Estimating diagnostic uncertainty in artificial intelligence assisted pathology using conformal prediction

Henrik Olsson; Kimmo Kartasalo; Nita Mulliqi; Marco Capuccini; Pekka Ruusuvuori; Hemamali Samaratunga; Brett Delahunt; Cecilia Lindskog; Emiel A M Janssen; Anders Blilie; ISUP Prostate Imagebase Expert Panel; Lars Egevad; Ola Spjuth; Martin Eklund

doi:10.1038/s41467-022-34945-8

Estimating diagnostic uncertainty in artificial intelligence assisted pathology using conformal prediction

Nat Commun. 2022 Dec 15;13(1):7761. doi: 10.1038/s41467-022-34945-8.

Authors

Affiliations

¹ Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden. henrik.olsson@ki.se.
² Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden.
³ Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden.
⁴ Institute of Biomedicine, University of Turku, Turku, Finland.
⁵ Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.
⁶ Aquesta Uropathology and University of Queensland, Brisbane, QLD, Australia.
⁷ Department of Pathology and Molecular Medicine, Wellington School of Medicine and Health Sciences, University of Otago, Wellington, New Zealand.
⁸ Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden.
⁹ Department of Pathology, Stavanger University Hospital, Stavanger, Norway.
¹⁰ Faculty of Science and Technology, University of Stavanger, Stavanger, Norway.
¹¹ Department of Oncology Pathology, Karolinska Institutet, Solna, Sweden.

Abstract

Unreliable predictions can occur when an artificial intelligence (AI) system is presented with data it has not been exposed to during training. We demonstrate the use of conformal prediction to detect unreliable predictions, using histopathological diagnosis and grading of prostate biopsies as example. We digitized 7788 prostate biopsies from 1192 men in the STHLM3 diagnostic study, used for training, and 3059 biopsies from 676 men used for testing. With conformal prediction, 1 in 794 (0.1%) predictions is incorrect for cancer diagnosis (compared to 14 errors [2%] without conformal prediction) while 175 (22%) of the predictions are flagged as unreliable when the AI-system is presented with new data from the same lab and scanner that it was trained on. Conformal prediction could with small samples (N = 49 for external scanner, N = 10 for external lab and scanner, and N = 12 for external lab, scanner and pathology assessment) detect systematic differences in external data leading to worse predictive performance. The AI-system with conformal prediction commits 3 (2%) errors for cancer detection in cases of atypical prostate tissue compared to 44 (25%) without conformal prediction, while the system flags 143 (80%) unreliable predictions. We conclude that conformal prediction can increase patient safety of AI-systems.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Artificial Intelligence*
Biopsy
Humans
Male
Neoplasms*
Prostate
Uncertainty