Assessment of image quality on the diagnostic performance of clinicians and deep learning models: Cross-sectional comparative reader study

A I Oloruntoba; M Asghari-Jafarabadi; M Sashindranath; Å Ingvar; N R Adler; C Vico-Alonso; L Niklasson; A L Caixinha; E Hiscutt; Z Holmes; K B Assersen; S Adamson; T Jegathees; T Bertelsen; V Velasco-Tamariz; T Helkkula; S Kristiansen; R Toholka; M S Goh; A Chamberlain; C McCormack; T Vestergaard; D Mehta; T D Nguyen; Z Ge; H P Soyer; V Mar

doi:10.1111/jdv.20462

Assessment of image quality on the diagnostic performance of clinicians and deep learning models: Cross-sectional comparative reader study

J Eur Acad Dermatol Venereol. 2024 Dec 10. doi: 10.1111/jdv.20462. Online ahead of print.

Authors

A I Oloruntoba¹, M Asghari-Jafarabadi¹, M Sashindranath¹, Å Ingvar^{1

2

3

4}, N R Adler^{1

2}, C Vico-Alonso¹, L Niklasson⁵, A L Caixinha⁶, E Hiscutt², Z Holmes², K B Assersen⁵, S Adamson², T Jegathees², T Bertelsen^{6

7}, V Velasco-Tamariz⁷, T Helkkula^{3

4}, S Kristiansen^{3

4}, R Toholka⁸, M S Goh⁸, A Chamberlain², C McCormack⁸, T Vestergaard⁵, D Mehta⁹, T D Nguyen⁹, Z Ge^{9

10

11}, H P Soyer¹², V Mar^{1

2}

Affiliations

¹ School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia.
² Victorian Melanoma Service, Alfred Health, Melbourne, Victoria, Australia.
³ Department of Dermatology, Skåne University Hospital, Lund, Sweden.
⁴ Department of Clinical Sciences, Lund University, Lund, Sweden.
⁵ Department of Dermatology and Allergy Centre, Odense University Hospital, Odense, Denmark.
⁶ Department of Dermatology and Venereology, Aarhus University Hospital, Aarhus, Denmark.
⁷ Department of Dermatology, Hospital Universitario 12 de Octubre, Madrid, Spain.
⁸ Department of Surgical Oncology (Dermatology), Peter MacCallum Cancer Centre, Melbourne, Victoria, Australia.
⁹ Monash eResearch Centre, Monash University, Clayton, Melbourne, Victoria, Australia.
¹⁰ Airdoc-Monash Research, Monash University, Clayton, Melbourne, Victoria, Australia.
¹¹ NVIDIA Artificial Intelligence Tech Centre, Monash University, Clayton, Melbourne, Victoria, Australia.
¹² Frazer Institute, The University of Queensland, Dermatology Research Centre, Brisbane, Queensland, Australia.

PMID: 39655640
DOI: 10.1111/jdv.20462

Abstract

Background: Skin cancer is a prevalent and clinically significant condition, with early and accurate diagnosis being crucial for improved patient outcomes. Dermoscopy and artificial intelligence (AI) hold promise in enhancing diagnostic accuracy. However, the impact of image quality, particularly high dynamic range (HDR) conversion in smartphone images, on diagnostic performance remains poorly understood.

Objective: This study aimed to investigate the effect of varying image qualities, including HDR-enhanced dermoscopic images, on the diagnostic capabilities of clinicians and a convolutional neural network (CNN) model.

Methods: Eighteen dermatology clinicians assessed 303 images of 101 skin lesions that were categorized into three image quality groups: low quality (LQ), high quality (HQ) and enhanced quality (EQ) produced using HDR-style conversion. Clinicians participated in a two part reader study that required their diagnosis, management and confidence level for each image assessed.

Results: In the binary classification of lesions, clinicians had the greatest diagnostic performance with HQ images, with sensitivity (77.3%; CI 69.1-85.5), specificity (63.1%; CI 53.7-72.5) and accuracy (70.2%; CI 61.3-79.1). For the multiclass classification, the overall performance was also best with HQ images, attaining the greatest specificity (91.9%; CI 83.2-95.0) and accuracy (51.5%; CI 48.4-54.7). Clinicians had a superior performance (median correct diagnoses) to the CNN model for the binary classification of LQ and EQ images, but their performance was comparable on the HQ images. However, in the multiclass classification, the CNN model significantly outperformed the clinicians on HQ images (p < 0.01).

Conclusion: This study highlights the importance of image quality on the diagnostic performance of clinicians and deep learning models. This has significant implications for telehealth reporting and triage.