DeepSeeNet: A Deep Learning Model for Automated Classification of Patient-based Age-related Macular Degeneration Severity from Color Fundus Photographs

Yifan Peng; Shazia Dharssi; Qingyu Chen; Tiarnan D Keenan; Elvira Agrón; Wai T Wong; Emily Y Chew; Zhiyong Lu

doi:10.1016/j.ophtha.2018.11.015

DeepSeeNet: A Deep Learning Model for Automated Classification of Patient-based Age-related Macular Degeneration Severity from Color Fundus Photographs

Ophthalmology. 2019 Apr;126(4):565-575. doi: 10.1016/j.ophtha.2018.11.015. Epub 2018 Nov 22.

Authors

Yifan Peng¹, Shazia Dharssi², Qingyu Chen¹, Tiarnan D Keenan³, Elvira Agrón³, Wai T Wong³, Emily Y Chew⁴, Zhiyong Lu⁵

Affiliations

¹ National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland.
² National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland; National Eye Institute, National Institutes of Health, Bethesda, Maryland.
³ National Eye Institute, National Institutes of Health, Bethesda, Maryland.
⁴ National Eye Institute, National Institutes of Health, Bethesda, Maryland. Electronic address: echew@nei.nih.gov.
⁵ National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland. Electronic address: zhiyong.lu@nih.gov.

Abstract

Purpose: In assessing the severity of age-related macular degeneration (AMD), the Age-Related Eye Disease Study (AREDS) Simplified Severity Scale predicts the risk of progression to late AMD. However, its manual use requires the time-consuming participation of expert practitioners. Although several automated deep learning systems have been developed for classifying color fundus photographs (CFP) of individual eyes by AREDS severity score, none to date has used a patient-based scoring system that uses images from both eyes to assign a severity score.

Design: DeepSeeNet, a deep learning model, was developed to classify patients automatically by the AREDS Simplified Severity Scale (score 0-5) using bilateral CFP.

Participants: DeepSeeNet was trained on 58 402 and tested on 900 images from the longitudinal follow-up of 4549 participants from AREDS. Gold standard labels were obtained using reading center grades.

Methods: DeepSeeNet simulates the human grading process by first detecting individual AMD risk factors (drusen size, pigmentary abnormalities) for each eye and then calculating a patient-based AMD severity score using the AREDS Simplified Severity Scale.

Main outcome measures: Overall accuracy, specificity, sensitivity, Cohen's kappa, and area under the curve (AUC). The performance of DeepSeeNet was compared with that of retinal specialists.

Results: DeepSeeNet performed better on patient-based classification (accuracy = 0.671; kappa = 0.558) than retinal specialists (accuracy = 0.599; kappa = 0.467) with high AUC in the detection of large drusen (0.94), pigmentary abnormalities (0.93), and late AMD (0.97). DeepSeeNet also outperformed retinal specialists in the detection of large drusen (accuracy 0.742 vs. 0.696; kappa 0.601 vs. 0.517) and pigmentary abnormalities (accuracy 0.890 vs. 0.813; kappa 0.723 vs. 0.535) but showed lower performance in the detection of late AMD (accuracy 0.967 vs. 0.973; kappa 0.663 vs. 0.754).

Conclusions: By simulating the human grading process, DeepSeeNet demonstrated high accuracy with increased transparency in the automated assignment of individual patients to AMD risk categories based on the AREDS Simplified Severity Scale. These results highlight the potential of deep learning to assist and enhance clinical decision-making in patients with AMD, such as early AMD detection and risk prediction for developing late AMD. DeepSeeNet is publicly available on https://github.com/ncbi-nlp/DeepSeeNet.

Publication types

Comparative Study
Multicenter Study
Research Support, N.I.H., Intramural

MeSH terms

Aged
Aged, 80 and over
Area Under Curve
Deep Learning*
Diagnosis, Computer-Assisted / methods*
Diagnostic Techniques, Ophthalmological*
Disease Progression
Female
Geographic Atrophy / classification*
Geographic Atrophy / diagnosis*
Humans
Male
Middle Aged
Models, Theoretical*
Photography / methods*
Prospective Studies
Reproducibility of Results
Retinal Drusen / classification
Retinal Drusen / diagnosis
Risk Factors
Sensitivity and Specificity
Severity of Illness Index

Abstract

Publication types

MeSH terms

Grants and funding