Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study

Emma L Twait; Constanza L Andaur Navarro; Vilmunur Gudnason; Yi-Han Hu; Lenore J Launer; Mirjam I Geerlings

doi:10.1186/s12911-023-02244-x

Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study

BMC Med Inform Decis Mak. 2023 Aug 28;23(1):168. doi: 10.1186/s12911-023-02244-x.

Authors

Emma L Twait^{1

2

3

4}, Constanza L Andaur Navarro¹, Vilmunur Gudnason^{5

6}, Yi-Han Hu⁷, Lenore J Launer⁷, Mirjam I Geerlings^{8

9

10

11

12}

Affiliations

¹ Department of Epidemiology, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht and Utrecht University, Utrecht, the Netherlands.
² Department of General Practice, Amsterdam UMC, location Vrije Universiteit Amsterdam, De Boelelaan 1117, Amsterdam, the Netherlands.
³ Amsterdam Public Health, Aging & Later life and Personalized Medicine, Amsterdam, the Netherlands.
⁴ Amsterdam Neuroscience, Neurodegeneration and Mood, Anxiety, Psychosis, Stress, and Sleep, Amsterdam, the Netherlands.
⁵ Faculty of Medicine, University of Iceland, Reykjavik, Iceland.
⁶ The Icelandic Heart Association, Kopavogur, Iceland.
⁷ Laboratory of Epidemiology and Population Sciences, National Institute on Aging, Baltimore, MD, USA.
⁸ Department of Epidemiology, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht and Utrecht University, Utrecht, the Netherlands. m.i.geerlings@amsterdamumc.nl.
⁹ Amsterdam Public Health, Aging & Later life and Personalized Medicine, Amsterdam, the Netherlands. m.i.geerlings@amsterdamumc.nl.
¹⁰ Amsterdam Neuroscience, Neurodegeneration and Mood, Anxiety, Psychosis, Stress, and Sleep, Amsterdam, the Netherlands. m.i.geerlings@amsterdamumc.nl.
¹¹ Laboratory of Epidemiology and Population Sciences, National Institute on Aging, Baltimore, MD, USA. m.i.geerlings@amsterdamumc.nl.
¹² Department of General Practice, Amsterdam UMC, location University of Amsterdam, Meibergdreef 9, Amsterdam, the Netherlands. m.i.geerlings@amsterdamumc.nl.

Abstract

Background: Early identification of dementia is crucial for prompt intervention for high-risk individuals in the general population. External validation studies on prognostic models for dementia have highlighted the need for updated models. The use of machine learning in dementia prediction is in its infancy and may improve predictive performance. The current study aimed to explore the difference in performance of machine learning algorithms compared to traditional statistical techniques, such as logistic and Cox regression, for prediction of all-cause dementia. Our secondary aim was to assess the feasibility of only using clinically accessible predictors rather than MRI predictors.

Methods: Data are from 4,793 participants in the population-based AGES-Reykjavik Study without dementia or mild cognitive impairment at baseline (mean age: 76 years, % female: 59%). Cognitive, biometric, and MRI assessments (total: 59 variables) were collected at baseline, with follow-up of incident dementia diagnoses for a maximum of 12 years. Machine learning algorithms included elastic net regression, random forest, support vector machine, and elastic net Cox regression. Traditional statistical methods for comparison were logistic and Cox regression. Model 1 was fit using all variables and model 2 was after feature selection using the Boruta package. A third model explored performance when leaving out neuroimaging markers (clinically accessible model). Ten-fold cross-validation, repeated ten times, was implemented during training. Upsampling was used to account for imbalanced data. Tuning parameters were optimized for recalibration automatically using the caret package in R.

Results: 19% of participants developed all-cause dementia. Machine learning algorithms were comparable in performance to logistic regression in all three models. However, a slight added performance was observed in the elastic net Cox regression in the third model (c = 0.78, 95% CI: 0.78-0.78) compared to the traditional Cox regression (c = 0.75, 95% CI: 0.74-0.77).

Conclusions: Supervised machine learning only showed added benefit when using survival techniques. Removing MRI markers did not significantly worsen our model's performance. Further, we presented the use of a nomogram using machine learning methods, showing transportability for the use of machine learning models in clinical practice. External validation is needed to assess the use of this model in other populations. Identifying high-risk individuals will amplify prevention efforts and selection for clinical trials.

Keywords: Dementia; Machine learning; Prediction model.

Publication types

Research Support, N.I.H., Intramural
Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural

MeSH terms

Aged
Algorithms
Dementia* / diagnosis
Dementia* / epidemiology
Female
Humans
Machine Learning*
Male
Proof of Concept Study
Supervised Machine Learning

Abstract

Publication types

MeSH terms

Grants and funding