Improved diabetes screening using an extended predictive feature search

Simon Lebech Cichosz; Mette Dencker Johansen; Niels Ejskjaer; Troels Krarup Hansen; Ole K Hejlesen

doi:10.1089/dia.2013.0255

Improved diabetes screening using an extended predictive feature search

Diabetes Technol Ther. 2014 Mar;16(3):166-71. doi: 10.1089/dia.2013.0255. Epub 2013 Nov 13.

Authors

Simon Lebech Cichosz¹, Mette Dencker Johansen, Niels Ejskjaer, Troels Krarup Hansen, Ole K Hejlesen

Affiliation

¹ 1 Department of Health Science and Technology, Aalborg University , Aalborg, Denmark .

PMID: 24224751
DOI: 10.1089/dia.2013.0255

Abstract

Background: Screening entire populations for diabetes is not cost-effective. Hence, an efficient screening process must select those people who are at high risk for diabetes. In this study, we investigated whether screening procedures could be improved using an extended predictive feature search.

Materials and methods: In order to develop our model and identify persons with diabetes (prevalence) we used data from years of the National Health and Nutrition Examination Survey (2005-2010), which has not been explored for this purpose before. We calculated all combinations of predictors in order to identify the optimal subset, and we used a linear logistic classification model to predict diabetes. V-fold cross-validation was used for the process of including variables and for validating the final models. This new model was compared with two established models.

Results: In total, 5,398 participants were included in this study. Among these, 478 participants had unidentified diabetes. The established models had a receiver operating characteristics curve for the area under the curve (AUC) of 0.74 and 0.71 compared with an AUC of 0.78 for the new model, showing a significant difference (P<0.05). A proposed cutoff point for the established models yielded respective sensitivities/specificities of 63%/72% and 40%/72% compared with the new model, which had a sensitivity/specificity of 70%/72%.

Conclusions: Our data indicate that simple healthcare and economic information such as ratio of family income to poverty can add value in deciding who is at risk of unknown diabetes by using extended investigations of predictor combinations.

Publication types

Evaluation Study

MeSH terms

Adult
Area Under Curve
Blood Glucose / metabolism*
Cost-Benefit Analysis
Diabetes Mellitus, Type 2 / diagnosis*
Fasting / blood
Feasibility Studies
Female
Humans
Logistic Models
Male
Mass Screening* / economics
Mass Screening* / methods
Middle Aged
Nutrition Surveys
Patient Selection
Predictive Value of Tests
Prevalence
Risk Assessment
Risk Factors
Sensitivity and Specificity
Socioeconomic Factors
Waist Circumference*

Substances

Blood Glucose