Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records

Jun-En Ding; Phan Nguyen Minh Thao; Wen-Chih Peng; Jian-Zhe Wang; Chun-Cheng Chug; Min-Chen Hsieh; Yun-Chien Tseng; Ling Chen; Dongsheng Luo; Chenwei Wu; Chi-Te Wang; Chih-Ho Hsu; Yi-Tui Chen; Pei-Fu Chen; Feng Liu; Fang-Ming Hung

doi:10.1038/s41598-024-71020-2

Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records

Sci Rep. 2024 Sep 6;14(1):20774. doi: 10.1038/s41598-024-71020-2.

Authors

Jun-En Ding^#¹, Phan Nguyen Minh Thao^#², Wen-Chih Peng², Jian-Zhe Wang², Chun-Cheng Chug², Min-Chen Hsieh², Yun-Chien Tseng², Ling Chen³, Dongsheng Luo⁴, Chenwei Wu⁵, Chi-Te Wang⁶, Chih-Ho Hsu⁷, Yi-Tui Chen⁸, Pei-Fu Chen^{9

10}, Feng Liu¹, Fang-Ming Hung^{11

12}

Affiliations

¹ School of Systems and Enterprises, Stevens Institute of Technology, Hoboken, USA.
² Department of Computer Science, National Yang Ming Chiao Tung University, Hsinchu City, Taiwan.
³ Institute of Hospital and Health Care Administration, National Yang Ming Chiao Tung University, Taipei City, Taiwan.
⁴ School of Computing and Information Science, Florida International University, Miami, USA.
⁵ Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, USA.
⁶ Center of Artificial Intelligence, Far Eastern Memorial Hospital, New Taipei City, Taiwan.
⁷ Department of Surgery, Far Eastern Memorial Hospital, New Taipei City, Taiwan.
⁸ Smart Healthcare Interdisciplinary College, National Taipei University of Nursing and Health Sciences, Taipei City, Taiwan.
⁹ Department of Anesthesiology, Far Eastern Memorial Hospital, New Taipei City, Taiwan.
¹⁰ Department of Electrical Engineering, Yuan Ze University, Taoyuan, Taiwan.
¹¹ Surgical Trauma Intensive Care Unit, Far Eastern Memorial Hospital, New Taipei City, Taiwan. philip@mail.femh.org.tw.
¹² Smart Healthcare Interdisciplinary College, National Taipei University of Nursing and Health Sciences, Taipei City, Taiwan. philip@mail.femh.org.tw.

^# Contributed equally.

Abstract

Type 2 diabetes mellitus (T2DM) is a prevalent health challenge faced by countries worldwide. In this study, we propose a novel large language multimodal models (LLMMs) framework incorporating multimodal data from clinical notes and laboratory results for diabetes risk prediction. We collected five years of electronic health records (EHRs) dating from 2017 to 2021 from a Taiwan hospital database. This dataset included 1,420,596 clinical notes, 387,392 laboratory results, and more than 1505 laboratory test items. Our method combined a text embedding encoder and multi-head attention layer to learn laboratory values, and utilized a deep neural network (DNN) module to merge blood features with chronic disease semantics into a latent space. In our experiments, we observed that integrating clinical notes with predictions based on textual laboratory values significantly enhanced the predictive capability of the unimodal model in the early detection of T2DM. Moreover, we achieved an area greater than 0.70 under the receiver operating characteristic curve (AUC) for new-onset T2DM prediction, demonstrating the effectiveness of leveraging textual laboratory data for training and inference in LLMs and improving the accuracy of new-onset diabetes prediction.

MeSH terms

Cohort Studies
Databases, Factual
Deep Learning
Diabetes Mellitus, Type 2* / epidemiology
Electronic Health Records*
Female
Humans
Male
Middle Aged
Neural Networks, Computer
ROC Curve
Taiwan / epidemiology