Use of an electronic medical record for the identification of research subjects with diabetes mellitus

Clin Med Res. 2007 Mar;5(1):1-7. doi: 10.3121/cmr.2007.726.

Abstract

Diabetes mellitus is a rapidly increasing and costly public health problem. Large studies are needed to understand the complex gene-environment interactions that lead to diabetes and its complications. The Marshfield Clinic Personalized Medicine Research Project (PMRP) represents one of the largest population-based DNA biobanks in the United States. As part of an effort to begin phenotyping common diseases within the PMRP, we now report on the construction of a diabetes case-finding algorithm using electronic medical record data from adult subjects aged > or =50 years living in one of the target PMRP ZIP codes. Based upon diabetic diagnostic codes alone, we observed a false positive case rate ranging from 3.0% (in subjects with the highest glycosylated hemoglobin values) to 44.4% (in subjects with the lowest glycosylated hemoglobin values). We therefore developed an improved case finding algorithm that utilizes diabetic diagnostic codes in combination with clinical laboratory data and medication history. This algorithm yielded an estimated prevalence of 24.2% for diabetes mellitus in adult subjects aged > or =50 years.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Algorithms
  • Biomarkers / chemistry
  • Cohort Studies
  • DNA / chemistry
  • Diabetes Mellitus / diagnosis*
  • Diabetes Mellitus / epidemiology*
  • False Positive Reactions
  • Glycosylation
  • Humans
  • Medical Records Systems, Computerized*
  • Middle Aged
  • Natural Language Processing
  • Phenotype
  • Prevalence
  • Sulfonylurea Compounds / metabolism

Substances

  • Biomarkers
  • Sulfonylurea Compounds
  • DNA