Proteogenomic analysis integrated with electronic health records data reveals disease-associated variants in Black Americans

J Clin Invest. 2024 Sep 24;134(21):e181802. doi: 10.1172/JCI181802.

Abstract

BACKGROUNDMost GWAS of plasma proteomics have focused on White individuals of European ancestry, limiting biological insight from other ancestry-enriched protein quantitative loci (pQTL).METHODSWe conducted a discovery GWAS of approximately 3,000 plasma proteins measured by the antibody-based Olink platform in 1,054 Black adults from the Jackson Heart Study (JHS) and validated our findings in the Multi-Ethnic Study of Atherosclerosis (MESA). The genetic architecture of identified pQTLs was further explored through fine mapping and admixture association analysis. Finally, using our pQTL findings, we performed a phenome-wide association study (PheWAS) across 2 large multiethnic electronic health record (EHR) systems in All of Us and BioMe.RESULTSWe identified 1,002 pQTLs for 925 protein assays. Fine mapping and admixture analyses suggested allelic heterogeneity of the plasma proteome across diverse populations. We identified associations for variants enriched in African ancestry, many in diseases that lack precise biomarkers, including cis-pQTLs for cathepsin L (CTSL) and Siglec-9, which were linked with sarcoidosis and non-Hodgkin's lymphoma, respectively. We found concordant associations across clinical diagnoses and laboratory measurements, elucidating disease pathways, including a cis-pQTL associated with circulating CD58, WBC count, and multiple sclerosis.CONCLUSIONSOur findings emphasize the value of leveraging diverse populations to enhance biological insights from proteomics GWAS, and we have made this resource readily available as an interactive web portal.FUNDINGNIH K08 HL161445-01A1; 5T32HL160522-03; HHSN268201600034I; HL133870.

Keywords: Genetic diseases; Genetics; Immunology; Population genetics; Proteomics.

MeSH terms

  • Adult
  • Aged
  • Black or African American* / genetics
  • Blood Proteins / genetics
  • Electronic Health Records*
  • Female
  • Genome-Wide Association Study*
  • Humans
  • Male
  • Middle Aged
  • Proteogenomics* / methods
  • Quantitative Trait Loci

Substances

  • Blood Proteins

Grants and funding