Evaluation of polygenic scoring methods in five biobanks shows larger variation between biobanks than methods and finds benefits of ensemble learning

Remo Monti; Lisa Eick; Georgi Hudjashov; Kristi Läll; Stavroula Kanoni; Brooke N Wolford; Benjamin Wingfield; Oliver Pain; Sophie Wharrie; Bradley Jermy; Aoife McMahon; Tuomo Hartonen; Henrike Heyne; Nina Mars; Samuel Lambert; Genes and Health Research Team; Kristian Hveem; Michael Inouye; David A van Heel; Reedik Mägi; Pekka Marttinen; Samuli Ripatti; Andrea Ganna; Christoph Lippert

doi:10.1016/j.ajhg.2024.06.003

Evaluation of polygenic scoring methods in five biobanks shows larger variation between biobanks than methods and finds benefits of ensemble learning

Am J Hum Genet. 2024 Jul 11;111(7):1431-1447. doi: 10.1016/j.ajhg.2024.06.003. Epub 2024 Jun 21.

Authors

Remo Monti¹, Lisa Eick², Georgi Hudjashov³, Kristi Läll³, Stavroula Kanoni⁴, Brooke N Wolford⁵, Benjamin Wingfield⁶, Oliver Pain⁷, Sophie Wharrie⁸, Bradley Jermy², Aoife McMahon⁶, Tuomo Hartonen², Henrike Heyne⁹, Nina Mars¹⁰, Samuel Lambert¹¹; Genes and Health Research Team; Kristian Hveem¹², Michael Inouye¹³, David A van Heel¹⁴, Reedik Mägi³, Pekka Marttinen⁸, Samuli Ripatti¹⁵, Andrea Ganna¹⁶, Christoph Lippert¹⁷

Affiliations

¹ Hasso Plattner Institute, University of Potsdam, Digital Engineering Faculty, Potsdam, Germany; Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association, Berlin Institute for Medical Systems Biology, Berlin, Germany.
² Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland.
³ Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia.
⁴ William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK.
⁵ K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, Faculty of Medicine and Health, Norwegian University of Science and Technology, Trondheim, Norway.
⁶ European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
⁷ Maurice Wohl Clinical Neuroscience Institute, Department of Basic and Clinical Neuroscience; Institute of Psychiatry, Psychology and Neuroscience; King's College London, London, UK.
⁸ Aalto University, Department of Computer Science, Espoo, Finland.
⁹ Hasso Plattner Institute, University of Potsdam, Digital Engineering Faculty, Potsdam, Germany.
¹⁰ Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland; Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
¹¹ Levanger Hospital, Nord-Trøndelag Hospital Trust, Levanger, Norway; Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia; British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK; British Heart Foundation Cambridge Centre of Research Excellence, School of Clinical Medicine, University of Cambridge, Cambridge, UK.
¹² K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, Faculty of Medicine and Health, Norwegian University of Science and Technology, Trondheim, Norway; Levanger Hospital, Nord-Trøndelag Hospital Trust, Levanger, Norway.
¹³ Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK; Cambridge Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia; British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK; Victor Phillip Dahdaleh Heart and Lung Research Institute, University of Cambridge, Cambridge, UK; British Heart Foundation Cambridge Centre of Research Excellence, School of Clinical Medicine, University of Cambridge, Cambridge, UK; Health Data Research UK Cambridge, Wellcome Genome Campus and University of Cambridge, Cambridge, UK.
¹⁴ Blizard Institute, Queen Mary University of London, London, UK.
¹⁵ Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland; Department of Public Health, University of Helsinki, Helsinki, Finland; Department of Public Health, University of Helsinki, Helsinki, Finland.
¹⁶ Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland; Massachusetts General Hospital and Broad Institute of MIT and Harvard, Cambridge, MA, USA.
¹⁷ Hasso Plattner Institute, University of Potsdam, Digital Engineering Faculty, Potsdam, Germany; Windreich Department of Artificial Intelligence and Human Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Department of Diagnostic, Molecular, and Interventional Radiology, Icahn School of Medicine at Mount Sinai, New York, NY, USA. Electronic address: christoph.lippert@hpi.de.

PMID: 38908374
PMCID: PMC11267524 (available on 2025-01-11)
DOI: 10.1016/j.ajhg.2024.06.003

Abstract

Methods of estimating polygenic scores (PGSs) from genome-wide association studies are increasingly utilized. However, independent method evaluation is lacking, and method comparisons are often limited. Here, we evaluate polygenic scores derived via seven methods in five biobank studies (totaling about 1.2 million participants) across 16 diseases and quantitative traits, building on a reference-standardized framework. We conducted meta-analyses to quantify the effects of method choice, hyperparameter tuning, method ensembling, and the target biobank on PGS performance. We found that no single method consistently outperformed all others. PGS effect sizes were more variable between biobanks than between methods within biobanks when methods were well tuned. Differences between methods were largest for the two investigated autoimmune diseases, seropositive rheumatoid arthritis and type 1 diabetes. For most methods, cross-validation was more reliable for tuning hyperparameters than automatic tuning (without the use of target data). For a given target phenotype, elastic net models combining PGS across methods (ensemble PGS) tuned in the UK Biobank provided consistent, high, and cross-biobank transferable performance, increasing PGS effect sizes (β coefficients) by a median of 5.0% relative to LDpred2 and MegaPRS (the two best-performing single methods when tuned with cross-validation). Our interactively browsable online-results and open-source workflow prspipe provide a rich resource and reference for the analysis of polygenic scoring methods across biobanks.

Keywords: GWAS; PGS; autoimmune diseases; biobank studies; cross-biobank analysis; ensemble learning; genetic risk; genetic variability; genome-wide association studies; method evaluation; phenotype prediction; polygenic scores.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Biological Specimen Banks*
Diabetes Mellitus, Type 1 / genetics
Genome-Wide Association Study*
Humans
Machine Learning
Multifactorial Inheritance* / genetics
Phenotype
Polymorphism, Single Nucleotide