Leveraging genetic correlations and multiple populations to improve genetic risk prediction for non-European populations

Res Sq [Preprint]. 2023 Dec 25:rs.3.rs-3741763. doi: 10.21203/rs.3.rs-3741763/v1.

Abstract

The disparity in genetic risk prediction accuracy between European and non-European individuals highlights a critical challenge in health inequality. To bridge this gap, we introduce JointPRS, a novel method that models multiple populations jointly to improve genetic risk predictions for non-European individuals. JointPRS has three key features. First, it encompasses all diverse populations to improve prediction accuracy, rather than relying solely on the target population with a singular auxiliary European group. Second, it autonomously estimates and leverages chromosome-wise cross-population genetic correlations to infer the effect sizes of genetic variants. Lastly, it provides an auto version that has comparable performance to the tuning version to accommodate the situation with no validation dataset. Through extensive simulations and real data applications to 22 quantitative traits and four binary traits in East Asian populations, nine quantitative traits and one binary trait in African populations, and four quantitative traits in South Asian populations, we demonstrate that JointPRS outperforms state-of-art methods, improving the prediction accuracy for both quantitative and binary traits in non-European populations.

Keywords: Bayesian high-dimensional regression; Continuous shrinkage prior; Cross-population genetic correlation; Multi-population genetic risk prediction.

Publication types

  • Preprint