networkGWAS: a network-based approach to discover genetic associations

Giulia Muzio; Leslie O'Bray; Laetitia Meng-Papaxanthos; Juliane Klatt; Krista Fischer; Karsten Borgwardt

doi:10.1093/bioinformatics/btad370

networkGWAS: a network-based approach to discover genetic associations

Bioinformatics. 2023 Jun 1;39(6):btad370. doi: 10.1093/bioinformatics/btad370.

Authors

Giulia Muzio^{1

2}, Leslie O'Bray^{1

2}, Laetitia Meng-Papaxanthos^{1

2

3}, Juliane Klatt^{1

2}, Krista Fischer^{4

5}, Karsten Borgwardt^{1

2

6}

Affiliations

¹ Machine Learning and Computational Biology Lab, Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland.
² Swiss Institute for Bioinformatics (SIB), 1015 Lausanne, Switzerland.
³ Google Research, Brain Team, 8002 Zürich, Switzerland.
⁴ Institute of Mathematics and Statistics, University of Tartu, 51009 Tartu, Estonia.
⁵ Institute of Genomics, University of Tartu, 51010 Tartu, Estonia.
⁶ Department of Machine Learning and Systems Biology, Max Planck Institute of Biochemistry, 82152 Martinsried, Germany.

Abstract

Motivation: While the search for associations between genetic markers and complex traits has led to the discovery of tens of thousands of trait-related genetic variants, the vast majority of these only explain a small fraction of the observed phenotypic variation. One possible strategy to overcome this while leveraging biological prior is to aggregate the effects of several genetic markers and to test entire genes, pathways or (sub)networks of genes for association to a phenotype. The latter, network-based genome-wide association studies, in particular suffer from a vast search space and an inherent multiple testing problem. As a consequence, current approaches are either based on greedy feature selection, thereby risking that they miss relevant associations, or neglect doing a multiple testing correction, which can lead to an abundance of false positive findings.

Results: To address the shortcomings of current approaches of network-based genome-wide association studies, we propose networkGWAS, a computationally efficient and statistically sound approach to network-based genome-wide association studies using mixed models and neighborhood aggregation. It allows for population structure correction and for well-calibrated P-values, which are obtained through circular and degree-preserving network permutations. networkGWAS successfully detects known associations on diverse synthetic phenotypes, as well as known and novel genes in phenotypes from Saccharomycescerevisiae and Homo sapiens. It thereby enables the systematic combination of gene-based genome-wide association studies with biological network information.

Availability and implementation: https://github.com/BorgwardtLab/networkGWAS.git.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Genetic Markers
Genome-Wide Association Study*
Humans
Phenotype
Polymorphism, Single Nucleotide
Population Groups*

Substances

Genetic Markers