Systematic single-variant and gene-based association testing of thousands of phenotypes in 394,841 UK Biobank exomes

Cell Genom. 2022 Aug 15;2(9):100168. doi: 10.1016/j.xgen.2022.100168. eCollection 2022 Sep 14.

Abstract

Genome-wide association studies have successfully discovered thousands of common variants associated with human diseases and traits, but the landscape of rare variations in human disease has not been explored at scale. Exome-sequencing studies of population biobanks provide an opportunity to systematically evaluate the impact of rare coding variations across a wide range of phenotypes to discover genes and allelic series relevant to human health and disease. Here, we present results from systematic association analyses of 4,529 phenotypes using single-variant and gene tests of 394,841 individuals in the UK Biobank with exome-sequence data. We find that the discovery of genetic associations is tightly linked to frequency and is correlated with metrics of deleteriousness and natural selection. We highlight biological findings elucidated by these data and release the dataset as a public resource alongside the Genebass browser for rapidly exploring rare-variant association results.

Keywords: GWAS; PheWAS; biobanks; exome sequencing; rare variant association studies; rare variants.