Identifying cross-disease components of genetic risk across hospital data in the UK Biobank

Research output: Contribution to journal › Journal article › Research › peer-review

Documents

EMS84971
Accepted author manuscript, 3.91 MB, PDF document

Adrian Cortes
Patrick K. Albers
Calliope A. Dendrou
Fugger, Lars
Gil McVean

Genetic risk factors frequently affect multiple common human diseases, providing insight into shared pathophysiological pathways and opportunities for therapeutic development. However, systematic identification of genetic profiles of disease risk is limited by the availability of both comprehensive clinical data on population-scale cohorts and the lack of suitable statistical methodology that can handle the scale of and differential power inherent in multi-phenotype data. Here, we develop a disease-agnostic approach to cluster the genetic risk profiles for 3,025 genome-wide independent loci across 19,155 disease classification codes from 320,644 participants in the UK Biobank, representing a large and heterogeneous population. We identify 339 distinct disease association profiles and use multiple approaches to link clusters to the underlying biological pathways. We show how clusters can decompose the variance and covariance in risk for disease, thereby identifying underlying biological processes and their impact. We demonstrate the use of clusters in defining disease relationships and their potential in informing therapeutic strategies.

Original language	English
Journal	Nature Genetics
Volume	52
Issue number	1
Pages (from-to)	126-134
Number of pages	9
ISSN	1061-4036
DOIs	https://doi.org/10.1038/s41588-019-0550-4
Publication status	Published - 2020

Research

Identifying cross-disease components of genetic risk across hospital data in the UK Biobank

Documents

Links