Deep integrative models for large-scale human genomics

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Deep integrative models for large-scale human genomics. / Sigurdsson, Arnór I; Louloudis, Ioannis; Banasik, Karina; Westergaard, David; Winther, Ole; Lund, Ole; Ostrowski, Sisse Rye; Erikstrup, Christian; Pedersen, Ole Birger; Nyegaard, Mette; Brunak, Søren; Vilhjálmsson, Bjarni J.; Rasmussen, Simon.

In: Nucleic Acids Symposium Series, Vol. 51, No. 12, 2023.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Sigurdsson, AI, Louloudis, I, Banasik, K, Westergaard, D, Winther, O, Lund, O, Ostrowski, SR, Erikstrup, C, Pedersen, OB, Nyegaard, M, Brunak, S, Vilhjálmsson, BJ & Rasmussen, S 2023, 'Deep integrative models for large-scale human genomics', Nucleic Acids Symposium Series, vol. 51, no. 12. https://doi.org/10.1093/nar/gkad373

APA

Sigurdsson, A. I., Louloudis, I., Banasik, K., Westergaard, D., Winther, O., Lund, O., Ostrowski, S. R., Erikstrup, C., Pedersen, O. B., Nyegaard, M., Brunak, S., Vilhjálmsson, BJ., & Rasmussen, S. (2023). Deep integrative models for large-scale human genomics. Nucleic Acids Symposium Series, 51(12). https://doi.org/10.1093/nar/gkad373

Vancouver

Sigurdsson AI, Louloudis I, Banasik K, Westergaard D, Winther O, Lund O et al. Deep integrative models for large-scale human genomics. Nucleic Acids Symposium Series. 2023;51(12). https://doi.org/10.1093/nar/gkad373

Author

Sigurdsson, Arnór I ; Louloudis, Ioannis ; Banasik, Karina ; Westergaard, David ; Winther, Ole ; Lund, Ole ; Ostrowski, Sisse Rye ; Erikstrup, Christian ; Pedersen, Ole Birger ; Nyegaard, Mette ; Brunak, Søren ; Vilhjálmsson, Bjarni J. ; Rasmussen, Simon. / Deep integrative models for large-scale human genomics. In: Nucleic Acids Symposium Series. 2023 ; Vol. 51, No. 12.

Bibtex

@article{a4bff77691434f4b8cdbec9d06cabedc,
title = "Deep integrative models for large-scale human genomics",
abstract = "Polygenic risk scores (PRSs) are expected to play a critical role in precision medicine. Currently, PRS predictors are generally based on linear models using summary statistics, and more recently individual-level data. However, these predictors mainly capture additive relationships and are limited in data modalities they can use. We developed a deep learning framework (EIR) for PRS prediction which includes a model, genome-local-net (GLN), specifically designed for large-scale genomics data. The framework supports multi-task learning, automatic integration of other clinical and biochemical data, and model explainability. When applied to individual-level data from the UK Biobank, the GLN model demonstrated a competitive performance compared to established neural network architectures, particularly for certain traits, showcasing its potential in modeling complex genetic relationships. Furthermore, the GLN model outperformed linear PRS methods for Type 1 Diabetes, likely due to modeling non-additive genetic effects and epistasis. This was supported by our identification of widespread non-additive genetic effects and epistasis in the context of T1D. Finally, we constructed PRS models that integrated genotype, blood, urine, and anthropometric data and found that this improved performance for 93% of the 290 diseases and disorders considered. EIR is available at https://github.com/arnor-sigurdsson/EIR.",
author = "Sigurdsson, {Arn{\'o}r I} and Ioannis Louloudis and Karina Banasik and David Westergaard and Ole Winther and Ole Lund and Ostrowski, {Sisse Rye} and Christian Erikstrup and Pedersen, {Ole Birger} and Mette Nyegaard and S{\o}ren Brunak and Bjarni J. Vilhj{\'a}lmsson and Simon Rasmussen",
year = "2023",
doi = "10.1093/nar/gkad373",
language = "English",
volume = "51",
journal = "Nucleic acids symposium series",
issn = "0261-3166",
publisher = "Oxford University Press",
number = "12",

}

RIS

TY - JOUR

T1 - Deep integrative models for large-scale human genomics

AU - Sigurdsson, Arnór I

AU - Louloudis, Ioannis

AU - Banasik, Karina

AU - Westergaard, David

AU - Winther, Ole

AU - Lund, Ole

AU - Ostrowski, Sisse Rye

AU - Erikstrup, Christian

AU - Pedersen, Ole Birger

AU - Nyegaard, Mette

AU - Brunak, Søren

AU - Vilhjálmsson, Bjarni J.

AU - Rasmussen, Simon

PY - 2023

Y1 - 2023

N2 - Polygenic risk scores (PRSs) are expected to play a critical role in precision medicine. Currently, PRS predictors are generally based on linear models using summary statistics, and more recently individual-level data. However, these predictors mainly capture additive relationships and are limited in data modalities they can use. We developed a deep learning framework (EIR) for PRS prediction which includes a model, genome-local-net (GLN), specifically designed for large-scale genomics data. The framework supports multi-task learning, automatic integration of other clinical and biochemical data, and model explainability. When applied to individual-level data from the UK Biobank, the GLN model demonstrated a competitive performance compared to established neural network architectures, particularly for certain traits, showcasing its potential in modeling complex genetic relationships. Furthermore, the GLN model outperformed linear PRS methods for Type 1 Diabetes, likely due to modeling non-additive genetic effects and epistasis. This was supported by our identification of widespread non-additive genetic effects and epistasis in the context of T1D. Finally, we constructed PRS models that integrated genotype, blood, urine, and anthropometric data and found that this improved performance for 93% of the 290 diseases and disorders considered. EIR is available at https://github.com/arnor-sigurdsson/EIR.

AB - Polygenic risk scores (PRSs) are expected to play a critical role in precision medicine. Currently, PRS predictors are generally based on linear models using summary statistics, and more recently individual-level data. However, these predictors mainly capture additive relationships and are limited in data modalities they can use. We developed a deep learning framework (EIR) for PRS prediction which includes a model, genome-local-net (GLN), specifically designed for large-scale genomics data. The framework supports multi-task learning, automatic integration of other clinical and biochemical data, and model explainability. When applied to individual-level data from the UK Biobank, the GLN model demonstrated a competitive performance compared to established neural network architectures, particularly for certain traits, showcasing its potential in modeling complex genetic relationships. Furthermore, the GLN model outperformed linear PRS methods for Type 1 Diabetes, likely due to modeling non-additive genetic effects and epistasis. This was supported by our identification of widespread non-additive genetic effects and epistasis in the context of T1D. Finally, we constructed PRS models that integrated genotype, blood, urine, and anthropometric data and found that this improved performance for 93% of the 290 diseases and disorders considered. EIR is available at https://github.com/arnor-sigurdsson/EIR.

U2 - 10.1093/nar/gkad373

DO - 10.1093/nar/gkad373

M3 - Journal article

C2 - 37224538

VL - 51

JO - Nucleic acids symposium series

JF - Nucleic acids symposium series

SN - 0261-3166

IS - 12

ER -

ID: 358092861