Identifying interactions in omics data for clinical biomarker discovery using symbolic regression

Research output: Contribution to journal › Journal article › Research › peer-review

Standard

Identifying interactions in omics data for clinical biomarker discovery using symbolic regression. / Christensen, Niels Johan; Demharter, Samuel; Machado, Meera; Pedersen, Lykke; Salvatore, Marco; Stentoft-Hansen, Valdemar; Iglesias, Miquel Triana.

In: Bioinformatics, Vol. 38, No. 15, 405, 2022, p. 3749-3758.

Research output: Contribution to journal › Journal article › Research › peer-review

Harvard

Christensen, NJ, Demharter, S, Machado, M, Pedersen, L, Salvatore, M, Stentoft-Hansen, V & Iglesias, MT 2022, 'Identifying interactions in omics data for clinical biomarker discovery using symbolic regression', Bioinformatics, vol. 38, no. 15, 405, pp. 3749-3758. https://doi.org/10.1093/bioinformatics/btac405

APA

Christensen, N. J., Demharter, S., Machado, M., Pedersen, L., Salvatore, M., Stentoft-Hansen, V., & Iglesias, M. T. (2022). Identifying interactions in omics data for clinical biomarker discovery using symbolic regression. Bioinformatics, 38(15), 3749-3758. [405]. https://doi.org/10.1093/bioinformatics/btac405

Vancouver

Christensen NJ, Demharter S, Machado M, Pedersen L, Salvatore M, Stentoft-Hansen V et al. Identifying interactions in omics data for clinical biomarker discovery using symbolic regression. Bioinformatics. 2022;38(15):3749-3758. 405. https://doi.org/10.1093/bioinformatics/btac405

Author

Christensen, Niels Johan ; Demharter, Samuel ; Machado, Meera ; Pedersen, Lykke ; Salvatore, Marco ; Stentoft-Hansen, Valdemar ; Iglesias, Miquel Triana. / Identifying interactions in omics data for clinical biomarker discovery using symbolic regression. In: Bioinformatics. 2022 ; Vol. 38, No. 15. pp. 3749-3758.

Bibtex

@article{157aa035f7034324afe691ab54054dc7,

title = "Identifying interactions in omics data for clinical biomarker discovery using symbolic regression",

abstract = "Motivation The identification of predictive biomarker signatures from omics and multi-omics data for clinical applications is an active area of research. Recent developments in assay technologies and machine learning (ML) methods have led to significant improvements in predictive performance. However, most high-performing ML methods suffer from complex architectures and lack interpretability.Results We present the application of a novel symbolic-regression-based algorithm, the QLattice, on a selection of clinical omics datasets. This approach generates parsimonious high-performing models that can both predict disease outcomes and reveal putative disease mechanisms, demonstrating the importance of selecting maximally relevant and minimally redundant features in omics-based machine-learning applications. The simplicity and high-predictive power of these biomarker signatures make them attractive tools for high-stakes applications in areas such as primary care, clinical decision-making and patient stratification.",

keywords = "DIFFERENTIAL EXPRESSION, INFERENCE, MODEL",

author = "Christensen, {Niels Johan} and Samuel Demharter and Meera Machado and Lykke Pedersen and Marco Salvatore and Valdemar Stentoft-Hansen and Iglesias, {Miquel Triana}",

year = "2022",

doi = "10.1093/bioinformatics/btac405",

language = "English",

volume = "38",

pages = "3749--3758",

journal = "Bioinformatics (Online)",

issn = "1367-4811",

publisher = "Oxford University Press",

number = "15",

}

RIS

TY - JOUR

T1 - Identifying interactions in omics data for clinical biomarker discovery using symbolic regression

AU - Christensen, Niels Johan

AU - Demharter, Samuel

AU - Machado, Meera

AU - Pedersen, Lykke

AU - Salvatore, Marco

AU - Stentoft-Hansen, Valdemar

AU - Iglesias, Miquel Triana

PY - 2022

Y1 - 2022

N2 - Motivation The identification of predictive biomarker signatures from omics and multi-omics data for clinical applications is an active area of research. Recent developments in assay technologies and machine learning (ML) methods have led to significant improvements in predictive performance. However, most high-performing ML methods suffer from complex architectures and lack interpretability.Results We present the application of a novel symbolic-regression-based algorithm, the QLattice, on a selection of clinical omics datasets. This approach generates parsimonious high-performing models that can both predict disease outcomes and reveal putative disease mechanisms, demonstrating the importance of selecting maximally relevant and minimally redundant features in omics-based machine-learning applications. The simplicity and high-predictive power of these biomarker signatures make them attractive tools for high-stakes applications in areas such as primary care, clinical decision-making and patient stratification.

AB - Motivation The identification of predictive biomarker signatures from omics and multi-omics data for clinical applications is an active area of research. Recent developments in assay technologies and machine learning (ML) methods have led to significant improvements in predictive performance. However, most high-performing ML methods suffer from complex architectures and lack interpretability.Results We present the application of a novel symbolic-regression-based algorithm, the QLattice, on a selection of clinical omics datasets. This approach generates parsimonious high-performing models that can both predict disease outcomes and reveal putative disease mechanisms, demonstrating the importance of selecting maximally relevant and minimally redundant features in omics-based machine-learning applications. The simplicity and high-predictive power of these biomarker signatures make them attractive tools for high-stakes applications in areas such as primary care, clinical decision-making and patient stratification.

KW - DIFFERENTIAL EXPRESSION

KW - INFERENCE

KW - MODEL

U2 - 10.1093/bioinformatics/btac405

DO - 10.1093/bioinformatics/btac405

M3 - Journal article

C2 - 35731214

VL - 38

SP - 3749

EP - 3758

JO - Bioinformatics (Online)

JF - Bioinformatics (Online)

SN - 1367-4811

IS - 15

M1 - 405

ER -

ID: 314353858