Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest. / Riedel, Jerome; Lettow, Maike; Grabarics, Márkó; Götze, Michael; Miller, Rebecca L.; Boons, Geert Jan; Meijer, Gerard; von Helden, Gert; Szekeres, Gergo Peter; Pagel, Kevin.

In: Journal of the American Chemical Society, Vol. 145, No. 14, 2023, p. 7859-7868.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Riedel, J, Lettow, M, Grabarics, M, Götze, M, Miller, RL, Boons, GJ, Meijer, G, von Helden, G, Szekeres, GP & Pagel, K 2023, 'Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest', Journal of the American Chemical Society, vol. 145, no. 14, pp. 7859-7868. https://doi.org/10.1021/jacs.2c12762

APA

Riedel, J., Lettow, M., Grabarics, M., Götze, M., Miller, R. L., Boons, G. J., Meijer, G., von Helden, G., Szekeres, G. P., & Pagel, K. (2023). Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest. Journal of the American Chemical Society, 145(14), 7859-7868. https://doi.org/10.1021/jacs.2c12762

Vancouver

Riedel J, Lettow M, Grabarics M, Götze M, Miller RL, Boons GJ et al. Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest. Journal of the American Chemical Society. 2023;145(14):7859-7868. https://doi.org/10.1021/jacs.2c12762

Author

Riedel, Jerome ; Lettow, Maike ; Grabarics, Márkó ; Götze, Michael ; Miller, Rebecca L. ; Boons, Geert Jan ; Meijer, Gerard ; von Helden, Gert ; Szekeres, Gergo Peter ; Pagel, Kevin. / Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest. In: Journal of the American Chemical Society. 2023 ; Vol. 145, No. 14. pp. 7859-7868.

Bibtex

@article{d63d5582c7a04b8ab12be128f92358a4,
title = "Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest",
abstract = "In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for identifying the structural motifs and conformation of shorter GAG chains is cryogenic gas-phase infrared (IR) spectroscopy. In this work, the cryogenic gas-phase IR spectra of mass-selected heparan sulfate (HS) di-, tetra-, and hexasaccharide ions were recorded to extract vibrational features that are characteristic to structural motifs. The data were augmented with chondroitin sulfate (CS) disaccharide spectra to assemble a training library for random forest (RF) classifiers. These were used to discriminate between GAG classes (CS or HS) and different sulfate positions (2-O-, 4-O-, 6-O-, and N-sulfation). With optimized data preprocessing and RF modeling, a prediction accuracy of >97% was achieved for HS tetra- and hexasaccharides based on a training set of only 21 spectra. These results exemplify the importance of combining gas-phase cryogenic IR ion spectroscopy with machine learning to improve the future analytical workflow for GAG sequencing and that of other biomolecules, such as metabolites.",
author = "Jerome Riedel and Maike Lettow and M{\'a}rk{\'o} Grabarics and Michael G{\"o}tze and Miller, {Rebecca L.} and Boons, {Geert Jan} and Gerard Meijer and {von Helden}, Gert and Szekeres, {Gergo Peter} and Kevin Pagel",
note = "Publisher Copyright: {\textcopyright} 2023 The Authors. Published by American Chemical Society.",
year = "2023",
doi = "10.1021/jacs.2c12762",
language = "English",
volume = "145",
pages = "7859--7868",
journal = "Journal of the American Chemical Society",
issn = "0002-7863",
publisher = "ACS Publications",
number = "14",

}

RIS

TY - JOUR

T1 - Predicting Structural Motifs of Glycosaminoglycans using Cryogenic Infrared Spectroscopy and Random Forest

AU - Riedel, Jerome

AU - Lettow, Maike

AU - Grabarics, Márkó

AU - Götze, Michael

AU - Miller, Rebecca L.

AU - Boons, Geert Jan

AU - Meijer, Gerard

AU - von Helden, Gert

AU - Szekeres, Gergo Peter

AU - Pagel, Kevin

N1 - Publisher Copyright: © 2023 The Authors. Published by American Chemical Society.

PY - 2023

Y1 - 2023

N2 - In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for identifying the structural motifs and conformation of shorter GAG chains is cryogenic gas-phase infrared (IR) spectroscopy. In this work, the cryogenic gas-phase IR spectra of mass-selected heparan sulfate (HS) di-, tetra-, and hexasaccharide ions were recorded to extract vibrational features that are characteristic to structural motifs. The data were augmented with chondroitin sulfate (CS) disaccharide spectra to assemble a training library for random forest (RF) classifiers. These were used to discriminate between GAG classes (CS or HS) and different sulfate positions (2-O-, 4-O-, 6-O-, and N-sulfation). With optimized data preprocessing and RF modeling, a prediction accuracy of >97% was achieved for HS tetra- and hexasaccharides based on a training set of only 21 spectra. These results exemplify the importance of combining gas-phase cryogenic IR ion spectroscopy with machine learning to improve the future analytical workflow for GAG sequencing and that of other biomolecules, such as metabolites.

AB - In recent years, glycosaminoglycans (GAGs) have emerged into the focus of biochemical and biomedical research due to their importance in a variety of physiological processes. These molecules show great diversity, which makes their analysis highly challenging. A promising tool for identifying the structural motifs and conformation of shorter GAG chains is cryogenic gas-phase infrared (IR) spectroscopy. In this work, the cryogenic gas-phase IR spectra of mass-selected heparan sulfate (HS) di-, tetra-, and hexasaccharide ions were recorded to extract vibrational features that are characteristic to structural motifs. The data were augmented with chondroitin sulfate (CS) disaccharide spectra to assemble a training library for random forest (RF) classifiers. These were used to discriminate between GAG classes (CS or HS) and different sulfate positions (2-O-, 4-O-, 6-O-, and N-sulfation). With optimized data preprocessing and RF modeling, a prediction accuracy of >97% was achieved for HS tetra- and hexasaccharides based on a training set of only 21 spectra. These results exemplify the importance of combining gas-phase cryogenic IR ion spectroscopy with machine learning to improve the future analytical workflow for GAG sequencing and that of other biomolecules, such as metabolites.

U2 - 10.1021/jacs.2c12762

DO - 10.1021/jacs.2c12762

M3 - Journal article

C2 - 37000483

AN - SCOPUS:85151828266

VL - 145

SP - 7859

EP - 7868

JO - Journal of the American Chemical Society

JF - Journal of the American Chemical Society

SN - 0002-7863

IS - 14

ER -

ID: 345415254