A proteomics sample metadata representation for multiomics integration and big data analysis

Research output: Contribution to journal › Journal article › Research › peer-review

Documents

A proteomics sample metadata representation for multiomics integration and big data analysis
Final published version, 2.76 MB, PDF document

Chengxin Dai
Anja Füllgrabe
Julianus Pfeuffer
Elizaveta M. Solovyeva
Jingwen Deng
Pablo Moreno
Selvakumar Kamatchinathan
Deepti Jaiswal Kundu
Nancy George
Silvie Fexova
Björn Grüning
Melanie Christine Föll
Johannes Griss
Marc Vaudel
Enrique Audain
Michael Turewicz
Martin Eisenacher
Julian Uszkoreit
Tim Van Den Bossche
Veit Schwämmle
Stefan Schulze
David Bouyssié
Savita Jayaram
Vinay Kumar Duggineni
Patroklos Samaras
Mathias Wilhelm
Meena Choi
Mingxun Wang
Oliver Kohlbacher
Alvis Brazma
Irene Papatheodorou
Nuno Bandeira
Eric W. Deutsch
Juan Antonio Vizcaíno
Mingze Bai
Timo Sachsenberg
Lev I. Levitsky
Yasset Perez-Riverol

The amount of public proteomics data is rapidly increasing but there is no standardized format to describe the sample metadata and their relationship with the dataset files in a way that fully supports their understanding or reanalysis. Here we propose to develop the transcriptomics data format MAGE-TAB into a standard representation for proteomics sample metadata. We implement MAGE-TAB-Proteomics in a crowdsourcing project to manually curate over 200 public datasets. We also describe tools and libraries to validate and submit sample metadata-related information to the PRIDE repository. We expect that these developments will improve the reproducibility and facilitate the reanalysis and integration of public proteomics datasets.

Original language	English
Article number	5854
Journal	Nature Communications
Volume	12
Issue number	1
Number of pages	8
ISSN	2041-1723
DOIs	https://doi.org/10.1038/s41467-021-26111-3
Publication status	Published - 2021

Bibliographical note

Number of downloads are based on statistics from Google Scholar and www.ku.dk

No data available

ID: 283756960