PaleoProPhyler: a reproducible pipeline for phylogenetic inference using ancient proteins

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 4,47 MB, PDF-dokument

Ancient proteins from fossilized or semi-fossilized remains can yield phylogenetic information at broad temporal horizons, in some cases even millions of years into the past. In recent years, peptides extracted from archaic hominins and long-extinct mega-fauna have enabled unprecedented insights into their evolutionary history. In contrast to the field of ancient DNA-where several computational methods exist to process and analyze sequencing data-few tools exist for handling ancient protein sequence data. In-stead, most studies rely on loosely combined custom scripts, which makes it difficult to reproduce results or share methodologies across research groups. Here, we present PaleoProPhyler: a new fully reproducible pipeline for aligning ancient peptide data and subsequently performing phylogenetic analyses. The pipeline can not only process various forms of proteomic data, but also easily harness genetic data in different formats (CRAM, BAM, VCF) and translate it, allowing the user to create reference panels for phyloproteomic analyses. We describe the various steps of the pipeline and its many functionalities, and provide some examples of how to use it. PaleoProPhyler allows re-searchers with little bioinformatics experience to efficiently analyze palaeoproteomic sequences, so as to derive insights from this valuable source of evolutionary data.

OriginalsprogEngelsk
Artikelnummere112
TidsskriftPeer Community Journal
Vol/bind3
Antal sider12
ISSN2804-3871
DOI
StatusUdgivet - 2023

Bibliografisk note

Funding Information:
The project was funded by the European Union\u2019s EU Framework Programme for Research and Innovation Horizon 2020, under Grant Agreement No. 861389-PUSHH. FR was additionally supported by a Villum Young Investigator Grant (project no. 00025300), a COREX ERC Synergy grant (ID 951385) and a Novo Nordisk Fonden Data Science Ascending Investigator Award (NNF22OC0076816). E.C. was additionally supported by the European Research Council (ERC) through the ERC Advanced Grant \u201DBACKWARD\u201D, under the European Union\u2019s Horizon 2020 research and innovation program (grant agreement No. 101021361).

Publisher Copyright:
© 2023, Centre Mersenne. All rights reserved.

ID: 401794564