JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles

Research output: Contribution to journalJournal articleResearchpeer-review

Documents

  • Fulltext

    Final published version, 1.41 MB, PDF document

  • Jaime A. Castro-Mondragon
  • Rafael Riudavets-Puig
  • Ieva Rauluseviciute
  • Roza Berhanu Lemma
  • Laura Turchi
  • Romain Blanc-Mathieu
  • Jeremy Lucas
  • Paul Boddie
  • Aziz Khan
  • Nicolás Manosalva Perez
  • Oriol Fornes
  • Tiffany Y. Leung
  • Alejandro Aguirre
  • Fayrouz Hammal
  • Daniel Schmelter
  • Damir Baranasic
  • Benoit Ballester
  • Boris Lenhard
  • Klaas Vandepoele
  • Wyeth W. Wasserman
  • François Parcy
  • Anthony Mathelier

JASPAR (http://jaspar.genereg.net/) is an open-access database containing manually curated, non-redundant transcription factor (TF) binding profiles for TFs across six taxonomic groups. In this 9th release, we expanded the CORE collection with 341 new profiles (148 for plants, 101 for vertebrates, 85 for urochordates, and 7 for insects), which corresponds to a 19% expansion over the previous release. We added 298 new profiles to the Unvalidated collection when no orthogonal evidence was found in the literature. All the profiles were clustered to provide familial binding profiles for each taxonomic group. Moreover, we revised the structural classification of DNA binding domains to consider plant-specific TFs. This release introduces word clouds to represent the scientific knowledge associated with each TF. We updated the genome tracks of TFBSs predicted with JASPAR profiles in eight organisms; the human and mouse TFBS predictions can be visualized as native tracks in the UCSC Genome Browser. Finally, we provide a new tool to perform JASPAR TFBS enrichment analysis in user-provided genomic regions. All the data is accessible through the JASPAR website, its associated RESTful API, the R/Bioconductor data package, and a new Python package, pyJASPAR, that facilitates serverless access to the data.

Original languageEnglish
Book seriesNucleic acids symposium series
Volume50
Issue numberD1
Pages (from-to)D165-D173
Number of pages9
ISSN0305-1048
DOIs
Publication statusPublished - 2022

    Research areas

  • DNA SHAPE-FEATURES, ENRICHMENT ANALYSIS, MOTIF DATABASE, TFBSSHAPE, MOUSE, TOOLS, GENE

Number of downloads are based on statistics from Google Scholar and www.ku.dk


No data available

ID: 297056593