eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations. / Muller, J; Szklarczyk, D; Julien, P; Letunic, I; Roth, A; Kuhn, M; Powell, S; von Mering, C; Doerks, T; Jensen, L J; Bork, P.

In: Nucleic Acids Symposium Series, Vol. 38, No. Database issue, 01.01.2010, p. D190-5.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Muller, J, Szklarczyk, D, Julien, P, Letunic, I, Roth, A, Kuhn, M, Powell, S, von Mering, C, Doerks, T, Jensen, LJ & Bork, P 2010, 'eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations', Nucleic Acids Symposium Series, vol. 38, no. Database issue, pp. D190-5. https://doi.org/10.1093/nar/gkp951

APA

Muller, J., Szklarczyk, D., Julien, P., Letunic, I., Roth, A., Kuhn, M., Powell, S., von Mering, C., Doerks, T., Jensen, L. J., & Bork, P. (2010). eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations. Nucleic Acids Symposium Series, 38(Database issue), D190-5. https://doi.org/10.1093/nar/gkp951

Vancouver

Muller J, Szklarczyk D, Julien P, Letunic I, Roth A, Kuhn M et al. eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations. Nucleic Acids Symposium Series. 2010 Jan 1;38(Database issue):D190-5. https://doi.org/10.1093/nar/gkp951

Author

Muller, J ; Szklarczyk, D ; Julien, P ; Letunic, I ; Roth, A ; Kuhn, M ; Powell, S ; von Mering, C ; Doerks, T ; Jensen, L J ; Bork, P. / eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations. In: Nucleic Acids Symposium Series. 2010 ; Vol. 38, No. Database issue. pp. D190-5.

Bibtex

@article{e4b17a09598c42eb9e5d4f29128e5666,
title = "eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations",
abstract = "The identification of orthologous relationships forms the basis for most comparative genomics studies. Here, we present the second version of the eggNOG database, which contains orthologous groups (OGs) constructed through identification of reciprocal best BLAST matches and triangular linkage clustering. We applied this procedure to 630 complete genomes (529 bacteria, 46 archaea and 55 eukaryotes), which is a 2-fold increase relative to the previous version. The pipeline yielded 224,847 OGs, including 9724 extended versions of the original COG and KOG. We computed OGs for different levels of the tree of life; in addition to the species groups included in our first release (i.e. fungi, metazoa, insects, vertebrates and mammals), we have now constructed OGs for archaea, fishes, rodents and primates. We automatically annotate the non-supervised orthologous groups (NOGs) with functional descriptions, protein domains, and functional categories as defined initially for the COG/KOG database. In-depth analysis is facilitated by precomputed high-quality multiple sequence alignments and maximum-likelihood trees for each of the available OGs. Altogether, eggNOG covers 2,242 035 proteins (built from 2,590,259 proteins) and provides a broad functional description for at least 1,966,709 (88%) of them. Users can access the complete set of orthologous groups via a web interface at: http://eggnog.embl.de.",
keywords = "Amino Acid Motifs, Animals, Archaea, Computational Biology, Databases, Genetic, Databases, Nucleic Acid, Databases, Protein, Fishes, Genome, Bacterial, Humans, Information Storage and Retrieval, Internet, Primates, Protein Structure, Tertiary, Rats, Software",
author = "J Muller and D Szklarczyk and P Julien and I Letunic and A Roth and M Kuhn and S Powell and {von Mering}, C and T Doerks and Jensen, {L J} and P Bork",
year = "2010",
month = jan,
day = "1",
doi = "10.1093/nar/gkp951",
language = "English",
volume = "38",
pages = "D190--5",
journal = "Nucleic acids symposium series",
issn = "0261-3166",
publisher = "Oxford University Press",
number = "Database issue",

}

RIS

TY - JOUR

T1 - eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations

AU - Muller, J

AU - Szklarczyk, D

AU - Julien, P

AU - Letunic, I

AU - Roth, A

AU - Kuhn, M

AU - Powell, S

AU - von Mering, C

AU - Doerks, T

AU - Jensen, L J

AU - Bork, P

PY - 2010/1/1

Y1 - 2010/1/1

N2 - The identification of orthologous relationships forms the basis for most comparative genomics studies. Here, we present the second version of the eggNOG database, which contains orthologous groups (OGs) constructed through identification of reciprocal best BLAST matches and triangular linkage clustering. We applied this procedure to 630 complete genomes (529 bacteria, 46 archaea and 55 eukaryotes), which is a 2-fold increase relative to the previous version. The pipeline yielded 224,847 OGs, including 9724 extended versions of the original COG and KOG. We computed OGs for different levels of the tree of life; in addition to the species groups included in our first release (i.e. fungi, metazoa, insects, vertebrates and mammals), we have now constructed OGs for archaea, fishes, rodents and primates. We automatically annotate the non-supervised orthologous groups (NOGs) with functional descriptions, protein domains, and functional categories as defined initially for the COG/KOG database. In-depth analysis is facilitated by precomputed high-quality multiple sequence alignments and maximum-likelihood trees for each of the available OGs. Altogether, eggNOG covers 2,242 035 proteins (built from 2,590,259 proteins) and provides a broad functional description for at least 1,966,709 (88%) of them. Users can access the complete set of orthologous groups via a web interface at: http://eggnog.embl.de.

AB - The identification of orthologous relationships forms the basis for most comparative genomics studies. Here, we present the second version of the eggNOG database, which contains orthologous groups (OGs) constructed through identification of reciprocal best BLAST matches and triangular linkage clustering. We applied this procedure to 630 complete genomes (529 bacteria, 46 archaea and 55 eukaryotes), which is a 2-fold increase relative to the previous version. The pipeline yielded 224,847 OGs, including 9724 extended versions of the original COG and KOG. We computed OGs for different levels of the tree of life; in addition to the species groups included in our first release (i.e. fungi, metazoa, insects, vertebrates and mammals), we have now constructed OGs for archaea, fishes, rodents and primates. We automatically annotate the non-supervised orthologous groups (NOGs) with functional descriptions, protein domains, and functional categories as defined initially for the COG/KOG database. In-depth analysis is facilitated by precomputed high-quality multiple sequence alignments and maximum-likelihood trees for each of the available OGs. Altogether, eggNOG covers 2,242 035 proteins (built from 2,590,259 proteins) and provides a broad functional description for at least 1,966,709 (88%) of them. Users can access the complete set of orthologous groups via a web interface at: http://eggnog.embl.de.

KW - Amino Acid Motifs

KW - Animals

KW - Archaea

KW - Computational Biology

KW - Databases, Genetic

KW - Databases, Nucleic Acid

KW - Databases, Protein

KW - Fishes

KW - Genome, Bacterial

KW - Humans

KW - Information Storage and Retrieval

KW - Internet

KW - Primates

KW - Protein Structure, Tertiary

KW - Rats

KW - Software

U2 - 10.1093/nar/gkp951

DO - 10.1093/nar/gkp951

M3 - Journal article

C2 - 19900971

VL - 38

SP - D190-5

JO - Nucleic acids symposium series

JF - Nucleic acids symposium series

SN - 0261-3166

IS - Database issue

ER -

ID: 33770138