Systematic association of genes to phenotypes by genome and literature mining

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Systematic association of genes to phenotypes by genome and literature mining. / Korbel, Jan O; Doerks, Tobias; Jensen, Lars J; Perez-Iratxeta, Carolina; Kaczanowski, Szymon; Hooper, Sean D; Andrade, Miguel A; Bork, Peer.

In: P L o S Biology (Online), Vol. 3, No. 5, 2005, p. e134.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Korbel, JO, Doerks, T, Jensen, LJ, Perez-Iratxeta, C, Kaczanowski, S, Hooper, SD, Andrade, MA & Bork, P 2005, 'Systematic association of genes to phenotypes by genome and literature mining', P L o S Biology (Online), vol. 3, no. 5, pp. e134. https://doi.org/10.1371/journal.pbio.0030134

APA

Korbel, J. O., Doerks, T., Jensen, L. J., Perez-Iratxeta, C., Kaczanowski, S., Hooper, S. D., Andrade, M. A., & Bork, P. (2005). Systematic association of genes to phenotypes by genome and literature mining. P L o S Biology (Online), 3(5), e134. https://doi.org/10.1371/journal.pbio.0030134

Vancouver

Korbel JO, Doerks T, Jensen LJ, Perez-Iratxeta C, Kaczanowski S, Hooper SD et al. Systematic association of genes to phenotypes by genome and literature mining. P L o S Biology (Online). 2005;3(5):e134. https://doi.org/10.1371/journal.pbio.0030134

Author

Korbel, Jan O ; Doerks, Tobias ; Jensen, Lars J ; Perez-Iratxeta, Carolina ; Kaczanowski, Szymon ; Hooper, Sean D ; Andrade, Miguel A ; Bork, Peer. / Systematic association of genes to phenotypes by genome and literature mining. In: P L o S Biology (Online). 2005 ; Vol. 3, No. 5. pp. e134.

Bibtex

@article{19375815163641d18c8217124bdc7a22,
title = "Systematic association of genes to phenotypes by genome and literature mining",
abstract = "One of the major challenges of functional genomics is to unravel the connection between genotype and phenotype. So far no global analysis has attempted to explore those connections in the light of the large phenotypic variability seen in nature. Here, we use an unsupervised, systematic approach for associating genes and phenotypic characteristics that combines literature mining with comparative genome analysis. We first mine the MEDLINE literature database for terms that reflect phenotypic similarities of species. Subsequently we predict the likely genomic determinants: genes specifically present in the respective genomes. In a global analysis involving 92 prokaryotic genomes we retrieve 323 clusters containing a total of 2,700 significant gene-phenotype associations. Some clusters contain mostly known relationships, such as genes involved in motility or plant degradation, often with additional hypothetical proteins associated with those phenotypes. Other clusters comprise unexpected associations; for example, a group of terms related to food and spoilage is linked to genes predicted to be involved in bacterial food poisoning. Among the clusters, we observe an enrichment of pathogenicity-related associations, suggesting that the approach reveals many novel genes likely to play a role in infectious diseases.",
author = "Korbel, {Jan O} and Tobias Doerks and Jensen, {Lars J} and Carolina Perez-Iratxeta and Szymon Kaczanowski and Hooper, {Sean D} and Andrade, {Miguel A} and Peer Bork",
year = "2005",
doi = "10.1371/journal.pbio.0030134",
language = "English",
volume = "3",
pages = "e134",
journal = "PLoS Biology",
issn = "1544-9173",
publisher = "Public Library of Science",
number = "5",

}

RIS

TY - JOUR

T1 - Systematic association of genes to phenotypes by genome and literature mining

AU - Korbel, Jan O

AU - Doerks, Tobias

AU - Jensen, Lars J

AU - Perez-Iratxeta, Carolina

AU - Kaczanowski, Szymon

AU - Hooper, Sean D

AU - Andrade, Miguel A

AU - Bork, Peer

PY - 2005

Y1 - 2005

N2 - One of the major challenges of functional genomics is to unravel the connection between genotype and phenotype. So far no global analysis has attempted to explore those connections in the light of the large phenotypic variability seen in nature. Here, we use an unsupervised, systematic approach for associating genes and phenotypic characteristics that combines literature mining with comparative genome analysis. We first mine the MEDLINE literature database for terms that reflect phenotypic similarities of species. Subsequently we predict the likely genomic determinants: genes specifically present in the respective genomes. In a global analysis involving 92 prokaryotic genomes we retrieve 323 clusters containing a total of 2,700 significant gene-phenotype associations. Some clusters contain mostly known relationships, such as genes involved in motility or plant degradation, often with additional hypothetical proteins associated with those phenotypes. Other clusters comprise unexpected associations; for example, a group of terms related to food and spoilage is linked to genes predicted to be involved in bacterial food poisoning. Among the clusters, we observe an enrichment of pathogenicity-related associations, suggesting that the approach reveals many novel genes likely to play a role in infectious diseases.

AB - One of the major challenges of functional genomics is to unravel the connection between genotype and phenotype. So far no global analysis has attempted to explore those connections in the light of the large phenotypic variability seen in nature. Here, we use an unsupervised, systematic approach for associating genes and phenotypic characteristics that combines literature mining with comparative genome analysis. We first mine the MEDLINE literature database for terms that reflect phenotypic similarities of species. Subsequently we predict the likely genomic determinants: genes specifically present in the respective genomes. In a global analysis involving 92 prokaryotic genomes we retrieve 323 clusters containing a total of 2,700 significant gene-phenotype associations. Some clusters contain mostly known relationships, such as genes involved in motility or plant degradation, often with additional hypothetical proteins associated with those phenotypes. Other clusters comprise unexpected associations; for example, a group of terms related to food and spoilage is linked to genes predicted to be involved in bacterial food poisoning. Among the clusters, we observe an enrichment of pathogenicity-related associations, suggesting that the approach reveals many novel genes likely to play a role in infectious diseases.

U2 - 10.1371/journal.pbio.0030134

DO - 10.1371/journal.pbio.0030134

M3 - Journal article

C2 - 15799710

VL - 3

SP - e134

JO - PLoS Biology

JF - PLoS Biology

SN - 1544-9173

IS - 5

ER -

ID: 40749431