A data-driven genome annotation approach for cassava
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
A data-driven genome annotation approach for cassava. / Chenna, Swetha; Ivanov, Maxim; Nielsen, Tue Kjærgaard; Chalenko, Karina; Olsen, Evy; Jørgensen, Kirsten; Sandelin, Albin; Marquardt, Sebastian.
In: Plant Journal, Vol. 119, 2024, p. 1596-1612.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - A data-driven genome annotation approach for cassava
AU - Chenna, Swetha
AU - Ivanov, Maxim
AU - Nielsen, Tue Kjærgaard
AU - Chalenko, Karina
AU - Olsen, Evy
AU - Jørgensen, Kirsten
AU - Sandelin, Albin
AU - Marquardt, Sebastian
N1 - © 2024 The Author(s). The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
PY - 2024
Y1 - 2024
N2 - Genome annotation files play a critical role in dictating the quality of downstream analyses by providing essential predictions for gene positions and structures. These files are pivotal in decoding the complex information encoded within DNA sequences. Here, we generated experimental data resolving RNA 5'- and 3'-ends as well as full-length RNAs for cassava TME12 sticklings in ambient temperature and cold. We used these data to generate genome annotation files using the TranscriptomeReconstructoR (TR) tool. A careful comparison to high-quality genome annotations suggests that our new TR genome annotations identified additional genes, resolved the transcript boundaries more accurately and identified additional RNA isoforms. We enhanced existing cassava genome annotation files with the information from TR that maintained the different transcript models as RNA isoforms. The resultant merged annotation was subsequently utilized for comprehensive analysis. To examine the effects of genome annotation files on gene expression studies, we compared the detection of differentially expressed genes during cold using the same RNA-seq data but alternative genome annotation files. We found that our merged genome annotation that included cold-specific TR gene models identified about twice as many cold-induced genes. These data indicate that environmentally induced genes may be missing in off-the-shelf genome annotation files. In conclusion, TR offers the opportunity to enhance crop genome annotations with implications for the discovery of differentially expressed candidate genes during plant-environment interactions.
AB - Genome annotation files play a critical role in dictating the quality of downstream analyses by providing essential predictions for gene positions and structures. These files are pivotal in decoding the complex information encoded within DNA sequences. Here, we generated experimental data resolving RNA 5'- and 3'-ends as well as full-length RNAs for cassava TME12 sticklings in ambient temperature and cold. We used these data to generate genome annotation files using the TranscriptomeReconstructoR (TR) tool. A careful comparison to high-quality genome annotations suggests that our new TR genome annotations identified additional genes, resolved the transcript boundaries more accurately and identified additional RNA isoforms. We enhanced existing cassava genome annotation files with the information from TR that maintained the different transcript models as RNA isoforms. The resultant merged annotation was subsequently utilized for comprehensive analysis. To examine the effects of genome annotation files on gene expression studies, we compared the detection of differentially expressed genes during cold using the same RNA-seq data but alternative genome annotation files. We found that our merged genome annotation that included cold-specific TR gene models identified about twice as many cold-induced genes. These data indicate that environmentally induced genes may be missing in off-the-shelf genome annotation files. In conclusion, TR offers the opportunity to enhance crop genome annotations with implications for the discovery of differentially expressed candidate genes during plant-environment interactions.
U2 - 10.1111/tpj.16856
DO - 10.1111/tpj.16856
M3 - Journal article
C2 - 38831668
VL - 119
SP - 1596
EP - 1612
JO - Plant Journal
JF - Plant Journal
SN - 0960-7412
ER -
ID: 394381289