Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: http://dx.doi.org/10.25673/122699
Titel: Improved reconstruction of transcripts and coding sequences from RNA-seq data
Autor(en): Grau, JanIn der Gemeinsamen Normdatei der DNB nachschlagen
Weise, Deborah
Panster, Marika
Schattat, Martin HartmutIn der Gemeinsamen Normdatei der DNB nachschlagen
Keilwagen, JensIn der Gemeinsamen Normdatei der DNB nachschlagen
Erscheinungsdatum: 2026
Art: Artikel
Sprache: Englisch
Zusammenfassung: Annotation of genes and transcripts is a key prerequisite for understanding the information that is encoded in newly sequenced genomes. One source of information suited for this purpose is RNA-seq data mapped to the respective genome sequence. RNA-seq-based approaches for transcript reconstruction generate transcript models from these data by combining regions of contiguous coverage (exons) and split read mappings (introns). Understanding phenotypes as a consequence of proteins encoded in a genome further requires the annotation of coding sequences within transcript models. We present GeMoSeq, a novel approach for transcript reconstruction from RNA-seq data that combines combinatorial enumeration of candidate transcripts with heuristics for splitting candidate transcripts into regions of contiguous coverage and subsequent likelihood-based quantification. Prediction of coding sequences is an integral part of the GeMoSeq algorithm. We benchmark GeMoSeq against previous approaches using a large collection of public RNA-seq data for seven species. For the majority of species, we observe an improved prediction performance of GeMoSeq, especially on the level of coding sequences and for species with dense genomes. We combine GeMoSeq with the homology-based approach GeMoMa to re-annotate two recently sequenced genomes of Nicotiana benthamiana lab strains, which illustrates the main purpose of GeMoSeq: the initial annotation of newly sequenced genomes with protein-coding genes.
URI: https://opendata.uni-halle.de//handle/1981185920/124644
http://dx.doi.org/10.25673/122699
Open-Access: Open-Access-Publikation
Nutzungslizenz: (CC BY-NC 4.0) Creative Commons Namensnennung - Nicht kommerziell 4.0 International(CC BY-NC 4.0) Creative Commons Namensnennung - Nicht kommerziell 4.0 International
Journal Titel: Nucleic acids research
Verlag: Oxford Univ. Press
Verlagsort: Oxford
Band: 54
Heft: 4
Originalveröffentlichung: 10.1093/nar/gkag091
Enthalten in den Sammlungen:Open Access Publikationen der MLU

Dateien zu dieser Ressource:
Datei GrößeFormat 
gkag091.pdf846.26 kBAdobe PDFÖffnen/Anzeigen