Please use this identifier to cite or link to this item:
http://dx.doi.org/10.25673/122699| Title: | Improved reconstruction of transcripts and coding sequences from RNA-seq data |
| Author(s): | Grau, Jan Weise, Deborah Panster, Marika Schattat, Martin Hartmut Keilwagen, Jens |
| Issue Date: | 2026 |
| Type: | Article |
| Language: | English |
| Abstract: | Annotation of genes and transcripts is a key prerequisite for understanding the information that is encoded in newly sequenced genomes. One source of information suited for this purpose is RNA-seq data mapped to the respective genome sequence. RNA-seq-based approaches for transcript reconstruction generate transcript models from these data by combining regions of contiguous coverage (exons) and split read mappings (introns). Understanding phenotypes as a consequence of proteins encoded in a genome further requires the annotation of coding sequences within transcript models. We present GeMoSeq, a novel approach for transcript reconstruction from RNA-seq data that combines combinatorial enumeration of candidate transcripts with heuristics for splitting candidate transcripts into regions of contiguous coverage and subsequent likelihood-based quantification. Prediction of coding sequences is an integral part of the GeMoSeq algorithm. We benchmark GeMoSeq against previous approaches using a large collection of public RNA-seq data for seven species. For the majority of species, we observe an improved prediction performance of GeMoSeq, especially on the level of coding sequences and for species with dense genomes. We combine GeMoSeq with the homology-based approach GeMoMa to re-annotate two recently sequenced genomes of Nicotiana benthamiana lab strains, which illustrates the main purpose of GeMoSeq: the initial annotation of newly sequenced genomes with protein-coding genes. |
| URI: | https://opendata.uni-halle.de//handle/1981185920/124644 http://dx.doi.org/10.25673/122699 |
| Open Access: | Open access publication |
| License: | (CC BY-NC 4.0) Creative Commons Attribution NonCommercial 4.0 |
| Journal Title: | Nucleic acids research |
| Publisher: | Oxford Univ. Press |
| Publisher Place: | Oxford |
| Volume: | 54 |
| Issue: | 4 |
| Original Publication: | 10.1093/nar/gkag091 |
| Appears in Collections: | Open Access Publikationen der MLU |
Files in This Item:
| File | Size | Format | |
|---|---|---|---|
| gkag091.pdf | 846.26 kB | Adobe PDF | View/Open |
Open access publication