Systematic assessment of long-read RNA-seq methods for transcript identification and quantification

Nature Methods
By: , and 

Links

Abstract

The Long-read RNA-Seq Genome Annotation Assessment Project Consortium was formed to evaluate the effectiveness of long-read approaches for transcriptome analysis. Using different protocols and sequencing platforms, the consortium generated over 427 million long-read sequences from complementary DNA and direct RNA datasets, encompassing human, mouse and manatee species. Developers utilized these data to address challenges in transcript isoform detection, quantification and de novo transcript detection. The study revealed that libraries with longer, more accurate sequences produce more accurate transcripts than those with increased read depth, whereas greater read depth improved quantification accuracy. In well-annotated genomes, tools based on reference sequences demonstrated the best performance. Incorporating additional orthogonal data and replicate samples is advised when aiming to detect rare and novel transcripts or using reference-free approaches. This collaborative study offers a benchmark for current practices and provides direction for future method development in transcriptome analysis.

Publication type Article
Publication Subtype Journal Article
Title Systematic assessment of long-read RNA-seq methods for transcript identification and quantification
Series title Nature Methods
DOI 10.1038/s41592-024-02298-3
Volume 21
Year Published 2024
Language English
Publisher Springer Nature
Contributing office(s) Wetland and Aquatic Research Center
Description 15 p.
First page 1349
Last page 1363
Google Analytic Metrics Metrics page
Additional publication details