TY - JOUR
T1 - A multi-sample approach increases the accuracy of transcript assembly
AU - Song, Li
AU - Sabunciyan, Sarven
AU - Yang, Guangyu
AU - Florea, Liliana
N1 - Publisher Copyright:
© 2019, The Author(s).
PY - 2019/12/1
Y1 - 2019/12/1
N2 - Transcript assembly from RNA-seq reads is a critical step in gene expression and subsequent functional analyses. Here we present PsiCLASS, an accurate and efficient transcript assembler based on an approach that simultaneously analyzes multiple RNA-seq samples. PsiCLASS combines mixture statistical models for exonic feature selection across multiple samples with splice graph based dynamic programming algorithms and a weighted voting scheme for transcript selection. PsiCLASS achieves significantly better sensitivity-precision tradeoff, and renders precision up to 2-3 fold higher than the StringTie system and Scallop plus TACO, the two best current approaches. PsiCLASS is efficient and scalable, assembling 667 GEUVADIS samples in 9 h, and has robust accuracy with large numbers of samples.
AB - Transcript assembly from RNA-seq reads is a critical step in gene expression and subsequent functional analyses. Here we present PsiCLASS, an accurate and efficient transcript assembler based on an approach that simultaneously analyzes multiple RNA-seq samples. PsiCLASS combines mixture statistical models for exonic feature selection across multiple samples with splice graph based dynamic programming algorithms and a weighted voting scheme for transcript selection. PsiCLASS achieves significantly better sensitivity-precision tradeoff, and renders precision up to 2-3 fold higher than the StringTie system and Scallop plus TACO, the two best current approaches. PsiCLASS is efficient and scalable, assembling 667 GEUVADIS samples in 9 h, and has robust accuracy with large numbers of samples.
UR - http://www.scopus.com/inward/record.url?scp=85074297526&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074297526&partnerID=8YFLogxK
U2 - 10.1038/s41467-019-12990-0
DO - 10.1038/s41467-019-12990-0
M3 - Article
C2 - 31676772
AN - SCOPUS:85074297526
SN - 2041-1723
VL - 10
JO - Nature communications
JF - Nature communications
IS - 1
M1 - 5000
ER -