TY - GEN
T1 - JULiP
T2 - 6th IEEE International Conference on Computational Advances in Bio and Medical Sciences, ICCABS 2016
AU - Yang, Guangyu
AU - Florea, Liliana
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/12/30
Y1 - 2016/12/30
N2 - Accurate alternative splicing detection and transcript reconstruction are essential to characterize gene regulation and function and to understand development and disease. However, current methods for extracting splicing variation from RNA-seq data only analyze signals from a single sample, which limits transcript reconstruction and fails to detect a complete set of alternative splicing events. We developed a novel feature selection method, JULiP, that analyzes information across multiple samples to identify alternative splicing variation in the form of splice junctions (introns). It formulates the selection problem as a regularized program, utilizing the latent information from multiple RNA-seq samples to construct an accurate and comprehensive intron set. JULiP is highly accurate, and could detect thousands more introns in any one sample, >30% more than the most sensitive single-sample method, and 10% more introns than in the cumulative set of samples, at higher or comparable precision (>98%). Tested assemblers included Cufflinks, CLASS2, StringTie and FlipFlop, and the multi-sample assembler ISP. JULiP is multi-threaded and parallelized, taking only one minute to analyze up to 100 data sets on a multi-computer cluster, and can easily scale up to allow analyses of hundreds and thousands of RNA-seq samples.
AB - Accurate alternative splicing detection and transcript reconstruction are essential to characterize gene regulation and function and to understand development and disease. However, current methods for extracting splicing variation from RNA-seq data only analyze signals from a single sample, which limits transcript reconstruction and fails to detect a complete set of alternative splicing events. We developed a novel feature selection method, JULiP, that analyzes information across multiple samples to identify alternative splicing variation in the form of splice junctions (introns). It formulates the selection problem as a regularized program, utilizing the latent information from multiple RNA-seq samples to construct an accurate and comprehensive intron set. JULiP is highly accurate, and could detect thousands more introns in any one sample, >30% more than the most sensitive single-sample method, and 10% more introns than in the cumulative set of samples, at higher or comparable precision (>98%). Tested assemblers included Cufflinks, CLASS2, StringTie and FlipFlop, and the multi-sample assembler ISP. JULiP is multi-threaded and parallelized, taking only one minute to analyze up to 100 data sets on a multi-computer cluster, and can easily scale up to allow analyses of hundreds and thousands of RNA-seq samples.
UR - http://www.scopus.com/inward/record.url?scp=85011015679&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85011015679&partnerID=8YFLogxK
U2 - 10.1109/ICCABS.2016.7802769
DO - 10.1109/ICCABS.2016.7802769
M3 - Conference contribution
AN - SCOPUS:85011015679
T3 - 2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences, ICCABS 2016
BT - 2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences, ICCABS 2016
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 13 October 2016 through 15 October 2016
ER -