Differential gene expression analysis tools exhibit substandard performance for long non-coding RNA-sequencing data.
Alemu Takele AssefaKatrijn De PaepeCeline EveraertPieter MestdaghOlivier ThasJo VandesompelePublished in: Genome biology (2018)
Overall, linear modeling with empirical Bayes moderation (limma) and a non-parametric approach (SAMSeq) showed good control of the false discovery rate and reasonable sensitivity. Of note, for achieving a sensitivity of at least 50%, more than 80 samples are required when studying expression levels in realistic settings such as in clinical cancer research. About half of the methods showed a substantial excess of false discoveries, making these methods unreliable for DE analysis and jeopardizing reproducible science. The detailed results of our study can be consulted through a user-friendly web application, giving guidance on selection of the optimal DE tool ( http://statapps.ugent.be/tools/AppDGE/ ).