DiffSegR: an RNA-seq data driven method for differential expression analysis using changepoint detection.
Arnaud LiehrmannEtienne DelannoyAlexandra Launay-AvonElodie GilbaultOlivier LoudetBenoît CastandetGuillem J RigaillPublished in: NAR genomics and bioinformatics (2023)
To fully understand gene regulation, it is necessary to have a thorough understanding of both the transcriptome and the enzymatic and RNA-binding activities that shape it. While many RNA-Seq-based tools have been developed to analyze the transcriptome, most only consider the abundance of sequencing reads along annotated patterns (such as genes). These annotations are typically incomplete, leading to errors in the differential expression analysis. To address this issue, we present DiffSegR - an R package that enables the discovery of transcriptome-wide expression differences between two biological conditions using RNA-Seq data. DiffSegR does not require prior annotation and uses a multiple changepoints detection algorithm to identify the boundaries of differentially expressed regions in the per-base log 2 fold change. In a few minutes of computation, DiffSegR could rightfully predict the role of chloroplast ribonuclease Mini-III in rRNA maturation and chloroplast ribonuclease PNPase in (3'/5')-degradation of rRNA, mRNA and tRNA precursors as well as intron accumulation. We believe DiffSegR will benefit biologists working on transcriptomics as it allows access to information from a layer of the transcriptome overlooked by the classical differential expression analysis pipelines widely used today. DiffSegR is available at https://aliehrmann.github.io/DiffSegR/index.html.
Keyphrases
- rna seq
- single cell
- high throughput
- genome wide identification
- loop mediated isothermal amplification
- poor prognosis
- machine learning
- label free
- small molecule
- real time pcr
- healthcare
- genome wide
- deep learning
- gene expression
- patient safety
- dna methylation
- emergency department
- big data
- adverse drug
- microbial community
- quality improvement
- sensitive detection
- long non coding rna
- bioinformatics analysis