Finding associations in a heterogeneous setting: statistical test for aberration enrichment.
Aziz M MezliniSudeshna DasAnna GoldenbergPublished in: Genome medicine (2021)
Most two-group statistical tests find broad patterns such as overall shifts in mean, median, or variance. These tests may not have enough power to detect effects in a small subset of samples, e.g., a drug that works well only on a few patients. We developed a novel statistical test targeting such effects relevant for clinical trials, biomarker discovery, feature selection, etc. We focused on finding meaningful associations in complex genetic diseases in gene expression, miRNA expression, and DNA methylation. Our test outperforms traditional statistical tests in simulated and experimental data and detects potentially disease-relevant genes with heterogeneous effects.
Keyphrases
- gene expression
- dna methylation
- clinical trial
- end stage renal disease
- small molecule
- chronic kidney disease
- machine learning
- newly diagnosed
- randomized controlled trial
- deep learning
- high throughput
- prognostic factors
- artificial intelligence
- copy number
- electronic health record
- open label
- cancer therapy
- adverse drug
- long non coding rna
- neural network