Efficient and accurate causal inference with hidden confounders from genome-transcriptome variation data.
Lingfei WangTom MichoelPublished in: PLoS computational biology (2017)
Mapping gene expression as a quantitative trait using whole genome-sequencing and transcriptome analysis allows to discover the functional consequences of genetic variation. We developed a novel method and ultra-fast software Findr for higly accurate causal inference between gene expression traits using cis-regulatory DNA variations as causal anchors, which improves current methods by taking into consideration hidden confounders and weak regulations. Findr outperformed existing methods on the DREAM5 Systems Genetics challenge and on the prediction of microRNA and transcription factor targets in human lymphoblastoid cells, while being nearly a million times faster. Findr is publicly available at https://github.com/lingfeiwang/findr.
Keyphrases
- gene expression
- high resolution
- genome wide
- transcription factor
- dna methylation
- single cell
- induced apoptosis
- endothelial cells
- cell cycle arrest
- rna seq
- electronic health record
- circulating tumor
- cell free
- induced pluripotent stem cells
- dna binding
- data analysis
- endoplasmic reticulum stress
- cell death
- single molecule
- pluripotent stem cells
- machine learning
- nucleic acid
- signaling pathway
- oxidative stress
- circulating tumor cells
- pi k akt
- high density