Machine learning for deciphering cell heterogeneity and gene regulation.
Michael SchererFlorian SchmidtOlga LazarevaJörn WalterJan BaumbachMarcel H SchulzMarkus ListPublished in: Nature computational science (2021)
Epigenetics studies inheritable and reversible modifications of DNA that allow cells to control gene expression throughout their development and in response to environmental conditions. In computational epigenomics, machine learning is applied to study various epigenetic mechanisms genome wide. Its aim is to expand our understanding of cell differentiation, that is their specialization, in health and disease. Thus far, most efforts focus on understanding the functional encoding of the genome and on unraveling cell-type heterogeneity. Here, we provide an overview of state-of-the-art computational methods and their underlying statistical concepts, which range from matrix factorization and regularized linear regression to deep learning methods. We further show how the rise of single-cell technology leads to new computational challenges and creates opportunities to further our understanding of epigenetic regulation.
Keyphrases
- single cell
- machine learning
- gene expression
- dna methylation
- genome wide
- deep learning
- rna seq
- artificial intelligence
- high throughput
- induced apoptosis
- public health
- big data
- healthcare
- mental health
- cell cycle arrest
- human health
- stem cells
- single molecule
- quality improvement
- convolutional neural network
- health promotion
- cell death
- signaling pathway
- endoplasmic reticulum stress
- oxidative stress
- climate change
- copy number
- bone marrow
- pi k akt
- circulating tumor cells