Bound2Learn: a machine learning approach for classification of DNA-bound proteins from single-molecule tracking experiments.
Nitin KapadiaZiad W El-HajjRodrigo Reyes-LamothePublished in: Nucleic acids research (2021)
DNA-bound proteins are essential elements for the maintenance, regulation, and use of the genome. The time they spend bound to DNA provides useful information on their stability within protein complexes and insight into the understanding of biological processes. Single-particle tracking allows for direct visualization of protein-DNA kinetics, however, identifying whether a molecule is bound to DNA can be non-trivial. Further complications arise when tracking molecules for extended durations in processes with slow kinetics. We developed a machine learning approach, termed Bound2Learn, using output from a widely used tracking software, to robustly classify tracks in order to accurately estimate residence times. We validated our approach in silico, and in live-cell data from Escherichia coli and Saccharomyces cerevisiae. Our method has the potential for broad utility and is applicable to other organisms.
Keyphrases
- single molecule
- machine learning
- circulating tumor
- cell free
- escherichia coli
- saccharomyces cerevisiae
- living cells
- atomic force microscopy
- big data
- deep learning
- artificial intelligence
- genome wide
- risk factors
- circulating tumor cells
- gene expression
- staphylococcus aureus
- health information
- amino acid
- cystic fibrosis
- small molecule
- human health