KDeep: a new memory-efficient data extraction method for accurately predicting DNA/RNA transcription factor binding sites.
Saeedeh Akbari Rokn AbadiSeyedehFatemeh TabatabaeiSomayyeh KoohiPublished in: Journal of translational medicine (2023)
This paper addresses the crucial task of identifying DNA/RNA binding sites, which has implications in drug/vaccine design, protein engineering, and cancer research. Existing methods utilize complex neural network structures, diverse input types, and machine learning techniques for feature extraction. However, the growing volume of sequences poses processing challenges. This study introduces KDeep, employing a CNN-LSTM architecture with a novel encoding method called 2Lk. 2Lk enhances prediction accuracy, reduces memory consumption by up to 84%, reduces trainable parameters, and improves interpretability by approximately 79% compared to state-of-the-art approaches. KDeep offers a promising solution for accurate and efficient binding site prediction.
Keyphrases
- neural network
- machine learning
- nucleic acid
- transcription factor
- circulating tumor
- cell free
- working memory
- single molecule
- big data
- high resolution
- papillary thyroid
- artificial intelligence
- convolutional neural network
- electronic health record
- squamous cell
- deep learning
- emergency department
- mass spectrometry
- binding protein
- amino acid
- adverse drug
- solid state