Login / Signup

CatNet: Sequence-based deep learning with cross-attention mechanism for identifying endocrine-disrupting chemicals.

Lu ZhaoQiao XueHuazhou ZhangYuxing HaoHang YiXian LiuWenxiao PanJianjie FuAiqian Zhang
Published in: Journal of hazardous materials (2023)
Endocrine-disrupting chemicals (EDCs) pose significant environmental and health risks due to their potential to interfere with nuclear receptors (NRs), key regulators of physiological processes. Despite the evident risks, the majority of existing research narrows its focus on the interaction between compounds and the individual NR target, neglecting a comprehensive assessment across the entire NR family. In response, this study assembled a comprehensive human NR dataset, capturing 49,244 interactions between 35,467 unique compounds and 42 NRs. We introduced a cross-attention network framework, "CatNet", innovatively integrating compound and protein representations through cross-attention mechanisms. The results showed that CatNet model achieved excellent performance with an area under the receiver operating characteristic curve (AUC ROC ) = 0.916 on the test set, and exhibited reliable generalization on unseen compound-NR pairs. A distinguishing feature of our research is its capacity to expand to novel targets. Beyond its predictive accuracy, CatNet offers a valuable mechanistic perspective on compound-NR interactions through feature visualization. Augmenting the utility of our research, we have also developed a graphical user interface, empowering researchers to predict chemical binding to diverse NRs. Our model enables the prediction of human NR-related EDCs and shows the potential to identify EDCs related to other targets.
Keyphrases
  • deep learning
  • working memory
  • endothelial cells
  • human health
  • machine learning
  • induced pluripotent stem cells
  • risk assessment
  • pluripotent stem cells
  • transcription factor
  • amino acid