Login / Signup

TM-search: An Efficient and Effective Tool for Protein Structure Database Search.

Zi LiuChengxin ZhangQidi ZhangYang ZhangDong-Jun Yu
Published in: Journal of chemical information and modeling (2024)
The quickly increasing size of the Protein Data Bank is challenging biologists to develop a more scalable protein structure alignment tool for fast structure database search. Although many protein structure search algorithms and programs have been designed and implemented for this purpose, most require a large amount of computational time. We propose a novel protein structure search approach, TM-search, which is based on the pairwise structure alignment program TM-align and a new iterative clustering algorithm. Benchmark tests demonstrate that TM-search is 27 times faster than a TM-align full database search while still being able to identify ∼90% of all high TM-score hits, which is 2-10 times more than other existing programs such as Foldseek, Dali, and PSI-BLAST.
Keyphrases
  • protein protein
  • amino acid
  • machine learning
  • public health
  • binding protein
  • small molecule
  • big data
  • single cell
  • rna seq
  • artificial intelligence
  • data analysis