HiTE: a fast and accurate dynamic boundary adjustment approach for full-length transposable element detection and annotation.
Kang HuPeng NiMinghua XuYou ZouJianye ChangXin GaoYaohang LiJue RuanBin HuJian-Xin WangPublished in: Nature communications (2024)
Recent advancements in genome assembly have greatly improved the prospects for comprehensive annotation of Transposable Elements (TEs). However, existing methods for TE annotation using genome assemblies suffer from limited accuracy and robustness, requiring extensive manual editing. In addition, the currently available gold-standard TE databases are not comprehensive, even for extensively studied species, highlighting the critical need for an automated TE detection method to supplement existing repositories. In this study, we introduce HiTE, a fast and accurate dynamic boundary adjustment approach designed to detect full-length TEs. The experimental results demonstrate that HiTE outperforms RepeatModeler2, the state-of-the-art tool, across various species. Furthermore, HiTE has identified numerous novel transposons with well-defined structures containing protein-coding domains, some of which are directly inserted within crucial genes, leading to direct alterations in gene expression. A Nextflow version of HiTE is also available, with enhanced parallelism, reproducibility, and portability.
Keyphrases
- gene expression
- genome wide
- high resolution
- rna seq
- loop mediated isothermal amplification
- crispr cas
- real time pcr
- label free
- multidrug resistant
- single cell
- genetic diversity
- protein protein
- current status
- big data
- machine learning
- silver nanoparticles
- bioinformatics analysis
- psychometric properties
- genome wide identification