Login / Signup

Transposable element annotation in non-model species: The benefits of species-specific repeat libraries using semi-automated EDTA and DeepTE de novo pipelines.

Ellen A BellChristopher L ButlerClaudio OliveiraSarah MarburgerLevi YantMartin I Taylor
Published in: Molecular ecology resources (2021)
Transposable elements (TEs) are significant genomic components which can be detected either through sequence homology against existing databases or de novo, with the latter potentially reducing the risk of underestimating TE abundance. Here, we describe the semi-automated generation of a de novo TE library using the newly developed EDTA pipeline and DeepTE classifier in a non-model teleost (Corydoras fulleri). Using both genomic and transcriptomic data, we assess this de novo pipeline's performance across four TE based metrics: (i) abundance, (ii) composition, (iii) fragmentation, and (iv) age distributions. We then compare the results to those found when using a curated teleost library (Danio rerio). We identify quantitative differences in these metrics and highlight how TE library choice can have major impacts on TE-based estimates in non-model species.
Keyphrases
  • machine learning
  • deep learning
  • high throughput
  • high resolution
  • copy number
  • antibiotic resistance genes
  • genetic diversity
  • mass spectrometry