Login / Signup

Sequence, Structure, and Functional Space of Drosophila De Novo Proteins.

Lasse MiddendorfBharat Ravi IyengarLars A Eicholt
Published in: Genome biology and evolution (2024)
During de novo emergence, new protein coding genes emerge from previously nongenic sequences. The de novo proteins they encode are dissimilar in composition and predicted biochemical properties to conserved proteins. However, functional de novo proteins indeed exist. Both identification of functional de novo proteins and their structural characterization are experimentally laborious. To identify functional and structured de novo proteins in silico, we applied recently developed machine learning based tools and found that most de novo proteins are indeed different from conserved proteins both in their structure and sequence. However, some de novo proteins are predicted to adopt known protein folds, participate in cellular reactions, and to form biomolecular condensates. Apart from broadening our understanding of de novo protein evolution, our study also provides a large set of testable hypotheses for focused experimental studies on structure and function of de novo proteins in Drosophila.
Keyphrases
  • machine learning
  • gene expression
  • deep learning
  • protein protein
  • binding protein