Login / Signup

Diverse origins of near-identical antifreeze proteins in unrelated fish lineages provide insights into evolutionary mechanisms of new gene birth and protein sequence convergence.

Nathan RivesVinita LambaC-H Christina ChengXuan Zhuang
Published in: Molecular biology and evolution (2024)
Determining the origins of novel genes and the mechanisms driving the emergence of new functions is challenging yet crucial for understanding evolutionary innovations. Recently evolved fish antifreeze proteins (AFPs) offer a unique opportunity to explore these processes, particularly the near-identical type I AFP (AFPI) found in four phylogenetically divergent fish taxa. This study tested the hypothesis of protein sequence convergence beyond functional convergence in three unrelated AFPI-bearing fish lineages. Through comprehensive comparative analyses of newly sequenced genomes of winter flounder and grubby sculpin, along with available high-quality genomes of cunner and 14 other related species, the study revealed that near-identical AFPI proteins originated from distinct genetic precursors in each lineage. Each lineage independently evolved a de novo coding region for the novel ice-binding protein while repurposing fragments from their respective ancestors into potential regulatory regions, representing partial de novo origination-a process that bridges de novo gene formation and the neofunctionalization of duplicated genes. The study supports existing models of new gene origination and introduces new ones: the Innovation-Amplification-Divergence (IAD) model, where novel changes precede gene duplication; the newly proposed Duplication-Degeneration-Divergence (DDD) model, which describes new functions arising from degenerated pseudogenes; and the DDD gene fission model, where each new sibling gene differentially degenerate and renovate distinct functional domains from their parental gene. These findings highlight the diverse evolutionary pathways through which a novel functional gene with convergent sequences at the protein level can evolve across divergent species, advancing our understanding of the mechanistic intricacies in new gene formation.
Keyphrases
  • genome wide
  • genome wide identification
  • copy number
  • binding protein
  • dna methylation
  • gene expression
  • transcription factor
  • single cell
  • small molecule
  • drug discovery
  • human health