Login / Signup

Empirical evidence that complexity limits horizontal gene transfer.

Christina L BurchArtur RomanchukMichael KellyYingfang WuCorbin D Jones
Published in: Genome biology and evolution (2023)
Horizontal gene transfer (HGT) is a major contributor to bacterial genome evolution, generating phenotypic diversity, driving the expansion of protein families, and facilitating the evolution of new phenotypes, new metabolic pathways, and new species. Comparative studies of gene gain in bacteria suggest that the frequency with which individual genes successfully undergo HGT varies considerably and may be associated with the number of protein-protein interactions in which the gene participates, i.e., its connectivity. Two non-exclusive hypotheses have emerged to explain why transferability should decrease with connectivity: the Complexity Hypothesis (Jain, Rivera, & Lake, 1999) and the Balance Hypothesis (Papp, Pál, & Hurst, 2003). These hypotheses predict that the functional costs of HGT arise from a failure of divergent homologues to make normal protein-protein interactions or from gene mis-expression, respectively. Here we describe genome-wide assessments of these hypotheses in which we used 74 existing prokaryotic whole genome shotgun libraries to estimate rates of horizontal transfer of genes from taxonomically diverse prokaryotic donors into E. coli. We show that 1) transferability declines as connectivity increases, 2) transferability declines as the divergence between donor and recipient orthologs increases, and that 3) the magnitude of this negative effect of divergence on transferability increases with connectivity. These effects are particularly robust among the translational proteins, which span the widest range of connectivities. Whereas the Complexity Hypotheses explains all three of these observations, the Balance Hypothesis explains only the first one.
Keyphrases
  • genome wide
  • genome wide identification
  • dna methylation
  • copy number
  • resting state
  • functional connectivity
  • white matter
  • poor prognosis
  • gene expression
  • transcription factor
  • binding protein
  • bioinformatics analysis