Systematic identification of gene-altering programmed inversions across the bacterial domain.
Oren MilmanIdan YelinRoy KishonyPublished in: Nucleic acids research (2023)
Programmed chromosomal inversions allow bacteria to generate intra-population genotypic and functional heterogeneity, a bet-hedging strategy important in changing environments. Some programmed inversions modify coding sequences, producing different alleles in several gene families, most notably in specificity-determining genes such as Type I restriction-modification systems, where systematic searches revealed cross phylum abundance. Yet, a broad, gene-independent, systematic search for gene-altering programmed inversions has been absent, and little is known about their genomic sequence attributes and prevalence across gene families. Here, identifying intra-species variation in genomes of over 35 000 species, we develop a predictive model of gene-altering inversions, revealing key attributes of their genomic sequence attributes, including gene-pseudogene size asymmetry and orientation bias. The model predicted over 11,000 gene-altering loci covering known targeted gene families, as well as novel targeted families including Type II restriction-modification systems, a protein of unknown function, and a fusion-protein containing conjugative-pilus and phage tail domains. Publicly available long-read sequencing datasets validated representatives of these newly predicted inversion-targeted gene families, confirming intra-population genetic heterogeneity. Together, these results reveal gene-altering programmed inversions as a key strategy adopted across the bacterial domain, and highlight programmed inversions that modify Type II restriction-modification systems as a possible new mechanism for maintaining intra-population heterogeneity.