Correlation between CRISPR Loci Diversity in Three Enterobacterial Taxa.
Dumitrana IordacheGabriela-Maria BaciOana CăprițăAnca FarkasAndreea LupAnca Livia Butiuc-KeulPublished in: International journal of molecular sciences (2022)
CRISPR-Cas is an adaptive immunity system of prokaryotes, composed of CRISPR arrays and the associated proteins. The successive addition of spacer sequences in the CRISPR array has made the system a valuable molecular marker, with multiple applications. Due to the high degree of polymorphism of the CRISPR loci, their comparison in bacteria from various sources may provide insights into the evolution and spread of the CRISPR-Cas systems. The aim of this study was to establish a correlation between the enterobacterial CRISPR loci, the sequence of direct repeats (DR), and the number of spacer units, along with the geographical origin and collection source. For this purpose, 3474 genomes containing CRISPR loci from the CRISPRCasdb of Salmonella enterica, Escherichia coli, and Klebsiella pneumoniae were analyzed, and the information regarding the isolates was recorded from the NCBI database. The most prevalent was the I-E CRISPR-Cas system in all three studied taxa. E. coli also presents the I-F type, but in a much lesser percentage. The systems found in K. pneumoniae can be classified into I-E and I-E*. The I-E and I-F systems have two CRISPR loci, while I-E* has only one locus upstream of the Cas cluster. PCR primers have been developed in this study for each CRISPR locus. Distinct clustering was not evident, but statistically significant relationships occurred between the different CRISPR loci and the number of spacer units. For each of the queried taxa, the number of spacers was significantly different ( p < 0.01) by origin (Africa, Asia, Australia and Oceania, Europe, North America, and South America) but was not linked to the isolation source type (human, animal, plant, food, or laboratory strains).