LI-Detector: a Method for Curating Ordered Gene-Replacement Libraries.
Emily C A GoodallFaye C MorrisSamantha A McKeandRudi SullivanIsabel A WarnerEmma SheehanGabriela BoelterChristopher IckeAdam F CunninghamJeffrey A ColeManuel BanzhafJack A BryantIan R HendersonPublished in: Microbiology spectrum (2022)
In recent years the availability of genome sequence information has grown logarithmically resulting in the identification of a plethora of uncharacterized genes. To address this gap in functional annotation, many high-throughput screens have been devised to uncover novel gene functions. Gene-replacement libraries are one such tool that can be screened in a high-throughput way to link genotype and phenotype and are key community resources. However, for a phenotype to be attributed to a specific gene, there needs to be confidence in the genotype. Construction of large libraries can be laborious and occasionally errors will arise. Here, we present a rapid and accurate method for the validation of any ordered library where a gene has been replaced or disrupted by a uniform linear insertion (LI). We applied our method (LI-detector) to the well-known Keio library of Escherichia coli gene-deletion mutants. Our method identified 3,718 constructed mutants out of a total of 3,728 confirmed isolates, with a success rate of 99.7% for identifying the correct kanamycin cassette position. This data set provides a benchmark for the purity of the Keio mutants and a screening method for mapping the position of any linear insertion, such as an antibiotic resistance cassette in any ordered library. IMPORTANCE The construction of ordered gene replacement libraries requires significant investment of time and resources to create a valuable community resource. During construction, technical errors may result in a limited number of incorrect mutants being made. Such mutants may confound the output of subsequent experiments. Here, using the remarkable E. coli Keio knockout library, we describe a method to rapidly validate the construction of every mutant.
Keyphrases
- genome wide
- high throughput
- copy number
- genome wide identification
- escherichia coli
- healthcare
- dna methylation
- genome wide analysis
- emergency department
- wild type
- magnetic resonance imaging
- gene expression
- mass spectrometry
- cystic fibrosis
- staphylococcus aureus
- deep learning
- wastewater treatment
- patient safety
- single cell
- pseudomonas aeruginosa
- health information
- big data
- high density
- neural network