Machine learning analysis of RB-TnSeq fitness data predicts functional gene modules in Pseudomonas putida KT2440.
Andrew J BorchertAlissa C BleemHyun Gyu LimKevin RychelKeven D DooleyZoe A KellermyerTracy L HodgesBernhard O PalssonGregg T BeckhamPublished in: mSystems (2024)
randomly barcoded transposon insertion sequencing data were used as a proof of concept, this approach is applicable to any organism with existing functional genomics data sets and may serve as a useful tool for many valuable applications, such as guiding metabolic engineering efforts in other microbes or understanding functional relationships between virulence-associated genes in pathogenic microbes. Furthermore, this work demonstrates that comparison of data obtained from independent component analysis of transcriptomics and gene fitness datasets can elucidate regulatory-functional relationships between genes, which may have utility in a variety of applications, such as metabolic modeling, strain engineering, or identification of antimicrobial drug targets.
Keyphrases
- electronic health record
- genome wide
- big data
- machine learning
- genome wide identification
- single cell
- staphylococcus aureus
- physical activity
- body composition
- copy number
- pseudomonas aeruginosa
- transcription factor
- artificial intelligence
- dna methylation
- emergency department
- biofilm formation
- quality improvement
- genome wide analysis
- candida albicans