Two novel genes identified by large-scale transcriptomic analysis are essential for biofilm and rugose colony development of Vibrio vulnificus.
Hojun LeeHanhyeok ImSeung-Ho HwangDuhyun KoSang Ho ChoiPublished in: PLoS pathogens (2023)
Many pathogenic bacteria form biofilms to survive under environmental stresses and host immune defenses. Differential expression (DE) analysis of the genes in biofilm and planktonic cells under a single condition, however, has limitations to identify the genes essential for biofilm formation. Independent component analysis (ICA), a machine learning algorithm, was adopted to comprehensively identify the biofilm genes of Vibrio vulnificus, a fulminating human pathogen, in this study. ICA analyzed the large-scale transcriptome data of V. vulnificus cells under various biofilm and planktonic conditions and then identified a total of 72 sets of independently co-regulated genes, iModulons. Among the three iModulons specifically activated in biofilm cells, BrpT-iModulon mainly consisted of known genes of the regulon of BrpT, a transcriptional regulator controlling biofilm formation of V. vulnificus. Interestingly, the BrpT-iModulon additionally contained two novel genes, VV1_3061 and VV2_1694, designated as cabH and brpN, respectively. cabH and brpN were shared in other Vibrio species and not yet identified by DE analyses. Genetic and biochemical analyses revealed that cabH and brpN are directly up-regulated by BrpT. The deletion of cabH and brpN impaired the robust biofilm and rugose colony formation. CabH, structurally similar to the previously known calcium-binding matrix protein CabA, was essential for attachment to the surface. BrpN, carrying an acyltransferase-3 domain as observed in BrpL, played an important role in exopolysaccharide production. Altogether, ICA identified two novel genes, cabH and brpN, which are regulated by BrpT and essential for the development of robust biofilms and rugose colonies of V. vulnificus.
Keyphrases
- biofilm formation
- candida albicans
- pseudomonas aeruginosa
- staphylococcus aureus
- genome wide
- escherichia coli
- bioinformatics analysis
- machine learning
- genome wide identification
- induced apoptosis
- cystic fibrosis
- transcription factor
- cell cycle arrest
- gene expression
- deep learning
- endothelial cells
- risk assessment
- artificial intelligence
- cell death
- single cell
- oxidative stress
- small molecule
- dna methylation
- endoplasmic reticulum stress
- electronic health record
- heat shock