Machine learning uncovers independently regulated modules in the Bacillus subtilis transcriptome.
Kevin RychelAnand V SastryBernhard O PalssonPublished in: Nature communications (2020)
The transcriptional regulatory network (TRN) of Bacillus subtilis coordinates cellular functions of fundamental interest, including metabolism, biofilm formation, and sporulation. Here, we use unsupervised machine learning to modularize the transcriptome and quantitatively describe regulatory activity under diverse conditions, creating an unbiased summary of gene expression. We obtain 83 independently modulated gene sets that explain most of the variance in expression and demonstrate that 76% of them represent the effects of known regulators. The TRN structure and its condition-dependent activity uncover putative or recently discovered roles for at least five regulons, such as a relationship between histidine utilization and quorum sensing. The TRN also facilitates quantification of population-level sporulation states. As this TRN covers the majority of the transcriptome and concisely characterizes the global expression state, it could inform research on nearly every aspect of transcriptional regulation in B. subtilis.
Keyphrases
- bacillus subtilis
- gene expression
- machine learning
- transcription factor
- biofilm formation
- genome wide
- poor prognosis
- rna seq
- single cell
- dna methylation
- staphylococcus aureus
- pseudomonas aeruginosa
- artificial intelligence
- candida albicans
- big data
- binding protein
- escherichia coli
- genome wide identification
- copy number
- deep learning
- heat shock
- heat shock protein