Mapping the Transcriptional and Fitness Landscapes of a Pathogenic E. coli Strain: The Effects of Organic Acid Stress under Aerobic and Anaerobic Conditions.
Francesca BushellJohn M J HerbertThippeswamy H SannasiddappaDaniel WarrenA Keith TurnerFrancesco FalcianiPeter A LundPublished in: Genes (2020)
Several methods are available to probe cellular responses to external stresses at the whole genome level. RNAseq can be used to measure changes in expression of all genes following exposure to stress, but gives no information about the contribution of these genes to an organism's ability to survive the stress. The relative contribution of each non-essential gene in the genome to the fitness of the organism under stress can be obtained using methods that use sequencing to estimate the frequencies of members of a dense transposon library grown under different conditions, for example by transposon-directed insertion sequencing (TraDIS). These two methods thus probe different aspects of the underlying biology of the organism. We were interested to determine the extent to which the data from these two methods converge on related genes and pathways. To do this, we looked at a combination of biologically meaningful stresses. The human gut contains different organic short-chain fatty acids (SCFAs) produced by fermentation of carbon compounds, and Escherichia coli is exposed to these in its passage through the gut. Their effect is likely to depend on both the ambient pH and the level of oxygen present. We, therefore, generated RNAseq and TraDIS data on a uropathogenic E. coli strain grown at either pH 7 or pH 5.5 in the presence or absence of three SCFAs (acetic, propionic and butyric), either aerobically or anaerobically. Our analysis identifies both known and novel pathways as being likely to be important under these conditions. There is no simple correlation between gene expression and fitness, but we found a significant overlap in KEGG pathways that are predicted to be enriched following analysis of the data from the two methods, and the majority of these showed a fitness signature that would be predicted from the gene expression data, assuming expression to be adaptive. Genes which are not in the E. coli core genome were found to be particularly likely to show a positive correlation between level of expression and contribution to fitness.
Keyphrases
- escherichia coli
- genome wide
- gene expression
- body composition
- physical activity
- dna methylation
- poor prognosis
- electronic health record
- big data
- genome wide identification
- fatty acid
- endothelial cells
- microbial community
- single cell
- biofilm formation
- binding protein
- copy number
- data analysis
- transcription factor
- long non coding rna
- living cells
- pseudomonas aeruginosa
- staphylococcus aureus
- heat stress
- multidrug resistant
- social media
- sewage sludge
- fluorescent probe