Complete genome sequence and annotation of the laboratory reference strain Shigella flexneri serotype 5a M90T and genome-wide transcriptional start site determination.
Ramón Cervantes-RiveraSophie TronnetAndrea PuharPublished in: BMC genomics (2020)
We provide the first complete genome for S. flexneri serotype 5a, specifically the laboratory reference strain M90T. Our work opens the possibility of employing S. flexneri M90T in high-quality systems biology studies such as transcriptomic and differential expression analyses or in genome evolution studies. Moreover, the catalogue of TSS that we report here can be used in molecular pathogenesis studies as a resource to know which genes are transcribed before infection of host cells. The genome sequence, together with the analysis of transcriptional start sites, is also a valuable tool for precise genetic manipulation of S. flexneri 5a M90T. Further, we present a new hybrid strategy to prepare gapless, highly accurate genome sequences. Unlike currently used hybrid strategies combining long- and short-read DNA sequencing technologies to maximize accuracy, our workflow using long-read DNA sequencing and short-read RNA sequencing provides the added value of using non-redundant technologies, which yield distinct, exploitable datasets.
Keyphrases
- genome wide
- single molecule
- dna methylation
- single cell
- rna seq
- copy number
- gene expression
- circulating tumor
- dengue virus
- case control
- induced apoptosis
- cell free
- heat shock
- cell proliferation
- klebsiella pneumoniae
- high resolution
- electronic health record
- escherichia coli
- zika virus
- endoplasmic reticulum stress
- signaling pathway
- mass spectrometry
- molecularly imprinted
- heat shock protein