Comparative Genomic Analysis Reveals Gene Content Diversity, Phylogenomic Contour, Putative Virulence Determinants, and Potential Diagnostic Markers within Pythium insidiosum Traits.
Weerayuth KittichotiratThidarat RujirawatPreecha PatumcharoenpolTheerapong KrajaejunPublished in: Journal of fungi (Basel, Switzerland) (2023)
Pythium insidiosum has successfully evolved into a human/animal filamentous pathogen, causing pythiosis, a life-threatening disease, worldwide. The specific rDNA-based genotype of P. insidiosum (clade I, II, or III) is associated with the different hosts and disease prevalence. Genome evolution of P. insidiosum can be driven by point mutations, pass vertically to the offspring, and diverge into distinct lineages, leading to different virulence, including the ability to be unrecognized by the host. We conducted comprehensive genomic comparisons of 10 P. insidiosum strains and 5 related Pythium species using our online "Gene Table" software to investigate the pathogen's evolutionary history and pathogenicity. In total, 245,378 genes were found in all 15 genomes and grouped into 45,801 homologous gene clusters. Gene contents among P. insidiosum strains varied by as much as 23%. Our results showed a strong agreement between the phylogenetic analysis of 166 core genes (88,017 bp) identified across all genomes and the hierarchical clustering analysis of gene presence/absence profiles, suggesting divergence of P. insidiosum into two groups, clade I/II and clade III strains, and the subsequent segregation of clade I and clade II. A stringent gene content comparison using the Pythium Gene Table provided 3263 core genes exclusively presented in all P. insidiosum strains but no other Pythium species, which could involve host-specific pathogenesis and serve as biomarkers for diagnostic purposes. More studies focusing on characterizing the biological function of the core genes (including the just-identified putative virulence genes encoding hemagglutinin/adhesin and reticulocyte-binding protein) are needed to explore the biology and pathogenicity of this pathogen.
Keyphrases
- genome wide
- genome wide identification
- copy number
- escherichia coli
- dna methylation
- genome wide analysis
- pseudomonas aeruginosa
- staphylococcus aureus
- biofilm formation
- transcription factor
- binding protein
- cystic fibrosis
- endothelial cells
- dna damage
- social media
- oxidative stress
- bioinformatics analysis
- risk assessment
- skeletal muscle
- high fat diet
- adipose tissue
- climate change
- human health
- pluripotent stem cells