AtSNP_TATAdb: Candidate Molecular Markers of Plant Advantages Related to Single Nucleotide Polymorphisms within Proximal Promoters of Arabidopsis thaliana L.
Anton G BogomolovKarina ZolotarevaSergey FilonovIrina ChadaevaDmitry RasskazovEkaterina SharypovaNikolay PodkolodnyyPetr PonomarenkoLudmila SavinkovaNatalya TverdokhlebBato KhandaevEkaterina KondratyukOlga PodkolodnayaElena ZemlyanskayaNikolay A KolchanovMikhail P PonomarenkoPublished in: International journal of molecular sciences (2024)
The mainstream of the post-genome target-assisted breeding in crop plant species includes biofortification such as high-throughput phenotyping along with genome-based selection. Therefore, in this work, we used the Web-service Plant_SNP_TATA_Z-tester, which we have previously developed, to run a uniform in silico analysis of the transcriptional alterations of 54,013 protein-coding transcripts from 32,833 Arabidopsis thaliana L. genes caused by 871,707 SNPs located in the proximal promoter region. The analysis identified 54,993 SNPs as significantly decreasing or increasing gene expression through changes in TATA-binding protein affinity to the promoters. The existence of these SNPs in highly conserved proximal promoters may be explained as intraspecific diversity kept by the stabilizing natural selection. To support this, we hand-annotated papers on some of the Arabidopsis genes possessing these SNPs or on their orthologs in other plant species and demonstrated the effects of changes in these gene expressions on plant vital traits. We integrated in silico estimates of the TBP-promoter affinity in the AtSNP_TATAdb knowledge base and showed their significant correlations with independent in vivo experimental data. These correlations appeared to be robust to variations in statistical criteria, genomic environment of TATA box regions, plants species and growing conditions.
Keyphrases
- genome wide
- arabidopsis thaliana
- dna methylation
- gene expression
- transcription factor
- binding protein
- copy number
- high throughput
- healthcare
- cell wall
- genome wide identification
- molecular docking
- climate change
- mental health
- single cell
- mass spectrometry
- protein protein
- electronic health record
- small molecule
- amino acid
- big data
- artificial intelligence
- data analysis