An Untargeted Metabolomics Approach to Study the Variation between Wild and Cultivated Soybeans.
Fakir Shahidullah TareqRaghavendhar R KothaSavithiry NatarajanJianghao SunDevanand L LuthriaPublished in: Molecules (Basel, Switzerland) (2023)
The differential metabolite profiles of four wild and ten cultivated soybeans genotypes were explored using an untargeted metabolomics approach. Ground soybean seed samples were extracted with methanol and water, and metabolic features were obtained using ultra-high-performance liquid chromatography coupled with high-resolution mass spectrometry (UHPLC-HRMS) in both positive and negative ion modes. The UHPLC-HRMS analysis of the two different extracts resulted in the putative identification of 98 metabolites belonging to several classes of phytochemicals, including isoflavones, organic acids, lipids, sugars, amino acids, saponins, and other compounds. The metabolic profile was significantly impacted by the polarity of the extraction solvent. Multivariate analysis showed a clear difference between wild and cultivated soybean cultivars. Unsupervised and supervised learning algorithms were applied to mine the generated data and to pinpoint metabolites differentiating wild and cultivated soybeans. The key identified metabolites differentiating wild and cultivated soybeans were isoflavonoids, free amino acids, and fatty acids. Catechin analogs, cynaroside, hydroxylated unsaturated fatty acid derivatives, amino acid, and uridine diphosphate- N -acetylglucosamine were upregulated in the methanol extract of wild soybeans. In contrast, isoflavonoids and other minor compounds were downregulated in the same soybean extract. This metabolic information will benefit breeders and biotechnology professionals to develop value-added soybeans with improved quality traits.
Keyphrases
- high resolution mass spectrometry
- ultra high performance liquid chromatography
- liquid chromatography
- amino acid
- mass spectrometry
- fatty acid
- ms ms
- gas chromatography
- tandem mass spectrometry
- machine learning
- genetic diversity
- simultaneous determination
- oxidative stress
- computed tomography
- electronic health record
- healthcare
- quality improvement
- dna methylation
- magnetic resonance imaging
- carbon dioxide
- big data