MetMiner: A user-friendly pipeline for large-scale plant metabolomics data analysis.
Xiao WangShuang LiangWenqi YangKe YuFei LiangBing ZhaoXiang ZhuChao ZhouLuis Alejandro Jose MurJeremy A RobertsJunli ZhangXuebin ZhangPublished in: Journal of integrative plant biology (2024)
The utilization of metabolomics approaches to explore the metabolic mechanisms underlying plant fitness and adaptation to dynamic environments is growing, highlighting the need for an efficient and user-friendly toolkit tailored for analyzing the extensive datasets generated by metabolomics studies. Current protocols for metabolome data analysis often struggle with handling large-scale datasets or require programming skills. To address this, we present MetMiner (https://github.com/ShawnWx2019/MetMiner), a user-friendly, full-functionality pipeline specifically designed for plant metabolomics data analysis. Built on R shiny, MetMiner can be deployed on servers to utilize additional computational resources for processing large-scale datasets. MetMiner ensures transparency, traceability, and reproducibility throughout the analytical process. Its intuitive interface provides robust data interaction and graphical capabilities, enabling users without prior programming skills to engage deeply in data analysis. Additionally, we constructed and integrated a plant-specific mass spectrometry database into the MetMiner pipeline to optimize metabolite annotation. We have also developed MDAtoolkits, which include a complete set of tools for statistical analysis, metabolite classification, and enrichment analysis, to facilitate the mining of biological meaning from the datasets. Moreover, we propose an iterative weighted gene co-expression network analysis strategy for efficient biomarker metabolite screening in large-scale metabolomics data mining. In two case studies, we validated MetMiner's efficiency in data mining and robustness in metabolite annotation. Together, the MetMiner pipeline represents a promising solution for plant metabolomics analysis, providing a valuable tool for the scientific community to use with ease.
Keyphrases
- data analysis
- mass spectrometry
- liquid chromatography
- network analysis
- rna seq
- gas chromatography
- high resolution
- capillary electrophoresis
- high performance liquid chromatography
- healthcare
- machine learning
- magnetic resonance imaging
- computed tomography
- emergency department
- genome wide
- wastewater treatment
- mental health
- artificial intelligence
- tandem mass spectrometry
- smoking cessation
- dna methylation
- ms ms
- copy number
- image quality
- dual energy
- genome wide identification