Using clusterProfiler to characterize multiomics data.
Shuangbin XuErqiang HuYantong CaiZijing XieXiao LuoLi ZhanWenli TangQianwen WangBingdong LiuRui WangWenqin XieTianzhi WuLiwei XieGuangchuang YuPublished in: Nature protocols (2024)
With the advent of multiomics, software capable of multidimensional enrichment analysis has become increasingly crucial for uncovering gene set variations in biological processes and disease pathways. This is essential for elucidating disease mechanisms and identifying potential therapeutic targets. clusterProfiler stands out for its comprehensive utilization of databases and advanced visualization features. Importantly, clusterProfiler supports various biological knowledge, including Gene Ontology and Kyoto Encyclopedia of Genes and Genomes, through performing over-representation and gene set enrichment analyses. A key feature is that clusterProfiler allows users to choose from various graphical outputs to visualize results, enhancing interpretability. This protocol describes innovative ways in which clusterProfiler has been used for integrating metabolomics and metagenomics analyses, identifying and characterizing transcription factors under stress conditions, and annotating cells in single-cell studies. In all cases, the computational steps can be completed within ~2 min. clusterProfiler is released through the Bioconductor project and can be accessed via https://bioconductor.org/packages/clusterProfiler/ .
Keyphrases
- genome wide identification
- genome wide
- transcription factor
- copy number
- single cell
- induced apoptosis
- randomized controlled trial
- big data
- genome wide analysis
- dna methylation
- machine learning
- mass spectrometry
- gene expression
- oxidative stress
- high throughput
- rna seq
- risk assessment
- cell proliferation
- data analysis
- stress induced
- neural network