Application of individualized differential expression analysis in human cancer proteome.
Yachen LiuYalan LinWenxian YangYuxiang LinYujuan WuZheyang ZhangNuoqi LinXianlong WangMengsha TongRongshan YuPublished in: Briefings in bioinformatics (2022)
Liquid chromatography-mass spectrometry-based quantitative proteomics can measure the expression of thousands of proteins from biological samples and has been increasingly applied in cancer research. Identifying differentially expressed proteins (DEPs) between tumors and normal controls is commonly used to investigate carcinogenesis mechanisms. While differential expression analysis (DEA) at an individual level is desired to identify patient-specific molecular defects for better patient stratification, most statistical DEP analysis methods only identify deregulated proteins at the population level. To date, robust individualized DEA algorithms have been proposed for ribonucleic acid data, but their performance on proteomics data is underexplored. Herein, we performed a systematic evaluation on five individualized DEA algorithms for proteins on cancer proteomic datasets from seven cancer types. Results show that the within-sample relative expression orderings (REOs) of protein pairs in normal tissues were highly stable, providing the basis for individualized DEA for proteins using REOs. Moreover, individualized DEA algorithms achieve higher precision in detecting sample-specific deregulated proteins than population-level methods. To facilitate the utilization of individualized DEA algorithms in proteomics for prognostic biomarker discovery and personalized medicine, we provide Individualized DEP Analysis IDEPAXMBD (XMBD: Xiamen Big Data, a biomedical open software initiative in the National Institute for Data Science in Health and Medicine, Xiamen University, China.) (https://github.com/xmuyulab/IDEPA-XMBD), which is a user-friendly and open-source Python toolkit that integrates individualized DEA algorithms for DEP-associated deregulation pattern recognition.
Keyphrases
- machine learning
- big data
- mass spectrometry
- papillary thyroid
- liquid chromatography
- squamous cell
- deep learning
- artificial intelligence
- poor prognosis
- electronic health record
- healthcare
- high resolution
- public health
- gene expression
- data analysis
- squamous cell carcinoma
- small molecule
- mental health
- label free
- case report
- tandem mass spectrometry
- young adults
- childhood cancer
- gas chromatography
- risk assessment
- health information
- social media
- ms ms
- long non coding rna
- single cell
- rna seq
- solid phase extraction