BIRCH: An Automated Workflow for Evaluation, Correction, and Visualization of Batch Effect in Bottom-Up Mass Spectrometry-Based Proteomics Data.
Niveda SundararamanArchana BhatVidya VenkatramanAleksandra BinekZachary DwightNethika R AriyasingheSean EscopeteSandy Y JoungSusan ChengSarah J ParkerJustyna Fert-BoberJennifer E Van EykPublished in: Journal of proteome research (2023)
Recent surges in large-scale mass spectrometry (MS)-based proteomics studies demand a concurrent rise in methods to facilitate reliable and reproducible data analysis. Quantification of proteins in MS analysis can be affected by variations in technical factors such as sample preparation and data acquisition conditions leading to batch effects, which adds to noise in the data set. This may in turn affect the effectiveness of any biological conclusions derived from the data. Here we present Batch-effect Identification, Representation, and Correction of Heterogeneous data (BIRCH), a workflow for analysis and correction of batch effect through an automated, versatile, and easy to use web-based tool with the goal of eliminating technical variation. BIRCH also supports diagnosis of the data to check for the presence of batch effects, feasibility of batch correction, and imputation to deal with missing values in the data set. To illustrate the relevance of the tool, we explore two case studies, including an iPSC-derived cell study and a Covid vaccine study to show different context-specific use cases. Ultimately this tool can be used as an extremely powerful approach for eliminating technical bias while retaining biological bias, toward understanding disease mechanisms and potential therapeutics.
Keyphrases
- mass spectrometry
- electronic health record
- data analysis
- big data
- liquid chromatography
- sars cov
- anaerobic digestion
- ms ms
- machine learning
- small molecule
- radiation therapy
- risk assessment
- high performance liquid chromatography
- gas chromatography
- quantum dots
- artificial intelligence
- single cell
- mesenchymal stem cells
- respiratory syndrome coronavirus