Prediction of Protein Complexes in Trypanosoma brucei by Protein Correlation Profiling Mass Spectrometry and Machine Learning.
Thomas W M CrozierMichele TintiMark LaranceAngus I LamondMichael A J FergusonPublished in: Molecular & cellular proteomics : MCP (2017)
A disproportionate number of predicted proteins from the genome sequence of the protozoan parasite Trypanosoma brucei, an important human and animal pathogen, are hypothetical proteins of unknown function. This paper describes a protein correlation profiling mass spectrometry approach, using two size exclusion and one ion exchange chromatography systems, to derive sets of predicted protein complexes in this organism by hierarchical clustering and machine learning methods. These hypothesis-generating proteomic data are provided in an open access online data visualization environment (http://134.36.66.166:8083/complex_explorer). The data can be searched conveniently via a user friendly, custom graphical interface. We provide examples of both potential new subunits of known protein complexes and of novel trypanosome complexes of suggested function, contributing to improving the functional annotation of the trypanosome proteome. Data are available via ProteomeXchange with identifier PXD005968.
Keyphrases
- mass spectrometry
- machine learning
- big data
- protein protein
- electronic health record
- amino acid
- liquid chromatography
- endothelial cells
- single cell
- healthcare
- artificial intelligence
- high performance liquid chromatography
- risk assessment
- gas chromatography
- tandem mass spectrometry
- social media
- high speed
- climate change
- candida albicans
- low cost
- induced pluripotent stem cells
- simultaneous determination