Proteomic fingerprinting of Neotropical hard tick species (Acari: Ixodidae) using a self-curated mass spectra reference library.
Rolando A GittensAlejandro AlmanzaKelly L BennettLuis C MejíaJavier E Sánchez-GalánFernando MerchanJonathan KernMatthew J MillerHelen J EsserRobert HwangMay DongLuis Fernando De LeónEric ÁlvarezJose R LoaizaPublished in: PLoS neglected tropical diseases (2020)
Matrix-assisted laser desorption/ionization (MALDI) time-of-flight mass spectrometry is an analytical method that detects macromolecules that can be used for proteomic fingerprinting and taxonomic identification in arthropods. The conventional MALDI approach uses fresh laboratory-reared arthropod specimens to build a reference mass spectra library with high-quality standards required to achieve reliable identification. However, this may not be possible to accomplish in some arthropod groups that are difficult to rear under laboratory conditions, or for which only alcohol preserved samples are available. Here, we generated MALDI mass spectra of highly abundant proteins from the legs of 18 Neotropical species of adult field-collected hard ticks, several of which had not been analyzed by mass spectrometry before. We then used their mass spectra as fingerprints to identify each tick species by applying machine learning and pattern recognition algorithms that combined unsupervised and supervised clustering approaches. Both Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) classification algorithms were able to identify spectra from different tick species, with LDA achieving the best performance when applied to field-collected specimens that did have an existing entry in a reference library of arthropod protein spectra. These findings contribute to the growing literature that ascertains mass spectrometry as a rapid and effective method to complement other well-established techniques for taxonomic identification of disease vectors, which is the first step to predict and manage arthropod-borne pathogens.
Keyphrases
- machine learning
- mass spectrometry
- liquid chromatography
- density functional theory
- artificial intelligence
- deep learning
- high performance liquid chromatography
- big data
- high resolution
- systematic review
- bioinformatics analysis
- genetic diversity
- young adults
- antimicrobial resistance
- small molecule
- protein protein
- rna seq
- childhood cancer