Login / Signup

Avant-garde: an automated data-driven DIA data curation tool.

Alvaro Sebastian Vaca JacomeRyan PecknerNicholas ShulmanKarsten KrugKatherine C DeRuffAdam OfficerKaren E ChristiansonBrendan MacLeanMichael J MacCossSteven A CarrJacob D Jaffe
Published in: Nature methods (2020)
Several challenges remain in data-independent acquisition (DIA) data analysis, such as to confidently identify peptides, define integration boundaries, remove interferences, and control false discovery rates. In practice, a visual inspection of the signals is still required, which is impractical with large datasets. We present Avant-garde as a tool to refine DIA (and parallel reaction monitoring) data. Avant-garde uses a novel data-driven scoring strategy: signals are refined by learning from the dataset itself, using all measurements in all samples to achieve the best optimization. We evaluate the performance of Avant-garde using benchmark DIA datasets and show that it can determine the quantitative suitability of a peptide peak, and reach the same levels of selectivity, accuracy, and reproducibility as manual validation. Avant-garde is complementary to existing DIA analysis engines and aims to establish a strong foundation for subsequent analysis of quantitative mass spectrometry data.
Keyphrases
  • data analysis
  • electronic health record
  • mass spectrometry
  • big data
  • primary care
  • healthcare
  • high throughput
  • liquid chromatography
  • rna seq
  • artificial intelligence
  • single cell
  • deep learning