Technical Note: mzML and imzML Libraries for Processing Mass Spectrometry Data with the High-Performance Programming Language Julia.
Ignacio Rosas-RománHéctor Guillén-AlonsoAbigail Moreno-PedrazaRobert WinklerPublished in: Analytical chemistry (2024)
Julia combines the virtues of high-level and low-level programming languages: The code is human-readable, and the performance of the created binaries competes with machine-orientated compilers. Thus, Julia is popular in "Big Data" sciences. Reading mass spectrometry (MS) data with Julia was impossible until now due to missing libraries. Here, we present a Julia library for importing mass spectrometry (MS) data in HUPO standard mzML and imzML formats and demonstrate its function with direct and ambient ionization MS, liquid chromatography-MS, and MS imaging data on standard platforms (Windows, Linux, and Mac OS). The processing speed of Julia for reading imzML MS imaging files was up to 214 times faster than the comparable code in R. Julia can remove bottlenecks for computationally demanding tasks in large-scale MS-Omics and MS imaging data processing workflows and supports their agile development. In addition, time-critical and complex data evaluation tasks become possible, such as following the real-time monitoring of biological processes and pattern recognition in large MS imaging projects. Our mzML/imzML libraries and code examples are available under the terms of the MIT license from https://github.com/CINVESTAV-LABI/julia_mzML_imzML.
Keyphrases
- mass spectrometry
- liquid chromatography
- big data
- high resolution
- gas chromatography
- high performance liquid chromatography
- multiple sclerosis
- ms ms
- electronic health record
- capillary electrophoresis
- high resolution mass spectrometry
- artificial intelligence
- tandem mass spectrometry
- machine learning
- working memory
- data analysis
- air pollution
- deep learning
- particulate matter
- induced pluripotent stem cells
- single cell
- fluorescence imaging