Technical Note: mzML and imzML Libraries for Processing Mass Spectrometry Data with the High-Performance Programming Language Julia.
Ignacio Rosas-RománHéctor Guillén-AlonsoAbigail Moreno-PedrazaRobert WinklerPublished in: Analytical chemistry (2024)
Julia combines the virtues of high-level and low-level programming languages: The code is human-readable, and the performance of the created binaries competes with machine-orientated compilers. Thus, Julia is popular in "Big Data" sciences. Reading mass spectrometry (MS) data with Julia was impossible until now due to missing libraries. Here, we present a Julia library for importing mass spectrometry (MS) data in HUPO standard mzML and imzML formats and demonstrate its function with direct and ambient ionization MS, liquid chromatography-MS, and MS imaging data on standard platforms (Windows, Linux, and Mac OS). The processing speed of Julia for reading imzML MS imaging files was up to 214 times faster than the comparable code in R. Julia can remove bottlenecks for computationally demanding tasks in large-scale MS-Omics and MS imaging data processing workflows and supports their agile development. In addition, time-critical and complex data evaluation tasks become possible, such as following the real-time monitoring of biological processes and pattern recognition in large MS imaging projects. Our mzML/imzML libraries and code examples are available under the terms of the MIT license from https://github.com/CINVESTAV-LABI/julia_mzML_imzML.
Keyphrases
- mass spectrometry
- liquid chromatography
- big data
- high resolution
- gas chromatography
- multiple sclerosis
- ms ms
- high performance liquid chromatography
- electronic health record
- capillary electrophoresis
- high resolution mass spectrometry
- tandem mass spectrometry
- artificial intelligence
- machine learning
- working memory
- air pollution
- autism spectrum disorder
- particulate matter
- photodynamic therapy
- solid phase extraction
- pluripotent stem cells