Login / Signup

Iterative Regression of Corrective Baselines (IRCB): A New Model for Quantitative Spectroscopy.

Matthew GlaceRoudabeh S Moazeni-PourasilDaniel W CookThomas D Roper
Published in: Journal of chemical information and modeling (2024)
In this work, a new model with broad utility for quantitative spectroscopy development is reported. A primary objective of this work is to create a novel modeling procedure that may allow for higher automation of the model development process. The fundamental concept is simple yet powerful even for complex spectra and is employed with no additional preprocessing. This approach is applicable for several types of spectroscopic data to develop regression models that have similar or greater quality than the current methods. The key modeling steps are a matrix transformation and subsequent feature selection process that are collectively referred to as iterative regression of corrective baselines (IRCB). The transformed matrix ( X transform ) is a linearized form of the original X data set. Features from X t ransform that are predictive of Y can be ranked and selected by ordinary least-squares regression. The best features (rows of X t ransform ) are linear depictions of Y that can be utilized to develop regression models with several machine learning models. The IRCB workflow is first detailed by using a case study of Fourier transform infrared (FTIR) spectroscopy for prepared solutions of a three-component mixture. Next, IRCB is applied and compared to benchmark results for the 2006 "Chimiométrie" near-infrared spectroscopy (NIR) soil composition challenge and Raman measurements of a simulated nuclear waste slurry.
Keyphrases