Distributional regression modeling via generalized additive models for location, scale, and shape: An overview through a data set from learning analytics.

Fernando Marmolejo-RamosMauricio TejoMarek BrabecJakub KuzilekSrecko JoksimovicVitomir KovanovicJorge GonzálezThomas KneibPeter BühlmannLucas KookGuillermo Briseño-SánchezRaydonal Ospina

Published in: Wiley interdisciplinary reviews. Data mining and knowledge discovery (2022)

The advent of technological developments is allowing to gather large amounts of data in several research fields. Learning analytics (LA)/educational data mining has access to big observational unstructured data captured from educational settings and relies mostly on unsupervised machine learning (ML) algorithms to make sense of such type of data. Generalized additive models for location, scale, and shape (GAMLSS) are a supervised statistical learning framework that allows modeling all the parameters of the distribution of the response variable with respect to the explanatory variables. This article overviews the power and flexibility of GAMLSS in relation to some ML techniques. Also, GAMLSS' capability to be tailored toward causality via causal regularization is briefly commented. This overview is illustrated via a data set from the field of LA. This article is categorized under:Application Areas > Education and LearningAlgorithmic Development > StatisticsTechnologies > Machine Learning.

Keyphrases