Bioinformatics and machine learning to support nanomaterial grouping.

Aileen BahlSabina HalappanavarWendel WohllebenPenny NymarkPekka KohonenHåkan Wallin Ulla Birgitte Vogel Andrea Haase

Published in: Nanotoxicology (2024)

Nanomaterials (NMs) offer plenty of novel functionalities. Moreover, their physicochemical properties can be fine-tuned to meet the needs of specific applications, leading to virtually unlimited numbers of NM variants. Hence, efficient hazard and risk assessment strategies building on New Approach Methodologies (NAMs) become indispensable. Indeed, the design, the development and implementation of NAMs has been a major topic in a substantial number of research projects. One of the promising strategies that can help to deal with the high number of NMs variants is grouping and read-across. Based on demonstrated structural and physicochemical similarity, NMs can be grouped and assessed together. Within an established NM group, read-across may be performed to fill in data gaps for data-poor variants using existing data for NMs within the group. Establishing a group requires a sound justification, usually based on a grouping hypothesis that links specific physicochemical properties to well-defined hazard endpoints. However, for NMs these interrelationships are only beginning to be understood. The aim of this review is to demonstrate the power of bioinformatics with a specific focus on Machine Learning (ML) approaches to unravel the NM Modes-of-Action (MoA) and identify the properties that are relevant to specific hazards, in support of grouping strategies. This review emphasizes the following messages: 1) ML supports identification of the most relevant properties contributing to specific hazards; 2) ML supports analysis of large omics datasets and identification of MoA patterns in support of hypothesis formulation in grouping approaches; 3) omics approaches are useful for shifting away from consideration of single endpoints towards a more mechanistic understanding across multiple endpoints gained from one experiment; and 4) approaches from other fields of Artificial Intelligence (AI) like Natural Language Processing or image analysis may support automated extraction and interlinkage of information related to NM toxicity. Here, existing ML models for predicting NM toxicity and for analyzing omics data in support of NM grouping are reviewed. Various challenges related to building robust models in the field of nanotoxicology exist and are also discussed.

Keyphrases