Login / Signup

Signposts for Aquatic Toxicity Evaluation in China: Text Mining using Event-Driven Taxonomy within and among Regions.

Fei ChengHuizhen LiBryan W BrooksJing You
Published in: Environmental science & technology (2021)
Selection of toxicity endpoints affects outcomes of risk assessment. Scientific decisions based on more holistic evidence is preferable for designing bioassay batteries rather than subjective selections, particularly when systems are poorly understood. Here, we propose a novel event-driven taxonomy (EDT)-based text mining tool to prioritize stressors likely to elicit water quality deterioration. The tool integrated automated literature collection, natural language processing using adverse outcome pathway-based toxicological terminologies and machine learning to classify event drivers (EDs). From aquatic toxicity assessments within China over the past decade, we gathered over 14 000 sources of information. With a dictionary that included 1039 toxicological terms, 15 bioassay-related modes of actions were mapped, yet less than half of the bioassays could be elucidated by available adverse outcome pathways. To fill these mechanistic knowledge gaps, we developed a Naïve Bayesian ED-classifier to annotate apical responses. The classifier's 4-fold cross-validation reached 74% accuracy and labeled 85% bioassays as 26 EDs. Narcosis, estrogen receptor-, and aryl hydrogen receptor-mediators were the major EDs in aquatic systems across China, whereas individual regions had distinct ED fingerprints. The EDT-based tool provides a promising diagnostic strategy to inform region-specific bioassay design and selection for water quality assessments in a big data era.
Keyphrases