Login / Signup

FFMDFPA: A FAIRification Framework for Materials Data with No-Code Flexible Semi-Structured Parser and Application Programming Interfaces.

Bing HeZhuming GongMaxim AvdeevSiqi Shi
Published in: Journal of chemical information and modeling (2023)
The FAIR Data Principles are guidelines to ensure Findability, Accessibility, Interoperability, and Reusability of digital resources, which are essential to accelerate data-driven materials science. Despite the development and growing adoption of the FAIR principles, appropriate implementation solutions and software to make data FAIR are still sparse, particularly in standardization of heterogeneous data and subsequent data access. Here, we introduce a FAIRification Framework for Materials Data with No-Code Flexible Semi-Structured Parser and API (FFMDFPA) (API, application programming interface) for raw data processing. Using a template-based parser, FFMDFPA can extract and transform semistructured data in various text formats, providing the flexibility to extend data manipulation without coding. Additionally, FFMDFPA provides a standardized API with efficient query syntax that facilitates seamless data sharing. Taking various text files generated by computational software as examples, we demonstrate the potential utility of FFMDFPA. This work offers important insights toward efficient utilization and reuse of materials data, and the data semantic manipulation implemented in the parser and API can be extended to textual data, which has implications for future data FAIRification.
Keyphrases
  • electronic health record
  • big data
  • machine learning
  • oxidative stress
  • public health
  • risk assessment
  • artificial intelligence
  • mass spectrometry
  • high resolution
  • health information
  • deep learning
  • smoking cessation