Login / Signup

MatKG: An autonomously generated knowledge graph in Material Science.

Vineeth VenugopalElsa A Olivetti
Published in: Scientific data (2024)
In this paper, we present MatKG, a knowledge graph in materials science that offers a repository of entities and relationships extracted from scientific literature. Using advanced natural language processing techniques, MatKG includes an array of entities, including materials, properties, applications, characterization and synthesis methods, descriptors, and symmetry phase labels. The graph is formulated based on statistical metrics, encompassing over 70,000 entities and 5.4 million unique triples. To enhance accessibility and utility, we have serialized MatKG in both CSV and RDF formats and made these, along with the code base, available to the research community. As the largest knowledge graph in materials science to date, MatKG provides structured organization of domain-specific data. Its deployment holds promise for various applications, including material discovery, recommendation systems, and advanced analytics.
Keyphrases
  • healthcare
  • convolutional neural network
  • public health
  • big data
  • neural network
  • high throughput
  • small molecule
  • autism spectrum disorder
  • electronic health record
  • deep learning
  • machine learning
  • mass spectrometry