Login / Signup

Model-based clustering for random hypergraphs.

Tin Lok James NgThomas Brendan Murphy
Published in: Advances in data analysis and classification (2021)
A probabilistic model for random hypergraphs is introduced to represent unary, binary and higher order interactions among objects in real-world problems. This model is an extension of the latent class analysis model that introduces two clustering structures for hyperedges and captures variation in the size of hyperedges. An expectation maximization algorithm with minorization maximization steps is developed to perform parameter estimation. Model selection using Bayesian Information Criterion is proposed. The model is applied to simulated data and two real-world data sets where interesting results are obtained.
Keyphrases
  • machine learning
  • healthcare
  • electronic health record
  • big data
  • mass spectrometry