Login / Signup

Assessing Search and Unsupervised Clustering Algorithms in Nested Sampling.

Lune MaillardFabio FinocchiMartino Trassinelli
Published in: Entropy (Basel, Switzerland) (2023)
Nested sampling is an efficient method for calculating Bayesian evidence in data analysis and partition functions of potential energies. It is based on an exploration using a dynamical set of sampling points that evolves to higher values of the sampled function. When several maxima are present, this exploration can be a very difficult task. Different codes implement different strategies. Local maxima are generally treated separately, applying cluster recognition of the sampling points based on machine learning methods. We present here the development and implementation of different search and clustering methods on the nested_fit code. Slice sampling and the uniform search method are added in addition to the random walk already implemented. Three new cluster recognition methods are also developed. The efficiency of the different strategies, in terms of accuracy and number of likelihood calls, is compared considering a series of benchmark tests, including model comparison and a harmonic energy potential. Slice sampling proves to be the most stable and accurate search strategy. The different clustering methods present similar results but with very different computing time and scaling. Different choices of the stopping criterion of the algorithm, another critical issue of nested sampling, are also investigated with the harmonic energy potential.
Keyphrases
  • machine learning
  • data analysis
  • single cell
  • healthcare
  • deep learning
  • rna seq
  • primary care
  • magnetic resonance imaging
  • density functional theory
  • big data
  • magnetic resonance
  • human health
  • quality improvement