Login / Signup

Characterizing multimorbidity in ALIVE: comparing single and ensemble clustering methods.

Jacqueline E RudolphBryan LauBecky L GenbergJing SunGregory D KirkShruti H Mehta
Published in: American journal of epidemiology (2024)
Multimorbidity, defined as having 2 or more chronic conditions, is a growing public health concern, but research in this area is complicated by the fact that multimorbidity is a highly heterogenous outcome. Individuals in a sample may have a differing number and varied combinations of conditions. Clustering methods, such as unsupervised machine learning algorithms, may allow us to tease out the unique multimorbidity phenotypes. However, many clustering methods exist, and choosing which to use is challenging because we do not know the true underlying clusters. Here, we demonstrate the use of 3 individual algorithms (partition around medoids, hierarchical clustering, and probabilistic clustering) and a clustering ensemble approach (which pools different clustering approaches) to identify multimorbidity clusters in the AIDS Linked to the Intravenous Experience cohort study. We show how the clusters can be compared based on cluster quality, interpretability, and predictive ability. In practice, it is critical to compare the clustering results from multiple algorithms and to choose the approach that performs best in the domain(s) that aligns with plans to use the clusters in future analyses.
Keyphrases
  • machine learning
  • single cell
  • rna seq
  • public health
  • deep learning
  • healthcare
  • artificial intelligence
  • primary care
  • convolutional neural network