Login / Signup

Identification and epidemiological characterization of Type-2 diabetes sub-population using an unsupervised machine learning approach.

Saptarshi BejJit SarkarSaikat BiswasPabitra MitraPartha ChakrabartiOlaf Wolkenhauer
Published in: Nutrition & diabetes (2022)
From a methodological perspective, we show that for diverse data types, frequent in epidemiological datasets, feature-type-distributed clustering using UMAP is effective as opposed to the conventional use of the UMAP algorithm. The application of UMAP-based clustering workflow for this type of dataset is novel in itself. Our findings demonstrate the presence of heterogeneity among Indian T2DM patients with regard to socio-demography and dietary patterns. From our analysis, we conclude that the existence of significant non-obese T2DM sub-populations characterized by younger age groups and economic disadvantage raises the need for different screening criteria for T2DM among rural Indian residents.
Keyphrases