A Bayesian hierarchical hidden Markov model for clustering and gene selection: Application to kidney cancer gene expression data.
Thierry ChekouoHimadri MukherjeePublished in: Biometrical journal. Biometrische Zeitschrift (2024)
We introduce a Bayesian approach for biclustering that accounts for the prior functional dependence between genes using hidden Markov models (HMMs). We utilize biological knowledge gathered from gene ontologies and the hidden Markov structure to capture the potential coexpression of neighboring genes. Our interpretable model-based clustering characterized each cluster of samples by three groups of features: overexpressed, underexpressed, and irrelevant features. The proposed methods have been implemented in an R package and are used to analyze both the simulated data and The Cancer Genome Atlas kidney cancer data.