Supervised clustering based on DPClusO: prediction of plant-disease relations using Jamu formulas of KNApSAcK database.
Sony Hartono WijayaHusnawati HusnawatiFarit Mochamad AfendiIrmanida BatubaraLatifah K DarusmanMd Altaf-Ul-AminTetsuo SatoNaoaki OnoTadao SugiuraShigehiko KanayaPublished in: BioMed research international (2014)
Indonesia has the largest medicinal plant species in the world and these plants are used as Jamu medicines. Jamu medicines are popular traditional medicines from Indonesia and we need to systemize the formulation of Jamu and develop basic scientific principles of Jamu to meet the requirement of Indonesian Healthcare System. We propose a new approach to predict the relation between plant and disease using network analysis and supervised clustering. At the preliminary step, we assigned 3138 Jamu formulas to 116 diseases of International Classification of Diseases (ver. 10) which belong to 18 classes of disease from National Center for Biotechnology Information. The correlation measures between Jamu pairs were determined based on their ingredient similarity. Networks are constructed and analyzed by selecting highly correlated Jamu pairs. Clusters were then generated by using the network clustering algorithm DPClusO. By using matching score of a cluster, the dominant disease and high frequency plant associated to the cluster are determined. The plant to disease relations predicted by our method were evaluated in the context of previously published results and were found to produce around 90% successful predictions.