Predicting Mycobacterium tuberculosis complex clades using knowledge-based Bayesian networks.
Minoo AminianDavid CouvinAmina ShabbeerKane HadleyScott VandenbergNalin RastogiKristin P BennettPublished in: BioMed research international (2014)
We develop a novel approach for incorporating expert rules into Bayesian networks for classification of Mycobacterium tuberculosis complex (MTBC) clades. The proposed knowledge-based Bayesian network (KBBN) treats sets of expert rules as prior distributions on the classes. Unlike prior knowledge-based support vector machine approaches which require rules expressed as polyhedral sets, KBBN directly incorporates the rules without any modification. KBBN uses data to refine rule-based classifiers when the rule set is incomplete or ambiguous. We develop a predictive KBBN model for 69 MTBC clades found in the SITVIT international collection. We validate the approach using two testbeds that model knowledge of the MTBC obtained from two different experts and large DNA fingerprint databases to predict MTBC genetic clades and sublineages. These models represent strains of MTBC using high-throughput biomarkers called spacer oligonucleotide types (spoligotypes), since these are routinely gathered from MTBC isolates of tuberculosis (TB) patients. Results show that incorporating rules into problems can drastically increase classification accuracy if data alone are insufficient. The SITVIT KBBN is publicly available for use on the World Wide Web.
Keyphrases
- mycobacterium tuberculosis
- healthcare
- deep learning
- high throughput
- pulmonary tuberculosis
- machine learning
- big data
- end stage renal disease
- electronic health record
- ejection fraction
- chronic kidney disease
- mental health
- clinical practice
- emergency department
- single molecule
- single cell
- circulating tumor
- artificial intelligence
- patient reported outcomes
- dna methylation
- data analysis
- circulating tumor cells