Integration of population-level data sources into an individual-level clinical prediction model for dengue virus test positivity.
Robert J WilliamsBenjamin BrintzGabriel Ribeiro Dos SantosAngkana T HuangDarunee BuddhariSurachai KaewhiranSopon IamsirithawornAlan L RothmanStephen J ThomasAaron FarmerStefan FernandezDerek A T CummingsKathryn B AndersonHenrik SaljeDaniel T LeungPublished in: Science advances (2024)
The differentiation of dengue virus (DENV) infection, a major cause of acute febrile illness in tropical regions, from other etiologies, may help prioritize laboratory testing and limit the inappropriate use of antibiotics. While traditional clinical prediction models focus on individual patient-level parameters, we hypothesize that for infectious diseases, population-level data sources may improve predictive ability. To create a clinical prediction model that integrates patient-extrinsic data for identifying DENV among febrile patients presenting to a hospital in Thailand, we fit random forest classifiers combining clinical data with climate and population-level epidemiologic data. In cross-validation, compared to a parsimonious model with the top clinical predictors, a model with the addition of climate data, reconstructed susceptibility estimates, force of infection estimates, and a recent case clustering metric significantly improved model performance.
Keyphrases
- dengue virus
- electronic health record
- climate change
- zika virus
- big data
- chronic kidney disease
- end stage renal disease
- case report
- drinking water
- ejection fraction
- intensive care unit
- emergency department
- newly diagnosed
- rna seq
- artificial intelligence
- aedes aegypti
- prognostic factors
- single cell
- acute respiratory distress syndrome
- respiratory failure
- aortic dissection
- drug induced
- chemotherapy induced
- water quality