Login / Signup

Regression analysis and variable selection for two-stage multiple-infection group testing data.

Juexin LinDewei WangQi Zheng
Published in: Statistics in medicine (2019)
Group testing, as a cost-effective strategy, has been widely used to perform large-scale screening for rare infections. Recently, the use of multiplex assays has transformed the goal of group testing from detecting a single disease to diagnosing multiple infections simultaneously. Existing research on multiple-infection group testing data either exclude individual covariate information or ignore possible retests on suspicious individuals. To incorporate both, we propose a new regression model. This new model allows us to perform a regression analysis for each infection using multiple-infection group testing data. Furthermore, we introduce an efficient variable selection method to reveal truly relevant risk factors for each disease. Our methodology also allows for the estimation of the assay sensitivity and specificity when they are unknown. We examine the finite sample performance of our method through extensive simulation studies and apply it to a chlamydia and gonorrhea screening data set to illustrate its practical usefulness.
Keyphrases
  • electronic health record
  • big data
  • high throughput
  • data analysis
  • men who have sex with men
  • genome wide
  • single cell
  • dna methylation
  • deep learning
  • real time pcr