Login / Signup

Using threshold regression to analyze survival data from complex surveys: With application to mortality linked NHANES III Phase II genetic data.

Yan LiTao XiaoDandan LiaoMei-Ling Ting Lee
Published in: Statistics in medicine (2017)
The Cox proportional hazards (PH) model is a common statistical technique used for analyzing time-to-event data. The assumption of PH, however, is not always appropriate in real applications. In cases where the assumption is not tenable, threshold regression (TR) and other survival methods, which do not require the PH assumption, are available and widely used. These alternative methods generally assume that the study data constitute simple random samples. In particular, TR has not been studied in the setting of complex surveys that involve (1) differential selection probabilities of study subjects and (2) intracluster correlations induced by multistage cluster sampling. In this paper, we extend TR procedures to account for complex sampling designs. The pseudo-maximum likelihood estimation technique is applied to estimate the TR model parameters. Computationally efficient Taylor linearization variance estimators that consider both the intracluster correlation and the differential selection probabilities are developed. The proposed methods are evaluated by using simulation experiments with various complex designs and illustrated empirically by using mortality-linked Third National Health and Nutrition Examination Survey Phase II genetic data.
Keyphrases
  • phase ii
  • electronic health record
  • clinical trial
  • big data
  • open label
  • cardiovascular events
  • cross sectional
  • cardiovascular disease
  • type diabetes
  • gene expression
  • genome wide
  • data analysis
  • copy number