Login / Signup

Know your population and know your model: Using model-based regression and poststratification to generalize findings beyond the observed sample.

Lauren KennedyAndrew Gelman
Published in: Psychological methods (2021)
Psychology research often focuses on interactions, and this has deep implications for inference from nonrepresentative samples. For the goal of estimating average treatment effects, we propose to fit a model allowing treatment to interact with background variables and then average over the distribution of these variables in the population. This can be seen as an extension of multilevel regression and poststratification (MRP), a method used in political science and other areas of survey research, where researchers wish to generalize from a sparse and possibly nonrepresentative sample to the general population. In this article, we discuss areas where this method can be used in the psychological sciences. We use our method to estimate the norming distribution for the Big Five Personality Scale using open source data. We argue that large open data sources like this and other collaborative data sources can potentially be combined with MRP to help resolve current challenges of generalizability and replication in psychology. (PsycInfo Database Record (c) 2021 APA, all rights reserved).
Keyphrases
  • big data
  • electronic health record
  • drinking water
  • public health
  • emergency department
  • combination therapy
  • quality improvement
  • neural network