Login / Signup

DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models.

Chongliang LuoMd Nazmul IslamNatalie E SheilsJohn BureshJenna Marie RepsMartijn J SchuemiePatrick B RyanMackenzie EdmondsonRui DuanJiayi TongArielle Marks-AnglinJiang BianZhaoyi ChenTalita Duarte-SallesSergio Fernández-BertolínThomas FalconerChungsoo KimRae Woong ParkStephen R PfohlNigam Haresh ShahAndrew E WilliamsHua XuYujia ZhouEbbing LautenbachJalpa A DoshiRachel M WernerDavid A AschYong Chen
Published in: Nature communications (2022)
Linear mixed models are commonly used in healthcare-based association analyses for analyzing multi-site data with heterogeneous site-specific random effects. Due to regulations for protecting patients' privacy, sensitive individual patient data (IPD) typically cannot be shared across sites. We propose an algorithm for fitting distributed linear mixed models (DLMMs) without sharing IPD across sites. This algorithm achieves results identical to those achieved using pooled IPD from multiple sites (i.e., the same effect size and standard error estimates), hence demonstrating the lossless property. The algorithm requires each site to contribute minimal aggregated data in only one round of communication. We demonstrate the lossless property of the proposed DLMM algorithm by investigating the associations between demographic and clinical characteristics and length of hospital stay in COVID-19 patients using administrative claims from the UnitedHealth Group Clinical Discovery Database. We extend this association study by incorporating 120,609 COVID-19 patients from 11 collaborative data sources worldwide.
Keyphrases