The Truth behind the Zeros: A New Approach to Principal Component Analysis of the Neuropsychiatric Inventory.
Kristoffer H HelltonJeffrey L CummingsAudun Osland Vik-MoJan Erik NordrehaugDag AarslandGeir SelbaekLasse Melvaer GiilPublished in: Multivariate behavioral research (2020)
Psychiatric syndromes in dementia are often derived from the Neuropsychiatric Inventory (NPI) using principal component analysis (PCA). The validity of this statistical approach can be questioned, since the excessive proportion of zeros and skewness of NPI items may distort the estimated relations between the items. We propose a novel version of PCA, ZIBP-PCA, where a zero-inflated bivariate Poisson (ZIBP) distribution models the pairwise covariance between the NPI items. We compared the performance of the method to classical PCA under zero-inflation using simulations, and in two dementia-cohorts (N = 830, N = 1349). Simulations showed that component loadings from PCA were biased due to zero-inflation, while the loadings of ZIBP-PCA remained unaffected. ZIBP-PCA obtained a simpler component structure of "psychosis," "mood" and "agitation" in both dementia-cohorts, compared to PCA. The principal components from ZIBP-PCA had component loadings as follows: First, the component interpreted as "psychosis" was loaded by the items delusions and hallucinations. Second, the "mood" component was loaded by depression and anxiety. Finally, the "agitation" component was loaded by irritability and aggression. In conclusion, PCA is not equipped to handle zero-inflation. Using the NPI, PCA fails to identify components with a valid interpretation, while ZIBP-PCA estimates simple and interpretable components to characterize the psychopathology of dementia.