Assessing the impact of areal unit selection and the modifiable areal unit problem on associative statistics between cases of tick-borne disease and entomological indices.
Collin O'ConnorMelissa A PrusinskiJared AldstadtRichard C FalcoJoAnne OliverJamie HaightKeith ToberLee Ann SpornJennifer L WhiteDustin BrissonP Bryon BackensonPublished in: Journal of medical entomology (2024)
The modifiable areal unit problem (MAUP) is a cause of statistical and visual bias when aggregating data according to spatial units, particularly when spatial units may be changed arbitrarily. The MAUP is a concern in vector-borne disease research when entomological metrics gathered from point-level sampling data are related to epidemiological data aggregated to administrative units like counties or ZIP Codes. Here, we assess the statistical impact of the MAUP when calculating correlations between randomly aggregated cases of anaplasmosis in New York State during 2017 and a geostatistical layer of an entomological risk index for Anaplasma phagocytophilum in blacklegged ticks (Ixodes scapularis Say, Acari: Ixodidae) collected during the fall of 2017. Correlations were also calculated using various administrative boundaries for comparison. We also demonstrate the impact of the MAUP on data visualization using choropleth maps and offer pycnophylactic interpolation as an alternative. Polygon simulations indicate that increasing the number of polygons decreases correlation coefficients and their variability. Correlation coefficients calculated using ZIP Code tabulation area and Census tract polygons were beyond 4 standard deviations from the mean of the simulated correlation coefficients. These results indicate that using smaller polygons may not best incorporate the geographical context of the tick-borne disease system, despite the tendency of researchers to strive for more granular spatial data and associations.