Login / Signup

Inventory statistics meet big data: complications for estimating numbers of species.

Ali KhalighifarLaura JiménezClaudia Nuñez-PenichetBenedictus FreemanKate IngenloffDaniel Jiménez-GarcíaAndrew Townsend Peterson
Published in: PeerJ (2020)
We point out complications inherent in biodiversity inventory metrics when applied to large-scale datasets. The number of units of inventory effort (e.g., days of inventory effort) in which a species is detected saturates, such that crucial numbers of detections of rare species approach zero. Any rare errors can then come to dominate species richness estimates, creating upward biases in estimates of species numbers. We document the problem via simulations of sampling from virtual biotas, illustrate its potential using a large empirical dataset (bird records from Cape May, NJ, USA), and outline the circumstances under which these problems may be expected to emerge.
Keyphrases
  • big data
  • machine learning
  • risk factors
  • genetic diversity
  • artificial intelligence
  • psychometric properties
  • south africa
  • emergency department
  • molecular dynamics
  • patient safety
  • quality improvement