Login / Signup

On the surprising effectiveness of a simple matrix exponential derivative approximation, with application to global SARS-CoV-2.

Gustavo DidierNathan E Glatt-HoltzAndrew J HolbrookAndrew F MageeMarc A Suchard
Published in: Proceedings of the National Academy of Sciences of the United States of America (2024)
The continuous-time Markov chain (CTMC) is the mathematical workhorse of evolutionary biology. Learning CTMC model parameters using modern, gradient-based methods requires the derivative of the matrix exponential evaluated at the CTMC's infinitesimal generator (rate) matrix. Motivated by the derivative's extreme computational complexity as a function of state space cardinality, recent work demonstrates the surprising effectiveness of a naive, first-order approximation for a host of problems in computational biology. In response to this empirical success, we obtain rigorous deterministic and probabilistic bounds for the error accrued by the naive approximation and establish a "blessing of dimensionality" result that is universal for a large class of rate matrices with random entries. Finally, we apply the first-order approximation within surrogate-trajectory Hamiltonian Monte Carlo for the analysis of the early spread of Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) across 44 geographic regions that comprise a state space of unprecedented dimensionality for unstructured (flexible) CTMC models within evolutionary biology.
Keyphrases
  • sars cov
  • respiratory syndrome coronavirus
  • monte carlo
  • randomized controlled trial
  • coronavirus disease
  • systematic review
  • genome wide
  • hiv infected
  • mental health
  • water soluble
  • gene expression
  • dna methylation