Login / Signup

propr: An R-package for Identifying Proportionally Abundant Features Using Compositional Data Analysis.

Thomas P QuinnMark F RichardsonDavid LovellTamsyn M Crowley
Published in: Scientific reports (2017)
In the life sciences, many assays measure only the relative abundances of components in each sample. Such data, called compositional data, require special treatment to avoid misleading conclusions. Awareness of the need for caution in analyzing compositional data is growing, including the understanding that correlation is not appropriate for relative data. Recently, researchers have proposed proportionality as a valid alternative to correlation for calculating pairwise association in relative data. Although the question of how to best measure proportionality remains open, we present here a computationally efficient R package that implements three measures of proportionality. In an effort to advance the understanding and application of proportionality analysis, we review the mathematics behind proportionality, demonstrate its application to genomic data, and discuss some ongoing challenges in the analysis of relative abundance data.
Keyphrases
  • data analysis
  • electronic health record
  • gene expression
  • machine learning
  • artificial intelligence
  • microbial community
  • single cell