Login / Signup

An application of compositional data analysis to multiomic time-series data.

Laura Sisk-HackworthScott T Kelley
Published in: NAR genomics and bioinformatics (2020)
Compositional data analysis (CoDA) methods have increased in popularity as a new framework for analyzing next-generation sequencing (NGS) data. CoDA methods, such as the centered log-ratio (clr) transformation, adjust for the compositional nature of NGS counts, which is not addressed by traditional normalization methods. CoDA has only been sparsely applied to NGS data generated from microbial communities or to multiple 'omics' datasets. In this study, we applied CoDA methods to analyze NGS and untargeted metabolomic datasets obtained from bacterial and fungal communities. Specifically, we used clr transformation to reanalyze NGS amplicon and metabolomics data from a study investigating the effects of building material type, moisture and time on microbial and metabolomic diversity. Compared to analysis of untransformed data, analysis of clr-transformed data revealed novel relationships and stronger associations between sample conditions and microbial and metabolic community profiles.
Keyphrases
  • data analysis
  • electronic health record
  • big data
  • healthcare
  • deep learning
  • high resolution
  • dna methylation
  • gas chromatography