Login / Signup

Simulating ComBat: how batch correction can lead to the systematic introduction of false positive results in DNA methylation microarray studies.

Tristan ZindlerStefan BleichAlexandra NeyaziStefan BleichEva Friedel
Published in: BMC bioinformatics (2020)
Using the approach described, we demonstrate, that using ComBat for batch correction in DNAm data can lead to false positive results under certain conditions and sample distributions. Our results are thus contrary to previous publications, considering a balanced sample distribution as unproblematic when using ComBat. We do not claim completeness in terms of reporting all technical conditions and possible solutions of the occurring problems as we approach the problem from a clinician's perspective and not from that of a computer scientist. With our approach of simulating data, we provide readers with a simple method to assess the probability of false positive findings in DNAm microarray data analysis pipelines.
Keyphrases
  • data analysis
  • dna methylation
  • electronic health record
  • mental health
  • big data
  • gene expression
  • nk cells
  • genome wide
  • deep learning
  • emergency department
  • anaerobic digestion
  • bioinformatics analysis
  • copy number