Login / Signup

Small sample sizes: A big data problem in high-dimensional data analysis.

Frank KonietschkeKarima SchwabMarkus Pauly
Published in: Statistical methods in medical research (2020)
In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the maximum statistic. Extensive simulation studies confirm that the new method is particularly suitable for analyzing data sets with small sample sizes. A real data set illustrates the application of the methods.
Keyphrases
  • big data
  • data analysis
  • artificial intelligence
  • machine learning
  • electronic health record
  • magnetic resonance
  • randomized controlled trial
  • stem cells
  • study protocol
  • bone marrow
  • phase iii