Login / Signup

High-dimensional gene expression and morphology profiles of cells across 28,000 genetic and chemical perturbations.

Marzieh HaghighiJuan C CaicedoBeth A CiminiAnne E CarpenterShantanu Singh
Published in: Nature methods (2022)
Cells can be perturbed by various chemical and genetic treatments and the impact on gene expression and morphology can be measured via transcriptomic profiling and image-based assays, respectively. The patterns observed in these high-dimensional profile data can power a dozen applications in drug discovery and basic biology research, but both types of profiles are rarely available for large-scale experiments. Here, we provide a collection of four datasets with both gene expression and morphological profile data useful for developing and testing multimodal methodologies. Roughly a thousand features are measured for each of the two data types, across more than 28,000 chemical and genetic perturbations. We define biological problems that use the shared and complementary information in these two data modalities, provide baseline analysis and evaluation metrics for multi-omic applications, and make the data resource publicly available ( https://broad.io/rosetta/ ).
Keyphrases
  • gene expression
  • electronic health record
  • big data
  • dna methylation
  • induced apoptosis
  • drug discovery
  • genome wide
  • cell cycle arrest
  • mental health
  • healthcare
  • cell death
  • artificial intelligence
  • chronic pain