Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations.
Srinivas Niranj ChandrasekaranBeth A CiminiAmy GoodaleLisa MillerMaria Kost-AlimovaNasim JamaliJohn G DoenchBriana FritchmanAdam SkepnerMichelle MelansonAlexandr A KalininJohn ArevaloMarzieh HaghighiJuan C CaicedoDaniel KuhnDesiree HernandezJames BerstlerHamdah Shafqat-AbbasiDavid E RootSusanne E SwalleySakshi GargShantanu SinghAnne E CarpenterPublished in: Nature methods (2024)
The identification of genetic and chemical perturbations with similar impacts on cell morphology can elucidate compounds' mechanisms of action or novel regulators of genetic pathways. Research on methods for identifying such similarities has lagged due to a lack of carefully designed and well-annotated image sets of cells treated with chemical and genetic perturbations. Here we create such a Resource dataset, CPJUMP1, in which each perturbed gene's product is a known target of at least two chemical compounds in the dataset. We systematically explore the directionality of correlations among perturbations that target the same protein encoded by a given gene, and we find that identifying matches between chemical and genetic perturbations is a challenging task. Our dataset and baseline analyses provide a benchmark for evaluating methods that measure perturbation similarities and impact, and more generally, learn effective representations of cellular state from microscopy images. Such advancements would accelerate the applications of image-based profiling of cellular states, such as uncovering drug mode of action or probing functional genomics.
Keyphrases
- genome wide
- copy number
- deep learning
- induced apoptosis
- single cell
- optical coherence tomography
- dna methylation
- cell cycle arrest
- single molecule
- gene expression
- transcription factor
- high resolution
- working memory
- newly diagnosed
- bone marrow
- molecular dynamics simulations
- signaling pathway
- high speed
- protein protein