Novel metrics reveal new structure and unappreciated heterogeneity in Caenorhabditis elegans development.
Gunalan NatesanTimothy HamiltonEric J DeedsPavak K ShahPublished in: PLoS computational biology (2023)
High throughput experimental approaches are increasingly allowing for the quantitative description of cellular and organismal phenotypes. Distilling these large volumes of complex data into meaningful measures that can drive biological insight remains a central challenge. In the quantitative study of development, for instance, one can resolve phenotypic measures for single cells onto their lineage history, enabling joint consideration of heritable signals and cell fate decisions. Most attempts to analyze this type of data, however, discard much of the information content contained within lineage trees. In this work we introduce a generalized metric, which we term the branch edit distance, that allows us to compare any two embryos based on phenotypic measurements in individual cells. This approach aligns those phenotypic measurements to the underlying lineage tree, providing a flexible and intuitive framework for quantitative comparisons between, for instance, Wild-Type (WT) and mutant developmental programs. We apply this novel metric to data on cell-cycle timing from over 1300 WT and RNAi-treated Caenorhabditis elegans embryos. Our new metric revealed surprising heterogeneity within this data set, including subtle batch effects in WT embryos and dramatic variability in RNAi-induced developmental phenotypes, all of which had been missed in previous analyses. Further investigation of these results suggests a novel, quantitative link between pathways that govern cell fate decisions and pathways that pattern cell cycle timing in the early embryo. Our work demonstrates that the branch edit distance we propose, and similar metrics like it, have the potential to revolutionize our quantitative understanding of organismal phenotype.
Keyphrases
- cell cycle
- cell fate
- single cell
- electronic health record
- high resolution
- cell proliferation
- high throughput
- induced apoptosis
- wild type
- big data
- cell cycle arrest
- cell death
- oxidative stress
- preterm infants
- mass spectrometry
- genome wide
- deep learning
- pregnant women
- endoplasmic reticulum stress
- drug induced
- gestational age
- health information
- human health
- pregnancy outcomes