Using pose estimation to identify regions and points on natural history specimens.
Yichen HeChristopher R CooneySteve MaddockGavin H ThomasPublished in: PLoS computational biology (2023)
A key challenge in mobilising growing numbers of digitised biological specimens for scientific research is finding high-throughput methods to extract phenotypic measurements on these datasets. In this paper, we test a pose estimation approach based on Deep Learning capable of accurately placing point labels to identify key locations on specimen images. We then apply the approach to two distinct challenges that each requires identification of key features in a 2D image: (i) identifying body region-specific plumage colouration on avian specimens and (ii) measuring morphometric shape variation in Littorina snail shells. For the avian dataset, 95% of images are correctly labelled and colour measurements derived from these predicted points are highly correlated with human-based measurements. For the Littorina dataset, more than 95% of landmarks were accurately placed relative to expert-labelled landmarks and predicted landmarks reliably captured shape variation between two distinct shell ecotypes ('crab' vs 'wave'). Overall, our study shows that pose estimation based on Deep Learning can generate high-quality and high-throughput point-based measurements for digitised image-based biodiversity datasets and could mark a step change in the mobilisation of such data. We also provide general guidelines for using pose estimation methods on large-scale biological datasets.
Keyphrases
- deep learning
- high throughput
- convolutional neural network
- artificial intelligence
- machine learning
- rna seq
- single cell
- endothelial cells
- oxidative stress
- epithelial mesenchymal transition
- big data
- clinical practice
- induced pluripotent stem cells
- signaling pathway
- optical coherence tomography
- pluripotent stem cells