Addressing persistent challenges in digital image analysis of cancerous tissues.
Sandhya PrabhakaranClarence YappGregory J BakerJohanna BeyerYoung Hwan ChangAllison L CreasonRobert KruegerJeremy MuhlichNathan Heath PattersonKevin SidakDamir SudarAdam J TaylorLuke TernesJakob TroidlYubin XieArtem SokolovDarren R Tysonnull nullPublished in: bioRxiv : the preprint server for biology (2023)
The National Cancer Institute (NCI) supports many research programs and consortia, many of which use imaging as a major modality for characterizing cancerous tissue. A trans-consortia Image Analysis Working Group (IAWG) was established in 2019 with a mission to disseminate imaging-related work and foster collaborations. In 2022, the IAWG held a virtual hackathon focused on addressing challenges of analyzing high dimensional datasets from fixed cancerous tissues. Standard image processing techniques have automated feature extraction, but the next generation of imaging data requires more advanced methods to fully utilize the available information. In this perspective, we discuss current limitations of the automated analysis of multiplexed tissue images, the first steps toward deeper understanding of these limitations, what possible solutions have been developed, any new or refined approaches that were developed during the Image Analysis Hackathon 2022, and where further effort is required. The outstanding problems addressed in the hackathon fell into three main themes: 1) challenges to cell type classification and assessment, 2) translation and visual representation of spatial aspects of high dimensional data, and 3) scaling digital image analyses to large (multi-TB) datasets. We describe the rationale for each specific challenge and the progress made toward addressing it during the hackathon. We also suggest areas that would benefit from more focus and offer insight into broader challenges that the community will need to address as new technologies are developed and integrated into the broad range of image-based modalities and analytical resources already in use within the cancer research community.
Keyphrases
- deep learning
- machine learning
- high resolution
- mental health
- artificial intelligence
- convolutional neural network
- healthcare
- big data
- gene expression
- electronic health record
- public health
- high throughput
- rna seq
- social media
- mycobacterium tuberculosis
- single cell
- optical coherence tomography
- liquid chromatography
- squamous cell