Human-interpretable image features derived from densely mapped cancer pathology slides predict diverse molecular phenotypes.
James A DiaoJason K WangWan Fung ChuiVictoria MountainSai Chowdary GullapallyRamprakash SrinivasanRichard N MitchellBenjamin GlassSara HoffmanSudha K RaoChirag MaheshwariAbhik LahiriAaditya PrakashRyan McLoughlinJennifer K KernerMurray B ResnickMichael C MontaltoAditya KhoslaIlan N WapinskiAndrew H BeckHunter L ElliottAmaro Taylor-WeinerPublished in: Nature communications (2021)
Computational methods have made substantial progress in improving the accuracy and throughput of pathology workflows for diagnostic, prognostic, and genomic prediction. Still, lack of interpretability remains a significant barrier to clinical integration. We present an approach for predicting clinically-relevant molecular phenotypes from whole-slide histopathology images using human-interpretable image features (HIFs). Our method leverages >1.6 million annotations from board-certified pathologists across >5700 samples to train deep learning models for cell and tissue classification that can exhaustively map whole-slide images at two and four micron-resolution. Cell- and tissue-type model outputs are combined into 607 HIFs that quantify specific and biologically-relevant characteristics across five cancer types. We demonstrate that these HIFs correlate with well-known markers of the tumor microenvironment and can predict diverse molecular signatures (AUROC 0.601-0.864), including expression of four immune checkpoint proteins and homologous recombination deficiency, with performance comparable to 'black-box' methods. Our HIF-based approach provides a comprehensive, quantitative, and interpretable window into the composition and spatial architecture of the tumor microenvironment.
Keyphrases
- deep learning
- endothelial cells
- convolutional neural network
- artificial intelligence
- papillary thyroid
- machine learning
- single molecule
- single cell
- dna repair
- cell therapy
- dna damage
- induced pluripotent stem cells
- squamous cell
- transcription factor
- pluripotent stem cells
- binding protein
- stem cells
- high resolution
- squamous cell carcinoma
- dna methylation
- mass spectrometry
- high speed
- genome wide
- lymph node metastasis
- bone marrow
- oxidative stress
- copy number