Teacher-student collaborated multiple instance learning for pan-cancer PDL1 expression prediction from histopathology slides.
Darui JinShangying LiangArtem ShmatkoAlexander ArnoldDavid HorstThomas G P GrünewaldMoritz GerstungXiangzhi BaiPublished in: Nature communications (2024)
Programmed cell death ligand 1 (PDL1), as an important biomarker, is quantified by immunohistochemistry (IHC) with few established histopathological patterns. Deep learning aids in histopathological assessment, yet heterogeneity and lacking spatially resolved annotations challenge precise analysis. Here, we present a weakly supervised learning approach using bulk RNA sequencing for PDL1 expression prediction from hematoxylin and eosin (H&E) slides. Our method extends the multiple instance learning paradigm with the teacher-student framework, which assigns dynamic pseudo-labels for intra-slide heterogeneity and retrieves unlabeled instances using temporal ensemble model distillation. The approach, evaluated on 12,299 slides across 20 solid tumor types, achieves a weighted average area under the curve of 0.83 on fresh-frozen and 0.74 on formalin-fixed specimens for 9 tumors with PDL1 as an established biomarker. Our method predicts PDL1 expression patterns, validated by IHC on 20 slides, offering insights into histologies relevant to PDL1. This demonstrates the potential of deep learning in identifying diverse histological patterns for molecular changes from H&E images.
Keyphrases
- deep learning
- poor prognosis
- single cell
- convolutional neural network
- machine learning
- artificial intelligence
- binding protein
- magnetic resonance
- long non coding rna
- squamous cell carcinoma
- magnetic resonance imaging
- computed tomography
- medical students
- young adults
- contrast enhanced
- antiretroviral therapy
- network analysis
- human health