Accurate deep learning model using semi-supervised learning and Noisy Student for cervical cancer screening in low magnification images.
Yuki KuritaShiori MeguroNaoko TsuyamaIsao KosugiYasunori EnomotoHideya KawasakiTakashi UemuraMichio KimuraToshihide IwashitaPublished in: PloS one (2023)
Deep learning technology has been used in the medical field to produce devices for clinical practice. Deep learning methods in cytology offer the potential to enhance cancer screening while also providing quantitative, objective, and highly reproducible testing. However, constructing high-accuracy deep learning models necessitates a significant amount of manually labeled data, which takes time. To address this issue, we used the Noisy Student Training technique to create a binary classification deep learning model for cervical cytology screening, which reduces the quantity of labeled data necessary. We used 140 whole-slide images from liquid-based cytology specimens, 50 of which were low-grade squamous intraepithelial lesions, 50 were high-grade squamous intraepithelial lesions, and 40 were negative samples. We extracted 56,996 images from the slides and then used them to train and test the model. We trained the EfficientNet using 2,600 manually labeled images to generate additional pseudo labels for the unlabeled data and then self-trained it within a student-teacher framework. Based on the presence or absence of abnormal cells, the created model was used to classify the images as normal or abnormal. The Grad-CAM approach was used to visualize the image components that contributed to the classification. The model achieved an area under the curve of 0.908, accuracy of 0.873, and F1-score of 0.833 with our test data. We also explored the optimal confidence threshold score and optimal augmentation approaches for low-magnification images. Our model efficiently classified normal and abnormal images at low magnification with high reliability, making it a promising screening tool for cervical cytology.
Keyphrases
- deep learning
- high grade
- low grade
- convolutional neural network
- artificial intelligence
- machine learning
- big data
- electronic health record
- fine needle aspiration
- clinical practice
- computed tomography
- cell proliferation
- high resolution
- mass spectrometry
- young adults
- ultrasound guided
- optical coherence tomography
- squamous cell carcinoma
- high intensity
- risk assessment
- signaling pathway
- soft tissue
- lymph node metastasis
- papillary thyroid