Gauze Detection and Segmentation in Minimally Invasive Surgery Video Using Convolutional Neural Networks.
Guillermo Sánchez-BrizuelaFrancisco-Javier Santos-CriadoDaniel Sanz-GobernadoEusebio de la Fuente-LópezJuan Carlos Fraile-MarineroJavier P TurielAna CisnalPublished in: Sensors (Basel, Switzerland) (2022)
Medical instruments detection in laparoscopic video has been carried out to increase the autonomy of surgical robots, evaluate skills or index recordings. However, it has not been extended to surgical gauzes. Gauzes can provide valuable information to numerous tasks in the operating room, but the lack of an annotated dataset has hampered its research. In this article, we present a segmentation dataset with 4003 hand-labelled frames from laparoscopic video. To prove the dataset potential, we analyzed several baselines: detection using YOLOv3, coarse segmentation, and segmentation with a U-Net. Our results show that YOLOv3 can be executed in real time but provides a modest recall. Coarse segmentation presents satisfactory results but lacks inference speed. Finally, the U-Net baseline achieves a good speed-quality compromise running above 30 FPS while obtaining an IoU of 0.85. The accuracy reached by U-Net and its execution speed demonstrate that precise and real-time gauze segmentation can be achieved, training convolutional neural networks on the proposed dataset.