论文标题
kvasir-instrument:胃肠道内窥镜检查中的诊断和治疗工具分割数据集
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopy
论文作者
论文摘要
使用手术工具定期筛查,活检并切除胃肠道(GI)病理。通常,在结肠镜检查期间或之后,没有对该程序和治疗区域或切除区域进行特定跟踪或分析。有关疾病边界的信息,切除区域的发展以及数量和大小丢失。这可能会导致治疗后的随访和重新评估困难。 To improve the current standard and also to foster more research on the topic we have released the ``Kvasir-Instrument'' dataset which consists of $590$ annotated frames containing GI procedure tools such as snares, balloons and biopsy forceps, etc. Beside of the images, the dataset includes ground truth masks and bounding boxes and has been verified by two expert GI endoscopists.此外,我们还为促进研究和算法开发的GI工具的分割提供了基准。我们使用经典的U-NET体系结构获得了0.9158的骰子系数评分,Jaccard指数为0.8578。对于DoubleUnet,观察到类似的骰子系数得分。定性结果表明,该模型对具有镜面和具有多种仪器的框架的图像不起作用,而在所有其他类型的图像上都观察到了两种方法的最佳结果。定性和定量结果都表明该模型的性能相当出色,但是有很大的进一步改进潜力。使用数据集进行基准测试为研究人员提供了为GI内窥镜检查的自动内窥镜诊断和治疗工具分割的领域。
Gastrointestinal (GI) pathologies are periodically screened, biopsied, and resected using surgical tools. Usually the procedures and the treated or resected areas are not specifically tracked or analysed during or after colonoscopies. Information regarding disease borders, development and amount and size of the resected area get lost. This can lead to poor follow-up and bothersome reassessment difficulties post-treatment. To improve the current standard and also to foster more research on the topic we have released the ``Kvasir-Instrument'' dataset which consists of $590$ annotated frames containing GI procedure tools such as snares, balloons and biopsy forceps, etc. Beside of the images, the dataset includes ground truth masks and bounding boxes and has been verified by two expert GI endoscopists. Additionally, we provide a baseline for the segmentation of the GI tools to promote research and algorithm development. We obtained a dice coefficient score of 0.9158 and a Jaccard index of 0.8578 using a classical U-Net architecture. A similar dice coefficient score was observed for DoubleUNet. The qualitative results showed that the model did not work for the images with specularity and the frames with multiple instruments, while the best result for both methods was observed on all other types of images. Both, qualitative and quantitative results show that the model performs reasonably good, but there is a large potential for further improvements. Benchmarking using the dataset provides an opportunity for researchers to contribute to the field of automatic endoscopic diagnostic and therapeutic tool segmentation for GI endoscopy.