上海交通大学学报(医学版) ›› 2022, Vol. 42 ›› Issue (5): 653-659.doi: 10.3969/j.issn.1674-8115.2022.05.014

• 论著 · 技术与方法 • 上一篇    下一篇

基于深度卷积神经网络的Barrett食管内镜图片分类模型的建立

林嘉希1,2(), 汪盛嘉3, 赵鑫3, 高欣1,2, 殷民月1,2, 朱锦舟1,2()   

  1. 1.苏州大学附属第一医院消化内科,苏州 215006
    2.江苏省苏州市消化病临床医学中心,苏州 215006
    3.苏州大学附属第一医院普外科,苏州 215006
  • 收稿日期:2022-01-07 接受日期:2022-03-25 出版日期:2022-05-28 发布日期:2022-05-07
  • 通讯作者: 朱锦舟 E-mail:jxlin@stu.suda.edu.cn;jzzhu@zju.edu.cn
  • 作者简介:林嘉希(1999—),男,本科生;电子信箱:jxlin@stu.suda.edu.cn
  • 基金资助:
    国家自然科学基金(82000540);苏州市“科教兴卫”青年科技项目(KJXW2019001)

Development of endoscopic image classification models of Barrett's esophagus based on deep convolutional neural networks

LIN Jiaxi1,2(), WANG Shengjia3, ZHAO Xin3, GAO Xin1,2, YIN Minyue1,2, ZHU Jinzhou1,2()   

  1. 1.Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou 215006, China
    2.Suzhou Clinical Center of Digestive Disease, Jiangsu Province, Suzhou 215006, China
    3.Department of General Surgery, The First Affiliated Hospital of Soochow University, Suzhou 215006, China
  • Received:2022-01-07 Accepted:2022-03-25 Online:2022-05-28 Published:2022-05-07
  • Contact: ZHU Jinzhou E-mail:jxlin@stu.suda.edu.cn;jzzhu@zju.edu.cn
  • Supported by:
    National Natural Science Foundation of China(82000540);Youth Program of Suzhou Health Committee(KJXW2019001)

摘要:

目的·利用深度卷积神经网络算法,构建Barrett食管内镜图片分类模型并评估其分类能力。方法·收集苏州大学附属第一医院消化内镜中心及HyperKvasir数据库中的内镜下食管图片共806张,其中正常食管图片412张、Barrett食管图片394张。随机将所有图片分为训练集(85%)与验证集(15%)。利用于ImageNet数据库进行预训练的4种深度卷积神经网络[Xception、NASNet Large(NASNetL)、ResNet50V2(ResNet)及BigTransfer(BiT)],分别在训练集中进行迁移学习,建立Barrett食管内镜图片分类模型,并使用梯度加权分类激活映射对该4个模型的分类结果进行可视化解释。随后,于验证集中评价模型的分类能力。同时,收集高、低年资医师对验证集数据的分类结果,将其与模型的分类结果进行对比,进一步评估模型的分类能力。结果·成功构建了基于深度卷积神经网络的Barrett食管内镜图片的4个分类模型。利用梯度加权分类激活映射,以热力图形式实现了对模型分类结果的可视化解释。在验证集数据中,各模型均拥有较高的分类准确性与精确性,其平均分类准确性为0.852,平均分类精确性为0.846。NASNetL模型相较其余3种模型,拥有最高分类准确性(0.873)及最高分类精确性(0.867),是表现最优的模型。该模型对Barrett食管内镜图片的分类能力近似高年资医师,其分类准确性略低于高年资医师(0.881)而高于低年资医师(0.855);同时,该模型与高年资医师(Kappa=0.712,P=0.000)、低年资医师(Kappa=0.695,P=0.000)均具有较好的分类一致性。结论·利用深度卷积神经网络迁移学习构建的Barrett食管内镜图片分类模型具有较好的分类能力。

关键词: Barrett食管, 深度学习, 迁移学习, 消化内镜, 卷积神经网络

Abstract:

Objective·To develop classification models for endoscopic images of Barrett's esophagus using deep convolutional neural network, and evaluate its classification performance.

Methods·A total of 806 esophageal images were collected from Gastrointestinal Endoscopy Center of the First Affiliated Hospital of Soochow University and the HyperKvasir database, including 412 normal esophageal images and 394 Barrett's esophagus images. All the images were randomly divided into training set (85%) and validation set (15%). Four deep convolutional neural networks [Xception, NASNet Large (NASNetL), ResNet50V2 (ResNet) and BigTransfer (BiT)], which had been pre-trained in the ImageNet database, were used to perform transfer learning in the training set and develop endoscopic image classification models of Barrett's esophagus, respectively. The gradient-weighted class activation mapping (Grad-CAM) was used to visually interpret the classification results of the four models. In the validation set, the models' classification abilities were evaluated. Besides, to further evaluate the classification abilities of the models, the classification results of the validation set by senior and junior physicians were compared with the models'.

Results·The 4 endoscopic image classification models of Barrett's esophagus based on deep convolutional neural networks were successfully developed. The visual interpretation of the models' classification results was presented in the form of a heat map by using Grad-CAM. In the validation set, all models presented high classification accuracy (average accuracy = 0.852) and high classification precision (average precision = 0.846). Compared with the other three models, the NASNetL model had the highest classification accuracy (0.873) and the highest classification precision (0.867), and was the best model. The NASNetL model showed similar classification ability to the senior physician. Its classification accuracy was slightly lower than that of senior physician (0.881), and higher than that of junior physician (0.855). Meanwhile, it had good classification consistency with both senior physician (Kappa=0.712, P=0.000) and junior physician (Kappa=0.695, P=0.000).

Conclusion·The endoscopic image classification models of Barrett's esophagus, developed by deep convolutional neural networks transfer learning, show good classification ability.

Key words: Barrett's esophagus, deep learning, transfer learning, gastrointestinal endoscopy, convolutional neural network

中图分类号: