多模态肺结节诊断模型的临床验证及应用价值探索

doi:10.3969/j.issn.1674-8115.2024.08.012

上海交通大学学报（医学版） ›› 2024, Vol. 44 ›› Issue (8): 1030-1036.doi: 10.3969/j.issn.1674-8115.2024.08.012

• 论著 · 临床研究 • 上一篇

多模态肺结节诊断模型的临床验证及应用价值探索

许万星¹^,²(), 王琳², 郭巧梅², 王薛庆², 娄加陶¹^,²^,³()

^1.江苏大学医学院基础医学研究所，镇江 212013
^2.上海交通大学医学院附属第一人民医院检验医学中心，上海 200080
^3.上海交通大学医学院医学技术学院，上海 200025

收稿日期:2024-01-10 接受日期:2024-04-30 出版日期:2024-08-28 发布日期:2024-08-27
通讯作者: 娄加陶 E-mail:wanxing_Xu@163.com;loujiatao@126.com
作者简介:许万星（1997—），女，硕士生；电子信箱： wanxing_Xu@163.com。
基金资助:
上海市卫健委协同创新集群项目(2019CXJQ03);上海市第一人民医院特色研究项目(CTCCR-2021B06)

Clinical validation and application value exploration of multi-modal pulmonary nodule diagnosis model

XU Wanxing¹^,²(), WANG Lin², GUO Qiaomei², WANG Xueqing², LOU Jiatao¹^,²^,³()

^1.Institute of Basic Medical Sciences, School of Medicine, Jiangsu University, Zhenjiang 212013, China
^2.Clinical Laboratory Medicine Center, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China
^3.College of Health Science and Technology, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China

Received:2024-01-10 Accepted:2024-04-30 Online:2024-08-28 Published:2024-08-27
Contact: LOU Jiatao E-mail:wanxing_Xu@163.com;loujiatao@126.com
Supported by:
Innovation Group Project of Shanghai Municipal Health Commission(2019CXJQ03);Clinical Research Innovation Plan of Shanghai General Hospital(CTCCR-2021B06)

摘要/Abstract

摘要：

目的·验证采用随机森林算法并基于血清代谢指纹数据、蛋白标志物癌胚抗原（carcinoembryonic antigen，CEA）和Image-AI的多模态肺结节诊断模型（a multi-modal pulmonary nodule diagnosis model combined metabolic fingerprints，protein biomarker CEA and Image-AI via random forest，MPI-RF）的性能，探索其临床应用价值。方法·入组就诊于上海交通大学医学院附属胸科医院且低剂量螺旋CT表现为肺结节的患者289例，根据术后病理结果将其分为恶性结节组（ n=197）和良性结节组（ n=92），收集并比较2组患者的基本信息。使用电化学发光法检测2组患者术前血清CEA水平，使用基质辅助激光解吸电离质谱（matrix-assisted laser desorption/ionization mass spectrometry，MALDI-MS）检测血清代谢指纹图谱，使用CT影像人工智能模型Image-AI计算影像得分。将CEA数据、血清代谢指纹数据和影像得分整合后输入至MPI-RF中，计算每位患者的恶性概率得分。采用受试者操作特征曲线（receiver operator characteristic curve，ROC曲线）、曲线下面积（area under the curve，AUC）评估不同模型的性能并采用DeLong检验进行比较分析，包括MPI-RF在不同类型（实性、纯磨玻璃、混合磨玻璃）和大小（直径<8 mm、直径≥8 mm）的肺结节中的诊断性能，MPI-RF与Mayo Clinic 模型、美国退伍军人管理局（veterans administration，VA）模型、Brock模型的诊断性能比较，以及MPI-RF与肺部影像报告和数据系统（lung imaging reporting and data system，Lung-RADS）在良恶性结节中的诊断性能比较。结果·MPI-RF在肺结节良恶性鉴别中具有良好的诊断性能（AUC=0.887，95% CI 0.848~0.925，灵敏度为81.22%，特异度为83.70%）；其中，MPI-RF对实性结节的AUC为0.877（95% CI 0.820~0.934），混合磨玻璃结节的AUC为0.858（95% CI 0.771~0.946），纯磨玻璃结节的AUC为0.978（95% CI 0.923~1.000）。对于直径<8 mm的结节，MPI-RF的AUC为0.840（95% CI 0.716~0.963）；直径≥8 mm的结节，其AUC为0.891（95% CI 0.849~0.933）。与现有模型对比的结果显示，MPI-RF的诊断性能优于Mayo Clinic模型、VA模型、Brock模型（均 P=0.000）；与Lung-RADS比较，MPI-RF在总样本、不同类型结节中的诊断性能均较优（均 P=0.000）。结论·MPI-RF是性能优良的良恶性肺结节鉴别诊断模型，具有潜在的临床应用价值。

关键词: 机器学习, 肺腺癌, 代谢组学, 多模态模型, 肺结节

Abstract:

Objective ·To verify the performance and explore the clinical application value of a multi-modal pulmonary nodule diagnosis model combined with metabolic fingerprints, protein biomarker CEA and Image-AI via random forest (MPI-RF). Methods ·This study enrolled 289 patients with pulmonary nodules who were admitted to the Shanghai Chest Hospital, Shanghai Jiao Tong University School of Medicine and were detected by low-dose helical computed tomography (LDCT). The patients were divided into malignant nodule group ( n=197) and benign nodule group ( n=92) based on postoperative pathological results, and the basic information of the two groups was collected and compared. Electrochemiluminescence was used to detect the preoperative serum CEA levels of the patients in the two groups, matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) was used to detect the serum metabolic fingerprints, and the CT image artificial intelligence model Image-AI was used to calculate the image scores. CEA data, serum metabolic fingerprints data and image scores were integrated and input into MPI-RF to calculate the malignant probability score of each patient. The receiver operator characteristic curve (ROC curve) and area under the curve (AUC) were used to evaluate the performance of different models, and the DeLong test was used for comparative analysis, including the diagnostic performance of MPI-RF in different types (solid nodule, pure ground-glass nodule and part-solid nodule) and sizes (diameter<8 mm and diameter≥8 mm) of pulmonary nodules, the diagnostic performance comparison of MPI-RF with Mayo Clinic model, veterans administration (VA) model and Brock model, and the diagnostic performance comparison of MPI-RF with lung imaging reporting and data system (Lung-RADS) in benign and malignant nodules. Results ·MPI-RF had good diagnostic performance in the differentiation of benign and malignant pulmonary nodules (AUC=0.887, 95% CI 0.848?0.925, sensitivity 81.22%, specificity 83.70%). Among them, the AUC of MPI-RF for solid nodules was 0.877 (95% CI 0.820?0.934), for part-solid nodules was 0.858 (95% CI 0.771?0.946), and for pure ground-glass nodules was 0.978 (95% CI 0.923?1.000). The AUC of MPI-RF was 0.840 (95% CI 0.716?0.963) for nodules within 8 mm diameter and 0.891 (95% CI 0.849?0.933) for nodules larger than 8 mm diameter. Compared with the existing models, the diagnostic performance of MPI-RF was better than that of Mayo Clinic model, VA model and Brock model (all P=0.000). Compared with Lung-RADS, MPI-RF had better diagnostic performance in the total samples and different types of nodules (all P=0.000). Conclusion ·MPI-RF is a model for the differential diagnosis of benign and malignant pulmonary nodules with excellent performance, and has potential clinical application value.

Key words: machine learning, lung adenocarcinoma, metabolomics, multi-modal model, pulmonary nodule

中图分类号:

R734.2

许万星, 王琳, 郭巧梅, 王薛庆, 娄加陶. 多模态肺结节诊断模型的临床验证及应用价值探索[J]. 上海交通大学学报（医学版）, 2024, 44(8): 1030-1036.

XU Wanxing, WANG Lin, GUO Qiaomei, WANG Xueqing, LOU Jiatao. Clinical validation and application value exploration of multi-modal pulmonary nodule diagnosis model[J]. Journal of Shanghai Jiao Tong University (Medical Science), 2024, 44(8): 1030-1036.

图/表 5

参考文献 18

1	BADE B C, DELA CRUZ C S. Lung cancer 2020: epidemiology, etiology, and prevention[J]. Clin Chest Med, 2020, 41(1): 1-24.
2	SUNG H, FERLAY J, SIEGEL R L, et al. Global cancer statistics 2020: globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries[J]. CA Cancer J Clin, 2021, 71(3): 209-249.
3	WALTER K. Pulmonary nodules[J]. JAMA, 2021, 326(15): 1544.
4	BOISELLE P M. Computed tomography screening for lung cancer[J]. JAMA, 2013, 309(11): 1163-1170.
5	MACMAHON H, NAIDICH D P, GOO J M, et al. Guidelines for management of incidental pulmonary nodules detected on CT images: from the fleischner society 2017[J]. Radiology, 2017, 284(1): 228-243.
6	National Lung Screening Trial Research Team, ABERLE D R, ADAMS A M, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening[J]. N Engl J Med, 2011, 365(5): 395-409.
7	VISSER O, VAN LEEUWEN F E. Stage-specific survival of epithelial cancers in North-Holland/Flevoland, The Netherlands[J]. Eur J Cancer, 2005, 41(15): 2321-2330.
8	HUANG L, WANG L, HU X M, et al. Machine learning of serum metabolic patterns encodes early-stage lung adenocarcinoma[J]. Nat Commun, 2020, 11(1): 3556.
9	WANG L, ZHANG M J, PAN X F, et al. Integrative serum metabolic fingerprints based multi-modal platforms for lung adenocarcinoma early detection and pulmonary nodule classification[J]. Adv Sci (Weinh), 2022, 9(34): e2203786.
10	GODOY M C B, ODISIO E G L C, TRUONG M T, et al. Pulmonary nodule management in lung cancer screening: a pictorial review of Lung-RADS version 1.0[J]. Radiol Clin North Am, 2018, 56(3): 353-363.
11	LIAO W J, SONG L J, YI H L, et al. Treatment choice by patients with obstructive sleep apnea: data from two centers in China[J]. J Thorac Dis, 2018, 10(3): 1941-1950.
12	YE M S, TONG L, ZHENG X X, et al. A classifier for improving early lung cancer diagnosis incorporating artificial intelligence and liquid biopsy[J]. Front Oncol, 2022, 12: 853801.
13	DE KONING H J, VAN DER AALST C M, DE JONG P A, et al. Reduced lung-cancer mortality with volume CT screening in a randomized trial[J]. N Engl J Med, 2020, 382(6): 503-513.
14	PATZ E F Jr, PINSKY P, GATSONIS C, et al. Overdiagnosis in low-dose computed tomography screening for lung cancer[J]. JAMA Intern Med, 2014, 174(2): 269-274.
15	MCWILLIAMS A, TAMMEMAGI M C, MAYO J R, et al. Probability of cancer in pulmonary nodules detected on first screening CT[J]. N Engl J Med, 2013, 369(10): 910-919.
16	MAZZONE P J, LAM L. Evaluating the patient with a pulmonary nodule: a review[J]. JAMA, 2022, 327(3): 264-273.
17	SAMMUT S J, CRISPIN-ORTUZAR M, CHIN S F, et al. Multi-omic machine learning predictor of breast cancer therapy response[J]. Nature, 2022, 601(7894): 623-629.
18	COHEN J D, LI L, WANG Y X, et al. Detection and localization of surgically resectable cancers with a multi-analyte blood test[J]. Science, 2018, 359(6378): 926-930.

Characteristic	Malignant nodule group ( n=197)	Benign nodule group ( n=92)	χ²/ U/ t value	P value
Age/year	53.92±10.95	51.92±10.70	1.457	0.815
Gender/ n			10.213	0.000
Male	72	52
Female	125	40
CEA level/(ng·mL ^-1)	1.99 (1.21, 2.83)	2.00 (1.28, 2.94)	8 791	0.682
Nodule size/ n			15.300	0.000
<8 mm	16	23
≥8 mm	181	69
Nodule location/ n			4.512	0.341
LUL	48	18
LLL	34	17
RUL	59	25
RML	15	14
RLL	41	18
Nodule type/ n			27.954	0.000
Pure GGN	13	7
Part-solid nodule	107	20
Solid nodule	77	65
Spiculation/ n			13.665	0.000
Yes	61	10
No	136	82
Lung-RADS grading/ n			4.615	0.000
2	14	16
3	35	24
4A	32	21
4B	55	24
4X	61	7

Characteristic	Malignant nodule group ( n=197)	Benign nodule group ( n=92)	χ²/ U/ t value	P value
Age/year	53.92±10.95	51.92±10.70	1.457	0.815
Gender/ n			10.213	0.000
Male	72	52
Female	125	40
CEA level/(ng·mL ^-1)	1.99 (1.21, 2.83)	2.00 (1.28, 2.94)	8 791	0.682
Nodule size/ n			15.300	0.000
<8 mm	16	23
≥8 mm	181	69
Nodule location/ n			4.512	0.341
LUL	48	18
LLL	34	17
RUL	59	25
RML	15	14
RLL	41	18
Nodule type/ n			27.954	0.000
Pure GGN	13	7
Part-solid nodule	107	20
Solid nodule	77	65
Spiculation/ n			13.665	0.000
Yes	61	10
No	136	82
Lung-RADS grading/ n			4.615	0.000
2	14	16
3	35	24
4A	32	21
4B	55	24
4X	61	7

Model	Cut-off	AUC (95% CI)	Sensitivity/%	Specificity/%	PPV/%	NPV/%	Accuracy/%	P value ^①
MPI-RF	0.697	0.887 (0.848‒0.925)	81.22	83.70	91.43	67.52	82.01	‒
Mayo Clinic	0.319	0.682 (0.619‒0.745)	64.47	67.39	80.89	46.97	65.40	0.000
Brock	0.387	0.723 (0.664‒0.783)	51.78	86.96	89.47	45.71	62.98	0.000
VA	0.294	0.626 (0.561‒0.693)	45.69	83.70	85.71	41.85	57.79	0.000

Model	Cut-off	AUC (95% CI)	Sensitivity/%	Specificity/%	PPV/%	NPV/%	Accuracy/%	P value ^①
MPI-RF	0.697	0.887 (0.848‒0.925)	81.22	83.70	91.43	67.52	82.01	‒
Mayo Clinic	0.319	0.682 (0.619‒0.745)	64.47	67.39	80.89	46.97	65.40	0.000
Brock	0.387	0.723 (0.664‒0.783)	51.78	86.96	89.47	45.71	62.98	0.000
VA	0.294	0.626 (0.561‒0.693)	45.69	83.70	85.71	41.85	57.79	0.000

Item	AUC (95% CI)	Sensitivity/%	Specificity/%	PPV/%	NPV/%	Accuracy/%	P value
All nodules							0.000
MPI-RF	0.887 (0.848‒0.925)	81.22	83.70	91.43	67.52	82.01
Lung-RADS	0.593 (0.521‒0.665)	75.13	43.48	74.00	74.94	65.05
Solid nodule							0.000
MPI-RF	0.877 (0.820‒0.934)	80.52	84.62	86.11	78.57	82.39
Lung-RADS	0.636 (0.542‒0.729)	94.81	32.31	62.39	84.00	66.20
Part-solid nodule							0.000
MPI-RF	0.858 (0.771‒0.946)	88.79	80.00	95.96	57.14	87.40
Lung-RADS	0.641 (0.506‒0.776)	68.22	60.00	90.12	26.09	66.93
Pure GGN							0.000
MPI-RF	0.978 (0.923‒1.000)	92.31	100.00	100.00	87.50	95.00
Lung-RADS	0.577 (0.319‒0.835)	15.38	100.00	100.00	38.89	45.00

多模态肺结节诊断模型的临床验证及应用价值探索

Clinical validation and application value exploration of multi-modal pulmonary nodule diagnosis model

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

图/表 5

参考文献 18

相关文章 15

编辑推荐

Metrics

[1]	朱鸣阳, 许元元, 任江浩, 黄嘉正, 李若楠, 谭强. 以磨玻璃结节为表现的肺腺癌亚肺叶切除研究综述[J]. 上海交通大学学报（医学版）, 2024, 44(7): 922-927.
[2]	吴丽蓉, 陈瑞华, 晁筱雯, 郭雨槐, 孙涛, 李梦慈, 陈天璐. 空腹血糖升高与认知功能恶化的代谢关联研究[J]. 上海交通大学学报（医学版）, 2024, 44(2): 212-222.
[3]	王梦菲, 杨守志, 乔永霞, 黄琳. 基于临床检验指标建立肺腺癌患者浸润程度判别模型[J]. 上海交通大学学报（医学版）, 2024, 44(1): 98-107.
[4]	马奔, 赵成, 束翌俊, 董平. CT影像组学在胃肠道间质瘤中的应用进展[J]. 上海交通大学学报（医学版）, 2023, 43(7): 923-930.
[5]	张梦吉, 黄琳, 李峥, 马卓然, 魏霖, 袁安彩, 胡刘华, 张薇, 钱昆, 卜军. 基于人群大队列探索心脑血管疾病相关血浆代谢组学特征[J]. 上海交通大学学报（医学版）, 2022, 42(3): 259-266.
[6]	赵雪, 董春燕. WD40重复蛋白43在肺腺癌中的表达及其对细胞紫杉醇耐药的影响[J]. 上海交通大学学报（医学版）, 2022, 42(12): 1656-1665.
[7]	李欣, 范青. 机器学习在抑郁症患者面部特征研究中的应用进展[J]. 上海交通大学学报(医学版), 2022, 42(1): 124-129.
[8]	熊雷, 易茜, 许明芳, 陈健. MRPL12在肺腺癌中的表达和预后分析[J]. 上海交通大学学报(医学版), 2021, 41(8): 1033-1040.
[9]	滕家俊，聂蔚，高志强，徐建林，孙加源，钟华. 国产经胸壁穿刺诊疗定位系统辅助肺结节穿刺活检的随机对照研究[J]. 上海交通大学学报(医学版), 2020, 40(9): 1218-1221.
[10]	陈亮，王喆歆，姚烽. Ⅰ A期肺腺癌气道播散临床病理因素分析[J]. 上海交通大学学报(医学版), 2020, 40(7): 957-961.
[11]	张伟然1, 2，林雪峰3，李鑫2，张浩2，王猛2，孙伟2，韩兴鹏2，孙大强1, 4. 转录组分析鉴定肺腺癌潜在的生物标志物[J]. 上海交通大学学报(医学版), 2020, 40(12): 1598-1606.
[12]	干渺妍1，谢晖2，唐惠儒1, 2. 基于异丁酯化的脂肪酸GC-FID/MS定量分析研究[J]. 上海交通大学学报（医学版）, 2020, 40(1): 22-.
[13]	汪也微，方新宇，陈艳，王丹丹，余玲芳，张晨. 奥氮平诱导体质量增加大鼠的肝脏代谢组学研究[J]. 上海交通大学学报（医学版）, 2019, 39(9): 933-.
[14]	蒋蓓蓓，张亚平，张琳，刘桂雪，解学乾. 深度卷积神经网络对≤ 3 cm的亚实性肺腺癌 CT图像病理学分型预测的可视化研究[J]. 上海交通大学学报（医学版）, 2019, 39(9): 1045-.
[15]	贾芷莹 1, 2，董旻晔 1, 2，施贞夙 2, 3，金春林 4，李国红 1, 2. 基于机器学习的轻度认知功能障碍筛查研究[J]. 上海交通大学学报（医学版）, 2019, 39(8): 908-.