TY - JOUR
T1 - A deep-learning model for intracranial aneurysm detection on CT angiography images in China
T2 - a stepwise, multicentre, early-stage clinical validation study
AU - China Aneurysm AI Project Group
AU - Hu, Bin
AU - Shi, Zhao
AU - Lu, Li
AU - Miao, Zhongchang
AU - Wang, Hao
AU - Zhou, Zhen
AU - Zhang, Fandong
AU - Wang, Rongpin
AU - Luo, Xiao
AU - Xu, Feng
AU - Li, Sheng
AU - Fang, Xiangming
AU - Wang, Xiaodong
AU - Yan, Ge
AU - Lv, Fajin
AU - Zhang, Meng
AU - Sun, Qiu
AU - Cui, Guangbin
AU - Liu, Yubao
AU - Zhang, Shu
AU - Pan, Chengwei
AU - Hou, Zhibo
AU - Liang, Huiying
AU - Pan, Yuning
AU - Chen, Xiaoxia
AU - Li, Xiaorong
AU - Zhou, Fei
AU - Schoepf, U. Joseph
AU - Varga-Szemes, Akos
AU - Garrison Moore, W.
AU - Yu, Yizhou
AU - Hu, Chunfeng
AU - Zhang, Long Jiang
AU - Tan, Bin
AU - Liu, Feidi
AU - Chen, Feng
AU - Gu, Hongmei
AU - Hou, Mingli
AU - Xu, Rui
AU - Zuo, Rui
AU - Tao, Shumin
AU - Chen, Weiwei
AU - Chai, Xue
AU - Wang, Wulin
AU - Dai, Yongjian
AU - Chen, Yueqin
AU - Zhou, Changsheng
AU - Lu, Guang Ming
AU - Schoepf, U. Joseph
AU - Moore, W. Garrison
N1 - Publisher Copyright:
© 2024 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY-NC-ND 4.0 license
PY - 2024/4
Y1 - 2024/4
N2 - Background: Artificial intelligence (AI) models in real-world implementation are scarce. Our study aimed to develop a CT angiography (CTA)-based AI model for intracranial aneurysm detection, assess how it helps clinicians improve diagnostic performance, and validate its application in real-world clinical implementation. Methods: We developed a deep-learning model using 16 546 head and neck CTA examination images from 14 517 patients at eight Chinese hospitals. Using an adapted, stepwise implementation and evaluation, 120 certified clinicians from 15 geographically different hospitals were recruited. Initially, the AI model was externally validated with images of 900 digital subtraction angiography-verified CTA cases (examinations) and compared with the performance of 24 clinicians who each viewed 300 of these cases (stage 1). Next, as a further external validation a multi-reader multi-case study enrolled 48 clinicians to individually review 298 digital subtraction angiography-verified CTA cases (stage 2). The clinicians reviewed each CTA examination twice (ie, with and without the AI model), separated by a 4-week washout period. Then, a randomised open-label comparison study enrolled 48 clinicians to assess the acceptance and performance of this AI model (stage 3). Finally, the model was prospectively deployed and validated in 1562 real-world clinical CTA cases. Findings: The AI model in the internal dataset achieved a patient-level diagnostic sensitivity of 0·957 (95% CI 0·939–0·971) and a higher patient-level diagnostic sensitivity than clinicians (0·943 [0·921–0·961] vs 0·658 [0·644–0·672]; p<0·0001) in the external dataset. In the multi-reader multi-case study, the AI-assisted strategy improved clinicians' diagnostic performance both on a per-patient basis (the area under the receiver operating characteristic curves [AUCs]; 0·795 [0·761–0·830] without AI vs 0·878 [0·850–0·906] with AI; p<0·0001) and a per-aneurysm basis (the area under the weighted alternative free-response receiver operating characteristic curves; 0·765 [0·732–0·799] vs 0·865 [0·839–0·891]; p<0·0001). Reading time decreased with the aid of the AI model (87·5 s vs 82·7 s, p<0·0001). In the randomised open-label comparison study, clinicians in the AI-assisted group had a high acceptance of the AI model (92·6% adoption rate), and a higher AUC when compared with the control group (0·858 [95% CI 0·850–0·866] vs 0·789 [0·780–0·799]; p<0·0001). In the prospective study, the AI model had a 0·51% (8/1570) error rate due to poor-quality CTA images and recognition failure. The model had a high negative predictive value of 0·998 (0·994–1·000) and significantly improved the diagnostic performance of clinicians; AUC improved from 0·787 (95% CI 0·766–0·808) to 0·909 (0·894–0·923; p<0·0001) and patient-level sensitivity improved from 0·590 (0·511–0·666) to 0·825 (0·759–0·880; p<0·0001). Interpretation: This AI model demonstrated strong clinical potential for intracranial aneurysm detection with improved clinician diagnostic performance, high acceptance, and practical implementation in real-world clinical cases. Funding: National Natural Science Foundation of China. Translation: For the Chinese translation of the abstract see Supplementary Materials section.
AB - Background: Artificial intelligence (AI) models in real-world implementation are scarce. Our study aimed to develop a CT angiography (CTA)-based AI model for intracranial aneurysm detection, assess how it helps clinicians improve diagnostic performance, and validate its application in real-world clinical implementation. Methods: We developed a deep-learning model using 16 546 head and neck CTA examination images from 14 517 patients at eight Chinese hospitals. Using an adapted, stepwise implementation and evaluation, 120 certified clinicians from 15 geographically different hospitals were recruited. Initially, the AI model was externally validated with images of 900 digital subtraction angiography-verified CTA cases (examinations) and compared with the performance of 24 clinicians who each viewed 300 of these cases (stage 1). Next, as a further external validation a multi-reader multi-case study enrolled 48 clinicians to individually review 298 digital subtraction angiography-verified CTA cases (stage 2). The clinicians reviewed each CTA examination twice (ie, with and without the AI model), separated by a 4-week washout period. Then, a randomised open-label comparison study enrolled 48 clinicians to assess the acceptance and performance of this AI model (stage 3). Finally, the model was prospectively deployed and validated in 1562 real-world clinical CTA cases. Findings: The AI model in the internal dataset achieved a patient-level diagnostic sensitivity of 0·957 (95% CI 0·939–0·971) and a higher patient-level diagnostic sensitivity than clinicians (0·943 [0·921–0·961] vs 0·658 [0·644–0·672]; p<0·0001) in the external dataset. In the multi-reader multi-case study, the AI-assisted strategy improved clinicians' diagnostic performance both on a per-patient basis (the area under the receiver operating characteristic curves [AUCs]; 0·795 [0·761–0·830] without AI vs 0·878 [0·850–0·906] with AI; p<0·0001) and a per-aneurysm basis (the area under the weighted alternative free-response receiver operating characteristic curves; 0·765 [0·732–0·799] vs 0·865 [0·839–0·891]; p<0·0001). Reading time decreased with the aid of the AI model (87·5 s vs 82·7 s, p<0·0001). In the randomised open-label comparison study, clinicians in the AI-assisted group had a high acceptance of the AI model (92·6% adoption rate), and a higher AUC when compared with the control group (0·858 [95% CI 0·850–0·866] vs 0·789 [0·780–0·799]; p<0·0001). In the prospective study, the AI model had a 0·51% (8/1570) error rate due to poor-quality CTA images and recognition failure. The model had a high negative predictive value of 0·998 (0·994–1·000) and significantly improved the diagnostic performance of clinicians; AUC improved from 0·787 (95% CI 0·766–0·808) to 0·909 (0·894–0·923; p<0·0001) and patient-level sensitivity improved from 0·590 (0·511–0·666) to 0·825 (0·759–0·880; p<0·0001). Interpretation: This AI model demonstrated strong clinical potential for intracranial aneurysm detection with improved clinician diagnostic performance, high acceptance, and practical implementation in real-world clinical cases. Funding: National Natural Science Foundation of China. Translation: For the Chinese translation of the abstract see Supplementary Materials section.
UR - https://www.scopus.com/pages/publications/85188566409
U2 - 10.1016/S2589-7500(23)00268-6
DO - 10.1016/S2589-7500(23)00268-6
M3 - 文章
C2 - 38519154
AN - SCOPUS:85188566409
SN - 2589-7500
VL - 6
SP - e261-e271
JO - The Lancet Digital Health
JF - The Lancet Digital Health
IS - 4
ER -