-
Artificial intelligence-based model for lymph node metastases detection on whole slide images in bladder cancer: a retrospective, multicentre, diagnostic study.
Accurate lymph node staging is important for the diagnosis and treatment of patients with bladder cancer. We aimed to develop a lymph node metastases diagnostic model (LNMDM) on whole slide images and to assess the clinical effect of an artificial intelligence-assisted (AI) workflow.
In this retrospective, multicentre, diagnostic study in China, we included consecutive patients with bladder cancer who had radical cystectomy and pelvic lymph node dissection, and from whom whole slide images of lymph node sections were available, for model development. We excluded patients with non-bladder cancer and concurrent surgery, or low-quality images. Patients from two hospitals (Sun Yat-sen Memorial Hospital of Sun Yat-sen University and Zhujiang Hospital of Southern Medical University, Guangzhou, Guangdong, China) were assigned before a cutoff date to a training set and after the date to internal validation sets for each hospital. Patients from three other hospitals (the Third Affiliated Hospital of Sun Yat-sen University, Nanfang Hospital of Southern Medical University, and the Third Affiliated Hospital of Southern Medical University, Guangzhou, Guangdong, China) were included as external validation sets. A validation subset of challenging cases from the five validation sets was used to compare performance between the LNMDM and pathologists, and two other datasets (breast cancer from the CAMELYON16 dataset and prostate cancer from the Sun Yat-sen Memorial Hospital of Sun Yat-sen University) were collected for a multi-cancer test. The primary endpoint was diagnostic sensitivity in the four prespecified groups (ie, the five validation sets, a single-lymph-node test set, the multi-cancer test set, and the subset for a performance comparison between the LNMDM and pathologists).
Between Jan 1, 2013 and Dec 31, 2021, 1012 patients with bladder cancer had radical cystectomy and pelvic lymph node dissection and were included (8177 images and 20 954 lymph nodes). We excluded 14 patients (165 images) with concurrent non-bladder cancer and also excluded 21 low-quality images. We included 998 patients and 7991 images (881 [88%] men; 117 [12%] women; median age 64 years [IQR 56-72]; ethnicity data not available; 268 [27%] with lymph node metastases) to develop the LNMDM. The area under the curve (AUC) for accurate diagnosis of the LNMDM ranged from 0·978 (95% CI 0·960-0·996) to 0·998 (0·996-1·000) in the five validation sets. Performance comparisons between the LNMDM and pathologists showed that the diagnostic sensitivity of the model (0·983 [95% CI 0·941-0·998]) substantially exceeded that of both junior pathologists (0·906 [0·871-0·934]) and senior pathologists (0·947 [0·919-0·968]), and that AI assistance improved sensitivity for both junior (from 0·906 without AI to 0·953 with AI) and senior (from 0·947 to 0·986) pathologists. In the multi-cancer test, the LNMDM maintained an AUC of 0·943 (95% CI 0·918-0·969) in breast cancer images and 0·922 (0·884-0·960) in prostate cancer images. In 13 patients, the LNMDM detected tumour micrometastases that had been missed by pathologists who had previously classified these patients' results as negative. Receiver operating characteristic curves showed that the LNMDM would enable pathologists to exclude 80-92% of negative slides while maintaining 100% sensitivity in clinical application.
We developed an AI-based diagnostic model that did well in detecting lymph node metastases, particularly micrometastases. The LNMDM showed substantial potential for clinical applications in improving the accuracy and efficiency of pathologists' work.
National Natural Science Foundation of China, the Science and Technology Planning Project of Guangdong Province, the National Key Research and Development Programme of China, and the Guangdong Provincial Clinical Research Centre for Urological Diseases.
Wu S
,Hong G
,Xu A
,Zeng H
,Chen X
,Wang Y
,Luo Y
,Wu P
,Liu C
,Jiang N
,Dang Q
,Yang C
,Liu B
,Shen R
,Chen Z
,Liao C
,Lin Z
,Wang J
,Lin T
... -
《-》
-
An artificial intelligence model for detecting pathological lymph node metastasis in prostate cancer using whole slide images: a retrospective, multicentre, diagnostic study.
The pathological examination of lymph node metastasis (LNM) is crucial for treating prostate cancer (PCa). However, the limitations with naked-eye detection and pathologist workload contribute to a high missed-diagnosis rate for nodal micrometastasis. We aimed to develop an artificial intelligence (AI)-based, time-efficient, and high-precision PCa LNM detector (ProCaLNMD) and evaluate its clinical application value.
In this multicentre, retrospective, diagnostic study, consecutive patients with PCa who underwent radical prostatectomy and pelvic lymph node dissection at five centres between Sep 2, 2013 and Apr 28, 2023 were included, and histopathological slides of resected lymph nodes were collected and digitised as whole-slide images for model development and validation. ProCaLNMD was trained at a dataset from a single centre (the Sun Yat-sen Memorial Hospital of Sun Yat-sen University [SYSMH]), and externally validated in the other four centres. A bladder cancer dataset from SYSMH was used to further validate ProCaLNMD, and an additional validation (human-AI comparison and collaboration study) containing consecutive patients with PCa from SYSMH was implemented to evaluate the application value of integrating ProCaLNMD into the clinical workflow. The primary endpoint was the area under the receiver operating characteristic curve (AUROC) of ProCaLNMD. In addition, the performance measures for pathologists with ProCaLNMD assistance was also assessed.
In total, 8225 slides from 1297 patients with PCa were collected and digitised. Overall, 8158 slides (18,761 lymph nodes) from 1297 patients with PCa (median age 68 years [interquartile range 64-73]; 331 [26%] with LNM) were used to train and validate ProCaLNMD. The AUROC of ProCaLNMD ranged from 0.975 (95% confidence interval 0.953-0.998) to 0.992 (0.982-1.000) in the training and validation datasets, with sensitivities > 0.955 and specificities > 0.921. ProCaLNMD also demonstrated an AUROC of 0.979 in the cross-cancer dataset. ProCaLNMD use triggered true reclassification in 43 (4.3%) slides in which micrometastatic tumour regions were initially missed by pathologists, thereby correcting 28 (8.5%) missed-diagnosed cases of previous routine pathological reports. In the human-AI comparison and collaboration study, the sensitivity of ProCaLNMD (0.983 [0.908-1.000]) surpassed that of two junior pathologists (0.862 [0.746-0.939], P = 0.023; 0.879 [0.767-0.950], P = 0.041) by 10-12% and showed no difference to that of two senior pathologists (both 0.983 [0.908-1.000], both P > 0.99). Furthermore, ProCaLNMD significantly boosted the diagnostic sensitivity of two junior pathologists (both P = 0.041) to the level of senior pathologists (both P > 0.99), and substantially reduced the four pathologists' slide reviewing time (-31%, P < 0.0001; -34%, P < 0.0001; -29%, P < 0.0001; and -27%, P = 0.00031).
ProCaLNMD demonstrated high diagnostic capabilities for identifying LNM in prostate cancer, reducing the likelihood of missed diagnoses by pathologists and decreasing the slide reviewing time, highlighting its potential for clinical application.
National Natural Science Foundation of China, the Science and Technology Planning Project of Guangdong Province, the National Key Research and Development Programme of China, the Guangdong Provincial Clinical Research Centre for Urological Diseases, and the Science and Technology Projects in Guangzhou.
Wu S
,Wang Y
,Hong G
,Luo Y
,Lin Z
,Shen R
,Zeng H
,Xu A
,Wu P
,Xiao M
,Li X
,Rao P
,Yang Q
,Feng Z
,He Q
,Jiang F
,Xie Y
,Liao C
,Huang X
,Chen R
,Lin T
... -
《EClinicalMedicine》
-
Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: a multicentre diagnostic study.
Strategies for integrating artificial intelligence (AI) into thyroid nodule management require additional development and testing. We developed a deep-learning AI model (ThyNet) to differentiate between malignant tumours and benign thyroid nodules and aimed to investigate how ThyNet could help radiologists improve diagnostic performance and avoid unnecessary fine needle aspiration.
ThyNet was developed and trained on 18 049 images of 8339 patients (training set) from two hospitals (the First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China, and Sun Yat-sen University Cancer Center, Guangzhou, China) and tested on 4305 images of 2775 patients (total test set) from seven hospitals (the First Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, China; the Sixth Affiliated Hospital of Sun Yat-sen University, Guangzhou, China; the Guangzhou Army General Hospital, Guangzhou, China; the Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China; the First Affiliated Hospital of Sun Yat-sen University; Sun Yat-sen University Cancer Center; and the First Affiliated Hospital of Guangxi Medical University, Nanning, China) in three stages. All nodules in the training and total test set were pathologically confirmed. The diagnostic performance of ThyNet was first compared with 12 radiologists (test set A); a ThyNet-assisted strategy, in which ThyNet assisted diagnoses made by radiologists, was developed to improve diagnostic performance of radiologists using images (test set B); the ThyNet assisted strategy was then tested in a real-world clinical setting (using images and videos; test set C). In a simulated scenario, the number of unnecessary fine needle aspirations avoided by ThyNet-assisted strategy was calculated.
The area under the receiver operating characteristic curve (AUROC) for accurate diagnosis of ThyNet (0·922 [95% CI 0·910-0·934]) was significantly higher than that of the radiologists (0·839 [0·834-0·844]; p<0·0001). Furthermore, ThyNet-assisted strategy improved the pooled AUROC of the radiologists from 0·837 (0·832-0·842) when diagnosing without ThyNet to 0·875 (0·871-0·880; p<0·0001) with ThyNet for reviewing images, and from 0·862 (0·851-0·872) to 0·873 (0·863-0·883; p<0·0001) in the clinical test, which used images and videos. In the simulated scenario, the number of fine needle aspirations decreased from 61·9% to 35·2% using the ThyNet-assisted strategy, while missed malignancy decreased from 18·9% to 17·0%.
The ThyNet-assisted strategy can significantly improve the diagnostic performance of radiologists and help reduce unnecessary fine needle aspirations for thyroid nodules.
National Natural Science Foundation of China and Guangzhou Science and Technology Project.
Peng S
,Liu Y
,Lv W
,Liu L
,Zhou Q
,Yang H
,Ren J
,Liu G
,Wang X
,Zhang X
,Du Q
,Nie F
,Huang G
,Guo Y
,Li J
,Liang J
,Hu H
,Xiao H
,Liu Z
,Lai F
,Zheng Q
,Wang H
,Li Y
,Alexander EK
,Wang W
,Xiao H
... -
《The Lancet Digital Health》
-
Real-time artificial intelligence for detection of upper gastrointestinal cancer by endoscopy: a multicentre, case-control, diagnostic study.
Upper gastrointestinal cancers (including oesophageal cancer and gastric cancer) are the most common cancers worldwide. Artificial intelligence platforms using deep learning algorithms have made remarkable progress in medical imaging but their application in upper gastrointestinal cancers has been limited. We aimed to develop and validate the Gastrointestinal Artificial Intelligence Diagnostic System (GRAIDS) for the diagnosis of upper gastrointestinal cancers through analysis of imaging data from clinical endoscopies.
This multicentre, case-control, diagnostic study was done in six hospitals of different tiers (ie, municipal, provincial, and national) in China. The images of consecutive participants, aged 18 years or older, who had not had a previous endoscopy were retrieved from all participating hospitals. All patients with upper gastrointestinal cancer lesions (including oesophageal cancer and gastric cancer) that were histologically proven malignancies were eligible for this study. Only images with standard white light were deemed eligible. The images from Sun Yat-sen University Cancer Center were randomly assigned (8:1:1) to the training and intrinsic verification datasets for developing GRAIDS, and the internal validation dataset for evaluating the performance of GRAIDS. Its diagnostic performance was evaluated using an internal and prospective validation set from Sun Yat-sen University Cancer Center (a national hospital) and additional external validation sets from five primary care hospitals. The performance of GRAIDS was also compared with endoscopists with three degrees of expertise: expert, competent, and trainee. The diagnostic accuracy, sensitivity, specificity, positive predictive value, and negative predictive value of GRAIDS and endoscopists for the identification of cancerous lesions were evaluated by calculating the 95% CIs using the Clopper-Pearson method.
1 036 496 endoscopy images from 84 424 individuals were used to develop and test GRAIDS. The diagnostic accuracy in identifying upper gastrointestinal cancers was 0·955 (95% CI 0·952-0·957) in the internal validation set, 0·927 (0·925-0·929) in the prospective set, and ranged from 0·915 (0·913-0·917) to 0·977 (0·977-0·978) in the five external validation sets. GRAIDS achieved diagnostic sensitivity similar to that of the expert endoscopist (0·942 [95% CI 0·924-0·957] vs 0·945 [0·927-0·959]; p=0·692) and superior sensitivity compared with competent (0·858 [0·832-0·880], p<0·0001) and trainee (0·722 [0·691-0·752], p<0·0001) endoscopists. The positive predictive value was 0·814 (95% CI 0·788-0·838) for GRAIDS, 0·932 (0·913-0·948) for the expert endoscopist, 0·974 (0·960-0·984) for the competent endoscopist, and 0·824 (0·795-0·850) for the trainee endoscopist. The negative predictive value was 0·978 (95% CI 0·971-0·984) for GRAIDS, 0·980 (0·974-0·985) for the expert endoscopist, 0·951 (0·942-0·959) for the competent endoscopist, and 0·904 (0·893-0·916) for the trainee endoscopist.
GRAIDS achieved high diagnostic accuracy in detecting upper gastrointestinal cancers, with sensitivity similar to that of expert endoscopists and was superior to that of non-expert endoscopists. This system could assist community-based hospitals in improving their effectiveness in upper gastrointestinal cancer diagnoses.
The National Key R&D Program of China, the Natural Science Foundation of Guangdong Province, the Science and Technology Program of Guangdong, the Science and Technology Program of Guangzhou, and the Fundamental Research Funds for the Central Universities.
Luo H
,Xu G
,Li C
,He L
,Luo L
,Wang Z
,Jing B
,Deng Y
,Jin Y
,Li Y
,Li B
,Tan W
,He C
,Seeruttun SR
,Wu Q
,Huang J
,Huang DW
,Chen B
,Lin SB
,Chen QM
,Yuan CM
,Chen HX
,Pu HY
,Zhou F
,He Y
,Xu RH
... -
《-》
-
Deep learning models for thyroid nodules diagnosis of fine-needle aspiration biopsy: a retrospective, prospective, multicentre study in China.
Accurately distinguishing between malignant and benign thyroid nodules through fine-needle aspiration cytopathology is crucial for appropriate therapeutic intervention. However, cytopathologic diagnosis is time consuming and hindered by the shortage of experienced cytopathologists. Reliable assistive tools could improve cytopathologic diagnosis efficiency and accuracy. We aimed to develop and test an artificial intelligence (AI)-assistive system for thyroid cytopathologic diagnosis according to the Thyroid Bethesda Reporting System.
11 254 whole-slide images (WSIs) from 4037 patients were used to train deep learning models. Among the selected WSIs, cell level was manually annotated by cytopathologists according to The Bethesda System for Reporting Thyroid Cytopathology (TBSRTC) guidelines of the second edition (2017 version). A retrospective dataset of 5638 WSIs of 2914 patients from four medical centres was used for validation. 469 patients were recruited for the prospective study of the performance of AI models and their 537 thyroid nodule samples were used. Cohorts for training and validation were enrolled between Jan 1, 2016, and Aug 1, 2022, and the prospective dataset was recruited between Aug 1, 2022, and Jan 1, 2023. The performance of our AI models was estimated as the area under the receiver operating characteristic (AUROC), sensitivity, specificity, accuracy, positive predictive value, and negative predictive value. The primary outcomes were the prediction sensitivity and specificity of the model to assist cyto-diagnosis of thyroid nodules.
The AUROC of TBSRTC III+ (which distinguishes benign from TBSRTC classes III, IV, V, and VI) was 0·930 (95% CI 0·921-0·939) for Sun Yat-sen Memorial Hospital of Sun Yat-sen University (SYSMH) internal validation and 0·944 (0·929 - 0·959), 0·939 (0·924-0·955), 0·971 (0·938-1·000) for The First People's Hospital of Foshan (FPHF), Sichuan Cancer Hospital & Institute (SCHI), and The Third Affiliated Hospital of Guangzhou Medical University (TAHGMU) medical centres, respectively. The AUROC of TBSRTC V+ (which distinguishes benign from TBSRTC classes V and VI) was 0·990 (95% CI 0·986-0·995) for SYSMH internal validation and 0·988 (0·980-0·995), 0·965 (0·953-0·977), and 0·991 (0·972-1·000) for FPHF, SCHI, and TAHGMU medical centres, respectively. For the prospective study at SYSMH, the AUROC of TBSRTC III+ and TBSRTC V+ was 0·977 and 0·981, respectively. With the assistance of AI, the specificity of junior cytopathologists was boosted from 0·887 (95% CI 0·8440-0·922) to 0·993 (0·974-0·999) and the accuracy was improved from 0·877 (0·846-0·904) to 0·948 (0·926-0·965). 186 atypia of undetermined significance samples from 186 patients with BRAF mutation information were collected; 43 of them harbour the BRAFV600E mutation. 91% (39/43) of BRAFV600E-positive atypia of undetermined significance samples were identified as malignant by the AI models.
In this study, we developed an AI-assisted model named the Thyroid Patch-Oriented WSI Ensemble Recognition (ThyroPower) system, which facilitates rapid and robust cyto-diagnosis of thyroid nodules, potentially enhancing the diagnostic capabilities of cytopathologists. Moreover, it serves as a potential solution to mitigate the scarcity of cytopathologists.
Guangdong Science and Technology Department.
For the Chinese translation of the abstract see Supplementary Materials section.
Wang J
,Zheng N
,Wan H
,Yao Q
,Jia S
,Zhang X
,Fu S
,Ruan J
,He G
,Chen X
,Li S
,Chen R
,Lai B
,Wang J
,Jiang Q
,Ouyang N
,Zhang Y
... -
《The Lancet Digital Health》