Whole-tumor 3D volumetric MRI-based radiomics approach for distinguishing between benign and malignant soft tissue tumors.
Our purpose was to differentiate between malignant from benign soft tissue neoplasms using a combination of MRI-based radiomics metrics and machine learning.
Our retrospective study identified 128 histologically diagnosed benign (n = 36) and malignant (n = 92) soft tissue lesions. 3D ROIs were manually drawn on 1 sequence of interest and co-registered to other sequences obtained during the same study. One thousand seven hundred eight radiomics features were extracted from each ROI. Univariate analyses with supportive ROC analyses were conducted to evaluate the discriminative power of predictive models constructed using Real Adaptive Boosting (Adaboost) and Random Forest (RF) machine learning approaches.
Univariate analyses demonstrated that 36.89% of individual radiomics varied significantly between benign and malignant lesions at the p ≤ 0.05 level. Adaboost and RF performed similarly well, with AUCs of 0.77 (95% CI 0.68-0.85) and 0.72 (95% CI 0.63-0.81), respectively, after 10-fold cross-validation. Restricting the machine learning models to only sequences extracted from T2FS and STIR sequences maintained comparable performance, with AUCs of 0.73 (95% CI 0.64-0.82) and 0.75 (95% CI 0.65-0.84), respectively.
Machine learning decision classifiers constructed from MRI-based radiomics features show promising ability to preoperatively discriminate between benign and malignant soft tissue masses. Our approach maintains applicability even when the dataset is restricted to T2FS and STIR fluid-sensitive sequences, which may bolster practicality in clinical application scenarios by eliminating the need for complex co-registrations for multisequence analysis.
• Predictive models constructed from MRI-based radiomics data and machine learning-augmented approaches yielded good discriminative power to correctly classify benign and malignant lesions on preoperative scans, with AUCs of 0.77 (95% CI 0.68-0.85) and 0.72 (95% CI 0.63-0.81) for Real Adaptive Boosting (Adaboost) and Random Forest (RF), respectively. • Restricting the models to only use metrics extracted from T2 fat-saturated (T2FS) and Short-Tau Inversion Recovery (STIR) sequences yielded similar performance, with AUCs of 0.73 (95% CI 0.64-0.82) and 0.75 (95% CI 0.65-0.84) for Adaboost and RF, respectively. • Radiomics-based machine learning decision classifiers constructed from multicentric data more closely mimic the real-world practice environment and warrant additional validation ahead of prospective implementation into clinical workflows.
Fields BKK
,Demirjian NL
,Hwang DH
,Varghese BA
,Cen SY
,Lei X
,Desai B
,Duddalwar V
,Matcuk GR Jr
... -
《-》
Ensemble learning-based radiomics with multi-sequence magnetic resonance imaging for benign and malignant soft tissue tumor differentiation.
Many previous studies focused on differentiating between benign and malignant soft tissue tumors using radiomics model based on various magnetic resonance imaging (MRI) sequences, but it is still unclear how to set up the input radiomic features from multiple MRI sequences. Here, we evaluated two types of radiomics models generated using different feature incorporation strategies. In order to differentiate between benign and malignant soft tissue tumors (STTs), we compared the diagnostic performance of an ensemble of random forest (R) models with single-sequence MRI inputs to R models with pooled multi-sequence MRI inputs. One-hundred twenty-five STT patients with preoperative MRI were retrospectively included and consisted of training (n = 100) and test (n = 25) sets. MRI included T1-weighted (T1-WI), T2-weighted (T2-WI), contrast-enhanced (CE)-T1-WI, diffusion-weighted images (DWIs, b = 800 sec/mm2) and apparent diffusion coefficient (ADC) maps. After tumor segmentation on each sequence, 100 original radiomic features were extracted from each sequence image and divided into three-feature sets: T features from T1- and T2-WI, CE features from CE-T1-WI, and D features from DWI and ADC maps. Four radiomics models were built using Lasso and R with four combinations of three-feature sets as inputs: T features (R-T), T+CE features (R-C), T+D features (R-D), and T+CE+D features (R-A) (Type-1 model). An ensemble model was built by soft voting of five, single-sequence-based R models (Type-2 model). AUC, sensitivity, specificity, and accuracy of each model was calculated with five-fold cross validation. In Type-1 model, AUC, sensitivity, specificity, and accuracy were 0.752, 71.8%, 61.1%, and 67.2% in R-T; 0.756, 76.1%, 70.4%, and 73.6% in R-C; 0.750, 77.5%, 63.0%, and 71.2% in R-D; and 0.749, 74.6%, 61.1%, and 68.8% R-A models, respectively. AUC, sensitivity, specificity, and accuracy of Type-2 model were 0.774, 76.1%, 68.5%, and 72.8%. In conclusion, an ensemble method is beneficial to incorporate features from multi-sequence MRI and showed diagnostic robustness for differentiating malignant STTs.
Lee S
,Lee SY
,Jung JY
,Nam Y
,Jeon HJ
,Jung CK
,Shin SH
,Chung YG
... -
《PLoS One》
Radiomics analysis of ultrasound images to discriminate between benign and malignant adnexal masses with solid ultrasound morphology.
Our primary aim was to identify radiomic ultrasound features that can distinguish benign from malignant adnexal masses with solid ultrasound morphology, and primary invasive from metastatic solid ovarian masses, and to develop ultrasound-based machine learning models that include radiomics features to discriminate between benign and malignant solid adnexal masses. Our secondary aim was to compare the diagnostic performance of our radiomics models with that of the ADNEX model and subjective assessment by an experienced ultrasound examiner.
This is a retrospective observational single center study. Patients with a histological diagnosis of an adnexal tumor with solid morphology at preoperative ultrasound examination performed between 2014 and 2021 were included. The patient cohort was split into training and validation sets with a ratio of 70:30 and with the same proportion of benign and malignant (borderline, primary invasive and metastatic) tumors in the two subsets. The extracted radiomic features belonged to two different families: intensity-based statistical features and textural features. Models to predict malignancy were built based on a random forest classifier, fine-tuned using 5-fold cross-validation over the training set, and tested on the held-out validation set. The variables used in model building were patient's age, and those radiomic features that were statistically significantly different between benign and malignant adnexal masses (Wilcoxon-Mann-Whitney Test with Benjamini-Hochberg correction for multiple comparisons) and assessed as not redundant based on the Pearson correlation coefficient. We describe discriminative ability as area under the receiver operating characteristics curve (AUC) and classification performance as sensitivity and specificity.
326 patients were identified and 775 preoperative ultrasound images were analyzed. 68 radiomic features were extracted, 52 differed statistically significantly between benign and malignant tumors in the training set, and 18 features were selected for inclusion in model building. The same 52 radiomic features differed statistically significantly between benign, primary invasive malignant and metastatic tumors. However, the values of the features manifested overlap between primary malignant and metastatic tumors and did not differ statistically significantly between them. In the validation set, 25/98 tumors (25.5%) were benign, 73/98 (74.5%) were malignant (6 borderline, 57 primary invasive, 10 metastases). In the validation set, a model including only radiomics features had an AUC of 0.80, and 78% sensitivity and 76% specificity at its optimal risk of malignancy cutoff (68% based on Youden's index). The corresponding results for a model including age and radiomics features were 0.79, 86% and 56% (cutoff 60% based on Youden's method), while those of the ADNEX model were 0.88, 99% and 64% (at 20% malignancy cutoff). Subjective assessment had sensitivity 99% and specificity 72%.
Even though our radiomics models had discriminative ability inferior to that of the ADNEX model, our results are promising enough to justify continued development of radiomics analysis of ultrasound images of adnexal masses. This article is protected by copyright. All rights reserved.
Moro F
,Vagni M
,Tran HE
,Bernardini F
,Mascilini F
,Ciccarone F
,Nero C
,Giannarelli D
,Boldrini L
,Fagotti A
,Scambia G
,Valentin L
,Testa AC
... -
《-》