Discovery of biomarkers in the psoriasis through machine learning and dynamic immune infiltration in three types of skin lesions.
Psoriasis is a chronic skin disease characterized by unique scaling plaques. However, during the acute phase, psoriatic lesions exhibit eczematous changes, making them difficult to distinguish from atopic dermatitis, which poses challenges for the selection of biological agents. This study aimed to identify potential diagnostic genes in psoriatic lesions and investigate their clinical significance.
GSE182740 datasets from the GEO database were analyzed for differential analysis; machine learning algorithms (SVM-RFE and LASSO regression models) are used to screen for diagnostic markers; CIBERSORTx is used to determine the dynamic changes of 22 different immune cell components in normal skin lesions, psoriatic non-lesional skin, and psoriatic lesional skin, as well as the expression of the diagnostic genes in 10 major immune cells, and real-time quantitative polymerase chain reaction (RT-qPCR) and immunohistochemistry are used to validate results.
We obtained 580 differentially expressed genes (DEGs) in the skin lesion and non-lesion of psoriasis patients, 813 DEGs in mixed patients between non-lesions and lesions, and 96 DEGs in the skin lesion and non-lesion of atopic dermatitis, respectively. Then 144 specific DEGs in psoriasis via a Veen diagram were identified. Ultimately, UGGT1, CCNE1, MMP9 and ARHGEF28 are identified for potential diagnostic genes from these 144 specific DEGs. The value of the selected diagnostic genes was verified by receiver operating characteristic (ROC) curves with expanded samples. The the area under the ROC curve (AUC) exceeded 0.7 for the four diagnosis genes. RT-qPCR results showed that compared to normal human epidermis, the expression of UGGT1, CCNE1, and MMP9 was significantly increased in patients with psoriasis, while ARHGEF28 expression was significantly decreased. Notably, the results of CIBERSORTx showed that CCNE1 was highly expressed in CD4+ T cells and neutrophils, ARHGEF28 was also expressed in mast cells. Additionally, CCNE1 was strongly correlated with IL-17/CXCL8/9/10 and CCL20. Immunohistochemical results showed increased nuclear expression of CCNE1 in psoriatic epidermal cells relative to normal.
Based on the performance of the four genes in ROC curves and their expression in immune cells from patients with psoriasis, we suggest that CCNE1 possess higher diagnostic value.
Zhou X
,Zhou H
,Luo X
,Wu RF
... -
《Frontiers in Immunology》
Development of machine learning models for diagnostic biomarker identification and immune cell infiltration analysis in PCOS.
Polycystic ovary syndrome (PCOS) is a common endocrine disorder affecting women of reproductive age. It is characterized by symptoms such as hyperandrogenemia, oligo or anovulation and polycystic ovarian, significantly impacting quality of life. However, the practical implementation of machine learning (ML) in PCOS diagnosis is hindered by the limitations related to data size and algorithmic models. To address this research gap, we have increased the sample size in our study and aim to utilize two ML algorithms to analyze and validate diagnostic biomarkers, as well as explore immune cell infiltration patterns in PCOS.
We performed RNA-seq analysis on granulosa cell, including 13 samples from normal controls and 25 samples from women with PCOS. The data from our study were combined with publicly available databases. Batch effects were corrected using the 'sva' package in R software. Differential expression analysis was performed to identify genes that exhibited significant differences between the two groups. These differentially expressed genes (DEGs) were further analyzed for Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Hub genes were selected by intersecting the results of both methods after using LASSO and SVM-RFE for central gene selection for DEGs. Receiver Operating Characteristic (ROC) curves were employed to verify the accuracy of models by SVM and XGBoost. CIBERSORT analysis was performed to determine the relative abundances of immune cell populations. GSEA was analyzed to illustrate the expression patterns of genes within highly enriched functional pathways. RT-qPCR was used to validate the reliability of hub genes.
824 DEGs were found between the normal control and PCOS groups, including 376 upregulated and 448 downregulated genes. These DEGs were associated with endocytosis, salmonella infection and focal adhesion based on the KEGG enrichment analysis. Through overlapping LASSO and SVM-RFE algorithms, we identified four hub genes (CNTN2, CASR, CACNB3, MFAP2) that are significantly associated with the PCOS group. The diagnostic efficacy validation set using SVM and XGBoost yielded AUC values of 0.795 and 0.875, respectively, indicating their potential as diagnostic biomarkers. Consistent with the data analysis, the upregulation of CNTN2, CASR, CACNB3, and MFAP2 in PCOS was confirmed by RT-qPCR analysis on human granulosa cells. Furthermore, according to CIBERSORT analysis, a significant reduction in CD4 memory resting T cells was revealed in the PCOS group compared to the normal control group (P < 0.05).
This study identified CNTN2, CASR, CACNB3, and MFAP2 as potential diagnostic biomarkers for PCOS, which provides strong evidence for existing research on hub genes. Furthermore, the analysis of immune cell infiltration revealed the significant involvement of CD4 memory resting T cells in the onset and progression of PCOS. These findings shed light on potential mechanisms underlying PCOS pathogenesis and provide valuable insights for future research and therapeutic interventions.
Chen W
,Miao J
,Chen J
,Chen J
... -
《Journal of Ovarian Research》
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.
Survival estimation for patients with symptomatic skeletal metastases ideally should be made before a type of local treatment has already been determined. Currently available survival prediction tools, however, were generated using data from patients treated either operatively or with local radiation alone, raising concerns about whether they would generalize well to all patients presenting for assessment. The Skeletal Oncology Research Group machine-learning algorithm (SORG-MLA), trained with institution-based data of surgically treated patients, and the Metastases location, Elderly, Tumor primary, Sex, Sickness/comorbidity, and Site of radiotherapy model (METSSS), trained with registry-based data of patients treated with radiotherapy alone, are two of the most recently developed survival prediction models, but they have not been tested on patients whose local treatment strategy is not yet decided.
(1) Which of these two survival prediction models performed better in a mixed cohort made up both of patients who received local treatment with surgery followed by radiotherapy and who had radiation alone for symptomatic bone metastases? (2) Which model performed better among patients whose local treatment consisted of only palliative radiotherapy? (3) Are laboratory values used by SORG-MLA, which are not included in METSSS, independently associated with survival after controlling for predictions made by METSSS?
Between 2010 and 2018, we provided local treatment for 2113 adult patients with skeletal metastases in the extremities at an urban tertiary referral academic medical center using one of two strategies: (1) surgery followed by postoperative radiotherapy or (2) palliative radiotherapy alone. Every patient's survivorship status was ascertained either by their medical records or the national death registry from the Taiwanese National Health Insurance Administration. After applying a priori designated exclusion criteria, 91% (1920) were analyzed here. Among them, 48% (920) of the patients were female, and the median (IQR) age was 62 years (53 to 70 years). Lung was the most common primary tumor site (41% [782]), and 59% (1128) of patients had other skeletal metastases in addition to the treated lesion(s). In general, the indications for surgery were the presence of a complete pathologic fracture or an impending pathologic fracture, defined as having a Mirels score of ≥ 9, in patients with an American Society of Anesthesiologists (ASA) classification of less than or equal to IV and who were considered fit for surgery. The indications for radiotherapy were relief of pain, local tumor control, prevention of skeletal-related events, and any combination of the above. In all, 84% (1610) of the patients received palliative radiotherapy alone as local treatment for the target lesion(s), and 16% (310) underwent surgery followed by postoperative radiotherapy. Neither METSSS nor SORG-MLA was used at the point of care to aid clinical decision-making during the treatment period. Survival was retrospectively estimated by these two models to test their potential for providing survival probabilities. We first compared SORG to METSSS in the entire population. Then, we repeated the comparison in patients who received local treatment with palliative radiation alone. We assessed model performance by area under the receiver operating characteristic curve (AUROC), calibration analysis, Brier score, and decision curve analysis (DCA). The AUROC measures discrimination, which is the ability to distinguish patients with the event of interest (such as death at a particular time point) from those without. AUROC typically ranges from 0.5 to 1.0, with 0.5 indicating random guessing and 1.0 a perfect prediction, and in general, an AUROC of ≥ 0.7 indicates adequate discrimination for clinical use. Calibration refers to the agreement between the predicted outcomes (in this case, survival probabilities) and the actual outcomes, with a perfect calibration curve having an intercept of 0 and a slope of 1. A positive intercept indicates that the actual survival is generally underestimated by the prediction model, and a negative intercept suggests the opposite (overestimation). When comparing models, an intercept closer to 0 typically indicates better calibration. Calibration can also be summarized as log(O:E), the logarithm scale of the ratio of observed (O) to expected (E) survivors. A log(O:E) > 0 signals an underestimation (the observed survival is greater than the predicted survival); and a log(O:E) < 0 indicates the opposite (the observed survival is lower than the predicted survival). A model with a log(O:E) closer to 0 is generally considered better calibrated. The Brier score is the mean squared difference between the model predictions and the observed outcomes, and it ranges from 0 (best prediction) to 1 (worst prediction). The Brier score captures both discrimination and calibration, and it is considered a measure of overall model performance. In Brier score analysis, the "null model" assigns a predicted probability equal to the prevalence of the outcome and represents a model that adds no new information. A prediction model should achieve a Brier score at least lower than the null-model Brier score to be considered as useful. The DCA was developed as a method to determine whether using a model to inform treatment decisions would do more good than harm. It plots the net benefit of making decisions based on the model's predictions across all possible risk thresholds (or cost-to-benefit ratios) in relation to the two default strategies of treating all or no patients. The care provider can decide on an acceptable risk threshold for the proposed treatment in an individual and assess the corresponding net benefit to determine whether consulting with the model is superior to adopting the default strategies. Finally, we examined whether laboratory data, which were not included in the METSSS model, would have been independently associated with survival after controlling for the METSSS model's predictions by using the multivariable logistic and Cox proportional hazards regression analyses.
Between the two models, only SORG-MLA achieved adequate discrimination (an AUROC of > 0.7) in the entire cohort (of patients treated operatively or with radiation alone) and in the subgroup of patients treated with palliative radiotherapy alone. SORG-MLA outperformed METSSS by a wide margin on discrimination, calibration, and Brier score analyses in not only the entire cohort but also the subgroup of patients whose local treatment consisted of radiotherapy alone. In both the entire cohort and the subgroup, DCA demonstrated that SORG-MLA provided more net benefit compared with the two default strategies (of treating all or no patients) and compared with METSSS when risk thresholds ranged from 0.2 to 0.9 at both 90 days and 1 year, indicating that using SORG-MLA as a decision-making aid was beneficial when a patient's individualized risk threshold for opting for treatment was 0.2 to 0.9. Higher albumin, lower alkaline phosphatase, lower calcium, higher hemoglobin, lower international normalized ratio, higher lymphocytes, lower neutrophils, lower neutrophil-to-lymphocyte ratio, lower platelet-to-lymphocyte ratio, higher sodium, and lower white blood cells were independently associated with better 1-year and overall survival after adjusting for the predictions made by METSSS.
Based on these discoveries, clinicians might choose to consult SORG-MLA instead of METSSS for survival estimation in patients with long-bone metastases presenting for evaluation of local treatment. Basing a treatment decision on the predictions of SORG-MLA could be beneficial when a patient's individualized risk threshold for opting to undergo a particular treatment strategy ranged from 0.2 to 0.9. Future studies might investigate relevant laboratory items when constructing or refining a survival estimation model because these data demonstrated prognostic value independent of the predictions of the METSSS model, and future studies might also seek to keep these models up to date using data from diverse, contemporary patients undergoing both modern operative and nonoperative treatments.
Level III, diagnostic study.
Lee CC
,Chen CW
,Yen HK
,Lin YP
,Lai CY
,Wang JL
,Groot OQ
,Janssen SJ
,Schwab JH
,Hsu FM
,Lin WH
... -
《-》
Identification of a novel disulfidptosis-related gene signature in osteoarthritis using bioinformatics analysis and experimental validation.
Osteoarthritis (OA) is a degenerative bone disease characterized by the destruction of joint cartilage and synovial inflammation, involving intricate immune regulation processes. Disulfidptosis, a novel form of programmed cell death, has recently been identified; however, the effects and roles of disulfidptosis-related genes (DR-DEGs) in OA remain unclear. We obtained six OA datasets from the GEO database, using four as training sets and two as validation sets. Differential expression analysis was employed to identify DR-DEGs, and unique molecular subtypes of OA were constructed based on these DR-DEGs. Subsequently, the immune microenvironment of OA patients was comprehensively analyzed using the "CIBERSORT" algorithm for immune infiltration. Various machine learning algorithms were utilized to screen characteristic DR-DEGs, and nomogram models and ROC curves were built based on these genes. The scRNA dataset (GSE169454) was used to classify chondrocytes in OA samples into distinct cell types, further exploring the gene distribution and correlation of characteristic DR-DEGs with specific cell subpopulations. Moreover, the expression levels of four characteristic DR-DEGs were validated through OA cell models and rat models. In our study, we identified 10 DR-DEGs with significant differences in expression within OA samples. Based on these DR-DEGs, two distinct molecular subtypes were recognized (cluster 1 and 2). ZNF484 and NDUFS1 were found to be significantly overexpressed in subtype 1, while the infiltration abundance of activated mast cells was markedly elevated in subtype 2. Moreover, significant differences were observed in the infiltration proportions of 11 immune cell types between OA and control samples, with 9 DR-DEGs demonstrating substantial correlations with immune cell infiltration levels. Further analysis of the scRNA dataset revealed that SLC3A2 and NDUFC1 were predominantly expressed in the preHTC subpopulation. All 10 DR-DEGs exhibited notably higher expression in the EC subpopulation across various cell types. The proportion of EC subgroups with high SLC3A2 expression increased, mainly enriching pathways related to inflammation, such as the IL-17 signaling pathway and TGF-beta signaling pathway. Using machine learning, we identified four characteristic DR-DEGs, which, in combination with the nomogram and ROC models, demonstrated promising performance in the diagnosis of OA. Additionally, in vivo validation confirmed a significant elevation of PPM1F expression in OA models. This study identified DR-DEGs as potential biomarkers for the diagnosis and classification of OA and provided a preliminary understanding of their role in the immune microenvironment. However, further experimental and clinical studies are required to validate their diagnostic value and therapeutic potential.
Wei M
,Shi X
,Tang W
,Lv Q
,Wu Y
,Xu Y
... -
《Scientific Reports》