-
Exploration and verification a 13-gene diagnostic framework for ulcerative colitis across multiple platforms via machine learning algorithms.
Ulcerative colitis (UC) is a chronic inflammatory bowel disease with intricate pathogenesis and varied presentation. Accurate diagnostic tools are imperative to detect and manage UC. This study sought to construct a robust diagnostic model using gene expression profiles and to identify key genes that differentiate UC patients from healthy controls. Gene expression profiles from eight cohorts, encompassing a total of 335 UC patients and 129 healthy controls, were analyzed. A total of 7530 gene sets were computed using the GSEA method. Subsequent batch correction, PCA plots, and intersection analysis identified crucial pathways and genes. Machine learning, incorporating 101 algorithm combinations, was employed to develop diagnostic models. Verification was done using four external cohorts, adding depth to the sample repertoire. Evaluation of immune cell infiltration was undertaken through single-sample GSEA. All statistical analyses were conducted using R (Version: 4.2.2), with significance set at a P value below 0.05. Employing the GSEA method, 7530 gene sets were computed. From this, 19 intersecting pathways were discerned to be consistently upregulated across all cohorts, which pertained to cell adhesion, development, metabolism, immune response, and protein regulation. This corresponded to 83 unique genes. Machine learning insights culminated in the LASSO regression model, which outperformed others with an average AUC of 0.942. This model's efficacy was further ratified across four external cohorts, with AUC values ranging from 0.694 to 0.873 and significant Kappa statistics indicating its predictive accuracy. The LASSO logistic regression model highlighted 13 genes, with LCN2, ASS1, and IRAK3 emerging as pivotal. Notably, LCN2 showcased significantly heightened expression in active UC patients compared to both non-active patients and healthy controls (P < 0.05). Investigations into the correlation between these genes and immune cell infiltration in UC highlighted activated dendritic cells, with statistically significant positive correlations noted for LCN2 and IRAK3 across multiple datasets. Through comprehensive gene expression analysis and machine learning, a potent LASSO-based diagnostic model for UC was developed. Genes such as LCN2, ASS1, and IRAK3 hold potential as both diagnostic markers and therapeutic targets, offering a promising direction for future UC research and clinical application.
Wang J
,Li L
,Chen P
,He C
,Niu X
... -
《Scientific Reports》
-
Precision therapy for ulcerative colitis: insights from mitochondrial dysfunction interacting with the immune microenvironment.
Accumulating evidence reveals mitochondrial dysfunction exacerbates intestinal barrier dysfunction and inflammation. Despite the growing knowledge of mitochondrial dysfunction and ulcerative colitis (UC), the mechanism of mitochondrial dysfunction in UC remains to be fully explored.
We integrated 1137 UC colon mucosal samples from 12 multicenter cohorts worldwide to create a normalized compendium. Differentially expressed mitochondria-related genes (DE-MiRGs) in individuals with UC were identified using the "Limma" R package. Unsupervised consensus clustering was utilized to determine the intrinsic subtypes of UC driven by DE-MiRGs. Weighted gene co-expression network analysis was employed to investigate module genes related to UC. Four machine learning algorithms were utilized for screening DE-MiRGs in UC and construct MiRGs diagnostic models. The models were developed utilizing the over-sampled training cohort, followed by validation in both the internal test cohort and the external validation cohort. Immune cell infiltration was assessed using the Xcell and CIBERSORT algorithms, while potential biological mechanisms were explored through GSVA and GSEA algorithms. Hub genes were selected using the PPI network.
The study identified 108 DE-MiRGs in the colonic mucosa of patients with UC compared to healthy controls, showing significant enrichment in pathways associated with mitochondrial metabolism and inflammation. The MiRGs diagnostic models for UC were constructed based on 17 signature genes identified through various machine learning algorithms, demonstrated excellent predictive capabilities. Utilizing the identified DE-MiRGs from the normalized compendium, 941 patients with UC were stratified into three subtypes characterized by distinct cellular and molecular profiles. Specifically, the metabolic subtype demonstrated enrichment in epithelial cells, the immune-inflamed subtype displayed high enrichment in antigen-presenting cells and pathways related to pro-inflammatory activation, and the transitional subtype exhibited moderate activation across all signaling pathways. Importantly, the immune-inflamed subtype exhibited a stronger correlation with superior response to four biologics: infliximab, ustekinumab, vedolizumab, and golimumab compared to the metabolic subtype.
This analysis unveils the interplay between mitochondrial dysfunction and the immune microenvironment in UC, thereby offering novel perspectives on the potential pathogenesis of UC and precision treatment of UC patients, and identifying new therapeutic targets.
Zhang YF
,Fan MY
,Bai QR
,Zhao R
,Song S
,Wu L
,Lu JH
,Liu JW
,Wang Q
,Li Y
,Chen X
... -
《Frontiers in Immunology》
-
Evaluating the significance of ECSCR in the diagnosis of ulcerative colitis and drug efficacy assessment.
The main challenge in diagnosing and treating ulcerative colitis (UC) has prompted this study to discover useful biomarkers and understand the underlying molecular mechanisms.
In this study, transcriptomic data from intestinal mucosal biopsies underwent Robust Rank Aggregation (RRA) analysis to identify differential genes. These genes intersected with UC key genes from Weighted Gene Co-expression Network Analysis (WGCNA). Machine learning identified UC signature genes, aiding predictive model development. Validation involved external data for diagnostic, progression, and drug efficacy assessment, along with ELISA testing of clinical serum samples.
RRA integrative analysis identified 251 up-regulated and 211 down-regulated DEGs intersecting with key UC genes in WGCNA, yielding 212 key DEGs. Subsequently, five UC signature biomarkers were identified by machine learning based on the key DEGs-THY1, SLC6A14, ECSCR, FAP, and GPR109B. A logistic regression model incorporating these five genes was constructed. The AUC values for the model set and internal validation data were 0.995 and 0.959, respectively. Mechanistically, activation of the IL-17 signaling pathway, TNF signaling pathway, PI3K-Akt signaling pathway in UC was indicated by KEGG and GSVA analyses, which were positively correlated with the signature biomarkers. Additionally, the expression of the signature biomarkers was strongly correlated with various UC types and drug efficacy in different datasets. Notably, ECSCR was found to be upregulated in UC serum and exhibited a positive correlation with neutrophil levels in UC patients.
THY1, SLC6A14, ECSCR, FAP, and GPR109B can serve as potential biomarkers of UC and are closely related to signaling pathways associated with UC progression. The discovery of these markers provides valuable information for understanding the molecular mechanisms of UC.
Feng B
,Zhang Y
,Qiao L
,Tang Q
,Zhang Z
,Zhang S
,Qiu J
,Zhou X
,Huang C
,Liang Y
... -
《Frontiers in Immunology》
-
Identification of subclusters and prognostic genes based on GLS-associated molecular signature in ulcerative colitis.
Ulcerative colitis (UC) is a chronic and recurrent inflammatory disease that affects the colon and rectum. The response to treatment varies among individuals with UC. Therefore, the aim of this study was to identify and explore potential biomarkers for different subtypes of UC and examine their association with immune cell infiltration. We obtained UC RNA sequencing data from the GEO database, which included the training set GSE92415 and the validation set GSE87473 and GSE72514. UC patients were classified based on GLS and its associated genes using consensus clustering analysis. We identified differentially expressed genes (DEGs) in different UC subtypes through a differential expression analysis of the training cohort. Machine learning algorithms, including Weighted Gene Co-Expression Network Analysis (WGCNA), Least Absolute Shrinkage and Selection Operator (LASSO), and Support Vector Machine Recursive Feature Elimination (SVM-RFE), were utilized to identify marker genes for UC. The CIBERSORT algorithm was used to determine the abundance of various immune cells in UC and their correlation with UC signature genes. Finally, we validated the expression of GLS through in vivo and ex vivo experiments. The expression of GLS was found to be elevated in patients with UC compared to normal patients. GLS and its related genes were able to classify UC patients into two subtypes, C1 and C2. The C1 subtype, as compared to the C2 subtype, showed a higher Mayo score and poorer treatment response. A total of 18 DEGs were identified in both subtypes, including 7 up-regulated and 11 down-regulated genes. Four UC signature genes (CWH43, HEPACAM2, IL24, and PCK1) were identified and their diagnostic value was validated in a separate cohort (AUC > 0.85). Furthermore, we found that UC signature biomarkers were linked to the immune cell infiltration. CWH43, HEPACAM2, IL24, and PCK1 may serve as potential biomarkers for diagnosing different subtypes of UC, which could contribute to the development of targeted molecular therapy and immunotherapy for UC.
Xie Y
,Li J
,Tao Q
,Wu Y
,Liu Z
,Chen Y
,Zeng C
... -
《Scientific Reports》
-
Identification of Diagnosis and Typological Characteristics Associated with Ferroptosis for Ulcerative Colitis via Bioinformatics and Machine Learning.
To investigate and validate ferroptosis genes (FRGs) in ulcerative colitis (UC) for diagnostic, subtype, and biological agent reactivity, with the goal of providing a foundation for the identification of novel therapeutic targets and the rational use of infliximab in clinical practice.
UC datasets and FRGs were selected from the Gene Expression Omnibus (GEO) and FerrDb databases. WGCNA was used to identify characteristic genes of UC. LASSO and SVM models were used to discover key FRGs in UC. A nomogram was constructed for diagnosing UC using logistic regression (LR), We performed internal and external validation for the model. Furthermore, we constructed a hub-gene-signature prediction model for the effectiveness of infliximab in treating UC and deployed it on the website. Finally, the hub gene-drug interaction networks were constructed.
Nineteen ferroptosis-related genes associated with UC were identified through bioinformatics analysis. FTH1 and GPX4 were two of the down-regulated genes.The seventeen upregulated genes consisted of DUOX1, DUOX2, SOCS1, LPIN1, QSOX1, TRIM21, IDO1, SLC7A11, MUC1, HSPA5, SCD, ACSL3, NOS2, PARP9, PARP14, LCN2, and TRIB2. Five hub genes, including LCN2, QSOX1, MUC1, IDO1, and TRIB2, were acquried via machine learning. The mean auc of internal validation was 0.964 and 0.965 respectively, after using cross-validation and bootstrap in the training set based on the 5 hub-gene diagnostic models. In the external validation set, the AUC reached 0.976 and 0.858. RF model performs best in predicting infliximab effectiveness. In addition, we identified two ferroptosis subtypes. Cluster A mostly overlaps with the high-risk score group, with a hyperinflammatory phenotype.
This research indicated that five hub genes related to ferroptosis might be potential markers in diagnosing and predicting infliximab sensitivity for UC.
Wang W
,Song X
,Ding S
,Ma H
... -
《-》