-
Identification of novel mitophagy-related biomarkers for Kawasaki disease by integrated bioinformatics and machine-learning algorithms.
Kawasaki disease (KD) is a systemic vasculitis primarily affecting the coronary arteries in children. Despite growing attention to its symptoms and pathogenesis, the exact mechanisms of KD remain unclear. Mitophagy plays a critical role in inflammation regulation, however, its significance in KD has only been minimally explored. This study sought to identify crucial mitophagy-related biomarkers and their mechanisms in KD, focusing on their association with immune cells in peripheral blood.
This research used four datasets from the Gene Expression Omnibus (GEO) database that were categorized as the merged and validation datasets. Screening for differentially expressed mitophagy-related genes (DE-MRGs) was conducted, followed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. A weighted gene co-expression network analysis (WGCNA) identified the hub module, while machine-learning algorithms [random forest-recursive feature elimination (RF-RFE) and support vector machine-recursive feature elimination (SVM-RFE)] pinpointed the hub genes. Receiver operating characteristic (ROC) curves were generated for these genes. Additionally, the CIBERSORT algorithm was used to assess the infiltration of 22 immune cell types to explore their correlations with hub genes. Interactions between transcription factors (TFs), genes, and Gene-microRNAs (miRNAs) of hub genes were mapped using the NetworkAnalyst platform. The expression difference of the hub genes was validated using quantitative reverse transcriptase polymerase chain reaction (qRT-PCR).
Initially, 306 DE-MRGs were identified between the KD patients and healthy controls. The enrichment analysis linked these MRGs to autophagy, mitochondrial function, and inflammation. The WGCNA revealed a hub module of 47 KD-associated DE-MRGs. The machine-learning algorithms identified cytoskeleton-associated protein 4 (CKAP4) and serine-arginine protein kinase 1 (SRPK1) as critical hub genes. In the merged dataset, the area under the curve (AUC) values for CKAP4 and SRPK1 were 0.933 [95% confidence interval (CI): 0.901 to 0.964] and 0.936 (95% CI: 0.906 to 0.966), respectively, indicating high diagnostic potential. The validation dataset results corroborated these findings with AUC values of 0.872 (95% CI: 0.741 to 1.000) for CKAP4 and 0.878 (95% CI: 0.750 to 1.000) for SRPK1. The CIBERSORT analysis connected CKAP4 and SRPK1 with specific immune cells, including activated cluster of differentiation 4 (CD4) memory T cells. TFs such as MAZ, SAP30, PHF8, KDM5B, miRNAs like hsa-mir-7-5p play essential roles in regulating these hub genes. The qRT-PCR results confirmed the differential expression of these genes between the KD patients and healthy controls.
CKAP4 and SRPK1 emerged as promising diagnostic biomarkers for KD. These genes potentially influence the progression of KD through mitophagy regulation.
Wang Y
,Liu Y
,Wang N
,Liu Z
,Qian G
,Li X
,Huang H
,Zhuo W
,Xu L
,Zhang J
,Lv H
,Gao Y
... -
《Translational Pediatrics》
-
Exploration of the shared diagnostic genes and mechanisms between periodontitis and primary Sjögren's syndrome by integrated comprehensive bioinformatics analysis and machine learning.
Accumulating evidence has showed a bidirectional link between periodontitis (PD) and primary Sjögren's syndrome (pSS), but the mechanisms of their occurrence remain unclear. Hence, this study aimed to investigate the shared diagnostic genes and potential mechanisms between PD and pSS using bioinformatics methods.
Gene expression data for PD and pSS were acquired from the Gene Expression Omnibus (GEO) database. Differential expression genes (DEGs) analysis and weighted gene co-expression network analysis (WGCNA) were utilized to search common genes. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis were conducted to explore biological functions. Three machine learning algorithms (least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE), and random forest (RF)) were used to further identify shared diagnostic genes, and these genes were assessed via receiver operating characteristic (ROC) curves in discovery and validation datasets. CIBERSORT was employed for immune cell infiltration analysis. Transcription factors (TFs)-genes and miRNAs-genes regulatory networks were conducted by NetworkAnalyst. Finally, relevant drug targets were predicted by DSigDB.
Based on DEGs, 173 overlapping genes were obtained and primarily enriched in immune- and inflammation-related pathways. WGCNA revealed 34 common disease-related genes, which were enriched in similar biological pathways. Intersecting the DEGs with WGCNA results yielded 22 candidate genes. Moreover, three machine learning algorithms identified three shared genes (CSF2RB, CXCR4, and LYN) between PD and pSS, and these genes demonstrated good diagnostic performance (AUC>0.85) in both discovery and validation datasets. The immune cell infiltration analysis showed significant dysregulation in several immune cell populations. Regulatory network analysis highlighted that WRNIP1 and has-mir-155-5p might be pivotal co-regulators of the three shared gene expressions. Finally, the top 10 potential gene-targeted drugs were screened.
CSF2RB, CXCR4, and LYN may serve as potential biomarkers for the concurrent diagnosis of PD and pSS. Additionally, we identified common molecular mechanisms, TFs, miRNAs, and candidate drugs between PD and pSS, which may provide novel insights and targets for future research on the pathogenesis, diagnosis, and therapy of both diseases.
Wang S
,Wang Q
,Zhao K
,Zhang S
,Chen Z
... -
《-》
-
Identification of mitophagy-related biomarkers in human osteoporosis based on a machine learning model.
Su Y
,Yu G
,Li D
,Lu Y
,Ren C
,Xu Y
,Yang Y
,Zhang K
,Ma T
,Li Z
... -
《Frontiers in Physiology》
-
Identification of novel biomarkers and immune infiltration characteristics of ischemic stroke based on comprehensive bioinformatic analysis and machine learning.
Ischemic stroke (IS) is one of most common causes of disability in adults worldwide. However, there is still a lack of effective and reliable diagnostic markers and therapeutic targets in IS. Furthermore, immune cell dysfunction plays an important role in the pathogenesis of IS. Hence, in-depth research on immune-related targets in progressive IS is urgently needed.
Expression profile data from patients with IS were downloaded from the Gene Expression Omnibus (GEO) database. Then, differential expression analysis and weighted gene coexpression network analysis (WGCNA) were performed to identify the significant modules and differentially expressed genes (DEGs). Key genes were obtained and used in functional enrichment analyses by overlapping module genes and DEGs. Next, hub candidate genes were identified by utilizing three machine learning algorithms: least absolute shrinkage and selection operator (LASSO), random forest, and support vector machine-recursive feature elimination (SVM-RFE). Subsequently, a diagnostic model was constructed based on the hub genes, and receiver operating characteristic (ROC) curves were constructed to validate the performances of the predictive models and candidate genes. Finally, the immune cell infiltration landscape of IS was explored with the CIBERSORT deconvolution algorithm.
A total of 40 key DEGs were identified based on the intersection of the DEGs and module genes, and we found that these genes were mainly enriched in the regulation of lipolysis in adipocytes, neutrophil extracellular trap formation and complement and coagulation cascades. Based on the results from three advanced machine learning algorithms, we obtained 7 hub candidate genes (ABCA1, ARG1, C5AR1, CKAP4, HMFN0839, SDCBP and TLN1) as diagnostic biomarkers of IS and developed a reliable nomogram with high predictive performance (AUC = 0.987). In addition, immune cell infiltration dysregulation was implicated in IS, and compared with those in the normal group, IS patients had increased fractions of gamma delta T cells, monocytes, M0 macrophages, M2 macrophages and neutrophils and clearly lower percentages of naive B cells, CD8 T cells, CD4+ memory T cells, follicular helper T cells, regulatory T cells (Tregs) and resting dendritic cells. Furthermore, correlation analysis indicated a significant correlation between the hub genes and immune cells in progressive IS.
In conclusion, our study identified 7 hub genes as diagnostic biomarkers and established a reliable model to predict the occurrence of IS. Meanwhile, we explored the immune cell infiltration pattern and investigated the relationship between candidate genes and immune cells in the pathogenesis of IS. Hence, our study provides new insights into the diagnosis and treatment of IS.
Hu S
,Cai J
,Chen S
,Wang Y
,Ren L
... -
《-》
-
Identification of immune- and autophagy-related genes and effective diagnostic biomarkers in endometriosis: a bioinformatics analysis.
To identify autophagy- and immune-related hub genes affecting the diagnosis and treatment of endometriosis.
Gene expression data were downloaded from the Gene Expression Omnibus (GEO) (GSE11691 and GSE120103 for training, and GSE7305 for validation). By overlapping the differentially expressed genes (DEGs), Weighted gene co-expression network analysis (WGCNA) module genes, and autophagy-related genes (ARGs), and immune-related genes (IRGs) separately, hub genes were identified using the least absolute shrinkage and selection operator (LASSO)and support vector machine recursive feature elimination (SVM-RFE). The hub genes were analyzed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses. A hub gene-prediction model was constructed and assessed using five-fold cross-validation via five supervised machine-learning algorithms: random forest, the sequential minimal optimization (SMO), K-nearest neighbours (IBK), C4.5 decision tree (J48), and logistics regression. The area under the receiver operating characteristic curve (AUC) was adopted to assess the identification ability of characteristic genes.
1,116 DEGs were obtained from the training cohort, and 22 endometriosis-related IRGs were identified by overlapping the 1,116 DEGs, 3,222 module genes, and 1,793 IRGs. Meanwhile, 45 endometriosis-related ARGs were obtained (1,928 ARGs). Subsequently, nine IRG hub genes (BST2, CCL13, CD86, CSF1, FAM3C, GREM1, ISG20, PSMB8, and S100A11) and nine ARG hub genes (GSK3A, HTR2B, RAB3GAP1, ARFIP2, BNIP3, CSF1, MAOA, PPP1R13L, and SH3GLB2) were obtained by LASSO and SVM-RFE. GO analysis indicated that the ARG hub genes responded to the regulation of autophagy and mitochondrial outer membrane permeabilization, and KEGG enrichment analysis involved serotonergic and dopaminergic synapses. GO analysis also indicated that the IRG hub genes responded to the regulation of leukocyte proliferation and mononuclear cell migration, and KEGG analysis showed enrichment involved in viral protein interaction with cytokines and cytokine receptors. The AUC of the random-forest algorithm of ARGs was 0.975 in the training cohort and 0.940 in the validation cohort, and the AUC of the SMO algorithm of IRGs was 0.907 in the training cohort and 0.8 in the validation cohort.
Seventeen hub genes are closely associated with endometriosis. These genes are potential autophagy- and immune-related biomarkers for diagnosis and treatment of endometriosis.
Ji X
,Huang C
,Mao H
,Zhang Z
,Zhang X
,Yue B
,Li X
,Wu Q
... -
《-》