-
Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis.
Diabetic nephropathy (DN) is a major microvascular complication of diabetes and has become the leading cause of end-stage renal disease worldwide. A considerable number of DN patients have experienced irreversible end-stage renal disease progression due to the inability to diagnose the disease early. Therefore, reliable biomarkers that are helpful for early diagnosis and treatment are identified. The migration of immune cells to the kidney is considered to be a key step in the progression of DN-related vascular injury. Therefore, finding markers in this process may be more helpful for the early diagnosis and progression prediction of DN.
The gene chip data were retrieved from the GEO database using the search term ' diabetic nephropathy '. The ' limma ' software package was used to identify differentially expressed genes (DEGs) between DN and control samples. Gene set enrichment analysis (GSEA) was performed on genes obtained from the molecular characteristic database (MSigDB. The R package 'WGCNA' was used to identify gene modules associated with tubulointerstitial injury in DN, and it was crossed with immune-related DEGs to identify target genes. Gene ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis were performed on differentially expressed genes using the 'ClusterProfiler' software package in R. Three methods, least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE) and random forest (RF), were used to select immune-related biomarkers for diagnosis. We retrieved the tubulointerstitial dataset from the Nephroseq database to construct an external validation dataset. Unsupervised clustering analysis of the expression levels of immune-related biomarkers was performed using the 'ConsensusClusterPlus 'R software package. The urine of patients who visited Dongzhimen Hospital of Beijing University of Chinese Medicine from September 2021 to March 2023 was collected, and Elisa was used to detect the mRNA expression level of immune-related biomarkers in urine. Pearson correlation analysis was used to detect the effect of immune-related biomarker expression on renal function in DN patients.
Four microarray datasets from the GEO database are included in the analysis : GSE30122, GSE47185, GSE99340 and GSE104954. These datasets included 63 DN patients and 55 healthy controls. A total of 9415 genes were detected in the data set. We found 153 differentially expressed immune-related genes, of which 112 genes were up-regulated, 41 genes were down-regulated, and 119 overlapping genes were identified. GO analysis showed that they were involved in various biological processes including leukocyte-mediated immunity. KEGG analysis showed that these target genes were mainly involved in the formation of phagosomes in Staphylococcus aureus infection. Among these 119 overlapping genes, machine learning results identified AGR2, CCR2, CEBPD, CISH, CX3CR1, DEFB1 and FSTL1 as potential tubulointerstitial immune-related biomarkers. External validation suggested that the above markers showed diagnostic efficacy in distinguishing DN patients from healthy controls. Clinical studies have shown that the expression of AGR2, CX3CR1 and FSTL1 in urine samples of DN patients is negatively correlated with GFR, the expression of CX3CR1 and FSTL1 in urine samples of DN is positively correlated with serum creatinine, while the expression of DEFB1 in urine samples of DN is negatively correlated with serum creatinine. In addition, the expression of CX3CR1 in DN urine samples was positively correlated with proteinuria, while the expression of DEFB1 in DN urine samples was negatively correlated with proteinuria. Finally, according to the level of proteinuria, DN patients were divided into nephrotic proteinuria group (n = 24) and subrenal proteinuria group. There were significant differences in urinary AGR2, CCR2 and DEFB1 between the two groups by unpaired t test (P < 0.05).
Our study provides new insights into the role of immune-related biomarkers in DN tubulointerstitial injury and provides potential targets for early diagnosis and treatment of DN patients. Seven different genes ( AGR2, CCR2, CEBPD, CISH, CX3CR1, DEFB1, FSTL1 ), as promising sensitive biomarkers, may affect the progression of DN by regulating immune inflammatory response. However, further comprehensive studies are needed to fully understand their exact molecular mechanisms and functional pathways in DN.
Wang L
,Su J
,Liu Z
,Ding S
,Li Y
,Hou B
,Hu Y
,Dong Z
,Tang J
,Liu H
,Liu W
... -
《BioData Mining》
-
Identifying C1QB, ITGAM, and ITGB2 as potential diagnostic candidate genes for diabetic nephropathy using bioinformatics analysis.
Diabetic nephropathy (DN), the most intractable complication in diabetes patients, can lead to proteinuria and progressive reduction of glomerular filtration rate (GFR), which seriously affects the quality of life of patients and is associated with high mortality. However, the lack of accurate key candidate genes makes diagnosis of DN very difficult. This study aimed to identify new potential candidate genes for DN using bioinformatics, and elucidated the mechanism of DN at the cellular transcriptional level.
The microarray dataset GSE30529 was downloaded from the Gene Expression Omnibus Database (GEO), and the differentially expressed genes (DEGs) were screened by R software. We used Gene Ontology (GO), gene set enrichment analysis (GSEA), and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis to identify the signal pathways and genes. Protein-protein interaction (PPI) networks were constructed using the STRING database. The GSE30122 dataset was selected as the validation set. Receiver operating characteristic (ROC) curves were applied to evaluate the predictive value of genes. An area under curve (AUC) greater than 0.85 was considered to be of high diagnostic value. Several online databases were used to predict miRNAs and transcription factors (TFs) capable of binding hub genes. Cytoscape was used for constructing a miRNA-mRNA-TF network. The online database 'nephroseq' predicted the correlation between genes and kidney function. The serum level of creatinine, BUN, and albumin, and the urinary protein/creatinine ratio of the DN rat model were detected. The expression of hub genes was further verified through qPCR. Data were analyzed statistically using Student's t-test by the 'ggpubr' package.
A total of 463 DEGs were identified from GSE30529. According to enrichment analysis, DEGs were mainly enriched in the immune response, coagulation cascades, and cytokine signaling pathways. Twenty hub genes with the highest connectivity and several gene cluster modules were ensured using Cytoscape. Five high diagnostic hub genes were selected and verified by GSE30122. The MiRNA-mRNA-TF network suggested a potential RNA regulatory relationship. Hub gene expression was positively correlated with kidney injury. The level of serum creatinine and BUN in the DN group was higher than in the control group (unpaired t test, t = 3.391, df = 4, p = 0.0275, r = 0.861). Meanwhile, the DN group had a higher urinary protein/creatinine ratio (unpaired t test, t = 17.23, df = 16, p < 0.001, r = 0.974). QPCR results showed that the potential candidate genes for DN diagnosis included C1QB, ITGAM, and ITGB2.
We identified C1QB, ITGAM and ITGB2 as potential candidate genes for DN diagnosis and therapy and provided insight into the mechanisms of DN development at transcriptome level. We further completed the construction of miRNA-mRNA-TF network to propose potential RNA regulatory pathways adjusting disease progression in DN.
Hu Y
,Yu Y
,Dong H
,Jiang W
... -
《PeerJ》
-
Identification of a novel immune landscape signature as effective diagnostic markers related to immune cell infiltration in diabetic nephropathy.
The study aimed to identify core biomarkers related to diagnosis and immune microenvironment regulation and explore the immune molecular mechanism of diabetic nephropathy (DN) through bioinformatics analysis.
GSE30529, GSE99325, and GSE104954 were merged with removing batch effects, and different expression genes (DEGs) were screened at a criterion |log2FC| >0.5 and adjusted P <0.05. KEGG, GO, and GSEA analyses were performed. Hub genes were screened by conducting PPI networks and calculating node genes using five algorithms with CytoHubba, followed by LASSO and ROC analysis to accurately identify diagnostic biomarkers. In addition, two different GEO datasets, GSE175759 and GSE47184, and an experiment cohort with 30 controls and 40 DN patients detected by IHC, were used to validate the biomarkers. Moreover, ssGSEA was performed to analyze the immune microenvironment in DN. Wilcoxon test and LASSO regression were used to determine the core immune signatures. The correlation between biomarkers and crucial immune signatures was calculated by Spearman analysis. Finally, cMap was used to explore potential drugs treating renal tubule injury in DN patients.
A total of 509 DEGs, including 338 upregulated and 171 downregulated genes, were screened out. "chemokine signaling pathway" and "cell adhesion molecules" were enriched in both GSEA and KEGG analysis. CCR2, CX3CR1, and SELP, especially for the combination model of the three genes, were identified as core biomarkers with high diagnostic capabilities with striking AUC, sensitivity, and specificity in both merged and validated datasets and IHC validation. Immune infiltration analysis showed a notable infiltration advantage for APC co-stimulation, CD8+ T cells, checkpoint, cytolytic activity, macrophages, MHC class I, and parainflammation in the DN group. In addition, the correlation analysis showed that CCR2, CX3CR1, and SELP were strongly and positively correlated with checkpoint, cytolytic activity, macrophages, MHC class I, and parainflammation in the DN group. Finally, dilazep was screened out as an underlying compound for DN analyzed by CMap.
CCR2, CX3CR1, and SELP are underlying diagnostic biomarkers for DN, especially in their combination. APC co-stimulation, CD8+ T cells, checkpoint, cytolytic activity, macrophages, MHC class I, and parainflammation may participate in the occurrence and development of DN. At last, dilazep may be a promising drug for treating DN.
Zhou H
,Mu L
,Yang Z
,Shi Y
... -
《Frontiers in Immunology》
-
Identification of diagnostic biomarkers of rheumatoid arthritis based on machine learning-assisted comprehensive bioinformatics and its correlation with immune cells.
Rheumatoid arthritis (RA) is a chronic systemic autoimmune disease characterized by inflammatory cell infiltration, which can lead to chronic disability, joint destruction and loss of function. At present, the pathogenesis of RA is still unclear. The purpose of this study is to explore the potential biomarkers and immune molecular mechanisms of rheumatoid arthritis through machine learning-assisted bioinformatics analysis, in order to provide reference for the early diagnosis and treatment of RA disease.
RA gene chips were screened from the public gene GEO database, and batch correction of different groups of RA gene chips was performed using Strawberry Perl. DEGs were obtained using the limma package of R software, and functional enrichment analysis such as gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), disease ontology (DO), and gene set (GSEA) were performed. Three machine learning methods, least absolute shrinkage and selection operator regression (LASSO), support vector machine recursive feature elimination (SVM-RFE) and random forest tree (Random Forest), were used to identify potential biomarkers of RA. The validation group data set was used to verify and further confirm its expression and diagnostic value. In addition, CIBERSORT algorithm was used to evaluate the infiltration of immune cells in RA and control samples, and the correlation between confirmed RA diagnostic biomarkers and immune cells was analyzed.
Through feature screening, 79 key DEGs were obtained, mainly involving virus response, Parkinson's pathway, dermatitis and cell junction components. A total of 29 hub genes were screened by LASSO regression, 34 hub genes were screened by SVM-RFE, and 39 hub genes were screened by Random Forest. Combined with the three algorithms, a total of 12 hub genes were obtained. Through the expression and diagnostic value verification in the validation group data set, 7 genes that can be used as diagnostic biomarkers for RA were preliminarily confirmed. At the same time, the correlation analysis of immune cells found that γδT cells, CD4+ memory activated T cells, activated dendritic cells and other immune cells were positively correlated with multiple RA diagnostic biomarkers, CD4+ naive T cells, regulatory T cells and other immune cells were negatively correlated with multiple RA diagnostic biomarkers.
The results of novel characteristic gene analysis of RA showed that KYNU, EVI2A, CD52, C1QB, BATF, AIM2 and NDC80 had good diagnostic and clinical value for the diagnosis of RA, and were closely related to immune cells. Therefore, these seven DEGs may become new diagnostic markers and immunotherapy markers for RA.
Mu KL
,Ran F
,Peng LQ
,Zhou LL
,Wu YT
,Shao MH
,Chen XG
,Guo CM
,Luo QM
,Wang TJ
,Liu YC
,Liu G
... -
《Heliyon》
-
Renal tubular gen e biomarkers identification based on immune infiltrates in focal segmental glomerulosclerosis.
Bai J
,Pu X
,Zhang Y
,Dai E
... -
《-》