-
Identification of prognostic gene signature associated with microenvironment of lung adenocarcinoma.
Lung cancer has the highest morbidity and mortality worldwide, and lung adenocarcinoma (LADC) is the most common pathological subtype. Accumulating evidence suggests the tumor microenvironment (TME) is correlated with the tumor progress and the patient's outcome. As the major components of TME, the tumor-infiltrated immune cells and stromal cells have attracted more and more attention. In this study, differentially expressed immune and stromal signature genes were used to construct a TME-related prognostic model for predicting the outcomes of LADC patients.
The expression profiles of LADC samples with clinical information were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). The differentially expressed genes (DEGs) related to the TME of LADC were identified using TCGA dataset by Wilcoxon rank sum test. The prognostic effects of TME-related DEGs were analyzed using univariate Cox regression. Then, the least absolute shrinkage and selection operator (LASSO) regression was performed to reduce the overfit and the number of genes for further analysis. Next, the prognostic model was constructed by step multivariate Cox regression and risk score of each sample was calculated. Then, survival and Receiver Operating Characteristic (ROC) analyses were conducted to validate the model using TCGA and GEO datasets, respectively. The Kyoto Encyclopedia of Genes and Genomes analysis of gene signature was performed using Gene Set Enrichment Analysis (GSEA). Finally, the overall immune status, tumor purity and the expression profiles of HLA genes of high- and low-risk samples was further analyzed to reveal the potential mechanisms of prognostic effects of the model.
A total of 93 TME-related DEGs were identified, of which 23 DEGs were up-regulated and 70 DEGs were down-regulated. The univariate cox analysis indicated that 23 DEGs has the prognostic effects, the hazard ratio ranged from 0.65 to 1.25 (p < 0.05). Then, seven genes were screened out from the 23 DEGs by LASSO regression method and were further analyzed by step multivariate Cox regression. Finally, a three-gene (ADAM12, Bruton Tyrosine Kinase (BTK), ERG) signature was constructed, and ADAM12, BTK can be used as independent prognostic factors. The three-gene signature well stratified the LADC patients in both training (TCGA) and testing (GEO) datasets as high-risk and low-risk groups, the 3-year area under curve (AUC) of ROC curves of three GEO sets were 0.718 (GSE3141), 0.646 (GSE30219) and 0.643 (GSE50081). The GSEA analysis indicated that highly expressed ADAM12, BTK, ERG mainly correlated with the activation of pathways involving in focal adhesion, immune regulation. The immune analysis indicated that the low-risk group has more immune activities and higher expression of HLA genes than that of the high-risk group. In sum, we identified and constructed a three TME-related DEGs signature, which could be used to predict the prognosis of LADC patients.
Yue C
,Ma H
,Zhou Y
《PeerJ》
-
Identification of cuproptosis-related subtypes, construction of a prognosis model, and tumor microenvironment landscape in gastric cancer.
Cuproptosis is a novel identified regulated cell death (RCD), which is correlated with the development, treatment response and prognosis of cancer. However, the potential role of cuproptosis-related genes (CRGs) in the tumor microenvironment (TME) of gastric cancer (GC) remains unknown.
Transcriptome profiling, somatic mutation, somatic copy number alteration and clinical data of GC samples were downloaded from the Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO) database to describe the alterations of CRGs from genetic and transcriptional fields. Differential, survival and univariate cox regression analyses of CRGs were carried out to investigate the role of CRGs in GC. Cuproptosis molecular subtypes were identified by using consensus unsupervised clustering analysis based on the expression profiles of CRGs, and further analyzed by GO and KEGG gene set variation analyses (GSVA). Genes in distinct molecular subtypes were also analyzed by GO and KEGG gene enrichment analyses (GSEA). Differentially expressed genes (DEGs) were screened out from distinct molecular subtypes and further analyzed by GO enrichment analysis and univariate cox regression analysis. Consensus clustering analysis of prognostic DEGs was performed to identify genomic subtypes. Next, patients were randomly categorized into the training and testing group at a ratio of 1:1. CRG Risk scoring system was constructed through logistic least absolute shrinkage and selection operator (LASSO) cox regression analysis, univariate and multivariate cox analyses in the training group and validated in the testing and combined groups. Real-time quantitative polymerase chain reaction (RT-qPCR) was used to evaluate the expression of key Risk scoring genes. Sensitivity and specificity of Risk scoring system were examined by using receiver operating characteristic (ROC) curves. pRRophetic package in R was used to investigate the therapeutic effects of drugs in high- and low- risk score group. Finally, the nomogram scoring system was developed to predict patients' survival through incorporating the clinicopathological features and CRG Risk score.
Most CRGs were up-regulated in tumor tissues and showed a relatively high mutation frequency. Survival and univariate cox regression analysis revealed that LIAS and FDX1 were significantly associated with GC patients' survival. After consensus unsupervised clustering analysis, GC patients were classified into two cuproptosis molecular subtypes, which were significantly associated with clinical features (gender, age, grade and TNM stage), prognosis, metabolic related pathways and immune cell infiltration in TME of GC. GO enrichment analyses of 84 DEGs, obtained from distinct molecular subtypes, revealed that DEGs primarily enriched in the regulation of metabolism and intracellular/extracellular structure in GC. Univariate cox regression analysis of 84 DEGs further screened out 32 prognostic DEGs. According to the expression profiles of 32 prognostic DEGs, patients were re-classified into two gene subtypes, which were significantly associated with patients' age, grade, T and N stage, and survival of patients. Nest, the Risk score system was constructed with moderate sensitivity and specificity. A high CRG Risk score, characterized by decreased microsatellite instability-high (MSI-H), tumor mutation burden (TMB) and cancer stem cell (CSC) index, and high stromal and immune score in TME, indicated poor survival. Four of five key Risk scoring genes expression were dysregulated in tumor compared with normal samples. Moreover, CRG Risk score was greatly related with sensitivity of multiple drugs. Finally, we established a highly accurate nomogram for promoting the clinical applicability of the CRG Risk scoring system.
Our comprehensive analysis of CRGs in GC demonstrated their potential roles in TME, clinicopathological features, and prognosis. These findings may improve our understanding of CRGs in GC and provide new perceptions for doctors to predict prognosis and develop more effective and personalized therapy strategies.
Wang J
,Qin D
,Tao Z
,Wang B
,Xie Y
,Wang Y
,Li B
,Cao J
,Qiao X
,Zhong S
,Hu X
... -
《Frontiers in Immunology》
-
Tumor microenvironment related novel signature predict lung adenocarcinoma survival.
Lung adenocarcinoma (LUAD) is the most common histological type of lung cancers, which is the primary cause of cancer-related mortality worldwide. Growing evidence has suggested that tumor microenvironment (TME) plays a pivotal role in tumorigenesis and progression. Hence, we investigate the correlation of TME related genes with LUAD prognosis.
The information of LUAD gene expression data was obtained from The Cancer Genome Atlas (TCGA). According to their immune/stromal scores calculated by the ESTIMATE algorithm, differentially expressed genes (DEGs) were identified. Then, we performed univariate Cox regression analysis on DEGs to obtain genes that are apparently bound up with LUAD survival (SurGenes). Functional annotation and protein-protein interaction (PPI) was also conducted on SurGenes. By validating the SurGenes with data sets of lung cancer from the Gene Expression Omnibus (GEO), 106 TME related SurGenes were generated. Further, intersection analysis was executed between the 106 TME related SurGenes and hub genes from PPI network, PTPRC and CD19 were obtained. Gene Set Enrichment Analysis and CIBERSORT analysis were performed on PTPRC and CD19. Based on the TCGA LUAD dataset, we conducted factor analysis and Step-wise multivariate Cox regression analysis for 106 TME related SurGenes to construct the prognostic model for LUAD survival prediction. The LUAD dataset in GEO (GSE68465) was used as the testing dataset to confirm the prognostic model. Multivariate Cox regression analysis was used between risk score from the prognostic model and clinical parameters.
A total of 106 TME related genes were collected in our research totally, which were markedly correlated with the overall survival (OS) of LUAD patient. Bioinformatics analysis suggest them mainly concentrated on immune response, cell adhesion, and extracellular matrix. More importantly, among 106 TME related SurGenes, PTPRC and CD19 were highly interconnected nodes among PPI network and correlated with immune activity, exhibiting significant prognostic potential. The prognostic model was a weighted linear combination of the 106 genes, by which the low-OS LUAD samples could be separated from the high-OS samples with success. This model was also able to rebustly predict the situation of survival (training set: p-value < 0.0001, area under the curve (AUC) = 0.649; testing set: p-value = 0.0009, AUC = 0.617). By combining with clinical parameters, the prognostic model was optimized. The AUC achieved 0.716 for 3 year and 0.699 for 5 year.
A series of TME-related prognostic genes were acquired in this research, which could reflect immune disorders within TME, and PTPRC and CD19 show the potential to be an indicator for LUAD prognosis and tumor microenvironment modulation. The prognostic model constructed base on those prognostic genes presented a high predictive ability, and may have clinical implications in the overall survival prediction of LUAD.
Chen J
,Zhou R
《PeerJ》
-
Construction of mRNA prognosis signature associated with differentially expressed genes in early stage of stomach adenocarcinomas based on TCGA and GEO datasets.
Stomach adenocarcinomas (STAD) are the most common malignancy of the human digestive system and represent the fourth leading cause of cancer-related deaths. As early-stage STAD are generally mild or asymptomatic, patients with advanced STAD have short overall survival. Early diagnosis of STAD has a considerable influence on clinical outcomes.
The mRNA expression data and clinical indicators of STAD and normal tissues were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database. The gene expression differences were analyzed by R packages, and gene function enrichment analysis was performed. Kaplan-Meier method and univariate Cox proportional risk regression analysis were used to screen differential expressed genes (DEGs) related to survival of STAD patients. Multivariate Cox proportional risk regression analysis was used to further screen and determine the prognostic DEGs in STAD patients, and to construct a multigene prognostic prediction signature. The accuracy of predictive signature was tested by receiver operating characteristic (ROC) curve software package, and the nomogram of patients with STAD was drawn. Cox regression was used to investigate the correlation between multigene prognostic signature and clinical factors. The predictive performance of this model was compared with two other models proposed in previous studies using KM survival analysis, ROC curve analysis, Harrell consistency index and decision curve analysis (DCA). qRT-PCR and Western blot were used to verify the expression levels of prognostic genes. The pathways and functions of possible involvement of features were predicted using the GSEA method.
A total of 569 early-stage specific DEGs were retrieved from TCGA-STAD dataset, including 229 up-regulated genes and 340 down-regulated genes. Enrichment analysis showed that the early-stage specific DEGs were associated with cytokine-cytokine receptor interaction, neuroactive ligand-receptor interaction, and calcium signaling pathway. Multiple Cox regression algorithm was used to identify 10 early-stage specific DEGs associated with overall survival (P < 0.01) of STAD patients, and a multi-mRNA prognosis signature was established. The patients were divided into high-risk group and low-risk group according to the risk score. In the training set, the prognostic signature was positively correlated with tumor size and stage (P < 0.05), survival curve (P < 0.001) and time-dependent ROC (AUC = 0.625). In the training dataset and test dataset, the both signatures had good predictive efficiencies. Cox regression and DCA analysis revealed that the prognostic signature was an independent factor and had a better predict effect than the conventional TNM stage classification method and the earlier published biomarkers on the prognosis of STAD patients.
In this study, based on the early-stage specifically expressed genes, the prognostic signature constructed through TCGA and GEO datasets may become an indicator for clinical prognosis assessment of STAD and a new strategy for targeted therapy in the future.
Jiang F
,Lin H
,Yan H
,Sun X
,Yang J
,Dong M
... -
《-》
-
Construction of a novel mRNA-signature prediction model for prognosis of bladder cancer based on a statistical analysis.
Bladder cancer (BC) is a common malignancy neoplasm diagnosed in advanced stages in most cases. It is crucial to screen ideal biomarkers and construct a more accurate prognostic model than conventional clinical parameters. The aim of this research was to develop and validate an mRNA-based signature for predicting the prognosis of patients with bladder cancer.
The RNA-seq data was downloaded from the Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). Differentially expressed genes (DEGs) were screened in three datasets, and prognostic genes were identified from the training set of TCGA dataset. The common genes between DEGs and prognostic genes were narrowed down to six genes via Least Absolute Shrinkage and Selection Operator (LASSO) regression, and stepwise multivariate Cox regression. Then the gene-based risk score was calculated via Cox coefficient. Time-dependent receiver operating characteristic (ROC) and Kaplan-Meier (KM) survival analysis were used to assess the prognostic power of risk score. Multivariate Cox regression analysis was applied to construct a nomogram. Decision curve analysis (DCA), calibration curves, and time-dependent ROC were performed to assess the nomogram. Finally, functional enrichment of candidate genes was conducted to explore the potential biological pathways of candidate genes.
SORBS2, GPC2, SETBP1, FGF11, APOL1, and H1-2 were screened to be correlated with the prognosis of BC patients. A nomogram was constructed based on the risk score, pathological stage, and age. Then, the calibration plots for the 1-, 3-, 5-year OS were predicted well in entire TCGA-BLCA patients. Decision curve analysis (DCA) indicated that the clinical value of the nomogram was higher than the stage model and TNM model in predicting overall survival analysis. The time-dependent ROC curves indicated that the nomogram had higher predictive accuracy than the stage model and risk score model. The AUC of nomogram time-dependent ROC was 0.763, 0.805, and 0.806 for 1-year, 3-year, and 5-year, respectively. Functional enrichment analysis of candidate genes suggested several pathways and mechanisms related to cancer.
In this research, we developed an mRNA-based signature that incorporated clinical prognostic parameters to predict BC patient prognosis well, which may provide a novel prognosis assessment tool for clinical practice and explore several potential novel biomarkers related to the prognosis of patients with BC.
Li J
,Cao J
,Li P
,Yao Z
,Deng R
,Ying L
,Tian J
... -
《BMC CANCER》