Cuproptosis-related gene CEP55 as a biomarker of pancreatic adenocarcinoma via multi-omics techniques and experimental validation
Categoria dell'articolo: Research Article
Pubblicato online: 18 lug 2025
Pagine: 368 - 382
Ricevuto: 21 gen 2025
Accettato: 26 apr 2025
DOI: https://doi.org/10.2478/raon-2025-0042
Parole chiave
© 2025 Riyuan Zhang et al., published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 International License.
Pancreatic adenocarcinoma (PAAD) is a fatal disease.1–3 Due to the deep location of the pancreas in the body and the lack of noticeable early symptoms, detection and diagnosis of PAAD has been difficult2 and treatment has been limited.4 It is estimated that more than 80% of PAAD patients are already unsuitable for surgery and have distant metastases at the time of diagnosis, for which current treatment strategies often have little effect.5 Recently, immunotherapies, such as PD-1/PDL-1 or CTLA-4 inhibitors, are being extensively studied for their utility in major high mutation-load solid tumors.6 In addition, many studies have been reported on the immune microenvironment (IME) of PAAD and the relationship between its stromal cells and immune cells.7,8 However, there are still some gaps in key targets, inhibitors and improved clinical prediction models of PAAD.7
Dysregulation of apoptosis is an essential hallmark of cancer.9 Therefore, reconstructing the cell death program and inducing cancer cell death have been a promising development in the field of cancer therapy.10 In addition to the mainstream programmed cell death, death receptor-mediated apoptosis, ferroptosis11,12, pyroptosis13, parthana-tos14, as well as the newly revealed cuproptosis15 were also involved. It was shown that as a non-ap-optotic cell death pathway, copper can bind to and aggregate with lipoylated TCA cell cycle proteins, which then trigger proteotoxic stress as well as loss of Fe-S cluster proteins, leading to cell death.16 Given the close link between necroptosis and immuno-oncology17, the role of this novel cuproptosis model is also being explored in different cancers, such as hepatocellular carcinoma18, breast cancer19 and renal clear cell carcinoma20, in the context of the rise of immune checkpoint therapy20, among others. Metabolic recombination is one of the typical characteristics of PAAD, so cuproptosis-related genes (CRGs) may provide new prognostic markers and guide the development of new therapeutic regimens.21
In this study, we used single-cell data to screen for differential genes associated with cuproptosis, and the data of 177 PAAD samples in TCGA were screened by the WGCNA algorithm to identify gene modules associated with cuproptosis. Subsequently, through comprehensive bioinformatics analysis, we constructed a cuproptosis-related prognostic model for PAAD and classified PAAD patients into high-risk and low-risk groups based on cuproptosis-related risk scores. An evaluation of the cancer IME using seven methods and mutational analysis revealed the mutation types in the high and low risk groups. Further analysis revealed that
We used “
The single cell dataset GSE212966 for PAAD was downloaded from the GEO database. The dataset contains a total of 12 samples. The “Seurat” R package was used to analyze the single cell data. We pre-processed these data using the following standards: cells with less than 10% of mitochondrial genes, cells with the total number of genes >200 and genes with expression range of 200-7000 and being expressed in at least three cells were retained. The data normalization was performed using the
All CRGs genes were obtained from the study of Tsvetkov P
Weighted Co-Expression Network Analysis is a systematic biological method for characterizing patterns of correlation between genes in microarray samples.22 This method can be used to find clusters (modules) of highly related genes. In this study, we used the WGCNA method to find gene modules in PAAD that were highly correlated with cuproptosis to obtain CRGs.
A prognostic model containing 8 genes was constructed using Cox-lasso’s algorithm.23 Univariate Cox analysis was then conducted to screen for important key genes. Using the “glmnet” tool in the R package, LASSO Cox regression analysis is undertaken to perform further screening and construct prognostic models. After this, the CRGs scores were calculated using the formula, and patients in the TCGA PAAD cohort were divided into high-risk and low-risk groups based on the median, to explore the differences in prognosis between the two groups. Finally, we evaluated the accuracy of the model by receiver operating characteristic (ROC) analysis and principal components analysis (PCA).
Immunotherapy sensitivity scores for the TCGA-PAAD (The Cancer Genome Atlas Pancreatic Adenocarcinoma) cohort were obtained from The Cancer Immunome Atlas (TCIA;
GSE85916 in GEO was used as an external validation cohort. In this validation cohort, the CRGs score was calculated for each sample according to the formula of the model, and patients were divided into a high-risk group and a low-risk group based on the median of the scores. Survival analysis was performed to judge whether there was a difference in prognosis between these two subgroups. Next, we evaluated the stability of the model using ROC curves. PCA was used to explore whether the model could better group patients with PAAD.
We evaluated the IME of PAAD in high and low risk groups using seven algorithms including CIBERSORT, EPIC, Estimate, MCP_counter, Quanti-seq, TIMER, xCell and showed their results in the form of heat maps. We then performed intergroup mutation analysis and further analysis based on the results.
The prognostic nomogram was developed by integrating CRGs-related risk scores, with continuous variables normalized and categorical variables incorporated using appropriate reference levels. Model coefficients were transformed into a 0-100 point scoring system using the R package ‘rms’ to generate a clinically applicable visual predictive tool. The nomogram’s discriminative performance was rigorously assessed using ROC curve analysis at 1-, 3-, and 5-year follow-up intervals, with the area under the curve (AUC) and 95% confidence intervals calculated through 1000 bootstrap iterations to evaluate predictive accuracy for survival outcomes. To further validate the model’s clinical utility, decision curve analysis (DCA) was performed across a comprehensive range of threshold probabilities (0-100%), systematically comparing the net benefit of the nomogram against default “treat-all” and “treat-none” strategies while accounting for the clinical consequences of falsepositive and false-negative predictions.
In this study, we conducted a meticulous examination of the expression profiles of the 8 model genes at the single-cell level. This involved a comprehensive exploration of their individual expression patterns across diverse cellular contexts. Subsequently, to glean insights into their clinical relevance, we performed a rigorous single-gene prognostic analysis. This investigative approach allowed us to unravel the nuanced intricacies of each gene’s expression within individual cells and, importantly, assess their potential impact on patient prognosis.
The expression profile data from the TCGA-PAAD cohort was used to analyze the expressed levels of 79 common immune checkpoint-related genes between different CRGs-related risk scores, including
HPDE6-C7 and PAAD cell lines (ASPC1 and BXPC3) were purchased from ATCC (Manassas, USA). DMEM basal medium supplemented with 10% fetal bovine serum (Gbico), 100 μg/ml penicillin and 100 mg/ml streptomycin were used for culture, respectively. All cells were incubated in an incubator with a constant temperature of 5% CO2 at 37°C. Experiments were performed in six independent biological replicates (cultures derived from separate passages). RNA Extraction Kit (beyotime) was used to extract the whole RNA from pancreatic epithelial cells (HPDE6-C7) and pancreatic cancer cells (ASPC1 and BXPC3), and then reverse transcribed into cDNA. SYBR green method was used for RT-qPCR. GAPDH can be used as a reference for comparing the mRNA expression levels of corresponding genes.
The sequence of RNA primers is shown below:
GAPDH: Forward: 5’-GGAGCGAGATCCCTCCAAAAT-3’, Reverse: 5’-GGCTGTTGTCATACTTCTCATGG-3’; CEP55: Forward: 5’-AGTAAGTGGGGATCGAAGCCT-3’, Reverse: 5’-CTCAAGGACTCGAATTTTCTCCA-3’; KIF23: Forward: 5’-CCATAAAACCCAAACCTCCACA-3’, Reverse: 5’-CTATGGGAACGGCTGGACTC-3’; ARNTL2: Forward: 5’-ACTTGGTGCTGGTAGTATTGGA-3’, Reverse: 5’-TGTTGGACTCGAATCATCAAGG-3’; FAM111B: Forward: 5’-GCTAGCATGAATAGCATGAAGACA-3’, Reverse: 5’-GGATCCGCACTCCATAGG-3’; MRPL3: Forward: 5’-TGCTGCAATTAAACCAGGCAC-3’, Reverse: 5’-CGTTTGACCATGCGTAGCAG-3’; DHX30: Forward: 5’-CCAGCCTCGTGATGAGGAAT-3’, Reverse: 5’-GCTGGGCCCGATCTTTTCT-3’; MET: Forward: 5’-TGGGCACCGAAAGATAAACCT-3’, Reverse: 5’-CACTCCCCATTGCTCCTCTG-3’; KNSTRN: Forward: 5’-AGGGCCTTGATCCAGCTTTA-3’, Reverse: 5’-TACCTTTAAGGCCTGTAACTCC-3’;
siRNAs targeting
In this study, we evaluated cell proliferation and viability using the ASPC1 cell line with the CCK-8 (Cell Counting Kit-8) assay. Initially, ASPC1 cells were seeded in a 96-well plate at a density of 1×10^4 cells/well and incubated at 37°C with 5% CO2 for 24 hours to allow for attachment. Following this, varying concentrations of the test compounds were added, and the cells were incubated for an additional 24, 48, and 72 hours. Each condition was tested with six independent biological replicates. At the end of each treatment period, 10 μL of CCK-8 reagent was added to each well, and the plate was further incubated for 1 to 4 hours to enable the viable cells to reduce the reagent to a soluble orange formazan product. The optical density (OD) values were measured at a wavelength of 450 nm.
Transwell assay was performed to evaluate the migratory and invasive properties of ASPC1 cells under distinct experimental conditions:
In this study, the R software (Institute of Statistics and Mathematics, Vienna, Austria; version 4.1.2) was applied to all statistical analysis procedures. Quantitative data are expressed as mean ± SEM (standard error of the mean). Normality and homogeneity of variance were assessed using Shapiro-Wilk and Levene’s tests, respectively. For comparisons between two groups, we applied: (1) Student’s t-test for normally distributed data with equal variances; (2) Welch’s t-test for normally distributed data with unequal variances; or (3) the Wilcoxon rank-sum test (Mann-Whitney U test) for non-normally distributed data. For multiple group comparisons, we employed: (1) one-way ANOVA (with Tukey’s post hoc test) for normally distributed data with equal variances; (2) Welch’s ANOVA (with Games-Howell post hoc test) for normally distributed data with unequal variances; or (3) the Kruskal-Wallis test (with Dunn’s post hoc test) for non-normally distributed data. Each group contains 6 samples. It was statistically significant only when two-sided p value < 0.05.
Our workflow diagram is shown in Supplementary Figure 1.

Single cell sequencing analysis of GSE212966.
In the initial step, we integrated and scrutinized the PAAD single-cell sequencing dataset obtained from GEO. The quality control process of the single-cell analysis is depicted in Supplementary Figure 2. Illustrated in Supplementary Figure 2, these samples exhibited seamless integration without any discernible interval effects, paving the way for subsequent analysis. Utilizing the k-Nearest Neighbor (KNN) clustering algorithm, we categorized all cells into 14 clusters (Figure 1A). Additionally, seven distinct cell types were identified as dendritic cells (DC), monocyte, macrophage, endothelial cells, smooth muscle cells, natural killer (NK) cells, and epithelial cells (Figure 1B). Figure 1C illustrates the distribution of 10 CRGs genes within each cluster.

Construction and validation of cuproptosis-related prognostic model.
Cells were stratified into those exhibiting low and high expression related to cuproptosis, based on the median percentage of CRGs, as illustrated in the tSNE plot (Figure 1D). The high CRGs and low CRGs groups were subjected to differentially expressed analysis, leading to the identification of a specific gene.
In the TCGA cohort, cuproptosis-related score (Cup_score) was calculated by ssGSEA. Then, Cup_score related module were derived through WGCNA analysis of the transcriptome data from 177 pancreatic adenocarcinoma patients (Figure 1E–H). Employing a soft threshold of 16, a minimum module gene counts of 100, a deep Split value of 2, and merging modules with a similarity threshold below 0.5 resulted in a total of 11 nongrey modules (Figure 1F–G). Notably, MEred and MEcyan modules demonstrated a significant association with cuproptosis scores within the nongrey modules, as illustrated in Figure 1H.
Genes meeting a stringent p-value threshold of < 0.05 were selected from these three modules for subsequent analysis. Furthermore, differential expression analysis, followed by enrichment analysis of PAAD data from TCGA, revealed the pivotal involvement of immune-related processes. Key pathways such as the chemokine signalling pathway, cytokine-cytokine receptor interaction, and B cell receptor signalling pathway were identified as playing crucial roles in pancreatic adenocarcinoma, as depicted in Supplementary Figure 3.

The construction of a nomogram.
By intersecting differentially expressed genes identified through single-cell sequencing data analysis with CRGs obtained from WGCNA, a total of 773 genes were curated. Initial selection of genes associated with patient prognosis was performed through univariate COX analysis, with a significance threshold set at P < 0.05. Subsequently, in LASSO regression analysis, gene contraction exhibited optimal stability with minimal partial likelihood bias when eight genes were included (Figure 2A–B).
The final prognostic model comprised 8 genes (
To assess the stability of CRGs in prognostic evaluation for PAAD patients, ROC curve analysis was performed in both the training and validation cohorts. Shown in Figure 2E are ROC curves for 2-year, 3-year, and 5-year prognoses in the TCGA and GSE85916 cohorts, respectively. In the TCGA cohort, the area under the curve (AUC) values were 0.720, 0.757, and 0.846 at 2, 3, and 5 years, while in the validation cohort, the AUCs were 0.798, 0.824, and 0.795 at 2, 3, and 5 years (Figure 2F). These results affirm that CRGs exhibits high accuracy in predicting patient prognosis in both cohorts.
Finally, PCA analysis of the eight genes in the model, performed in the training and validation sets, respectively, revealed the model’s efficacy in effectively distinguishing PAAD patients into different groups (Figure 2G–H).
In order to accurately calculate the prognosis of PAAD patients, we constructed a Nomogram by cuproptosis-related scores. As the nomogram exhibited in Figure 3A, the mortality rates of patients at 1, 3, and 5 years were estimated to be 0.576, 0.973, and 0.994 according to their age and CRGs score. In essence, the column line plot is a visualization of the regression equation results that can be used to easily calculate the prognosis of PAAD patients and guide subsequent clinical decisions. To further evaluate the accuracy of this line plot, a ROC analysis was performed. The results showed that the area under the curve (AUC) was 0.71, 0.8, and 0.84 at 1, 3, and 5 years, respectively (Figure 3B). In order to evaluate the clinical utility of CRGs scores, we conducted decision curve analysis on this nomogram, and the results showed that they were above the reference line for a large threshold range, which had a good guiding effect (Figure 3C).
Through our comprehensive analysis, we’ve identified significant variations in patient outcomes within the CRGs subgroup. To unravel the etiology and guide immunotherapy, we utilized seven methods to explore differences in the cancer immune microenvironment. Results in Figure 4A depict a higher presence of immune cell infiltrates, predominantly T cells, B cells, and macrophages, in the low CRGs group. Statistical outcomes from 7 immune infiltration algorithms (Supplementary Figure 4) support these findings. Additionally, the expressed levels of immune checkpoint-related genes showed in Supplementary Figure 5A–B. We found that most of the immune checkpoint-related genes were high expressed in low CRGs group, such as

Immune microenvironment and mutation correlation analysis.

Single-cell sequencing analysis to investigate the cellular localization of 8 model genes.
Moving to genomic analysis, we examined the top 20 mutated genes in high and low CRGs groups. Mutation rates in the high and low CRGs group were tested separately. The results showed that
In order to further explore the different emphases of drug therapy for patients in high and low risk groups, 5 drugs associated with the copper ion metabolism or cuproptosis were selected for drug sensitivity analysis (Supplementary Figure 6). The results showed that Elesclomol had significant differences between high and low CRGs expression groups (Supplementary Figure 6A), and the IC50 of high risk group was lower, so patients with high CRG score were more sensitive to Elesclomol.

Cell experiment and screening of low cuproptosis-related genes (CRGs).
To investigate the expression patterns of hub genes across distinct cell types, we conducted a detailed single-gene analysis focusing on the eight identified hub genes (Figure 5A–H). The results, revealed that
Subsequently, a comprehensive prognostic analysis was performed for these eight genes. The findings, depicted in Figure 5I–P, unveiled that, with the exception of
The expression of CRGs in pancreatic epithelial cells (HPDE6-C7) and two pancreatic cancer cells (ASPC-1, BXPC-3) was detected by RT-qPCR. The results showed that except for
To investigate the functional role of

Pancreatic adenocarcinoma (PAAD) is a malignancy with a very poor prognosis, and the 5-year survival rate for this disease is statistically less than 7%.1,24 In the past, to prolong the survival of PAAD patients, we could only use surgical and chemotherapeutic treatments, although their effectiveness was in fact limited.25,26 In recent years, with the development of cancer immunotherapy in full swing27,28, more and more scholars have also started to put their eyes on the relationship between PAAD and immunotherapy.29,30 For example, Fengjiao Li
In this study, through an extensive analysis of PAAD data from the TCGA and GEO databases, we divided PAAD patients into high-risk and low-risk groups based on the subsequently calculated cuproptosis scores. The results showed that both in the TCGA and GEO cohorts, the high-risk group showed a poorer outcome. Since there are no studies to date on the association between CRG and the occurrence of PAAD37, we constructed an 8-gene model related to CRGs score based on differential expression analysis and WGCNA results. In addition, by using ROC curves, we found that the model also showed high accuracy in assessing the prognosis of PAAD patients at 2, 3 and 5 years. The results of the immune microenvironment and mutation correlation analysis showed similarity in mutated genes between high and low risk groups, while there were some differences in the mutation rates of the same genes. We identified
Despite the rapid development of cancer immunotherapy, the application of this approach in PAAD has had little success for a long time39, which is one of the reasons why the prognosis of PAAD is so poor. The TME, also known as the stromal compartment, is composed of cancer-associated fibroblasts (CAFS) with immune cells.40 The prevailing view is that the TME can activate and transform growth factor beta to drive the recruitment of immunosuppressive cells, thereby limiting immune cell infiltration and impairing their function in the tumor41, while the abundant stromal component in the tumor is one of the specific features of PAAD.42,43 Therefore, PAAD was considered to be low immunogenic.44 In recent years, however, new breakthrough points have been made in this thorny historical problem. A new model has been used to convert non-immunogenic PAAD into immunomodulatory immunogenic lesions45, while tertiary lymphoid structures (TLS) in the tumors of some PAAD patients have been identified to contribute to antitumor immunity.46 The role of various immunomarkers in PAAD has been extensively studied.47–49 It is important to understand the TME of PAAD based on new perspectives. In the present study, we found that immune-related processes play a crucial role in PAAD by differential expression analysis and enrichment analysis. Therefore, we utilized seven algorithms to explore differences in the cancer immune microenvironment. The results showed that the infiltration level of immune cells in the high CRGs group was significantly lower than that in the low CRGs group, and the statistically significant immune cells were also consistent with the pathway obtained by enrichment analysis. Thus, the high CRGs group may be more likely to benefit from immunotherapy.
While our multi-omics analyses and in vitro experiments have established
This study constructed a cuproptosis-related prognostic model for PAAD through multi-omics techniques. Moreover, the cancer immune microenvironment and tumor mutational burden of two CRGs groups were assessed. Finally,