Blood metabolites as predictors of skin cancer risk: a comprehensive analysis
Kategoria artykułu: Original Study
Data publikacji: 19 sie 2024
Zakres stron: 74 - 85
Otrzymano: 14 lut 2024
Przyjęty: 19 cze 2024
DOI: https://doi.org/10.2478/ahem-2004-0007
Słowa kluczowe
© 2024 Kaymin Wu et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Skin cancer (SC), encompassing melanoma and non-melanoma subtypes such as basal cell carcinoma (BCC) and squamous cell carcinoma (SCC) [ 1 2 3], represents the most prevalent malignancy worldwide [1, 4]. The incidence of skin cancer has been increasing globally, with significant variations in occurrence and types across different geographic regions and populations. In 2019, the United States alone reported an estimated 2.8 million cases of BCC and 1.5 million cases of SCC, with 4472 deaths attributed to SCC [4]. Although deaths from BCC are rare, the high incidence rates of both BCC and SCC indicate the considerable burden of keratinocyte carcinomas (KCs). It is important to note that as KCs are not required to be reported to cancer registries, the actual incidence and mortality rates may be higher, suggesting a potential underestimation of the true impact of these cancers [5].
Melanoma, although accounting for a smaller percentage of skin cancer cases, is responsible for a significant proportion of skin cancer-related mortality [6]. This highlights the aggressive nature of melanoma and underscores the importance of early detection and effective treatment strategies. The increasing prevalence and burden of both melanoma and KCs globally necessitate enhanced research, awareness, and public health initiatives aimed at reducing the impact of skin cancer.
Plasma metabolites, encompassing both final products and intermediate substances of metabolic pathways, represent pivotal biomarkers with significant potential for elucidating the underlying pathophysiological mechanisms associated with skin cancer. Metabolomics offers a powerful approach to identifying these biomarkers, potentially enabling early detection and intervention [7, 8]. Emerging studies in this field have begun to unravel the complex metabolic alterations associated with different cancers, shedding light on potential new biomarkers for diagnosis and targets for treatment. MR studies provide a unique opportunity to explore the causal relationships between plasma metabolites and skin cancer, offering insights beyond conventional observational studies. By leveraging genetic variants associated with metabolite levels, MR studies can address confounding factors and reverse causation, thus enhancing the understanding of the etiological roles of these metabolites in skin cancer development [9]. Recent advances in this area have opened new avenues for understanding the genetic underpinnings of metabolic pathways in skin cancer pathogenesis. In recent studies, the bacterium identified as flavonoid decomposer sp90199495 has been linked to the metabolism of X-21849, a compound also implicated in the risk of pancreatic ductal adenocarcinoma (PDAC) [10]. Furthermore, comprehensive metabolic analysis has revealed a significant causal relationship between several metabolites—namely pyruvate, 1,6-anhydroglucose, nonadecanoate, 1-linoleoylglycerophosphoethanolamine, 2-hydroxystearate, and gamma-glutamylthreonine—and colorectal cancer (CRC). Notably, multivariable Mendelian randomization (MVMR) analysis has demonstrated that the genetic predispositions for elevated levels of pyruvate, 1-linoleoylglycerophosphoethanolamine, and gamma-glutamylthreonine have an independent and direct influence on CRC risk, distinct from other metabolites. These findings suggest novel biochemical pathways and potential therapeutic targets in CRC pathogenesis [11].
However, there is a notable gap in the literature regarding comprehensive Mendelian randomization studies that directly link plasma metabolites to skin cancer causation. This highlights the necessity for more rigorous and targeted research in this field to elucidate the potential causal pathways and contribute to the development of effective prevention and treatment strategies. The purpose of this study is to investigate the causal relationships between plasma metabolites and skin cancer risk using Mendelian randomization. By integrating metabolomic profiling with genetic data, we aim to identify specific metabolites that may contribute to skin cancer pathogenesis, thereby providing new insights into potential biomarkers and therapeutic targets for early detection and personalized treatment of this prevalent disease.
We utilized genomic predictors for 1,400 circulating metabolites identified through a GWAS involving 8,299 individuals of European descent, who are part of the Canadian Longitudinal Study of Aging (CLSA) [12]. The CLSA is a comprehensive study tracking over 50,000 Canadians, aged 45 to 85 at the time of enrollment, to collect a wide array of data spanning biological, medical, physiological, social, lifestyle, and economic factors [13]. The current research narrows down to 8,299 non-related Europeans from the CLSA cohort, all of whom have undergone genome-wide genotyping and assessment of their plasma metabolite levels. By concentrating on participants with European backgrounds, the study aims to minimize confounding influences arising from differences in genetic population structures
Genetic variants that meet the following criteria were selected as instrumental variables (IVs) (Figure 1):

Selection workflow
Initially, genetic variants were selected based on association with P < 1×10−5, a common practice in MR to enhance the variance captured, especially when limited SNPs are available for the exposure variable. Then, using a clumping process within the R software environment, we filtered for independent variants, maintaining a linkage disequilibrium threshold of r2 < 0.5 across a 5,000 kb range. Instrumental SNPs were further refined by discarding palindromic SNPs with a middle allele frequency (MAF) between 0.01 and 0.30, which can be problematic due to their ambiguity in A/T or G/C alleles. Additionally, SNPs with an MAF below 0.01 were omitted from the initial GWAS to enhance the reliability of our findings. Subsequently, the explanatory power of the instrumental variables (IVs) for metabolite levels was assessed using the R2 and F-statistic parameters. For robust MR analysis, an F-statistic exceeding 10 is generally recommended.
To determine genetic variants as suitable instrumental variables (IVs), it is imperative that three key assumptions are satisfied: (I) the genetic variant must exhibit an association with the exposure, (II) the genetic variant should not be linked with any confounders, whether they are known or unknown, and (III) the genetic variant must influence the outcome exclusively through the exposure variable, without any alternate routes. To adhere to the first assumption, we selected SNPs associated with the exposure using a genome-wide significance threshold of P < 5×10−8 as potential instruments and employed F-statistics to rule out any weak instruments, specifically those with an F-statistic lower than 10. For the third assumption, tools like MR-Egger and MR-PRESSO were utilized to evaluate horizontal pleiotropy. Regarding the second assumption, recent research suggests that while variants exhibiting horizontal pleiotropy are often considered confounders in MR analyses, they can be valuable for identifying alternative pathways of the trait under study. This can lead to a more nuanced understanding of the specific exposure-outcome hypothesis and, hence, a richer set of results. Therefore, we included all SNPs in our analysis [14].
To assess the association with skin cancer risk, we downloaded GWAS data from the FinnGen Database. After adjusting for age, sex, genetic relatedness, genotyping batch, and the first 10 principal components, we utilized 20,951 SC cases and 287,137 controls for this analysis.
In the pursuit of exploring the putative causative links between plasma metabolites as the exposure and skin cancer risk as the outcome, a two-sample MR study was conducted using IVW, executed through the “TwoSampleMR” R software package. The validity of the MR findings was appraised by adopting a false discovery rate (FDR) threshold of less than 0.05. To bolster the analysis, we employed the MR-Egger intercept test and the MR Pleiotropy Residual Sum and Outlier (MR-PRESSO) global test, which aid in discerning potential horizontal pleiotropy effects and in detecting any directional pleiotropy [151617]. Moreover, a suite of methods, including maximum likelihood, MR-Egger, simple median, and weighted median, were implemented to affirm the directionality and robustness of the results.
The comprehensive workflow of the study is depicted in Figure 1. We studied a total of 1,400 unique quantified metabolites. Through rigorous analysis, we identified 78 instrumental variables (IVs) that have a significant causal relationship with skin cancer risk, with the number of SNPs contributing to each IV ranging from 13 to 38 (Caffeine/linoleate (18:2n6) had the fewest IVs with 13 SNPs; X-23654 had the most with 38 SNPs). These IVs accounted for between 0.0023 and 0.17596 of the variances in their respective metabolites. Furthermore, the lowest F-statistic among these IVs was 19.51, indicating that all IVs were adequately robust for the MR analysis of the 78 metabolites, well above the commonly accepted threshold of an F-statistic greater than 10 (Additional file 1: Table S1).
The IVW method was utilized to establish the causal relationships between 1,400 metabolites and skin cancer, using GWAS data from the FinnGen Database (Risteys FinnGen R9). In total, 78 metabolites displayed notable causative associations with skin cancer at a PIVW < 0.05, as depicted in the forest plot (Additional file 2: Figure. S1). Additional file 3: Table S2 lists the plasma metabolites significantly associated with skin cancer as identified in the GWAS datasets. Additional file 1: Table S1 presents the characteristics of SNPs, including their genetic correlations with plasma metabolites and skin cancer. Among these, 36 metabolites were shown to significantly reduce the risk of skin cancer, while 42 metabolites exhibited a significant positive correlation with the disease (Additional file 4: Table S3). For these 78 metabolites, the MR-Egger analysis, weighted median analysis, simple mode analysis, and weighted mode analysis all indicated trends consistent with the IVW analysis. This consistency across different analytical approaches suggests robustness in the findings. For example, 1-arachidonylglycerol (20:4) displayed the following associations: (MR-Egger, p = 0.02, OR = 1.13, 95% CI 1.03 - 1.25; IVW, p = 0.0004, OR = 1.09, 95% CI 1.04 - 1.15), indicating a consistent trend across analyses (Figure. 2).

Scatter plot of 1-arachidonylglycerol
Horizontal pleiotropy was not deemed a significant issue for the vast majority of the metabolites identified, as indicated by the MR-Egger intercept (if PEgger-Intercept > 0.05) or the MR-PRESSO global test for pleiotropy (if PGlobalTest > 0.05). Notable exceptions such as metabolite X-12117, which had a PEgger-Intercept of 0.043, and gamma-glutamylthreonine as well as Picolinoylglycine, both of which had a PGlobalTest of less than 0.001 (Table 1). Sensitivity analyses confirmed the consistent direction of association for all 78 metabolites. The “leave-one-out” methodology further reinforced that the MR analysis was robust, and no single SNP was influential enough to skew the results significantly. Moreover, Cochran’s Q statistic suggested that significant heterogeneity was absent in the majority of cases, except for 9 plasma metabolites (Table 2). All results collectively reinforce the relative reliability of the causal association between these 78 plasma metabolites and skin cancer. We did not conduct a PhenoScanner search, as our aim, as previously mentioned, was to achieve as comprehensive a set of results as possible. Furthermore, the reverse MR analysis revealed a significant causal association between 2-hydroxyoctanoate (outcome) and skin cancer (exposure), with an odds ratio (OR) of 0.96, a confidence interval (CI) of 0.92 – 0.99, and a PIVW of 0.038. No significant heterogeneity or pleiotropy was detected between the two, and sensitivity analyses verified the consistent direction of 2-hydroxyoctanoate.
Metabolites with significant pleiotropy
X-12117 | Egger Intercept | 0.043 |
X-13695 | Egger Intercept | 0.049 |
Oleoyl-linoleoyl-glycerol (18:1 to 18:2) [2] / linoleoyl-arachidonoyl-glycerol (18:2 to 20:4) [2] | Egger Intercept | 0.024 |
Adenosine 5′-diphosphate (ADP) / cytidine | Egger Intercept | 0.025 |
Caffeine / linoleate (18:2n6) | Egger Intercept | 0.047 |
Gamma-glutamylthreonine | Global Test | <0.001 |
Picolinoylglycine | Global Test | <0.001 |
Hydroxypalmitoyl sphingomyelin (d18:1/16:0(OH)) | Global Test | 0.012 |
Cysteinylglycine | Global Test | 0.017 |
Hypotaurine | Global Test | 0.011 |
Cortisone / 4-cholesten-3-one | Global Test | 0.029 |
Hypotaurine / cysteine | Global Test | 0.003 |
Retinol (Vitamin A) / linoleoyl-arachidonoyl-glycerol (18:2 to 20:4) [2] | Global Test | 0.011 |
Metabolites with significant heterogeneity (MR Egger or Inverse variance weighted)
Metabolites | Method | Q | Q_df | Q_pval |
---|---|---|---|---|
1-stearoyl-2-oleoyl-gpc (18:0/18:1) | MR Egger | 45.43 | 31 | 0.045 * |
1-stearoyl-2-oleoyl-gpc (18:0/18:1) | Inverse variance weighted | 45.62 | 32 | 0.056 |
2-hydroxyarachidate | MR Egger | 39.57 | 20 | 0.006* |
2-hydroxyarachidate | Inverse variance weighted | 47.17 | 21 | 0.001* |
Hydroxypalmitoyl sphingomyelin (d18:1/16:0(OH)) | MR Egger | 48.40 | 29 | 0.013* |
Hydroxypalmitoyl sphingomyelin (d18:1/16:0(OH)) | Inverse variance weighted | 48.54 | 30 | 0.017* |
Cysteinylglycine | MR Egger | 31.72 | 17 | 0.016* |
Cysteinylglycine | Inverse variance weighted | 34.74 | 18 | 0.01* |
Hypotaurine | MR Egger | 45.67 | 26 | 0.01* |
Hypotaurine | Inverse variance weighted | 45.74 | 27 | 0.013* |
X-13431 | MR Egger | 39.43 | 25 | 0.033* |
X-13431 | Inverse variance weighted | 39.52 | 26 | 0.043* |
Cortisone / 4-cholesten-3-one | MR Egger | 39.82 | 29 | 0.087 |
Cortisone / 4-cholesten-3-one | Inverse variance weighted | 43.83 | 30 | 0.049* |
Hypotaurine / cysteine | MR Egger | 54.17 | 31 | 0.006* |
Hypotaurine / cysteine | Inverse variance weighted | 55.86 | 32 | 0.005* |
Retinol (Vitamin A) / linoleoyl-arachidonoyl-glycerol (18:2 to 20:4) [2] | MR Egger | 42.15 | 22 | 0.006* |
Retinol (Vitamin A) / linoleoyl-arachidonoyl-glycerol (18:2 to 20:4) [2] | Inverse variance weighted | 43.13 | 23 | 0.006* |
This Mendelian randomization study provides an unbiased assessment of the causal links between 1,400 plasma metabolites and skin cancer. We pinpointed 78 metabolites associated with the risk of skin cancer by employing genetic variants as investigative tools. Of these, 36 metabolites were found to significantly decrease the risk of skin cancer, while 42 showed a significant positive correlation with the disease. Reverse MR analysis also revealed a causal relationship between 2-hydroxyoctanoate and skin cancer, supporting the reliability of the analysis.
To our knowledge, this is the first MR study to assess the causal relationships between the latest set of 1,400 plasma metabolites and skin cancer. Previous studies have identified significant causal associations between plasma metabolites and various malignancies, which could help clarify the metabolic characteristics of cancers and further assess the potential role of metabolites in cancer risk evaluation. One Mendelian randomization study indicated a significant causal relationship between blood metabolites and pancreatic ductal adenocarcinoma (PDAC). Out of 483 metabolites, 44 unique metabolites were found to be significantly associated with PDAC risk, with the top four ranked as X:12798, X:11787, X:11308, and X:19141 [18]. Similarly, in a study on colorectal cancer (CRC), 6 out of 486 blood metabolites were found to have significant causal links with CRC. Multivariate MR analysis revealed that pyruvate, 1-linoleoylglycerophosphoethanolamine, and gamma-glutamylthreonine could independently influence CRC, aside from other metabolites [11]. Additionally, in breast cancer, researchers found two blood metabolites, high-density lipoprotein cholesterol (HDL-C) and acetate, to have present causal associations (from an analysis of 112 blood metabolites and 147,827 European individuals). In this MR study, the authors suggested that HDL-C and acetate might be promising targets for preventing breast cancer, though their advantages and disadvantages should be carefully considered [19]. For skin cancer, past research has hinted that a causal relationship may exist between childhood sunburn and the risk of malignant melanoma (MM) and non-melanoma skin cancer (NMSC), suggesting that enhancing screening and prevention of childhood sunburn could aid in the early detection and reduction of MM and NMSC risks [20]. Conventional carcinogenic factors, such as smoking, have also been found to have a significant causal link with skin cancer [21]. However, there is currently a lack of research on the causal relationship between metabolites and skin cancer. Our study’s initial findings indicate a causal relationship between plasma metabolites and skin cancer, suggesting that plasma metabolites may have predictive significance for the progression and prognosis of skin cancer. Our study fills a gap in the research on the correlation between plasma metabolites and skin cancer, indicating that from the perspective of metabolites, there is potential to provide new insights into the pathogenic mechanisms of skin cancer.
Previous research indicates that plasma metabolites play a crucial role in the pathogenesis and progression of cancer. For instance, it has been observed that patients with advanced cancer cachexia typically exhibit lower levels of triglycerides, a phenomenon attributed to enhanced triglyceride hydrolysis within the body [ 22, 23 24]. However, our study did not detect a significant reduction in triglycerides, possibly due to the inclusion of a study sample that was not limited to patients with advanced melanoma. Our analysis identified 78 metabolites associated with the risk of melanoma. Arachidonic acid, an omega-6 fatty acid prevalent in cell membranes, is primarily synthesized from dietary linoleic acid, an essential fatty acid, and serves as a precursor for inflammatory mediators such as leukotrienes [25, 26]. The conversion of linoleic acid to arachidonic acid is regulated by the rate-limiting enzyme Δ6-desaturase, and increasing blood concentrations of linoleic acid does not significantly impact arachidonic acid production. Consequently, the level of arachidonic acid typically remains stable. However, our study revealed that an elevated ratio of arachidonic acid to linoleic acid correlates with an increased risk of melanoma. This finding suggests that tumor-induced immune responses may consume substantial amounts of arachidonic acid for the synthesis of leukotrienes and other inflammatory mediators, thereby reducing linoleic acid levels and increasing the arachidonic acid to linoleic acid ratio [27, 28]. Additionally, we observed a correlation between increased serum caffeine levels and melanoma risk. Notably, an elevated ratio of caffeine to linoleate (18:2n6) is positively associated with melanoma risk, whereas an increased ratio of caffeine to theophylline is inversely related to melanoma risk. The former association may be due to the disproportionate consumption of linoleate, whereas the latter requires further investigation. Increased caffeine intake might potentially reduce melanoma incidence.
To ascertain genetic variants as suitable instrumental variables (IVs), three foundational assumptions must be fulfilled: (I) the genetic variant should be associated with the exposure, (II) the genetic variant must not be linked with any known or unknown confounders, and (III) the genetic variant should influence the outcome solely through the exposure, not via alternative pathways. For assumption II, the most common approach is to use PhenoScanner (
The strengths of our study lie in the utilization of high-quality metabolomics data, which has allowed for the identification of plasma metabolite associations with skin cancer (SC), thereby broadening our understanding of the disease’s genetic architecture. Additionally, the relatively new approach of two-sample MR leverages the associations between genetic instrumental variables (IVs) and exposures. GWAS summary data also enhances the statistical power in two-sample MR [35], setting the stage for future mechanistic studies and hinting at the potential for metabolomic interventions in skin cancer.
In conclusion, this study advances our understanding of skin cancer’s biological underpinnings by using a two-sample Mendelian randomization analysis to establish causal relationships between plasma metabolites and skin cancer risk. The significant associations discovered between numerous metabolites and skin cancer risk highlight the potential of metabolomics in identifying biomarkers for early detection and treatment. This research not only provides insights into the complex pathophysiology of skin cancer but also paves the way for personalized medicine, emphasizing the integration of genetic and metabolomic data. Future research should further explore the functions of these metabolites and their roles in the development of skin cancer, contributing to the development of new diagnostic and therapeutic strategies.
Several limitations must be considered for an accurate interpretation of our findings. First, skin cancer can be classified into melanoma and non-melanoma types; our study analyzed skin cancer as a whole. Hence, future research should aim for a more comprehensive investigation of skin cancer subtypes. Second, we have preliminarily identified 78 plasma metabolites associated with skin cancer, but the metabolic pathways of these metabolites were not elucidated in this analysis; future studies based on our results will delve deeper into these pathways. Third, while MR is proven to be an effective method for assessing causal relationships between human plasma metabolites and skin cancer, these findings should be verified through further research based on experimental data. Fourth, the validity of MR analysis greatly depends on the explanatory power of the IVs for the exposure, necessitating an expansion of the sample size to provide a more accurate assessment of the genetic influence on metabolites. Fifth, considering the data currently available in public databases, the data for European populations is relatively more comprehensive. This study only investigated European populations. Future research needs to focus on different ethnic groups. Finally, although this study identified multiple plasma metabolites contributing to skin cancer risk, additional research is needed to uncover their roles in the pathogenic mechanisms of the disease.