Nonalcoholic fatty liver disease (NAFLD) is the most common type of chronic liver disease with a prevalence rate of 25% worldwide (1). Several lifestyle-related factors are associated with incident fatty liver such as alcohol intake, lower physical activity, smoking, and shift work. Poor lifestyle choices are often the main cause of fatty liver, these include smoking, drinking, lack of physical activity, and shift work, etc. In addition, high triglycerides, type 2 diabetes mellitus, obesity, and hypertension are associated with incident fatty liver. Therefore, lifestyle modification is strongly recommended to prevent fatty liver (2, 3). It is difficult to detect this problem in the earlier stages of the disease, and may thus further develop into advanced liver diseases, such as cirrhosis and hepatocellular carcinoma, bringing forth clinical challenges to the treatment of NAFLD (4). In the literature, the severity of NAFLD in patients with type 2 diabetes and obesity will be significantly affected, increasing the degree of deterioration of liver fibrosis and the possibility of further development of end-stage liver disease (5, 6, 7). Likewise, studies have shown that when NAFLD patients suffer from cardiovascular diseases and dyslipidemia, these factors have a negative impact on the natural progression of NAFLD (8, 9, 10). Nearly 40% of patients with NAFLD die of complications, as previously reported (1). However, the detailed mechanisms under which NAFLD develops remain largely unknown. Diet adjustment and weight loss can improve NAFLD, but it is difficult to maintain. Moreover, the theory of insulin resistance has been widely accepted clinically. Insulin sensitizers have a certain therapeutic effect, but they can cause adverse reactions such as increased body weight and its therapeutic target is too limited. Therefore, this study aimed at finding new molecular targets to provide a theoretical basis for new and effective treatment methods of NAFLD.
Long non-coding RNA (lncRNA) is the main component of the human transcriptome. Long non-coding RNA plays an important role in regulating cell migration, proliferation, invasion, and metastasis. It can also be used as a diagnostic marker or therapeutic target for malignant tumors and other diseases. Competitive endogenous RNA (ceRNA) is a transcript with the same microRNA (miRNA) response element, which binds to miRNA to compete and regulate its target gene, thereby affecting the biological behavior of the disease. Studies have confirmed that the mutual regulation between lncRNA and miRNA and their downstream target genes plays an important role in the occurrence and development of diseases (11).
The inflammatory component of nonalcoholic steatohepatitis (NASH) is more difficult to capture with ultrasound-assisted techniques. Although more and more technologies are applied in clinical practice, such as quantitative and contrast-enhanced ultrasound, there are still many technical barriers to be broken; and not all technologies have been successful in clinical and research practice (12). Due to the limitations of liver biopsy, searching for non-invasive and reliable diagnostic biomarkers for NAFLD is a priority for current research. Bioinformatics has been widely used to explore biomarkers of different diseases, but NAFLD-related biomarkers need to be further explored to help the early diagnosis and prognosis evaluation of NAFLD (13).
In this study, human samples from the Gene Expression Omnibus (GEO) database were used to identify key genes related to NAFLD and non-NASH samples during the baseline and 1-year follow-up time point, and to explore the underlying mechanism of NAFLD and develop new NAFLD diagnostic biomarkers. Then, the lncRNA–miRNA–mRNA network related to NAFLD was constructed by mapping the differentially expressed RNAs (DERs) into a global triple network via starBase and miRcode databases. This was done to identify which RNAs can be used as sensitive and specific markers for NAFLD. Furthermore, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed to explore the potential regulatory functions of RNAs. Finally, the PharmGKB database was used to search and obtain gene-related drug molecules in the ceRNA regulatory network and then build a gene–drug connection network to screen out important gene molecules and KEGG signaling pathways involved in genes.
GES83452 (14) in the NCBI-GEO (
The mRNA and lncRNA in the GES83452 datasets were reannotated using the HUGO Gene Nomenclature Committee (
The miRNAs related to NAFLD included in the Human MicroRNA Disease Database (HMDD) database (
The screened target genes in the ceRNA regulatory network were submitted to DAVID 6.8 online tool to perform functional annotation based on GO biological processes and KEGG pathway enrichment analysis, the P value < 0.05 as the significance threshold.
The pharmacogenetics and pharmacogenomics knowledge base (PharmGKB) (
A total of 9698 mRNAs and 1116 lncRNAs were detected after data preprocessing, and a total of 561 DERs (48 lncRNA and 513 mRNA; 268 downregulated and 293 upregulated) were screened in the baseline time point group, and 1163 DERs (114 lncRNA and 1049 mRNA; 522 downregulated and 641 upregulated) were screened in the 1-year follow-up time point group, with FDR < 0.05 and |log2FC| > 0.5 as the cutof criteria. We identified all DERs shown in the volcano map according to the value of |log2FC| and displayed DERs on a heat map (Figure 1A, B). The expression values of the DERs were 2-way hierarchically clustered, and the color contrast indicated that there was a significant difference in expression levels between the NAFLD and non-NAFLD samples (Figure1 C, D). Subsequently, a total of 220 overlapping DERs were identified between the baseline and 1-year follow-up time points, which were used by the Venn diagram software (Figure 2).
The hierarchically clustering analysis of screened differentially expressed RNAs (DERs). Left: log2FC-log10 (FDR) volcano map for GSE83452 using the significant DERs. Blue and red dots indicate significant DERs. The horizontally dashed line indicates FDR < 0.05. Two vertical lines indicate |Log2FC| > 0.5. A: Baseline time points; B: 1-year follow-up time points. Right: Two-way hierarchically clustered heat map for GSE83452 using the DERs. Red: upregulated DERs. Blue: downregulated DERs. C: Baseline time points; D: 1-year follow-up time points.
Authentication of overlapping DERs in the GSE83452 datasets via Venn diagrams software. Blue represents DERs of the baseline time points group. Yellow represents DERs of the 1-year follow-up time points group.
In addition, the functional enrichment analysis of the overlapping DERs based on online DAVID analyses revealed 22 significantly related GO biological processes and 9 KEGG pathways, with
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis for the differentially expressed RNAs (DERs).
Category | Term | Count | P-Value | Genes |
---|---|---|---|---|
Biology Process | GO:0006935~chemotaxis | 8 | 3.110E-04 | |
GO:0006636~unsaturated fatty acid biosynthetic process | 4 | 4.770E-04 | ||
GO:0007267~cell-cell signaling | 10 | 1.513E-03 | ||
GO:0007568~aging | 8 | 1.848E-03 | ||
GO:0007166~cell surface receptor signaling pathway | 10 | 2.538E-03 | ||
GO:0055114~oxidation-reduction process | 15 | 4.070E-03 | ||
GO:0016337~single organismal cell-cell adhesion | 6 | 4.312E-03 | ||
GO:0006629~lipid metabolic process | 7 | 6.425E-03 | ||
GO:0070098~chemokine-mediated signaling pathway | 5 | 6.698E-03 | ||
GO:0032496~response to lipopolysaccharide | 7 | 7.895E-03 | ||
GO:0010508~positive regulation of autophagy | 4 | 8.546E-03 | ||
GO:0035338~long-chain fatty-acyl-CoA biosynthetic process | 4 | 9.779E-03 | ||
GO:0006915~apoptotic process | 13 | 1.716E-02 | ||
GO:0002250~adaptive immune response | 6 | 2.033E-02 | ||
GO:0006959~humoral immune response | 4 | 2.223E-02 | ||
GO:0006968~cellular defense response | 4 | 2.767E-02 | ||
GO:0060326~cell chemotaxis | 4 | 3.124E-02 | ||
GO:0045766~positive regulation of angiogenesis | 5 | 3.346E-02 | ||
GO:0032868~response to insulin | 4 | 3.374E-02 | ||
GO:0008203~cholesterol metabolic process | 4 | 3.504E-02 | ||
GO:0001889~liver development | 4 | 4.331E-02 | ||
GO:0006954~inflammatory response | 9 | 4.788E-02 | ||
KEGG Pathway | hsa01212:Fatty acid metabolism | 6 | 2.300E-04 | |
hsa03320:PPAR signaling pathway | 6 | 1.090E-03 | ||
hsa04620:Toll-like receptor signaling pathway | 7 | 1.479E-03 | ||
hsa01040:Biosynthesis of unsaturated fatty acids | 4 | 2.350E-03 | ||
hsa04640:Hematopoietic cell lineage | 6 | 3.475E-03 | ||
hsa04152:AMPK signaling pathway | 6 | 1.465E-02 | ||
hsa04668:TNF signaling pathway | 5 | 3.695E-02 | ||
hsa00760:Nicotinate and nicotinamide metabolism | 3 | 4.529E-02 | ||
hsa04060:Cytokine-cytokine receptor interaction | 7 | 4.659E-02 |
A total of 77 miRNAs directly related to NAFLD were downloaded from the HMDD database. After the lncRNA and miRNA connection relationship pairs were downloaded and the regulation relationship between significantly DElncRNA and NAFLD-related DEmiRNAs were selected, a total of 74 connection pairs were retained to construct an lncRNA–miRNA connection network with a connection coefficient higher than 0.6. In addition, after the target genes of the miRNA were screened, we compared the regulated target genes with the significant DEmRNAs in target modules and retained the opposite relationship pairs of the expressions of significant differential direction. The miRNA–mRNA regulation network was constructed by using a total of 523 pairs of regulation relationships. Finally, as shown Figure 3, a ceRNA regulation network was constructed.
The lncRNA–miRNA–mRNA ceRNA network. Squares, triangles, and circles represent lncRNA, miRNA, and mRNA, respectively. Green and red dots indicate the significantly downregulated DERs at both baseline and 1-year follow-up time points, and the white dots indicate DERs whose expression difference direction has changed.
A total of 28 GO biological processes and 9 KEGG pathways of the mRNAs in the ceRNA regulatory network were obtained, with
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis for the mRNA in the ceRNA regulatory network.
Category | Term | Count | P-Value | Genes |
---|---|---|---|---|
Biology Process | GO:0008610~lipid biosynthetic process | 12 | 1.15E-06 | |
GO:0008202~steroid metabolic process | 9 | 1.26E-05 | ||
GO:0006694~steroid biosynthetic process | 6 | 9.34E-05 | ||
GO:0006633~fatty acid biosynthetic process | 5 | 8.64E-04 | ||
GO:0046649~lymphocyte activation | 6 | 4.38E-03 | ||
GO:0016053~organic acid biosynthetic process | 5 | 9.79E-03 | ||
GO:0046394~carboxylic acid biosynthetic process | 5 | 9.79E-03 | ||
GO:0045321~leukocyte activation | 6 | 9.84E-03 | ||
GO:0006355~regulation of transcription, DNA-dependent | 18 | 1.19E-02 | ||
GO:0030334~regulation of cell migration | 5 | 1.31E-02 | ||
GO:0051252~regulation of RNA metabolic process | 18 | 1.47E-02 | ||
GO:0007267~cell-cell signaling | 9 | 1.52E-02 | ||
GO:0051094~positive regulation of developmental process | 6 | 1.71E-02 | ||
GO:0001775~cell activation | 6 | 1.93E-02 | ||
GO:0040012~regulation of locomotion | 5 | 2.00E-02 | ||
GO:0051270~regulation of cell motion | 5 | 2.04E-02 | ||
GO:0009719~response to endogenous stimulus | 7 | 2.19E-02 | ||
GO:0006631~fatty acid metabolic process | 5 | 2.21E-02 | ||
GO:0045860~positive regulation of protein kinase activity | 5 | 3.23E-02 | ||
GO:0060429~epithelium development | 5 | 3.42E-02 | ||
GO:0033674~positive regulation of kinase activity | 5 | 3.61E-02 | ||
GO:0051347~positive regulation of transferase activity | 5 | 4.06E-02 | ||
GO:0001568~blood vessel development | 5 | 4.33E-02 | ||
GO:0051254~positive regulation of RNA metabolic process | 7 | 4.51E-02 | ||
GO:0001944~vasculature development | 5 | 4.66E-02 | ||
GO:0009725~response to hormone stimulus | 6 | 4.80E-02 | ||
GO:0030030~cell projection organization | 6 | 4.84E-02 | ||
GO:0007243~protein kinase cascade | 6 | 4.94E-02 | ||
KEGG Pathway | hsa01040:Biosynthesis of unsaturated fatty acids | 4 | 2.39E-05 | |
hsa00900:Terpenoid backbone biosynthesis | 2 | 8.23E-03 | ||
hsa00534:Heparan sulfate biosynthesis | 2 | 1.38E-02 | ||
hsa04910:Insulin signaling pathway | 3 | 1.79E-02 | ||
hsa04060:Cytokine-cytokine receptor interaction | 4 | 1.86E-02 | ||
hsa04010:MAPK signaling pathway | 4 | 1.93E-02 | ||
hsa04630:Jak-STAT signaling pathway | 3 | 2.21E-02 | ||
hsa05200:Pathways in cancer | 4 | 2.87E-02 | ||
hsa03320:PPAR signaling pathway | 2 | 3.28E-02 |
The gene-related drug molecule connection pairs were downloaded from the PharmGKB database. A total of 154 connection pairs were obtained by selecting the parts related to the genes in the ceRNA regulatory network, and a gene–drug connection network was constructed (Figure 4). Compared with the pathway in which the RNAs significantly participated in the ceRNA regulatory network constructed in the previous step, the leptin receptor (LEPR) and
Gene–drug connection network. Squares represent drug molecules, circles represent genes, and green and red dots represent the significantly downregulated DERs at both baseline and 1-year follow-up time points.
The characteristics of NAFLD include necrotizing inflammation and lipid accumulation in the liver, as well as continuous improvement of living standards leading to over-nutrition. In addition, bad living habits lead to the incidence of NAFLD on a global scale (27). The specific mechanism of the transition from benign steatosis to steatohepatitis in NAFLD is not fully understood, and there are currently no pharmacological options for the treatment of NAFLD. Therefore, the treatment of NASH mainly depends on lifestyle changes, such as strengthening exercises, reducing weight, and a light diet (28). Although current studies have shown that weight loss improves the histological characteristics of NAFLD, most patients have not however achieved the goal of curing NAFLD. There are some potentially valuable molecules, nevertheless, which are currently being clinically evaluated (29). For example,
In this study, a total of 220 overlapping DERs were identified between the baseline and 1-year follow-up time points. In addition, functional enrichment analysis of overlapping DERs, based on online DAVID analyses, revealed 22 significantly related GO biological processes and 9 KEGG pathways, with
Our study revealed that peginterferon alfa-2a and peginterferon alfa-2b can downregulate the expression of
In conclusion, this study constructed and analyzed a ceRNA network, a network which may provide some evidence for future studies focusing on the molecular mechanisms of NALFD.