Introduction

The global prevalence of diabetes and cancer is increasing rapidly. In 2015, an estimated 415 million people were diagnosed with diabetes, and 90% of those were identified as having type 2 diabetes [1]. Recently, drugs used for the treatment of type 2 diabetes have been reported to affect the risk of cancer [2], suggesting that it is important to select appropriate hypoglycemic drugs that minimize the risk of cancer. Pioglitazone is a thiazolidinedione drug that regulates blood glucose homeostasis by binding to peroxisome proliferator-activated receptor gamma (PPARγ). This drug can improve insulin resistance in peripheral tissues and the liver, protect B-cell function in diabetic patients, and is considered to be beneficial for lipid homeostasis [3]. Various restrictions have been placed on the utilization of other thiazolidinedione drugs, including troglitazone and rosiglitazone, due to the resulting risk of hepatotoxicity or cardiovascular disease [4]. Pioglitazone is currently the only thiazolidinedione drug that is commonly used globally.

However, the safety of pioglitazone still warrants investigation. It has been reported that the use of pioglitazone increases the incidence of bladder cancer [5,6,7]. However, the association between pioglitazone and the risk of bladder cancer remains a source of debate among researchers [8, 9]. In addition to bladder cancer, researchers should also be aware that pioglitazone increases the risk of other types of cancers. In a study by Ferrara et al., pioglitazone use was found to be associated with an increased prostate cancer risk and pancreatic cancer risk, but it did not increase the risk of ten other cancers, including bladder cancer [10]. Also, a meta-analysis indicated that pioglitazone is associated with an increased incidence of prostate cancer and pancreatic cancer. Further investigation is needed to evaluate whether pioglitazone is a causal risk factor for these cancers.

In this research, we used DrugBank to obtain direct-target information for pioglitazone and investigate its potential pharmacological effects. Based on the four direct targets found for pioglitazone, 50 target-related genes were identified with the STRING database. We then performed pathway enrichment analysis for the targets and their associated genes using the STRING and WebGestalt databases. Interestingly, the results indicated that the prostate cancer pathway and the pancreatic cancer pathway were highly correlated with the functional network of pioglitazone. These pathways were thus screened, and the related genes were then used as queries in searches of the cBioPortal database to explore relevant genomic alterations. Overall, our research plan was to assess the risk of prostate cancer and pancreatic cancer associated with pioglitazone use via systematic network studies in order to determine the safety of pioglitazone in T2DM patients.

Methods

Drug-Target Search

The DrugBank database is a useful web-based bioinformatics tool that can provide detailed drug data, including pharmacological mechanisms and targets [11, 12]. In this study, DrugBank was used to retrieve information on pioglitazone, including the interactions between pioglitazone and its primary targets, in order to gain insight into the biological network of pioglitazone.

Network Generation and Pathway Enrichment Analysis

STRING v.10.5 is an open database that is used to explore protein–protein interactions and obtain information on interactions predicted by comparative genomics [13]. In the present research, the target proteins of pioglitazone were explored using STRING, and the results generated were then visualized using Cytoscape (version 3.6.0) [14]. In addition, KEGG pathway analysis of pioglitazone-associated genes was conducted using STRING, which yielded the top 25 statistically significant KEGG enrichment pathways. WebGestalt is an integrated enrichment analysis tool that allows flexible and accurate exploration of functionally enriched pathways [15]. In our study, the pathway enrichment analysis of pioglitazone-associated genes was further validated using WebGestalt.

Obtaining Cancer Genomics Data Linked to Pioglitazone Using cBioPortal

cBioPortal is a web-based tool for exploring multidimensional cancer genomics data obtained from various cancer samples [16, 17]. In our study, the cBioPortal database was used to explore the connectivity of pioglitazone-related genes across prostate cancer and pancreatic cancer studies.

Compliance with Ethics Guidelines

This article is based on previously conducted studies and does not contain any studies with human participants or animals performed by any of the authors.

Results

Characterization of the Biological Effects of Pioglitazone Using DrugBank and Visualization of the Pioglitazone Linkage Network Using STRING

In this research, we first queried DrugBank using pioglitazone as an input, which yielded an output of DB01132, categorizing pioglitazone as a blood glucose lowering agent, cytochrome P450 enzyme inhibitor/inducer, peroxisome proliferator-activated receptor alpha/gamma agonist, and thiazolidinedione (Table 1). Among these, the primary use of pioglitazone is to improve glycemic control, where it is used as an adjunct to diet and exercise. Table 2 presents four primary targets of pioglitazone, PPARG, PPARD, PPARA, and MAOB. To further extend our research and analysis, STRING was employed to obtain 50 pioglitazone-related target proteins. These data, including the four primary targets, were then grouped together using Cytoscape 3.6 to build a biological function network for visualizing the interactions of the pioglitazone-related proteins: the four primary direct targets and the secondary pioglitazone-associated proteins. PPARA had 44 protein targets, PPARD had 19 protein targets, PPARG had 35 protein targets, and MAOB had 2 protein targets; see Fig. 1. Three of the primary direct targets of pioglitazone (the exception being MAOB) had 17 genes in common.

Table 1 Characterization of pioglitazone using DrugBank
Table 2 Identification of the direct targets of pioglitazone using DrugBank
Fig. 1
figure 1

Drug-target interactome of pioglitazone. The drug pioglitazone is highlighted in yellow, the primary direct protein targets (PPARA, PPARD, PPARG, and MAOB) are highlighted in red, and the secondary pioglitazone-associated proteins are highlighted in pink

Analysis of Biologic Functions Related to Pioglitazone-Mediated Changes in Genes Using STRING and WebGestalt

To assess the functional attributes of the pioglitazone-mediated genes, we performed KEGG pathway analysis using STRING. The top 25 KEGG enrichment pathways related to pioglitazone included the PPAR signaling pathway (16 genes), adipocytokine signaling pathway (12 genes), thyroid hormone signaling pathway (11 genes), cancer pathways (13 genes), AMPK signaling pathway (9 genes), prostate cancer (8 genes), hepatitis B (9 genes), non-alcoholic fatty liver disease (8 genes), T-cell receptor signaling pathway (7 genes), inflammatory bowel disease (6 genes), HTLV-I infection (9 genes), FoxO signaling pathway (7 genes), leishmaniasis (6 genes), viral carcinogenesis (8 genes), chronic myeloid leukemia (6 genes), osteoclast differentiation (7 genes), Chagas disease (6 genes), transcriptional misregulation in cancer (7 genes), tuberculosis (7 genes), NOD-like receptor signaling pathway (5 genes), herpes simplex infection (7 genes), acute myeloid leukemia (5 genes), pancreatic cancer (5 genes), Epstein–Barr virus infection (7 genes), and B-cell receptor signaling pathway (5 genes) (Fig. 2). All identified enrichment pathways with biological functions were statistically significant, thus motivating further study. As shown in Table 3, the most common enrichment pathway was the PPAR signaling pathway, which is consistent with pioglitazone mainly exerting a hypoglycemic effect. In addition, both the PPAR and adipocytokine signaling pathways are known to regulate glucose and lipid homeostasis, which is also characteristic of pioglitazone. These observations confirm the reliability of our results. Extensive grouping via biologic functional analysis also indicated that pioglitazone-associated genes are closely linked to cancer and its related signaling pathways, such as the FoxO signaling pathway [18], viral carcinogenesis [19], and transcriptional misregulation in cancer [20]. Among the 25 signaling enrichment pathways, four were directly linked to specific types of cancers: prostate cancer, chronic myeloid leukemia, acute myeloid leukemia, and pancreatic cancer. Notably, Ferrara et al. found that the use of pioglitazone can increase the risk of prostate cancer and pancreatic cancer but not ten other cancers (not including chronic/acute myeloid leukemia). A meta-analysis also showed that pioglitazone use is associated with an increased risk of prostate cancer and pancreatic cancer [21]. These results are consistent with those from the pathway enrichment analysis of pioglitazone. However, whether the increased risk of prostate cancer and pancreatic cancer after treatment with pioglitazone is actually caused by the pioglitazone treatment was unclear. Therefore, the prostate cancer and pancreatic pathways were screened for further investigation.

Fig. 2
figure 2

KEGG pathway analysis of the potential functions of pioglitazone-associated genes in the pathway using STRING

Table 3 List of enriched pioglitazone-associated gene sets obtained using STRING

Next, to verify the enrichment pathway identified by STRING, we additionally conducted KEGG pathway enrichment using WebGestalt. The prostate cancer and pancreatic cancer pathways were also identified with statistical significance in this study (Table 4). A broad search indicated that eight genes in the prostate cancer pathway (CREBBP, EP300, HSP90AA1, NFKB1, NFKBIA, MAPK1, RB1, and RELA) and five genes in the pancreatic pathway (NFKB1, MAPK1, RB1, RELA, and TGFB1) were linked to pioglitazone. The CREBBP, EP300, and RELA genes in the prostate cancer pathway and the RELA gene in the pancreatic cancer pathway were all linked to three of the primary direct targets of pioglitazone.

Table 4 List of enriched pioglitazone-related gene sets obtained via WebGestalt

Mining Genomic Alterations Related to Pioglitazone-Associated Genes in Prostate Cancer Using cBioPortal

To further investigate the associations between pioglitazone-associated genes and the prostate cancer pathway, cBioPortal was used to reveal genomic alterations of pioglitazone-associated genes in prostate cancer. A total of 14 prostate cancer studies were included in cBioPortal. One study was only accepted provisionally, so we focused on the remaining 13 studies. Gene sets containing eight identified genes (EGF, EGFR, THBS1, VEGFA) associated with prostate cancer were analyzed in the 13 studies. The results showed that the most pronounced mutations detected in the identified gene sets occurred in Demichelis’ study, with 46.73% genetic alterations [22]. In that study, 50 cases (46.73%) had alterations in all eight genes; the alteration frequencies of the eight identified genes are shown in Fig. 3. For RELA (31%), CREBBP (25%), HSP90AA1 (25%), NFKBIA (21%), NFKB1 (12%), MAPK (10%), and EP300 (8%), the most common alterations were amplifications. For RB1 (12%), the majority of the alterations were deep deletions, with a small fraction of amplifications as well as missense mutations.

Fig. 3a–b
figure 3

Exploring the genetic alterations linked to the pioglitazone-related genes CREBBP, EP300, HSP90AA1, NFKB1, NFKBIA, MAPK1, RB1, and RELA in prostate cancer using cBioPortal. a Overview of variations in CREBBP, EP300, HSP90AA1, NFKB1, NFKBIA, MAPK1, RB1, and RELA in the genomic database across a series of prostate cancer samples. b Oncopoint: a visual display of genomic alterations in the eight identified genes (based on the Demichelis study [22]). Genomic alterations are summarized and displayed as % changes in the selected genes. Each row corresponds to a gene, and each column is a sample. Bars with different colors represent different genomic alterations

cBioPortal can also be used to perform interactive analysis and to construct networks that are altered in prostate cancer. Figure 4 shows a network including all neighbors of the eight identified genes CREBBP, EP300, HSP90AA1, NFKB1, NFKBIA, MAPK1, RB1, and RELA. To reduce the complexity of the analysis, we employed the alteration frequency within a particular prostate cancer study as a filter, so that only the genes that displayed high alteration frequencies were presented. The eight selected genes were found to be linked to AR using 49.5% alteration as a filter. Eleven genes, including MYC and NCOA2, were flagged up using 44.8% alteration as a filter. Thirteen gene clusters, including AR, MYC, NCOA2, HEY1, and LY96, were highlighted by applying 43.9% alteration as a filter. This integrated network revealed the variability in the pioglitazone-associated gene alterations among the prostate cancer samples in Demichelis’ study [22].

Fig. 4a–b
figure 4

A visual display of gene networks linked to CREBBP/EP300/HSP90AA1/NFKB1/NFKBIA/MAPK1/RB1/RELA in prostate cancer (based on the Demichelis study [22]). a Eight selected pioglitazone-related genes were employed as seed genes (circles with thick black outlines) to harvest all the other genes that were changed in prostate cancer samples using cBioPortal. b Neighboring genes associated with the eight selected genes were filtered by alteration (%). Darker red indicates an increased frequency of alterations in prostate cancer. In addition to the selected genes, the filter used also involved the highest genomic alteration frequency in the prostate cancer study

Mining Genomic Alterations Related to Pioglitazone-Associated Genes in Pancreatic Cancer Using cBioPortal

cBioPortal was also used to assess genomic alterations of pioglitazone-associated genes in pancreatic cancer. A total of four pancreatic cancer studies were included in cBioPortal. One study was only accepted provisionally, so the remaining three studies were used to accurately evaluate the genomic alterations of five selected genes: RB1, MAPK1, NFKB1, RELA, and TGFB1. As shown in Fig. 5, alterations ranging from 1.01 to 29.36% were detected for the selected genes. The study by Knudsen [23] displayed the most pronounced genomic alterations. In that study, 32 cases showed alterations in all five genes. TGFB1 (16%) showed the most prominent alterations, which included amplification with a few cases of missense mutation. For RELA (10%), the majority of the alterations were amplifications. For MAPK1 (7%), most of the alterations were amplifications, with a small fraction of deep deletions, missense mutations, and truncating mutations. For RB1 and NFKB1, only a few alterations were detected (5% and 2.8%, respectively). A network study was then conducted which included neighbors of the five selected genes (Fig. 6). Six genes, including TP53, were identified using 54.1% alterations as the filter. Seven gene clusters, including TP53 and SMAD4, were obtained with a filter of 43.1% alterations, while eight genes, including TP53, SMAD4, and CDKN2A, were highlighted by a filter of 41.2% alterations.

Fig. 5a–b
figure 5

Exploring genetic alterations linked to the pioglitazone-related genes RB1, MAPK1, NFKB1, RELA, and TGFB1 in pancreatic cancer using cBioPortal. a Overview of variations in RB1, MAPK1, NFKB1, RELA, and TGFB1 in the genomic database across a series of pancreatic cancer samples. b Oncopoint: a visual display of genomic alterations in the five identified genes (based on the Knudsen study [23])

Fig. 6a–b
figure 6

A visual display of gene networks linked to RB1/MAPK1/NFKB1/RELA/TGFB1 in pancreatic cancer (based on the Knudsen study [23]). a Five selected pioglitazone-related genes to explore all the other genes that were changed in pancreatic cancer samples using cBioPortal. b Neighboring genes associated with the five selected genes were filtered by alteration (%)

Discussion

Pioglitazone, a thiazolidinedione drug that regulates glucose homeostasis, has been widely used to treat diabetes since 1999 [24]. To our knowledge, the safety profile of pioglitazone has not been determined, including its affect on cancer risk. A cohort and case–control study showed that pioglitazone use is statistically significantly associated with an increased risk of prostate cancer and pancreatic cancer [10]. Meanwhile, a meta-analysis also reported that pioglitazone treatment may increase the risk of prostate cancer and pancreatic cancer [21]. Thus, it is necessary to investigate whether pioglitazone is a causal risk factor for these cancers.

In this study, we first constructed a biological network for pioglitazone to evaluate the relationship between pioglitazone and each type of cancer using open platforms. Four primary targets (PPARG, PPARD, PPARA, and MAOB) and another 50 pioglitazone-related proteins were identified using DrugBank and STRING, and then a KEGG pathway enrichment analysis was carried out based on the network for pioglitazone. As previously described, the relationships between pioglitazone use and prostate cancer risk as well as pancreatic cancer risk remain unclear. Our KEGG enrichment analysis showed that among the top 25 enrichment pathways, four were directly linked to specific types of cancers: prostate cancer, chronic myeloid leukemia, acute myeloid leukemia, and pancreatic cancer. In their study, Ferrara et al. discovered that pioglitazone use can increase the risk of prostate cancer and that of pancreatic cancer, but not the risk of ten other cancers (not including chronic/acute myeloid leukemia) [10]. Hongting et al. also found that the use of pioglitazone leads to an increased risk of prostate cancer as well as an increased risk of pancreatic cancer [21]. Therefore, the prostate cancer and pancreatic cancer pathways were screened to determine whether pioglitazone is a causal risk factor for these cancers. For the prostate cancer pathway, eight identified genes (CREBBP, EP300, HSP90AA1, NFKB1, NFKBIA, MAPK1, RB1, and RELA) were selected, and then their genomic alterations were explored across prostate cancer studies using cBioPortal. Notably, the overlapping genes EP300, CREBBP, and RELA were all found to be linked to three of the primary direct targets of pioglitazone. Additionally, the results showed that the majority of the alterations to EP300, CREBBP, and RELA in prostate cancer were amplifications. These amplifications trigger an upregulation of the expression of those genes, which is associated with the acceleration of carcinogenesis [25, 26]. Moreover, mutual exclusivity analysis showed that there were co-occurrence relationships between EP300 and CREBBP, EP300 and RELA, and CREBBP and RELA (data not shown), demonstrating the synergistic effects of these genes in prostate cancer development.

For the pancreatic cancer pathway, five overlapping genes (NFKB1, MAPK1, RB1, RELA, and TGFB1) were extracted for further analysis. RELA was found to be associated with three of the primary direct targets of pioglitazone, and was observed to be amplified in pancreatic cancer. This results in the overexpression of this gene, which is linked to the carcinogenesis of pancreatic cancer [27]. The TGFB1 gene presented the most pronounced alterations, which were mainly amplifications, thus promoting the expression of TGFB1 and accounting for the development of pancreatic cancer [28]. A co-occurrence relationship between RELA and TGFB1 was noted (data not shown). Based on alteration analysis of the genes related to pioglitazone, it is clear that EP300, CREBBP, and RELA are strongly amplified in prostate cancer and that RELA and TGFB1 are strongly amplified in pancreatic cancer, suggesting that the increased risk of cancer development associated with pioglitazone is due to the amplification of these pioglitazone-related genes. Therefore, the results of our network analysis correlate well with the increased prostate cancer risk and pancreatic cancer risk associated with pioglitazone use.

Further study is still needed to explore the impact of pioglitazone on cancer. The expression profiles of PPARγ in tumor tissues are different from those in normal tissues, and PPARγ agonists have been reported to suppress tumor cell growth [29, 30]. However, conflicting results regarding the pro-carcinogenic effects of PPARγ agonists such as pioglitazone can be found in several clinical studies [8, 24]. Therefore, whether pioglitazone affects cancer development via the PPARγ signaling pathway remains unclear. Our KEGG pathway analysis showed that the AMPK, FoxO, and viral carcinogenesis pathways are closely associated with pioglitazone. These pathways are involved in the development of cancer [18, 19, 31], so future research into the pioglitazone-induced carcinogenesis should also focus on those signaling pathways. In addition, there are various confounding factors in clinical studies, such as age, gender, race, lifestyle habits, occupational exposure, drug dose, the level of blood glucose control, and complications, which could contribute to the observed inconsistency of study results. The biological function network of pioglitazone that we constructed can eliminate these confounding factors, and the generated results may be close to the actual effects of pioglitazone on human beings. Our results also show that pioglitazone has the potential to affect chronic/acute myeloid leukemia—an association that has not received enough attention from researchers. Thus, constructing a gene functional network may be a useful supplemental method for evaluating the safety profiles of drugs.

It should be noted that hyperglycemia or insulin resistance also enhances the risk of malignancy [32], so using pioglitazone as an antidiabetic drug does have beneficial effects in this context too. However, pioglitazone may have other biological actions that contribute to the development of different types of cancers. The combination of these effects of pioglitazone may determine its impact on the risk of specific types of cancers. Therefore, when choosing an antidiabetic drug, priority should be given to drugs that can correct hyperglycemia without increasing the risk of cancer, assuming that all of the drugs considered permit similar control of glucose homeostasis.

There are several limitations of this study. First, although we comprehensively analyzed the observed effects of pioglitazone and its associations with proteins, this study lacks a functional validation experiment which directly demonstrates that pioglitazone does indeed increase the risk of prostate cancer and pancreatic cancer. Second, the results of the pathway enrichment analysis for specific types of cancer were obtained with the STRING database, but the interaction of pioglitazone-related targets does not imply the activation of partners; such interactions often have a neutral or sometimes negative impact on the partner’s function. The nature of the interactions require further investigation to clarify the molecular mechanism through which pioglitazone increases the risk of prostate cancer and pancreatic cancer. Finally, other cancer pathways revealed by pathway enrichment analysis have not yet been investigated. Whether the functional network between pioglitazone and prostate/pancreatic cancer presented in this study can be extended to other specific types of cancers remains to be explored. However, the results of this study may provide the foundations for further experimental research and help researchers to translate basic research into clinical applications.

In summary, we integrated and utilized open databases to establish a biological effect network for pioglitazone and thereby explore the safety of this drug. Scientific research should lead to the discovery of more pioglitazone-targeted proteins helps update the functional network constructed, and thereby provide novel insights into scientific research, and permit improved clinical guidance. For example, our results suggest that pioglitazone is likely to be a causal risk factor for prostate cancer and pancreatic cancer. Therefore, government regulators, doctors, and patients should evaluate the risk–benefit relationship of pioglitazone when choosing a suitable hypoglycemic drug. Overall, this study has demonstrated a simple and flexible method for applying information about the biological network of a drug and genomic alterations during cancer to the rational analysis of drug safety.

Conclusions

Our results suggest that pioglitazone is likely to be a causal risk factor for both prostate cancer and pancreatic cancer.