Pulmonary arterial hypertension (PAH) is a life-threatening condition. The aim of this study was to explore potential crucial genes and pathways associated with PAH based on integrative analyses of gene expression and to shed light on the identification of biomarker for PAH.
Gene expression profile of pulmonary tissues from 27 PAH patients and 22 normal controls were downloaded from public database (GSE53408 and GSE113439). After the identification of differentially expressed genes (DEGs), hub pathways and genes were identified based on the comprehensive evaluation of protein-protein interaction (PPI) network analysis, modular analysis and cytohubba’s analysis, and further validated in another PAH transcriptomic dataset (GSE33463). Potentially associated micro-RNAs (miRNAs) were also predicted.
A total of 521 DEGs were found between PAH and normal controls, including 432 up-regulated DEGs and 89 down-regulated DEGs. Functional enrichment analysis showed that these DEGs were mainly enriched in mitotic cell cycle process, mitotic cell cycle and microtubule cytoskeleton organization. Moreover, five key genes (CDK1, SMC2, SMC4, KIF23, and CENPE) were identified and then further validated in another transcriptomic dataset associated with special phenotypes of PAH. Furthermore, these hub genes were mainly enriched in promoting mitotic cell cycle process, which may be closely associated with the pathogenesis of PAH. We also found that the predicted miRNAs targeting these hub genes were found to be enriched in TGF-β and Hippo signaling pathway.
These findings are expected to gain a further insight into the development of PAH and provide a promising index for the detection of PAH.
Pulmonary arterial hypertension (PAH) is a severe chronic and progressive vascular disorder, predominantly influencing the arterial circulation and, in particular, the pulmonary arterioles . PAH is defined based on the elevation of the mean pulmonary arterial pressure (mPAP) above 25 mmHg at resting state and a pulmonary vascular resistance > 3 Wood units, as well as a pulmonary capillary wedge pressure (PCWP) < 15 mmHg at end expiration . However, this 6th World Symposium on Pulmonary Hypertension (WSPH) Task Force proposed to define the low limit of mPAP for PAH to be 20 mmHg  The increase in afterload puts great stress on the right ventricle (RV), leading to RV hypertrophy, and ultimately RV failure and death . It has a prevalence of about 20 cases in 1,000,000 population and particularly affects women with four times more than men . According to latest classification criteria, PAH can be divided into idiopathic PAH (IPAH), heritable PAH, drug- and toxin-induced PAH, PAH associated with other disease (connective tissue disease, HIV infection, portal hypertension, congenital heart disease, schistosomiasis) and so on . In the past several decades, lots of studies have been undertaken to clarify the mechanism of the progression of PAH and apply new promising target therapy, while the incidence and mortality rate of PAH remains high with poor prognosis [6, 7]. Hence, revealing the causes and underlying molecular mechanisms of the disease, as well as discovering molecular biomarkers for early diagnosis, prevention and personalized treatment, is especially important and highly demanded for PAH.
Microarray has been used to detect massive genes expression for more than 10 years, and is particularly suitable for DEGs screening . With the extensive using of microarray technology, an increasing number of chip data were produced and deposited into public databases including the largest public database: NCBI-Gene Expression Omnibus (NCBI-GEO) database. Several microarray studies have been conducted on PAH in recently years and hundreds of DEGs have been obtained [9, 10]. However, most analytical results are inconsistent among these studies mainly due to sample heterogeneity and study design. Thus, there has been no unified biomarker and commonly accepted biological mechanism acquired from these microarray studies for PAH.
Therefore, in our study, we obtained two original microarray datasets GSE53408 and GSE113439 (unpublished data) with the same platform of GPL6244 from GEO database [11, 12]. After merging the two datasets based on the same platform, we performed comprehensive biological functional analyses of DEGs from various angles, including Gene Ontology (GO) enrichment, pathway enrichment, protein-protein interaction (PPI) network and prediction of correlative miRNAs, which could largely overcome the disadvantages of previous single array studies. More importantly, identifying DEGs with their biological functions and key pathways will assist with providing more accurate and reliable biomarkers for early diagnosis of PAH.
Data attainment and preprocessing
The gene expression data of PAH were obtained from the NCBI-GEO database (http://www.ncbi.nlm.nih.gov/geo/). Respectively, two GEO series (GSE53408 and GSE113439) were chose in our study with the following selection criteria: (a) keywords of “pulmonary artery hypertension (PAH)” or “pulmonary hypertension (PH)”; (b) Inclusion of gene expression data of PAH and normal tissue samples with the same GEO platform; (c) excluding other diseases except PAH and normal tissues, such as pulmonary fibrosis or interstitial pneumonia (d) Datasets contained a minimum of 10 PAH and normal tissue samples and inclusion of > 5000 genes in the GEO platform.
The raw data were manipulated with the process of background adjustment, quantile normalization, logarithmic transformation and summarization by using the “Affy” package of R language . Afterwards, according to the annotation files provided by GPL6244, the expression matrix with the probe IDs were converted into gene symbols. The “Impute.Knn” function of “impute” package was applied to supplement missing value  and probes without a corresponding gene symbol were deleted and the average value was calculated as the final expression value for genes corresponding to more than one probe. Moerover, the “ComBat” function of “sva” package was used to remove known batch effects from microarray data  and quantile normalization within and between arrays on all samples was conducted using “normalizeBetweenArrays function” function.
Identification of differentially expressed genes
We used the limma package  to implement DEGs analysis, and used “princomp” function in R 3.6.0 to conduct a two-dimensional principle component analysis (PCA) and hierarchical clustering to visualize the similarities, as well as the differences between the PAH and the control samples. Subsequently, differential expression analysis was performed and a DEG was defined based on the following criteria: p value< 0.05 and the absolute value of log2 fold change (FC) > 1. The volcano plot, which visualized all DEGs between PAH and control, was performed with “ggplot2” package in R and clustering heatmap for the DEGs was drawn using the R software package “pheatmap”.
Functional analysis of DEGs
To investigate the biological function of DEGs in PAH, we conducted gene ontology (GO) analysis including biological processes (BP), cellular components (CC), and molecular functions (MF). “ClueGo” plug-in  integrated GO terms in Cystoscape, and created biological process networks with the up-regulated and down-regulated genes, respectively. Bonferroni step down method was used for correction and the threshold of p-value was 0.05. Significant pathway analysis was conducted by the function “Gene-list Enrichment” with same threshold of p-value in online websites of Kobas3.0 (http://kobas.cbi.pku.edu.cn) including four signaling pathway analysis: KEGG Pathway, Reactome, BioCyc, and PANTHER.
Identification and validation of hub genes
The PPI data of the DEGs was downloaded from STRING version 11.0. Then, the PPI network was set up and visualized by Cytoscape  software, and hub genes were detected according to levers of degree (the number of connections/interactions for each node) in PPI network. To further validate the key genes, the plug-in MCODE  was used to find out several functional modules based on MCODE score, which represented the degree of interrelation of DEGs. Subsequently, we also used plug-in cytoHubba of Cytoscape to explore important nodes in the PPI network by several topological algorithms including Betweenness, Bottle Neck, Closeness, Clustering Coefficient, Degree, DMNC, EcCentricity, EPC, MCC, MNC, Radiality and Stress . The top 50 genes identified by each topological algorithm were selected to found the shared genes more than 6 ways as the most important hub genes in the network . Finally, the shared hub genes detected by three methods (levers of degree in PPI network, MCODE score and topological algorithms in cytoHubba) were identified as the pivotal genes.
The transcriptomic data set of peripheral blood mononuclear cells (PBMCs) (GSE33463, Table S1), includes 30 IPAH, 19 patients with systemic sclerosis (SSc) without pulmonary hypertension, 42 scleroderma-associated PAH patients (SSc-PAH), and 8 patients with SSc complicated by interstitial lung disease and PH (SSc-PH-ILD), which were used to validate DEGs identified compared with 41 healthy individuals. By further analyzing expressional levers of pivotal genes with Wilcoxon and Kruskal−Wallis test, hub genes were validated and were exhibited as Violin diagram using R software package “ggpubr”. Subsequently, correlation analysis of hub genes was conducted by R software package “ggcorrplot”. To validate the function of hub-genes in distinguishing PAH cohorts from the control groups, the clustering heatmap for hub genes was drawn using the function “heatmap.2” of R software package “gplots”.
Identification of hub genes associated with respiratory tract diseases
The Comparative Toxicogenomics Database (CTD; http://ctdbase.org/), a premier public resource based on literature, was used to find curated associations between chemicals interactions, gene interactions, phenotypes, diseases, and environmental exposures. In the database, the Inference Score was calculated from original source articles to present the relationship of genes to diseases. Here, we used the CTD database to analyze the associations between hub genes and respiratory tract diseases, and identified their relationships based on ranks of Inference Score.
Prediction of miRNAs interacted with hub genes and function analysis
The miRNA-mRNA (hub genes) interaction networks were predicted based on Diana-microT-CDS (http://www.microrna.gr/microT-CDS/), TargetScan (http://www.targetscan.org/), miRDB (http://www.mirdb.org/) and mirDIPsoftware (http://ophid.utoronto.ca/mirDIP/) respectively. By setting “threshold as 0.7” in microT-CDS, “score class as very high (top1%) or high (top5%)” in mirDIP and “Total context ++ score” ≤ − 0.2 in TargetScan, we identified the intersection of four database as prediction of miRNAs for each hub gene. Cytoscape was applied to visualize the miRNA-mRNA interaction network and bubble diagram was selected for exhibiting function analysis of miRNA using online tools from Diana-miRPath v3.0 (http://www.micro rna.gr/miRPathv3).
DEGs in subgroups of PAH
Fig.S1 showed the workflow for identification, functional analysis and validation of DEGs in PAH. To get a list of PAH-related DEGs, we compared the gene expression profiles of lung tissues of PAH patients with samples from healthy volunteers. Based on the GPL6244 [HuGene-1_0-st] Affymetrix Human Gene 1.0 ST Array, the microarray data included 12 cases and 11 control samples from GSE53408, as well as 15 cases and 11 control samples from GSE113439 (Table S1).
We undertook quality control of these datasets, and observed that gene expression distribution of each sample from the two different resources were homogeneous and comparable after data preprocessing (Fig.S2 A). PCA analysis revealed that PAH samples and control were clearly separated into two distinct clusters (Fig. 1a), indicating the discriminative gene expression pattern of PAH. Based on the cut-off criteria (p < 0.05 and [logFC] > 1), 521 DEGs between PAH and normal controls were identified, including 432 up-regulated genes and 89 down-regulated genes displayed by volcano plot (Fig. 1b and Table S2). The expression of the top 100 DEGs listed by corrected P-values was depicted by heatmap (Fig. 1c), from which we can see significant different clustering between two groups.
Functional enrichment analysis of DEGs
To further interpret the biological processes associated with DEGs, ClueGo plug-in for Cytoscape was used to cluster GO terms that participate in the same biological function and visualize of the interactions inside each cluster, as well as between different groups. As shown in Fig. 2a and Table S3, the DEGs were significantly enriched in several biological processes (BP) potentially associated with PAH including mitotic cell cycle (GO: 0000278), mitotic cell cycle process (GO: 1903047) and microtubule cytoskeleton organization (GO: 0000226), all of which play essential roles in cell proliferation. As for cell component (CC), we found that candidate DEGs were mainly enriched in intracellular non-membrane-bounded organelle (GO:0043232), nuclear lumen (GO:0031981) and microtubule cytoskeleton (GO:0015630). In addition, the candidate DEGs were significantly enriched in ATP binding (GO: 0005524), adenyl ribonucleotide binding (GO: 0032559) and adenyl nucleotide binding (GO: 0030554) in the molecular function (MF) group.
Moreover, we also investigated the biological processes of all up-regulated and down-regulated genes which were exhibited severally in Fig. 2b and c, respectively. We found that up-regulated genes were also mainly enriched in cell cycle process (GO: 0022402) and mitotic cell cycle process (GO: 1903047) highly consistent with all DEGs. Nevertheless, down-regulated genes were mainly enriched in vasculogenesis (GO:0001570) and regulation of vasculogenesis (GO:2001212) with unapparent interaction. In summary, the GO analyses indicated that most of DEGs were significantly enriched in mitotic cell cycle process, microtubule cytoskeleton organization, intracellular non-membrane-bounded organelle, and ATP binding, all of which are associated with the cell proliferation [22, 23]. As shown in Table S4, pathway enrichment analysis showed that cell cycle, metabolism of proteins and cell cycle, mitotic were top 3 significantly enriched pathways according to the corrected P-Value, which were also consistent with GO term enrichment analysis.
Protein–protein interaction network (PPI), modular and CytoHubba analysis
To identify central attractors for DEGs in the physical interaction network and provide clues for further pathogenic mechanism of PAH, we constructed an interconnected PPI network of DEGs based on the Search Tool for the Retrieval of Interacting Genes (STRING) database (https://string-db.org/). A total of 521 DEGs were filtered into the network, forming 492 nodes and 2720 edges with average node degree of 11 (Fig.S3). Among the 492 nodes, the top 15 node genes listed by the number of connections were identified with degree of larger than 40 (each node had more than 40 connections/interactions), including TOP2A, TOP2B, CDK1, LRRK2, HSP90AA1, EPRS, POLR2B, SMC2, CHEK1, SMC4, KIF23, PLK4, KIF11, CDC6 and CENPE (Table S5). In the PPI network of top 100 ranked DEGs according to the P-values, we found that up-regulated genes have more significantly close interaction than down-regulated genes (Fig. 3a). Given that mitotic cell cycle process indicated the most prominent features in up-regulated DEGs based on GO term enrichment and microtubule cytoskeleton organization especially in microtubule had been found exist important interaction with PAH through text mining , we performed the interconnected interaction network of DEGs among these two biological processes. We found that there were 70 DEGs with a close interconnected relationship and 27 DEGs shared by these two biological processes (Fig. 3b).
For further modular analysis for DEGs in the PPI network, we identified 3 significant modules from the PPI network complex by using MCODE with MCODE score greater than 5 (Fig. 3c and Fig.S2 B). Pathway enrichment analysis using Kobas3.0 showed that Module 1, consisted of 31 nodes and 446 edges, were mainly associated with cell cycle, cell cycle mitotic and M phase. For Module 2, it was consisted of 20 nodes and 173 edges, which were mainly associated with major pathway of ribosomal RNA (rRNA) processing in the nucleolus and cytosol, as well as ribosome biogenesis in eukaryotes. In addition, the Module 3 were mainly associated with cellular response to heat stress, regulation of HSF1-mediated heat shock response and protein processing in endoplasmic reticulum, including 17 nodes and 65 edges (Table S6). Based on the levers of MCODE score, we identify 22 hub genes with largest node’s MCODE score 24 in Module 1(bold in Table S5) .
In addition to aforementioned two functional analysis, hub genes were also identified by CytoHubba analysis. We selected the top 50 key genes identified from each method of CytoHubba and found that there were 29 hub genes shared with more than 6 topological analysis methods (Table S7). Summarily, there were 9 key genes shared by all three analysis, including TOP2A, CDK1, SMC2, CHEK1, SMC4, KIF23, PLK4, CDC6 and CENPE (Fig. 3d).
Validation of hub genes and identification of correlative respiratory tract diseases
To further explore the potential role of these hub genes in PAH, we used the transcriptomic data set from a cohort of PBMCs (GSE33463) to validate the nine hub genes. As shown in Fig. 4, five hub genes (CDK1, SMC2, SMC4, KIF23 and CENPE) were further identified with significant increased expression in PAH patients compared to control (p = 0.00014 for CDK1, p = 1.1 × 10− 7 for SMC2, p = 6.5× 10− 8 for SMC4, p = 1.7× 10− 5 for KIF23, p = 2.9× 10− 7 for CENPE; Wilcoxon and Kruskal-Wallis test). Notably, we also found that the five hub genes have significant increased expression in several PAH subtypes compared to control, especially in IPAH and systemic sclerosis (SSc)-PAH for all five hub genes (Fig. 4). Moreover, except CDK1, there are 4 hub genes with increased expression in SSc compared to control. In addition, only SMC4 showed the higher expression in SSc-PAH-ILD than control. For other four hub genes (CDC6, CHEK1, TOP2A and PLK4), there were no significant difference between PAH patients and control (Fig.S4).
To further confirm the significant difference between PAH cohorts and control cohorts, we made the cluster analysis which revealed up-regulation of hub genes in cases while down-regulation in control (Fig. 5a). Correlation analysis showed that the five hub genes have positive correlation between each other with correlation coefficient greater that 0.8 (Fig. 5b). Moreover, we also performed the association analysis between hub genes and respiratory tract diseases based on the Inference Score from CTD database. Although lung neoplasms covered the largest area with high inference score in all hub genes, other respiratory tract diseases, including hypertension pulmonary, lung diseases interstitial and pulmonary fibrosis, also presented a degree of association with hub genes (Fig. 5c).
Prediction of miRNAs targeting hub genes and function analysis for miRNAs
To further explore the potential role of miRNAs in PAH, we found 30 probable miRNAs that target these five hub genes based on the prediction of Diana-microT-CDS, miRDB, TargetScan and mirDIP (Fig. 6a). Among these miRNAs, that there were 13 miRNAs targeting SMC2, 6 miRNAs targeting CENPE, 6 miRNAs targeting CDK1, 4 miRNAs targeting KIF23 and one miRNA targeting SMC4 (Fig. 6a). Subsequently, using Diana-miRPath v3.0, we conducted KEGG pathway enrichment analysis that exhibited miRNAs majorly focus on TGF − beta signaling pathway, signaling pathways regulating pluripotency of stem cells, Proteoglycans in cancer, Mucin type O − Glycan biosynthesis and Hippo signaling pathway (Fig. 6b-f).
With the development of high-throughput gene detection technology, gene expression studies have been conducted to reveal the molecular mechanisms of the progression of PAH, but the specific genetic changes of PAH are still not clear. Herein, we extracted and merged data from GSE53408 and GSE113439 datasets containing gene expression profiles of both PAH and normal tissues, and identified a total of 521 DEGs after a series of preprocessing process. Functional annotation indicated that these DEGs were mainly involved in mitotic cell cycle and microtubule cytoskeleton organization. By constructing the PPI network and further modular analysis, we identified some key genes and predicted miRNAs targeting hub genes with function analysis, all of which can provide new insights into the pathogenesis of PAH.
It has been known that the pathogenesis of PAH is complex and uncertain including the increasing ratio of endothelium-derived vasoconstrictors to vasodilators , apoptosis resistance, and hyperproliferation of pulmonary artery vascular smooth muscle cells (PASMC) , increasing plasma serotonin levels  and decreased function of voltage-gated potassium channels in PASMC . By integrated bioinformatical analysis, we identified the mitotic cell cycle process as the considerable biological process dealing with pathogenesis of PAH, which had been found as target sites for some drugs that could delay even overturn the progress of PAH through inhibiting the proliferation of vascular smooth muscle . For an instance, Kentaroet et al.  indicated protein tyrosine kinases inhibitors inhibit multiple steps of the vascular SMC cell cycle and a progressive irreversible endothelial cell dysfunction induced by Monocrotaline pyrrole, leading to inactivation of CDC2 kinase and irreversible cell-cycle arrest at the G2 checkpoint, which was ulteriorly certified by Thomas’s study . Moreover, microtubule cytoskeleton organization had been reported to regulate the essential processes of smooth muscle cell migration through cell contraction and focal adhesion assembly, which was associated with lung and vascular diseases . Further, Yunchao’s study documented that microtubule-active agents could influence the state of microtubule polymerization to modify the nitric oxide (NO) production in pulmonary artery endothelial cells (PAEC), which might well provide a promising avenue for the treatment of PAH .
In present studies, CDK1, also named CDC2, is one of Cyclin-dependent kinase family, which are essential conditioning agent of cell cycle progression . More importantly, previous study had discovered the indispensable role of CDK1 in G1/S and G2/M phase transitions of the eukaryotic cell cycle, as a vital catalytic subunit of M-phase promoting factor . It has been reported that the phosphorylation and dephosphorylation process of proteins encoded by CDK1 plays an essential role in regulating mitochondrial function, maintaining mitochondrial homeostasis and improving cell-cycle progression . It had been reported that PASMC mitochondria play an essential role in the process of PAH by regulating energy metabolism including glucose oxidation and increased cytoplasmic glycolysis represent, which were associated with oxidative stress mechanism in hypoxic microenvironment . Interestingly, recent studies have demonstrated an important interaction between the mitochondrial cycle and the cell cycle, leading to increased proliferation in human PAH PASMCs, which also verifies the interaction between cell cycle, especially mitochondrial cycle and PAH . Although most studies considered abnormal expression of CDK1 as risk factors for a variety of tumors, such as hepatocellular, breast and colorectal [37, 38], several researches had found that CDK1 participated in molecular mechanism of PAH, mainly influencing mitochondrial dynamics .
Similarly, SMC2 and SMC4 are two vital core subunits of condensin, which plays an essential role in mitotic chromosome condensation . Recently, it has been reported that the condensin complex also play role in DNA repair and transcriptional regulation during interphase. More importantly, Yoko’s study has found that SMC2 was transcriptionally regulated by MYCN, a carcinogenic gene with poor prognosis in MYCN-amplified neuroblastoma cells . There was also evidence indicating the transcription of SMC4 was activated by NF-κB through regulation of miR-16 and miR-21 in gastric cancer, and SMC4 could participate in chronic lymphocytic leukemia (CLL) with post-transcriptional regulation of miR-15/− 16 family members . Therefore, SMC2 and SMC4 might function in the regulation of cell proliferation based on aforementioned mechanisms, which was further confirmed by Verónica’s study that indicated that SMC2 transcription was directly activated by WNT signaling, as a key player in the mitotic cell division machinery .
KIF23, as both a regulator of cytokinesis and a motor enzyme of microtubules, is critical for the microtubule bundling during cytokinesis . In addition, KIF23 expression was regulated in cell cycle with peaks in G2/M phase, indicating the role of KIF23 in boosting cell hyperplasia . Based on above features, up-regulated KIF23 has been considered as one of biomarkers in multiple tumors including lung cancer, malignant pleural mesothelioma, glioma and hepatocellular carcinoma [46, 47].
CENPE, the largest kinesin of Kinesins families, has been found with roles in microtubule kinetochore capture, which contributes to chromosome congression and alignment . Sullivan’s study  also demonstrated CENP-C and CENP-E are necessary components of functional centromeres in human. Through traveling unidirectionally along microtubule tracks, CENPE participated in intracellular transport or cell division, which interpreted its role in cell cycle in our study . It also had been reported that CENPE was highly expressed in the G2/M phase of the cell cycle and promoted lung adenocarcinoma (LUAD) proliferation, regulated by FOXM1 in previous study . In addition, several studies had considered CENPE as drug targets and even entered Phase I and II clinical trials for threat of certain tumors . It is known to all that PAH is recognized as the most common complication of systemic sclerosis (SSc) due to severe vascular lesion. Notably, CENPE was increased in vascular progenitors and mature endothelial cells as one of the scleroderma autoantigens with IFI-16 in McMahan’s study , which supported its role in proliferation of vascular endothelial cells in our study.
In this study, we found that there was significant enrichment of DEGs in cell cycle associated with PAH through vascular smooth muscle cells proliferation. Notably, KEGG pathways analysis of predicted miRNAs also suggest similar potential biological mechanism for PAH. It had been reported that BMPR-II, a receptor of TGF-beta family, participated in formation of PAH by germline mutations [54, 55]. Through text mining, Goumans’s  study had also affirmed that TGF-β/ALK1 signaling could stimulate endothelial cells (EC) migration, proliferation and tube formation by inducing Smad1/5 activation, which played an essential role in vascular dysfunction . Moreover, Hippo signaling pathway  was reported as a master regulator of proliferation and apoptosis balance with function of regulating cell proliferation and inducing cell differentiation or apoptosis. However, excessive activation of Hippo signaling pathway by its major reciprocal effectors Yap/Taz would promote proliferation via regulating other transcriptional factors, as significant component of PAH progression in Tatiana V’s research . All of these studies supported our hypothesis on hub genes may participate in PAH through promoting biological process of vascular proliferation. Moreover, our study also found some potential pathways such as Signaling pathways regulating pluripotency of stem cells, which might provide new opinions on mechanism or therapeutic targets for PAH. Significantly, several studies have started to focus on implantation of Mesenchymal Stem Cells in postponing or improving the process of cardiovascular diseases including PAH, which also gives us several inspiration on novel therapeutic intervention for PAH .
However, there still are some limitations in this study. To reach a solid correlation between hub genes and PAH, further validation of enlarged samples would be necessary. Given the shortage of silicon analyses and some dataset validation, a series of in-depth experiments in vitro and vivo are needed in the future to confirm the specific functions of these genes in PAH and detailed pathway regulation.
In conclusion, a total of 432 up-regulated DEGs and 89 down-regulated DEGs were identified in PAH samples. The DEGs related to the up-regulation of cell cycle process (CDK1, SMC2, SMC4, KIF23, CENPE), which may activate the proliferation of PASM in the process of PAH. The predicted miRNAs were found enriched in TGF-β and Hippo signaling pathway. These findings are expected to a get a further insight into biomarkers for PAH diagnosis and molecular mechanisms of PAH pathogenesis.
Availability of data and materials
All the data generated or analyzed in this study have been included in this published article and its supplementary Table and Figure files. The raw matrix datasets can be downloaded from the NCBI-GEO database by searching “GSE53408” (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE53408), “GSE113439” (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE113439) and “GSE33463” (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE33463). GPL6244 annotation files can be required from the website: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL6244.
Pulmonary arterial hypertension
Differentially expressed genes
Mean pulmonary arterial pressure
Pulmonary capillary wedge pressure
World symposium on pulmonary hypertension
Idiopathic pulmonary arterial hypertension
Log2 fold change
Peripheral blood monouclear cells
Comparative toxicogenomics database
The search tool for the retrieval of interacting genes
Pulmonary artery vascular smooth muscle cells
Pulmonary artery endothelial cells
Chronic lymphocytic leukemia
Tuder RM, Archer SL, Dorfmuller P, Erzurum SC, Guignabert C, Michelakis E, Rabinovitch M, Schermuly R, Stenmark KR, Morrell NW. Relevant issues in the pathology and pathobiology of pulmonary hypertension. J Am Coll Cardiol. 2013;62(25 Suppl):D4–12.
McLaughlin VV, Archer SL, Badesch DB, Barst RJ, Farber HW, Lindner JR, Mathier MA, Mcgoon MD, Park MH, Rosenson RS, et al. ACCF/AHA 2009 expert consensus document on pulmonary hypertension a report of the American college of cardiology foundation task force on expert consensus documents and the american heart association developed in collaboration with the American college of chest physicians; American Thoracic Society, Inc.; and the pulmonary hypertension association. J Am Coll Cardiol. 2009;53(17):1573–619.
Simonneau G, Montani D, Celermajer DS, Denton CP, Gatzoulis MA, Krowka M, Williams PG, Souza R. Haemodynamic definitions and updated clinical classification of pulmonary hypertension. Eur Respir J. 2019;53(1):1-13.
Vonk-Noordegraaf A, Haddad F, Chin KM, Forfia PR, Kawut SM, Lumens J, Naeije R, Newman J, Oudiz RJ, Provencher S, et al. Right heart adaptation to pulmonary arterial hypertension: physiology and pathobiology. J Am Coll Cardiol. 2013;62(25 Suppl):D22–33.
Hatton N, Ryan JJ. Sex differences in response to pulmonary arterial hypertension therapy : is What's good for the goose, good for the Gander? Chest. 2014;145(6):1184–6.
Machado RD, Southgate L, Eichstaedt CA, Aldred MA, Austin ED, Best DH, Chung WK, Benjamin N, Elliott CG, Eyries M, et al. Pulmonary arterial hypertension: a current perspective on established and emerging molecular genetic defects. Hum Mutat. 2015;36(12):1113–27.
Hoeper MM, Apitz C, Grunig E, Halank M, Ewert R, Kaemmerer H, Kabitz HJ, Kahler C, Klose H, Leuchte H, et al. Targeted therapy of pulmonary arterial hypertension: Updated recommendations from the Cologne Consensus Conference 2018. Int J Cardiol. 2018;272s:37–45.
Vogelstein B, Papadopoulos N, Velculescu VE, Zhou S, Diaz LA Jr, Kinzler KW. Cancer genome landscapes. Science. 2013;339(6127):1546–58.
Mura M, Anraku M, Yun Z, McRae K, Liu M, Waddell TK, Singer LG, Granton JT, Keshavjee S, de Perrot M. Gene expression profiling in the lungs of patients with pulmonary hypertension associated with pulmonary fibrosis. Chest. 2012;141(3):661–73.
Hemnes AR, Brittain EL, Trammell AW, Fessel JP, Austin ED, Penner N, Maynard KB, Gleaves L, Talati M, Absi T, et al. Evidence for right ventricular lipotoxicity in heritable pulmonary arterial hypertension. Am J Respir Crit Care Med. 2014;189(3):325–34.
Zhao Y, Peng J, Lu C, Hsin M, Mura M, Wu L, Chu L, Zamel R, Machuca T, Waddell T, et al. Metabolomic heterogeneity of pulmonary arterial hypertension. PLoS One. 2014;9(2):e88727.
Zhao YD, Yun HZH, Peng J, Yin L, Chu L, Wu L, Michalek R, Liu M, Keshavjee S, Waddell T, et al. De novo synthesize of bile acids in pulmonary arterial hypertension lung. Metabolomics. 2014;10(6):1169–75.
Gautier L, Cope L, Bolstad BM. Irizarry RA: affy--analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004;20(3):307–15.
Suyundikov A, Stevens JR, Corcoran C, Herrick J, Wolff RK, Slattery ML. Accounting for dependence induced by weighted KNN imputation in paired samples, motivated by a colorectal cancer study. PLoS One. 2015;10(4):e0119876.
Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882–3.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W. Smyth GK: limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47.
Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, Kirilovsky A, Fridman WH, Pages F, Trajanoski Z, Galon J. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics. 2009;25(8):1091–3.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.
Bader GD, Hogue CW. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics. 2003;4:2.
Chin CH, Chen SH, Wu HH, Ho CW, Ko MT, Lin CY. cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Syst Biol. 2014;8(Suppl 4):S11.
Sun D, Wan X, Pan BB, Sun Q, Ji XB, Zhang F, Zhang H, Cao CC. Bioinformatics analysis of genes and pathways of CD11b(+)/Ly6C(intermediate) macrophages after renal ischemia-reperfusion injury. Curr Med Sci. 2018;38(1):70–7.
Thomas HC, Lame MW, Wilson DW, Segall HJ. Cell cycle alterations associated with covalent binding of monocrotaline pyrrole to pulmonary artery endothelial cell DNA. Toxicol Appl Pharmacol. 1996;141(1):319–29.
Tang DD, Gerlach BD. The roles and regulation of the actin cytoskeleton, intermediate filaments and microtubules in smooth muscle cell migration. Respir Res. 2017;18(1):54.
Marsboom G, Toth PT, Ryan JJ, Hong Z, Wu X, Fang YH, Thenappan T, Piao L, Zhang HJ, Pogoriler J, et al. Dynamin-related protein 1-mediated mitochondrial mitotic fission permits hyperproliferation of vascular smooth muscle cells and offers a novel therapeutic target in pulmonary hypertension. Circ Res. 2012;110(11):1484–97.
Ni M, Liu X, Wu J, Zhang D, Tian J, Wang T, Liu S, Meng Z, Wang K, Duan X, et al. Identification of candidate biomarkers correlated with the pathogenesis and prognosis of non-small cell lung Cancer via integrated bioinformatics analysis. Front Genet. 2018;9:469.
Steudel W, Ichinose F, Huang PL, Hurford WE, Jones RC, Bevan JA, Fishman MC, Zapol WM. Pulmonary vasoconstriction and hypertension in mice with targeted disruption of the endothelial nitric oxide synthase (NOS 3) gene. Circ Res. 1997;81(1):34–41.
Ryan JJ, Marsboom G, Fang YH, Toth PT, Morrow E, Luo N, Piao L, Hong Z, Ericson K, Zhang HJ, et al. PGC1alpha-mediated mitofusin-2 deficiency in female rats and humans with pulmonary arterial hypertension. Am J Respir Crit Care Med. 2013;187(8):865–78.
Herve P, Launay JM, Scrobohaci ML, Brenot F, Simonneau G, Petitpretz P, Poubeau P, Cerrina J, Duroux P, Drouet L. Increased plasma serotonin in primary pulmonary hypertension. Am J Med. 1995;99(3):249–54.
Reeve H, Michelakis E, Nelson D, Weir EK, Archer S. Alterations in a redox oxygen sensing mechanism in chronic hypoxia. J Appl Physiol. 2001;90(6):2249–56.
Shimokado K, Umezawa K, Ogata J. Tyrosine kinase inhibitors inhibit multiple steps of the cell cycle of vascular smooth muscle cells. Exp Cell Res. 1995;220(2):266–73.
Thomas HC, Lame MW, Morin D, Wilson DW, Segall HJ. Prolonged cell-cycle arrest associated with altered cdc2 kinase in monocrotaline pyrrole-treated pulmonary artery endothelial cells. Am J Respir Cell Mol Biol. 1998;19(1):129–42.
Su Y, Zharikov SI, Block ER. Microtubule-active agents modify nitric oxide production in pulmonary artery endothelial cells. Am J Physiol Lung Cell Mol Physiol. 2002;282(6):L1183–9.
Xi Q, Huang M, Wang Y, Zhong J, Liu R, Xu G, Jiang L, Wang J, Fang Z, Yang S. The expression of CDK1 is associated with proliferation and can be a prognostic factor in epithelial ovarian cancer. Tumour Biol. 2015;36(7):4939–48.
Satyanarayana A, Hilton MB, Kaldis P. p21 inhibits Cdk1 in the absence of Cdk2 to maintain the G1/S phase DNA damage checkpoint. Mol Biol Cell. 2008;19(1):65–77.
Liu R, Fan M, Candas D, Qin L, Zhang X, Eldridge A, Zou JX, Zhang T, Juma S, Jin C, et al. CDK1-mediated SIRT3 activation enhances mitochondrial function and tumor Radioresistance. Mol Cancer Ther. 2015;14(9):2090–102.
Sutendra G, Michelakis ED. Pyruvate dehydrogenase kinase as a novel therapeutic target in oncology. Front Oncol. 2013;3:38.
Bednarek K, Kiwerska K, Szaumkessel M, Bodnar M, Kostrzewska-Poczekaj M, Marszalek A, Janiszewska J, Bartochowska A, Jackowska J, Wierzbicka M, et al. Recurrent CDK1 overexpression in laryngeal squamous cell carcinoma. Tumour Biol. 2016;37(8):11115–26.
Kim SJ, Nakayama S, Shimazu K, Tamaki Y, Akazawa K, Tsukamoto F, Torikoshi Y, Matsushima T, Shibayama M, Ishihara H, et al. Recurrence risk score based on the specific activity of CDK1 and CDK2 predicts response to neoadjuvant paclitaxel followed by 5-fluorouracil, epirubicin and cyclophosphamide in breast cancers. Ann Oncol. 2012;23(4):891–7.
Ryan J, Dasgupta A, Huston J, Chen KH, Archer SL. Mitochondrial dynamics in pulmonary arterial hypertension. J Mol Med (Berl). 2015;93(3):229–42.
Takemoto A, Kimura K, Yokoyama S, Hanaoka F. Cell cycle-dependent phosphorylation, nuclear localization, and activation of human condensin. J Biol Chem. 2004;279(6):4551–9.
Murakami-Tonami Y, Kishida S, Takeuchi I, Katou Y, Maris JM, Ichikawa H, Kondo Y, Sekido Y, Shirahige K, Murakami H, et al. Inactivation of SMC2 shows a synergistic lethal response in MYCN-amplified neuroblastoma cells. Cell Cycle. 2014;13(7):1115–31.
Allegra D, Bilan V, Garding A, Dohner H, Stilgenbauer S, Kuchenbauer F, Mertens D, Zucknick M. Defective DROSHA processing contributes to downregulation of MiR-15/−16 in chronic lymphocytic leukemia. Leukemia. 2014;28(1):98–107.
Davalos V, Suarez-Lopez L, Castano J, Messent A, Abasolo I, Fernandez Y, Guerra-Moreno A, Espin E, Armengol M, Musulen E, et al. Human SMC2 protein, a core subunit of human condensin complex, is a novel transcriptional target of the WNT signaling pathway and a new therapeutic target. J Biol Chem. 2012;287(52):43472–81.
Hutterer A, Glotzer M, Mishima M. Clustering of centralspindlin is essential for its accumulation to the central spindle and the midbody. Curr Biol. 2009;19(23):2043–9.
Seguin L, Liot C, Mzali R, Harada R, Siret A, Nepveu A, Bertoglio J. CUX1 and E2F1 regulate coordinated expression of the mitotic complex genes Ect2, MgcRacGAP, and MKLP1 in S phase. Mol Cell Biol. 2009;29(2):570–81.
Kato T, Lee D, Wu L, Patel P, Young AJ, Wada H, Hu HP, Ujiie H, Kaji M, Kano S, et al. Kinesin family members KIF11 and KIF23 as potential therapeutic targets in malignant pleural mesothelioma. Int J Oncol. 2016;49(2):448–56.
Sun L, Zhang C, Yang Z, Wu Y, Wang H, Bao Z, Jiang T. KIF23 is an independent prognostic biomarker in glioma, transcriptionally regulated by TCF-4. Oncotarget. 2016;7(17):24646–55.
Mao Y, Desai A, Cleveland DW. Microtubule capture by CENP-E silences BubR1-dependent mitotic checkpoint signaling. J Cell Biol. 2005;170(6):873–80.
Sullivan BA, Schwartz S. Identification of centromeric antigens in dicentric Robertsonian translocations: CENP-C and CENP-E are necessary components of functional centromeres. Hum Mol Genet. 1995;4(12):2189–97.
Kim Y, Holland AJ, Lan W, Cleveland DW. Aurora kinases and protein phosphatase 1 mediate chromosome congression through regulation of CENP-E. Cell. 2010;142(3):444–55.
Shan L, Zhao M, Lu Y, Ning H, Yang S, Song Y, Shi X. Chai W: [corrigendum] CENPE promotes lung adenocarcinoma proliferation and is directly regulated by FOXM1. Int J Oncol. 2019;55(6):1397.
Rath O, Kozielski F. Kinesins and cancer. Nat Rev Cancer. 2012;12(8):527–39.
McMahan ZH, Cottrell TR, Wigley FM, Antiochos B, Zambidis ET, Park TS, Halushka MK, Gutierrez-Alamillo L, Cimbro R, Rosen A, et al. Enrichment of scleroderma vascular disease-associated autoantigens in endothelial lineage cells. Arthritis Rheumatol. 2016;68(10):2540–9.
Thomson JR, Machado RD, Pauciulo MW, Morgan NV, Humbert M, Elliott GC, Ward K, Yacoub M, Mikhail G, Rogers P, et al. Sporadic primary pulmonary hypertension is associated with germline mutations of the gene encoding BMPR-II, a receptor member of the TGF-beta family. J Med Genet. 2000;37(10):741–5.
Machado RD, Aldred MA, James V, Harrison RE, Patel B, Schwalbe EC, Gruenig E, Janssen B, Koehler R, Seeger W, et al. Mutations of the TGF-beta type II receptor BMPR2 in pulmonary arterial hypertension. Hum Mutat. 2006;27(2):121–32.
Goumans MJ, Liu Z, ten Dijke P. TGF-beta signaling in vascular biology and dysfunction. Cell Res. 2009;19(1):116–27.
Goumans MJ, Valdimarsdottir G, Itoh S, Lebrin F, Larsson J, Mummery C, Karlsson S, ten Dijke P. Activin receptor-like kinase (ALK)1 is an antagonistic mediator of lateral TGFbeta/ALK5 signaling. Mol Cell. 2003;12(4):817–28.
Johnson R, Halder G. The two faces of Hippo: targeting the Hippo pathway for regenerative medicine and cancer treatment. Nat Rev Drug Discov. 2014;13(1):63–79.
Kudryashova TV, Goncharov DA, Pena A, Kelly N, Vanderpool R, Baust J, Kobir A, Shufesky W, Mora AL, Morelli AE, et al. HIPPO-integrin-linked kinase cross-talk controls self-sustaining proliferation and survival in pulmonary hypertension. Am J Respir Crit Care Med. 2016;194(7):866–77.
Luan Y, Zhang X, Kong F, Cheng GH, Qi TG, Zhang ZH. Mesenchymal stem cell prevention of vascular remodeling in high flow-induced pulmonary hypertension through a paracrine mechanism. Int Immunopharmacol. 2012;14(4):432–7.
We would like to thank Xin Liao and Senhong Ying (Institute of Genomic Medicine, Wenzhou Medical University) for suggestion in statistics of this study.
This study was supported by the National Science Foundation (81700062) and the Natural Science Foundation of Zhejiang Province grants (LQ16H010003), Science and Technology Project of Zhejiang Provincial Health Commission (2019RC050), Zhejiang Xinmiao Talents Program (2019R413082) and the General scientific projects of Zhejiang Education Department (Y201942208). The funders played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript. Publication costs were funded by the National Science Foundation (81700062).
Ethics approval and consent to participate
Consent for publication
All authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 2 Figure S2 A. Box diagram showes the homogeneous and comparable distribution of expression profile for each sample. The horizontal ordinate represents samples and the ordinate represents expression distribution of each sample. The green and red rectangles represent the case or control groups, respectively. B. Module 2 with MCODE score of 19.789 and Module 3 with MCODE score of 11.125 from the PPI network. The color shadow of nodes represents node’s Mcode_score (degree of connection of nodes).
Additional file 5 Table S1. Characteristics of the individual studies. Table S2. List of the top 100 DEGs according to the rank of adjusted P value (adj. P-value). Table S3. The enriched gene ontology (GO) categories of differentially expressed genes (DEGs). Table S4. The significantly Top 20 enriched pathways of differentially expressed genes (DEGs). Table S5. List of hub genes by PPI Degree and Modular Analysis. Table S6. Pathway enrichment analysis of Module genes function. Table S7. List of genes representing the top 50 key genes in more than 6 ways.
About this article
Cite this article
Luo, J., Li, H., Liu, Z. et al. Integrative analyses of gene expression profile reveal potential crucial roles of mitotic cell cycle and microtubule cytoskeleton in pulmonary artery hypertension. BMC Med Genomics 13, 86 (2020). https://doi.org/10.1186/s12920-020-00740-x
- Pulmonary arterial hypertension
- Differentially expressed gene
- Functional enrichment analysis
- Protein-protein interaction network