Integrated genome-wide association, coexpression network, and expression single nucleotide polymorphism analysis identifies novel pathway in allergic rhinitis
- 5.1k Downloads
Allergic rhinitis is a common disease whose genetic basis is incompletely explained. We report an integrated genomic analysis of allergic rhinitis.
We performed genome wide association studies (GWAS) of allergic rhinitis in 5633 ethnically diverse North American subjects. Next, we profiled gene expression in disease-relevant tissue (peripheral blood CD4+ lymphocytes) collected from subjects who had been genotyped. We then integrated the GWAS and gene expression data using expression single nucleotide (eSNP), coexpression network, and pathway approaches to identify the biologic relevance of our GWAS.
GWAS revealed ethnicity-specific findings, with 4 genome-wide significant loci among Latinos and 1 genome-wide significant locus in the GWAS meta-analysis across ethnic groups. To identify biologic context for these results, we constructed a coexpression network to define modules of genes with similar patterns of CD4+ gene expression (coexpression modules) that could serve as constructs of broader gene expression. 6 of the 22 GWAS loci with P-value ≤ 1x10−6 tagged one particular coexpression module (4.0-fold enrichment, P-value 0.0029), and this module also had the greatest enrichment (3.4-fold enrichment, P-value 2.6 × 10−24) for allergic rhinitis-associated eSNPs (genetic variants associated with both gene expression and allergic rhinitis). The integrated GWAS, coexpression network, and eSNP results therefore supported this coexpression module as an allergic rhinitis module. Pathway analysis revealed that the module was enriched for mitochondrial pathways (8.6-fold enrichment, P-value 4.5 × 10−72).
Our results highlight mitochondrial pathways as a target for further investigation of allergic rhinitis mechanism and treatment. Our integrated approach can be applied to provide biologic context for GWAS of other diseases.
KeywordsGenome-wide association study Allergic rhinitis Coexpression network Expression single-nucleotide polymorphism Coexpression module Pathway Mitochondria Hay fever Allergy
False discovery rate
Genome-wide association study
Expression quantitative trait loci
Expression single nucleotide polymorphism
Allergic rhinitis is an IgE-mediated inflammation of the upper airway that causes naso-ocular congestion, pruritis, rhinorrhea, and sneezing . Colloquially referred to as hay fever, seasonal allergies, and allergies, allergic rhinitis is one of the most common chronic diseases, affecting up to 30% of adults and 40% of children .
A genetic contribution to allergic rhinitis is evident, based on an increased incidence and prevalence of allergic rhinitis among twins and within atopic families [2, 3]. Despite the high population prevalence of allergic rhinitis, there have been relatively few studies of its genetic basis. The National Human Genome Research Institute catalogs just one genome wide association study (GWAS) of allergic rhinitis , for example, compared to 33 for asthma and 61 for diabetes . Candidate gene studies have been performed with variable effect sizes and levels of significance reported [3, 6, 7]. We are aware of three prior GWAS of allergic rhinitis. Andiappan et al. found no genome-wide significant loci in a GWAS of allergic rhinitis in 942 Chinese subjects . Ramasamy et al. reported one genome-wide significant locus in a GWAS meta-analysis of 12,898 Europeans . Hinds et al. reported 16 genome-wide significant loci for self-reported allergy in a GWAS meta-analysis of subjects of European ancestry . The functional implications of the identified loci were not directly examined in these studies. In a GWAS of allergen-specific IgE level (i.e. not allergic rhinitis), Bonnelykke et al. estimated that ten loci associated with allergen-specific IgE level accounted for 25% population-attributable risk for allergic rhinitis , but this was not from a direct study of allergic rhinitis. Of note, loci associated with allergen-specific IgE level have not been consistently associated with allergic rhinitis [4, 9]. Given that the genetic loci identified to date do not fully explain the estimated heritability of allergic rhinitis, it is likely that as yet unidentified genes and pathways contribute to allergic rhinitis pathogenesis.
GWAS results on their own, while helping to elucidate the etiology of disease, do not provide a rich context within which to interpret any finding [11, 12]. For example, for disease-associated SNPs in intergenic regions, the gene is not necessarily immediately known . Typically the closest gene is identified as the gene of interest, but that is not a foolproof algorithm, and the pathways affected by the genetic locus are also not necessarily immediately apparent . In addition, given the stringent P value thresholds that must be adopted in a GWAS to declare genome-wide significance, much of the data in a GWAS that may inform on disease is ignored because the association P values (and effect sizes) that reflect true associations cannot be distinguished from the noise .
Various methods have been tried to identify biologic context for loci identified by GWAS, including (1) expression quantitative trait loci (eQTL) mapping and expression single nucleotide polymorphism (eSNP) analysis [10, 15, 16, 17, 18], (2) network analysis [19, 20], and (3) pathway analysis [18, 21, 22, 23]. eQTL mapping and eSNP analysis are frequently used [15, 16, 17, 18]. The motivation for eQTL mapping and eSNP analysis is that genetic variation is more likely to impact a disease trait if it alters gene transcription. Linkage or association methods can be used to identify genetic loci influencing gene expression. The linkage-based identification of loci for gene expression is called eQTL mapping, and the association-based identification of SNPs affecting gene expression is called eSNP analysis . Because complex traits such as allergic rhinitis are unlikely to be governed by single genes or loci, however, eQTL and eSNP analyses alone may provide insufficient context. Network approaches can model vast networks of gene interactions that modulate disease [19, 20, 24]. Networks are formed by considering pairwise relationships between genes, including protein interaction relationships and coexpression relationships [14, 24]. Considering GWAS results in the context of whole-gene networks may thus provide the necessary context within which to interpret the disease role for a given gene or variant identified by GWAS. Finally, pathway analysis can help decipher the functional implications of coherent groups of genes with respect to gene ontology functional categories [18, 21, 22, 23]. Pathways representing specific biologic mechanisms may be overrepresented in genes identified by GWAS, thereby providing relevant biologic context for GWAS results.
Among all GWAS, some have reported findings without characterizing the effects of loci on gene expression and downstream biologic pathways [4, 8, 25], while others have incorporated eQTL/eSNP, network analysis, and pathway analysis individually to provide some evidence for downstream effect [15, 16, 17, 19, 21, 23]. Integrative approaches have elucidated biologic mechanisms and treatment targets in a number of disease areas including inflammatory bowel disease, Alzheimer’s disease, diabetes, heart disease, and obesity [15, 24, 26, 27, 28, 29], but similar strategies have not been widely applied to allergy. Further, the gene expression data used to support many GWAS are drawn from individuals distinct from those who were genotyped [16, 18, 19, 21, 23], rendering the analysis of any effects of genotype on gene expression indirect and potentially biased due to differences in subjects who were genotyped versus subjects with mRNA data. For example, while Hinds et al. performed GWAS to identify allergy-related loci in a sample of personal genetics company customers and birth cohort participants , they then identified expression quantitative trait loci (eQTL) among these loci using monocyte gene expression data from a distinct study cohort of heart disease.
We hypothesized that a genome-wide approach to allergic rhinitis integrating GWAS with eSNP, coexpression, and pathway analyses using gene expression data generated from disease-relevant tissue collected from the same individuals who were genotyped could enhance the power over standard GWAS to identify disease-relevant loci. Such an approach could not only provide more robust biological context, but also leverage data from cohorts that may not be large enough to yield high numbers of genome-wide significant GWAS results for complex traits such as allergic rhinitis. Here we present our integrated genomic analysis of allergic rhinitis, where we not only identified genome-wide significant genetic variants associated with allergic rhinitis, but also explored the biologic context for these results by profiling gene expression from CD4+ lymphocytes collected from genotyped subjects and performing expression single nucleotide polymorphism (eSNP), network, and pathway analyses. Our integrated approach identified a novel pathway in allergic rhinitis.
Baseline characteristics of North American subjects included in the study
African American/African Caribbean
550 K, 610 K
GWAS and meta-analysis
Four loci on chromosomes 2p22.3 near LINC0048, 3q29 near DLG1, 10p15.1 near AKR1E2, and 19q13.43 near ZNF776 were genome-wide significant among Latinos (Figure 3). The regional association plots for these loci are shown in Additional file 4: Figure S3. For context, each of these SNPs was directly genotyped in 2 of the 7 populations, imputation was performed using very conservative metrics , and the imputation scores for these SNPs demonstrated good confidence (Additional file 5: Table S2). The regional LD plots for these loci (Additional file 6: Figure S4) show that there were limited SNPs in LD with these genome-wide significant loci. The locus marked by rs7780001 on chromosome 7p21.1 near FERD3L was genome-wide significant in the meta-analysis across ethnic groups (P value 2.0 × 10−8; Figure 3 and Additional file 7: Figure S5) and had nominally significant associations in all three ethnic groups. The loci marked by rs2884670 on chromosome 12p13.32 near DYRK4 and rs7237244 on chromosome 18q11.2 near LAMA3 also had nominally significant associations in all three ethnic groups. Among the 17 loci previously identified by GWAS as associated with allergic rhinitis [4, 9], four were associated with allergic rhinitis with P value ≤ 0.05 in our study (Additional file 8: Table S3).
Individuals with allergic rhinitis frequently have comorbid asthma [1, 32]. Indeed, we observed that 2051 (76%) of those with allergic rhinitis had asthma, and 1195 (41%) of those without allergic rhinitis had asthma. As subphenotypes of AR based on asthma status are possible, we also performed secondary GWAS stratified by asthma status. These results are shown in the supplementary file (Additional file 9: Supplementary Results 1, Additional file 10: Table S4, and Additional file 11: Figure S6) and similarly showed ethnicity-specific findings. In Additional file 12: Table S5, we show the sample composition of the stratified analyses according to asthma status.
Genome-wide CD4+ gene expression and coexpression network to enhance GWAS
To assess the potential biological impact of the loci identified in our GWAS analyses, we collected and measured genome-wide gene expression in disease-relevant tissue (peripheral blood CD4+ lymphocytes) from 200 subjects who had undergone GWAS and constructed a gene coexpression network based on the gene expression data (Figure 1, blue box). We built the coexpression network to identify coexpressed gene modules (i.e. groups of genes with similar patterns of expression profiles and interconnectivity across the experimental samples), as these could serve as broader constructs of gene expression and provide a path to discover broader biologic context .
Integration of GWAS and CD4+ gene expression to explore biologic context for GWAS
To explore the biologic context for our GWAS results, we analyzed our GWAS and gene expression findings together (Figure 1, purple box).
GWAS loci that are eSNPs
We first performed eSNP analysis to assess for the association between genetic variation and gene expression (Figure 1, purple path). We then examined the GWAS and eSNP results together to identify GWAS loci that were eSNPs (Figure 1, turquoise path), as genetic variation that is associated with both the trait and gene expression is more likely to be biologically relevant than variants that are associated with the trait only with no effect on gene expression. We found that the 19q13.43 locus near ZNF776 was associated with allergic rhinitis (GWAS P value 5.0 × 10−8) as well as CD4+ gene expression (χ2 = 19.55, FDR-adjusted P value 0.00078). The other loci identified by GWAS were not associated with CD4+ gene expression.
Given the relatively modest size of our sample and the fact that we were examining a complex trait, we had anticipated that traditional GWAS would uncover only a small number of biologically relevant loci, even with the aid of eSNP analysis, as such an approach would rely upon detection of single variant associations with trait and expression. We therefore sought to leverage the CD4+ expression data more broadly through coexpression network and pathway analysis.
Coexpression modules tagged by GWAS loci
Compared to individual genes, coexpression modules identified through coexpression network analysis (Figure 4) can serve as more general constructs of gene expression, providing a path to discover broader context and related loci [14, 28, 31, 33, 34, 35, 36, 37, 38, 39, 40]. Motivated by the same rationale that genetic variation that is associated with both the trait and gene expression is more likely to be biologically relevant than variants associated with the trait only, we mapped GWAS loci to CD4+ coexpression modules and examined the modules that were tagged by GWAS loci (Figure 1, orange path). Specifically, we defined a GWAS locus as tagging a coexpression module if a coexpression module contained a gene within 250 kb of the locus. We found that 9 of the 22 GWAS loci tagged at least one coexpression module and 6 coexpression modules in total (Figure 3). These 6 modules (the brown, pink, magenta, red, midnight blue, and steel blue coexpression modules) tagged by GWAS loci were therefore considered candidate allergic rhinitis associated modules that could inform on allergic rhinitis biology. The 19q13.43 locus near ZNF776 was among the GWAS loci tagging coexpression modules, corroborating our eSNP results and illustrating the increased power of detection gained by using the coexpression module as a more general construct of gene expression. Of note, some of candidate allergic rhinitis associated modules were tagged by GWAS loci that would not have been considered remarkable by traditional criteria for genome-wide significance of individual loci (P value ≤ 5.0 × 10−8). Our approach of using GWAS loci to tag coexpression modules therefore allowed us to gain additional utility from our GWAS results.Among the 6 candidate allergic rhinitis associated modules tagged by GWAS loci, the brown module (representing mitochondrial pathways according to pathway analysis (Figure 4)) was tagged by 6 of the 22 GWAS loci (Figure 3). This proportion represented a significant enrichment (4.0-fold enrichment, P-value 0.0029) over chance, supporting a connection between these GWAS loci and mitochondrial pathway functions.
Coexpression module enrichment for allergic rhinitis-associated eSNPs
To further ascertain whether any of the 6 candidate allergic rhinitis associated modules were underlying allergic rhinitis susceptibility, we tested whether these modules were enriched for eSNPs that were also associated with allergic rhinitis (Figure 1, green path). eSNPs (i.e. SNPs associated with gene expression) represent functionally validated SNPs of interest in that they are associated with expression levels of genes in a cell type relevant to the disease under study . While such individual associations may not be meaningful, the pattern of associations enriched within a given coexpression module can provide strong statistical support for module involvement at the genetic level in the disease [14, 28, 31, 33, 34, 35, 36, 37, 38, 39, 40]. The additional association of eSNPs within a coexpression module with the disease of interest provides further statistical support that the coexpression module is involved in the disease, as the module is then not only enriched for eSNPs (e.g. SNPs associated with CD4+ gene expression), but more specifically, enriched for disease-associated eSNPs (e.g. SNPs associated with both CD4+ gene expression and allergic rhinitis).
In summary, pathway analysis of our results from the integration of GWAS and coexpression network analysis (showing a significantly high number of GWAS loci tagging the brown module), as well as from the integration of eSNP analysis and coexpression network results (showing greatest enrichment of the brown module for allergic rhinitis-associated eSNPs), both pointed to mitochondrial pathways as playing an important role in allergic rhinitis (Figure 1). Key to these results was that the coexpression network helped organize the expression traits into coherent, highly interconnected modules reflecting the biological processes at play in the tissue. By using GWAS loci to tag coexpression modules and then requiring these tagged coexpression modules to be enriched for eSNPs that were also associated with allergic rhinitis, we were able to place candidate GWAS associations in a more informed context that not only provided a biological context for GWAS interpretation, but enhanced confidence in the suggestive hits given an enrichment of multiple functional SNPs associating with the disease phenotype . Through this approach, we found consistent evidence that mitochondrial pathways likely play a role in the genetics and pathophysiology of allergic rhinitis.
A motivation for genome-wide studies is the desire to identify novel pathways and mechanisms in disease pathogenesis. A limitation of traditional genome-wide association studies is that statistically significant loci may be identified [4, 9], but the biological relevance of the individual or aggregate variants are often not evident [11, 12]. This is not overcome by replication of genotype associations, which has been the usual path taken to follow up GWAS findings, and one which has led to limited success [7, 9, 41]. Our study demonstrates the advantages of integrating network approaches with GWAS to identify and prioritize pathways and gene targets of biologic relevance. By integrating our GWAS findings with eSNP, coexpression, and pathway analyses using gene expression data from disease-relevant tissue generated from subjects who had undergone GWAS, we tested the potential biologic context of our GWAS findings through integrative methods and identified a novel pathway in allergic rhinitis—mitochondrial pathways. Our method allowed us to leverage data from multiple GWAS loci to identify biologic context for the aggregate findings. This is in contrast to traditional GWAS, where the implications of individual SNP associations are often challenging to define [11, 12]. Because complex traits such as allergic rhinitis are unlikely to be governed by single variants, strategies that capitalize on broader constructs of GWAS and gene expression results are more likely to yield informative disease context. We adopted such a strategy and were able to identify novel biologic context for allergic rhinitis. Further, our approach of integrating genotype and gene expression data generated from the same sample has not been widely applied to the study of allergic diseases. Our methods can be used to provide a richer biologic context for GWAS findings in other disease areas.
While mitochondrial pathways have not been associated with allergic rhinitis pathogenesis in traditional descriptions  or genetic studies of the disease [4, 8, 9], our findings and those from laboratory-based studies of airway dysfunction support a role for mitochondrial perturbations in allergic rhinitis pathogenesis. There is a strong link between upper (e.g. nasal) and lower (e.g. bronchial) airway disease pathogenesis , and mitochondrial perturbations have been observed to affect airway inflammation. Mitochondria are the major source of endogenous reactive oxygen species, which are required for normal function of the acquired immune response, including normal T-cell activation, B-cell differentiation, and T-cell and B-cell proliferation . Because alterations in the acquired immune response are observed in allergic inflammation and allergic rhinitis, mitochondrial disruption could play a role in allergic rhinitis. There are some experimental data in support of this hypothesis. OVA-induced allergic airway inflammation in BALB/c mice triggers mitochondrial dysfunction, including the reduction of cytochrome c oxidase activity in lung mitochondria, reduction in the expression of subunit III of cytochrome c oxidase in bronchial epithelium, appearance of cytochrome c in lung cytosol, and mitochondrial ultrastructural changes such as loss of cristae and swelling . Experiments using pollen, rather than ova, to induce allergic inflammation more akin to allergic rhinitis in humans, have also shown mitochondrial disturbance. Pollen grains and subpollen particles have intrinsic NADPH oxidases . Upon hydration in the airway epithelium they produce reactive oxygen species that induce oxidative stress . The pollen-induced oxidative stress damages mitochondrial respiratory chain proteins (specifically NADH dehydrogenase Fe-S protein (NDUFS) and ubiquinol-cytochrome c reductase core (UQCRC)) in human airway epithelial cells, triggers reactive oxygen species production from mitochondrial respiratory chain complex III, and induces mitochondrial dysfunction in complex III .
Mitochondrial changes induced by pollen can serve as the second hit leading to allergic inflammation if there is preexisting mitochondrial dysfunction. Indeed, when treated intranasally with pollen extract, mice with mitochondrial dysfunction (induced deficiency in UQCRC2) demonstrated evidence of allergic airway inflammation, in contrast to UQCRC2-sufficient control mice challenged with the same pollen extract . Specifically, these mice with mitochondrial dysfunction exhibited a 4.4 fold increase in bronchoalveolar lavage eosinophil counts, increased accumulation of peribronchial inflammatory cells, enhanced mucous cell metaplasia in airway epithelium, and increased airway hyperresponsiveness . Although these studies characterized changes in the lower airway, the strong link between upper and lower airway disease pathogenesis  suggests that analogous changes in the upper airway could cause individuals with preexisting mitochondrial dysfunction to develop allergic inflammation leading to allergic rhinitis with pollen exposure. Our results highlight this as a potential mechanistic area, as 27% (6/22) of the genetic loci for allergic rhinitis that we identified by genome-wide association analysis tagged a gene coexpression module that was not only markedly enriched for eSNPs associated with allergic rhinitis (3.4-fold, FDR-adjusted P-value 2.6 × 10−24), but also significantly enriched for mitochondrial pathways by pathway analysis (8.6-fold enrichment, FDR-adjusted P value 4.5 × 10−72).
Population-based studies additionally support a role for mitochondrial pathways in allergic rhinitis pathogenesis. Mitochondria are the primary sites of oxidative reactions. Levels of malondialdehyde (a marker of oxidative stress) are higher, and levels of reduced glutathione (an antioxidant) are lower in the exhaled nasal condensates of allergic rhinitis subjects compared to healthy controls . The epidemiological link between maternal history of atopy (as opposed to paternal history of atopy) and greater risk for allergic rhinitis in offspring [49, 50] may be explained by the fact that mitochondria are maternally transmitted. Consistent with this, mitochondrial haplotypes are associated with intermediate phenotypes of allergic rhinitis, including total serum IgE levels and skin prick test reactivity .
Our results suggest that reducing mitochondrial dysfunction could improve allergic rhinitis. In murine models of allergic airway inflammation, intratracheal administration of an antioxidant known to enter mitochondria and protect the electron chain from oxidative damage  decreased allergen-induced airway hyperreactivity and airway inflammation severity, as shown by reduced numbers of inflammatory cells in bronchoalveolar fluid . Again, these studies focused on lower airway disease, but the strong link between upper and lower airway disease pathogenesis  suggests that it may be possible to achieve similar results if the upper airway were targeted in allergic rhinitis treatment.
Our GWAS of allergic rhinitis was the first to examine ethnically diverse subjects for this complex trait, and our study revealed susceptibility loci that were specific to ethnicity. This is consistent with genome-wide association studies of other complex diseases-- such as asthma [25, 54] and obesity [55, 56]-- that have also demonstrated ethnicity-specific effects. Given the possibility that allergic rhinitis with comorbid asthma vs. allergic rhinitis without comorbid asthma may be distinct disease subphenotypes , we also performed secondary GWAS analyses stratified by asthma status. These results similarly showed ethnicity specific findings. Our findings support the utility of studying ancestrally-diverse populations in genome-wide studies.
We recognize the limitations of our study. We defined allergic rhinitis using criteria commonly employed in population-based and genetic studies of allergic rhinitis, which are based on questionnaire without objective markers [1, 4, 9]. For our eSNP and coexpression network analyses, it would have been ideal to have profiled gene expression for all 5633 subjects who participated in the GWAS, but CD4+ lymphocytes were not available from all subjects. We had CD4+ lymphocytes from European-American CAMP subjects only, and it is possible that coexpression results would have differed had we additionally had expression profiles from subjects of other ethnic backgrounds, as gene expression can vary by ethnicity. While expression differences can change with ethnicity, the connectivity structure is expected to be much more highly conserved, however, and is even seen across species . Additionally, we recognize that our coexpression network may have yielded distinct results had we chosen a different tissue for gene expression profiling; we had chosen to study peripheral blood CD4+ lymphocytes given their central role in allergic disease . Despite these limitations, we were able to implement an integrative analysis of our GWAS, coexpression network, and eSNP results, leading to the identification of a novel biologic pathway in allergic rhinitis. Our strategy created an informed biological context for our GWAS that may be used to better understand allergic rhinitis. Further, our methods may be implemented to provide biologic context for GWAS of other diseases.
Our GWAS of allergic rhinitis of 5633 ethnically diverse subjects demonstrated ethnicity-specific, genome-wide significant findings. To determine the potential biological impact of the variants identified in our GWAS, we integrated eSNP, coexpression network, and pathway analyses using gene expression data generated from subjects who had undergone GWAS. Our integrated approach identified mitochondrial pathways as important in allergic rhinitis, and our strategy may prove useful to studying other diseases.
Each study was approved by the Institutional Review Board of the corresponding institution. Informed consent was obtained for all study participants, and where appropriate, informed assent from minors and informed consent from their parent were obtained.
Subjects, genotyping, and phenotyping
Subjects were recruited from EVE Consortium centers in the United States, Mexico, and Barbados. Detailed descriptions of the individual studies, genotyping platforms, and quality control protocols have been previously described . Of note, SNPs with imputation quality scores below a threshold (Rsq < 0.3) were removed from the analysis. We included subjects who were specifically assessed for allergic rhinitis, and these came from 7 study centers (Figure 1). Allergic rhinitis status was considered positive if a subject reported a history of allergic rhinitis ever, defined as hay fever or runny/stuffy nose with sneezing or itching when the subject did not also have a cold or flu.
GWAS and meta-analysis
Summary files on a common set of SNPs were shared among the EVE Consortium investigators. Genotype imputation using HapMap reference panels and Markov Chain Haplotyping software (MaCH)  were performed in each sample as previously described . We pooled the imputed genotype data for each ethnic group (European American, Latino, African American/African Caribbean) (Figure 1). To adjust for potential population stratification, we used Eigenstrat  to create principal components for each ethnic group. Within each ethnic group, we tested for the association of SNPs with allergic rhinitis by constructing a test statistic that had a standard normal distribution under the null hypothesis of no association and captured the direction of the effect. Models were implemented in PLINK  and controlled for age, sex, and principal components. To allow for comparability with previous GWAS of allergic rhinitis [4, 8, 9], we did not include asthma status as a covariate. To assess for the effects of SNPs across ethnic groups, we then calculated a meta-analysis statistic as a combination of the individual ethnic study scores using METAL .
Recognizing the potential subphenotypes of isolated allergic rhinitis vs. allergic rhinitis with comorbid asthma [1, 32], we additionally performed secondary GWAS and meta-analyses in subjects stratified by asthma status using methods analogous to the above.
Genome-wide CD4+ gene expression
We collected peripheral blood CD4+ lymphocytes from 200 subjects who had undergone GWAS. These 200 subjects were from Childhood Asthma Management Cohort (CAMP) cohort , one of the member centers of the EVE Consortium (Figure 1). We focused on this sample subset because of biospecimen availability. Peripheral blood was collected into BD Vacutainer CPT tubes (BD Diagnostics, Franklin Lakes, New Jersey) and placed on ice. Samples were centrifuged within 1 hour of collection for 20 minutes at 1700RCF, followed by mononuclear cell layer isolation and suspension in 10 ml of PBS. We isolated CD4+ lymphocytes using anti-CD4+ microbeads by column separation (Miltenyi Biotec, Auburn, CA) using 20 μl anti-CD4+ Micro beads per 106 total cells. To extract total RNA, we used the RNeasy Mini Protocol (QIAGEN, Valencia, CA) and stored at −80°C. We generated expression profiles with the Illumina HumanRef8 v2 BeadChip arrays (Illumina, San Diego CA). Expression data were log2 transformed and quantile normalized.
Coexpression network analysis
We performed weighted gene coexpression network analysis to identify coexpressed gene modules . We used a previously applied, well-established, well-recognized, and validated method to construct the coexpression network [14, 28, 31, 33, 34, 35, 36, 37, 38, 39, 40]. For module detection, we used average linkage hierarchical clustering of a topological overlap matrix based on an adjacency matrix that is comprised of power-transformed correlations between gene expression profiles . To cut branches of the tree into gene modules, we used the dynamic tree cutting algorithm, which iteratively searches for stable branch sizes and chooses clusters based on the shape of each dendrogram branch . This algorithm allows manipulation of several parameters controlling the resultant cluster size and cohesiveness. The modules identified from the coexpression network were then carried forward into the integrative analysis.
To provide support for the specificity of our coexpression network, we generated multiple random coexpression networks where gene assignments were randomized (Random Networks 1–3), as well as random networks where the gene expression levels were randomized (Random Networks 4–6). We carried these random networks forward into the integrated analysis as well.
Integration of GWAS and CD4+ gene expression
We defined a GWAS locus (P value for association ≤ 1 × 10−6) as tagging a coexpression module if a coexpression module contained a gene within 250 kb of the locus. Coexpression modules tagged by GWAS loci were identified as candidate allergic rhinitis associated modules, and then assessed for enrichment of eSNPs that were also nominally associated with allergic rhinitis. A SNP was considered an eSNP if the SNP was located within 1 megabase of the corresponding gene, and the association between genotype and gene expression was significant at a 10% false discovery rate (FDR) (P value ≤ 1 × 10−4). For modules tagged by at least 1 GWAS locus, we used the Fisher’s exact test to assess whether a module was enriched for eSNPs that were also nominally associated (P value ≤ 0.01) with allergic rhinitis. The composition of modules was then assessed by pathway analysis using defined gene ontologies (GO) via the DAVID analysis tool [64, 65]. Overrepresentation of canonical pathways and biological processes in modules was measured via the Fisher’s exact test. P values from this test were FDR-adjusted given the number of modules and functional categories tested. Networks were visualized using the Cytoscape network visualization tool .
We thank all the participants who made this study possible. We appreciate contributions from Brooke Schuemann, Barbara Klanderman, PhD, Jody S. Sylvia, Ute Geigenmuller, PhD, Roxanne Kelly, M.B.A., Jose Rodriguez Santana, M.D, William Rodriguez Cintron, M.D., Rocio Chapela, M.D., Jean Ford, M.D., Shannon Thyne, M.D., and Pedro C. Avila, M.D. This work was supported by grants from the National Institutes of Allergy and Infectious Disease (K08AI093538 to S.B.; AI070503 to C.O.; AI079139 and AI061774 to L.K.W.; and AI077439 to E.G.B.), the National Heart, Lung, and Blood Institute (HL101651 to C.O. and D.L.N.; HL087665 to D.L.N.; HL085197, HL087665, HL072414, and HL49596 to C.O.; HL064307 and HL064313 to F.D.M.; HL075419, HL65899, HL083069, HL066289, HL087680, HL101543, and HL101651 to S.T.W.; HL079055 to L.K.W.; HL087699, to K.C.B.; HL061768, HL076647, P30ES007048, P01ES011627 to F.D.G.; HL087680 to W.J.G.; HL078885 and HL088133 to E.G.B.; HL087665 to D.A.M.; and HL069167 to E.R.B), the National Institute of Diabetes and Digestive and Kidney Diseases to L.K.W. (DK064695); the National Institutes of Environmental Health Sciences (ES020801 and ES022719 to W.J.G.; ES007048, ES009581, R826708, RD831861, and ES011627 to F.D.G.; ES015794 to E.G.B.; and the Division of Intramural Research, Z01 ES049019 to S.J.L.) of the National Institutes of Health. Also supported by the American Asthma Foundation and the Fund for Henry Ford Hospital (to L.K.W.), Mary Beryl Patch Turnbull Scholar Program (to K.C.B.); the Flight Attendant Medical Research Institute (FAMRI), RWJF Amos Medical Faculty Development Award, the American Asthma Foundation, the Sandler Foundation (to E.G.B.), and Fundación Ramón Areces (to M.P.Y.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
- 1.Wallace DV, Dykewicz MS, Bernstein DI, Blessing-Moore J, Cox L, Khan DA, Lang DM, Nicklas RA, Oppenheimer J, Portnoy JM, Randolph CC, Schuller D, Spector SL, Tilles SA: The diagnosis and management of rhinitis: an updated practice parameter. J Allergy Clin Immunol. 2008, 122: S1-S84.CrossRefPubMedGoogle Scholar
- 4.Ramasamy A, Curjuric I, Coin LJ, Kumar A, McArdle WL, Imboden M, Leynaert B, Kogevinas M, Schmid-Grendelmeier P, Pekkanen J, Wjst M, Bircher AJ, Sovio U, Rochat T, Hartikainen AL, Balding DJ, Jarvelin MR, Probst-Hensch N, Strachan DP, Jarvis DL: A genome-wide meta-analysis of genetic variants associated with allergic rhinitis and grass sensitization and their interaction with birth order. J Allergy Clin Immunol. 2011, 128: 996-1005.CrossRefPubMedGoogle Scholar
- 5.Hindorff LA MJEBI, Morales J, Junkins HA, Hall PN, Klemm AK, Manolio TA, (European Bioinformatics Institute): A Catalog of Published Genome-Wide Association Studies. Available at: http://www.genome.gov/gwastudies Accessed September 19, 2013 2013
- 9.Hinds DA, McMahon G, Kiefer AK, Do CB, Eriksson N, Evans DM, St Pourcain B, Ring SM, Mountain JL, Francke U, Davey-Smith G, Timpson NJ, Tung JY: A genome-wide association meta-analysis of self-reported allergy identifies shared and allergy-specific susceptibility loci. Nat Genet. 2013, 45: 907-911.CrossRefPubMedPubMedCentralGoogle Scholar
- 10.Bonnelykke K, Matheson MC, Pers TH, Granell R, Strachan DP, Alves AC, Linneberg A, Curtin JA, Warrington NM, Standl M, Kerkhof M, Jonsdottir I, Bukvic BK, Kaakinen M, Sleimann P, Thorleifsson G, Thorsteinsdottir U, Schramm K, Baltic S, Kreiner-Moller E, Simpson A, St Pourcain B, Coin L, Hui J, Walters EH, Tiesler CM, Duffy DL, Jones G, Ring SM, McArdle WL, et al: Meta-analysis of genome-wide association studies identifies ten loci influencing allergic sensitization. Nat Genet. 2013, 45: 902-906.CrossRefPubMedPubMedCentralGoogle Scholar
- 15.Schadt EE, Molony C, Chudin E, Hao K, Yang X, Lum PY, Kasarskis A, Zhang B, Wang S, Suver C, Zhu J, Millstein J, Sieberts S, Lamb J, GuhaThakurta D, Derry J, Storey JD, Avila-Campillo I, Kruger MJ, Johnson JM, Rohl CA, van Nas A, Mehrabian M, Drake TA, Lusis AJ, Smith RC, Guengerich FP, Strom SC, Schuetz E, Rushmore TH: Mapping the genetic architecture of gene expression in human liver. PLoS Biol. 2008, 6: e107.CrossRefPubMedPubMedCentralGoogle Scholar
- 16.Greenawalt DM, Sieberts SK, Cornelis MC, Girman CJ, Zhong H, Yang X, Guinney J, Qi L, Hu FB: Integrating genetic association, genetics of gene expression, and single nucleotide polymorphism set analysis to identify susceptibility Loci for type 2 diabetes mellitus. Am J Epidemiol. 2012, 176: 423-430.CrossRefPubMedPubMedCentralGoogle Scholar
- 17.Zhang X, Johnson AD, Hendricks AE, Hwang SJ, Tanriverdi K, Ganesh SK, Smith NL, Peyser PA, Freedman JE, O'Donnell CJ: Genetic associations with expression for genes implicated in GWAS studies for atherosclerotic cardiovascular disease and blood phenotypes. Hum Mol Genet. 2013, 23: 782-795.CrossRefPubMedPubMedCentralGoogle Scholar
- 18.Anttila V, Winsvold BS, Gormley P, Kurth T, Bettella F, McMahon G, Kallela M, Malik R, de Vries B, Terwindt G, Medland SE, Todt U, McArdle WL, Quaye L, Koiranen M, Ikram MA, Lehtimaki T, Stam AH, Ligthart L, Wedenoja J, Dunham I, Neale BM, Palta P, Hamalainen E, Schurks M, Rose LM, Buring JE, Ridker PM, Steinberg S, Stefansson H: Genome-wide meta-analysis identifies new susceptibility loci for migraine. Nat Genet. 2013, 45: 912-917.CrossRefPubMedPubMedCentralGoogle Scholar
- 24.Zhang B, Gaiteri C, Bodea LG, Wang Z, McElwee J, Podtelezhnikov AA, Zhang C, Xie T, Tran L, Dobrin R, Fluder E, Clurman B, Melquist S, Narayanan M, Suver C, Shah H, Mahajan M, Gillis T, Mysore J, MacDonald ME, Lamb JR, Bennett DA, Molony C, Stone DJ, Gudnason V, Myers AJ, Schadt EE, Neumann H, Zhu J, Emilsson V: Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer's disease. Cell. 2013, 153: 707-720.CrossRefPubMedPubMedCentralGoogle Scholar
- 25.Torgerson DG, Ampleford EJ, Chiu GY, Gauderman WJ, Gignoux CR, Graves PE, Himes BE, Levin AM, Mathias RA, Hancock DB, Baurley JW, Eng C, Stern DA, Celedon JC, Rafaels N, Capurso D, Conti DV, Roth LA, Soto-Quiros M, Togias A, Li X, Myers RA, Romieu I, Berg DJ, Hu D, Hansel NN, Hernandez RD, Israel E, Salam MT, Galanter J: Meta-analysis of genome-wide association studies of asthma in ethnically diverse North American populations. Nat Genet. 2011, 43: 887-892.CrossRefPubMedPubMedCentralGoogle Scholar
- 28.Kang HP, Yang X, Chen R, Zhang B, Corona E, Schadt EE, Butte AJ: Integration of disease-specific single nucleotide polymorphisms, expression quantitative trait loci and coexpression networks reveal novel candidate genes for type 2 diabetes. Diabetologia. 2012, 55: 2205-2213.CrossRefPubMedPubMedCentralGoogle Scholar
- 29.Jostins L, Ripke S, Weersma RK, Duerr RH, McGovern DP, Hui KY, Lee JC, Schumm LP, Sharma Y, Anderson CA, Essers J, Mitrovic M, Ning K, Cleynen I, Theatre E, Spain SL, Raychaudhuri S, Goyette P, Wei Z, Abraham C, Achkar JP, Ahmad T, Amininejad L, Ananthakrishnan AN, Andersen V, Andrews JM, Baidoo L, Balschun T, Bampton PA, Bitton A: Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature. 2012, 491: 119-124.CrossRefPubMedPubMedCentralGoogle Scholar
- 32.Bousquet J, Schunemann HJ, Samolinski B, Demoly P, Baena-Cagnani CE, Bachert C, Bonini S, Boulet LP, Bousquet PJ, Brozek JL, Canonica GW, Casale TB, Cruz AA, Fokkens WJ, Fonseca JA, van Wijk RG, Grouse L, Haahtela T, Khaltaev N, Kuna P, Lockey RF, Lodrup Carlsen KC, Mullol J, Naclerio R, O'Hehir RE, Ohta K, Palkonen S, Papadopoulos NG, Passalacqua G, Pawankar R: Allergic Rhinitis and its Impact on Asthma (ARIA): achievements in 10 years and future needs. J Allergy Clin Immunol. 2012, 130: 1049-1062.CrossRefPubMedGoogle Scholar
- 33.Sivendran S, Chang R, Pham L, Phelps RG, Harcharik ST, Hall LD, Bernardo SG, Moskalenko MM, Sivendran M, Fu Y, De Moll EH, Pan M, Moon JY, Arora S, Cohain A, Difeo A, Ferringer TC, Tismenetsky M, Tsui CL, Friedlander PA, Parides MK, Banchereau J, Chaussabel D, Lebwohl MG, Wolchok JD, Bhardwaj N, Burakoff SJ, Oh WK, Palucka K, Merad M: Dissection of Immune Gene Networks in Primary Melanoma Tumors Critical for Antitumor Surveillance of Patients with Stage II-III Resectable Disease. J Invest Dermatol. 2014, doi:10.1038/jid.2014.85. [Epub ahead of print]Google Scholar
- 36.Yang X, Zhang B, Molony C, Chudin E, Hao K, Zhu J, Gaedigk A, Suver C, Zhong H, Leeder JS, Guengerich FP, Strom SC, Schuetz E, Rushmore TH, Ulrich RG, Slatter JG, Schadt EE, Kasarskis A, Lum PY: Systematic genetic and genomic analysis of cytochrome P450 enzyme activities in human liver. Genome Res. 2010, 20: 1020-1036.CrossRefPubMedPubMedCentralGoogle Scholar
- 41.Andiappan AK, De Wang Y, Anantharaman R, Suri BK, Lee BT, Rotzschke O, Liu J, Chew FT: Replication of genome-wide association study loci for allergic rhinitis and house dust mite sensitization in an Asian population of ethnic Chinese in Singapore. J Allergy Clin Immunol. 2013, 131: 1431-1433. e1438CrossRefPubMedGoogle Scholar
- 46.Dharajiya N, Choudhury BK, Bacsi A, Boldogh I, Alam R, Sur S: Inhibiting pollen reduced nicotinamide adenine dinucleotide phosphate oxidase-induced signal by intrapulmonary administration of antioxidants blocks allergic airway inflammation. J Allergy Clin Immunol. 2007, 119: 646-653.CrossRefPubMedPubMedCentralGoogle Scholar
- 52.Reboucas JS, Spasojevic I, Batinic-Haberle I: Pure manganese (III) 5,10,15,20-tetrakis (4-benzoic acid) porphyrin (MnTBAP) is not a superoxide dismutase mimic in aqueous systems: a case of structure-activity relationship as a watchdog mechanism in experimental therapeutics and biology. J Biol Inorg Chem. 2008, 13: 289-302.CrossRefPubMedGoogle Scholar
- 54.Galanter JM, Torgerson D, Gignoux CR, Sen S, Roth LA, Via M, Aldrich MC, Eng C, Huntsman S, Rodriguez-Santana J, Rodriguez-Cintron W, Chapela R, Ford JG, Burchard EG: Cosmopolitan and ethnic-specific replication of genetic risk factors for asthma in 2 Latino populations. J Allergy Clin Immunol. 2011, 128: 37-43. e12CrossRefPubMedPubMedCentralGoogle Scholar
- 55.Monda KL, Chen GK, Taylor KC, Palmer C, Edwards TL, Lange LA, Ng MC, Adeyemo AA, Allison MA, Bielak LF, Chen G, Graff M, Irvin MR, Rhie SK, Li G, Liu Y, Liu Y, Lu Y, Nalls MA, Sun YV, Wojczynski MK, Yanek LR, Aldrich MC, Ademola A, Amos CI, Bandera EV, Bock CH, Britton A, Broeckel U, Cai Q: A meta-analysis identifies new loci associated with body mass index in individuals of African ancestry. Nat Genet. 2013, 45: 690-696.CrossRefPubMedPubMedCentralGoogle Scholar
- 56.Wen W, Cho YS, Zheng W, Dorajoo R, Kato N, Qi L, Chen CH, Delahanty RJ, Okada Y, Tabara Y, Gu D, Zhu D, Haiman CA, Mo Z, Gao YT, Saw SM, Go MJ, Takeuchi F, Chang LC, Kokubo Y, Liang J, Hao M, Le Marchand L, Zhang Y, Hu Y, Wong TY, Long J, Han BG, Kubo M, Yamamoto K: Meta-analysis identifies common variants associated with body mass index in east Asians. Nat Genet. 2012, 44: 307-311.CrossRefPubMedPubMedCentralGoogle Scholar
- 57.Emilsson V, Thorleifsson G, Zhang B, Leonardson AS, Zink F, Zhu J, Carlson S, Helgason A, Walters GB, Gunnarsdottir S, Mouy M, Steinthorsdottir V, Eiriksdottir GH, Bjornsdottir G, Reynisdottir I, Gudbjartsson D, Helgadottir A, Jonasdottir A, Jonasdottir A, Styrkarsdottir U, Gretarsdottir S, Magnusson KP, Stefansson H, Fossdal R, Kristjansson K, Gislason HG, Stefansson T, Leifsson BG, Thorsteinsdottir U, Lamb JR: Genetics of gene expression and its effect on disease. Nature. 2008, 452: 423-428.CrossRefPubMedGoogle Scholar
- 62.Murphy A, Chu JH, Xu M, Carey VJ, Lazarus R, Liu A, Szefler SJ, Strunk R, Demuth K, Castro M, Hansel NN, Diette GB, Vonakis BM, Adkinson NF, Klanderman BJ, Senter-Sylvia J, Ziniti J, Lange C, Pastinen T, Raby BA: Mapping of numerous disease-associated expression polymorphisms in primary peripheral blood CD4+ lymphocytes. Hum Mol Genet. 2010, 19: 4745-4757.CrossRefPubMedPubMedCentralGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1755-8794/7/48/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.