Background

Hepatitis C virus (HCV) infection is one of the most common chronic blood-borne infections in the world. The estimated HCV prevalence was 2.35% in 2010, affecting 160 million individuals, 29 million of whom were in China [1],[2]. One significant characteristic of HCV infection is its high possibility of chronicity, which usually leads to cirrhosis and hepatocellular carcinoma [3],[4].

The human leukocyte antigen (HLA) genomic region at chromosomal position 6p21 encodes many genes, which are important for immune system [5]. These genes are categorized into three basic groups: class I (HLA-A, -B, and -C), class II (HLA-DR, -DQ, and -DP), and class III. Previous studies have focused on classical class-I and -II regions and showed that polymorphisms in these regions were associated with chronic hepatitis C [6]–[8]. There are also some non-classic genes located among those classic regions and genetic functions of those genes are not fully understood yet. For example, TAP (the transporter associated with antigen processing) and LMP (large molecular weight proteasome) are located in the HLA-DP and -DQ interval and encode proteins that are involved in antigen processing and presentation. Our earlier work indentified some genomic variants of TAP and LMP that were associated with chronic hepatitis B and hepatitis C [9],[10].

Among the genes located in the HLA class-II DQ-DP interval, eight genes are categorized into antigen processing/presentation-related genes, including TAP1, TAP2, LMP2, LMP7, HLA-DMA, HLA-DMB, HLA-DOA, and HLA-DOB [11],[12]. TAP1 and TAP2 encode the two subunits of TAP. During antigen presentation, TAP works with its binding protein TAPBP, which is encoded by tapasin located near the centromere of chromosome 6 [13]. LMP2 and LMP7 encode LMP. TAP and LMP transport antigenic peptides from the cytosol into the endoplasmic reticulum in an ATP-dependent manner [14]. HLA-DMA and HLA-DMB encode two chains of DM complex, which is required for the assembly of antigenic peptides with the HLA class-II molecules [15]. DO, a protein complex translated from HLA-DOA and HLA-DOB, negatively regulates the activity of DM [16],[17].

Very few studies have investigated the influence of variants in these nine genes on HCV infection [10],[18]. Considering the importance of antigen presentation during immune response to HCV as well as what we have observed previously, we hypothesized that genomic variants of these genes may play a role in generating immune responses and contribute to the outcomes of HCV infection. The current study was to reveal any possible association of these nine genes with HCV in a Chinese population of former paid-blood donors.

Methods

Participants

All participants were recruited from a population of former paid-blood donors in Zhenjiang, Jiangsu Province, China. These participants had a high risk of acquiring HCV infection due to improper use of needles and syringes. The study protocol was approved by the institutional review committee of Nanjing Medical University. Interviews for donation history and other risk factors were conducted with signed informed consent from April 2010 to January 2013. An approximately 10 mL of blood sample was then collected for serological tests and host DNA genotyping. Those eligible for the study were excluded if they were positive of any of the following conditions: hepatitis B surface antigen (HBsAg), other types of liver diseases, alcoholic diseases, metabolic liver diseases, and previous interferon and/or ribavirin therapy. The subjects tested to be anti-HCV negative were defined as the uninfected ones. The subjects tested to be anti-HCV positive were defined as the infected cases. The infected cases were further classified into chronic cases (HCV RNA positive) and spontaneous resolvers (HCV RNA negative).

Laboratory analysis and genotyping of SNPs

Sera HBsAg and HCV antibody (anti-HCV) were detected by enzyme-linked immunosorbent assay (Beijing Wantai Biological Pharmacy Engineering Co., Ltd., Beijing, China) following the manufacturer’s instructions. Blood biochemical tests were undertaken by Roche MODULE P800 Automatic Biochemical Analyzer (Roche Co., Ltd., Shanghai, China). Total RNA was extracted from serum using Trizol LS Reagent (TaKaRa Biotechnology Co., Ltd., Dalian, China), and HCV RNA was detected by RT-PCR with specific primers using RT-PCR kit (TaKaRa Biotechnology Co., Ltd., Dalian, China) [10].

DNA extraction was performed by protease K digestion and phenol-chloroform purification as described previously [10]. The information of single-nucleotide polymorphisms (SNPs) in 9 candidate genes (HLA-DMA, HLA-DMB, HLA-DOA, HLA-DOB, TAP1, TAP2, LMP2, LMP7, and tapasin) were obtained from the NCBI dbSNP database (http://www.ncbi.nlm.nih.gov/SNP) and the Chinese Han population (CHB) database of HapMap (http://www.hapmap.org). All the SNPs were filtered with the criteria: (1) MAF (minor allele frequency) ≥ 0.05; (2) Hardy-Weinberg equilibrium test P value ≥ 0.05. Tag SNPs were selected to represent a group of variants with strong linkage disequilibrium (LD). A total of 34 SNPs were chosen for genotyping. The chosen SNPs were listed in both Additional file 1: Table S1 and S2. Genotyping was performed by the TaqMan allelic discrimination assay on ABI PRISM 7900HT Sequence Detection system (Applied Biosystems, San Diego, CA, USA). The information of primers and probes was shown in Additional file 1: Table S1. Two blank controls and five repeated samples were assigned into each 384-well format for quality control, and a 100% concordance was achieved. The success rates of genotyping for 34 SNPs were all above 95%. The samples failed for genotyping were excluded from the statistical analyses. The observed genotype frequencies of these SNPs in the remaining subjects with different HCV status were all in Hardy-Weinberg equilibrium.

Statistical analysis

Differences in the general demographic characteristics were calculated by the Student t test or One-Way ANOVA and the chi-square (χ 2) test. The associations of SNPs with HCV susceptibility and HCV clearance were estimated by the odds ratios (ORs) and 95% confidence intervals (CIs) using both univariate and multivariate logistic regression analysis. Adjustment for age, gender, history of blood/plasma donation, and times of donation was conducted during regression analysis. The χ 2-based Q test was used for homogeneity among strata by selected variables. The trend analysis was assessed with Cochran-Armitage trend test. LD parameters (i.e., D’ and r2) were analyzed by Haploview. The haplotype frequencies based on the observed genotypes were estimated by PHASE software (v2.1) [19],[20]. False discovery rate (FDR) correction was performed as described in reference [21]. All the statistical analyses were carried out by SAS 9.1.3 software (SAS Institute, Cary, NC, USA), and P < 0.05 in a two-sided test was considered as statistical significance.

Results

Demographic and selected variables of participants

All enrolled participants were divided into three groups based on their anti-HCV and HCV RNA status, including 482 anti-HCV negative subjects (uninfected subjects), 193 spontaneous clearance cases (anti-HCV positive and HCV RNA negative, also called resolvers), and 532 persistent HCV cases (both anti-HCV and HCV RNA positive, also called chronic cases). The later two groups were combined as HCV infected subjects. Some demographic and selected characteristics were shown in Table 1. Distribution of age and gender was not different among groups. Consistent with current knowledge of HCV transmission pattern, experience of plasma donation was associated with HCV infection and the risk of HCV chronicity was increased with the times of plasma donation. The level of aspartate aminotransferase (AST) and alanine aminotransferase (ALT) indicated that HCV infected subjects had worse liver function compared with non-infected subjects.

Table 1 Demographic and selected variables in subjects with different HCV infection outcomes

Association of candidate SNPs with HCV infection outcomes

Three genetic models (dominant, recessive, and additive models) were used in analysis of each SNP. Significance in any model was considered as a possible association of these SNPs with HCV infection (Additional file 1: Table S2). After adjustment for age, gender, experience of blood/plasma donation, and times of blood/plasma donation, logistic regression analyses showed that variants in HLA-DMA, HLA-DOA, HLA-DOB, LMP2, and LMP7 were associated with outcomes of HCV infection.

The allelic frequencies of candidate genes were firstly compared among the uninfected controls and infected cases (including both natural clearance cases and chronic cases). HLA-DMA rs1063478-T mutant had a decreased frequency in infected cases compared with C wild type. HLA-DOA rs2284191-A (vs. G) and HLA-DOB rs11244-T (vs. C) variants were positively related with anti-HCV (Table 2a). Conditional logistic regression analysis was performed to test the independence of these SNPs. The effect of rs11244 was weakened (P = 0.078) after being conditioned on the other two SNPs, so it was excluded from further analysis of combined effect (Additional file 1: Table S3a). The combined effect of two independent SNPs (rs1063478-T and rs2284191-G) was analyzed by Cochran-Armitage’s trend test. The results showed an increased protection effect with more favorable alleles (P = 0.037). Carrying three favorable alleles offered the highest protective effect (OR = 0.46, 95% CI = 0.27-0.78), as showed in Table 3a.

Table 2 Association of selected SNPs with HCV infection outcomes
Table 3 Cumulative effects of selected SNPs on HCV infection outcomes

The ability of viral clearance was then compared among the spontaneous infection subjects and the persistent infection subjects. LMP2 rs17587-A (vs. G), LMP7 rs2071543-A (vs. C), HLA-DOA rs2284191-A (vs. G), rs376892-T (vs. C), rs416622-A (vs. G), and HLA-DOB rs7383287-G (vs. A) were positively associated with HCV clearance (Table 2b). The effect of rs17587 and rs7383287 on HCV clearance remained obvious after being conditioned on the other five SNPs (P = 0.03 for rs17587 and P = 0.007 for rs7383287) (Additional file 1: Table S3b). Therefore, the independent SNPs of LMP2 rs17587 and HLA-DOB rs7383287 were analyzed in Cochran-Armitage’s trend test. There was also an increased protection effect with more favorable alleles (P = 0.001). Subjects carrying two favorable alleles had a 58% decrease in risk of HCV persistent infection (OR = 0.42, 95% CI = 0.26-0.66) (Table 3b).

Stratified analysis of independent SNPs

The association between the combined effect of the independent SNPs and HCV infection was further evaluated by adjustment for confounding factors including age, gender, experience of blood/plasma donation, and times of blood/plasma donation. The results were shown in Additional file 1: Table S4.

The combined protective effect of HLA-DMA rs1063478-T and HLA-DOA rs2284191-G was more prominent in female subgroup (OR = 0.68, 95% CI = 0.49-0.94). Subjects < 60 years favored more from the protective effect of rs1063478-T and rs2284191-G as compared with subjects ≥ 60 years (OR = 0.62, 95% CI = 0.43-0.88). The protective effect of carrying rs1063478-T and rs2284191-G was also more obvious in subjects with experience of blood donation (OR = 0.69, 95% CI = 0.51-0.94), subjects with blood donation ≥ 30 times (OR = 0.61, 95% CI = 0.40-0.93), subjects with experience of plasma donation (OR = 0.66, 95% CI = 0.49-0.90), and subjects with plasma donation ≥ 50 times (OR = 0.67, 95% CI = 0.47-0.95). Heterogeneity test showed that heterogeneity in every two strata was significant for gender and plasma donation (P = 0.023 and 0.024, respectively).

The combined protective effect of LMP2 rs17587-A and HLA-DOB rs7383287-G was more pronounced in female subjects (OR = 0.63, 95% CI = 0.43-0.93), subjects < 60 years (OR = 0.49, 95% CI = 0.31-0.78), subjects with experience of blood donation (OR = 0.67, 95% CI = 0.47-0.97), subjects with blood donation < 30 times (OR = 0.50, 95% CI = 0.31-0.81), subjects with experience of plasma donation (ever vs. never, OR = 0.61, 95% CI = 0.43-0.87), and subjects with plasma donation ≥ 50 times (OR = 0.46, 95% CI = 0.27-0.78). No obvious evidence of heterogeneity associations was observed.

Interaction analysis

The interaction between the meaningful SNPs and potential risk factors was also analyzed. The results were shown in Table 4. Significant multiplicative interactions on HCV susceptibility were found between the combined effects of rs1063478 and rs2284191 and plasma donation (P interaction = 0.020). Compared to subjects carrying 0-2 favorable alleles and with experience of plasma donation, subjects carrying 3-4 favorable alleles but without plasma donation had a decreased detection rate of anti-HCV (OR = 0.18, 95% CI = 0.10-0.32). There was also a multiplicative interaction between rs7383287 genotypes and exposure of blood donation (P interaction = 0.040). Compared to subjects carrying rs7383287AA genotypes and with experience of blood donation, subjects carrying AG/AA genotypes but without blood donation had a 66% decrease in risk of HCV persistent infection (OR = 0.34, 95% CI = 0.15-0.78).

Table 4 Interaction analysis between favorable SNPs and selected risk factors

Haplotype analysis

Because high linkage disequilibrium exists in HLA region, we also analyzed the LD among the candidate SNPs (detailed information showed in Additional file 1: Table S5) and then we performed haplotype analysis.

In the above analyses, rs1063478, rs2284191, and rs11244 were identified to be related with anti-HCV status. Compared with the most frequent CGC haplotype, the haplotype with rs1063478-T (TGC) was associated with a protective effect (P < 0.001), while the haplotype carrying rs2284191-A (CAC) indicated a risk effect of HCV infection (P < 0.001) (Table 5a).

Table 5 Haplotype analysis with different HCV outcomes

Six SNPs were identified to be related with chronicity of HCV infection, including rs17587, rs2071543, rs2284191, rs7383287, rs376892 and rs416622. The most frequent haplotype was GCGACG. No difference of viral clearance was found between reference haplotype GCGACG and haplotype carrying rs2071543 (GAGACG). Haplotypes carrying rs7383287-G (GCGGCG) and rs17587-A (ACGACG) were more frequently found in spontaneous clearance group than in chronic infection group (P < 0.001 and P = 0.001, respectively). Haplotypes carrying the other alleles enhanced the risk of chronic HCV infection. The results were shown in Table 5b.

Discussion

HCV infection is now considered as curable and therefore lots of studies including several GWAS have focused on genetic polymorphisms and treatment response [22],[23]. Comparatively fewer data reported association of genetic variants with HCV infection outcomes. A recent GWAS study demonstrated that SNPs near interleukin-28B (IL-28B) and DQB might explain approximately 15% of spontaneous resolution of HCV infection with European and African ancestry [24]. This GWAS together with other studies have revealed that HLA is one of the most important regions with respect to viral hepatitis infection. However, there are still some shortages of relative studies about HCV infection outcomes. One major difficulty is owing to sample collection because HCV prevalence in general population is very low. A detailed review compared studies from different ethnic groups and speculated that the chance was limited to detect globally common HLA haplotypes due to high diversity in HLA loci [25]. Current GWAS data are based on mixed populations and most published high quality studies were conducted in non-Chinese population. Conventional genotyping for precise detection of HLA loci in specific population is definitely necessary to complement GWAS approach. The present study investigated a HLA region with little attention and defined several new genomic loci that were related with HCV susceptibility and clearance in Chinese Han population. We selected and analyzed 34 tagging-SNPs in 9 candidate genes (HLA-DMA, HLA-DMB, HLA-DOA, HLA-DOB, TAP1, TAP2, LMP2, LMP7, and tapasin), which are involved in antigen processing and presentation. The results indicated that HLA-DMA rs1063478 and HLA-DOA rs2284191 were independent factor of being anti-HCV positive, while HLA-DOB rs7383287 and LMP2 rs17587 were independent factor of infection chronicity. Stratified analyses and haplotype analyses all showed that rs1063478-T mutant and rs2284191-G wild type had protective effect from HCV infection, and wild type rs17587-A with rs7383287-G could help clear virus.

Variants in HLA-DMA, HLA-DOA, and HLA-DOB were reported for the first time to play a role in HCV infection. HLA-DMA rs1063478 C > T makes a missense mutation. Mutation may change the activity of the encoding protein and influence antigen presentation process. HLA-DOA rs2284191 (G > A) was in intron region. HLA-DOB rs7383287 (A > G) make a synonymous mutation. It is hard to explain the biological plausibility of those two SNPs without further studies. However, HLA-DOB rs7383287 seems to be a strong biomarker of HCV clearance, because the association was significant in dominant, recessive, and additive all three models (Table 2b).

LMP2 rs17587 G > A also makes a missense mutation. Based on our previous study conducted in injecting drug users, LMP2 rs17587 had no relationship with the outcomes of HCV infection, which was not in accordance with this study [10]. Another study in a European Caucasian population found that G allele in exon 4 of tapasin was associated with outcomes of HCV infection, which was not observed in out study [18]. The discrepancy among these studies may be due to the different study design as well as the participants with dissimilar physical conditions and genetic background. The inconsistency among studies also illustrates that collecting data from specific population should help reveal unique disease-related effects in a given genetic background rather than in generalized population setting.

Some methodological issues need to be mentioned. Firstly, selection bias may be induced during sample collection. Although only subjects with risk exposure of paid-blood and/or plasma donation were recruited, some uninfected controls may never have contact with the pathogen. Therefore the uninfected status may not be owned to carrying protective alleles and the estimated risk association would be inaccurate. Nevertheless, considering the frequency and lasting period of blood donation, most subjects should have exposure history and this bias might be minimal. The selected population also has other advantages. The subjects were exposed during the same period and their infection outcomes were steady after decade. The subjects live in the same area and share similar environment exposure. All the advantages make the current population one of the best choice for studying the natural history of HCV infection. Secondly, 34 SNPs were included in the present study, which may increase the false positive results because of multiple comparisons. Therefore the association found in this study could be a false-positive result. For this reason we performed FDR correction for all SNPs, which was provided in Additional file 1: Table S2. As a result, HLA-DOB rs7383287 and LMP2 rs17587 were highly possible to be related with HCV chronicity because the FDR was pretty small in all three genetic models (FDR < 0.3 as significance). Although we analyzed 34 SNPs, other SNPs covering more regions might also contribute to the risk of HCV chronicity. It should be noticed that the association strength of HLA-DOB rs7383287 and LMP2 rs17587 could be weakened in a larger scale study as GWAS. Frankly speaking, future work with more subjects and covering more SNPs is needed to confirm this association as well as to explore other potential loci. Finally, HCV viral genotype may be a confounding factor and we did not include in analysis. Based on our current knowledge, HCV 1b strain is the dominant strain in China and accounts for over half of the HCV cases we collected [9],[10],[26],[27]. We did not adjust viral genotype in analysis due to insufficient subgroup sample size. More samples are needed for further stratified analysis.

Conclusion

Our results suggested that genetic variants in antigen presentation pathway had influence on susceptibility to HCV infection and viral clearance. HLA-DMA rs1063478, HLA-DOA rs2284191, and HLA-DOB rs7383287 were identified as novel loci in Chinese population that were involved in HCV infection.

Additional file