Keywords

FormalPara Core Message

There is a need for novel vaccine technologies where existing viral vaccine types (viruses, killed or inactivated viruses, and conjugate or subunits) are unsuitable against many viruses. Hence, short peptide (10–20 residues) vaccine candidates are considered promising solutions in recent years. These function on the principle of short epitopes developed through the binding of CD8+/CD4+-specific HLA alleles (12542 known so far). Thus, the specific binding of short peptide antigens to HLA alleles is rate limiting with high sensitivity in producing T-cell-mediated immune responses. Identification of HLA allele-specific antigen peptide binding is mathematically combinatorial and thus complex. Therefore, prediction of HLA allele-specific peptide binding is critical. Recent advancement in immune-informatics technologies with the aid of known X-ray-determined HLA-peptide structure data provides solutions for the accurate identification of short peptides as vaccine candidates for further consideration. Thus, we document the possibilities and challenges in the prediction, large-scale screening, development, and validation of short peptide vaccine candidates in this chapter.

1 Introduction

The types of approved viral vaccines include live attenuated viruses, killed/inactivated viruses, and conjugate/subunits. However, these types of vaccine technologies may prove unsuitable against some viruses. In some cases, there is interest in the development of short peptide vaccines to fill the gaps. For example, the use of live attenuated HIV-1/AIDS vaccines is not as yet approved due to safety concerns [1]. There are several subunit vaccines under consideration and evaluation. However, one of these, the NIAID and Merck Co.-sponsored 2004 STEP (HVTN 502 or Merck V520-023) trial using three recombinant adenovirus-5 (rAD5) vectors containing HIV-1 genes Ad5-gag, Ad5-pol, and Ad5-Nef, did not show promising results [2]. This has led to the development of a multifaceted strategy for HIV-1/AIDS vaccine development. However, encouraging results were observed with four priming injections of a recombinant canary pox vector (ALVAC-HIV) and two booster injections of gp120 subunit (AIDSVAX-B/E) in a community-based, randomized, multicenter, double-blind, placebo-controlled efficacy trial (NCT00223080) in Thailand [3]. The main concern following this study was that this vaccine did not affect the degree of viremia or the CD4 T-cell count in patients who later seroconverted. Further studies indicated that the challenges with the development of an HIV-1/AIDS vaccine are viral diversity and host-virus molecular mimicry [46]. Nonetheless, there is considerable amount of interest to develop gp160 (gp120-gp41 complex) TRIMER envelope (ENV) protein as a potential vaccine candidate [4].

The production of an HIV-1 ENV spike protein trimer complex is nontrivial due to protein size, protein type, sequence composition, and residue charge polarity. Therefore, the need for the consideration of alternative approaches for vaccine development such as T-cell-based HLA-specific short peptide vaccines is promising [6, 7]. The LANL HIV molecular immunology database provides comprehensive information on all known T-cell epitopes in the literature [8]. Thus, these resources in combination with other predictive advancements described in this chapter are collectively useful for the design, development, evaluation, and validation of short peptide vaccine candidates.

2 Methodology

2.1 Structural Data

A structural dataset of complexes for class I HLA-peptide (Table 1.1) and class II HLA-peptide (Table 1.2) is created from the protein databank (PDB) [9]. The characteristic features of the datasets are presented in Tables 1.1 and 1.2.

Table 1.1 Dataset of class 1 HLA-peptide structures downloaded from PDB
Table 1.2 Dataset of class 2 HLA-peptide structures downloaded from PDB

2.2 Structural Superposition of HLA Molecules

The peptide-binding grooves of both class I HLA (Fig. 1.1a) and class II HLA (Fig. 1.1c) molecules were superimposed using the molecular overlay option in the Discovery Studio software from Accelrys® [10].

Fig. 1.1
figure 1

The structural basis for short peptide vaccine design is illustrated. The allele-specific nomenclature defined, ethnicity profiled using known HLA sequences at the IMGT/HLA database [11], and the striking backbone structural similarity of antigen peptides at the HLA binding groove is the bottleneck. This is generated with using a dataset (Tables 1.1 and 1.2) of HLA-peptide complexes (67 class I and 16 class II) retrieved from protein databank (PDB) [9] using with Discovery Studio® (Accelrys Inc.) [10]. (a) The peptide-binding groove (superimposed) in class I HLA is structurally similar among known alleles and complexes. (b) The peptide-binding groove (superimposed) in class II HLA is structurally similar among known alleles and complexes; (c) class I HLA-bound peptides overlay showing structural constraints (bend peptides) at the groove; (d) class II bound peptides overlay showing extended conformation at the groove. This clearly suggests that class I (panel c) and class II (panel d) bound peptides do not have identical binding patterns at the groove

2.3 Molecular Overlay of HLA-Bound Peptides

HLA-bound peptides in the groove of both class I HLA (Fig. 1.1b) and class II HLA (Fig. 1.1d) molecules were overlaid using the molecular overlay option in the Discovery Studio software from Accelrys® [10].

2.4 Accessible Surface Area Calculations

Accessible surface area (ASA) was calculated using the WINDOWS software Surface Racer [12] with Lee and Richard implementation [13]. A probe radius of 1.4 Å was used for ASA calculation.

2.5 Relative Binding Measure

Relative binding measure (RBM) is defined as the percentage ASA Å2 of residues in the peptide at the corresponding positions buried as a result of binding with the HLA groove. This is the percentage change in ASA (ΔASA) of the position-specific peptide residues upon complex formation with the HLA groove (Fig. 1.2).

Fig. 1.2
figure 2

The peptide binding pattern at the groove is illustrated as function of residue position for class I and class II alleles using a dataset (Tables 1.1 and 1.2) of HLA-peptide complexes (67 class I and 16 class II) retrieved from protein databank (PDB). This dataset is represented by several class I and class II alleles (see Tables 1.1 and 1.2). The peptide lengthwise distribution of the binding pattern is shown as relative binding measure using change in solvent-accessible surface area upon complex formation with the HLA groove

3 Results and Discussion

3.1 HLA-Peptide Binding Prediction for T-Cell Epitope Design

The rate-limiting step in T-cell epitope design is allele-specific HLA-peptide binding prediction. The number of known HLA alleles is over 12542 in number as of March 2015 at the IMGT/HLA database [11]. Hence, a number of methods have been formulated so far and optimized for HLA-peptide binding prediction during the last two decades. Structural information on HLA-peptide complexes has increased our understanding of their binding patterns (Tables 1.1 and 1.2). The HLA-binding groove is structurally similar among class I (Fig. 1.1a) and class II (Fig. 1.1b) alleles. The class I (Fig. 1.1c) and class II (Fig. 1.1d) bound peptides do not show an identical binding pattern at the groove. A detailed illustration of peptide binding patterns (Fig. 1.2) at the groove of class I and class II alleles provides valuable insights using mean and deviation profiles (Fig. 1.3).

Fig. 1.3
figure 3

The mean peptide binding pattern with standard deviation (SD) at the groove is illustrated as function of residue position for class I and class II alleles using a dataset (Tables 1.1 and 1.2) of HLA-peptide complexes (67 class I and 16 class II) retrieved from protein databank (PDB). This provides insight into the understanding of the nature of peptide binding at the groove towards the design of an effective T-cell epitope candidate

A comprehensive description of HLA-peptide binding prediction is documented [14, 15]. Lee and McConnell [16] proposed a general model of invariant chain association with class II HLA using the side-chain packing technique on a known structural template complex with self-consistent ensemble optimization (SCEO) [17, 18] using the program CARA in the molecular visualization/modeling software LOOK (Molecular Application Group (1995), Palo Alto, CA) [16, 19]. This was an important development in the field and the approach was extended to a large dataset of known HLA-binding peptides. Kangueane et al. [20] collected over 126 class I peptides with known IC50 values from literature with defined HLA allele specificity. These peptides were modeled using available templates for a large-scale assessment of peptide binding to defined HLA alleles. Thus, a structural framework was established for discriminating allele-specific binders from non-binders using rules derived from a dataset of HLA-peptide complexes. This procedure was promising.

An extended dataset of class 1 and class 2 complexes were manually created, curated, and analyzed for insights into HLA-peptide binding patterns at the groove [21]. These studies lead to a detailed analysis of the HLA-peptide interface at the groove and the importance of peptide side chain and backbone atomic interactions were realized [22]. Meanwhile, the amount of structural data on HLA-peptide complexes was increasing in size leading to the development of an online database [23]. Thus, information gleaned from HLA-peptide structural complexes helped to identify common pockets among alleles in the binding groove and provided insights into functional overlap among them [24]. The need for a simple, robust, generic HLA-peptide binding prediction was evident. Therefore, a model was formulated by defining virtual pockets at the peptide-binding groove using information gleaned from a structural dataset of HLA-peptide complexes [25]. The model (average accuracy of 60 %) was superior because of its application to any given class I allele whose sequence is clearly defined. The model (53 % accuracy) was then extended for class II prediction using a class II-specific HLA-peptide structural dataset [26].

The techniques thus far established are highly promising towards short peptide vaccine design and development [27, 28]. Nonetheless, it was observed that alleles are covered within few HLA supertypes, where different members of a supertype bind similar peptides, yet exhibiting distinct repertoires [29]. These principles led to the development of frameworks to group alleles into HLA supertypes [30, 31], understand their structural basis [32], and cluster alleles based on electrostatic potential at the groove [33]. These observations should aid in the design of peptide vaccine candidates for viruses including HIV/AIDS [5, 6]. Further, for example, the importance of protein modifications to enhance HIV-1 ENV trimer spike protein vaccine across multiple clades, blood, and brain is discussed [4]. Currently available types of vaccine technology [34, 35], such as live virus, killed virus, and conjugate vaccines, have failed to produce a promising vaccine against several clinically important viruses, including HIV/AIDS [36]. Therefore, short peptide vaccines are promising solutions for viral vaccine development. It should be noted that there are many other viruses for which vaccines are needed. Examples of additional viruses for which there are no vaccines available, vaccines are still under development, vaccine failures occurred, or more effective vaccines are needed include RSV, measles, HBV, WNV, Coronaviruses, H5N1 influenza virus, HCV, Adenovirus, Hantavirus, and Filoviruses [3747].

4 Conclusion

The design and development of short peptide cocktail vaccines is a possibility in the near future. This function on the principle of short epitopes developed through the binding of CD8+/CD4+-specific HLA alleles. HLA molecules are specific within ethnic populations and are polymorphic with more than 12542 known alleles as of March 2015. Thus, the binding of short peptide antigens to HLA alleles is rate limiting yet specific, with high sensitivity, while producing T-cell-mediated immune responses. Our understanding of this specific peptide binding to HLA alleles has improved using known HLA-peptide complexes. There is a search for superantigen peptides covering major HLA supertypes. Thus, peptide-binding predictions with large coverage, accuracy, sensitivity, and specificity are essential for vaccine candidate design and development. It should be noted that available HLA-peptide binding prediction methods are highly promising in these directions.