Mapping the sequence mutations of the 2009 H1N1 influenza A virus neuraminidase relative to drug and antibody binding sites
- 28k Downloads
In this work, we study the consequences of sequence variations of the "2009 H1N1" (swine or Mexican flu) influenza A virus strain neuraminidase for drug treatment and vaccination. We find that it is phylogenetically more closely related to European H1N1 swine flu and H5N1 avian flu rather than to the H1N1 counterparts in the Americas. Homology-based 3D structure modeling reveals that the novel mutations are preferentially located at the protein surface and do not interfere with the active site. The latter is the binding cavity for 3 currently used neuraminidase inhibitors: oseltamivir (Tamiflu®), zanamivir (Relenza®) and peramivir; thus, the drugs should remain effective for treatment. However, the antigenic regions of the neuraminidase relevant for vaccine development, serological typing and passive antibody treatment can differ from those of previous strains and already vary among patients.
This article was reviewed by Sandor Pongor and L. Aravind.
KeywordsOseltamivir Zanamivir Neuraminidase Inhibitor H1N1 Strain Antibody Binding Site
The recent epidemic of the "2009 H1N1" influenza A virus (also called swine or Mexican flu) has put the world on alert since a new swine flu strain (naturally hosted by pigs) has crossed the species barrier to human and, apparently, acquired the capability for human to human transmission [1, 2]. Given earlier experiences with risks of viral pandemics such as SARS and the avian flu , global control and public health surveillance mechanisms provided sequences of the new flu strain in public sequence databases within weeks of the outbreak. Here, we analyze the protein sequence of its neuraminidase with respect to similarities and differences to known strains and implications on drug treatment and vaccination.
Domain architecture and posttranslational modifications
Next, a presumably unstructured linker region (residues 35 to 82) connects the membrane anchor to the catalytic neuraminidase domain (residues 83 to 469; Figure 1). Such unstructured linker regions are rich in small and polar residues and often harbour sites for posttranslational modifications [10, 11]. Probable posttranslational modification sites in the neuraminidase of the new strain are glycosylation motifs involving N88, N146 and N235, which correspond to residues that are also glycosylated in other subtype neuraminidases . However, the minimal and non-specific consensus motif of glycosylation sites (Nx [ST]) is found in total 8 times in the new strain sequence with an apparent clustering (50%) in the unstructured linker region (Figure 1). Interestingly, another putative novel glycosylation site N386, which is unique to the new strain, would be accessible on the surface, as seen in the structural models.
Phylogenetic relation of new NA to known subtypes
Influenza A virus protein sequences were downloaded from NCBI (as of April 29th). Neuraminidases were identified by BLAST (E-value < 0.001)  using the representative NA of the new strain as query [Genbank: ACP41107.1 http://www.ncbi.nlm.nih.gov/protein/227809834]. Redundancy was removed with cd-hit at a level of maximal 90% sequence identity , the remaining sequences were aligned with MAFFT (using L-INS-I settings ) and the resulting multiple alignment was visualized and annotated in Jalview . A neighbour joining tree with pairwise gap deletion, Poisson correction as distance measure and 500 bootstrap replicates (generated with MEGA ) produces robust groupings consistent with previous studies  for the known NA subtypes (clustering of N1, N4, N5+N8 on one side and N2, N3, N6+N7+N9 on the other) and reliably places the new NA with other N1s. Interestingly, inside the N1 cluster, the new NA appeared close to the N1 of H5N1 avian flu viruses. The alignment and corresponding phylogenetic tree are available at http://mendel.bii.a-star.edu.sg/SEQUENCES/H1N1/.
Moreover, neuroaminidases of these new H1N1 swine flu examples are more similar to H5N1 avian flu strains than other H1N1 variants found in the Americas or than that of the historic strains such as the 1918 Spanish flu . This is surprising since avian flu strains typically have different hemaglutinin (HA) subtypes (e.g. H5N1). Combinations of HA and NA subtypes need to be fine-tuned to recognize the same type of sialic acid modifications to allow smooth interplay of the two proteins, which is important for the viral cycle . These results support the notion that, also inside the family of N1 subtypes, a clear distinction can be made between avian-like H1N1, such as the one from the current outbreak, and other existing H1N1 strains.
Structural modelling and mapping of new mutations
The crystal structures of both the historic 1918 NA as well as the avian flu NA are available in complex with currently used drugs. We created a homology model of the new 2009 swine flu NA to map the sequence differences to the three-dimensional structure templates. Using Modeller , the sequence of the new neuraminidase [Genbank Accession: ACP41107.1 http://www.ncbi.nlm.nih.gov/protein/227809834] was modelled 50 times onto multiple templates (PDB: 2hu4, 3ckz, 3b7e, 3beq) and the resulting best model (as judged by DOPE score) further refined with short simulated annealing MD simulations in the presence of a bound inhibitor (zanamivir, oseltamivir or peramivir) as implemented in the Yasara Structure package . The final atom-resolution models are available in PDB format at http://mendel.bii.a-star.edu.sg/SEQUENCES/H1N1/.
It has to be noted that indirect effects of the mutations that may alter the binding pocket also from a greater distance are difficult to assess and cannot be excluded. To this extent, we have analysed coevolution patterns in an extensive alignment of more than 6000 non-identical Influenza A neuraminidases to eventually identify connected networks of residues using the SCA algorithm  as implemented in , but no network that would connect the surface directly to the core and catalytic site was found (see supplementary material). In fact, all positions of the observed mutations are at the surface and naturally variable, as judged by the conservation and SCA analysis, which would rather indicate that they do not have an effect on the structure of the more distant binding pocket.
Thus, we conclude that the drug binding pocket remains unchanged in the new strain and, hence, the binding behaviour of neuraminidase inhibitors such as oseltamivir (Tamiflu®) and zanamivir (Relenza®) should be unaffected. Indeed, initial clinical reports suggest that the new virus is susceptible to the two drugs . Our findings support this notion and provide a molecular mechanism. Furthermore, the third currently tested neuraminidase inhibitor, peramivir, should also be effective since it also shares the same binding pocket.
Next, we review how the new mutations affect vaccine development through altering antibody interactions as well as antigenic regions. There are 3 crystal structures of related neuraminidases in complex with antibodies [41, 42, 43]. In Figure 5, we annotate residues that are within 3 Å distance to the respective bound antibody and, hence, crucial for the interaction. Interestingly, residues in sites recognized by both NC41 and NC10 antibodies appear mutated in the new strain. This would suggest that these old antibodies (that were originally directed against N9 neuraminidase) would probably not bind to NA of the new strain. Nevertheless, using the same regions as epitope may be a viable option for novel vaccine development.
Additionally, several other known antigenic regions, partially derived from surviving patients of previous flu outbreaks (e.g. H5N1), are reported in the literature [44, 45, 46] and their location is indicated in the alignment (Figure 5). Another extensive source of epitopes and antigens including neuraminidase of influenza A viruses is the immune epitope database (IEDB)  and the complete mapping of epitopes for the new H1N1 NA sequence is available at http://mendel.bii.a-star.edu.sg/SEQUENCES/H1N1/. While several of the new mutations are found in antigenic regions, it is also apparent that they often occur on positions that are hardly conserved among different NA subtypes. Consequently, these regions are evolutionarily more flexible and may mutate fast. This increases the risk of evading antibody responses of human hosts acquired during previous flu infection or from vaccination.
Sequence variation among patients with the same 2009 H1N1 strain
After the first wave of new patient sequences arrived, it becomes clear that there are at least two major lineages that are distinguishable by only few mutations. Most notably, N248 has mutated to Aspartate (D248) in the New York infection cluster. All intra-strain mutations available before May 8th 2009 are indicated in Figures 4B and 5. As expected, they are predominantly found on the surface and in regions that are known to be variable from the conservation analysis. While the drug binding pocket remains unaffected by these most recent mutations, the N248D substitution changes a central part of an antibody recognition site. This has important consequences for vaccine development, forcing to either avoid this epitope or produce combined vaccines to account for the epitope variation observed in different patient groups. Although the mutation pattern will become less transparent over time it may still serve to delineate chains of transmission, retrospectively.
In summary, we provide a sequence analysis and structural modelling of the neuraminidase from the 2009 H1N1 swine flu outbreak. Besides mapping of phylogenetic relationships to other strains, we find that the sequence variation in the new strain does not seem to affect the drug binding site but may very well alter common epitopes. To allow quick analysis of future mutations that could produce drug or vaccine-resistant strains, we provide a tool for 3D visualization of the neuraminidase structure models with mapping of drug and antibody recognition sites on the supplementary webpage http://mendel.bii.a-star.edu.sg/SEQUENCES/H1N1/.
Sandor Pongor, International Centre for Genetic Engineering and Biotechnology, Trieste, Italy
In this work, the authors carry out a predictive analysis of the "2009 H1N1" (swine or Mexican flu) influenza A virus strain, based on phylogenetic analysis and 3D homology modelling. The results show that this strain is phylogenetically more closely related to European H1N1 swine flu and H5N1 avian flu rather than to the H1N1 counterparts in the Americas.
Homology-modeling of the neuraminidase reveals that the novel mutations are not likely to interfere with the active site so the currently used neuraminidase inhibitors (oseltamivir, zanamivir and peramivir) will be effective against the new virus strain.
The subject is very timely and the approach is adequate. The authors may want to include analysis of more patient data that were published since the analysis was completed. More and more sequences are being published from all over the world that might be worthwhile to include into this analysis. The authors may consider establishing a periodically updated homepage, if appropriate. In summary, the analysis is careful and carried out in a commendable fashion, and the findings are highly significant.
Indeed, there have been additional "2009 H1N1" neuraminidase sequences since the submission of this manuscript for review. Between April 29th and May 8th, 45 new sequences became available. Overall, 3 mutations occur in multiple (S95G, V106I and N248D) while 2 mutations are restricted to single patient virus isolates (V83M, V241I), so far. We have included a section about these mutations identified in new patient sequences and mapped them to the structure. As discussed in the main text, the intra-strain variation is typically found on the surface and at positions expected to be variable as judged by the conservation analysis among all NA subtypes. Interestingly, one of the mutations among patients (N248D) is critically affecting one of the antibody binding sites.
As the virus will continue to evolve, new mutations will become available and the best way we have found to allow quick mapping and update of new sequence variation in respect to drug and antibody binding sites is to give full access to users/readers via a 3D structure visualization tool at the supplementary webpage http://mendel.bii.a-star.edu.sg/SEQUENCES/H1N1/ (instructions to map new mutations are given, including an example).
Aravind, National Center for Biotechnology Information, Bethesda, MD, USA
Maurer-Stroh et al discuss sequence features of the neuraminidase gene of the virus behind the latest influenza outbreak, 2009 H1N1. Of the features of interest they note that this strain has acquired a novel glycosylation consensus site that could be surface accessible. Of course, it remains unknown if this site is indeed used for modification. The key finding in the paper is that the binding cavity for neuraminidase inhibitors is unaffected as suggested by molecular modeling. This finding is of significance in the current situation of an outbreak with pandemic potential. However, one issue needs to be highlighted in this regard – mutations far away from the binding site can potentially affect the shape and or binding affinity of the binding pocket. These are not always captured by homology models, especially the issue of affinity. In principle, authors could use a conservation patterns or co-evolution measures (e.g. as in PMID: 10514373) to determine if there are interaction chains that might connect distant residues to the active site. In the least it would be useful to provide the caveat of distant changes affecting affinity in the current paper.
We totally agree with the referee that also mutations at a greater distance may affect the binding pocket under special circumstances and we have added a new paragraph to the manuscript. However, we have to admit that it remains essentially impossible to quantify the influence of non-direct interactions by theoretical means unambiguously. In our experience, co-evolution measures such as the one proposed and several others (PMID: 18056067) produce high rates of false positives and, therefore, the interpretation is difficult. We did the requested analysis with Ranganathan's SCA algorithm to eventually identify connected networks of residues using the webserver implementation at the Gerstein lab, but no network that would connect the surface directly to the core and catalytic site was found (see supplementary webpage for full details). In fact, all positions of the observed mutations are at the surface and naturally variable, as judged by the conservation and SCA analysis, which would rather indicate that they do not have an effect on the structure of the more distant binding pocket. A possible problem why the SCA could not work in this case is the high level of sequence similarity among the neuraminidases which only gives limited numbers of informative correlated mutations. This is a totally different scenario from alignments of highly divergent sequences that still have the same fold, such as the small PDZ domains analyzed by Ranganathan. In the case of a diverse family with shared fold, correlated mutations are indicative of allowed fluctuations also among structurally important residues. However, with the neuraminidases, most variation can be attributed to surface residues, which makes sense given the pressure to avoid immune responses. Additionally, sequence sampling of neuraminidases is not independent but biased by transmission chains and clusters of outbreaks. A possible but also not necessarily more precise alternative to judge indirect effects on drug binding would be to run free energy simulations with the bound drug to judge changes of affinity caused by the mutations but this is a tricky and time-consuming endeavour that would burst the scope of this current manuscript.
unspecific consensus motif: change to "non-specific"
Phylogenetic analysis: While for sequences at this range of similarity Poisson correction may not have negative consequences it is definitely better to repeat the analysis with JTT and variable rates to see if the clustering remains the same or changes drastically.
We confirmed that the tree clustering inside the N1 subtype family is robust regarding different tree generation methods and added this also to the text. We also provide the suggested JTT distance tree with variable rates as supplementary at http://mendel.bii.a-star.edu.sg/SEQUENCES/H1N1/.
" This indicates that, similar to the current outbreak, scenarios of breaches in the species barrier between human and pigs have already arisen out of closely related H1N1 strains as also evidenced by further reports in the literature [26, 27]." The wording of this sentence is somewhat unclear. It appears that the authors wish to state that H1N1 like strains have crossed species boundaries on other occasions, but this is not necessarily clear in the sentence.
changed to "This indicates that, similar to the current outbreak, closely related H1N1 strains have crossed species boundaries on previous occasions as also evidenced by further reports in the literature".
"These results support the notion that, also inside the family of N1 subtypes, a clear distinction can be made to distinguish avian-like H1N1, such as the one from the current outbreak, from other existing H1N1 strains." What would be the explanation for this? A recent recombination between an avian-like H1N1 or has it diverged from other avian like H1N1 with the recombination occurring much earlier. Could this information be superimposed in phylogenetic context on current figure 3?
This is, of course, a very interesting question and difficult to deduce from the phylogenetic tree of a single protein without molecular clock, as in our case. However, this and similar questions have already been analyzed to quite some detail for the existing strains. The current knowledge of the scenario for H1N1 is that around the late 1970s to early 1980s, a human-avian reassortant virus started to be detected in European pigs as host (PMID: 8091678). Then, this avian-like swine flu started to move from the European continent to the UK in the early 90s (PMID: 9049404). More reassortments and emergence of the N2 subtype that quickly spread is also well documented. This phylogenetic analysis plugs the new H1N1 strain into the already known clusterings among the NA subtypes and within the N1 family.
- 4.Ooi HS, Kwo CY, Wildpaner M, Sirota FL, Eisenhaber B, Maurer-Stroh S, Wong WC, Schleiffer A, Eisenhaber F, Schneider G: ANNIE: integrated de novo protein sequence annotation. Nucleic Acids Res. 2009.Google Scholar
- 7.Blok J, Air GM, Laver WG, Ward CW, Lilley GG, Woods EF, Roxburgh CM, Inglis AS: Studies on the size, chemical composition, and partial sequence of the neuraminidase (NA) from type A influenza viruses show that the N-terminal region of the NA is not processed and serves to anchor the NA in the viral membrane. Virology. 1982, 119: 109-121.PubMedCrossRefGoogle Scholar
- 11.Eisenhaber F, Eisenhaber B, Maurer-Stroh S: Prediction of Post-translational modifications from amino acid sequence: Problems, pitfalls, methodological hints. Bioinformatics and Genomes: Current Perspectives. Edited by: Andrade MM. 2003, Wymondham: Horizon Scientific Press, 81-105.Google Scholar
- 27.Komadina N, Roque V, Thawatsupha P, Rimando-Magalong J, Waicharoen S, Bomasang E, Sawanpanyalert P, Rivera M, Iannello P, Hurt AC, Barr IG: Genetic analysis of two influenza A (H1) swine viruses isolated from humans in Thailand and the Philippines. Virus Genes. 2007, 35: 161-165.PubMedCrossRefGoogle Scholar
- 40.Centers for Disease Control and Prevention (CDC): Update: drug susceptibility of swine-origin influenza A (H1N1) viruses, April 2009. MMWR Morb Mortal Wkly Rep. 2009, 58: 433-435.Google Scholar
- 46.Ho HT, Qian HL, He F, Meng T, Szyporta M, Prabhu N, Prabakaran M, Chan KP, Kwang J: Rapid Detection of H5N1 Subtype Influenza Viruses by Antigen-Capture Enzyme-Linked Immunosorbent Assay Using H5 and N1 specific Monoclonal Antibodies. Clin Vaccine Immunol. 2009, 16: 726-32.PubMedPubMedCentralCrossRefGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.