Introduction

Influenza is a common respiratory infection caused by influenza virus. Influenza has been a serious threat to public health worldwide since its emergence. According to the World Health Organization (WHO), the annual epidemic of seasonal human influenza virus can affect 5-10% of adults, causing approximately 3 to 5 million cases of serious disease and approximately 290,000 to 650,000 deaths (WHO, 2018). Influenza is ranked first among the world's top 10 public health threats [27]. Since 2009, when the influenza H1N1 virus caused the first influenza pandemic of the 21st century, the influenza A(H1N1)pdm09 virus has become one of the main subtypes of seasonal influenza viruses. Hemagglutinin (HA) and neuraminidase (NA) are two important proteins on the surface of influenza virus. Since the gene sequence of HA is prone to mutation, HA gene mutations are one of the primary causes of variation in the antigenicity and pathogenicity of the virus [14]. NA plays a key role in the replication and transmission of influenza virus, and NA gene mutations are closely associated with resistance to antiviral drugs [15].

Vaccination is the most effective measure to prevent influenza, and neuraminidase inhibitors (NAIs) are effective drugs to treat influenza. In 2017, WHO recommended using the A/Michigan/45/2015 strain to replace the A/California/07/2009 strain in influenza A(H1N1)pdm09 vaccines for the Northern Hemisphere, suggesting a certain antigenic drift of influenza A(H1N1)pdm09 virus compared with the previous period [1, 2]. Although WHO updates the universal influenza vaccine every few years, a mismatch between influenza virus vaccine strains and epidemic strains still exists. The influenza vaccine can effectively protect only 50%-60% of the population, and the coverage rate of the influenza vaccine in mainland China is only 23.2% [31]. In addition, some mutant influenza H1N1 viruses are resistant to adamantanes and NAIs [12, 30]. These data indicate that analysing the dynamics of molecular evolution and antigenic mutations is important for predicting influenza epidemics and assessing the effectiveness of vaccines and drugs.

In the present study, we collected and extracted RNA of influenza A(H1N1)pdm09 strains isolated from Peking University People’s Hospital (PKUPH) during the influenza epidemic seasons from November 2017 to March 2018 and from November 2018 to March 2019. We mainly examined the molecular evolution and genetic properties of the HA and NA genes of isolates from in Beijing, China, which has not been reported previously. The findings of our study will help us understand the evolution process and mutations of HA and NA genes of the influenza H1N1 virus more comprehensively and provide a basis for the prediction and prevention of influenza epidemics.

Materials and methods

Case definition and nasal specimens

The study population included patients with influenza-like illness (ILI) admitted to PKUPH from November 2017 to March 2018 and from November 2018 to March 2019. ILI is defined in strict accordance with WHO standards as an acute respiratory illness with a measured temperature of ≥ 38 °C and cough, with onset within the preceding 10 days [11]. Nasal swabs were immediately placed in virus transport medium tubes and stored at -80 °C until analysis. A total of 32,240 ILI nasal swabs were collected.

Data collection

The Chinese National Influenza Center (CNIC) monitors patients with ILI nationwide through its Influenza Laboratory Surveillance Network, which collects and tests thousands of specimens from ILI patients in sentinel hospitals nationwide on a weekly basis. We summarized the weekly CNIC data of ILI and confirmed influenza cases during the influenza epidemic from July 2017 to June 2019 to understand the epidemic situation of influenza in North China. We also collected HA and NA gene sequences of influenza A(H1N1)pdm09 strains isolated from PKUPH during the period from 2012 to 2017.

Viral RNA extraction, gene amplification, and sequencing

We extracted viral RNA from 140 μl of nasal swabs, using a QIAamp Viral RNA Mini Kit (QIAGEN, Germany) according to the manufacturer’s instructions. Reverse transcription was performed at 50 °C for 50 min using the universal primer 5’-AGCAAAAGCAGGTAG-3’ [35]. For detection of influenza A and B viruses, we used the forward primer and the reverse primer for amplification of the matrix gene [9] (Table 1). Amplification reactions were carried out under the following conditions: 95 °C for 15 min; followed by 40 cycles of 94 °C for 30 s, 46 °C for 15 s, and 72 °C for 30 s; and then 42 °C for 10 s.

Table 1 Primers used for influenza virus detection and specific primers used for amplification of HA and NA genes of the influenza A(H1N1)pdm09 virus

We used H1- and N1-specific primers to amplify HA and NA genes of the influenza A(H1N1)pdm09 viruses [6] (Table 1). The cycling conditions were 94 °C for 3 min; followed by 40 cycles of 94 °C for 30 s, 50 °C for 30 s, and 72 °C for 1.5 min; and a final extension step at 72 °C for 7 min. Fifty-one A(H1N1)pdm09 virus strains were detected, and their products were sequenced using the Sanger method. We obtained a total of 38 HA and 46 NA sequences, and all sequences were deposited in the NCBI GenBank database (GenBank accession numbers: MN853421-MN853458 and MN853485-MN853530).

Phylogenetic analysis

For analysis of the molecular evolution of influenza A(H1N1)pdm09 virus, we determined the HA and NA gene sequences of strains from PKUPH as described above and compared them with other strains collected in previous years. The reference sequences were obtained from NCBI. For each year, from the large collection of global isolates, 2-3 sequences were randomly selected during 2009–2019. A total of 38 HA and 46 NA sequences from PKUPH (Beijing) isolates and 24 HA and 24 NA sequences from global strains were sampled independently for our phylogenetic analysis. ClustalW 2.1 was used for multiple sequence alignments. Phylogenetic analysis of gene segments was performed using MEGA 7.0 software and Tamura’s 3-parameter model with gamma-distributed rates. The reliability of the maximum-likelihood tree was assessed by bootstrap analysis with 1000 replications [18, 29].

Characterization of HA and NA sequences

The A/California/7/2009 and A/Michigan/45/2015 strains were used as the reference strains in this study. All 38 HA and 46 NA amino acid sequences from the PKUPH (Beijing) strains were aligned with their homologs from various countries using MEGA 7.0. All amino acid mutation sites were confirmed, and key mutation sites were screened.

Results

Distribution of influenza

In this study, we collected data from 32,240 ILI patients treated by the Department of Infectious Disease of PKUPH from November 2017 to March 2019. By RT-PCR, we determined that 2978 ILI patients were positive for influenza virus, 2441 (82.0%) of whom were positive for influenza A virus, while 537 (18.0%) were positive for influenza B virus. We collected data for ILI and confirmed influenza cases in northern China from July 2017 to June 2019 to determine the distribution of influenza. The distribution of ILI cases was consistent with that of confirmed influenza cases. According to the CNIC, during the 2017-2018 influenza season in Beijing, influenza A(H1N1)pdm09 and A(H3N2) viruses co-circulated with the predominance of influenza B virus, with a peak period that commenced at the 47th week of 2017 and lasted approximately 2 months (Fig. 1). However, during the 2018-2019 influenza season in Beijing, influenza A(H1N1)pdm09 virus became the completely dominant subtype in the first peak period, and subsequently, the number of cases of influenza B increased rapidly; influenza A(H1N1)pdm09, A(H3N2), and B viruses constituted the second peak (Fig. 1).

Fig. 1
figure 1

The epidemic situation in northern China from July 2017 to June 2019. Influenza B virus was the most dominant subtype during the 2017-2018 influenza season, while the influenza A(H1N1)pdm09 virus became the completely predominant subtypes in the peak phase of the 2018-2019 influenza season. The influenza data from northern China was obtained from the Chinese National Influenza Center.

Sequence comparisons

Among the Beijing influenza A(H1N1)pdm09 strains from PKUPH obtained during the 2017–2018 and 2018-2019 influenza seasons, we found that the nucleotide sequence identity values for of HA and NA were 96.4%-99.4% and 92.2%-100%, respectively, and the amino acid sequence identity values for HA and NA were 96.1%-99.3% and 91.3%-99.6%, respectively. When comparing the gene sequences of the 51 strains of influenza A(H1N1)pdm09 virus from this study with those of the influenza A/California/07/2009 reference strain, we observed that the HA nucleotide and amino acid sequences were 94.7%-96.6% and 94.2%-96.3% identical, respectively; and the NA nucleotide and amino acid sequences were 90.3%-97.1% and 88.1%-95.3% identical, respectively. When comparing the gene sequences of the 51 strains of influenza A(H1N1)pdm09 virus from this study with those of the influenza A/Michigan/45/2015 vaccine strain, the HA nucleotide and amino acid sequences were 96.5%-98.6% and 96.6%-98.9% identical, respectively, and the NA nucleotide and amino acid sequences were 92.1%-99.0% and 90.9%-98.9% identical, respectively. The isolates obtained from PKUPH were more similar to the influenza A/Michigan/45/2015 strain than to the influenza A/California/07/2009 strain.

Evolutionary and mutational analysis of influenza A(H1N1)pdm09 virus

We constructed phylogenetic trees for the HA and NA genes of the 51 strains of influenza A(H1N1)pdm09 virus from PKUPH (Beijing), the vaccine strains A/California/07/2009 and A/Michigan/45/2015, and the viruses circulating in other regions of the world (Figs. 2 and 3). HA phylogenetic analysis showed that all PKUPH (Beijing) strains formed a distinct cluster, belonging to subclade 6B.1 (Fig. 2). This subclade includes the currently recommended vaccine virus A/Michigan/45/2015 with the amino acid substitutions S162N and I216T (potential gain of glycosylation; H1 numbering) [7]. However, all of the isolates differed from the vaccine virus A/Michigan/45/2015, due to the HA antigenic epitope substitutions S91R, S181T and I312V (H1 numbering), indicating that the isolates belong to the 6B.1A subclade. Additionally, differences were found between strains isolated from November 2017 to March 2018 and from November 2018 to March 2019. The former group was defined by the amino acid substitutions Y10C, D52N, H155Y and A273I (H1 numbering).

Fig. 2
figure 2

Phylogenetic tree based on HA nucleotide sequences of influenza A(H1N1)pdm09 virus from Beijing and global strains reported from 2009 to 2019. “▲” represents the vaccine strains. “●” represents the isolates from this study. The tree was constructed by the maximum-likelihood method in MEGA 7.0 with 1000 bootstrap replicates; nodes with bootstrap values >70 are shown. The 2017–2019 Beijing strains are indicated in different colours (strains from 2017 in the 2017-2018 influenza season are shown in red, strains from 2018 in the 2017-2018 influenza season are shown in yellow, strains from 2018 in the 2018-2019 influenza season are shown in blue, and strains from 2019 in the 2018-2019 influenza season are shown in green).

Fig. 3
figure 3

Phylogenetic tree based on NA nucleotide sequences of influenza A(H1N1)pdm09 virus from Beijing and global strains reported from 2009 to 2019. “▲” represents the vaccine strain. “●” represents the isolates from this study. The tree ws constructed by the maximum-likelihood method in MEGA 7.0 with 1000 bootstrap replicates; nodes with bootstrap values >70 are shown. The 2017–2019 Beijing strains are indicated in different colours (strains from 2017 in the 2017-2018 influenza season are shown in red, strains from 2018 in the influenza 2017-2018 season are shown in yellow, strains from 2018 in the 2018-2019 influenza season are shown in blue, and strains from 2019 in the 2018-2019 influenza season are shown in green).

A similar evolutionary characteristic was observed when analysing the NA sequences (Fig. 3). The NA nucleotide sequences of the 2018–2019 viruses displayed a drift relative to the 2017–2018 strain. All of the isolates were similar to epidemic strains from the USA, Germany, France and South Korea that were isolated during the same period.

Analysis of amino acid sequence variations in the HA and NA proteins

In this study, we compared the amino acid sequences of the HA protein between the PKUPH (Beijing) A(H1N1)pdm09 strains and vaccine strain A/Michigan/45/2015. The entire article follows the H1 numbering system. All isolates had the S91R, S181T and I312V mutations, which were the characteristic mutations of the influenza A(H1N1)pdm09 virus of these two seasons (Table 2). In the antigenic epitopes, we found that all isolates of the 2017‐2018 and 2018-2019 seasons harboured the following mutations: S181T (100%) in the Sa site; T202I (28.2%), a less common mutation, in the Sb site; H155Y (23.1%), a less common mutation, in the Ca site; and S91R (100%), a common mutation, in the Cb site (Table 3). In addition, L178I (2.5%) and P154S (2.5%), rare mutations that were detected in one isolate, were found in the Sa site and Cb site, respectively. The receptor-binding region (RBD) of HA consists of three domains, including the 130 loop, the 220 loop, and the 190 helix [23], which are associated with adaptability to α2–6-linked glycans, which are receptors for human influenza viruses In the RBD of HA, 3.9% of isolates had the E189G mutation, and other sites did not mutate. D222G is an additional mutation in HA that has been reported to enhance the binding to α2–6-linked glycans in the human lower respiratory tract [8]. However, we did not observe this mutation in the isolated A(H1N1)pdm09 strains. We detected nine potential glycosylation sites in the HA gene: NNS27-29, NST28-30, NVT40-42, NGT104-106, NQT179-181, NXT293-295, NST304-306, NGT498-500, and NGS557-559. At the potential glycosylation sites, NTT at amino acid 293 was mutated to NAT in a small number of strains (5.1%). We also found that all of the strains had a new glycosylation site 179 (NQT), which is located in the antigenic epitope. The 179 glycosylation site began to appear in 2015 and became the main feature of all strains isolated in the following years.

Table 2 Amino acid mutations in the HA and NA protein of influenza A(H1N1)pdm09 virus strains isolated in Beijing during the 2017–2019 influenza seasons compared to the A/Michigan/45/2015 vaccine strain
Table 3 Amino acid substitutions in the HA protein of influenza A(H1N1)pdm09 virus strains isolated in Beijing during the 2017–2019 influenza seasons compared to the A/Michigan/45/2015 vaccine strain

We analysed the NA protein sequences of the PKUPH (Beijing) A(H1N1)pdm09 strains and compared them to the NA protein sequence of the vaccine strain A/Michigan/45/2015. The alignment of the NA amino acid sequences showed that all isolated strains had the amino acid substitutions V81A and N449D (Table 2). We detected twelve mutations that were in the NA antigenic sites, which are represented by NA residues 83–143 (I99V, I117M), 156–190 (I188T, N189S), 252–303 (V267A, G298V), 330, 332, 340–345, 368, 370, 387–395 (I389K), 400 (S400T), 431–435, and 448–468 (N449D, S450R, D451N, D451E) [4] (Table 4). The catalytic sites in the NA protein include R118, D151, R152, R225, E277, R293, R368, and Y402, and the framework sites supporting the catalytic residues include E119, R156, W179, S180, D/N199, I223, E228, H275, E278, N295 and E425 [4, 26]. However, most of these residues were highly conserved in strains isolated from the 2017–2018 and 2018-2019 influenza seasons. The binding site of the NA neutralizing antibody is the structural epitope around the catalytic centre of NA. Around that location, we found that one strain had a rare I177M mutation, and all of the strains had the common N369K mutation. In addition, we observed that three potential N-linked glycosylation sites had variations in NA (Table 5). Importantly, two strains had an I223 mutation, which is associated with reduced susceptibility to several NAIs [24]. None of the analysed strains had the oseltamivir resistance mutation H275Y.

Table 4 Amino acid substitutions in the NA antigenic epitope and framework sites of influenza A(H1N1)pdm09 virus strains isolated in Beijing during the 2017–2019 influenza seasons compared to the A/Michigan/45/2015 vaccine strain
Table 5 The changes in glycosylation sites in the HA and NA genes of influenza A(H1N1)pdm09. Changes that distinguish strains from the 2017–2019 influenza seasons from the vaccine strains are shown in bold.

Annual changes of amino acid substitutions

We collected HA and NA gene sequences of influenza A(H1N1)pdm09 strains from the data bank of PKUPH during the period from 2012 to 2017 and compared the gene sequences from 2012-2017 and 2017-2019 to the vaccine strains A/California/07/2009 and A/Michigan/45/2015. In our study, for HA, the amino acid substitution H155Y appeared in the 2017-2018 influenza season and disappeared in the 2018-2019 influenza season, representing a transient change (Table 6). At site 202, Thr mutated to Ile, which was detected in the 2017-2018 and 2018-2019 influenza seasons (Table 6). The I233T mutation has been extant since 2017 and has been a feature of subsequent seasons (Table 6). The changes of other amino acid substitutions in the HA protein sequence are shown in Table 6, including sites 146, 200, 273 and 421.

Table 6 Amino acid substitutions of HA observed from 2012 to 2019. Mutations in HA from 2017 to 2019 are shown in bold

From 2012 to 2019, changes in amino acid substitutions in the NA protein are shown in Table 7. Among them, mutations at sites 51, 74, 77, 188, 314 and 416 appeared in the 2017-2018 influenza season. Mutations at sites 189 and 389 followed, appearing since the 2018-2019 influenza season (Table 7). During these seven seasons, amino acids at sites 45, 449, 450 and 451 have changed constantly (Table 7).

Table 7 Amino acid substitutions in NA observed from 2012 to 2019. Mutations in NA from 2017 to 2019 are shown in bold.

The N146D, S200P, S202I and A273T mutations in HA appeared along with Q51K, F74S and D416N mutations in NA in six strains during the 2017–2018 and 2018-2019 influenza seasons. H155Y and A273I in HA were detected in nine strains in the same period (Tables 5 and 6).

Discussion

Since the influenza pandemic of 2009, the influenza A(H1N1)pdm09 virus has gradually replaced the seasonal influenza A(H1N1) virus and become one of the main subtypes of the influenza epidemic in winter and spring worldwide. Influenza A(H1N1)pdm09 virus adapts to new environments and hosts by constantly changing [13]. Each country has a different start time and major subtype of the influenza season each year. For example, the distribution of influenza strain subtypes in the Northern Hemisphere during the 2017–2018 influenza season indicates that the influenza epidemiology of China was similar to that of other regions in Asia but was different from that of North America and European countries [2].

The influenza vaccine is the primary measure to prevent influenza virus infections. Each year, WHO collects data on the antigenic characteristics of circulating viruses and analyzes the evolutionary history of HA to choose influenza vaccine strains for the Northern and Southern Hemispheres. Sequence similarity comparisons revealed that the isolates from 2017 to 2019 in this study were divergent from the vaccine strain A/Michigan/45/2015 and that the similarity between the isolates and the vaccine virus decreased year by year. All of the isolates differed from the vaccine virus A/Michigan/45/2015, with the HA antigenic epitope substitutions S91R, S181T and I312V, potentially indicating that the surface antigen of these strains had constantly mutated and gradually evolved. This suggests that the current vaccines probably provide only moderate protection against the influenza A(H1N1)pdm09 virus, and such protection might have been reduced or even absent in the last season.

During infection, HA binds to sialic acid receptors on the host cell surface and allows the virus to enter the cell by mediating membrane fusion [32]. We found some common and less common mutations in HA, which may lead to changes in the antigen specificity, host adaptability, and pathogenicity of influenza virus. The HA protein of the influenza A(H1N1)pdm09 virus has four antigenic epitopes, Sa, Sb, Ca and Cb [21]. Amino acid substitutions were detected in these four antigenic sites. Studies have shown that neutralizing antibodies protect against influenza virus infection and transmission by blocking the binding of HA to cell receptors and inhibiting membrane fusion [33]. Therefore, amino acid substitutions in of HA antigen epitopes may result in immune escape by the virus. In addition, other mutations (V169I, E189G and S200P) are close to antigenic sites and therefore may indirectly influence antigenicity and virulence. Interestingly, we found that the fixed mutation S202T was replaced by S202I in eleven (28%) isolated strains. Whether this change (Thr to Ile) alters the fitness of influenza virus remains to be investigated.

The NA protein catalyses the hydrolysis of sialic acid, which reduces the affinity of new virus particles for the host cell and promotes the release of the virus [20]. Currently, NAIs are the most effective treatment for influenza. The catalytic sites and framework sites of NA are usually conserved, but mutations at these sites, such as H275Y, I223R and E119G, directly affect the effect of NAIs [15]. In our study, we detected a drug-resistance mutation, I223R, in one strain that was isolated from a patient with nephrotic syndrome who had been taking an immunosuppressor for a long time. The mechanism of resistance to oseltamivir was due to the loss of key hydrogen bonds between the inhibitor and residues in the 150-loop [34]. When the I223R mutation in NA occurs, the structure of the NA protein transitions from a closed conformation to an open conformation. Oseltamivir binds weakly to the open conformation of NA. Although the influenza A(H1N1)pdm09 virus with H275Y in NA is defective in its biological characteristics, drug-resistance mutations (E119D, H275Y and I223R) are often detected in viruses from immunosuppressed populations [19, 28, 30]. Therefore, these mutated viruses may need a friendly environment to survive. This might explain why influenza viruses are more susceptible to drug-resistance mutations in immunosuppressed populations.

We also found some mutations that change over time by comparing the relevant data of PKUPH over the last seven years. Glycosylation of the HA head antigen sites not only changes the properties of the HA protein but also increases the pathogenicity of the virus by hiding it from immune recognition [5]. We detected a new potential glycosylation site, NQT, at site 179, which has been reported in studies from other countries [17]. The glycosylation of site 179 did not appear in China before 2015 [9, 10] but became the main feature of all strains isolated during the 2015-2019 influenza seasons. The appearance of this glycosylation site may be related to vaccine pressure. Interestingly, it has been reported that no amino acid substitutions occurred at I233 of the HA protein of the influenza A(H1N1)pdm09 virus in China before 2017; however, in our study, the I233T mutation in HA has been extant since 2017 and has been a feature in subsequent seasons (Table 6). It is known that the I233 mutation can change the structure of the HA protein and reduce its pH stability. By disrupting hydrophobic interactions between I233 and V237, mutations at position 233 may reduce the pH stability of the HA protein. Such intramolecular changes may alter the conformation of a loop formed by amino acids 230–242, thereby reducing receptor specificity [16]. Whether the I233T mutation affects the adaptability of the virus requires further study.

Furthermore, co-occurrence of mutations in the HA and NA genes in the influenza A(H1N1)pdm09 virus were detected in this study, suggesting that these substitutions occurred simultaneously. The amino acids substitutions S220T in HA and N248D in NA were found in our study, and these are the main molecular markers that distinguish the beginning and the peak of the 2009-2010 influenza A(H1N1) season [22]. The N146D, S200P, S202I, A273T and N277D mutations in HA appeared along with the Q51K, F74S and D416N mutations in NA in six strains during the 2017–2019 influenza seasons. Similarly, the co-mutations H155Y and A273I in HA were detected in nine strains in the same period, indicating that co-mutations in HA and NA may not occur independently but are probably associated with each other [8]. Importantly, the newly emerged substitutions (HA: 91, 181 and 312; NA: 77, 81, 188 and 449) in almost all isolates from the 2017-2018 and 2018-2019 influenza seasons were observed in the strains circulating globally from 2017–2019 but were not found in the strains before 2017. Previous studies showed that the viability of the strain with the H275Y mutation in NA decreased to a certain extent, but this strain has persisted in recent years and remained adaptable, because of the compensating effects of other mutations [3, 25]. Co-mutation strains (HA: S91R, S181T and I312V; NA: G77R, V81A, I188T and N449D) may increase the fitness of the virus in a new environment and host. The increased fitness and decreased sequence similarity of co-mutation strains to the A/Michigan/45/2015 strain may lead to a reduced efficacy of the WHO-recommended vaccine. The co-mutations of the HA and NA proteins of the influenza A(H1N1)pdm09 virus need to be studied further, and attention should be paid particularly to the accumulation of mutations that may cause an epidemic of new influenza A(H1N1)pdm09 lineages.

Conclusions

The present work further enhances our understanding of the winter influenza epidemic in China and the Northern Hemisphere and thus provides insights for prediction of the next epidemic strain of the influenza A(H1N1)pdm09 virus. Mutations of HA and NA, including new glycosylation and drug-resistance mutations, were detected in this study, and the biological characteristics of these mutations require further study. Furthermore, co-mutations in the HA and NA proteins, which may affect the adaptability of the influenza virus to the environment and the host, need to be taken seriously, and more research is needed to overcome the influenza problem.