Figures
Abstract
Humans experience many influenza infections over their lives, resulting in complex and varied immunological histories. Although experimental and quantitative analyses have improved our understanding of the immunological processes defining an individual’s antibody repertoire, how these within-host processes are linked to population-level influenza epidemiology in humans remains unclear. Here, we used a multilevel mathematical model to jointly infer antibody dynamics and individual-level lifetime influenza A/H3N2 infection histories for 1,130 individuals in Guangzhou, China, using 67,683 haemagglutination inhibition (HI) assay measurements against 20 A/H3N2 strains from repeat serum samples collected between 2009 and 2015. These estimated infection histories allowed us to reconstruct historical seasonal influenza patterns in humans and to investigate how influenza incidence varies over time, space, and age in this population. We estimated median annual influenza infection rates to be approximately 19% from 1968 to 2015, but with substantial variation between years; 88% of individuals were estimated to have been infected at least once during the study period (2009 to 2015), and 20% were estimated to have 3 or more infections in that time. We inferred decreasing infection rates with increasing age, and found that annual attack rates were highly correlated across all locations, regardless of their distance, suggesting that age has a stronger impact than fine-scale spatial effects in determining an individual’s antibody profile. Finally, we reconstructed each individual’s expected antibody profile over their lifetime and inferred an age-stratified relationship between probability of infection and HI titre. Our analyses show how multi-strain serological panels provide rich information on long-term epidemiological trends, within-host processes, and immunity when analysed using appropriate inference methods, and adds to our understanding of the life course epidemiology of influenza A/H3N2.
Citation: Hay JA, Zhu H, Jiang CQ, Kwok KO, Shen R, Kucharski A, et al. (2024) Reconstructed influenza A/H3N2 infection histories reveal variation in incidence and antibody dynamics over the life course. PLoS Biol 22(11): e3002864. https://doi.org/10.1371/journal.pbio.3002864
Academic Editor: Jonathan Dushoff, McMaster University, CANADA
Received: March 15, 2024; Accepted: September 26, 2024; Published: November 7, 2024
Copyright: © 2024 Hay et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All code and data required to reproduce the analyses are available at https://doi.org/10.5281/zenodo.12795911.
Funding: JAH is supported by a Wellcome Trust Early Career Award (grant 225001/Z/22/Z). This study was supported by grants from the NIH R56AG048075 (DATC, JL), NIH R01AI114703 (DATC, BY), and the Wellcome Trust 00861/Z/16/Z (SR) and 200187/Z/15/Z (SR). This work was also supported by research grants from Guangdong Government HZQB-KCZYZ‐2021014 and 2019B121205009 (HZ). DATC, KOK, JL, JMR, and SR acknowledge support from the National Institutes of Health Fogarty Institute (R01TW0008246 and R01AI160780). JMR acknowledges support from the Medical Research Council (MR/V038613/1; MR/Z504373/1). KOK acknowledges support from Health and Medical Research Fund (reference numbers: INF-CUHK-1, 17160302, 18170312, 22210232, CID-CUHK-A, COVID1903008), General Research Fund (reference numbers: 14112818, 24104920), Group Research Scheme of The Chinese University of Hong Kong, and Funding Allocation to Faculties by Research Committee of The Chinese University of Hong Kong. The funders played no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: DATC declares research funding from Merck, Sharp and Dohme and from Pfizer for research unrelated to this manuscript. All other authors have declared that no competing interests exist.
Abbreviations: CoV, coefficient of variation; CrI, credible interval; HI, haemagglutination inhibition; ILI, influenza-like illness; MCMC, Markov chain Monte; MDCK, Madin-Darby canine kidney; RDE, receptor-destroying enzyme; TCID50, 50% tissue culture infectious dose
Introduction
Patterns of influenza infections in humans are highly varied across time, space, and demography [1,2]. Recurrent epidemics occur because influenza viruses undergo an evolutionary process of antigenic drift, whereby new strains escape preexisting host immunity through the accumulation of mutations in immunodominant surface glycoproteins leading to rapid turnover of lineages, with specific strains persisting for 1 to 2 years [3,4]. Because individuals are alive at different times and locations, they are exposed to different strains and thus each individual has a distinct immunological history [5,6]. As a result, serological data suggest that humans are infected with a new A/H3N2 influenza strain approximately every 5 years, with less frequent infections, or at least less frequent detectable antibody boosts, as individuals enter middle age [7,8].
A better understanding of who, where, and when influenza infections are likely to occur would aid in public health planning, nowcasting, and forecasting [9,10]. However, it is not just antigenic variation and evolution that contributes to variation in influenza incidence, but a combination of individual and population level factors [11,12]. Birth cohorts [13–15], contact and movement patterns [16–18], climatic variation [19,20], school terms [21,22], city structure [23,24], and household structure [25,26] have all been shown to be associated with variation in influenza incidence. However, variation in surveillance quality and consistency across locations and over time makes it difficult to identify individual-level or population-specific effects over a longer time period using routine influenza-like illness (ILI) surveillance data [27,28]. These limitations may be overcome by using serological data, where unobserved past infections and vaccinations leave a signature in an individual’s measurable antibody profile [29–31].
For influenza, measured antibody levels are the result of complex interactions of immunological responses from all past exposures [6,32]. Hence, accurate inferences of individual infection histories require models of antibody kinetics to determine the number and timing of past exposures to multiple influenza strains [8,13,33–35]. These models can be complicated, as immunological interactions of antigenic drift with immune memory occur through imprinting effects, whereby the set and order of strains in an individual’s previous exposure history influences which epitopes are targeted and the magnitude of their antibody response to subsequent exposures [6,32]. Estimating influenza infection histories from serological data therefore presents a decoding problem, as the space of possible exposure histories which could lead to an observed antibody landscape is large, and observed antibody titres are highly variable due to within-host and laboratory-level effects. Although inferences which account for these mechanisms have provided rich insights into individual-level life course immune profiles, most attempts have been in relatively small cohorts or using small panels of influenza strains, limiting the conclusions which can be drawn about population-level influenza epidemiology [13,36,37].
Here, we applied an infection history inference method to data from a large serosurvey to reconstruct lifetime individual infection histories and population-level incidence of A/H3N2 influenza in Guangzhou, China [35,36,38]. Infection histories were inferred based on individual-level antibody profiles to a panel of 20 influenza A/H3N2 strains representing viruses that first circulated from 1968 onward. The study population comes from a range of age groups, social backgrounds, and geographical areas, thereby providing an ideal dataset to investigate predictors of influenza infection and small-scale spatial variation. In fitting the model, we also obtained parameter estimates for the underlying antibody kinetics model which allows us to elucidate long- and short-term antibody dynamics. Together, these results provide detailed spatiotemporal insights into the historical epidemiological and immunological dynamics of influenza A/H3N2
Results
Description of participant data
We measured 67,683 HI titres against 20 A/H3N2 strains isolated between 1968 and 2014 at 2 to 3 year intervals from serum samples collected between 2009-12-22 and 2015-06-02 (S1 Fig) as part of the Fluscape cohort study (1,130 individuals, 2 samples each) (S2 and S3 Figs). Sampling was done over 4 study rounds, with a mean time between serum sample collection of 3.87 years (standard deviation: 0.780; 95% quantiles: 1.68 to 4.74 years). This cohort covered 40 unique locations and 651 unique households in a 60 kilometre transect from Guangzhou, China. Participant ages at the most recent sampling round ranged from 6 years to 97 years, with a median of 50 years. Vaccination rates were low in this cohort (S1 Table), consistent with low reported vaccine coverage rates in mainland China, particularly in older individuals [39,40]. We refer to an individual’s set of antibody titres against all strains in the panel as their antibody profile. All individual antibody profiles and observed changes in titre are shown in S20 and S21 Figs. Summary statistics of these profiles and full study details have been described elsewhere [38,41].
Antibody titres vary by age and in space
We saw 5 broad patterns of seropositivity (HI titre > = 1:40) when stratifying the antibody data by age group and strain (Fig 1). First, individuals had mostly low or undetectable titres against strains that circulated before they were born (cells below the black lines in Fig 1), though many individuals were seropositive against the strain which circulated in the years immediately prior to birth. Second, in the youngest age group (0 to 10 years old), many individuals were also seropositive to pre-birth strains that circulated further back in time (A/California/2004, A/Fujian/2002, A/Fujian/2000, and A/Victoria/1998), indicating the presence of cross-reactive antibodies as these individuals could not have been exposed to those strains (Fig 1C). Third, seroprevalence tended to be high among individuals who were young when a strain was first isolated (cells just above the black line in Fig 1) compared to individuals who were older at the time that the strain was first isolated. Fourth, the total proportion of individuals who seroconverted between sample collection dates was high against recent strains at nearly 50% across all age groups. Finally, some strains exhibited systematically higher titres than others, for example, titres against A/Fujian/2002 and A/Mississippi/1985 were higher than all other strains and particularly high for individuals who were under 10 years old at the time of their first circulation.
Solid black line divides age groups that were alive or not at the time of strain circulation. Seropositivity was defined as having an HI titre of ≥1:40 (a log titre of 3). (A) First serum sample. (B) Second serum sample. (C) Seroconversion between samples, defined as a ≥4-fold increase in HI titre. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
Inferring antibody kinetics and individual infection histories from antibody profiles
To infer the set of A/H3N2 influenza strains each individual was infected with, we further developed and used the serosolver R-package to fit an infection history and antibody kinetics model to all individual-level antibody profiles [35]. Briefly, the method finds the combination of influenza strains which an individual is most likely to have encountered conditional on their antibody profile, accounting for cross-reactive, transient antibody boosting, and antigenic seniority arising from repeated exposures to antigenically related strains (see Materials and methods). Individuals can be infected with the strain assumed to be circulating in each time period provided they were alive at the time of circulation. Individuals have distinct infection histories, but the parameters governing post-infection antibody kinetics were assumed to be universal, and individual infection probabilities were assumed to arise from a single population-level infection probability parameter per time period, regardless of location. A crucial component of the model is the observation level, which accounts for the fact that some strains elicit systematically higher or lower titres than others in the HI assay (see S1 Text).
We routinely fitted the same model to both the Fluscape cohort data and to previously published antibody profile data from a cohort of 69 individuals in Ha Nam, Viet Nam (see Materials and methods and [37]), and compared the inferred antibody kinetics and epidemic dynamics. Posterior distributions for the model predicted titres and infection histories compared to observed titres are shown for 5 randomly selected individuals from the Fluscape cohort in S4 Fig, and from the Ha Nam, Viet Nam cohort in S5 Fig. Overall, we imputed a total of 10,558 (posterior median; 95% credible interval (CrI): 10,394 to 10,750) distinct infections (see Post-processing of infection history posteriors in Materials and methods) across all individuals and times from the Fluscape data and 547 (posterior median; 95% CrI: 519 to 574) infections from the Ha Nam, Viet Nam data. We also ran varied scenario analyses using simulated data closely resembling the Fluscape data, demonstrating that our inference system was able to accurately recover infection histories, attack rates, and antibody kinetics parameters under a range of assumptions and model misspecifications (S2 Text).
Although there were no virologically confirmed infections reported in the Fluscape study with which to validate our infection history estimates, individuals did self-report influenza vaccination at each study visit, but only within a window of time rather than on a specific date (e.g., in the preceding calendar year; see Materials and methods). We estimated antibody boosting events (infections) with >25% posterior probability for 68.9% of time windows in which individuals self-reported influenza vaccination (62 of 90 windows from 77 individuals; S22 and S23 Figs), compared to 43.1% (mean of null simulations; 95% quantiles: 32.8% to 51.7%) of randomly selected time windows of the same duration from randomly selected individuals, suggesting that our model was more likely to infer antibody boosting events that coincided with self-reported vaccination than in randomly selected time periods. Vaccination windows in which no vaccination/infection was inferred tended to either have antibody boosts identified soon after the reported vaccination, suggesting either delayed boosting or recall error, or very low antibody titres to recent influenza strains, suggesting either inaccurate recall or no vaccine-induced antibody boosting (S23 Fig).
Antibody kinetics parameters
We obtained estimates for the antibody kinetics model parameters assumed to underlie the generation of observed influenza antibody titres. We estimated parameter values consistent with an initial, broadly reactive antibody response that decays within approximately 1 year to leave an antigenically narrow, persistent antibody response. These estimates are in line with previous estimates using a similar method applied to a smaller pilot serosurvey from the Fluscape cohort [36], as well as from model fits to the Ha Nam, Viet Nam cohort data (Table 1). However, there were some differences in the estimated magnitude of the antibody response between the Fluscape and Ha Nam data sets. Estimates from the Fluscape data suggested that infection elicited an antigenically narrow, long-term antibody boost of 1.42 log HI units (posterior median; 95% CrI: 1.39 to 1.44) compared to 1.96 log HI units (posterior median; 95% CrI: 1.89 to 2.03) from the Ha Nam data set, and an additional antigenically broad, short-term boost of 2.06 log HI units (posterior median; 95% CrI: 1.99 to 2.14) compared to 2.65 log HI units (posterior median; 95% CrI: 2.43 to 2.92). The estimated waning rate of the short-term response was similar from the 2 data sets, suggesting that the short-term response took 0.995 years (posterior median; 95% CrI: 0.963 to 1.03) to fully subside based on the Fluscape data and 1.33 years (posterior median; 95% CrI: 1.22 to 1.45) based on the Ha Nam data. Finally, the observation error standard deviation was estimated to be larger for the Ha Nam data set at 1.29 (posterior median; 95% CrI: 1.27 to 1.31) versus 0.636 (posterior median; 95% CrI: 0.631 to 0.641) (posterior medians and 95% CrI), though this is likely attributable to the different assumed observation model and measurement of more influenza strains in the latter data set.
The data underlying this table can be found at https://doi.org/10.5281/zenodo.12795911.
Inferred historical and contemporary attack rates
By combining all individual-level inferred infection histories, we obtained A/H3N2 incidence estimates in the Fluscape cohort for each 3-month window since 1968 (Fig 2A). The estimated median quarterly sample attack rate was 3.54% (median across all posterior samples; 95% CrI: 3.08% to 4.01%). This corresponded to a median annual attack rate of 19.1% (posterior median; 95% CrI: 17.2% to 20.9%), defined as the proportion of alive individuals who experienced at least 1 infection within a calendar year. This was comparable to estimates from the Ha Nam, Viet Nam data set, which gave an estimated median annual attack rate of 18.6% (posterior median; 95% CrI: 14.5% to 22.9%).
Quarterly A/H3N2 attack rate estimates in the Fluscape cohort varied over time, ranging from a minimum of 0.459% (posterior median; 95% CrI: 0.00% to 2.53%) in Q1-1977 (during the re-emergence of A/H1N1) to a maximum of 64.8% in Q1-1968 (posterior median; 95% CrI: 54.2% to 71.3%) (at the beginning of the A/H3N2 pandemic). The attack rate estimate for Q1-1985 was unusually high (51.8% posterior median; 95% CrI: 0.207% to 63.8%), suggesting that there might be residual bias from systematically higher titres for the A/Mississippi/1985 virus not captured by the antibody kinetics and measurement models. Periods of high and low attack rates were reasonably well synchronised between the Fluscape and Ha Nam data sets, though attack rates were estimated to be significantly higher in Ha Nam from 2000 to 2003, and significantly higher in the Fluscape cohort from 2010 to 2012 (S6 Fig). For example, both data sets gave very high attack rates for 1968, 1989, and 2009, and similar attack rates from 1970 to 1985 (though with substantial uncertainty in the Ha Nam estimates due to the much smaller sample size). There were also periods with clear differences—attack rate estimates were much lower in the Fluscape cohort in the early 2000s, and higher during 2010 to 2012.
Quarterly attack rate estimates were substantially higher in the Fluscape cohort after the first serum sample in Q4-2009 than the overall average, with the median quarterly attack rate since Q4-2009 estimated to be 7.79% (median of posterior samples for median per-quarter attack rate; 95% CrI: 3.46% to 21.7%) versus 2.98% before (95% CrI: 0.748% to 13.7%). Annual attack rates (defined as the proportion of individuals who were inferred to have been infected at least once per year) fluctuated, with lower attack rates in 2010, 2012, and 2013, and high attack rates in 2011 and 2014 (Table 2). These attack rate estimates were of a similar magnitude to the proportion of individuals that seroconverted between study visits against strains that circulated during that time. Surveillance data collected from the Guangdong Provincial Centre for Disease Control and Prevention influenza surveillance system during the same time period reported slightly different dynamics, with an increased number of A/H3N2 virus isolates in Q2/Q3-2009, Q3-2010, Q4-2011, and Q1/Q2-2012, as well as a higher proportion of A/H3N2 isolates in 2011 and 2012 [42]. However, direct comparison to our estimated attack rates is difficult, as the influenza surveillance network only collects nasopharyngeal swab samples from patients presenting to sentinel hospitals with ILI, whereas our estimates relate to all exposure events regardless of symptoms.
A substantial proportion of people were estimated to have been reinfected within a single year, with higher reinfection rates in years with higher overall attack rates; 33.9% (posterior median; 95% CrI: 27.0% to 41.0%) of all reinfections between 1968 and 2015 occurred since 2008, whereas only 14.9% of possible infection events were in this time frame, suggesting that reinfections were disproportionately more likely in recent time periods. Annual reinfection rates, defined as having been infected at least once in a year for multiple years, were also high for recent years (Table 3).
(A) Model predicted per-capita incidence per quarter. Attack rates were estimated by dividing the number of inferred infections by the number alive in each 3-month period. Red line shows the posterior median estimate from 1,000 posterior samples. Dark and light red shaded regions show 50% and 95% credible intervals respectively from 1,000 posterior samples. Grey shaded box shows duration of the Fluscape study—the improved precision is due to the inclusion of sera bracketing this time period. Asterisks mark times from which a strain included in the HI panel was first isolated. (B) Inferred infection histories for each individual. Each row represents an individual ordered by increasing age in years. Each column represents the time of a potential infection. Cells are shaded based on the proportion of posterior samples with an infection at that time. The grey areas show time periods prior to each individual’s birth. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
Percentages shown are posterior median and 95% credible intervals. “Attack rate” was defined as the proportion of individuals who were infected at least once in that year. “Seroconverted” gives the percentage of individuals that seroconverted to the measured strain, with ranges showing 95% binomial confidence intervals. “Reinfected” gives the percentage of people that were infected more than once in a year. The data underlying this table can be found at https://doi.org/10.5281/zenodo.12795911.
The data underlying this Table can be found at https://doi.org/10.5281/zenodo.12795911.
Attack rates varied in space but were not clearly associated with proximity
Previous analyses from this cohort found that antibody titres varied significantly between study locations after accounting for differences in demographics, suggesting that there may be differences in influenza A/H3N2 epidemiological dynamics between locations [1]. We grouped the posterior draws for the Fluscape infection histories by study location to investigate spatial patterns in attack rates. Attack rates exhibited variation between locations and over time (S7 Fig), though the timing of high attack rate periods was synchronised. S8 Fig shows snapshots from a video of attack rates over time across the study region, demonstrating that the periods of high influenza incidence are similar across the study locations, but that there is some variation in the timing and magnitude of incidence (full video in S1 Video). The overall coefficient of variation (CoV) for per-quarter attack rates across all locations and times was 1.12 (posterior median; 95% CrI: 1.04 to 1.19). This was reduced to 0.493 (posterior median; 95% CrI: 0.458 to 0.535) after aggregating the infection histories into per-year attack rates, defined as the proportion of individuals infected at least once in a calendar year. There was a strong negative correlation between the estimated posterior median CoV and per-quarter attack rate (Pearson correlation coefficient of −0.829).
To understand if this variation reflects epidemiological differences between locations or simply sampling variation, we generated a comparable null simulation where no significant spatial variation in incidence would be expected (see Spatial correlation in inferred attack rates in Materials and methods). The mean CoV for these simulations was 0.300 (95% quantiles: 0.0307 to 1.22). Given the overlapping uncertainty intervals of the model estimates with the simulations, we cannot exclude the possibility that the estimated attack rates from the Fluscape data exhibited no more variation overall than would be expected by chance if all attack rates were drawn from the same binomial distribution. S9 Fig demonstrates that this pattern is maintained across time, though with higher variation in time periods with low infection rates. Fitting a spatial nonparametric correlation function—a model describing correlation of different spatial units over time as a function of their distance to each other—revealed high correlation in attack rates between locations which showed no clear association with increasing distance from one another (S9D Fig). This consistent correlation across space was also observed when subsetting attack rates by recent (from Q1-2009 onward) or historical times (from Q1-1968 to Q1-2009), when considering either per-quarter or per-year attack rates, and also when considering the proportion of individuals in a population who seroconverted against all or only recent strains.
Age-specific infection patterns
Periods of high infection probability were largely synchronised across all individuals regardless of age, though individuals typically experienced more frequent infections in the years immediately following birth (S10 Fig). Two age-specific patterns emerged. First, almost all individuals who were alive in 1968 were almost certainly infected in or around 1968, demonstrated by the high posterior probability of infection across all individuals alive at that time. Second, the posterior probability that an individual was infected soon after birth was consistently high, demonstrated by the lower edge of the heatmap in Figs 2B and S10. The posterior mean, median, and 95% CrI on the age of first infection was 1.36, 0.75, and 0.00–6.00 years, respectively (estimates using 1,000 posterior samples for all individuals born since 1968). We note that although these augmented infection histories suggest most individuals were infected soon after birth, our model requires a number of assumptions regarding cross-reactivity to pre-birth strains and antigenic distance between strains which limits our ability to draw this conclusion, particularly given contrasting previous findings from longitudinal serological studies in children [43].
We calculated each individual’s age at the time of each infection and found that the number of infections per 10 year period decreased through childhood and became stable in adulthood (Figs 3C and S11). Overall, individuals were estimated to be infected 2.12 times per 10 year period (posterior median; 95% CrI: 1.06 to 7.86), in line with previous estimates (Fig 3D) [36]. Infection frequency patterns and trends with respect to age were similar under different assumptions for the infection history model, suggesting that these findings were driven by features of the data and not an artifact of the model structure (S12 Fig; [35]).
(A) Pointrange plot shows posterior median and 95% CrI on the total number of lifetime infections for each individual in the Fluscape cohort, ordered by increasing age at time of sampling. (B) Distribution of the total number of infections across all individuals based on the posterior median total number of infections. (C) Posterior median number of infections per 10 year period stratified by age group at the time of infection, excluding individuals who spent less than 2 years in that age group and including only time periods prior to the first serum sample in Q4-2009 (see S11 Fig for explanation and comparison using all time periods). Text shows sample size within each age group—note this does not sum to the number of individuals in the sample, as individuals contribute to multiple age groups during their lifetime. (D) Distribution of individual posterior median number of infections per 10 years alive. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
Relationship between titre and probability of infection
Our model fits estimated not only infection histories, but also each individual’s expected HI titre against all strains at each point in time, and thus we were able to estimate the relationship between probability of infection and model-predicted latent HI titre (Fig 4). For each sample from the posterior, we found the proportion of time periods (across all times and individuals) where infection was estimated to have occurred, stratified by log HI titre against the circulating strain in that time period. There was a clear pattern of decreasing relative risk of infection as a function of increasing titre (Fig 5). An HI titre of 1:40 (log titre of 3) corresponded to a risk of infection of 0.559 (posterior median; 95% CrI: 0.514 to 0.605) relative to individuals with no detectable titre, consistent with prior evidence from deliberate infection experiments [44]. This pattern appeared to vary with age. An HI titre of 1:40 gave a relative risk of infection of 0.374 (posterior median; 95% CrI: 0.312 to 0.427) in the 0 to 10 age group, but only 0.789 (posterior median; 95% CrI: 0.638 to 0.955) in the 60+ age group.
Each subplot shows one randomly selected individual. X-axis shows time since birth. Blue line and shaded region show model-predicted, true latent titre against the strain assumed to be circulating at each time period (posterior median and 95% CrI). Note that titres are continuous and represent latent, true values, not observations. Orange lines indicate times of high posterior probability of infection. Grey regions show times before birth and after the last serum sample. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
Top left panel shows the relative risk of infection at all time points stratified by model-predicted HI titre against the circulating strain just before infection for all age groups. Remaining plots show the same relationship but stratified by age group in 10-year bands at the time of infection. Solid lines and shaded regions show posterior median and 95% CrI. Note that the uncertainty intervals reflect uncertainty in the imputed infection states and latent antibody titres; the relationships shown here are empirically calculated from the serosolver estimates. Wide uncertainty intervals at higher titres reflect limited data as few individuals reach such high titres; the posterior median for number of individuals with HI titre of 1:640 or greater at the time of infection was less than 10 for all but the youngest age group. Vertical dashed line shows HI titre 1:40. Horizontal grey dashed line shows 50% protective titre. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
Discussion
Influenza A/H3N2 infection histories and attack rates exhibited substantial variation across time and locations in a cohort of 1,130 individuals around Guangzhou, China. We considered that variation in influenza antibody titres may be generated in 3 ways: (i) exposure to different combinations of viruses at different times; (ii) time-dependent antibody kinetics observed at different times relative to an exposure; (iii) random and strain-specific, systematic variation in the HI assay. We showed that accounting for these mechanisms in a modelling framework allowed for the reconstruction of each individual’s complete A/H3N2 infection history since birth conditional on their antibody profile. We reconstructed population-wide and location-specific historical infection, or, more precisely, seroincidence rates from these infection histories, finding that influenza infection incidence may be higher than suggested by routine surveillance. Also, estimates of each individual’s true antibody titre against circulating strains for each 3-month period since birth were generated, showing that elevated antibody titres were associated with a substantial reduction in infection risk that became less effective with increasing age.
We estimated that the incidence of influenza A/H3N2 infections, or at least detectable serological responses, in both the Fluscape cohort and a smaller data set from Ha Nam, Viet Nam was around 19% on average per calendar year since 1968. Results from urban and rural South African communities (the PHIRST cohort) demonstrated similarly high influenza incidence rates under routine RT-PCR testing for influenza A or B [45]. Similarly high incidence rates based on either anti-haemagglutinin or anti-neuraminidase seroconversion were also detected in New Zealand (the SHIVERS cohort) [46]. Our results and those from the PHIRST and SHIVERS studies are at the higher end of annual influenza incidence rate estimates [46–48] and suggest that some individuals may be infected with the same subtype multiple times within a year. Our annual seroincidence rate estimates were particularly high during the Fluscape study period in 2010 to 2014, ranging from 20% to 61% of individuals infected at least once per year, though the cutoff of an influenza season is not as clearly defined for this region as it is for temperate regions [2,49]. These values are higher than previous estimates based on serological data from Hong Kong for the same time period at 7% to 19% [34,50,51]. This increased incidence during recent time periods could reflect the changing antigenic diversity of influenza A/H3N2 during this time, with the emergence of multiple antigenically distinct clades providing an increased force of infection from strains with varying levels of immune escape leading to more frequent infections [52–54]. Our model is limited in capturing this changing epidemiology, as it cannot account for co-circulation of competing strains at the same time, and our HI panel did not have strains representing these multiple clades. However, if individuals in this cohort were infected with strains that were not measured in the HI panel it is likely the model would underestimate infection incidence; antigenic mismatch between the infection strain and the measured strain would lead to lower observed titres, which will lead to a lower posterior probability of infection in the model.
The estimated annual attack rates were similar between the 2 cohorts, but not for all years. A possible driver of these differences is climate, despite these 2 locations being relatively close: Ha Nam, Viet Nam has a tropical climate with no clear influenza seasons [55], whereas Guangdong, China has a subtropical climate with clear peaks of influenza typically during the summer months, though with peaks in winter/spring in recent years [56–58]. It is possible that differences in climate and behaviour partly explain differences in seasonal dynamics between the 2 locations [59–61], though past studies have been unable to reliably compare overall disease burden between locations due to differences in influenza case detection and reporting without further modelling [62]. Differences in influenza epidemiology between Ha Na, Viet Nam and Guangdong, China could be due to a combination of multiple factors including contact rates, climate effects on virus survival and behaviour, immunological history, and dominance of different A/H3N2 strains over time.
Reconstruction of lifetime infection histories from this large study population covering all age groups revealed that the frequency of influenza A/H3N2 serological responses is initially high and decreases with age and then remains stable at around 2 infections per decade through most of adult life. The pattern is consistent with previous findings from age-stratified epidemiological data [8,34,45,46,48,63], regression analyses of randomized control trial data [64], theoretical models incorporating age-specific differences in social behaviour [65,66], and data from the Fluscape pilot study [7]. Reassuringly, the clear age patterns we find here were maintained under different assumptions for the infection history model, suggesting that these patterns emerge from information in the data rather than through the model structure. However, it is important to note that we have estimated the incidence of detectable antibody responses and not necessarily clinically relevant infections—not all infections lead to seroconversions and not all antibody boosts reflect detectable virus shedding [67,68]. Similarly, although our inference method did not distinguish between vaccination and infection, vaccination coverage in this cohort was very low and thus this assumption is unlikely to bias our infection history estimates substantially.
In lieu of virologically confirmed influenza infections with which to validate our model, we used self-reported vaccination to approximate true antibody boosting events, finding that our model was more likely to impute antibody boosts during self-reported vaccination windows than in randomly selected time periods with no reported vaccination. Although this provides some support for the validity of our approach, self-reported vaccination status is not always an accurate measure of true vaccination status due to recall bias, and our model does not account for different antibody kinetics following infection and vaccination. Indeed, for most of the self-reported vaccinations where our model found limited evidence for an antibody boosting event showed limited antibody responses against recent strains; both of these limitations make it difficult to evaluate the validity of our approach based on vaccination data alone.
Although the spatial dynamics of influenza at a large scale are fairly well understood through the use of molecular data, explaining spatial dynamics at smaller scales has revealed contrasting results and remains the focus of ongoing work [12,18,22,69]. Comparable spatiotemporal analyses of seasonal ILI incidence surveillance data in Norway, Sweden, Denmark, and the USA—regions with clearer ILI seasons than Guangdong—showed high spatial correlation in epidemic phase timing and amplitude between locations thousands of kilometres apart that declined with distance [70]. Our estimates show a high correlation of attack rates in space that did not change as a function of distance at a small spatial scale (less than 60 km), suggesting that an individual’s life-course of seasonal influenza A/H3N2 infections is largely determined by when they were born and the epidemiology of their wider region, rather than precisely where they live. However, our results are limited as we do not account for the fact that individuals move over their lives, and thus their location recorded at the time their serum sample collected may not match their location earlier in life.
We inferred a negative correlation between log HI titre at the time of exposure and probability of seroresponse in line with the historic deliberate infection data of 50% protection for an HI titre of 1:40 [44,71–73]. We found that this relationship became weaker with age, consistent with recent work showing that although HI titre is a good correlate of protection in children, it is less robust in adults [8]. Non-HI-mediated protection, such as non-haemagglutinin head targeted antibodies or cellular responses, are likely to explain a greater amount of variability in immunity as individuals age [46,74]. The magnitude of our estimates do contrast with other work, which has found that the 50% protective titre threshold may be higher than 1:40 in young children [75,76]. As stated by Hobson and colleagues in 1972, care must be taken in assigning causality to the titre-mediated infection risk estimated here. In the present model, titres necessarily decreased over time following infection due to antigenic drift and short-term waning. If protection is governed by non-HI immunity that wanes at a similar rate, then the same association between titre and relative risk could be observed. Non-HI protection may also explain the findings of decreased infection frequency at older age despite decreasing titre-mediated protection, for example, via non-haemagglutinin head targeted antibodies or cellular responses [46,74]. We note that our estimates are only of relative rather than absolute risk of infection; predicting the future probability of infection for an individual would additionally require knowing the force of infection (probability of exposure) as well as knowing which strain will circulate.
Finally, we generated estimates for post-exposure antibody kinetics from 2 serological data sets. There were a number of similarities and differences in the estimated antibody kinetics parameters based on the Fluscape and Ha Nam cohort data sets. Although differences in tested strains and between-lab protocols limits direct comparison [77], both data sets gave qualitatively similar estimates suggesting a transient, antigenically broad antibody boost that waned within 2 years following exposure to leave a persistent, antigenically narrow antibody boost. The antigenic breadth of the short-term boost was broader in the Fluscape estimates, though this may be due to the inclusion of fewer recent strains and more older individuals than in the Ha Nam cohort, requiring the model to give greater weighting to boosting of historical strains (back-boosting) [37]. We estimated that the transient arm of the seroresponse waned within 1 year based on the Fluscape data compared to 1.33 years based on the Ha Nam cohort data, which is broadly in line with previous observations of antibody titres waning to near baseline approximately 1 year post vaccination [37,78–80]. Overall, this supports previous work suggesting that accounting for sub 4-fold rises in titre, issues arising from non-bracketed sera, and measurement effects may lead to greater sensitivity in identifying infections from paired titres [33,81,82].
Our study has a number of limitations. A key challenge underpinning almost all efforts to analyse quantitative antibody titre data is the variation in titres arising from laboratory processes rather than underlying epidemiology and immunology [77]. Although we adjusted our observation model to account for systematically lower or higher observed titres to particular strains, which may lead to under- or overestimation of attack rates, unaccounted for biases may still remain. External validation or estimation of the offset terms would be useful, but is difficult to do across studies given variability in laboratory protocols and serum potency [77]. We performed paired tests of the serum samples from the 2 study visits to minimise batch effects, though it is possible that the increase in titres and broader reactivity between study rounds (S2 and S3 Figs) might reflect systematic measurement bias rather than increased recent infections as inferred by our model. For example, the delay between sample collection and testing for visit 1 samples might have led to some decline in antibody concentration within the samples. Also, our data were limited in their strain coverage, as we measured titres to only 1 strain for each approximately 2-year period and assumed that that A/H3N2 antigenic evolution followed a smooth rather than punctuated trajectory through antigenic space, and thus we may infer infections with antigenic variants that an individual has never been exposed to [3]. As a sensitivity analysis (not shown), we did attempt to fit the serosolver model instead assuming punctuated changes through antigenic space, but we were unable to produce converged model fits due to the discretization of the parameter space and thus we do not present these results. However, we did explore the impact of this potential model misspecification using simulation-recovery experiments, and found that fitting a model assuming continuous antigenic changes over time when data were generated under a punctuated antigenic evolution model did not substantially bias antibody kinetics parameter or attack rate estimates, with the exception of slightly overestimating the short-term antibody waning rate (S2 Text). These assumptions about antigenic evolution, in addition to our simple way of modelling cross-reactive antibody boosting, might explain why our estimates for the age at first infection were much younger than found in serological data from longitudinal cohort studies of children, which are more reliable for estimating the precise timing of these early childhood infections [43]. They may also partially explain why the inferred seroresponse rates did not strictly align with timings of high incidence based on viral isolate data in Guangdong and Hong Kong for the same time period [42,50].
Another limitation is the simplification of the antibody kinetics model used here. The assumption of a fixed-effect term on boosting and waning for all individuals and infection events masks a substantial amount of individual-level and strain-level variation in kinetics which would be better described by random-effects terms. This is a necessary simplification to ensure identifiability of the post-exposure kinetics parameters while simultaneously inferring hundreds of thousands of latent infection states. Similarly, there are a number of immunological mechanisms, such as titre-dependent boosting and titre ceiling effects which we did not include in the model due to identifiability issues [37,83]. Our model is also limited in explaining back-boosting, as we assumed that cross-reactivity extends linearly across antigenic space. In the short-term antibody boosting arm, cross-reactivity was found to boost the entirety of antigenic space regardless of an individual’s age to account for the clear back-boosting of titres against strains encountered early in life. However, this model may be inappropriate for younger individuals, whose immune systems may not have encountered any of these historical antigens and would therefore have no targeted memory B-cells to stimulate. A model that distinguishes boosting of heterologous antibodies through back-boosting of the memory response as opposed to cross-reactive antibodies from a de novo response through targeting shared epitopes would provide a more realistic model of the observed, antigenically broad short-term boost. Finally, we did not model the kinetics of maternal antibodies, which typically wane within the first year of life [84]. The youngest individual in our dataset was 6 years old at the time of sampling, and thus all maternal antibodies would have waned by the time of observation and therefore did not contribute towards the observed antibody titres.
Antibody landscapes based on traditional HI assays, as well as multiplex antigen arrays and deep mutational scanning data, are useful tools for understanding how immunity develops following repeated infection and vaccination to antigenically related viruses [85–90]. The approach and results shown here demonstrate how these antibody profiles can be used to reconstruct lifetime infection histories at a fine spatial scale, providing a new source of augmented data with which to understand long-term epidemiological trends for influenza and other antigenically variable pathogens such as SARS-CoV-2 [91–93].
Materials and methods
Ethics statement
Study protocols and instruments were approved by the following institutional review boards: Johns Hopkins Bloomberg School of Public Health, University of Hong Kong, Guangzhou No. 12 Hospital, and Shantou University. Written informed consent was obtained from all participants over 12 years old, and verbal assent was obtained from participants 12 years old or younger. Written permission of a legally authorised representative was obtained for all participants under 18 years old.
Cohort description—Fluscape
Influenza haemagglutination inhibition (HI) titres were obtained from a previously described cohort in Guangzhou, China, called the Fluscape study (described in detail in a cohort profile [38]) [41]. The Fluscape study is a serological, contact, and demographic survey in and near Guangzhou, China. The study covers 40 locations randomly selected from a 60 km cone-shaped transect extending from Guangzhou city center into the surrounding rural area. Latitudes and longitudes of each study location were assigned based on a central place (e.g., a street or village committee center). A total of 60 households within 1 km of each chosen location were randomly selected and contacted one-by-one until 20 households with at least 1 member willing to provide a blood sample and answer the survey questionnaire were contacted. The majority of locations were classified as rural (30/40), were between 20 and 80 min travel time from Guangzhou city center, and had a population density between 256 and 367,346 persons per 9 km grid cell. All household members aged 2 or above were eligible to participate, and 5 ml of blood was taken for each blood sample from each visit. Here, we used data from a subset of serum samples, capturing 1,130 individuals who had 2 serum samples taken and analysed from 2 different rounds of sampling between 2009 and 2015 inclusive. The month of sampling, age of participant, vaccination history, and other socioeconomic covariates were available for all individuals.
Serological data
HI assays were performed for each sample to measure antibody titres against 20 A/H3N2 strains that circulated between 1968 and 2014 inclusive with approximately 2 year spacing between circulation years (A/Hong Kong/1968, A/England/1972, A/Victoria/1975, A/Texas/1977, A/Bangkok/1979, A/Philippines/1982, A/Mississippi/1985,A/Sichuan/1987, A/Beijing/1989, A/Beijing/1992, A/Wuhan/1995, A/Victoria/1998, A/Fujian/2000, A/Fujian/2002, A/California/2004, A/Brisbane/2007, A/Perth/2009, A/Victoria/2009, A/Texas/2012, A/Hong Kong/2014). The 50% tissue culture infectious dose (TCID50) for each virus was determined using Madin-Darby canine kidney (MDCK) cells, calculated using the Reed–Muench method [94]. Blood samples were kept at 4°C on the day of collection until extracted sera was frozen at −80°C until testing. Sera were thawed and treated with receptor-destroying enzyme (RDE) derived from Vibrio cholerae to remove nonspecific inhibitors prior to incubation at 56°C for 30 min, and then absorbed with Turkey red blood cells to remove substances which might lead to nonspecific agglutination. HI assays were conducted in 96-well microtiter plates with 0.5% turkey erythrocytes using 4 haemagglutination units. Sera from each individual’s first and second sample were tested side-by-side on the same plate. Repeat titres were generated for all of the strains tested from the second serum sample on a separate plate (23,686 repeat measurements in total), but were not run for the first sample due to insufficient sample volume. Titres were tested in serial 2-fold dilutions from 1:10 to 1:1,280, with the reciprocal of the highest dilution at which haemagglutination was inhibited recorded as the titre, and undetectable titres recorded as <1:10. The full laboratory protocol is described in [95].
For all analyses, titres were transformed to a log2 scale (i.e., 2-fold dilutions), where y = log2(D/5), giving log titres between 0 and 8 (undetectable titres were treated as a 0 log titre). Seroconversion between study visits was defined as a 4-fold rise in titre, equivalent to a ≥2 unit increase on the log scale. Seropositivity was defined as having a titre of ≥1:40 (log titre ≥3). Further details on laboratory testing have been described previously [41].
Summary of model
The overall inference task is to obtain estimates for the joint posterior distribution of antibody kinetics parameters (Θ), infection histories (Z) for all n individuals, and the attack rate within each of m possible discrete infection periods conditional on the set of observed HI titres (Y). Crucially, only Y is observed, so we must infer (or augment) the values of Z as latent features. Throughout the remainder of the methods, we use capital letters to represent random variables and bold letters to represent vectors of random variables. A detailed description of the inference problem and approach is described in [35], but can be summarised as sampling from the posterior distribution: where θ is the vector of antibody kinetics parameters that describes the link between Z and Y. The set of times t1,i to tmax,i gives the time periods when a serum sample was obtained from individual i. P(Y |Z, θ) is defined by the antibody kinetics and observation model, and is the infection history prior, described below.
An individual’s entire infection history is given as a vector of unobserved binary variables, Zi = [Zi,1, Zi,2, Zi,j]. Each infection event, Zi,j, is the outcome of a single Bernoulli trial, where Zi,j = 1 indicates that individual i was infected with the strain circulating in discrete time period j, Zi,j = 0 indicates that they were not. The entire infection history matrix Z for all n individuals across all m time windows was therefore represented by an n by m binary matrix. We estimated infection histories at a 3-monthly resolution, such that each individual had an unobserved infection state for each 3-month period j since birth. Individuals could be infected from the first quarter after they were born. We did not attempt to impute infection states occurring after an individual’s last serum sample. The attack rate is then given as , where Nj gives the number of individuals alive in time period j.
Infections lead to the production of antibodies (“seroresponses”) that undergo longitudinal and cross-reactive kinetics. The vector of true, latent antibody titres across all time periods is given as Ai = [Ai,1, Ai,2,…, Ai,j], which is generated from an antibody kinetics process with parameters θ described below. The process generating measured titres from the latent antibody titres is modelled through an observation level. The vector of observations is given as Yi = [Yi,1, Yi,2,…, Yi,t]. Note that the time index for Yi is different to Ai and Zi, as observations are only made at a subset of t times, whereas latent infection states and antibody titres must be represented at all j times. Yi,t is also itself a vector, as it contains HI titres against all strains measured from that serum sample. A schematic of the full model is shown in S13 Fig and an example simulated antibody landscape over time is shown in S14 Fig.
Note that the estimated attack rates and infection histories incorporate uncertainty in whether elevated titres against a particular strain result from infection with that strain and/or from cross-reactive antibodies from infection with a different, antigenically related strain. For example, a high titre measurement might be explained by a single infection with that strain (leading to a large boost) or by multiple infections with other, antigenically distant strains (leading to multiple small boosts). The antibody kinetics model accounts for both possibilities by including a homologous antibody boosting parameter and a model for cross-reactive antibody boosting as a function of antigenic distance (see below). This structure also allows us to estimate infection states during years from which we do not have a representative influenza strain in the HI panel. For example, in the time period 2010 to 2014 (Table 2), the model samples possible infection histories for that time period where elevated titres against A/Perth/2009, A/Victoria/2009, A/Texas/2012, and A/Hong Kong/2014 could reflect strain-specific antibody boosting from infections in 2010, 2012, and 2014, or cross-reactive antibodies from infection in 2011 and 2013. Hence, we do not present a single estimate for the most plausible infection history, but rather incorporate this uncertainty into the attack rate estimates by using multiple samples from the posterior for each individual’s infection history.
Antibody kinetics, antigenic map, and observation model
We used an existing deterministic model to describe the generation of observed antibody titres following exposure to an influenza strain [35,36]. The model has 3 components: (i) the antibody kinetics model; (ii) the antigenic map; and (iii) the observation model. The antibody kinetics model describes linear short- (μs) and long-term (μl) antibody boosting on the log scale immediately following infection. The short-term boost wanes over time such that the remaining short-term boost j − k time periods after infection at time k is given by μsw(k,j) = μsmax{0, 1 − ω(j-k)}. Long-term boosting is persistent. In addition to boosting antibodies against the infecting strain, infection also elicits the production of cross-reactive antibodies against antigenically related strains. Cross-reactivity is assumed to decrease linearly with antigenic distance by a factor of dl(k, j) = max(0, 1 − σδk,j), where δk,j represents the antigenic distance between the infecting strain k and the measured strain j. The short- and long-term boost for a given strain is therefore given by μldl(k, j) = max{0, μl(1 − σlδk,j)} and μsds(k, j) = max{0, μs(1 − σsδk,j)}, respectively. Finally, antigenic seniority by suppression was included, wherein the full amount of boosting decreased linearly by a proportion τ for every infection following the first. Boosting was scaled by s(Zi,j, j) = max{0,(1 − τ(Nj − 1))} after each infection, where Nj − 1 is the number of previous infections before strain j. The full model, f(A|Z,θ) (where θ = {μs, μl, ω, σl, σs, τ}), for expected titre for individual i measured at time t against strain j, Ai,j,t, is given by:
A key component of the model is the antigenic map specifying the antigenic distance between A/H3N2 strains and thus their cross-reactivity measured by the HI assay. That is, infection with strain A generates cross-reactive antibodies which also recognise epitopes on strain B, where the degree of cross-reactivity can be modelled as the antigenic distance between the 2 strains. In the serosolver model, antigenic distance between strain k and strain j, δk,j, was given by the Euclidean distance between them on the antigenic map. In theory, we might jointly estimate the antigenic map alongside the other model parameters, but at present this is computationally infeasible and thus we assume a fixed antigenic map for model fitting.
We used the antigenic coordinates of the 20 strains measured in the Fluscape study to represent the strains circulating in each time period (S15 Fig). We fit a cubic smoothing spline with low amounts of smoothing (smoothing parameter = 0.3) through the coordinates to provide a comparable model to previous analyses and to smooth over large jumps in the posterior surface, which greatly aids in model convergence by smoothing over multiple posterior modes (see S2 Text) [36]. We also attempted to fit a version of the model assuming punctuated rather than continuous evolution through antigenic space, placing strains into clusters based on previous analyses of A/H3N2 strains in China [96]. Although this punctuated version of the cross-reactivity model may be more realistic [13], the posterior distribution under this model was multimodal and thus we were unable to produce reliably converged model fits.
We assumed that log HI titres observed at time t were normally distributed with mean Ai,j,t and variance ε as in [35,36], with censoring to account for the upper and lower bounds of the assay. The probability of observing an empirical titre at time t within the limits of a particular assay Yi,j,t ∈ {0,…, qmax} given expected titre Ai,j,t is given by: where qmax = 8 and , the probability density function of the normal distribution and ε is the standard deviation.
Fitting the model as described so far to the HI titre data led to systematic under- or overestimation of titres to certain strains (S16 and S17 Figs). We therefore introduced fixed, strain-specific measurement offsets similar to [97]. The procedure for estimating these offsets is described in S1 Text. In short, we added fixed offsets to each predicted titre as: , where χj is the measurement offset for strain j. These additional offset parameters aim to capture the residual observation error not explained by the estimated infection histories, antibody kinetics, and normally distributed observation error.
Infection history prior
The choice of prior for the infection history matrix Z is discussed in detail in [35]. The crux of the problem is that the choice of prior P(Z) determines not only P(Zi,j = 1), but also the prior distribution of total number of lifetime infections, attack rate in a given time period, and time between infections. As we are interested here in reconstructing historical attack rates, we chose to place a Beta prior on the probability of infection in a given time window (prior version 2 in serosolver). The infection history matrix Z is then a Beta-Bernoulli distributed variable such that: where B is the Beta function; kj = ∑iZi,j is the total number of infections across all individuals during time period j, and nj is the number of individuals that could be infected during time period j. Values for α and β can then be set to give known priors and variance on the total number of infections as and . When α = β, the attack rate prior has an expectation of 0.5n, and the variance may be decreased by increasing α and β. Here, we set α = β = 1. This choice of prior also implicitly assumes that the total number of lifetime infections for an individual is binomially distributed with success probability p = α/(α+β) and N = mi, where mi is the number of time periods that individual i could be infected.
Inference using Markov chain Monte Carlo
All models were fitted using the Markov chain Monte Carlo (MCMC) algorithm implemented in the serosolver R package. This is a custom, adaptive Metropolis–Hastings algorithm with alternating univariate normal proposals for the model parameters θ and custom proposals Z for the infection history states. Step sizes for all antibody kinetics parameter proposals were also scaled automatically during the burn in to achieve an acceptance rate of 0.44 for all parameters. Uniform priors were placed on all antibody kinetics parameters θ shown in S4 Table. Infection history state proposals were randomly chosen between one of 2 options: (i) select 2 potential infection times 12 time periods apart and swap infection states for all individuals that could have been infected at these times; (ii) randomly select 20% of individuals and for each individual, with 50% probability, either sample new infection state values for all possible infection times, or choose 2 potential infection times 12 time units apart and swap their values.
For the main model results, 5 chains were run for 50,000,000 iterations, with the first 20,000,000 discarded as burn-in, to achieve an effective sample size of >200 for all inferred parameters. Convergence was assessed visually and using the Gelman-Rubin diagnostic criteria () with the coda R-package [98]. Note that the huge number of iterations is due to the need to impute nearly 4*N*T = 4*1130*190 = 858,800 latent binary variables in Z (i.e., the infection state for each individual in each 3-month window since 1968, slightly less as individuals could not be infected prior to birth). The ability for the model to recover ground-truth parameters was explored through simulation-recovery, described in S2 Text.
Post-processing of infection history posteriors
When fitting to the Fluscape data set, the serosolver model occasionally imputed continuous runs of repeated infections in adjacent 3-month windows (e.g., imputing infections in Q1-1968, Q2-1968, and Q3-1968), reflecting either genuine repeat infections or the model explaining titres that were higher than a single antibody boost could explain. These infection runs were relatively rare, but were most common for infection in young children and occasionally for recent time periods (S18 and S19 Figs). Runs of repeat infections were not estimated when fitting to simulated data, suggesting that their occurrence in the Fluscape data represents either genuine repeat infections early in life or the model explaining titres that were higher than a single antibody boost could explain. Although these reinfections may be real, particularly among children [99,100], this may be a limitation of the model, which sometimes explains high antibody titres through multiple infections rather than from a single infection eliciting a large boost. Our pipeline therefore included a post-processing step to count these runs as single infection events starting on the date of the first infection in the run. This reduced the total number of distinct infection events from 11,544 infections (posterior median; 95% CrI: 11,358 to 11,736) to 10,558 (posterior median; 95% CrI: 10,394 to 10,750). The attack rate estimates and reinfection rates from the posterior samples prior to removing the runs of consecutive infections are shown in S2 Table, showing that although the post-processing step only slightly reduced overall attack rate estimates, it substantially reduced the reinfection rate estimates, which were as high as 8% in 2013 prior to removing consecutive infections.
We also carried out an additional data augmentation step when calculating annual attack rates for more recent time periods, as we are unable to infer infection states after the latest serum sample for each individual. For example, if an individual’s final serum sample is in Q3-2014 and they have been estimated to have no infections in Q1 or Q2, it is not known if the individual should contribute to the numerator and/or denominator when calculating the annual attack rate for 2014. However, because the model assumes that the per-time infection probability is Beta-distributed, it is straightforward to sample new infection states from the model prior to seeing any antibody data using the probability of infection given by (described in [35]):
Where Z−i,j gives the infection states of all individuals other than i at time j, α, and β are parameters of the Beta prior (assumed to both be 1), kj is the number of infections in Z-i,j and n is the number of alive individuals at time point j. For every individual with an unknown infection state following their final serum sample, we resample a new infection state using the above formula for each posterior draw.
Fitting to longitudinal HI titre data from Ha Nam, Viet Nam
To provide a comparison for the model fits to the Fluscape data, we fit the same model to a publicly available data set from Ha Nam, Viet Nam. This data set consisted of 69 participants, each with between 1 and 6 (inclusive) serum samples taken annually from 2007 to 2012 as described previously [37]. In this cohort, HI assays were performed against a panel of up to 57 A/H3N2 strains isolated between 1968 and 2008, with greater sampling of titres against more recent strains. This represents a comparable data set with different dimensions: a much smaller sample size, but far more titres and samples tested per individual. Unlike the Fluscape data, we only had access to the year of sample collection, and thus we could only estimate infection histories at an annual resolution. Furthermore, we did not have dates of birth available for the individuals, and thus we assumed that all individuals were born prior to 1968, noting that age signals will still be detected based on the individual’s antibody profile. For this analysis, we did not include the strain-specific measurement offsets in the observation model, as there were multiple strains tested for each time period. We ran the MCMC algorithm for 5 chains each for 1,500,000 iterations and discarded the first 500,000 iterations as burn-in, which was sufficient to achieve effective sample sizes of >200 and upper 95% confidence intervals of the values of less than 1.1 for all estimated parameters.
Spatial correlation in inferred attack rates
To test for patterns of influenza incidence in space, nonparametric correlation as a function of distance was tested using the Sncf function from the ncf R-package [101], where observations were the quarterly attack rate estimates stratified by location ID (40 locations, 190 observations per location). We fit these spline correlograms to 100 samples from the posterior distribution of attack rates, resampling the 40 locations with replacement for each sample to generate posterior medians and 95% CrIs. This analysis was repeated using estimated attack rates since 1968, since 2009, and pre-2009. To provide a comparable null simulation where no significant spatial variation would be expected, we simulated 40 draws from a binomial distribution with n = 25 (i.e., 1,000 individuals across 40 locations) and success probability drawn from a uniform distribution between 0 and 1, and repeated this process 10,000 times to calculate the mean and 95% quantiles of the resulting coefficients of variation.
Relationship between titre and probability of infection
For the analyses investigating the relationship between titre at time of infection and probability of infection, we drew samples from the estimated posterior distribution of antibody kinetics parameters and infection histories, and calculated model-predicted preinfection latent antibody titres against the circulating strain for each individual at each possible infection time. Predicted titres were converted to integers, with titres ≥8 assumed to be 8 to match the observation process. We then calculated the proportion of time periods where infection occurred (Zi,j = 1) (the overall probability of infection) stratified by log titre at the time of infection relative to the overall probability of infection with a log titre of 0. As discussed above, some of the inferred infection histories had runs of infections in consecutive time periods (e.g., Zi = [0, 0, 0, 1, 1, 1, 0, 0]). We removed these consecutive infections from the probability of infection statistics (e.g., [0, 0, 0, 1, 1, 1, 0, 0] becomes [0, 0, 0, 1, 0, 0, 0, 0]). We repeated this process for 1,000 posterior samples to generate median and 95% CrI estimates on the relationship between titre and relative risk of infection.
Validation using self-reported vaccination
We validated our inference approach by comparing the model-predicted probability of infection to self-reported influenza vaccination. Vaccination coverage was low in this cohort (S1 Table), and self-reported vaccination status is not a perfect reflection of true vaccination status. Furthermore, vaccination status was reported only within a window of time rather than on a specific date: individuals were asked if they had ever received an influenza vaccination and could give one of 6 answers: never vaccinated; in the same calendar year; in the preceding calendar year; 2 to 5 years ago; 5+ years ago; or unsure. Individuals were also asked if they had been vaccinated since their last study visit. Where an individual reported vaccination in one of these time periods, we define the covered time period as the “vaccination window.”
To assess the accuracy of our inference approach, we defined sensitivity as the proportion of vaccination windows +/− 3 months which contained at least 1 inferred infection with more than 25% posterior probability (i.e., we expect vaccination windows to contain inferred infections if our inference is accurate). However, a complication is that these windows are wide—an individual reporting vaccination 2 to 5 years ago has a 3-year wide vaccination window, and thus the individual is likely to have been infected during that time regardless of vaccination. Therefore to provide a comparable null simulation, we took each of the windows of time with reported vaccination and reassigned them to randomly chosen individuals at random times (e.g., if individual 1 reported vaccination between Q1-2013 and Q1-2015; the null simulation reassignment might move the window to individual 2 in Q2-1980 to Q2-1982, regardless of individual 2’s vaccination status). We repeated this process 100 times to generate 95% uncertainty intervals for the null simulation. We then compared the proportion of vaccination windows which contained inferred infections based on reported vaccination to the null simulations. If the upper 95% uncertainty interval of the null simulation was lower than the proportion of vaccination windows with inferred infections using the real data, we took this as evidence that our model was more likely to infer antibody boosting events that coincided with self-reported vaccination than in randomly selected time periods.
Supporting information
S1 Fig. Distribution of serum sampling times from the Fluscape cohort.
First and second visits refer to an individual’s serum sample order, which may differ from the sample collection round of the overall study. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s001
(TIF)
S2 Fig. Distribution of log HI titres by study location at first serum sample.
Each cell represents the log HI titre for 1 individual measured against 1 strain, shown on the x-axis. Locations were grouped into quintiles based on increasing distance from Guangzhou city center (bottom panels). Individuals were grouped by age and plotted with increasing age. Colours to the left of each subplot show age group. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s002
(TIF)
S3 Fig. Distribution of log HI titres by study location at second serum sample.
Each cell represents the log HI titre for 1 individual measured against 1 strain, shown on the x-axis. Locations were grouped into quintiles based on increasing distance from Guangzhou city center (bottom panels). Individuals were grouped by age and plotted with increasing age. Colours to the left of each subplot show age group. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s003
(TIF)
S4 Fig. Example inferred latent antibody titres and infection histories.
(A) Model-predicted titres compared to observed HI titres at each sampling time for 5 randomly selected individuals. Rows represent individuals. Subplots show antibody titres based on serum samples taken at that time. X-axis represents a position along the antigenic summary path. Black diamonds show observed titres. Black line and green shaded regions show posterior median and 95% credible intervals (CrIs) on model-predicted latent titres (dark green) and 95% prediction intervals (light green). Orange bars show posterior probability of infection in that 3-month window. Grey rectangles denote the limit of detection of the HI assay. Purple rectangles show time periods before birth. (B) Posterior median and 95% CrI for the cumulative number of infections over time from birth (purple dashed line). The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s004
(TIF)
S5 Fig. Model fit to data from Ha Nam, Viet Nam.
(A) Model-predicted titres compared to observed HI titres at each sampling time for 5 randomly selected individuals, as in S4 Fig. Diamonds show titre measurements; green shaded region shows 95% CrI and 95% prediction intervals; dashed line shows posterior median; orange bars show posterior probability of infection in a given time window. (B) Posterior median and 95% credible intervals (CrI) for the cumulative number of infections over time from birth (orange). Note that date of birth information was not available for these individuals. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s005
(TIF)
S6 Fig. Comparison of annual attack rates using data from Ha Nam, Viet Nam and the Fluscape study in Guangzhou, China.
Annual attack rates were defined here as the proportion of individuals who experienced at least 1 infection per year. Yellow shaded regions show time periods where >95% or <5% of posterior samples suggested a greater attack rate in the Fluscape cohort than Ha Nam, whereas green shaded regions show time periods where between 25% and 75% of posterior draws suggested a greater attack rate in the Fluscape cohort. Some time periods showed high uncertainty for the Ha Nam data set, as few individuals in the sample were alive during that time (e.g., 1969–1980). Attack rate estimates were estimated to be higher in the Fluscape cohort from 2000 to 2003 inclusive with more than 95% posterior probability. This might reflect a genuine different in A/H3N2 epidemiology during that time, but may also be partially driven by systematic biases in titre measurements to strains isolated during that time period—the fits to the Fluscape data include a positive offset term for titres against A/Fujian/2002, which leads to lower attack rate estimates in that time period, whereas fits to the Ha Nam data do not. The time period from 2010 to 2012 also shows higher attack rates in the Fluscape cohort but with similar relative patterns. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s006
(TIF)
S7 Fig. Quarterly attack rates across the 40 Fluscape study locations.
Each row represents 1 study location ordered by increasing distance from Guangzhou city center. Each column represents a 3-month period. Cells are shaded by (A) the posterior median inferred attack rate or (B) the coefficient of variation of the posterior quarterly attack rate estimate for each location. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s007
(TIF)
S8 Fig. Distribution of quarterly attack rates by location over time.
Each panel is one frame from a full animation available in S1 Video. Each coloured point shows the inferred attack rate in each of the 40 locations, with size and shading reflecting the posterior median attack rate. Underlying the plot is a map of the study area, with each grid cell shaded by its log10 population density. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s008
(TIF)
S9 Fig. Spatial variation in annual attack rate estimates over time and correlation between nearby locations.
(A) Coefficient of variation, (B) overall mean, and (C) standard deviation of posterior median annual attack rate estimates from the 40 study locations. Solid lines and shaded regions show posterior medians and 95% credible intervals. In (A), the solid horizontal line shows the overall mean coefficient of variation across all time. Dashed horizontal lines show 95% quantiles of simulated coefficients of variation under the assumption that attack rates are the same across space. (D) Fitted spline correlograms showing spatiotemporal correlation in attack rates and proportion seroconverted with increasing distance. The first 3 plots show the spline correlogram calculated using the Scnf function from the ncf R-package for each of 100 posterior samples for the 40 location-specific attack rates. Solid lines and shaded regions show median and 95% quantiles of the predicted covariance function for these 100 samples, coloured by the level of aggregation used to calculate the attack rates. Each subplot shows the same calculation using either attack rates from all times, prior to Q1-2009 or Q1-2009 onwards. For the final plot (“Seroconversion”), we calculated the spatial correlation in the proportion seroconverted in each of the 40 study locations, treating strain isolation time as the time variable. Solid lines and shaded regions show median and 95% quantiles of 1,000 bootstrapped observations. We repeated the analysis using either seroconversion to all strains, or only A/Victoria/2009, A/Perth/2009, A/Texas/2012, and A/HongKong/2014. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s009
(TIF)
S10 Fig. Quarterly attack rates by birth cohort.
Model predicted per-capita incidence per quarter stratified into 5 birth cohorts. Attack rates were estimated by dividing the number of inferred infections by the number alive in each birth cohort in each 3-month period. Solid lines show the posterior median estimate from 1,000 posterior samples. Shaded regions show 95% credible intervals from 1,000 posterior samples. Grey shaded box shows duration of the Fluscape study. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s010
(TIF)
S11 Fig. Posterior median number of infections per 10-year period stratified by age group at the time of infection, including either all time periods (blue), time periods during the Fluscape study period (Q4-2009 onward; orange) or only time periods prior to the Fluscape study (prior to Q4-2009; grey).
Excludes infection states for individuals who spent less than 2 years in that age group. We present infection rate estimates using only infections from time periods prior to the first serum sample in Q4-2009 in Fig 3C. This is because there are many individuals representing the oldest age group at time of infection for time periods post Q4-2009, but relatively few from pre Q4-2009 (as individuals who were very old in historical time periods are no longer alive). In contrast, younger age groups are better represented across historical time periods (as those individuals are still alive at the time of sampling). Combining this biased representation of older individuals with much higher estimated incidence rates in recent time periods weighs the infection rate estimates for the older age groups much higher simply because most of their infections come from this time period. Therefore, we present age-stratified infection rate estimates using only pre Q4-2009 infections in the main text. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s011
(TIF)
S12 Fig. Age-specific patterns of infection under an alternative infection history model using estimates from the model version described in S1 Text.
Results shown are identical to those in Fig 3, but assuming that (i) individuals can only be infected once per year (i.e., annual resolution infection histories rather than quarterly); (ii) the infection history model is placed upon an individual’s total number of lifetime infections and not their per-time probability of infection (see [35] for further detail on implications of different prior assumptions); (iii) we did not remove runs of continuous infections from the posteriors. (A) Pointrange plot shows median and 95% CrI on the total number of lifetime infections for each individual, ordered by increasing age. (B) Distribution of the total number of infections across all individuals based on the posterior median total number of infections. (C) Posterior median number of infections per 10-year period stratified by age group at the time of infection, excluding individuals who spent less than 2 years in that age group, and including only time periods prior to Q4-2009. Text shows sample size within each age group—note this does not sum to the number of individuals in the sample, as individuals contribute to multiple age groups during their lifetime. (D) Posterior median number of infections per 10 years alive across all individuals. Under this prior, we estimated that individuals are infected 2.08 times per 10-year period (posterior median; 95% CrI: 1.04–7.62). The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s012
(TIF)
S13 Fig. Schematic of the full serosolver model representing a single individual infected with 2 strains, B and H, over a 10-year time period.
(A) Example, randomly generated population-level infection probabilities. At the population level, the model describes a per-time-period probability of infection applied to the whole population. These probabilities are used to simulate a vector of latent binary infection states, Zi, for each individual as a series of independent Bernoulli trials (shown as a vector of 1s and 0s). (B) The antigenic relatedness of A/H3N2 strains is given by an antigenic map, where the degree of cross reactivity between any 2 strains is given by their Euclidean distance on the map. (C) Antibody levels against the infecting strain (strain B) are boosted and wane, given by the summation of transient short-term boosting and persistent long-term boosting. (D) Infection with strain B also induces cross-reactive antibodies against all other strains, here showing antibody levels to strain H. The degree of cross-reactivity is proportional to the antigenic distance between the infecting and measured strain. Later on, the individual is infected again, this time with strain H, inducing further antibody boosting and waning. An antigenic seniority parameter, τ, reduces each successive boost as a function of the number of previous infections, N. Snapshots of these underlying antibody kinetics are observed through serum samples (blood vials; (E) and (F)) distributed according to a truncated, discretized normal distribution with standard deviation parameter ε.
https://doi.org/10.1371/journal.pbio.3002864.s013
(TIF)
S14 Fig. Simulated antibody landscapes and infection histories over time for one individual using the antibody kinetics model.
Each subplot shows the antibody landscape for that time period. The blue region gives the antibody landscape in that time period, whereas the red region gives the antibody landscape in the preceding time period. The x-axis of each subplot gives the identity of the strain assumed to be circulating in that year. The grey region shows the time period and thus strains that circulated before the individual was born. Vertical dashed lines give the timing/strains the individual was infected with.
https://doi.org/10.1371/journal.pbio.3002864.s014
(TIF)
S15 Fig. Antigenic coordinates of measured strains and strains assumed to be circulating in each time period.
Each coloured point shows the location of the labelled strain on the antigenic map given in [37]. Strains which are further apart are less antigenically similar, and therefore exhibit less cross-reactivity following a seroresponse. The smoothing spline shows the inferred coordinates of each strain j assumed to have circulated in each 3-month time period, where each black point shows the assumed location in successive time periods. First, a cubic smoothing spline was fitted to the locations of the measured strain with smoothing parameter 0.3. Second, a linear model was fitted to predict the x-coordinate as a function of the strain isolation time. Finally, we generated predicted x-coordinates for each possible j given the circulation time from the linear model, and then used the predicted x-coordinate to predict the y-coordinate from the fitted smoothing spline. The antigenic distance between each pair of strains k and j was then calculated based on their Euclidean distance. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s015
(TIF)
S16 Fig. Quarterly incidence and individual infection histories from the Fluscape data set without strain-specific measurement offsets.
Identical to Fig 2, but without the inclusion of strain-specific measurement offsets in the observation model. (A) Model predicted per-capita incidence per quarter. Attack rates were estimated by dividing the number of inferred infections by the number alive in each 3-month period. Red line shows the posterior median estimate from 1,000 posterior samples. Dark and light red shaded regions show 50% and 95% credible intervals respectively from 1,000 posterior samples. Grey shaded box shows duration of the Fluscape study. Asterisks mark times from which a sample circulating strain was tested. (B) Inferred infection histories for each individual. Each row represents an individual ordered by increasing age in years. Each column represents the time of a potential infection. Cells are shaded based on the number of the posterior samples with an infection at that time divided by the total number of posterior samples for that infection state. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s016
(TIF)
S17 Fig. Distribution of antibody titre prediction errors (observed—model predicted) when fitting the serosolver model ignoring strain-specific measurement offsets.
(A) Distribution of titre prediction errors stratified by tested A/H3N2 strain. (B) Overall distribution of titre prediction errors across all measured viruses. Vertical red line shows x = 0; buckets to the right of the red line suggest underestimation of titres; histograms to the left of the red line buckets suggest overestimation of titres. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s017
(TIF)
S18 Fig. Runs of repeated infections are disproportionately more likely to occur early in life.
(A) Proportion of infection episodes which are estimated as runs of consecutive infections by age at start of infection run, using either all infection episodes or only those from individuals born after 1985. (B) Number of inferred infection episodes which are estimated as runs of consecutive infections stratified by the infection order in each individual’s infection history. For example, if the run is the first infection an individual has experienced, this is given an infection number of one. Of the 10,558 distinct infection episodes, 757 (posterior median; 95% CrI: 676–861) were runs of 2 consecutive infections, 79 (posterior median; 95% CrI: 62–97) were runs of 3 consecutive infections and 19 (posterior median; 95% CrI: 13–27) were runs of 4 or more. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s018
(TIF)
S19 Fig. Model fits demonstrating typical profiles where consecutive infection runs were imputed.
Model-predicted titres compared to observed HI titres at each sampling time for 2 individuals, as in S4 Fig. Diamonds show titre measurements; green shaded region shows 95% CrI and 95% prediction intervals; dashed line shows posterior median; orange bars show posterior probability of infection in a given time window. Purple rectangles show time periods prior to birth. (A) Example of an individual estimated to have experienced multiple consecutive infections immediately following birth to explain high titres. (B) Example of an individual estimated to have experienced multiple consecutive infections between the 2 serum sampling times to explain the drastic increase in titres to recently circulating strains. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s019
(TIF)
S20 Fig. Observed antibody profiles for all individuals in the Fluscape cohort.
Each subplot shows antibody levels measured against each of the 20 H3N2 strains. The x-axis shows the isolation year of the measured strain. The areas are shaded by sample number, showing titre measurements from the first (blue) and second (red) samples. Grey rectangles mark strains which circulated before that individual was born. The vertical coloured lines show the timing of the serum samples relative to the strain isolation times. Plots where the red region extends above the blue region reflect antibody boosting between the first and second serum sample. Where multiple titres were measured against the same strain from the same serum sample, we plotted the mean of the log titres. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s020
(PDF)
S21 Fig. Observed changes in antibody titre for all individuals in the Fluscape cohort.
As in S20 Fig, but showing fold-change in titre against each strain between samples. Each subplot shows the change in antibody levels measured against each of the 20 H3N2 strains. The x-axis gives the isolation year of the measured strain. The vertical grey line shows the timing of birth or 1968, whichever was later. Bars are shaded orange to denote antibody boosting and green to denote antibody waning. Horizontal dash lines indicate 2-fold boosting or waning. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s021
(PDF)
S22 Fig. Comparison of observed antibody profiles, model-predicted infection states, and self-reported vaccination status for accurately detected vaccinations.
Left-hand column shows raw data as in S20 Fig. Right-hand column shows the model-estimated posterior probability of infection (higher grey area suggested higher probability of infection) compared to self-reported vaccination states. Orange regions show time periods in which individuals reported having been vaccinated for influenza. Purple regions show time periods in which individuals reported no vaccination for influenza. Vertical dashed lines show the time of serum sample collection. Individuals were included in this plot (rather than S23 Fig) if the posterior probability of infection during one of the orange time windows was >25%. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s022
(PDF)
S23 Fig. Comparison of observed antibody profiles, model-predicted infection states, and self-reported vaccination status for missed vaccinations.
Left-hand column shows raw data as in S20 Fig. Right-hand column shows the model-estimated posterior probability of infection (higher grey area suggested higher probability of infection) compared to self-reported vaccination states. Orange regions show time periods in which individuals reported having been vaccinated for influenza. Purple regions show time periods in which individuals reported no vaccination for influenza. Vertical dashed lines show the time of serum sample collection. Individuals were included in this plot (rather than S22 Fig) if the posterior probability of infection during one of the orange time windows was <25%. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s023
(PDF)
S24 Fig. Comparison of estimated antibody landscapes from the same model fit with and without strain-specific measurement offsets.
Rows represent individuals. Subplots show antibody titres based on serum samples taken at that time. X-axis represents a position along the antigenic summary path. Black diamonds show observed titres. Black line and blue or green shaded regions show posterior median and 95% credible intervals (CrI) on model-predicted latent titres (dark blue/green) and 95% prediction intervals (light blue/green). Orange bars show posterior probability of infection in that 3-month window. Grey rectangles denote the limit of detection of the HI assay. Purple rectangles show time periods prior to birth. (A) Model-predicted titres compared to observed HI titres at each sampling time for 3 randomly selected individuals, as in S5 Fig, but from fitting the model without strain-specific measurement offsets. (B) as in (A), but with the estimated strain-specific measurement offsets included. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s024
(TIF)
S25 Fig. Estimated posterior distributions for antibody kinetics parameters (green shaded region) using the fitted model described in S1 Text compared to true values used in the simulation (blue line).
The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s025
(TIF)
S26 Fig. Estimated posterior distributions for strain-specific measurement offsets (green shaded region) using the fitted model described in S1 Text compared to true values used in the simulation (blue line).
The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s026
(TIF)
S27 Fig. Assessment of model fitting accuracy based on simulated data.
Results shown are from fitting the full model to simulated infection histories and antibody titres with known parameters. (A) Model-predicted titres compared to observed HI titres at each sampling time for three individuals. (B) Posterior median and 95% credible intervals (CrI) for the cumulative number of infections over time from birth (orange). Blue solid line shows the true, known cumulative number of infections. Purple dashed line shows the time of birth. (C) Posterior estimated per-capita per-3-month attack rates. Red line and shaded region shows posterior median and 95% CrI. Grey line shows the true values used for the simulation. (D) Shaded regions show posterior distributions of estimated antibody kinetics parameters. Dashed lines show the true value used for simulation. Note the x-axis range is small relative to the prior ranges in S5 Table. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s027
(TIF)
S28 Fig. Assessment of model fitting accuracy based on simulated data when strain-specific measurement offsets are ignored.
Results shown are from fitting the full model to simulated infection histories and antibody titres with known parameters where strain-specific measurement offsets are used in the simulation, but ignored in the fitted model. (A) Model-predicted titres compared to observed HI titres at each sampling time for 3 individuals. (B) Posterior median and 95% credible intervals (CrI) for the cumulative number of infections over time from birth (orange). Blue solid line shows the true, known cumulative number of infections. Purple dashed line shows the time of birth. (C) Posterior estimated per-capita per-3-month attack rates. Red line and shaded region shows posterior median and 95% CrI. Grey line shows the true values used for the simulation. (D) Shaded regions show posterior distributions of estimated antibody kinetics parameters. Dashed lines show the true value used for simulation. Note the x-axis range is small relative to the prior ranges in S5 Table. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s028
(TIF)
S29 Fig.
Antigenic maps used for the simulation (A) compared to the map produced using data from Bedford and colleagues (B), and the map which assumes a punctuated path through antigenic space (C). Axes show arbitrary antigenic dimensions. Coloured points show the position of individual strains—all labels of the same colour correspond to the same antigenic cluster. Black line shows the fitted antigenic summary path for each map. Black dots show the antigenic position of the strain corresponding to each time period; (A) and (B) assume continuous evolution through antigenic space, whereas (C) assumes that the same position is used for all strains within a cluster. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s029
(TIF)
S30 Fig. Assessment of model fitting accuracy when the antigenic map used for model fitting does not match the one used for simulation.
Results shown are from fitting the full model to simulated infection histories and antibody titres with known parameters. (A) Model-predicted titres compared to observed HI titres at each sampling time for 3 individuals. (B) Posterior median and 95% credible intervals (CrI) for the cumulative number of infections over time from birth (orange). Blue solid line shows the true, known cumulative number of infections. Purple dashed line shows the time of birth. (C) Posterior estimated per-capita per-3-month attack rates. Red line and shaded region shows posterior median and 95% CrI. Grey line shows the true values used for the simulation. (D) Shaded regions show posterior distributions of estimated antibody kinetics parameters. Dashed lines show the true value used for simulation. Note the x-axis range is small relative to the prior ranges in S5 Table. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s030
(TIF)
S31 Fig. Assessment of model fitting accuracy based when fitting a model with a smoothed antigenic map to data simulated using a punctuated antigenic map.
(A) Model-predicted titres compared to observed HI titres at each sampling time for 3 individuals. (B) Posterior median and 95% credible intervals (CrI) for the cumulative number of infections over time from birth (orange). Blue solid line shows the true, known cumulative number of infections. Purple dashed line shows the time of birth. (C) Posterior estimated per-capita per-3-month attack rates. Red line and shaded region shows posterior median and 95% CrI. Grey line shows the true values used for the simulation. Purple dashed lines show cluster transitions. (D) Shaded regions show posterior distributions of estimated antibody kinetics parameters. Dashed lines show the true value used for simulation. Note the x-axis range is small relative to the prior ranges in S5 Table. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s031
(TIF)
S1 Text. Description of methods to estimate strain-specific measurement offsets used in the main model.
https://doi.org/10.1371/journal.pbio.3002864.s032
(DOCX)
S2 Text. Description of full simulation-recovery analyses using the serosolver model.
https://doi.org/10.1371/journal.pbio.3002864.s033
(DOCX)
S1 Video. Distribution of quarterly attack rates by location over time.
Each coloured point shows the inferred attack rate in each of the 40 locations, with size and shading reflecting the posterior median attack rate. Underlying the plot is a map of the study area, with each grid cell shaded by its log10 population density. The data underlying this figure can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s034
(MP4)
S1 Table. Self-reported vaccination status at time of first study visit (top) and between study visits (bottom).
Percentages exclude individuals who declined to answer, were unsure, or had missing data. The data underlying this table can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s035
(XLSX)
S2 Table. Estimated attack rates and infection patterns 2010–2014 prior to removing runs of consecutive infections.
Percentages shown are posterior median and 95% credible intervals. Attack rate was defined as the proportion of individuals who were infected at least once in that year. “Reinfected” gives the percentage of people that were infected more than once in a year. Bottom table shows the percentage of individuals that were infected at least once per year in 0, 1, 2, 3, 4, or 5 years between 2010 and 2014 inclusive. The data underlying this table can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s036
(XLSX)
S3 Table. Strain-specific measurement offset terms used in the main model fits.
Values shown are maximum posterior probability estimates from a less flexible version of the model fit to the same data (described in S1 Text). Values shown to 3 significant figures.
https://doi.org/10.1371/journal.pbio.3002864.s037
(XLSX)
S4 Table. Estimated antibody kinetics parameters under the model without strain-specific measurement offsets.
The data underlying this table can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s038
(XLSX)
S5 Table. Description of antibody kinetics parameter values used for the simulation.
Uniform priors were used for all parameters. The data underlying this table can be found at https://doi.org/10.5281/zenodo.12795911.
https://doi.org/10.1371/journal.pbio.3002864.s039
(XLSX)
References
- 1. Lessler J, Cummings D, et al. Location-specific patterns of exposure to recent pre-pandemic strains of influenza A in southern China. Nat Commun. 2011;2:423. pmid:21829185
- 2. Cauchemez S, Ferguson NM, Fox A, Mai LQ, Thanh LT, Thai PQ, et al. Determinants of Influenza Transmission in South East Asia: Insights from a Household Cohort Study in Vietnam. PLoS Pathog. 2014;10:e1004310. pmid:25144780
- 3. Smith DJ, Lapedes AS, de Jong JC, Bestebroer TM, Rimmelzwaan GF, Osterhaus ADME, et al. Mapping the antigenic and genetic evolution of influenza virus. Science. 2004;305:371–376. pmid:15218094
- 4. Tate MD, Job ER, Deng Y-M, Gunalan V, Maurer-Stroh S, Reading PC. Playing hide and seek: how glycosylation of the influenza virus hemagglutinin can modulate the immune response to infection. Viruses. 2014;6:1294–1316. pmid:24638204
- 5. Dugan HL, Guthmiller JJ, Arevalo P, Huang M, Chen Y-Q, Neu KE, et al. Preexisting immunity shapes distinct antibody landscapes after influenza virus infection and vaccination in humans. Sci Transl Med. 2020:12. pmid:33298562
- 6. Oidtman RJ, Arevalo P, Bi G, McGough L, Russon CJ, Cruz DV, et al. Influenza immune escape under heterogeneous host immune histories. Trends Microbiol. 2021. pmid:34218981
- 7. Kucharski AJ, Lessler J, Read JM, Zhu H, Jiang CQ, Guan Y, et al. Estimating the Life Course of Influenza A(H3N2) Antibody Responses from Cross-Sectional Data. PLoS Biol. 2015;13:1–16. pmid:25734701
- 8. Ranjeva S, Subramanian R, Fang VJ, Leung GM, Ip DKM, Perera RAPM, et al. Age-specific differences in the dynamics of protective immunity to influenza. Nat Commun. 2019;10:1660. pmid:30971703
- 9. Reich NG, Brooks LC, Fox SJ, Kandula S, McGowan CJ, Moore E, et al. A collaborative multiyear, multimodel assessment of seasonal influenza forecasting in the United States. Proc Natl Acad Sci U S A. 2019;116:3146–3154. pmid:30647115
- 10. Nelson MI, Edelman L, Spiro DJ, Boyne AR, Bera J, Halpin R, et al. Molecular Epidemiology of A/H3N2 and A/H1N1 Influenza Virus during a Single Epidemic Season in the United States. PLoS Pathog. 2008;4:e1000133. pmid:18725925
- 11. Mosterín Höpping A, McElhaney J, Fonville JM, Powers DC, Beyer WEP, Smith DJ, et al. The confounded effects of age and exposure history in response to influenza vaccination. Vaccine. 2016;34:540–546. pmid:26667611
- 12. Viboud C, Bjørnstad ON, Smith DL, Simonsen L, Miller MA, Grenfell BT. Synchrony, waves, and spatial hierarchies in the spread of influenza. Science. 2006;312:447–451. pmid:16574822
- 13. Brouwer AF, Balmaseda A, Gresh L, Patel M, Ojeda S, Schiller AJ, et al. Birth cohort relative to an influenza A virus’s antigenic cluster introduction drives patterns of children’s antibody titers. PLoS Pathog. 2022;18:e1010317. pmid:35192673
- 14. Arevalo P, McLean HQ, Belongia EA, Cobey S. Earliest infections predict the age distribution of seasonal influenza A cases. Elife. 2020:9. pmid:32633233
- 15. Gouma S, Kim K, Weirick M, Gumina ME, Branche A, Topham DJ, et al. Middle-aged individuals may be in a perpetual state of H3N2 influenza virus susceptibility. Nat Commun. 2020:11. pmid:32917903
- 16. Kucharski AJ, Kwok KO, Wei VWI, Cowling BJ, Read JM, Lessler J, et al. The Contribution of Social Behaviour to the Transmission of Influenza A in a Human Population. PLoS Pathog. 2014:10. pmid:24968312
- 17. Cauchemez S, Bhattarai A, Marchbanks TL, Fagan RP, Ostroff S, Ferguson NM, et al. Role of social networks in shaping disease transmission during a community outbreak of 2009 H1N1 pandemic influenza. Proc Natl Acad Sci U S A. 2011;108:2825–2830. pmid:21282645
- 18. Charu V, Zeger S, Gog J, Bjørnstad ON, Kissler S, Simonsen L, et al. Human mobility and the spatial transmission of influenza in the United States. Salathé M, editor. PLoS Comput Biol. 2017;13: e1005382. pmid:28187123
- 19. Shaman J, Kohn M. Absolute humidity modulates influenza survival, transmission, and seasonality. Proc Natl Acad Sci U S A. 2009;106:3243–3248. pmid:19204283
- 20. Shaman J, Pitzer VE, Viboud C, Grenfell BT, Lipsitch M. Absolute Humidity and the Seasonal Onset of Influenza in the Continental United States. Ferguson NM, editor. PLoS Biol. 2010;8:e1000316. pmid:20186267
- 21. Cauchemez S, Valleron A-J, Boëlle P-Y, Flahault A, Ferguson NM. Estimating the impact of school closure on influenza transmission from Sentinel data. Nature. 2008;452:750–754. pmid:18401408
- 22. Gog JR, Ballesteros S, Viboud C, Simonsen L, Bjornstad ON, Shaman J, et al. Spatial Transmission of 2009 Pandemic Influenza in the US. PLoS Comput Biol. 2014;10:e1003635. pmid:24921923
- 23. Grantz KH, Rane MS, Salje H, Glass GE, Schachterle SE, Cummings DAT. Disparities in influenza mortality and transmission related to sociodemographic factors within Chicago in the pandemic of 1918. Proc Natl Acad Sci U S A. 2016;113:13839–13844. pmid:27872284
- 24. Dalziel BD, Kissler S, Gog JR, Viboud C, Bjørnstad ON, Metcalf CJE, et al. Urbanization and humidity shape the intensity of influenza epidemics in U.S. cities. Science. 2018;362:75–79. pmid:30287659
- 25. House T, Keeling MJ. Household structure and infectious disease transmission. Epidemiol Infect. 2009;137:654–661. pmid:18840319
- 26. Tsang TK, Lau LLH, Cauchemez S, Cowling BJ. Household Transmission of Influenza Virus. Trends Microbiol. 2016;24:123–133. pmid:26612500
- 27. Ang LW, Tien WS, Lin RT-P, Cui L, Cutter J, James L, et al. Characterization of influenza activity based on virological surveillance of influenza-like illness in tropical Singapore, 2010–2014. J Med Virol. 2016;88:2069–2077. pmid:27152935
- 28. Layne SP. Human influenza surveillance: the demand to expand. Emerg Infect Dis. 2006;12:562–568. pmid:16704802
- 29. Laurie KL, Huston P, Riley S, Katz JM, Willison DJ, Tam JS, et al. Influenza serological studies to inform public health action: best practices to optimise timing, quality and reporting. Influenza Other Respi Viruses. 2013;7:211–224. pmid:22548725
- 30. Riley S, Kwok KO, Wu KM, Ning DY, Cowling BJ, Wu JT, et al. Epidemiological characteristics of 2009 (H1N1) pandemic influenza based on paired sera from a longitudinal community cohort study. PLoS Med. 2011;8:e1000442. pmid:21713000
- 31. Wu JT, Ho A, Ma ESK, Lee CK, Chu DKW, Ho P-L, et al. Estimating infection attack rates and severity in real time during an influenza pandemic: analysis of serial cross-sectional serologic surveillance data. PLoS Med. 2011;8:e1001103. pmid:21990967
- 32. Cobey S, Hensley SE. Immune history and influenza virus susceptibility. Curr Opin Virol. 2017. pmid:28088686
- 33. Quandelacy TM, Cummings DAT, Jiang CQ, Yang B, Kwok KO, Dai B, et al. Using serological measures to estimate influenza incidence in the presence of secular trends in exposure and immuno-modulation of antibody response. Influenza Other Respi Viruses. 2021;15:235–244. pmid:33108707
- 34. Tsang TK, Perera RAPM, Fang VJ, Wong JY, Shiu EY, So HC, et al. Reconstructing antibody dynamics to estimate the risk of influenza virus infection. Nat Commun. 2022;13:1557. pmid:35322048
- 35. Hay JA, Minter A, Ainslie KEC, Lessler J, Yang B, Cummings DAT, et al. An open source tool to infer epidemiological and immunological dynamics from serological data: Serosolver. PLoS Comput Biol. 2020;16:e1007840. pmid:32365062
- 36. Kucharski AJ, Lessler J, Cummings DAT, Riley S. Timescales of influenza A/H3N2 antibody dynamics. PLoS Biol. 2018;16:e2004974. pmid:30125272
- 37. Fonville JM, Wilks SH, James SL, Fox A, Ventresca M, Aban M, et al. Antibody landscapes after influenza virus infection or vaccination. Science. 2014;346:7–9. pmid:25414313
- 38. Jiang CQ, Lessler J, Kim L, Kwok KO, Read JM, Wang S, et al. Cohort Profile: A study of influenza immunity in the urban and rural Guangzhou region of China: the Fluscape Study. Int J Epidemiol. 2016;46:dyv353.
- 39. Du Y, Jin C, Jit M, Chantler T, Lin L, Larson HJ, et al. Influenza vaccine uptake among children and older adults in China: a secondary analysis of a quasi-experimental study. BMC Infect Dis. 2023;23:225. pmid:37055738
- 40. Wang Q, Yue N, Zheng M, Wang D, Duan C, Yu X, et al. Influenza vaccination coverage of population and the factors influencing influenza vaccination in mainland China: A meta-analysis. Vaccine. 2018;36:7262–7269. pmid:30340886
- 41. Yang B, Lessler J, Zhu H, Jiang CQ, Read JM, Hay JA, et al. Life course exposures continually shape antibody profiles and risk of seroconversion to influenza. Russell CA, editor. PLoS Pathog. 2020;16:e1008635. pmid:32702069
- 42. Liu S-S, Jiao X-Y, Wang S, Su W-Z, Jiang L-Z, Zhang X, et al. Susceptibility of influenza A(H1N1)/pdm2009, seasonal A(H3N2) and B viruses to Oseltamivir in Guangdong, China between 2009 and 2014. Sci Rep. 2017;7:8488. pmid:28814737
- 43. Bodewes R, de Mutsert G, van der Klis FRM, Ventresca M, Wilks S, Smith DJ, et al. Prevalence of antibodies against seasonal influenza A and B viruses in children in Netherlands. Clin Vaccine Immunol. 2011;18:469–476. pmid:21209157
- 44. Hobson D, Curry RL, Beare AS, Ward-Gardner A. The role of serum haemagglutination-inhibiting antibody in protection against challenge infection with influenza A2 and B viruses. J Hyg. 1972;70:767–777. pmid:4509641
- 45. Cohen C, Kleynhans J, Moyes J, McMorrow ML, Treurnicht FK, Hellferscee O, et al. Asymptomatic transmission and high community burden of seasonal influenza in an urban and a rural community in South Africa, 2017–18 (PHIRST): a population cohort study. Lancet Glob Health. 2021;9:e863–e874. pmid:34019838
- 46. Huang QS, Bandaranayake D, Wood T, Newbern EC, Seeds R, Ralston J, et al. Risk Factors and Attack Rates of Seasonal Influenza Infection: Results of the Southern Hemisphere Influenza and Vaccine Effectiveness Research and Surveillance (SHIVERS) Seroepidemiologic Cohort Study. J Infect Dis. 2018. pmid:30016464
- 47. Hayward AC, Fragaszy EB, Bermingham A, Wang L, Copas A, Edmunds WJ, et al. Comparative community burden and severity of seasonal and pandemic influenza: results of the Flu Watch cohort study. Lancet Respir Med. 2014;2:445–454. pmid:24717637
- 48. Horby P, Mai LQ, Fox A, Thai PQ, Thi Thu Yen N, Thanh LT, et al. The Epidemiology of Interpandemic and Pandemic Influenza in Vietnam, 2007–2010. Am J Epidemiol. 2012;175:1062–1074.
- 49. Ye C, Zhu W, Yu J, Li Z, Zhang Y, Wang Y, et al. Understanding the complex seasonality of seasonal influenza A and B virus transmission: Evidence from six years of surveillance data in Shanghai, China. Int J Infect Dis. 2019;81:57–65.
- 50. Wei VWI, Wong JYT, Perera RAPM, Kwok KO, Fang VJ, Barr IG, et al. Incidence of influenza A(H3N2) virus infections in Hong Kong in a longitudinal sero-epidemiological study, 2009–2015. PLoS ONE. 2018;13:e0197504. pmid:29795587
- 51. Kwok KO, Riley S, Perera RAPM, Wei VWI, Wu P, Wei L, et al. Relative incidence and individual-level severity of seasonal influenza A H3N2 compared with 2009 pandemic H1N1. BMC Infect Dis. 2017;17:337. pmid:28494805
- 52.
European Centre for Disease Prevention and Control. Influenza virus characterisation, summary Europe, May 2015. Stockholm: ECDC; 2015. Available from: http://ecdc.europa.eu/en/publications/Publications/influenza-virus-characterisation-may-2015.pdf.
- 53. Chambers BS, Parkhouse K, Ross TM, Alby K, Hensley SE. Identification of Hemagglutinin Residues Responsible for H3N2 Antigenic Drift during the 2014–2015 Influenza Season. Cell Rep. 2015;12:1–6. pmid:26119736
- 54. Shi W, Ke C, Fang S, Li J, Song H, Li X, et al. Co-circulation and persistence of multiple A/H3N2 influenza variants in China. Emerg Microbes Infect. 2019. pmid:31373538
- 55. Nguyen DNT, Mai LQ, Bryant JE, Hang NLK, Hoa LNM, Nadjm B, et al. Epidemiology and etiology of influenza-like-illness in households in Vietnam; it’s not all about the kids! J Clin Virol. 2016;82:126–132. pmid:27479176
- 56. Lin J, Kang M, Zhong H, Zhang X, Yang F, Ni H, et al. Influenza seasonality and predominant subtypes of influenza virus in Guangdong, China, 2004–2012. J Thorac Dis. 2013;5:S109. pmid:23977430
- 57. Tan J, Liang L, Huang P, Ibrahim AA, Huang Z, Zhao W, et al. Changes in Influenza Activities Impacted by NPI Based on 4-Year Surveillance in China: Epidemic Patterns and Trends. J Epidemiol Glob Health. 2023;13:539–546. pmid:37535238
- 58. Huang P, Ni H, Shen G, Zhou H, Peng G, Liu S. Analysis of the 1991–2000 influenza epidemic in Guangdong Province, China. Southeast Asian J Trop Med Public Health. 2001;32:787–790. pmid:12041555
- 59. Ng S, Gordon A. Influenza Burden and Transmission in the Tropics. Curr Epidemiol Rep. 2015;2:89. pmid:25938010
- 60. Tamerius JD, Shaman J, Alonso WJ, Bloom-Feshbach K, Uejio CK, Comrie A, et al. Environmental Predictors of Seasonal Influenza Epidemics across Temperate and Tropical Climates. PLoS Pathog. 2013;9:e1003194. pmid:23505366
- 61. Tamerius J, Nelson MI, Zhou SZ, Viboud C, Miller MA, Alonso WJ. Global Influenza Seasonality: Reconciling Patterns across Temperate and Tropical Regions. Environ Health Perspect. 2011. pmid:21097384
- 62. Chowell G, Viboud C, Simonsen L, Miller M, Alonso WJ. The reproduction number of seasonal influenza epidemics in Brazil, 1996–2006. Proc R Soc Lond B Biol Sci. 2010;277:1857. pmid:20150218
- 63. Monto AS, Sullivan KM. Acute respiratory illness in the community. Frequency of illness and the agents involved. Epidemiol Infect. 1993;110:145–160. pmid:8432318
- 64. Jayasundara K, Soobiah C, Thommes E, Tricco AC, Chit A. Natural attack rate of influenza in unvaccinated children and adults: a meta-regression analysis. BMC Infect Dis. 2014;14:670. pmid:25495228
- 65. Read JM, Lessler J, Riley S, Wang S, Tan LJ, Kwok KO, et al. Social mixing patterns in rural and urban areas of southern China. Proc R Soc Lond B Biol Sci. 2014;281:20140268. pmid:24789897
- 66. Kwok KO, Cowling BJ, Wei VWI, Wu KM, Read JM, Cummings D, et al. Social contacts and the locations in which they occur as risk factors for influenza infection. Proc R Soc B. 2014;281. pmid:25009062
- 67. Leung NHL, Xu C, Ip DKM, Cowling BJ. Review Article: The Fraction of Influenza Virus Infections That Are Asymptomatic: A Systematic Review and Meta-analysis. Epidemiology. 2015;26:862–872. pmid:26133025
- 68. Petrie JG, Ohmit SE, Johnson E, Cross RT, Monto AS. Efficacy studies of influenza vaccines: effect of end points used and characteristics of vaccine failures. J Infect Dis. 2011;203:1309–1315. pmid:21378375
- 69. Viboud C, Nelson MI, Tan Y, Holmes EC. Contrasting the epidemiological and evolutionary dynamics of influenza spatial transmission. Philos Trans R Soc Lond B Biol Sci. 2013;368:20120199. pmid:23382422
- 70. Morris SE, Freiesleben de Blasio B, Viboud C, Wesolowski A, Bjørnstad ON, Grenfell BT. Analysis of multi-level spatial data reveals strong synchrony in seasonal influenza epidemics across Norway, Sweden, and Denmark. PLoS ONE. 2018;13:e0197519. pmid:29771952
- 71. Coudeville L, Bailleux F, Riche B, Megas F, Andre P, Ecochard R. Relationship between haemagglutination-inhibiting antibody titres and clinical protection against influenza: development and application of a bayesian random-effects model. BMC Med Res Methodol. 2010;10:18. pmid:20210985
- 72. Fox A, Mai LQ, Thanh LT, Wolbers M, Le Khanh HN, Thai PQ, et al. Hemagglutination inhibiting antibodies and protection against seasonal and pandemic influenza infection. J Infect. 2015;70:187–196. pmid:25224643
- 73. Longini IM Jr, Koopman JS, Monto AS, Fox JP. Estimating household and community transmission parameters for influenza. Am J Epidemiol. 1982;115:736–751. pmid:7081204
- 74. Treanor J, Wright PF. Immune correlates of protection against influenza in the human challenge model. Dev Biol. 2003;115:97–104. pmid:15088780
- 75. Black S, Nicolay U, Vesikari T, Knuf M, Del Giudice G, Della Cioppa G, et al. Hemagglutination Inhibition Antibody Titers as a Correlate of Protection for Inactivated Influenza Vaccines in Children. Pediatr Infect Dis J. 2011;30:1081–1085. pmid:21983214
- 76. Tsang TK, Cauchemez S, Perera RAPM, Freeman G, Fang VJ, Ip DKM, et al. Association Between Antibody Titers and Protection Against Influenza Virus Infection Within Households. J Infect Dis. 2014;210:684–692. pmid:24676208
- 77. Wood JM, Major D, Heath A, Newman RW, Höschler K, Stephenson I, et al. Reproducibility of serology assays for pandemic influenza H1N1: Collaborative study to evaluate a candidate WHO International Standard. Vaccine. 2012;30:210–217. pmid:22100887
- 78. Wright PF, Sannella E, Shi JR, Zhu Y, Ikizler MR, Edwards KM. Antibody Responses After Inactivated Influenza Vaccine in Young Children. Pediatr Infect Dis J. 2008;27:1004–1008. pmid:18833023
- 79. Hsu JP, Zhao X, Chen MI-C, Cook AR, Lee V, Lim WY, et al. Rate of decline of antibody titers to pandemic influenza A (H1N1-2009) by hemagglutination inhibition and virus microneutralization assays in a cohort of seroconverting adults in Singapore. BMC Infect Dis. 2014;14:414. pmid:25066592
- 80. Young B, Zhao X, Cook AR, Parry CM, Wilder-Smith A, I-Cheng MC. Do antibody responses to the influenza vaccine persist year-round in the elderly? A systematic review and meta-analysis. Vaccine. 2017;35:212–221. pmid:27939013
- 81. Cauchemez S, Horby P, Fox A, Mai LQ, Thanh LT, Thai PQ, et al. Influenza Infection Rates, Measurement Errors and the Interpretation of Paired Serology. PLoS Pathog. 2012;8:e1003061. pmid:23271967
- 82. Tsang TK, Fang VJ, Perera RAPM, Ip DKM, Leung GM, Peiris JSM, et al. Interpreting Seroepidemiologic Studies of Influenza in a Context of Nonbracketing Sera. Epidemiology. 2016;27:152–158. pmid:26427725
- 83. Jacobson RM, Grill DE, Oberg AL, Tosh PK, Ovsyannikova IG, Poland GA. Profiles of influenza A/H1N1 vaccine response using hemagglutination-inhibition titers. Hum Vaccin Immunother. 2015;11:961–969. pmid:25835513
- 84. Li M, Wang W, Chen J, Zhan Z, Xu M, Liu N, et al. Transplacental transfer efficiency of maternal antibodies against influenza A(H1N1)pdm09 virus and dynamics of naturally acquired antibodies in Chinese children: a longitudinal, paired mother–neonate cohort study. Lancet Microbe. 2023;4:e893–e902. pmid:37827184
- 85. Hinojosa M, Shepard SS, Chung JR, King JP, McLean HQ, Flannery B, et al. Impact of Immune Priming, Vaccination, and Infection on Influenza A(H3N2) Antibody Landscapes in Children. J Infect Dis. 2021;224:469–480. pmid:33090202
- 86. Daulagala P, Mann BR, Leung K, Lau EHY, Yung L, Lei R, et al. Imprinted Anti-Hemagglutinin and Anti-Neuraminidase Antibody Responses after Childhood Infections of A(H1N1) and A(H1N1)pdm09 Influenza Viruses. MBio. 2023;14:e0008423. pmid:37070986
- 87. Li Z-N, Liu F, Gross FL, Kim L, Ferdinands J, Carney P, et al. Antibody Landscape Analysis following Influenza Vaccination and Natural Infection in Humans with a High-Throughput Multiplex Influenza Antibody Detection Assay. MBio. 2021:12. pmid:33531397
- 88. Auladell M, Phuong HVM, Mai LTQ, Tseng Y-Y, Carolan L, Wilks S, et al. Influenza virus infection history shapes antibody responses to influenza vaccination. Nat Med. 2022;28:363–372. pmid:35177857
- 89. Ertesvåg NU, Cox RJ, Lartey SL, Mohn KG-I, Brokstad KA, Trieu M-C. Seasonal influenza vaccination expands hemagglutinin-specific antibody breadth to older and future A/H3N2 viruses. NPJ Vaccines. 2022;7:67. pmid:35750781
- 90. Levy S, Abd Alhadi M, Azulay A, Kahana A, Bujanover N, Gazit R, et al. FLU-LISA (fluorescence-linked immunosorbent assay): high-throughput antibody profiling using antigen microarrays. Immunol Cell Biol. 2023;101:231–248. pmid:36567516
- 91. Pušnik J, Monzon-Posadas WO, Zorn J, Peters K, Baum M, Proksch H, et al. SARS-CoV-2 humoral and cellular immunity following different combinations of vaccination and breakthrough infection. Nat Commun. 2023;14:572. pmid:36732523
- 92. Sun K, Tempia S, Kleynhans J, von Gottberg A, McMorrow ML, Wolter N, et al. Rapidly shifting immunologic landscape and severity of SARS-CoV-2 in the Omicron era in South Africa. Nat Commun. 2023;14:246. pmid:36646700
- 93. Antigenic mapping of emerging SARS-CoV-2 omicron variants BM.1.1.1, BQ.1.1, and XBB.1. Lancet Microbe. 2023. pmid:36657480
- 94. Reed LJ, Muench H. A simple method of estimating fifty per cent endpoints. Am J Epidemiol. 1938;27:493–497.
- 95. Manu U. WHO manual on animal influenza diagnosis and surveillance. 2002. Available from: https://apps.who.int/iris/bitstream/handle/10665/68026/WHO_CDS?sequence=1.
- 96. Du X, Dong L, Lan Y, Peng Y, Wu A, Zhang Y, et al. Mapping of H3N2 influenza antigenic evolution in China reveals a strategy for vaccine strain recommendation. Nat Commun. 2012;3:709. pmid:22426230
- 97. Salje H, Cummings DAT, Rodriguez-Barraquer I, Katzelnick LC, Lessler J, Klungthong C, et al. Reconstruction of antibody dynamics and infection histories to evaluate dengue risk. Nature. 2018;1. pmid:29795354
- 98. Plummer M, Best N, Cowles K, Vines K. CODA: Convergence Diagnosis and Output Analysis for MCMC. R News. 2006;6:7–11.
- 99. Frank AL, Taber LH. Variation in frequency of natural reinfection with influenza A viruses. J Med Virol. 1983;12:17–23. pmid:6619811
- 100. Camacho A, Ballesteros S, Graham AL, Carrat F, Ratmann O, Cazelles B. Explaining rapid reinfections in multiple-wave influenza outbreaks: Tristan da Cunha 1971 epidemic as a case study. Proc R Soc Lond B Biol Sci. 2011;278:3635–3643.
- 101.
Bjornstad ON. Spatial Covariance Functions [R package ncf version 1.2–8]. Available from: https://cran.r-project.org/web/packages/ncf/index.html.