Despite evidence for avian influenza A virus (AIV) transmission between wild and domestic ecosystems, the roles of bird migration and poultry trade in the spread of viruses remain enigmatic. In this study, we integrate ecosystem interactions into a phylogeographic model to assess the contribution of wild and domestic hosts to AIV distribution and persistence. Analysis of globally sampled AIV datasets shows frequent two-way transmission between wild and domestic ecosystems. In general, viral flow from domestic to wild bird populations was restricted to within a geographic region. In contrast, spillover from wild to domestic populations occurred both within and between regions. Wild birds mediated long-distance dispersal at intercontinental scales whereas viral spread among poultry populations was a major driver of regional spread. Viral spread between poultry flocks frequently originated from persistent lineages circulating in regions of intensive poultry production. Our analysis of long-term surveillance data demonstrates that meaningful insights can be inferred from integrating ecosystem into phylogeographic reconstructions that may be consequential for pandemic preparedness and livestock protection.
It is assumed that AIV outbreaks in poultry are introduced from wild birds. To test this, we incorporated ecosystem and location of isolation into a comparative genetic analysis. We show high rates of viral transmission from domestic to wild birds within a region and, that wild birds could transmit AIV to poultry between regions. However, the highest rates of viral flow between regions was among domestic populations, indicating poultry trade may play a major role in spreading viral populations. We demonstrate that interactions between migratory birds and animal productions systems contribute to the emergence of AIV. The assumption of unidirectional viral flow from wild birds to domestic poultry provides an incomplete picture of influenza ecology and may confound control efforts.
Citation: Bahl J, Pham TT, Hill NJ, Hussein ITM, Ma EJ, Easterday BC, et al. (2016) Ecosystem Interactions Underlie the Spread of Avian Influenza A Viruses with Pandemic Potential. PLoS Pathog 12(5): e1005620. https://doi.org/10.1371/journal.ppat.1005620
Editor: Neil M. Ferguson, Imperial College London, UNITED KINGDOM
Received: September 8, 2015; Accepted: April 18, 2016; Published: May 11, 2016
Copyright: © 2016 Bahl et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files except the BEAST XML’s can be accessed at http://dx.doi.org/10.5061/dryad.601fd
Funding: This study was partially funded by the National Institutes of Health (NIH) Centers for Excellence in Influenza Research and Surveillance (CEIRS, contract # HHSN266200700010C and HHSN272201400008C for JAR and HHSN272201400006C for JB). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Intensive agriculture has allowed AIV circulating in wild bird populations and multi-host poultry systems (domestic food birds including chicken, duck, goose, pigeon) to interact, shaping the diversity of subtypes with pandemic potential . The recently emerged H7N9 viruses containing H9N2-origin internal genes highlight that co-circulation of subtypes concealed within poultry systems can enhance the pandemic threat of influenza . Such interactions are not unique. Over the last decade, H9N2 viruses have donated gene segments to several virus subtypes infecting poultry and humans, including the highly pathogenic avian influenza (HPAI) H5N1 panzootic that emerged in 2003 and persists to the present day [3–5]. Transmissions across the wild-domestic bird interface and genomic reshuffling within poultry have contributed to the emergence, spread and persistence of novel H5, H6, H7, and H9 AIV genotypes, which have caused human infection [4, 6, 7]. Despite the importance of viral transmission between natural and domestic systems, the role of these interactions in determining viral diversity and distribution has not been adequately studied.
AIV transmission between wild reservoirs and domestic animals takes place where natural and agricultural ecosystems overlap, a scenario that occurs worldwide. For example, transmission between wild and domestic birds led to H6 outbreaks in Californian poultry ; low pathogenic (LPAI) H5 viruses from wild birds in Italy were linked to poultry disease in Asia [7, 9]; and wild bird-origin H9 viruses circulating in Korea later emerged in domestic flocks . Most recently, H5 viruses detected in East Asia have spread to European and North American poultry, consistent with intercontinental migration of wild birds . Although often detected in domestic birds, viral communication between populations is not one-way. In 2005, HPAI H5N1 jumped from domestic birds to infect bar-headed geese (Anser indicus) at Qinghai Lake, China (12). This triggered large-scale wildlife die-offs, contributed to the global spread of HPAI H5N1 virus and placed considerable burdens on government resources to mitigate global spread [12, 13].
H9 viruses are endemic in terrestrial poultry in China, the Middle East, and Europe and occur globally in a diversity of wild bird taxa . Although H9 in poultry is not a notifiable disease, it is listed by the World Health Organization as a candidate for the next global pandemic along with H5 and H7 . Gene segment exchange with H9 has facilitated generation of novel reassortants, including the 1997 H5N1 strain that caused human infections and fatalities in Hong Kong . Furthermore, periodic human infections with H9 subtype viruses have led to intensive epidemiological surveillance of wild and domestic birds to identify the infection source [3, 6, 16, 17]. These events have prompted regular collection and sequencing of H9 subtype viruses providing a globally sampled genetic dataset where the contributions of wild and domestic animals to the global spread can be modelled using phylogeographic approaches.
Despite an expanding AIV host range, the role of ecosystem interactions (i.e. transmission between wild and domestic animals) in generating and spreading novel influenza viruses over large distances remain poorly characterized. In this study we integrate the geographical and ecological context of transmission into a statistical phylogenetic model. Using H9, H3 and H6 sequence data, including newly acquired sequences from North American wild birds, we characterize the contribution of wild and domestic birds to the global distribution of AIV and estimate the role of inter-ecosystem dynamics on viral diversity and persistence. We hypothesize that viruses circulating among wild migratory birds may connect a global network of influenza outbreaks that occur locally in domestic birds. To test this, we incorporated ecosystem and geographic location of isolation into a comparative genetic analysis of H9 hemagglutinin (HA) sequence data (see S1 Text and S1 Table). H9 subtype viruses are commonly isolated from domestic poultry and wild birds [3, 7, 18–24]. Periodic human infections have resulted in intensive surveillance of domestic birds in order to identify the sources of infection [16, 17]. As a result, a robust dataset of H9 subtype viruses is available from both domestic and wild birds throughout the world. Since poultry and wild birds infected with H9 viruses do not often manifest major disease symptoms, vaccination or active control efforts are limited for this subtype. We utilize the H9 virus surveillance and sequence reporting, enhanced with novel sequences acquired from North American wild bird surveillance collected between 1974 and 2013. We build on previous phylogeographic methods  to integrate wild and domestic ecosystems in order to 1) estimate an asymmetric spatial transition matrix; 2) model viral spread among wild or domestic populations; and 3) assess the relative risk of virus emergence from discrete locations and ecosystems. We extend our analysis to other avian influenza A virus subtypes (H3 and H6) in order to assess if inferences made from comparative genetic analysis of H9 gene sequences could be generalized.
Poultry production, live animal trade and disease surveillance
Poultry production density and disease surveillance effort vary considerably between countries. We therefore incorporated this information in defining our discrete geographic units for our migration model. Mapping all available poultry and wild bird virus isolates allows for a qualitative assessment showing that global sampling sites correspond well with the intensity and distribution of poultry production systems, particularly in China and the Middle East (Fig 1A–1C). Samples from wild and domestic avian hosts spatially overlap. It is possible that surveillance in wild birds was in response to AIV detection in domestic birds. There are few long-term virological surveillance programs and most detection was based on opportunistic sampling [2,4,6–8,10,11]. As a result, there are many more isolates from domestic birds than wild birds, despite the importance of these hosts in the ecology, evolution and emergence of AIV . Sampled sites similarly overlap with areas of intense duck production, although domestic duck rearing is far less common than chickens worldwide (S1 Fig). While we assume that the virus behaves the same in all poultry hosts, experimental data indicates this may be a poor assumption . Unfortunately, limited information on virus prevalence or epidemiology in various domestic host species between countries makes it difficult to treat individual species separately, thereby necessitating the grouping used here.
(A) Map showing global chicken density (millions of chickens/km2) (B) Reported trade intensity of domestic poultry 1995 to 2011 between regions (C) Locations of available viral sequences within discrete regions defined by sampling effort, poultry production and sequence data.
We assume that if AIV is transmitted between domestic populations it is likely correlated with live poultry trade between countries (Fig 1B). We use this framework to interpret our phylogeographic reconstruction results where, we believe that the movement of viruses between discrete states is limited to the movement of live animals, whether through trade or wild animal migration. It is possible that transmission between regions could be linked to trade of animal product or other cryptic means, but for this study we have not considered other mechanisms of viral spread. According to official trade data available from the Food and Agriculture Organization of the United Nations (UN-FAO), intercontinental movement of live poultry primarily involves exports from Europe and North America into the eastern hemisphere: Middle East, China and Southeast Asia (Fig 1B). Regional flow of animals throughout Europe showed that few countries (i.e. Germany, Netherlands) were the main importers of live animals from other European nations and were also exporters to the Middle East and Southeast Asia. The majority of poultry production in China is for domestic consumption, although trade with Southeast and East Asia (Japan/South Korea) accounted for the bulk of live animal exports (Fig 1B). After the emergence and spread of HPAI H5N1, there was a marked decline in poultry exports with the majority of official trade restricted to supplying Hong Kong and Macau. Detailed trade flow and animal population data led us to subdivide China into 3 distinct regions (East, Central and West China) based on production and consumption, and to combine Japan and South Korea into a discrete isolated (particularly after 2004) geo-region relative to China (Fig 1B and 1C). Based on the distribution of available AIV sequences from poultry and global patterns of live poultry trade, we therefore identified 9 discrete regions for which viral migration patterns could be estimated (Fig 1C, S2 Fig and S2 Table).
Global distribution of wild bird isolates
Wild bird surveillance is often conducted to identify potential virus sources following outbreaks in poultry . Consequently, large overlap between wild bird and poultry surveillance sites exists (Fig 1C and S2 Table). Notable exceptions include long-term influenza surveillance studies of migratory birds in North America (Delaware Bay, Alberta) and Europe (Ottenby, Sweden) . With the exception of waterfowl, most other wild bird species are opportunistically sampled rather than targeted . Sampling inconsistencies across different species mean that sample sizes of virus isolates are not appropriate to infer rates of interspecies transmission among wild bird taxa (i.e. mallard, Anas platyrhynchos to northern pintail, A. acuta). Nevertheless, the data from wild birds was well suited for estimating viral flow between wild and domestic systems. In our analysis we assumed that the virus behaves similarly in all wild species and classified isolates as either ‘wild’ or ‘domestic’ to estimate viral transmission rate between populations.
Phylogenetic reconstruction and source/sink dynamics
Phylogenetic reconstruction of the H9-HA gene showed a mixture of North American and Eurasian wild bird isolates and two recently diverged poultry lineages (G1 and Ck/Bei lineages; Fig 2A and S3 Fig), consistent with previous studies [3, 29–31]. Our analysis demonstrated that the H9 lineage was younger and less geographically structured than other HA subtypes [8, 32, 33]. The estimated tree root age from sampled H9 strains revealed an origin between 1964 and 1975, suggesting a recent selective sweep removed genetic signals of long-term geographic structuring from the population. Periodic sweeps have been implicated in lasting changes in the viral population of multiple subtypes [33, 34].
(A) H9-HA phylogenetic tree for global isolates where branches are coloured according to discrete geographic region and thick and thin lines indicate ancestral transitions between natural and agricultural ecosystems respectively (B) Graph showing the proportional Markov jump counts between ecosystems over time (C) Heat maps showing mean H9 migrations estimated to and from East China per year. Heat maps for other regions are shown in S4 Fig.
We estimated the ancestral locations of isolates collected from the 9 discrete geographic regions defined above (Fig 2A) and stratified the observations as viruses collected from either wild or domestic birds. We found frequent viral flow among regions and identified East/Central China (Ck/Bei lineage) and the Middle East (G1 lineage) as the primary sources for H9 virus spread among poultry (Fig 2B and Table 1). We determined the number of transitions emerging from each ecosystem state from 1998 through 2013, and show that the viral population was primarily emerging from domestic populations with most transitions into other domestic populations (Fig 2B). Approximately 90% of all transitions emerged from the domestic populations. The mean waiting time in each state as a proportion of the total phylogenetic tree time show that the populations was primarily circulating among poultry located in East/Central China and the Middle East (Table 1). Date estimates suggested these two regional populations diverged around 1988 (95% Bayesian Credible Interval: BCI 1985–1990), coinciding with industrialization of poultry production in Asia . Our results also suggest that both CK/Bei and G1 lineages emerged from independent wild bird introductions. Over the last decade these regions have acted as two distinct and persistent gene pools (Fig 2B and Table 1), reflecting the establishment of East Asia and the Middle East as independent centers of poultry production .
In our analysis we used a non-reversible model such that the direction of migration could be inferred to assess whether a region acted as a source or a sink from 1998 through 2013 [25, 36, 37]. For each location, we averaged the number of state changes observed/year for trees sampled from the posterior distribution. Even though East China was a persistent source population, the transmission was primarily limited to within China. East China primarily receives viruses from Central China and vice versa (Fig 2C and S4 Fig). Prior to 2003 there was some exchange with Japan. The rapid disruption in viral flow between these regions corresponded with restrictions on live poultry exports after the emergence and regional spread of highly pathogenic avian influenza H5N1 . The Middle East region was primarily a sink for G1 lineage viruses originating in South Asia and Europe (S4 Fig). Transmission from the Middle East to South Asia and Europe, although rare, was evident in other lineages (S4 Fig).
Our results showed enhanced risk of viral emergence from East China, whereas Western China and Southeast Asia represent regions with enhanced risk of receiving H9 viruses (Table 2). By evaluating the relative risk for regional source/sink transmission from domestic or wild birds, we see enhanced risk for transmission originating from wild birds in Japan/South Korea and Europe. In contrast, wild birds from the Middle East, South Asia, East and Central China were more likely to receive viruses. There was enhanced risk for domestic birds in East China, South Asia and the Middle East to be a source for transmission of H9 viruses. Domestic birds in Japan/S Korea, Western China, Southeast Asia and Europe were more likely to be sinks (Table 2).
Migration patterns and ecosystem interactions
Domestic-to-wild virus communications were restricted to within regions, whereas wild-to-domestic transmissions occurred both within and between regions (Table 3). Two-way viral flow between ecosystems occurred within China and the Middle East, emphasizing the importance of these regions for ecosystem interactions (Table 3). Domestic-to-domestic transmission was between adjacent regions with no significant long distance migrations (Fig 3A). In contrast, supported wild-to-wild transmissions occurred over long and short distances (Fig 3A). Our results suggest that poultry systems have provided a persistent source for regional H9 spread, whereas wild bird-mediated dispersal provided a mechanism for both intercontinental (Japan/S Korea-North America and Japan/S Korea-Europe) and regional spread (East Asia-Southeast Asia) along known waterbird migratory routes [38, 39] (Fig 3A). Despite strong statistical support for these migration events, the importance of Japan and South Korea for long distance wild bird carriage of viruses is difficult to assess due to sparseness of wild bird sampling in this region (Fig 2A). We also found strong support for wild-bird mediated viral migration between North America and the Middle East (Table 3). In both locations, viruses were isolated from gulls and shorebirds. In our reconstruction there was more than 10 years of under-sampled diversity between viruses in gulls from the Republic of Georgia and the putative North American ancestor. It is unlikely that this was a direct transmission between populations. The most parsimonious explanation is that this virus transmitted among gulls during this period, implying that persistence in wild bird populations may contribute to the long-distance spread of H9 viruses.
Statistically supported interactions are shown in bold. Italicized text indicates interactions between ecosystems.
(A) Map showing statistically supported transitions between geographic regions by ecosystem. Line thickness corresponds to viral flow rates shown in Table 2 (thinnest<0.5; ≥0.5<1; ≥1<2;≥ 2 thickest). (B) Density distribution of statistically supported mean transition rates between ecosystems. *Domestic-to-domestic rates are significantly faster than domestic-to-wild (BF>100), wild-to-domestic (BF = 62.3), and wild-to-wild (BF = 39.4). (C) Statistically supported mean migration rates per MCMC step of wild-to-wild avian transitions versus domestic-to-domestic avian transitions.
Integrating ecosystem and region into our migration model provided the most comprehensive description of global H9 distribution and diversity. Domestic-to-domestic transitions among regions, especially between East China and Central China, were significantly faster than all other estimated migration patterns, suggesting poultry trade was likely responsible for spreading H9 subtype virus (Fig 3B and 3C). This trend was also reflected at the global scale, whereby migration rates among domestic birds was significantly higher than among wild birds and between ecosystems (wild-to-domestic or domestic-to-wild) (Fig 3).
We chose to extend our model with analysis of AIV subtypes H3 and H6 HA gene sequences. Similar diffusion patterns (i.e. ecosystem transition patterns or rates) between subtypes may suggest similar underlying processes. Analysis of H3 subtype HA gene sequences showed similar results to those described above for H9 (S5 Fig, S6 Fig and S3 Table). In contrast, the role of virus ecosystem interactions in determining viral distribution of H6 subtype viruses was less clear, even though similar transmission patterns were observed (S7 Fig, S4 Table). Despite evidence for two-way transmission of H6 viruses between wild or domestic populations (S8 Fig and S4 Table), our analysis shows no support for either ecosystem playing a larger role in the distribution of these viruses (see S1 Text).
Our model for ancestral state reconstruction was restricted to locations and ecosystems where viruses were observed, which results in an inherent bias in the reconstruction of migration or ecosystem transitions. For example, in our analysis of H9 subtype virus, wild bird isolates were not observed from western China and therefore, this state was not represented in our ancestral state reconstruction. While it is possible that the virus population did spend time circulating among wild birds in western China, the lack of contemporary observations imposes limitations on the ancestral state reconstruction approach taken in this study. Similarly, North American H9 isolates from domestic birds were not present in our reconstruction of migration patterns and ecosystem interactions. To investigate if our ancestral reconstruction was sensitive to sampling bias, we randomized the location assignments at the tips throughout the MCMC procedure to determine if the posterior transition rate estimate and root state probability converged on the expected prior under the sampling scenario we used, as well as randomly generated alternatives . For each subtype analyzed the posterior empirical frequency converged to the prior root state probability for all sampling scenarios (S9 Fig). In addition, the ecosystem transition rate estimates converged on the prior expectation where all rates were approximately equal (S10, S11 and S12 Figs). Despite uneven sampling of domestic and wild populations these analyses suggest that signals of ecosystem/spatial structure in the data inform our estimates and were not biased by the sampling scheme.
Our findings highlight that transmission among domestic flocks drove the majority of H9 dispersal to adjacent regions. Generally, wild bird migrations provide opportunities for the widespread movement of viruses with pandemic potential, but this did not play a greater role in viral spread than transmission among poultry populations, despite frequent transmission between wild and domestic birds. The most significant contribution from wild birds to emerging AIV ecology is the redistribution of gene segments over large distances, thereby increasing biodiversity and creating opportunities for novel variants to emerge. Failure to control H9 outbreaks in domestic populations can therefore contribute to the emergence and spread of new influenza variants . Our results suggest that viral flow between wild and domestic systems contributes to the persistence, and spread of AIV.
The continued circulation of AIV in domestic populations poses a public health risk . In this study, we capitalized on the robust surveillance of H9 viruses among avian populations to elucidate the role of viral transmission between wild and domestic ecosystems in the global spread and persistence of AIV. Parallels exist between the mechanisms of H9N2 global dispersal uncovered here and other subtypes. Most notably, transmissions of HPAI H5N1, and recently HPAI H5N8, have been perpetuated and spread by both wild bird migration and domestic poultry trade [13, 40, 41]. The complex interactions characterizing these systems have contributed to the genetic diversity and widespread diffusion of these viruses throughout Asia, Africa, Europe, and most recently, North America [4, 11].
Similar models may be extended to understand the mechanisms of spread and emergence of other influenza subtypes, although the paucity of surveillance data may be a limiting factor. While analysis of H3 subtype virus HA gene sequences supported our findings that transitions among domestic populations may be driving the spread of influenza viruses, our analysis of H6 viruses was less clear. H6 subtype viruses were prevalent in domestic ducks in southern China . While 75% of the global duck production (including reared wild ducks) occurs in China  production is primarily for domestic consumption, with Hong Kong as the largest importer of duck meat . Even though the large-scale transmission patterns were similar across subtypes, the binning of discrete geographic states used in our analysis of H6 could not capture the majority of domestic trade within China. It is likely that alternative sampling strategies are necessary to investigate the role of ecosystems interactions and poultry trade in maintaining H6 subtype populations.
Production systems that promote the two-way transmission of viruses between wild and domestic avian hosts facilitate the generation of potentially pandemic AIV and may lead to widespread outbreaks that are difficult to contain. Surveillance programs focused on detecting highly pathogenic subtypes in symptomatic poultry falls short of identifying the mechanisms of emergence, spread and genomic reshuffling. Active systematic surveillance for AIV in both wild and domestic populations allow for the continued development of models needed to test the role of various species or populations in viral persistence. A limitation of this study is that reconstruction of viral movement patterns was limited to transmission among the populations surveyed. It is likely that unsampled populations play a role in the spread and persistence of AIV. Systematic surveillance programs are critical to assess the risk of disease emergence and spread of AIV by wild migratory birds. Analysis of long-term surveillance data enables meaningful insights necessary to develop appropriately informed predictive models. Inferences from such models are consequential for pandemic preparedness and livestock protection.
Materials and Methods
Poultry trade relationships
Using available census data for chicken and ducks we produced heat maps showing poultry production intensity. Mapping all available H9-HA sequence data on top of the heat maps allows us to visually assess the distribution of sampling and production regions. This assessment was used to determine appropriate geographic regions to incorporate into our model and identify regions with few samples. Data on the international trade of live poultry from 1995–2011 (the most current year of data available) was downloaded from the United Nations, Food & Agriculture Organization (UN-FAO: faostat3.fao.org, accessed June 11, 2015). The quantity of chickens and ducks traded was chosen as the metric to assess trade relationships. All years were summed to generate long-term estimates of international trade and countries were aggregated into 7 regions: Japan/South Korea, China, South East Asia, South Asia, Middle East, Europe and North America, broadly consistent with georegions used for the phylogeographic model. The quantity of exports and imports was compared for each region and only the maximum trade quantity linking two regions was recorded due to inconsistencies in data reported.
Distribution of isolates and dataset design
All available H9 influenza A HA gene sequences were downloaded from the Influenza Virus Resource database (http://www.ncbi.nlm.nih.gov/genomes/FLU/FLU.html) on March 30, 2014. Accession numbers of newly sequenced viruses are presented in S1 Table. The data analyzed, including isolation dates, latitude, longitude and accession numbers are presented in the attached S1 Dataset. Sequences included in the dataset were subject to the following criteria: a) sequences had known location, host, and isolation date; b) for sequences with the same location, date of isolation and 100% similarity a single representative was retained; c) vaccine, derivative, and recombinant sequences were excluded; and d) sequences less than 1000 nucleotides in length were excluded. Locations with fewer than 10 taxa, as well as taxa collected prior to 1970 were excluded.
The remaining taxa were coded by both geographic region and ecosystem (‘wild’ or ‘domestic’). The wild classification included migratory birds (i.e. Anseriformes, Charadriiformes, etc.). The agricultural ecosystem included domestic birds raised for consumption (Galliformes including chicken, quail and pheasant; and Anseriformes including domestic duck and goose). See S1 Text for detailed descriptions of data stratification and subsampling. The final dataset consisted of 955 taxa, which were coded into 9 geographic regions: Japan/South Korea (n = 116), China–East (n = 147), China–Central (n = 179), China–West (n = 94), Southeast Asia (n = 18), South Asia (n = 93), Middle East (n = 210), Europe (n = 36), and North America (n = 62). 178 taxa were isolated from wild birds and the remaining from domesticated poultry. S2 Table presents detailed stratification of the dataset by region and ecosystem and S2 Fig shows the H9-HA sequence/location/year before and after subsampling.
Two additional datasets were assembled to investigate if model inferences from analysis of the H9 dataset could be generalized to other influenza A virus subtypes. Low pathogenic avian influenza A H3 and H6 subtype viruses have been sampled from both ecosystems and were chosen as comparison datasets. Even though AIV has a global distribution, surveillance and reporting is inconsistent and data availability for both wild and domestic birds can be limited. The AIV H3 and H6 subtype spatial distribution of wild and domestic birds sampled were similar to those of H9, but not identical. For the H3 subtype, sequence data was available from North Asia, including Russia and Mongolia, but none were available from western China, South Asia or the Middle East. For the H6 subtype no sequence data was available from South Asia. Limiting our ancestral state reconstruction to location states that overlapped between datasets would result in the exclusion of substantial data. Variation in data availability, locations sampled, and dataset design is discussed in the S1 Text. All available H3 and H6 subtype HA gene sequence data was downloaded from the Influenza Virus Resource database and screened based on the criteria described above (S2 Table).
Bayesian phylogenetic and coalescent analysis
For each of the gene segments analyzed, Bayesian phylogenetic trees were estimated using BEAST v.1.8  with an uncorrelated lognormal relaxed molecular clock  that allows for rate variation across lineages. The general time-reversible model of nucleotide substitution (+ gamma + invariant sites) was used along with a Bayesian skyline coalescent tree prior. A minimum of three independent runs of 150 million generations were performed and combined after removal of burn-in to achieve an Effective Sample Size of >200 as diagnosed in Tracer v1.6.
Ancestral state reconstruction of host ecology and location of isolation
Phylogenies record the history of viral exchange between ecosystems and the gene flow between spatially sampled populations [4, 6, 7, 32, 45–48]. By integrating both ecosystem and geography into the phylogenetic model, we can estimate the relative contribution of each to the global distribution and diversity of viruses in circulation. We used a non-reversible continuous-time Markov chain model to estimate the migration rates between geographical regions and the general patterns of H9 virus circulation in different avian populations . Here we estimate the network linking the discrete wild and domestic populations distributed across regions. By defining the geographically and ecologically discrete characters in our model we were able to distinguish whether inter-regional migration was between domestic populations or wild animals. In addition, we estimated the rate of viral transmission between wild and domestic flocks and where the ecosystem interface was porous. A limitation of this approach is that realistic measurements of bird density and disease prevalence were not accounted for within our model.
A Bayesian stochastic search variable selection (BSSVS) was employed to reduce the number of parameters to those with significantly non-zero transition rates . The BSSVS explores and efficiently reduces the state space by employing a binary indicator (I). A Bayes factor (BF) can be computed to assess the support for individual transitions between discrete states. We identify a transition as important when P(I = 1|data) >0.5. This analysis was conducted with a Poisson prior on the number of non-zero rates with a mean equal to the minimum number of rates required to connect the discrete ecological/geographical region states. We applied this to our analysis of the H9 dataset and determined the critical BF >14 (Poisson prior mean = 15). The same criterion was applied to our analysis of H3 (Poisson prior mean = 11) and H6 (Poisson prior mean = 14) datasets and determined a critical Bayes factor >10 and > 12 respectively. Strength of statistical supports were interpreted as follows; 10≤ BF <30 indicating strong support, 30≤ BF <100 indicates very strong support and BF >100 indicating decisive support [8, 25, 32].
We assessed statistical support of rate differences (wild > domestic and domestic < wild) by computing Bayes factors. The Bayes factors for differences in migration rates (r) were estimated by the ratio of posterior odds (P(r1 > r2 | Data)/P(r2 > r1 | Data)) versus prior odds P(r1 > r2)/P(r2 > r1), where the prior odds ratio was approximately 1 [8, 32].
Model sensitivity to sampling bias
Domestic populations were sampled much more intensively than wild populations (S1 Text). To investigate if our reconstruction was sensitive to data heterogeneity, we consider the prior expectation for the root discrete state frequencies and mean ecosystem transition rates for the sampling scheme used. If the discrete state distribution at the root is correlated with the location frequencies at the tips, we can expect that ancestral reconstruction throughout the entire phylogeny will be influenced by this tip-location sampling frequency. We randomized the location assignments at the tips throughout the MCMC procedure to investigate this possibility . Similarly, if uneven sampling influences ecosystem transition rates then we can expect the mean rates to deviate from the prior. We further tested the model sensitivity to alternative sampling procedures where the number of sequences sampled was randomly pruned from a maximum of 955 sequences to a minimum of 73 sequences. Analysis of each sampling scheme was repeated three times. This sensitivity analysis was carried out for the final H3 and H6 subtype datasets. Our results quickly converged on the prior root location probability for all subtypes (equal state frequencies; S5 Fig) and ecosystem transition rate probability (equal mean rates; S6 Fig) indicating that the sampling frequencies had little impact on the model inferences a posteriori.
Source/sink dynamics and relative risk of viral emergence
To assess the contribution of each region as a viral source or sink in the migration network, state jumps at the tree nodes, representing a state transition event (i.e. migration or ecosystem interaction) were counted . We used a non-reversible model and therefore the direction of gene flow between states can be determined to assess which region was either source or sink. Heat maps representing the average number of jumps per year estimated from the last 1000 posterior sampled trees were generated. Due to the increase in surveillance efforts post-1997’s Hong Kong H5N1 outbreak and recent intensification in poultry production, our analysis only considered migration events between 1998 and 2013.
Even though the explicit migration events are not observed, the waiting times between state changes can be tracked on the phylogeny. The duration that a particular state is observed before transitioning to another state (Markov reward) was recorded on a branch-by-branch basis from the posterior sampling of phylogenetic trees . These Markov rewards were calculated for each tree and ecosystem/location state in our model.
We further assessed the relative risk of each location as a viral source or sink population using 2x2 contingency tables . The total jump counts in to or out of each discrete state were obtained for each step of the Bayesian MCMC. Contingency tables containing four cells (A, B, C, and D) were populated for each combination of regions. For example, to calculate the relative proportion of times viruses emerged from Japan and migrated to North America, the columns of the contingency table denote events in which Japan was the geographic source (left column) and events in which Japan was not the source (right column). Likewise, the rows of the contingency table can be denoted as events in which North America was the geographic sink of a transition (top row) and events in which was not the sink (bottom row). Therefore, cell A in this example includes the total number of estimated events in which viruses from Japan were introduced into North America; cell B includes the total introductions from other regions (not Japan) into North America; cell C includes the total number of introductions from Japan into other regions (not North America), and cell D includes the total number of mutually exclusive events in which Japan was not the geographic source and North America was not the geographic sink. The relative proportion of introductions from Japan into North America will be calculated as [A/(A+B)]/[C/(C+D)]. The proportion of total times where North America was the sink when Japan was the source was represented by [A/(A+B)]. Likewise, [C/(C+D)] represents the proportion of total times when viruses entered other regions (not North America) where Japan was the source. The ratio of these proportions represent the relative risk of Japan as the source when North America was the sink compared to when other regions were the geographic sinks. These ratios were calculated per MCMC step and averaged across all steps for each in order to incorporate phylogenetic uncertainty.
All studies involving the collection of samples from wild and domestic animal species are conducted in compliance with the policies of the National Institutes of Health and the Animal Welfare Act, and with the approval of the St. Jude Children's Research Hospital Institutional Animal Care and Use Committee (Protocol Number 546-100324-10/14, approved July 20, 2015) and Massachusetts Institute for Technology (Protocol Number 0515-046-18, approved May 3, 2015).
S1 Text. Dataset descriptions and model extensions.
This file contains details of the sampling procedures used and descriptions of the final datasets presented in this manuscript.
S1 Table. Accession numbers of newly sequenced viruses.
S2 Table. Dataset summary before and after subsampling.
S3 Table. Mean migration rates of H3 subtype viruses between avian ecosystems across sampling locations.
Statistically supported interactions are shown in bold.
S4 Table. Mean migration rates of H6 subtype viruses between avian ecosystems across sampling locations.
Statistically supported interactions are shown in bold.
S1 Dataset. Ecotype, Latitude and Longitude of globally distributed samples
S1 Fig. Map showing global duck production density.
(A) Histogram of H9 avian influenza isolates included in the full dataset per year by regions. (B) Histogram of H9 avian influenza isolates following subsampling included in the final dataset per year by regions.
S3 Fig. Bayesian relaxed clock phylogenetic MCC tree of global H9 HA gene sequences with taxon names.
Purple bars on nodes indicate 95% Bayesian credibility intervals of divergence time estimates.
S4 Fig. Heat map showing source-sink dynamics/location/year.
S5 Fig. Inferred migration rates and patterns for H3 subtype viruses sampled from wild and domestic animals.
(A) Map showing statistically supported transitions between geographic regions by ecosystem. Line thickness corresponds to viral flow rates shown in S3 Table (thinnest <0.5; 0.5 to <1; 1 to <2; ≥2 thickest). (B) Density distribution of statistically supported mean transition rates between ecosystems. (C) Statistically supported mean migration rates per MCMC step of wild-to-wild avian transitions versus domestic-to-domestic avian transitions.
S6 Fig. Bayesian relaxed clock phylogenetic MCC tree of global H3 HA gene sequences with taxon names.
Purple bars on nodes indicate 95% Bayesian credibility intervals of divergence time estimates.
S7 Fig. Inferred migration rates and patterns for H6 subtype viruses sampled from wild and domestic animals.
(A) Map showing statistically supported transitions between geographic regions by ecosystem. Line thickness corresponds to viral flow rates shown in S4 Table (thinnest <0.5; 0.5 to <1; 1 to <2; ≥2 thickest). (B) Density distribution of statistically supported mean transition rates between ecosystems. (C) Statistically supported mean migration rates per MCMC step of wild-to-wild avian transitions versus domestic-to-domestic avian transitions.
S8 Fig. Bayesian relaxed clock phylogenetic MCC tree of global H6 HA gene sequences with taxon names.
Purple bars on nodes indicate 95% Bayesian credibility intervals of divergence time estimates.
S9 Fig. Root state probabilities as a function of number of taxa sampled from a particular state estimated during tip-state randomization procedure.
The final dataset is shown in green and alternative state sampling procedures indicated in blue. Grey line indicates the prior expectation for the root location probability. Shaded area indicates empirical posterior probability under conditions of over-parametization (i.e. more parameters estimated than data points observed).
S10 Fig. Mean state transition rates within and between ecosystems using tip-state randomization for the full dataset and alternative state sampling procedures (A-G).
The number of sequenced used in each analysis were A) n = 73; B) n = 143; C) n = 260; D) n = 424; E) n = 564; F) n = 686; G) n = 791.
S11 Fig. A) Mean state transition rates within and between ecosystems and B) root state probability using tip-state randomization for the H3 dataset.
S12 Fig. A) Mean state transition rates within and between ecosystems and B) root state probability using tip-state randomization for the H6 dataset.
ED Breckenridge and AR Heri are thanked for critical comments during the drafting of this manuscript. The data for this manuscript was generated while DEW was employed at JCVI. The opinions expressed in this article are the authors’ own and do not reflect the views of the Centers for Disease Control, the Department of Health and Human Services, or the United States government.
Conceived and designed the experiments: JB TTP NJH ITMH JAR. Performed the experiments: JB TTP NJH ITMH. Analyzed the data: JB TTP NJH ITMH EJM GK RGW RJW MDS GJDS JAR. Contributed reagents/materials/analysis tools: BCE RAH TBS DEW SK SSC. Wrote the paper: JB TTP NJH ITMH JAR. Designed and supervised the study and revised the manuscript: JB JAR.
- 1. Perry BD, Grace D, Sones K. Current drivers and future directions of global livestock disease dynamics. Proc Natl Acad Sci USA. 2013;110(52):20871–20877. pmid:21576468
- 2. Gao R, Cao B, Hu Y, Feng Z, Wang D, Hu W, et al. Human infection with a novel avian-origin influenza A (H7N9) virus. N Engl J Med. 2013;368(20):1888–1897. pmid:23577628
- 3. Guan Y, Shortridge KF, Krauss S, Webster RG. Molecular characterization of H9N2 influenza viruses: were they the donors of the "internal" genes of H5N1 viruses in Hong Kong? Proc Natl Acad Sci USA. 1999;96(16):9363–9367. pmid:10430948
- 4. Vijaykrishna D, Bahl J, Riley S, Duan L, Zhang JX, Chen H, et al. Evolutionary dynamics and emergence of panzootic H5N1 influenza viruses. PLoS Pathog. 2008;4(9):e1000161. pmid:18818732
- 5. Duan L, Bahl J, Smith GJ, Wang J, Vijaykrishna D, Zhang LJ, et al. The development and genetic diversity of H5N1 influenza virus in China, 1996–2006. Virology. 2008;380(2):243–254. pmid:18774155
- 6. Lam TT, Wang J, Shen Y, Zhou B, Duan L, Cheung CL, et al. The genesis and source of the H7N9 influenza viruses causing human infections in China. Nature. 2013;502(7470):241–244. pmid:23965623
- 7. Duan L, Campitelli L, Fan XH, Leung YH, Vijaykrishna D, Zhang JX, et al. Characterization of low-pathogenic H5 subtype influenza viruses from Eurasia: implications for the origin of highly pathogenic H5N1 viruses. J Virol. 2007;81(14):7529–7539. pmid:17507485
- 8. Bahl J, Vijaykrishna D, Holmes EC, Smith GJ, Guan Y. Gene flow and competitive exclusion of avian influenza A virus in natural reservoir hosts. Virology. 2009;390(2):289–297. pmid:19501380
- 9. Capua I, Alexander DJ. Avian influenza and human health. Acta Trop. 2002;83(1):1–6. pmid:12062786
- 10. Moon HJ, Song MS, Cruz DJ, Park KJ, Pascua PN, Lee JH, et al. Active reassortment of H9 influenza viruses between wild birds and live-poultry markets in Korea. Arch Virol. 2010;155(2):229–241. pmid:20033463
- 11. Verhagen JH, Herfst S, Fouchier RAM. How a virus travels the world. Science. 2015;347(6222):616–617. pmid:25657235
- 12. Chen H, Smith GJ, Li KS, Wang J, Fan XH, Rayner JM, et al. Establishment of multiple sublineages of H5N1 influenza virus in Asia: implications for pandemic control. Proc Natl Acad Sci USA. 2006;103(8):2845–2850. pmid:16473931
- 13. Chen H, Smith GJ, Zhang SY, Qin K, Wang J, Li KS, et al. Avian flu: H5N1 virus outbreak in migratory waterfowl. Nature. 2005;436(7048):191–192. pmid:16007072
- 14. Fusaro A, Monne I, Salviato A, Valastro V, Schivo A, Amarin NM, et al. Phylogeography and evolutionary history of reassortant H9N2 viruses with potential human health implications. J Virol. 2011;85(16):8413–8421. pmid:21680519
- 15. W.H.O. Avian influenza: assessing the pandemic threat. http://apps.who.int/iris/handle/10665/68985. World Health Organization Communicable Diseases Cluster; 2005.
- 16. Lin YP, Shaw M, Gregory V, Cameron K, Lim W, Klimov A, et al. Avian-to-human transmission of H9N2 subtype influenza A viruses: relationship between H9N2 and H5N1 human isolates. Proc Natl Acad Sci USA. 2000;97(17):9654–9658. pmid:10920197
- 17. Butt KM, Smith GJ, Chen H, Zhang LJ, Leung YH, Xu KM, et al. Human infection with an avian H9N2 influenza A virus in Hong Kong in 2003. J Clin Microbiol. 2005;43(11):5760–5767. pmid:16272514
- 18. Baranovich T. Assessing the fitness of distinct clades of influenza A (H9N2) viruses. Emerg Microbes Infect. 2013;2:e75. pmid:26038443
- 19. Qi W, Zhou X, Shi W, Huang L, Xia W, Liu D, et al. Genesis of the novel human-infecting influenza A(H10N8) virus and potential genetic diversity of the virus in poultry, China. Euro Surveill. 2014;19(25).
- 20. Li X, Shi J, Guo J, Deng G, Zhang Q, Wang J, et al. Genetics, receptor binding property, and transmissibility in mammals of naturally isolated H9N2 Avian Influenza viruses. PLoS Pathog. 2014;10(11):e1004508. pmid:25411973
- 21. Kandeil A, El-Shesheny R, Maatouq AM, Moatasim Y, Shehata MM, Bagato O, et al. Genetic and antigenic evolution of H9N2 avian influenza viruses circulating in Egypt between 2011 and 2013. Arch Virol. 2014;159(11):2861–2876. pmid:24990416
- 22. Xu KM, Smith GJ, Bahl J, Duan L, Tai H, Vijaykrishna D, et al. The genesis and evolution of H9N2 influenza viruses in poultry from southern China, 2000 to 2005. J Virol. 2007;81(19):10389–10401. pmid:17652402
- 23. Cheung CL, Vijaykrishna D, Smith GJ, Fan XH, Zhang JX, Bahl J, et al. Establishment of influenza A virus (H6N1) in minor poultry species in southern China. J Virol. 2007;81(19):10402–10412. pmid:17652385
- 24. Huang K, Zhu H, Fan X, Wang J, Cheung CL, Duan L, et al. Establishment and lineage replacement of H6 influenza viruses in domestic ducks in southern China. J Virol. 2012;86(11):6075–6083. pmid:22438558
- 25. Lemey P, Rambaut A, Drummond AJ, Suchard MA. Bayesian phylogeography finds its roots. PLoS Comput Biol. 2009;5(9):e1000520. pmid:19779555
- 26. Peiris JSM, Poon LLM, Guan Y. Surveillance of animal influenza for pandemic preparedness. Science. 2012;335(6073):1173–1174. pmid:22345402
- 27. Munster VJ, Baas C, Lexmond P, Waldenstrom J, Wallensten A, Fransson T, et al. Spatial, temporal, and species variation in prevalence of influenza A viruses in wild migratory birds. PLoS Pathog. 2007;3(5):e61. pmid:17500589
- 28. Hoye BJ, Munster VJ, Nishiura H, Klaassen M, Fouchier RA. Surveillance of wild birds for avian influenza virus. Emerg Infect Dis. 2010;16(12):1827–1834. pmid:21122209
- 29. Xu KM, Li KS, Smith GJ, Li JW, Tai H, Zhang JX, et al. Evolution and molecular epidemiology of H9N2 influenza A viruses from quail in southern China, 2000 to 2005. J Virol. 2007;81(6):2635–2645. pmid:17192315
- 30. Dong G, Luo J, Zhang H, Wang C, Duan M, Deliberto TJ, et al. Phylogenetic diversity and genotypical complexity of H9N2 influenza A viruses revealed by genomic sequence analysis. PLoS One. 2011;6(2):e17212. pmid:21386964
- 31. Jiang W, Liu S, Hou G, Li J, Zhuang Q, Wang S, et al. Chinese and global distribution of H9 subtype avian influenza viruses. PLoS One. 2012;7(12):e52671. pmid:23285143
- 32. Bahl J, Krauss S, Kuehnert D, Fourment M, Raven G, Pryor SP, et al. Influenza A virus migration and persistence in North American wild birds. PLoS Pathog. 2013;9(8):e1003570. pmid:24009503
- 33. Chen RB, Holmes EC. Hitchhiking and the population genetic structure of avian influenza virus. Journal of Molecular Evolution. 2010;70(1):98–105. pmid:20041240
- 34. Worobey M, Han GZ, Rambaut A. A synchronized global sweep of the internal genes of modern avian influenza virus. Nature. 2014;508(7495):254–257. pmid:24531761
- 35. Pi C, Rou Z, Horowitz S. Fair or Fowl? Industrialization of Poultry Production in China. Institute for Agriculture and Trade Policy: 2014.
- 36. Minin VN, Suchard MA. Counting labeled transitions in continuous-time Markov models of evolution. J Math Biol. 2008;56(3):391–412. pmid:17874105
- 37. Edwards Ceiridwen J, Suchard Marc A, Lemey P, Welch John J, Barnes I, Fulton Tara L, et al. Ancient hybridization and an Irish origin for the modern polar bear matriline. Current Biology. 2011;21(15):1251–1258. pmid:21737280
- 38. Brown IH. Summary of avian influenza activity in Europe, Asia and Africa 2006–2009. Avian Dis. 2010;54:187–193. pmid:20521631
- 39. Winker K, Gibson DD. The Asia-to-America influx of avian influenza wild bird hosts is large. Avian Dis. 2010;54(1 Suppl):477–482. pmid:20521682
- 40. Pasick J, Berhane Y, Joseph T, Bowes V, Hisanaga T, Handel K, et al. Reassortant highly pathogenic Influenza A H5N2 virus containing gene segments related to Eurasian H5N8 in British Columbia, Canada, 2014. Sci Rep. 2015;5.
- 41. Prosser DJ, Cui P, Takekawa JY, Tang M, Hou Y, Collins BM, et al. Wild bird migration across the Qinghai-Tibetan Plateau: a transmission route for highly pathogenic H5N1. PLoS ONE. 2011;6(3):e17622. pmid:21408010
- 42. Van Boeckel TP, Prosser D, Franceschini G, Biradar C, Wint W, Robinson T, et al. Modelling the distribution of domestic ducks in Monsoon Asia. Agriculture, ecosystems & environment. 2011;141(3–4):373–380.
- 43. Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29(8):1969–1973. pmid:22367748
- 44. Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evolutionary Biology. 2007;7:214. pmid:17996036
- 45. Smith GJ, Vijaykrishna D, Bahl J, Lycett SJ, Worobey M, Pybus OG, et al. Origins and evolutionary genomics of the 2009 swine-origin H1N1 influenza A epidemic. Nature. 2009;459(7250):1122–1125. pmid:19516283
- 46. Smith GJ, Bahl J, Vijaykrishna D, Zhang J, Poon LL, Chen H, et al. Dating the emergence of pandemic influenza viruses. Proc Natl Acad Sci USA. 2009;106(28):11709–11712. pmid:19597152
- 47. Bahl J, Nelson MI, Chan KH, Chen R, Vijaykrishna D, Halpin RA, et al. Temporally structured metapopulation dynamics and persistence of influenza A H3N2 virus in humans. Proc Natl Acad Sci USA. 2011;108(48):19359–19364. pmid:22084096
- 48. Li KS, Guan Y, Wang J, Smith GJ, Xu KM, Duan L, et al. Genesis of a highly pathogenic and potentially pandemic H5N1 influenza virus in eastern Asia. Nature. 2004;430(6996):209–213. pmid:15241415
- 49. Lemey P, Rambaut A, Bedford T, Faria N, Bielejec F, Baele G, et al. Unifying Viral Genetics and Human Transportation Data to Predict the Global Transmission Dynamics of Human Influenza H3N2. PLoS Pathog. 2014;10(2).
- 50. Everitt BS. The Analysis of Contingency Tables. Second edition ed. Boca Raton, FL: CRC Press; 1992.
- 51. Bahl J, Pham TT, Hill NJ, Hussein ITM, Ma EJ, Easterday BC, Halpin RA, Stockwell TB, Wentworth DE, Kayali G, Krauss S, Shultz-Cherry S, Webster RG, Webby RJ, Swartz MD, Smith GJD, Runstadler RA (2016) Data from: Ecosystem interactions underlie the spread of avian influenza A viruses with pandemic potential. Dryad Digital Repository.