The Basic Reproduction Number as a Predictor for Epidemic Outbreaks in Temporal Networks

Petter Holme; Naoki Masuda

doi:10.1371/journal.pone.0120567

Abstract

The basic reproduction number R₀—the number of individuals directly infected by an infectious person in an otherwise susceptible population—is arguably the most widely used estimator of how severe an epidemic outbreak can be. This severity can be more directly measured as the fraction of people infected once the outbreak is over, Ω. In traditional mathematical epidemiology and common formulations of static network epidemiology, there is a deterministic relationship between R₀ and Ω. However, if one considers disease spreading on a temporal contact network—where one knows when contacts happen, not only between whom—then larger R₀ does not necessarily imply larger Ω. In this paper, we numerically investigate the relationship between R₀ and Ω for a set of empirical temporal networks of human contacts. Among 31 explanatory descriptors of temporal network structure, we identify those that make R₀ an imperfect predictor of Ω. We find that descriptors related to both temporal and topological aspects affect the relationship between R₀ and Ω, but in different ways.

Citation: Holme P, Masuda N (2015) The Basic Reproduction Number as a Predictor for Epidemic Outbreaks in Temporal Networks. PLoS ONE 10(3): e0120567. https://doi.org/10.1371/journal.pone.0120567

Academic Editor: Gui-Quan Sun, Shanxi University, CHINA

Received: July 24, 2014; Accepted: February 3, 2015; Published: March 20, 2015

Copyright: © 2015 Holme, Masuda. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

Data Availability: All the data sets used in this paper for Dating, Internet community, E-mail 1, E-mail 2, Forum are from previously published studies referenced in the paper and are available from the author of those studies. The contact details of the author is: Petter Holme petter.holme@physics.umu.se.

Funding: PH was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2013R1A1A2011947) and the Swedish Research Council. NM acknowledges support from JST, CREST, and JST, ERATO, Kawarabayashi Large Graph Project. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: Petter Holme and Naoki Masuda are academic editors of PLoS ONE. This does not alter the authors' adherence to all the PLoS ONE policies on sharing data and materials.

Introduction

The interaction between medical and theoretical epidemiology of infectious diseases is probably not as strong as it should. Many results in the respective fields fail to migrate to the other. There are of course exceptions. Perhaps the most important are the ideas of epidemic thresholds and the parameter R₀—the basic reproduction number—as a key predictor of the epidemiological severity of a disease [1,2]. R₀ is defined as the expected number of other individuals that an infected individual will infect if he or she enters a population entirely composed of susceptible individuals. It is thus a combined property of the process of contagion and the contact patterns of the population. In classic mathematical models of infectious disease spreading, R₀ = 1 marks an epidemic threshold. If R₀ < 1, the expected fraction of infected people in an outbreak, denoted by Ω, will not depend on the total population size N. If R₀ > 1, the expected value of Ω is proportional to N. In other words, in the limit of large populations, a finite fraction of the population can be infected. The focus on R₀ in the literature has sometimes been so strong that researchers rather calculate R₀ than quantities directly related to the outbreak, such as prevalence, incidence, and time to the peak prevalence.

The use of R₀ is not entirely unproblematic. First, it is hard to estimate both in models [3–5] and from outbreak data [6–8]. Second, the result that R₀ = 1 defines an epidemic threshold rests on very coarse assumptions [3,9,10]. For example, one needs to assume that every pair of individuals has the same chance of interacting at any given time. In fact, interaction rates depend on pairs of individuals—people living in the same city are more likely to interact than those living in different cities. The derivation of R₀ has been extended to the case in which information about contact networks (describing who can spread the disease to whom) is available [11–15]. In this case, the derivation is usually restricted to the case of regular networks, where all individuals have the same degree (number of neighbors in the contact network) [14,15]. Sometimes people use definitions of R₀ that differs from the original [11–13,16] in a strict sense (but typically captures some similar property relevant for the modeling framework in question). The assumption that a pair of individuals interacts at the same rate over time does not hold true in reality either. For example, interaction is more likely to take place when most people are awake. This point is a reason for the increasing interest in temporal networks (showing who is in contact with whom, at what time) as a representation for the interactions underlying epidemic spreading, which focus on time dependence of networks [17–19]. Another reason is the increasing availability of data sets of temporal networks—typically lists of anonymized id numbers of two individuals and the times when these two individuals have been in contact (close enough for a disease to spread). The temporal network literature has focused on spreading processes (not only epidemic spreading) and how these are affected by the structure. Structure, in this case, refers to the way the network differs from a random temporal network (where the contact can happen with any pair of nodes with equal probability, at any time). Studies of epidemic models on temporal networks have found that e.g. a broad distribution of interevent times slows down the spreading [18,19].

There have been a few attempts to examine R₀ for temporal networks. Ref. [16,20], for example, derives R₀ for a specific model of temporal networks. Ref. [21] measures R₀ in empirical temporal networks, but does not relate it to prevalence, final outbreak size or other direct measures of outbreak severity.

One possible approach in this line of research is to find more accurate estimators than R₀ of disease severity. However, R₀ is routinely estimated for different infectious diseases by public health organizations worldwide. These estimates constitute an important resource for monitoring and comparing disease outbreaks. Rather than discarding this data by proposing another quantity, we will investigate what R₀ really tells us about disease spreading in empirical temporal networks of human contacts. Including the temporal information can make a big impact on the outbreak dynamics compared to modeling epidemics on a static network, let alone a fully mixed model [17–19].

We use the Susceptible–Infectious–Recovered (SIR) model with constant disease duration [22]. This model has two control parameters—the probability of disease transmission (upon a contact between an infectious and susceptible individual), denoted by λ, and the duration of the infectious stage, denoted by δ. We numerically simulate the SIR model on various temporal networks. First, we observe that in this case Ω is not uniquely determined from an R₀ value. A combination of λ and δ can give a larger R₀ but a smaller Ω than another combination does. Then, we investigate how the structure of the temporal contact network explains the relationship between R₀ and Ω. Instead of building a theory that bridges the microscopic structure of temporal network data and the emergent properties of the outbreak, we screen many potentially interesting descriptors of the temporal network structure by identifying those that are strongly correlated with the descriptors of the shape of scatter plots of Ω vs. R₀.

Results

Empirical data

We analyze empirical sequences of contacts between people. These data sets can be divided into physical proximity and electronic communication data. The former type could be interesting for studying information and disease spreading mediated by human contacts. The latter type is primarily of interest in the context of information spreading (bearing in mind that information spreading not necessarily follows the same dynamics as infectious diseases). In all data sets, nodes are human individuals. We list some basic statistics of the data sets in Table 1.

Download:

Table 1. The basic statistics of the data sets.

https://doi.org/10.1371/journal.pone.0120567.t001

One data set belonging to the physical proximity class comes from the Reality mining study [23], where contacts between university students were recorded when their smartphones were within Bluetooth range (10–15 m). We use the same subset of this data as in Ref. [24]. Another class of proximity data was collected from groups of people wearing radio-frequency identification sensors. One such dataset comes from the attendees of a conference [25] (Conference), another from a school (School) [26], another from a hospital (Hospital) [27] and yet another from visitors to a gallery (Gallery) [25]. School and Gallery are collected for two and 69 days, respectively. We analyze the days separately and average the results over the days. In these data sets, a contact between people closer than 1~1.5m was recorded every 20 seconds. Finally, we use a data set of sexual contacts between sex sellers and buyers collected from a Brazilian web forum (Prostitution) [28].

The class of electronic communication data includes two e-mail networks. These data sets are described in detail in Refs. [29] (E-mail 1) and [30] (E-mail 2). E-mails have a natural direction from the sender to the recipient. However, to analyze all the data sets in the same way, we treat them as undirected temporal networks. We furthermore study two Internet communities: a dating community (Dating) [31] and a film community (Online community) [32]. The contacts in these data sets represent messages from person to person like e-mails do. In Dating there are also “flirts” with which one user expresses interest in another (but does not send text, images or other information). A slightly different form of online pair-wise interaction is posting to public web pages. We study one data set of posts to the home page (“wall”) of Facebook [33] and a data set from the aforementioned film community where a contact represents a reply to a post at a public forum (Forum) [32]. One contact in these data sets is thus a publically accessible message from one user to another.

Final outbreak size as a function of R₀

In Fig. 1, we show scatter plots of Ω vs. R₀ for our data sets (see S1 Fig. for the results for the Gallery data). One scatter plot corresponds to one data set. More precisely, we measure R₀ directly from the simulations according to the definition—the average number of others infected by the infection source. Ω is the fraction of recovered individuals once the outbreak has subsided, i.e., when there no longer are infectious individuals. A point in a scatter plot represents an average over 10⁶ runs for given parameter values (λ,δ). Each run starts with one infected node that is selected from all nodes with the equal probability. We assume the source of the infection is infected at the time of the first contact. In total, we sample 20×20 points in the (λ,δ) parameter space, where each parameter varies from 0.001 to 1 with exponentially increasing intervals. δ is defined as a fraction of the total sampling time.

Download:

Fig 1. The average outbreak size plotted against the basic reproduction number for 12 data sets (indicated in the Fig.) of human interaction.

Each point of the scatter plots corresponds to one pair (λ,δ), where λ is the infection probability and δ is the duration of infection. In the upper left corner there is a legend for the color-coding of these points. In the other panels, a data point is an average over 10⁴ runs of the SIR model as described in the Methods section. The vertical lines mark R₀ = 1—the epidemic threshold for the canonical, fully mixed SIR model.

https://doi.org/10.1371/journal.pone.0120567.g001

For all the data sets, there is a significant deviation from a deterministic relationship between R₀ and Ω. Here, a deterministic relationship is operationally defined as the situation in which the Ω value is uniquely determined by the value of R₀ (as it would be in most fully mixed and network models we are aware of). Interestingly, the way these scatter plots deviate from a deterministic relationship depends on data sets. For example, for the Hospital data red points are typically on top of the green ones—i.e. points with higher λ and lower δ give larger outbreaks than points with similar R₀ but lower λ and higher δ. For the Facebook data the situation is reversed.

Characterizing the shape of the Ω vs. R₀ point cloud

To explore the causes of the imperfectness of R₀ as a predictor of Ω, we define six so-called shape descriptors, which measure the shape of the point clouds shown in Fig. 1. The shape descriptors are listed in Table 2, their definitions are illustrated in Fig. 2, and their values for each data set are shown in S2 Fig.

Download:

Table 2. Shape descriptors for the point clouds shown in Fig. 1.

https://doi.org/10.1371/journal.pone.0120567.t002

Download:

Fig 2. Explanation of shape descriptors to characterize the point clouds shown in Fig. 1.

All examples come from the Conference data set. Panel A describes Kendall’s τ—a correlation coefficient based on the counting of discordant pairs (pairs of points connected by a line of negative slope). Panels B and C show the maximal separation of discordant pairs. In B, the measures focus on the pair with the largest separation in the R₀ direction. Δ_R0 denotes the maximum separation; ρ_R0 is the mean R₀ value for the maximally discordant pair. Panel C shows the similar quantities, Δ_Ω and ρ_Ω, defined along the Ω direction. Panels D, E, and F illustrate the measurement of λδ-balance via τ_αΩ. This descriptor captures the tendency of some data sets to have high-λ, low-δ points above high-δ, low-λ points, while for other data sets, the situation is reversed. Panel D illustrates how the R₀ axis is segmented into bins. Panel E shows how we assign a (λ,δ)-plane angle, α, to all points in the bin. Panel F shows how we measure the correlation between α and Ω, which is very weak in this particular case.

https://doi.org/10.1371/journal.pone.0120567.g002

The first shape descriptors is the Kendall’s τ (Fig. 2A), which captures how good R₀ is as a predictor of Ω. We chose Kendall’s τ because the Ω vs. R₀ curve is highly non-linear such that the Pearson’s correlation coefficient would underestimate how good a predictor R₀ is. Among non-linear correlation measures, Kendall’s τ, is the most principled and easiest to understand. It counts the number of point pairs that are connected by a line with a positive slope (concordant pairs) and a negative slope (discordant pairs). Kendall’s τ is then the number of concordant pairs minus the number of discordant pairs divided by the total number of pairs. In the context of measuring the R₀-Ω correlation, we denote Kendall’s τ by τ_R0Ω.

Next four shape descriptors focus on the region in the (R₀,Ω) space where the spread of the points is the largest (Fig. 2B, C). We look for the discordant (λ,δ) pair with the largest difference between the its R₀ values. This difference defines Δ_R0. Similarly, the largest difference in Ω among discordant pairs defines Δ_Ω. We also measure the average R₀ value, ρ_R0, of the two R₀ values derived from the discordant pair maximally separated in R₀. Similarly ρ_Ω is the average R₀ value of the discordant pair maximally separated in Ω. The shape descriptors ρ_R0 and ρ_Ω thus show the locations on the R₀ axis of the maximally separated discordant pairs. They may be related to the location of the epidemic threshold, where Ω takes off from zero in an infinite population.

As mentioned above, for some data sets, given a value of R₀, higher δ implies higher Ω (Hospital), whereas the relationship is reversed for other data sets. To quantify this observation, we define the sixth shape descriptor τ_αΩ that we call λδ-balance for short. To define τ_αΩ, we start by dividing the range of R₀ into ten equidistant bins between the smallest and largest observed values (Fig. 2D). Within a bin, the points have fairly similar R₀ values, but their λ and δ values can be diverse. To measure the effect of the balance between λ and δ on Ω, we calculate the angle α that a (λ,δ) pair relative to the origin makes to the diagonal in the (λ,δ)-plane, i.e., the λ = δ line (Fig. 2E). Then, we measure the correlation between α and Ω by Kendall’s τ (Fig. 2F). Finally, we average the values for the different bins. To avoid confusion, we denote the calculated Kendall’s τ by τ_αΩ.

Temporal and static network descriptors

To characterize the structure of the contact structures modeled as temporal networks, we use 31 different quantities, which we call network descriptors. They are listed in Table 3. We have chosen quantities that are relatively simple and intuitive.

Download:

Table 3. Descriptors of temporal network structure.

https://doi.org/10.1371/journal.pone.0120567.t003

Time evolution.

We calculate eight network descriptors that characterize the long-term behavior of the contact dynamics—basically, how the contacts process differs from a stationary process. The background is that some of these data sets (e.g. Prostitution, Dating, Forum and Online community) are growing throughout the sampling period. A fast-spreading outbreak would thus, effectively, spread in a larger population (defined as the set of individuals possible to be infected) in the end than in the beginning. The Gallery data is also special in that the individuals in the beginning of the sampling are not present in the end. Ref. [34] argues, in more general terms, that when the first and last contacts of a link (pairs of nodes that are in contact at least once) happen is important for the behavior of outbreaks.

The first such set of quantities focuses on the time when nodes and links appear for the first time. For example, Ref. [34] points at the growth of the Prostitution data set as a factor behind the observation [35] that the order of events speeds up disease spreading in this data. We use f to symbolize this class of network descriptors. We measure the fraction of links present at half the sampling time relative to the final number of links. Because several studies in temporal networks address the role of the order of events [35,36], rather than the time itself, we also measure the corresponding quantities if time is replaced by the contact index (the index of the contact number—1 for the first contact, 2 for the second, etc.). These have subscript ‘C’ as opposed to ‘T’ for time. Furthermore, the descriptors concerning nodes and links have the subscripts ‘N’ and ‘L’, respectively.

Another class of network descriptors, denoted F, focuses on persistent nodes or links. F is the fraction of nodes (subscript N) or links (subscript L) present in the first and last 5% of time (T) or contact index (C). Fig. 3 illustrates f and F.

Download:

Fig 3. Illustration of two descriptors of temporal network structure, f_LC and F_NT.

The measure illustrated in A and B, f_LC, uses the order of the contact to separate the contacts; the measure in C and D, F_NT, uses the real time. Panels A and C are time-line representations of a temporal network data set. Each horizontal line represents an individual. A contact between two individuals is indicated by a vertical arc. In A and B, we focus on the first contact between a pair of nodes. We measure the fraction of the number of node pairs that have been in direct contact when a fraction ν of the total number of contacts has been observed. This fraction is plotted against ν in B. The value at ν = 1/2 defines f_LC. In the timeline (A) we highlight the first half contacts, which contribute to the calculation of f_LC, in color and the first contact between each node pair by black contours. In panels C and D, we illustrate the calculation of F_NT, which looks at nodes (rather than links) present in both the first and last time interval of width ϕ (measured as a fraction of the sampling time), shown in color in the timeline (C). The fraction of such nodes as a function of ϕ is graphed in D. F_NT is defined as the value at ϕ = 0.05.

https://doi.org/10.1371/journal.pone.0120567.g003

These network descriptors calculated across the different data sets span a relatively wide range. For example, f_NT, the fraction of links present at half the sampling time, takes values from 0.17 (Facebook) to 0.98 (School).

Node and link activity.

The node activity descriptors relate to the bursty nature of human activity as characterized by intense periods of activity separated by long periods of quiescence [37]. To characterize burstiness, one usually starts from interevent times, i.e., the times between consecutive contacts for a node or link. For simplicity, we ignore correlations between consecutive interevent times and focus on the probability distribution of interevent times. The distribution is often right-skewed—a structure that has been shown to slow down epidemic spreading [38–41]. To characterize the distribution, we measure four descriptors, i.e., the mean μ, standard deviation σ, coefficient of variation c (i.e. the standard deviation divided by the mean) [37], and the sample skewness given by (1) where μ₂ and μ₃ are the second and third moments of the distribution, respectively.

Some studies have pointed out that the duration of presence of a node or link in the data can be more important for spreading dynamics than interevent times [34,42]. For this reason, we also study the distribution of node and link durations and use the same four descriptors. In sum, we use 16 network descriptors in this category—μ, σ, c and γ for interevent times and duration of activity, for both nodes and links.

Degree distribution.

In the following, we define static network descriptors, i.e., those for aggregate contact networks. Among them, the degree distribution is arguably the most important for disease spreading. A right-skewed degree distribution, which is observed in many empirical networks, is known to facilitate disease spreading [43]. For simplicity, we use the network of accumulated contacts (even though one may be able to find network representations of temporal network data that better captures the important structures for disease spreading [44]). To summarize the shape of the degree distribution, we use the same four descriptors as for the interevent time and duration distributions—μ, σ, c and γ.

Other static network descriptors.

We also measure other static network descriptors. First, we count the number of nodes, N. Because the number of links is equal to the half of the mean degree times N, we do not include it in the analysis.

We also measure the degree assortativity r (essentially, the Pearson correlation coefficient of the degrees at either side of a link). This network descriptor measures the tendency for assortative mixing by degree, i.e., whether high-degree nodes tend to connect to high-degree nodes and low-degree nodes to low-degree nodes. It has been shown that assortativity affects disease spreading (exactly how depends on the specific epidemic model and other structures of the contacts) [45–48].

Finally, we measure the clustering coefficient—the number of triangles in the network divided by the number of connected triples (not necessarily a full triangle) normalized to the interval [0,1]. Similar to assortativity, the relative number of triangles (clustering) is also a contact-structural factor influencing disease dynamics [46–51]. As an example, if we compare SI disease spreading on a clustered network with a random network with the same number of nodes and links, the early stage of the spreading would be faster in the less clustered network [49,50]. Intuitively, if a disease spreads from one individual to two neighbors, and the three individuals are connected as a triangle, then the third link of the triangle is useless for the spreading process. If the third link were connected elsewhere, the disease would spread faster.

Structural determinants of the Ω vs. R₀ point cloud

Ultimately, one would like to explain how the relations between R₀, Ω, λ and δ emerge from the contact structure. In this work, as mentioned, we take a different approach and look at the Pearson correlation coefficient between the shape descriptors (Table 2) and network descriptors (Table 3). In this way, we search for network descriptors that contribute to the deviation from a deterministic relationship between Ω and R₀. A temporal network data set defines a data point that is fed to the calculation of the correlation coefficient; there are 12 data points available for regression analysis. We decided to use the Pearson correlation coefficient and not multivariate regression methods because there are 31 dependent variables, i.e., network descriptors (and 6 independent variables, i.e., shape descriptors), whereas we have only 12 data points.

In Fig. 4, we plot the results from our correlation analysis. In each panel, we plot the coefficient of determination R² (square of the Pearson correlation coefficient) between a shape descriptor and each network descriptor. The network descriptors are grouped in accordance with the subsections of the previous section. Scatter plots of all pairs of network descriptor and shape descriptors are shown in S2 Fig.

Download:

Fig 4. The coefficient of determination R² between the shape descriptors of the R₀ vs. Ω point cloud and network descriptors.

The error bars are standard errors estimated by the jackknife resampling method. *: p < 0.05, **: p < 0.01, ***: p < 0.001.

https://doi.org/10.1371/journal.pone.0120567.g004

The predictability of R₀ with respect to Ω, as measured by τ_R0Ω (Fig. 4A), is to some extent (p < 0.05) explained by the coefficients of variation of the interevent time for the node and link interevent time distribution, c_Lt and c_Nt. This correlation is positive (see S2 Fig.), so broader interevent time distributions (burstier contact patterns) imply worse predictability. Furthermore, the α dependence of Ω is most strongly correlated with the burstiness of the nodes c_Nt. In this case the correlation is negative (S2 Fig.). This means that if we compare two points with the same R₀ value, where the first parameter set has a comparatively large transmission probability and short disease duration than the second, then the first parameter set tends to trigger a larger outbreak size than the second. These quantities are strongly affected by burstiness. The remaining four shape descriptors concern the location (in the (R₀,Ω) space) of the biggest deviation from a deterministic relationship and the size of the deviation.

Fig. 4C shows the correlation coefficient with the location along the R₀ axis of the mid-point of the discordant pair with the largest separation in R₀, i.e., ρ_R0. Also in this case, network descriptors derived from the interevent-time distributions are relatively strongly correlated with ρ_R0. The mean μ_NT and standard deviation σ_NT as well as the skewness γ_NT show strong correlations. Furthermore, the fraction of links present in both the first and last 5% of the contacts (F_LC) shows an R² = 0.4 correlation with ρ_R0 (p = 0.06). Furthermore, even though they do not reach the p < 0.05 significance criterion, other link-related quantities of the time evolution (μ_Lt, c_Lt, γ_Lt, μ_Lτ, σ_Lτ, c_Lτ and γ_Lτ) show R² values over 0.3. Fig. 4D indicates that the largest width of a discordant pair, Δ_R0, is strongly correlated with a number of temporal network descriptors. First, Δ_R0 is correlated with both those relating to the node and link activity when the real time, not the contact index, is used (μ_Lt, c_Lt, γ_Lt, μ_Lt, σ_Lt and γ_Lt). Second, Δ_R0 is correlated with the time evolution, especially with the F quantities—measuring the fraction of links and nodes present both in the beginning and end of the sampling period (f_NC, f_NT, F_NC, F_LC, F_NT, F_LT); p < 0.01). Fig. 4E shows the correlation with the R₀-location with the discordant pair with the largest separation in Ω, ρ_Ω. Just like ρ_R0 (Fig. 4C), much of the variance in ρ_Ω is explained by the time-related descriptors in real time (f_LC, f_LT, F_LC, F_LT and σ_Lt). More interestingly, the largest Ω-separation of discordant pairs, Δ_Ω (Fig. 4F) is strongly and positively correlated with some static network descriptors, i.e., the coefficient of variation and the skewness of the degree distribution (c_k and γ_k).

Discussion

In this work, we have shown that temporal network structure of human contacts can change the interpretation of the basic reproduction number R₀. We have found pairs of SIR parameter values (λ₁,δ₁) and (λ₂,δ₂) such that R₀ (λ₁,δ₁) < R₀ (λ₂,δ₂) and Ω(λ₁,δ₁) > Ω(λ₂,δ₂). In other words, the expected number of secondary infections of the outbreak’s source is smaller for (λ₁,δ₁) than (λ₂,δ₂), but the expected final fraction of individuals that had the infection is larger for (λ₁,δ₁) than (λ₂,δ₂). It is hard to give a succinct explanation for this phenomenon, and we do not attempt that in the present paper. It relates to many aspects of the contact patterns—static network structures, dynamic network structures, and the fact that empirical data is finite-sized, non-equilibrium and inhomogeneous [18,19,52]. On the other hand, it is easy to imagine scenarios where this happens. Assume, for simplicity, that λ₁≪ λ₂, δ₁ ≫ δ₂ and the nodes split in two halves—one half active throughout the sampling time, the other half entering after some time. Then, in the (λ₂,δ₂) scenario, the larger λ (i.e., λ₂) could cause a burnout outbreak that ends before the second group of nodes enters the system. Therefore, R₀ would be high, whereas Ω does not exceed 1/2. In the (λ₁,δ₁) scenario, R₀ would be smaller. However, the duration of infection would be long enough for the second half of the nodes to be infected, so Ω could be larger than 1/2. Therefore, a larger value of R₀ does not necessarily mean that the disease spreads more easily. At the same time, the correlation between R₀ and Ω is often strong, especially if one accepts a non-linear relationship. For most practical purposes, it probably suffices to assume that R₀ is a good predictor of Ω.

Looking closer at the deviation of the Ω vs. R₀ scatter plots from a deterministic relationship and structural correlates of the amount of the deviation, we notice that a combination of seemingly unrelated descriptors of temporal network structure often shows a significant correlation. This result suggests that—although a better achievement may be obtained through identification of microscopic factors contributing to these phenomena—such factors could be interdependent and hard to fully disentangle. Probably a fruitful path would be to vary the structure in models of contact patterns and look at responses in the Ω vs. R₀ plots. However, already based on the current numerical results, we can draw some conclusions. One of them is that the temporal network factors often seem important. In particular, the quantities relating to the interevent-time distributions are significant predictors of e.g. the overall correlation between Ω and R₀. This is a bit surprising in the light of Refs. [35] and [41] that have found that the birth and death of links and nodes influence (some other quantities relating to) spreading phenomena (probably also the importance of the “loyalty” metrics in Ref. [52]). Only one aspect of the Ω vs. R₀ plots—Δ_Ω (see Table 2 and Fig. 2C for definition)—is primarily explained by the static network properties, specifically the coefficient of variation and skewness of the degree distribution. This result is accompanied by the largest confidence level (p < 0.001) of the correlation. In contrast to Δ_Ω, a similar shape descriptor Δ_R0 (see Table 2 and Fig. 2B for definition) is strongly correlated with several of the temporal network properties and not with the static ones. Especially the former observation is interesting—even though temporal structure is needed to see any spread in Δ_Ω at all, it is the degree distribution that is the most strongly correlated with the actual value of Δ_Ω.

Needless to say, this work opens more questions than it answers. In particular, it calls for mechanistic modeling connecting R₀ and Ω. Another direction would be to develop improved estimators of disease severity.

Methods

In this section, we will go through technicalities of the methods that are not fully explained in the Results section.

SIR simulations

In this work we use the constant duration SIR model (that defines a Monte Carlo simulation of the SIR model). We initialize all individuals to susceptible and pick one random individual i to be the source of the infection. We assume that i becomes infected at the same time as its first appearance in the data. In a contact between an infectious and susceptible, the susceptible will (instantaneously) become infectious with a probability λ. Infectious individuals stay infectious for δ time steps after which they become recovered. If many contacts happen during the same time step, we go through them in a random order.

A more common version of the SIR model is to let infectious individuals recover with a constant rate. Qualitatively, both versions give the same results [21]. We use the constant duration version because it is a bit more realistic [53,54] and makes the code a bit faster than the exponentially distributed durations.

Measuring the λδ-balance

A combination of a large λ and small δ can give the same R₀ value as a combination of a small λ and large δ. At the same time, Ω may depend on one of these parameters more strongly than on the other. The result is a vertical trend in the colors of the points as seen in Fig. 1 (most clearly for the Forum, Dating and Online community data). We measure this tendency—the λδ-balance—as illustrated in Fig. 2D, E, and F. First, we segment the R₀ axis into ten bins. The number of bins is determined based on a trade-off between minimizing the spread of the points along the R₀ axis, and maximizing the number of points per bin. After the division into bins, we capture the λδ-balance via the angle α between the line from the origin to the parameter value (λ,δ) and the λ = δ line. Finally, we calculate Kendall’s τ for the relationship between α and Ω and average the τ values over all bins.

Data availability

The Conference, Gallery, Hospital and School data sets are available from http://www.sociopatterns.org/datasets/, the Prostitution data set is available as the Supporting Information of Ref. 35, and the Facebook data is available from http://konect.uni-koblenz.de/. Other data is available from the authors of the papers where they were first analyzed (as cited above).

Supporting Information

S1 Fig. R₀ vs. Ω plots for the Gallery data.

This file contains plots corresponding to Fig. 1 for all 69 days of data for the Gallery data set.

https://doi.org/10.1371/journal.pone.0120567.s001

(PDF)

S2 Fig. Scatterplots between the R₀-Ω cloud shape descriptors and network structural descriptors.

The numbers identify the data sets as follows: Conference (1), Dating (2), E-mail 1 (3), E-mail 2 (4), Facebook (5), Forum (6), Gallery (7), Hospital (8), Online community (9), Prostitution (10), Reality mining (11), School (12).

https://doi.org/10.1371/journal.pone.0120567.s002

(PDF)

Acknowledgments

We thank the SocioPatterns collaboration (www.sociopatterns.org) for privileged access to the School data set.

Author Contributions

Conceived and designed the experiments: PH NM. Performed the experiments: PH. Analyzed the data: PH. Contributed reagents/materials/analysis tools: PH. Wrote the paper: PH NM.

References

1. Heesterbeek JAP. A brief history of R0 and a recipe for its calculation. Acta Biotheoretica 2002;50: 189–204. pmid:12211331
- View Article
- PubMed/NCBI
- Google Scholar
2. Diekmann O, Heesterbeek H. Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation. New York: Wiley; 2000. https://doi.org/10.1016/j.mbs.2008.02.005 pmid:18405926
3. Heffernan JM, Smith RJ, Wahl LM. Perspectives on the basic reproductive ratio. J Roy Soc Interface 2005;4: 281–293.
- View Article
- Google Scholar
4. Holland Jones J. Notes on R0. Unpublished, Standford University; 2007. Available http://web.stanford.edu/~jhj1/teachingdocs/Jones-on-R0.pdf Accessed 15 February 2015.
5. van den Driessche P, Watmough J. Further Notes on the basic reproduction number. Mathematical Epidemiology. Lecture Notes in Mathematics 2008;1945: 159–178.
- View Article
- Google Scholar
6. Dietz K. The estimation of the basic reproduction number for infectious diseases. Stat Methods Med Res 1993;2: 23–41. pmid:8261248
- View Article
- PubMed/NCBI
- Google Scholar
7. Heffernan J M, Wahl L M. Improving estimates of the basic reproductive ratio: Using both the mean and the dispersal of transition times. Theor Popul Biol 2006;70: 135–145. pmid:16712889
- View Article
- PubMed/NCBI
- Google Scholar
8. Massad E, Coutinho FAB, Burattini MN, Amaku M. Estimation of R0 from the initial phase of an outbreak of a vector-borne infection. Trop Med Int Health 2010;15: 120–126. pmid:19891761
- View Article
- PubMed/NCBI
- Google Scholar
9. Anderson RM, May RM. Infectious diseases of humans. Oxford UK: Oxford University Press; 1991.
10. Li J, Blakeley D, Smith RJ. The failure of R0. Comput Math Methods Med 2011;2011: 527610. pmid:21860658
- View Article
- PubMed/NCBI
- Google Scholar
11. Cross PC, Johnson PLF, Lloyd-Smith JO, Getz WM. Utility of R0 as a predictor of disease invasion in structured populations. J R Soc Interface 2007;4: 315–324. pmid:17251146
- View Article
- PubMed/NCBI
- Google Scholar
12. van den Driessche P, Watmough J. Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Math Biosci 2002;180: 29–48. pmid:12387915
- View Article
- PubMed/NCBI
- Google Scholar
13. Colizza V, Vespignani A. Epidemic modeling in metapopulation systems with heterogeneous coupling pattern: theory and simulations. J Theor Biol 2008;251: 450–467. pmid:18222487
- View Article
- PubMed/NCBI
- Google Scholar
14. Keeling MJ, Grenbell BT. Individual-based perspectives on R0. J Theor Biol 2000;203: 51–61. pmid:10677276
- View Article
- PubMed/NCBI
- Google Scholar
15. Keeling MJ. The implications of network structure for epidemic dynamics. Theor Popul Biol 2005;67: 1–8. pmid:15649519
- View Article
- PubMed/NCBI
- Google Scholar
16. Volz E, Meyers LA. Epidemic thresholds in dynamic contact networks. J R Soc Interface 2009;6: 233–241. pmid:18664429
- View Article
- PubMed/NCBI
- Google Scholar
17. Bansal S, Read J, Pourbohloul B, Meyers LA. The dynamic nature of contact networks in infectious disease epidemiology. J Biol Dyn 2010;4: 478–489. pmid:22877143
- View Article
- PubMed/NCBI
- Google Scholar
18. Holme P, Saramäki J. Temporal networks. Phys Rep 2012;519: 97–125.
- View Article
- Google Scholar
19. Masuda N, Holme P. Predicting and controlling infectious disease epidemics using temporal networks. F1000Prime Rep 2013;5: 6. pmid:23513178
- View Article
- PubMed/NCBI
- Google Scholar
20. Taylor M, Taylor TJ, Kiss IZ. Epidemic threshold and control in a dynamic network. Phys Rev E 2012;85: 016103. pmid:22400621
- View Article
- PubMed/NCBI
- Google Scholar
21. Rocha LEC, Blondel VD. Bursts of vertex activation and epidemics in evolving networks. PLoS Comput Biol 2013;9: e1002974. pmid:23555211
- View Article
- PubMed/NCBI
- Google Scholar
22. Holme P. Model versions and fast algorithms for network epidemiology. Journal of Logistical Engineering University 2014;30: 1–7.
- View Article
- Google Scholar
23. Eagle N, Pentland A. Reality mining: Sensing complex social systems. Personal and Ubiquitous Computing 2006;10: 255–268.
- View Article
- Google Scholar
24. Pfitzner R, Scholtes I, Garas A, Tessone TJ, Schweitzer F. Betweenness preference: Quantifying correlations in the topological dynamics of temporal networks. Phys Rev Lett 2013;110: 198701. pmid:23705746
- View Article
- PubMed/NCBI
- Google Scholar
25. Isella L, Stehlé J, Barrat A, Cattuto C, Pinton JF, van den Broeck W. What’s in a crowd? Analysis of face-to-face behavioral networks. J Theor Biol 2011;271, 166–180. pmid:21130777
- View Article
- PubMed/NCBI
- Google Scholar
26. Stehlé J, Voirin N, Barrat A, Cattuto C, Isella L, Pinton JF, et al. High-resolution measurements of face-to-face contact patterns in a primary school. PLoS ONE 2011;6: e23176. pmid:21858018
- View Article
- PubMed/NCBI
- Google Scholar
27. Vanhems P, Barrat A, Cattuto C, Pinton J-F, Khanafer N, Régis C, et al. Estimating potential infection transmission routes in hospital wards using wearable proximity sensors. PLoS ONE 2013;8: e73970. pmid:24040129
- View Article
- PubMed/NCBI
- Google Scholar
28. Rocha LEC, Liljeros F, Holme P. Information dynamics shape the sexual networks of Internet-mediated prostitution. Proc Natl Acad Sci USA 2010;107: 5706–5711. pmid:20231480
- View Article
- PubMed/NCBI
- Google Scholar
29. Ebel H, Mielsch HI, Bornholdt S. Scale-free topology of e-mail networks. Phys Rev E 2002;66: 035103. pmid:12366171
- View Article
- PubMed/NCBI
- Google Scholar
30. Eckmann JP, Moses E, Sergi D. Entropy of dialogues creates coherent structures in e-mail traffic. Proc Natl Acad Sci USA 2004;101: 14333–14337. pmid:15448210
- View Article
- PubMed/NCBI
- Google Scholar
31. Holme P, Edling CR, Liljeros F. Structure and time-evolution of an Internet dating community. Soc Networks 2004;26: 155–174.
- View Article
- Google Scholar
32. Karimi F, Ramenzoni VC, Holme P. Structural differences between open and direct communication in an online community. Physica A 2014;414: 263–273.
- View Article
- Google Scholar
33. Viswanath B, Mislove A, Cha M, Gummadi KP. On the evolution of user interaction in Facebook. Proceedings of the 2nd ACM workshop on Online Social Networks; 2009.
34. Holme P, Liljeros F. Birth and death of links control disease spreading in empirical contact networks. Sci Rep 2014;4: 4999. pmid:24851942
- View Article
- PubMed/NCBI
- Google Scholar
35. Rocha LEC, Liljeros F, Holme P. Simulated epidemics in an empirical spatiotemporal network of 50,185 sexual contacts. PLoS Comp Biol 2011;7: e1001109.
- View Article
- Google Scholar
36. Karsai M, Kivelä M, Pan RK, Kaski K, Kertész J, Barabási AL, et al. Phys Rev E 2011;83: 025102. pmid:21405879
- View Article
- PubMed/NCBI
- Google Scholar
37. Goh KI, Barabási AL. Burstiness and memory in complex systems. EPL 2008;81: 48002.
- View Article
- Google Scholar
38. Vazquez A, Rácz B, Lukács A, Barabási AL. Impact of non-Poissonian activity patterns on spreading processes. Phys Rev Lett 2007;98: 158702. pmid:17501392
- View Article
- PubMed/NCBI
- Google Scholar
39. Min B, Goh KI, Vazquez A. Spreading dynamics following bursty human activity patterns. Phys Rev E 2011;83: 036102. pmid:21517553
- View Article
- PubMed/NCBI
- Google Scholar
40. Masuda N, Klemm K, Eguíluz VM. Temporal networks: slowing down diffusion by long lasting interactions Phys Rev Lett 2013;111: 188701. pmid:24237569
- View Article
- PubMed/NCBI
- Google Scholar
41. Liu SY, Baronchelli A, Perra N. Contagion dynamics in time-varying metapopulation networks. Phys Rev E 2013;87: 032805.
- View Article
- Google Scholar
42. Miritello G, Lara R, Cebrian M, Moro E. Limited communication capacity unveils strategies for human interaction. Sci Rep 2013;3: 1950. pmid:23739519
- View Article
- PubMed/NCBI
- Google Scholar
43. Newman MEJ. Networks: An introduction. Oxford University Press, Oxford UK; 2010.
44. Holme P. Epidemiologically optimal static networks from temporal network data. PLoS Comput Biol 2013;9: e1003142. pmid:23874184
- View Article
- PubMed/NCBI
- Google Scholar
45. Newman MEJ. Mixing patterns in networks. Phys Rev E 2003;67: 026126. pmid:12636767
- View Article
- PubMed/NCBI
- Google Scholar
46. Serrano MÁ, Boguñá M. Clustering in complex networks. II. Percolation properties. Phys Rev E 2006;74: 056115. pmid:17279976
- View Article
- PubMed/NCBI
- Google Scholar
47. Britton T, Deijfen M, Lagerås AN, Lindholm M. Epidemics on random graphs with tunable clustering. J Appl Prob 2008;45: 743–756.
- View Article
- Google Scholar
48. Eguíluz VM, Klemm K. Epidemic threshold in structured scale-free networks. Phys Rev Lett 2002;89: 108701. pmid:12225235
- View Article
- PubMed/NCBI
- Google Scholar
49. Szendroi B, Csanyi G. Polynomial epidemics and clustering in contact networks. Proc R Soc B 2004;271: S364–S366. pmid:15504019
- View Article
- PubMed/NCBI
- Google Scholar
50. Volz EM, Miller JC, Galvani A, Ancel Meyers L. Effects of heterogeneous and clustered contact patterns on infectious disease dynamics. PLoS Comput Biol 2011;7: e1002042. pmid:21673864
- View Article
- PubMed/NCBI
- Google Scholar
51. Newman MEJ. Properties of highly clustered networks. Phys Rev E 2003;68: 026121. pmid:14525063
- View Article
- PubMed/NCBI
- Google Scholar
52. Valdano E, Poletto C, Giovannini A, Palma D, Savini L, Colizza V. Predicting epidemic risk from past temporal contact data; 2014. Preprint. Available arXiv:1406.1449v1. Accessed 15 February 2015.
53. Lloyd AL. Destabilization of epidemic models with the inclusion of realistic distributions of infectious periods. Proc R Soc Lond Ser B 2001;268: 985–993.
- View Article
- Google Scholar
54. Lloyd AL. Realistic distributions of infectious periods in epidemic models: changing patterns of persistence and dynamics. Theor Popul Biol 2001;60: 59–71. pmid:11589638
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Heesterbeek JAP. A brief history of R0 and a recipe for its calculation. Acta Biotheoretica 2002;50: 189–204. pmid:12211331
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Diekmann O, Heesterbeek H. Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation. New York: Wiley; 2000. https://doi.org/10.1016/j.mbs.2008.02.005 pmid:18405926

[ref3] 3. Heffernan JM, Smith RJ, Wahl LM. Perspectives on the basic reproductive ratio. J Roy Soc Interface 2005;4: 281–293.
View Article
Google Scholar

[7] View Article

[8] Google Scholar

[ref4] 4. Holland Jones J. Notes on R0. Unpublished, Standford University; 2007. Available http://web.stanford.edu/~jhj1/teachingdocs/Jones-on-R0.pdf Accessed 15 February 2015.

[ref5] 5. van den Driessche P, Watmough J. Further Notes on the basic reproduction number. Mathematical Epidemiology. Lecture Notes in Mathematics 2008;1945: 159–178.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref6] 6. Dietz K. The estimation of the basic reproduction number for infectious diseases. Stat Methods Med Res 1993;2: 23–41. pmid:8261248
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref7] 7. Heffernan J M, Wahl L M. Improving estimates of the basic reproductive ratio: Using both the mean and the dispersal of transition times. Theor Popul Biol 2006;70: 135–145. pmid:16712889
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref8] 8. Massad E, Coutinho FAB, Burattini MN, Amaku M. Estimation of R0 from the initial phase of an outbreak of a vector-borne infection. Trop Med Int Health 2010;15: 120–126. pmid:19891761
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref9] 9. Anderson RM, May RM. Infectious diseases of humans. Oxford UK: Oxford University Press; 1991.

[ref10] 10. Li J, Blakeley D, Smith RJ. The failure of R0. Comput Math Methods Med 2011;2011: 527610. pmid:21860658
View Article
PubMed/NCBI
Google Scholar

[27] View Article

[28] PubMed/NCBI

[29] Google Scholar

[ref11] 11. Cross PC, Johnson PLF, Lloyd-Smith JO, Getz WM. Utility of R0 as a predictor of disease invasion in structured populations. J R Soc Interface 2007;4: 315–324. pmid:17251146
View Article
PubMed/NCBI
Google Scholar

[31] View Article

[32] PubMed/NCBI

[33] Google Scholar

[ref12] 12. van den Driessche P, Watmough J. Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Math Biosci 2002;180: 29–48. pmid:12387915
View Article
PubMed/NCBI
Google Scholar

[35] View Article

[36] PubMed/NCBI

[37] Google Scholar

[ref13] 13. Colizza V, Vespignani A. Epidemic modeling in metapopulation systems with heterogeneous coupling pattern: theory and simulations. J Theor Biol 2008;251: 450–467. pmid:18222487
View Article
PubMed/NCBI
Google Scholar

[39] View Article

[40] PubMed/NCBI

[41] Google Scholar

[ref14] 14. Keeling MJ, Grenbell BT. Individual-based perspectives on R0. J Theor Biol 2000;203: 51–61. pmid:10677276
View Article
PubMed/NCBI
Google Scholar

[43] View Article

[44] PubMed/NCBI

[45] Google Scholar

[ref15] 15. Keeling MJ. The implications of network structure for epidemic dynamics. Theor Popul Biol 2005;67: 1–8. pmid:15649519
View Article
PubMed/NCBI
Google Scholar

[47] View Article

[48] PubMed/NCBI

[49] Google Scholar

[ref16] 16. Volz E, Meyers LA. Epidemic thresholds in dynamic contact networks. J R Soc Interface 2009;6: 233–241. pmid:18664429
View Article
PubMed/NCBI
Google Scholar

[51] View Article

[52] PubMed/NCBI

[53] Google Scholar

[ref17] 17. Bansal S, Read J, Pourbohloul B, Meyers LA. The dynamic nature of contact networks in infectious disease epidemiology. J Biol Dyn 2010;4: 478–489. pmid:22877143
View Article
PubMed/NCBI
Google Scholar

[55] View Article

[56] PubMed/NCBI

[57] Google Scholar

[ref18] 18. Holme P, Saramäki J. Temporal networks. Phys Rep 2012;519: 97–125.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref19] 19. Masuda N, Holme P. Predicting and controlling infectious disease epidemics using temporal networks. F1000Prime Rep 2013;5: 6. pmid:23513178
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref20] 20. Taylor M, Taylor TJ, Kiss IZ. Epidemic threshold and control in a dynamic network. Phys Rev E 2012;85: 016103. pmid:22400621
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref21] 21. Rocha LEC, Blondel VD. Bursts of vertex activation and epidemics in evolving networks. PLoS Comput Biol 2013;9: e1002974. pmid:23555211
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref22] 22. Holme P. Model versions and fast algorithms for network epidemiology. Journal of Logistical Engineering University 2014;30: 1–7.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref23] 23. Eagle N, Pentland A. Reality mining: Sensing complex social systems. Personal and Ubiquitous Computing 2006;10: 255–268.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref24] 24. Pfitzner R, Scholtes I, Garas A, Tessone TJ, Schweitzer F. Betweenness preference: Quantifying correlations in the topological dynamics of temporal networks. Phys Rev Lett 2013;110: 198701. pmid:23705746
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref25] 25. Isella L, Stehlé J, Barrat A, Cattuto C, Pinton JF, van den Broeck W. What’s in a crowd? Analysis of face-to-face behavioral networks. J Theor Biol 2011;271, 166–180. pmid:21130777
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref26] 26. Stehlé J, Voirin N, Barrat A, Cattuto C, Isella L, Pinton JF, et al. High-resolution measurements of face-to-face contact patterns in a primary school. PLoS ONE 2011;6: e23176. pmid:21858018
View Article
PubMed/NCBI
Google Scholar

[88] View Article

[89] PubMed/NCBI

[90] Google Scholar

[ref27] 27. Vanhems P, Barrat A, Cattuto C, Pinton J-F, Khanafer N, Régis C, et al. Estimating potential infection transmission routes in hospital wards using wearable proximity sensors. PLoS ONE 2013;8: e73970. pmid:24040129
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref28] 28. Rocha LEC, Liljeros F, Holme P. Information dynamics shape the sexual networks of Internet-mediated prostitution. Proc Natl Acad Sci USA 2010;107: 5706–5711. pmid:20231480
View Article
PubMed/NCBI
Google Scholar

[96] View Article

[97] PubMed/NCBI

[98] Google Scholar

[ref29] 29. Ebel H, Mielsch HI, Bornholdt S. Scale-free topology of e-mail networks. Phys Rev E 2002;66: 035103. pmid:12366171
View Article
PubMed/NCBI
Google Scholar

[100] View Article

[101] PubMed/NCBI

[102] Google Scholar

[ref30] 30. Eckmann JP, Moses E, Sergi D. Entropy of dialogues creates coherent structures in e-mail traffic. Proc Natl Acad Sci USA 2004;101: 14333–14337. pmid:15448210
View Article
PubMed/NCBI
Google Scholar

[104] View Article

[105] PubMed/NCBI

[106] Google Scholar

[ref31] 31. Holme P, Edling CR, Liljeros F. Structure and time-evolution of an Internet dating community. Soc Networks 2004;26: 155–174.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref32] 32. Karimi F, Ramenzoni VC, Holme P. Structural differences between open and direct communication in an online community. Physica A 2014;414: 263–273.
View Article
Google Scholar

[111] View Article

[112] Google Scholar

[ref33] 33. Viswanath B, Mislove A, Cha M, Gummadi KP. On the evolution of user interaction in Facebook. Proceedings of the 2nd ACM workshop on Online Social Networks; 2009.

[ref34] 34. Holme P, Liljeros F. Birth and death of links control disease spreading in empirical contact networks. Sci Rep 2014;4: 4999. pmid:24851942
View Article
PubMed/NCBI
Google Scholar

[115] View Article

[116] PubMed/NCBI

[117] Google Scholar

[ref35] 35. Rocha LEC, Liljeros F, Holme P. Simulated epidemics in an empirical spatiotemporal network of 50,185 sexual contacts. PLoS Comp Biol 2011;7: e1001109.
View Article
Google Scholar

[119] View Article

[120] Google Scholar

[ref36] 36. Karsai M, Kivelä M, Pan RK, Kaski K, Kertész J, Barabási AL, et al. Phys Rev E 2011;83: 025102. pmid:21405879
View Article
PubMed/NCBI
Google Scholar

[122] View Article

[123] PubMed/NCBI

[124] Google Scholar

[ref37] 37. Goh KI, Barabási AL. Burstiness and memory in complex systems. EPL 2008;81: 48002.
View Article
Google Scholar

[126] View Article

[127] Google Scholar

[ref38] 38. Vazquez A, Rácz B, Lukács A, Barabási AL. Impact of non-Poissonian activity patterns on spreading processes. Phys Rev Lett 2007;98: 158702. pmid:17501392
View Article
PubMed/NCBI
Google Scholar

[129] View Article

[130] PubMed/NCBI

[131] Google Scholar

[ref39] 39. Min B, Goh KI, Vazquez A. Spreading dynamics following bursty human activity patterns. Phys Rev E 2011;83: 036102. pmid:21517553
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref40] 40. Masuda N, Klemm K, Eguíluz VM. Temporal networks: slowing down diffusion by long lasting interactions Phys Rev Lett 2013;111: 188701. pmid:24237569
View Article
PubMed/NCBI
Google Scholar

[137] View Article

[138] PubMed/NCBI

[139] Google Scholar

[ref41] 41. Liu SY, Baronchelli A, Perra N. Contagion dynamics in time-varying metapopulation networks. Phys Rev E 2013;87: 032805.
View Article
Google Scholar

[141] View Article

[142] Google Scholar

[ref42] 42. Miritello G, Lara R, Cebrian M, Moro E. Limited communication capacity unveils strategies for human interaction. Sci Rep 2013;3: 1950. pmid:23739519
View Article
PubMed/NCBI
Google Scholar

[144] View Article

[145] PubMed/NCBI

[146] Google Scholar

[ref43] 43. Newman MEJ. Networks: An introduction. Oxford University Press, Oxford UK; 2010.

[ref44] 44. Holme P. Epidemiologically optimal static networks from temporal network data. PLoS Comput Biol 2013;9: e1003142. pmid:23874184
View Article
PubMed/NCBI
Google Scholar

[149] View Article

[150] PubMed/NCBI

[151] Google Scholar

[ref45] 45. Newman MEJ. Mixing patterns in networks. Phys Rev E 2003;67: 026126. pmid:12636767
View Article
PubMed/NCBI
Google Scholar

[153] View Article

[154] PubMed/NCBI

[155] Google Scholar

[ref46] 46. Serrano MÁ, Boguñá M. Clustering in complex networks. II. Percolation properties. Phys Rev E 2006;74: 056115. pmid:17279976
View Article
PubMed/NCBI
Google Scholar

[157] View Article

[158] PubMed/NCBI

[159] Google Scholar

[ref47] 47. Britton T, Deijfen M, Lagerås AN, Lindholm M. Epidemics on random graphs with tunable clustering. J Appl Prob 2008;45: 743–756.
View Article
Google Scholar

[161] View Article

[162] Google Scholar

[ref48] 48. Eguíluz VM, Klemm K. Epidemic threshold in structured scale-free networks. Phys Rev Lett 2002;89: 108701. pmid:12225235
View Article
PubMed/NCBI
Google Scholar

[164] View Article

[165] PubMed/NCBI

[166] Google Scholar

[ref49] 49. Szendroi B, Csanyi G. Polynomial epidemics and clustering in contact networks. Proc R Soc B 2004;271: S364–S366. pmid:15504019
View Article
PubMed/NCBI
Google Scholar

[168] View Article

[169] PubMed/NCBI

[170] Google Scholar

[ref50] 50. Volz EM, Miller JC, Galvani A, Ancel Meyers L. Effects of heterogeneous and clustered contact patterns on infectious disease dynamics. PLoS Comput Biol 2011;7: e1002042. pmid:21673864
View Article
PubMed/NCBI
Google Scholar

[172] View Article

[173] PubMed/NCBI

[174] Google Scholar

[ref51] 51. Newman MEJ. Properties of highly clustered networks. Phys Rev E 2003;68: 026121. pmid:14525063
View Article
PubMed/NCBI
Google Scholar

[176] View Article

[177] PubMed/NCBI

[178] Google Scholar

[ref52] 52. Valdano E, Poletto C, Giovannini A, Palma D, Savini L, Colizza V. Predicting epidemic risk from past temporal contact data; 2014. Preprint. Available arXiv:1406.1449v1. Accessed 15 February 2015.

[ref53] 53. Lloyd AL. Destabilization of epidemic models with the inclusion of realistic distributions of infectious periods. Proc R Soc Lond Ser B 2001;268: 985–993.
View Article
Google Scholar

[181] View Article

[182] Google Scholar

[ref54] 54. Lloyd AL. Realistic distributions of infectious periods in epidemic models: changing patterns of persistence and dynamics. Theor Popul Biol 2001;60: 59–71. pmid:11589638
View Article
PubMed/NCBI
Google Scholar

[184] View Article

[185] PubMed/NCBI

[186] Google Scholar

Figures

Abstract

Introduction

Results

Empirical data

Final outbreak size as a function of R0

Characterizing the shape of the Ω vs. R0 point cloud

Temporal and static network descriptors

Time evolution.

Node and link activity.

Degree distribution.

Other static network descriptors.

Structural determinants of the Ω vs. R0 point cloud

Discussion

Methods

SIR simulations

Measuring the λδ-balance

Data availability

Supporting Information

S1 Fig. R0 vs. Ω plots for the Gallery data.

S2 Fig. Scatterplots between the R0-Ω cloud shape descriptors and network structural descriptors.

Acknowledgments

Author Contributions

References

Final outbreak size as a function of R₀

Characterizing the shape of the Ω vs. R₀ point cloud

Structural determinants of the Ω vs. R₀ point cloud

S1 Fig. R₀ vs. Ω plots for the Gallery data.

S2 Fig. Scatterplots between the R₀-Ω cloud shape descriptors and network structural descriptors.