Social Network Sensors for Early Detection of Contagious Outbreaks

doi:10.1371/journal.pone.0012948

Figure 1.

Network Illustrating Structural Parameters.

This real network of 105 students shows variation in structural attributes and topological position. Each circle represents a person and each line represents a friendship tie. Nodes A and B have different “degree,” a measure that indicates the number of ties. Nodes with higher degree also tend to exhibit higher “centrality” (node A with six friends is more central than B and C who both only have four friends). If contagions infect people at random at the beginning of an epidemic, central individuals are likely to be infected sooner because they lie a shorter number of steps (on average) from all other individuals in the network. Finally, although nodes B and C have the same degree, they differ in “transitivity” (the probability that any two of one's friends are friends with each other). Node B exhibits high transitivity with many friends that know one another. In contrast, node C's friends are not connected to one another and therefore they offer more independent possibilities for becoming infected earlier in the epidemic.

More »

Expand

Figure 2.

Theoretical expectations of differences in contagion between central individuals and the population as a whole.

A contagious process passes through two phases, one in which the number of infected individuals exponentially increases as the contagion spreads, and one in which incidence exponentially decreases as susceptible individuals become increasingly scarce. These dynamics can be modeled by a logistic function. Central individuals lie on more paths in a network compared to the average person in a population and are therefore more likely to be infected early by a contagion that randomly infects some individuals and then spreads from person to person within the network. This shifts the S-shaped logistic cumulative incidence function forward in time for central individuals compared to the population as a whole (left panel). It also shifts the peak infection rate forward (right panel).

More »

Expand

Figure 3.

Empirical differences in flu contagion between “friend” group and randomly chosen individuals.

We compared two groups, one composed of individuals randomly selected from our population, and one composed of individuals who were nominated as a friend by members of the random group. The friend group was observed to have significantly higher measured in-degree and betweenness centrality than the random group (see Supporting Information Text S1). In the left panel, a nonparametric maximum likelihood estimate (NPMLE) of cumulative flu incidence (based on diagnoses by medical staff) shows that individuals in the friend group tended to get the flu earlier than individuals in the random group. Moreover, predicted daily incidence from a nonlinear least squares fit of the data to a logistic distribution function suggests that the peak incidence of flu is shifted forward in time for the friends group by 13.9 days (right panel). A significant (p<0.05) lead time for the friend group was first detected with data available up to Day 16. Raw data for daily flu cases in the friend group (blue) and random group (red) is shown in the inset box (right panel).

More »

Expand

Figure 4.

Progression of flu contagion in the friendship network over time.

Each frame shows the largest component of the network (714 people) for a specific date, with each line representing a friendship nomination and each node representing a person. Infected individuals are colored red, friends of infected individuals are colored yellow, and node size is proportional to the number of friends infected. All available information regarding infections is used here. Frames for all 122 days of the study are available in a movie of the epidemic posted in the Supporting Information (Video S1).

More »

Expand

Figure 5.

Estimated days of advance detection of a flu outbreak when following specific groups.

Here, degree, transitivity, centrality, and coreness are computed based on the mapping of the network. The high in-degree group is composed of individuals who have a higher-than-average number of other people in the network who name them as a friend. The low transitivity group is composed of individuals with below-average probability that any two of their friends are friends with one another. The high centrality group is composed of individuals with a higher-than-average betweenness, which is the number of shortest paths connecting all individuals in a network that pass through a given person. The high coreness group is composed of individuals with a higher-than-average coreness, which is the number of friends a person has once all individuals with fewer friends have been eliminated from the network. Analyses were conducted separately for data based on flu diagnoses by medical staff (blue bars) and data based on self-reported flu symptoms (green bars). Estimates and 95% confidence intervals are based on a nonlinear least squares fit of the flu data to a logistic distribution function (see Supporting Information Text S1). The results show that flu outbreaks occur up to two weeks earlier in each of these groups.

More »

Expand