## Figures

## Abstract

Collective behaviour is a widespread phenomenon in biology, cutting through a huge span of scales, from cell colonies up to bird flocks and fish schools. The most prominent trait of collective behaviour is the emergence of global order: individuals synchronize their states, giving the stunning impression that the group behaves as one. In many biological systems, though, it is unclear whether global order is present. A paradigmatic case is that of insect swarms, whose erratic movements seem to suggest that group formation is a mere epiphenomenon of the independent interaction of each individual with an external landmark. In these cases, whether or not the group behaves truly collectively is debated. Here, we experimentally study swarms of midges in the field and measure how much the change of direction of one midge affects that of other individuals. We discover that, despite the lack of collective order, swarms display very strong correlations, totally incompatible with models of non-interacting particles. We find that correlation increases sharply with the swarm's density, indicating that the interaction between midges is based on a metric perception mechanism. By means of numerical simulations we demonstrate that such growing correlation is typical of a system close to an ordering transition. Our findings suggest that correlation, rather than order, is the true hallmark of collective behaviour in biological systems.

## Author Summary

Our perception of collective behaviour in biological systems is closely associated to the emergence of order on a group scale. For example, birds within a flock align their directions of motion, giving the stunning impression that the group is just one organism. Large swarms of midges, mosquitoes and flies, however, look quite chaotic and do not exhibit any group ordering. It is therefore unclear whether these systems are true instances of collective behaviour. Here we perform the three dimensional tracking of large swarms of midges in the field and find that swarms display strong collective behaviour despite the absence of collective order. In fact, we discover that the capability of swarms to collectively respond to perturbations is surprisingly large, comparable to that of highly ordered groups of vertebrates.

**Citation: **Attanasi A, Cavagna A, Del Castello L, Giardina I, Melillo S, Parisi L, et al. (2014) Collective Behaviour without Collective Order in Wild Swarms of Midges. PLoS Comput Biol 10(7):
e1003697.
doi:10.1371/journal.pcbi.1003697

**Editor: **Tamás Vicsek, Eotvos Lorand University, Hungary

**Received: **February 24, 2014; **Accepted: **May 12, 2014; **Published: ** July 24, 2014

**Copyright: ** © 2014 Attanasi et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

**Funding: **This work was supported by grants IIT—Seed Artswarm, ERC—StG n.257126 and US-AFOSR - FA95501010250 (through the University of Maryland). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

**Competing interests: ** The authors have declared that no competing interests exist.

## Introduction

Intuition tells us that a system displays collective behaviour when all individuals spontaneously do the same thing, whatever this thing may be. We surely detect collective behaviour when all birds in a flock fly in the same direction and turn at the same time [1], as well as when all spins in a magnet align, giving rise to a macroscopic magnetization [2], [3]. On the other hand, we would not say that there is any collective behaviour going on in a gas, despite the large number of molecules. The concept of collective behaviour seems therefore closely linked to that of emergent collective order, or synchronization. Indeed, explaining how order spontaneously arises from local inter-individual interactions has been one of the major issues in the field [4]–[6].

The case of insect swarms is tricky in this respect. Several species in the vast taxonomic order Diptera (flies, mosquitoes, midges) form big swarms consisting largely of males, whose purpose is to attract females [7], [8]. Swarming therefore has a key reproductive function and, in some cases, relevant health implications, the obvious, but not unique, example being that of the malaria mosquito, *Anopheles gambiae* [9]–[11]. It is well-known that swarms form in proximity of some visual marker, like a water puddle, or a street lamp [7]. Swarming insects seem to fly independently around the marker, without paying much attention to each other (see Video S1). For this reason, the question of whether swarms behave as truly collective systems is debated [4], [12]. In fact, it has even been suggested that in Diptera there is no interaction between individuals within the swarm and therefore no collective behaviour at all [13], [14]. Although other studies observed local coordination between nearest neighbours [15], [16], it remains controversial whether and to what extent collective patterns emerge over the scale of the whole group. Clarifying this issue is a central goal in swarms containment [17], [18]. In absence of quantitative evidence telling the contrary, the hypothesis that external factors, as the marker, are the sole cause of swarming and that no genuine collective behaviour is present, is by far the simplest explanation.

We must, however, be careful in identifying collective behaviour with collective order. There are systems displaying important collective effects both in their ordered *and* in their disordered phase. An example is that of a ferromagnet near the critical temperature i.e. the temperature below which a spontaneous magnetization emerges: the collective response of the system to an external perturbation is as strong in the disordered phase slightly above as it is in the ordered phase slightly below In fact, once below the critical temperature, increasing the amount of order *lowers* the collective response [2], [3]. Similarly, in animal behaviour it is possible to conceive cases in which individuals coordinate their behavioural reactions to environmental stimuli, rather than the behaviours themselves; conversely we may expect that a group too heavily ordered, i.e. with a very large behavioural polarization, may respond poorly to perturbations, because of an excessive behavioural inertia. Hence, although certainly one of its most visible manifestations, emergent order is not necessarily the best probe of collective behaviour.

The crucial task for living groups is not simply to achieve an ordered state, but to respond collectively to the environmental stimuli. For this to happen, correlation must be large, namely individuals must be able to influence each other's behavioural changes on a group scale. The question then arises of whether correlation in biological systems is a consequence of collective order or whether it can be sustained even in absence of order. The question is relevant because the way individuals in a group synchronize their behavioural fluctuations (correlation) is possibly a more general mechanism than the synchronization of behaviour itself (order). All experimental studies performed up to now, however, concerned highly synchronized groups (as bird flocks, fish shoals and marching locusts [19]–[21]), which displayed both order and correlation. Hence, the question of whether or not order and correlations are two sides of the same phenomenon remained open until now. Here, we attempt to give an answer to this question by experimentally studying large swarms of insects in the field. As we will show, despite the lack of collective order, we do find strong correlations, indicating that in biological systems collective behaviour and group-level coordination do not require order to be sustained.

## Results

### Experiments and tracking

We perform an experimental study of swarms of wild midges in the field. Midges are small non-biting flies belonging to the order Diptera, suborder Nematocera (Diptera:Chironomidae and Diptera:Ceratopogonidae - see Methods). The body length of the species we study is in the range – Swarms are found at sunset, in the urban parks of Rome, typically near stagnant water. As noted before [7], we find that swarms form above natural or artificial landmarks. Moving the landmark leads to an overall displacement of the swarm. The swarms we studied range in size between and individuals (see Table S1 in Text S1).

To reconstruct the 3d trajectories of individual insects we use three synchronized cameras shooting at frames-per-seconds (trifocal technique – Fig. 1e and Methods). Our apparatus does not perturb the swarms in any way. The technique is similar to the one we used for starling flocks [22], with one notable difference. To reach the desired experimental accuracy we need to know the mutual geometric relations between the three cameras very accurately. In the case of flocks, this could be achieved only by an *a priori* alignment of the cameras. In the case of swarms, though, we proceed differently. After each swarm acquisition, we pin down the geometry of the camera system by taking multiple images of a calibrated target (Fig. 1f). This procedure is so accurate that the error in the 3d reconstruction is dominated by the image segmentation error due to the pixel resolution. If we assume this to be equal to pixel (typically it is smaller than that because midges occupy many pixels), we make an error of in the determination of the distance between two points apart from each other (a reference value for nearest neighbour distance). The absolute error is the same for more distant points, making the relative precision of our apparatus even higher. This accuracy makes the determination of the correlation functions we study here very reliable.

**a**: A natural swarm of midges (*Cladotanytarsus atridorsum*, Diptera:Chironomidae), in Villa Ada, Rome. The digital image of each midges is, on average, a pixels light object against a dark background. **b**: The trajectories reconstructed for the same swarm as in **a**. Individual trajectories are visualized for a short time (roughly frames sec), to avoid visual overcrowding (see also Video S1 and S2). **c**: A microscope image of an adult male of *Cladotanytarsus atridorsum*. **d**: A detailed view of the hypopygium, used for species identification (see Methods); the same midge as in **c**. **e**: Scheme of the experimental set-up. Three synchronized cameras recording at frames per second are used. Two cameras m apart are used as the stereoscopic pair for the three dimensional reconstruction. The third one is used to reduce tracking ambiguities and resolve optical occlusions. Three dimensional trajectories are reconstructed in the reference frame of the right stereoscopic camera. **f**: The mutual geometric positions and orientations of the cameras are retrieved by taking several pictures of a known target. The accuracy we achieve in the determination of the mutual camera orientation is of the order of radians.

The -tracking of each midge is performed by using the recursive global optimization method described in [23]. This recursive algorithm dramatically reduces the complexity of the tracking problem, effectively overcoming the limit of other tracking methods [24], [25], and allowing the reconstruction of large swarms, up to midges, for long time, up to frames. Sample reconstructions are shown in Fig. 1b and in Video S2. Compared to previous field [11], [15], [26] and lab [27]–[29] studies, data collected and analysed in the present work have the advantage to span among swarms of different sizes and densities.

### Lack of collective order

Swarms are in a disordered phase. The standard order parameter normally used in collective behaviour is the polarization, where is the number of midges in the swarm and is the velocity of insect The polarization measures the degree of alignment of the directions of motion; it is a positive quantity and its maximum value is The average polarization over all swarms is quite small, (see Fig. 2 and Table S1 in Text S1). As a reference, in starling flocks we find on average [19]. The probability distributions of the polarization fully confirms the swarms' lack of translational order and the stark difference with flocks (Fig. 2). Clearly, swarms are not in a polarized state. Translation is not the only possible collective mode, though. For example, it is well-known that fish schools can produce rotating (milling) configurations. Moreover, a group can expand/contract, giving rise to dilatational (or pulsive) collective modes. For this reason we have defined and measured also a rotational and a dilatational order parameter (see Methods). We find, however, that these quantities too have very small values (Fig. 2). The time series, on the other hand, show that the order parameters can have rare, but strong fluctuations, during which their value may become significantly larger than that of an uncorrelated system (Fig. 2). These large fluctuations are a first hint that non-trivial correlations are present.

Order parameters in a typical natural swarm. In all panels the grey band around zero is the expected amplitude of the fluctuations in a completely uncorrelated system. In the left panels we report the time series of the order parameters, in the right panels their probability distributions. **Top**: The alignment order parameter, known as polarization, In red we report the reference value of the polarization in a flock of starlings. **Middle**: Rotational order parameter, **Bottom**: Dilatational order parameter,

### Correlation

The connected correlation function measures to what extent the change in behaviour of individual is correlated to that of individual at distance Correlation is the most accessible sign of the presence of interaction between the members of a group. The absence of interaction implies the absence of correlation. Conversely, the presence of correlation implies the presence of an effective interaction (see Text S1, Section I). Correlation can be measured for different quantities, but in the case of midges, as with birds and other moving animals, the principal quantity of interest is the direction of motion. To compute the connected correlation we first need to introduce the velocity fluctuations, namely the individual velocity subtracted of the overall motion of the group, (see Methods for the detailed definition of and ). This fluctuation is a dimensional quantity, hence it is unsuitable to compare the correlation in natural vs numerical systems, as we shall do later on. We therefore introduce the dimensionless velocity fluctuation,(1)

The connected correlation function is then given by,(2)where if and zero otherwise, and is the space binning factor. The form of in natural swarms is reported in Fig. 3: at short distances there is strong positive correlation, indicating that midges tend to align their velocity fluctuations to that of their neighbours. After some negative correlation at intermediate distances, relaxes to no correlation for large distances. This qualitative form is quite typical of all species analysed (see Fig. 3). The smallest value of the distance where crosses zero is the correlation length, that is an estimate of the length scale over which the velocity fluctuations are correlated [19]. The average value of this correlation length over all analysed swarms is, This value is about times larger than the nearest neighbours distance, whose average over all swarms is (see Fig. 3 and Table S1 in Text S1). Previous works noticed the existence of pairing manoeuvres and flight-path coordination between nearest neighbours insects [4], [15], [16]. Our results, however, indicate that midges within a natural swarm influence each other's motion far beyond their nearest neighbours.

Black lines and symbols refer to natural swarms, red lines to simulations of ‘swarms’ of non-interacting particles (NHS). Each column refers to a different midge species. **Top**: Correlation function as a function of the distance at one instant of time. The dashed vertical line marks the average nearest neighbour distance, for that swarm. The correlation length, is the first zero of the correlation function. Red: correlation function in the NHS case. The value of for the NHS has been rescaled to appear on the same scale as natural distances. Each natural swarm is compared to a NHS with the same number of particles. **Middle**: Cumulative correlation, This function reaches a maximum The value of the integrated correlation at its maximum, is the susceptibility **Bottom**: Numerical values of the susceptibility in all analysed swarms. For each swarm the value of is a time average over the whole acquisition; error bars are standard deviations. Red: the average susceptibility in the non-interacting case.

### Susceptibility

The collective response of the swarm depends crucially on two factors: how distant in space the behavioural change of one insect affects that of another insect (spatial span of the correlation) and how strong this effect is (intensity of the correlation). To combine these two factors in one single observable we calculate the cumulative correlation up to scale ,(3)where is the Heaviside function,

It can be shown (see Text S1, Section II) that this dimensionless function is related to the space integral of the correlation function Hence, reaches a maximum where vanishes, i.e. for (see Fig. 3). This maximum, is a measure of the total amount of correlation present in the system. In statistical physics is exactly equal to the *susceptibility*, namely the response of the system to an external perturbation [30], [31]. In collective animal behaviour, we do not have a quantitative link between integrated correlation and response, so that calling susceptibility is not strictly correct. However, if the probability distribution of the velocities is stationary, we can follow a maximum entropy approach [32] and still find that the total amount of correlation in the system, is related to the way the group responds collectively to a perturbation (see Text S1, Section II). The value of for midge swarms is reported in Fig. 3.

### Non-interacting Swarm

In order to judge how significant is the correlation function and how large is the susceptibility in natural swarms, we need an effective zero for these quantities, i.e. some null hypothesis baseline. As we have seen in the Introduction, the minimal assumption is that all individuals in the swarm interact with an external landmark independently from each other. Following Okubo [4] (but see also [27] and [16]), we therefore simulate a ‘swarm’ of non-interacting particles performing a random walk in a three-dimensional harmonic potential (see Methods). Visually, the group behaviour of this Non-interacting Harmonic Swarm (NHS) looks remarkably similar to that of a real swarm (see Video S2 and S3): all ‘midges’ fly around the marker and the group lacks collective order.

This similarity, however, is deceptive. In the NHS, the correlation function simply fluctuates around zero, with no spatial span, nor structure (Fig. 3). Moreover, the susceptibility in the NHS is extremely small, whereas the susceptibility in natural swarms is up to times larger than this non-interacting benchmark (Fig. 3). We conclude that swarming behaviour is *not* the mere epiphenomenon of the independent response of each insect with the marker. Despite the lack of collective order, natural swarms are strongly correlated on large length scales. There are big clusters of midges that move coherently, contributing to the ‘dancing’ visual effect of the swarm. The only way this can happen is that midges interact. What kind of interaction is that?

### Metric interaction

To understand the nature of the interaction, we study the susceptibility across swarms of different densities. Interestingly, we find that increases when the average nearest neighbour distance, decreases (Fig. 4). Denser swarms are more correlated than sparser ones. This result indicates that midges interact through a metric perceptive apparatus: the strength of the perception decreases with the distance, so that when midges are further apart from each other (larger ) the interaction is weaker and the susceptibility is lower. This is at variance with what happens in starling flocks: starlings interact with a fixed number of neighbours, irrespective of their nearest neighbour distance [33]; such kind of topological interaction does not depend on the group density, hence the susceptibility does not depend on the nearest neighbour distance. Fig. 4, on the other hand, shows that midges interact metrically, namely with all neighbours within a fixed metric range, Hence, in swarms the number of interacting neighbours increases with decreasing (increasing density), and as a consequence of this increased amount interaction, the system becomes also more correlated.

: Susceptibility as a function of the rescaled nearest neighbour distance, where is the body length. Each point represents a single time frame of a swarming event, and all events are reported on the same plot (symbols are the same for all species). The solid line is the best fit to equation (4). **Right**: Logarithm of the average susceptibility as a function of *Dasyhelea flavifrons* - blue squares; *Corynoneura scutellata* - green circles; *Cladotanytarsus atridorsum* - red triangles. The solid line represents the best fit to equation (4). Each data point represents the time average over the entire acquisition of one swarming event. Error bars indicate standard deviations.

In a system ruled by metric interaction we expect all lengths to be measured in units of the perception range, This implies that the natural variable for the susceptibility is the rescaled nearest neighbour distance, The problem is that we are considering different species of midges, likely to have different metric perception ranges. The simplest hypothesis we can make is that is proportional to the insect body length (which we can measure), so that This hypothesis is confirmed by the data: the susceptibility is significantly more correlated to the variable (P-value ) than to (P-value - see Methods for the definition of P-value). The fact that the natural variable is is a further indication that the interaction in swarms is based on a metric mechanism.

The difference in the nature of the interaction between flocking birds and swarming midges (topological vs. metric) is possibly due to the significant differences between vertebrates and arthropods. Topological interaction, namely tracking a fixed number of neighbours irrespective of their distance, requires a level of cognitive elaboration of the information [33] more sophisticated than a metric interaction, where the decay of the effective force is merely ruled by the physical attenuation of the signal with increasing metric distance. In other words, within a metric mechanism the range of the interaction is fixed by a perceptive cut-off, rather than a cognitive one. Metric interaction is known to be more fragile than topological one against external perturbations [33], and indeed it is far more likely to observe the dispersion of a swarm in the field than that of a flock. This may be the reason why the presence of an external marker is crucial for the swarming behaviour of midges [13].

### Correlation without order

The experimental observations of a non-trivial connected correlation and of a large susceptibility indicate that midges are effectively interacting with each other by acting on their directions of motion. This does not exclude, of course, that other types of interaction are present. First of all, the empirical observation that the swarm uses a visual marker as a reference for maintaining its mean spatial position, strongly suggests that each individual interacts with the marker. Besides, it is certainly possible that effective positional attraction-repulsion forces between midges, as those described in [34], exist. However, the directional correlations indicate that insects are *also* effectively interacting by adjusting their velocities. Moreover, the fact that these correlations are positive for short distances means that midges tend to *align* their direction of motion. This fact may seem surprising, because alignment interactions typically lead to the formation of ordered (polarized) groups, which is clearly not the case for midges. Swarms are disordered, and yet interacting and highly correlated systems. Is this a paradox?

In fact, it is not. An alignment interaction does not *per se* lead to global order in the group. In all models where imitation of the neighbours is present, the onset of long-range order depends on the value of some key tuning parameter. In a ferromagnet, this parameter is the temperature namely the amount of noise affecting the interaction between the neighbouring spins. At high temperature the system is in a disordered state, whereas by lowering one reaches a critical temperature below which an ordering transition occurs. In models of active matter there is another parameter tuning the transition between disorder and order, that is density or, equivalently, nearest neighbour distance: the system gets ordered once the nearest neighbour distance falls below some transition value. The crucial point is that, in general, the correlation of the system tends to be very large *around* the transition point, irrespective of whether the system is in the ordered or in the disordered phase. Hence, even a disordered system can display large correlations, provided that it is not too far from an ordering transition. In what follows, we want to show that this is indeed what happens with midge swarms.

### Vicsek model

The simplest model based on alignment interaction that predicts an order-disorder transition on changing the density is the Vicsek model of collective motion [35]. In this model each individual tends to align its direction of motion to that of the neighbours within a *metric* perception range, The rescaled nearest neighbour distance, is the control parameter: for low noise, the model predicts a transition from a disordered phase (low polarization) at high values of (low density), to an ordered phase (large polarization) at low values of (high density) [35]–[37]. We numerically study the Vicsek model in three dimensions. As we have seen, real swarms hold their average position with respect to a marker; to reproduce this behavioural trait we introduce an harmonic attraction force that each individual experiences towards the origin (see Methods). Also in central potential the model displays an ordering transition: at large density, for the system is ordered and it has large polarization (Video S4). On the other hand, the polarization is low in the disordered phase, (Fig. 5). However, the correlation function is non-trivial when is sufficiently close to (Fig. 5), indicating the existence of large clusters of correlated individuals, which can be clearly detected in Video S5. We calculate the susceptibility in the same manner as we did for natural swarms, in the disordered phase, and find a clear increase of on lowering (Fig. 5).

Three-dimensional Vicsek model in a central potential. **Left**: Correlation function in the disordered phase, but close to the ordering transition. The dashed line is the nearest neighbour distance. Inset: polarization as a function of time. For this value of the system is disordered. **Right**: Logarithm of the susceptibility as a function of the rescaled nearest neighbour distance, where is the metric interaction range. The solid line represents the best fit to equation (4). Error bars are smaller than symbols' size.

This increase of the susceptibility is coherent with the existence of an ordering transition at It has been shown that, unless is much larger than the values analysed here, the transition in the Vicsek model is characterized by a clear second order phenomenology (the nature of the transition for is still debated - see [37]–[39]). As a consequence, the susceptibility is expected to become very large approaching and to follow the usual scaling relation of critical phenomena [38],(4)

A fit to equation (4) of the 3d-Vicsek data is reported in Fig. 5, giving and a transition point, The reason for the growth of approaching in the Vicsek model is quite intuitive. The model is metric, so that at large namely when the nearest neighbour distance is much larger than the interaction range very few individuals interact with each other, and coordination is small. The smaller becomes, the larger the number of particles within the mutual interaction range, thus promoting the correlation of larger and larger clusters of particles. For this reason the correlation length and the susceptibility grow when the nearest neighbour distance decreases. When approaches its critical value, the coordinated clusters become as large as the whole system, so that the groups orders below

The low order parameter, the non-trivial correlation function, and especially the increase of on decreasing the nearest neighbour distance, are phenomenological traits that the metric Vicsek model shares with natural swarms. We conclude that a system based solely on alignment can be in its disordered phase and yet display large correlations, as midge swarms do. It is interesting to note that by approaching the ordering transition a compound amplification of the correlation occurs: when the nearest neighbour distance, decreases, the spatial span of the correlation, increases, so that the effective perception range in units of nearest neighbour distance, is boosted. We emphasize that we are not quantitatively fitting Vicsek model to our data. Our only aim is to demonstrate a general concept: large correlation and lack of global order can coexist even in the simplest model of nearest neighbours alignment, provided that the system is sufficiently close to an ordering transition.

### Estimating the interaction range

The consistency between our experimental data and the Vicsek model suggests that an underlying ordering transition could be present in swarms as well. An ordering transition as a function of the density has been indeed observed in laboratory experiments on locusts [21], fish [40] and in observations of oceanic fish shoals [41]. In these cases, both sides (low and high density) of the ordering transition were explored. However, midge swarms in the field are always disordered, living in the low-density/high- side of the transition. Locating a transition point having data on just one side of it, is a risky business. The reason why we want to do this here is because it will allow us to give a rough estimate of the metric range of interaction in midges, which can be compared with other experiments.

If a Vicsek-like ordering transition exist, we can use equation (4) to fit the swarms data for (Fig. 4). As we already mentioned, we do not know the value of the metric perception range, in swarms. Therefore, we use as scaling variable where is the body length. Although the fit works reasonably well (Fig. 4), the scatter in the data is quite large; hence, given the non-linear nature of the fit, it would be unwise to pin down just one value for the parameters, and we rather report confidence intervals. The fit gives a transition point in the range, with an exponent in the range, (larger exponents correspond to lower transition points).

Interestingly, there is an alternative way to locate the ordering transition that does not rely on the fit of Let us establish a link between pairs of insects closer than the perception range and calculate the size of the biggest connected cluster in the network. Given a swarm with nearest neighbour distance the larger the larger this cluster. When exceeds the percolation threshold, a giant cluster of the same order as the group size appears [42]. We calculate the percolation threshold in swarms (Fig. 6 and Methods) and find The crucial point is that varying the perception range at fixed nearest neighbour distance is equivalent to varying at fixed Hence, at fixed there is an equivalent percolation threshold of the nearest neighbour distance, such that for a giant cluster appears. Clearly, It is reasonable to hypothesise that the critical nearest neighbour distance is close to the maximal distance compatible with a connected network, given A sparser network would cause the swarm to lose bulk connectivity. Therefore, given a certain perception range the ordering transition occurs at values of the nearest neighbour distance close to its percolation threshold,

Percolation threshold as a function of the nearest-neighbour distance in natural swarms. The linear fit (black line) gives, Inset: Fraction of midges belonging to the largest cluster as a function of the clustering threshold In correspondence of the percolation threshold there is the formation of a giant cluster. We define as the point where (red dashed line). Because of the sharp nature of the percolation transition, the value of does not depend greatly on the threshold used.

At this point we have two independent (and possibly equally unreliable) estimates of the transition point in natural swarms of midges: the first one in units of body-lengths, the second one in units of interaction range, Putting the two together we finally obtain an estimate of the metric interaction range in units of body-lengths, The body length of the species under consideration is in the range, This implies a perception range of a few centimetres, depending on the species. This crude estimate of the midge interaction range is compatible with the hypothesis that midges interact acoustically. In [43] the male-to-male auditory response in *Chironomus annularius* (Diptera:Chironomidae) was studied and it was found that the range of the response was about not too far from our estimate. Similar measurements in mosquitoes (Diptera:Culicidae) show that the auditory perception range is about [44], which is again compatible with our determination of the interaction range in midge swarms.

## Discussion

We have shown that natural swarms of midges lack collective order and yet display strong correlations. Such correlations extends spatially much beyond the inter-individual distance, indicating the presence of significant cluster of coordinated individuals. This phenomenology is incompatible with a system of non-interacting particles whose swarming behaviour is solely due to the attraction to an external landmark. We conclude that genuine collective behaviour is present in swarms. We stress that the existence of correlation, and therefore of inter-individual interaction, is not in contradiction with the fact that a swarm almost invariably forms in proximity of a marker. The effect of the marker (external force) is merely to keep the swarm at a stationary position with respect to the environment. However, as we have shown in the case of the non-interacting swarm, this stationarity (which superficially would seems the only visible trait of swarming), cannot by itself produce the observed strong correlations. By using Vicsek model as a simple conceptual framework, we have shown that this coexistence of disorder and correlation is a general feature of systems with alignment interaction close to their ordering transition.

We should be careful in interpreting our data as proof that explicit alignment is the main interaction at work in swarms. What we can say is that non-trivial alignment correlation implies *effective* alignment interaction. However, how this effective alignment interaction is achieved in terms of sensorimotor processes is hard to tell. In fact, as we have already remarked, it is possible that models purely based on repulsion/attraction positional forces, lead to correlations similar to the ones we reported here. Hence, as always when dealing with animal behaviour, it is important to keep in mind the intrinsically *effective* nature of any interaction. The Vicsek model provides the simplest and most compelling description of collective behaviour when effective alignment is present and this fact is not hindered by the real, non-effective nature of the interaction giving rise to the observed correlations.

Our results suggest that correlation, rather than order, is the most significant experimental signature of collective behaviour. Correlation is a measure of how much and how far the behavioural change of one individual affects that of other individuals not directly interacting with it. Our data show that in swarms correlations are so strong that the effective perception range of each midge is much larger than the actual interaction range. If the change of behaviour is due to some environmental perturbations, such large correlation guarantees that the stimulus is perceived at a collective level.

A notion of collective behaviour based on correlation is more general and unifying than one based on order. For example, bird flocks and insect swarms look like completely different systems as long as we stick to collective order. However, once we switch to correlation, we understand that this big difference may be deceptive: both flocks and swarms are strongly correlated systems, in which the effective perception range, or correlation length, is far larger than the interaction range [19]. In this perspective, the striking difference in emergent order between the two systems, namely the fact that flocks move around the sky, whereas swarms do not, may be related to different ecological factors, rather than to any fundamental qualitative difference in the way these systems interact. Strong correlations similar to those found in bird flocks and midge swarms have also been experimentally measured in neural assemblies [45]. This huge diversity - birds, insects, neurons - is bewildering but fascinating, and it suggests that correlation may be a universal condition for collective behaviour, bridging the gap between vastly different biological systems.

## Methods

### Experiments

Data were collected in the field (urban parks of Rome), between May and October, in and in We acquired video sequences using a multi-camera system of three synchronized cameras (IDT-M5) shooting at fps. Two cameras (the stereometric pair) were at a distance between and depending on the swarm and on the environmental constraints. A third camera, placed at a distance of from the first camera was used to solve tracking ambiguities. We used Schneider Xenoplan lenses. Typical exposure parameters: aperture , exposure time Recorded events have a time duration between and seconds. No artificial light was used. To reconstruct the 3d positions and velocities of individual midges we used the techniques developed in [23]. Wind speed was recorded. After each acquisition we captured several midges in the recorded swarm for lab analysis. A summary of all swarms data can be found in Table S1 in Text S1.

### Midge identification

We recorded swarms of midges belonging to the family Diptera:Ceratopogonidae (*Dasyhelea flavifrons*) and Diptera:Chironomidae (*Corynoneura scutellata* and *Cladotanytarsus atridorsum*). Midges belonging to the family Chironomidae were identified to species according to [46], the ones belonging to the family Ceratopogonidae were identified according to [47] and [48]. Specimens used for identification were captured with a hand net and fixed in alcohol, cleared and prepared according to [49]. Permanent slides were mounted in Canada Balsam and dissected according to [50]. Species identification was based on morphology of the adult male, considering different characters, as wing venation, antennal ratio (length of apical flagellomere divided by the combined length of the more basal flagellomeres) and genitalia, which in Diptera are named hypopygium (a modified ninth abdominal segment together with the copulatory apparatus - see Fig. 1).

### Velocity fluctuations

Let be the set of coordinates at time and at the next time step. To simplify the notation we set The velocity vector of insect is defined as, To compute the connected correlation function we need to subtract the contribution of all collective modes from the individual velocity. We identify three collective modes: translation, rotation and dilatation (expansion/contraction).

*Translation*: Let be the position of the centre of mass, and the position of the -th object in the centre of mass reference frame. By subtracting the centre of mass velocity, from the individual velocity, we obtain the translation-subtracted fluctuation,(5)

*Rotation*: The optimal rotation about the origin is defined [51] as the orthogonal matrix which minimizes the quantity By subtracting the overall translation and rotation, the velocity fluctuation is,(6)

*Dilatation*: The optimal dilatation is defined [51] as the scalar that minimizes the quantity After subtracting the optimal translation, rotation and dilatation, the velocity fluctuation is finally given by,(7)where with we have indicated the contribution to the velocity of of all three collective modes.

### Rotation and dilatation order parameters

The rotational order parameter is defined as,(8)where is the projection of on the plane orthogonal to the axis of rotation, the operator indicates the cross product, and is a unit vector in the direction of the axis of rotation. In (8), is the angular momentum of midge with respect to the axis In a perfectly coherent rotation, all individuals would have angular momenta parallel to the axis, so that In a non-coherent system, some of the projections of the angular momentum on would be positive and some negative, so Note that is the axis of rotation defined in the previous section, computed using Kabsch algorithm [51].

The dilatational order parameter is defined as,(9)

and it measures the degree of coherent expansion (positive ) and contraction (negative ) of the swarm. In a perfectly coherent expansion/contraction would be parallel to and so the scalar product in equation (9) will be for an expansion and for a contraction.

### Normalization of the correlation function

In the study of flocks [19], we normalized by its limiting value for which is equivalent dividing it by the value in the first bin. In that way the normalized correlation function tends to for so that its value is amplified. In the study of flocks we were only looking at the correlation length, which is not altered by such a normalization. However, here we will be interested in both the range and the intensity of the correlation, so we must not amplify artificially the correlation signal. Normalising the fluctuations as in (1) is equivalent normalising the correlation function by its value at *exactly* i.e. for which is different from its limit for

### Non-interacting Harmonic Swarm

The NHS is an elementary model of non-interacting particles performing a random walk in a three-dimensional harmonic potential. The dynamics of each particle is defined by the Langevin equation,(10)where is the position of the -th particle at time is the mass, the friction coefficient, the harmonic constant and is a random vector with zero mean and unit variance, with Clearly, in this model there is no interaction between particles. The parameter tunes the strength of the noise. The equation of motion is integrated with the Euler method [52]. We simulated the NHS in the critically damped regime (), which gives the best similarity to natural swarms. The number of particles is set equal to that of the natural swarm we want to compare it with. Parameters have been tuned to have a ratio between the distance travelled by a particle in one time step (frame) and the nearest neighbour distance comparable to natural swarms,

### Definition of P-value

Let us define a data set as a collection of pairs of variables, with (for example, the susceptibility as a function of the rescaled nearest neighbour distance - Fig. 4). The null hypothesis is that are independent variables. Let us call the Spearman's rank correlation coefficient for a set of data and the probability distribution of in the case of pairs of *independent* variables. Given the empirical data, we calculate the Spearman's rank correlation coefficient and get a certain value, The P-value is defined as the probability that the statistical test we are using (Spearman) gives a result at least as extreme as the one actually observed, *provided that the null hypothesis is true*. Hence, the P-value is given by,(11)

Basically, the P-value is telling us how likely it is that the degree of correlation that we observe is just the result of chance. In absence of an *a priori* model of the noise, we estimate by a permutation test [53], [54]: using the original paired data, we randomly redefine the pairs to create a new data set where the are a permutation of the set we calculate the Spearman's rank correlation coefficient of this new randomized data set; we iterate this permutation times; we compute the fraction of permutations that give This fraction is equal to the P-value of the data set under consideration [53].

### Vicsek model

We performed numerical simulations of the Vicsek model in 3d [35]–[38], [55]. The direction of particle at time is the average direction of all particles within a sphere of radius around (including itself). The parameter is the metric radius of interaction. The resulting direction of motion is then perturbed with a random rotation (noise). Natural swarms are known to form close to a marker and to keep a stationary position with respect to it [13]. To mimic this behaviour we modified the Vicsek model by adding an external harmonic force equal for all particles. This potential also grants cohesion, without the need to introduce an inter-individual attraction force [4], [16], [27].

The update equation for velocities is therefore given by,(12)where is the spherical neighbourhood of radius centred around is the normalization operator, and performs a random rotation uniformly distributed around the argument vector with maximum amplitude of The term is the harmonic force directed towards the origin. For we recover the standard Vicsek model. The update equation for the positions is, Thanks to the central force we can use open boundary conditions. All particles have fixed velocity modulus Each simulation has a duration of time steps, with initial conditions consisting in uniformly distributed positions in a sphere and uniformly distributed directions in the solid angle. After a transient of time steps, we saved 500 configurations at intervals of 1000 time steps in order to have configurations with velocity fluctuations uncorrelated in time. The control parameter of interest is where is the nearest neighbour distance, which is tuned by The model displays a transition to an ordered phase when We studied the susceptibility for different values of To observe the power-law behaviour of predicted by the model we performed standard finite-size scaling [38]: at each fixed value of the system' size we calculated and worked out the maximum of the susceptibility and its position we finally plotted vs. parametrically in to obtain the function in Fig. 5. The noise, affects the position of the transition point [35]–[37], but this is irrelevant for us, because we do not use any quantitative result from the model to infer the biological parameters of real swarms. The data reported in Fig. 5 have

### Percolation threshold

For each frame we run a clustering algorithm with scale [56]: two points are connected when their distance is lower than For each value of we compute the ratio between the number of objects in the largest cluster and the total number of objects in the swarm (Fig. 6). The percolation threshold, is defined as the point where a giant cluster, i.e. a cluster with size of the same order as the entire system, forms [42]. We define as the point where The percolation threshold scales with the nearest neighbour distance, (Fig. 6). Strictly speaking, the percolation argument only holds at equilibrium, because in a system where particles are self-propelled there may be order even at low density [36]. However, at low values of the noise, we still expect the percolation argument to give a reasonable, albeit crude, estimate of the perception range.

## Supporting Information

### Datafile S1.

Datafile S1 refers to a swarm of *Chironomidae, Cladotanytarsus atridorsum*. Data from this file were used to produce the panels in the first two rows of the left column of Fig. 3.

doi:10.1371/journal.pcbi.1003697.s001

(DAT)

### Datafile S2.

Datafile S2 refers to a swarm of *Chironomidae, Corynoneura scutellata*. Data from this file were used to produce the panels in the first two rows of the central column of Fig. 3.

doi:10.1371/journal.pcbi.1003697.s002

(DAT)

### Datafile S3.

Datafile S3 refers to a swarm of *Ceratopogonidae, Dasyhelea flavifrons*. Data from this file were used to produce the panels in the first two rows of the right column of Fig. 3.

doi:10.1371/journal.pcbi.1003697.s003

(DAT)

### Text S1.

In the text file, helpful details about the importance and the interpretation of the connected correlation function and of the susceptibility are reported together with a table summarizing the main properties of the analysed swarming events. The three supporting data files report the positions and the velocities used to compute the correlation functions for a single instant of time. The three files refer to the three different analysed species of midges. In each file, for each pair of midges in the swarm, the mutual distance and the scalar product between their dimensionless velocity fluctuations (see Eq.1) are reported: the distances in meters in the first column, the dimensionless scalar products in the second column.

doi:10.1371/journal.pcbi.1003697.s004

(PDF)

### Video S1.

Wild swarm of roughly midges in the field (Diptera:Ceratopogonidae). The swarm has been video recorded at 170 frames per second, with a resolution of 4Mpix, using an IDT-M5 camera.

doi:10.1371/journal.pcbi.1003697.s005

(AVI)

### Video S2.

Three dimensional visualization of the same natural swarm as in Video S1. This 3d reconstruction has been obtained by means of our dynamical tracking algorithm based on the trifocal experimental technique.

doi:10.1371/journal.pcbi.1003697.s006

(AVI)

### Video S3.

Three dimensional visualization of a numerically simulated swarm of non-interacting particles in a harmonic potential (NHS, Non-interacting Harmonic Swarm). The number of ‘midges’ in the NHS is the same as in Video S2.

doi:10.1371/journal.pcbi.1003697.s007

(AVI)

### Video S4.

Three dimensional visualization of a numerically simulated swarm obtained using Vicsek model with the addition of an harmonic attraction force towards the origin. The video refers to a swarm in the ordered phase, with the polarization equal to The number of “midges” in the swarm is the harmonic constant is equal to while the simulation noise is equal to

doi:10.1371/journal.pcbi.1003697.s008

(MP4)

### Video S5.

Three dimensional visualization of a numerically simulated swarm obtained using Vicsek model with the addition of an harmonic attraction force towards the origin. The video refers to a swarm in the disordered phase, with the polarization equal to The number of “midges” in the swarm is the harmonic constant is equal to while the simulation noise is equal to

doi:10.1371/journal.pcbi.1003697.s009

(MP4)

## Acknowledgments

We thank William Bialek, Enzo Branchini, Massimo Cencini, Francesco Ginelli and Dov Levine for discussions. We also acknowledge the help of Tomas S. Grigera in running numerical tests on the single sample susceptibility in the Ising model.

## Author Contributions

Conceived and designed the experiments: AC IG. Performed the experiments: LDC SM OP ESh. Analyzed the data: AA LDC IG SM LP MV. Wrote the paper: AC. Set up and calibrated the 3d system: AA AC LDC IG SM LP ESh MV. Performed the species recognition and monitored all entomological aspects of the study: BR. Performed the tracking and produced the 3d data: AC LDC SM LP ESh ESi MV. Performed the numerical simulations: ES.

## References

- 1.
Krause J, Ruxton GD (2002) Living in Groups. Oxford: Oxford University Press.
- 2.
Cardy J (1996) Scaling and Renormalization in Statistical Physics. Cambridge Lecture Notes in Physics. Cambridge: Cambridge University Press.
- 3.
Parisi G (1998) Statistical Field Theory. Westview Press.
- 4. Okubo A (1986) Dynamical aspects of animal grouping: Swarms, schools, flocks, and herds. Adv Biophys 22: 1–94.
- 5.
Camazine S, Deneubourg JL, Franks NR, Sneyd J, Theraulaz G, et al. (2003) Self-Organization in Biological Systems. Princeton University Press, Princeton.
- 6.
Sumpter DJT (2010) Collective Animal Behavior. Princeton: Princeton University Press.
- 7. Downes JA (1969) The swarming and mating flight of Diptera. Ann Rev Entomol 14: 271–298.
- 8. Sullivan RT (1981) Insect swarming and mating. Florida Entomol 64: 44–65.
- 9. Nielsen ET, Haeger JS, Bell RT (1960) Swarming and mating in mosquitoes. Entomol Soc Am 1: 72–95.
- 10. Charlwood JD, Jones MDR (1980) Mating in the mosquito Anopheles gambiae sl. Physiol Entomol 5: 315–320.
- 11. Manoukis NC, Diabate A, Abdoulaye A, Diallo M, Dao A, et al. (2009) Structure and dynamics of male swarms of Anopheles gambiae. J Med Entomol 46: 227–235.
- 12. Okubo A, Grunbaum D, Edelstein-Keshet L (2001) The dynamics of animal grouping. Interdiscip Appl Math 14: 197–237.
- 13. Downes JA (1955) Observations on the swarming flight and mating of Culicoides (Diptera: Ceratopogonidae) Trans R Entomol Soc London. 106: 213–236.
- 14. Blackwell A, Mordue (Luntz) AJ, Young MR, Mordue W (1992) The swarming behaviour of the Scottish biting midge, Culicoides impunctatus (Diptera: Ceratopogonidae). Ecol Entomol 17: 319–325.
- 15. Okubo A, Chiang HC (1974) An analysis of the kinematics of swarming of Anarete Prithcardi Kim (Diptera: Cecidomyiidae). Res Popul Ecol 16: 1–42.
- 16.
Butail S, Manoukis NC, Diallo M, Ribeiro JM, Paley DA (2013) The Dance of Male Anopheles
*gambiae*in Wild Mating Swarms. J Med Entomol 50: 552–559. - 17. Ali A (1980) Nuisance chironomids and their control: a review. Bulletin of the ESA 26: 3–16.
- 18. Rose RI (2001) Pesticides and public health: integrated methods of mosquito management. Emerg Infect Dis 7: 17–23.
- 19. Cavagna A, Cimarelli A, Giardina I, Parisi G, Santagati R, et al. (2010) Scale-free correlations in starling flocks. Proc Natl Acad Sci U S A 107: 11865–11870.
- 20. Tunstrøm K, Katz Y, Ioannou CC, Huepe C, Lutz MJ, et al. (2013) Collective states, multistability and transitional behavior in schooling fish. PLOS Comput Biol 9: 1–11.
- 21. Buhl J, Sumpter DJT, Couzin ID, Hale JJ, Despland E, et al. (2006) From disorder to order in marching locusts. Science 312: 1402–1406.
- 22. Cavagna A, Giardina I, Orlandi A, Parisi G, Procaccini A, et al. (2008) The starflag handbook on collective animal behaviour: Part I. empirical methods. Anim Behav 76: 217–236.
- 23.
Attanasi A, Cavagna A, Del Castello L, Giardina I, Jelic A, et al. (2013) Tracking in three dimensions via recursive multi-path branching. arXiv:1305.1495.
- 24.
Wu Z, Kunz TH, Betke M (2011) Efficient track linking methods for track graphs using network-flow and set-cover techniques. In proceeding of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR): 1185–1192
- 25.
Wu Z, Hristov NI, Kunz TH, Betke M (2009) Tracking-reconstruction or reconstruction-tracking? Comparison of two multiple hypothesis tracking approaches to interpret 3D object motion from several camera views. IEEE Workshop on Motion and Video Computing (WMVC): 1–8.
- 26. Shinn EA, Long GE (1986) Technique for 3-D analysis of Cheumatopsyche pettiti (Trichoptera: Hydropsychidae) swarms. Environ Entomol 15: 355–359.
- 27. Kelley DH, Ouellette NT (2013) Emergent dynamics of laboratory insect swarms. Scientific Reports 3: 1073.
- 28. Wu HS, Zhao Q, Zou D, Chen YQ (2011) Automated 3D trajectory measuring of large numbers of moving particles. Opt Express 19: 7646–7663.
- 29.
Zou D, Zhao Q, Wu HS, Chen YQ (2009) Reconstructing 3D motion trajectories of particle swarms by global correspondence selection. International Conference on Computer Vision (ICCV) IEEE: 1578–1585.
- 30.
Huang K (1990) Statistical mechanics. John Wiley & Sons, New York.
- 31.
Binney JJ, Dowrick NJ, Fisher AJ, Newman M (1992) The theory of critical phenomena: an introduction to the renormalization group. Oxford University Press, Oxford.
- 32. Bialek W, Cavagna A, Giardina I, Mora T, Silvestri E, Viale M, Walczak AM (2012) Statistical mechanics for natural flocks of birds. Proc Natl Acad Sci U S A 109: 4786–4791.
- 33. Ballerini M, Cabibbo N, Candelier R, Cavagna A, Cisbani E, et al. (2008) Interaction ruling animal collective behavior depends on topological rather than metric distance: Evidence from a field study. Proc Natl Acad Sci U S A 105: 1232–1237.
- 34. Couzin ID, Krause J, James R, Ruxton GD, Franks NR (2002) Collective memory and spatial sorting in animal groups. J Theor Biol 218: 1–11.
- 35. Vicsek T, Czirók A, Ben-Jacob E, Cohen I, Shochet O (1995) Novel type of phase transition in a system of self-driven particles. Phys Rev Lett 75: 1226–1229.
- 36. Gönci B, Nagy M, Vicsek T (2008) Phase transition in the scalar noise model of collective motion in three dimensions. Europ Phys J Special Topics 157: 53–59.
- 37. Chaté H, Ginelli F, Grégoire G, Raynaud F (2008) Collective motion of self-propelled particles interacting without cohesion. Phys Rev E 77: 046113.
- 38. Baglietto G, Albano EV (2008) Finite-size scaling analysis and dynamic study of the critical behavior of a model for the collective displacement of self-driven individuals. Phys Rev E 78: 021125.
- 39. Baglietto G, Albano EV, Candia J (2012) Criticality and the onset of ordering in the standard Vicsek model. Interface Focus 2: 708–714.
- 40. Becco C, Vandewalle N, Delcourt J, Poncin P (2006) Experimental evidences of a structural and dynamical transition in fish school. Physica A 367: 487–493.
- 41. Makris NC, Ratilal P, Jagannathan S, Gong Z, Andrews M, et al. (2009) Critical population density triggers rapid formation of vast oceanic fish shoals. Science 323: 1734–1737.
- 42.
Kertesz J, Stauffer D, Coniglio A (1983) Percolation structures and processes. Am Ist Phys, New York.
- 43. Fyodorova MV, Azovsky AI (2003) Interactions between swarming Chironomus annularius (Diptera: Chironomidae) males: role of acoustic behavior. J Insect Behav 16: 295–306.
- 44.
Pennetier C, Warren B, Dabiré KR, Russell IJ, Gibson G (2010) Singing on the wing as a mechanism for species recognition in the malarial mosquito
*Anopheles gambiae*. Current Biology 20: 131–136. - 45. Schneidman E, Berry MJ, Segev R, Bialek W (2006) Weak pairwise correlations imply strongly correlated network states in a neural population. Nature 440: 1007–1012.
- 46.
Langton PH, Pinder LCV (2007) Keys to the adult male Chironomidae of Britain and Ireland Vol I, Vol II, S P 64, The Freshwater Biological Association, Ambleside.
- 47.
Kieffer JJ (1925) Faune de France, Vol. 11: Diptères Nématocères Piqueurs: Chironomidae, Ceratopogoninae. Lechevalier, Paris.
- 48.
Dominiak P (2012) Biting midges of the genus
*Dasyhelea*Kieffer (Diptera: Ceratopogonidae). Poland Polish Journal of Entomology 81: 211–304. - 49. Wirth WW, Marston N (1968) A method for mounting small insects on microscope slides in Canada balsam. Annals of the Entomological Society of America 61: 783–784.
- 50.
Wiederholm T (1989) Chironomidae of the Holarctic Region. Keys and Diagnoses. Part III. Adult males. Entomologica Scadinavica Suppl 34: 1–532.
- 51. Kabsch W (1976) A solution for the best rotation to relate two sets of vectors. Acta Crystallographica Section A 32: 922–923.
- 52.
Butcher JC (2008) Numerical methods for ordinary differential equations. John Wiley & Sons, New York.
- 53. Ludbrook J, Hugh D (1998) Why permutation tests are superior to t and F tests in biomedical research. The American Statistician 52: 127–132.
- 54. Berry KJ, Johnston JE, Mielke PW (2011) Permutation methods. WIREs Comp Stat 3: 527–542.
- 55. Czirok A, Vicsek M, Vicsek T (1999) Collective motion of organisms in three dimensions. Physica A 264: 299–304.
- 56. Lu SY, Fu KS (1978) A sentence-to-sentence clustering procedure for pattern analysis. IEEE Trans Syst Man Cybern 8: 381–389.