## Figures

## Abstract

How organisms move and disperse is crucial to understand how population dynamics relates to the spatial heterogeneity of the environment. Random walk (RW) models are typical tools to describe movement patterns. Whether Lévy or alternative RW better describes forager movements is keenly debated. We get around this issue using the Generalized Pareto Distribution (GPD). GPD includes as specific cases Normal, exponential and power law distributions, which underlie Brownian, Poisson-like and Lévy walks respectively. Whereas previous studies typically confronted a limited set of candidate models, GPD lets the most likely RW model emerge from the data. We illustrate the wide applicability of the method using GPS-tracked seabird foraging movements and fishing vessel movements tracked by Vessel Monitoring System (VMS), both collected in the Peruvian pelagic ecosystem. The two parameters from the fitted GPD, a scale and a shape parameter, provide a synoptic characterization of the observed movement in terms of characteristic scale and diffusive property. They reveal and quantify the variability, among species and individuals, of the spatial strategies selected by predators foraging on a common prey field. The GPD parameters constitute relevant metrics for (1) providing a synthetic and pattern–oriented description of movement, (2) using top predators as ecosystem indicators and (3) studying the variability of spatial behaviour among species or among individuals with different personalities.

**Citation: **Bertrand S, Joo R, Fablet R (2015) Generalized Pareto for Pattern-Oriented Random Walk Modelling of Organisms’ Movements. PLoS ONE 10(7):
e0132231.
https://doi.org/10.1371/journal.pone.0132231

**Editor: **Franck Courchamp, Ecologie, Systématique & Evolution, FRANCE

**Received: **November 21, 2014; **Accepted: **June 12, 2015; **Published: ** July 14, 2015

**Copyright: ** © 2015 Bertrand et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

**Data Availability: **Seabird data has been included in Supporting Information files, and that the data on Peruvian fishing vessels was obtained from the third party IMARPE. Data from IMARPE can be requested from Carla Aguilar, Scientific Director of IMARPE (imarpe@imarpe.gob.pe Instituto del Mar del Perú IMARPE Esquina Gamarra y General Valle S/N Chucuito Callao Telf: (051)208-8650).

**Funding: **The authors received no specific funding for this work.

**Competing interests: ** The authors have declared that no competing interests exist.

## Introduction

The characterization of foraging movements by random walk (RW) models (e.g. [1]) has received increased attention since GPS tracking made high-resolution records of animal and human displacements easily available (e.g. [2–4]). In such a probabilistic framework, movement is seen as a succession of elementary behavioural events called ‘moves’ [1] whose lengths are defined by random draws from a probability density function. While the dominant RW model in statistical physics has been the Brownian walk (characterized by a Gaussian behaviour of the tail of the move lengths distribution, and producing a normally diffusive behaviour), early studies in ecology suggested that a variety of foragers, from microzooplankton to humans (e.g. [5–10]), perform movements well described by Lévy walks. Lévy walks are characterized by the occurrence of rare and very large moves in the trajectory, producing heavy-tailed (power-law) move-length distributions, and revealing super diffusive spatial behaviour (e.g. [11, 12]). Regarding foraging behaviours, this super diffusive behaviour may emerge from the patchiness of the prey fields [13]. There has been a great deal of controversy regarding whether random walks should be phenomenological or mechanistic, and whether methods typically used to test the Lévy walk hypothesis are appropriate. [14–25].

One of the controversies originated in the modelling paradigm [2]: Should RWs provide a phenomenological, synthetic and pattern-oriented description of movement? This would justify the search for one global and parsimonious RW model for each study case. Or rather, should RWs provide a biomechanical, process-oriented description of movement? This would justify less parsimonious composite models possibly involving different RWs according to the activity in which the organisms are engaged at each move (e.g. state-space model approaches, [26] for seabirds, [27] for fishing vessels). Indeed, the two paradigms allow addressing different critical issues: while the process-oriented paradigm focuses on understanding the determinants of movement (e.g. bioenergetics or environmental constraints), the pattern–oriented paradigm may provide synthetic metrics of movement, highly valuable for studying the interactions between predators and their prey field (e.g. [28]), for studying some components of animal personalities (e.g. [29–31]) or for using top predator spatial behaviours as ecosystem indicators (e.g. [13, 32]).

A second controversy is linked to the methods that have been used to test the Lévy walk hypothesis from the power-tailed behaviour of the move length distribution. After graphical methods were proven inaccurate, the maximum likelihood criterion was recommended to perform an unbiased estimation of the parameter of the power-law distribution, and the comparison to alternative models according to the Akaike Information Criterion (AIC [33]) was encouraged [17,34]. Revisiting early works with this methodology and accepting/rejecting the AIC's best candidate model with a G-test for goodness-of-fit (GOF) test [35], Edwards *et al*. [16,24] and Edwards [23] stated that most of the original conclusions of Lévy walks in animal and human foraging strategies had to be overturned. A few cases could be described by the exponential distribution, a specific case of light-tailed distributions producing non-super diffusive spatial behaviours. Such exponentially distributed move lengths would correspond to purely random search strategies, i.e. Poisson distributed times for changes in search directions. In most cases, none of the tested models fitted the data, suggesting that the proposed candidate models were not appropriate.

Here, we argue that testing exponential vs. power-tailed distributions seems fairly restrictive to determine if super-diffusive or normally diffusive behaviours are dominant among foraging strategies. We propose Generalized Pareto distributions to model the movement of animal and human foragers as a random walk. This two-parameter class of distributions comprise as specific cases the Gaussian, exponential and power-law distributions, within a wider continuum of normally and super-diffusive RWs that allows getting around the endless debate opposing Lévy to other types of random walk models. Moreover, we consider that the G-test is not appropriate for evaluating the goodness of fit of such models because it requires binning continuous data, which is highly sensitive to the rules chosen for data quantization. We propose the use of the Cramér-von Mises test, exploiting directly the cumulative distribution of the original data, thus not requiring any binning. Finally, we address the problem of the determination of the start of the tail of the distribution, which is a critical issue for evaluating whether or not a Lévy walk hypothesis holds. Indeed, from a theoretical point of view, the super-diffusive behaviour of a Lévy walk [11, 12] is fully characterized by the asymptotic power-tailed behaviour of the distribution of the forager move lengths. Then, the tail of the distribution alone, and not its first percentiles, defines this super-diffusive characteristic. While explicitly addressed in the field of extreme event statistics [36], the determination of the start of the tail is an issue that has been ignored in the evaluation of the Lévy walk hypothesis in movement ecology. Power-tailed behaviour has typically been tested for a tail starting at the minimum value of the dataset or using histogram-based criteria that are very sensitive to the binning. To address this issue, we provide a pragmatic definition of the tail of the move length distribution.

Applying this methodology to the movements of two species of GPS-tracked seabirds and fishermen, we (1) illustrate the wide applicability of the proposed method on a variety of tracking data and (2) reveal inter-individual and interspecific variability in the spatial strategies selected by predators foraging on a common prey field.

## Method and Case Study Data

We consider the Generalized Pareto Distribution (GPD) to model the movements of living organisms, as described by the tails of their move-length distributions. A key feature of GPDs is to embed as special cases the Gaussian, exponential and power distributions, which underlie Brownian, Poisson-like and Lévy walks. The GPD is a two-parameter family of distributions [37, 38] with the following cumulative distribution function [39]:
(Eq 1)
and the probability density function:
(Eq 2)
where the *σ* > 0 and −∞ < *k* < ∞ are the scale and shape parameters; and the domain of x is [θ, ∞ [when *k* > 0 or [θ, *σ>*] when *k* ≤ 0; θ is the start of the tail of the distribution. The scale parameter, *σ*, characterizes the spatial range or characteristic scale of the movement patterns. The shape parameter *k* describes the asymptotic behaviour of the distribution; in other words, the relative contribution of long moves vs. short moves, i.e. the thickness of the tail that determines the diffusive property of the movement.

The thickness of the tail of a distribution is generally defined relative to that of a Normal distribution. A distribution that has a tendency to generate a higher proportion of extreme values than the Normal distribution is referred to as a heavy-tailed distribution. A distribution that has a tendency to generate a lower proportion of extreme values than the Normal distribution is referred to as a light-tailed distribution. The GPD comprises finite-tailed (*k*<0, e.g. Uniform and Beta distributions), light-tailed (*k* in [0; 0.5], e.g. exponential, gamma, or Gumbel distributions), heavy-tailed (*k* in] 0.5; ∞[, e.g. Lévy, Pareto, log-normal, Burr, Cauchy and log-gamma distributions) and ballistic (infinite *k* values) distributions. As illustrated in Fig 1, the exponential (*k* = 0) and Lévy walk (*k* > 0.5) hypotheses lie within this continuum of distributions. Parameter *k* is associated with the power-law (Lévy) exponent *μ* as *μ = 1+1/k*

We show typical trajectories emerging from random realizations of those different move distributions, including a Poisson-exponential motion (*k* = 0), a Brownian-Gaussian motion (*k* = 0.5) and a Lévy-power walk (*k*>0.5) with *k* = 1. For each case, we also show in the lower right inset the log-log plot of the corresponding move length probability density function.

Regarding the interpretation of forager movements in terms of diffusion, the reference point is the Brownian walk (*k* = 0.5), which distinguishes linear diffusive behaviour (*k*<0.5) from the super-diffusive behaviour exhibited by Lévy walks (*k*>0.5). The estimation of model parameters *k* and *σ* from move length observations is carried out using a maximum likelihood estimation. The log-likelihood of a sample of move lengths *l = (l*_{1},*l*_{2},*…l*_{n}*)* is given by [39]:
(Eq 3)

The analytical maximization of the log-likelihood for GPD is not possible; hence numerical optimizations are carried out. In the present work, the maximization of the log-likelihood functions is performed with the Matlab toolbox « Statistics », that proceeds using a gradient descent initialized with the method of moments.

From 200 simulations, carried out for values of *k* from 0.1 to 1.25, we analyse the estimation bias on model parameters that may emerge, as real foragers' datasets (1) cannot exhibit infinite move length values and (2) are samples of finite size. The effect of the finite move length values was explored with a sample of size 1000 and by setting maximum values of the move lengths between the 90^{th} and the 99.9^{th} percentile for the GPD (Fig 2, left panel). These simulations clearly point out a bias in the parameter estimation for the unbounded GPD when the maximal value of the sample is below the 99.9^{th} percentile. In contrast, the parameter estimation for the bounded GPD does not exhibit such a bias. As a consequence, the bounded version of the GPD (BGPD) should be preferred for examining real observation data. Then the effect of the samples of finite size was examined with a maximal value of the observed move length set below the 99.9^{th} percentile of the actual GPD. A finite sample size greater than 200 observations, although relatively small, leads to a significant reduction in the bias (Fig 2, right panel). Because the bias was linear with respect to the actual *k* value, a bias correction relationship was estimated. Simulations for other maximal values of the observed move length range led to similar conclusions. Linear bias corrections were then estimated as a function of the size of the sample set and of the percentile of the maximum move length value. These corrections were then applied when fitting bounded GPDs to seabirds and fishing vessel movement patterns.

In both cases, we report the zero-bias case (solid black line). The mean bias was estimated over 200 simulations.

The probability density function of the bounded version of the GPD (BGPD) is given by:
(Eq 4)
and the log-likelihood functions given a sample of move lengths *l = (l*_{1},*l*_{2},*…l*_{n}*)* is
(Eq 5)

Similar to the GPD, the maximization of the BGPD log-likelihood is performed numerically using a gradient descent scheme.

We use the Cramér-von Mises test [40], hereafter CVM, as a GOF-test to examine whether or not the observed data are consistent with the fitted model. The CVM statistic is defined as follows: (Eq 6)

Where *n* is the size of the sample and *U*_{1} *= F(X*_{1}*)*, …, *U*_{n} *= F(X*_{n}*)*. *X* is the random variable representing the move length and *F(X*_{i}*)* is the value of the cumulative distribution function corresponding to the *i*^{-th} rank statistic in the sample. The same statistic *ω*^{2} can be calculated for a number of random draws of size *n* from the fitted model. The null hypothesis of the GOF test (observations are consistent with the fitted model) is rejected with significance level *α* whenever , where is the upper *α*-quantile of the distribution of *ω*^{2} over the random draws from the fitted model. This test does not require any prior binning of the continuous move length data and appears more appropriate than the G-test used in [23], especially when small samples are considered [41]. In addition, the CVM test relies on rank statistics that compare the cumulative density functions over their entire domain, and not only their maximum difference, as the Kolmogorov-Smirnov test does. This results in a greater robustness to test for the relevance of the candidate model given some observed samples, particularly when dealing with heavy-tailed distributions.

As mentioned above, the definition of the start of the tail is a critical issue, generally overlooked in previous studies. Here, we formally define the start of the tail as the minimum data value for which the GOF test is significant. By coupling model estimation and the GOF measure in this way, we guarantee a consistent and adaptive definition of the tail, ensuring the statistical consistency of the fitted model.

As case studies, we consider two datasets collected in the Peruvian coastal upwelling ecosystem on anchovy (*Engraulis ringens*) main predators: seabirds and the purse seine fishery. In such pelagic ecosystems, water masses and fish schools are constantly moving [42] so that precise prey localization is unpredictable regardless of the predator, human or animal [8]. Foraging movements for all predators aim at the same goal: dealing with uncertainty in prey localization and maximizing encounters. Yet, they may adopt distinct forms due to differences in the cognitive, locomotory and motivational states of each predator. Synthetic metrics like the scale and shape parameters from the BGPD may be very useful for quantifying and comparing alternative spatial strategies.

The first dataset consisted of the movements deployed during four foraging trips by two seabird species, boobies (*Sula variegata*) and guanay cormorants (*Phalacrocorax bougainvilli*) from Isla Pescadores, a colony located off Lima, Peru (11°47’12” South, 77°14’25” West), where the two species breed in sympatry (authorizations for the fieldwork were delivered by SERNANP, Peru; no animals were handled directly for this study as data were collected from previous works). Their positions were recorded each second with miniaturized GPS [see 43 for protocol details; raw data available in Supporting Information S1, S2, S3 and S4 Dataset]. The original positions were resampled into moves (see Supporting Information S1 Protocol), where moves are defined as elementary behavioural events [1]. We fitted a BGPD, and alternative classical models for comparison, to the distribution of the move lengths observed for each foraging trip, each of them gathering more than 500 moves.

The second dataset consisted of the movements deployed by three industrial anchovy purse seiners, as monitored by the Peruvian Vessel Monitoring System (VMS [13, 8]; between 4 and 18° South and 72 and 82° West). VMS provides approximately one position per hour for each vessel. The original positions of fishing trips are re-sampled into moves ([1, 13, 8]; see the electronic supplementary material]. Since fishing trips in this fishery last on average 24h and VMS sampling is about 1 per hour, each trip is described by ~ 24 positions, producing less than 20 moves. Since model fits based on samples with less than 200 moves should not be considered, we fit a BGPD (and alternative models for comparison) to the yearly distributions of the move lengths of each vessel, considering that a given fisherman should exhibit some consistency in its spatial behaviour over time. These vessel data were already analysed in previous RW works [8, 23] and were selected for potential comparisons of the results.

## Results

### 3.1 Seabirds

The four seabird foraging trips studied are presented in Fig 3. They lasted between ~ 1 and 4 hours with covered distances between 11 and 140 km, and maximum distances from the colony (ranges) between 2 and 43 km. Table 1 compares for each trip, through AIC, the fit of the exponential, power, Generalized Pareto and Bounded Generalized Pareto distributions to the move length data. The estimated shape parameters (expressed as ) show that the four tracks are super diffusive and well described by Lévy walks ( or ). With the proposed adaptive tail definition, BGPD was the model that allowed gathering the largest number of moves in the tail (50%-70% of the moves, other model results are not shown here).

Trips were recorded with miniaturized GPS, providing one position every second. The orange and dark red tracks are from boobies (*Sula variegata*) and the dark and light green tracks are from cormorants (*Phalacrocorax bougainvillii*).

For each trip, four alternative models are considered: Exponential (*f*(*l*)∼*l*^{−μ}), Power (*f*(*l*)∼*e*^{−λl}), Generalized Pareto (GPD), Bounded Generalized Pareto (BGPD), see Edwards (2011) for the likelihood functions of the first two models. Model parameter estimates , and are given except for the exponential model and the likelihood of the models are compared with the Akaike information criterion (AIC). The last column provides the total number of moves obtained after resampling the original track positions (N) and the % of moves retained in the tail, using the adaptive criteria defining the start of the tail as the minimum data value for which the GOF test of estimated model parameters is significant (CMV p-value >0.05).

The four track examples (Fig 4) illustrate that a diffusion coefficient alone does not allow comparing the area that two foragers may visit during a trip: diffusion coefficients are directly comparable only if the scale parameters are similar. For instance, booby *a* and cormorant *d* have comparable number of moves (*n* equal to 1003 and 1035, respectively) and the former has a lower diffusion coefficient than the latter (*k* equal to 0.67 and 1.54, respectively). However, booby *a* travelled farther distances than cormorant *d* (ranges of 43km and 2 km, respectively). Regarding individuals within the same species, boobies *a* and *b* show great differences in covered distance and range. Since the diffusion parameter k takes similar values for both boobies, it is the scale parameter *σ* (5.5.10^{−2} and 1.10^{−2} for boobies *a* and *b*, respectively) that conditions the characteristic length of the trajectory and the scope of exploration of the seabirds. Cormorants *c* and *d* show smaller covered distances and range values, with the former one taking larger values than the latter. In this case, scale parameters are comparable (7.6.10^{−4} and 7.3.10^{−4}, respectively), and it is the difference in the diffusion parameter (1.85 and 1.54, respectively) that conditions the scope of exploration (ranges of 11 and 2 km, respectively).

Original tracks, log-log plots of the fits for the different candidate models and synthetic statistics characterizing the trips: Total distance covered during the foraging trip (km), maximum range from the colony (km), duration of the trip (h), global sinuosity of the trip (defined as distance/2*range), number of moves after resampling (cf. SI S3), scale parameter (*σ*) and shape parameter (*k*, describing diffusion) from the fitted BGPD.

### 3.2 Fishing vessels

The fishing trips from the three anchovy purse seiners are presented in Fig 5. Bertrand et al. [8], based on graphical estimation methods, concluded that the movements of the three vessels were well described by Lévy walks, vessel *a* being the most diffusive () and vessel *c* the least diffusive (). Edwards [23] on the same dataset concluded that although bounded Lévy was the most plausible model for vessels *a* and *b*, and exponential for vessel *c*, no model actually passed the G-GOF test. The conclusion was that these datasets did not conform to any candidate models. Table 2 compares for each vessel the fitting of the exponential, power, Generalized Pareto and Bounded Generalized Pareto distributions to the move length distributions. In the three cases, BGPD presented the minimum AIC, regardless of the rule chosen for defining the start of the tail of the move length distribution (i.e. either based on histograms as in previous works or using the proposed adaptive definition of the tail). Consistently with [8], vessels *a* () and *b* () are well described by a super diffusive Lévy walk behaviour ( or ); in contrast, vessel *c* () approaches a normally diffusive RW (), close to a Poisson-like (exponential tail) movement. The three vessels also differed in the characteristic length of the moves they deployed: vessel *a* exhibited the longest ( = 25.29), and vessel *c* the shortest ( = 3.31) moves. Since both the scale and shape parameters changed in the same direction, vessels *a* and *c* exhibited the widest and smallest scope of exploration respectively.

Black dots represent the global set of positions for fishing trips deployed during one year. A sample of 25 fishing trips is represented with colored lines. The inset gives the log-log plot of the total move length distribution together with the fits of the four candidate models (Generalized Pareto in dark blue, Bounded Generalized Pareto in red, exponential in pink and power in light blue).

For each vessel, four alternative models are considered: Exponential (*f*(*l*)∼*l*^{−μ}), Power (*f*(*l*)∼*e*^{−λl}), Generalized Pareto (GPD), Bounded Generalized Pareto (BGPD), see Edwards (2011) for the likelihood functions of the first two models. Model parameter estimates , and are given except for the exponential model and the likelihood of the models are compared with the Akaike information criterion (AIC). The last column provides the total number of moves obtained after the resampling of the original track positions (N) and the % of moves retained in the tail. Two protocols for the definition of the beginning of the distribution tail are compared: the empirical protocol used in previous works on the same data (Bertrand *et al*. 2005, 2007; Edwards 2011; the tail is defined based on the histogram of the move length built with Scott’s rule, excluding the first bin); the adaptive protocol proposed here defining the start of the tail as the minimum data value for which the GOF test of estimated model parameters is significant (CMV p-value >0.05). Note: AIC are not directly comparable when the tail definition is different because the observation sample is not the same.

## Discussion

The visual inspection of tracks is usually the first step that an ecologist undertakes when studying foraging movements. Considering the seabird tracks studied here, this step immediately reveals that booby *a* went much farther from the colony than booby *b* and cormorants *d* and *c*. Basic statistics (range, distance covered) confirm and quantify this global pattern. Fitting BGPDs to these tracks provided additional and non-redundant information that was not accessible to visual inspection and basic statistics.

First, the estimated shape parameter () revealed that all four tracks were well described by Lévy walks, suggesting that these four foraging movements were super diffusive. Anchovy is a pelagic gregarious forage fish, organized in hierarchical aggregations (schools, clusters of schools, cluster of clusters; see [44]), which are constantly moving (up to 26 km per day, [42]). Predators foraging on such prey fields constantly need to track this hierarchical organization, and this is certainly the reason why their foraging trajectories mix long moves (moves between anchovy aggregations) with short ones (moves within anchovy aggregations), producing super-diffusive patterns in their movements [13, 8].

Second, the fitted shape and scale parameters of the BGPD revealed that the studied boobies and cormorants resorted to different strategies while foraging the same prey field: boobies used moves with longer characteristic length (scale parameter) while cormorants used a more diffusive spatial strategy (larger shape parameter). The larger characteristic length of boobies’ moves may be linked to their morphology: *Sulids* are in general described to be optimal for fast and far flights [45]; *Sula variegata* in particular, when facing competition for food (including the fishery, [43]), possibly take advantage over cormorants from its greater foraging range [45]. The more diffusive patterns of cormorants emerge from a more contrasted contribution of short moves versus long moves, producing a skewed distribution of move lengths. Cormorants tend to rely more on collective foraging than boobies do. They form dense rafts when shoals are located, with large unbroken columns of flying individuals from the colony to the feeding ground [45, 46]. The importance of social information for cormorants may explain their more diffusive foraging movements: while involved in a column, cormorants fly straight to a fish aggregation producing long directed moves in contrast to much shorter moves while actively foraging on a fish aggregation. Such speculative interpretation obviously calls for complementary studies analyzing a much larger sample of tracks.

Here our main contribution is a BGPD-based methodology that (1) quantified differences among the studied tracks that were not trivial from visual inspection and basic statistics and (2) raised questions and hypotheses about differences in the foraging strategies adopted by two sympatric species foraging on the same prey field. The inference of BGPD parameters for a variety of scenarios of prey abundance and distribution should be explored to investigate how the two species deal with the great variability of the anchovy distribution (e.g. [47]), and may provide new insights on the behavioural niches of these two sympatric species [48].

The second case study, the movements of fishing vessels, provided a complementary illustration of the possibilities opened by the proposed methodology. Using the BGPD model, we characterized the different strategies exhibited by three fishermen. Similar to the seabirds, they all faced the need to track down the gregarious and constantly moving distribution of anchovy aggregations. Two of the vessels exhibited a super diffusive behaviour, well described by a Lévy walk, and the third one was best described by a normally diffusive behaviour. In that case study, the characteristic length of the observed moves was positively correlated to the diffusion coefficients, producing large differences in their scope of exploration.

Several hypotheses could explain such variability in the fishermen’s strategies, including a difference in their personalities. Allen & MacGlade [49] for instance showed that facing fluctuating fish conditions, several types of spatial behaviours may emerge among fishermen, namely, stochastic versus Cartesians, characterized respectively as hunters or risk takers, and followers or low risk takers. Such hypotheses should be evaluated in future works on larger samples of fishermen, allowing a thorough analysis of fishermen personalities in this fishery, and to evaluate the benefit, at the fishery scale, of mixing such personalities for discovering the location of new fish aggregations and optimally exploiting the prey field.

Overall, the proposed BGPD framework is applicable to any move length distribution, regardless of the forager trajectory under consideration. It provides a phenomenological, compact, and pattern-oriented description of movement. It quantifies informative and non-trivial components of foraging strategies: their characteristic size (*σ*, the scale parameter) and their diffusive behaviour (*k*, the shape parameter). It is parsimonious because it does not call for testing a battery of putative models, sometimes difficult to select among (e.g. [50]). In contrast, it lets the most likely random walk model emerge from the data. This model, along with the proposed adaptive tail definition, is a generic tool to analyse GPS-tracking data and foraging strategies. It allows quantitative and synoptic comparisons between foragers as well as detections of changes with time in the behaviour of a single forager. This methodological framework is a key step to test and validate hypotheses on the processes underlying the variability of foraging movements. As such, it is of critical importance in a variety of movement ecology issues, including for instance the use of the spatial behaviour of top predators as an ecosystem indicator (e.g. [32, 13, 51]), the comparison of the behavioural niche of different species foraging the same prey field (e.g. [48]) and the identification of different personalities within the same species (e.g. [29, 30, 31]).

## Supporting Information

### S1 Protocol. Resampling observed positions into moves.

https://doi.org/10.1371/journal.pone.0132231.s005

(DOCX)

## Acknowledgments

We thank A. Bertrand and A. Chaigneau for their comments on the manuscript and A. Johansen and B.H. Victor for their advice on English editing. We thank E. Seier for valuable discussions on goodness-of-fit tests. This work is a contribution to the cooperation agreement between the Instituto del Mar del Peru (IMARPE) and the Institut de Recherche pour le Développement (IRD). It was supported by and is a contribution to the ANR project TOPINEME and the International Joint Laboratory DISCOH.

## Author Contributions

Conceived and designed the experiments: SB RF. Performed the experiments: SB RJ RF. Analyzed the data: SB RJ RF. Contributed reagents/materials/analysis tools: SB RJ RF. Wrote the paper: SB RJ RF.

## References

- 1.
Turchin P (1998) Quantitative analysis of movement. Measuring and modeling population redistribution in animals and plants. Sunderland: Sinauer Associates, Inc.
- 2. Nathan R, Getz WM, Revillac E, Holyoak M, Kadmon R, Saltz D et al. (2008) A movement ecology paradigm for unifying organismal movement research. Proceedings of the National Academy of Sciences of the U. S. A. 105: 19052–19059.
- 3. Schick RS, Loarie SR, Colchero F, Best BD, Boustany A, Conde DA et al. (2008) Understanding movement data and movement processes: current and emerging directions. Ecology Letters 11: 1338–1350. pmid:19046362
- 4. Giugglioli L, Bartumeus F (2010) Animal movement, search strategies and behavioural ecology: a cross-disciplinary way forward. Journal of Animal Ecology 79: 906–909. pmid:20337757
- 5. Viswanathan GM, Afanasyev V, Buldyrev SV, Murphy EJ, Prince PA, Stanley HE (1996) Lévy flight search patterns of wandering albatrosses. Nature 381: 413–415.
- 6. Bartumeus F, Peters F, Pueyo S, Marrasé C, Catalan J (2003) Helical Lévy walks: adjsuting searching statistics to resource availability in microzooplankton. Proceedings of the National Academy of Sciences of the U. S. A. 100: 12771–12775.
- 7. Brockmann D, Hufnagel L, Geisel T (2006) The scaling laws of human travel. Nature 439:462–465. pmid:16437114
- 8. Bertrand S, Bertrand A, Guevara-Carrasco R, Gerlotto F (2007) Scale-invariant movements of fishermen: the same foraging strategy as natural predators.. Ecological Applications 17: 331–337. pmid:17489242
- 9. Brown CT, Liebovitch LS, Glendon R (2007) Lévy Flights in Dobe Ju/’hoansi Foraging Patterns. Human Ecology 35: 129–138.
- 10. Sims DW, Southall EJ, Humphries NE, Hays GC, Bradshaw CJA, Pitchford JW et al. (2008) Scaling laws of marine predator search behaviour. Nature 451: 1098–1103. pmid:18305542
- 11. Shlesinger MF, Zaslavsky GM, Klafter J (1993) Strange kinetics. Nature 363: 31–37.
- 12. Berkolaiko G, Havlin S, Larralde H, Weiss GH (1996) The expected number of distinct sites visited by N Lévy fllights on a one-dimensional lattice. Physical Review E 53: 5774–5778.
- 13.
Bertrand S, Burgos JM, Gerlotto F, Atiquipa J (2005) Lévy trajectories of Peruvian purse-seiners as an indicator of the spatial distribution of anchovy (
*Engraulis ringens*). ICES Journal of Marine Science 62: 477–482. - 14. Benhamou S (2007) How many animals really do the Lévy walk? Ecology 88: 1962–1969. pmid:17824427
- 15. Bradshaw CJA, Sims DW, Hays GC (2007) Measurment error causes scale-dependent treshold erosion of biological signals in animal movement data. Ecological Applications 17: 628–638. pmid:17489266
- 16. Edwards AM, Phillips RA, Watkins NW, Freeman MP, Murphy EJ, Afanasyev V et al. (2007) Revisiting Lévy flight search patterns of wandering albatrosses, bumblebees and deer. Nature 449: 1044–1049. pmid:17960243
- 17. Edwards AM (2008) Using likelihood to test for Lévy flight search patterns and for general power-law distributions in nature. Journal of Animal Ecology 77: 1212–1222. pmid:18631370
- 18. Gonzales MC, Hidalgo CA, Barabasi A-L (2008) Understanding individual human mobility patterns. Nature 453: 779–782. pmid:18528393
- 19. Reynolds A (2008) How many animals really do the Lévy walk? Comment. Ecology 89: 2347–2351.
- 20. Plank MJ, Codling EA (2009) Sampling rate and misidentification of Lévy and non-Lévy movement paths. Ecology 90: 3546–3553 pmid:20120821
- 21. Codling EA, Plank MJ (2010) Turn designation, sampling rate and the misidentification of power laws in movement path data using maximum likelihood estimates. Theoretical Ecology 4: 397–406.
- 22. Humphries N, Queiroz N, Dyer JRM, Pade1 NG, Musyl MK, Schaefer K M et al. (2010) Environmental context explains Lévy and Brownian movement patterns of marine predators. Nature 465: 1066–1069. pmid:20531470
- 23. Edwards AM (2011) Overturning conclusions of Lévy flight movement patterns by fishing boats and foraging animals. Ecology 92: 1247–1257. pmid:21797153
- 24. Edwards AM, Freeman MP, Breed GA, Jonsen ID (2012) Incorrect likelihood methods were used to infer scaling laws of marine predator search behaviour. PLoS ONE 7: e45174. pmid:23071508
- 25. Gautestad AO (2013) Animal space use: distinguishing a two-level superposition of scale-specific walks from scale-free Lévy walk. Oikos 122: 612–620.
- 26. Boyd C, Punt A, Weimerskirch H, Bertrand S (2014) Movement models provide insights into variation in the foraging effort of central place foragers. PLoS ONE 286: 13–25
- 27. Joo R, Bertrand S, Tam J, Fablet R (2013) Hidden Markov models: the best models for forager movements? PLoS ONE 8: e71246. pmid:24058400
- 28. Boyer D, Ramos-Fernández G, Miramontes O, Mateos JL, Cocho G, Larralde H et al. (2006) Scale-free foraging by primates emerges from their interaction with a complex environment. Proceedings of the Royal Society B 273: 1743–1750. pmid:16790406
- 29. Fraser DF, Gilliam JF, Daley MJ, Le AN, Skalski GT (2001) Explaining Leptokuric movement distributions: Intrapopulation variation on boldness and exploration. The American Naturalist 158: 124–135. pmid:18707341
- 30. Cote J, Clobert J, Brodin T, Fogarty S, Sih A (2010) Personality-dependent dispersal: characterization, ontogeny and consequences for spatially structured populations. Philosophical Transactions of the Royal Society B 365: 4065–4076.
- 31.
van Overveld T, Matthysen E (2010) Personality predicts spatial responses to food manipulations in free-ranging great tits (
*Parus major*). Biology Letters 6: 187–190. pmid:19906682 - 32.
Boyd I, Wanless S, Camphuysen CJ (2006) Top predators in marine ecosystems. Their role in monitoring and management. Cambridge: University Press.
- 33. Akaike H (1981) Likelihood of a model and information criteria. Journal of Econometrics 16: 3–14.
- 34. White EP, Enquist BJ, Green JL (2008) On estimating the exponent of power-law frequency distributions. Ecology 89: 905–912. pmid:18481513
- 35.
Sokal RR, Rohlf FJ (2001) Biometry—The principles and practice of statistics in biological research. New York: Freeman and Company.
- 36. Davison AC, Smith RL (1990) Models for Exceedances over High Thresholds. Journal of the Royal Statistical Society, Series B (Methodological) 52: 393–442.
- 37. Pickands J (1975) Statistical Inference Using Extreme Order Statistics. The Annals of Statistics 3: 119–131.
- 38. Grimshaw SD (1993) Computing Maximum Likelihood Estimates for the Generalized Pareto Distribution. Technometrics 35: 185–191.
- 39. Chaouche A, Bacro J-N (2006) Statistical Inference for the Generalized Pareto Distribution: Maximum Likelihood Revisited. Communications in Statistics—Theory and Methods 35: 785–802.
- 40. Bush J, Woodru B, Moore A, Dunne E (1983) Modified Cramér-von Mises and Anderson-Darling tests for weibull distributions with unknown location and scale parameters. Communications in Statistics—Theory and Methods, 12: 2465–2476.
- 41.
Scherrer B (2007). Biostatistique. Montréal: Gaëtan Morin.
- 42.
Peraltilla S, Bertrand S (2014)
*In situ*measurements of the speed of Peruvian anchovy schools. Fisheries Research 149: 92–94. - 43. Bertrand S, Joo R, Arbulu Smet C, Tremblay Y, Barbraud C, Weimerskirch H (2012) Competition for the same fish: Local depletion by the fishery can affect seabird foraging. Journal of Applied Ecology 49: 1168–1177.
- 44.
Fréon P, Misund OA (1999) Dynamics of pelagic fish distribution and behaviour: effects on fisheries and stock assesment. Oxford: Blackwell Science.
- 45.
Nelson JB (2005) Pelicans, Cormorants and their relatives
*Pelecanidae*,*Sulidae*,*Phalacrocoracidae*,*Anhingidae*,*Fregatidae*,*Phaethontidae*. Oxford: University Press. - 46.
Murphy RC (1936) Oceanic birds of South America. New York: The Macmillan company, The American museum of natural history.
- 47. Chavez F, Bertrand A, Guevara-Carrasco R, Soler P, Csirke J. (2008) The Northern Humboldt Current System: Brief history, present status and a view towards the future. Progress in Oceanography 79: 95–1051.
- 48. Weimerskirch H, Bertrand S, Silva J, Peraltilla S, Bost CA (2012) Comparative foraging behaviours of the two major guano producing seabirds, the Guanay cormorant and the Peruvian booby in the Humboldt Current System. Marine Ecology Progress Series 458:231–245.
- 49. Allen PM, McGlade JM (1986) Dynamics of discovery and exploitation: The case of the Scotian shelf groundfish fisheries. Canadian Journal of Fisheries and Aquatic Sciences 43: 1187–1200.
- 50. Benoit-Bird KJ, Battaile BC, Nordstrom CA, Trites AW (2013) Foraging behavior of northern fur seals closely matches the hierarchical patch scales of prey. Marine Ecology Progress Series 479: 283–302.
- 51.
Bertrand S, Díaz E, Lengaigne M (2008) Patterns in the spatial distribution of Peruvian anchovy (
*Engraulis ringens*) revealed by spatially explicit fishing data. Progress in Oceanography 79: 379–389.