Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Comparing marine distribution maps for seabirds during the breeding season derived from different survey and analysis methods

  • Alex Sansom,

    Roles Formal analysis, Methodology, Visualization, Writing – review & editing

    Affiliation Centre for Conservation Science, Royal Society for the Protection of Birds, Scottish Headquarters, Edinburgh, United Kingdom

  • Linda J. Wilson ,

    Roles Visualization, Writing – original draft, Writing – review & editing

    Linda.Wilson@rspb.org.uk

    Affiliation Centre for Conservation Science, Royal Society for the Protection of Birds, Inverness, United Kingdom

  • Richard W. G. Caldow,

    Roles Conceptualization, Funding acquisition, Methodology, Writing – review & editing

    Affiliation Natural England, Blandford Forum, Dorset, United Kingdom

  • Mark Bolton

    Roles Conceptualization, Funding acquisition, Methodology, Writing – review & editing

    Affiliation Centre for Conservation Science, Royal Society for the Protection of Birds, Sandy, United Kingdom

Abstract

Understanding how seabirds use the marine environment is key for marine spatial planning, and maps of their marine distributions derived from transect-based surveys and from tracking of individual bird’s movements are increasingly available for the same geographic areas. Although the value of integrating these different datasets is well recognised, few studies have undertaken quantitative comparisons of the resulting distributions. Here we take advantage of four existing distribution maps and conduct a quantitative comparison for four seabird species (black-legged kittiwake Rissa tridactyla; European shag Phalacrocorax aristotelis; common guillemot Uria aalge; and razorbill Alca torda). We quantify the amount of overlap and agreement in the location of high use areas identified from either tracking or transect samples and use Bhattacharyya’s Affinity to quantify levels of similarity in the general distribution patterns. Despite multiple differences in the properties of the datasets, there was a far greater degree of overlap than would be expected by chance, except when adopting the most constrained definition of high use. Distance to the nearest conspecific colony appeared to be an important driver of the degree of similarity. Agreed areas of highest use tended to occur close to colonies and, with increasing distance from colonies, similarity between datasets declined and/or there was similarity in respect of their being relatively low usage. Interpreting reasons for agreement between data sources in some areas and not others was limited by an inability to control for the multiple potential sources of differences from both the sampling and modelling processes of the underlying datasets. Nevertheless, our quantitative comparative approach provides a valuable tool to quantify the degree to which an area’s importance is corroborated across multiple datasets, and therefore confidence that an important area has been correctly identified. This can help prioritise where the implementation of conservation measures should be targeted and identify where greatest scrutiny is required of the potential adverse environmental effects of any planned anthropogenic activities.

Introduction

Seabird populations are undergoing declines at a faster rate than any other avian group [1] and face multiple pressures at sea, including many of anthropogenic origin. Marine spatial planning should focus conservation efforts in the places of greatest ecological importance to these birds and identify the sea areas in which they are most vulnerable to various anthropogenic activities in order to minimise, avoid or mitigate possible impacts on them. Such planning requires a good understanding of birds’ use of the marine environment, and knowledge of the exposure of the different components of the population to pressures in different places. Unfortunately, spatial distributions of species in the marine environment are particularly challenging to determine by survey. However, the range and sophistication of techniques available to gather spatial distributional information is constantly changing as technology advances. The at-sea distribution of marine birds has traditionally been sampled using boat-based and aerial transect visual sampling surveys, using techniques largely developed since the early 1980s [24]. Into the new millennium, these have been complemented with increasing use of digital aerial sampling (video or photographic stills) which takes advantage of advances in high resolution photographic equipment [57]. Moreover, increasing miniaturisation and technological improvement of bird-borne devices such as GPS loggers has proved to be transformational in the amount of information available on the at-sea movements of individual birds [8, 9].

Analytical approaches and software to map marine bird distribution data have also become more advanced and more accessible. A range of species distribution modelling techniques are now available for spatially referenced abundance and occurrence data [10, 11] and techniques have been developed to deal with the large quantities of movement data now resulting from tracking studies and the analytical challenges they present [9, 1215]. These techniques mean that population-level distributions can now be estimated based on the tracked movements of individuals e.g. [13, 16]. This leads to the relatively new situation where population-level seabird distributional information derived from the two contrasting sampling techniques of transects and tracking has become available for the same geographic area.

The properties of the data collected under different sampling methods will affect the interpretation of any derived spatial distribution maps. Transect data capture concurrent location information across all individuals observed at a fixed point in space [17, 18]. However, the individuals observed are of unknown provenance and usually of unknown breeding status. Furthermore, because the method is based on direct observation, sampling is limited to daylight hours and suitable weather conditions. Resulting distribution maps therefore potentially represent birds of all ages and breeding status and reflect space use during daylight hours and relatively good weather conditions. Tracking provides location information of individuals through time, during both day and night, and under all weather conditions. Data comes from specific individuals usually of known breeding status and provenance. However, the duration over which tracking data can be collected for an individual is limited by deployment constraints (e.g. bird welfare considerations, attachment and retrieval method, battery life), and financial or logistical constraints limit the numbers of individuals which can be tracked in any one year. Therefore, resulting distribution maps typically represent distributions of breeding adults under all light and weather conditions, but often during a concentrated period of the breeding season from a specific sample of colonies and individuals. The latter can lead to concerns regarding representativeness of the resultant sample of tracks and derived maps of space use [19].

Each method therefore comes with its own strengths and weaknesses (depending on the questions being asked) so there is value in the integration of different data sources and several approaches to do so exist. A robust statistical approach is to build hierarchical models applied to transect and tracking data collected systematically to fulfil a common goal, however few studies to date have used this method (but see [20]). An alternative is to build separate models for each dataset and use a model average or the best fitting model to make a single prediction across the study area but we know of no examples where this has been done. The most common approach has been to make a comparison (usually qualitative) of the estimated spatial distributions resulting from separate models for each dataset and use these in a complementary way e.g. [18, 2126]. This latter approach can offer a pragmatic and relatively simple way to maximise interpretation of different datasets for conservation managers, and is also useful in situations where the original raw data are unavailable or were not designed for more sophisticated statistical integration techniques.

Since 2010, four initiatives have resulted in the generation of distribution maps for seabirds within UK waters during the breeding season. Three of these used transect data (Kober et al. (2010) [27] which was UK-wide; ‘SeaMaST II’, an unpublished updated version of Bradbury et al. (2014) [28] which covered English Waters; and Bradbury et al. (2017) [29] which covered Scottish, Welsh and Northern Irish waters) and one used tracking data (Wakefield et al. (2017) [16], UK-wide). Here, we compare these marine distribution maps for the only four species of breeding seabird which were common across all four studies: common guillemot Uria aalge (a pelagic, deep diving auk), hereafter ‘guillemot’; razorbill Alca torda (a pelagic, shallow diving auk); black-legged kittiwake (a pelagic, surface feeder), hereafter ‘kittiwake’; and European shag (a coastal, benthic feeder), hereafter ‘shag’.

The main applications for these distribution maps have been to investigate species’ dispersion patterns to better understand their ecological requirements and to identify high density areas suitable for targeting conservation efforts (e.g. identifying protected sites and areas of greater vulnerability) (see Methods). So, as conservation practitioners, we wished to investigate how consistent the maps were in these respects. We recognised that comparisons could be influenced by multiple differences in the properties of the maps due to the sampling process (see above) as well as the modelling process used to derive each map. Given these multiple and potentially confounding differences, our comparison work did not seek to test a series of hypotheses regarding the manner in which the distribution maps would agree or disagree and the possible causes of those (dis-)agreements. Instead, our objectives were to quantitatively assess (i) how well the maps agreed in the location of high use areas, and (ii) the degree of similarity in the mapped distribution patterns.

Methods

Data sources and processing

The four maps used in our comparisons were derived from various sources and had already been used for a variety of purposes (Table 1). Kober et al. (2010) [27] used Poisson Kriging to model boat transect data collected between 1980–2004 and generated distribution maps for 26 seabird species across British Fisheries Limits, subsequently used to identify hotspots that might be considered for designation as marine Special Protection Areas (SPAs) [30]. The same boat survey data were supplemented with more recent boat transect data (up to 2014) and aerial transect data (collected between 2000–2013), and modelled for 32 seabird species using Generalised Additive Models for English waters (Bradbury et al. (2014), updated [28]) or Generalised Estimating Equations for the rest of UK waters (Bradbury et al. (2017) [29]). These were modelled as part of two separate assessments of the vulnerability of seabirds to wind farms and by-catch respectively (the latter was combined with the former to generate a single UK-wide map for the by-catch vulnerability assessment). Finally, Wakefield et al. (2017) [16] applied mixed-effects generalised linear models (GLMMs) to tracking data collected for four seabird species breeding at 12–20 UK colonies between 2010–2014 to model habitat use as a function of habitat availability, accessibility and proxies of intra-specific competition. These models were used to produce the first UK-wide at-sea distribution maps for seabirds derived from tracking data, to help highlight areas most warranting conservation measures. Thus, each map varied in several properties relating to survey method, spatial and temporal coverage, analytical technique and output format (Table 1 and S1 Table) but all reflected usage by the species’ population of a given grid cell relative to other grid cells. In all cases, data across years were pooled prior to analysis to produce a single map per species/season, and all of the transect-based maps had adjusted densities for the decline in detectability at increasing distance from the transect line. Hereafter we refer to these four outputs according to the underlying sampling method, namely: ‘boat-only transects’ [27]; ‘boat/aerial transects restricted to English waters’ ([28] updated, though note some Welsh waters are included); ‘boat/aerial transects restricted to Scottish, Welsh and Northern Irish waters’ [29]; and ‘tracking’ [16]. The maps derived from transect data were available for different seasons, while those derived from tracking data were only available for the breeding season. We only compared breeding season maps, noting that there was still variation in the months covered (Table 1).

thumbnail
Table 1. The properties of each data source used for the comparisons.

S1 Table provides detail on the covariates used in the models.

https://doi.org/10.1371/journal.pone.0201797.t001

To allow quantitative comparisons between each pair of maps, the outputs from the different datasets were resampled to share a common resolution (1km x 1km for all analyses except the Spearman’s Rank correlation, see later), processed into raster format (if not already) and projected to the same coordinate system (Lambert Azimuthal Equal Area in the ETRS_1989_LAEA coordinate system). The resampling meant that all datasets were downscaled, apart from the shag tracking dataset which was up-scaled. Our metrics of overlap and similarity (see below) were such that the replication of values and increased sample size caused by downscaling would not directly influence their outputs. Therefore downscaling allowed us to extract a high level of spatial information across a common grid size. For each comparison, datasets were first clipped to the commonly shared geographic extent. Most notably, this meant that any comparisons including the updated Bradbury et al. (2014) [28] output were restricted to a shared area within English waters, while those including the Bradbury et al. (2017) [29] output were restricted to a shared area within the rest of UK waters. It is important to note that the boat/aerial transect output restricted to English waters excluded usage values with associated coefficient of variation values exceeding 0.5 as these were considered unreliable [28], which further reduced the area for comparison, in particular for razorbill and shag. All processing and analysis of spatial data was conducted in ArcGIS 10.3.1 and R 3.2.1 (using the Raster package version 2.5–8 [32]).

Overlap and agreement in the location of high use areas.

Before making comparisons of the locations of high use areas, it is first necessary to define areas of high use. Different thresholds, based on the statistical distribution of the usage values, can be used to delineate areas of high use, so for each map we identified those grid cells in which use exceeded a range of percentile thresholds (50th, 75th, 90th, 95th and 99th percentiles), and then made comparisons between each pair of outputs at each threshold. Pair-wise comparisons were not undertaken where the threshold value in either dataset reached zero i.e. no distinction could be made in one or both rasters as to areas of ‘higher use’. For each pair-wise comparison, we calculated the degree of overlap, defined as the proportion of the locations exceeding the xth percentile which were shared between both data sources, such that a value of one would represent complete overlap in where higher use areas were located. This provided a quantitative measure of the amount of overlap between any two maps in the location of higher use areas, and how this varied depending on the threshold used. For comparison, we calculated this same metric in accordance to what we would expect if cells had been randomly assigned to be above or below the threshold, based on simple probabilities, by dividing the expected number of shared locations ((1-percentile)2 * size of study area) by the number of cells in the upper percentile of each output ((1-percentile) * size of study area).

In addition to these pair-wise comparisons we also mapped the degree of agreement in the locations of high use areas between all three outputs (although there were four outputs in total, the maximum number of outputs that could agree in any one area was three as the two boat/aerial transect outputs did not overlap geographically). For this purpose we clipped datasets to the three-way shared area in English and in Scottish/Welsh/Northern Irish waters separately, and then used the 95th percentile threshold for each as a nominal representation of the highest use areas. Each grid cell was then categorised according to the degree of agreement regarding whether it exceeded the 95th percentile threshold, i.e. agreement across all three outputs; agreement across each possible combination of two outputs; no agreement (only one output identified the grid cell as exceeding the 95th percentile threshold); and agreement across all three outputs that the grid cell did not exceed the 95th percentile threshold.

Similarity of distribution patterns within core areas of use.

To quantify the overall degree of similarity between pairs of estimated spatial distributions, we used Bhattacharyya’s Affinity (BA; [33]), a recommended statistic for providing a general measure of similarity between estimates of utilisation distributions, typically representing individuals or populations [34]. This provides a single measure of overall similarity between each pair-wise comparison and ranges from 0 (no similarity) to 1 (identical utilisation distributions) using the calculation: BA = ∑(√V1 x √V2); where V1 = value in 1st raster and V2 = value in 2nd raster.

The BA statistic was calculated using rasters containing only cells which exceeded the 50th percentile, often used to define the core area of use [19]. This allowed a reasonable size of area over which to calculate the similarity index (because the area of overlap reduces markedly with higher thresholds, see Results), while excluding areas of very low usage which are of less interest and may otherwise have complicated the interpretation of the calculation; BA is a useful tool to compare areas of usage, rather than areas of non-usage. Exceptions to using the 50th percentile had to be made for selected pair-wise comparisons for razorbill and shag where the threshold value had already reached zero. In these cases, the ‘core area of use’ was represented by the next highest percentile that ensured the threshold value exceeded zero (this was either the 75th or 90th percentile). To make values appropriate as inputs to the BA formula, which requires the values to sum to one, the data in ‘core use’ rasters were re-scaled (i.e. each grid cell value was divided by the sum of values in the grid).

As the BA statistic provides only a single value for each comparison, we assessed how similarity varied spatially by calculating the proportional contribution of each grid cell to the overall BA value for the area of overlap, where higher values indicated co-occurrence of high usage and lower values indicated either less similarity or co-occurrence of low usage. We used Spearman’s rank correlations to assess how the cell-by-cell proportional contribution to overall similarity in each pairwise comparison varied with distance to the nearest conspecific breeding colony (using minimum distance-by-sea, calculated as per Wakefield et al. (2017) [16]). Before doing this all inputs were resampled to share a common resolution defined by the raster in each pair which originally had the largest cell size, to avoid falsely inflating sample size for this statistical test. Non-parametric tests were used since the data were under-dispersed. We only considered the relative size and direction of the Spearman’s rank test values, not P-values, due to spatial autocorrelation. Spearman’s rank tests were conducted using cor.test in R. Regression lines were plotted using the geom._smooth() function in the R package ggplot2 (version 2.2.1) [35], specified using binomial GLM with an x~y relationship.

Results

Overlap and agreement in the location of high use areas

We made a total of 100 pair-wise comparisons (five for each of the four species, for each of the five thresholds), to assess how overlap varies depending on the threshold used to define high use (Fig 1). In all cases but two, the degree of overlap (the proportion of the locations exceeding the xth percentile which was shared between both data sources) increased as the threshold percentile defining high usage was reduced, with similar patterns of change across all species. Specifically, in every comparison, overlap was lowest when only cells exceeding the 99th percentile were considered (mean proportional overlap across all species and comparisons = 0.13). The degree of overlap increased markedly as the threshold for defining high use was reduced to the 95th or 90th percentile (mean proportional overlap across all species and comparisons = 0.50 at the 90th percentile), but the rate of increase in overlap slowed once thresholds below 90th percentile were considered. Crucially, the degree of overlap was consistently greater than would be expected by chance, except in some cases for the 99th percentile threshold.

thumbnail
Fig 1. Overlap in high use areas for pair-wise comparisons as the percentile threshold defining ‘high use’ decreases.

The black dotted line shows the proportional overlap expected by chance. A value of zero indicates no overlap and a value of one indicates complete overlap.

https://doi.org/10.1371/journal.pone.0201797.g001

Given our interest in areas of highest use and the relatively poor levels of overlap when using the 99th percentile threshold (Fig 1), we present a further investigation of patterns in the degree of overlap in the location of high use areas only for the 95th percentile threshold (28 pair-wise comparisons, seven for each of the four species, see Table 2). The degree of overlap varied among comparisons, with the proportional overlap of high use areas ranging from 0.22 to 0.51 (Table 2). However, there was little difference in the level of overlap within grouped comparisons for: species (mean proportional overlap = 0.38 (guillemot), 0.34 (razorbill), 0.35 (kittiwake) and 0.4 (shag)); regions (mean proportional overlap = 0.32 (English), rest of UK waters (0.39), UK as a whole (0.42)); or data type (mean proportional overlap = 0.38 (tracking / transect comparisons), 0.33 (transect/transect comparisons)).

thumbnail
Table 2. Extent of overlap of ‘high use’ areas (cells >95th percentile) in pair-wise comparisons of datasets.

https://doi.org/10.1371/journal.pone.0201797.t002

There were some general similarities among species in terms of broad regions of higher usage (>95th percentile) identified by all three datasets (Fig 2). For guillemot, razorbill and kittiwake, these were: within the Moray Firth (Area 1 in Fig 2A–2C); off the Grampian and Angus coast in east Scotland (Area 2 in Fig 2A–2C); and an area off the Yorkshire coast adjacent to Flamborough Head and the Filey Coast SPA (Area 3 in Fig 2A–2C). However, the latter area of agreement for guillemot and razorbill was coastal while for kittiwake, it was further offshore around the Dogger Bank area. For guillemot and kittiwake there was also agreement among all three datasets that there was a high use area off the Northumberland coast adjacent to the Farne Islands SPA (Area 4 in Fig 2A and 2C) held high densities, though this was more extensive for guillemot than for kittiwake. For razorbill, there was an additional high use area agreed across all three datasets at the narrowest part of the North Channel, between north-east Ireland and south-west Scotland, (Area 5 in Fig 2B). For shags, high use areas were much more restricted to the coastline than for the other species. The main region where there was shared agreement among all three datasets for this species was in scattered areas along the west coast of the Scottish mainland (Area 6 in Fig 2D). Other, much less extensive areas of agreement for shag included the Moray Firth (e.g. off the North Caithness Cliffs SPA) (Area 1 in Fig 2D) and along the Fife coast opposite the Isle of May, part of the Forth Islands SPA (Area 7 in Fig 2D).

thumbnail
Fig 2. Agreement in locations of ‘high use’ areas (cells > 95th percentile) in different distribution maps.

(a) guillemot, (b) razorbill, (c) kittiwake and (d) shag. Colour shading shows degree of agreement: green—all three maps agreed the area was ‘high use’; shades of blue—two maps agreed the area was ‘high use’; shades of red—only one map identified the area as ‘high use’, yellow—all maps agreed the area was not ‘high use’. Grey lines indicate the different regions within which comparisons were made (‘English waters’ and ‘Scottish, Welsh and Northern Irish waters’). Breeding colony SPA locations for which the relevant species is a designated feature are shown (provided by Joint Nature Conservation Committee under Open Government License). European coastline base map provided by the European Environment Agency under ODC-by license.

https://doi.org/10.1371/journal.pone.0201797.g002

Similarity of spatial distribution patterns within core areas of use

The overall similarity among datasets in the pattern of use across core areas (grid cells exceeding the 50th percentile) was typically high (BA>0.5) for all species, with the exception of shag where BA values for four of the seven pair-wise comparisons were <0.5 (Fig 3). Note however that all four of these were comparisons using data with a 90th, rather than 50th, percentile threshold so similarity is expected to be lower. The BA value reached a maximum of 0.87 for the comparison for kittiwakes between boat/aerial transects and tracking datasets restricted to Scottish, Welsh and Northern Irish waters. However, BA values varied little within grouped comparisons for regions (mean BA = 0.59 (English waters), 0.69 (rest of UK waters), 0.64 (UK as a whole) or data type (mean BA = 0.64 (tracking / transect comparisons), 0.65 (transect/transect comparisons)).

thumbnail
Fig 3. The degree of similarity between pair-wise comparisons of spatial distribution patterns.

Similarity was measured by Bhattacharyya’s Affinity which ranges in value from 0 (no similarity) to 1 (identical utilisation distributions). These were calculated for cells > 50th percentile, except for selected pair-wise comparisons for razorbill and shag (see labelled bars), which were calculated for cells > 75th or 90th percentile (see methods).

https://doi.org/10.1371/journal.pone.0201797.g003

We found that for almost all pair-wise comparisons, similarity was greatest closest to the nearest conspecific colony and declined with distance from it, with the rate of decline greatest closer to the colony (Table 3, Fig 4). The only two exceptions to this were the pair-wise comparisons that included the boat/aerial transect datasets for kittiwake within English waters; more generally, the strength of the relationship between similarity and distance from the colony was least pronounced in kittiwake.

thumbnail
Fig 4. The degree of similarity between datasets in relation to distance from the nearest conspecific colony.

Similarity was measured as the proportional contribution to the overall Bhattacharyya’s Affinity (BA) value. BA scores compare areas of usage rather than non-usage, so while higher values indicate co-occurrence of high usage, lower values can reflect low usage in one or both datasets.

https://doi.org/10.1371/journal.pone.0201797.g004

thumbnail
Table 3. Spearman’s rank correlations between the contribution of each grid cell to the overall BA similarity value for each pair-wise comparison and the distance of that cell from the nearest colony.

https://doi.org/10.1371/journal.pone.0201797.t003

Discussion

To our knowledge we have undertaken the first quantitative comparison of population-level seabird marine distribution maps derived from tracking and transect samples at a UK scale. This has allowed us to quantify overlap and agreement in the location of high use areas and the degree of similarity in the mapped distribution patterns. The majority of high use areas identified across all data sources were within inshore waters (<12nm from shore). While shag showed the most coastally restricted of these, some agreed high use areas for kittiwake extended into offshore waters, reflecting differences in their foraging ranges [36]. Four broad regions were consistently identified as high use areas across multiple species: the Moray Firth, and areas off the Grampian / Angus, Northumberland and Yorkshire coasts. In general, the locations of these areas make biological sense as they tend to be close to known breeding colonies, or in the vicinity of shallow sand banks (e.g. Dogger Bank, Wee Bankie, Marr Bank, Smith Bank) which are known to be important sandeel spawning areas [37, 38]. These areas are of particular conservation interest as they are subject to pressure from various human activities including an industrial sandeel fishery (Dogger Bank [39]), several operational or planned renewable energy developments [40], and have all been identified as being at relatively high risk to seabird bycatch from various types of fishing gear [29]. While our analysis was not designed to identify marine protected areas, some of the high use areas consistently identified across the datasets (parts of the Moray Firth, the Firth of Forth, and off the Northumberland coast) are being considered as marine Special Protection Areas, or have recently been designated as such [4144].

Almost all pair-wise comparisons reveal a far greater degree of overlap in used locations at all percentile thresholds examined than would be expected by chance. This indicates that all the datasets captured a common signal in the spatial dispersion of the four species. Proportional overlap in the location of high use areas increased as the percentile threshold defining high use was lowered indicating that the signal becomes stronger when progressing the search for overlap from locations of highest use towards those of core use. At the 99th percentile threshold the level of overlap reached a maximum of just 30% and in some cases overlap was no better than random. In contrast, for the 95th percentile threshold, overlap between high use areas was between 22% and 51%, compared to an expected degree of overlap of just 5%. Our analysis indicated that increasing the threshold of ‘high use’ above the 95th percentile severely risks the reliability of any positive corroboration results, which could potentially be no better than random and risks failure to corroborate truly important areas due to application of too stringent a definition of ‘high use’. The definition of ‘high use’ therefore requires very careful consideration particularly in the context of applying an integrated approach to site identification that relies on corroboration of ‘high use’ across independent datasets.

Within the broader sea areas defined by the 50th percentile i.e. ‘core’ areas, measures of similarity in spatial use by all species were typically high (BA > 0.5). Within these ‘core’ areas distance to the nearest conspecific colony appeared to be an important influence on how well the outputs compared.

For all species, the proportional contribution to overall similarity was greatest closest to the nearest conspecific colony and declined with distance from the nearest conspecific colony (or there was increasing similarity in respect of there being relatively low use further from colonies; low BA values indicate either scenario). This most likely reflects the fact that seabird ecology during the breeding season is heavily influenced by central place foraging constraints. The only exceptions to the general pattern were two pair-wise comparisons for kittiwakes in English waters where there was clear evidence of high use in large areas relatively far offshore from the nearest colony on the Yorkshire coast (Fig 2C).

Our comparisons focussed on overlap, agreement and similarity, rather than differences, between maps due to the difficulties associated with disentangling multiple ecological and statistical properties that might lead to differences. However, one of the most obvious differences in the biological information contained within the tracking dataset as opposed to the various transect datasets is that the former captures information on the spatial distribution of breeding individuals only. The locations of high use areas from outputs based on samples that potentially included non-breeding birds (i.e. transect datasets, particularly those which included surveys carried out later in the season during post-breeding dispersal offshore) might therefore be expected to be less concentrated around breeding colonies than the tracking datasets. However, although transect samples did identify high use areas further offshore that were not corroborated by tracking data (e.g. dark red/brown areas in Fig 2A–2C), there was no quantitative evidence that tracking- and transect-derived maps showed less overlap (in the location of high use areas), or were less similar to each other (in their core distribution patterns) than was the case for any two transect-derived maps, despite the more fundamental differences between the former data types. There are several potential reasons for this finding. Firstly, non-breeders may not have a great influence on the at-sea distributions derived from transect samples, either because they are to some extent also subject to central place foraging (e.g. they are prospecting breeding sites, e.g. [45] and their distributions, therefore, resemble those of breeding birds), or because they are so dispersed that transect-derived maps of distribution during the breeding season are overwhelmingly dominated by aggregations of breeding birds. In addition, non-breeders are thought to generally comprise a smaller proportion of the overall population than breeders [46, 47] (though notably one study estimates that the ratio of immature birds to breeding adults is 1.31 for shag [47]). Secondly, the differing sample periods of the datasets may play a role if distributions have changed over time. For example if breeding birds range more widely during times of food stress and, as indicated in [48], there has been increasing food stress in recent years, then the more recently sampled transect and tracking at-sea distributions may show dispersion patterns more closely resembling each other than either resemble those derived from less recent transect sampling. Thirdly, choice of modelling technique has been shown to influence the estimated spatial distribution and therefore which areas are identified as important [49]. While the boat-only transects were modelled based only on the inherent spatial autocorrelation in the data, models for the other three outputs all included covariates. None of these three possibilities are mutually-exclusive nor possible to discount or demonstrate easily.

Conservation efforts, in particular the designation of protected sites and assessment of the impacts to anthropogenic activities, rightly face high levels of scrutiny and it is important that the identification of important or vulnerable areas for marine birds is supported by as strong an empirical evidence base as possible. Corroboration by more than one independent data source is one way to maximise confidence that protective measures are put in the right place [21, 50]. Although the datasets we compared cannot be considered truly independent, locations where we have found total agreement that they are highly used regardless of the survey platform, age of data, population sampled and modelling applied, are areas in which there can be high confidence of their persistent importance during the breeding season for the species concerned. Areas such as these where there is shared agreement in their importance across differing data sources merit prioritisation for consideration of conservation measures and particular scrutiny when assessing potential impacts of any current or planned anthropogenic activities. Notwithstanding this, when interpreting the results from the comparisons, it is important to include consideration of the spatial distribution of uncertainty associated with each modelled output. There will be lower confidence that agreed areas of high use are truly important if there is high uncertainty in one or both of the underlying datasets compared to those where uncertainty is low in both. However, although error information is available for all of the datasets, the derivation and meaning of the metrics varied among them (see Table 1) so the relative magnitude of the errors may not be equivalent between data sets. Therefore, we were unable to test how levels of error affected the degree of agreement in our pair-wise comparisons.

This study has highlighted the value of considering multiple data sources to inform marine conservation and management decisions and developed a protocol to do so using quantitative comparisons that could be extended further to a wider range of species, locations and geographic scales. However, much progress could be made in the future by controlling for some of the differences that have hampered our ability to interpret our results. A priority would be to undertake concurrent sampling by both transects and tracking within the same area (controlling for geographic and temporal differences) with sampling and analysis designed specifically with the aim of using differences between the resulting outputs to distinguish breeder and non-breeder distributions. This would improve our understanding of potential asymmetries in how pressures might impact different components of the population and where protective measures for each are best targeted.

Supporting information

S1 Table. Covariates used in the models underlying each of the mapped outputs used in the comparisons.

https://doi.org/10.1371/journal.pone.0201797.s001

(DOCX)

Acknowledgments

This work relied on the availability of the source datasets. We would like to thank JNCC for providing the modelled boat density data, and Defra for providing the modelled boat/aerial density data for Scottish/Welsh/Northern Irish waters. RSPB own the tracking modelled data and Natural England own the boat/aerial modelled data for English waters. We would also like to thank those individuals most heavily involved in the original work that generated the models we have used in this project including in particular: Ellie Owen of RSPB, Ewan Wakefield of University of Glasgow (formerly of RSPB), Kerstin Kober of JNCC, Gareth Bradbury of WWT Consulting Ltd, Mark Trinder of MacArthur Green, Lindesay Scott-Hayward of University of St Andrews. We thank Ewan Wakefield, Gareth Bradbury, Ellie Owen and Matthew Carroll for their input through the project’s steering group and two anonymous referees who provided valuable comments and suggestions on an earlier draft.

References

  1. 1. Croxall JP, Butchart SHM, Lascelles B, Stattersfield AJ, Sullivan B, Symes A, et al. Seabird conservation status, threats and priority actions: a global assessment. Bird Conservation International. 2012;22(01):1–34.
  2. 2. Tasker ML, Jones PH, Dixon TJ, Blake BF. Counting seabirds at sea from ships: a review of methods employed and a suggestion for a standardized approach. Auk. 1984;101:567–77.
  3. 3. Komdeur J, Bertelsen J, Cracknell G. Manual for aeroplane and ship surveys of waterfowl and seabirds. Iwrb Special Publication. 1992;19.
  4. 4. Camphuysen KJ, Fox AD, Leopold MF, Petersen IK. Towards standardised seabirds at sea census techniques in connection with environmental impact assessments for offshore wind farms in the UK: a comparison of ship and aerial sampling methods for marine birds, and their applicability to offshore wind farm assessments. Texel: 2004.
  5. 5. Thaxter CB, Burton NHK. High definition imagery for surveying seabirds and marine mammals: A review of recent trials and development of protocols. 2009.
  6. 6. Buckland ST, Burt ML, Rexstad EA, Mellor M, Williams AE, Woodward R. Aerial surveys of seabirds: the advent of digital methods. Journal of Applied Ecology. 2012;49(4):960–7.
  7. 7. Goodship N, Caldow R, Clough S, Korda R, McGovern S, Rowlands N, et al. Surveys of red-throated divers in the Outer Thames Estuary SPA. British Birds. 2015;108:506–13.
  8. 8. Burger AE, Shaffer SA. Application of tracking and data-logging technology in research and conservation of seabirds. The Auk. 2008;125(2):253–64.
  9. 9. Wakefield ED, Phillips RA, Matthiopoulos J. Quantifying habitat use and preferences of pelagic seabirds using individual movement data: a review. Marine Ecology Progress Series. 2009;391:165–82.
  10. 10. Elith J, Leathwick JR. Species distribution models: ecological explanation and prediction across space and time. Annual Review of Ecology, Evolution and Systematics. 2009;40(1):677–97.
  11. 11. Miller DL, Burt ML, Rexstad EA, Thomas L, Gimenez O. Spatial models for distance sampling data: recent developments and future directions. Methods in Ecology and Evolution. 2013;4(11):1001–10.
  12. 12. Kie JG, Matthiopoulos J, Fieberg J, Powell RA, Cagnacci F, Mitchell MS, et al. The home-range concept: are traditional estimators still relevant with modern telemetry technology? Philosophical transactions of the Royal Society of London Series B, Biological sciences. 2010 Jul 27;365(1550):2221–31. pmid:20566499. Pubmed Central PMCID: 2894967.
  13. 13. Aarts G, MacKenzie M, McConnell B, Fedak M, Matthiopoulos J. Estimating space-use and habitat preference from wildlife telemetry data. Ecography. 2008;31(1):140–60.
  14. 14. Illian JB, Sørbye SH, Rue H, Hendrichsen DK. Using INLA to fit a complex point process model with temporally varying effects–a case study. Journal of Environmental Statistics. 2012;3:1–25.
  15. 15. Matthiopoulos J, Hebblewhite M, Aarts G, Fieberg J. Generalized functional responses for species distributions. Ecology. 2011;92:583–9. pmid:21608467
  16. 16. Wakefield ED, Owen E, Baer J, Carroll MJ, Daunt F, Dodd SG, et al. Breeding density, fine-scale tracking and large-scale modelling reveal the regional distribution of four seabird species. Ecological Applications. 2017;27:2074–91. pmid:28653410
  17. 17. Camphuysen KJ, Garthe S. Recording foraging seabirds at sea: Standardised recording and coding of foraging behaviour and multi-species foraging associations. Atlantic Seabirds. 2004;6(1).
  18. 18. Camphuysen K, Shamoun-Baranes J, Bouten W, Garthe S. Identifying ecologically important marine areas for seabirds using behavioural information in combination with distribution patterns. Biological Conservation. 2012;156:22–9.
  19. 19. Soanes LM, Arnould JPY, Dodd SG, Sumner MD, Green JA, Frederiksen M. How many seabirds do we need to track to define home-range area? Journal of Applied Ecology. 2013;50(3):671–9.
  20. 20. Yamamoto T, Watanuki Y, HE L., Nishizawa B, Sasaki H, Takahashi A. Statistical integration of tracking and vessel survey data to incorporate life history differences in habitat models. Ecological Applications. 2015;25(8):2394–406. pmid:26910963
  21. 21. Arcos JM, Bécares J, Villero D, Brotons L, Rodríguez B, Ruiz A. Assessing the location and stability of foraging hotspots for pelagic seabirds: An approach to identify marine Important Bird Areas (IBAs) in Spain. Biological Conservation. 2012;156:30–42.
  22. 22. Louzao M, Bécares J, Rodríguez B, Hyrenbach KD, Ruiz A, Arcos JM. Combining vessel-based surveys and tracking data to identify key marine areas for seabirds. Marine Ecology Progress Series. 2009;391:183–97.
  23. 23. Thiers L, Louzao M, Ridoux V, Le Corre M, Jaquemet S, Weimerskirch H. Combining methods to describe important marine habitats for top predators: Application to identify biological hotspots in tropical waters. PloS one. 2014;9(12).
  24. 24. Perrow MR, Harwood AJP, Skeate ER, Praca E, Eglington SM. Use of multiple data sources and analytical approaches to derive a marine protected area for a breeding seabird. Biological Conservation. 2015;191:729–38.
  25. 25. Péron C, Grémillet D, Prudor A, Pettex E, Saraux C, Soriano-Redondo A, et al. Importance of coastal Marine Protected Areas for the conservation of pelagic seabirds: The case of Vulnerable yelkouan shearwaters in the Mediterranean Sea. Biological Conservation. 2013;168:210–21.
  26. 26. Pérez-Ortega M, İsfendiyaroğlu S. Predicting foraging hotspots for Yelkouan Shearwater in the Black Sea. Deep Sea Research. 2017;part 2, 141:237–47.
  27. 27. Kober K, Webb A, Win I, O'Brien S, Wilson LJ, Reid JB. An analysis of the numbers and distribution of seabirds within the British Fishery Limit aimed at identifying areas that qualify as possible marine SPAs. JNCC Report No 431. 2010.
  28. 28. Bradbury G, Trinder M, Furness B, Banks AN, Caldow RW, Hume D. Mapping seabird sensitivity to offshore wind farms. PloS one. 2014;9(9):e106366. pmid:25210739. Pubmed Central PMCID: 4161323.
  29. 29. Bradbury G, Shackshaft M, Scott-Hayward L, Rexstad E, Miller D, Edwards D. Risk assessment of seabird bycatch in UK waters. Report to Defra under contract MB0126. 2017; Available from http://sciencesearch.defra.gov.uk
  30. 30. Kober K, Wilson LJ, Black J, O’Brien S, Allen S, Win I, et al. The identification of possible marine SPAs for seabirds in the UK: The application of Stage 1.1–1.4 of the SPA selection guidelines JNCC Report No 461. 2012.
  31. 31. Reid JB, Camphuysen CJ. The European Seabirds at Sea database. Biological Conservation Fauna. 1998;102:291.
  32. 32. Hijmans RJ. raster: Geographic Data Analysis and Modeling. R package version 2.5–8. https://CRANR-projectorg/package=raster. 2016.
  33. 33. Bhattacharyya . On a measure of divergence between two statistical populations defined by their probability distributions. Bulletin of the Calcutta Mathematical Society 1943;35:99–109.
  34. 34. Fieberg J, Kochanny CO. Quantifying home-range overlap: the importance of the utilization distribution. Journal of Wildlife Management. 2005;69:1346–59.
  35. 35. Wickham H. ggplot2: Elegant Graphics for Data Analysis: Springer-Verlag New York; 2009.
  36. 36. Thaxter CB, Lascelles B, Sugar K, Cook ASCP, Roos S, Bolton M, et al. Seabird foraging ranges as a preliminary tool for identifying candidate Marine Protected Areas. Biological Conservation. 2012;156:53–61.
  37. 37. Ellis JR, Milligan S, Readdy L, South A, Taylor N, Brown M. Mapping the spawning and nursery grounds of selected fish for spatial planning. Report to the Department of Environment, Food and Rural Affairs from Cefas Defra Contract No MB5301. 2010.
  38. 38. Proctor R, Wright PJ, Everitt A. Modelling the transport of larval sandeels on the north west European shelf. Fish Oceanogr. 1998;7:347–54.
  39. 39. Carroll MJ, Bolton M, Owen E, Anderson GQA, Mackley EK, Dunn EK, et al. Kittiwake breeding success in the southern North Sea correlates with prior sandeel fishing mortality. Aquatic Conservation: Marine and Freshwater Ecosystems. 2017;0:1–12.
  40. 40. Crown Estate. Offshore wind operational report January–December 2016. https://www.thecrownestate.co.uk/media/1050888/operationalwindreport2017_final.pdf 2017 [07/09/17].
  41. 41. Scottish Natural Heritage. Moray Firth Proposed Special Protection Area (pSPA) No. UK9020313. SPA Site Selection Document: Summary of the scientific case for site selection. 44pp. 2016;Available from: http://www.snh.gov.uk/docs/A2013180.pdf.
  42. 42. Scottish Natural Heritage. Outer Firth of Forth and St Andrews Bay Complex Proposed Special Protection Area (pSPA) No. UK9020316. SPA Site Selection Document: Summary of the scientific case for site selection. 65pp. 2016;Available from: http://www.snh.gov.uk/docs/A2013181.pdf.
  43. 43. Natural England. Departmental brief: Northumberland Marine potential Special Protection Area (pSPA). 77pp. 2015;Available from: https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/492864/northumberland-departmental-brief.pdf.
  44. 44. Joint Nature Conservation Committee. JNCC MPA mapper webGIS. http://jncc.defra.gov.uk/page-5201. 2017 [7/9/17].
  45. 45. Votier S, Grecian W, Patrick S, Newton J. Inter-colony movements, at-sea behaviour and foraging in an immature seabird: results from GPS-PPT tracking, radio-tracking and stable isotope analysis. Marine Biology. 2011;158:355–62.
  46. 46. Barrett R, Chapdelaine G, Ankernilssen T, Mosbech A, Montevecchi W, Reid J, et al. Seabird numbers and prey consumption in the North Atlantic. ICES Journal of Marine Science. 2006.
  47. 47. Furness RW. Non-breeding season populations of seabirds in UK waters: Population sizes for Biologically Defined Minimum Population Scales (BDMPS). Natural England Commissioned Reports, Number 164. 2015.
  48. 48. Frederiksen M, Anker-Nilssen T, Beaugrand G, Wanless S. Climate, copepods and seabirds in the boreal Northeast Atlantic—current state and future outlook. Global change biology. 2013 Feb;19(2):364–72. pmid:23504776.
  49. 49. Oppel S, Meirinho A, Ramírez I, Gardner B, O’Connell AF, Miller PI, et al. Comparison of five modelling techniques to predict the spatial distribution and abundance of seabirds. Biological Conservation. 2012;156:94–104.
  50. 50. Birdlife International. Marine Important Bird Areas toolkit: standardised techniques for identifying priority sites for the conservation of seabirds at sea. Cambridge, UK: Birdlife International, 2011.