Global metrics of land cover and land use provide a fundamental basis to examine the spatial variability of human-induced impacts on freshwater ecosystems. However, microscale processes and site specific conditions related to bank vegetation, pollution sources, adjacent land use and water uses can have important influences on ecosystem conditions, in particular in smaller tributary rivers. Compared to larger order rivers, these low-order streams and rivers are more numerous, yet often under-monitored. The present study explored the relationship of nutrient concentrations in 150 streams in 57 hydrological basins in South, Central and North America (Buenos Aires, Curitiba, São Paulo, Rio de Janeiro, Mexico City and Vancouver) with macroscale information available from global datasets and microscale data acquired by trained citizen scientists. Average sub-basin phosphate (P-PO4) concentrations were found to be well correlated with sub-basin attributes on both macro and microscales, while the relationships between sub-basin attributes and nitrate (N-NO3) concentrations were limited. A phosphate threshold for eutrophic conditions (>0.1 mg L-1 P-PO4) was exceeded in basins where microscale point source discharge points (eg. residential, industrial, urban/road) were identified in more than 86% of stream reaches monitored by citizen scientists. The presence of bankside vegetation covaried (rho = –0.53) with lower phosphate concentrations in the ecosystems studied. Macroscale information on nutrient loading allowed for a strong separation between basins with and without eutrophic conditions. Most importantly, the combination of macroscale and microscale information acquired increased our ability to explain sub-basin variability of P-PO4 concentrations. The identification of microscale point sources and bank vegetation conditions by citizen scientists provided important information that local authorities could use to improve their management of lower order river ecosystems.
Citation: Loiselle SA, Gasparini Fernandes Cunha D, Shupe S, Valiente E, Rocha L, Heasley E, et al. (2016) Micro and Macroscale Drivers of Nutrient Concentrations in Urban Streams in South, Central and North America. PLoS ONE 11(9): e0162684. https://doi.org/10.1371/journal.pone.0162684
Editor: Michael E. Douglas, University of Arkansas Fayetteville, UNITED STATES
Received: May 21, 2016; Accepted: August 27, 2016; Published: September 23, 2016
Copyright: © 2016 Loiselle et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data are available on request to all interested readers by contacting FreshWater Watch (email@example.com) while all data are query-able without request using the FreshWater Watch data (https://freshwaterwatch.thewaterhub.org). Data are also available at Figshare (DOI: 10.6084/m9.figshare.3806505.v2).
Funding: HSBC Bank gave financial support for the FreshWater Watch, under the scope of the HSBC Water Programme. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: No competing interests exist, and the funding provided by the commercial source (HSBC Bank) does not alter the authors' adherence to all PLOS ONE policies on sharing data and materials.
Anthropogenic stressors endanger more than 65% of fluvial habitats globally . Increased nutrient loads and reduced ecosystem functioning have led to algal blooms and widespread artificial eutrophication in most freshwater ecosystems. This is evident in both periurban and rural ecosystems, where land management has a strong influence on nutrient fluxes, in comparison to the dominant climate influences in undisturbed areas [2, 3]. In urban and periurban areas, elevated impervious land cover modifies nutrient dynamics [4–6] and particulate inputs . In agriculturally dominated areas, increasingly industrial-scale activities utilise major inputs of mineral based nutrients which have basin-scale (and long term) impacts on the nutrient dynamics of rivers, river sediments and receiving waterbodies . The resulting eutrophication modifies macroinvertebrate and native fish populations, carbon sequestration and in-stream vegetation diversity (eg. favouring harmful algal blooms), effectively changing the basis of ecosystem functioning [9–12].
The use of satellite based estimates of land cover and land use provides a fundamental basis to understand the spatial variability of human-induced impacts on freshwater ecosystems . However, microscale processes and site specific conditions related to bank vegetation, pollution sources, adjacent land and water use have been shown to impact biological communities [14–16]. While macroscale information from Earth Observation (land cover/use) is increasingly available, microscale data require local data gathering. Acquisition of such high resolution field data is resource (cost, time) intensive. Most monitoring programmes focus on a limited number of typically large waterbodies (i.e., usually the most important tributaries of a given catchment area). This is particularly problematic as the majority of water bodies are small and therefore unmonitored .
Clearly, there is a need for new data acquisition approaches. One possible source of additional data is that acquired by trained citizen scientists–non-professional scientists or volunteers with basic training in data collection and ecosystem analysis. Citizen science is increasingly being relied on to improve the temporal and spatial resolution of local data acquisition, complementary to agency monitoring programmes [18–20]. This approach depends on appropriate training  and the presence of a local community willing to collaborate.
Combining macroscale and microscale information gathered through citizen scientists represents a novel opportunity to identify the conditions of freshwater ecosystems and the factors which influence their degradation. However, the relative importance of microscale data with respect to larger macroscale information for explaining ecosystem conditions remains unclear. This has important consequences as microscale conditions are more amenable to management actions (restoration, mitigation) than macroscale land use changes. The determination of threshold values for microscale conditions would allow more effective decision making.
In the present study, we explored the use of high resolution microscale data gathered by citizen scientists to improve the explanatory power of low resolution macroscale information on river stressors in stream basins in South, Central and North America. We hypothesised that nutrient concentrations are sensitive to potential drivers at both macro and microscales and that the latter are complementary to the former. To our knowledge, this is the first study to associate data acquired by citizen scientists with macroscale information for the analysis of freshwater ecosystems.
Our analysis was based on the hypothesis that information at two very different scales, macroscale data (globally available) and microscale data (obtained by local citizen scientists) would provide insights to different nutrient pathways and processes, allowing for a more robust analysis of sub-basin conditions [22, 23]. We focused on microscale data that would best describe point sources and processes (pollution sources, bankside vegetation, local land use/cover) and macroscale data for diffuse processes and sources (nutrient loadings, general land use/cover and population). Additional hydrological variables (stream order, sub-basin size and sampling day precipitation) were also expected to influence nutrient conditions .
Macroscale data acquisition
Drainage basin boundaries (HydroBasins) for each study area were extracted from the 15 arc-second resolution USGS HydroSHEDS database. HydroBasins are nested hierarchically into 12 levels following the Pfafstetter coding system . For this study, level 10 basins were used, which provided an appropriate scale for aggregating samples into similar sized basins with common geological and climate conditions. Stream order (Strahler classification) was determined using 15 arc-second resolution USGS HydroSHEDS flow accumulation and flow direction data, using a minimum accumulation condition of 100 cells. Population densities for 2010 were obtained from the Columbia University Center for International Earth Science Information Network [26, 27]. Daily precipitation data was obtained from the Global Precipitation Climatology Project . The Adjusted Human Water Security (AHWS) index was used as well as its component datasets (30' resolution latitude x longitude) for nutrient loading and land cover fractions for the year 2000 . The AHWS combines key global drivers regarding water resource development (human and agricultural), pollution (nutrient loading), watershed disturbances (cropland and livestock density) and biotic factors (fishing and invasive species).
Between September 2013 and September 2015, 1,000 trained citizen scientists, working in groups of 2 or 3, collected 2,097 datasets from 150 rivers and streams in urban and periurban areas in Buenos Aires, Curitiba, São Paulo, Rio de Janeiro, Mexico City and Vancouver as part of the FreshWater Watch programme. Measurements were repeated bimonthly or quarterly in the same sample sites, assigned by project scientists to cover urban, periurban and quasi-rural streams that were not being monitored by local authorities. Additional sites (13%) were self-selected by participants. The locations and measurements obtained at each sample are available at: freshwaterwatch.thewaterhub.org/content/data-map and the site locations are presented in Fig 1.
Basemap image reprinted from Esri under a CC BY license with permission from Esri and its licensors, original copyright June 2009.
Each dataset contained observations and measurements of ecosystem conditions, hydrology and water quality, collected using a consistent methodology and uploaded directly on to the online database available. General ecosystem conditions included observations of the land use/cover in the immediate surroundings of the sampling site, visible evidence of pollution sources (e.g. discharge pipes) and estimates of their potential sources (urban or road runoff/drainage, residential, industrial, other) and the presence of bankside vegetation at the sampling point. These observations were limited to the immediate area of the sampling site, in general less than 25 m in both directions. Hydrological conditions were assessed using categorical estimates of water flow. Menu-based observations of water colour, the presence of pollution features (oil, foam, litter) and algal blooms were also recorded for each site and supported by a photographic documentation .
Measurements of dissolved phosphate (P-PO4) and nitrate (N-NO3) concentrations were performed from unfiltered samples using colorimetric methods. The method allowed for in-situ estimates of dissolved nutrients with exposure to reagents occurring within closed sample tubes, a method appropriate for a mass citizen science programme. Total nutrient concentrations could not be measured in the field by citizen scientists due to digestion and laboratory analysis requirements. Phosphate concentrations were estimated using inosine enzymatic reactions in seven specific ranges from 0.02 mg L-1 to 1.0 mg L-1 P-PO4 [30, 31]. Nitrate concentrations were measured using N-(1-napthyl)-ethylenediamine  in seven specific ranges from 0.2 mg L-1 to 10 mg L-1 N-NO3.
Field methods were tested against laboratory methods  and calibrated sensors using standard solutions and natural water samples. Duplicate and triplicate measurements were made during training and quarterly quality control analysis. Variability between different citizen scientists in the same waterbodies (on the training days) was assessed. All data were cross-checked against specified criteria. If an inconsistent measurement was found, the citizen scientist who collected the dataset was notified and asked to confirm, delete or correct the measurement.
Datasets were time and geo-coded either using a dedicated smartphone app or online using measured geographic coordinates or Google maps. After uploaded to a common database, all data were checked by project scientists. All participants were trained to use consistent data acquisition methods by professional scientists in field-based training days and were required to pass an online training quiz before being able to upload data. Written instruction sheets were provided with each testing kit and a training video was used to remind participants of the appropriate methods.
Phosphate and nitrate concentrations were averaged within individual sub-basins (L10 HydroBasin) to determine a sub-basin average. Of the 97 original sub-basins, only those with more than 10 measurements were used for the analysis (n = 57). The average number of measurements was 34 per sub-basin, covering typically quarterly measurements of 3 streams per basin, with an average sub-basin area of 146 km2. The experimental unit of all further analysis is that of the sub-basins.
Microscale information on the number of pollution sources (eg. industrial, residential, road discharge) observed and recorded by the citizen scientists during each measurement was summed to create an index of point pollution sources for each sampling site. Observations of site-adjacent land use/cover were recorded individually and aggregated into three categories based on assumed potential impact (0-forest, 1-urban park, grassland/pasture 2-agricultural, industrial, and/or urban residential). Observations of site specific bank vegetation were divided into vegetated (1) and non-vegetated values (0). All values were averaged across sub-basins.
Macroscale and microscale sub-basin averages were compared to nutrient concentrations using non-parametric tests (Mann-Whitney U, Spearman’s rank correlation). Correlations above 0.6 were considered strong following Tukey’s guidelines and multiple hypotheses corrections (Bonferroni) for significance were utilised.
Receiver operating characteristic (ROC) analysis was used to identify possible thresholds for macroscale and microscale characteristics with respect to elevated nutrient concentrations [34, 35]. ROC analysis is commonly used to understand the performance of a binary classifier , in this case, sub-basins with eutrophic conditions based on P-PO4 concentrations. A single concentration limit for eutrophication is difficult to determine and will depend on the local geological, climate and groundwater conditions . Nevertheless, we used a P-PO4 concentration of 0.1 mg L-1 for rivers and streams to discriminate sub-basins with eutrophic conditions [38, 39]. We calculated area under the ROC curve to compare the explanatory characteristics (specificity and sensitivity) of individual microscale and macroscale variables using SPSS (version 21). Only those variables that were found to have statistically significant rho were used in the ROC analysis, considering a Bonferroni-corrected significance of 0.002 (0.05/24).
Nutrient concentrations were log transformed for multiple linear regression analysis (kurtosis & skewness outside the range of –1.0 to 1.0). Multiple linear regression (backward step regression, removal criteria for probability of F < 0.05.) with SPSS (version 21) was used to identify which microscale and macroscale variables, or combination of variables, contribute to elevated P-PO4 and N-NO3 concentrations. Those variables that were found to have statistically significant rho, corrected for multiple hypotheses, were used in the regression analysis. Multicollinearity of variables was identified using a Variance Inflation Factor above 2.5 and noted in the results. Partial correlations and homoscedasticity were also checked. Models were evaluated based on the highest adjusted R2.
Nutrient concentrations varied greatly across sub-basins and cities (Fig 2, S1 Table), with the highest concentrations (means and medians) in sub-basins in Mexico City, Rio de Janeiro and São Paulo, and the lowest in Curitiba and Vancouver.
Bars indicate 2 standard errors (SE).
Sub-basins had differences in land cover, from highly urban to rural with a very low population density. São Paulo had the largest coverage of impermeable surfaces and population density (S1 Table). AHWS was highest in Rio de Janeiro, São Paulo and Mexico. Nutrient loading was greatest in São Paulo and livestock density was most elevated in Curitiba.
Average stream order was 1.6 (±0.8) with sub-basins in São Paulo having the highest stream order. Average rainfall on the day of sampling was 3.5 (± 3.1) mm/day with sub-basins in Curitiba having the highest average daily rainfall. Sub-basin areas averaged 146 (± 77) km2 with the smallest basins in São Paulo and the largest in Rio de Janeiro.
On a microscale, the average number of observed site-specific pollution sources (discharges) was 0.8 per measurement site (S1 Table), with residential and urban/road sources being most often identified (Fig 3a). Vegetated stream banks were observed in 93% of the sampling sites, and microscale land cover was mostly urban residential followed by urban park (Fig 3b).
High correlations (rho > 0.6, n = 57) between basin averaged phosphate concentrations and nutrient loading (and organic matter) were observed (Table 1). Interestingly, the observed sum of microscale pollution sources was the best covariate of phosphate concentration, with a correlation coefficient of 0.70. Moderate correlations (rho from 0.4 to 0.6) of phosphate concentrations with macroscale characteristics of impervious land cover, AHWS and population density were found. Moderate correlations with microscale variables (bank vegetation cover and land use/cover) were also evident (Table 1).
*AHWS refers to the Adjusted Human Water Security, . **Significant p-values, considering multiple hypotheses are below 0.002 (Bonferroni correction).
Macro and microscale influences on nitrate concentrations were lower, with no high correlations and limited variables with moderate correlations. These were limited to two macroscale variables: cropland land cover fraction (moderate) and population density (low).
Receiver operating characteristic (ROC) analysis for a phosphate concentration limit of 0.1 mg L-1 suggested that macroscale characteristics (phosphorus loading) and microscale data (sum of pollution sources) provided significant estimates (p<0.01) with greater than 0.80 area under the ROC curve. The sum of pollution sources provided a slightly higher area under the curve (0.89) with respect to phosphorus loading (0.84), where a perfect classifier has an area of 1 (percentage of total area) and a poor classifier has an area of 0.50 . The determination of area was insensitive to the relative distribution of the two classes, eutrophic and non-eutrophic. Using a value of 0.75 for sensitivity (true positive rate) and 0.25 (1–0.75) for specificity (false positive rate), the threshold for phosphate loading was 0.975 (Fig 4). Using a value of 0.75 for sensitivity (true positive rate) and 0.15 (1–0.85) for specificity (false positive rate), the threshold for the sum of local pollution sources was 0.855.
The x-axis is denominated as 1 minus Specificity or the false positive rate.
The macro and microscale data, combined using multiple linear regression, did not show a strong relationship with nitrate (log10 transformed to reduce skewness). Phosphate concentrations (log10 transformed to reduce skewness) showed a stronger relationship with both macroscale and microscale data (Table 2). Considering the former, a combination of macroscale phosphate loading (Phosphate loading) estimates and AHWS provided the best model, with a moderate correlation in explaining the variability of phosphate concentrations in each sub-basin (adjusted R2 of 0.31, Table 2). The results of the model showed some collinearity as tolerance was 0.251 and the distribution of the standardised residuals was skewed towards higher predicted values (signs of heteroscedasticity). Partial correlations showed that the relationships between each variable and P-PO4 concentrations (log transformed) remained significant, controlling for the effects of the other variable. The use of only microscale observations (sum of pollution sources and bank vegetation) provided better explanatory power, reaching an adjusted R2 of 0.47, with a much higher tolerance of 0.847, a scatter plot of standardised residuals showing no trend and partial correlations remaining significant.
Combining both microscale and macroscale information allowed for a large improvement over the use of macroscale variables alone, and a small improvement over using microscale variables alone (Table 2). The resulting models showed low collinearity between parameters (tolerance between 0.8 and 0.9) and there was no serial correlation among the residuals (Durbin-Watson). Cooks distances were all below 0.2 and the scatterplot of standardised residuals showed no trends.
Nitrate concentrations were not well correlated to either macroscale or microscale land cover/use variables using the sub-basin averages in the six study areas. Previous studies show correlations between nitrate and agriculture land cover [40, 41] for large river networks. The weaker correlations in our study for nitrate compared to phosphate may have resulted from the more dynamic nature of nitrate cycling (with respect to phosphate) in small waterbodies. The study areas covered a range of climate conditions, with large differences in temperature, residence time, oxygen conditions, groundwater inputs and surface temperatures, all with important impacts on dissolved nitrate dynamics [42–44]. As these variables were not evaluated in the present study, key drivers of nitrate dynamics were left out of the present analysis.
Phosphate concentrations showed important correlations to both macroscale and microscale variables. The positive relationship between phosphate concentrations and macroscale descriptors, based on low resolution global land cover data, confirmed the usefulness of satellite based land cover data to study aquatic systems conditions. These globally available data allowed for a good estimate of the variability of phosphate concentrations across a range of river environment and climate conditions. These databases, and in particular AHWS, were developed to examine broad patterns of water quality for large river networks (stream order > 5, ). It is interesting that they were successful when focused on relatively low stream order systems. It is expected that higher resolution, more current land cover datasets would provide better results. Such information, if available on a global scale, would greatly improve our capacity to explore basin scale impacts on freshwater ecosystems across biomes. At present, most large scale studies are limited to temperate areas .
Average sub-basin phosphate concentrations ranged from 0.39 mg L-1 in one sub-basin in Rio de Janeiro to 0.018 mg L-1 in sub-basins in Curitiba and Vancouver, with an average of 0.15 mg L-1. This matches well with the average P-PO4 concentration for all river and streams in the 30 cities of FreshWater Watch: 0.15 mg L-1 from August 2013 to April 2016 (n = 7,646). It also matches well with the average P-PO4 concentration reported for all surface waters (including lakes) in the EPA NRSA/Storet database (n = 105,347, United States only, data from 1992 to 2009); 0.11 mg L-1. Therefore, the nutrient concentrations across our study streams spanned the range reported in existing international datasets, suggesting our findings are applicable to urban and periurban aquatic systems globally.
Considering phosphate as the main driver of eutrophication within the sub-basins, a macroscale phosphate loading threshold of 0.975 (standardised units) was shown by ROC analysis to provide a good separation of basins with more eutrophic conditions. This indicates that basins with a loading above 0.975 were correctly identified as eutrophic (exceeding 0.1 mg L-1 P-PO4) 75% of the time, and incorrectly identified as being below the P-PO4 limit only 25% of the time. Of the 57 sub-basins analysed, 29 (51%) had an average phosphate loading below this threshold. These were present in Buenos Aires, Curitiba and Vancouver. No threshold for the AHWS index could be identified that provided both acceptable specificity and sensitivity. Combining both phosphate loading and AHWS, regression analysis showed that 31% of the variability of the phosphate concentrations could be explained. Phosphorus loading was the most important variable, as seen by both the standardised coefficients and partial correlations. It should be noted that macroscale land use/cover data showed an elevated covariance (eg. partial correlations), a natural consequence of the gradients considered in the AHWS analysis and the link between anthropogenic and natural landscape gradients .
Microscale data significantly improved our capacity to explain the variance in phosphate concentrations across sub-basins, taken separately as well as in combination with macroscale data. As an individual microscale variable, the observed number of pollution sources provided the most explanatory power, while information on bankside vegetation was also found to provide moderate correlation. This supports studies regarding the importance of reducing residential discharges and fertiliser use in controlling stream nutrient conditions [46, 47, 41]. Using a phosphate concentration limit of 0.1 mg L-1, a microscale pollution source threshold (sum of pollution sources) of 0.855 allowed for a statistically significant separation of basins with more eutrophic conditions. This threshold was surpassed in 64%, 56%, 100%, 100%, 91% and 5%, of the sub-basins in Buenos Aires, Curitiba, Mexico City, Rio de Janeiro, São Paulo and Vancouver respectively. Effectively, this means that the observation of a pollution source near 86% of the sampling sites in a sub-basin was sufficient to accurately classify that sub-basin as eutrophic (> 0.1 P-PO4 mg L-1).
The resulting threshold indicates the importance of (typically) under-monitored and unidentified discharges in lower order rivers. Residential discharges (outfalls) and urban discharges were the most common in the study sub-basins (Fig 3a). There are few studies addressing the impact of residential land use near streams and rivers, and those that do (eg. ) are limited to modern residential developments where these discharges are less common. The identification of microscale point sources by trained local community members improves stakeholder capacity to explain the spatial variability of algal blooms and other impacts of eutrophication. Furthermore, this information provides stakeholders with opportunities to address local (and more manageable) drivers of ecosystem degradation. Experiments using trained community members to monitoring outfalls are underway in several areas in the UK (eg. Thames 21 ).
The negative relationship between bank vegetation and phosphate concentrations indicated that rivers and streams in the study areas with vegetated buffers had lower phosphate concentrations. These data did not allow for the determination of the buffering capacity of vegetated banks (no significant ROC threshold), but do lend weight to the role of vegetated buffer strips in reducing surface and subsurface inputs of nutrients into streams [50–52]. The importance of bank vegetation was less than that of pollution sources, from standardised coefficients and partial correlations, but still significant in explaining phosphate concentrations. The regression with both microscale variables explained nearly half of the variability in phosphate concentrations.
The observation of microscale land use/cover in the immediate sampling area provided limited interpretative power, indicating that microscale information is less important than macroscale land use/cover conditions in the study areas. This was demonstrated by a lower rho with respect to macroscale attributes (eg. impervious land cover fraction) and no statistically significant relationship between the microscale land use/cover parameter and nutrient concentrations using the ROC analysis. This result confirmed studies that show that macroscale land cover information provides better explanatory information in heavily modified areas with limited spatial diversity . In undisturbed sub-basins, we would expect that stream nutrient concentrations may be more sensitive to microscale land use/cover differences .
Adding microscale information improved our overall understanding of the variability of phosphate concentrations compared to using macroscale information alone, with an increase in the adjusted R2 from 0.34 to 0.51, with a 13% reduction in the standard error (Table 2). All variables appear to have a similar importance in the regression equation (based on standardized regression coefficients). The tolerance of the macroscale phosphorus loading and AHWS confirmed the expected correlation between these variables. Integrating information from these two sources lends weight to ongoing studies of microscale data to model river water quality . The improvement made by introducing macroscale variables to explain the variability in sub-basin phosphate conditions was limited (2% improvement in adjusted R2), and while topographic factors (not explored here) have been shown to be important on a microscale , we show that observational microscale variables provide important tools to identify nutrient conditions.
A number of macroscale variables underperformed with respect to their expected importance. Meteorology typically plays an important role in modifying nutrient concentrations. However, sampling day precipitation was not found to influence the variability of average phosphate concentrations. This may have resulted from the variable lag times for precipitation and runoff in the range of streams examined and the low resolution of the precipitation data (1.0° x 1.0°). It should also be noted that citizen acquired data may contain a bias towards sampling in non-rain conditions (for comfort and safety considerations). Interestingly, the average daily rainfall on sampling days was similar to the average daily rainfall for each study area (except for São Paulo), indicating that this bias was relatively low. However, a bias towards a reduced frequency sampling during heavy rain events is inherent in citizen based data acquisition in rivers and streams. It should be noted that for studies on nutrient dynamics, it would be advisable to use consistent lag times between rain events and data acquisition by citizen scientists with respect to whether first flush or base flow conditions are desired.
Interestingly, average stream order was not found to be an important driver of nutrient concentrations. This may be due to the similarity between the sub-basins examined, with mean stream order below 2, except Rio de Janeiro. Finally, sub-basin area did not significantly influence nutrient concentrations, contrary to studies which show the importance of basin area on nutrient concentrations [56, 22]. As most study streams were ungauged, it was not possible to normalise measurements using stream discharge or base flow conditions.
The present study focused on the use of repeated “spot” measurements of dissolved nutrient concentrations to explore the spatial variability of river basin conditions. We recognise that continuous or integrated measurements of nutrient concentrations or their impact would provide better information on nutrient dynamics. Biotic measurements and in-stream sensors provide more complete information, but may not always be appropriate for mass citizen science based measurements due to elevated cost (sensors) and time/training requirements (biological measurements) compared to grab samples or spot measurements . These measurement approaches (biotic, sensor and bio-optical/chemical) are complementary, allowing for a range of participation (and training requirements).
Eutrophication of surface waterbodies presents an important challenge to decision makers, which is compounded by insufficient information on ecosystem conditions and potential drivers. In the present study, we used information gathered by thousands of trained citizen scientists to explore the spatial variability of nutrient concentrations and microscale conditions of stream basins. Combining macroscale and microscale data increased our capacity to explain the variability of phosphate concentrations on a sub-basin scale. Integrating information acquired by trained citizen scientists with global datasets of land use/cover represents a new approach to explore factors that control water quality and is a key step towards managing the drivers of its degradation.
Macroscale national and international data help broadly define conditions across basins and can identify potential tipping points. In turn, microscale information is important for evaluating potential point sources of pollution and the presence of riparian buffer areas to mitigate non-point source runoff. The participation of active and informed citizens allows for a greater temporal frequency of data acquisition, while also allowing for rapid identification of changes in ecosystems before they expand into more widespread impacts. Thresholds for microscale point sources (eg., discharges) can be used in the design of early alert systems and long term monitoring processes. The identification of microscale point sources and the condition of bank vegetation provides stakeholders and local authorities with high resolution information to improve control of key drivers of ecosystem degradation.
The present study focused on predominantly smaller rivers and streams which were previously unmonitored. This is a consistent pattern internationally as smaller order streams, greater in number and in length  than larger rivers, are not regularly monitored. There is a clear need to increase data gathering in these spatially disperse ecosystems and the use of trained citizen scientists is one promising method to generate complementary data to government agency monitoring schemes. Microscale conditions are likely to have a larger influence on the conditions of smaller ecosystems, with respect to their larger counterparts [59, 60]. This is further justification of the integration of community based monitoring within sub-basin scale programmes. In this study, field observations and measurements made by citizen scientists were found to provide complementary information to coarser scale global data in showing patterns of water quality across a range of climate and ecological conditions.
S1 Table. Average and standard deviation of the study sub-basin characteristics by city (see Methods for data sources, AHWS refers to the Adjusted Human Water Security, values missing standard deviation indicate that all sub-basin values of land cover were equal).
We sincerely acknowledge the efforts of the citizen scientists in the HSBC Water Programme for providing enthusiasm and fundamental data gathering. We would like to thank the efforts of Luis Felipe Velasquez and Eva Pintado Castilla for their assistance in the development of the global information system and three anonymous reviewers for their constructive comments on the manuscript.
- Conceptualization: SL DC.
- Formal analysis: SL DC SS EV LR.
- Investigation: DC SS PB LR.
- Methodology: SL AB.
- Resources: AB EH.
- Validation: DC SS EV LR.
- Visualization: SS.
- Writing – original draft: SL DC LR.
- Writing – review & editing: LR SS PB AB EH.
- 1. World Water Assessment Program. The United Nations World Water Development Report. UNESCO. 2009. [pdf], Available at: http://webworld.unesco.org/water/wwap/wwdr/wwdr3/pdf/WWDR3 Water in a Changing World.pdf], Accessed 29 January 2016.
- 2. Anderson NJ, Dietz RD, Engstrom DR. Land-use change, not climate, controls organic carbon burial in lakes. Proceedings of the Royal Society B: Biological Sciences. 2013; 280:1769.
- 3. Gücker B, Silva RCS, Graeber D, Monteiro JAF, Brookshire ENJ, Chaves RC, et al. Dissolved nutrient exports from natural and human-impacted Neotropical catchments. Global Ecology and Biogeography. 2016;25:378–390. “In press”.
- 4. Fitzpatrick ML, Long DT, Pijanowski BC. Exploring the effects of urban and agricultural land use on surface water chemistry, across a regional watershed, using multivariate statistics. Applied Geochemistry. 2007;22:1825–1840.
- 5. Morgan RP, Kline KM, Cushman SF. Relationships among nutrients, chloride and biological indices in urban Maryland streams. Urban Ecosystems. 2007;10:153–166.
- 6. Pouyat RV, Pataki DE, Belt KT, Groffman PM, Hom J, Band LE. Effects of urban land-use change on biogeochemical cycles. In Terrestrial ecosystems in a Changing World. Springer Berlin Heidelberg; 2007:45–58.
- 7. Duan HL, Feng L, Ma R, Zhang Y, Loiselle SA. Variability of particulate organic carbon in inland waters observed from MODIS Aqua imagery. Environmental Research Letters. 2014;9:1–10.
- 8. Jarvie HP, Sharpley AN, Spears B, Buda AR, May L, Kleinman PJ. Water quality remediation faces unprecedented challenges from “legacy phosphorus”. Environmental science & technology. 2013;47(16):8997–8998.
- 9. Weijters MJ, Janse JH, Alkemade R, Verhoeven JTA. Quantifying the effect of catchment land use and water nutrient concentrations on freshwater river and stream biodiversity. Aquatic Conservation: Marine and Freshwater Ecosystems. 2009;19(1):104–112.
- 10. Evans-White MA, Dodds WK, Huggins DG, Baker DS. Thresholds in macroinvertebrate biodiversity and stoichiometry across water-quality gradients in Central Plains (USA) streams. Journal of the North American Benthological Society. 2009;28(4):855–868.
- 11. Steffen K, Becker T, Herr W, Leuschner C. Diversity loss in the macrophyte vegetation of northwest German streams and rivers between the 1950s and 2010. Hydrobiologia. 2013;713(1):1–17.
- 12. Rosemond AD, Benstead JP, Bumpers PM, Gulis V, Kominoski JS, Manning DW, et al. Experimental nutrient additions accelerate terrestrial carbon loss from stream ecosystems. Science. 2015;347(6226):1142–1145. pmid:25745171
- 13. Tu J. Spatial variations in the relationships between land use and water quality across an urbanization gradient in the watersheds of northern Georgia, USA. Environmental management. 2013;51:1–17. pmid:21858555
- 14. Mykra H, Heino J, Muotka T. Scale-related patterns in the spatial and environmental components of stream macroinvertebrate assemblage variation. Global Ecology and Biogeography. 2007;16:149–159.
- 15. Feld CK. Response of three lotic assemblages to riparian and catchment-scale land use: implications for designing catchment monitoring programmes. Freshwater Biology. 2012;58:715–729.
- 16. Liu S, Xie G, Wang L, Cottenie K, Liu D, Wang B. Different roles of environmental variables and spatial factors in structuring stream benthic diatom and macroinvertebrate in Yangtze River Delta, China. Ecological Indicators. 2016;61:602–611.
- 17. Downing JA, Prairie YT, Cole JJ, Duarte CM, Tranvik LJ, Striegl RG, et al. The global abundance and size distribution of lakes, ponds, and impoundments. Limnology and Oceanography. 2006;51:2388–2397.
- 18. Silvertown J. A new dawn for citizen science. Trends in ecology & evolution. 2009;24: 467–471.
- 19. Castilla EP, Cunha DGF, Lee FWF, Loiselle S, Ho KC, Hall C. Quantification of phytoplankton bloom dynamics by citizen scientists in urban and peri-urban environments. Environmental monitoring and assessment. 2015;187(11):1–11.
- 20. Busch J, Price I, Jeauson E, Zielinski O, Woerd HJvd. Citizens and satellites: Assessment of phytoplankton dynamics in a NW Mediterranean aquaculture zone. International Journal of Applied Earth Observation and Geoinformation. 2016;47:40–49;
- 21. Conrad CC, Hilchey KG. A review of citizen science and community-based environmental monitoring: issues and opportunities. Environmental Monitoring and Assessment. 2011;176:273–291. pmid:20640506
- 22. Peterson EE, Sheldon F, Darnell R, Bunn SE, Harch BD. A comparison of spatially explicit landscape representation methods and their relationship to stream condition. Freshwater Biology. 2011;56(3):590–610.
- 23. Yates AG, Brua RB, Corriveau J, Culp JM, Chambers PA. Seasonally Driven Variation in Spatial Relationships Between Agricultural Land Use and In-Stream Nutrient Concentrations. River Res. Applic., 2014; 30:476–493.
- 24. Zhang T, Yang J. Predicting Nitrogen Loading With Land-Cover Composition: How Can Watershed Size Affect Model Performance Environmental Management 2013;51: 96–107 pmid:22773114
- 25. Lehner B, Grill G. Global river hydrography and network routing: baseline data and new approaches to study the world’s large river systems. Hydrological Processes. 2013;27:2171–2186.
- 26. Potter P, Ramankutty N, Bennett EM, Donner SD. Global Fertilizer and Manure, Version 1: Phosphorus Fertilizer Application. Palisades, NY, NASA Socioeconomic Data and Applications Center (SEDAC), NASA. 2001. [Internet]. Available at: http://sedac.ciesin.columbia.edu/data/set/ferman-v1-phosphorus-fertilizer-application, Accessed 29th January, 2016.
- 27. Center for International Earth Science Information Network—CIESIN—Columbia University. Gridded Population of the World, Version 4 (GPWv4), Preliminary Release 2 (2010). Palisades, NY. 2014. Available at: http://www.ciesin.columbia.edu/data/gpw-v4. Accessed 26 March 2015.
- 28. Schneider U, Becker A, Finger P, Meyer-Christoffer A, Rudolf B, Ziese M. GPCC Full Data Reanalysis Version 6.0 at 1.0°: Monthly Land-Surface Precipitation from Rain-Gauges built on GTS-based and Historic Data. 2011. https://doi.org/10.5676/DWD_GPCC/FD_M_V7_100
- 29. Vörösmarty CJ, McIntyre PB, Gessner MO, Dudgeon D, Prusevich A, Green P, et al. Global threats to human water security and river biodiversity. Nature. 2010;467: 555–561. pmid:20882010
- 30. Strickland JD, Parsons TR. A Practical Handbook of Seawater Analysis, Bull. Fish. Res. Bd. Can; 1968.
- 31. Berti G, Fossati P, Tarenghi G, Musitelli C, Melzi d’Eril GV. Enzymatic colorimetric method for the determination of inorganic phosphorus in serum and urine. Clinical Chemistry and Laboratory Medicine, 1988;26(6):399–404.
- 32. Adeloju SB. Progress and recent advances in phosphate sensors: A review. Talanta 2013;114:191–203. pmid:23953460
- 33. American Public Health Association (APHA), American Water Works Association (AWWA), Water Environment Federation (WEF); 2012. Standard Methods for the Examination of Water and Wastewater, 22nd Ed., Washington, D.C.
- 34. Mann HB, Whitney RD. On a test of whether one of two random variables is stochastically larger than the other. The annals of mathematical statistics. 1947;18(1):50–60.
- 35. Greiner M, Pfeiffer D, Smith RD. Principles and practical application of the receiver-operating characteristic analysis for diagnostic tests. Preventive veterinary medicine 2000;45(1):23–41.
- 36. Morrison AM, Coughlin K, Shine JP, Coull BA, Rex AC. Receiver operating characteristic curve analysis of beach water quality indicator variables. Applied and environmental microbiology, 2003;69(11):6405–6411. pmid:14602593
- 37. Hinsby K, Markager S, Kronvang B, Windolf J, Sonnenborg TO, Thorling L. Threshold values and management options for nutrients in a catchment of a temperate estuary with poor ecological status. Hydrology and Earth System Sciences. 2012;16:2663–2683.
- 38. Hutchinson GE. A treatise on limnology. John Wiley and Sons: New York; 1957.
- 39. Mount DI, Hansen DJ, Gentile JR, Chapman GA, Brungs WA. Guidelines for deriving numerical national water quality criteria for the protection of aquatic organisms and their uses. Washington DC: United States Environmental Protection Agency, Office of Research and Development. 1985. [Internet]. Available at: http://water.epa.gov/scitech/swguidance/standards/criteria/aqlife/upload/85guidelines.pdfAccessed 29 January 2016.
- 40. Seitzinger SP, Styles RV, Boyer EW, Alexander RB, Billen G, Howarth RW, et al. Nitrogen retention in rivers: model development and application to watershed in the northeastern USA. Biogeochemistry. 2002;1:199–237.
- 41. Álvarez-Cabria M, Barquín J, Peñas FJ. Modelling the spatial and seasonal variability of water quality for entire river networks: Relationships with natural and anthropogenic factors. Science of The Total Environment. 2016; 545:152–162. pmid:26745301
- 42. Harrison JA, Maranger RJ, Alexander RB, Giblin AE, Jacinthe PA, Mayorga E, et al. The regional and global significance of nitrogen removal in lakes and reservoirs. Biogeochemistry. 2009;93:143–157.
- 43. Bailey RT, Ahmadi M. Spatial and temporal variability of in-stream water quality parameter influence on dissolved oxygen and nitrate within a regional stream network. Ecological Modelling. 2014;277:87–96.
- 44. Oliver AA, Dahlgren RA, Deas ML. The upside-down river: Reservoirs, algal blooms, and tributaries affect temporal and spatial patterns in nitrogen and phosphorus in the Klamath River, USA. Journal of Hydrology. 2014;519:164–176.
- 45. Allan JD. Landscapes and riverscapes: the influence of land use on stream ecosystems. Annual review of ecology, evolution, and systematics. 2004;257–284.
- 46. La Valle PD. Domestic sources of stream phosphates in urban streams. Water Res. 1975;9:913–915.
- 47. Waschbusch RJ, Selbig WR, Bannerman RT. Sources of phosphorus in stormwater and street dirt from two urban residential basins in Madison, Wisconsin, 1994–95. U.S. Geological Survey Water-Resources Investigations Report. 1999;51:99–4021.
- 48. Conine A, Porter-Goff E, Frost PC. Phosphorus export from a small forested stream: Are there effects from human residential development in the riparian zone?. Fundamental and Applied Limnology/Archiv für Hydrobiologie. 2015;187(1):55–62.
- 49. Thames21 NEW! Outfall monitoring training; Thames21 2016. [Internet]. Available at: http://www.thames21.org.uk/event/new-outfall-monitoring-training/ Accessed 29 January 2016.
- 50. Osborne LL, Kovacic DA. Riparian vegetated buffer strips in water-quality restoration and stream management. Freshwater biology. 1993;29:243–258.
- 51. Reddy KR, Kadlec RH, Flaig E, Gale PM. Phosphorus retention in streams and wetlands: a review. Critical Reviews in Environmental Science and Technology. 1999;29:83–146.
- 52. Pfeifer LR, Bennett EM. Environmental and social predictors of phosphorus in urban streams on the Island of Montréal, Québec. Urban Ecosystems. 2011;14:485–499.
- 53. Tudesque L, Tisseuil C, Lek S. Scale-dependent effects of land cover on water physico-chemistry and diatom-based metrics in a major river system, the Adour-Garonne basin (South Western France). Science of the Total Environment. 2014; 466:47–55. pmid:23892023
- 54. Amiri BJ, Nakane K. Modeling the linkage between river water quality and landscape metrics in the Chugoku district of Japan. Water resources management. 2009; 23(5):931–956.
- 55. Singh S, Chang H. Effects of Land Cover Change on Water Quality in Urban Streams at Two Spatial Scales. International Journal of Geospatial and Environmental Research. 2014;1(1):8.
- 56. King RS, Baker ME, Whigham DF, Weller DE, Jordan TE, Kazyak PF, et al. Spatial considerations for linking watershed land cover to ecological indicators in streams. Ecological applications. 2005;15(1):137–153.
- 57. Biggs J, Ewald N, Valentini A, Gaboriaud C, Dejean T, Griffiths RA, et al. Using eDNA to develop a national citizen science-based monitoring programme for the great crested newt (Triturus cristatus). Biological Conservation. 2015;183:19–28.
- 58. Strahler AN. Quantitative analysis of watershed geomorphology. Eos, Transactions American Geophysical Union, 1957; 38(6):913–920.
- 59. Biggs BJ, Tuchman NC, Lowe RL, Stevenson RJ. Resource stress alters hydrological disturbance effects in a stream periphyton community. Oikos. 1999;85:95–108.
- 60. Oertli B, Joye DA, Castella E, Juge R, Cambin D, Lachavanne JB. Does size matter? The relationship between pond area and biodiversity. Biological conservation. 2002;104:59–70.