Citizen science reveals unexpected solute patterns in semiarid river networks

Human modification of water and nutrient flows has resulted in widespread degradation of aquatic ecosystems. The resulting global water crisis causes millions of deaths and trillions of USD in economic damages annually. Semiarid regions have been disproportionately affected because of high relative water demand and pollution. Many proven water management strategies are not fully implemented, partially because of a lack of public engagement with freshwater ecosystems. In this context, we organized a large citizen science initiative to quantify nutrient status and cultivate connection in the semiarid watershed of Utah Lake (USA). Working with community members, we collected samples from ~200 locations throughout the 7,640 km2 watershed on a single day in the spring, summer, and fall of 2018. We calculated ecohydrological metrics for nutrients, major ions, and carbon. For most solutes, concentration and leverage (influence on flux) were highest in lowland reaches draining directly to the lake, coincident with urban and agricultural sources. Solute sources were relatively persistent through time for most parameters despite substantial hydrological variation. Carbon, nitrogen, and phosphorus species showed critical source area behavior, with 10–17% of the sites accounting for most of the flux. Unlike temperate watersheds, where spatial variability often decreases with watershed size, longitudinal variability showed an hourglass shape: high variability among headwaters, low variability in mid-order reaches, and high variability in tailwaters. This unexpected pattern was attributable to the distribution of human activity and hydrological complexity associated with return flows, losing river reaches, and diversions in the tailwaters. We conclude that participatory science has great potential to reveal ecohydrological patterns and rehabilitate individual and community relationships with local ecosystems. In this way, such projects represent an opportunity to both understand and improve water quality in diverse socioecological contexts.

Introduction Agriculture, wastewater, and fossil fuel use have exceeded global thresholds for nitrogen (N) and phosphorus (P) [1][2][3][4], resulting in eutrophication of two-thirds of freshwater ecosystems globally [5][6][7][8]. Excess nutrients and other water pollutants such as heavy metals and waste from humans and livestock degrade aquatic ecosystem integrity, leading to trillions of USD in economic damages and the premature death of approximately 2 million people annually [9][10][11][12]. Mitigating these global water crises will require improved monitoring and management, which themselves depend on public understanding and financial support [13][14][15][16]. Consequently, improving public understanding and interaction with aquatic ecosystems is a planetary priority [17][18][19].
Because of the high spatiotemporal variability typical of both surface and subsurface aquatic ecosystems [20][21][22], identifying pollutant sources requires repeated sampling of many locations in the watershed [23][24][25]. This type of synoptic sampling provides a high-resolution view of water chemistry throughout the network, potentially generating insights into hydrological and biogeochemical processes [26][27][28], as well as identifying the location and spatial extent of pollutant sources [29][30][31]. Pollutant sources range in size from discrete point sources, such as a wastewater discharge, to diffuse nonpoint sources, such as runoff from agricultural fields [32,33]. They also vary in duration, from persistent sources that are always active to intermittent sources that only deliver pollutants to the ground or surface water during certain ecohydrological conditions [34][35][36].
While measuring or modeling water chemistry continually everywhere in the stream network remains impossible, a suite of ecohydrological metrics have been developed to inform management based on repeated synoptic sampling [1,23,27,29,37]. For example, the relationship between spatial variability and watershed size can reveal the patch size of processes driving water chemistry, revealing the relative importance of delivery from terrestrial environments and processing in aquatic environments [26,29,38,39]. Likewise, the persistence of spatial patterns in water chemistry through time can reveal changes in pollutant sources or sinks, informing the necessary sampling frequency [23,27,40,41]. In practice, high spatial persistence of a solute in a watershed allows for a single synoptic sampling event to representatively characterize its dynamics [23,29,40]. While synoptic sampling requires substantial resources and coordination, combining such metrics with traditional analysis of watershed land use and land cover could improve effectiveness of monitoring and restoration measures. So far, these analyses have primarily been done in temperate and high-latitude ecosystems [23,27,38,40], leaving important unknowns about other ecosystem types.
In semiarid and arid regions, the degradation of aquatic ecosystems has been particularly extreme because of high water demand and low water availability for green, blue, and gray water use [17,34,[42][43][44]. This combination often results in intense hydrological and chemical disruption of surface and groundwater in semiarid and arid watersheds, which are often endorheic [17,42,43,45]. In addition to anthropogenic pressure, aquatic ecosystems in semiarid regions are naturally dynamic because of extreme hydrological and biogeochemical variability [46][47][48]. Changes in precipitation and evapotranspiration result in large expansions and contractions of the surface water network, and wetting and drying cycles create heterogeneous biogeochemical conditions [49][50][51]. This combination of human and natural variability could create nutrient source and sink patterns that are distinct from temperate regions, potentially complicating the identification and mitigation of nutrient sources in heavily impacted semiarid watersheds.
In this context, we organized a series of participatory synoptic sampling events in the Utah Lake watershed in the western US. We used a citizen science approach for two reasons. First, waterbodies in this region are experiencing eutrophication and water overallocation [43, [52][53][54], partly because of a lack of public connection with local aquatic ecosystems [55]. For example, this disconnect has led to a lack of public will to implement wastewater treatment measures that could decrease delivery of bioavailable nutrients to Utah Lake [56,57] and even the consideration of a radical reengineering of the lake, including large artificial islands [58,59]. Second, the Utah Lake watershed is nearly 8,000 km 2 , making traditional synoptic sampling impractical and expensive. By collaborating with local nonscientists, we were able to sample nearly 200 locations throughout the watershed within a few hours, reducing variability from temporal changes in water flow and chemistry [27,60]. We focused on three major questions: 1. What are the primary sources of carbon and nutrients in the Utah Lake Watershed, 2. How much solute retention or release is there in the surface water network, and 3. What are the general patterns of solute chemistry in these semiarid river networks. Though we did not collect quantitative data on the attitudinal effects of participating in the samplings, we hypothesized that learning about and spending time in the diverse tributaries to Utah Lake would improve public awareness and proclivity to address local environmental issues [16,18,61,62]. Regarding the ecohydrological issues surrounding Utah Lake, we hypothesized that the complex human footprint and variable hydrology in this semiarid watershed would create low spatial persistence of pollutant sources and high critical source area behavior with a few influential areas disproportionately influencing water quality [29,31,33]. To test these hypotheses, we analyzed volunteer-collected samples for a broad suite of physicochemical parameters, calculating the ecohydrological metrics we describe below.

The Utah Lake watershed
Utah Lake is one of the largest freshwater lakes in western North America, with a surface area of 375 km 2 and a natural watershed area of 7,640 km 2 [54]. As a part of the Great Salt Lake watershed, Utah Lake is a remnant of Lake Bonneville, which covered up to 51,000 km 2 of what is now Utah, Nevada, and Idaho until about 15,000 years ago [63][64][65]. The watershed ranges from 1,368 to 3,586 MASL and is characterized by relatively pristine high-elevation headwaters in the Wasatch Mountains-although mining, livestock grazing, rural subdivisions, and ski resorts are present. Low elevation valleys have a mix of urban and suburban development interspersed with irrigated agricultural land, with an overall human footprint of up to 93% in the valley bottom. Utah has one of the fastest growing populations in the US [66], and agricultural land is increasingly being converted for suburban development. Seven wastewater treatment plants, serving approximately 600,000 people in the valley region, discharge treated effluent into tributaries to the lake (S1 Fig). In the watershed headwaters, the dominant geology is limestone and quartz [67]. The vegetation consists of mixed aspen, conifer, and maple forests at high elevations, transitioning to scrub oak and sagebrush at lower elevations. The high-elevation hydrology is complex, including gaining reaches (i.e., net flow of subsurface water into surface water flows) and losing reaches (i.e., net flow into subsurface water), particularly in areas of karst conduits and colluvial materials [68]. At the base of the mountains, where rivers flow into Lake Bonneville sediment deposits at the outer perimeter of the prehistoric lake, reaches become primarily losing, and water transport occurs largely through shallow groundwater flowpaths [69]. Near the lake, streams once again become gaining, mixing with new streams generated by natural and artificial recharge that feeds many springs flowing into the lake [70,71].

Sampling design
We classified the subwatersheds into one of four categories based on land use and hydrologic condition (Table 1): Agricultural unregulated (Spanish Fork River), Mixed dammed (Provo River), Mountain urban (American Fork River and Hobble Creek), and Valley tributaries (Benjamin Slough, Goshen Valley, Mill Race, and others). The Agricultural unregulated subwatersheds have been slower to experience rapid population growth and still remain mostly under agricultural uses. One subwatershed (Diamond Fork) receives artificially diverted flow from the nearby Strawberry River, but for most of the Agricultural unregulated subwatersheds, the in-stream hydrologic modifications are minimal. All the categories have some degree of flow infrastructure (e.g., check dams, channelized reaches, dikes, etc.), but Mixed dammed subwatersheds include two large reservoirs (capacities of 0.18 and 0.39 km 3 , respectively) that drastically alter the downstream hydrology. Land use in Mixed dammed subwatersheds is a mixture of both agriculture and urban. Mountain urban subwatersheds include high contrast land use and degree of impact, with mostly pristine headwaters before entering highly developed land use and consequently modified stream channels at low elevations. Valley tributaries include the groundwater sourced tributaries that originate near the lake as described above. These subwatersheds also include return flow from agricultural water diversions, and effluent from wastewater treatment plants and other industrial facilities. The Utah Lake watershed also includes the West desert region (Cedar Valley, Tickville Gulch, and Lake Mountain), which has only small, remote ephemeral springs that are largely inaccessible. These subwatersheds were excluded from sampling because they are not connected to the lake via surface water. We initially selected 500 sampling sites from the Ambient Water Quality Monitoring System (AWQMS), a database curated by the Utah Division of Water Quality, and the Utah State University Water Quality Extension citizen science program, Utah Water Watch. We used a clustering technique by including sites just above and below a confluence to maximize watershed coverage and minimize travel distances. We consolidated the initial 500 sites to 270 by merging redundant locations and removing inaccessible sampling locations (e.g., difficult terrain or private ownership) and sites with no surface flow even during snowmelt. Nearly all remaining sites were publicly accessible, and for the few locations on private property, we obtained verbal permission from landowners. We used the application USGS StreamStats to delineate watersheds, calculate watershed areas (km 2 ), and estimate land use and land cover from the National Land Cover Database (NLCD) for 1992 and 2011. Land use was classified as forested, developed, impervious surface, or herbaceous upland for each watershed.

Participatory science
The practice of involving nonprofessionals in research (i.e., participatory or citizen science) has been extensively used in the natural and social sciences [16, 72,73]. While there are tradeoffs to participatory approaches, including less control over data acquisition and costs for participants, there can be substantial community and scientific benefits [73][74][75]. On the community side, citizen science can create educational opportunities and foster trust between researchers, regulators, policymakers, and the public [74,[76][77][78]. On the scientific side, citizen science can provide opportunities and value for researchers, including enabling novel experimental designs (e.g., the collection of hundreds of samples synchronously) and informing research priorities by improving researchers' awareness of local needs and policy priorities [10, 75,76,79].
We collaborated with an undergraduate course on watershed ecology to develop a multiyear participatory science program. This program included 2 public lectures, 7 community events (e.g., fairs, festivals, and educational evenings) where we made presentations or staffed interactive booths, and 6 synoptic samplings. From the beginning of the project, we partnered with existing water organizations, including the Provo River Watershed Council, Utah Water Watch, the Utah Lake Commission, the Utah Division of Water Quality, and the Utah Valley Earth Forum. These partnerships were pivotal in recruiting volunteers, designing the samplings, and disseminating the results. We additionally advertised through social media (Facebook, Twitter, and Instagram), email lists (including past participants), online videos, and paper fliers for approximately one month before each of the six sampling events. These advertisements targeted local university students, youth groups, environmental groups, and the broader community. At the community events, we presented a model watershed (EnviroScape, Herndon, VA) to demonstrate runoff and transport of pollutants represented by food coloring. In all our interactions with the public, we emphasized the historical context of human-water interactions in the watershed, the current ecological status of the lake system, and potential policy and personal actions to improve the health of the lake.

Sampling events
The synoptic sampling events were the central experiences for volunteers in the program. We carried out six sampling events in 2018 and 2019, but due to COVID-19 and other delays, the chemical analyses of the last three samplings have not yet been finalized. We report the results of the 2018 March (Spring), July (Summer), and October (Fall) samplings in this paper. Volunteers were invited to sign up in advance for the sites they wanted to sample on the project website (https://utahlakecollab.wixsite.com/utahlakecollab), choosing from an interactive online map of the 270 locations. This planning process encouraged volunteers to explore the entire watershed virtually before the event, looking at areas where they already had experiences and imagining locations they had not yet visited.
To reduce variability from sampling error, we used careful training, simplification of procedures, and replication of sampling (i.e., we asked multiple volunteers to collect samples from the same site so we could quantify sampling error). Other citizen science studies have found that when such precautions are taken, the benefit of using volunteers to collect large numbers of samples outweighs the tradeoffs of this kind of approach [80][81][82]. On the days of the samplings, we distributed informational flyers while volunteers waited to collect sampling kits and directions to sites. The flyers described the sampling procedure and provided context about water use and water pollution in the area (S2 Fig). Before distributing the sampling kits, we trained each volunteer individually, showing them how to collect water samples safely and reproducibly. We provided simple field sheets, where participants recorded site conditions and bottle numbers. After completing the sampling, participants returned their samples and datasheets to the distribution locations, where we performed quality control, noting samples with incomplete data or other irregularities (e.g., broken filters, partially filled bottles).
Because this research involved nonprofessional volunteers, we consulted the Institutional Review Board (IRB) at Brigham Young University (BYU), which oversees all research involving human subjects. We were informed that IRB approval was not needed because volunteers were not the subject of the research (i.e., we did not collect information about their identities or experiences). This limited our ability to quantitatively assess the community outcomes of the research but made participation less burdensome and improved inclusivity, particularly for members of the community unwilling to share personal information for philosophical or political reasons.

Laboratory analyses
Samples were filtered in the field with pre-rinsed 0.45 μm cellulose acetate filters into 60 ml high-density polyethylene bottles and immediately frozen or refrigerated until analysis (typically within 2 weeks of sampling

Ecohydrological metrics and statistical analyses
We calculated three recently developed ecohydrological metrics that indicate solute dynamics: spatial variance thresholds, subwatershed leverage, and spatial persistence [27,29]. Spatial variance thresholds indicate the predominant spatial scale where solute delivery or removal is occurring, analogous to the representative elemental area concept [84,85]. For each of the main tributaries (Table 1), we plotted the scaled (subtracted the mean and divided by the standard deviation) concentration for each solute against subwatershed area. Changes in variance with spatial scale can be caused by mixing of tributaries, in-stream processing, and changes in terrestrial-aquatic linkages. We used the pruned exact linear time (PELT) method to test for changes in variance, implemented in the changepoint package of R [86]. We hypothesized that the variance would decrease with spatial scale, following observations from other river networks [29, 38]. We calculated subwatershed leverage by multiplying the difference in subwatershed and watershed outlet nutrient concentrations with the ratio of subwatershed area to watershed outlet area [27,29]. Assuming similar specific discharge throughout the watershed, leverage indicates the amount of flux at the outlet that can be explained by the contribution of each subwatershed [29,38]. Positive leverage indicates the watershed is a net sources of the given solute and negative leverage indicates the watershed is a net sink. Very large leverage values can occur-for example, well over 100%-because some material is retained or removed as the water flows over and through the landscape before reaching the outlet. Thus, sites with large leverage values (i.e., >100%) can be considered highly influential contributors to solute concentrations at the outlet. Recently, mean leverage values across subwatersheds have been used to infer network-scale production or retention of solutes [27]. When the watershed leverage mean is positive, this implies there has been nutrient removal within the surface water network (i.e., there is more solute in the tributaries than can be accounted for at the outlet), whereas a mean negative leverage value indicates production within the surface water network. Valley tributaries subwatersheds were excluded from leverage analyses because they did not converge within a network to a single outlet (i.e., many discharged directly into Utah Lake).
To assess the consistency of observed spatial patterns, we calculated spatial persistence with Spearman's rank correlation [23,29]. This analysis calculates a correlation coefficient (ρ) of solute concentrations ranked from highest to lowest for each category and solute, across each pair of Seasons (i.e., Spring-Summer, Summer-Fall, and Spring-Fall). We graphed means and ranges of ρ values and tested for the effects of Season and land use category on solute persistence using ANOVA.
We also used parametric statistics to compare links between land use and land cover with water chemistry for the main tributaries. For each solute, we tested for differences among watersheds with Analysis of Variance (ANOVA). We tested for links between land use and catchments characteristics with water chemistry using generalized least squares models. The models used concentration of each select solute (PO We performed all statistical analyses in R, using the dplyr and ggplot2 packages [87][88][89]. We used ArcGIS Pro (ESRI) to map solute concentrations, using open-source base layers. Sample concentrations, including geographic information, and code used in analysis are available at https://doi.org/10.4211/hs.85f3584dccb54afba5698ac615ff949a.

Results
In 2018 and 2019, our outreach efforts reached approximately 6,500 community members directly. Across the first three sampling events (the ones reported in this paper), we had over 150 unique participants, with at least a third of participants attending more than one event. Most participants sampled in small groups although some worked alone. Detailed demographic data was not collected for participants in this phase of the project, however most attendees were estimated to be between 18 to 30 years old, with ages ranging from less than 1 over 80 years old. When informally asked about their experience, responses were overwhelmingly positive, including statements like, "I had no idea there were so many beautiful streams in Utah Valley," and, "this opened my eyes to how much we depend on this water. It actually comes from somewhere before my sink!" Most sampling kits were used correctly (e.g., bottles filled with filtered sample), with <10% of the samples rejected because of user error. Incomplete or illegible labeling was the most common error, though there were a few instances of complete communication breakdown (e.g., one bottle was returned filled with soil).

Solute concentrations
The spatial distribution of solute concentrations across the watershed is shown in Fig 1. This map includes data for samples collected from Utah Lake, which were excluded from other analyses. Point color corresponds with Utah's numeric water quality standards for N and P , and Clconcentrations were highest at sites near or on Utah Lake (Tukey, p-adj.<0.001).
Solute concentrations were different across the three sampling seasons (Spring, Summer, and Fall) through the year (ANOVA, F-stat = 5.896, p-value<0.01) and across categories (ANOVA, F-stat = 96.055, p-value<0.0001) for all solutes (Fig 2 and S1 Table). Pairwise analysis determined that solute concentrations in Spring were higher than Summer or Fall (Tukey, p-adj. = 0.002), but also depended on land use category (ANOVA, F-stat = 4.34, p-value<0.001). A pairwise analysis across all categories determined that DOC concentration was higher in Summer than Fall or Spring (Tukey, p-adj<0.001), and TDN concentration was higher in Fall than during Summer or Spring (Tukey, p-adj<0.05). All solute concentrations were higher at Valley tributaries sites than the other three categories (p-adj.<0.001), and Mountain urban were higher than Mixed dammed (p-adj = 0.005).
Regressions of solute concentration by Season and land use found that % impervious surface was positively correlated with higher concentrations for all solutes except DOC (Table 2). In regression models of TDN, DOC, and SO 4 2-, backward exclusion criteria found that only Season was significant. In regression models of SO 4 2and Cl -, backward exclusion criteria found that only % herbaceous upland was significant. Correlation coefficients (R 2 ) for the models were between 0.11 and 0.25.

Leverage and spatial persistence
In general, scaled concentrations and leverage by watershed area for the different solutes did not show a funnel shape that is typical of humid and temperate watersheds (Figs 3 and 4A) [38]. Instead, many of the solutes exhibited hourglass shapes, with higher scaled concentration and leverage at the largest subwatershed size. However, the funnel shape did occur for DIN concentrations in the valley tributaries, which had particularly high leverage in the smaller catchments (<100 km 2 ). Variance thresholds (km 2 ), calculated from the scaled concentration data using the PELT method, are listed in S3   Table). DOC, N, and P dynamics were much more concentrated (i.e., showing a critical source area behavior), with around two-thirds (59-69%) of watersheds having a moderate to low effect on outflow concentrations and the remaining third controlling flux. There were many more highly influential subwatersheds (i.e., >100% leverage) for TDN and PO

PLOS ONE
Citizen science reveals unexpected solute patterns in semiarid river networks Spatial persistence was specific to solute and land use (Fig 6 and S3

Novel spatial hydrochemical patterns
Our study emphasizes the unique hydrochemistry of semiarid and mixed natural-urban regions. In our results, PO 4 3was less spatially stable than other major ions, which was also the    6. Spatial persistence for solutes (DOC, PO43-, TDN, DIN, Cl -, and SO42-) for different categories of land use within the Utah Lake watershed. Spatial persistence is calculated as a pairwise Spearman rank correlation coefficient (ρ) among synoptic samplings from three seasons (Spring, Summer, and Fall). Points represent mean ρ for all pairwise comparisons within each category, and bars give the range of values. The horizontal grey line marks ρ = 0.7; indicating that the majority of the spatial pattern was retained between the two samplings.
https://doi.org/10.1371/journal.pone.0255411.g006 case in temperate agricultural watersheds [23,29] and in Arctic tundra watersheds [27]. The low spatial persistence of PO 4 3compared with other solutes like Clsuggests two non-exclusive factors: one, sources other than natural geologic weathering introduce variability and two, biological processes involving P uptake are variable. For example, changes in hydrological connectivity across seasons and storm events can alter the delivery and chemical availability of PO 4 3and other nutrients [90][91][92][93]. Likewise, P is often a limiting nutrient for some subwatersheds, introducing another source of variability as biological demand may exceed supply [36, 94,95].
Valley tributaries stood out as having particularly low spatial persistence across all solutes (Fig 6). We hypothesize that this is at least partially a function of the natural difference in solute concentrations between dilute snowmelt stream water and solute-rich groundwater, combined with changes in source based on hydrologic conditions [69]. When discharge is high, stream water diluted with snowmelt flows farther into the valleys before the change from losing to gaining occurs. During low flow, the switching point of losing to gaining moves upstream, creating large variability in solute concentrations observed at valley sites between the mouth of the canyons and the lake.
Our analysis did not return a single spatial threshold for variance collapse in solute concentration, unlike in other studies [50,[96][97][98]. variance may be also be impacted by the fact that the average subwatershed areas for the Valley tributaries and Mountain urban were much smaller (40 and 32 km 2 , respectively) than the Mixed dammed and Agriculture unregulated (200 and 281 km 2 , respectively). The lack of variance collapse in our study, like the low spatial persistence, may have been due to increased solute concentrations at groundwater-influenced sites near the lake [69]. Alternatively or additionally, we may have detected differences in spatial variance at finer scales because of the high-resolution sampling (i.e., such variability could exist in other areas but not have been detected because of coarser spatial sampling).

Urban sources affect all solutes except DOC
This study identified hot spots of solutes of concern (Fig 1), and determined that position within watershed is important in determining concentration dynamics [100]. PO 4 3-, TDN, DIN, Cl -, and SO 4 2concentrations were highest in Valley tributaries and other low-elevation reaches, and solute removal or dilution occurred at mid-elevation reaches, indicated by the map of concentrations (Fig 1) and narrowing of solute concentrations in mid-range subwatershed sizes (Fig 4A). Impervious surfaces contributed to overall higher solute concentrations ( Table 2), but solute concentrations were more variable in reaches with agricultural activity (Fig 6). This could be due to both the direct effect of impermeability on nutrient export (e.g., stormwater drainage) as well as a correlation between percent impermeability and the presence of wastewater effluent [34,101]. We note that our linear models had low predictive capability (up to 25%), suggesting the need for additional explanatory metrics (e.g., geology and updated land cover data). Decreases in solute concentrations at sites in the valley could be due to losses to groundwater [69] or sorption of P to Lake Bonneville sediments [54].
Valley tributaries were significant sources of N, a majority of which was DIN (Figs 2 and 4A). Multiple linear regression analyses showed that models of TDN and DIN had higher correlation with % impervious surface than any of the other solutes (Table 2), and both categories of N had lower persistence across Valley tributaries than the other subwatershed categories. Valley tributaries, which had the highest percentage of developed land use (Table 1), had the highest variability in N concentrations. Biological activity could also be responsible for high variability and seasonal differences in N concentrations, similar to what was observed in other mountainous western US watersheds [102].
This study confirms that urban point sources may disrupt spatial variability collapse. For example, in highly urbanized watersheds in New York, NO 3 − decreased in variability with increased watershed size, but not SRP or NH 4 + [94]. We found DIN increased below wastewater treatment plants in Mixed dammed and Mountain urban watersheds, although in Mixed dammed subwatersheds, DIN is subsequently diluted and/or removed at downstream sites (Fig 1). PO 4 3concentration exceeded State of Utah numeric criteria for nutrient pollution throughout all subwatershed categories and sizes, suggesting that point and non-point sources (e.g., stormwater, agricultural water, and natural geological deposits) contribute to elevated P in local streams.

Citizen science
In this study, we provided the opportunity for thousands of local citizens to learn more about point and non-point sources of water pollution in a deep and meaningful way [18]. This engagement has the possibility of creating public support for efforts to address water quality in the Utah Lake watershed [103]. Future directions of this work include using educational research tools to quantify the impact of participation on knowledge, attitude, and behavior. This study demonstrates that citizen scientists can help professional researchers accomplish study methodologies that are otherwise prohibitive. This has the dual benefits of extending capacity for scientific observation, and fundamentally changing public awareness and mentality. Both benefits subsequently influence how water resources are managed [61]. In this sense, participatory water quality monitoring is not only a means of increasing understanding of how water and nutrients propagate through watersheds; it is a mechanism to improve water quality itself and encourage sustainable stewardship [79,80].

Conclusion
Our results demonstrate the high spatial and temporal variability of PO 4 3- , decreased in concentration in mid-range subwatershed size. In addition to high solute concentrations, Valley tributaries had low spatial persistence, indicating temporally dynamic changes in sources and sinks of solutes. Even though there are inputs from natural (e. g, geology) and anthropogenic (e.g., mining and grazing) sources that contribute to variability in solute concentrations in the headwaters, the variability decreases at intermediate watershed sizes as these diverse headwaters mix. It is at the highly impacted, urban reaches (Valley tributaries and subwatershed sites >500 km 2 that normal solute behavior is affected to the degree that variability unexpectedly increases.
The Utah Lake watershed is fundamentally different in network-scale hydrochemistry than previously described temperate, urban, and Arctic watersheds, due to the unique hydrology and human impacts, especially in the lower reaches of the watershed. We encourage including more semiarid regions, specifically endorheic basins, in hydrologic studies because understanding their distinctive hydrologic characteristics is critical to preserving these unique ecosystems, many of which are threatened by climate change and human development [43].