PLoS ONEplosplosonePLOS ONE1932-6203Public Library of ScienceSan Francisco, CA USA10.1371/journal.pone.0219639PONE-D-19-17266Research ArticleEarth sciencesGeomorphologyTerrainEcology and environmental sciencesSoil scienceSoil ecologyEarth sciencesGeomorphologyTopographyComputer and information sciencesArtificial intelligenceMachine learningEngineering and technologyRemote sensingResearch and analysis methodsMathematical and statistical techniquesStatistical methodsMultivariate analysisPrincipal component analysisPhysical sciencesMathematicsStatisticsStatistical methodsMultivariate analysisPrincipal component analysisEarth sciencesGeomorphologyTopographyLandformsValleysEarth sciencesGeographyCartographyTopographic mapsDownscaling satellite soil moisture using geomorphometry and machine learningDownscaling satellite soil moisture using geomorphometry and machine learningGuevaraMarioConceptualizationData curationFormal analysisWriting – original drafthttp://orcid.org/0000-0001-6829-5333VargasRodrigoConceptualizationSupervisionWriting – review & editing*University of Delaware, Department of Plant and Soil Sciences, Newark, DEPoulterBenjaminEditorMontana State University Bozeman, UNITED STATES
The authors have declared that no competing interests exist.
* E-mail: rvargas@udel.edu24920192019149e021963918620199920192019Guevara, VargasThis is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Annual soil moisture estimates are useful to characterize trends in the climate system, in the capacity of soils to retain water and for predicting land and atmosphere interactions. The main source of soil moisture spatial information across large areas (e.g., continents) is satellite-based microwave remote sensing. However, satellite soil moisture datasets have coarse spatial resolution (e.g., 25–50 km grids); and large areas from regional-to-global scales have spatial information gaps. We provide an alternative approach to predict soil moisture spatial patterns (and associated uncertainty) with higher spatial resolution across areas where no information is otherwise available. This approach relies on geomorphometry derived terrain parameters and machine learning models to improve the statistical accuracy and the spatial resolution (from 27km to 1km grids) of satellite soil moisture information across the conterminous United States on an annual basis (1991–2016). We derived 15 primary and secondary terrain parameters from a digital elevation model. We trained a machine learning algorithm (i.e., kernel weighted nearest neighbors) for each year. Terrain parameters were used as predictors and annual satellite soil moisture estimates were used to train the models. The explained variance for all models-years was >70% (10-fold cross-validation). The 1km soil moisture grids (compared to the original satellite soil moisture estimates) had higher correlations (improving from r2 = 0.1 to r2 = 0.46) and lower bias (improving from 0.062 to 0.057 m3/m3) with field soil moisture observations from the North American Soil Moisture Database (n = 668 locations with available data between 1991–2013; 0-5cm depth). We conclude that the fusion of geomorphometry methods and satellite soil moisture estimates is useful to increase the spatial resolution and accuracy of satellite-derived soil moisture. This approach can be applied to other satellite-derived soil moisture estimates and regions across the world.
http://dx.doi.org/10.13039/100000083Directorate for Computer and Information Science and Engineering1724843http://orcid.org/0000-0001-6829-5333VargasRodrigoMG acknowledges a fellowship from CONACyT (382790). RV acknowledges support from the National Science Foundation CIF21 DIBBs (Grant #1724843).Data AvailabilityData is publicly available at the ESA-CCI website and is described in the main document. We also provide an R code in: Guevara M. Vargas R. Protocol for Downscaling Satellite Soil Moisture Estimates using Geomorphometry and Machine Learning, Protocols.io. protocols.io; 1970; https://doi:10.17504/protocols.io.6cahase, and model outputs are available at: Guevara M and Vargas R. 2019. Annual soil moisture predictions across conterminous United States using remote sensing and terrain analysis across 1 km grids (1991-2016), HydroShare, https://doi.org/10.4211/hs.b8f6eae9d89241cf8b5904033460af61.Introduction
Continuous national to continental scale soil moisture information is increasingly needed to characterize spatial and temporal trends of terrestrial productivity patterns (e.g., production of food, fiber and energy). This is because soil moisture is a key variable regulating hydrological and biogeochemical cycles, and thus studying its spatial-temporal dynamics is crucial for assessing the potential impact of climate change on water resources [1–4]. Currently, the most feasible way to obtain national to continental soil moisture information is using remote sensing. Microwave remote sensing devices deployed on multiple earth observation satellites are able to quantify the dielectric constant of soil surface and retrieve soil moisture estimates [5]. However, there are spatial gaps of satellite-based soil moisture information and its current spatial resolution (> 1km grids) limits its applicability at the ecosystem-to-landscape scales to address the ecological implications of soil moisture dynamics [5–8].
Satellite soil moisture records are an effective indicator for monitoring global soil conditions and forecasting climate impacts on terrestrial ecosystems, because soil moisture estimates are required for assessing feedbacks between water and biogeochemical cycles [9–12]. In addition, accurate soil moisture information is critical to predict terrestrial and atmospheric interactions such as water evapotranspiration or CO2 emissions from soils [3, 13–15]. However, soil moisture information at spatial resolution of 1x1km pixels or less is not yet available across large areas of the world and the coarse pixel size (>1km pixels) of available satellite soil moisture records is limited for spatial analysis (i.e., hydrological, ecological) at small regional levels (e.g., county- to state). In addition, satellite soil moisture estimates are representative only of the first few 0–5 to 10 cm of top-soil surface [16]. Therefore, comparing multiple sources for satellite soil moisture and field soil moisture estimates is constantly required for precise interpretations of soil moisture spatial patterns [17–19].
There is an opportunity for exploring statistical relationships across different sources of remote sensing information (e.g., topography and soil moisture) and developing alternative soil moisture spatial datasets (i.e., grids) to improve the continental-to-global spatial resolution of current satellite soil moisture estimates [7]. Spatially explicit soil moisture estimates can be obtained across large areas with a coarse spatial resolution (between 25–50 km grids) from radar-based microwave platforms deployed across different satellite soil moisture missions [20–21]. The availability of historical soil moisture records of these sources has increased during the last decade with unprecedented levels of temporal resolution (i.e., daily from years 1978-present) at the global scale. However, large areas constantly covered by snow, extremely dry regions or tropical rain forests (where there is a higher content of water above ground) lack of precise soil moisture satellite records due to sensor intrinsic limitations (e.g., saturation or noise) across these environmental conditions [22].
One valuable product that is affected by the aforementioned environmental conditions is the ESA-CCI (European Space Agency Climate Change Initiative) soil moisture product [20–21]. The ESA-CCI mission makes rapidly available long-term soil moisture estimates with daily temporal resolution from the 1978s to date, and it represents the state-of-the-art knowledge tool for assessing long term trends in the climate system. Modeling, validation and calibration frameworks are required for improving the spatial representation of this important dataset, and for predicting soil moisture patterns across areas where no satellite estimates are available.
Currently, there is an increasing availability of fine-gridded information sources and modeling approaches that could be used for increasing the spatial resolution (hereinafter downscaling) of the ESA-CCI satellite soil moisture estimates (e.g., soil moisture predictions across <1x1km grids). Downscaling (and subsequently gap-filling) satellite soil moisture estimates has been the objective of empirical modeling approaches based on sub-grids of soil moisture related information such as soil texture [23]. Other approaches followed environmental correlation methods and generated soil moisture predictions for satellite soil moisture estimates using both data-driven or hypothesis driven models and multiple sub-grids of ancillary information [24–26]. These sub-grids of information usually include vegetation related optical remote sensing imagery, gridded soil information, land cover classes and landforms [27–30]. Most of these approaches have been tested for specific study sites. Other studies have focused on applying a digital soil mapping approach (a reference framework for understanding the spatial distribution of soil variability [31]) and multiple upscaling methods for predicting soil moisture patterns at the continental scale [26, 32]. An overview of multiple approaches for downscaling satellite soil moisture (e.g., empirically based, physically based) has been previously discussed [33]. Here, we propose that digital terrain analysis (i.e., geomorphometry) can also be applied for empirically downscaling soil moisture satellite-based information across continental-to-global spatial scales.
Geomorphometry is an emergent discipline in earth sciences dedicated to the quantitative analysis of land surface characteristics and topography [34–35]. For geomorphometry, the analysis of topography includes the generation of a diversity of hydrologically meaningful terrain parameters (i.e., slope, aspect, curvature, valley depth index, topographic wetness index) that aim to represent how the landscape physically constrains water inputs (e.g., rainwater, irrigation, overland flow) that reaches the soil surface [35–36]. These terrain parameters are referred as "digital" because they are usually derived from digital elevation models using Geographic Information Systems (GIS). These digital terrain parameters are hydrologically meaningful because at the landscape scale, soil moisture is partially controlled by topography-related factors (i.e., slope, aspect, curvature) that physically constrain soil water inputs and soil hydraulic properties (e.g., soil texture, structure). Based on these geomorphometry principles [35–39], we propose that it is possible to determine which terrain parameters are the strongest predictors of the spatial variability of satellite soil moisture. Statistically coupling the spatial variability of satellite soil moisture with hydrologically meaningful terrain parameters could be an alternative way to improve the spatial resolution and accuracy of satellite soil moisture estimates across scales. This is possible because topography (represented by digital terrain parameters) directly affects: 1) the angle of the satellite microwave signal at the soil surface; and 2) the overall distribution of water in the landscape.
Topography is a major driver for soil moisture and topography surrogates (e.g., land form or elevation map) have been combined with other variables (e.g., climate, soils, vegetation and land use) for downscaling satellite soil moisture estimates [33]. Furthermore, previous studies have shown that realistic soil moisture patterns can be obtained using topographic information across site specific and catchment scales including physically based and empirical approaches [40–41], However, the exclusive use of geomorphometryderived products (i.e., digital terrain parameters) for downscaling satellite soil moisture estimates has not yet been explored in detail from national-to-continental scales. This approach is relevant to avoid statistical redundancies and potential spurious correlations when downscaled soil moisture is further used or analyzed with vegetation- or climate-related variables (when these aforementioned variables were used for downscaling of satellite derived soil moisture). In this study, we show the potential of a soil moisture prediction framework purely based on digital terrain parameters.
Our main objective is to generate a soil moisture prediction framework by coupling satellite soil moisture estimates with digital terrain parameters as prediction factors. Coupling the complexity of topographic gradients and the multi-temporal nature of satellite soil moisture requires an approach that should account for non-linear relationships. Machine learning approaches could account for non-linearity based on probability and the ability of computer systems to reproduce and ‘learn’ (i.e., decide the best solution after multiple model realizations) from multiple modeling outputs (i.e., varying model parameters of combinations of training and testing random samples) [42]. Furthermore, machine learning is now a common component of geoscientific research leading the discovery of new knowledge in the earth system [43] including mapping of soil organic carbon [44], soil greenhouse gas fluxes [8, 45] and soil moisture estimates [46].
We postulate that the data fusion between satellite soil moisture with hydrologically meaningful terrain parameters can enhance the spatial resolution, representativeness and quality (i.e., accuracy) of current coarse satellite soil moisture grids. We focus on the conterminous United States (CONUS) given the large availability of soil moisture records for validation purposes from the North American Soil Moisture Database (NASMD) [47]. The novelty of this research relies on proposing an alternative approach for obtaining soil moisture gridded estimates with no gaps and at high spatial resolution (i.e., 1km) determined by topographic prediction factors. This study is based on public sources of satellite information (derived from ESA-CCI soil moisture product) and a data-driven framework that could be reproduced and applied across the world.
Materials and methods
Our downscaling approach relied on a Digital Elevation Model (DEM), and satellite soil moisture records. Soil moisture information was acquired from the ESA-CCI [20–21]. The development and reliability (i.e., validation) of this remote sensing soil moisture product has been documented by previous studies [20–21, 48]. Our framework includes prediction factors for soil moisture from digital terrain analysis. These terrain predictors were derived across CONUS using 1km grids. Machine learning was used for generating soil moisture predictions (annual, 1991–2016) and the satellite soil moisture estimates provided by the ESA-CCI were used as training data. Field soil moisture observations from the North American Soil Moisture dataset were used for validating the soil moisture predictions based on digital terrain analysis (Fig 1).
10.1371/journal.pone.0219639.g001
Soil moisture prediction framework.
The folders are the inputs and outputs and the ovals are methods for data preparation (data bases harmonization), modeling (for prediction) and validation (for assessing the reliability of soil moisture maps). The field data from the North American Soil Moisture Database (NASMD) was only used for validation purposes (i.e., not for training the model).
Datasets and data preparation
The downscaled dataset were obtained from the ESA-CCI satellite soil moisture estimates between 1991 and 2016, and the validation dataset were field soil moisture measurements from the NASMD (S1 Fig). The downscaling framework is explained in the following sections. The ESA-CCI soil moisture product has a daily temporal coverage from 1978 to 2016 and a spatial resolution of ~27 km (S2 Fig). Among several remotely sensed soil moisture products [16, 49–53], we decided to use the ESA-CCI soil moisture product because it covers a larger period of time compared with other satellite soil moisture products (e.g., NASA SMAP). We highlight that satellite soil moisture information is used for training a machine learning model for each year, and independent field soil moisture records area only used for validating the downscaled soil moisture predictions.
For externally validating, we used the NASMD because it has been curated following a strict quality control calibrated for CONUS [44] (S1 Fig). This data collection effort consists of a harmonized and quality-controlled soil moisture dataset with contributions from over 2000 meteorological stations across CONUS described in previous studies [47]. The NASMD also includes records of soil moisture registered in the International Soil Moisture Network (ISMN) [47, 54]. The NASMD provides processed data from each station location in each network following a standardization framework focused in North America [47–19]. We used soil moisture records at 5 cm of depth (n = 5541 daily measurements) from 668 stations with available (from the aforementioned sources) soil moisture estimates at this depth because radar-based soil moisture estimates are representative for these first few centimeters of topsoil surface [16].
As prediction factors for soil moisture, we calculated hydrologically meaningful terrain parameters for CONUS (S1 Table) using information from a radar-based DEM [55–56]. These terrain parameters are quantitative spatial grids representing the topographic variability that directly influence the water distribution across the landscape [35], which supports the physical link between soil moisture and topography. These parameters were the basis for downscaling satellite soil moisture records to 1km grids. This spatial resolution captures the major variability of topographic features across CONUS and is commonly used on large-scale ecosystem studies and soil mapping efforts [56–57].
For the calculation of soil moisture prediction factors, we used automated digital terrain analysis using the System for Automated Geographical Analysis-Geographical Information System (SAGA-GIS) [36]. The automated implementation of SAGA-GIS for Geomorphometry (module for basic terrain analysis) includes a preprocessing stage to remove spurious sinks and reduce the presence of other artifacts in the elevation gridded surface (e.g., false pikes or flat areas). After preprocessing the DEM, fifteen hydrologically meaningful terrain parameters were generated for the CONUS from elevation data including primary (i.e., slope, aspect) and secondary parameters (i.e., cross-sectional curvature, longitudinal curvature, analytical hill-shading, convergence index, closed depressions, catchment area, topographic wetness index, length-slope factor, channel network base level, vertical distance to channel network, and valley depth index; Fig 2).
10.1371/journal.pone.0219639.g002
Elevation and hydrologically meaningful terrain parameters at 1x1km of spatial resolution derived using the standard SAGA-GIS basic terrain parameters module.
These maps were normalized (between 0–1) and then used as prediction factors to downscale soil moisture across CONUS.
Primary terrain parameters are direct descriptions of elevation data, for example slope and aspect, which are respectively the first and second derivative of elevation data. Secondary parameters are generated by combining a primary terrain parameter with a mathematical formulation to describe process controlled by topography that are directly related to soil moisture variability [34–35, 37], such as the overland flow accumulation. The topographic wetness index for example, indicates areas where water tend to accumulate by effect of topography and is a secondary index derived by the combination upslope area draining through a certain point per unit contour length and the local terrain slope [34–35]. The values of these terrain parameters (S1 Table) were extracted for each one of the ESA-CCI soil moisture pixels using as reference the central coordinates of each ESA-CCI pixel (Fig 1inputs).
Data exploration
We used a principal component analysis (PCA) prior to modeling for data exploration and description of general relationships between soil moisture values and topography (represented by the aforementioned terrain parameters). The purpose was to simplify the dimensionality of the data set to identify the main relationships (between soil moisture and topographic parameters) driving our downscaling framework (Fig 1methods). The PCA was implemented as in previous work [58], based on a reference value representing the 0.95-quantile of the variability obtained by randomly simulating 300 data tables of equivalent size on the basis of a normal distribution. This analysis was applied to the terrain parameters at the locations of the field stations in order to compare the relationship of the first PCA and the values of soil moisture from the ESA-CCI grids and from the field data.
Model building
For this study we built a model for each annual mean of satellite soil moisture grids. We used a machine learning kernel-based model (kernel weighted nearest neighbors, kknn) [59–60] to generate predictions and downscale satellite soil moisture grids (Fig 1methods). The training dataset for each model/year were the annual mean values of the ESA-CCI soil moisture product. The kknn model has two main model parameters: the optimum number of neighbors (k) and the optimal kernel function (okf). First, we defined k, which is the number of neighbors to be considered for the prediction. Second, we selected the okf, which is a reference (e.g., triangular, epanechnikov, Gaussian, optimal) for the probability density function of the variable to be predicted. The okf is used to convert distances (i.e., Minkowski distance) into weights used to calculate the k-weighted average. These kknn model parameters (k and okf) were selected by the means of 10-fold cross validation as previously recommended [61]. Cross-validation is a well-known re-sampling technique that divides data into 10 roughly equal subsets. For every possible parameter value (e.g., k from 1 to 50 and okf [triangular, epanechnikov, Gaussian, optimal]), 10 different models are generated, each using 90% of the data then being evaluated on the remaining 10%. To predict soil moisture information at 1 km of spatial resolution for each year (between 1991 and 2016), we selected the combination of optimal k and okf that lead to the highest correlation (between observed and predicted data) with the lowest root mean squared error (RMSE) after the cross-validation strategy. Thus, for each year we were able to predict soil moisture across 1x1 km grids (Fig 1outputs).
Validation using field observations across CONUS
Downscaled soil moisture grids were compared against field measurements and we computed the explained variance (r2) using a linear fit (observed vs predicted) for each field soil moisture location. Given the relatively low density and sparse spatial distribution of field data for validating (S1 Fig), we bootstrapped the independent validation using different sample sizes (from 10 to 100% of data with increments each 10%) to avoid systematic bias associated with the spatial distribution and density of field soil moisture information. We sampled (n = 1000) repeatedly the original and the downscaled soil moisture grids aiming to identify their correlation with the aforementioned validation dataset (i.e., observed vs predicted).
We also computed the spatial structure (spatial autocorrelation) of the explained variance (correlation between geographical distance and variance of r2 values) for estimating an r2 map using an interpolation technique known in geostatistics as Ordinary Kriging [62]. Ordinary Kriging is a well-known method for spatial interpolation based on the spatial structure or spatial autocorrelation of the variable of interest (the r2 values between the field observations and the predicted soil moisture values). The spatial autocorrelation is defined by the relationship between geographical distances and variance of values at a given distance, and it is commonly characterized using variograms. We followed an automated variogram parameterization (the optimal selection for the variogram parameters nugget, sill and range required to perform Ordinary Kriging) proposed in previous work [63].
As implemented in the automap package of R [63], the initial sill is estimated as the mean of the maximum and the median values of the semi-variance. The semi-variance is defined by the variance within multiple distance intervals. For modeling the spatial autocorrelation this algorithm iterates over multiple variogram model parameters selecting the model (e.g., spherical, exponential, Gaussian) that has the smallest residual sum of squares with the sample variogram. The initial range is defined as 0.10 times the diagonal of the bounding box of the data. The initial nugget is defined as the minimum value of the semi-variance. Thus, the parameters used for obtaining a continuous map showing spatial trends in the r2 were: a Gaussian (normal) model form, a nugget value of 0.06 m3 m-3, a sill of 0.08 m3 m-3 and an approximate range of 428.7 km. This map was generated because it could provide insights about overall sources of modeling errors (e.g., environmental similarities in multiple areas showing low or high explained variance) and their spatial distribution. All analyzes were performed in R [64] using public sources of data. Our protocol and R code used for generating the soil moisture predictions at 1km grids is available online (http://dx.doi.org/10.17504/protocols.io.6cahase) for reproducibility of this research [65].
Results
The exploratory PCA showed that the first two PCs explained 33% of the total dataset variability (S3A Fig), where the first PC explained 18% of total variability and at least five PCs were needed to explain 70% of total variability. The first PC was best correlated with elevation (r = 0.82) and with the vertical distance to channel network (r = 0.88). Elevation varied negatively with soil moisture, as well as other secondary terrain parameters such as the base level channel network elevation (distance from each pixel to the closer highest point), while the valley depth index varied positively with soil moisture (S3B Fig). The relative slope position (indicating the dominance of flat or complex terrain) and the topographic wetness index (which indicates areas where water tends to accumulate) were also correlated with soil moisture across the first 5 PCs. Thus, multiple terrain parameters varied positively and negatively with soil moisture values (S1 Appendix).
Our framework to predict soil moisture based on topography and remote sensing was able to explain, on average 79±0.1% of the variability of satellite soil moisture information as revealed by the cross-validation strategy. The root mean squared error (RMSE) derived from the cross-validation varied around 0.03 m3/m3, while the percentage of explained variance was in all cases above 70% (Table 1).
10.1371/journal.pone.0219639.t001
The cross-validation results for each year.
This table shows the correlation, root mean squared error (RMSE) in m3/m3, the number of training data available (n), the optimal kernel function (okf), and the optimal number of neighbors used for predicting to new data (k).
Model
Year
Correlation
RMSE
n
okf
k
1
1991
0.85
0.03
18058
triangular
18
2
1992
0.89
0.03
18429
triangular
16
3
1993
0.88
0.03
18107
triangular
18
4
1994
0.9
0.03
18367
triangular
16
5
1995
0.88
0.03
18385
triangular
18
6
1996
0.9
0.03
18454
triangular
15
7
1997
0.88
0.03
18428
triangular
15
8
1998
0.88
0.03
18540
triangular
16
9
1999
0.89
0.03
18542
triangular
15
10
2000
0.9
0.03
18547
triangular
15
11
2001
0.9
0.03
18523
triangular
15
12
2002
0.9
0.03
19170
triangular
16
13
2003
0.89
0.03
19132
triangular
16
14
2004
0.89
0.03
18934
triangular
16
15
2005
0.89
0.03
19132
triangular
16
16
2006
0.9
0.03
19131
triangular
16
17
2007
0.88
0.03
19142
triangular
16
18
2008
0.9
0.03
19136
triangular
16
19
2009
0.9
0.03
19142
triangular
16
20
2010
0.88
0.03
19245
triangular
18
21
2011
0.9
0.03
19255
triangular
18
22
2012
0.9
0.03
19252
triangular
16
23
2013
0.89
0.03
19226
triangular
16
24
2014
0.89
0.03
19227
triangular
16
25
2015
0.88
0.03
19231
triangular
16
26
2016
0.88
0.03
19225
triangular
16
By applying the model coefficients to the topographic prediction factors across CONUS, we generated 26 cross-validated maps (for years 1991–2016) of mean annual soil moisture estimates within 1x1km grids (Fig 3). The downscaled product shows a higher level of spatial variability due the increased spatial detail achieved by downscaling soil moisture to 1x1km grids (S4 Fig). Our predictions reveal a clear bimodal distribution of soil moisture values (e.g., from the east to the west, Fig 4) which is also evident in the original estimate (S5 Fig). The statistical comparison between the original product and the downscaled product shows a high level of agreement with an r2 value of 0.72.
10.1371/journal.pone.0219639.g003
Annual means of soil moisture (1991–2016) downscaled to 1x1km grids across CONUS using terrain parameters as prediction factors.
10.1371/journal.pone.0219639.g004
Comparison of the original (27km grids) and the downscaled (1km grids) soil moisture products.
Median (a, b) and standard deviation (c, d, sdev) values of satellite soil moisture and downscaled soil moisture values (1991–2016). Uncertainty reported by the ESA-CCI soil moisture (e) and the explained variance map (r2) between field data and downscaled soil moisture (f).
We provided a visual comparison between the original satellite estimate and the downscaled results including both median (Fig 4A and 4B) and standard deviation values (Fig 4C and 4D). We also show the uncertainty of the original soil moisture product as reported by its developers (Fig 4E) and the r2 map from the validation against field stations (Fig 4F). This r2 map (Fig 4F) is based on the spatial autocorrelation found in the variogram estimation (e.g., sill > nugget) applied to the r2 values of the data used for validating our approach (S1 Fig). This r2 map is indicating areas with similar soil moisture conditions regulating the spatial variability of soil moisture and affecting the performance of our models, around the sites of available field data for validating our approach. The r2 map shows the lowest values across the Central Plains of the US and the lower Mississippi basin. The lower values in the r2 map are consistent with the high uncertainty values of the original satellite estimate (Fig 4E).
The r2 map in Fig 4F provided insights about the relationship between soil moisture gridded surfaces and soil moisture field data. Higher r2 values were found across the east coast, the Northern Plains and water-limited environments across the western states. We found that our soil moisture downscaled output better correlates (nearly 25% improvement) with NASMD field observations when compared to the original soil moisture satellite estimates (Fig 5).
10.1371/journal.pone.0219639.g005
Validation of soil moisture gridded estimates (original 27 and 1km grids) against NASMD field observations.
Dashed line represents the relationship of field stations and soil moisture gridded estimates at 27x27km, while black line represents the relationship between field stations and the downscaled 1x1km soil moisture product. In all cases (all sample sizes), the 1x1km product showed higher r2 with the NASMD than the ESA-CCI soil moisture estimates.
This improvement was consistent after repeating it using random samples and different sample sizes (from 10 to 90% of available validation data) from the NASMD field observations (Fig 5). Consistently, we found a negative mean bias (surrogate of systematic error) in the ESA-CCI when compared against field stations of the NASMD (-0.051) that is slightly lower when comparing the downscaled soil moisture predictions against the NASMD (-0.048). The resulting RMSE of validating against field data was also slightly lower (0.057 m3/m3) for the downscaled estimate compared with the original product (0.062 m3/m3). However, there is a sparse distribution of validation data and large areas of CONUS lack of field information for validating/calibrating soil moisture predictions (Fig 6). Considering the quality-controlled records available from the NASMD across CONUS and the coarse scale of the ESA-CCI soil moisture product, our approach suggests an improvement in the spatial resolution (from 27 to 1km grids) of soil moisture estimates while maintaining the integrity of the original satellite values.
10.1371/journal.pone.0219639.g006
Explained variances computed for each meteorological station of the NASMD and the corresponding pixel of our soil moisture predictions based on geomorphometry.
The original satellite values, the downscaled product and the ISMN dataset showed a similar correlation with the terrain predictors. For example, the first PCA (represented by the distance to channel network and elevation), was negatively correlated with field soil moisture, the satellite original product and our soil moisture predictions. The correlation values were r = -0.17, r = -0.27, and r = -0.28 respectively. These relationships showed a similar pattern in the statistical space (Fig 7).
10.1371/journal.pone.0219639.g007
Relationships between the first PC of terrain parameters with soil moisture field data (a), with the ESA-CCI satellite product (b), and with the soil moisture predictions based on terrain parameters (c).
Discussion
Our soil moisture downscaling framework was able to improve the spatial detail of ESA-CCI satellite soil moisture product and its agreement with field soil moisture records from the NASMD. It is well known that topography has a direct influence on the overall water distribution across the landscape [38–39] and in the angle between satellite retrieval and the Earth’s surface. Thus, we demonstrated how a coarse scale satellite-based soil moisture product (27x27km of spatial resolution), in combination with hydrologically meaningful terrain parameters, can be coupled using machine learning algorithms to generate a fine-gridded and gap-free soil moisture product at the annual scale across CONUS.
We found a correlation between field soil moisture estimates and topography that is similar to the correlation between satellite estimates and topography (Fig 7), suggesting that topography can be an effective predictor for direct soil moisture measurements (i.e., from microwave remote sensing). Previous studies have confirmed this correlation between topographic variability and soil moisture conditions for downscaling soil moisture across multiple catchment scales and environmental conditions [40–41]. We recognize that our modeling approach based on digital terrain analysis does not directly account for local variations of evaporation, soil structure or vegetation (but indirectly). Our approach assumed that topography (described by the shape of multiple digital terrain parameters) is also capturing some variability associated to all factors affecting soil moisture. In contrast to previous downscaling efforts using vegetation and climate information [33, 66], we generated 26 annual soil moisture predictions (1991–2016, 1x1 km of spatial resolution) that are independent of ecological data (i.e., vegetation greenness) and climate information, (i.e., precipitation and temperature). This topography-based approach is able to maintain the original satellite soil moisture statistical distribution (S5 Fig) and has the advantage that our modelling output could be further related to independent datasets of ecological or climate variables [67–68] and avoid subsequent spurious relationships.
We showed that this topography-based approach is able to improve the correlation of the original estimate when compared against field data from the NASMD while maintaining the integrity of its values (i.e., same systematic error between the NASMD and the original or the downscaled product). Therefore, we provided a reliable (topography-based) approach to predict the satellite soil moisture patterns across finer spatial grids and in areas where no satellite soil moisture is available.
The downscaling process of satellite soil moisture from 27 to 1km grids across CONUS is supported on both internal (Table 1) and independent (Fig 5) validation frameworks to describe modeling performance. The accuracy of our modeling framework showed explained variances >70% and RMSE values considerably below (~0.03 m3 m-3) the satellite soil moisture mean of 0.22 m3 m-3, which is suitable for many applications [66], such as the detection of irrigation signals [69]. Similar results have been found recently for specific study sites [70]. Our results obtained by the cross-validation strategy and ground validation supports the application of a topography-based model to predict satellite soil moisture estimates (Fig 4).
Our results showed that higher soil moisture values could be found across lower elevations, areas with generally large and gentle slopes mainly across valley bottoms and across catchment areas where water tends to accumulate. This interpretation could explain the short distance in the multivariate analysis of satellite soil moisture estimates to elevation and derived terrain parameters such as the vertical distance (of each pixel) to the nearest channel network, the valley depth index and the topographic wetness index. The multivariate analysis also suggested some degree of statistical redundancy between the topographic prediction factors (S1 Appendix) as they were derived from the digital elevation model by the means of geomorphometry [34–39]. For example, we found that the topographic wetness index is highly correlated with the length-slope factor (>0.80%), and this is because they are two secondary parameters that depend on slope (primary terrain attribute) [35]. Elevation and slope are respectively required for calculating secondary terrain parameters such as the valley depth index and the topographic wetness index [36] and these terrain parameters varied closely with soil moisture in the multivariate space (S1 Appendix). Thus, understanding the main relationships between topographic prediction factors and soil moisture can be useful for reducing modeling complexity while increasing our capacity to interpret modeling results.
The spatial detail of soil moisture estimates using 1km grids across the continental scale of CONUS is consistent with the variability of soil moisture patterns between the western and eastern United States. While drought scenarios have been recently reported for the western states [71] evidence of precipitation increase has been reported recently in the eastern states [72]. Our soil moisture downscaled estimates (Fig 3) revealed a soil moisture gradient across the Central Plains of CONUS and a clear separation of two major soil moisture data populations (i.e., soil moisture values with a bimodality distribution) from the drier west, to the humid east (S5 Fig).
The original satellite soil moisture estimates also show this bimodal distribution but with a lesser extent (S2 Fig). The bimodal distribution of soil moisture could be explained by a negative soil moisture and precipitation feedback in the western CONUS and a positive soil moisture and precipitation feedback in the eastern CONUS [68]. Furthermore, areas with soil moisture bimodality have been recognized across global satellite observations and climate models [73]. We identified areas of low agreement between our soil moisture predictions and field stations (lower r2 values) across the transitional ecosystems (Fig 4) from drier to humid soil moisture environments (i.e., Central Plains and lower Mississippi basin). It is likely that these transitional areas drive changes in water availability in surface and subsurface hydrological systems [74]. The lower Mississippi basin, specifically the area across the surroundings of the Mississippi delta, is an example of a transitional area experiencing aquifer depletion [75] where both flooding events and droughts tend to occur within shorter distances that are not captured by the original satellite soil moisture information. These are the type areas where we found lower values of agreement (r2 values) between satellite and ground soil moisture observations. These low correlation values can be also explained by the use of multiple soil moisture networks with different types of sensors and measurement techniques [19]. Also, the imperfections of prediction factors used for soil moisture spatial variability models represent a potential source of uncertainty.
As any downscaling effort dependent on covariates (i.e., terrain parameters), our approach is vulnerable to data quality limitations such as the presence of systematic errors on these covariates. Other errors are derived from input data imperfections and difficulties meeting modeling assumptions. These errors in soil moisture modeling inputs increase the risk of bias and uncertainty propagation to subsequent soil moisture modeling outputs and soil mapping applications [76–78]. For example, elevation data surfaces derived from remote sensing data (such as the global DEM used here) could show artifacts (i.e., false pikes or spurious sinks) due to data saturation or signal noise that can be propagated to final soil moisture predictions [79]. We minimized this issue by using SAGA-GIS [36] as it has adopted methods for preprocessing and perform DEM quality checks [80] before deriving the topographic prediction factors used in this study. Because input covariates could not be fully free of errors, we advocate for reporting information on bias and r2 values to inform about accuracy (e.g., Table 1) as important components for interpreting soil moisture predictions.
Our results suggest that the original coarse scale soil moisture product and the values of soil moisture from the NASMD (Fig 5) are difficult to compare in terms of spatial variability, as is highlighted in previous studies [19]. This is because a satellite soil moisture pixel from the ESA-CCI product provides a value across a larger area (27x27km) than a field measurement at a specific sampling location (defined by geographical coordinates). This scale dependent effect (27x27km vs 1:1 field scale) is reduced (>25%) with soil moisture predictions across finer grids (1km). The downscaled soil moisture maps showed a higher agreement with field soil moisture records from the NASMD (Fig 5), supporting the applicability of this soil moisture product for applications that required higher spatial resolution.
Our soil moisture predictions across 1km grids suggest that topography can be effectively used to improve the spatial detail and accuracy of satellite soil moisture estimates. Several studies have highlighted differences in spatial representativeness between ground-based observations and satellite soil moisture products [77, 81]. Other studies have shown that the spatial representativeness of the ESA-CCI soil moisture compared with field observations is higher from regional-to-continental scales than from ecosystem-to-landscape scales [82–83]. Therefore, large uncertainties of soil moisture spatial patterns (below 1km grids) needs to be resolved for assessing and better understanding the local variability of soil moisture trends. We argue that currently there is an increasing availability of high-quality digital elevation data sources with high levels of spatial resolution (e.g., 1–2 to 30 to 90m grids) across large areas of the world [84–85] that can be used to derive reliable hydrologically meaningful terrain parameters for predicting soil moisture. The relationship of these digital terrain parameters and field soil moisture (i.e., meteorological stations) is similar to the relationship between terrain parameters and satellite soil moisture gridded estimates (Fig 7).
From a single information source (a remotely sensed DEM), we downscaled satellite records of soil moisture using a framework that theoretically is reproducible across multiple scales. The ultimate goal of reducing the multiple information sources for predicting soil moisture is to reduce the statistical redundancy in further modeling efforts (i.e., land carbon uptake models) and large-scale ecosystem studies (i.e., ecological niche modeling) that combine similar prediction factors for soil moisture (i.e., climate or vegetation indexes). These include models estimating water evapotranspiration trends [86] and process based global carbon models that could also benefit from more accurate and independent soil moisture inputs [78]. To improve the spatial representativeness of satellite soil moisture estimates, the number of studies developing new downscaling approaches based on prediction factors is rapidly expanding [26, 28, 66, 87]. There is a pressing need to solve the current uncertainty of soil moisture estimates to accurately understand how soil moisture is limiting the primary productivity of terrestrial ecosystems [6]. Previous studies have identified a topographic signal in satellite soil moisture [88] supporting the reliability of our modeling approach. Therefore, our results provide an alternative applicable to continental scales for downscaling satellite soil moisture estimates based on hydrologically meaningful terrain parameters.
The novelty of this approach is that it could be applicable to multiple temporal resolutions (e.g., monthly or daily) as it generates independent models for each period of interest and at multiple spatial scales as the availability of terrain parameters for modeling purposes has increased substantially (i.e., meters) in the last decade. Increasing the temporal resolution of downscaled maps (i.e., from annual to monthly predictions) is beyond the scope of this study, will increase computational costs, but are theoretically possible following this approach. While monthly or weekly (or even daily soil moisture datasets) are valuable sources for large scale earth system modeling, annual averages are also valuable for detecting long term trends in the climate-land system. Rather than focusing on temporal variability of soil moisture, our results provide insights for improving the spatial variability and consequently the spatial representation of soil moisture gridded surfaces derived from satellite information.
Conclusion
Recent studies highlight the necessity of detailed soil moisture products to account for soil moisture limitation in terrestrial ecosystems. We developed a geomorphometry-based framework to couple satellite soil moisture records with hydrologically meaningful terrain parameters. This approach is useful to avoid statistical redundancies when downscaled soil moisture is further used or analyzed with vegetation- or climate-related variables not included in the downscaling framework. We predicted (i.e., downscaled) soil moisture using 1x1km grids across CONUS at an annual scale from 1991 to 2016. This gap-free soil moisture product improved the spatial detail of the original satellite soil moisture grids and the overall agreement (increased by >20%) of these grids with the NASMD field soil moisture records. Our findings suggest that digital terrain analysis can be applied to elevation data sources to derive hydrologically meaningful terrain parameters and use these parameters predict soil moisture spatial patterns. Our framework is reproducible across the world because it is based on publicly available DEMs, ground and satellite soil moisture data. Our protocol and R code is available online (http://dx.doi.org/10.17504/protocols.io.6cahase) as well as input parameters and annual means of soil moisture at 1km grids (https://www.hydroshare.org/resource/b8f6eae9d89241cf8b5904033460af61/).
Supporting information
This file (AppendixS1.html) contains the results of the automated PCA report described in the methods section, for exploring relationships between soil moisture and topography.
(HTML)
Spatial distribution of the NASMD validation set with information for the first 0-5cm depth (n 668).
(TIF)
ESA-CCI satellite soil moisture across CONUS (~27km grids).
(TIF)
PCA map of the first and second principal components.
The individual point cloud across the plane between the first and second PCA (a). The orthogonal relationship of the variables with higher contribution to this plane (b). Soil moisture is represented by the dotted blue line. An interpretation of these can be found in S1 Appendix.
(TIF)
Mean soil moisture for the year 2016.
Comparison between the original satellite soil moisture (a) and soil moisture predicted at 1km grids (b).
(TIF)
Bimodal distribution of satellite soil moisture (black) and the downscaled soil moisture estimates (red).
(TIF)
This table (S1 Table) contains a description of primary and secondary terrain parameters used in this study for generating soil moisture predictions.
(DOCX)
ReferencesGreveP., GudmundssonL., SeneviratneS.I., 2018. Regional scaling of annual mean precipitation and water availability with global temperature change. . 9, 227–240. https://doi.org/10.5194/esd-9-227-2018SeneviratneS.I., CortiT., DavinE.L., HirschiM., JaegerE.B., Lehner, et al. 2010. Investigating soil moisture–climate interactions in a changing climate: A review. 99, 125–161. https://doi.org/10.1016/j.earscirev.2010.02.004SeneviratneS.I., WilhelmM., StanelleT., van den HurkB., HagemannS., Berg,et al. 2013. Impact of soil moisture-climate feedbacks on CMIP5 projections: First results from the GLACE-CMIP5 experiment: GLACE-CMIP5 EXPERIMENT. 40, 5212–5217. https://doi.org/10.1002/grl.50956WesternA.W., GraysonR.B., BlöschlG., WillgooseG.R., McMahonT.A., 1999. Observed spatial organization of soil moisture and its relation to terrain indices. . 35, 797–810. https://doi.org/10.1029/1998WR900065DorigoW., WagnerW., AlbergelC., AlbrechtF., BalsamoG., Brocca, et al. 2017. ESA CCI Soil Moisture for improved Earth system understanding: State-of-the art and future directions. 203, 185–215. https://doi.org/10.1016/j.rse.2017.07.001StockerB. D., ZscheischlerJ., KeenanT. F., PrenticeI. C., PeñuelasJ., & SeneviratneS. I.2018. Quantifying soil moisture impacts on light use efficiency across biomes. , 218(4), 1430–1449. doi: 10.1111/nph.1512329604221BroccaL., CiabattaL., MassariC., CamiciS., & TarpanelliA.2017. Soil Moisture for Hydrological Applications: Open Questions and New Opportunities. , 9(2), 140. doi: 10.3390/w9020140VargasR., Sánchez-CañeteP., Serrano-OrtizP., Curiel YusteJ., DomingoF., López- BallesterosA.et al. 2018. Hot-moments of soil CO2 efflux in a water-limited grassland. , 2(3), p.47.AsnerG.P., AlencarA., 2010. Drought impacts on the Amazon forest: the remote sensing perspective. .CookB.D., CorpL.A., NelsonR.F., MiddletonE.M., MortonD.C., McCorkelJ.T., et al. 2013. NASA Goddard’s LiDAR, Hyperspectral and Thermal (G-LiHT) Airborne Imager. 5, 4045–4066. https://doi.org/10.3390/rs5084045DaiA., 2011. Drought under global warming: a review. 2, 45–65. https://doi.org/10.1002/wcc.81SamaniegoL., ThoberS., KumarR., WandersN., RakovecO., Pan,et al. 2018. Anthropogenic warming exacerbates European soil moisture droughts. 8, 421–426. https://doi.org/10.1038/s41558-018-0138-5van der MolenM.K., DolmanA.J., CiaisP., EglinT., GobronN., LawB.E.et al. 2011. Drought and ecosystem carbon cycling. 151, 765–773. https://doi.org/10.1016/j.agrformet.2011.01.018LuoY., AhlströmA., AllisonS.D., BatjesN.H., BrovkinV., Carvalhais, et al., 2016. Toward more realistic projections of soil carbon dynamics by Earth system models. 30, 40–56. https://doi.org/10.1002/2015GB005239WalshB., CiaisP., JanssensI.A., PeñuelasJ., RiahiK., RydzakF., et al., 2017. Pathways for balancing CO2 emissions and sinks. 8, 14856. doi: 10.1038/ncomms1485628406154OweM., Van de GriendA.A., 1998. Comparison of soil moisture penetration depths for several bare soils at two microwave frequencies and implications for remote sensing. 34, 2319–2327. https://doi.org/10.1029/98WR01469EntekhabiD., YuehS., O’NeillP., KellogK., AllenA., et al., 2014. , Jet Propulsion Lab., California Inst. Technol., Pasadena, Calif.SinghR.S., ReagerJ.T., MillerN.L., FamigliettiJ.S., 2015. Toward hyper-resolution land-surface modeling: The effects of fine-scale topography and soil texture on CLM4.0 simulations over the Southwestern U.S.: Effects of fine-scale resolution on CLM4.0 in Southwest US. 51, 2648–2667. https://doi.org/10.1002/2014WR015686DirmeyerP., WuJ., NortonH., DorigoW., QuiringS., TrentonW., et al. 2016. “Confronting Weather and Climate Models with Observational Data from Soil Moisture Networks over the United States.” 17 (4): 1049–67. doi: 10.1175/JHM-D-15-0196.129645013LiuY.Y., DorigoW.A., ParinussaR.M., de JeuR.A.M., WagnerW., McCabe, et al. 2012. Trend-preserving blending of passive and active microwave soil moisture retrievals. 123, 280–297. https://doi.org/10.1016/j.rse.2012.03.014LiuY.Y., ParinussaR.M., DorigoW.A., De JeuR.A.M., WagnerW., van DijkA.I.J.M., et al. 2011. Developing an improved soil moisture dataset by blending passive and active microwave satellite-based retrievals. 15, 425–436. https://doi.org/10.5194/hess-15-425-2011McCollK.A., AlemohammadS.H., AkbarR., KoningsA.G., YuehS., EntekhabiD., 2017. The global distribution and dynamics of surface soil moisture. 10, 100–104. https://doi.org/10.1038/ngeo2868MontzkaC., RötzerK., BogenaH.R., and VereeckenH.2018. A new soil moisture downscaling approach for SMAP, SMOS and ASCAT by predicting sub-grid variability. . 10(3):427. doi: 10.3390/rs10030427AfsharM.H., YilmazM.T., 2017. The added utility of nonlinear methods compared to linear methods in rescaling soil moisture products. 196, 224–237. https://doi.org/10.1016/j.rse.2017.05.017JinY., GeY., WangJ., HeuvelinkG.B.M., WangL., 2018. Geographically Weighted Area-to-Point Regression Kriging for Spatial Downscaling in Remote Sensing. 10, 579. https://doi.org/10.3390/rs10040579KearneyM.R., MainoJ.L., 2018. Can next-generation soil data products improve soil moisture modelling at the continental scale? An assessment using a new microclimate package for the R programming environment. 561, 662–673. https://doi.org/10.1016/j.jhydrol.2018.04.040PilesM., CampsA., Vall-llosseraM., CorbellaI., PancieraR., RudigerC., KerrY.H., et al., 2011. Downscaling SMOS-Derived Soil Moisture Using MODIS Visible/Infrared Data. 49, 3156–3166. https://doi.org/10.1109/TGRS.2011.2120615RanneyK.J., NiemannJ.D., LehmanB.M., GreenT.R., JonesA.S., 2015. A method to downscale soil moisture to fine resolutions using topographic, vegetation, and soil data. 76, 81–96. https://doi.org/10.1016/j.advwatres.2014.12.003WangA., ZhangM., ShiJ., MuT., GongH., XieC., 2012. Space-time analysis on downscaled soil moisture data and parameters of plant growth. 28, 164–169.YuG., DiL., YangW., 2008. Downscaling of Global Soil Moisture using Auxiliary Data. , pp. III-230–III–233. https://doi.org/10.1109/IGARSS.2008.4779325McBratneyA., Mendonça SantosM., MinasnyB., 2003. On digital soil mapping. 117, 3–52. https://doi.org/10.1016/S0016-7061(03)00223-4Bauer-MarschallingerB., FreemanV., CaoS., PaulikC., SchauflerS., StachlT., et al. (2019). Toward Global Soil Moisture Monitoring With Sentinel-1: Harnessing Assets and Overcoming Obstacles. , 57(1), 520–539. doi: 10.1109/TGRS.2018.2858004MoreloB., MerlinO., MalbeteauY., Al BitarA., CabotF., StefanV., et al. 2016. SMOS disaggregated soil moisture product at 1km resolution: Processor overview and first validation results. 180, 361–376https://doi.org/10.1016/j.rse.2016.02.045PikeR.J., EvansI.S., T., 2009. Chapter 1 Geomorphometry: A Brief Guide, in: . Elsevier, pp. 3–30.WilsonJ. P., & GallantJ. C. (2000). Digital terrain analysis. , 6(12), 1–27.ConradO., BechtelB., BockM., DietrichH., FischerE., GerlitzL., et al. 2015. System for Automated Geoscientific Analyses (SAGA) v. 2.1.4. 8, 1991–2007. https://doi.org/10.5194/gmd-8-1991-2015Wilson, J.P., 2012. Digital terrain modeling. Geomorphology, Geospatial Technologies and Geomorphological Mapping Proceedings of the 41st Annual Binghamton Geomorphology Symposium 137, 107–121. https://doi.org/10.1016/j.geomorph.2011.03.012FlorinskyI.V., 2016. Chapter 9—Influence of Topography on Soil Properties, in: FlorinskyI.V. (Ed.), (Second Edition). Academic Press, pp. 265–270. https://doi.org/10.1016/B978-0-12-804632-6.00009-2FlorinskyI.V., 2012. The Dokuchaev hypothesis as a basis for predictive digital soil mapping (on the 125th anniversary of its publication). 45, 445–451. https://doi.org/10.1134/S1064229312040047PellenqJ, KalmaJ, BouletG, SaulnierG-M, WooldridgeS. et al. A disaggregation scheme for soil moisture based on topography and soil depth. . 2003;276: 112–127. doi: 10.1016/s0022-1694(03)00066-0BuschFA, NiemannJD, ColemanM. Evaluation of an empirical orthogonal function-based method to downscale soil moisture patterns based on topographical attributes. . 2011;26: 2696–2709. doi: 10.1002/hyp.8363HenglT., MacMillanR.A., 2019. . OpenGeoHub foundation, Wageningen, the Netherlands, 370 pages, www.soilmapper.org, ISBN: 978-0-359-30635-0.ReichsteinM., Camps-VallsG., StevensB., JungM., DenzlerJ., CarvalhaisN., Et al. 2019. Deep learning and process understanding for data-driven Earth system science. , 566(7743), 195. doi: 10.1038/s41586-019-0912-130760912GuevaraM, OlmedoGF, StellE, YiginiY, Aguilar DuarteY, Arellano HernándezC, et al. No silver bullet for digital soil mapping: country-specific soil organic carbon estimates across Latin America. . 2018;4: 173–193. doi: 10.5194/soil-4-173-2018WarnerD.L., GuevaraM., InamdarS. and VargasR., 2019. Upscaling soil-atmosphere CO2 and CH4 fluxes across a topographically complex forested landscape. , 264, pp.80–91.CoopersmithE. J., CoshM. H., BellJ. E., & BoylesR.2016. Using machine learning to produce near surface soil moisture estimates from deeper in situ records at U.S. Climate Reference Network (USCRN) locations: Analysis and applications to AMSR-E satellite validation. ., 98, 122–131. doi: 10.1016/j.advwatres.2016.10.007QuiringS.M., FordT.W., WangJ.K., KhongA., HarrisE., LindgrenT.,et al., 2016. The North American Soil Moisture Database: Development and Applications. 97, 1441–1459. https://doi.org/10.1175/BAMS-D-13-00263.1DorigoW., WagnerW., AlbergelC., AlbrechtF., BalsamoG., Brocca, et al. 2017. ESA CCI Soil Moisture for improved Earth system understanding: State-of-the art and future directions. 203, 185–215. https://doi.org/10.1016/j.rse.2017.07.001BindlishR., JacksonT., CoshM., Tianjie ZhaoO’NeillP., 2015. Global Soil Moisture From the Aquarius/SAC-D Satellite: Description and Initial Assessment. 12, 923–927. https://doi.org/10.1109/LGRS.2014.2364151EntekhabiD., NjokuE., O’NeillP., KelloggK., CrowW., EdelsteinW., et al. 2010. The Soil Moisture Active Passive (SMAP) Mission. 98, 704–716. https://doi.org/10.1109/JPROC.2010.2043918NaeimiV., PaulikC., BartschA., WagnerW., KiddR., Park, et al. ASCAT Surface State Flag (SSF): Extracting Information on Surface Freeze/Thaw Conditions From Backscatter Data Using an Empirical Threshold-Analysis Algorithm. 50, 2566–2582. https://doi.org/10.1109/TGRS.2011.2177667NaeimiV., ScipalK., BartalisZ., HasenauerS., WagnerW., 2009. An Improved Soil Moisture Retrieval Algorithm for ERS and METOP Scatterometer Observations. 47, 1999–2013. https://doi.org/10.1109/TGRS.2008.2011617WagnerW., LemoineG., RottH., 1999. A Method for Estimating Soil Moisture from ERS Scatterometer and Soil Data. 70, 191–207. https://doi.org/10.1016/S0034-4257(99)00036-XDorigoW.A., WagnerW., HohensinnR., HahnS., PaulikC., XaverA., et al. 2011. The International Soil Moisture Network: a data hosting facility for global in situ soil moisture measurements. 15, 1675–1698. https://doi.org/10.5194/hess-15-1675-2011BeckerJ.J., SandwellD.T., SmithW.H.F., BraudJ., BinderB., DepnerJ., et al. Global Bathymetry and Elevation Data at 30 Arc Seconds Resolution: SRTM30_PLUS. 32, 355–371. https://doi.org/10.1080/01490410903297766HenglT., de JesusJ., MacMillanR., BatjesN., HeuvelinkGBM., RibeiroE, et al. SoilGrids1km—Global Soil Information Based on Automated Mapping. Bond-LambertyB, editor. . 2014;9: e105992. doi: 10.1371/journal.pone.010599225171179TuanmuM.-N., & JetzW.A global 1-km consensus land-cover product for biodiversity and ecosystem modelling. , 23(9), 1031–1045. 2014. doi: 10.1111/geb.12182ThuleauS, and HussonF. 2018. FactoInvestigate: Automatic Description of Factorial Analysis. . https://CRAN.R-project.org/package=FactoInvestigateHechenbichler, K., Schliep, K., 2006. Weighted k-nearest-neighbor techniques and ordinal classification, in: Discussion Paper 399, SFB 386.Hechenbichler, K., Schliep, K., 2004. Weighted k-Nearest-Neighbor Techniques and Ordinal Classification [WWW Document]. URL https://epub.ub.uni-muenchen.de/1769/ (accessed 12.24.16).BorraS., Di CiaccioA., 2010. Measuring the prediction error. A comparison of cross-validation, bootstrap and covariance penalty methods. 54, 2976–2989. https://doi.org/10.1016/j.csda.2010.03.004OliverM. A., & WebsterR.2014. A tutorial guide to geostatistics: Computing and modelling variograms and kriging. , 113, 56–69. doi: 10.1016/j.catena.2013.09.006HiemstraP. H., PebesmaE. J., TwenhöfelC. J. W., & HeuvelinkG. B. M.2009. Real-time automatic interpolation of ambient gamma dose rates from the Dutch radioactivity monitoring network. , 35(8), 1711–1721. doi: 10.1016/j.cageo.2008.10.011R Core Team2018. . R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.GuevaraM. VargasR. Protocol for Downscaling Satellite Soil Moisture Estimates using Geomorphometry and Machine Learning. ; 1970; doi: 10.17504/protocols.io.6cahaseCollianderA., FisherJ. B., HalversonG., MerlinO., MisraS., Bindlish, et al. 2017. Spatial Downscaling of SMAP Soil Moisture Using MODIS Land Surface Temperature and NDVI During SMAPVEX15. , 14(11), 2107–2111. doi: 10.1109/LGRS.2017.2753203HeL., ChenJ. M., LiuJ., BélairS., & LuoX.2017. Assessment of SMAP soil moisture for global simulation of gross primary production. ., 122(7), 1549–1563. doi: 10.1002/2016JG003603TuttleS. & SalvucciG.2016. Empirical evidence of contrasting soil moisture-precipitation feedbacks across the United States. 352, 825–828. doi: 10.1126/science.aaa718527174987LawstonP. M., SantanelloJ. A., & KumarS. V.2017. Irrigation Signals Detected From SMAP Soil Moisture Retrievals. , 44(23), 11,860–11,867. doi: 10.1002/2017GL075733CollianderA., JacksonT. J., BindlishR., ChanS., DasN., KimS. B.,et al. 2017. Validation of SMAP surface soil moisture products with core validation sites. ., 191, 215–231. doi: 10.1016/j.rse.2017.01.021DiffenbaughN. S., SwainD. L., & ToumaD.2015. Anthropogenic warming has increased drought risk in California. , 112(13), 3931–3936. doi: 10.1073/pnas.142238511225733875EasterlingD. R., KunkelK. E., ArnoldJ. R., KnutsonT., LeGrandeA. N., LeungL.et al. 2017. Precipitation change in the United States. In WuebblesD. J., FaheyD. W., HibbardK. A., DokkenD. J., StewartB. C., & MaycockT. K. (Eds.), (Vol. I, pp. 207–230). Washington, DC: U.S. Global Change Research ProgramVilasaL, MirallesDG, de JeuRAM, DolmanAJ. Global soil moisture bimodality in satellite observations and climate models. Journal of Geophysical Research: . 2017;122: 4299–4311. doi: 10.1002/2016jd026099DyerJ., & MercerA.2013. Assessment of Spatial Rainfall Variability over the Lower Mississippi River Alluvial Valley. , 14(6), 1826–1843. doi: 10.2307/24914344RebaM. L., MasseyJ. H., Adviento-BorbeM. A., LeslieD., YaegerM. A., AndersM., et al. 2017. Aquifer Depletion in the Lower Mississippi River Basin: Challenges and Solutions. , 162(1), 128–139. doi: 10.1111/j.1936-704X.2017.03264.xHeuvelinkG.B. M., MillwardA.A., 1999. Error propagation in environmental modelling with GIS. 36, 69.MirallesD.G., CrowW.T., CoshM.H., 2010. Estimating Spatial Sampling Errors in Coarse-Scale Soil Moisture Estimates Derived from Point-Scale Observations. 11, 1423–1429. https://doi.org/10.1175/2010JHM1285.1Munguia-FloresF., ArndtS., GanesanA. L., Murray-TortaroloG., & HornibrookE. R. C.2018. Soil Methanotrophy Model (MeMo v1.0): a process-based model to quantify global uptake of atmospheric methane by soil. ., 11(6), 2009–2032. doi: 10.5194/gmd-11-2009-2018LindsayJ.B, CreedI.F. Removal of artifact depressions from digital elevation models: towards a minimum impact approach. . 2005;19: 3113–3126. doi: 10.1002/hyp.5835PlanchonO. & DarbouxF. (2001): A fast, simple and versatile algorithm to fill the depressions of digital elevation models. 46: 159–176GruberA., DorigoW.A., ZwiebackS., XaverA., WagnerW., 2013. Characterizing Coarse-Scale Representativeness of in situ Soil Moisture Measurements from the International Soil Moisture Network. 12, 0. https://doi.org/10.2136/vzj2012.0170Nicolai-ShawN., HirschiM., MittelbachH., SeneviratneS.I., 2015. Spatial representativeness of soil moisture using in situ, remote sensing, and land reanalysis data: SPATIAL REPRESENTATIVENESS OF SOIL MOISTURE. 120, 9955–9964. https://doi.org/10.1002/2015JD023305VargasR., SonnentagO., AbramowitzG., CarraraA., ChenJ.M., CiaisP., et al. 2013. Drought influences the accuracy of simulated ecosystem fluxes: a model-data meta-analysis for Mediterranean oak woodlands. , 16(5), pp.749–764.NelsonA., ReuterH.I., GesslerP., 2009. Chapter 3 DEM Production Methods and Sources, in: . Elsevier, pp. 65–85.TadonoT., IshidaH., OdaF., NaitoS., MinakawaK., IwamotoH., 2014. Precise Global DEM Generation by ALOS PRISM. II–4, 71–76. https://doi.org/10.5194/isprsannals-II-4-71-2014SchwingshacklC., HirschiM., SeneviratneS. I., SchwingshacklC., HirschiM., & SeneviratneS. I.2017. Quantifying Spatiotemporal Variations of Soil Moisture Control on Surface Energy Balance and Near-Surface Air Temperature. https://journals.ametsoc.org/doi/full/10.1175/JCLI-D-16-0727.1JinY., GeY., WangJ., HeuvelinkG.B.M., WangL., 2018. Geographically Weighted Area-to-Point Regression Kriging for Spatial Downscaling in Remote Sensing. 10, 579. https://doi.org/10.3390/rs10040579MasonD. C., Garcia-PintadoJ., ClokeH. L. and DanceS. L.: Evidence of a topographic signal in surface soil moisture derived from ENVISAT ASAR wide swath data, , 45, 178–186, doi: 10.1016/j.jag.2015.02.004, 201610.1371/journal.pone.0219639.r001Decision Letter 0PoulterBenjaminAcademic Editor2019Benjamin PoulterThis is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Submission Version0
2 Aug 2019
PONE-D-19-17266
Downscaling Satellite Soil Moisture using Geomorphometry and Machine Learning
PLOS ONE
Dear Dr. Vargas,
Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.
We would appreciate receiving your revised manuscript by Sep 16 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.
To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols
Please include the following items when submitting your revised manuscript:
A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.
Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.
We look forward to receiving your revised manuscript.
Kind regards,
Benjamin Poulter
Academic Editor
PLOS ONE
Journal Requirements:
When submitting your revision, we need you to address these additional requirements.
1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at
http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf
2. Thank you for stating the following in the Acknowledgments Section of your manuscript:
"MG acknowledges a fellowship from CONACyT. RV acknowledges support from the National Science Foundation CIF21 DIBBs (Grant #1724843)."
We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.
Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:
"RV acknowledges support from the National Science Foundation CIF21 DIBBs (Grant #1724843). "
Please provide an amended Funding Statement that declares *all* the funding or sources of support received during this specific study (whether external or internal to your organization) as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now
Please state what role the funders took in the study. If any authors received a salary from any of your funders, please state which authors and which funder. If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."
c. Please include your amended statements within your cover letter; we will change the online submission form on your behalf.
3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.
4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information
Additional Editor Comments:
Dear Authors,
I have received two reviews for your manuscript, both consider the work interesting and important for publication. One reviewer has several detailed questions that need to be addressed and will improve the manuscript's clarity and robustness. Consider the decision between and minor and major revision as the responses need to be addressed in detail.
Ben
[Note: HTML markup is below. Please do not edit.]
Reviewers' comments:
Reviewer's Responses to Questions
Comments to the Author
1. Is the manuscript technically sound, and do the data support the conclusions?
The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.
Reviewer #1: Yes
Reviewer #2: Partly
**********
2. Has the statistical analysis been performed appropriately and rigorously?
Reviewer #1: Yes
Reviewer #2: No
**********
3. Have the authors made all data underlying the findings in their manuscript fully available?
The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.
Reviewer #1: Yes
Reviewer #2: No
**********
4. Is the manuscript presented in an intelligible fashion and written in standard English?
PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.
Reviewer #1: Yes
Reviewer #2: Yes
**********
5. Review Comments to the Author
Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)
Reviewer #1: I enjoyed reading the paper by Guevara et al., which addresses the downscaling of soil moisture data using geo-morphometry. The work is useful and very well explained in the manuscript. I recommend to accept it with few minor comments:
1. Authors can add some references about different disaggregation methods applied on soil moisture datasets for downscaling, specifically related to topography or geomorphology, like:
Pellenq et al 2003, A disaggregation scheme for soil moisture based on topography and soil depth
2. Line 193: It will be nice to provide explicitly what are the primary hydrologically meaningful terrain parameters authors are referring to.
3. A little description of these terrain parameters are required to be discussed.
4. It will be great to add some discussion about the systematic error (Bias)
Reviewer #2: The manuscript by Guevara and Vargas, entitled 'Downscaling Satellite Soil Moisture using Geomorphometry and Machine Learning', covers an interesting topic appropriate for PLOS ONE. The authors developed an approach to downscale the annual soil moisture at 25 km resolution derived from ESA CCI satellite products to 1 km resolution using a machine learning approach. A set of geomorphological parameters were applied to train the model using a k-NN algorithm. I think the study is original in the sense that it provides an alternative approach for downscaling methodologies. However, I feel the downscaled results were not well justified and the writing are not clear and can be strengthened. I have several concerns as listed below:
- The title is a bit misleading as it's too generalized. The current title doesn't suggest for what temporal resolution (annual or monthly), nor for what regions this approach can be applied. Validating the results for the US continent doesn't necessarily prove that it can be applied in other regions that have different climate (e.g. high latitudes and tropics).
- The calculation of terrain parameters. It is surprised to see that some of the parameters in Figure 2 show fairly low values with almost none spatial heterogeneity. Is that a rendering problem or due to the normalization? This seems incorrect. For example, the wetness index is supposed to have significant spatial heterogeneity. See HYDRO1K or HydroSHEDS products for some of these parameters.
- The downscaled 1k soil moisture. The results don’t suggest that it maintain the original values of the satellite product. Why the downscaled results seem to have a systematical lower value than the original one? Is that because of the calculation of terrain parameters? Also, the spatial heterogeneity is not higher in the downscaled product as it’s supposed to be. Downscaled products are supposed to provide more details at a higher resolution. The legend of Figure 3 and Figure S2 are not consistent, which makes it hard to compare the downscaled with the original visually.
- The parameters used in the machine learning. This study uses a few terrain parameters to train their model - Here is an implicit assumption that the soil moisture is mainly controlled by geomorphological information. But soil moisture is proved to be strongly influenced by vegetation via its regulation of evapotranspiration.
Specific comments:
Paragraph Line58-68. It seems the first outline sentence doesn’t match the following content.
Line 164-165: need to be more specific on how you did the downscale.
Line 175: not a standard format. Is this a personal communication?
Line 177: why the ISMN is unlike the NASMD? Please elaborate
Line 181-182? Be specific on what parameters are used, or at least mention the list in Table S1.
Line 198: how to harmonize? Not clear
Line 199: where is section 2.3?
Line 214: what soil moisture map ? daily or annual? If it is daily, how did you do the aggregation?
Line 241-249. I don’t understand why a spatial interpolation for R2 map is needed. I don't think an interpolated R2 can be used for justification. Using an Ordinary Kriging doesn't make sense as Ordinary Kriging assumes your R2 points is spatially-correlated.
Supplement Figure 3A. There is no dotted blue line.
Table 1. The Unit of RMSE is needed.
**********
6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.
If you choose “no”, your identity will remain anonymous but your review may still be made public.
Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.
Reviewer #1: No
Reviewer #2: No
[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]
While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.
10.1371/journal.pone.0219639.r002Author response to Decision Letter 0Submission Version1
6 Sep 2019
Response to reviewers: PONE-D-19-17266
Downscaling Satellite Soil Moisture using Geomorphometry and Machine Learning
General journal comments
Comment: Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.
Response: Thank you for the opportunity to submit a revised version of our work. We have revised the points raised by the reviewers and we believe that now this is a substantially improved manuscript.
Comment: To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols
Response: We have now included the protocol to reproduce this research in “protocols.io”. The link is dx.doi.org/10.17504/protocols.io.6cahase and is cited in the manuscript in line 283.
Comment: Thank you for stating the following in the Acknowledgments Section of your manuscript: "MG acknowledges a fellowship from CONACyT. RV acknowledges support from the National Science Foundation CIF21 DIBBs (Grant #1724843)."
We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.
Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:
"RV acknowledges support from the National Science Foundation CIF21 DIBBs (Grant #1724843). "
a. Please provide an amended Funding Statement that declares *all* the funding or sources of support received during this specific study (whether external or internal to your organization) as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now
a. Please state what role the funders took in the study. If any authors received a salary from any of your funders, please state which authors and which funder. If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."
c. Please include your amended statements within your cover letter; we will change the online submission form on your behalf.
Response: This section has been edited following the journal suggestions for acknowledgements.
Comment: We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.
Response: We have included the data repository information and obtained a DOI .
Guevara, M., R. Vargas (2019). Annual soil moisture predictions across conterminous United States using remote sensing and terrain analysis across 1 km grids (1991-2016), HydroShare, https://doi.org/10.4211/hs.b8f6eae9d89241cf8b5904033460af61
Comment: Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information
Response: We have included captions of the Supporting Information at the end of the revised manuscript.
Additional Editor Comments:
Comment: I have received two reviews for your manuscript, both consider the work interesting and important for publication. One reviewer has several detailed questions that need to be addressed and will improve the manuscript's clarity and robustness. Consider the decision between and minor and major revision as the responses need to be addressed in detail.
Response: We would like to thank the two referees and editor for reviewing our manuscript, for providing constructive suggestions and for the opportunity to resubmit a revised and more detailed version of the manuscript. We have carefully reviewed the comments and have improved the manuscript accordingly.
Reviewers' comments:
Reviewer #1:
Comment: I enjoyed reading the paper by Guevara et al., which addresses the downscaling of soil moisture data using geo-morphometry. The work is useful and very well explained in the manuscript. I recommend to accept it with few minor comments:
Response: We would like to thank Reviewer #1 for his/her valuable feedback to improve this work. We have revised the comments and improved the manuscript accordingly.
Comment: Authors can add some references about different disaggregation methods applied on soil moisture datasets for downscaling, specifically related to topography or geomorphology, like: Pellenq et al 2003, A disaggregation scheme for soil moisture based on topography and soil depth
Response: As suggested by the reviewer, we have included new references of soil moisture disaggregation methods based on topography in introduction and discussion. The added references are (lines 111:113):
Pellenq J, Kalma J, Boulet G, Saulnier G-M, Wooldridge S, Kerr Y, et al. A disaggregation scheme for soil moisture based on topography and soil depth. Journal of Hydrology. 2003;276: 112–127. doi:10.1016/s0022-1694(03)00066-0
Busch FA, Niemann JD, Coleman M. Evaluation of an empirical orthogonal function-based method to downscale soil moisture patterns based on topographical attributes. Hydrological Processes. 2011;26: 2696–2709. doi:10.1002/hyp.8363
Comment: 2. Line 193: It will be nice to provide explicitly what are the primary hydrologically meaningful terrain parameters authors are referring to.
Response: We have improved the definition of primary and secondary terrain parameters in the introduction (lines 97-100) and methods (lines 210-216) of the revised version of the manuscript.
Comment: 3. A little description of these terrain parameters are required to be discussed.
Response: We have improved our description of these terrain parameters in the introduction (lines 97-100) and methods (lines 210-216). We have briefly discussed the main implications of using only these parameters for downscaling soil moisture in this revised version (see lines 435-437).
Comment: 4. It will be great to add some discussion about the systematic error (Bias)
Response: We have included more information about systematic error (mean bias and RMSE) in our results section (lines 391-395, 500-511). Please note from Table 1 a consistent (cross-validated) error estimate (0.03 m3/m3) across all analyzed years. This error is larger when compared against the NASMD but similar between both products (0.057 for the downscaled estimate, and 0.062 for the original product). From our field validation, we also found a general negative mean bias in the original estimates (-0.051) that propagates to our downscaled product (-0.048). Note that systematic error (indicated by the mean bias and the RMSE is slightly lower than in the downscaled product), supporting the significant improvement in the explained variance (from 0.01 to 0.47) described in Figure 5 of the main text.
Reviewer #2:
Comment: The manuscript by Guevara and Vargas, entitled 'Downscaling Satellite Soil Moisture using Geomorphometry and Machine Learning', covers an interesting topic appropriate for PLOS ONE. The authors developed an approach to downscale the annual soil moisture at 25 km resolution derived from ESA CCI satellite products to 1 km resolution using a machine learning approach. A set of geomorphological parameters were applied to train the model using a k-NN algorithm. I think the study is original in the sense that it provides an alternative approach for downscaling methodologies. However, I feel the downscaled results were not well justified and the writing are not clear and can be strengthened. I have several concerns as listed below:
Response: We would like to thank Reviewer #2 for his/her comments. We have revised the concerns and clarified the text to show how our results support our main conclusion.
Comment: The title is a bit misleading as it's too generalized. The current title doesn't suggest for what temporal resolution (annual or monthly), nor for what regions this approach can be applied. Validating the results for the US continent doesn't necessarily prove that it can be applied in other regions that have different climate (e.g. high latitudes and tropics).
Response: This manuscript presents a prove of concept for the application of terrain parameters for downscaling satellite derived soil moisture. Although the region of interest does not represent all the variability encountered across the world, it does represent a large climatic and ecological gradient. Our results show that indeed the accuracy of the method will vary across areas with different data availability for training and validating models (e.g., areas of large gaps in the ESA-CCI soil moisture product). We focus our validation framework in the US given the relatively high availability of soil moisture field data (and relatively good coverage of the ESA-CCI soil moisture product), which is usually sparse or not existent across large areas in higher latitudes or in the tropics. The opportunity for validation is the main reason why we chose this region but we are working on a larger application of this method across the world (manuscript in preparation).
Comment: The calculation of terrain parameters. It is surprised to see that some of the parameters in Figure 2 show fairly low values with almost none spatial heterogeneity. Is that a rendering problem or due to the normalization? This seems incorrect. For example, the wetness index is supposed to have significant spatial heterogeneity. See HYDRO1K or HydroSHEDS products for some of these parameters.
Response: The reviewer’s concern is mainly an issue of visualization of Figure 2, but the high resolution products at 1km resolution are available at https://www.hydroshare.org/resource/b8f6eae9d89241cf8b5904033460af61/.
To show this we present in this response to the reviewers an example of the spatial detail in the topographic wetness index using 1 km grids. We also clarify that there are contrasting ranges of values in the terrain parameters calculated across CONUS (radians, meters, unitless). Consequently, we normalized these parameters by centering their means to 0 by a unit of variance. That is the reason why there are apparent low levels of variation in some parameters of Figure 2 in the main text. The figure below is showing a screenshot of the spatial heterogeneity (at the national level and across an elevation gradient of the Appalachian mountains) of the topographic wetness index at 1km pixel size, used in this study. Again, the final products/maps at 1km resolution are now available at https://www.hydroshare.org/resource/b8f6eae9d89241cf8b5904033460af61/.
FIGURE legend. Example of spatial detail of the topographic wetness index using 1km grids. a) topographic wetness index at 1km of spatial resolution across the conterminous United States (screenshot from SAGA-GIS), b) zoom in to the topographic wetness index across an elevation gradient in the Appalachian mountains of eastern CONUS.
Comment: The downscaled 1k soil moisture. The results don’t suggest that it maintain the original values of the satellite product. Why the downscaled results seem to have a systematic lower value than the original one? Is that because of the calculation of terrain parameters? Also, the spatial heterogeneity is not higher in the downscaled product as it’s supposed to be. Downscaled products are supposed to provide more details at a higher resolution. The legend of Figure 3 and Figure S2 are not consistent, which makes it hard to compare the downscaled with the original visually.
Response: We clarify that our results show that the statistical distribution of the downscaled product is similar to the original ESA-CCI (see figure below). We also show that within the 95% confidence intervals (CI) our results capture relatively well the original variability of the ESA-CCI (i.e., no statistical differences between PDFs). We clarify that representing extreme values (i.e., >95% CI) is challenging for any model and our model is relatively lower for values >95% CI that represent the minority of the dataset. Most importantly, our results show that the downscaled soil moisture product has a better agreement with the “ground truth information” from the NASMD than the original ESA-CCI (see Figure 5), thus this demonstrates a substantial improvement using our proposed approach.
FIGURE legend. Probability distribution functions (PDFs) of the original satellite estimate (black) and the downscaled product (red). The vertical lines show the limits of the 95% of the data distribution of both soil moisture products.
Comment: The parameters used in the machine learning. This study uses a few terrain parameters to train their model - Here is an implicit assumption that the soil moisture is mainly controlled by geomorphological information. But soil moisture is proved to be strongly influenced by vegetation via its regulation of evapotranspiration.
Response: The reviewer is correct to say that vegetation is also important to regulate soil moisture but we want to avoid including vegetation information into our model to have an independent estimate that can be used/compared with vegetation information in further analyzes. If vegetation information is included in model predictions then the resulting soil moisture could lead to spurious correlations in further soil moisture – vegetation analyses. This is why we kept an independent product that will avoid potential statistical redundancies by future users/analyzes.
Consequently, we focused our methods only on topographic parameters and our results show that there is an improvement in agreement with the NASMD compared with the original ESA-CCI product. Previous studies have also found a topographic signal on satellite soil moisture (Mason et al., 2016) with potential implications for downscaling satellite soil moisture estimates. Here, we show how combining (in a model) satellite soil moisture estimates and multiple terrain parameters is useful to generate a soil moisture product that showed higher agreement with field soil moisture measurements compared with the satellite soil moisture estimates only. Given the influence of topography on hydrological, climatological and ecological variability, our results support the reliability of the implicit assumption highlighted by the reviewer, when a) the input data is satellite soil moisture, b) the geomorphological information is represented by digital terrain parameters derived by the means of geomorphometry and c) the statistical model should be able to account for non linear relationships and it should be parameterized and validated with some resampling technique (such as repeated cross validation) or field soil moisture data (e.g., in the NASMD or the ISMN).
Reference
Mason, D. C., Garcia-Pintado, J., Cloke, H. L. and Dance, S. L.: Evidence of a topographic signal in surface soil moisture derived from ENVISAT ASAR wide swath data, International Journal of Applied Earth Observation and Geoinformation, 45, 178–186, doi:10.1016/j.jag.2015.02.004, 2016.
Specific comments:
Comment: Paragraph Line 58-68. It seems the first outline sentence doesn’t match the following content.
Response: The connection of ideas in this paragraph has been improved in the revised version of the manuscript.
Comment: Line 164-165: need to be more specific on how you did the downscale.
Response: We have improved the narrative of this sentence explaining the main components of our downscaling effort. We have also made available our protocol for downscaling satellite soil moisture using topography for improving the transparency and reproducibility of this research: dx.doi.org/10.17504/protocols.io.6cahase.
Comment: Line 175: not a standard format. Is this a personal communication?
Response: It is a research manuscript, we have edited the citation style.
Comment: Line 177: why the ISMN is unlike the NASMD? Please elaborate
Response: We meant to say that the standardization procedures are different between the global effort and the North American effort. The differences between the NASMD and the ISMN are in the quality control and are explained and discussed in previous studies (Dirmeyer et al., 2016). While the ISMN uses global reference values, the NASMD uses a quality control based on North America reference values and the NASMD includes the information contained in the ISMN for CONUS. In the ISMN ‘basic quality control is performed and records suspected to be out-of-range or otherwise untrustworthy are flagged, but no data is omitted’ (Dirmeyer et al., 2016). The NASMD provides a single (quality controlled) value for each station (location, i.e., latitude and longitude) measuring soil moisture, while the ISMN provides raw data from each soil moisture instrument on each station (location, i.e., latitude and longitude) measuring soil moisture. This the NASMD integrates a single value from multiple soil moisture instruments taking measurements simultaneously and this more convenient that using the raw values of each instrument for the purpose of comparing values with pixels of soil moisture that are representative of a larger area than the soil moisture field station.
Reference
Dirmeyer, P. A., Wu, J., Norton, H. E., Dorigo, W. A., Quiring, S. M., Ford, T. W., et al. (2016). Confronting Weather and Climate Models with Observational Data from Soil Moisture Networks over the United States. Journal of Hydrometeorology, 17(4), 1049–1067. https://doi.org/10.1175/jhm-d-15-0196.1
Comment: Line 181-182? Be specific on what parameters are used, or at least mention the list in Table S1.
Response: Now we clearly explain which parameters are used and we explicitly refer the reader to Table S1 (line 193 and line 217).
Comment: Line 198: how to harmonize? Not clear
Response: We explain that we extracted the values of terrain parameters at the locations of the central coordinates of pixels in the coarser gridded soil moisture product. (lines 217-219)
Comment: Line 199: where is section 2.3?
Response: This was a mistake, we have edited the sentence.
Comment: Line 214: what soil moisture map ? daily or annual? If it is daily, how did you do the aggregation?
Response: We meant to say that a model was built for each year of satellite soil moisture data. These satellite data was aggregated from its original temporal resolution to a yearly basis based on the median value. The time scale of our predictions are clearly explained in the abstract and throughout the manuscript.
Comment: Line 241-249. I don’t understand why a spatial interpolation for R2 map is needed. I don't think an interpolated R2 can be used for justification. Using an Ordinary Kriging doesn't make sense as Ordinary Kriging assumes your R2 points is spatially-correlated.
Response: We think that the R2 map provides information on the accuracy of our framework across areas where no field data for validation is available. We provide the R2 values (i.e., model results with the NASMD) in Figure 6. Then, we provide Figure 4f to show the spatial trend of the R2 values across CONUS. This map is supported by a variogram analysis (showing evidence of R2 points spatially-correlated) and we report parameter values (sill > nugget) in the methods section (lines 277-279). These parameters and the fitted variogram model are shown along the R2 Kriging prediction and its associated Kriging standard error in the figure below (see figure below).
FIGURE 3. Screenshot of the native plot of the fitted R2 Kriging model (in the R package automap). This screenshot includes the R2 Kriging prediction and its associated Kriging standard error. The variogram parameters and the fitted variogram model is included in this screenshot.
Comment: Supplement Figure 3A. There is no dotted blue line.
Response: This was an issue with the original resolution of the figure. The blue line indicates soil moisture in Supplementary Figure 3B. The Supplementary Figure S3A shows the cloud of points only, and the Figure S3B shows the variables. We have highlighted the soil moisture blue line to facilitate the interpretation of Supplementary Figure S3B.
Comment: Table 1. The Unit of RMSE is needed.
Response: We have included the units of the RMSE in Table 1.
Submitted filename: Response_to_reviewers.docx
10.1371/journal.pone.0219639.r003Decision Letter 1PoulterBenjaminAcademic Editor2019Benjamin PoulterThis is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Submission Version1
10 Sep 2019
Downscaling Satellite Soil Moisture using Geomorphometry and Machine Learning
PONE-D-19-17266R1
Dear Dr. Vargas,
We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.
Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.
Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.
If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.
With kind regards,
Benjamin Poulter
Academic Editor
PLOS ONE
Additional Editor Comments (optional):
Thank you for your re-submission to PLOS ONE and for addressing the reviewer comments in detail. I am pleased to accept this version for publication.
Reviewers' comments:
10.1371/journal.pone.0219639.r004Acceptance letterPoulterBenjaminAcademic Editor2019Benjamin PoulterThis is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
16 Sep 2019
PONE-D-19-17266R1
Downscaling Satellite Soil Moisture using Geomorphometry and Machine Learning
Dear Dr. Vargas:
I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.
If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.
For any other questions or concerns, please email plosone@plos.org.