Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Evaluation of Gridded Precipitation Data for Driving SWAT Model in Area Upstream of Three Gorges Reservoir

  • Yan Yang,

    Affiliations College of Water Sciences, Beijing Normal University, Beijing 100875, China, United Graduate School of Agricultural Science, Gifu University, 1-1 Yanagido, Gifu, 501-1193, Japan

  • Guoqiang Wang ,

    wanggq@bnu.edu.cn

    Affiliation College of Water Sciences, Beijing Normal University, Beijing 100875, China

  • Lijing Wang,

    Affiliation State Environmental Protection Key Laboratory of Drinking Water Source Protection, Chinese Research Academy of Environmental Sciences, Beijing 100012, China

  • Jingshan Yu,

    Affiliation College of Water Sciences, Beijing Normal University, Beijing 100875, China

  • Zongxue Xu

    Affiliation College of Water Sciences, Beijing Normal University, Beijing 100875, China

Abstract

Gridded precipitation data are becoming an important source for driving hydrologic models to achieve stable and valid simulation results in different regions. Thus, evaluating different sources of precipitation data is important for improving the applicability of gridded data. In this study, we used three gridded rainfall datasets: 1) National Centers for Environmental Prediction - Climate Forecast System Reanalysis (NCEP-CFSR); 2) Asian Precipitation - Highly-Resolved Observational Data Integration Towards Evaluation (APHRODITE); and 3) China trend - surface reanalysis (trend surface) data. These are compared with monitoring precipitation data for driving the Soil and Water Assessment Tool in two basins upstream of Three Gorges Reservoir (TGR) in China. The results of one test basin with significant topographic influence indicates that all the gridded data have poor abilities in reproducing hydrologic processes with the topographic influence on precipitation quantity and distribution. However, in a relatively flat test basin, the APHRODITE and trend surface data can give stable and desirable results. The results of this study suggest that precipitation data for future applications should be considered comprehensively in the TGR area, including the influence of data density and topography.

Introduction

Precipitation data are generally recognized as the most important driving data for hydrologic models. However, constrained by the sparse distribution of observation stations, the applicability of such models is limited. With development of modern observation and massive computing technologies, the estimation of precipitation based on combination of multi-source data (historical observed, radar and satellite) has become a feasible means for extending model applications. Various data are published for global or regional modeling, including climate change, global water cycle, and ecology modeling [1][3]. For example, Chappell [4] used TRMM data, station observed data and a kernel-based statistical blending algorithm to produce 5-km resolution gridded precipitation data for Australia. Additionally, Li et al. [5] and Huang et al. [6] used spline interpolation and the trend surface methods to generate 5-km resolution gridded precipitation data for New Zealand and China. With the expanded application requirements, these data have been widely used for hydrologic modeling in various studies and regions [7], [8], [9].

However, errors in the gridded precipitation data have a high probability for inducing errors and uncertainties in hydrologic simulations. Thus, validations and evaluations of these gridded data require attention. The gridded data are generally compared with observed data. For example, Wang and Zeng [10] evaluated six reanalysis climate data products (MERRA, NCEP/NCAR-1, NCEP-CFSR, ERA-40, ERA-Interim, and GLDAS) versus in situ measurements at 63 weather stations on the Tibetan Plateau, showing that the NCEP-CFSR (National Centers for Environmental Prediction Climate Forecast System Reanalysis) data had the best overall performance. Bao et al. [11] successfully evaluated four different data (NCEP-NCAR reanalysis, NCEP-CFSR, ERA-40, and ERA-Interim) products based on an enhanced observed network, and the results showed that the performance of NCEP-CFSR and ERA-Interim data were superior for the Tibetan Plateau. Areal accuracy and validity of gridded precipitation data are evaluated using a hydrologic model. For example, Fuka et al. [12] presented a method using the NCEP-CFSR global meteorological data to model five watersheds representing different hydro-climate regimes. The results proved that NCEP-CFSR precipitation can provide runoff simulations that are as good as or better than traditional weather gauging stations. Vu et al. [13] compared five different sources of rainfall data for the Vietem River basin using the Soil and Water Assessment Tool (SWAT) model, finding that all the gridded data could capture hydrologic processes in this tropical basin well. For model driving, most gridded precipitation data are developed for application to much larger basins, for example continental or global scale [1], [3], and their applicability is always validated in such large basins [12], [13], [14]. In small basins, it is commonly recognized that precipitation is mainly influenced by topographic, wind direction, hill aspect, and other factors, and the creation or reanalysis of precipitation data in such small basin usually required more detailed information compared with the data in large scale basins [15], [16], [17]. However, the applicability of various gridded data to such basins requires further investigation.

The Three Gorges Reservoir (TGR) in the middle reach of the Yangtze River is the largest hydropower project in the world. Although TGR has the benefit of flood control and electric generation, its influence on hydrologic processes, its alteration of ecological systems and the environment has attracted public attention. Hydrologic models have been applied to this region to analyze these influences at different spatiotemporal scales [18][21]. However, limited by sparse observation stations in the mountains and missing observations in immigration regions, the applicability of those models remains limited in this study area. Additionally, water resource analysis and pollution control in the TGR area are needed for local tributary basins, which have strong topographic variation, relatively small scales, and a lack of dense observed data. Although global or regional gridded precipitation data are probable sources for model simulation in such regions, the applicability of these data require further study.

Thus, the main objective of this study was to evaluate various gridded precipitation data by both station validation and model driving tests for the area upstream of TGR. In this area, two subbasins of the Pengxi River were selected as the test basin. NCEP-CFSR, Asian Precipitation - Highly-Resolved Observational Data Integration Towards Evaluation (APHRODITE), and trend surface data were used for the evaluation. These gridded data were compared with observed data. Then, simulation results of the data were appraised based on quantitative evaluation indices of model simulation and uncertainties. Spatial density and receptiveness of the topography for the various data are also discussed.

Study Area and Data

Study basins

We selected two test basins (of the Dong and Puli rivers) for study. These basins belong to the basin of the Pengxi River, one of the main tributaries on the north shore of TGR [21]. In this region, average annual precipitation is 1100–1500 mm. Daily discharge data were collected from gauging stations in the two basins (Figure 1; Table 1), Wenquan in the Dong River basin and Yujia in the Puli River basin. Wenquan is at the outlet of the Dong River basin, and Yujia is on the upper main channel of the Puli River basin. Drainage area of Wenquan station is 1098 km2, and 366 km2 for Yujia station (Table 1).

thumbnail
Table 1. Basic geo-information of Dong River and Puli River basins.

https://doi.org/10.1371/journal.pone.0112725.t001

Digital elevation model data at resolution 90 m (Shuttle Radar Topography Mission) were used to generate topographic information for the SWAT model. A land-use map extracted from Landsat Thematic Mapper data [21] and a soil map were used to generate hydrologic response units (HRUs), which were used as basic cells to calculate parameters in SWAT. Land-use types were reclassified into nine types [21]. Basin soil data were identified from the Chinese national 1∶1,000,000 scale soil map including five soil types [21], [22].

Daily climate inputs for SWAT included precipitation, minimum and maximum air temperature, and wind speed. The air temperature and wind speed data were collected from meteorological stations within the study area (Figure 1). Details of the various precipitation data are introduced in Section 2.2. Additional climate variables such as solar radiation and relative humidity were produced from a weather generator, using values from the nearest standard weather station [21], [22].

Various Precipitation datasets

Monitoring Precipitation data.

The monitoring precipitation data were acquired from a local observed agency. These were from four stations in the Dong River basin and three in the Puli River basin (Table 2). The period for all stations is 1 January 2000 to 31 December 2006.

thumbnail
Table 2. Basic geo-information of precipitation observation stations in the Dong River and the Puli River basins.

https://doi.org/10.1371/journal.pone.0112725.t002

NCEP-CFSR data.

The NCEP-CFSR data is available for 1979–2013. They were designed and implemented as a global, high-resolution coupled atmosphere-ocean-land surface-sea ice system to provide the best estimate of the state of these coupled domains in the period [1]. The optimum interpolation algorithm of Xie et al. [23] is employed to partially account for the orographic enhancements in precipitation [1]. Although the original NCEP-CFSR data were based on a four-times-per-day step, the official website of the SWAT model (http://globalweather.tamu.edu/) supplied daily assimilation data for global application, as with the APHRODITE data, and the period of study was also 1 January 2000 to 31 December 2006. Spatial resolution of the NCEP-CFSR data is 0.5 degrees. See File S1 for the detail NCEP-CFSR data of the test basins.

APHRODITE data.

The APHRODITE project created continuous daily gridded precipitation data for 1951–2007. These data cover monsoon Asia, the Middle East, and Russia. Version V1101R1 data of monsoon Asia were used in this study, with spatial resolution of 0.25 degrees [3]. The available data period was 1 January 1979 to 31 December 2007. The data cover the entire Pengxi River basin and the period 1 January 2000 to 31 December 2007 (Figure 1). The APHRODITE data combined the monitoring data in mountain areas and the Parameter-Elevation Regressions on Independent Slopes Model (PRISM) monthly precipitation climatology to correct the bias caused by orographic effects of Xie et al. [23]. File S2 list the APHRODITE data of the test basins in this study.

Trend surface data.

Data of the land surface model forcing field for mainland China during 1958–2010, was provided by Li et al. [5]. We used the precipitation data from this dataset, with temporal and spatial resolutions daily and 0.05 degrees, respectively. The period is the same as that of the other data. The trend surface dataset applied the thin smoothing spline method to construct the surface trend of the precipitation; however, limited to the available of observed precipitation data, the trend surface product of precipitation didn't include the correction of elevation [5]. File S3 list the trend surface data of the test basins in this study.

Methodology

Soil Water Assessment Tool

SWAT is a temporally continuous, semi-distributed hydrologic model that was used to represent hydrologic and water quality processes in various basins with different temporal and spatial scale [24]. SWAT subdivides a watershed into subbasins connected by a stream network, and further delineates HRUs with unique combinations of land cover and soils in each subbasin. The hydrologic routines within SWAT account for snowfall and melt, vadose zone processes (infiltration, evaporation, plant uptake, lateral flows, and percolation), and groundwater flows [25], [26]. A modified SCS curve number method is employed to simulate the river runoff in the SWAT model, the Evapotranspiration (ET) is mainly calculated by using the Penman-Monteith method [25]. Moreover, the river channel routing of SWAT model is calculated by using a variable storage method [26].

Model calibration settings

We used a daily simulation step for the SWAT model. The observed data is from 2002 to 2006 in Dong River basin, and it is from 2002 to 2004 in Puli River basin. Given a lack of river discharge observed data in the study area during the 2000s, we only conduct the validation in Dong River basin from 2005 to 2006. For the Dong River basin and Puli River basin, the calibration period is from 2002 to 2004.

We used the SWAT model calibration software SWAT-CUP 4.3.7 [27] to perform auto-calibration in the two study areas. SWAT-CUP is a public domain program, and thus can be copied and used freely. Among the model calibration methods available with SWAT-CUP, SUFI-2 has proven effective for calibrating and validating the SWAT model in different regions, with limited computational cost and high accuracy [27], [28]. In our study areas, applicability of the SUFI-2 method was also demonstrated by Wang et al. [21] and Yang et al. [22] for runoff and water quality modeling.

Results of hydrologic models are mainly controlled by two factors, input data and model structure, which is usually indicated by its parameterization. Thus, to conduct a “pure” evaluation for various precipitation data, we followed these rules:

  1. The initial sampling ranges of the parameters were the same for both basins (Table 3)
  2. Parameter sets generated by the Latin hypercube method [27] were the same for all precipitation data in each test basin
  3. Repetition of SWAT-CUP was fixed at 1000 times for different input rainfall data
thumbnail
Table 3. Parameters used for auto-calibration of the two test basins.

https://doi.org/10.1371/journal.pone.0112725.t003

Evaluation of Model Performance

Parameter sensitivity.

We used 18 parameters for model calibration (Table 3). The parameter sensitivity analysis method of SWAT-CUP is based on the Latin hypercube [29], [30] and multiple regression methods. The multiple regression equation is as followed:(1)

Here, is the value of evaluation index for the model simulations, is a constant in multiple linear regression equation is a coefficient of the regression equation, is a parameter generated by the Latin hypercube method, and n is the amount of parameters. The Latin hypercube method ensures that the full range of all parameters has been sampled. The multiple regression method that assigns changes of each model run output can be unambiguously attributed to the changed parameters [27], [31]. The t-test of this equation is applied to indicate the sensitivity of each parameter .

Evaluation of runoff simulation.

The Nash Sutcliffe Efficiency (NSE) [32] and coefficient of determination (R2) were chosen as indexes to evaluate the performance of various precipitation data for driving the hydrologic model. NSE and R2are calculated as(2)(3)

Here, is observed runoff on day i, is average observed runoff, is the simulated value on day i, and is average simulated runoff. The NSE is also used as the evaluation index () for the parameter sensitivity analysis in the Eq.(1).

Evaluation of model uncertainty.

In SWAT-CUP, output uncertainty is quantified by the 95% prediction uncertainty band (95PPU), calculated at the 2.5 and 97.5% levels of the cumulative distribution function of the output variables [27], [28]. Two indices, the r-factor and p-factor, are used for evaluating output uncertainty based on the 95PPU. The r-factor is calculated by(4)

Here, and represent the upper and lower boundaries of the 95PPU, is the observed data and symbolizes the standard deviation of the measured data [28]. The p-factor is the percentage of observed data bracketed by the 95PPU band, and the r-factor stands for thickness of the uncertainty band [28].

The threshold for distinguishing the behavioral and non-behavioral simulation was set to 0.5 for NSE. Optimal calibration and parameter uncertainty for both methods was measured on the basis of proximity of the p-factor to 100% and r-factor to 1 [28].

Results

Based on the input data, results of watershed delineation and weather station distribution are listed in Table 4. There are 37 subbasins of the Dong River basin and 13 subbasins in the Puli River basin. Based on nearest distance, which is calculated using station location and geographic centers of the subbasins, the SWAT model automatically assigns precipitation stations to each subbasin. The amounts of precipitation data in the two test basins are shown in Figure 1.

Station validation and spatial distribution of various precipitation data

Station validation.

We also used four stations in the Dong River basin to evaluate accuracy of the various precipitation data. Three different evaluation indices were determined: 1) correlation efficiency (R), 2) mean error (ME), and 3) mean absolute error (MAE). Equations for these indices are(5)(6)(7)

Here, is observed precipitation data on day i, is the average value of observed precipitation data, represents the gridded precipitation data on day i, is the average value of the gridded precipitation data, and n is the total number of observed values.

The observation station and nearest station were used for comparison. The results are presented in Table 5. The results of R for the Dong River basin show that at all four stations, NCEP-CFSR had a satisfactory performance compared with the other two datasets. The APHRODITE and trend surface data achieved reasonable R values compared with the NCEP-CFSR data in Puli River basin. Regarding ME, the APHRODITE and trend surface data produced at all stations, a negative difference compared with the monitoring precipitation data indicated that the first two datasets underestimated the precipitation data at those sites. Regarding the NCEP-CFSR data, positive ME values at DJ and GM stations indicate overestimation of the precipitation. In contrast, at WQ and YSH stations, NCEP-CFSR underestimated the precipitation. The NCEP-CFSR data had the largest MAE values at all validation stations compared with the APHRODITE and trend surface data. For the latter two datasets, MAE for all stations was 2.50–4.0 mm, which was a small range compared with that of the NCEP-CFSR data.

thumbnail
Table 5. Evaluation results of different gridded precipitation datasets in the Dong River basin.

https://doi.org/10.1371/journal.pone.0112725.t005

There were three validation stations in the Puli River basin (Table 3). Results are presented in Table 6. The results of R for that basin show that APHRODITE data gave the best performance compared with the other two datasets. ME values of APHRODITE and trend surface data were positive, which means that the precipitation data were overestimated at the three stations. For the NCEP-CFSR data, ME values were negative at QT and YJ stations and positive at HX. MAE had the same tendency as the results for the Dong River basin. The APHRODITE and trend surface data delivered more accurate results relative to the NCEP-CFSR data.

thumbnail
Table 6. Evaluation results of different gridded precipitation datasets in the Puli River basin.

https://doi.org/10.1371/journal.pone.0112725.t006

Spatial distribution of various precipitation data.

The areal distributions of different precipitation datasets are shown in Figure 2 and 3. The annual precipitation distribution of in Dong River basin (Figure 2, left) indicate that the results of monitoring data and the NCEP-CFSR data present more variation compared with the APRHODITE data and the trend surface data. The precipitation value of Dong River basin calculated by the monitoring data is varied from 800 mm to 1600 mm. The value of NCEP-CFSR data indicates a more large variation, which is from 800 mm to 1800 mm. However, the results of APHRODITE data and the trend surface data show that the annual precipitation in Dong River basin is from 1000 mm to 1200 mm. In the Puli River basin, the monitoring precipitation data indicate that the annual precipitation amount is from 800 mm to 1200 mm in Puli River basin (Figure 2, right). As same as in the Dong River basin, the APHRODITE data and the trend surface data also perform similar results in Puli River basin. The range of annual precipitation of these two datasets is from 1100 mm to 1300 mm. Though the NCEP-CFSR also shows the annual precipitation is from 1100 mm to 1300 mm, the spatial distribution clearly indicate that it present contrary results compared with other three datasets. In the downstream of Puli River basin, the annual precipitation calculated from the NCEP-CFSR data is lower than other three datasets. However, in the upstream, the annual precipitation of NCEP-CFSR data present relative higher values compared with other datasets.

thumbnail
Figure 2. Spatial distribution of annual precipitation value for various precipitation datasets for the Dong River (up) and the Puli River (down) basins, from left to right is Monitoring data, NCEP CFSR data, APHRODITE and trend surface data.

https://doi.org/10.1371/journal.pone.0112725.g002

thumbnail
Figure 3. Annual precipitation value of various precipitation datasets for the Dong River (left) and the Puli River (right) basins.

https://doi.org/10.1371/journal.pone.0112725.g003

The annual precipitation value in the Dong River basin (Figure 3, left) shows that in the north and northwest portions, which have high elevations, the NCEP-CFSR data had positive differences compared with other data. This means that the input precipitation values for the SWAT model subbasin were much higher than other data. In contrast, in the downstream area of the Dong River basin, the NCEP-CFSR data had negative differences relative to the other datasets. For the APHRODITE and trend surface data, their differences with the monitoring data were greater in the upstream (high elevation) area, but smaller in the downstream area. In the Puli River basin (Figure 3, right), the monitoring and NCEP-CFSR data had sensible differences with the APHRODITE and trend surface data. The NCEP-CFSR data had positive differences compared with the APRHODITE and trend surface data; however, the monitoring data had negative differences.

Spatial distributions of the various datasets indicate that the input precipitation data varied greatly across the test basin. Such differences may lead directly to variations of simulation results and produce uncertainties in the SWAT model. However, because of the lack of observed precipitation data in each subbasin, we cannot directly evaluate the quality of the various datasets there. This situation is also inadequate for evaluating performances of the precipitation datasets based only on in situ validation and spatial distributions. Therefore, model calibration must be done.

Results of daily runoff simulation

Results of parameter sensitivity analysis are shown in Table 7. It is clear that in both the Dong and Puli River basins, the most sensitive parameters were ALPHA_BNK, CH_K2, CH_N2, and CN2. However, from the average rank of each basin, the order of sensitive parameters is different. This order is ALPHA_BNK, CH_K2, CH_N2 and CN2 in the Dong River Basin. For the Puli River Basin, the average rank value clearly shows that CN2 was more sensitive than the other three parameters. There were no differences in sensitive parameters for the various precipitation datasets, and the other parameters were not sensitive to the hydrologic simulation. In the Dong River basin, the monitoring precipitation data and the gridded data obtain very similar results, except the 4th rank of sensitive parameter. However, the sensitive rank in the Puli river basin show that the gridded datasets presents more consistent results compared with the monitoring data, especially for the CH_K2 and CH_N2, the ranks are different in monitoring data and gridded data. Moreover, compared with the two test basins, the gridded precipitation data can obtain similar sensitive parameter ranks rather than the monitoring data.

thumbnail
Table 7. Ranking of sensitivity of various parameters for the Dong River and the Puli River basins, based on auto-calibration of SWAT-CUP.

https://doi.org/10.1371/journal.pone.0112725.t007

Daily runoff simulation results for the Dong River basin and Puli River basin are shown in Figures 4 and 5, respectively. For the Dong River Basin, during the calibration period, R2 is 0.89 and NSE is 0.87 of the monitoring precipitation data. For NCEP-CFSR data, R2 is 0.37 and NSE is 0.37, which show a significant decrease relative to the monitoring data, and are lower than the threshold of behavioral simulation (NSE = 0.5). R2 and NSE are 0.60 and 0.49 for APHRODITE data, respectively. Although these values are higher than for the NCEP-CFSR data, they are under the threshold for behavioral simulation assessment. The trend-surface data show similar results to the APHRODITE data, with a slight decrease. NSE of the trend surface data exceeded the behavioral threshold value (NSE = 0.50; R2 = 0.61). During the validation period, the R2 and NSE values of monitoring data presents relatively good results (R2 = 0.82; NSE = 0.79), for the NCEP-CFSR data, the performances are improved (R2 = 0.61; NSE = 0.60). In the results of APHRODITE data, the R2 is 0.65 which is slightly improved compared with the calibration period; however, the NSE slightly decreases to 0.46. For the trend surface data, the performances (R2 = 0.72; NSE = 0.53) of trend surface data during the validation period are also improved compared with the calibration period. The hydrograph for the Dong River basin (Figure 6, up, and Figure 7) demonstrated that only the monitoring precipitation data could reproduce the hydrograph well during flood periods, compared with the various gridded datasets.

thumbnail
Figure 4. R2 and NSE of various precipitation datasets for the Dong River and the Puli River basins during the calibration period.

https://doi.org/10.1371/journal.pone.0112725.g004

thumbnail
Figure 5. R2 and NSE of various precipitation datasets for the Dong River during the validation period.

https://doi.org/10.1371/journal.pone.0112725.g005

thumbnail
Figure 6. Simulation hydrographs of various precipitation datasets for the Dong River (up) and the Puli River (down) basins during the calibration period.

NCEP stands for NCEP-CFSR data and T-S for trend surface data.

https://doi.org/10.1371/journal.pone.0112725.g006

thumbnail
Figure 7. Simulation hydrograph of various precipitation datasets for the Dong River basin during the validation period.

NCEP stands for NCEP-CFSR data and T-S for trend surface data.

https://doi.org/10.1371/journal.pone.0112725.g007

For the Puli River basin (Figure 4), R2 is 0.83 and NSE is 0.80 for APHRODITE data, and are respectively 0.84 and 0.82 for the trend surface data. For the monitoring data, the indices are R2 = 0.50 and NSE = 0.41. R2 and NSE were 0.29 and 0.23, respectively for the NCEP-CFSR data, both of which represent significant decrements relative to the other two datasets. The hydrograph (Figure 6, down) shows that the peak flow and sequence in the Puli River basin matched well for the APHRODITE and trend surface data, compared with the monitoring data and NCEP-CFSR data.

Model uncertainties induced by various precipitation data

The p-factor and r-factor results are shown in Figure 8. Trends of the factors are the same as R2 and NSE. In the Dong River basin, the monitoring precipitation data gave the best results. Because there was no simulation result in excess of the behavioral threshold, p-factor and r-factor values were set to zero for both NCEP-CFSR and APHRODITE data. The results of both factors for the trend surface data show that they had greater uncertainties compared with the observed data.

thumbnail
Figure 8. p-factor and r-factor of different precipitation datasets for the Dong River and the Puli River basins.

NCEP stands for NCEP-CFSR data and T-S for trend surface data.

https://doi.org/10.1371/journal.pone.0112725.g008

In the Puli River basin (Figure 8), the APHRODITE and trend surface data yielded more preferable results relative to the monitoring and NCEP-CFSR data. Because there was no simulation result in excess of the behavioral threshold, p-factor and r-factor values were set to zero for the NCEP-CFSR data. For the monitoring data, although they obtained behavioral simulations, the p-factor and r-factor values still demonstrate that they led to more uncertainties relative to the APHRODITE and trend surface data.

Discussion

Performances of various precipitation data upstream of TGR

In situ validation performances of the various precipitation datasets for the Dong and Puli basin show that APHRODITE and trend surface data can reflect the basic quantity of precipitation, as determined by R, ME, and MAE. The NCEP-CFSR data always gave larger errors compared with the other two datasets. Additionally, the APHRODITE data were more accurate than the trend surface data; however, the differences were not significant. As presented, the monitoring data were accurate but could only represent precipitation in or near the station. Consequently, to evaluate the performance of the different datasets, we should also rely on the model results.

The daily runoff simulation results of the SWAT model proved that the APHRODITE and trend surface data could reproduce the hydrologic processes in both the Dong and Puli basin more effectively than NCEP-CFSR. The model uncertainty results also suggest that the latter dataset has greater uncertainties compared with the other two. Amongst all the gridded datasets, the trend surface data always achieved the best results, as determined by both monitoring data and hydrologic model testing. The spatial distribution results already demonstrated that the NCEP-CFSR data gave significant differences compared with the other two datasets and the monitoring data. Section 4.2 revealed that the significant spatial distribution differences of the datasets can lead to variable simulation results. Further, the model uncertainty results indicate that the spatial distribution of NCEP-CFSR data may directly cause a relatively weak performance and greater uncertainty in the SWAT model.

Compared with sparse and isolated observation stations, the gridded data have an advantage in considering effects of station surroundings for prediction at a fixed location [1], [24]. Thus, the consideration of impacts from surrounding stations in APHRODITE and trend surface data products may make them more valid and permit more robust data input to SWAT model runoff determination at TGR, compared with the NCEP-CFSR data. The data source for the latter dataset is global public observation stations [1]. Thus, compared with the APHRODITE dataset that covers Asia with over 40,000 stations [3], [24], the NCEP-CFSR data are too sparse to achieve accurate reanalysis results in our study area. Li et al. [5] explained that the trend surface data are also based on public global exchange stations, but the present simulation results suggest that the trend surface interpolation method and its products may be more applicable to our study area.

Nevertheless, we recognize that the performances of the various gridded precipitation data were weak relative to the monitoring data for the Dong River basin, where there are large elevation differences. As mentioned above, in micro-scale or meso-scale basins, precipitation can be influenced by various factors such as topography, wind speed, and hill aspects. Given this, the gridded precipitation data may lose the capacity to reflect the precipitation distribution accurately. Thus, we also addressed the applicability of various precipitation datasets by considering topographic influences and data density.

Applicability of different precipitation data: topography

If the topography strongly affects precipitation, strong linear relationships can be obtained by analyzing elevation and precipitation differences among stations [33], [34]. The “precipitation lapse rate” is used to evaluate this relationship. We used this rate to assess the precipitation influence in each basin, using(8)

Here, is the precipitation difference between a pair of stations (the station pair is determined by permutations of all available stations used in the SWAT model), is the elevation difference between the pair, is the first order or slope of the linear regression equation, and is the intercept of the linear equation. For both Dong River basin and Puli River basin, the linear regression analysis is conducted with the F-test in significant level α = 0.05 (Table 8). The results are presented in Figures 9 and 10.

thumbnail
Figure 9. Correlation between elevation difference and precipitation difference in the Dong River basin (significant level α = 0.05).

a) Monitoring data; b) NCEP-CFSR data; c) APHRODITE data; d) trend surface data.

https://doi.org/10.1371/journal.pone.0112725.g009

thumbnail
Figure 10. Correlation between elevation difference and precipitation difference in the Puli River basin (significant level α = 0.05).

a) Monitorin data; b) NCEP-CFSR data (only one point available, so linear regression analysis is missing); c) APHRODITE data; d) trend surface data.

https://doi.org/10.1371/journal.pone.0112725.g010

thumbnail
Table 8. F-test of linear regression for elevation difference and preciptiaiton difference of all datasets in Dong River basin and Puli River basin.

https://doi.org/10.1371/journal.pone.0112725.t008

In the Dong River basin, R2 of the linear regression equation for the monitoring data (Table 8, Figure 9a) was nearly 1, clearly indicating a strong correlation between elevation and precipitation differences. Conversely, judging by the F-test, there is no correlated relationship between elevation and precipitation for APHRDOITE and NCEP-CFSR data. Though the F-test value is larger than its value in the significant level (Table 8), the R2 of trend surface data indicate the linear regression relationships between elevation and precipitation is weak results in Dong River basin. (Figure 9 b), c) and d)). In the Puli River basin, the linear regression result at the significance level 0.05 ((Table 8, Figure 10) shows there is no significant linear relationship between elevation and precipitation data for all the precipitation datasets in this basin.

Moreover, basic geo-information of the two test basins (Table 1) confirms their different elevations. In the Dong River basin, the minimum elevation is 192 m and maximum 2569 m; its standard error is 560 m and the slope is 23.2 degrees. In the Puli River basin, the minimum elevation is 267 m and maximum 1326 m, with standard error 213 m and slope 12.1 degrees. Obviously, the topography has greater variation in the Dong River basin than in the Puli River basin.

Considering the simulation results of Section 4.2 in the Dong River basin, the results of simulation and uncertainty clearly demonstrate that the monitoring data gave results superior to those of the gridded data. Significant differences of precipitation lapse rate of monitoring precipitation data strongly suggest that for Dong River basin, topography is the main influence on the spatial distribution of precipitation data, which thereby affects SWAT model simulation results. However, in the Puli River basin, all data indicate that topography is not the main influence on precipitation variation. Uhlenbrook et al. [35] indicated that the topographic influence on the rainfall will also significantly influence the results of hydrological model. Weisse and Bois [36] also demonstrated that topographic characterization has significant influence on the rainfall, especially the heavy rainfall in the French Alps. Sanchez-Moreno et al. [37] also presented the topography can significantly influence the trend of monthly and seasonal precipitation distribution in Santiago Island, Cape Verde. In our study, the results of Dong River basin also clearly demonstrated that the gridded precipitation data cannot reflect the rainfall distribution which is greatly influenced by the topography. Contrarily, in Puli River basin, of which the precipitation receives less influence form the topographic, the simulation results indicate that the gridded data can reflect the value and distribution of precipitation more effectively compared with the sparse monitoring data.

As presented in the section 2.2, except the trend surface method, the NCEP-CFSR data and the APRHODITE data are all employed the optimum interpolation method for correction of Topographic. However, our results indicate that the effects of the correction datasets on local regions still cannot match the applicable of SWAT model. Moreover, Silva et al. [38] indicate that the NCEP CFSR data may overestimate the precipitation in the coastal mountain area in Brazil. Andermann et al. [39] shown that for APHRODITE data also it can present relatively good performances in yearly and monthly steps in mountain areas of Himalaya front, and its performances in daily step is still need to be improved. Ono et al. [40] also reported that for the heavy rainfall investigation in Mekong River, the local topography is still need to be considered as a correction factor for improving the accurate of APHRODITE data. The results of our research indicate that for their applications in local regions, the applicability, especially, topographic effects should be considered and discussed firstly.

Applicability of various precipitation data: spatial density

In addition to topography, input precipitation data density is an influence, and alters the hydrologic model results. Basic information on station density is given in Figure 1 and Table 4. In Dong River Basin, the NCEP-CFSR and APHRODITE data have the same spatial density as the monitoring data. The trend surface data, from more than 30 stations for the simulation, gave a poor result compared with monitoring data. In the Puli River Basin, those data were from 12 stations and the APHRODITE data only three stations, but the performance of these data showed no significant difference. Additionally, station density was similar for monitoring (three stations) and NCEP-CFSR (two stations) data. Compared with the APHRODITE data, however, those two datasets had less capacity to portray the hydrologic processes in the Puli River Basin.

The gridded data usually contain precipitation data of high density for simulation, relative to large basins. However, the performances of the hydrologic simulations in this study strongly suggest the spatial density has no major influence on model simulations of this study area. Data accuracy and representativeness is more important. Additionally, these gave auxiliary proof that topography is the dominant influence in the study basin.

Conclusions

The objective of the present study was to evaluate the capabilities of various gridded precipitation datasets (NCEP-CFSR, APHRODITE, and trend surface) for the area upstream of TGR, using both monitoring data and a hydrologic model.

Two test basins (Dong and Puli) were selected for data evaluation. Comparisons between the gridded and monitoring precipitation data suggest that APHRODITE data can explain the precipitation more accurately than the other two datasets at observation stations. For model testing in the Dong River basin where there is significant elevation variation, the simulation results suggest that all gridded precipitation data had reduced abilities to drive SWAT for achieving reasonable results. However, in the Puli River basin with relatively flat topography, the APHRODITE and trend surface data can give more accurate results than the NCEP-CFSR data, and even better than the monitoring data.

The results also prove that the APHRODITE and trend surface data are more stable and applicable than the NCEP-CFSR data in the study area. The results also suggest that in the application of the hydrologic or other models to the TGR using the gridded precipitation datasets, one must consider topographic effects on those data.

Supporting Information

File S1.

Extracted precipitation data of NCEP-CFSR data in the test basins for SWAT model.

https://doi.org/10.1371/journal.pone.0112725.s001

(ZIP)

File S2.

Extracted precipitation data from APHRODITE data in the test basins for SWAT model.

https://doi.org/10.1371/journal.pone.0112725.s002

(ZIP)

File S3.

Extracted precipitation data from trend surface data in the test basins for SWAT model.

https://doi.org/10.1371/journal.pone.0112725.s003

(ZIP)

Author Contributions

Conceived and designed the experiments: GW JY ZX. Performed the experiments: YY GW LW. Analyzed the data: YY GW LW JY. Contributed reagents/materials/analysis tools: YY GW ZX. Wrote the paper: YY GW.

References

  1. 1. Saha S, Moorthi S, Pan HL, Wu X, Wang J, et al. (2010) The NCEP climate forecast system reanalysis. Bull Am Meteorol Soc 91: 1015–1057.
  2. 2. Dee DP, Uppala SM, Simmons AJ, Berrisford P, Poli P, et al. (2011) The ERA-Interim reanalysis: Configuration and performance of the data assimilation system. Q J R Meteorol Soc 137: 553–597.
  3. 3. Yatagai A, Kamiguchi K, Arakawa O, Hamada A, Yasutomi N, et al. (2012) APHRODITE: Constructing a long-term daily gridded precipitation dataset for Asia based on a dense network of rain gauges. Bull Am Meteorol Soc 93: 1401–1415.
  4. 4. Chappell A, Renzullo LJ, Raupach TH, Haylock M (2013) Evaluating geostatistical methods of blending satellite and gauge data to estimate near real-time daily rainfall for Australia. J Hydrol 493: 105–114.
  5. 5. Li T, Zheng X, Dai Y, Yang C, Chen Z, et al.. (2014). Mapping Near-surface Air Temperature, Pressure, Relative Humidity and Wind Speed over Mainland China with High Spatiotemporal Resolution. Advancesin Atmos Sci 73 : In Press.
  6. 6. Huang C, Zheng X, Tait A, Dai Y, Yang C, et al. (2014) On using smoothing spline and residual correction to fuse rain gauge observations and remote sensing data. J Hydrol 508: 410–417.
  7. 7. Betts AK, Ball JH, Viterbo P (2003) Evaluation of the ERA-40 surface water budget and surface temperature for the Mackenzie River basin. J Hydrometeorol 4: 1194–1211.
  8. 8. Panday PK, Williams CA, FreyKE, Brown ME (2014) Application and evaluation of a snowmelt runoff model in the Tamor River basin, Eastern Himalaya using a Markov Chain Monte Carlo (MCMC) data assimilation approach. Hydrol Process 28: 5337–5353.
  9. 9. Tong K, Su F, Yang D, Zhang L, Hao Z (2014) Tibetan Plateau precipitation as depicted by gauge observations, reanalyses and satellite retrievals. Int J Climatol 34: 265–285.
  10. 10. Wang A, Zeng X (2012) Evaluation of multireanalysis products with in situ observations over the Tibetan Plateau. J Geophys Res Atmos (1984–2012) 117 D5.
  11. 11. Bao X, Zhang F (2013) Evaluation of NCEP–CFSR, NCEP–NCAR, ERA-Interim, and ERA-40 reanalysis datasets against independent sounding observations over the Tibetan Plateau. J Clim 26: 206–214.
  12. 12. Fuka DR, Walter MT, MacAlister C, Degaetano AT, Steenhuis TS, et al. (2013) Using the Climate Forecast System Reanalysis as weather input data for watershed models. Hydrol Process 28: 5613–5623.
  13. 13. Vu MT, Raghavan SV, Liong SY (2012) SWAT use of gridded observations for simulating runoff–a Vietnam river basin study. Hydrol Earth Syst Sci 16: 2801–2811.
  14. 14. Duethmann D, Zimmer J, Gafurov A, Güntner A, Merz B, et al. (2012) Evaluation of areal precipitation estimates based on downscaled reanalysis and station data by hydrological modelling. Hydrol Earth Syst Sci Disc 9: 10719–10773.
  15. 15. Bosilovich MG, Chen J, Robertson FR, Adler RF (2008) Evaluation of global precipitation in reanalysis. J Appl Meteorol Climatol 47: 2279–2299.
  16. 16. Daly C, Neilson RP, Phillips DL (1994) A statistical-topographic model for mapping climatological precipitation over mountainous terrain. J Appl Meteorol 33: 140–158.
  17. 17. Widmann M, Bretherton CS, Salathé Jr EP (2003) Statistical precipitation downscaling over the Northwestern United States Using numerically simulated precipitation as a predictor. J Clim 16: 799–816.
  18. 18. Shen ZY, Gong YW, Li YH, Hong Q, Xu L, et al. (2009) A comparison of WEPP and SWAT for modeling soil erosion of the Zhangjiachong Watershed in the Three Gorges Reservoir Area. Agric Water Manage 96: 1435–1442.
  19. 19. Shen ZY, Chen L, Chen T (2012) Analysis of parameter uncertainty in hydrological and sediment modeling using GLUE method: a case study of SWAT model applied to Three Gorges Reservoir Region, China. Hydrol Earth Syst Sci 16: 121–132.
  20. 20. Shi ZH, Ai L, Fang NF, Zhu HD (2012) Modeling the impacts of integrated small watershed management on soil erosion and sediment delivery: A case study in the Three Gorges Area, China. J Hydrol 438: 156–167.
  21. 21. Wang G, Yang H, Wang L, Xu Z, Xue B (2014) Using the SWAT model to assess impacts of land use changes on runoff generation in headwaters. Hydrol Process 28: 1032–1042.
  22. 22. Yang H, Wang G, Yang Y, Xue B, Wu B (2014) Assessment of the Impacts of Land Use Changes on Nonpoint Source Pollution Inputs Upstream of the Three Gorges Reservoir. ScientificWorldJournal 2014.
  23. 23. Xie P, Chen M, Yang S, Yatagai A, Hayasaka T, et al. (2007) A gauge-based analysis of daily precipitation over East Asia. J Hydrometeorol 8: 607–626.
  24. 24. Arnold JG, Srinivasan R, Muttiah RS, Williams JR (1998) Large area hydrologic modeling and assessment part I: Model development. J Am Water Resour Assoc 34: 73–89.
  25. 25. Arnold JG, Fohrer N (2005) SWAT2000: current capabilities and research opportunities in applied watershed modelling. Hydrol Process 19: 563–572.
  26. 26. Douglas-Mankin KR, Srinivasan R, Arnold JG (2010) Soil and Water Assessment Tool (SWAT) model: Current developments and applications. Trans ASABE 53: 1423–1431.
  27. 27. Abbaspour KC, Yang J, Maximov I, Siber R, Bogner K, et al. (2007) Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT. J Hydrol 333: 413–430.
  28. 28. Yang J, Reichert P, Abbaspour KC, Xia J, Yang H (2008) Comparing uncertainty analysis techniques for a SWAT application to the Chaohe Basin in China. J Hydrol 358: 1–23.
  29. 29. McKay MD, Beckman RJ, Conover WJ (1979) Comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 21: 239–245.
  30. 30. Iman RL, Helton JC (1988) An investigation of uncertainty and sensitivity analysis techniques for computer models. Risk Anal 8: 71–90.
  31. 31. Van Griensven A, Meixner T, Grunwald S, Bishop T, Diluzio M, et al. (2006) A global sensitivity analysis tool for the parameters of multi-variable catchment models. J Hydrol 324: 10–23.
  32. 32. Nash J, Sutcliffe JV (1970) River flow forecasting through conceptual models part I—A discussion of principles. J Hydrol 10: 282–290.
  33. 33. Fontaine TA, Cruickshank TS, Arnold JG, Hotchkiss RH (2002) Development of a snowfall–snowmelt routine for mountainous terrain for the soil water assessment tool (SWAT). J Hydrol 262: 209–223.
  34. 34. Hong Y, Nix HA, Hutchinson MF, Booth TH (2005) Spatial interpolation of monthly mean climate data for China. Int J Climatol 25: 1369–1379.
  35. 35. Uhlenbrook S, Roser S, Tilch N (2004) Hydrological process representation at the meso–scale: the potential of a distributed, conceptual catchment model. J Hydrol 291: 278–296.
  36. 36. Sanchez-Moreno JF, Mannaerts CM, Jetten V (2014) Influence of topography on rainfall variability in Santiago Island, Cape Verde. Int J Climatol 34: 1081–1097.
  37. 37. Weisse AK, Bois P (2001) Topographic effects on statistical characteristics of heavy rainfall and mapping in the French Alps. J Appl Meteorol 40: 720–740.
  38. 38. Silva VB, Kousky VE, Higgins RW (2011) Daily precipitation statistics for South America: An intercomparison between NCEP reanalyses and observations. J Hydrometeorol 12: 101–117.
  39. 39. Andermann C, Bonnet S, Gloaguen R (2011) Evaluation of precipitation data sets along the Himalayan front. Geochem Geophys Geosyst 12: Q07023.
  40. 40. Ono K, Kazama S, Gunawardhana LN, Kuraji K (2013) An investigation of extreme daily rainfall in the Mekong River Basin using a gridded precipitation dataset. Hydrol Res Lett 7: 66–72.