Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Actual evapotranspiration by machine learning and remote sensing without the thermal spectrum

  • Taiara Souza Costa,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Agricultural Engineering, Center of Agricultural Sciences, Federal University of Viçosa, Viçosa-MG, Brazil

  • Roberto Filgueiras,

    Roles Conceptualization, Formal analysis, Methodology, Software, Supervision, Visualization, Writing – original draft

    Affiliation Department of Agricultural Engineering, Center of Agricultural Sciences, Federal University of Viçosa, Viçosa-MG, Brazil

  • Robson Argolo dos Santos,

    Roles Data curation, Formal analysis, Software, Validation, Writing – review & editing

    Affiliation Department of Agricultural Engineering, Center of Agricultural Sciences, Federal University of Viçosa, Viçosa-MG, Brazil

  • Fernando França da Cunha

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Visualization, Writing – original draft, Writing – review & editing

    fernando.cunha@ufv.br

    Affiliation Department of Agricultural Engineering, Center of Agricultural Sciences, Federal University of Viçosa, Viçosa-MG, Brazil

Abstract

The objectives of this study were to use machine learning algorithms to establish a model for estimating the evapotranspiration fraction (ETf) using two data input scenarios from the spectral information of the Sentinel-2 constellation, and to analyze the temporal and spatial applicability of the models to estimate the actual evapotranspiration (ETr) in agricultural crops irrigated by center pivots. The spectral bands of Sentinel 2A and 2B satellite and vegetation indices formed the first scenario. The second scenario was formed by performing the normalized ratio procedure between bands (NRPB) and joining the variables applied in the first scenario. The models were generated to predict the ETf using six regression algorithms and then compared with ETf calculated by the Simple Algorithm For Evapotranspiration Retrieving (SAFER) algorithm, was considered as the standard. The results possible to select the best model, which in both scenarios was Cubist. Subsequently, ETf was estimated only for the center pivots present in the study area and the classification of land use and cover was accessed through the MapBiomas product. Land use was necessary to enable the calculation of ETr in each scenario, in the center pivots with sugarcane and soybean crops. ETr was estimated using two ETo approaches (EToBrazil and Hargreaves-Samani). It was found that the Hargreaves-Samani equation overestimated ETr with higher errors mainly for center pivots with sugarcane, where systematic error (MBE) ranged from 0.89 to 2.02 mm d-1. The EToBrazil product, on the other hand, presented statistical errors with MBE values ranging from 0.00 to 1.26 mm d-1 for both agricultural crops. Based on the results obtained, it is observed that the ETr can be monitored spatially and temporally without the use of the thermal band, which causes the estimation of this parameter to be performed with greater temporal frequency.

1. Introduction

With the advancement of technologies and with the aid of agrometeorological models aimed at the determination of evapotranspiration (ET), it becomes possible to estimate, spatially and accurately, the water demand of agricultural crops [1]. Such information is essential in irrigated agriculture. Thus, ET is a key parameter in the water balance and, in this context, the use of data from remote sensing stands out [24]. These estimates are generally based on the components of the energy balance equation and have shown a high degree of reliability and speed in ET estimation [5].

Among the agrometeorological models, it is possible to mention the Simple Algorithm For Evapotranspiration Retrieving (SAFER) presented by Teixeira [6]. SAFER is an energy balance algorithm developed based on the modeling of evapotranspiration fraction (ETf), of easy application and operability, calibrated in the field [7, 8], and has good results in studies with emphasis on irrigation management [4]. For Roerink et al. [9], ETf is the ratio between the energy released in the form of latent heat to the sum of latent heat and sensible heat.

To estimate ETf through the SAFER algorithm, images from the Landsat-8 satellite have been used [6]. This is possible because, the Landsat-8 satellite has an electromagnetic spectrum of the thermal infrared region. This electromagnetic spectrum makes it possible to estimate the surface temperature, being considered one of the main parameters directly responsible for determining actual evapotranspiration (ETr) [1012]. However, the temporal resolution of Landsat-8 (16 days) becomes an obstacle when the goal is to carry out irrigation management.

Given the above, the need to study other satellites with better temporal resolution is clear. However, there are no free satellites available that combine high spatial and temporal resolution, in addition to the ability to capture electromagnetic radiation emitted in the thermal infrared region, which could make irrigation management accessible by satellite images. On the other hand, Sentinel 2A and Sentinel 2B satellites have good spatial and temporal resolution, although they do not have the ability to capture electromagnetic radiation emitted in the thermal infrared region [13].

Considering this scenario, it is clear that there is a significant challenge for ET modeling with frequent images without the thermal infrared spectrum, which would help in the decisions related to management of water resources at the agricultural property level [14]. Therefore, it is desirable to develop techniques that make it possible to fill this gap by providing detailed and frequent information on ETr.

As an alternative and promising tool, there are machine learning algorithms that have increasingly been applied in ET modeling [15, 16]. These tools, in general, surpass the conventional equations for estimating ETr and/or reference evapotranspiration (ETo) [1719] and provide information that allows us to visualize how ET behaves in space and time [20, 21].

These machine learning algorithms are robust mathematical models with high predictive potential [22], and in addition, are able to capture complex relationships between data input and output, with interesting accuracy in the predicted values [3, 19]. It is noteworthy that machine learning algorithms are sensitive to input covariates. Therefore, it is important to use models capable of predicting ET with high performance, among them: Multiple Linear Regression (MLR), Linear Support Vector Machine (Linear SVM), Cubist, Bayesian-Regularized Neural Network (BRNN) and eXtreme Gradient Boosting (Xgb) by the Linear and Tree methods. More information on these methods can be found in the subtopic “Regression algorithms used in predicting ETf” in the material and methods item of this paper.

Although many studies have used algorithms to estimate ET, only Santos et al. [23] evaluated the impact of using machine learning algorithms in estimating ETf using only surface reflectance data. These authors used information from the OLI and TIR sensors (Landsat 8) to calculate the ETf in areas cultivated with sugarcane using the METRIC algorithm. The spectral bands of the MSI sensor, from the Sentinel-2 satellites, gave rise to the independent variables for input in the machine learning algorithms. With this, they concluded that algorithms were able to identify patterns in the MSI sensor data to predict the ETf when comparing with the standard model, with the XgbLinear model showing the best results. The study by Santos et al. [23] demonstrates that machine learning equations have the potential to predict ET in a simpler way, as it does not need the principles of mass and energy estimated by fully physical models and also do not need the energy balance calculated by the thermal band. For El-Hendawy et al. [24], spectral information has been used with great success to detect even the small biophysical and biochemical variations and modifications of the plant canopy.

Based on this need for further studies and on the availability of the Sentinel-2 constellation, this study aimed to estimate ETf through regression algorithms and spectral information from Sentinel 2A and Sentinel 2B satellites and to analyze the temporal and spatial applicability of ETf obtained to estimate the ETr in agricultural crops irrigated by center pivots.

2. Material and methods

2.1. Study area

The study was carried out in an area belonging to the hydrographic region of the Tocantins-Araguaia river (Fig 1), being located in the municipality of Itapaci, state of Goiás, delimited by the pairs of coordinates X1: 660185.23; Y1: 8356189.30 (upper left corner) and X2: 693932.13; Y2: 8332533.58 (lower right corner), DATUM WGS84 reference system—UTM plane coordinate system—Zone 22S.

thumbnail
Fig 1. Geographic location referring to the area used to model the evapotranspiration fraction (ETf).

The figure was elaborated using the open source software QGIS [25]—using an RGB image composite generated from the Landsat 8 satellite images that were downloaded from https://earthexplorer.usgs.gov/. Landsat-8 image courtesy of the U.S. Geological Survey according to [26].

https://doi.org/10.1371/journal.pone.0285535.g001

The study area, including exposed soil, has the following land uses: native forest, pasture, rivers, irrigated agriculture and rainfed. The area under study was selected due to the high density of center pivots. Therefore, to ensure water security, irrigation management is necessary. On the other hand, irrigation management also contributes to food security, since water is fundamental to obtain agricultural products in greater quantity and quality [27, 28].

Fig 2 shows the flowchart with the steps taken for the modeling of the target variable, which are later explained in the topics.

thumbnail
Fig 2. Methodology used to predict the evapotranspiration fraction (ETf).

https://doi.org/10.1371/journal.pone.0285535.g002

2.2. Regression algorithms used in predicting ETf

In an attempt to model ETf, six regression algorithms were used: Multiple Linear Regression (MLR), Linear Support Vector Machine (Linear SVM), Cubist, Bayesian-Regularized Neural Network (BRNN) and eXtreme Gradient Boosting (Xgb) by the Linear and Tree methods. All algorithms were implemented using the Caret package [29], in R software [30].

These models have a multivariate regression approach, in addition to a strong statistical basis, and have received increasing research interest in recent years [31]. However, the performance of each model may vary according to the characteristic of the variable to be modeled, which justifies the importance of its choices [31]. Regression algorithms are mathematical models that seek to find an equation that quantifies the relationship between a dependent variable (Y) and one or more independent variables (X1,…, Xp). That is, its analysis is a prediction of the values of one or more response variables to which a set of explanatory variables is used [32, 33].

The regression algorithms used in the study will be described in a simplified way. However, more information about the models used can be obtained in [31]. The Multiple Linear Regression (MLR) equation with p explanatory variables is represented by Eq 1.

(1)

Where yi represents the value of the response variable in the observation i, i = 1,…,n; xi1, xi2,…xip, i = 1,…,n are the values of the i-ésima observation of the p explanatory variables; B0, B1, B2,…,Bp are the regression parameters or coefficients; εi, i = 1,…,n correspond to random errors.

The Linear Support Vector Machine (Linear SVM) is obtained through Eq 2 and seeks to maximize the generalization ability and minimize structural risk. In addition, this model is not sensitive to outliers, that is, extreme values do not cause noise in the training of the data. The basic functioning of the SVM consists of adjusting the equation of a line, called a hyperplane, in such a way that the distance between it and points with different characteristics is maximized.

(2)

Where is a vector perpendicular to the points; is the vector of the set of points; b is an optional constant that can be used as a BIAS. When the result of this equation is equal to 1 or -1, it is one of the support vectors, with the result greater than 0 and less than 1 or less than 0 and greater than -1, it is a margin space.

The Cubist builds the model based on a set of unordered rules, where each rule is formed from conditions associated with a linear expression. The rules generated by Cubist have the following structure: (3)

Where R represents the number of rules; Ncob is the number of examples covered by the rule; Vm is the average (predicted) meta-attribute value of the covered examples; I = [Vmin, Vmax] is a range of predicted values (minimum and maximum value) for the attribute-meta class; Eest is the expected error estimate; complex are the rule conditions R and C is the linear model (mathematical function) for the target attribute.

Bayesian-Regularized Neural Networks (BRNN) are trained using supervised learning, with a training dataset of inputs and destinations D = {(x1, y1), (x2, y2), … (xN, yN)}. In the Bayesian structure, the weights of the network are considered random variables [34]. With the data set, the probability density function of a matrix w of network weights is: (4)

Where M is the particular neural network model used; f(w|,∝,M) is the prior density, which represents our knowledge of the weights before any data is collected; f(D|,w,β,M) is the likelihood function, which is the probability of the data occurring given the weights; f(D|∝,β,M) is a normalization factor, which guarantees that the total probability is 1.

Xgb is a new simple tree-based ensemble method that was developed by Chen et al. [35], which is an improved version of gradient boosting with higher computational efficiency and better ability to deal with overfitting problems [35]. Xgb is formed by a set of classification and regression trees K (CART) {T1 (x1, y1) … TK (xi, yi) where xi is the provided training set of descriptors associated with a molecule to predict the yi class label. Given that a CART assigns an actual score to each sheet (outcome or target), the prediction scores for individual CARTs are summed to obtain the final score and evaluated using K additive functions as shown in Eq 5.

(5)

Where fk represents an independent tree structure with leaf scores and F is the space of all CART.

2.3. Target variable: evapotranspiration fraction (ETf)

To model the evapotranspiration fraction (ETf) from regression models and readily available information, images of Sentinel 2A and 2B satellites were used as input variables in the models. However, the information from Landsat-8 was used as a reference for the calculation of ETf by Simple Algorithm For Evapotranspiration Retrieving [6], here called ETfSAFER.

The satellites used in the research were the target of study due to the potential in terms of spatial and temporal resolution, in addition to the existence of the band referring to the thermal spectrum present only in Landsat-8. Satellite images were acquired on the Earth Explorer platform (http://earthexplorer.usgs.gov) belonging to the United States Geological Survey [36]. After the download, a visual analysis was carried out to select images free of clouds and noises with a view to not compromising the analysis of the models. After this analysis, six pairs of images were selected (Landsat-8 and Sentinel-2) with the same dates, and they were recorded on 02/03/2016, 03/09/2017, 08/16/2017 and 01/23/2018, 08/06/2019 and 04/02/2020.

Then, a pre-processing was performed in order to leave the images ready to be used in the estimation of ETf. In this step, operational procedures were performed in QGis® software [25]. The Semi-Automatic Classification Plugin [37], was used for the conversion of digital numbers to physical values (Landsat-8 and Sentinel images) and for atmospheric correction (Landsat-8 and Sentinel images). The Dark Object Subtraction (DOS1) atmospheric correction methodology [38], was adopted to contain the additive effects caused by the dispersion of particles present in the atmosphere. Then, the coordinate system was reprojected to UTM, zone 22 South.

2.4. Methodologies: Resources adopted to model the target variable

For training regression algorithms, the four first images (02/03/2016, 03/09/2017, 08/16/2017 and 01/23/2018) of the six selected from the Sentinel-2 (A or B) and Landsat-8 satellites were used. ETfSAFER was used as dependent variable, calculated with Landsat-8 information from the reflectances of the bands (ρ2, ρ3, ρ4, ρ5, ρ6 and ρ7) and radiance ρ10 required to estimate biophysical parameters presented in the methodology of Teixeira [6]. On the website (https://eos.com/blog/band-combinations-for-landsat-8/) it is possible to access detailed information about each spectral band of the Landsat 8 satellite.

The independent variables were the reflectances of the multispectral bands from the Sentinel-2 constellation (ρ2, ρ3, ρ4, ρ8, ρ5, ρ6, ρ7, ρ8A, ρ11 and ρ12) and three different vegetation indices. The indices used were NDVI—normalized difference vegetation index [39], EVI—enhanced vegetation index [40], and NDWI—normalized difference water index [41], which were calculated according to Eqs 6, 7 and 8, respectively. NDVI and EVI are associated with vegetation growth stage, while NDWI is a moisture index.

(6)(7)(8)

Where ρ8, ρ4, ρ12, ρ2 are the reflectances of the near infrared (band 8), red (band 4), short wave infrared (band 12) and blue (band 2) bands of the sensor Multi Spectral Instrument (MSI); G is the adjusted gain factor; L is the soil adjustment factor; and C1 and C2 are coefficients for aerosol correction. For the coefficients G, C1, C2 and L, the values of 2.5, 6.0, 7.5 and 1.0 were assumed, respectively [40].

Sentinel-2 multispectral bands gave rise to other covariates, and, through the band_ratio Function of the Labgeo package [42], in R software [30], all the possibilities of ratio between the bands were calculated. Filgueiras [13] called this procedure a normalized ratio between bands (NRPB) (Eq 9). Through this procedure, 45 new covariates were originated, totaling 58 independent variables.

(9)

Where ρn are the surface reflectances relative to the bands of the MSI/Sentinel 2A and 2B sensor.

After the 58 covariates were obtained, predictors with high collinearity were excluded using a Pearson correlation limit of 0.98. Subsequently, the data set was filtered again to eliminate predictors that had low importance for the models using the Recursive Feature Elimination (RFE) function of the Caret package [29], in R software [30].

The variables under study have different spatial resolution, so it was necessary to convert all independent variables to 30 m. For this, the resample function of the Raster package [43] was applied in R software [30].

The data base of variables, for each pair of images, was obtained from random collection at georeferenced points. This procedure was performed using the sampleRandom function of the Raster package, present in R software [30]. To evaluate the performance of the models, the data set was separated into training and validation. Thus, the data sets were randomly divided into the ratio of 75% (110.442 pairs of points) for training with k-fold cross-validation (k = 10) and 25% (36.814 pairs of points) for the validation of ETf estimate. Althoff et al. [44] state that the 75/25% ratio is a good ratio for separate training and validation sets, respectively.

2.5. Study of two scenarios of input of independent variables to predict ETf considering the total area

The spectral bands of Sentinel 2A and 2B satellite and vegetation indices comprised a total of 11 variables and formed the first scenario (Table 1). In the second scenario, variables derived only from the spectral bands of Sentinel 2A and 2B satellite were combined through the NRPB. After performing the NRPB, the variables applied in the first scenario were joined, and subjected to the correlation analysis of 0.98 and the RFE Function, which made it possible to obtain 29 variables (Table 1).

thumbnail
Table 1. Combinations of input variables for each scenario.

https://doi.org/10.1371/journal.pone.0285535.t001

It is important to emphasize that both scenarios are formed by the same dates and area, delimited by the pairs of coordinates X1: 660185.23; Y1: 8356189.30 (upper left corner) and X2: 693932.13; Y2: 8332533.58 (lower right corner), DATUM WGS84 reference system—UTM plane coordinate system—Zone 22S. That is, the scenarios are different only as a function of the independent variables.

The variables of the second scenario were created with the expectation that they could increase the explanatory power of the machine learning methods. These methods have good ability to find patterns in data sets with significant amount of parameters, quality and quantity of input data [3, 45, 46].

2.6. Performance comparison criteria

The ETf values obtained by the models were considered as the predicted values, while the values from SAFER (ETfSAFER) were considered as observed data. Thus, the predicted values were compared with observed values using the following statistical metrics: root mean square error (RMSE, Eq 10), mean absolute error (MAE, Eq 11), systematic error (MBE, Eq 12) and coefficient of determination (r2, Eq 13).

(10)(11)(12)(13)

Where Pi is the value predicted by the model; P is the average value predicted by the model; Oi is the observed value; is the average observed value; n is the number of data pairs.

2.7. Case study

To evaluate whether Sentinel-2 is effective for obtaining the evapotranspiration fraction (ETf), the best model, in each scenario, was selected based on the statistical metrics obtained in the validation process. These models were applied to estimate ETf in sugarcane and soybean crops irrigated by center pivots present in the delimitation of the study area on 08/06/2019 which corresponds to the 218 days of the year (DOY) and 04/02/2020 (DOY093). The land use of the center pivots was determined by using images from the MapBiomas product (Collection 5) [47], relative to the year 2019, available at https://mapbiomas.org.

Normalized Difference Vegetation Index (NDVI) values at different dates for cultivated areas are provided in S1 File. In the worksheet, the maximum, average and minimum values of NDVI of the areas vegetated with soybean and sugarcane are informed. These crops are irrigated by various central pivot equipment and the coordinates of the center of the area are informed. With these NDVI values it is possible to have an idea of the vegetative stages of agricultural crops.

In the second moment of this research, the ETr calculated from the reference evapotranspiration (ETo) by the Penman-Monteith FAO 56 method (Eq 14) was used. This method is considered the standard for ETo calculation [48, 49]. Thus, ETr was used in this research to obtain the ETrSAFER when multiplied by the ETfSAFER to be compared with the ETr estimated with the ETf from the selected models. The empirical model of Hargreaves-Samani (HS) [50], (Eq 15) and the EToBrazil product [44], were used in each approach from the multiplication of ETfPredicted, hence forming the products ETrHS and ETrEToBrazil, respectively.

(14)(15)

Where ETo is reference evapotranspiration (mm d-1); Δ slope of the vapor pressure curve versus temperature (kPa °C-1); RG net radiation at the crop surface (MJ m-2 d-1); G is the soil heat flux density (MJ m-2 d-1); γ is the psychrometric constant (0.0677 kPa °C-1); Tmax is the maximum air temperature (°C-1); Tmin is the minimum air temperature (°C-1); Tmean is the mean air temperature (°C-1); U2 is the average wind speed at 2-m height (m s-1); es is the vapor saturation pressure (kPa); ea is the actual vapor pressure (kPa); ∝ is an empirical parameter (its original value of 0.0023 was used); β is an exponential empirical parameter (its original value of 0.5 was used); Ro is extraterrestrial solar radiation (MJ m-2 d-1).

ETo by the Penman-Monteith FAO 56 method was calculated from the BrazilMet package [51], in R statistical software [30]. The BrazilMet package used daily data from the automatic station of the National Institute of Meteorology (INMET) [52], installed in the municipality of Itapaci-GO (A015), located 20.5 km away from the center of the study area. The coordinates of station A015 are X:656990.30; Y:8343368.99, DATUM WGS84 reference system—UTM plane coordinate system—Zone 22S. EToHS was calculated using data from the aforementioned station, with the exception of extraterrestrial radiation, which was spatially calculated in R statistical software (Eq 16).

(16)

where: Ro is extraterrestrial solar radiation (MJ m-2 d-1); Sc is the solar constant that is equivalent to 0.0820 (MJ m-2 min-1); Dr is the relative distance between the earth and the sun in astronomical units; Ĥ is the hour angle at sunrise; ∅ is latitude; δ is solar declination.

According to Iqbal [53], the relative distance between the earth and the sun in astronomical units can be calculated by Eq 17.

(17)

where: J is the day in the Julian calendar.

The EToBrazil product covers Brazil as a whole and provides daily ETo information that was estimated spatially using data from 849 weather stations [44]. This product has a spatial resolution of approximately 10 km. It is worth pointing out that, for an effective application of the EToBrazil product, it was necessary to use the disaggregate function of the Raster package [43], present in R software [30].

3. Results and discussion

3.1 Validation of models for evapotranspiration fraction (ETf) estimation

To evaluate the possible improvement in the performance of prediction models, the approach of two different scenarios was chosen. The scenarios refer to different combinations of input variables to estimate ETf through regression algorithms, since the intention is to fit a simple and accurate model. This made it possible to know, in each scenario, the regression algorithm that had the highest capacity to predict ETf. The performance of the methods for predicting ETf for the different scenarios is shown in Fig 3.

thumbnail
Fig 3. Daily values of evapotranspiration fraction (ETf) predicted by different approaches (scenarios 1 and 2) compared with values obtained by SAFER from Landsat-8 images in the period from 2016 to 2018.

https://doi.org/10.1371/journal.pone.0285535.g003

For the authors Chandel et al. [54] and Zhang and Zhou [55] the vegetation indices that have a band with a short infrared or red wavelength show greater sensitivity to the water content in the leaves. The presence of the indices generated through the NRPB that were selected in the second scenario, which have a band in the short or red infrared, are seven. This may be one of the arguments that affirms the superior predictive capacity of the second scenario, not ruling out that the greater number of variables influenced the result.

By analyzing Fig 3, it can be noted that both scenarios promoted good performances, because the models showed accuracy (low values of MAE and RMSE) and small presence of outliers due to the proximity of RMSE and MAE. The SVM Linear model overestimated ETf and the other algorithms had MBE values equal to zero, thus showing that there was no overestimation. Therefore, in general, all models showed good accuracy (high values of r2). It can also be observed in each method that the line of the fitted linear model is similar to the ideal line (1:1).

The values obtained from the statistical indicators show that the predicted values (ETfPredicted) are positively correlated with the data derived from ETfSAFER, that is, they had the same linear trend in both scenarios. Although the first scenario contains fewer input variables in the models, the statistical results obtained indicate high predictability of the methods under these conditions.

It is also noticed that the models of the second scenario differed slightly from those of the first scenario, showing a reduction in RMSE values and positive increments in the values of r2, except for the BRNN and XgbLinear models, which had the same metrics in both circumstances. However, the cloud of points is different, being more concentrated around the 1:1 line in the second scenario. It is worth mentioning that, when the models of the second scenario were run, the time required for their training increased more than 70 times in comparison to the model with lowest machine cost, but it did not lead to major changes in the performance of the methods. Thus, computational time, which is highly dependent on the speed of the computer processor, should be considered as an important element of comparison to process large amounts of data in a short time [45, 46].

The MLR and SVM Linear methods showed similar statistical behaviors in the performance of ETf prediction in both scenarios and have lower explanation capacity when compared with the others. However, the SVM Linear method had the highest machine cost, taking up to 70 times longer to be trained, when compared to the MLR, which used the shortest computational time among all algorithms. Cubist, which has better statistical results, had a training time approximately 20 times longer than the time spent by the MLR. It is worth mentioning that in the present study a conventional computer was used to perform the training. Perhaps, if it was necessary to reduce the time of the procedures, it could use the cloud processing to adapt the time to the demand.

Given the analyses that the two scenarios obtained good results, it is necessary to verify the performance of the models that best fitted the ETfSAFER data on each occasion. Thus, the two best models in descending order are the Cubist and the XgbLinear, with slight superiority for the Cubist. XgbTree is the third best model in the second scenario, while in the first scenario it also promoted good performance, with statistical indices equal to those of BRNN. As the performance of the models is considered satisfactory in both strategies, the subsequent analyses and discussion were based on the best model of each scenario.

The selection of an algorithm with excellent accuracy is essential for the obtaining of estimates and later practical application for the prediction of ETr, which is an important parameter for irrigation management. Despite the similarity in the performance of the two best models, the Cubist was chosen because it had a slight advantage in all scenarios studied. Also in this context, the variations of RMSE and r2 values in the validated modeling of Cubist in the first and second scenarios were from 0.10 to 0.09 (-10%) and from 0.92 to 0.94 (2.13%), respectively. In addition, the magnitude of RMSE relative to the mean of ETfSAFER was equivalent to 18.86 and 16.67% for the first and second scenarios, respectively.

The ETfPredicted by the Cubist method, in the present study, is in accordance with the results obtained by Filgueiras et al. [51], who conducted a study with six regression algorithms to estimate ETr without thermal information, using 40% (NDVI and SR) of the independent variables from spectral information of the MODIS sensor on board the TERRA platform. The authors found RMSE and r2 with the help of the Cubist model of 0.40 and 0.92, respectively, which shows the robustness of the tool in ETr estimation. Thus, these authors indicated it as the best algorithm to predict ETr in that situation.

The Cubist model is based on the modified regression trees theory and its principle is to generate prediction models from rule-based systems. These rules are created using training data and each has a multivariate linear model, which are overlapped and arranged in ascending order of predicted mean values [56]. Cubist has been widely applied in academic studies in a variety of areas from satellite images. In this context, several authors conclude that this method provides great robustness and greater accuracy when estimating ET when compared to other traditional machine learning algorithms, such as multiple linear regression, principal component analysis, artificial neural networks, support vector machine, among others [44, 53, 5759].

When using the methodology applied in this study, the estimation of ETf through the MSI sensor becomes feasible and these results can be used in temporal studies containing information derived from Landsat-8, since the model was based on the information of this product. This fact enables the presence of a higher frequency of information and, therefore, it is possible to monitor the evapotranspiration of a crop with a substantially lower image periodicity, as mentioned by Filgueiras et al. [13].

3.2. Estimation of evapotranspiration fraction (ETf) in areas irrigated by center pivot

As the objective of the proposed ETf modeling was to apply the trained and validated model to estimate ETr in agricultural areas a posteriori, it is convenient to analyze the scatter plots of the applicability of the model only in areas irrigated by center pivots. This analysis is important to know the sensitivity of the values of ETfPredicted versus ETfSAFER, and to know the generalization capacity of the models in predictions for dates not used in the training (Fig 4)—which simulates the applicability of the model in practice. When analyzing the MBE, it can be observed that the Cubist model, in this application, overestimated the values in both scenarios and on the two days 08/06/2019 (DOY218) and 04/02/2020 (DOY093), with values ranging from 0.11 to 0.20. For Stone [60], the closer the MBE value is to zero, the better the performance of the model under study.

thumbnail
Fig 4. Application of the Cubist model for prediction of evapotranspiration fraction (ETf) in areas with center pivots compared with values obtained by SAFER with Landsat-8 images for the days 08/06/2019 (DOY218) and 04/02/2020 (DOY093).

https://doi.org/10.1371/journal.pone.0285535.g004

The data on the date corresponding to DOY218 showed poorer behavior compared to that of DOY093 in both scenarios. This is highly associated with land use on this day, showing low mean value of NDVI, ranging from 0.1 to 0.8, with an average of 0.4. For DOY093, NDVI values are within the range from 0.1 to 0.9, with an average of 0.7. NDVI is directly related to the photosynthetic components of the plant, and the presence of low NDVI values indicates that plants, in most center pivots, had a small leaf area or are at the physiological maturity stage [40, 61].

It is important to highlight that NDVI is one of the variables used for the calculation of ETfSAFER. In this scenario, low values of NDVI result in values closer to zero for ETfSAFER, and this occurred around 45% of the area under study for the day referring to DOY218, when ETfSAFER ranged from 0 to 0.40. ETfPredicted, on DOY218, showed values closer to one in both scenarios. Therefore, the errors were relatively higher and the accuracy was lower when compared to the image referring to DOY093, which had ETfSAFER values from 0.45 to 0.90 in more than 70% of the area.

Corroborating Carpintero et al. [62], the results found in the application of the model in areas irrigated by center pivots are possibly linked to the oscillations that occur in the dynamics of land use in agricultural areas, which ends up causing different spectral responses in the data captured by the sensors. Moreover, the relationships observed under different conditions of the study area may not be able to properly generalize, with the same performance, since the behavior of regression algorithms may be highly specific to the site [63, 64].

Dou and Yang [46] also evaluated the performance of models developed to estimate evapotranspiration in four different terrestrial ecosystems (forest, pasture, cropland and wetland). The authors verified that the predictive capacity of the four machine learning algorithms varied with the different ecosystems. These results are justified by the fact that evapotranspiration is influenced by land use and cover [65].

3.3. Estimation of actual evapotranspiration (ETr) using two approaches in ETo calculation in two agricultural crops

Evapotranspiration is an important factor for irrigation management, while land use and cover acts directly on this estimate. Thus, the land use and cover data for the center pivots, from MapBiomas, were accessed (Fig 5). It was verified that 85% of the center pivots were occupied with sugarcane, while the rest were occupied with soybean. From this information, ETr was calculated for each crop and compared using two approaches: ETrSAFER vs ETrEToHS and ETrSAFER vs ETrEToBrazil for the center pivots on the days DOY218 and DOY093 in both scenarios (Fig 6).

thumbnail
Fig 5. Land use and land cover classification from MapBiomas, collection 5.

The figure was elaborated using the open source software QGIS [25]—using an RGB image composite generated from the Landsat 8 satellite images that were downloaded from https://earthexplorer.usgs.gov/. Landsat-8 image courtesy of the U.S. Geological Survey according to [26].

https://doi.org/10.1371/journal.pone.0285535.g005

thumbnail
Fig 6. Application of the Cubist model for the prediction of actual evapotranspiration (ETr) in areas with center pivots using sugarcane and soybean crop, obtained through two approaches, (ETrEToHS and ETrEToBrazil), and compared with values obtained by (ETrSAFER) for the days 08/06/2019 (DOY218) and 04/02/2020 (DOY093).

https://doi.org/10.1371/journal.pone.0285535.g006

The study with both approaches makes it possible to know the best method of ETo to calculate ETr for the conditions and methodologies applied in the present study, which can make it possible to monitor the crop with a substantially higher image frequency, as well as making the observation of the spatial-temporal dynamics of evapotranspiration practical. When comparing the mean difference of the statistical metrics of the second strategy with those of the first one on the two days under study, in the approaches for any land use, it is verified that the first scenario has greater precision in ETr estimation, with some exceptions. The exceptions were for the comparisons ETrSAFER vs ETrEToBrazil for sugarcane and ETrSAFER vs ETrEToHS and ETrSAFER vs ETrEToBrazil on DOY093 in areas with soybean crop.

In this context, it is observed that the estimated values of ETr for sugarcane have higher errors and lower r2 when compared with those for soybean. This occurred mainly for ETrSAFER vs ETrEToHS, whereas, if compared with ETrSAFER vs ETrEToBrazil, the error indicators are relatively better in both crops. However, sugarcane still has lower metrics. This behavior may be explained by the fact that this agricultural crop does not have the same window dynamics as that of soybean, which is more rigid. This is because sugarcane is a semi-perennial crop, with a production cycle of six to seven years.

From the spectral reflectances in the areas irrigated by the center pivots, it is observed that there is a more heterogeneous behavior of sugarcane plants and this is reflected in the prediction of ETr, since the independent variables directly influence ETf. This makes it possible to highlight that the plants have greater spectral variation, when analyzing the center pivots with this crop, different from what is verified with the center pivots with soybean crop (in which the spectral behavior is more homogeneous). It is worth pointing out that the dominant factors that control leaf reflectance are pigments, cell structure and water content [6668]. These factors depend on the crop, physiological stage and management of the cultivation area.

Also in Fig 5, all approaches and land use in the areas irrigated by the center pivots showed an overestimation, except for the comparison ETrSAFER vs ETrEToBrazil for soybean in the first scenario, on the day corresponding to DOY218. In this period, the areas cultivated with sugarcane also stood out, with higher values of MBE (0.68 to 2.02 mm d-1), while for soybeans, the values are between 0.00 and 0.41 mm d-1. Santos et al. [4], when working with the SAFER algorithm and testing various ETo methods to predict ETr, found that MBE is directly related to the values estimated using ETo.

In this study, when analyzing the scatter plots for sugarcane, it was verified that, regardless of the approach used to estimate ETr, it is pointing to higher water demand compared with ETrSAFER, especially ETrEToHS, which would have a direct impact on irrigation management. This indicates that obtaining ETo the HS method is not an adequate strategy to map ETr for sugarcane under the studied conditions, so this is not a recommended methodology.

However, if the producer chooses to apply the average water depth, the ideal is to use the ETrEToBrazil approach. Nonetheless, some areas will be over- or under-irrigated, which will impact yield or result in waste of water resources and may even lead to an increased incidence of agricultural diseases [1, 4].

Santos et al. [4] estimated the ETr of sugarcane crop using SAFER and the ETo data calculated by the HS model. Thereby, verified an overestimation with this strategy, suggesting greater water demand compared with ETrSAFER, which was estimated by ETo calculated by Penman-Monteith FAO56. Therefore, the methodology that uses the HS model was not recommended by the authors. Other authors have also studied the HS method and verified overestimates of the ETo values, indicating improvements in the efficiency of the method only after calibration of the model [69, 70].

As observed in Fig 6, the different ETo estimation methodologies directly impacted the prediction of ETr in both scenarios. Therefore, it is clear that the Cubist model, for the first scenario, and the approach ETrEToBrazil respond better when one is working with soybean crops irrigated by center pivots. This approach has real potential to improve the ETr estimation in time and space and makes it possible to monitor water demand more frequently during the crop cycle.

4. Conclusions and recommendation

The study explored six regression algorithms (MLR, SVM Linear, Cubist, BRNN, XgbLinear and XgbTree) in two different data input scenarios. The first scenario was worked with 11 independent variables, while the second scenario, with 29. Regardless of these scenarios, the Cubist model should be recommended for predicting the evapotranspiration fraction, since it showed the best performance metrics, as well as the greater computational cost compensation required for validation.

The methodology applied in this study made it possible to analyze the actual evapotranspiration (ETr) using the ETrEToHS and ETrEToBrazil approaches in the areas under center pivots with sugarcane and soybean crops. The first scenario and the EToBrazil product comprise the recommended strategy to estimate ETr for both agricultural crops, but the best performances were obtained in soybean crops. In contrast, the Hargreaves-Samani equation overestimated ETr and led to greater errors, especially in irrigated sugarcane crops.

The findings found in this study are promising and of great importance for water management in irrigation, especially in center pivots with soybean crop. Thus, this study proved the potential of using machine learning approaches, considering their effectiveness and ease of implementation. However, it must be emphasized that this research is unprecedented and should be tested in other areas to verify whether the behaviors of the two agricultural crops are similar to the results found here.

Based on the findings, the subsequent recommendations are forwarded:

  • More studies should be carried out to confirm our results and reduce errors in the ETr estimation. In this sense, the introduction of meteorological data from satellites can be used in future studies. These data with daily temporal resolution can provide a better explanation of the phenomenon we are monitoring.
  • The authors also suggest further studies to assess whether the proposed methodology is good enough for ETr spatialization for precision agriculture. This is a point that was not addressed in the present research and that would require evaluations to understand whether the methodology could also be applied for this purpose.

Supporting information

S1 File. Maximum, average and minimum values of Normalized Difference Vegetation Index (NDVI) in areas cultivated with soybean and sugarcane and irrigated by center pivot at different dates.

https://doi.org/10.1371/journal.pone.0285535.s001

(XLSX)

Acknowledgments

We thank the Department of Agriculture Engineering (DEA) and Graduate Program in Agricultural Engineering (PPGEA) of the Federal University of Viçosa (UFV) for supporting the researchers.

References

  1. 1. Silva COF, Teixeira AHC, Manzione RL. Agriwater: An R package for spatial modelling of energy balance and actual evapotranspiration using satellite images and agrometeorological data. Environmental Modelling & Software. 2019; 120: 104497. https://doi.org/10.1016/j.envsoft.2019.104497
  2. 2. Ahmadpari H, Shokoohi ES, Lalabadi NM, Gerdini MS, Ebrahimi M. Assessment of potential evapotranspiration estimation methods in the fasa region. Specialty Journal of Agricultural Sciences. 2019; 5: 56–66.
  3. 3. Yamaç SS, Todorovic M. Estimation of daily potato crop evapotranspiration using three different machine learning algorithms and four scenarios of available meteorological data. Agricultural Water Management. 2020; 228: 105875. https://doi.org/10.1016/j.agwat.2019.105875
  4. 4. Santos JEO, Cunha FF, Filgueiras R, Silva GH, Teixeira AHC, Silva FCS, et al. Performance of SAFER evapotranspiration using missing meteorological data. Agricultural Water Management. 2020; 233: 106076. https://doi.org/10.1016/j.agwat.2020.106076
  5. 5. Senay GB, Friedrichs M, Singh RK, Velpuri NM. Evaluating Landsat 8 evapotranspiration for water use mapping in the Colorado River Basin. Remote Sensing of Environment. 2016; 185: 171–185. https://doi.org/10.1016/j.rse.2015.12.043
  6. 6. Teixeira AHC. Determining regional actual evapotranspiration of irrigated and natural vegetation in the São Francisco river basin (Brazil) using remote sensing and Penman-Monteith equation. Remote Sensing. 2010; 2: 1287–1319. https://doi.org/10.3390/rs0251287
  7. 7. Teixeira AHC, Hernandez FBT, Andrade RG, Leivas JF, Bolfe EL. Energy balance with Landsat images in irrigated central pivots with corn crop in the São Paulo State, Brazil. Remote Sensing for Agriculture, Ecosystems, and Hydrology. 2014; 16: 92390O. https://doi.org/10.1117/12.2067090
  8. 8. Teixeira AHC, Leivas JF, Andrade RG, Hernandez FBT. Water productivity assessments with Landsat 8 images in the Nilo Coelho irrigation scheme. Irriga. 2015; 1: 1–10. https://doi.org/10.15809/irriga.2015v1n2p01
  9. 9. Roerink GJ, Su Z, Menenti M. S-SEBI: A simple remote sensing algorithm to estimate the surface energy balance. Physics and Chemistry of the Earth. 2000; 25: 147–157. http://doi.org/10.1016/S1464-1909(99)00128-8
  10. 10. Petchprayoon P, Blanken PD, Hussein K, Abdalati W, Lawavirotwong S. A decade of changing surface energy balance components over a large water region. Remote Sensing of the Ocean. 2016; 2016: 99990B. https://doi.org/10.1117/12.2241516
  11. 11. Erlandsen HB, Haddeland I, Tallaksen LM, Kristiansen J. The sensitivity of the terrestrial surface energy and water balance estimates in the WRF model to lower surface boundary representations: a south Norway case study. Journal of Hydrometeoroly. 2017; 18: 265–284. https://doi.org/10.1175/JHM-D-15-0146.1
  12. 12. Anapalli SS, Green TR, Reddy KN, Gowda PH, Sui R, Fisher DK, et al. Application of an energy balance method for estimating evapotranspiration in cropping systems. Agricultural Water Management. 2018; 204: 107–117. https://doi.org/10.1016/j.agwat.2018.04.005
  13. 13. Filgueiras R, Mantovani EC, Dias SHB, Fernandes Filho EI, Cunha FF, Neale CMU. New approach to determining the surface temperature without thermal band of satellites. European Journal of Agronomy. 2019; 106: 12–22. https://doi.org/10.1016/j.eja.2019.03.001
  14. 14. Sobrino J, Del Frate F, Drusch M, Jimenez-Munoz JC, Manunta P, Regan A. Review of thermal infrared applications and requirements for future high-resolution sensors. IEEE Transactions on Geoscience and Remote Sensing. 2016; 54: 2963–2972. https://doi.org/10.1109/TGRS.2015.2509179
  15. 15. Granata F. Evapotranspiration evaluation models based on machine learning algorithms: A comparative study. Agricultural Water Management. 2019; 217: 303–315. https://doi.org/10.1016/j.agwat.2019.03.015
  16. 16. Tikhamarine Y, Malik A, Kumar A, Souag-Gamane D, Kisi O. Estimation of monthly reference evapotranspiration using novel hybrid machine learning approaches. Hydrological Sciences Journal. 2019; 64: 1824–1842. https://doi.org/10.1080/02626667.2019.1678750
  17. 17. Antonopoulos VZ, Antonopoulos AV. Daily reference evapotranspiration estimates by artificial neural networks technique and empirical equations using limited input climate. Computers and Electronics in Agriculture. 2017; 132: 86–96. https://doi.org/10.1016/j.compag.2016.11.011
  18. 18. Mattar MA. Using gene expression programming in monthly reference evapotranspiration modeling: a case study in Egypt. Agricultural Water Management. 2018; 198: 28–38. https://doi.org/10.1016/j.agwat.2017.12.017
  19. 19. Ferreira LB, Cunha FF. New approach to estimate daily reference evapotranspiration based on hourly temperature and relative humidity using machine learning and deep learning. Agricultural Water Management. 2020; 234: 106–113. https://doi.org/10.1016/j.agwat.2020.106113
  20. 20. Madugundu R, Al-Gaadi KA, Tola E, Hassaballa AA, Patil VC. Performance of the METRIC model in estimating evapotranspiration fluxes over an irrigated field in Saudi Arabia using Landsat-8 images. Hydrology and Earth System Sciences. 2017; 21: 6135–6151. https://doi.org/10.5194/hess-21-6135-2017
  21. 21. Tang D, Feng Y, Gong D, Hao W, Cui N. Evaluation of artificial intelligence models for actual crop evapotranspiration modeling in mulched and non-mulched maize croplands. Computers and Electronics in Agriculture. 2018; 152: 375–384. https://doi.org/10.1016/j.compag.2018.07.029
  22. 22. Deo RC, Samui P, Kim D. Estimation of monthly evaporative loss using relevance vector machine, extreme learning machine and multivariate adaptive regression spline models. Stochastic Environmental Research and Risk Assessment. 2016; 30: 1769–1784. https://doi.org/10.1007/s00477-015-1153-y
  23. 23. Santos RA, Mantovani EC, Fernandes-Filho EI, Filgueiras R, Lourenço RDS, Bufon VB.; et al. Modeling actual evapotranspiration with MSI-Sentinel images and machine learning algorithms. Atmosphere. 2022; 13: 1518. https://doi.org/10.3390/atmos13091518
  24. 24. El-Hendawy SE, Al-Suhaibani NA, Elsayed S, Hassan WM, Dewir YH, Refay Y, et al. Potential of the existing and novel spectral reflectance indices for estimating the leaf water status and grain yield of spring wheat exposed to different irrigation rates. Agricultural Water Management. 2019; 217: 356–373. https://doi.org/10.1016/j.agwat.2019.03.006
  25. 25. QGIS Development Team. QGIS Geographic Information System. Open Source Geospatial Foundation Project. 2016; 3.28.3. https://www.qgis.org
  26. 26. Earth Resources Observation and Science (EROS) Center. Data citation. https://www.usgs.gov/centers/eros/data-citation.
  27. 27. Elshaikh AE, Yang S, Jiao X, Elbashier MM. Impacts of legal and institutional changes on irrigation management performance: A case of the Gezira irrigation scheme, Sudan. Water. 2018; 10(11): 1579. https://doi.org/10.3390/w10111579
  28. 28. Kadiresan K, Khanal PR. Rethinking irrigation for global food security. Irrigation and Drainage. 2018; 67: 8–11. http://doi.org/10.1002/ird.2219
  29. 29. Kuhn M. Building predictive models in R using the caret package. Journal of Statistical Software. 2008; 28(5): 1–26. https://doi.org/10.18637/jss.v028.i05
  30. 30. R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. 2017. https://www.R-project.org
  31. 31. Kuhn M, Johnson K. Regression trees and rule-based models. Applied predictive modeling. New York:Springer. 2013. https://doi.org/10.1007/978-1-4614-6849-3
  32. 32. Maulud DH, Abdulazeez AM. A Review on Linear Regression Comprehensive in Machine Learning. Journal of Applied Science and Technology Trends. 2020. 1(4): 140–147. http://doi.org/10.38094/jastt1457
  33. 33. Zhang S, Lin G. Robust data-driven discovery of governing physical laws with error bars. Proceeding of the Royal Society. 2018; 474: 20180305. pmid:30333709
  34. 34. Foresee DF, Hagan MT. Gauss-Newton approximation to Bayesian learning. Proceedings of International Conference on Neural Networks (ICNN’97). 1997. http://doi.org/10.1109/icnn.1997.614194
  35. 35. Chen Y, He P, Chen W, Zhao F. A polynomial regression method based on Trans-dimensional Markov Chain Monte Carlo. Institute of Electrical and Electronics Engineers. 2018. 3: 1781–1786. http://doi.org/10.1109/iaeac.2018.8577769
  36. 36. Explorer Earth. Earth Explorer (Download Orbital Images) American Geological Institute. Access on: January 31, 2022. Available at:<https://earthexplorer.usgs.gov/>.
  37. 37. Congedo L. Semi-Automatic Classification Plugin Documentation. 2016; 6.1.1.1. http://doi.org/10.13140/RG.2.2.29474.02242/1
  38. 38. Chavez PS. Image-based atmospheric corrections: revisited and improved. Photogrammetric Engineering e Remote Sensing. 1996; 62: 1025–1036.
  39. 39. Rouse JW, Haas RH, Schell JA, Deering DW. Monitoring vegetation systems in the great plains with ERTS. Proceedings, Third Earth Resources Technology Satellite-1, Symposium, Greenbelt: NASA. 1974; 351: 3010–3017.
  40. 40. Huete A, Didan K, Miura T, Rodriguez EP, Gao X, Ferreira LG. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sensing of Environment. 2002; 83: 195–213. https://doi.org/10.1016/s0034-4257(02)00096-2
  41. 41. Gao X, Huete AR, Ni W, Miura T. Optical-biophysical relationships of vegetation spectra without background contamination. Remote Sensing of Environment. 2000; 74(3): 609–620. https://doi.org/10.1016/S0034-4257(00)00150-4
  42. 42. Fernandes Filho EI. Labgeo: Collection of functions to fit models with emphasis in land use and soil mapping. 2019. https://github.com/elpidiofilho/labgeo
  43. 43. Hijmans RJ, Van Etten J, Cheng J, Sumner M, Mattiuzzi M, Greenberg JA, et al. Raster: Geographic data analysis and modeling. 2018; 2: 8–4. Retrieved from https://CRAN.R-project.org/package=raster
  44. 44. Althoff D, Dias S, Filgueiras R, Rodrigues LN. EToBrazil: A daily gridded reference evapotranspiration data set for Brazil—Repository. Water Resources Research. 2020; 56(7): 1–16. http://doi.org/10.1029/2020wr027562
  45. 45. Althoff D, Bazame HC, Filgueiras R, Dias SHB. Heuristic methods applied in reference evapotranspiration modeling. Ciência e Agrotecnologia. 2018; 42: 314–324. https://doi.org/10.1590/1413-70542018423006818
  46. 46. Dou X, Yang Y. Evapotranspiration estimation using four different machine learning approaches in different terrestrial ecosystems. Computers and Electronics in Agriculture. 2018; 148: 95–106. https://doi.org/10.1016/j.compag.2018.03.010
  47. 47. Project MapBiomas-Collection 5 of the Annual Series of Land Cover and Land Use Maps in Brazil. 2020. https://mapbiomas.org/
  48. 48. Allen RG, Pereira LS, Raes D, Smith M. Crop evapotranspiration: Guidelines for computing crop water requirements, FAO irrigation and drainage paper. Food and Agriculture Organization of the United Nations, Rome. 1998. 300p.
  49. 49. Djaman K, Balde AB, Sow A, Muller B, Irmak S, N’Diaye MK, et al. Evaluation of sixteen reference evapotranspiration methods under sahelian conditions in the Senegal River Valley. Journal of Hydrology: Regional Studies. 2015; 3: 139–159.
  50. 50. Hargreaves GH, Samani ZA. Reference crop evapotranspiration from temperature. Applied Engineering in Agriculture. 1985; 1(2): 96–99. https://doi.org/10.13031/2013.26773
  51. 51. Filgueiras R, Almeida TS, Mantovani EC, Dias SHB, Fernandes-Filho EI, Cunha FF, et al. Soil water content and actual evapotranspiration predictions using regression algorithms and remote sensing data. Agricultural Water Management. 2020; 244: 106346. https://doi.org/10.1016/j.agwat.2020.106346
  52. 52. Instituto Nacional de Meteorologia (INMET). Clima. http://www.inmet.gov.br/portal/index.php?r=bdmep/bdmep.
  53. 53. Iqbal M. An introduction to solar radiation. Cambridge: Academic Press. 2012; 390p.
  54. 54. Chandel NS, Rajwade YA, Golhani K, Tiwari PS, Dubey K, Jat D. Canopy spectral reflectance for crop water stress assessment in wheat (Triticum aestivum L.). Irrigation Drainage. 2020; 70: 321–331. https://doi.org/10.1002/ird.2546
  55. 55. Zhang F, Zhou G. Estimation of vegetation water content using hyperspectral vegetation indices: A comparison of crop water indicators in response to water stress treatments for summer maize. BMC Ecology. 2019; 19: 18. pmid:31035986
  56. 56. Im J, Jensen JR, Coleman M, Nelson M. Hyperspectral remote sensing analysis of short rotation woody crops grown with controlled nutrient and irrigation treatments. Geocarto International. 2009; 24(4): 293–312. https://doi.org/10.1080/10106040802556207
  57. 57. Ke Y, Im J, Park S, Gong H. Downscaling of MODIS one kilometer evapotranspiration using Landsat-8 data and machine learning approaches. Remote Sensing. 2016; 8: 1–26. http://doi.org/10.3390/rs8030215
  58. 58. Noi P, Degener J, Kappas M. Comparison of multiple linear regression, cubist regression, and random forest algorithms to estimate daily air surface temperature from dynamic combinations of MODIS LST data. Remote Sensing. 2017; 9: 390–398. https://doi.org/10.3390/rs9050398
  59. 59. Dias SHB, Filgueiras R, Fernandes Filho EI, Arcanjo GS, Silva GH, Mantovani EC, et al. Reference evapotranspiration of Brazil modeled with machine learning techniques and remote sensing. Plos One. 2021; 16: 1–19. pmid:33561147
  60. 60. Stone RJ. Improved statistical procedure for the evaluation of solar radiation estimation models. Solar Energy. 1993; 51: 289–291. https://doi.org/10.1016/0038-092X(93)90124-7
  61. 61. Fontana DC, Pinto DG, Junges AH, Bremm C. Inferências sobre o calendário agrícola a partir de perfis temporais de NDVI/MODIS. Bragantia. 2015; 74: 350–358. https://doi.org/10.1590/1678-4499.0439
  62. 62. Carpintero E, Mateos L, Andreu A, González-Dugo MP. Effect of the differences in spectral response of Mediterranean tree canopies on the estimation of evapotranspiration using vegetation index-based crop coefficients. Agricultural Water Management. 2020; 238: 106201. https://doi.org/10.1016/j.agwat.2020.106201
  63. 63. Kiafar H, Babazadeh H, Marti P, Kisi O, Landeras G, Karimi S, et al. Evaluating the generalizability of GEP models for estimating reference evapotranspiration in distant humid and arid locations. Theoretical and Applied Climatology. 2017; 130: 377–389. https://doi.org/10.1007/s00704-016-1888-5
  64. 64. Ferreira LB, Cunha FF, Oliveira RA, Fernandes Filho EI. Estimation of reference evapotranspiration in Brazil with limited meteorological data using ANN and SVM—A new approach. Journal of Hydrology. 2019; 572: 556–570. https://doi.org/10.1016/j.jhydrol.2019.03.028
  65. 65. Li G, Zhang F, Jing Y, Liu Y, Sun G. Response of evapotranspiration to changes in land use and land cover and climate in China during 2001–2013. Science of the Total Environment. 2017; 596: 256–265. pmid:28433768
  66. 66. Moraes Novo EML. Sensoriamento remoto: Princípios e aplicações. São Paulo: Editora Blucher. 2010; 388p.
  67. 67. Jones HG, Vaughan RA. Remote sensing of vegetation: principles, techniques, and applications. Oxford: Oxford University Press. 2010; 384p.
  68. 68. Moghadam P, Ward D, Goan E, Jayawardena S, Sikka P, Hernandez E. Plant disease detection using hyperspectral imaging. International Conference on Digital Image Computing: Techniques and Applications (DICTA). 2017; 2017: 1–8. https://doi.org/10.1109/DICTA.2017.8227476
  69. 69. Carvalho DF, Rocha HS, Bonomo R, Souza AP. Estimativa da evapotranspiração de referência a partir de dados meteorológicos limitados. Pesquisa Agropecuária Brasileira. 2015; 50: 1–11. http://doi.org/10.1590/S0100-204X2015000100001
  70. 70. Zanetti SS, Dohler R, Cecílio RA, Pezzopane JEM, Xavier AC. Proposal for the use of daily thermal amplitude for the calibration of the Hargreaves-Samani equation. Journal of Hydrology. 2019; 571: 193–201. https://doi.org/10.1016/j.jhydrol.2019.01.049