Figures
Abstract
Coffee is a highly valued commodity and a widely consumed beverage, playing an important role in global trade. However, coffee farming landscapes are increasing transitioning into smaller-scale agricultural setups. This transformation highlights the critical need for accurate classification and mapping of coffee cropping systems (CS), especially in countries like Uganda, where dense vegetation and complex terrain present substantial challenges to traditional land survey methods. Moreover, understanding the spatial distribution of Robusta coffee (Coffea canephora) CS is essential for developing site-specific management strategies, guiding extension services, and informing evidence-based policy decisions. To address this gap, the present study aimed to enhance the discrimination and mapping capabilities of Robusta coffee CS at a sub-pixel scale using a two-step classification approach and multi-date Sentinel-2 (S2) data. In the first step, the random forest (RF) classification algorithm was used to map the major land use and land cover (LULC) classes in Google Earth Engine platform. Then, the Robusta coffee cropland class was masked, and a sub-pixel multiple endmember spectral mixture analysis (MESMA) was employed to discriminate Robusta coffee CS using three endmembers (EMs) obtained from in-situ hyperspectral data collected in 2023 from: (i) Robusta coffee with agroforestry, (ii) Robusta coffee with banana, and (iii) Robusta coffee under full sun. The result showed 93.5% overall accuracy for the major LULC and 89.9% for all Robusta coffee CS classes, disentangled as follows: 91.3% accuracy for Robusta coffee with agroforestry, 88.5% for Robusta coffee with banana, and 91.2% for Robusta coffee under full sun. Moreover, the MESMA sub-pixel algorithm demonstrates credible performance in discriminating the Robusta coffee CS within each S2 pixel at the heterogeneous landscape in the study area. The findings of this study can inform site-specific interventions (e.g., pest management, fertilizer application, etc.) tailored to the type of CS.
Citation: Kebede G, Mudereri BT, Mutanga O, Landmann T, Odindi J, Motisi N, et al. (2026) Mapping Robusta coffee (Coffea canephora) cropping systems in Uganda: A two-step pixel and sub-pixel based approach with Sentinel-2 data. PLoS One 21(1): e0338803. https://doi.org/10.1371/journal.pone.0338803
Editor: Tzen-Yuh Chiang, National Cheng Kung University, TAIWAN
Received: March 5, 2025; Accepted: November 26, 2025; Published: January 5, 2026
Copyright: © 2026 Kebede et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data used in this study are publicly available on the icipe data portal: https://dmmg.icipe.org/dataportal/dataset/mapping-robusta-coffee-coffea-canephora-cropping-systems-in-uganda.
Funding: Getachew Kebede was supported by the In-Region Postgraduate Scholarship from the German Academic Exchange Service (DAAD). The authors sincerely appreciate the financial support provided by the European Union (EU) for the project “Robusta coffee agroforestry to adapt and mitigate climate change in Uganda” (GCCA+-Global Climate Change Alliance & DESIRA, Project/Grant No. FOOD/2021/427-759). Additional funding was received from the Swedish International Development Cooperation Agency (Sida), the Swiss Agency for Development and Cooperation (SDC), the Australian Centre for International Agricultural Research (ACIAR), the Norwegian Agency for Development Cooperation (Norad), the Federal Democratic Republic of Ethiopia, and the Government of the Republic of Kenya. The project also benefited from funding by the Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD). Internal support was also provided by the authors’ host institutions through logistical, technical, and administrative assistance. There was no additional external funding received for this study.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Uganda is Africa’s second-largest coffee producer after Ethiopia, with over 1.5 million small-scale coffee farmers accounting for 10% of global coffee production [1]. These farmers cultivate both Arabica (Coffea arabica L. - 23% of the total production) and Robusta (Coffea canephora −77% of the total production) coffee on plots smaller than 0.5 ha, often intercropped with bananas and various agroforestry species [2]. In the recent past, Uganda has faced challenges arising from climate change, that potentially require the replanting or regenerating of older coffee trees. This may explain why Uganda’s coffee industry underperforms compared to the other major producers like Vietnam and Brazil. The Uganda coffee technology adoption, for instance, are hindered by so many factors as opposed to major producers [3]. To address these constraints, Okia & Buyinza [4] recommend on-farm participatory trials to evaluate shade tree compatibility and determine the best spatial and temporal configurations for coffee cropping systems (CS) in the country. Also, researchers promote implementing a cluster approach, tailored to the specific type of coffee CS, where farmer groups collaborate to seek government intervention or support. Furthermore, the Uganda Coffee Development Authority advocates for strategies such as promoting commercial cultivation, replanting, adapting to climate change, and accessing new markets.
Efforts to map coffee CS using remote sensing approaches in Uganda face significant challenges due to dense vegetation and varied terrain. These difficulties are particularly pronounced in coffee CS with dense shade trees [5]. Issues such as topographical variations, complex CS structures (e.g., coffee farm or patch layout, species heterogeneity within and between coffee farms), and the presence of diverse flora and shade trees further complicate accurate coffee CS mapping. Whereas a successful land use and land cover (LULC) classification has been achieved in full sun coffee CS, mapping various coffee CS at a finer landscape scale remain a challenge [6,7]. Hence, some studies have used various data sets, such as high-resolution optical images, spectral indices, and temperature data to enhance the accuracy of classifying coffee agroforestry systems [8,9].
Understanding the spatial distribution of Robusta coffee CS is essential for developing site-specific management strategies, guiding extension services, and informing policy decisions [10,11]. Accurate maps of coffee CS can support targeted pest and disease monitoring [11], facilitate climate adaptation planning [12], and help optimize resource allocation by identifying the most vulnerable or high-potential areas [10]. Despite this importance, few studies have focused on mapping coffee CS at fine scales in heterogeneous landscapes [13]. This study bridges that gap by applying advanced spectral unmixing techniques to classify and map diverse Robusta coffee CS using remote sensing data and advanced machine learning classification algorithms [14].
In the typical small-scale agroecological systems in Africa, inter and intra coffee CS spectral responses are often due to landscape heterogeneity, fragmentation, and intricate crop cycles [15]. Specifically, the subtle spectral characteristics of coffee CS might be uncovered using pixel-based classification methods and image data of medium or high spatial resolution, such as Landsat (30 m) and Sentinel-2 (S2) (10 m). Hence, studies suggest the use of sub-pixel classification methods, such as spectral mixture analysis (SMA) [14], which can effectively decompose mixed pixels with multiple classes and produce fairly acceptable mapping accuracy. Different methods of SMA are successfully used in diverse studies, including simple spectral mixture analysis: sSMA [14], Monte Carlo spectral mixture model: AutoMCU [16], Bayesian spectral mixture analysis: BSMA [17], linear mixture analysis: LAM [18], and multiple endmember spectral mixture analysis: MESMA [19]. These methodologies are implemented by: (1) determining the number of pure spectra known as endmembers (EMs) present in image data, (2) discerning the biophysical characteristics of each EM within a pixel, and (3) estimating the fractional abundances of each EM in a pixel [14,20].
The MESMA algorithm is the most popular sub-pixel classification method due to its low error rates [21]. Furthermore, unlike the spectral mixture analysis, which uses a fixed set of endmembers for all pixels, MESMA allows EM combinations to vary for each pixel, making it more adaptable to spatially heterogeneous surfaces [19]. While deep learning methods can also capture complex spectral–spatial relationships, they typically require large datasets and high computational resources. MESMA provides a more transparent, data-efficient, and interpretable approach for mapping mixed LULC types, especially in areas with limited training data or where physical understanding of spectral behavior is essential [14]. The MESMA has been used in, for instance, a range of fields that include the analysis of urban settings [22], the delineation of fire-affected areas [23,24], discrimination of plant species [19], mapping marshland [25], detection of weeds in maize crop [26] and mapping agro-ecosystems [27].
This study evaluates the performance of MESMA in mapping three distinct Robusta coffee CS using Sentinel-2 imagery in Uganda. We first classified the study landscape into five major LULC classes to distinguish Robusta coffee cropland from other major LULC types. After isolating the Robusta coffee cropland, we applied MESMA exclusively within the Robusta coffee cropland class to further discriminate and map the three Robusta coffee CS at a sub-pixel scale.
2. Materials and methods
2.1. Materials
2.1.1. Study site.
This study was conducted in the main Robusta coffee growing-regions in central, eastern, western, and southwestern Uganda, encompassing 94,076 km2. The study site spans an altitude range of 612–5239 meters above sea level, with coordinates at latitudes 1° 32’N to 1° 21’ S and longitudes 29° 31’ E to 34° 27’ E (Fig 1). The area experiences a tropical climate characterized by an annual bimodal rainfall pattern, with temperatures ranging from 18 to 22 °C. The annual precipitation varies between 750 and 1500 mm [1]. The study area is divided into 34 districts with the main Ugandan Robusta coffee growing-regions and mainly dominated by Robusta coffee cropland [1]. Agricultural practices in the study area are primarily subsistence and small-scale, featuring diverse agricultural systems such as agroforestry with timber and fruit trees. Additionally, intercropping with horticultural crops like banana, cassava, and yam is commonly integrated into the Robusta coffee CS. The main challenges to Robusta coffee production in this region include climate change, pests, and diseases.
2.1.2. Field data collection.
Field surveys were conducted between January 1st and February 28th, 2023, to collect ground polygon data on Robusta coffee cropland (n = 287). Field data were collected using an open data kit (ODK) application installed on a smartphone, which captured location information with an accuracy of ±3 m. Field data collection was carried out across the study sites, facilitated and permitted by the Uganda National Coffee Research Institute (NaCORI) and the National Forestry Resources Research Institute (NaFORRI), which are the national entities responsible for overseeing coffee and forest research in Uganda. These entities were also part of the Robusta project consortium, under which this study was conducted. To reduce the potential sampling bias during LULC Robusta coffee cropland reference data collection, we employed a stratified random sampling approach. Specifically, we divided the landscape into eight clusters based on several landscape and climate indicators, including elevation, slope, rainfall, temperature, and soil properties using a k-means cluster analysis (Table 1).
We then randomly collected ground-truth points for Robusta coffee cropland and major LULC as polygons (i.e., regions of interest: ROIs). The major LULC included, natural forests, mixed agricultural and grassland areas (which encompass commercial farms, subsistence farms, grazing lands, and areas with trees and shrubs), built-up areas, Robusta coffee cropland, and water bodies as described in [26,28]. Additionally, we collected ROI for each major LULC class on Google Earth Platform® with a Landsat 8 background image captured on the 15th of January, 2022. The image had spatial resolution of 30 m x 30 m [29]. The number of pixels for ROIs was calculated based on S2 spatial resolution (10 m x 10 m) using Orfeo ToolBox (OTB) in quantum geographic information system [30]. To ensure representative number of pixels used in the classification experiment, a scaling factor of 0.189 was applied to each major LULC ROI and Robusta coffee cropland sample. This approach allowed for proportionally balanced representation of pixel samples across ROIs of varying sizes. As a result, a total of 3,000 pixels were selected for all major LULC classes combined. A summary of the reference sample pixels used is provided in (Table 2). The representative pixel samples were then divided into 70% (2100) for training and 30% (900) for validating the classification model.
To address potential concerns regarding spatial autocorrelation between training and validation samples, we included a map (Fig 2) illustrating the distribution of polygon used for training and validating the random forest (RF) classification algorithm. These polygons are distributed randomly across the eight clusters. The digitized polygons area 0.27 to 50.7 hectares. The majority of polygons fall within the 0.27–12 ha range, with a mean size of 5.6 ha. Some polygons are close to each others, but since we randomly selected the training and validation samples, we expect minimum or no autocorrelation among these samples.
a), and b) zoomed-in view of the distribution and size of polygons within the highlighted red box.
The sample polygons were organized at a point pixel scale in QGIS [30], retrieved in the GEE, and their reflectance values were extracted at the S2 spectral (Blue, Red, Green, Red-edge1, Red-edge2, Red-edge3, near-infrared (NIR), and narrow-band near-infrared (NIRn)) bands of a 10-m spatial resolution.
2.2. Methods
The proposed methodology employed a two-step classification approach for mapping Robusta coffee CS. In the first step, a multi-date S2 composite was utilized to map the major LULC in the study area. This is achieved using the RF [31] classifier in a semi-automatic approach within the Google Earth Engine (GEE). The second step involved spectral unmixing analysis of the derived Robusta coffee into three CS using MESMA and a dry season S2 median composite image generated from all available images acquired between January 1st and February 28th, 2023 in R-software [32]. The overall workflow of the classification approach utilized in this study is illustrated in Fig 3, with detailed explanations of each step provided in the subsequent sub-sections. To ensure consistency, all spatial data layers and maps were processed and visualized using World Geodetic System (WGS) 84 coordinate system in decimal degrees.
2.2.1. Sentinel-2 (S2) data compositing and classification in the Google Earth Engine (GEE).
The study employed GEE to map the major LULC. The GEE offers several advantages, including ease of use, access to a comprehensive library of global remotely sensed datasets, powerful cloud-based computing power, and the ability to process large-scale imagery without being constrained by the user local computer memory. This is particularly beneficial when working with high-resolution or global-scale data. A multi-date S2 image composite covering the period from 01/01/2022–31/12/2022 was used in this study. All image processing and classification steps were carried out in the GEE platform. The reflectance data from eight S2 tiles, initially provided as top-of-atmosphere reflectance, were retrieved and used in the major LULC classification experiment. These images underwent a sequence of processing steps, including cloud filtering, atmospheric correction, and normalization for seasonal illumination effects. Median compositing was employed to reduce the influence of redundant or invalid pixels caused by clouds, shadows, or other atmospheric disturbances [33]. The resulting composite was resampled to a 10 m spatial resolution using bilinear interpolation. Removing contaminated pixels is essential to minimize spectral errors induced by atmospheric conditions and ensure more reliable estimation of biophysical parameters [34].
The median compositing method was selected for the classification of both major LULC classes and Robusta coffee CS due to its computational efficiency and superior technical performance compared to other compositing techniques, such as maximum ratio, annual greenness pixel, best pixel (based on the distance to the nearest cloud), and seasonal greenest pixel [33]. By applying the GEE ‘median()’ function, this method effectively filtered out cloud-contaminated (high-value) and shadow-contaminated (low-values) pixels across all bands, capturing seasonal variability while retaining the most representative pixel values over the entire year [33,35]. Additionally, spatial resampling was performed to standardize all S2 bands to a uniform 10-m spatial resolution.
Five vegetation indices (VIs) were calculated (Table 3) and integrated with S2 spectral bands to support the classification of major LULC in the study area. These indices included chlorophyll vegetation index (CVI), modified simple ratio for vegetation (MSR), normalized difference vegetation index (NDVI), a combination of the modified chlorophyll absorption in reflectance index (MCARI) and optimized soil-adjusted vegetation index (OSAVI), and the Beison Datt vegetation index (DATT). The description, significance and proven effectiveness of these indices for vegetation-based analysis were well documented in previous studies [29,39–42].
We used the RF classification algorithm [31] to map the major LULC classes in the study area. The RF algorithm is renowned for its effectiveness in various remote sensing studies, particularly in tasks such as cropland mapping [43]. The algorithm is an ensemble machine learning method that constructs multiple decision trees (ntree) for classification and regression. Each decision tree is developed using a bootstrap sample, known as “in-bag” data, which constitutes two-thirds of the original dataset. Random subsets of variables (mtry) are employed to split nodes in these decision trees, with the default mtry value calculated as the square root of the total number of variables [44]. Reducing the number of variables utilized in a split not only lowers the algorithm’s computing complexity but also reduces the correlation between the ntree. The RF uses a majority voting to label the different classes in the experiment and produce consistent and reliable predictions. For this study, the default settings of ntree (500 trees) and mtry [3] were applied for a pixel-based RF classification and mapping of major LULC classes. It is reported that the performance of RF classifier based on the default ntree and mtry is not significantly different when these two parameters are tuned and optimized [31,44]. The major LULC classes were removed, except the Robusta coffee cropland class, which was retained to generate a mask for further analysis.
2.2.2. Endmembers (EMs) selection and spectral data collection.
In this particular study, three spectra relevant to the Robusta coffee CS were identified and utilized in the MESMA experiment. According to Kebede et al. [45], the three EMs (Fig 4) were classified as follows: (1) Robusta coffee with agroforestry, where tall (18 m on average) fruit and timber trees were co-existing with the coffee crop. In this CS, coffee is an understory layer. The primary timber trees in this CS included Grevillea robusta, Albizia adianthifolia, Ficus natalensis, Maesopsis eminii, and Markhamia lutea whereas the fruit trees were Artocarpus heterophyllus, Mangifera indica, Persea americana, Carica papaya, and Citrus reticulata, This CS also included some shrubs such as Hibiscus syriacus, (2) Robusta coffee with banana. This CS was dominated by Robusta coffee and banana. Additionally, intercrops of short fruits and vegetables were also co-existing as understory layer. The main short fruits and intercrops in this category included Citrus sinensis, Theobroma cacao, Manihot esculenta, Dioscorea bulbifera, Zea mays, and Ipomoea batatas, and (3) Robusta coffee under full sun without shade trees layer. The spectral data for these EMs were collected using in-situ hyperspectral measurements that are fully described in [45]. A total of 60 purposefully selected field plots representing Robusta coffee CS were identified within the study area. From these plots, a total of 1,260 leaf spectra were collected using a FieldSpec HandHeld 2™ Spectroradiometer [46]. This non-imaging sensor measures electromagnetic radiation in the 325–1075 nm range with a 25-degree full conical field of view [46]. For each of the three target endmembers, 60 samples were obtained. Leaf-level spectral data were collected between January 1st and February 28th, 2023, from 20 plant species, with three leaf spectra measured per species, as well as five bare soil samples. Prior to measurement, the spectroradiometer was optimized and calibrated using a Spectralon white reference panel (~100% reflectance) to correct for solar illumination and atmospheric variability. Calibration was repeated every 20 readings to ensure consistent and reliable data [47]. To minimize measurement error, the spectroradiometer was held 5–7 cm above the adaxial (upper) leaf surface, depending on leaf size [48]. All readings were taken under clear, sunny conditions between 10:00 a.m. and 2:00 p.m. local time (GMT + 3). Each leaf spectrum was recorded three times to reduce handling error and improve accuracy, resulting in a total of 1,260 high-quality spectral measurements [48]. Subsequently, these samples were filtered using the ‘Noise Filtering’ function and smoothed with the ‘Savitzky–Golay’ filter utilizing the ‘hsdar’ package [49] in R software [32]. Spectral resampling to match the S2 sensor configuration was performed using the ‘spectralResampling function in the ‘hsdar’ package [49] in R software [32]. The resulting mean reflectance values for each band were then used as input EM spectra for the MESMA implemented in R software [32] via the ‘mesma()’ function from the RStoolbox package [50] which enable sub-pixel unmixing and model evaluation.
2.2.3. Robusta coffee cropping systems (CS) discrimination using multiple endmember spectral mixture analysis (MESMA).
A 3-month median S2 image composite (01/01/2023–28/02/2023), was pre-processed in GEE following the procedure detailed in Section 2.2.1. This image composite was chosen due to its temporal correspondence with the in situ spectral data collection period. A sub-pixel MESMA model, which is a linear mixture of pure spectral EMs, [19] was applied to classify the three Robusta coffee CS. This model allowed the calculation of the fractional abundance of the CS [51]. For each pixel, MESMA iteratively runs multiple candidate models, with the best model determined based on identified pixel fractions, their residuals, and the smallest root mean square error (RMSE) compared to the pixel’s spectral curve [21]. These models are then applied to the entire image on a per-pixel basis. In this study, the MESMA algorithm, integrated into the ‘RStoolbox’ package [52] in R software [32] was used. The ‘RStoolbox’ package for MESMA employs the non-negative least squares (NNLS) regression through a sequential coordinate-wise algorithm. The MESMA execution was performed using default settings, including the NNLS method, 400 iterations, and a tolerance of 1 × 10 ⁻9. The MESMA algorithm’s outputs consist of individual bands representing estimated per-pixel EM abundance fractions and the probability of occurrence for each tested EM, ranging from 0 to 1. We employed spectral mixture analysis (SMA) to generate the fractional abundance maps for three Robusta coffee CS, following the recommendations of Salih [53]. We also performed a threshold sensitivity to compute the percentage of pixels exceeding fractional abundance thresholds of 0.3, 0.4, and 0.5 for Robusta coffee with agroforestry, Robusta coffee with banana, and Robusta coffee under full sun, respectively. We then assessed the spectral separability among endmembers using the spectral angle mapper (SAM) approach as suggested by Salih [53]. The SAM computes the angle (in degrees) between spectral vectors in n-dimensional space, where smaller angles indicate higher similarity. Although SAM is not a classifier, it is widely used to quantify spectral similarity and assess the distinctiveness of LULC classes [54,55]. Based on established guidelines, spectral angles less than 10–15° indicate high similarity, angles between 15–30° suggest moderate separability, and angles greater than 30° reflect low similarity or high separability between classes [56].
2.2.4. Classification accuracy assessment.
Accuracy evaluation is crucial in generating thematic maps using remotely sensed data [57]. This process requires high-quality reference datasets sampled at appropriate spatial and temporal scales [58]. To assess the two-step classification approach for creating Robusta coffee CS maps, comparisons were made with ground-based verification point data.
The evaluation of classification outcomes from both stages, including major LULC and Robusta coffee cropland mapping, involved comparing random validation pixels for accuracy. For major LULC classification, 70% of the extracted pixels were used for training and 30% for validation. However, we acknowledge that the validation relied on polygons and pixels from the same period and region, which may have led to an overestimation of accuracy [59], as independent temporal or spatial datasets were not available for this study. In the absence of percentage coverage data for certain Robusta coffee CS classes, confusion matrix methods were used to validate MESMA results. Each pixel in the MESMA map was assigned to the land cover class with the highest fractional coverage, ensuring a robust accuracy assessment. For MESMA classification, only 228 independent ground-based points collected from the three Robusta coffee CS field plots were used for assessing the classification accuracy. Specifically, the validation of the classification involved the use of accuracy assessment metrics derived from confusion matrices. These metrics encompass overall accuracy (OA), user’s accuracy (UA), producer’s accuracy (PA) and Kappa statistics (K). The PA represents the accuracy of correctly classified samples within each category, while the user’s accuracy (UA) denotes the proportion of correctly classified samples for a specific class. The Kappa statistic (K) measures the discrepancy between the actual agreement of a classified map and the agreement that would be expected by chance, based on comparison with reference data [60]. The root mean square (RMSE) evaluates the average magnitude of prediction errors by measuring the square root of the average squared differences between predicted and observed values. Lower RMSE indicates better model accuracy [61]. Additionally, the overall accuracy (OA) reflects the accuracy of correctly classified samples among all samples of the Robusta coffee CS [23]. Area estimates for each class were adjusted using the reference sample to correct for map classification errors and were accompanied by 95% confidence intervals (CIs) to quantify the uncertainty in the estimated areas. All analyses were performed in R following recommended practices. This approach follows the accuracy assessment and area estimation best practices as recommended by Olofsson et al. [62], which emphasize the importance of integrating sampling design, response design, and consistent analysis for transparent land change validation.
3. Results
3.1. Major land use and land cover (LULC) mapping
The study area’s major LULC classes (five classes) were mapped with an OA of 93.5% (Fig 5 and Table 4). The study area was dominated by mixed agricultural and grassland class, which covered 75.1% of the total area. In contrast, the built-up areas account for only a small fraction (1.7%). Robusta coffee cropland accounts for 8.7% of the total area.
3.2. Robusta coffee cropping systems (CS) classification using multiple endmember spectral mixture analysis (MESMA)
Fig 6, illustrates the spectral responses of the three chosen EMs within the eight resampled S2 bands, aligning with the wavelength range of the spectroradiometer instrument employed in this study. Four S2 bands (red-edge2, red-edge3, NIR, and NIRn), noticeable inter-category variations in EMs reflectance values were observed. Furthermore, the correlation between bands Red-edge3, and NIRn had a p-value < 0.0008, indicating a statistically significant correlation between the three endmembers, illustrating spectral distinctiveness. Conversely, the visible bands (blue, green, and red) and red-edge1 did not exhibit any significant differences across the three EMs.
The eight bands correspond to the blue, green, red, red edge (RE1), RE2, RE3, near infrared (NIR), and narrow-band near-infrared (NIRn) waveband of S2.
Fig 7 depicts the distribution and frequency of pixel fractions for the three EMs used in this study based on the unmixing result on S2. In the study area, Robusta coffee with agroforestry occupied the majority of the pixels, followed by Robusta with banana and Robusta under full sun EMs. Robusta coffee with agroforestry occupied numerous pixels with relatively large fractions, however, fewer pixels contained Robusta coffee with banana or Robusta coffee under full sun with these fractions generally below 0.5. The high fractional abundance of Robusta coffee with agroforestry pixels ranged between 0.1 and 0.6-pixel fractions, with > 65% of pixels containing zero fractions of the two Robusta coffee CS.
The proportion of Robusta coffee under full sun EM was relatively low in comparison to the other two EMs, as evidenced by the few pixels representing the Robusta coffee under full sun fraction. The majority of pixels had a low RMSE (< 0.05), whereas the highest RMSE over all pixels was 0.078. The average RMSE (0.021) indicated a good model fit for MESMA and relatively small EM prediction errors across the entire study area. MESMA created four fraction images for the three EMs, along with their corresponding RMSEs.
Fig 8 depicts the S2-based classification results from the MESMA of the fraction images of the three EMs. The results indicate that the RMSE was typically low throughout the study area, except for the northeast and southwest areas. The fractional abundance distribution of Robusta coffee CS types in the study area varied, with Robusta coffee with banana pixels ranging between 0.5 and 0.6 dominating the southwest and Robusta coffee with agroforestry in the middle and northeastern parts of the major Robusta coffee growing-regions. The threshold sensitivity analysis (Fig 9a) showed that the percentage of pixels classified under each of the three Robusta coffee CS at thresholds of 0.3, 0.4, and 0.5. The Robusta coffee with banana CS maintained the highest pixel purity, followed by Robusta coffee under full sun, while Robusta coffee with agroforestry showed the steepest decline due to spectral mixing. The SAM analysis (Fig 9b) revealed very high spectral separability (50°) between Robusta coffee with agroforestry and banana CS, indicating clear spectral distinction. A moderate separability (30°) was observed between Robusta coffee with banana and Robusta coffee under full sun CS, while Robusta coffee with agroforestry and Robusta coffee under full sun showed low separability (20°), suggesting overlapping spectral characteristics.
The classification results of the three types of Robusta coffee CS presented in Table 5 and Fig 10, indicated 89.9% classification accuracy for all Robusta coffee CS classes and disentangled as follows: Robusta coffee with agroforestry covers 63.7% of the total coffee CS area and has a UA of 91.3.7%, Robusta coffee with banana covers 24.8% with a UA of 88.5%, and Robusta coffee under full sun covers 11.5% with a UA of 91.2%. Moreover, the corrected area estimates indicate that Robusta coffee with agroforestry is the most widespread CS, covering 468,377 hectares (95% CI: 431,243–505,511 ha), followed by Robusta with banana at 189,814 hectares (95% CI: 172,879–206,749 ha). Robusta coffee under full sun is the least extensive, with 79,658 hectares (95% CI: 69,394–89,922 ha).
4. Discussion
This study employed the RF algorithm, utilizing the high spatiotemporal resolution of S2 imagery integrated within the GEE platform to classify major LULC, including Robusta coffee cropland. To enhance the classification accuracy, five VIs were incorporated together with S2 bands. These indices, selected for their sensitivity to vegetation structure, chlorophyll content, and canopy moisture [63]. These plant morphological and physiological traits could considerably vary among our major LULC classes, and since they were mimicked by the VIs. Studies have shown that when VIs are combined with the spectral bands, the classification accuracy can be reasonably improved [64,65], particularly in such a complex landscape that is characterized by fragmented and small-scale crop fields (<0.5 ha). The use of multi-date and fine spatial resolution S2 imagery enabled the detection of seasonal vegetation dynamics, contributing to highly accurate major LULC classification. However, some classes, such as the Robusta coffee cropland, showed a relatively lower classification accuracy (PA = 59.4%) compared to other land covers. This is likely due to the spectral similarity of Robusta coffee cropland with surrounding vegetation and co-existing species. Also, Robusta coffee cropland is a mixture of shaded and unshaded CS that have different spectral characteristics, which could not have been distinguished at a pixel level.
Many recent studies have reported the successful application of remote sensing techniques for crop classification and mapping [59,66,67], including discriminating Robusta coffee CS [45]. The precision of cropland mapping using S2 imagery has been particularly emphasized. Similarly, Gomez et al. [68] in New Caledonia and Mosomtai et al. [7] in Kenya successfully mapped coffee crop among other associated LULC classes using S2 imagery [69]. Additionally, Escobar-López et al. [13] detected shaded and unshaded coffee crops in Mexico with an OA of 95% using S2 and other auxiliary climatic variables.
The classification of Robusta coffee CS demonstrated high reliability, with an OA of 89.9% and a Kappa coefficient of 83.0%, indicating strong agreement with reference data. UA and PA ranged from 88.5% to 91.3% and 83.8% to 92.4%, respectively. The corrected area estimates revealed that Robusta coffee with agroforestry is the dominant CS, followed by Robusta coffee with banana and Robusta coffee under full sun. These accuracy-adjusted estimates provide a more reliable assessment of the true spatial distribution of each Robusta coffee CS than uncorrected map outputs. Therefore, the corrected area estimates offer a more accurate and defensible basis for understanding the landscape-level prevalence of each Robusta coffee CS. The slight variation may likely be attributed to the complexity of the heterogeneous landscape and the diverse crop composition among smallholder farmers. This variability introduces significant spectral and textural differences, making major LULC classification more challenging. Importantly, mapping Robusta coffee CSs extends beyond simple land-cover classification. As noted by Cassamo et al. [70], detailed characterization of cropping systems forms the foundation for assessing ecosystem services and sustainability within smallholder agroecosystems. This highlights the broader policy and environmental relevance of our findings for coffee-producing regions, where spatially explicit insights can inform sustainable land management and climate-resilient agricultural planning. Additionally, the prevalence of mixed Robusta coffee CS on small fields (<0.5 ha) in regions like Uganda have a wide range of field sizes, orientations, and shapes, which might have also contributed to the relatively lower classification accuracy. Notably, the UA (commission error) for Robusta coffee cropland mapping was higher than the PA (omission error), indicating a trade-off between omission and commission errors in the classification process. The relatively higher omission errors in our major LULC classification experiment could also be due to the small field size and several other confounding landscape factors. This approach led to lower commission errors, where other major LULC areas were misclassified as Robusta coffee cropland. This challenge of balancing PA and UA aligns with findings from previous studies [43,71].
The resampled S2 red-edge and near infrared bands (5, 6, 7, and 8a) showed significant variations in the mean reflectance values of Robusta coffee CS EMs. In contrast, the visible bands (1, 2, and 3) and red-edge1 (4) did not display noteworthy differences across the three Robusta coffee CS EMs. This highlights the discernible spectral characteristics within specific bands and underscores the efficacy of these wavelengths in capturing variations among EMs, aligning with previous studies emphasizing the importance of selected S2 bands in species identification [72]. Furthermore, the observed differences among Robusta coffee CS EMs were attributed to the distinct spectral signatures of chlorophyll, nitrogen, phosphorus, and potassium in Robusta coffee and its co-existing plants for each EM group, which exhibit distinguishable characteristics in the red-edge region [73].
The results of the MESMA showed that most pixels exhibited low RMSE values, typically below 0.05, with the highest RMSE value of 0.078, encompassing nearly the entire study area. The mean RMSE of 0.021 suggests that MESMA provides a good model fit with minimal prediction errors across the study area; however, we acknowledge that a low RMSE does not necessarily equate to high classification accuracy. Moreover, MESMA accuracy assessments, using independent validation data, demonstrated promising results, with low omission errors. Although alternative sub-pixel classification approaches, such as support vector machine (SVM) [74] and deep learning [75], could have also been applied in this study; but a direct comparison between MESMA and these methods in mapping coffee CS was beyond this study scope. Future studies should explore the performance of different sub-pixel classifiers in mapping Robusta coffee CSs.
The two-step approach, combining RF for major LULC mapping and MESMA for mapping Robusta coffee CS, improved accuracy by minimizing spectral confusion and suppressing spatial constraints from other classes; however, this improvement was not empirically tested within this study. Like our approach, spectral sub-pixel approaches, commonly employed in mapping landscape dynamics involving moderate- and low-resolution imagery within intricate and diverse landscapes, aim to overcome the limitations associated with spatial resolution. These approaches aim to estimate the relative proportions of different constituents within a pixel. They consider a single pixel as a representation and identify pure sub-pixel classes known as “endmembers”. Schmitt-Harsh et al. [76] utilized spectral mixture analysis with aerial photographs, which resulted in varying accuracies when mapping coffee agroforestry systems. Similarly, Bispo et al. [77] employed spectral linear mixing but encountered limitations regarding plant heterogeneity, varying plant sizes, diverse management systems, and inconsistent spacing, achieving an OA of 67%. The strength of our approach is that we used a 2-step approach that aims to provide a detailed fractional coverage of three Robusta coffee CS. However, our study area represents a highly heterogeneous cropping system, where the structural and compositional diversity of Robusta coffee CSs influences both classification performance and ecological outcomes. This relationship aligns with broader agroecological evidence showing that variations in cropping system heterogeneity strongly affect remote sensing detectability as well as landscape-scale ecological functions [5,13]. However, while spectral sub-pixel approaches offer an improved capability to capture landscape heterogeneity compared to pixel-based methods, they still face constraints related to cloud cover and tend to be more time-intensive.
The fractional abundance distribution of Robusta coffee CS classes in the study area varied, with Robusta coffee with banana pixels ranging between 0.5 and 0.6 dominating the southwest and Robusta coffee with agroforestry pixels ranging between 0.5 and 1.0-pixel fractions, with > 65% of pixels in the middle and northeast.
In evaluating the classification robustness, the threshold sensitivity analysis revealed that Robusta coffee with banana consistently retained the highest pixel purity across all thresholds, followed by Robusta coffee under full sun, while Robusta coffee with agroforestry exhibited the steepest decline. This suggests that Robusta coffee with banana has more spectrally distinct and homogeneous coverage, whereas Robusta coffee with agroforestry is more spectrally mixed due to vegetation complexity. As the fractional threshold increased from 0.3 to 0.5, the decline in pixel percentages, particularly for Robusta coffee with agroforestry and Robusta coffee under full sun, indicates growing uncertainty in abundance estimates at stricter thresholds. Supporting these findings, the SAM analysis showed very high spectral separability (50°) between Robusta coffee with agroforestry and Robusta coffee with banana, indicating a considerable distinction. Moderate separability (30°) was observed between Robusta coffee with banana and Robusta coffee under full sun, while Robusta coffee with agroforestry and Robusta coffee under full sun displayed low separability (20°), suggesting greater spectral similarity. These results underscore the ability to discriminate Robusta coffee with agroforestry from Robusta coffee with banana, while distinguishing Robusta coffee with agroforestry from Robusta coffee under full sun remains more challenging due to spectral overlap.
However, caution is advised in interpreting these findings due to challenges in determining optimal endmembers that represent the heterogeneity of Robusta coffee CS. Spectral variations within the same class may be influenced by factors such as tree ages, abiotic stresses, and insect pests such as black coffee twig borer (BCTB) that damages the coffee leaves. Also, it is possible that some of the spectral variations in our EMs were obscured by the area sampled (scanned) by the spectroradiometer. This frequently results in uncertainty, especially when taking into account the temporal variance of the EM spectra recorded at various locations and times. Furthermore, socio-economic variables, such as farmer practices, crop management intensity, and yield differences, could also contribute to unexplained spectral variability in the EMs. While such datasets were currently not spatially available, integrating these information in future work could provide a more comprehensive understanding of Robusta coffee CS spectral heterogeneity for effective and accurate classification and mapping.
Specifically, integration of different Robusta coffee CS as an endmember contributed to accurate Robusta coffee CS discrimination. However, caution is warranted when applying this approach in different spatial or temporal settings due to potential variations in brightness effects. Also, this study was based on Robusta coffee CS EM data collected only during a dry season (January to February). Seasonal variations in vegetation phenology, pest damage, and other biotic or abiotic stressors can significantly alter the spectral characteristics of both the major LULC classes and the three Robusta coffee CS [73]. These changes reduce their spectral separability and hinder accurate mapping when relying on a single image or spectral sampling from only one season. Therefore, future studies should incorporate multi-seasonal imagery and EMs derived from both dry and wet seasons, as well as from varying stress conditions, to enhance the robustness and reliability of classification accuracy. Similarly, validation polygon samples were obtained from the same period and region to assess the performance of our classification approach. This approach, while practical, may lead to a slight overestimation of classification accuracy [59]. Future studies should employ independent validation datasets collected across different time periods and geographic locations to enhance the reliability and robustness of the classification accuracy assessment. While this study primarily relied on S2 imagery for mapping Robusta coffee CS, it is important to acknowledge the role of in situ spectral data. Although not explicitly compared here, in situ data provide high-resolution spectral detail at the plot level but are limited in spatial coverage and temporal frequency. In contrast, S2 offers consistent and scalable imagery, making it more suitable for operational mapping across large regions.
A key limitation of this study lies in the class imbalance within the training dataset, particularly concerning land cover types such as water bodies and built-up areas. These classes occupy relatively small spatial extents in the study area, leading to limited sample sizes. This imbalance may have adversely affected classification accuracy and constrained the model’s ability to generalize across diverse agroclimatic zones. Although the tree Robusta coffee CS were distinctly identified and classified using in situ hyperspectral data in Kebede et al. [45], spectral characteristics of other tree crops, which could be confused with Robusta coffee, and the possible presence of Arabic coffee could have compromised the purity of the ENM applied in this study. Another shortcoming of this study is the inclusion of agriculture and grassland in one major LULC class. This could have caused spectral confusion in this class as the two land cover types, i.e., agriculture and grassland, have distinct spectral characteristics. As long as the spatial resolution of the Google Earth background image (30 m) is concerned, and based on our field observations, a clear separation between agriculture and grasslands was not possible in our study area. Although representative pixels were carefully selected using field data and medium-resolution imagery, we recommend that future research should mitigate these limitations by increasing sample sizes for underrepresented classes and using higher-resolution imagery.
5. Conclusions
Our 2-step classification approach improves the accuracy of discriminating and mapping Robusta coffee CS in the main Robusta coffee growing-regions of Uganda. The study demonstrated the effectiveness of S2 composite image, the RF classifier, and MESMA in discriminating among the three Robusta coffee CS classes. Importantly, producing a Robusta coffee cropland mask before employing MESMA enabled a focused and spatially adjusted spectral unmixing process. Additionally, extracting EMs from in-situ spectrometric data provided a more realistic spectral array for the three Robusta coffee CS classes. It is recommended that future research should compare the effectiveness of using Robusta coffee CS EMs sourced from various image datasets employing different EM selection techniques, such as endmember average root mean squared error (EAR), minimum average spectral angle (MASA), and count-based endmember selection (CoB). Moreover, we were able to assess Robusta coffee CS fractional coverage at a sub-pixel scale for three different Robusta coffee CS. Future research should also evaluate the comparative performance of the above-mentioned EMs derived from different datasets and selection strategies in classifying different coffee CS. Furthermore, exploring seasonal or multi-temporal composites, incorporating hyperspectral imagery, or integrating drone-based observations could open new avenues for refining Robusta coffee CS maps that are important for informed decision making regarding coffee pest management, harvest monitoring, implementing sustainable and effective agronomic practices, and ecosystem services.
Acknowledgments
The authors sincerely appreciate the support and assistance provided by the Uganda National Coffee Research Institute (NaCORI) and the National Forestry Resources Research Institute (NaFORRI) for facilitating field data collection.
References
- 1. Bunn C, Lundy M, Läderach P, Fernández-Kolb P, Castro-Llanos F. Climate-smart coffee in Uganda. Feed Futur. 2019:1–24. Available from: https://cgspace.cgiar.org/handle/10568/101331
- 2. Bukomeko H, Jassogne L, Tumwebaze SB, Eilu G, Vaast P. Integrating local knowledge with tree diversity analyses to optimize on-farm tree species composition for ecosystem service delivery in coffee agroforestry systems of Uganda. Agroforest Syst. 2017;93(2):755–70.
- 3. Uganda Coffee Development Authority (UCDA). Robusta coffee handbook: A sustainable coffee industry with high stakeholder value for social economic transformation. 2019;2–148. Available from: https://greeninguganda.com/images/product/coffee/robusta.pdf
- 4.
Okia CA, Buyinza J. Review and Analysis of policies, strategies, attitudes, practices and institutions for strengthening coffee agroforestry in Uganda. 2022;(December).
- 5. Hunt DA, Tabor K, Hewson JH, Wood MA, Reymondin L, Koenig K, et al. Review of Remote Sensing Methods to Map Coffee Production Systems. Remote Sensing. 2020;12(12):2041.
- 6. Schmitt-Harsh M. Landscape change in Guatemala: Driving forces of forest and coffee agroforest expansion and contraction from 1990 to 2010. Applied Geography. 2013;40:40–50.
- 7. Mosomtai G, Azrag AGA, Babin R, Abdel-Rahman EM, Odindi J, Mutanga O. Functional land cover scale for three insect pests with contrasting dispersal strategies in a fragmented coffee-based landscape in Central Kenya. Agric Ecosyst Environ. 2021;319(February):1–10.
- 8. Kelley LC, Pitcher L, Bacon C. Using Google Earth Engine to Map Complex Shade-Grown Coffee Landscapes in Northern Nicaragua. Remote Sensing. 2018;10(6):952.
- 9. Hebbar R, Ravishankar HM, Trivedi S, Manjula VB, Kumar NM, Mukharib DS, et al. National level inventory of coffee plantations using high resolution satellite data. Int Arch Photogramm Remote Sens Spat Inf Sci - ISPRS Arch. 2019;42(3/W6):293–8.
- 10.
Closing the Potential-Performance Divide in Ugandan Agriculture. Closing Potential-Performance Divid Ugandan Agric. 2018.
- 11. Jaramillo J, Muchugu E, Vega FE, Davis A, Borgemeister C, Chabi-Olaye A. Some like it hot: the influence and implications of climate change on coffee berry borer (Hypothenemus hampei) and coffee production in East Africa. PLoS One. 2011;6(9):e24528. pmid:21935419
- 12. Ovalle-Rivera O, Läderach P, Bunn C, Obersteiner M, Schroth G. Projected shifts in Coffea arabica suitability among major global producing regions due to climate change. PLoS One. 2015;10(4):e0124155. pmid:25875230
- 13. Escobar-López A, Castillo-Santiago MÁ, Hernández-Stefanoni JL, Mas JF, López-Martínez JO. Identifying Coffee Agroforestry System Types Using Multitemporal Sentinel-2 Data and Auxiliary Information. Remote Sensing. 2022;14(16):3847.
- 14. Somers B, Asner GP, Tits L, Coppin P. Endmember variability in Spectral Mixture Analysis: A review. Remote Sens Environ. 2011;115(7):1603–16. Available from: http://dx.doi.org/10.1016/j.rse.2011.03.003
- 15. Xiong J, Thenkabail PS, Gumma MK, Teluguntla P, Poehnelt J, Congalton RG, et al. Automated cropland mapping of continental Africa using Google Earth Engine cloud computing. ISPRS Journal of Photogrammetry and Remote Sensing. 2017;126:225–44.
- 16. Asner GP, Lobell DB. A Biogeophysical Approach for Automated SWIR Unmixing of Soils and Vegetation. Remote Sensing of Environment. 2000;74(1):99–112.
- 17. Song C. Spectral mixture analysis for subpixel vegetation fractions in the urban environment: How to incorporate endmember variability? Remote Sensing of Environment. 2005;95(2):248–63.
- 18. Heinz DC, Chang CI. Fully constrained least squares linear spectral mixture analysis method for material quantification in hyperspectral imagery. IEEE Trans Geosci Remote Sens. 2001;39(3):529–45.
- 19. Roberts DA, Gardner M, Church R, Ustin S, Scheer G, Green RO. Mapping chaparral in the Santa Monica Mountains using multiple endmember spectral mixture models. Remote Sens Environ. 1998;65(3):267–79.
- 20. Correa Pabón RE, Souza Filho CRd, Oliveira WJd. Reflectance and imaging spectroscopy applied to detection of petroleum hydrocarbon pollution in bare soils. Sci Total Environ. 2019;649:1224–36. pmid:30308893
- 21. Degerickx J, Roberts DA, Somers B. Enhancing the performance of Multiple Endmember Spectral Mixture Analysis (MESMA) for urban land cover mapping using airborne lidar data and band selection. Remote Sensing of Environment. 2019;221:260–73.
- 22. Franke J, Roberts DA, Halligan K, Menz G. Hierarchical Multiple Endmember Spectral Mixture Analysis (MESMA) of hyperspectral imagery for urban environments. Remote Sensing of Environment. 2009;113(8):1712–23.
- 23. Quintano C, Fernández-Manso A, Roberts DA. Multiple Endmember Spectral Mixture Analysis (MESMA) to map burn severity levels from Landsat images in Mediterranean countries. Remote Sensing of Environment. 2013;136:76–88.
- 24. Landmann T. Characterizing sub-pixel Landsat ETM fire severity on experimental fires in the Kruger National Park, South Africa. S Afr J Sci. 2003;99(7–8):357–60.
- 25. Li L, Ustin SL, Lay M. Application of multiple endmember spectral mixture analysis (MESMA) to AVIRIS imagery for coastal salt marsh mapping: a case study in China Camp, CA, USA. International Journal of Remote Sensing. 2005;26(23):5193–207.
- 26. Mudereri BT, Abdel-Rahman EM, Dube T, Niassy S, Khan Z, Tonnang HEZ, et al. A two-step approach for detecting Striga in a complex agroecological system using Sentinel-2 data. Sci Total Environ. 2021;762:143151. pmid:33143922
- 27. Njenga GW. Multiple Endmember Spectral Mixture Analysis (MESMA) on multi-temporal VHR images for weed detection in smallholder farms. Thesis [Internet]. 2016;116. Available from: http://www.itc.nl/library/papers_2016/msc/gfm/njenga.pdf
- 28. Uganda Bureau of Statistics. Uganda Bureau of Statistics; 2013 Statistical Abstract. Uganda Bur Stat Stat. 2013;1:64. Available from: http://www.ubos.org/onlinefiles/uploads/ubos/pdfdocuments/abstracts/StatisticalAbstract2013.pdf
- 29. Chemura A, Mutanga O, Dube T. Separability of coffee leaf rust infection levels with machine learning methods at Sentinel-2 MSI spectral resolutions. Precis Agric. 2017;18(5):859–81.
- 30. QGis. QGIS Geographic Information System. Open Source Geospatial Foundation Project. Qgisorg. 2021;2021. http://qgis.osgeo.org
- 31. Breiman L. Random Forests. Machine Learning. 2001;45(1):5–32.
- 32. R Studio Team. A language and environment for statistical computing. R Found Stat Comput. 2021;3. Available from: http://www.r-project.org
- 33. Bey A, Jetimane J, Lisboa SN, Ribeiro N, Sitoe A, Meyfroidt P. Mapping smallholder and large-scale cropland dynamics with a flexible classification system and pixel-based composites in an emerging frontier of Mozambique. Remote Sensing of Environment. 2020;239:111611.
- 34. Wu W, Luo J, Hu X, Yang H, Yang Y. A Thin-Cloud Mask Method for Remote Sensing Images Based on Sparse Dark Pixel Region Detection. Remote Sensing. 2018;10(4):617.
- 35. Gumma MK, Thenkabail PS, Teluguntla PG, Oliphant A, Xiong J, Giri C, et al. Agricultural cropland extent and areas of South Asia derived using Landsat satellite 30-m time-series big-data using random forest machine learning algorithms on the Google Earth Engine cloud. GIScience & Remote Sensing. 2019;57(3):302–22.
- 36. Hunt E, Daughtry CST, Eitel JUH, Long DS. Remote sensing leaf chlorophyll content using a visible band index. Agron J. 2011;103(4):1090–9.
- 37. Sims DA, Gamon JA. Relationships between leaf pigment content and spectral reflectance across a wide range of species, leaf structures and developmental stages. Remote Sensing of Environment. 2002;81(2–3):337–54.
- 38. Rouse RWH, Haas JAW, Schell JA, Deering DW. Monitoring vegetation systems in the Great Plains with ERTS (Earth Resources Technology Satellites). Goddard Sp Flight Cent 3d ERTS-1. 1974;1(74):309–17.
- 39. Haboudane D, Miller JR, Tremblay N, Zarco-Tejada PJ, Dextraze L. Integrated narrow-band vegetation indices for prediction of crop chlorophyll content for application to precision agriculture. Remote Sensing of Environment. 2002;81(2–3):416–26.
- 40. Sibanda M, Mutanga O, Dube T, S Vundla T, L Mafongoya P. Estimating LAI and mapping canopy storage capacity for hydrological applications in wattle infested ecosystems using Sentinel-2 MSI derived red edge bands. GIScience & Remote Sensing. 2018;56(1):68–86.
- 41. Wu C, Niu Z, Tang Q, Huang W. Estimating chlorophyll content from hyperspectral vegetation indices: modeling and validation. Agric For Meteorol. 2008;148(8–9):1230–41.
- 42. Bannari A, Morin D, Bonn F, Huete AR. A review of vegetation indices. Remote Sensing Reviews. 1995;13(1–2):95–120.
- 43. Belgiu M, Csillik O. Sentinel-2 cropland mapping using pixel-based and object-based time-weighted dynamic time warping analysis. Remote Sensing of Environment. 2018;204:509–23.
- 44.
Liaw A, Wiener M. Classification and Regression by randomForest. 2002;2(December):18–22.
- 45.
Kebede G, Mudereri BT, Abdel-rahman EM. Discriminating Robusta coffee (Coffea canephora) cropping systems using leaf-level hyperspectral data. 2024;18(4):1–17.
- 46. ASD. FieldSpec® HandHeld 2 user manual. 1–140. 2010;(September). http://www.geo-informatie.nl/courses/grs60312/material2017/manuals/600860-dHH2Manual.pdf
- 47. Dai J, Roberts DA, Stow DA, An L, Hall SJ, Yabiku ST, et al. Mapping understory invasive plant species with field and remotely sensed data in Chitwan, Nepal. Remote Sensing of Environment. 2020;250:112037.
- 48. Iqbal IM, Balzter H, Firdaus-e-Bareen , Shabbir A. Identifying the Spectral Signatures of Invasive and Native Plant Species in Two Protected Areas of Pakistan through Field Spectroscopy. Remote Sensing. 2021;13(19):4009.
- 49.
Meyer H, Silva B, Regeling B, Thies B, Bendix J. Hyperspectral Data Analysis in R: The hsdar Package. 2019;89(12).
- 50.
Orcid JS. Package ‘ RStoolbox.’ 2025.
- 51. Thenkabail PS, Lyon JG, Huete A. Biophysical and biochemical characterization and plant species studies. Hyperspectral Remote Sensing of Vegetation. 2019.
- 52.
Leutner B, Horning N, Schwalb-willmann J. RStoolbox – Tools for Remote Sensing Data Analysis. 2019.
- 53. Salih A. Spectral mixture analysis (SMA) model for extracting urban fractions from Landsat and Sentinel-2A images in the Al-Ahsa oasis, eastern region of Saudi Arabia. Land. 2023;12(10).
- 54. Shivakumar BR. Performance Analysis of Spectral Angle Mapper and Spectral Information Divergence Performance Analysis of Spectral Angle Mapper and Spectral Information Divergence Classifiers. ResearchGate. 2017;6(August):5685–92.
- 55. Christovam LE, Pessoa GG, Shimabukuro MH, Galo MLBT. Land use and land cover classification using hyperspectral imagery: evaluating the performance of spectral angle mapper, support vector machine and random forest. Int Arch Photogramm Remote Sens Spat Inf Sci - ISPRS Arch. 2019;42(2/W13):1841–7.
- 56. van der Meer F. The effectiveness of spectral similarity measures for the analysis of hyperspectral imagery. International Journal of Applied Earth Observation and Geoinformation. 2006;8(1):3–17.
- 57. Dube T, Pandit S, Shoko C, Ramoelo A, Mazvimavi D, Dalu T. Numerical Assessments of Leaf Area Index in Tropical Savanna Rangelands, South Africa Using Landsat 8 OLI Derived Metrics and In-Situ Measurements. Remote Sensing. 2019;11(7):829.
- 58. Mupfiga U, Mutanga O, Dube T. Assessing drivers of vegetation fire occurrence in Zimbabwe - Insights from Maxent modelling and historical data analysis. Remote Sensing Applications: Society and Environment. 2025;37:101404.
- 59. Tariq A, Yan J, Gagnon AS, Riaz Khan M, Mumtaz F. Mapping of cropland, cropping patterns and crop types by combining optical remote sensing images with decision tree classifier and random forest. Geo-spatial Information Science. 2022;26(3):302–20.
- 60. Engelhart MD, Moughamian H. Book reviews: Book reviews. Educ Psychol Meas. 1968;28(3):951.
- 61. Richard K, Abdel-Rahman EM, Subramanian S, Nyasani JO, Thiel M, Jozani H, et al. Maize Cropping Systems Mapping Using RapidEye Observations in Agro-Ecological Landscapes in Kenya. Sensors (Basel). 2017;17(11):2537. pmid:29099780
- 62. Olofsson P, Foody GM, Herold M, Stehman SV, Woodcock CE, Wulder MA. Good practices for estimating area and assessing accuracy of land change. Remote Sensing of Environment. 2014;148:42–57.
- 63. Immitzer M, Vuolo F, Atzberger C. First Experience with Sentinel-2 Data for Crop and Tree Species Classifications in Central Europe. Remote Sensing. 2016;8(3):166.
- 64. Xie Y, Sha Z, Yu M. Remote sensing imagery in vegetation mapping: a review. Journal of Plant Ecology. 2008;1(1):9–23.
- 65.
Thenkabail PS, Gumma MK, Teluguntla P, Ahmed MI. Hyperspectral remote sensing of vegetation and agricultural crops. Photogrammetric Engineering and Remote Sensing. 2013;Vol. 79. p. 784.
- 66. Kussul N, Lavreniuk M, Skakun S, Shelestov A. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data. IEEE Geosci Remote Sensing Lett. 2017;14(5):778–82.
- 67. Ozdogan M, Yang Y, Allez G, Cervantes C. Remote Sensing of Irrigated Agriculture: Opportunities and Challenges. Remote Sensing. 2010;2(9):2274–304.
- 68. Gomez C, Mangeas M, Petit M, Corbane C, Hamon P, Hamon S, et al. Use of high-resolution satellite imagery in an integrated model to predict the distribution of shade coffee tree hybrid zones. Remote Sensing of Environment. 2010;114(11):2731–44.
- 69. Mudereri BT, Mukanga C, Mupfiga ET, Gwatirisa C, Kimathi E, Chitata T. Analysis of potentially suitable habitat within migration connections of an intra-African migrant-the Blue Swallow (Hirundo atrocaerulea). Ecol Inform. 2020;57(August 2019).
- 70. Mosomtai G, Odindi J, Abdel-Rahman EM, Babin R, Fabrice P, Mutanga O, et al. Landscape fragmentation in coffee agroecological subzones in central Kenya: a multiscale remote sensing approach. J Appl Rem Sens. 2020;14(04).
- 71. Oliphant AJ, Thenkabail PS, Teluguntla P, Xiong J, Gumma MK, Congalton RG, et al. Mapping cropland extent of Southeast and Northeast Asia using multi-year time-series Landsat 30-m data using a random forest classifier on the Google Earth Engine Cloud. International Journal of Applied Earth Observation and Geoinformation. 2019;81:110–24.
- 72. Adam E, Deng H, Odindi J, Abdel-Rahman EM, Mutanga O. Detecting the early stage of phaeosphaeria leaf spot infestations in maize crop using in situ hyperspectral data and guided regularized random forest algorithm. J Spectrosc. 2017;2017.
- 73. Mutanga O, Skidmore AK. Narrow band vegetation indices overcome the saturation problem in biomass estimation. International Journal of Remote Sensing. 2004;25(19):3999–4014.
- 74. Denize J, Hubert-Moy L, Betbeder J, Corgne S, Baudry J, Pottier E. Evaluation of using sentinel-1 and -2 time-series to identify winter land use in agricultural landscapes. Remote Sens. 2019;11(1).
- 75. Kussul N, Lavreniuk M, Skakun S, Shelestov A. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data. IEEE Geosci Remote Sensing Lett. 2017;14(5):778–82.
- 76. Schmitt-Harsh M, Sweeney SP, Evans TP. Classification of Coffee-Forest Landscapes Using Landsat TM Imagery and Spectral Mixture Analysis. photogramm eng remote sensing. 2013;79(5):457–68.
- 77. Bispo RC, Lamparelli RAC, Rocha JV. Using fraction images derived from modis data for coffee crop mapping. Eng Agric. 2014;34(1):102–11.