Figures
Abstract
Understanding global food production and productivity patterns is crucial for policy and investment decisions aimed at addressing poverty, food insecurity, and climate change. This paper develops comprehensive calorific-based production and yield indices for 144 crops, covering 98% of global agricultural land and food output. These indices provide standardized measures across various crops and varieties, facilitating comparison of agricultural productivity and consolidating country and regional contributions to global food production. Utilizing a Box-Cox transformation, we find that a linear model best approximates yield growth. Our findings reveal that, at an aggregate level, there has been no discernable slowdown in global yield growth over the past six decades. This translates into an average annual yield increase equivalent to nearly 33 kilograms of wheat per hectare. These results suggest that any observed deceleration in specific commodities, regions, or countries, has been offset by gains in others. While these findings are reassuring from a global food supply perspective, caution is warranted about the sustainability of production and the affordability of food. These concerns are particularly relevant as global food demand increases due to population and income growth, and as the pressures from climate change intensify. The study underscores the importance of adopting strategic and sustainable agricultural practices to ensure continued food security in the face of evolving global challenges.
Citation: Baffes J, Etienne X (2024) Yield growth patterns of food commodities: Insights and challenges. PLoS ONE 19(11): e0313088. https://doi.org/10.1371/journal.pone.0313088
Editor: Charles Odilichukwu R. Okpala, University of Georgia, UNITED STATES OF AMERICA
Received: February 14, 2024; Accepted: October 17, 2024; Published: November 27, 2024
Copyright: © 2024 Baffes, Etienne. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data and code can be found here: https://github.com/xletienne/yield-growth No restrictions of data access.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
As the global population approaches 10 billion by the mid-century, agricultural productivity will become increasingly critical in feeding the world [1]. Over the past six decades, productivity improvements have accounted for much of the growth in food production. However, the rate of yield growth for food commodities, a key measure of productivity, has been perceived to have stagnated in recent decades [2–5]. This assessment has led to concerns regarding food availability, especially in low and middle-income countries where population growth rates are the highest, including Sub-Saharan Africa [6]. The slowdown has also been cited as a cause of commodity price spikes and price volatility [7].
Traditionally, yield growth patterns have been analyzed using single-commodity, weight-based data (e.g., kilograms or metric tons per unit of land) at country, regional, and global levels. While the bulk of the empirical studies finds widespread yield growth deceleration or stagnation [1, 4, 8], closer examination reveals considerable heterogeneity across crops and regions. Notably, high-income countries that experienced large yield gains in response to the Green Revolution earlier in the 20th century appeared to have experienced stagnation or even deceleration. Certain low-income countries, especially Sub-Saharan Africa, also faced similar problems due to limited access to high-yielding varieties and production inputs. However, contrary to the stagnation and deceleration narrative, yield growth acceleration has been observed for various commodities and regions [9].
This heterogeneity in yield growth patterns across crops and regions raises two crucial research questions: Firstly, has aggregate global crop yield experienced a slowdown or stagnation? Secondly, how similar or dissimilar are yield growth patterns across commodities, countries, or regions? Although single-commodity models are useful for identifying supply-driven issues such as the effects of weather patterns, climate change, or technological improvements for a specific crop, they do not provide a comprehensive assessment of aggregate yield dynamics across diverse crops and regions. Indeed, analyzing sustainability issues and addressing food security concerns requires a modeling framework that allows us to evaluate the aggregate growth patterns not only across the entire food crop spectrum, but also accounts for changing crop patterns that may be driven by input costs, domestic and trade policies, as well as demand-side considerations, including changes in tastes and preferences.
Against this background, the present paper addresses the two research questions by introducing an innovative calorific-based approach to tackle the limitations of the existing literature. Instead of traditional weight-based measures such as metric tons or kilograms, we convert annual crop production into calorific content and subsequently aggregate the production and yield of all food commodities into single metrics. This aggregation accounts for heterogenous yield patterns due to changes in production composition, the transition from low-yield to high-yield crops and varieties as well as shifts in country and regional significance. Additionally, this approach provides an easy-to-implement, yet standardized and universal framework for analyzing and comparing yield growth at any level, from individual crops to commodity groups to global aggregates.
Using these calorific-based indices, we apply statistical methods to select the most appropriate model for charting the yield paths of commodities at global, regional, and crop-group levels. Production and calorific content data for 144 major crops covering the 1961–2021 period from the Food and Agricultural Organization (FAO) are included in the analysis. These crops combined account for approximately 98 percent of the world’s agricultural land area. The evidence suggests that the aggregate global yield index has not been subjected to growth deceleration over the past six decades. Thus, slow growth in certain commodities, regions, or countries, documented in the literature has been offset by accelerated growth in others.
The remainder of the paper proceeds as follows. The next section summarizes the literature on yield growth. Section 3 discusses the aggregate yield index, the modeling framework for evaluating yield growth, and the data. Section 4 presents the results. The last section concludes and discusses avenues for further research.
2. A brief review of literature
The literature assessing yield growth performance can be broadly delineated into three principal strands. One strand delves into yield growth from experimental data, serving as a tool for plant scientists to discern performance nuances and facilitate the selection of crop varieties [10–12]. The second strand focuses on evaluating the statistical distribution of yields, with the explicit objective of assisting farmers to make informed selection of crop varieties, while also aiding the insurance industry in determining premia for potential crop losses. Pioneering contributions to this strand include an early study by Day [13], subsequently expanded upon by Just and Weninger [14], Atwood, Shaik and Watts [15], Norwood, Roberts and Lusk [16], and Sherrick et al. [17]. The third strand, particularly pertinent in the present context and the subject of this section, examines yield growth through the lens of sustainability and food security considerations.
Under this strand of literature, determining whether yield growth has encountered deceleration or stagnation has been a topic of substantial debate with most studies focuseing on grain and oilseed crops. At the global level, Alston, Beddow and Pardey [2] conducted a comprehensive study documenting a discernible slowdown in the growth of grain yields. Their findings underscored the potentially far-reaching implications for food price trends should this deceleration persist. Cassman [18] contributed to this discourse by highlighting that the rate of yield increase for cereal crops potentially fell significantly short of the anticipated rise in food demand. Examining the trends in yields for the Big-4 commodities (maize, soybeans, wheat, and rice), Ray et al [5] concluded that, while yields continue to ascend in many regions, a noteworthy proportion—ranging from 24% to 39% of the Big-4 growing areas—exhibits a concerning pattern of either negligible improvement, stagnation, or outright collapse. Subsequent research by Ray et al [1] revealed that the growth rates of yields in these commodities fall considerably short of the levels required to meet the projected demand for food commodities by 2050.
At the regional level, Van Ittersum et al [6] asserted that the stagnation of yields in Sub-Saharan Africa presents a formidable food security risk, particularly as the population of the region is anticipated to reach 2.1 billion by 2050 (from 1.4 billion in 2020). Using grid-level data, Iizumi et al. [19] found increased yield instability for the Big-4 commodities across a broad region of the Southern Hemisphere, corroborating previous findings of yield stagnation and collapses in these regions. Lin and Huybers [20] examined wheat yield data across 47 regions, finding that approximately half of the production within their sample continued with a linear growth trend, while the remainder exhibited yield stagnation. Michel and Makowski [21] estimated that wheat yield growth exceeded 0.06 tons per hectare per year in 1961–2010 in various countries across Europe, Asia, Africa, and America, but it experienced stagnation in many other countries. Focusing on 24 African countries from 1960 to 2012, Saito et al. [22] found that 15 countries experienced accelerating yield growth, while the rest saw stagnation or decline.
Numerous studies examined country-level crop yield growth as well. In a focused analysis of China’s yield performance spanning 1980–2010, Li et al [8] observed yield stagnation in 50% of rice-producing areas, 54% of maize-producing areas, and nearly 16% of wheat-producing areas. In the context of analyzing food security in China, Wei et al. [23] determined that rice yields face pronounced stagnation in 53.9% of the regions examined, followed by 42.4% in maize and 41.9% in wheat. Finger [9] found that while the yield growth of maize, barley, and rye remained at a linear trend in Switzerland from 1961 to 2006, other commodities, including wheat, experienced yield growth deceleration. Examining yields in 29 Indian states for the 1967–2017 period, Madhukar, Kumar and Dashora [24] found that 76%, 47%, and 18% of the harvested areas did not show yield improvement in the recent decade for wheat, rice, and maize, respectively.
Notwithstanding the prevailing, somewhat pessimistic, outlook in the studies reviewed above, it is essential to note that not all research aligns with such a perspective. In an influential work, Alexandratos [25] contended that global agricultural production is poised to meet, or potentially exceed, the demands of food requirements. Alexandratos also emphasized that the critical concern lies in the persistent challenges of poverty in low-income countries. Similarly, an editorial in the journal Nature [26] offered a nuanced perspective on productivity growth and food security, noting that depending on research advancements in the sector, it is feasible to meet the 2050 global food demand at an acceptable cost. Ausubel, Wernick and Waggoner [27] echoed similar views, highlighting the substantial increase in crop yields over the past 50 years, alongside a marked decline in caloric requirements relative to GDP.
To summarize, the existing literature provides mixed evidence on yield growth trends, with findings varying based on the specific commodities, countries, regions, and time periods analyzed. While some studies indicate a decline or stagnation for certain crops and regions, others report an acceleration or consistency with a linear growth trend. This raises a critical question: from a global perspective, has aggregate crop yield experienced stagnation or deceleration? Most existing studies have predominately employed weight-based yield to analyze productivity growth for single commodities. However, this approach does not adequately capture the rate of productivity change across all combined commodities, which is crucial for analyzing food security and sustainability challenges. In this study, we propose constructing global yield growth indices based on the calorific content of crops to assess aggregate yield growth patterns at the global, regional, and commodity levels. These patterns are discussed in the following section.
3. Methods and data
Indices have been used in various contexts within commodity markets, such as the aggregation of various commodity prices into a single index and the construction of aggregate agricultural productivity indices. A critical issue in the aggregation process is the selection of appropriate weights. For instance, the weights used in the World Bank’s commodity price indices are based on the export values of emerging markets and developing economies [28]. For agricultural productivity studies, weights are typically based on values derived from FAO international crop prices, measured in Geary-Khamis dollars per ton, also known as “international dollars” [29]. Numerous studies have employed FAO-based aggregation, including Adamopoulos and Restuccia [30] who studied cross-country agricultural productivity based on micro-plot level data; Gollin, Lagakos and Waugh [31] who examined agricultural productivity differences across countries for maize, rice, and wheat; Mekonnen et al. [32] who investigated technical efficiency in developing countries; and Nin-Pratt, Yu and Fan [33] who compared agricultural productivity growth in China and India.
However, using weights based on values derived from commodity prices for specific years can introduce bias for several reasons. Firstly, while there are established international price sequences for key agricultural commodities such as the Big-4, most other commodities, which are less frequently traded, lack benchmark prices. Second, in many low-income countries, geographical isolation and high transportation costs render many agricultural commodities non-tradable [34, 35]. Third, trade distortions often cause domestic prices to diverge from international benchmarks [36]. Additionally, many commodity prices are subject to long-term trends, medium-term cyclicality, and short-term variability, implying that selecting different sub-periods can yield varying results [37]. Therefore, yield aggregation based on value terms, even when adjusted for inflation or purchasing power, can suffer from rapidly changing weights due to price volatility. To address these issues, we employ calorific-based weights in our construction of global and regional production and yield indices.
Calorific-based indices have been widely used on the consumption side in various contexts, such as calculating food requirements for balanced diets and estimating budgetary needs to ensure healthy nutrition [38, 39]. However, their application on the production side has been less frequent, though they have been utilized in a range of contexts. Dating back to 1942, Williamson and Williamson [40] used calorific content of food commodities to discern patterns in food consumption. Roberts and Schlenker [41] extended this approach by converting the production of the Big-4 commodities to identify supply and demand elasticities in the context of the 2007–08 price spike. In a similar vein, Bobenrieth, Wright and Zeng [42] calculated stocks-to-use ratios for major grains and an index of total calories from these grains, serving as indicators of vulnerability to food shortages and price spikes. D’Odorico et al. [43] contributed to this line of inquiry by calculating the water intensity of internationally traded food commodities, employing a calorific aggregation. Additionally, Cassidy et al. [44] leveraged the calorific content of food commodities to assess area requirements for human food consumption.
Our study extends this strand of literature to evaluate aggregate crop yield growth. The remainder of this section details the construction of an aggregate yield index and discusses the key assumptions underlying common yield growth estimation approaches. It also examines the modeling framework used to select the appropriate specification for growth rate estimation, addressing how to manage structural breaks and nonlinearities. Additionally, it provides a brief overview of the data used in the analysis.
3.1 The aggregate yield index
The aggregate calorie-based yield index, yt, for a particular country/region is computed as: (1) where Qit denotes the total output of commodity i at year t in weight unit; wi represents the calorific content of commodity i per weight unit; and Lit is land allocated to commodity i at year t. Thus, yt represents the amount of calories produced per land area (in hectare). In addition to a given country, the index can be constructed for the entire world or any region or aggregate commodity group of interest.
Common calorific units used include cal (small calorie) and Cal (large calorie). A small calorie is defined as “…the amount of heat required to raise the temperature of 1 g of water by 1°C with a temperature change from 14.5 to 15.5°C. The current US Dietary Reference Intakes define 1 cal as 4.186 J [joules]” [45]. One large calorie is equivalent to 1,000 small calories, or 1 Cal = 1,000 cal. Often Cal is denoted as kcal, a notation used in food labeling. For notational convenience, the present paper uses KCal and MCal, which equal 1,000 large calories and 1,000,000 large calories, respectively. In other words, we define 1 KCal = 1,000 Cal = 1,000 kcal, and 1 MCal = 1,000 KCal = 1,000,000 Cal.
The benefits of the calorific-based approach over traditional weight-based methods can be illustrated with an example of two commodities that followed different production and yield paths. Millet experienced a modest increase in global production from 25.7 MMT (million metric tons) in 1961 to 30.1 MMT in 2021. The area allocated to millet during this period declined by almost a third, leading to an annual yield growth of 6.3 kg/ha. During this same period, maize production increased sixfold while the area doubled, giving an annual yield growth of 65.6 kg/ha. To assess the combined yield growth performance for these two crops, production is converted from metric tons to calories (3,400 kcal/kg for millet and 3,560 kcal/kg for maize) and aggregated into a single metric by adding their global calorific production. The annual calorific-based yield growth for the two commodities combined is 219 KCal/ha. This calorific approach allows us to assess the aggregate yield growth of two crops with different growth paths in a way that the traditional weight-based approach cannot.
3.2 Assumptions under traditional methods of estimating yield growth patterns
The growth rate between periods 1 and 2, denoted by ρ, typically reported as percent change, is calculated as ρ = (y1−y0)/y0, where y1 and y0 denote yield in the current and previous periods. The growth rate equation can also be written as y1 = (1+ρ)y0. In a multiperiod case, if yield grows at rate ρ, on average, while in each period it is subjected to a stochastic shock ηt, yield at time t can be rewritten as yt = (1+ρ)yt−1 ηt.
Taking a one-period lag results in , which upon substitution and working backward to period 0 results in . Taking natural logarithms of both sides and setting β0 = log(y0), β1 = log(1+ρ), and , gives: . Such logarithmic transformation is a frequently used regression where the parameters β0 and β1 are estimated with ordinary least squares and the growth rate is calculated as ρ = exp(β1)−1. The growth rate is often reported as the estimate of β1 rather than ρ, since for small growth rates β1 and ρ are approximately equal. Estimating growth rates by using logarithmic transformations rests on the assumption that yt grows at approximately the same rate (in percent terms) throughout the sample period, in order to render the error term white noise (ηt is log-normally distributed with mean 1). This model essentially assumes that yield follows an exponential growth pattern.
While the proportionality assumption may be a reasonable approximation for short periods (e.g., a decade) and relatively small growth rates, it may be unrealistic when long periods are considered, such as a 60-year sample in the current context, due to the change in the base. An alternative and, perhaps, more realistic, assumption could be that yt grows by a constant amount in each period, say μ, in which case the two-period yield growth would be y1 = μ+y0 or, in the general case, yt = μ+yt−1. For the multi-period case, backward substitution gives yt = μt+y0. Letting β0 = y0 and β1 = μ, and appending an additive error term gives yt = β0+β1t+εt. This linear growth specification implies that yield grows at an average rate of β1 units.
The fundamentally different nature of these specifications, i.e., exponential vs. linear growth models, has important implications for assessing whether growth has decelerated or accelerated. To illustrate, consider that maize yield grew at 2.6 and 1.7 percent per annum during 1961–71 and 2011–21, the first and the last decade of the sample. However, maize yields grew at 203 KCal and 324 KCal annually during the same periods. In other words, yield growth for maize has dropped by more than one-third (based on the logarithmic specification that assumes a constant percentage growth) but has increased by more than 50 percent (based on the linear specification that assumes a constant amount of growth).
3.3 Choosing the appropriate specification
The discussion above suggests that yield growth should be modeled under a more general framework. We begin wih the following specification: (2) where β0 is constant, β1 indicates the rate of yield growth, ε is iid (independent and identically distributed) error term, and λ is a transformation parameter, such that y(λ) is either log(y) (for logarithmic specification) or y (for linear specification). In this paper we employ the Box-Cox model to determine which specification best represents the data generation process. This approach selects the transformation of the dependent variable such that the residuals approximate a normal distribution and exhibit reduced heteroskedasticity [46]. Sakia [47] provides a comprehensive review of the Box-Cox transformation technique. Notable advancements to the Box-Cox transformation include Yeo and Johnson [48] transformation, which accommodates both positive and negative observations, and the extension by Atkinson, Riani and Corbellini [49], which allows for transformations on both sides of the equation.
The Box-Cox model relies on estimating the transformation parameter λ such that: (3)
All notations are the same as defined previously. Eq (3) embeds linear (λ = 1) and logarithmic (λ = 0) transformation of the dependent variable. Under the assumption that there exists a λ that makes the error term in the model approximately normal, Box and Cox (1964) derived the likelihood function for a set of observations {y1,y2,…,yt} and suggested using the maximum likelihood estimation (MLE) to determine λ. Further details on the MLE and alternative estimation procedures are discussed in Spitzer [50].
Regardless of the value of λ, Eq (2) assumes that the underlying parameter estimate of the growth rate is constant over time. However, yield paths may exhibit non-linearities, with growth rates gradually accelerating, decelerating, or even taking sharp turns. We consider two ways to account for such non-linearities. First, following earlier studies [e.g., 9], a squared time trend is added to Eq (4): (4) where β1 approximates the growth rate and β2 denotes the rate at which growth decelerates (when negative) or accelerates (when positive). Higher-order polynomials could be included to capture more complex non-linearities. In Eq (4), εt again is assumed to be iid and other notations are defined as previously.
A second way to account for non-linearities is to impose one or more structural breaks and estimate separate growth rates for each subperiod. In the case of one structural break at year τ, a piecewise linear regression model can be utilized to estimate the pre- and post-break growth rates, while ensuring continuity at the break date [51]: (5) where ia the estimated break year, D is a dummy variable taking the value of one for the years after the break and zero otherwise, εt is iid error, and other notations are defined as previously.
Since we have no a priori knowledge of the existence of a structural break, we use the Quandt Likelihood Ratio (QLR) procedure to determine if (and when) the data-generating process has been subjected to breaks [52]. Specifically, a sample Wald test statistic is computed for every possible break date (τ) to determine the stability of parameters before and after the break. The QLR statistic is defined as the supremum Wald test statistic, allowing the break date τ to vary during the sample period. For improved performance, we use a symmetric trimming of 15%, where the first and last 15% of the observations are excluded from the estimation [53]. Additionally, we use both average and exponential Wald tests, where the former is defined as the average of the sample test statistic and the latter is the natural log of the average of the exponential of the sample test [54]. The break date is defined as the date when the maximum test statistic is reached.
3.4 Estimation issues
As with many time series analyses, autocorrelation and the appropriate selection of goodness-of-fit statistics can pose challenges. In the presence of autocorrelation, although the Ordinary Least Squares (OLS) estimator remains unbiased and consistent, statistical inferences may be affected due to unreliable standard errors, thus compromising the efficiency of the estimators [55]. A common diagnostic tool for detecting autocorrelation is the Durbin-Watson [56, 57] test. To address autocorrelation, two primary strategies are employed in the literature. The first involves adjusting the standard errors of the estimated parameters using the Newey-West [58] variance estimator. Alternatively, feasible generalized least squares (FGLS) can be applied to obtain more efficient estimators, with the Prais-Winsten and Cochrane-Orcutt techniques being notable procedures. Further details on these two approaches as well as the Durbin-Watson test are provided in Appendix A in S1 Appendix.
An additional concern is the presence of autocorrelation when utilizing the Box-Cox transformation. As previously discussed, the primary goal of this transformation is to normalize regression residuals and mitigate heteroskedasticity. However, as Savin and White [59] highlight, neglecting autocorrelation in the context of a Box-Cox transformation could result in misleading conclusions due to the potential presence of functional form misspecification. To ensure robustness, we implement the method proposed by Savin and White [59], which was later extended by Seaks and Layson [60]. The full details of these procedures are outlined in Appendix B in S1 Appendix.
Another important consideration is the assessment of model goodness-of-fit. When applying the FGLS method to correct for autocorrelation, the R2 statistic—which measures the proportion of variance in the dependent variable explained by the regressors—may not be directly comparable across models, as the dependent variable is transformed based on the autocorrelation coefficient estimated with different sets of regressors [55]. Similarly, when using the OLS coefficients with Newey-West errors, the traditional R2 maybe biased, since the variance formula for the dependent variable is no longer valid in the presence of a trend. Instead of the R2, we report stationarity statistics to evaluate whether the residuals exhibit stable statistical properties over time—a crucial assumption for ensuring that the error terms are iid. For this purpose, the Phillips-Perron test is employed to evaluate stationarity. Additionally, we report the Alkaike Information Criteria, a model selection metric that estimates the relative amount of information loss by a given model, as described in Banks and Joyner [61] for OLS regressions.
3.5 Data
Data on area and production are collected for 144 food commodities, including grains, oilseeds, fruits, fibers, and other crops, for 1961–2021 from FAOSTAT. These commodities combined account for over 98 percent of the global total agricultural land use during the sample period. The calorific information of most commodities is obtained from the FAO, as reported in the Food Balance Sheets. Appendix C in S1 Appendix details the commodities considered along with their calorific content.
Although many different data sources exist for commodity production and their nutritional values, considerable variation exists in how these data are collected (e.g., dry vs. raw weight). For consistency, we draw all data used in the analysis from FAO. In addition to the crop-level data, we also collect the regional-level data for each crop. Important to note that the FAO Food Balance Sheets provide the calorific content of a specific commodity per 100 grams of edible portion in terms of the retail weight ("as purchased"). As such, it does not consider the calories from the non-edible portion of the crop that can indirectly affect the food supply. For instance, some of the non-edible components, upon further processing, may be used as feed for livestock, which is an important source of calories for human consumption. Meanwhile, the FAOSTAT production data report the total production quantity of the crop, which includes both edible and non-edible components of the crop. In the present study, we assume that the relative ratio of edible and non-edible components for a given crop remains similar over the years. As such, the indirect contribution of the calorific content from non-edible should not severely bias the results.
4. Results
4.1 Production and yield patterns
Fig 1 plots the global area, calorific production, and calorific yield across the 144 crops over the sample period, indexed to 100 in 1961. Global crop production nearly quadrupled over the past six decades on a calorific basis. Most of the increase reflects yield advancements, with cropland expansion experiencing much smaller growth. Fig 2 shows that the aggregate global yield has risen from approximately 4,330 KCal/ha in 1961 to almost 11,000 KCal/ha in 2021. Such growth, amounting to a 158% increase, closely mirrors the 152% increase in the global population over the same period.
Source: Authors’ calculations based on FAO data.
Source: Authors’ calculations based on FAO data. Notes: Big-4 refers to the sum of maize, wheat, rice, and soybeans.
Fig 2 further shows that much of the yield growth was driven by the Big-4 commodities (maize, wheat, rice, and soybeans), whose yields almost tripled from 4,826 KCal/ha in 1961 to 14,323 KCal/ha to 2021. In contrast, the yield of the “other” category (which includes the remaining 140 crops) grew by less than 100% over the sample period, with their 2021 calorific yield only half that of the Big-4.
Table 1 provides a more detailed decomposition of the area, yield, and production growth of the Big-4 versus other crops. We compute the average area, yield, and production for the first (1961–63, Panel A) and last (2019–21, Panel B) three years of the data, along with their respective shares. The middle panel shows their growth rates (calculated as logarithmic change) between the two periods, while the lower two panels present the contribution of area and yield to the total production growth. Fig 3 (left panel) summarizes the decomposition analysis based on Table 1.
Source: Authors’ calculations based on FAO data. Notes: Big-4 refers to the sum of maize, wheat, rice, and soybeans. Other includes all other commodities. ‘AE’ and ‘EMDE’ refer to Advanced Economies and Emerging Market and Development Economies, respectively.
Three key findings emerge from the decomposition analysis. Firstly, more than two-thirds of global food production growth over the past six decades came from yield increases, while the remainder reflects area expansion. Secondly, the Big-4 commodities contributed nearly 60 percent to production growth with the rest of the growth attributable to all other crops. Thirdly, about two-thirds of the 60 percent production growth of the Big-4 is associated with yield increases.
Fig 4 shows a similar decomposition based on income and regional classification. First, it separates the country sample into (i) AEs (Advanced Economies) consisting primarily of countries in North America, Northern Europe, Western Europe, and Southern Europe and (ii) EMDEs (Emerging Markets and Developing Economies). Second, the EMDEs group is decomposed into five regional aggregates: LAC (Latin America and the Caribbean), SEAO (South-East Asia and Oceania), EECA (Eastern Europe and Central Asia), MENA (Middle East and North Africa), SSA (Sub-Saharan Africa). EMDEs have consistently reported lower aggregate yield than AEs. For example, the aggregate yield of EMDEs for 2021 was 60 percent lower than the corresponding yield of AEs. However, EMDEs’ yield growth slightly outpaced that of AEs—the former increased by 167% while the latter increased by 150% during the sample period.
Source: Authors’ calculations based on FAO data. Notes: ‘AE’ and ‘EMDE’ refer to Advanced Economies and Emerging Market and Development Economies, respectively.
Table 2 presents a detailed decomposition analysis of yield, area, and production for regional aggregates, which is further summarized in Fig 3 (right panel). Several results emerge from the regional decomposition analysis. First, during the past six decades, EMDEs contributed more than 70 percent to global production growth. Second, SEAO contributed nearly 40 percent of production growth, likely due to the Green Revolution (mostly in rice and less so wheat) which significantly enhanced productivity. Additionally, yield growth in EMDEs contributed to over 50% of global calorific production growth. In contrast, AEs contributed approximately a quarter of global calorific production growth through yield advancements. It is important to note that the latter witnessed substantial yield growth earlier in the 20th century, particularly following the introduction of hybrid crop varieties in the United States during the early 1930s [62].
4.2 Yield growth patterns
Our empirical analysis unfolds in the following steps. First, we test for structural breaks in the aggregate calorific yield indexes, applying both linear and exponential growth models. We then employ the Box-Cox transformation to identify which specification—linear or logarithmic yield indexes—best captures the yield growth patterns, considering models with and without a structural break or quadratic term. Based on the preferred transformation parameter we estimate the aggregate, commodity- and region-specific yield trajectories using Eqs (2), (4), and (5).
Results of the structural break analysis for the aggregate yield index are reported in Table 3. Under the linear transformation, the aggregate yield index appears to be subjected to a structural break in 1993, as suggested by the supremum, average, and exponential Wald tests, all of which are significant at the 1% level. For the logarithmic transformation, although the supremum Wald test suggests the presence of a structural break in 1978, the other two tests fail to reject the null hypothesis of no parameter shifts.
The results also confirm that the optimal transformation parameter is close to one for linear model, with quadratic trend and break year of 1993. Indeed, statistics reported in Table 4 (last two columns) favor rejection of the null hypothesis that the optimal transformation parameter is equal to zero (logarithmic transformation) but fails to reject the null hypothesis that the parameter equals one (linear transformation). In Appendix B in S1 Appendix, we further consider the Box-Cox method in the presence of autocorrelation, finding that again a linear transformation is preferred over a logarithmic transformation for almost all models considered. Overall, Tables 3 and 4 suggest that a linear specification best captures global aggregate yield growth, which is consistent with some previous single-commodity studies [4, 63].
Based on the findings reported in Tables 3 and 4, we conduct regression analyses using the linear specification for the global aggregate as well as income and regional aggregates, with the results summarized in Tables 5–7. In each table, panel A reports parameter estimates of the base model, panel B shows the corresponding estimates which include a time-squared term, and panel C presents the results allowing for a structural break using piecewise linear regression. Before discussing the results, several key points should be noted. First, we test for residual autocorrelation in each regression using the Durbin-Watson test. Results in Table A1 in Appendix A in S1 Appendix suggest the presence of residual autocorrelation in almost all models. To address this, we use the Newey-West standard errors instead of the conventional standard errors in Tables 5–7.
Second, we re-estimated the models using the Prais-Winsten FGLS estimator, which directly accounts for autocorrelation, rather than simply adjusting the standard errors as in the Newey-West approach. The results, reported in Appendix A (Tables A2-A4 in S1 Appendix), indicate that most of the statistical significance remains consistent and the estimated coefficients are closely aligned with those in Tables 5–7. Third, as the Phillips-Perron stationarity statistics confirm, most of the residuals do not contain a unit root, suggesting that the statistical properties of the residuals generally do not change over time.
Lastly, the results from AIC vary depending on the yield indices analyzed. In Table 5, the base model with a linear trend exhibits the lowest AIC for the aggregate, Big-4, and AE yield indices. In contrast, for Other and EMDE yield indices, the models with a structural break and a quadratic trend, respectively, give lower AIC values. In Tables 6 and 7, the quadratic trend model is preferred for cereals & grains and other crop group, the linear trend model for MENA and SEAO, and the structural break model for all other commodity and regional groupings.
The first column of Table 5 focuses on the aggregate global yield index. As can be seen, the parameter estimate for the post-1993 term is non-significant. Similarly, the parameter estimate for the quadratic trend was positive but non-significant. Combined, these findings suggest that aggregate global calorific yield has exhibited steady growth at approximately 109 KCal/ha per year with no evidence of deceleration or acceleration.
At a more granular level, the second and third columns of Table 5 show that the yield growth of the Big-4 outpaced other crops by a considerable margin, with the gap between the two groups widening over time, consistent with Fig 2. The average yield of Big-4 commodities grew at 158 KCal per year during the sample period, with no evidence of yield growth acceleration or deceleration. By contrast, the yield of ‘Other’ grew only at 59 KCal per year over the period of analysis. While the parameter estimate of the quadratic term is non-significant, the model with structural break suggests that the yield of the “other” crop aggregates accelerated from 55 KCal before 1993 to 64 KCal per year after 1993.
The last two columns in Table 5 show the estimation results for AE and EMDE country groupings. On average, the aggregate yield in the AEs grew by 76 KCal/ha per year, while in EMDEs it grew by 95 and 109 KCal/ha before and after 1993, respectively. The acceleration in EMDEs was likely driven by productivity growth in Latin America (following the uptake of soybean production), efficiency gains in Eastern Europe and Central Asia (following the collapse of centrally planned economies), and Asia (following the adoption of the green revolution). In contrast, Sub-Saharan Africa did not contribute much to the acceleration due to its low crop intensity, inadequate irrigation systems, and limited use of commercial inputs during the sample period [6].
Further disaggregation at commodity and regional levels is reported in Tables 6 and 7. Specifically, Table 6 reports yield growth estimates for four commodity groups: cereals and grains, oil crops, fruits and vegetables, and other crops. Non-linearities are present in all four groups, as demonstrated by the significant quadratic and post-1993 coefficient estimates. Consistent with the studies on individual commodities discussed earlier, regression analyses by commodity groups gave a mixed picture: yield growth accelerated in cereals, oil crops, and fruits/vegetables, and decelerated in the remainder crop aggregates (the “others” category in the table). Cereals and grains present the highest yield growth during the sample period, as illustrated by the large magnitude of the coefficient associated with the linear trend. However, the magnitude of the quadratic term, as well as the estimated parameter after the structural break in 1993, are considerably larger for oil crops, suggesting that the yield growth for these crops may eventually catch up with cereals and grains.
Table 7 reports parameter estimates for regional aggregates. Results show that yield growth acceleration, although with a somewhat small magnitude, is present in EMDEs, mainly reflecting the growth in LAC and EECA. For SEAO, MENA, and SSA, neither the quadratic term nor the post-structural break parameter is statistically significant, suggesting linear yield growth in these regions. However, yield growth in SEAO has been the fastest across all EMDEs (although without acceleration) during the sample period.
5. Discussion and conclusion
Understanding yield growth patterns is crucial for the discussion of food security and sustainable development. However, the frameworks employed by existing studies are inadequate for assessing yield growth aggregated across all food commodities and countries. In this study, we introduce an innovative calorific-based approach to examine aggregate global crop yield growth. Our analyses indicate that, contrary to the widely accepted view, the global aggregate crop yield has grown at a consistent rate of 109 KCal/ha per year from 1961 to 2021 (equivalent to adding 32.6 kilograms of wheat every year in each hectare of land), with no evidence of deceleration or acceleration. This suggests that yield stagnation or growth deceleration for specific commodities, regions, or countriesdiscussed in the literature has been offset by acceleration elsewhere, resulting in a broadly stable aggregate yield growth pattern.
This calorific-based approach further allows us to examine the yield growth at various commodity groups or regional levels in a consistent manner. We show that, while the yield of the Big-4 commodities exhibited no acceleration or deceleration, the aggregates of the remaining crops showed a slight acceleration after 1993. Similarly, while yield growth in AEs has been fairly constant, EMDEs experienced some acceleration, driven primarily by three regions (LAC, EECA, and SSA). Combined, these findings highlight the importance of analyzing aggregate trends based on the single metrics developed in this paper alongside the performance of specific commodities, countries, or regions reported in the literature.
Yield growth during the past six decades has supported adequate food supplies at a global level. However, several challenges must be addressed to ensure that future yield growth patterns meet global food requirements, which are expected to grow by one-third over the next three decades due to population and income growth as well as changing consumption patterns [64, 65].
The first challenge is the increasing frequency and intensity of adverse weather patterns, exacerbated by the ongoing climate change, which is expected to alter the regional composition of commodity production and increase yield volatility [66]. Public investment in developing new cultivars resilient to temperature and precipitation variations will be crucial for maintaining historical yield growth trajectories. The significance of such cultivars will only grow, as most global food supplies rely heavily on a few crops, with the Big-4 accounting for nearly half of the global calorie supplies [67]. Additionally, climate change is likely to increase yield volatility. Indeed, yield growth in advanced economies has been more volatile compared to emerging markets and developing economies, primarily due to weather-induced factors such as the La Niña episodes of 2010–11 and 2011–12 [68]. Addressing these challenges will require substantial investment, research and development, and diffusion efforts, similar to those that led to the development and use of hybrid maize varieties in the United States and the grain varieties promoted by the Consultative Group on International Agricultural Research, which spearheaded the Green Revolution in East Asia [10, 69, 70].
Another challenge is removing distorting trade policies, which restrict food availability in regions experiencing food deficits [71–73]. Despite agricultural policy reforms undertaken in recent decades, global agricultural trade is still subjected to trade barriers and domestic policies [74]. Policies diverting food commodities to biofuels should balance environmental and energy security concerns and food availability [75]. Likewise, policies to reduce greenhouse gas emissions should not compromise productivity. This is especially important since agriculture accounts for a quarter of global emissions [76].
Finally, even if adequate food supplies can be ensured globally, access to food remains a concern, especially in low-income countries where food insecurity is associated with conflict and extreme weather events [77]. The number of people subjected to acute food insecurity worldwide has more than doubled during the past five years, exceeding an estimated 280 million in 2023 [78]. Achieving equitable access to food across income groups will entail financial assistance, food delivery through aid, as well as investments and policies aimed at increasing productivity but also reducing food waste at the production level [26].
The calorific-based indices along with the estimation procedure presented in this paper not only deepen our comprehension at both aggregate and regional levels but also pave the way for multifaceted avenues of future research. For instance, by incorporating additional factors like water usage or chemical input use, the aggregation of production and yields could facilitate an assessment of environmental strain. Moreover, exploring alternative aggregation methods could elucidate the susceptibility of various commodities and regions to changes in weather patterns and intensification of climatic phenomena like El Niño. Lastly, our proposed index methodology holds promise in scrutinizing patterns of production or yield volatility, thereby offering valuable insights into the resilience of food production systems.
Acknowledgments
The authors extend their gratitude to the reviewers and editor for their valuable comments and suggestions on an earlier version of the manuscript, which greatly enhanced the quality of this paper.
References
- 1. Ray D, Mueller N, West P, Foley J. Yield trends are insufficient to double global crop production by 2050. PloS One. 2013;8(6):e66428. pmid:23840465
- 2. Alston JM, Beddow JM, Pardey PG. Agricultural research, productivity, and food prices in the long run. Science. 2009;325(5945):1209–10.
- 3.
FAO. Global agriculture towards 2050. 2009.
- 4. Grassini P, Eskridge K, Cassman K. Distinguishing between yield advances and yield plateaus in historical crop production trends. Nature Communications. 2013;4:2918. pmid:24346131
- 5. Ray D, Ramankutty N, Mueller N, West P, Foley J. Recent patterns of crop yield growth and stagnation. Nature Communications. 2012;3:1293. pmid:23250423
- 6. Van Ittersum MK, Van Bussel LG, Wolf J, Grassini P, Van Wart J, Guilpart N, et al. Can sub-saharan africa feed itself? Proceedings of the National Academy of Sciences. 2016;113(52):14964–9.
- 7. Helbling T, Roache S. Rising prices on the menu: Higher food prices may be here to stay. Finance & Development. 2011;48(001):A009.
- 8. Li X, Liu N, You L, Ke X, Liu H, Huang M, et al. Patterns of cereal yield growth across china from 1980 to 2010 and their implications for food production and food security. PLoS One. 2016;11(7):e0159061. pmid:27404110
- 9. Finger R. Evidence of slowing yield growth–the example of swiss cereal yields. Food Policy. 2010;35(2):175–82.
- 10. Crow JF. 90 years ago: The beginning of hybrid maize. Genetics. 1998;148(3):923–8. pmid:9539413
- 11. Kiær LP, Skovgaard IM, Østergård H. Grain yield increase in cereal variety mixtures: A meta-analysis of field trials. Field Crops Research. 2009;114(3):361–73.
- 12. Reiss ER, Drinkwater LE. Cultivar mixtures: A meta‐analysis of the effect of intraspecific diversity on crop yield. Ecological applications. 2018;28(1):62–77. pmid:28940830
- 13. Day RH. Probability distributions of field crop yields. American journal of agricultural economics. 1965;47(3):713–41.
- 14. Just RE, Weninger Q. Are crop yields normally distributed? American Journal of Agricultural Economics. 1999;81(2):287–304.
- 15. Atwood J, Shaik S, Watts M. Are crop yields normally distributed? A reexamination. American Journal of Agricultural Economics. 2003;85(4):888–901.
- 16. Norwood B, Roberts MC, Lusk JL. Ranking crop yield models using out‐of‐sample likelihood functions. American Journal of Agricultural Economics. 2004;86(4):1032–43.
- 17. Sherrick BJ, Zanini FC, Schnitkey GD, Irwin SH. Crop insurance valuation under alternative yield distributions. American Journal of Agricultural Economics. 2004;86(2):406–19.
- 18. Cassman K. Ecological intensification of cereal production systems: Yield potential, soil quality, and precision agriculture. Proceedings of the National Academy of Sciences. 1999;96(11):5952–9. pmid:10339523
- 19. Iizumi T, Yokozawa M, Sakurai G, Travasso M, Romanenkov V, Oettli P, et al. Historical changes in global yields: Major cereal and legume crops from 1982 to 2006. Global Ecology and Biogeography. 2014;23(3):346–57.
- 20. Lin M, Huybers P. Reckoning wheat yield trends. Environmental Research Letters. 2012;7(2):024016.
- 21. Michel L, Makowski D. Comparison of statistical models for analyzing wheat yield time series. PLoS One. 2013;8(10):e78615. pmid:24205280
- 22. Saito K, Dieng I, Toure AA, Somado EA, Wopereis MC. Rice yield growth analysis for 24 african countries over 1960–2012. Global Food Security. 2015;5:62–9.
- 23. Wei X, Zhang Z, Shi P, Wang P, Chen Y, Song X, et al. Is yield increase sufficient to achieve food security in china? PloS One. 2015;10(2):e0116430. pmid:25680193
- 24. Madhukar A, Kumar V, Dashora K. Spatial and temporal trends in the yields of three major crops: Wheat, rice and maize in india. International Journal of Plant Production. 2020;14:187–207.
- 25. Alexandratos N. World food and agriculture: Outlook for the medium and longer term. Proceedings of the National Academy of Sciences. 1999;96(11):5908–14. pmid:10339517
- 26. Editorial Nature. How to feed a hungray world? Nature. 2020;466(7306):531–2.
- 27. Ausubel JH, Wernick IK, Waggoner PE. Peak farmland and the prospect for land sparing. Population and Development Review. 2013;38:221–42.
- 28.
World Bank. Commodity markets. https://www.worldbank.org/en/research/commodity-markets; 2024.
- 29. Prasada Rao D. Intercountry comparisons of agricultural output and productivity. 1993.
- 30. Adamopoulos T, Restuccia D. Geography and agricultural productivity: Cross-country evidence from micro plot-level data. The Review of Economic Studies. 2022;89(4):1629–53.
- 31. Gollin D, Lagakos D, Waugh ME. Agricultural productivity differences across countries. American Economic Review. 2014;104(5):165–70.
- 32. Mekonnen DK, Spielman DJ, Fonsah EG, Dorfman JH. Innovation systems and technical efficiency in developing‐country agriculture. Agricultural economics. 2015;46(5):689–702.
- 33. Nin-Pratt A, Yu B, Fan S. Comparisons of agricultural productivity growth in china and india. Journal of Productivity Analysis. 2010;33:209–23.
- 34. Delgado CL. Agricultural diversification and export promotion in sub-saharan africa. Food policy. 1995;20(3):225–43.
- 35. Delgado C, Minot N, Tiongco M. Evidence and implications of non-tradability of food staples in tanzania 1983–98. The Journal of Development Studies. 2005;41(3):376–93.
- 36. Baffes J, Gardner B. The transmission of world commodity prices to domestic markets under policy reforms in developing countries. Policy Reform. 2003;6(3):159–80.
- 37. Baffes J, Etienne XL. Analysing food price trends in the context of engel’s law and the prebisch-singer hypothesis. Oxford Economic Papers. 2016;68(3):688–713.
- 38. Bekaert G. Caloric consumption in industrializing belgium. The Journal of Economic History. 1991;51(3):633–55.
- 39. Sibhatu KT, Qaim M. Rural food security, subsistence agriculture, and seasonality. PloS one. 2017;12(10):e0186406. pmid:29049329
- 40. Williamson L, Williamson P. What we eat. Journal of Farm Economics. 1942;24(3):698–703.
- 41. Roberts M, Schlenker W. World supply and demand of food commodity calories. American Journal of Agricultural Economics. 2009;91(5):1235–42.
- 42. Bobenrieth E, Wright B, Zeng D. Stocks‐to‐use ratios and prices as indicators of vulnerability to spikes in global cereal markets. Agricultural Economics. 2013;44(s1):43–52.
- 43. D’Odorico P, Carr J, Laio F, Ridolfi L, Vandoni S. Feeding humanity through global food trade. Earth’s Future. 2014;2(9):458–69.
- 44. Cassidy E, West P, Gerber J, Foley J. Redefining agricultural yields: From tonnes to people nourished per hectare. Environmental Research Letters. 2013;8(3):034015.
- 45. Hargrove JL. Does the history of food energy units suggest a solution to" calorie confusion"? Nutrition Journal. 2007;6:44. pmid:18086303
- 46. Box G, Cox D. An analysis of transformations. Journal of the Royal Statistical Society: Series B (Methodological). 1964;26(2):211–43.
- 47. Sakia RM. The box-cox transformation technique: A review. Journal of the Royal Statistical Society Series D: The Statistician. 1992;41(2):169–78.
- 48. Yeo IK, Johnson RA. A new family of power transformations to improve normality or symmetry. Biometrika. 2000;87(4):954–9.
- 49. Atkinson AC, Riani M, Corbellini A. The box–cox transformation: Review and extensions. Statistical Science. 2021;36(2):239–55.
- 50. Spitzer JJ. A primer on box-cox estimation. The Review of Economics and Statistics. 1982:307–13.
- 51. Boyce JK. Kinked exponential models for growth rate estimation. Oxford Bulletin of Economics and Statistics. 1986;48(4):385–91.
- 52. Quandt R. Tests of the hypothesis that a linear regression system obeys two separate regimes. Journal of the American Statistical Association. 1960;55(290):324–30.
- 53. Andrews D. Tests for parameter instability and structural change with unknown change point. Econometrica. 1993:821–56.
- 54. Andrews D, Ploberger W. Optimal tests when a nuisance parameter is present only under the alternative. Econometrica. 1994:1383–414.
- 55.
Wooldridge J. Introductory econometrics: A modern approach 6ed. Mason, OH: South‐Western; 2016.
- 56. Durbin J, Watson G. Testing for serial correlation in least squares regression: I. Biometrika. 1950;37(3/4):409–28.
- 57. Durbin J, Watson G. Testing for serial correlation in least squares regression. Ii. Biometrika. 1951;38(1/2):159–77.
- 58. Newey WK, West KD. A simple, positive semi-definite, heteroskedasticity and autocorrelationconsistent covariance matrix. 1986.
- 59. Savin NE, White KJ. Estimation and testing for functional form and autocorrelation: A simultaneous approach. Journal of Econometrics. 1978;8(1):1–12.
- 60. Seaks TG, Layson SK. Box-cox estimation with standard econometric problems. The Review of Economics and Statistics. 1983:160–4.
- 61. Banks HT, Joyner ML. Aic under the framework of least squares estimation. Applied Mathematics Letters. 2017;74:33–45.
- 62. Griliches Z. Hybrid corn: An exploration in the economics of technological change. Econometrica. 1957;25(4):501–22.
- 63. Hafner S. Trends in maize, rice, and wheat yields for 188 nations over the past 40 years: A prevalence of linear growth. Agriculture, Ecosystems, and Environment. 2003;97(1):275–83.
- 64. Alexandratos N, Bruinsma J. World agriculture towards 2030/2050: The 2012 revision. 2012.
- 65. Nations United. World population prospects 2019. 2019.
- 66. Wing I, De Cian E, Mistry M. Global vulnerability of crop yields to climate change. Journal of Environmental Economics and Management. 2021:102462.
- 67. Zabel F, Delzeit R, Schneider J, Seppelt R, Mauser W, Václavík T. Global impacts of future cropland expansion and intensification on agricultural markets and biodiversity. Nature Communications. 2019;10(1):1–10.
- 68. Rippey B. The us drought of 2012. Weather and Climate Extremes. 2015;10:57–64.
- 69. Evenson RE, Gollin D. Assessing the impact of the green revolution, 1960 to 2000. Science. 2003;300(5620):758–62. pmid:12730592
- 70. Renkow M, Byerlee D. The impacts of cgiar research: A review of recent evidence. Food policy. 2010;35(5):391–402.
- 71.
Baffes J, Lewin B, Varangis P. Global agricultural trade and developing countries. The World Bank. 2005.
- 72. Rutten M, Shutes L, Meijerink G. Sit down at the ball game: How trade barriers make the world less food secure. Food Policy. 2013;38:1–10.
- 73. Smith VH, Glauber JW. Trade, policy, and food security. Agricultural Economics. 2020;51(1):159–71.
- 74.
OECD. Agricultural policy monitoring and evaluation 2021: Addressing the challenges facing food systems. Paris: OECD Publishing; 2021.
- 75. Graham-Rowe D. Agriculture: Beyond food versus fuel. Nature. 2011;474(7352):S6–S8. pmid:21697842
- 76. Laborde D, Mamun A, Martin W, Piñeiro V, Vos R. Agricultural subsidies and global greenhouse gas emissions. Nature Communications. 2021;12(1):2601. pmid:33972533
- 77.
FAO. Sofi 2021: Transforming food systems for food security, improved nutrition and affordable healthy diets for all. Rome; 2021.
- 78.
Food Security Information Network. Global report on food crises. 2024.