Figures
Abstract
Technological knowledge evolves not only through the generation of new ideas, but also through the reinterpretation of existing ones. Reinterpretations lead to changes in the classification of knowledge, that is, reclassification. This study investigates the relationship between reclassification and the rate of technological change both empirically and analytically. Drawing on patent data as a proxy for technological knowledge, I discuss two empirical patterns: (i) more recent patents are more likely to get reclassified and (ii) larger technological classes acquire proportionally more reclassified patents. These patterns are then used to formulate an analytical model of knowledge accumulation, which can be solved exactly to describe the relationship between reclassification and growth rates in detail and to make further predictions about related knowledge quantities. These predictions are supported across all major technology domains, implying a strong link between reclassification and the pace of technological advancement. In sum, the model allows for a better understanding of both the magnitude and shape of technological growth patterns. More generally, it connects various seemingly unrelated knowledge quantities, providing a basis for knowledge-intrinsic explanations of growth patterns.
Citation: Persoon P (2026) Can knowledge reclassification accelerate technological innovation? PLoS One 21(5): e0348804. https://doi.org/10.1371/journal.pone.0348804
Editor: Joshua L. Rosenbloom, Iowa State University, UNITED STATES OF AMERICA
Received: June 11, 2025; Accepted: April 21, 2026; Published: May 18, 2026
Copyright: © 2026 Peter Persoon. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The raw data underlying the results presented in the study are available through purchase at the European Patent Office, https://www.epo.org/en/searching-for-patents/business/patstat. The patent office will need to be contacted for the older versions of Patstat. A processed dataset is available at https://doi.org/10.17026/SS/MEV4KE.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Technology and science advance as new ideas are added to a cumulative structure of earlier ideas. Equally important, however, are the developments that restructure or reorganize earlier ideas, which occur for example when technologies find new applications or when scientific results are reinterpreted. Think about how solar panels, once used primarily in satellites, are now installed on rooftops around the world. Or think about how Einstein’s work on the photoelectric effect partly revived Newton’s corpuscular theory of light. While these changes may not directly introduce new words or inventions, they potentially alter the categorization or classification of existing knowledge, i.e., they reclassify existing knowledge. Where the societal relevance of generating new ideas is undisputed, the direct or indirect relevance of reclassification is less clear. This is mainly because it is poorly understood how substantial the effect is and what the underlying dynamics are.
However, studying the dynamics of reclassification is not trivial as it mixes two processes for a given class of knowledge: first, the inclusion of new ideas in a cumulative process of creation; and second, the inclusion/exclusion of older ideas as a result of ongoing reclassification; see Fig 1. It is likely that the two types strongly interact, but how exactly largely remains an open problem. Earlier contributions have focused on either the process of new idea generation [1–4], or the process of classification and reclassification [5–8]. In this contribution, I aim to integrate these dynamics in a single analytical model and subsequently derive how reclassification and growth relate to one another. Subsequently, I check the model predictions in an empirical analysis where patent classification data are used as a proxy for technological knowledge. A detailed analysis of the dynamics of reclassification advances our understanding of knowledge creation on various fronts.
The time development of a class of knowledge mixes two effects: the introduction of new inventions each year (triangles) and the inclusion/exclusion of older inventions as a result of reclassification (squares/crosses).
First, it provides a clue as to where the large variation in growth and improvement rates across different technologies comes from [9,10]. Given the close relationship between technological development, economic growth, and societal change, this continues to be one of the central issues in the innovation literature [11,12]. Some have pointed to conceptual links between technologies as predictors of innovation rates [13,14], while others have emphasized structural features such as technological complexity [9,15]. Complementing this literature, this contribution finds that technological classes that are often reclassified tend to grow faster. Although further research is needed to explore causality and the exact mechanism underlying this pattern, the results suggest there is a connection between a sensitivity to redefinition – possibly rooted in a wider scope or applicability of a technology – and the rate at which technological ideas multiply. This provides a relevant perspective on the growth rates of General Purpose Technologies, which by definition adhere to a wide range of applications [16,17], and more generally sheds new light on the debate about the cumulativeness of technological knowledge [18–20], i.e., the extent to which technologies build on earlier results.
Second, the findings in this research have implications to the sociology and philosophy of science and technology. In theories of ‘paradigm shifts’, reinterpretations are an indispensable part of scientific [21] and technological [22] revolutions. This contribution offers a quantitative, empirical perspective on reinterpretation and more generally reclassification, confirming these processes play a key role in technological developments. However, where in theories of paradigm change, reinterpretation is often presented as a short, revolutionizing event, in this contribution it is instead implemented as a routine, ongoing process.
Third, explicitly incorporating reclassification into models of technological growth provides a fresh explanation for the commonly observed “drop” in recent patent counts—often referred to as a truncation effect [23,24]. This phenomenon is not fully explained by administrative factors, such as the 18-month lag between patent filing and publication in the U.S. Especially when it is focused on a particular technology such as green technology, there is a risk in interpreting this drop as a real decline in patenting rates [25–28], even though several years later the drop seems to have disappeared [29]. This research shows that such declines can be a natural consequence of reclassification patterns and predicts the timing of these drops with high accuracy.
The paper proceeds in three parts. The first part is phenomenological, systematically observing patent reclassifications and discussing two empirical patterns. The second part is analytical, where the empirical patterns are used to develop a model relating reclassification to growth. The third part tests the model’s predictions against a range of empirical outcomes.
2. Phenomenology of reclassification
Patent classifications are expert-based categorizations of inventions and therefore provide an ideal empirical definition of individual technologies. There are also limitations to using patent data: not all technologies are patented and many patented inventions eventually find no practical application. In addition, patent classifications are primarily designed and regularly updated to assist patent examiners in their search for prior art. Despite these limitations, it is difficult to think of an alternative source of codified technological knowledge that could match the scope and detail of patent data. Patent classification systems are continuously updated by adding (or deleting) classes and classifications of patents in a process called ‘reclassification’, thereby accommodating changing societal perspectives and technological developments that do not fit the existing scheme [5,7]. Take for example US patent # 4671271 from 1987, which describes a type of protective facial mask: where it was initially only classified as a protective garment (CPC group A41D13), in a classification version of 2016, it is also classified as device for influencing the respiratory system (CPC group A61M16). This is therefore an example of a patent obtaining extra classifications, but also, an example where a group in the classification system obtains an extra patent (even though it is an old patent). There may be several reasons why this happened: perhaps a patent examiner familiar with the group came across the patent by chance, perhaps the definition of the group was modified such that it now includes this patent, or perhaps the classification algorithm was improved. Despite regular publications of patent offices about changes to the classification system, there is generally no clear-cut way to find the reason for individual cases of reclassification. However, by keeping old versions of the patent classification system and comparing it to newer systems, one can systematically study the general patterns of reclassification.
Previous studies have pointed out several regularities in the patent (re)classifications. Research on US patent classifications showed that, since the start of the 20th century, most patents have a combination of more than one classification, that the number of distinct classes increases as a power law with the number of patents, and that the number of used combinations increases proportionally with the number of patents [6,7]. Given these long-term dynamics, let me focus instead on the changes happening within a single reclassification moment. To count each invention only once, the unit of analysis in this research is a patent family. This is a collection of patent documents that all correspond to the same invention (but are filed, for example, at different patent offices). To be precise, I will work with docdb families as administered by the European Patent Office, in which all documents have the same priority patent application, i.e., the earliest application for a patent with this invention). I will consider patent families filed between 1970 and 2019, each including at least one US application and one filed in another jurisdiction. This comprises more than 5 million families, in the following also referred to simply as ‘patents’. Using data from 4 Patstat editions (2013, 2016, 2019, and 2023), there are three distinct reclassification moments: 2013/2016, 2016/2019 and 2019/2023. Reclassification happens yearly, so I aggregate the changes over approximately three years for each reclassification moment. I will use the Cooperative Patent Classification (CPC) on three levels: the section level (1 digit), subclass level (four digits), and main group level (four digits plus main group number) [30].
For a given reclassification moment and any technological class, I consider both the number of families added to that class (’positive reclassifications’) and those deleted (’negative reclassifications’). The positive and negative reclassifications from each CPC subclass are plotted for the size of those subclasses (in number of families) in Fig 2. This plot focuses on the reclassification moment 2019/2023. It is almost the same for the other reclassification moments, see also the appendix in [29] for a similar plot on CPC class level. The plot only considers families that were already there in 2019, thus making sure that any patent added is necessarily a reclassification and not a new patent. This plot shows that both the positive and negative reclassifications of a class are proportional to the size of that class. For the negative reclassifications, this is intuitive: if each classification is equally likely to be deleted, the larger classes lose proportionally more families. For the positive reclassifications, however, this is more puzzling: why would larger classes attract more reclassifications? One possible explanation is that the scope of a technological class somewhat widens with each new patent, and with a wider scope, more families can be classified into that class. The pattern in Fig 2 suggests that the positive reclassifications are generally more substantial than the negative reclassifications. This is confirmed by an additional analysis in the Supplementary Information (SI), where the net reclassifications are considered on the subclass level and where the role of negative reclassification is investigated in more detail.
This figure shows the number of families added and deleted for all subclasses for the reclassification moment 2019/2023. Note that the axes are log-transformed, hence the fits with slope 1.0 suggest the number of reclassifications is proportional to subclass size. The offset (the y-coordinate of the fit intersection with the y-axis) that maximizes R2 for the positive reclassifications is −1.31. Plotting the fit without transforming the axes thus results in a linear relation with a slope of 10−1.31 ≈ 0.05.
Next, I obtain the net reclassifications for each filing year by subtracting the total negative reclassifications from the total positive reclassifications of families with that filing year. In Fig 3 I plot the net reclassifications over the total number of classifications for each filing year, each panel corresponding to a different reclassification moment. The plot shows more reclassifications per classification for patents with filing years close to the reclassification moment, in other words, recent patents are more likely to get reclassified. The pattern is similar across the three reclassification moments, except that for the 2013/2016 moment, where the net reclassifications are negative for some years. The reason for this is unclear but might relate to the fact that the CPC was still young in those years and parts were still being reorganized. A log-log plot of the positive and negative reclassifications, see S2 Appendix, indicates an inversely proportional relation between the number of reclassifications per classification r and the time between cohort filing year c and time of reclassification t, i.e., . Fig 3 includes fits of the data based on this relation, introducing the ‘net reclassification rate’
. Because each reclassification moment in the data aggregates three years, I add
for three subsequent values of t to construct these fits, see S2 Appendix. A large, positive
indicates more classifications are being added, while a negative
suggests classifications are being deleted. While this relation fits the data rather well, I observe substantial variation across the three moments in
. Especially in the 2019/2023 reclassification moment, many extra classifications were added to the system. The values for
appear to vary across different CPC (sub)classes too: some receive relatively more reclassified families than others. This leads to the question whether some technologies are more prone to reclassification than others, i.e., having intrinsically higher
s. Again reasoning from the scope or applicability of a technological class, possible candidates for greater
could be general (or multi) purpose technologies [16,17]: wide scopes may be more likely to attract reclassifications from other classes, (thereby widening the scope again).However, measuring multi-applicability in patents is challenging and even if that could be done, then fitting and estimating
for smaller classes also brings in additional challenges. I therefore leave investigating the variability of
for later work.
For three reclassification moments (2013/2016, 2016/2019, and 2019/2023), I calculate the net reclassifications and divide this by the total number of classifications prior to reclassification. I plot this fraction for the earliest filing year of the corresponding patent families. As the filing years c get close to the moment in time of reclassification t, the net reclassifications per classification r(c,t) sharply increase. I include fits based on the relation for different values of
. For this plot, the classifications are aggregated on subclass level. Only the families occurring in the classifications systems before and after reclassification are included.
That recent patents are more likely to get reclassified makes sense for several reasons. First, when new classes are introduced, they likely consist mostly of recent patents: if they consist of mostly older patents, it would be puzzling why the class was not introduced earlier. Second, recent patents may not yet be well understood and new applications are still being discovered. Third, the language of recent patents may be more modern and the classification (search) algorithms still need some time to adjust to this language.
3. A model of reclassification and growth
In this section, I will explore analytically how reclassification and growth could interact. I will introduce a model that describes the growth of a technology over time incorporating two effects: (i) how inventions in that technology trigger new inventions and (ii) how that technology is reclassified so that it includes inventions from other technologies. Let me denote the number of patents in that technology with filing year c by nc: this group of patents will be referred to as a ‘cohort’. Generally, the number of patents in a cohort will change over time: with later reclassifications, new patents might be added and subtracted. I am therefore interested in describing how nc depends on time t, i.e., nc(t). To incorporate (i), I use the principle that knowledge develops cumulatively, i.e., that new knowledge builds on old knowledge. [18–20] This can be implemented by allowing any earlier result in a technology to at any time trigger a new result in that technology. The triggering of new patents, essentially creating the new cohort c, can therefore be written as , where
. Note however that this only counts for what will be the new cohort at time t + 1, i.e., only for c = t + 1. To incorporate (ii), I explicitly use the empirical patterns identified in Section 2. The first pattern suggests that the probability for a patent from cohort c to get reclassified at time t is inversely proportional to t − c. The second pattern suggests that the greater the class, the more families are reclassified into or out of that class. I will focus on the net-reclassification effect, hence subtracting the negative reclassification from the positive ones. The change in nc(t) as a result of reclassification at time t can then be written as
. Note that these changes apply to all cohorts except the newly introduced cohort, so for this expression, c will be smaller than t + 1, i.e., c < t + 1. Finally, one cannot reclassify or trigger patents into future cohorts, hence
for c > t + 1. The three dynamical relations and corresponding cohorts can therefore be summarized in the following set of equations:
where I introduce the technology-dependent triggering rate and net reclassification rate
. In the following, I will focus on the case where
but the model in principle also allows for negative reclassification rates. In the S1 Appendix I discuss in more detail what happens if
.
Let me focus on the simplest case, a technology with one patent at t = c = 0, i.e., nc(0) = 1 for c = 0 and nc(0) = 0 otherwise. As I show in more detail in the S1 Appendix, the third line of Equation 1 directly leads to the relation
which can be used to derive the exact solution
Summing over c ≥ 0 yields the total number of families
In Fig 4, I plot nc(t) for c, three values of t and fixed and
. It is clear that for large t, the number of patents grows exponentially (note the y-axis is logarithmic) before their numbers drop again in years close to t. To better understand how the base of the exponential relation depends on
and
, it is instructive to calculate the generating function G(z) of n(t), that is,
for some 0 ≤ z < 1. In the S1 Appendix, I demonstrate that this evaluates to
Following the solution in Equation 3, this figure shows the number of patents in each cohort nc(t) plotted for each filing year c and for three values of time t = 30,35,40. I choose the values and
. The number of patents increases exponentially until the apparent ‘decline’ when the filing year c of a cohort is close to t, that is, for recent cohorts.
Furthermore, following a theorem from generating functions theory ([31], see page 341), I show that this rational generating function allows for an asymptotic estimate of n(t). For real and
, it counts
where n0 is a constant depending on ,
, and the initial conditions. The base of the exponential relation g is a real number satisfying
While this implies one cannot obtain a closed expression of g in terms of and
, Equation 7 is enough to get a qualitative understanding of the effect of reclassification on growth. As I show in the appendix, g is always larger for larger
and larger
and
. When both
and
, i.e., for slow growth,
. The model therefore suggests that, in the regime of slow technological growth, there is a positive, non-linear relation between the rate of reclassification and the rate of growth. Note that the upper bound
implies that in a regime of fast technological growth, the relation between the rate of reclassification and growth rates is linear instead.
Using the model solutions, I can calculate several interesting quantities that allow for empirical validation of the model. In the following, I discuss how to interpret and calculate three of these quantities: the decline time, reclassification proportion and number of classifications per patent.
3.1. Decline time
Using the model solutions, I can calculate several interesting quantities that allow for empirical validation of the model in the next section. Let me first consider the year for which the number of patents peaks before they (seemingly) decline in recent years, again see Fig 4 for an illustration. As is clear from this figure, the peaks are not stationary but instead shift to more recent years as t increases: the decline is therefore only apparent. Let me study the typical ‘decline-time’ td = t − c between the filing year c where the peak is and the present time t. If the peak occurs at filing year c, then that is the earliest year such that , or, using Equation 2,
From the initial conditions and the second line of Equation 1, note that . Approximating
, substituting in Equation 8 and simplifying then gives
The c for which the left- and right-hand sides are equal is the peak year, hence the typical time-span between peak and present can be approximated by
3.2. Reclassification proportion
Second, I consider the total number of reclassified families in year t, denoted by v(t). The phenomenology in Section 2 requires that v(t) is proportional to the total number of families. Let me show the proportionality is indeed predicted by the model and at the same time derive an explicit expression for the constant of proportionality V, which I will refer to as the ‘reclassification proportion’. Equation 1 leads to . Rather than doing the summation, let me calculate the total number of families added in year t by summing Equation 1 over all c, obtaining
Dividing left and right by n(t), using the approximation for n(t) in Equation 6 and summing over t, this becomes
For the left- and right-hand side to agree for large t, I conclude that . As required, the number of reclassified families is proportional to the total number of families; furthermore, the proportion of reclassification is expected to be
.
3.3. Number of classification per patent
Finally, the current formulation of the model is focused on the development of one individual technological class, where nc(t) represents a set of unique patents with filing year c at time t. Note that if this set would be the collection of all patents, i.e., ‘all technology’, then because there are no external classes or patents to draw from. However, approaching this slightly differently, the model can still be sensibly applied to the union of all technologies: if nc(t) instead represents all unique classifications of patents with filing year c at time t (that is, counting patents with multiple classifications multiple times), then nc(t) does increase for c < t. In this interpretation, there is a relevant quantity W0(t) describing on average how many classifications each new patent has (upon introduction). Assuming W0 is constant (or changes very slowly), the number of unique patents is then given by
, allowing us to calculate the number of classifications per patent, denoted by W (for more details see S1 Appendix),
Another interpretation of this expression is that the growth rate g − 1 is proportional to W (the proportionality constant being ): in other words, technologies or technological classes with more classifications per patent are expected to grow proportionally faster.
4. Model validations
In this section, I explicitly check the main predictions made by the model using patent classification data. More specifically, based on the estimated values for the model parameters and
, I will compare the predicted values with the empirical values for four interesting quantities.
- the overall growth factor g,
- the decline-time td,
- the reclassification proportion V,
- the number of classifications per patent W.
Let me apply the model to ‘all technology’ (see Section 3.3). Based on the definitions in Equation 1, patent numbers between 2010–2015, and the fits in Fig 3, the values for and
can be directly estimated (for a detailed explanation, see the S2 Appendix). I find
and
.
As a first validation, using and
in Equation 7, I predict an overall growth-factor of 1.07 < g < 1.08. Measuring g using OLS fits for the log number of classifications between 1980 and 2015 using the 2023 dataset gives g ≈ 1.079, see Fig 5. For the other datasets and using the cumulative number of classifications instead, I find similar values for g between 1.075 and 1.08. All of these are very well fitted by an exponential relation (R2 ≈ 0.98). I conclude that Equation 7 agrees rather well with observation.
This figure shows the total number of classifications of patents by earliest filing year according to 4 different datasets. I include a fit of the 2023 data (R2 = 0.98), which indicates that classifications grow exponentially (the y-axis is logarithmic). Note the similarity with Fig 4, especially with the apparent declines in recent years. A closer examination points out that the 2013, 2016, 2019, and 2023 data peak respectively in 2006, 2012, 2013, and 2014.
As a second validation, I use g = 1.079 and in Equation 10 to predict the decline-time td ≈ 5.1. Fig 5 plots the total number of classifications of all patents by their earliest filing year and clearly demonstrates the apparent declines in recent years. Note also the similarity with Fig 4. A closer examination points out that classifications in the 2013, 2016, 2019, and 2023 data peak respectively in 2006, 2012, 2013, and 2014. This leads to decline-times in these datasets of 7,4,6 and 9 years. These values are close to the predicted value for td, the only exception being 2023. However, if I choose
for the 2023 data (see Fig 3), the prediction is that td ≈ 7.6. To demonstrate that the decline-time is also observed for each CPC section and a number of subclasses, I include plots of their time development for various versions of Patstat in S2 Appendix. For those cases too, I observe decline-times between 3 and 6 years.
As a third validation, I use the estimated (lower) values for g and in the expression
to predict a higher estimate for the reclassification proportion V ≈ 0.056. The easiest way to measure this value is to determine, for a given reclassification moment, the total net reclassifications and divide by the number of classifications before reclassification. Using reclassification moments 2016/2019 and 2019/2023, this leads to a reclassification proportions 0.023 and 0.053 respectively. Those values are reasonably close to the predicted higher estimate for V. Furthermore, as explained in Fig 2, the slope of the linear relation between the subclass reclassifications and size is of the order 0.05. While this value takes into account three reclassification years, the order of magnitude is in agreement with the predicted V. This suggests that the model applies reasonably well also to individual subclasses, that is, to individual technologies.
For the fourth and final validation, I will consider the classifications per patent first for all technology and then specifically for different technologies. To predict this quantity for all technology, I will use Equation 14 and the measured values for ,
, W0 = 1.25 for the CPC group level (see S2 Appendix) to derive that 3.7 < W < 4.1 classification/patent. Using W0 = 0.7 for the CPC subclass level, this is 2.0 < W < 2.13 instead. Measuring the average number of classifications per patent (using the 2023 data) gives 3.6 classifications per patent on the CPC group level and gives 2.14 classification/patent on the CPC subclass level. The measured values are therefore very close to the predicted intervals. Next, I will explicitly calculate the number of group classifications per patent wk for each CPC group k (considering classifications on the CPC group level). Equation 14 predicts that this is proportional to the growth rate gk − 1 of these groups. Testing if variation across the groups in wk correlates with variation in gk is therefore an indirect test of the effect of reclassification. I sub-select all groups with at least one patent each year between 1970 and 2015 and determine the growth factor gk using an OLS estimation on the log-transformed number of patents each year. The number of classifications per patent wk is determined by dividing the total number of group classifications of each patent in a group by the number of unique patents in that group. Each group belongs to exactly one CPC section. In Fig 6, I plot each group’s wk and gk − 1 separately for each section. For each section, I observe highly significant and moderately strong correlations between these quantities (see the values for the Pearson correlation coefficient R2). The only exception is Chemistry/metallurgy, however, as I show in more detail in the S3 Appendix, the correlations become much stronger when groups with wk > 9 classification/patent (i.e., very high wk) are excluded. Those groups belong to 4 subclasses only, a very small minority. In any case, the relation between gk − 1 and wk appears to be especially strong for wk < 9. The relations are still highly significant if I control for the total number of patents in each group and if I control for the number of patents in recent years. This indicates that the correlations are not an indirect result of group size or having many recent patents. In the S3 Appendix, I include the details of two more robustness checks. In the first, I reformulate wk to account for the bias of recent years in reclassifications. In the second, I reformulate gk to exclude double counting of classifications using a fractional attribution. Even with these reformulations, I observe a strong relation between classifications and growth. Finally, it is interesting to note that the slopes of the observed linear relations, which the model predicts correspond to
, appear to vary across different sections: the slopes are large for Human Necessities, Physics, and Electricity and small for Performing Operations and Chemistry. It is not directly clear where these technology-dependent variations come from. In line with prediction, note that the slopes are of the same order of magnitude as the measured
.
For each CPC group k, I plot the growth rate gk − 1 for the average classifications per patent wk. I plot the groups for each CPC section separately (each group belongs to one section). As predicted, most slopes are of the same order of magnitude as the estimated .
5. Discussion
This research develops an analytical model relating technological reclassification to the rate of development. Empirical tests using patent classification data are found to be consistent with the model predictions. Possibly, this means that further improvements to categorization and classification could speed up technological innovation, and that reclassified ideas can be considered renewed sources of future ideas. There are, however, also important limitations to this research that I should mention.
First, it is important to stress that the observed positive relationship between reclassification and growth rates is no direct evidence of causality. To explore this further, it is essential to better understand the mechanism of cumulative knowledge development proposed in the model, where earlier patents in a given class ‘trigger’ the development of later patents within the same class. Reclassification could then accelerate the process, as reclassified patents would trigger new results in multiple classes. Although the ‘triggering event’ is difficult to observe, earlier work finds a positive relationship between reclassification and increased citation rates [7], suggesting that reclassified patents may indeed have a greater triggering potential. However, the extent to which (patent) citations can be interpreted as knowledge flows is still a matter of debate [32,33]. Of course, another possibility is that there is only indirect causality or causality through other (unobserved) factors, in which case another model would apply.
Second, one should keep in mind the various facets of reclassification: some reclassifications reflect deep conceptual shifts and reinterpretations, while others merely facilitate searching within the patent system and involve less substantial conceptual change. Although the distinction is relevant to the implications of this research, it is difficult to separate these facets empirically.
Third, I should mention that, although the CPC classification is by now widely adopted, there are also other patent classifications systems in use across various countries, most notably the International Patent Classification (IPC). While validating these results with the IPC would be valuable, the CPC is largely derived from the IPC and offers more detailed classifications in many respects.
Finally, I explore the broader implications of this research. This research indicates that reinterpretation, here considered a part of reclassification, is closely linked to the speed of development. This finding aligns well with theories of paradigm change (e.g., [21,22]), which propose that innovation accelerates during paradigm shifts, with reinterpretation playing a central role. In contrast, this study treats reclassification as a continuous process, occurring at a relatively constant rate throughout a technology’s evolution and as such allowing for verifiable predictions. This raises the question: should reinterpretation be viewed as a singular event or as a continuous, everyday process?
The results also have some direct implications for research on technological change using patent classifications. The proposed model, based on only two parameters, can explain various seemingly unrelated quantities, such as growth rates, decline-times, and the number of classifications per patent. It is intuitive to think that these quantities are determined mainly by external factors, such as fluctuations in investments or resource price dynamics. For example the earlier mentioned green decline, which ran over several years, has been attributed to factors such as falling fossil fuel prices and the financial crisis (for an overview see [29]). Yet, as reclassification (i.e., ) for green technology is relatively strong [29], Equation 10 directly suggests that the decline-time for green technology is relatively large, offering an alternative explanation for the years of green decline. This demonstrates that it is not always necessary to bring in external factors; instead knowledge-intrinsic factors might provide a simpler explanation.
Supporting information
S3 Appendix. Robustness checks and OLS results.
https://doi.org/10.1371/journal.pone.0348804.s003
(PDF)
Acknowledgments
The author thanks François Lafond, two anonymous reviewers, and the editor for useful comments on the script. This project arose as a spin-off of a wider research project on patent reclassification together with Nicolo Barbieri and Kerstin Hotte. The author thanks Nicolo and Kerstin for joint developments of the data strategy and fruitful discussions.
References
- 1. Tria F, Loreto V, Servedio VDP, Strogatz SH. The dynamics of correlated novelties. Sci Rep. 2014;4:5890. pmid:25080941
- 2. Napolitano L, Evangelou E, Pugliese E, Zeppini P, Room G. Technology networks: the autocatalytic origins of innovation. R Soc Open Sci. 2018;5(6):172445. pmid:30110482
- 3. Lee ED, Kempes CP, West GB. Idea engines: unifying innovation & obsolescence from markets & genetic evolution to science. Proc Natl Acad Sci U S A. 2024;121(6):e2312468120. pmid:38306477
- 4. Tacchella A, Napoletano A, Pietronero L. The language of innovation. PLoS One. 2020;15(4):e0230107. pmid:32352986
- 5.
Bailey MF. History of Classification of Patents. Patent Office Society; 1946. Available from: https://books.google.nl/books?id=lIM0AQAAMAAJ
- 6. Youn H, Strumsky D, Bettencourt LMA, Lobo J. Invention as a combinatorial process: evidence from US patents. J R Soc Interface. 2015;12(106):20150272. pmid:25904530
- 7. Lafond F, Kim D. Long-run dynamics of the U.S. patent classification system. J Evol Econ. 2019;29(2):631–64.
- 8. Bergeaud A, Potiron Y, Raimbault J. Classifying patents based on their semantic content. PLoS One. 2017;12(4):e0176310. pmid:28445550
- 9. Fink TMA, Reeves M. How much can we influence the rate of innovation? Sci Adv. 2019;5(1):eaat6107.
- 10. Benson CL, Magee CL. Quantitative determination of technological improvement from patent data. PLoS One. 2015;10(4):e0121635. pmid:25874447
- 11. Nordhaus WD. An economic theory of technological change. Am Econ Rev. 1969;59(2):18–28.
- 12. Romer PM. Endogenous technological change. J Polit Econ. 1990;98(5, Part 2):S71–102.
- 13. Acemoglu D, Akcigit U, Kerr WR. Innovation network. Proc Natl Acad Sci U S A. 2016;113(41):11483–8. pmid:27681628
- 14.
Pichler A, Lafond F, Farmer JD. Technological Interdependencies Predict Innovation Dynamics. 2020. Available from: https://papers.ssrn.com/abstract=3547474
- 15. McNerney J, Farmer JD, Redner S, Trancik JE. Role of design complexity in technology improvement. Proc Natl Acad Sci U S A. 2011;108(22):9008–13. pmid:21576499
- 16. Bresnahan TF, Trajtenberg M. General purpose technologies ‘Engines of growth’? J Econ. 1995;65(1):83–108.
- 17.
Bresnahan TF. General purpose technologies. Handbook of the Economics of Innovation. 2010. https://doi.org/10.1016/S0169-7218(10)02002-2
- 18. Trajtenberg M, Henderson R, Jaffe A. University versus corporate patents: a window on the basicness of invention. Econ Innov New Technol. 1997;5(1):19–50.
- 19. Merges RP, Nelson RR. On limiting or encouraging rivalry in technical progress: the effect of patent scope decisions. J Econ Behav Organ. 1994;25(1):1–24.
- 20. Persoon PGJ, Bekkers RNA, Alkemade F. How cumulative is technological knowledge? Quant Sci Stud. 2021;3.
- 21.
Kuhn TS. The structure of scientific revolutions. vol. 111. Chicago: University of Chicago Press; 1970.
- 22. Dosi G. Technological paradigms and technological trajectories: A suggested interpretation of the determinants and directions of technical change. Res Policy. 1982;11(3):147–62.
- 23.
Hall BH, Jaffe AB, Trajtenberg M. The NBER patent citation data file: Lessons, insights and methodological tools. 2001.
- 24. Dass N, Nanda V, Xiao SC. Truncation bias corrections in patent data: implications for recent research on innovation. J Corp Fin. 2017;44:353–74.
- 25. Probst B, Touboul S, Glachant M, Dechezleprêtre A. Global trends in the invention and diffusion of climate change mitigation technologies. Nat Energy. 2021;6(11):1077–86.
- 26.
Acemoglu D, Aghion P, Barrage L, Hémous D. Climate Change, Directed Innovation, and Energy Transition: The Long-run Consequences of the Shale Gas Revolution. National Bureau of Economic Research; 2023. pp. 31657. http://www.nber.org/papers/w31657
- 27.
Aghion P, Boneva L, Breckenfelder J, Laeven L, Olovsson C, Popov AA. Financial markets and green innovation. European Central Bank; 2022.
- 28.
Martin R, Verhoeven D. The Private Value of Clean Energy Innovation. Author working paper. 2022. Available from: https://www.aeaweb.org/conference/2023/program/paper/e7R598BG
- 29. Barbieri N, Hotte K, Persoon P. The evolving boundary of green technology. arXiv e-prints. 2025.
- 30.
Cooperative Patent Classification. www.cooperativepatentclassification.org. 2025. Available from: https://www.cooperativepatentclassification.org/cpcSchemeAndDefinitions/table
- 31.
Graham RL, Knuth DE, Patashnik O. Concrete mathematics: a foundation for computer science. 9th ed. Addison-Wesley; 1993.
- 32. Alcácer J, Gittelman M. Patent citations as a measure of knowledge flows: the influence of examiner citations. Rev Econ Stat. 2006;88(4):774–9.
- 33. Sharma P, Tripathi RC. Patent citation: a technique for measuring the knowledge flow of information and innovation. World Patent Inf. 2017;51:31–42.