Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Research on technology prospect risk of high-tech projects based on patent analysis

  • Liwei Zhang ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Writing – original draft, Writing – review & editing

    zhangliwei19810@126.com

    Affiliation School of Management and Engineering, Capital University of Economics Business, Beijing, China

  • Zhihui Liu

    Roles Data curation, Validation, Writing – review & editing

    Affiliation Research Center for Information Science Methodology, Institute of Scientific and Technical Information of China, Beijing, China

Abstract

The uncertainty of high technology has determined the high-risk character of high-tech projects. Thus, it is of great importance to effectively avoid the risk of high-tech projects by thoroughly analyzing projects’ methodologies and fully understanding the technology prospect risk of projects in the feasibility study phase. This study proposes a systematic research framework to identify and analyze the technology prospect risk of projects based on patent analysis. Thus, text mining technology and principal component analysis are used to improve the traditional patent map method and construct a technology prospect risk map. Moreover, patent value evaluation and correlation analysis methods are combined to identify technology potential areas and calculate the value of technology prospect risk. At the same time, an empirical study is conducted with project cases in the field of optical communications, and the technology prospect risk situations of these projects are ascertained through qualitative and quantitative methods. The study is innovative and practical and offers a better combination of analysis methods for current technology development and specific projects.

1. Introduction

With the rapid development of the social economy, high technology plays a crucial role in the development of enterprises, countries, and the entire society. As the carriers of high technology, high-tech projects are based on technological innovation, and technical uncertainty can lead directly to project risk. Government services attempt to select valuable high-tech projects that would be beneficial to fund. Because specific technologies applied in projects play a very important role, they must be analyzed and evaluated before the projects are carried out to ensure that the upcoming projects possess good research and development (R&D) prospects. Technology prospect risk is mainly used to identify whether the technology used in a project has R&D value, whether there is a risk of infringement, and so on. Since patents are useful sources of knowledge about technical progress and innovative activity [16] and constitute reliable intelligence that reflects advances in technological development [7], patent analysis has been considered to be a vital tool for the formulation of technology strategy formulation and R&D planning [8].

Previous studies have identified technological directions or areas for technological opportunities from a broad view, yet few studies address how to detect and evaluate distinctive technologies that can act as new technological opportunities at the individual technology level. Previous studies on technological opportunities based on patent analysis have proposed approaches and systems for identifying potential technological types by exploiting keyword-based morphology analysis [9], for identifying undeveloped technological areas by analyzing patent vacuums [10], for forecasting products by temporal analysis of patent maps [11], and for generating discontinuous ideas of product improvements by exploiting system evolution patterns [12]. These previous studies focus on exploring technological opportunities that have not been explored, but few studies combine patent analysis with a specific project to judge the project’s technology prospects.

To address the aforementioned problem, this paper proposes a method to evaluate the prospect risk of a specific technology based on patent analysis. The identification of technology prospect risk of a specific project can lead to the correct selection and increased investment of resources by the government, and speed up the overall research in specific fields. The analysis model outline is presented in Fig 1. The procedure consists of (1) collecting patent data, (2) determining keywords in a certain field, (3) transforming patent documents and the technological methods of a project into a keyword frequency matrix, (4) conducting principal component analysis, (5) constructing an initial technology prospect risk map of the project, (6) determining technology potential areas and technology barrier areas, and forming a technology prospect risk map of the project, (7) conducting correlation analysis and obtaining the technology prospect risk value of the project. By identifying the relationship and measuring the correlation degree between the technology procedure of a project and current patents in the same technology industry, the proposed method qualitatively and quantitatively analyzes the prospect risk of a project’s technology procedure.

thumbnail
Fig 1. Overall procedure for analyzing projects’ technology prospect risks.

https://doi.org/10.1371/journal.pone.0240050.g001

This paper is structured as follows. Section 2 addresses the underlying methodology for the proposed approach: patent analysis and patent map. Section 3 focuses on the overall research framework and details the processes for analyzing and evaluating the technology prospect risk of high-tech projects. A case study with an R&D project of automatic switching optical network node equipment, and optical communication technology-related patents is provided in Section 4, followed finally by the discussion and conclusion.

2. Related work

2.1 Patent analysis

Patent analysis is widely used in many countries, fields, industries, companies and technology sectors [13]. Patent literature presents an abundance of technical and market information, such as technical characteristics, ownership, and commercial worth [14, 15]. Approximately 80% of the world’s technology knowledge [16, 17] can be found in patents. Patent documents are sourced from national patent offices, ensuring the authority and accuracy of patent data. Moreover, national patent offices have perfected the structure of patent documents, further enhancing the usability of patent data. Thus, researchers, including technology developers and policymakers, can easily access and utilize patents as data sources [18].

Patent documents are composed of structured and unstructured data: structured data have a consistent and standardized format, such as patent number, filing date, IPC classification number, patent family, cited patent, and patentee or inventors [10, 18, 19]; the unstructured patent data comprise narrative text including the patent title, abstract, claims, and description [20, 21]. The methods and techniques for patent analysis are applied according to the characteristics of the structured and unstructured data. Therefore, patents can be analyzed from two viewpoints: quantitative and qualitative analysis. For the quantitative analysis method, patent analysis is applied to trend analysis, time-series analysis, and share analysis in accordance with patent type, technological classification, assignees, and so on. The result of quantitative analysis has been shown in the form of numerical trends or curves [22, 23]. A bibliometric technique is used to investigate dynamic trends in technology development, and statistical methods make structured data more informative and constructive [9]. Social network analysis is increasingly popular as an advanced method of structured data analysis for monitoring technological development and identifying technological competition, such as technology development path tracking [24, 25], the integration of interdisciplinary technologies [26], and patent infringement lawsuits [27]. To conduct qualitative analysis, the contents of patent documents are utilized. Since it is difficult to systematically extract valuable information from natural languages, such as texts or descriptions, unstructured data is transformed into structured data [13]. Thus, text mining is a widely used technique that logically and automatically derives keywords from collections of unstructured data [28]. The keywords extracted from patents usually include technology, products, components, and functions [13]. The visual output of the structured patent data is represented in the form of graphs and networks, whereas the results from the unstructured patent data are represented as patent maps [29].

The purpose of patent analysis is diverse in terms of technical and economic decision making: identifying technological competitors and partners [30, 31], analyzing technology trends [3235], identifying patent infringements [36, 37], conducting strategic technology planning [3840], identifying patent quality [4143], and determining novelty in patents [44, 45]. In fact, a technology strategy in firms specializing in technology acquisition, technology transfer, and even mergers and acquisitions can be formulated with the use of the methodological patent analysis described above [46].

2.2 Patent map

One patent analysis tool is a patent map that visualizes all the relationships among patents for a given technology. Patent maps showing competition trends in technological development can provide valuable input for decision support of R&D strategies [33]. A patent map is a visual representation of the relationship among patents and contains a variety of visual concepts and descriptions, such as charts, graphs, bar charts, and tables [47, 48], and significant amounts of technological information can be acquired in informative and easy ways. Thus, the patent map plays an important role in formulating strategies because it provides practical and intuitive information [49]. Because text mining technology can be applied to unstructured data processing to build patent maps, patent maps can effectively convey potentially valuable knowledge and explicit information from patents [21, 50]. Therefore, it is imperative to develop a patent map because it provides valuable information in a graphical manner, helping people to explore technological opportunities and avoid technological risks. Thus, research on patent maps has attracted the attention of interested people.

Because current technologies develop so rapidly and technology competitions become unusually fierce, it is imperative to avoid unnecessary investment and gain the seeds of technological development in the applicable fields. On the basis of such awareness, the Japan Patent Office has been producing and providing more than 50 types of expressions and more than 200 maps for several technology fields since 1997 [51, 52]. In addition, many other countries such as Korea, Italy [53, 54] and the USA [55] also provide many kinds of patent maps. Patent maps are categorized according to specific purposes: the technical patent map, the management patent map, and the claim patent map [9]. The technical map is used to understand core technology and identify potential technology. Management patent maps trace dynamic trends of specific technologies. Claim patent maps are useful for monitoring patent conflicts [13]. However, current patent maps have some drawbacks. According to Kim et al. [49], most of them are time-based, ranking-based or matrix maps that consider only structured or unstructured items of patent documents.

3. Methods

This research applies text mining to analyze the unstructured items in patent documents, such as the titles, abstracts and claims, as well as the technical proposal of the high-tech project. Thus, the patents and the technical proposal are combined together to construct a keyword frequency matrix. Then, principal component analysis [10] and correlation analysis are used to evaluate the technology prospect risk of the project. At the same time, a visualization method is applied to develop the technology prospect risk map of projects, which can show the R&D value of the adopted technology and the technology barriers that may be touched on. The construction process is shown in Fig 1.

3.1 Determination of technical keywords and acquisition of patent documents

The study selected the research objective, located it in a specific technical area, determined the technical keywords, defined retrieval conditions and acquired the relevant patent documents.

  1. Determination of technical keywords and definition of a patent retrieval query: The determination of keywords has a direct impact on the acquisition of patent documents. To improve the scientific nature and integrality of the keywords, keyword retrieval strategies in the relevant technology field are determined by three methods: terminology extraction [56], literature study and expert consultation. On the basis of these three means, the technical keywords of a specific industry are determined, and a patent retrieval query is defined.
  2. Retrieval and acquisition of patent documents: The primary source of patents employed in this study is the United States Patent and Trademark Office (USPTO) database. The USPTO database is one of the most representative systems because patents submitted in the US are often simultaneously submitted in other countries, and the US has the largest commercial market in the world [57, 58]. The patent retrieval query is input to obtain patents from the USPTO patent database. Then, the patent documents are preprocessed—which requires cleaning, integration and loading—and clean data are obtained for the following data analysis.

3.2 Constructing a frequency matrix of keywords

3.2.1 Feature representation of documents.

Feature representation refers to text information being represented by feature items. We only need to process the feature items to comprehend unstructured text processing. At present, there are a variety of feature representation models, such as the Boolean model, vector space model, and probabilistic model. In this paper, according to the characteristics of patent documents, the vector space model [59] is preferred.

In the vector space model, the patent documents are considered to be a set of vectors composed of feature items, and each document is expressed as a standardized eigenvector V(D) = (t1,w1; t2,w2; ……; tn,wn), where ti is a feature item, and wi is the weight of ti in the document D. In this way, all patent documents constitute a vector space. When the collection of patent documents is fixed, the position of ti is fixed as well; thus, the eigenvector can be simplified as V(D) = (w1, w2,…, wn).

The weight value depends on the selection of the weighted formula, which directly affects the performance of text categorization. There are two kinds of frequently used weight calculation methods: two-value representation, which means “if ti appears in the article, wi is 1, otherwise 0”, where ti is a feature item, and wi is the weight of ti; and frequency feature, which means that wi represents the appearance frequency of ti, such as 0, 1, 2. The first method is simpler, but much useful information is lost. The second one is more reasonable, as it retains the frequencies of feature items. To obtain more accurate results, the frequency feature is chosen to calculate the weight of the feature items in this study.

3.2.2 Forming a keyword frequency matrix.

In this section, we analyze the unstructured items in patent documents, including names, abstracts, claims and so on. At the same time, the technology adopted by the high-tech project must be expressed in the same format as the patent documents, such as names, abstracts and specifications, and studied together with the patent documents.

The research method is as follows: according to the identified keywords, the appearance frequency of the keywords in the names, abstracts and specifications are counted. The keyword frequency matrix is obtained and is defined as follows: where p is the number of documents (patents and projects), q is the number of keywords, and aij is the frequency of the j-th keyword appearing in the i-th document.

3.2.3 Initial map of projects’ technology prospect risk.

A given industry includes many technical keywords, and one document usually contains more than 10 keywords. Thus, it is difficult to grasp the characteristics of each document, let alone the technology layout of the entire industry. To address this issue, principal component analysis is used to analyze the keyword frequency matrix and obtain its principal components. Then, a coordinate system is built based on the principal components, and the patents and project information are mapped onto a multidimensional space. Finally, we construct an initial map of the projects’ technology prospect risk. The development process is shown in Fig 2.

thumbnail
Fig 2. Constructing process for initial map of projects’ technology prospect risk.

https://doi.org/10.1371/journal.pone.0240050.g002

3.2.4 Determination of technology potential areas.

Fig 2 shows the patent layout in one particular industry. The entire space represents one specific technical field, and there are many vacant areas that correspond to undiscovered technical areas. However, whether these areas are worth being developed remains to be analyzed.

  1. Setting the technology potential areas.
    First, one or more relatively large areas with lower patent density need to be determined.
    The rule for selecting potential areas is that the patent density is relatively sparse but not too low. If the patent density is too high, the area has been the focus of R&D and has many technology barriers, with little R&D potential. If the density is too low, few studies have been performed in the area. In this case, the risk would be very high if R&D is carried out in the area. Because the type of projects to be carried out in this paper focuses on the application of a technology’s R&D rather than basic theory research, an area with very low density is not suitable. First, because any application technology’s R&D is based on the basic theory research, it is necessary that the basic theory research has matured; second, even if the theoretical foundation has matured, it is still unknown whether the technology can be industrialized, as nobody is trying to do so and few experiences can be referred to. Thus, this is an extremely high-risk area. If a project in this area is conducted blindly, its R&D progress is full of risk.
    According to the principles above, the possible technology areas of potential are established. After developing the initial map of technology prospect risk and identifying the patent vacancy areas, the next step is to verify the validity of each vacancy area.
  2. Verifying technology potential areas.
    Some vacancy areas seem valuable, but due to the low potential value of surrounding patents, the vacancy areas are in fact barren. Therefore, it is indispensable to verify the effectiveness of the technology potential areas. The standards of identifying a valuable vacant area depend on the significance of its surrounding patents.
    The process of verifying the technology potential areas is as follows: First, the original information of all surrounding patents is collected and used to define vacancy areas. The range of patent information is broad and diverse, from basic information (patent numbers, patent names, etc.), to more complex information, such as abstracts, claims and so on.
    Secondly, the evaluation criteria of the technology potential area’s effectiveness should be set up. Through expert interviews and literature reviews, three kinds of indicators are established:
    1. Importance indicators:
      • Number of patent claims. The number of patent claims can indicate the scope or width of patents [60], and show the value of patents. A previous study [61] showed that a more valuable patent would invite more claims, cover a wider technology range, and encounter more infringement charges and litigations. This study uses the average value of claims requested by surrounding patents to represent the number of patent claims.
      • Patent family. Patent family is defined as “the set of patents (or applications) filed in several countries which are related to each other by one or several common priority filings” by the OECD (Organization for Economic Co-operation and Development) Patent Statistics Manual [62]. Given the territorial character of patent protection, when applicants want to protect their inventions in different countries, a patent application needs to be filed in each one of the patent offices where protection is sought. As a result, the first patent filing made to protect the invention (the priority filing) is followed by a series of subsequent filings and together they form a patent family [63]. Because the application fee and annual maintenance fee of patents are very high, a patent whose patent family includes more members is more valuable.
    2. Novelty of technology: this factor is reflected by the characteristic coefficient of the new technology [64].
      • Characteristic coefficient of the new technology n: , where v is the growth rate of the technology and α is the maturity coefficient of the technology. It is a comprehensive indicator to reflect the degree of emergence or aging of the technology. A larger n means that its new technology features are more obvious and that it possesses more development potential.
      • Growth rate of technology: v = a/A, where a is the application quantity (or approval quantity) of the inventions in one year, and A is the cumulative application quantity (or cumulative approval quantity) of the inventions dating back 5 years. Calculated for several years in a row, the value of v is increasing, showing that the technology is in the inception or growth stage.
      • Mature coefficient of technology: α = a/(a + b), where a is the application quantity (or approval quantity) of the inventions in one year, and b is the application quantity (or approval quantity) of the utility models in one year. Calculated for several years in a row, the value of α is decreasing, showing that the technology is becoming increasingly mature.
    3. Infringement risk: this risk is evaluated by the patent licensing rate.
      Based on the above indicators, the technology potential areas can finally be determined.

3.2.5 Determination of the technology barrier areas.

The determination of the technology barrier areas is easier than that of the technology potential areas. The technology barrier areas are selected based on the patent density, and the densest one is usually chosen as the technology barrier area. The technology barrier area could be identified according to the patent distribution on the initial map of technology prospect risk.

3.2.6 Technology prospect risk map of projects.

Technology potential areas are determined according to the validity test, while technology barrier areas are identified by referring to the patent density. Then, the technology prospect risk areas of the project are defined, and the technology prospect risk map is constructed. Finally, the technology prospect risk of the project can be qualitatively judged according to the position of the technology in the risk map.

3.2.7 Analysis of projects’ technology prospect risk.

The technologies adopted in high-tech projects, the patents surrounding the technology potential areas (PP) and the patents surrounding the technology barrier areas (PB) are separately expressed as keyword frequency matrixes TS, TP, and TB. Then, the correlation analysis between TS and TP, as well as the correlation analysis between TS and TB, are both carried out. Finally, the value of technology prospect risk can be obtained. The specific method is as follows:

In this paper, the similarity degree [59] is applied to measure the correlation between documents Dt and Dp. Dt represents the keyword frequency vector of the technology adopted by a high-tech project, and Dp represents the keyword frequency vector of a patent document. The similarity degree is defined as the distance between Dt and Dp and expressed is by the angle cosine formula: (1) where Dt = (wt1,wt2,…,wtn), Dp = (wp1,wp2,…,wpn), and w represents the frequency of a certain keyword appearing in a certain document.

The similarity degree between the technologies adopted by high-tech projects and patents in the same industry can be revealed by calculation. Suppose the similarity degree between the technologies and PP is Stp, and the similarity degree between the technologies and PB is Stb. According to the analysis above, the effective technology potential area represents the low-risk area, while the technology barrier area is in the high-risk area; thus, (2) where ⦿ represents the synthesis between Stp and Stb. Aiming at a specific research object, the specific synthesis method is adopted through many experiments.

4. Empirical study

With the continuous exponential growth of IP services, the reform of the telecommunications regulatory system, and the gradual and comprehensive opening of the telecommunications market, optical communications will gradually replace traditional switching, transmission, and access technologies, and ultimately achieve all-optical networking. In view of the obvious advantages and development prospects of optical transmission networking, many countries have invested a lot of manpower, material and financial resources in pre-research. The US Department of Defense Research Agency (DARPA) has funded a series of optical networking projects, such as "Optical Network Technology Cooperation Program (ONTC)" developed mainly by Bellcore,"All-Optical Communication Network" pre-research program developed mainly by Lucent, and so on. In Europe and Japan, similar optical transmission networking projects have also been carried out. At the same time, China has also arranged a cross-themed Chinese high-speed information demonstration network project in the national 863 high-tech plan. Obviously, optical transmission networking has become another new global optical communication development climax after SDH electrical networking.

An R&D project with automatically switched optical network (ASON) node equipment in the optical communication field——"ASON Automatic Switching Optical Network Node Equipment Development and System"is selected for the empirical study. The project is to finish the R&D tasks of ASON with functions including multiple types of business access and dynamic resource allocation. As the undertaking unit of the project, P Company relying on its leading technical strength and deep understanding of optical network requirements, is actively committed to the research and development of ASON equipment and how the existing traditional network can smoothly evolve to the next-generation intelligent optical network ASON. The paper analyzes the technology prospect risk of the project accordingly.

4.1 Data collection

At present, the five development directions in the field of optical communications are a new generation of optical fiber, optical access network technology, ultra-high-speed transmission systems, IP over Opticall, and optical transmission networking technology, which is shown in Fig 3.

Through expert consultation and literature review, the patent retrieval query for each sub-field of the Optical Communication field were determined (Table 1). For analysis, we applied the keywords in Table 1 and collected the patents issued in the USPTO from 2000 to 2020. The total number of patents is 21002, and the detail is displayed in the Table 1.

thumbnail
Table 1. Patent retrieval query of optical communication.

https://doi.org/10.1371/journal.pone.0240050.t001

4.2 Initial map formation of projects’ technology prospect risk

4.2.1 Principal component analysis.

According to the techological solutions of the project and the patent data obtained in the field of optical communications, combined with the keywords in Table 1, a keyword word frequency matrix is constructed. According to the method proposed in section 3.2.3, perform principal component analysis on the generated word frequency matrix to obtain the component matrix, as shown in Table 2. There are a total of five factors whose eigenvalues are shown in Table 3.

Through analysis and verification, only factor 1 and factor 3 are effective. Therefore, factor 1 and factor 3 are selected to construct the coordinate system.

4.2.2 Mapping patent documents.

Through the analysis in section 4.2.1, factor 1 and factor 3 act as the X-axis and Y-axis, respectively, to set up the coordinate system, and the patent documents are mapped onto two-dimensional space. The result is shown in Fig 4.

thumbnail
Fig 4. Initial map formation of projects’ technology prospect risk.

https://doi.org/10.1371/journal.pone.0240050.g004

4.3 Setting up the initial technology potential areas

In this section, the initial technology potential areas are set up. According to Fig 5, the marked areas may be the technology potential areas, but further validation work is needed.

4.4 Validating technology potential areas

Through the effectiveness analysis, the situations of the selected six areas can be seen in Table 4.

According to the research results in section 3.2.4, there are four rules that can identify the technology potential areas:

  1. More claims represent better patent quality. The areas surrounded by patents with more claims possess better development prospects.
  2. Lower authorized rates mean less infringement possibilities. The areas surrounded by patents with lower authorization rates possess larger development space.
  3. Larger characteristic coefficients of new technology indicate stronger new technical features and more development potential.
  4. Larger patent families reflect better patent quality. The areas surrounded by patents whose patent families include more members possess more development potential.

Therefore, considering these four rules, GROUP2 and GROUP4 are the first choices for R&D potential regions, as shown in Fig 6. Table 5 shows some patents surrounding the two regions.

thumbnail
Table 5. Surrounding patent table of effective technology potential areas.

https://doi.org/10.1371/journal.pone.0240050.t005

4.5 Technology prospect risk map of project

Based on the above analysis and combined with patent density analysis, the technology potential areas and the technology barrier areas are displayed in Fig 7.

4.6 Technology prospect risk of project

According to the optical communication industry to which the study objects belong, a large number of projects in the field are selected and verified, and the formula for calculating the technology prospect risk is obtained, as shown in formula (3): (3)

According to the correlation analysis among the high-tech project, PP and PB, and combined with the calculation of formula (3), the risk value of the technology PR is equal to 0.451, which means Stp >Stb. Thus, we know that the technology prospect risk of the project is less.

The validity of the experimental results can be verified by the actual implementation of the project. Through the implementation of the "ASON Automatic Switching Optical Network Node Equipment Development and System" project, P Company has successfully developed ASON node equipment with T-bit switching capacity, which has been applied to the National High-tech Plan Project "High-performance Broadband Information Network" and many other information networks. At the same time, P Company has also launched the converged intelligent optical network solution to provide a full range of support for the evolution of operatorsߣ existing transmission networks to the next generation of intelligent optical networks.

The project has achieved innovative results in the ASON node equipment frame structure, large cross-capacity non-blocking cross-over matrix construction and so on, which have been applied for and authorized more than 20 invention patents. Moreover, P Company has also participated in the formulation and revision of a number of ASON-related international standards. In addition, P Company has not been subject to patent infringement lawsuits for any of the final results.

5. Discussion and conclusion

At present, government services attempt to select some promising high-tech projects to fund, but high-tech projects commonly have high technology risk. Thus, it is imperative to evaluate the technology potentials and risks before the projects start. To address this problem, the paper proposed an approach to evaluate the technology prospect risk of high-tech projects based on systematic processes and scientific methods——to construct a technology prospect risk map and calculate the technology prospect risk value.

The proposed approach has several advantages over previous methods. Firstly, most previous studies focus on exploring technological opportunities that have not yet been explored, so their outputs provide a broad view of technological directions or technological areas that are promising and undeveloped [1]. However, few studies focus on a specific technology to explore whether it can act as a new technological opportunity at the individual technology level. The paper proposes a method for exploring the potential of a specific technology and evaluating the risk of that technology. According to the empirical research, this approach is significant and offers valuable practical guidance in selecting R&D projects, as it combines the information about technology development in a certain industry with a specific project application. Secondly, previous researches pay more attention to the identification of patent vacuums, while the evaluation of technology potential and risk takes into account not only technology potential areas, but also technology barrier areas. The research makes full use of the patent information around technology potential areas and technology barrier areas, and applies text mining, visualization methods and the correlation analysis method to construct a technology prospect risk map, as well as to calculate the value of the technology prospect risk, so as to synthetically evaluate the technology prospect risk. Finally, in order to make the research results more practical, we have developed a software system to implement our method more simply and efficiently, reducing the burden of manual work and therefore allowing even those who are unfamiliar with the complex algorithms to benefit from the research results. Then managers, researchers and engineers interested in new technology development can effectively evaluate project technology prospects and avoid technology risks before a project is approved and started. In addition, the analytical results can be updated easily with minimal involvement of experts, since the patent documents are totally reusable. As time goes on, new patent data and new technical terms will be separately supplemented into the original patent database and keywords set. When new projects need be evaluated, the projects will act as inputs, and their technology prospect risk will be outputs. Thus, this approach ensures the effectiveness and reusability of the software system.

Despite its contributions, this study exists only at the explorative stage and is subject to certain limitations as outlined below. Firstly, it is vital to elaborate the keyword extraction process, since keyword extraction plays a critical role in analyzing the technology prospect risk. Although the term extraction algorithm and expert judgment are employed to extract the keywords, covering both the quantitative and qualitative aspects of keyword extraction, other systematic methodologies should supplement these techniques in order to validate the extracted keywords. Secondly, to avoid identifying the technology potential areas subjectively, the study selects three kinds of indicators to evaluate the technology potential areas. Future research could integrate the existing body of literature on value and potential assessment of individual patents and technologies with the proposed approach to construct a systematic evaluation system of technology potential areas.

Nevertheless, the research opens up a new approach for the application of patent map by combining patent information with risk assessment of high-tech projects closely. At the same time, the development of a technology prospect risk map helps people to learn about the R&D potential of a project and technology barriers that may be encountered in the R&D process. We expect that the proposed method can be incorporated into the strategic technology planning process to assist technology experts in identifying project technology risks before a project is funded.

Acknowledgments

We are grateful for the support and technical assistance from Prof. Shao Shicai at Ministry of Science and Technology the People’s Republic of China.

References

  1. 1. Basberg BL. Patents and the measurement of technological change: A survey of literature. Research Policy. 1987; 16(2/4):131–141.
  2. 2. Ernst H. Patent information for strategic technology management. World Patent Information. 2003; 25(3): 233–242.
  3. 3. Grilliches Z. Patent statistics as economic indicators: A survey. Journal of Economic Literature. 1990; 28(4): 1661–1707.
  4. 4. Jaffe AB, Trajtenberg M. Forgarty MS. Knowledge spillovers and patents citations: Evidence from a survey from inventors. American Economic Review. 2000; 90(2): 215–218.
  5. 5. Li YR, Wang LH, Hong CF. Extracting the significant-rare keywords for patent analysis. Expert Systems with Applications. 2009; 36(3): 5200–5204.
  6. 6. Hwang I. The effect of collaborative innovation on ICT-based technological convergence: A patent-based analysis. PLoS ONE. 2020; 15(2): e0228616. pmid:32017795
  7. 7. Yoon J, Choi S, Kim K. Invention property-function network analysis of patents: A case of silicon-based thin film solar cells. Scientometrics. 2011; 86(3): 687–703.
  8. 8. Mogee M, Kolar R. International patent analysis as a tool for corporate technology analysis and planning. Technology Analysis & Strategic Management. 1994; 6(4): 485–504.
  9. 9. Yoon B. On the development of a technology intelligence tool for identifying technology opportunity. Expert Systems with Applications. 2008; 35(1–2): 124–135.
  10. 10. Lee S, Yoon B, Lee C, Park J. Business planning based on technological capabilities: Patent analysis for technology-driven roadmapping. Technological Forecasting and Social Change. 2009; 76(6): 769–786.
  11. 11. Gerken J, Moehrle M, Walter L. Patents as an information source for product forecasting: Insights from a longitudinal study in the automotive industry. Manchester, England: The R&D Management Conference. 2010.
  12. 12. Mann D. Better technology forecasting using systematic innovation methods. Technological Forecasting and Social Change. 2003; 70(8): 779–795.
  13. 13. Son Changho, Suh Yongyoon, Jeon Jeonghwan, Park Yongtae. Development of a GTM-based patent map for identifying patent vacuums. Expert Systems with Applications,2012,39:2489–2500.
  14. 14. Park Y, Yoon B, Lee S. The idiosyncrasy and dynamism of technological innovation across industries: Patent citation analysis. Technology in Society. 2005; 27(4): 471–485.
  15. 15. Soo VW, Lin SY, Yang SY, Lin SN, Cheng SL. A cooperative multi-agent platform for invention based on patent document analysis and ontology. Expert Systems with Applications. 2006; 31(4): 766–775.
  16. 16. Lerner J. The importance of patent scope: An empirical analysis. RAND Journal of Economics. 1994; 25(2): 319–333.
  17. 17. Ernst H. Patent applications and subsequent changes of performance: Evidence from time-series cross-section analyses on the firm level. Research Policy. 2001; 30 (1):143–157.
  18. 18. Jeong Yujin, Yoon Byungun. Development of patent roadmap based on technology roadmap by analyzing patterns of patent development. Technovation,2015, 39–40:37–52.
  19. 19. Verbeek A, Debackere K, Luwel M, Andries P, Zimmermann E, Deleus F. Linking science to technology: Using bibliographic references in patents to build linkage schemes. Scientometrics. 2002; 54(3): 399–420.
  20. 20. Liu SH, Liao HL, Pi SM, Hu JW. Development of a patent retrieval and analysis platform-a hybrid approach. Expert Systems with Applications. 2011; 38(6): 7864–7868.
  21. 21. Tseng YH, Wang YM, Lin YI, Lin CJ, Juang DW. Patent surrogate extraction and evaluation in the context of patent mapping. Journal of Information Science. 2007; 33(6): 718–736.
  22. 22. Curran CS, Leker J. Patent indicators for monitoring convergence examples from NFF and ICT. Technological Forecasting & Social Change. 2011;78(2): 256–273.
  23. 23. Young P. Technological growth curves. A competition of forecasting models. Technological Forecasting & Social Change. 1993; 44(4): 375–389.
  24. 24. Choi C, Park Y. Monitoring the organic structure of technology based on the patent development paths. Technological Forecasting and Social Change. 2009;76(7):769–786.
  25. 25. Meyer M. Patent citations in a novel field of technology—What can they tell about interactions between emerging communities of science and technology? Scientometrics. 2000; 48(2): 151–178.
  26. 26. No H, Park Y. Trajectory patterns of technology fusion: Trend analysis and taxonomical grouping in nanobiotechnology. Technological Forecasting and Social Change. 2010; 77(1): 63–75.
  27. 27. Kim Hyoungshick, Song JaeSeung. Social network analysis of patent infringement lawsuits. Technological Forecasting & Social Change,2013,80:944–955.
  28. 28. Kostoff R, Toothman D, Eberhart H, Humenik J. Text mining using database tomography and bibliometrics: A review. Technological Forecasting and Social Change. 2001; 68(3): 223–252.
  29. 29. Abbas A, Zhang LM, Khan S U. A literature review on the state-of-the-art in patent analysis. World Patent Information. 2014;37 (4):3–13.
  30. 30. Abraham BP, Moitra SD. Innovation assessment through patent analysis. Technovation. 2001; 21(4): 245–52.
  31. 31. Geum Y, Lee S, Yoon B, Park Y. Identifying and evaluating strategic partners for collaborative R&D: Index-based approach using patents and publications. Technovation. 2013. 33(6–7): 211–224.
  32. 32. Segev A, Kantola J. Identification of trends from patents using self-organizing maps. Expert Systems with Applications. 2012. 39(18):13235–13242.
  33. 33. Yoon J, Park H, Kim K. Identifying technological competition trends for R&D planning using dynamic patent maps: SAO-based content analysis. Scientometrics. 2013; 94(1):313–331.
  34. 34. Chang PL, Wu CC, Leu HJ. Using patent analyses to monitor the technological trends in an emerging field of technology: a case of carbon nanotube field emission display. Scientometrics. 2010;82(1):5–19.
  35. 35. Park H, Kim K, Choi S, Yoon J. A patent intelligence system for strategic technology planning. Expert Systems with Applications. 2013; 40(7): 2373–2390.
  36. 36. Lee C, Song B, Park Y. How to assess patent infringement risks: a semantic patent claim analysis using dependency relationships. Technology Analysis & Strategic Management. 2013; 25(1): 23–38.
  37. 37. Park H, Yoon J, Kim K. Identifying patent infringement using SAO based semantic technological similarities. Scientometrics. 2012; 90(2):515–529.
  38. 38. Choi S, Park H, Kang D, Lee JY, Kim K. An SAO-based text mining approach to building a technology tree for technology planning. Expert Systems with Applications. 2012; 39(13): 11443–11455.
  39. 39. Yoon J, Kim K. An analysis of property-function based patent networks for strategic R&D planning in fast-moving industries: The case of silicon-based thin film solar cells. Expert Systems with Applications. 2012; 39(9): 7709–7717.
  40. 40. Yoon J. Park Y, Kim M, Lee J, Lee D. Tracing evolving trends in printed electronics using patent information. Journal of Nanoparticle Research. 2014; 16(7): 2471.
  41. 41. Du R, Ai S. Cross-organizational knowledge acquisition through flexible Hiring and joint R&D: insights from a survey in China. Expert Systems with Applications. 2008; 35(1–2): 434–441.
  42. 42. Park H, Ree JJ, Kim K. Identification of promising patents for technology transfers using TRIZ evolution trends. Expert Systems with Applications. 2013; 40(2): 736–743.
  43. 43. Trappey AJC, Trappey CV, Wu CY, Fan CY, Lin YL. Intelligent patent recommendation system for innovative design collaboration. Journal of Network & Computer Applications. 2013; 36(6):1441–1450.
  44. 44. Bonino D, Ciaramella A, Corno F. Review of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics. World Patent Information. 2010; 32(1):30–38.
  45. 45. Gerken JM, Moehrle MG. A new instrument for technology monitoring: Novelty in patents measured by semantic patent analysis. Scientometrics. 2012; 91(3): 645–670.
  46. 46. Narin F, Noma E, Perry R. Patents as indicators of corporate technological strength. Research Policy. 1987; 16(2–4): 143–155.
  47. 47. Chen R. Design patent map visualization display. Expert Systems with Applications. 2009;36(10):12362–12374.
  48. 48. Liu S. A route to a strategic intelligence of industrial competitiveness (pp. 2–13). Taipei: Proceedings of the first Asia-Pacific conference on patent maps. 2003.
  49. 49. Kim YG, Suh JH, Park SC. Visualization of patent analysis for emerging technology. Expert Systems with Applications. 2008; 34(3): 1804–1812.
  50. 50. Tseng YH, Lin CJ, Lin YI. Text mining techniques for patent analysis. Information Processing & Management. 2007; 43(5): 1216–1247.
  51. 51. Japan Institute of Invention and Innovation. Guide book for practical use of patent map for each technology field. 2000.
  52. 52. Ryoo JH, im IG. Workshop H—What patent analysis can tell about companies in Korea, Far East Meets West in Vienna. 2005.
  53. 53. Camus C., & Brancaleon R. Intellectual assets management: From patents to knowledge. World Patent Information. 2003;25(2):155–159.
  54. 54. Fattori M, Pedrazzi G, Turra R. Text mining applied to patent mapping: A practical business case. World Patent Information. 2003; 25(4): 335–342.
  55. 55. Morris S, DeYong C, Wu Z, Salman S, Yemenu D. DIVA: A visualization system for exploring documents databases for technology forecasting. Computers & Industrial Engineering. 2002; 43(4); 841–862.
  56. 56. Zhang W, Yoshida T, Tang X. Using ontology to improve precision of terminology extraction from documents. Expert Systems with Applications. 2009; 36(5): 9333–9339.
  57. 57. Lee Changyong, Kang Bokyoung, Shin Juneseuk. Novelty-focused patent mapping for technology opportunity analysis. Technological Forecasting & Social Change,2015,90:355–365.
  58. 58. Lee C, Cho Y, Seol H, Park Y. A stochastic patent citation analysis approach to assess future technological impacts, Technological Forecasting & Social Change. 2012; 79 (1): 16–29.
  59. 59. Shi ZZ. Knowledge Discovery (Second Edition). Beijing: Tsinghua University Press. 2011.
  60. 60. Li CY, Shi R. The exploration on evaluation indicators of patent quality. Journal of Modern Information. 2008; 28(2): 146–149.
  61. 61. Lanjouw JO, Schankerman M. Stylized facts of patent litigation: Value, scope and ownership. NBER Working Papers. 2003; 52(52):75–78.
  62. 62. OECD., Patent Statistics Manual, OECD, Paris. 2009.
  63. 63. Martínez C. Patent families: when do different definitions really matter? Scientometrics. 2011; 86 (1):39–63.
  64. 64. Zhang LW, Shao SC, Wei HY, Zhu DH, Wang XF. The patent analysis indices of OECD. Information Science. 2009; 27(1): 124–127.