Figures
Abstract
Purpose
Due to the existence of information opacity, there is a common problem of adverse selection in the process of screening alternative technology start-ups (TSs) and determining investment targets by venture capital institutions, which does not reveal the true value of enterprises and makes the market inefficient. The aim of this paper is to design an evaluation and screening system help venture capital institutions to select the qualified TSs as their investment objective.
Design
A research framework of four dimensions that include conception, technical innovation, business model and team structure, was built based on previous studies. Based on the research framework, 15 second-level indicators and 33 third-level indicators were extracted with literature research method. This paper proposes an evaluation model with back propagation neural network (BPNN) optimized by genetic algorithm (GA) to improve the rate of selecting and investing in qualified start-ups.
Findings
The results show that the evaluation accuracy of the evaluation model for qualified and unqualified enterprises can reach 80.33% and 93.67% respectively, which has verified the effectiveness of the model and algorithm.
Originality/Value
This paper established an effective evaluation system based on PCA and GA-BPNN to help venture capital institutions preliminarily screen potential technology start-ups, which provides the possibility for venture capital institutions to greatly reduce the screening time and cost, improve the screening efficiency of TSs, and scientifically assess the risk of investee projects or investee enterprises to obtain sustainable and stable excess profits.
Citation: Li J, Meng M, Liu X, Lv Y, Yu J (2024) Evaluation and screening of technology start-ups based on PCA and GA-BPNN. PLoS ONE 19(2): e0289691. https://doi.org/10.1371/journal.pone.0289691
Editor: Mohamed Rafik N. Qureshi, King Khalid University, SAUDI ARABIA
Received: May 3, 2023; Accepted: December 12, 2023; Published: February 8, 2024
Copyright: © 2024 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by the National Key Research and Development Program of the Ministry of Science and Technology of China (No. 2017YFB1402000). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
In today’s world, science and technology play a vital role in economic and social development. The development trend of TSs has become the focus that the governments and scholars have been paying attention to. Governments and scholars are increasingly recognizing that technology entrepreneurship (TE) is an important engine for developing emerging industries and promoting high-quality economic development. The vigorous development of TSs helps to restructure the value chain of the industry, being the key in the future in the process of accelerating the evolution under the international economic pattern. In September 2014, the then Premier Li Keqiang announced at the Summer Davos Forum in Tianjin that “Widespread Entrepreneurship and Innovation” would be a new engine for China to upgrade the economy and make it more efficient. Since then, Chinese start-ups, especially high-tech ones, have made great progress. Under the guidance of innovation and entrepreneurship policies, TSs in China are forming a trend, which is the source of power to improve the overall quality of the economy and international competitiveness.
As the engine of economic growth, venture capital can provide an effective source of capital for the transformation of scientific and technological achievements (STAs) for start-ups. However, the “bottleneck” of financing seriously restricts the vigorous development of start-ups, which urgently need the support of venture capital. It is a common phenomenon that start-ups, especially TSs, have a low success rate. The production of scientific and technological innovation achievements is highly difficult and risky, which requires long-term R&D investment. The incubated TSs, with higher risk of failure, are in more urgent need of getting venture capital to support the transformation of STAs, and then push them to the market quickly to gain competitive advantages. However, when faced with a large number of TSs, venture capital institutions still face difficulties in assessing and screening them quickly and accurately. To minimize the problem of adverse selection that is prevalent when investing in start-ups, it is crucial for venture capital institutions to build an algorithm to reduce the screening cost of investment targets. This difficulty is reflected in the fact that assessment and selection is highly subjective, and it is also possible that adverse selection problems may arise when identifying investment targets. How to evaluate and screen scientifically, effectively and accurately of TSs has become an important issue for venture capital institutions.
Designing a good system of evaluating and screening TSs has become one of the key aspects of the investment process for venture capital institutions. The establishment of an evaluation and screening system can provide a series of alternative potential partners for venture capital institutions, thus helping venture capitalists in a disadvantaged information position to make strategic investment decisions, reduce information opacity in the process of achieving cooperation, and overcome the problem of adverse selection. Therefore, this paper takes TSs as the research object and constructs an evaluation model based on PCA and GA-BPNN. This model is important for venture capital institutions to evaluate the value of TSs and improve the correctness of investment decisions, as well as to select the best investment target effectively. The remainder of the paper is structured as follows: Section 2 reviews the existing literature and constructs the index system. Section 3 performs principal component analysis. Section 4 investigates the BPNN model (BPNNM) and derives preliminary simulation results. In section 5, the GA is selected to optimize the BPNNM to produce simulation results with higher evaluation accuracy. Finally, section 6 presents the conclusions, and practical implications.
2. Literature review and index system construction
2.1 Review of related studies
TE is a special form of entrepreneurship. In the process of economic development, TE can promote technological innovation, adjust the industrial structure, drive the development of entrepreneurial venture capital, and guide the economic demand and supply-side structural reform. The era of production capacity upgrading and economic structure optimization starting from TSs has arrived, and TSs have made great contributions to job creation and economic development [1].
TE not only has the characteristics of general entrepreneurship, but also is unique. Nacu and Avasikai [2] found that all the definitions of TE are slightly different, but they all have one thing in common, that is, the spirit of adventure, and the new business philosophy in the field of technology. TE is generated based on academic entrepreneurship, technological entrepreneurship, university-derived enterprise-level scientist entrepreneurship and other concepts, all of which have one thing in common—technology-driven [3]. Whereas, the introduction of various supporting policies and measures has promoted the development of TE, but also spawned the phenomenon of “pseudo-TE”. In addition, there are some phenomena occurring frequently, such as serious homogeneity of innovation achievements, plagiarism and running away of makers [4]. This also reflects that it is particularly important to improve the efficiency and accuracy of “identifying” and “screening” TSs by “incubators”. However, at present, the academic research on TSs is more inclined to the study of single-factor or multi-factor interrelationship, and there are few research results on the value assessment and evaluation of TSs.
For instance, Santisteban & Mauricio [5] conducted a systematic literature review of key success factors for TSs, and identified them through empirical research. Santisteban et al. [1] identified key success factors for TSs based on the analysis of the information system theory by using human, social, and organizational behavior theory. Kim et al. [6] investigated experts of TSs by using the Analytic Hierarchy Process (AHP) and found that commercialization of ideas, continuous investment and market-oriented opportunity conversion are the most critical success factors for start-ups, and further found that continuous investment was the most important factor, followed by commercialization of ideas. Lin et al. [7] proposed a new hesitation-fuzzy language decision method to solve the evaluation problem of TSs.
Small businesses, such as TSs, are frequently characterized by information opacity. And the opacity results in a possible adverse selection for venture capital institutions on investment targets [8]. Venture capital institutions need to overcome the problem of adverse selection in the process of screening TSs and determining investment targets. It is important to understand that a large number of economic, social, technological, ecological, environmental and other indicators define the effectiveness of the investment process [9]. According to some of the indicators, some startups are suitable for being selected and invested, while they are not according to other indicators. Certain characteristics of the enterprises (e.g., their young age, small capitalization, and simplified accountancy requirements) make it hard for outside investors to verify their financial situation and investment decisions [8]. In order to seek better and more accurate evaluation methods and strengthen the construction of credit enhancement mechanism in venture capital, it is necessary to establish a perfect evaluation and screening system and increase the realization rate of venture capital.
From the current studies, although some scholars have summarized the factors influencing the development of TSs [1,10], there is a lack of systematic and perfect screening and assessment system studies. At present, the existing studies have the following two problems. First and foremost, TSs are greatly influenced by environmental policies, and the update and iteration speed of these policies are fast, so, the earlier research results lack a scientific and effective system for evaluating and screening TSs and are not applicable to the current TSs.
The field of machine learning is now expanding rapidly at the intersection of computer science and statistics, and forms the core of artificial intelligence and data science [11]. Recent advancements in machine learning have been driven by the development of new learning algorithms. For example, Zheng et al. [12] proposed an adaptive memetic differential evolution-back propagation-fuzzy neural network. Additionally, Xue et al. [13] combined differential evolution with Adam to create ensemble of differential evolution and Adam, which integrates a modern version of the differential evolution algorithm with Adam.
Machine learning approaches are widely adopted in various fields including science, technology, and industry. These approaches contribute to evidence-based decision-making in healthcare, biomedicine, manufacturing, education, financial modeling, data governance, policing, and marketing [14,15]. In specific application of machine learning, Du et al. [16] proposed a hybrid model that combines long short-term memory with discrete wavelet transform and principal component analysis (PCA) pre-processing techniques for water demand forecasting. They aimed to improve the accuracy of the forecasting process. In another study, Ozcalici and Bumin [17] used GA to optimize the parameters of filtering rules and stock selection for enhanced trading. They integrated artificial neural networks into their approach to achieve better trading performance. Furthermore, Zhu et al. [18] developed a hybrid variational mode decomposition and stacked gated recursive unit model that consisted of a prediction module and an environment module to predict trading strategies for a target stock’s closing price. These studies highlight the diverse application of machine learning in different domains, from water demand forecasting to stock trading, and the various techniques used to enhance its performance.
The existing studies lack more in-depth simulation operations. Whether it is for evaluating and screening TSs or general start-ups, scholars mostly used AHP to evaluate them, subjective factors account for a large proportion, and it is difficult to guarantee the objectivity of the evaluation. Therefore, it is of great significance and value to re-establish an evaluation system of TSs in the current environmental background. Combined with the latest progress and specific application of machine learning methods, this study chooses to use the BPNNM and GA to evaluate the TSs (project) from a new perspective of research ideas and methods.
2.2 Construction of the index system
2.2.1 Index arrangement.
In order to build a suitable index system for evaluating and screening TSs, adopts the literature research method was adopted this paper to select the first-level indicators by collecting the existing research results as far as possible to complete the indicators sorting, and then further screening and summarizing indicators. Therefore, this paper refers to many literatures related to the evaluation and selection of start-ups, and classifies the existing index system into two types: general start-ups and TSs. The indicators are summarized in Table 1
2.2.2 Selection of indicators.
Combined with the actual situation of TSs, based on the existing literatures analysis, and combined with the characteristics of TSs, to select the first-level indicators.
- We sort out the existing assessment and evaluation index system of (technology) start-ups/ projects respectively according to the two categories of technology start-ups/ projects and general start-ups/ projects. To a certain extent, the indicators of technology start-ups/ projects are derived and optimized based on the indicators of general start-ups / projects. Therefore, this paper refers to the index system of general start-ups/ projects. Meanwhile, although the index system of technology start-ups/ projects has a high reference value, it is necessary to establish an evaluation system suitable for the current TSs due to the characteristics of rapid technology iteration and high risk.
- The evaluation indicators of start-ups tend to be consistent in the selection of environment, entrepreneurial team, technology, product and economic indicators. Most of the research factors for TSs are the research on policy support for these enterprises, scientific and technological innovation ability, various characteristics of entrepreneurs (such as cognitive style, personality traits, etc.), business model, resources, financing, entrepreneurial team, and other aspects.
From above analysis, it can be basically summarized into the following five dimensions: conceptual architecture, technological innovation, business model, team structure, and operation status to evaluate a business. However, in order to evaluate and select a qualified TS located in the start-up period, the evaluation system with five dimensions should be changed.
While some of Chinese high-tech enterprises and network ones are still in the state of no profit or micro profit before or at present. Meanwhile, for the new company, the lack of historical data means that part of the business information is in a blank state [22]. Therefore, the business status dimension is eliminated. Thereby, four dimensions of conceptual architecture, scientific and technological innovation, business model and team structure are eventually selected as the first-level evaluation indicators of the evaluation system of TSs in this paper.
In addition, the selection process of second-level and third-level indicators is the same as above, and the research results of various dimensions are sorted out via four first-level indicators. In combination with the identification conditions of TSs, the actual situation of China’s TSs, the identification conditions of high-tech enterprises, and the data investigation reports of TSs in various regions or industrial parks, several key index factors are selected to initially obtain the evaluation system of TSs.
We employed the Delphi technique to further determine the indicators and their weights. Specifically, we selected six professors from universities and nine managers from related companies. All respondents were over 30 years old. Five professors from universities were all doctoral supervisors, and their research directions were all business management. The nine managers had more than five years of experience in technology-based small and medium-sized enterprises. We solicited opinions from the aforementioned experts through online meetings, distributed survey outlines and background resources in advance, and answered some experts’ questions. We summarized the experts’ initial discussion opinions on the indicator system and fed back the results to the experts for the second revision. After four rounds of anonymous discussion, the indicator system was finally determined, and the experts’ opinions were unified. In the subsequent fifth and sixth rounds of discussion, we asked the experts to provide importance evaluations for the three-level indicators of the determined indicator system, thus further determining the corresponding scores for each indicator. The final evaluation system and index scores for TSs include 4 first-level indicators, 15 second-level indicators, and 33 third-level indicators. As shown in Table 2.1
Prior to initiating the formal research of this paper, specifically prior to distributing the survey questionnaire, eight experts were requested to evaluate the validity of the questionnaire’s structure, content, and overall design. Based on the evaluation results from these eight experts (see Table 2.2), they provided positive feedback on the questionnaire structure, content, and overall design. Therefore, the survey questionnaire designed in this paper has a high level of validity.
2.2.3 Description of the index system.
The main contents are as follows:
- Conceptual architecture, including industrial demand, technological innovation, strategic support, conceptual value, and capital markets. Whether the main products and technologies of TSs are in line with the various categories of national support and the scope of relevant guidance catalogue, which is an important guarantee for determining whether they can continue to operate and enjoy the relevant national policies and treatment in the future. Starting from the concept of entrepreneurship, consider its value, demand, strategic support and whether they belong to scientific and technological innovation, and consider the capital factors, and show them from the financing situation. Therefore, it is the subdivision index of the conceptual architecture of TSs.
- Technological innovation, including team technology research and development strength, product, and technical strength. Technological innovation has changed the status of the various pillar industries, caused the replacement of leading industries, transformed traditional industries, and accelerated the decline of old industries. Technological innovation can also make some industries have explosive growth, thus become the new pillar industries. The core of TSs is technological innovation. The analysis of technological innovation for TSs will be measured from the two aspects of technology and product, so this paper is divided into technology research and development strength and product and technology strength. The sources of technology research and development indicators are mainly the identification conditions of TSs and high-tech enterprises, while the aspect of product and technology is more consider the patent or software copyright, the undertaking of scientific research projects, and the obtaining of authoritative awards.
- Business model, including its own business maturity, industrial chain maturity, industrial chain relationship and ecological relationship. In order to better evaluate the overall strength of TSs, this paper refers to the research results of Wu & Shi [30] and Santisteban & Mauricio [5] when selecting business model indicators, and does not include traditional enterprise profit model, key resource capability and other indicators into the index system. Instead, it pays more attention to the industrial chain, their own business, and the ecological relationship of TSs, and measures the development of their industry, the development of upstream and downstream industries, the quality of suppliers and customers they cooperate with, and whether they have the experience of government-enterprise cooperation.
- Team structure, including team strength, team quality, team stability, and shareholder strength. Industry selection, development mode, system and team are the three major elements of entrepreneurial success. The scientific and technological independent innovation ability of enterprises cannot be separated from scientific research organizations and scientific and technological talent teams. The team is not only the beginning of an enterprise but also the guarantee of its development. The strength of a team should be investigated from whether there are leading experts, professors, and associate professors and the like, to evaluate the availability of its scientific research projects. Part of the indicators of team quality come from the identification conditions of TSs. Meanwhile, the experience and background of team members is investigated. Furthermore, the condition of the turnover ratio of members in the issuance conditions of the Science and Technology Innovation Board is taken as an index to measure the team stability, and the strength of shareholders is also taken as a major evaluation index.
2.3 Data source
In order to ensure the authenticity and reliability of the data obtained, the sample enterprises selected in this paper are all listed enterprises. However, since all TSs are small and medium-sized enterprises, this paper only selects listed enterprises from National Equities Exchange and Quotations of China (commonly known as “new OTC market”). The selection criteria are as follows:
- The age of the listed TSs (LTSs) on the new OTC market was no more than 5 years after its establishment (excluding public service and professional consultation enterprises);
- Their average annual asset growth rate, operating revenue growth rate and employee number growth rate shall be no less than 15%, 15% and 5%, respectively, since they were listed;
- Since listed, their worst annual growth rate of assets, worst annual growth rate of operating revenue and annual growth number of employees shall be no less than -5%, -5% and -5%, respectively.
According to these above criteria, 240 LTSs are initially selected, they are involved in high-end equipment manufacturing industry, information technology industry, related service industry, energy conservation and environmental protection industry, new materials industry, information technology industry, medical industry, biological industry, and new energy industry.
In accordance with the principle of rationality and scientificity, the preliminary sample data collected are sorted out and screened. From the perspective of the financial status and scale growth of enterprises, we divide enterprises into qualified enterprises and unqualified enterprises. Qualified enterprises represent enterprises with normal financial status and growth, while unqualified enterprises represent enterprises that are lagging behind or stagnating. These LTSs are classified into two groups that are qualified and unqualified start-ups, according to whether the number of employees, total assets and total income have maintained a growth rate of more than 10% in 2017 and 2018 compared with those in the previous year. Finally, data of 200 sample start-ups are obtained, and among these samples, 100 start-ups are qualified enterprises, and 100 unqualified.
3. Principal component analysis (PCA)
PCA, is a common data analysis method [31,32]. By using linear algebra method to reduce the dimension of multiple variables of the original data, based on which fewer independent variables are obtained, and these variables obtained are the principal components [33]. The information represented by each principal component will not overlap and can describe the whole data set in a relatively comprehensive way [34]. As can be seen from Table 2, the number of third-level indicators are high, and it is difficult to determine whether every indicator is independent or multicollinear with other indicators. In order to make the results obtained more scientific and accurate for this evaluation system, PCA method is used for the final sample data to extract the principal components and eliminate the multicollinearity between indicators. In order to make the results obtained more scientific and accurate for this evaluation system, PCA method is used for the final sample data to extract the principal components, filter the input variables of the model from 33 to 22, and eliminate the multicollinearity between indicators.
3.1 Conditions of the principal component analysis
SPSS 26.0 software is applied in this paper to input conduct the statistical analysis. The KMO (Kaiser-Meyer-Olkin) and Bartlett’s test results are shown in Table 3.
From Table 3, the KMO value is greater than 0.5, and the probability of significance in bartlett’s test of sphericity is much less than 0.05. Thus, the sample data can meet the requirements of the PCA method, indicating that the data can be used for PCA.
3.2 Results of the principal component analysis
According to the principle of PCA and research experiences, the factors with the cumulative contribution rate of variance greater than 85% or eigenvalue λ of the software greater than 1 are generally selected as the principal components. The results of the eigenvalue, and cumulative contribution rates of variance of the sample data set are shown in Table 4.
3.3 Principal component extraction
The factor load matrix is shown in Table 5. According to the principle of PCA method, the relationship between the original indicator X1X2⋯X32X33 and the principal component Ff is:
(1)
where, the right subscript, f, of Lfi indicates the serial number of the principal component, and i indicates the load number of the principal component. All indicator data is then substituted into Formula (1), and a partial display of the results for each principal component (f = 1,2, …, 22) is provided below. The partial values of Ff obtained are presented in Table 6, and the complete version can be found in the S1 Table.
4. Prediction of TSs by BPNN
4.1 Three layers of BPNN
By reviewing the relevant studies on the evaluation index system, it can be found that most scholars used the AHP, Logistic regression, fuzzy evaluation method and other research methods to help to complete the process of establishing the evaluation system, and analyze the feasibility and accuracy. However, with the continuous development of artificial intelligence, artificial neural networks are used more and more widely in the field of evaluation and prediction, especially the BPNNM, applied in various industries. The advantage of BPNNM is that it can process lots of data and has objectivity. For instance, Sun & Lei [35] used BPNN to train and simulate the financial early warning model of listed mining companies, which proved the objectivity and practical value of this method.
Combined with the characteristics of BPNNM, and considering that the evaluation of TSs involves many and complex indicators, this paper finally selects BPNNM for evaluating TSs, and drew on the studies of related scholars. For example, Shu & Xu [36] combined multi-level dynamic fuzzy evaluation with BPNN, used multi-layer hidden layer neural network structure and BP algorithm to train the network, established an evaluation decision support system, and confirmed the feasibility of this model; Chen [37] used the AHP to calculate the index weights and used the BPNN method to simulate the samples, proving that the prediction accuracy was higher than that of the traditional prediction method.
BPNN is an artificial neural network obtained through a supervised learning (that is BP algorithm) [38]. Considering the problem of learning rate, the topology of BPNN generally has three layers, as shown in Fig 1, including input layer, hidden layer, and output layer. The output layer can be one or more [39].
It is a traditional three-layer BPNN structure.
The learning process of BPNN is mainly divided into two steps. The first step is to input the sample data and calculate the output value through the corresponding weight connection of each neuron between the input layer and the hidden layer. When the expected output is met, the process ends and the whole process is positive propagation. If the output layer does not get the expected output, it will enter the second step of BP, which is reverse propagation, send back the error signal, MSE, and correct the hidden layer to the output layer and the input layer to the hidden layer. Through repeated iterations, the process ends until the expected output value is obtained.
In this paper, three-layer BPNN is used, in which the input layer is the evaluation index. According to the above results of PCA, there are 22 principal components extracted. And there are 22 nodes in the input layer. The output layer is the result of evaluating TSs, that is, whether these start-ups are qualified, so the number of output layer node is 1. And MATLAB_R2017b software is used for programming to realize the training, simulation and evaluation for the model.
4.2 Confirmation of the number of nodes in the hidden layer
The setting of the number of nodes in the hidden layer of the BPNN is a critical part of this method, which has a great influence on the final results of calculation. There are 22 nodes in the input layer of BPNN, and there is 1 node in the output layer. According to the empirical formula, the number of nodes in the hidden layer ranges from 4 to 14. It is set that the maximum training times are 1,000, the required training precision is 0.1, and the learning rate is 0.1. The number of nodes in each hidden layer is simulated, and the results in the simulation are shown in Table 7.
It can be seen from Table 7 that when the number of nodes in the hidden layer is 10, the training accuracy is optimal and the number of iterations is minimum. Therefore, the number of nodes in the hidden layer in BPNN can be determined as 10.
(7)
where, y is the number of nodes in the hidden layer, z is the number of nodes in the input layer, l is the number of nodes in the output layer, and the value of β is 1,2… 9,10.
4.3 The evaluation model based on BPNN
The evaluation model based on BPNN is established in this paper, and there is a three-layer neural network structure with 22 input layers, 11 hidden layers, and 1 output layer, as shown in Fig 2. The output of the BPNNM includes two classifications for evaluating TSs (qualified or unqualified enterprises). The calculation allows to evaluate the correct rate of investment. In turn, after simulation, the correct judgment rate of investing in qualified enterprises can be derived. In the simulation process, the data of 140 enterprises are randomly selected as training samples, among which, 70 enterprises are qualified and 70 enterprises are unqualified, and the data for the remaining 60 enterprises are selected as test samples, among which, 30 enterprises are qualified and 30 enterprises are unqualified.
By specifying the number of nodes in the input layer, hidden layer and output layer and selecting the transfer function, the hierarchical structure of the network is determined.
4.4 Simulation results
After 10 times of simulation, the correct rate of evaluation based on BPNN is shown in Table 8.
It can be found that the correct rate of judging that the enterprises are qualified is between 23.33% and 90.00%, and the average correct rate is 58.33%. The correct rate of judging that the enterprises are unqualified ranges from 63.33% to 90.00%, and the average correct is 81.00%. It can be concluded that the results of evaluating the qualified enterprises are unstable. Although most of the evaluation results are greater than 60.00% and the optimal accuracy rate of evaluation can reach 90.00%, there are still results with assessment accuracy of only 23.33%, 30.00% or 46.67%. However, the accuracy rate of evaluating unqualified enterprises is mostly more than 80.00%, but there are also cases where the evaluation accuracy rate is only 63.33%. It shows that BPNN can evaluate TSs based on sample data and has certain evaluation ability. Whereas, due to the shortcomings of BPNN itself, especially from the results of evaluating qualified enterprises. It can be clearly seen that the evaluation results based on BPNNM are not stable, and the average accuracy of evaluating the two types of enterprises is not very ideal. The stability of evaluation results is low, it is necessary to further optimize the model.
5. Prediction of TSs by GA-BPNNM
5.1 Combination of GA and BPNN
5.1.1 Advantages of GA.
In order to make up for the shortcomings of the BPNNM, and improve the stability and accuracy of the simulation results, many scholars utilize the GA, gray wolf algorithm, particle swarm optimization algorithm to optimize the BPNNM, combining with the advantages of other methods to improve the performance of the neural network model. GA is an optimization algorithm inspired by biology, developing by referring to the theory of evolution and genetics [40], and following the survival of the fittest mechanism [41]. The whole process is a process of seeking a global optimal solution [42]. GA is an adaptive search method, which simulates several natural processes: selection, information, heredity, random mutation, and population dynamics [43,44]. GA is simple to calculate, using the encoding set of parameters rather than for the parameters themselves, and the optimization range is a point group rather than a single point, which has the advantages of good optimization effect and high optimization efficiency [45]. The general GA mainly has three operators: selection, crossover and mutation operator, which are also the core of GA. The basic execution process is as follows: initialization is carried out to generate initial population; then the individual evaluation is carried out to calculate the fitness of every individual and enter the link of population evolution [46]. According to the results of individual evaluation, some individuals are generally selected as mothers by roulette-wheel selection method, and chromosomes are respectively selected to cross according to the crossover probability Pc to generate offspring. The new offspring mutate the chromosome according to the mutation probability Pm to obtain a new generation of population. If the results meet the requirements, the calculation would be stopped; if not, the population evolution process would be carried out again. Experiencing continuous evolution, the optimal solution is obtained.
5.1.2 Building GA-BPNNM.
It is found that GA has been applied to the related studies of neural network [47,48]. In particular, some scholars have pointed out that GA and BPNNM can complement the advantages with each other, which has high feasibility and accuracy [4,49]. Ding et al. proposed an optimization method with BP algorithm based on GA to speed up the training speed of BP algorithm and overcome the disadvantage that BP algorithm is easy to fall into the local minimum [50]. Using the University of California Irvine (UCI) machine learning repository data sets for experimental analysis, the results show that it is not easy to fall into a local minimum to use the method of combining GA and BP to train neural network and has better generalization ability and stable performance. Yin et al. [45] proposed IGABP algorithm based on improved GA for public opinion prediction on Weibo platform (https://weibo.com/), a Chinese social media platform based on user relationships. The results proved that IGABP algorithm is feasible in public opinion prediction on Weibo platform and has better generalization ability and higher accuracy. Han et al. made use of the advantages of GA in optimization to optimize BPNN and build a GA-BP hybrid model, to effectively evaluate and screen scientific design schemes, which confirms that the GA-BP hybrid product modeling and designing evaluation system can quickly, conveniently, effectively, and scientifically evaluate design schemes [51]. Based on the advantages of GA in optimization, we try to select GA to optimize the BPNNM, establishing a GA-BPNN model (GA-BPNNM) and conduct relevant analysis.
GA-BPNNM is mainly based on both the advantages of BPNN such as fast search, and the advantages of GA to reach the optimal solution through global search to overcome the disadvantages of BPNN which is easy to fall into the local optimal. The combination of GA and BPNN can significantly improve the performance of neural network. The basic process is that the optimal solution of GA is decomposed as the initial weights and thresholds of BPNN, and the evaluation results of TSs are finally obtained through the learning and training process of BPNN. The construction of GA-BPNNM for evaluating TSs can reduce the subjective influence and improve the accuracy of the evaluation. The flow chart of the algorithm for the GA-BPNN is shown in Fig 3.
The work chart of BPNN model based on genetic algorithm optimization was constructed.
5.2 Parameter setting of GA-BPNNM and simulation
In the optimization process of BPNN by GA, selection, crossover and mutation operation are experienced. The model parameters are set as follows: the evolution algebra of gen is 100, the crossover probability of Pc is 0.6, and the mutation probability of Pm is 0.05.
In order to verify the optimization effect of GA on the BPNNM for the evaluation of TSs, the training sample and test sample methods are selected as shown above, and the optimized and unoptimized results are directly output.
As shown in Table 9, in the comparison of 10 simulation results, the average correct rate of evaluation for qualified enterprises is 80.33%, and the one of unqualified enterprises is 93.67%, and the average evaluation accuracy rate is increased by 22.00% and 12.67%, respectively. Even in the third simulation, when the evaluation accuracy rate of qualified enterprises in the BPNN is only 23.33%, the evaluation accuracy can still be improved to 70.00% after it is through GA optimization, and the average modeling time of GA-BPNNM is very little different from the average modeling time of BPNN. Therefore, GA can improve the accuracy of the evaluation results based on GA to a certain extent, but also make the stability of the evaluation results better, which confirms the feasibility of applying GA to optimize BPNN.
6. Conclusion
In summary, the following conclusions can be drawn:
- This paper takes TSs as the research object, obtains the evaluation index system for the preliminary evaluation and screening of TSs. In the case of reducing the data information as little as possible, the dimension of the index is reduced, and the PCA method is used to extract the original 33 indicators into 22 unrelated principal components, which improves the speed and accuracy of the later analysis.
- Through learning and simulating the BPNNM of the final data, from the simulation results for ten times, the highest evaluation accuracy rate of qualified enterprises can be 90.00%, the lowest is 23.33%, the range is 66.67%, and the average evaluation accuracy rate is 58.33%. The range of the evaluation accuracy rate of unqualified enterprises is 26.67%, and the average evaluation accuracy rate is 81.00%. The results obtained are not ideal, and there are problems with the unstable and low evaluation accuracy. The reason for this problem is that the weights and thresholds of the BPNNM are randomly generated, and the gradient descent algorithm is adopted, which leads to the result easily falling into the deficiency of the local optimal BP algorithm, so it is necessary to add related algorithms on this basis to optimize it and complement advantages with each other.
- In view of the shortcomings of BP algorithm, this paper adopts GA to optimize it, uses the advantages of GA for global optimization, to take its optimal solution as the initial weights and thresholds of BPNNM, and obtains the GA-BPNNM. Empirical analysis is used to compare the simulation results of BPNNM before and after the improvement. From the simulation results for ten times, it can be seen, the range of the latter’s evaluation accuracy of the two types of enterprises is significantly lower than that of the former, the lowest evaluation accuracy of the latter is significantly higher than that of the former, the maximum optimization accuracy is 50.00%, and the average difference of modeling time is only 0.04 seconds. It shows that the BPNN optimized by GA significantly improves the evaluation accuracy, making the evaluation results more stable and reliable. The GA-BPNNM can be used for the evaluation and screening of TSs.
- This paper attempts to use the artificial neural network model to input the data obtained from the evaluation system of TSs to get the simulation results, so that the evaluation results can be changed from subjective judgment to objective evaluation, and reduce the inevitable subjective factors in manual evaluation and screening. In addition, the GA-BPNNM established in this paper can be used to conduct preliminary evaluation of TSs. The relevant data of numerous enterprises can be input for simulation at one time, and the results can be output, which can greatly improve the efficiency and accuracy of preliminary screening work.
- The evaluation system of TSs is a complex one that needs to be adjusted in time according to the changes of national policies and social development, whose influencing factors are numerous. In view of some requirements, including that the evaluation system ought to be scientific and rational, the one of TSs needs to be optimized and enriched in stages.
The experimental results show that the BPNNM optimized by the GA is superior to the BPNNM in both stability and accuracy of evaluation, and the modeling speed is basically the same. Meanwhile, the simulation results of this paper show that the GA-BPNNM can be used as an effective tool for the evaluation and screening of TSs, and providing an effective method of initially evaluating and screening TSs for various venture capital institutions. However, the number of samples selected in this paper is only 200, which is relatively small and has a certain impact on the evaluation results. Therefore, the number of samples should be expanded in the follow-up research to improve the accuracy of the research results.
Supporting information
S1 Table. The complete version of the sample data of TSs.
https://doi.org/10.1371/journal.pone.0289691.s001
(XLSX)
References
- 1. Santisteban J, Mauricio D, Cachay O. Critical success factors for technology-based startups. Int J Entrep Small Bus. 2021; 42(4): 397–421. https://doi.org/10.1504/IJESB.2020.10035620.
- 2. Nacu CM, Avasilcăi S. Technological ecopreneurship: conceptual approaches. Procedia Soc Behav Sci. 2014; 124: 229–235. https://doi.org/10.1016/j.sbspro. 2014.02.481.
- 3.
Shane SA. Academic entrepreneurship: University spin-offs and wealth creation. Edward Elgar Publishing. 2004. https://doi.org/10.4337/9781843769828
- 4. Wu D, Shen Y. English feature recognition based on ga-bp neural network algorithm and data mining. Comput Intell Neurosci. 2021; 1890120. pmid:34504519
- 5. Santisteban J, Mauricio D. Systematic literature review of critical success factors of information technology startups. Acad Entrepreneurship J. 2017; 23(2): 1–23. https://doi.org/1-23.1528-2686-23-2-101.
- 6. Kim B, Kim H, Jeon Y. Critical success factors of a design startup business. Sustainability. 2018; 10(9): 2981. https://doi.org/10.3390/su10092981.
- 7. Lin M, Chen Z, Chen R, Fujita H. Evaluation of startup companies using multicriteria decision making based on hesitant fuzzy linguistic information envelopment analysis models. Int J Intell Syst. 2021; 36(5): 2292–2322. https://doi.org/10.1002/int.22379.
- 8. Tirelli M. On the optimal investment finance of small businesses. Small Bus Econ. 2021; 56(4): 1639–1665. https://doi.org/10.1007/s11187-019-00283-1.
- 9. Stankevičienė J, Žinytė S. Valuation model of new start-up companies: Lithuanian case. Entrep Theory Pract. 2021; 12(4): 379–389. http://dx.doi.org/10.3846/btp.2011.39.
- 10. Gupta G, Bose I. Digital transformation in entrepreneurial firms through information exchange with operating environment. Informat Manag. 2019; 103243. https://doi.org/10.1016/j.im.2019.103243.
- 11. Pugliese R., Regondi S., & Marini R. (2021). Machine learning-based approach: Global trends, research directions, and regulatory standpoints. Data Science and Management, 4, 19–29. https://doi.org/10.1016/j.dsm.2021.12.002.
- 12. Zheng K., Zhang Q., Peng L., & Zeng S. (2023). Adaptive memetic differential evolution-back propagation-fuzzy neural network algorithm for robot control. Information Sciences, 637, 118940. https://doi.org/10.1016/j.ins.2023.118940.
- 13. Xue Y., Tong Y., & Neri F. (2022). An ensemble of differential evolution and Adam for training feed-forward neural networks. Information Sciences, 608, 453–471. https://doi.org/10.1016/j.ins.2022.06.036.
- 14. Yang Y., & Sun S. (2021). Tourism demand forecasting and tourists’ search behavior: Evidence from segmented Baidu search volume. Data Science and Management, 4, 1–9. https://doi.org/10.1016/j.dsm.2021.10.002.
- 15. Wu B., Wang L., & Zeng Y. R. (2022). Interpretable wind speed prediction with multivariate time series and temporal fusion transformers. Energy, 252, 123990. https://doi.org/10.1016/j.energy.2022.123990.
- 16. Du B., Zhou Q., Guo J., Guo S., & Wang L. (2021). Deep learning with long short-term memory neural networks combining wavelet transform and principal component analysis for daily urban water demand forecasting. Expert Systems with Applications, 171, 114571. https://doi.org/10.1016/j.eswa.2021.114571.
- 17. Ozcalici M., & Bumin M. (2022). Optimizing filter rule parameters with genetic algorithm and stock selection with artificial neural networks for an improved trading: The case of Borsa Istanbul. Expert Systems with Applications, 208, 118120. https://doi.org/10.1016/j.eswa.2022.118120.
- 18. Zhu Q., Che J., Li Y., & Zuo R. (2022). A new prediction NN framework design for individual stock based on the industry environment. Data Science and Management, 5(4), 199–211. https://doi.org/10.1016/j.dsm.2022.09.001.
- 19. Yachmeneva V, Vol’s’ka G. Factors influencing the enterprise innovation. Econtechmod: an International Quarterly Journal. 2014; 1(1): 133–138. https://econtechmod.pl/wpcontent/ uploads/2014vol3_1/1_2014_019.pdf.
- 20. Skawińska E, Zalewski RI. Success factors of startups in the EU—A comparative study. Sustainability. 2021; 12(19): 8200. https://doi.org/10.3390/su12198200.
- 21. Dinesh KK, Sushil . Strategic innovation factors in startups: results of a cross-case analysis of Indian startups. J Glob Bus Adv. 2019; 12(3): 449–470. https://doi.org/10.1504/JGBA.2019.101387.
- 22. Yin ZH. A brief discussion on the value assessment of High-tech Enterprises. Enterp Econ. 2007; (04): 42–44. https://kns.cnki.net/kcms/detail/detail.aspx?FileName=QUIT200704012&DbName=CJFQ2007.
- 23. Rezende JF, Correia AA, Gomes BA. The intellectual capital and the creation of value in research units linked to the Brazilian Ministry of Science Technology and Innovation. RAI Revista de Administração e Inovação. 2017; 14(3): 199–215. https://doi.org/10.1016/j.rai. 2017.03. 010.
- 24. Wu Z, Yi L. Regulatory Effect of Government Quality Award on the Relationship between Quality Management Activities and Enterprise Quality Innovation Performance and Analysis of Employees’ Psychological Demand in Quality Management Activities. For Chem Rev. 2022; 793–809. http://forestchemicalsreview.com/index.php/JFCR/article/view/594.
- 25. Akgün AE, Lynn GS. Antecedents and consequences of team stability on new product development performance. J Eng Technol Manage. 2002; 19(3–4): 263–286. https://doi.org/10.2139/ssrn.2151898.
- 26. Artz KW, Norman PM, Hatfield DE, Cardinal LB. A longitudinal study of the impact of R&D, patents, and product innovation on firm performance. J Prod Innovat Manag. 2010; 27(5): 725–740. https://doi.org/10.1111/j.1540-5885.2010.00747.x.
- 27. Slotegraaf RJ, Atuahene-Gima K. Product development team stability and new product advantage: The role of decision-making processes. J Marketing. 2011; 75(1): 96–108.
- 28. Guo B, Wang J, Wei SX. R&D spending, strategic position and firm performance. Front Bus Res China. 2018; 12(1), 1–19. https://doi.org/10.1186/s11782-018-0037-7.
- 29. González-Benito J, González-Benito Ó. A study of determinant factors of stakeholder environmental pressure perceived by industrial companies. Bus Strateg Environ. 2010; 19(3): 164–181. https://doi.org/10.1002/bse.631.
- 30. Wu YW, Shi YC. Research on hard driving elements and incubation mode of small and micro science and technology enterprises. Sci Manag Res. 2019; 37 (01): 70–73. https://doi.org/10.19445/j.cnki.15-1103/g3.2019.01.019.
- 31. Pearson K. On lines and planes of closest fit to systems of points in space. Lond Edinb Dublin Philos Mag J Sci. 1901; 2(11): 559–572. https://doi.org/10.1080/14786440109462720.
- 32. Hotelling H. Analysis of a complex of statistical variables into principal components. J Educ Psychol. 1933, 24(7): 498–520. https://doi.org/10.1037/h0070888.
- 33. Jiang L, Li Y, Cai LP. Evaluation of enterprise economic performance based on principal component analysis. J Interdiscip Math. 2018; 21(5), 1309–1314. https://doi.org/10.1080/09720502.2018.1498004.
- 34. Rahimpour K, Shirouyehzad H, Asadpour M, Karbasian M. A PCA-DEA method for organizational performance evaluation based on intellectual capital and employee loyalty: A case study. J Model Manag. 2020; 15(4): 1479–1513. https://doi.org/10.1108/JM2-03-2019-0060.
- 35. Sun X, Lei Y. Research on financial early warning of mining listed companies based on BP neural network model. Resour Policy. 2021; 73, 102223. https://doi.org/10.1016/j.resourpol.2021.102223.
- 36. Shu Y, Xu GH. Multi-level dynamic fuzzy evaluation and BP neural network method for performance evaluation of Chinese private enterprises. Wireless Pers Commun. 2018; 102(4): 2715–2726. https://doi.org/10.1007/s11277-018-5298-0.
- 37. Chen WJ. BP neural network-based evaluation method for enterprise comprehensive performance. Mathematical Problems in Engineering. 2022; 11: 7308235. https://doi.org/10.1155/2022/7308235.
- 38. Liu L, Luo J. Research of PID control algorithm based on neural network. Energ Proced. 2011; 13: 6988–6993. https://doi.org/10.1016/j.egypro.2011.12.409.
- 39. Cai X, Qian Y, Bai Q, Liu W. Exploration on the financing risks of enterprise supply chain using back propagation neural network. J Comput Appl Math. 2019; 367: 112457. https://doi.org/10.1016/j.cam.2019.112457.
- 40. Smith DJ, Lapedes AS, De Jong JC, Bestebroer TM, Rimmelzwaan GF, Osterhaus AD, et al. Mapping the antigenic and genetic evolution of influenza virus. Science. 2004; 305(5682): 371–376. pmid:15218094
- 41. Yin X, Li J, Huang S. The improved genetic and BP hybrid algorithm and neural network economic early warning system. Neural Comput Appl. 2022; 34(5). https://doi.org/3365-3374.10.1007/s00521-021-05712-5.
- 42. Saha S, Bandyopadhyay S. A new multiobjective simulated annealing based clustering technique using symmetry. Pattern Recogn Lett. 2009; 30(15): 1392–1403. https://doi.org/10.1016/j.patrec.2009.07.015.
- 43. Metawa N, Hassan MK, Elhoseny M. Genetic algorithm based model for optimizing bank lending decisions. Expert Syst Appl. 2017; 80: 75–82.
- 44. Yuan X, Elhoseny M, El-Minir HK, Riad AM. A genetic algorithm-based, dynamic clustering method towards improved WSN longevity. J Netw Syst Manag. 2016; 25(1): 1–26.
- 45. Yin FM, Xu HH, Gao HH, Bian MJ. Research on weibo public opinion prediction using improved genetic algorithm based BP neural networks. J Comput Sci. 2019; 30(3), 82–101. https://doi.org/19911599-201906-201907030013-201907030013-82-101.
- 46. Lova A, Tormos P, Cervantes M, Barber F. An efficient hybrid genetic algorithm for scheduling projects with resource constraints and multiple execution modes. Int J Prod Econ. 2009; 117(2): 302–316. https://doi.org/10.1016/j.ijpe.2008.11.002.
- 47. Ardjmand E, Ghalehkhondabi I, Young WA, Sadeghi A, Weckman GR. A hybrid artificial neural network, GA and column generation heuristic for minimizing makespan in manual order picking operations. Expert Syst Appl. 2020; 113566. https://doi.org/10.1016/j.eswa.2020.113566.
- 48. Teerasoponpong S, Sopadang A. A simulation-optimization approach for adaptive manufacturing capacity planning in small and medium-sized enterprises. Expert Syst Appl. 2021; 168(4): 114451. https://doi.org/10.1016/j.eswa.2020.114451.
- 49. Chen Y. Business English translation model based on BP neural network optimized by genetic algorithm. Comput Intel Neurosc. 2021; 2837584. pmid:34413884
- 50. Ding S, Su C, Yu J. An optimizing BP neural network algorithm based on genetic algorithm. Artif Intell Rev. 2011; 36(2): 153–162. https://doi.org/10.1007/s10462-011-9208-z.
- 51. Han JX, Ma MY, Wang K. Product modeling design based on genetic algorithm and BP neural network. Neural Comput Appl. 2021; 33(9): 4111–4117. https://doi.org/10.1007/s00521-020-05604-0.