Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Research on China insurance demand forecasting: Based on mixed frequency data model

  • Cheng Wang,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations School of Medical Economics and Management, Anhui University of Chinese Medicine, Hefei, China, Key Laboratory of Data Science & Innovative Development of Traditional Chinese Medicine, Philosophy and Social Sciences of Anhui Province, Hefei, China

  • Mengnan Xu ,

    Roles Conceptualization, Formal analysis, Methodology, Software, Supervision, Validation, Writing – review & editing

    15058477369@163.com

    Affiliations School of Medical Economics and Management, Anhui University of Chinese Medicine, Hefei, China, Key Laboratory of Data Science & Innovative Development of Traditional Chinese Medicine, Philosophy and Social Sciences of Anhui Province, Hefei, China

  • Zheng Wang,

    Roles Project administration, Supervision, Validation

    Affiliations School of Medical Economics and Management, Anhui University of Chinese Medicine, Hefei, China, Key Laboratory of Data Science & Innovative Development of Traditional Chinese Medicine, Philosophy and Social Sciences of Anhui Province, Hefei, China

  • Wenjing Sun

    Roles Funding acquisition, Project administration

    Affiliations School of Medical Economics and Management, Anhui University of Chinese Medicine, Hefei, China, Key Laboratory of Data Science & Innovative Development of Traditional Chinese Medicine, Philosophy and Social Sciences of Anhui Province, Hefei, China

Abstract

In this paper, we introduce the mixed-frequency data model (MIDAS) to China’s insurance demand forecasting. We select the monthly indicators Consumer Confidence Index (CCI), China Economic Policy Uncertainty Index (EPU), Consumer Price Index (PPI), and quarterly indicator Depth of Insurance (TID) to construct a Mixed Data Sampling (MIDAS) regression model, which is used to study the impact and forecasting effect of CCI, EPU, and PPI on China’s insurance demand. To ensure forecasting accuracy, we investigate the forecasting effects of the MIDAS models with different weighting functions, forecasting windows, and a combination of forecasting methods, and use the selected optimal MIDAS models to forecast the short-term insurance demand in China. The experimental results show that the MIDAS model has good forecasting performance, especially in short-term forecasting. Rolling window and recursive identification prediction can improve the prediction accuracy, and the combination prediction makes the results more robust. Consumer confidence is the main factor influencing the demand for insurance during the COVID-19 period, and the demand for insurance is most sensitive to changes in consumer confidence. Shortly, China’s insurance demand is expected to return to the pre-COVID-19 level by 2023Q2, showing positive development. The findings of the study provide new ideas for China’s insurance policymaking.

1. Introduction

China’s insurance industry has maintained rapid growth for many years, and the scale, structure, and quality of insurance have been greatly improved, but compared with the developed countries in the world, China’s insurance industry is still at a relatively low level of development (see Table 1). As can be seen from Table 1, although the total premium income of China ranks the second level in the world, the insurance density and depth of insurance are far lower than that of developed countries, and even only half of the world’s average level, it can be said that the development potential of China’s insurance market is very large, and the demand for insurance is still strong, and the study of China’s insurance demand is of strong theoretical and practical significance.

thumbnail
Table 1. Comparison of insurance development levels in major countries worldwide in 2022.

https://doi.org/10.1371/journal.pone.0305523.t001

Recent years have seen a global economic recession, rising unemployment and increasing uncertainty due to the impact of public health emergencies and systemic financial risk events. However, China has made excellent achievements in economic recovery, stabilizing financial markets, and maintaining regional stability. In this process, the insurance industry has given full play to the role of social “stabilizer” and economic “booster”, and the role of insurance in risk transfer, loss compensation, financial intermediation, and social security has become increasingly prominent. Therefore, studying the factors influencing the development of China’s insurance industry and accurately grasping the demand for insurance is crucial to ensuring the high-quality development of China’s economy.

In this paper, we attempt to construct a MIDAS model to explore the macro-influencing factors of China’s insurance demand, and at the same time make full use of mixed-frequency data information to forecast China’s insurance demand in real time. Multiple factors influence the demand for insurance, among which macroeconomic fluctuations risk, as a type of independent background risk, is considered an important factor affecting insurance purchase decisions [1]. The index of uncertainty in policies and economic conditions is an important indicator for measuring macroeconomic fluctuations in a country. The Economic Policy Uncertainty (EPU) index constructed by Baker et al. measures the degree of uncertainty in China’s economic policies [2]. Liu et al. and Ju et al. find that insurance demand increases with the fluctuation of economic policy uncertainty [3, 4]. At the same time, consumer expectations and actual disposable income levels also play an important role in affecting insurance premium expenditures [1, 5]. Therefore, we intend to use the Consumer Confidence Index (CCI), EPU, and Consumer Price Index (CPI) to construct a MIDAS model to predict the demand for insurance in China. Among them, CCI, EPU, and CPI are monthly high-frequency indicators, and insurance demand is measured using the quarterly low-frequency indicator of insurance depth (TID, regional premium income/regional GDP).

The contributions of this paper are threefold. First, the application of the MIDAS model in insurance demand forecasting enriches the research on the MIDAS model in the financial domain. Although there is relatively abundant research on the application of the MIDAS model in fields such as GDP forecasting and market volatility prediction [68], its application in the insurance domain remains comparatively limited. Furthermore, factors such as CCI, EPU, and CPI are generally considered to have an impact on insurance demand, although there is limited research on their predictive capacity concerning insurance demand. Second, Previous research on the MIDAS model focused mainly on its innovation and application. For instance, Kuzin et al. compared the prediction of Eurozone GDP using a mixed-frequency vector autoregressive model (MF-VAR) [9]. Aruoba et al. propose a mixed-frequency dynamic factor model (MF-DFM) that combines state-space models [10]. Foroni et al. propose reverse MIDAS (RR-MIDAS) and reverse U-MIDAS (RU-MIDAS) [11]. However, the performance of the model’s internal design and optimization has been neglected. We address this gap by considering the impact of different weight functions on the model and investigating the effects of three different prediction windows and different combination prediction forms on the prediction performance. Our findings lay a solid foundation for predicting future insurance demand. Thirdly, the optimal MIDAS model constructed in this paper is used for short-term forecasting and nowcasting of insurance demand in China. Previous studies mainly demonstrated the superiority and feasibility of different MIDAS models in the prediction field, without actually predicting the future.

The remainder of the paper is organized as follows. Section 2 Literature review. Section 3 The data and methodology. Section 4 Empirical results. Section 5 Nowcasting and short-term forecasting. Section 6 Conclusion and policy implications.

2. Literature review

2.1 Insurance demand influencing factors

Existing studies on insurance demand are mostly based on household microdata, examining the impact of factors such as household life cycle, financial status, asset mix, and gender on insurance demand. For example, Lin and Grace point out that there is significant heterogeneity in the demand for commercial life insurance among households in different life cycles, with older households allocating a lower proportion of commercial life insurance compared to younger households [12]; Tian and Dong point out that financial status directly affects the consumption demand for insurance, and different asset portfolios can create heterogeneity in the demand for insurance [13]; Wu and Zheng find that married female household heads are more inclined to purchase commercial insurance [14].

The above studies are based on the micro perspective, and macroeconomic volatility risk, as a separate contextual risk, has been identified as an important factor influencing insurance purchase decisions [1]. The index of uncertainty in policies and economic conditions is an important indicator for measuring macroeconomic fluctuations in a country. The Economic Policy Uncertainty (EPU) index constructed by Baker et al. measures the degree of uncertainty in China’s economic policies [2]. Liu et al. and Ju et al. find that insurance demand increases with the fluctuation of economic policy uncertainty [3, 4]. At the same time, consumer expectations and actual disposable income levels also play an important role in affecting insurance premium expenditures [1, 5]. Therefore, we intend to use the Consumer Confidence Index (CCI), EPU, and Consumer Price Index (CPI) to construct a MIDAS model to predict the demand for insurance in China. Among them, CCI, EPU, and CPI are monthly high-frequency indicators, and insurance demand is measured using the quarterly low-frequency indicator of insurance depth (TID, regional premium income/regional GDP).

2.2 MIDAS and forecasting

The prevailing prediction models are based on the co-frequency data, such as the autoregressive (AR) model, autoregressive moving average (ARMA) model, generalized autoregressive conditional heteroscedasticity (GARCH) model, back-propagation neural network model, and the grey model (GM) (1, N). While relying on the co-frequency model, prediction is often challenged by the synchronicity of data updates, leading to the need to convert indicator data frequencies. However, this conversion process: 1) leads to an information loss of high-frequency data, reducing the accuracy and timeliness of prediction; 2) may distort the structure of the low-frequency data, resulting in distortion of the constructed high-frequency data.

The Mixed Data Sampling (MIDAS) regression model proposed by Ghysels et al. effectively addresses the limitations of traditional prediction models, it incorporates variables of different frequencies into a single model by assigning weight functions to high-frequency data, to analyze the impact of high-frequency data on low-frequency data [15]. This method fully exploits the useful information contained in high-frequency data and enables advanced forecasting. Subsequently, the form and estimation methods of the mixed-frequency data sampling models have been continuously refined. Ghysels et al. introduced several mixed-frequency data sampling models, including the Multivariate Mixed Data Sampling model (M-MIDAS), Unrestricted Mixed Data Sampling model (U-MIDAS), Mixed Data Sampling model with Autoregressive terms (AR-MIDAS), and the Autoregressive Distributed Lag Mixed Data Sampling model (ADL-MIDAS) [1619]. In the meantime, the MIDAS model has been widely applied in practical research. For example, Marcellino and Schumacher used dynamic and static estimation windows and the Factor-Augmented Mixed Data Sampling (FA-MIDAS) model to forecast Germany’s GDP [20]. Foroni et al. construct several MIDAS models and the U-MIDAS model using various weight functions to make real-time forecasts of US quarterly GDP [21]. Furthermore, the MIDAS model has also been used to predict inflation, energy consumption, price fluctuations, and others [2227]. The existing literature demonstrates that the MIDAS model exhibits advantages not only in terms of prediction accuracy over its counterparts in frequency-based forecasting models, but also in its ability to achieve nowcasting and short-term forecasting, thus improving the timeliness of predictions [9, 28].

3. The date and methodology

3.1 Data processing

The interval for the monthly indicators CCI, EPU, and CPI is 2006M1-2022M3, and the interval for the quarterly indicator TID is 2006Q1-2022Q1. where CCI and CPI are downloaded from the Wind database (www.wind.com.cn), and EPU is constructed using data from Baker et al. 2013, downloaded from www.policyuncertainty.com. To eliminate the seasonal periodicity of the time series data, we use the X-12-ARIMA method to seasonally adjust CCI, EPU, and CPI. Fig 1 shows the trend of TID with CCI, EPU, and CPI. Visually, both the explanatory and response variables exhibit a rising trend with oscillations, which suggests a certain correlation between TID and CCI, EPU, and CPI. Table 2 indicates that each explanatory variable is stable and can be applied directly to predict the MIDAS model. To further demonstrate the applicability of the selected explanatory variables, we conduct a mixed-frequency data impulse response experiment Fig 2 shows that the changes in CCI, EPU, and CPI have a significant impact on TID in the short term (generally within the first 10 periods), but maintain a stable relationship in the long term.

thumbnail
Fig 1. Row mixed frequency data.

(a), (b) and (c) present the trends between the macroeconomic variables CCI, EPU, CPI and insurance demand TID, respectively, which basically show the same growth trends.

https://doi.org/10.1371/journal.pone.0305523.g001

thumbnail
Fig 2. Response of TID to CCI, EPU& CPI.

(a) Presents the impulse response of insurance demand TID to the macroeconomic variable CCI. (b) Presents the impulse response of insurance demand TID to the macroeconomic variable EPU. (c) Presents the impulse response of insurance demand TID to the macroeconomic variable CPI. (d) Presents the impulse response of insurance demand TID to the insurance demand TID.

https://doi.org/10.1371/journal.pone.0305523.g002

3.2 The methodology

3.2.1 Model setup.

The MIDAS model integrates the concept of distributed lag models (DL) and is consistent with the construction mechanism of bridging models. By applying a weighting function, high-frequency data can be incorporated into a low-frequency regression model, thus achieving the desirable outcome of coexisting data of varying frequencies.

We set the univariate hstep ahead prediction as follows: (1) where yt is the low-frequency explanatory variable in period t, and is the high-frequency explanatory variable, is the weighted polynomial function, and , L1/m is the lag operator for high-frequency data, , εt is a random disturbance term, m denotes the frequency ratio between the high-frequency variable and the low-frequency variable, in this paper we set m = 3. K is the lag order of high-frequency data, we set the highest lag order K−1 to 30. h is the prediction horizon, it represents the mixed frequency data sampling of one step forward when h = 1, that is, the high-frequency data of the first two months of the current quarter are used to predict the low-frequency data of the current quarter; when h>3, out-of-sample prediction can be realized.

We construct the MIDAS model using high-frequency explanatory variables CCI, EPU, and CPI and explain variable TID and the lag effect of TID is considered. The following expression can be obtained: (2) when there is no weight function, it is the U-MIDAS model, which can be expressed as: (3) where q1, q2, q3 is the lag order of high-frequency data, and is a random disturbance term.

3.2.2 Weighting function.

The key to the MIDAS data structure is frequency alignment, it is sufficient to treat each high-frequency variable observation corresponding to a low-frequency variable yt as the same. It is easy to achieve for a small frequency ratio but will be cumbersome for a larger frequency ratio. e.g., if yt is monthly data and is daily data, then for each yt, there will be at least 21 observations corresponding to it. In this case, it is necessary to introduce a polynomial weight function ω(k;θ) to constrain the high-frequency variables, and the optimal effect of the MIDAS model can be achieved through the selection of parameter vector θ and lag order K. In other words, the information of high-frequency data can be retained and the number of parameters to be estimated can be reduced. In this way, the high-frequency data information can be retained, the number of parameters to be estimated can be reduced, and the data-driven automatic screening of the appropriate maximum lag order K avoids the problem of lag order selection in the case of no parameter constraints. Ghysels et al. find that the Beta weight function, Almon weight function, and Exp Almon weight function are more frequently used and generally have better prediction results [18]. In this paper, we examine the predicted results of each weighting function. The weighting function can be expressed as follows:

Beta weight function (4) where (5)

The Beta polynomial weight function is derived from the probability density function in the family of Beta distributions [29], The different values of θ1,θ2 enable the generation of incremental, decreasing, and then increasing forms of weight changes. Beta weighting functions are used more often in the prediction and analysis of financial market volatility [8].

Almon weight function (6)

Exp Almon weight function (7)

Almon weight function and Exp Almon weights are some of the most commonly used polynomial functional forms that allow the construction of a wide range of weight functions with a guaranteed positive number of weights, which gives the equations the nice property of having zero approximation error [17] Two parameter exponential Almon lagging polynomials are often used in macroeconomic studies. The θ1≤300, θ2<0 are generally carried out to satisfy the macroeconomic analyses and macroeconomic forecasting required by the weight forms [28].

3.2.3 Combination forecasting and weighting criteria.

We know that combined forecasting generally improves the accuracy and reliability of forecasting relative to single indicator forecasting, so we also investigated the estimation effects of univariate MIDAS models in different combined forms. The expression for combination forecasting, derived from multiple univariate predictors, is as follows: (8) where n represents the total number of explanatory variables based on univariate predictions, denotes the value of the forward hstep prediction in period t, ωj,t represents the weight of the combined prediction function, and the usual methods for determining the weights include the following:

EW weighting method: (9)

BIC weighting method: (10)

MSFE and DMSFE weighting method: (11) where T0 corresponds to the starting point of the in-sample prediction. When δ = 1, it corresponds to the MSFE weight, and when δ = 0.9, it is the DMSFE weight.

3.2.4 Evaluation criteria.

The prediction accuracy is measured by the root mean square error (RMSE). The expression of RMSE is: (12)

In general, the smaller the values of RMSE, the better performance of the model.

4. Empirical results

4.1 MIDAS model selection

The selection of the weighting function and its optimal lag order is crucial for the MIDAS model. We construct three MIDAS models using the weighting above functions, in addition to the U-MIDAS model, resulting in four models in total. The selection criteria for the weighting function are as follows: firstly, a mixed-frequency data regression model is constructed for the insurance demand variable TID and a single explanatory variable , to obtain the optimal lag order of the model’s weighting function for the high-frequency data, in which the lag order for the high-frequency variables is set from 1 to 30. The optimal model is determined based on the Akaike information criterion (AIC). Considering the possible autocorrelation of the explanatory variable TID, we included TID(-1) in the regression model. Secondly, considering that the COVID-19 event may have an impact on the model fitting effect, for this reason, we exclude the COVID-19 (from 2019Q3 to 2022Q1) period data for re-estimation. Finally, the COVID-19 period sample is selected to test the forecasting effect of the selected optimal model.

Fig 3 represents the optimal univariate MIDAS model based on the full sample. The optimal MIDAS models for the high-frequency explanatory variables CCI, EPU, and CPI are Exp almon-AR(1)-MIDAS(3), U-AR(1)-MIDAS(7), and Exp almon-AR(1)-MIDAS(24), respectively, which indicate that the optimal lag orders in the full-sample estimation are 3, 7, and 24 months for CCI, EPU, and CPI, respectively.

thumbnail
Fig 3. Optimal model selection (full sample).

(a1)-(a4) present the AIC values corresponding to the 1–30 period lag order of the macroeconomic variable CCI in the Almon-MIDAS model, ExpAlmon-MIDAS model, Beta-MIDAS model, and U-MIDAS model. (b1)-(b4) present the AIC values corresponding to the 1–30 period lag order of the macroeconomic variable EPU for the Almon-MIDAS model, ExpAlmon-MIDAS model, Beta-MIDAS model, and U-MIDAS model. (c1)-(c4) present the AIC values corresponding to the 1–30 period lag order of the macroeconomic variable CPI in the Almon-MIDAS model, ExpAlmon-MIDAS model, Beta-MIDAS model, and U-MIDAS model.

https://doi.org/10.1371/journal.pone.0305523.g003

Fig 4 represents the effect of fitting based on in-sample data. The optimal MIDAS models corresponding to the high-frequency explanatory variables CCI, EPU, and CPI in the in-sample estimation are Exp almon-AR(1)-MIDAS(6), U-AR(1)-MIDAS(24), and Exp almon-AR(1)-MIDAS(30), respectively, which indicate that the optimal lags of CCI, EPU, and CPI in the in-sample estimation are 6, 24, and 30 months, respectively. The optimal lags for the CCI, EPU, and CPI in the in-sample estimates are 6, 24, and 30 months, respectively. It is worth noting that the change in sample period does not change the type of optimal weight function, but increases the optimal lag order of each high-frequency explanatory variable, suggesting that the TID has become more sensitive to the effects of CCI, EPU, and CPI as a result of the COVID-19 crisis. In addition, these results suggest that the Exp Almon-MIDAS model and the U-MIDAS model outperform the other models.

thumbnail
Fig 4. Optimal model selection (in-sample).

(a1)-(a4) present the AIC values corresponding to the 1–30 period lag order of the macroeconomic variable CCI in the Almon-MIDAS model, ExpAlmon-MIDAS model, Beta-MIDAS model, and U-MIDAS model. (b1)-(b4) present the AIC values corresponding to the 1–30 period lag order of the macroeconomic variable EPU for the Almon-MIDAS model, ExpAlmon-MIDAS model, Beta-MIDAS model, and U-MIDAS model. (c1)-(c4) present the AIC values corresponding to the 1–30 period lag order of the macroeconomic variable CPI in the Almon-MIDAS model, ExpAlmon-MIDAS model, Beta-MIDAS model, and U-MIDAS model.

https://doi.org/10.1371/journal.pone.0305523.g004

4.2 Benchmark model comparison

In this section, we divide the whole sample period into the sample estimation period (from 2006Q1 to 2019Q2) and the short-term forecasting period (from 2019Q3 to 2022Q1), and analyze the forecasting performance of the selected optimal univariate MIDAS model and multivariate MIDAS model in the COVID-19 period in comparison with the traditional AR and ARDL models of the co-frequency. Table 3 presents the root mean square error values of the in-sample predictions of the TID explanatory variables under the optimal weighting function and its optimal lag order. The results show that the MIDAS model outperforms the co-frequency AR model and the ARDL model in terms of in-sample prediction accuracy. In addition, the MIDAS model outperforms the same-frequency model in terms of in-sample prediction, except for the univariate EPU. It is worth noting that the in-sample prediction accuracy is lower than the in-sample estimation accuracy due to the effect of COVID-19, especially for EPU, whose optimal lag order for in-sample prediction is shorter, which indicates that during COVID-19, the change of EPU has a relatively small effect on the TID, and the TID is less sensitive to the change of EPU. (As shown in Table 3, when the EPU indicator is excluded, the M(2)-AR(1)-MIDAS model is better than the covariate AR model and ARDL model, except for the univariate EPU. AR(1)-MIDAS function is better when the EPU indicator is excluded, as shown in Table 3). On the contrary, the in-sample prediction accuracy of CCI is better than the in-sample estimation accuracy, indicating that CCI is the main factor affecting the change of TID during COVID-19, and TID is more sensitive to the change of CCI.

Fig 5 shows the in-sample predictions of the co-frequency model and the MIDAS model. It is clear from Fig 5 that the MIDAS model outperforms the co-frequency AR model and the ARDL model in terms of prediction. In addition, the overall performance of the multivariate MIDAS model is better than that of the univariate MIDAS model, while the M(2)-AR(1)-MIDAS model without the EPU variable has the best fitting performance.

thumbnail
Fig 5. Comparison of in-sample predictions.

The figure presents the optimal co-frequency model, mixed-frequency model (univariate, multivariate) based forecasts compared to the true values for the period 2019Q3-2022Q1. Following the legend from top to bottom, the true values, AR model forecasts, ARDL model (CCI, EPU and CPI) forecasts, Eep Almon-MIDAS (CCI) model forecasts, U-MIDAS (EPU) model forecasts, Eep Almon-MIDAS (CPI) model forecasts, and M-MIDAS (CCI, EPU and CPI) model forecasts, and M-MIDAS (CCI and CPI) model forecasts. Of course these models are based on the results under the optimal lag order and optimal weight function. From the graphs, it is obvious that the MIDAS model predictions are generally better than the co-frequency model predictions. 4.3 Optimization of MIDAS estimation methods.

https://doi.org/10.1371/journal.pone.0305523.g005

For the same MIDAS model, different prediction windows can also affect the prediction performance. Based on the three optimal models selected above, we examined the sample in-sample prediction performance under three prediction windows: fixed, rolling, and recursive. Table 4 presents the RMSE values for the sample in-sample prediction of 1 to 9 periods ahead based on CCI, EPU, and CPI under different prediction windows. From Table 4, we can see that the overall prediction performance of the MIDAS model is good, with RMSE values kept below 1.6118, and the lowest value is 0.2226. Among them, the prediction performance of the CCI indicator is the best, with the smallest and most stable RMSE value. The prediction performance of the EPU indicator is the worst, with a relatively high RMSE value. At the same time, the accuracy of the prediction gradually decreases with increasing prediction period h, which indicates that the MIDAS model is more suitable for short-term prediction. Also, different prediction windows have different effects on prediction performance. Generally speaking, the prediction accuracy of the rolling window and recursive window is better than the accuracy fixed window, but there are differences for single-variable prediction. For CCI, the rolling window prediction performance is better in the short term, the fixed window prediction performance is better in the medium term, and the recursive identification is better in the long term. For the EPU, the fixed window prediction performance is better in the short term, and the recursive window performs better in the medium to long term. For the CPI, the prediction performance of the recursive window is the best.

thumbnail
Table 4. RMSE values of in-sample predictions under different prediction windows.

https://doi.org/10.1371/journal.pone.0305523.t004

Table 5 shows the RMSE values for the four combined forms of EW, BIC, MSFE, and DMSFE for periods 1 to 9. Based on the results presented in Table 5, it can be observed that the combined forecasting approach performs favorably, as evidenced by the stable and reasonable range of RMSE values, which are generally low. Specifically, the BIC-weighted approach appears to yield the most accurate forecasts, and when applied in conjunction with a short-term rolling window, it exhibits superior performance compared to long-term recursive prediction.

After validating the multiperiod in-sample predictions of insurance demand, the satisfactory predictive performance of the MIDAS model suggests that it can be employed to forecast future insurance demand. Adopting the MIDAS model would likely provide valuable insights for decision-makers in the insurance industry.

5. Nowcasting and forecasting

The preceding discussion has established the significant advantage of the MIDAS model in terms of forecast accuracy. In this section, we present evidence that the MIDAS model is capable of nowcasting and forecasting. Using estimates based on the full sample data, Table 6 presents nowcasting and forecasting of China’s insurance demand for the period between 2022Q2 and 2023Q1. The main findings are as follows:

  1. Overall, the model’s performance weakens gradually as the horizon increases. This is because the corresponding amount of recent information gradually decreases as the horizon increases, leading to a decrease in the accuracy of predictions. Nevertheless, the forecast values remain within a reasonable range.
  2. There is significant volatility in the results of the EPU forecast, which can carry a risk of distortion. The TID values of the CPI and CCI forecast remain relatively stable, and the combination and multivariate predictions outperform the univariate predictions.
  3. The TID value has increased to some extent compared to the level of 3.87 in 2022Q1 and is showing a favorable trend, with the expectation of recovering to the level before COVID-19 in 2023Q2. This is mainly due to the normalization of COVID-19, the orderly implementation of various stabilization policies by the Chinese government, the gradual stability of social indicators, the continuous increase in the people’s confidence index in the economy, and the continuous improvement in insurance demand.

6. Conclusion and policy implications

In this paper, we try to introduce the MIDAS model into the field of insurance demand forecasting. We select monthly high-frequency data indicators, i.e., CCI, EPU, and CPI, and build a basic MIDAS model by choosing the optimal weighting function and its lag order. In addition to this, we also investigated the impact of different forecast windows and combinations of forecast forms on the forecasting performance. To verify the superiority of the selected MIDAS model, we compare the forecasting performance of the co-frequency AR model and the ARDL model. Finally, the optimal model is used for present and short-term forecasting of future TID values in China.

The following main conclusions are drawn from this study. First, the MIDAS model exhibits higher prediction accuracy compared to the co-frequency AR and ARDL models. Specifically, rolling window and recursive prediction improved the prediction performance, while combined prediction stabilized the prediction results. In addition, the multivariate MIDAS model outperforms the univariate MIDAS model, and the forecasting performance of both models decreases with increasing forecasting horizon, suggesting that the MIDAS model is suitable for short-term forecasting. Secondly, CCI, EPU, and CPI have different explanatory power and predictive effects on TID. Among them, CCI has the best predictive effect and the strongest explanatory power for TID, especially when COVID-19-related uncertainty is included in the estimation interval. The optimal lag order of CCI becomes shorter, and the accuracy of the in-sample prediction improves, suggesting that CCI is the main factor influencing the level of TID during the COVID-19 period and that TID is more sensitive to changes in CCI. Third, from the perspective of nowcasting and short-term forecasting, China’s TID is expected to return to its pre-COVID-19 level in 2023Q1, suggesting that TID will show a positive trend shortly.

In conclusion, the within-sample forecast results show that the mixed-frequency data model has a comparative advantage over the same-frequency data model in the within-sample prediction of China’s insurance demand, and the estimation and prediction results of the model indicate that consumer confidence is an important factor affecting China’s insurance demand in the COVID-19 period; the real-time forecast results suggest that China’s insurance demand will maintain good growth in the post-epidemic era.

The above findings provide several policy insights. First, the Chinese government should refer to consumer confidence indicators in the process of formulating and implementing insurance policies; stabilizing consumer confidence is an important way to secure effective insurance demand. Second, policymakers should combine behavioral indicators with mixed-frequency models, which can provide a better grasp of insurance demand by using the effective information provided by mixed-frequency data. Finally, in the post-epidemic era, China’s insurance market has better prospects, and policymakers should lay out their plans in advance to optimize insurance demand-side and supply-side reforms to safeguard the high-quality development of China’s insurance industry.

This study also has some limitations. For example, this paper only judges whether there is a long-term relationship between CCI, EPU, PPI, and TID from the impulse response results, while it does not explore whether there is a bidirectional causality between the variables and whether there is a problem of covariance; at the same time, this paper does not discuss the limitations of the weighting functions of MIDAS and their impact on the prediction effect. Of course, these contents are not the focus of this paper, and will not affect the correctness of the conclusion.

References

  1. 1. Jeleva M. Background risk, demand for insurance, and Choquet expected utility preferences. The Geneva Papers on Risk and Insurance Theory, 2000; 25(1), 7–28.
  2. 2. Baker S. R., Bloom N., & Davis S. J. Measuring economic policy uncertainty. The quarterly journal of economics, 2016; 131(4), 1593–1636.
  3. 3. Liu W. & Xu J.Y. The impact of economic policy uncertainty on the demand for life insurance. Insurance Research, 2019; 371(3), 63–81(in Chinese).
  4. 4. Ju L. Xue C.Y Zhao T.T, & Cong Y.F. Macroeconomic policy expectations and insurance demand. Consumer Economics, 2021; 37(1), 27–38(in Chinese).
  5. 5. Beck T., & Webb I. Economic, demographic, and institutional determinants of life insurance consumption across countries. The World Bank Economic Review, 2003; 17(1), 51–88.
  6. 6. Diron M. Short-term forecasts of euro area real GDP growth: an assessment of real-time performance based on vintage data. Journal of Forecasting, 2008; 27(5), 371–390.
  7. 7. Jardet C., & Meunier B. Nowcasting world GDP growth with high-frequency data. Journal of Forecasting, 2022; 41,1181–1200.
  8. 8. Monteforte L., & Moretti G. Real‐time forecasts of inflation: The role of financial variables. Journal of Forecasting, 2013; 32(1), 51–61.
  9. 9. Kuzin V., Marcellino M., & Schumacher C. MIDAS vs. mixed-frequency VAR: Nowcasting GDP in the euro area. International Journal of Forecasting, 2011; 27(2), 529–542.
  10. 10. Aruoba S. B., Diebold F. X., & Scotti C. Real-time measurement of business conditions. Journal of Business & Economic Statistics, 2009; 27(4), 417–427.
  11. 11. Foroni C., Guérin P., & Marcellino M. Using low frequency information for predicting high frequency variables. International Journal of Forecasting, 2018; 34(4), 774–787.
  12. 12. Grace L M F. Household Life Cycle Protection: Life Insurance Holdings, Financial Vulnerability, and Portfolio Implications. Journal of Risk and Insurance, 2007; 74(1).
  13. 13. Tian L, Dong H S. Family Life Cycle, Asset Portfolio, and Commercial Health Insurance Demand in China. International Journal of Environmental Research and Public Health. 2022; 19(24):16795.
  14. 14. Wu A, Zheng X. Assortative matching and commercial insurance participation: Evidence from the China Household Finance Survey. Journal of Asian Economics, 2022; (80). 101459
  15. 15. Ghysels E., Santa-Clara P., & Valkanov R. The MIDAS touch: Mixed data sampling regression models. Cirano Working Papers, 2004; 5(1), 512–517
  16. 16. Ghysels E., Santa-Clara P., & Valkanov R. There is a risk-return trade-off after all. Journal of financial economics, 2005; 76(3), 509–548.
  17. 17. Ghysels E., Santa-Clara P., & Valkanov R. Predicting volatility: getting the most out of return data sampled at different frequencies. Journal of Econometrics, 2006; 131(1–2), 59–95.
  18. 18. Ghysels E., Sinko A., & Valkanov R. MIDAS regressions: Further results and new directions. Econometric Reviews, 2007; 26(1), 53–90.
  19. 19. Andreou E., Ghysels E., & Kourtellos A. Should macroeconomic forecasters use daily financial data and how? Journal of Business & Economic Statistics, 2013; 31(2), 240–2.
  20. 20. Marcellino M., & Schumacher C. Factor MIDAS for nowcasting and forecasting with ragged‐edge data: A model comparison for German GDP. Oxford Bulletin of Economics and Statistics, 2010; 72(4), 518–550.
  21. 21. Foroni C. Econometric models for mixed-frequency data (Doctoral dissertation). 2012.
  22. 22. Liu L., & Wang Y. Forecasting aggregate market volatility: The role of good and bad uncertainties. Journal of Forecasting, 2021; 40(1), 40–61.
  23. 23. Salisu A. A., Gupta R., Bouri E., & Ji Q. Mixed‐frequency forecasting of crude oil volatility based on the information content of global economic conditions. Journal of Forecasting, 2022; 41(1), 134–157.
  24. 24. Huber F., Pfarrhofer M., & Piribauer P. A multi‐country dynamic factor model with stochastic volatility for euro area business cycle analysis. Journal of Forecasting, 2020; 39(6), 911–926. pmid:32999523
  25. 25. Li X., Li D., Zhang X., Wei G., Bai L., & Wei Y. Forecasting regular and extreme gold price volatility: The roles of asymmetry, extreme event, and jump. Journal of Forecasting, 2021; 40(8), 1501–1523.
  26. 26. Bangwayo-Skeete P. F., & Skeete R. W. Can Google data improve the forecasting performance of tourist arrivals? Mixed-data sampling approach. Tourism Management, 2015; 46, 454–464.
  27. 27. Toker S., Özbay N., & Månsson K. Mixed data sampling regression: Parameter selection of smoothed least squares estimator. Journal of Forecasting, 2022; 41(4), 718–75.
  28. 28. Clements M. P., & Galvão A. B. Macroeconomic forecasting with mixed-frequency data: Forecasting output growth in the United States. Journal of Business & Economic Statistics, 2008; 26(4), 546–554.
  29. 29. Ghysels E, Santa-Clara P, Valkanov R. The MIDAS touch: Mixed data sampling regression models. Working Paper. 2002.