Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Dynamic financial tail risk networks: A backtesting-based conditional expected shortfall approach

  • Donghao Zhang,

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Writing – review & editing

    Affiliation Chinese Financial Research Institute, Southwestern University of Finance and Economics, Chengdu, PR China

  • Xiaodong Yan,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Software, Validation, Visualization, Writing – original draft

    Affiliation School of Finance, Southwestern University of Finance and Economics, Chengdu, PR China

  • Feng Shen

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Writing – review & editing

    shenfeng1213@gmail.com

    Affiliations School of Finance, Southwestern University of Finance and Economics, Chengdu, PR China, Engineering Research Center of Intelligent Finance, Southwestern University of Finance and Economics, Chengdu, PR China

Abstract

This paper develops a Factor-Copula methodology for constructing high-dimensional dynamic tail risk networks based on the conditional expected shortfall (CoES) in order to overcome the limitations of traditional quantile regression and copula models. We backtest the CoES using cumulative joint violations and conditional coverage tests. The proposed factor‑copula‑CoES model reduces the rejection rates in 10‑th order conditional backtests by 56.37%, 47.72%, and 1.96% relative to the static‑copula‑CoES, time‑varying‑copula‑CoES, and factor‑copula‑CoVaR models, respectively, with all reductions being statistically significant. The dynamic analysis of the tail risk network of Chinese listed financial institutions indicates that the network topology characteristics align with real market risk events. Finally, the quantitative analyses show different effects of institutional and market factors on network-measured risk spillovers and contagion among Chinese financial institutions.

1. Introduction

In 2008, the collapse of the U.S. subprime mortgage market triggered a global financial crisis, resulting in severe market fluctuations that undermined financial system stability and have since drawn widespread regulatory and academic attention to systemic risk. Since the early 1990s, regulators and institutions have relied on Value-at-Risk (VaR) to monitor tail risk and estimate maximum potential losses at a given confidence level. However, VaR fails to fully capture expected tail losses, lacks convexity and sub-additivity, and is neither coherent nor resistant to regulatory arbitrage [1]. Research has demonstrated that Expected Shortfall (ES) provides a superior measure by coherently assessing losses beyond a confidence threshold [2] and exhibiting enhanced tail sensitivity [3]. Because of the ES’s mathematical and financial robustness, the Basel Committee on Banking Supervision (BCBS) adopted ES in 2016 to replace VaR for capital adequacy calculations. However, both VaR and ES measure only individual institution risk, ignoring interconnections between institutions. To address this limitation, Adrian and Brunnermeier [4] proposed the Conditional Value-at-Risk (CoVaR) and Conditional Expected Shortfall (CoES) to measure tail risk spillover relationships between financial institutions. is defined as the VaR of institution i when institution j is in distress, while defines the ES of institution i under the same conditions. These metrics quantify the magnitude of risk spillover between financial institutions i and j, enabling the construction of tail risk spillover networks.

Network analysis is widely used in systemic financial risk research. However, conventional financial network construction requires detailed inter-institutional exposure data [5,6], which is rarely publicly available. Consequently, studies increasingly build tail-risk networks directly based on and , with research by Hautsch et al. [7] and Härdle et al. [8] based on the CoVaR concept, and work by Chen et al. [9] and Bonaccolto et al. [10] based on the CoES. However, a common limitation in these studies is the reliance on quantile regression models, which often fail to model nonlinear and dynamic dependencies. Girardi and Ergün [11] employed the DCC-GARCH model for dynamic CoVaR estimation; however, its binary dependency structure assumptions have limited its applicability. Copula models address this by enabling more flexible dependency structures [12,13], yet both copula models and DCC-GARCH framework suffer from the curse of dimensionality in high-dimensional settings. Oh and Patton [14] introduced a Factor Copula model based on the GAS algorithm to aptly capture the high-dimensional dynamic dependencies between institutions. Recent advances in factor copula models span methodological innovation, structural change detection, and portfolio risk optimization. Opschoor et al. [15] introduced closed-form multi-factor copulas with observation-driven dynamic factor loadings, enabling scalable and flexible high‑dimensional dependence modeling. Chang et al. [16] developed a change‑point detection framework for dynamic factor copulas that identifies both smooth and abrupt structural breaks in dependence. Chen et al. [17] applied a dynamic factor copula‑based mean‑ES model to optimize portfolio risk in China’s real industry, showing the approach effectively captures time‑varying dependence in high‑dimensions. However, few studies have applied factor copulas specifically to the construction of financial risk networks. To fill this gap, we propose a Factor Copula–based framework for constructing high‑dimensional dynamic tail‑risk networks.

Backtesting is a cornerstone of risk model validation. Although robust frameworks exist for individual risk measures like VaR and ES [18,19], comparable validation frameworks for CoVaR and CoES remain underdeveloped. Girardi and Ergün [11] and Karimalis and Nomikos [13] applied CoVaR backtesting to validate the DCC-GARCH-CoVaR model and the Copula-CoVaR model. However, CoES backtesting is scarcely explored. To bridge this gap, we develop a cumulative joint violation process for backtesting CoES. This process integrates the cumulative violation process of Du and Escanciano [1] for backtesting ES and the joint violation process of Banulescu-Radu et al. [20] for backtesting CoVaR. Thus, by backtesting each edge in a CoVaR/CoES-defined tail-risk network, we ensure the reliability of the network as a whole.

We empirically test our methodology using weekly returns of 55 A-share listed financial institutions in China from January 2011 to December 2022. The sample covers three systemic risk events in China: (1) the 2013 interbank liquidity crunch, (2) the 2015–2016 stock market crash, and (3) the COVID-19 pandemic period (2020–2022). We compare tail risk networks built with three approaches: (i) high-dimensional dynamic Factor Copula, (ii) bivariate static Copula, and (iii) bivariate dynamic Copula. Results show that networks constructed with the Factor Copula consistently yield lower rejection rates in conditional coverage tests than alternatives, and CoES-based networks achieve lower rejection rates than CoVaR-based ones. Consequently, we employ the Factor-Copula-CoES model to analyze China’s financial systemic risk evolution across crisis periods. Dynamic network metrics (e.g., centrality, density) correlate strongly with actual market turbulence and exhibit clear risk clustering during crises. Network visualizations reveal risk clusters within and across financial sectors over time, indicating a strong sectoral clustering effect among highly correlated, high-risk institutions.

Finally, we investigate the factors influencing systemic risk at both institutional and market levels, measuring systemic risk through tail risk network characteristics. Compared to traditional systemic risk measures based on or (where s denotes overall systemic risk), our network-based approach offers two distinct advantages. First, conventional -based studies typically approximate systemic returns using aggregate institutional returns, leading to potential information loss [14]. In contrast, our node centrality measures in tail risk networks preserve the complete risk spillover relationships among all institutions. Second, while -based measures only capture unidirectional risk spillovers from institutions to the system, our centrality metrics (through out-degree and in-degree) capture both outgoing and incoming risks, thus providing a more comprehensive perspective on systemic risk factors. To demonstrate the utility of our network-based approach, we employ it to examine key determinants of systemic risk at both institutional and market levels. Existing literature reports mixed findings on the link between institutional size and systemic risk. Some studies report a positive correlation due to moral hazard [21], while others suggest larger institutions may pose lower systemic risk due to conflict of interest effects [22]. Smaller institutions, conversely, may exhibit greater risk vulnerability [23]. Additional financial indicators like liquidity [24,25] and leverage [26] also influence systemic risk levels. Furthermore, following Adrian and Brunnermeier [4], we incorporate the spread of China’s local government debt as a market-wide risk factor.

Compared with previous studies, this research contributes three key innovations: First, we propose a Factor-Copula-CoES-based method for constructing high-dimensional dynamic tail-risk networks. This approach simultaneously addresses two limitations: (1) the inability of quantile regression models to capture dynamic nonlinear dependencies, and (2) the curse of dimensionality in conventional high-dimensional Copula models. Second, we introduce a comprehensive backtesting framework for CoVaR and CoES to ensure the validity of both the risk measures and the constructed network. Unlike the indirect CoVaR and CoES estimations in traditional quantile regression, the Factor Copula model enables direct estimation, which in turn lays the foundation for our backtesting framework. Specifically, we combine the cumulative violation process for ES backtesting with the joint violation process for CoVaR backtesting to construct a novel joint cumulative violation process for CoES backtesting. Third, using tail-risk network characteristics as systemic risk proxies, we identify the key factors affecting systemic risk in China’s financial system. We then separately examine their influences on risk spillover and risk contagion. Our empirical results reveal that institutional-level and market-level factors have distinct impacts on risk spillover and contagion across financial institutions.

The remainder of this paper is organized as follows. Section 2 introduces the research methods, including the definitions for the CoVaR and CoES tail risk measures, the dynamic high-dimensional Factor Copula model, the backtesting framework, and the construction of the tail risk networks. Section 3 details the data and the backtesting results, and presents the constructed tail risk network and the dynamic characteristics. Section 4 explores the factors affecting systemic risk in China’s financial system by measuring the systemic risk by network characteristics. Finally, Section 5 concludes the paper and discusses the implications. Appendix contains the mathematical proofs and Monte Carlo simulations of our proposed CoES backtesting framework.

2. Methodology

2.1. CoVaR and CoES definitions

Let the random variable denote the return of institution i at time t . The Value-at-Risk (VaR) of institution i at time t at the given confidence level α ∈ (0, 1) is denoted by and is defined as the α-quantile of the distribution:

(1)

Let denote the cumulative distribution function (CDF) of . Its inverse, the quantile function, is defined as . Thus, Equation (1) can be equivalently expressed as:

(2)

Extending the concept to the bivariate random vector , the Conditional Value-at-Risk CoVaR of institution i given that institution j is in distress, denoted , is originally defined by Adrian and Brunnermeier [4] in a linear quantile regression framework as:

(3)

The original definition in equation (3) uses a strict conditioning event , which is not conducive to backtesting. Following Girardi and Ergün [11], we adopt a revised version that broadens the conditioning event to , facilitating estimation and backtesting within models like DCC-GARCH and copulas. Thus, the CoVaR definition used in this study is:

(4)

Building on formula (4), the Conditional Expected Shortfall CoES is defined as the expected shortfall of institution i at level conditional on institution j being in distress:

(5)

Compared to CoVaR, which only provides a quantile threshold, CoES captures the average of losses beyond that threshold. Thus, CoES offers a more comprehensive measure of the expected tail losses of institution i conditional on the distress of institution j.

2.2. Factor copula CoVaR and CoES methodology

This section outlines the estimation of and using a high-dimensional dynamic factor copula model. Although estimating and for a single pair (i, j) requires only a bivariate dependency and can be done with a traditional bivariate copula, constructing a tail-risk network demands consistent estimation of these measures for all institution pairs (i, j). A factor copula model, by contrast, estimates the joint distribution of the entire return vector . This provides a coherent framework for extracting all pairwise dependencies from a single high-dimensional model, ensuring consistency across the network—an assurance that separate bivariate copula estimations cannot provide.

Let be an N-dimensional vector of returns at time t. Denote its joint distribution function by , the marginal distribution of by , and the copula function linking them by . According to Sklar’s theorem [27], the joint distribution function can be decomposed into its marginal distributions and the Copula function :

(6)

Following Sklar’s theorem, we adopt a two-step estimation procedure: first model the marginal distributions , then model the copula . For the return , it is conventionally assumed that the marginal distribution of the individual returns follows the AR(1)-GARCH(1,1)-Skewed-t model:

(7)(8)(9)

and equations (7)(9) represent the mean equation, variance equation, and error distribution assumption of the GARCH model, respectively. Equation (7) is an AR(1) process that describes the mean dynamics of the return series , where is the intercept, is the first-order autoregressive coefficient (), and is the residual at time t. The conditional distribution of the residual is given by , where is the conditional standard deviation (determined by equation 8) and is the standardized residual, which follows a standard skewed t distribution (specified by equation 9). Equation (8) is a GARCH(1,1) process that describes the dynamics of the conditional variance, where is the conditional variance (volatility) at time t, is the constant term (), is the ARCH term coefficient (), and is the GARCH term coefficient (), . Equation (9) specifies that the standardized residual follows a standardized skewed t distribution with degrees of freedom and skewness parameter .

Next, utilizing the high-dimensional Dynamic Factor Copula model, the Copula function is modeled following Oh and Patton [14], the joint standardized errors and the latent variable share the same copula function, with the links between being driven by the common factor :

(10)

where the common factor follows a skewed t-distribution with degrees of freedom and skewness. The idiosyncratic factor is distributed as an i.i.d. standard t-distribution with degrees of freedom. For any given i, . The factor loading, , is a potentially time-varying weight on the common factor, and we denote the parameters of the factor loading as . We employ the Generalized Autoregressive Score (GAS) model to allow for time variation in the factor loadings in the factor copula implied by equation (10).

As an ARCH-type model, the GAS framework offers a distinct advantage because it circumvents the need to integrate out the innovation terms driving latent processes—a feature particularly valuable in high-dimensional applications. Moreover, compared to alternative ARCH-type models, the GAS approach is better suited for contexts where parameters lack immediate intuitive interpretation, such as in high-dimensional copula models [14]. In the general GAS framework, consider a copula with a time-varying parameter vector , which is driven by a vector of fixed parameters , we have:

(11)

where denote the information set containing all available information up to and including time t,

, is the conditional probability integral transforms of , is the copula function. The relation implys that the conditional copula of is equal to the conditional distribution of . , , and are the parameters to be estimated, is a scaling matrix (e.g., the inverse Hessian or its square root).

To maintain model parsimony, we restrict the coefficient matrices B and A in formula (11) to be diagonal with a common scalar parameter along their respective diagonals. Furthermore, to circumvent the estimation of an N × N scaling matrix, we set . Under these specifications, the model reduces to the following log form:

(12)

where and . In this high-dimensional dynamic Factor Copula model, there are a total of N + 5 parameters: .

While the Copula function and its likelihood are not known in a closed form, this can be approximated using numerical integration. The approximated likelihood is then maximized via maximum likelihood estimation (MLE) to obtain estimates of all model parameters. When the high-dimensional dynamic copula model parameter estimations are completed, data can be easily simulated for the implicit dynamic copula function using Equation (10). Then an inverse probability integral transformation can be performed on , by inverting the transformations from Equations (9), (8), and (7), a set of simulations for are obtained. Through extensive simulations, it is then possible to acquire the simulated joint distribution for , which allows for the estimation to be made for the and measures based on the simulated joint distributions of .

2.3. CoVaR and CoES backtesting

This section explains the backtesting CoVaR and CoES frameworks. First, following the approach by Banulescu et al. (2020), the joint violation process for backtesting CoVaR was constructed:

(13)

where represents all unknown parameters within the CoVaR estimation model, and is an indicator function. If the CoVaR model is appropriate, then for any , the series should be a martingale difference sequence (MDS).

Noting that , to verify the correct specification for the CoES model, this study considers the integral of and constructs a cumulative joint violation process as:

(14)

Given Fubini’s theorem, we have:

(15)

it can be deduced that is a Bernoulli variable with mean , which implies that for any , the sequence should also be an MDS. Using that , can be computed using the following formula:

(16)

where denotes the cumulative distribution function for , and represents the cumulative distribution function for under the condition . Both sequences and are independent and identically distributed sequences following a uniform distribution. Notably, is independent of the distribution forms for and .

Then, the joint violation process is employed to construct the conditional coverage tests for CoVaR. If the CoVaR model is correctly specified, then the m-th order autocorrelation coefficients for the joint violation process are zero, that is :

(17)

Then, the following statistic for the conditional coverage test is derived:

(18)

where , . Under certain assumptions, converges to a chi-squared distribution with degrees of freedom.

The cumulative joint violation process is employed to construct the conditional coverage tests for CoES. See Appendix for detailed proofs and Monte Carlo simulations of the conditional coverage tests for CoES. If the CoES model is correctly specified, the m-th order autocorrelation coefficients for the cumulative joint violation process are zero, that is .

(19)

From this, the following conditional coverage test statistic is constructed:

(20)

where , . Under certain assumptions, converges to a chi-squared distribution with degrees of freedom.

The logic of the cumulative joint violation process is best understood by starting from the standard univariate VaR backtest and extending it along two dimensions. The first dimension is cumulative: moving from VaR to ES backtesting. In a VaR backtest, one simply counts violations—days when the realized loss exceeds the predicted VaR. ES backtesting goes further by asking whether, on violation days, the magnitude of the loss is consistent with the predicted ES. This requires tracking the cumulative sum of scaled violations over time, rather than merely their frequency. The second dimension is joint: moving from VaR to CoVaR backtesting. Instead of asking whether one institution experiences a violation, we ask whether, conditional on institution i being in distress, the loss of institution j exceeds the predicted CoVaR. This defines a joint violation—a co-exceedance of two thresholds across two institutions. Our CoES framework naturally combines these two extensions: we construct a cumulative joint violation process that tracks, over time, whether the co-exceedances between institution pairs are consistent with the predicted CoES, both in frequency and in serial dependence.

2.4. Network estimation

When constructing the tail risk networks, the most common approach in previous studies is employed, ΔCoVaR and ΔCoES, to depict the directed risk spillover relationships. Specifically, as and are directional, (or ) does not necessarily equal (or ) when i ≠ j. is defined as:

(21)

where represents the VaR for institution i when institution j is in a normal state. The normal state is defined as , where and respectively denote the 25% and 75% quantiles of the random variable . captures the net risk spillover from institution j to institution i.

Similarly, is defined as:

(22)

similarly characterizes the risk spillover from institution j to institution i. Using the high-dimensional dynamic Factor Copula model, and are estimated for each period t, where i ≠ j. To clarify the primary risk spillover relationships in the tail risk network, a thresholding method is employed to filter out smaller ΔCoVaR and ΔCoES values. Comparing the ΔCoVaR (or ΔCoES) from the same institution at different times t, we need to determine a percentile threshold to filter out smaller values, thereby establishing the threshold for each i|j relationship. Specifically, we establish a functional relationship between the percentile threshold and the global efficiency of the network, then and are selected by monitoring the knee point of the curve. Global efficiency of a network G with N nodes is defined as:

(23)

where denotes the shortest path length between nodes i and j, defined as the minimum number of edges that must be traversed to go from i to j.

Knee-point detection is a heuristic method for selecting a threshold in complex network construction. It identifies a point that balances network sparsity and topological integrity by locating a pronounced change in the slope of a network metric (e.g., global efficiency) plotted against threshold values. The primary advantage of using the knee point of global efficiency, rather than those of network density or information entropy, for threshold selection lies in its direct optimization of a core functional property—the network’s capacity for parallel information integration. Unlike density, which only reflects connection sparsity without topological insight, or entropy, which emphasizes structural heterogeneity rather than functional performance, global efficiency captures the cost–efficiency trade-off intrinsic to many real-world networks. Following this rationale, we apply the knee-point detection method to the global efficiency curve of our tail-risk network to determine a percentile threshold. This empirically derived threshold is then used to filter the smaller ΔCoVaR and ΔCoES values. For each pair (i, j) at time t, an adjacency matrix element is determined by setting it to 1 if the spillover value is greater than or equal to the threshold, and to 0 otherwise:

(24)

Filtering risk measures into 0 or 1 using the thresholding method allows for a comparison of the overall risk spillover (out-degree) and overall risk contagion (in-degree) of different institutions in the same period t. A higher network density indicates the retention of a greater number of significant risk interaction relationships during that period. Finally, through the adjacency matrix shown in formula 25, a directed and unweighted tail risk network is constructed based on the and risk spillover relationships.

(25)

3. Results

3.1. Data

This study uses weekly return data for 55 Chinese A-share listed financial institutions from January 1, 2011, to December 31, 2022, covering 580 trading weeks. The data are sourced from the China Stock Market & Accounting Research (CSMAR) database. Given the quasi-financial characteristics of the real estate sector in China, this study incorporates real estate companies into the analysis as financial institutions [28]. The real estate sector, a cornerstone of China’s economy for two decades, accounts for approximately 39% of total bank lending. Additionally, substantial capital from bond markets, equity investments, and trust products flows into this sector. Our sample represents approximately 80% of the total market capitalization across financial and real estate sectors, based on China Securities Regulatory Commission (CSRC) industry classifications. This comprehensive coverage includes five subsectors: (1) banking, (2) securities, (3) insurance, (4) other financial services, and (5) real estate (see Table 1).

thumbnail
Table 1. Abbreviations and full names for the 55 A-listed Chinese financial institutions.

https://doi.org/10.1371/journal.pone.0351966.t001

3.2. Network backtesting results

We backtest the tail risk networks to validate the proposed Factor-Copula-CoES model. Using the backtesting frameworks for CoVaR and CoES described in Section 2.3, we test each edge of the tail-risk network and aggregate the results. In assessing the network backtesting results, we employ the ‘rejection rate’ indicator, defined as the proportion of network edges that fail the backtest. A lower rejection rate suggests better model adequacy, implying that the underlying CoVaR/CoES estimation model is more appropriate for constructing the tail risk network.

For comparison, the backtesting results of tail risk networks constructed using the bivariate static Copula model and the bivariate dynamic Copula model are also examined. Specifically, nine bivariate static Copula models: Normal, Student t, Gumbel, Rotated Gumbel, Clayton, Rotated Clayton, Frank, Plackett, and SJC; and four bivariate dynamic Copula models: time-varying Normal, time-varying Student t, time-varying Rotated Gumbel, and time-varying SJC; are considered. AIC and BIC criteria are used to determine the optimal bivariate static and dynamic Copula models. For the bivariate (static and dynamic) copula models, CoVaR and CoES are estimated using standard parametric methods. For the high-dimensional factor copula model, these measures are estimated via simulation, with 10,000 repetitions per estimation. To ensure the reliability of our simulation-based estimates for the factor copula model, we verify the convergence of key dependence measures. Since accurate estimation of CoVaR and CoES depends on accurately capturing tail dependence, we conduct convergence diagnostics on the tail dependence coefficient (TDC) and tail correlation (TC) derived from the simulated data. TDC quantifies the asymptotic probability of simultaneous extreme events, whereas TC measures the strength of dependence in the distribution tails. For illustration, we randomly selected four institution pairs—(CITS, CJSC), (JKGF, CITS), (CBC, JDTZ), and (BL, HXXF)—and examine the convergence of TDC and TC for these pairs in the 200th trading week (see Fig 1). The fluctuations of TDC and TC were computed over the final 2,000 simulation runs. Following common practice in simulation studies, we consider TDC and TC to have converged when their variations over the final 2,000 runs fall below thresholds of 0.05 and 0.08, respectively. As illustrated in the figure, all four institutional pairs show clear convergence at 10,000 simulations, and the convergence patterns observed for other randomly sampled pairs are similar.

Tables 2 and 3 present the backtesting results for tail-risk networks constructed using the CoVaR and CoES measures, respectively. In each table, Panel A reports results for the network built with the Factor Copula model, Panel B for the bivariate dynamic copula model, and Panel C for the bivariate static copula model.

thumbnail
Table 2. Average backtest p-values and rejection rates for the CoVaR tail risk network.

https://doi.org/10.1371/journal.pone.0351966.t002

thumbnail
Table 3. Average backtest p-values and rejection rates for the CoES tail risk network.

https://doi.org/10.1371/journal.pone.0351966.t003

Panel A of Table 2 shows that the Factor-Copula-CoVaR network has conditional coverage rejection rates of 6.9% and 11.18% at autocorrelation orders 5 and 10, respectively. Notably, the rejection rates are higher for edges involving banking institutions. The results in Panels B and C suggest that both the bivariate dynamic and static Copula models achieve conditional coverage rejection rates exceeding 50%, which is inferior to the Factor-Copula-CoVaR model. The CoES conditional coverage test results in Table 3 corroborate these findings: the high-dimensional Factor Copula model consistently yields lower rejection rates than both bivariate dynamic and static copula models in constructing CoES-based tail-risk networks.

As shown in Panel A of Tables 2 and 3, the Factor-Copula-CoES model exhibits somewhat lower rejection rates in and tests than its CoVaR counterpart. To systematically evaluate the performance of the CoES-based models and verify whether the observed differences are statistically significant, we conduct McNemar tests against a set of benchmark models (including the Static-copula-CoES and Time-varying-copula-CoES). A significant result indicates that the two models differ significantly in their backtesting outcomes, and the model with a lower rejection rate can be considered superior. Table 4 reports the results. The findings show that the Factor-Copula-CoES model yields significantly lower rejection rates than both the Static-copula-CoES and Time-varying-copula-CoES models under the and metrics. Compared with the Factor‑Copula‑CoVaR model, however, the Factor‑Copula‑CoES model does not show a significant advantage in rejection rate under the metric, whereas it remains significantly superior under the metric. The proposed factor‑copula‑CoES model reduces the rejection rates by 56.37%, 47.72%, and 1.96% relative to the static‑copula‑CoES, time‑varying‑copula‑CoES, and factor‑copula‑CoVaR models, respectively, with all reductions being statistically significant. Given CoES’s superior ability to characterize tail loss correlations and its robust performance in backtests, we select the Factor-Copula-CoES model for constructing China’s financial system’s high-dimensional dynamic tail risk network. In addition, to assess whether our tail-risk network construction methodology is robust to the choice of data frequency, we replicate the entire analysis at a daily frequency. The results, detailed in Appendix D, confirm that the core network architecture is consistently captured across frequencies, while also revealing the effects of microstructure noise inherent in daily returns.

thumbnail
Table 4. McNemar test: model comparison of Factor-Copula-CoES and alternative models.

https://doi.org/10.1371/journal.pone.0351966.t004

3.3. Dynamic analysis of the tail risk network

This section constructs a high-dimensional dynamic tail-risk network for China’s financial system (comprising 55 A-share listed institutions) using the selected Factor-Copula-CoES model. To characterize the levels of risk spillover and contagion, we employ topological network analysis to extract key systemic risk features. These include overall network characteristics, node-specific metrics (with particular attention to out-degree and in-degree), and their time-varying trends.

First, the knee-point detection method is employed to determine the network threshold. Fig 2 plots network density, information entropy, and global efficiency of the tail-risk network as functions of the percentile threshold. Because the network examined in this paper is dynamic, all metrics are reported as weekly averages. In general, as the percentile threshold rises, network density, information entropy, and global efficiency all show a monotonic decreasing trend. Both the information‑entropy curve and the global‑efficiency curve exhibit mathematical knee points. The knee point of the information‑entropy curve corresponds to a percentile threshold of 93.5%, while that of the global‑efficiency curve corresponds to a percentile threshold of 88.5%. Given that the primary objective of our tail-risk network analysis is to capture systemic risk propagation—a process reliant on efficient information transmission—we prioritize the threshold identified by the global-efficiency curve (88.5%). This choice is functionally grounded. Global efficiency directly quantifies a network’s integration capacity, which is crucial for understanding risk propagation speed. In contrast, while information entropy provides insights into structural heterogeneity, it does not have a direct bearing on dynamic spillover processes.

thumbnail
Fig 2. Evolution of network global efficiency, information entropy, and density under different percentile thresholds.

https://doi.org/10.1371/journal.pone.0351966.g002

As outlined in Section 2.4, once the threshold is determined, we construct a directed, unweighted tail‑risk network based on the CoES measure. We examine the dynamic evolution of the tail-risk network’s overall characteristics to assess systemic risk across the financial system. Fig 3 displays the weekly trends of network density, transitivity, and reciprocity from 2011 to 2022. These three metrics demonstrate synchronized dynamics and comparable clustering patterns. Notably, all metrics show significant elevations during three crisis periods: (1) the 2013 interbank liquidity crisis, (2) the 2015–2016 stock market collapse, and (3) the 2020 COVID-19 pandemic – each corresponding to major financial turbulence in China. The June 2013 interbank market liquidity crunch triggered severe banking system stress. During 2015–2016, China’s stock market experienced unprecedented volatility, with frequent “limit-down” events (when stocks hit the maximum allowed daily decline) eroding nearly 50% of total market capitalization by June 2016. The 2020 pandemic then transmitted substantial macroeconomic shocks to the financial sector. The responsive fluctuations in our network metrics confirm that the high-dimensional dynamic tail-risk network effectively captures regime shifts in China’s systemic risk profile.

thumbnail
Fig 3. Network density, reciprocity, and transitivity trends from 2011 to 2022.

https://doi.org/10.1371/journal.pone.0351966.g003

We next analyze the dynamic evolution of nodal characteristics in the tail-risk network: all-degree (total connections), out-degree (risk spillover), and in-degree (risk contagion). Figs 4-6 present the weekly trends of these metrics, with industry averages shown for clarity. The main observations from Figs 46 are as follows. (1) Industry-average all-degree, out-degree, and in-degree all show pronounced peaks during the three crisis periods: 2013, 2015–2016, and 2020. (2) On average, out-degree exceeds in-degree across industries. While the evolution of in-degree is relatively consistent (homogeneous) across industries, out-degree trends display substantial heterogeneity. Consequently, the temporal pattern of all-degree closely mirrors that of out-degree, as the latter dominates the variation in total connections. (3) The temporal peaks of out-degree and in-degree (Figs 5 and 6) tend to concentrate in different industries at different times. This indicates that the epicenters of risk spillover and contagion shift across the financial sectors over the sample period.

thumbnail
Fig 4. Node all-degree trends for each sector from 2011 to 2022.

https://doi.org/10.1371/journal.pone.0351966.g004

thumbnail
Fig 5. Node out-degree trends in each sector from 2011 to 2022.

https://doi.org/10.1371/journal.pone.0351966.g005

thumbnail
Fig 6. Node in-degree trends in each sector from 2011 to 2022.

https://doi.org/10.1371/journal.pone.0351966.g006

To visualize the evolving dynamics of the tail-risk network, we divide the 12-year sample into four distinct periods (see Figs 710): 2011–2013, 2014–2016, 2017–2019, and 2020–2022. Our analysis reveals two salient temporal patterns. First, the tail-risk networks are markedly denser during the 2014–2016 and 2020–2022 sub-periods. This heightened connectivity coincides with two major systemic-risk events in China: the stock market crash (2015–2016) and the COVID-19 pandemic (2020–2022). Second, the core of the risk network shifts notably across periods. During 2014–2016, risk spillovers were centered predominantly in the banking and securities sectors. In contrast, the 2020–2022 period saw real estate and other financial sectors become the primary foci of risk.

thumbnail
Fig 7. Weighted adjacency matrix of tail risk network (2011-2013).

https://doi.org/10.1371/journal.pone.0351966.g007

thumbnail
Fig 8. Weighted adjacency matrix of tail risk network (2014-2016).

https://doi.org/10.1371/journal.pone.0351966.g008

thumbnail
Fig 9. Weighted adjacency matrix of tail risk network (2017-2019).

https://doi.org/10.1371/journal.pone.0351966.g009

thumbnail
Fig 10. Weighted adjacency matrix of tail risk network (2020-2022).

https://doi.org/10.1371/journal.pone.0351966.g010

To clearly illustrate the risk spillover and contagion across different financial institutions during China’s various risk events, the financial institutions are classified and ranked based on their cumulative out-degrees and in-degrees to identify the systemically important financial institutions (SIFIs). Three distinct risk periods are identified: June 2013 is characterized as a ‘cash crunch’, 2015−2016 as a ‘stock market crash’, and 2020−2022 as the ‘COVID-19 pandemic’. Tables 5–7 show the financial institution rankings in these three risk periods. Table 5 shows that during the ‘cash crunch’, the primary contributors to the risk spillovers are securities firms. Table 6 shows that during the ‘stock market crash’ period, the tail risk spillovers and contagion are mainly concentrated in the banking sectors. Notably, bank BNB ranks among the top three in both out-degree and in-degree, positioning it as a potentially critical nexus for risk transmission within the network during this period. Table 7 indicates that during the ‘COVID-19 pandemic’, the risk spillovers are mainly associated with some real estate institutions, and the risk contagion is associated with real estate and other financial institutions. Similarly, real estate firm HXXF appears as a key player, ranking highly in both measures, suggesting its central role in risk propagation during the pandemic.

thumbnail
Table 5. Top 10 financial institutions ranked based on in-degree and out-degree sums during the ‘cash crunch’ period.

https://doi.org/10.1371/journal.pone.0351966.t005

thumbnail
Table 6. Top 10 financial institutions ranked based on the in-degree and out-degree sums during the ‘stock market crash’ period.

https://doi.org/10.1371/journal.pone.0351966.t006

thumbnail
Table 7. Top 10 financial institutions ranked based on the in-degree and out-degree sums during the ‘COVID-19’ period.

https://doi.org/10.1371/journal.pone.0351966.t007

4. Network systemic risk determinants

The topological analysis of the tail‑risk network offers a valuable perspective for risk decomposition, enabling us to distinguish between two distinct channels of systemic risk: risk spillover and risk contagion. Unlike conventional systemic‑risk studies that rely on CoVaR (or CoES) metrics, this framework allows for a clear separation of spillover and contagion effects. In this section, we develop an econometric model to examine the determinants of tail systemic risk in Chinese financial institutions and to differentiate how various factors influence risk contagion and risk spillover. Specifically, we estimate an individual fixed‑effects panel regression in which institutional and market determinants of systemic risk serve as explanatory variables, and the total degree, out‑degree, and in‑degree centrality of financial institutions in the tail‑risk network serve as dependent variables:

where is the all-degree, out-degree, and in-degree of financial institution i in the tail risk network in week t; is the individual fixed effects; and k is the explanatory variable lag, which is set at 1, 2, 3, and 4 in subsequent discussions to examine the term structure of the influencing factors. Drawing from Adrian and Brunnermeier [4], as shown in Table 8, six institutional-level variables and seven market-level variables are selected as the explanatory variables. The data for the institutional and market-level explanatory variables are sourced from the Wind database, the institutional-level variables; size, leverage, and mismatch; are quarterly data and the remainder are weekly data. Table 8 shows the definitions and descriptive statistics for the explanatory variables.

thumbnail
Table 8. Definitions and summary statistics for the independent variables.

https://doi.org/10.1371/journal.pone.0351966.t008

Table 9 shows the regression results for the impact of the institutional and market variables on the financial institution’s node all-degree; columns 1–4 respectively lag the explanatory variables by 1, 2, 3, and 4 quarters. Table 9 indicates that the ‘size’ variable consistently exerted a significant positive influence on the institutional network all-degree in all model specifications and implies that the larger institutions tended to have higher systemic risk, which was consistent with existing research [4]. However, the ‘leverage’ variable negatively impacted the network all-degree, which indicated that institutions with higher leverage possibly had reduced systemic risk, which is a different finding from previous international studies [13]. The ‘mismatch’ variable, which indicates an asset-liability duration mismatch, was not found to significantly influence systemic risk, which was possibly because of the relatively low overall duration mismatch in Chinese financial institutions. The stock market price-based variables ‘beta’, ‘vol’, and ‘ES’ all significantly and positively affected institutional systemic risk, which was as expected. The influence of the ‘beta’ regression coefficient remained stable in the different lag quarters, the impact of ‘vol’ grew, and the impact of ‘ES’ decreased.

thumbnail
Table 9. Panel regression results for the dependent all-degree variable.

https://doi.org/10.1371/journal.pone.0351966.t009

At the market level, the variables representing market interest rate changes (‘dib’), corporate credit risk (‘dco’), and overall market volatility (‘vix’) all had a significantly positive influence on the network all-degree. The ‘dco’ variable impact intensified over the lag quarters, the ‘vix’ variable impact decreased, and the ‘dgo’ variable, which represented local government credit risk, had a significant negative impact on the network all-degree, which was not in line with theoretical expectations. As the ‘dgo’ variable indicates the spread between local government financing vehicle bonds and national bonds, a higher ‘dgo’ indicates higher local government bond interest rates, increased financing pressure, or higher local government debt risk. Theoretically, local government debt risk should amplify institutional systemic risk. However, the regression results suggest that the higher ‘dgo’ values correspond to reduced systemic risk. These counterintuitive patterns regarding the liability variable and the ‘dgo’ variable likely stem from the unique institutional structure of China’s financial system, which differs substantially from the mature markets that underpin standard theoretical predictions. We discuss these issues further in the “Future Research” section at the end of the article.

Table 10 shows the varying impacts of the institutional and market-level factors on the institutions’ risk spillovers and contagion. The first three columns show the outcomes when the network in-degree is taken as the dependent variable and the explanatory variables are lagged by 1–3 quarters, and the subsequent three columns show the results when the network out-degree is taken as the dependent variable and the explanatory variables are similarly lagged.

thumbnail
Table 10. Panel regression results for the dependent in-degree and out-degree variables.

https://doi.org/10.1371/journal.pone.0351966.t010

The comparison of the network out-degree and in-degree regression results in Table 10 reveals that the ‘size’ variable significantly influences both the out-degree and in-degree, with the impact on the out-degree being more pronounced. This indicates that an institution’s scale simultaneously amplifies its risk contagion and risk spillovers, especially the latter. As institution size increases, the resilience to risks also rises; therefore, the impact of size on risk contagion is lower than the impact on risk spillover.

The ‘leverage’ variable has a significant negative impact on the in-degree but does not significantly influence the out-degree at a 1% confidence level. This suggests that an increase in institutional leverage can reduce its risk contagion. The ‘beta’ variable significantly affects both the out-degree and in-degree, with a greater impact on the out-degree. Because ‘beta’ represents the undiversified risks in an institution’s portfolio, it has been traditionally seen as a systemic risk indicator. These results confirm that ‘beta’ significantly influences the tail risk levels of the institutions. While the institutional variables ‘vol’ and ‘ES’ both have positive, significant impacts on the individual institution’s out-degrees, their influence on the in-degree is inconsequential. This suggests that even though an institution’s return volatility (‘vol’) and individual tail risk can increase their tail risk spillover, they do not significantly affect the tail risk contagion.

When the market-level variables’ effects on the in-degree and out-degree are compared, the direction and significance of the regression coefficients remain consistent, but their magnitudes differ. The increase in corporate credit risk (‘dco’), the escalation in market volatility (‘vix’), and the reduction in the central government’s implicit guarantee expectations (‘dgo’) all result in significantly positive impacts on risk contagion and spillover, with the influence on risk contagion being more potent than on risk spillover.

In summary, the institutional and market-level factors are found to have different influences on the institutions’ in-degree and out-degree, which highlights the varying risk factor impacts on risk contagion and spillover in financial institutions, that is, institutional-level factors have a stronger influence on risk spillover, but market-level factors have a greater impact on risk contagion. This distinction aligns with the systemic risk literature, where spillover (out-degree) refers to shock propagation through fundamental linkages, and contagion (in-degree) denotes vulnerability to tail events transmitted from distressed peers—a phenomenon that intensifies during crises [4,29]. Our finding that institutional factors primarily drive spillover while market factors shape contagion complements existing evidence on the distinct determinants of systemic risk transmission and vulnerability.

5. Conclusion and future research

This study seeks to estimate the tail risk measures, CoVaR and CoES, in high-dimensional scenarios using the Factor Copula model and validate the efficacy of the tail risk measure estimations through backtesting. Based on the tail risk measure relationships, a dynamic high-dimensional tail risk network is constructed. Previous research on tail risk network construction, which has been limited to indirectly estimating tail risk measures using quantile regression models, has failed to effectively capture the intricate non-linear dependencies between institutions and was not conducive to backtesting. The approach taken in this study improves upon these deficits by incorporating the Factor Copula model and constructing a comprehensive framework for unconditional and conditional backtests for CoVaR and CoES. Based on the backtesting results, an optimal Factor-Copula-CoES model is employed to construct a tail risk network that comprises Chinese banking, real estate, securities, insurance and other financial industries. Using network topology analyses, rich dynamic network characteristics, such as network density and node degree, are extracted. The dynamic network analysis results align with significant Chinese financial market risk events. The quantitative analyses reveal that institutional and market-level factors have varying impacts on institutional in-degrees and out-degrees; they suggest that institutional-level factors have a stronger effect on risk spillover, whereas market-level factors are more influential in driving risk contagion. This study aids regulators and investors in better understanding the systemic finance risk evolution in China and the factors that can influence risk spillover and contagion in financial institutions.

Future research may follow three directions: specificity studies of China’s financial market, construction of multi‑layer tail‑risk networks, and the introduction of graph neural network (GNN)‑based artificial intelligence algorithms.

Two interesting anomalies emerge in the present study of tail‑risk determinants: higher leverage and higher dgo (local government financing vehicle bond spread) are associated with lower systemic risk levels of institutions. According to conventional financial theory, leverage typically represents an institution’s default risk, and the dgo signals local‑government default risk; both should be positively correlated with systemic risk. The anomalies, however, may be explained by the specific institutional context of China’s financial market. In China’s bank‑dominated indirect‑financing system, higher leverage may reflect stronger financing capacity, thereby reducing systemic risk. As for the dgo variable, because local government bonds carry an implicit central‑government guarantee, a rising spread may shift systemic risk toward the central fiscal authority, thus lowering the systemic risk borne by financial institutions. By identifying suitable instrumental variables for institutional financing capacity and the central government’s implicit guarantee, future studies could design mediation‑effect tests to formally examine the anomalies described above.

The tail‑risk network constructed in this paper is a lower‑tail network based on CoES(i|j) relationships. Future studies could build multi‑layer tail‑risk networks by altering the edge‑formation mechanism or by constructing upper‑tail risk networks. For instance, edge relationships could be defined based on the similarity of CoES measures across nodes, which may better capture risk resonance and diversification. Moreover, combining upper‑ and lower‑tail networks would provide a more comprehensive characterization of systemic tail risk. These ideas open new possibilities for constructing multi‑layer tail‑risk networks.

The dynamic tail‑risk network constructed here also creates opportunities for introducing graph‑based artificial intelligence algorithms such as graph neural networks (GNNs). In recent years, GNNs have advanced rapidly, owing to their strong ability to mine graph‑structured information and their high accuracy in classification and prediction tasks, and have been widely adopted in transportation, computer science, and social‑network analysis. However, their application in financial risk research faces two major challenges: correlation‑based models often suffer from the “curse of dimensionality,” and GNN‑type algorithms typically require large volumes of data. The high‑dimensional dynamic tail‑risk network proposed in this paper provides rich feature data for GNNs and allows them to fully utilize their network‑information mining capabilities. The integration of graph neural networks with tail‑risk networks shows considerable promise in financial domains such as systemic risk identification and prediction, as well as portfolio optimization.

Supporting information

S1 Appendix. Proofs and simulations of CoES backtesting.

https://doi.org/10.1371/journal.pone.0351966.s001

(DOCX)

S2 File. Data and code used in this article.

https://doi.org/10.1371/journal.pone.0351966.s002

(ZIP)

References

  1. 1. Du Z, Escanciano JC. Backtesting Expected Shortfall: Accounting for Tail Risk. Management Science. 2017;63(4):940–58.
  2. 2. Acerbi C, Tasche D. On the coherence of expected shortfall. Journal of Banking & Finance. 2002;26(7):1487–503.
  3. 3. Kratz M, Lok YH, McNeil AJ. Multinomial VaR backtests: A simple implicit approach to backtesting expected shortfall. Journal of Banking & Finance. 2018;88:393–407.
  4. 4. Adrian T, Brunnermeier MK. CoVaR. American Economic Review. 2016;106(7):1705–41.
  5. 5. Barnett WA, Wang X, Xu H-C, Zhou W-X. Hierarchical contagions in the interdependent financial network. Journal of Financial Stability. 2022;61:101037.
  6. 6. Brunetti C, Harris JH, Mankad S. Networks, interconnectedness, and interbank information asymmetry. Journal of Financial Stability. 2023;67:101163.
  7. 7. Hautsch N, Schaumburg J, Schienle M. Financial Network Systemic Risk Contributions. Review of Finance. 2014;19(2):685–738.
  8. 8. Härdle WK, Wang W, Yu L. TENET: Tail-Event driven NETwork risk. Journal of Econometrics. 2016;192(2):499–513.
  9. 9. Chen CY-H, Härdle WK, Okhrin Y. Tail event driven networks of SIFIs. Journal of Econometrics. 2019;208(1):282–98.
  10. 10. Bonaccolto G, Caporin M, Maillet BB. Dynamic large financial networks via conditional expected shortfalls. European Journal of Operational Research. 2022;298(1):322–36.
  11. 11. Girardi G, Tolga Ergün A. Systemic risk measurement: Multivariate GARCH estimation of CoVaR. Journal of Banking & Finance. 2013;37(8):3169–80.
  12. 12. Reboredo JC, Ugolini A. Systemic risk in European sovereign debt markets: A CoVaR-copula approach. Journal of International Money and Finance. 2015;51:214–44.
  13. 13. Karimalis EN, Nomikos NK. Measuring systemic risk in the European banking sector: a copula CoVaR approach. The European Journal of Finance. 2017;24(11):944–75.
  14. 14. Oh DH, Patton AJ. Time-Varying Systemic Risk: Evidence From a Dynamic Copula Model of CDS Spreads. Journal of Business & Economic Statistics. 2017;36(2):181–95.
  15. 15. Opschoor A, Lucas A, Barra I, van Dijk D. Closed-Form Multi-Factor Copula Models With Observation-Driven Dynamic Factor Loadings. Journal of Business & Economic Statistics. 2020;39(4):1066–79.
  16. 16. Chang J, Hao X, Chen Z. Research on structural breaks in dependence: dynamic factor Copula models. Applied Economics Letters. 2025;:1–8.
  17. 17. Chen Z, Zhou J, Hao X. Dynamic factor copula-based modeling for market risk optimization with an application to the real industry in China. Journal of Innovation & Knowledge. 2023;8(4):100453.
  18. 18. Escanciano JC, Pei P. Pitfalls in backtesting Historical Simulation VaR models. Journal of Banking & Finance. 2012;36(8):2233–44.
  19. 19. Du Z, Pei P. Backtesting portfolio value‐at‐risk with estimated portfolio weights. Journal Time Series Analysis. 2020;41(5):605–19.
  20. 20. Banulescu-Radu D, Hurlin C, Leymarie J, Scaillet O. Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures. Management Science. 2021;67(9):5730–54.
  21. 21. van Oordt M, Zhou C. Systemic risk and bank business models. J of Applied Econometrics. 2018;34(3):365–84.
  22. 22. De Jonghe O, Diepstraten M, Schepens G. Banks’ size, scope and systemic risk: What role for conflicts of interest?. Journal of Banking & Finance. 2015;61:S3–13.
  23. 23. Tabak BM, Fazio DM, Cajueiro DO. Systemically important banks and financial stability: The case of Latin America. Journal of Banking & Finance. 2013;37(10):3855–66.
  24. 24. Brunnermeier MK, Oehmke M. The Maturity Rat Race. The Journal of Finance. 2013;68(2):483–521.
  25. 25. Silva AF. Strategic Liquidity Mismatch and Financial Sector Stability. The Review of Financial Studies. 2019;32(12):4696–733.
  26. 26. Beltratti A, Stulz RM. The credit crisis around the globe: Why did some banks perform better?. Journal of Financial Economics. 2012;105(1):1–17.
  27. 27. Sklar A. Fonctions de répartition à n dimensions et leurs marges. Publications del’Institut de Statistique de L’Université de Paris. 1959;8:229–31.
  28. 28. Bellavite Pellegrini C, Cincinelli P, Meoli M, Urga G. The contribution of (shadow) banks and real estate to systemic risk in China. Journal of Financial Stability. 2022;60:101018.
  29. 29. Forbes KJ, Rigobo R. No Contagion, Only Interdependence: Measuring Stock Market Comovements. The Journal of Finance. 2002;57:2223–61.