Figures
Abstract
With the ongoing evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and its increasing adaptation to humans, several variants of concern (VOCs) and variants of interest (VOIs) have been identified since late 2020. These include Alpha, Beta, Gamma, Delta, Omicron parent lineage, and other variants. These variants may show distinct levels of virulence, antigenicity, and infectivity, which require specific defense and control measures. In this study, we propose an infectious disease model to simulate the spread of SARS-CoV-2 variants among the human population. We combine the proposed epidemic model and reported infected data of variants with physical information neural networks (PINNs) to develop a novel mechanism called VOCs-informed neural network (VOCs-INN). In our experiments, we found that this algorithm can accurately fit the reported data of the British Columbia (BC) province and its five internal health agencies in Canada. Furthermore, it can simulate observed or unobserved dynamics, infer time-dependent parameters, and enable short-term predictions. The experimental results also reveal variations in the intensity of control strategies implemented across these regions. VOCs-INN performs well in fitting and forecasting when analyzing long-term or multi-wave data.
Author summary
Epidemiologists and mathematicians use epidemic models expressed by parameterized differential equations to describe the complex dynamics of COVID-19 transmission. The model’s parameters are possibly time-dependent and adjusted to fit the observed data. However, traditional parameter estimation methods usually assume the parameters are constant functions or piecewise constant functions, making it challenging to capture the time evolution of SARS-CoV-2 variants transmission among the population.
To address this issue, we propose an model to simulate the spread of SARS-CoV-2 variants among the human population, then use VOCs-INN to identify time-dependent parameters and learn unobserved dynamics both represented by neural networks. Even with limited observed data, we reconstruct the full-spectrum dynamics and forecast the pandemic of two VOIs in the BC province and its five internal health agencies. Our research findings reveal an interesting aspect of disease transmission dynamics, namely the diversity of control measures in different regions, which is reflected in the different shapes and values of removed rates in different areas within the same region. We also successfully reproduced dynamic features that were not observed in two variants of the virus, including their competition and coexistence relationships.
In addition, we conducted two interesting comparison experiments in the discussion section. One is the comparison of the proposed VOCs-INN method’s fitting and prediction performance with the nonlinear least squares on the daily confirmed cases from Italy. Another experiment explores the impact of physical information on the performance of the proposed VOCs-INN method, focusing on three key aspects: the influence of weights, contraction factors, and time interpolation points. Furthermore, the adaptability of the VOCs-INN framework enables its customization to diverse epidemiological scenarios and datasets, furnishing a robust instrument to tackle the intricacies and uncertainties inherent in epidemic challenges.
Citation: Li W, Chen X, Liu S, Zhang C, Liu G (2025) Using a multi-strain infectious disease model with physical information neural networks to study the time dependence of SARS-CoV-2 variants of concern. PLoS Comput Biol 21(2): e1012778. https://doi.org/10.1371/journal.pcbi.1012778
Editor: Roger Dimitri Kouyos, University of Zurich, SWITZERLAND
Received: August 26, 2024; Accepted: January 10, 2025; Published: February 14, 2025
Copyright: © 2025 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The epidemiological data of BC province and its five internal regions in the main text can be obtained through the following link: https://bccdc.shinyapps.io/genomic_surveillance/ The epidemiological data of COVID-19 in Italy under discussion available at the following repository: https://github.com/CSSEGISandData/COVID-19. The source code in Python is available at the following repository: https://github.com/jluorganization/VOCs-INN.
Funding: S.L. was supported by the National Natural Science Foundation of China (Grant Number: 12301627), the Science and Technology Research Projects of the Education Office of Jilin Province, China (Grant Numbers: JJKH20250046KJ, JJKH20241240KJ), and the Technology Development Program of Jilin Province, China (Grant Number: 20210508024RQ). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The evolution and mutation of viruses are typical processes that allow them to adjust to various environments and hosts. SARS-CoV-2, the virus that causes COVID-19, continues to evolve and mutate. Since October 2020, several SARS-CoV-2 variants with significant mutations have emerged and been classified by the WHO as VOCs (variants of concern) or VOIs (variants of interest). These variants exhibit differences in transmissibility, disease severity, immune evasion capabilities, and responsiveness to treatments. According to the WHO’s updated definition in March 2023, compared to VOIs, VOCs represent major evolutionary advancements and necessitate substantial public health measures. To evaluate the risk, simulating the transmission mechanism within the population and determining the time-dependent characteristics of SARS-CoV-2 variants is crucial.
Epidemiological models, represented by parameterized differential equations, are essential tools for studying the patterns of COVID-19 transmission and developing prevention and control strategies [1–7]. The input parameters, which may vary over time, can impact the model’s predictability. Various methodologies can be used to fit the trend of disease progression and determine the values or ranges of parameters. These methods primarily depend on mathematical models and statistical principles, along with the analysis of actual epidemiological data, such as the least squares method [8], the Markov Chain Monte Carlo (MCMC) approach [9], and the iterative filtering method [10].
The methods mentioned above may face some limitations in practical applications. Furthermore, factors like intervention measures, human activities, and virus mutations could potentially alter the parameters of the epidemic model over time. However, most parameter estimation techniques assume these parameters remain constant over time. Therefore, it can be difficult to accurately estimate how these parameters change over time and capture the complex dynamics of infectious diseases in real-world situations using these methods. Recently, deep neural networks (DNNs) [11] have emerged as a strong approach for tackling complex problems in various fields. They have been utilized not only in areas like autonomous driving [12], image recognition [13], and natural language processing [14], but also in predicting and detecting the spread of epidemics.
During the COVID-19 pandemic, deep neural networks have proven to be useful in developing various simulation frameworks for predicting the transmission dynamics of the epidemic [15,16]. In these studies, Arora et al. employed different versions of Long Short-Term Memory (LSTM) within Recurrent Neural Networks (RNN) to estimate the number of infected cases in India [17]. Shahid et al. compared predictive models such as Autoregressive Integrated Moving Average (ARIMA), Support Vector Regression (SVR), LSTM, and Bi-directional LSTM (Bi-LSTM) to forecast confirmed cases, deaths, and recoveries in ten major countries affected by COVID-19 [18]. Similarly, Zhou et al. assessed predictive models including LSTM, Bi-LSTM, Generalized Regression Neural Network (GRU), and Dense-LSTM for predicting the trend of confirmed cases, deaths, and recoveries in the time series data of twelve major countries impacted by COVID-19 [19]. Other research has also adopted similar methodologies [20–22]. These studies demonstrate the varied applications of neural network architectures in epidemiological forecasting during the global health crisis. Although these models exhibit good data fitting and short-term prediction abilities, their failure to account for epidemiological transmission mechanisms complicates the accurate interpretation of disease spread patterns. To tackle this issue, several researchers have incorporated compartmental models into Physics-Informed Neural Networks (PINNs). Their work indicates that this approach enables the estimation of temporal changes in model parameters with reasonable accuracy and provides dependable predictive methods [23–25]. This, in turn, offers plausible explanations for the efficacy of certain control strategies and the fundamental dynamics of epidemic transmission. For example, Kharazmi et al. applied PINNs to analyze various variants using the classic SIR model, with the aim of determining time-dependent parameters [26]. Ning et al. combined compartmental models with DNN to create Epidemiological priors informed deep neural networks (Epi-DNN), which aids in simulating the complex dynamics of COVID-19. They tested the effectiveness of Epi-DNN using real COVID-19 data from the Shanghai Omicron outbreak [27]. He et al. introduced Transmission Dynamics Informed Neural Networks (TDINN), which encode two versions of the compartmental model into DNN to predict time-dependent parameters [28,29]. They validated these predictions using real COVID-19 data from various regions and provided thoughtful interpretations of epidemic-related interventions. Saikia et al. utilized PINNs to analyze the ratio of undetected to detected active infections during the first and second waves of COVID-19 in India, giving a realistic picture of the Indian pandemic [30]. These studies demonstrate that the use of PINNs can estimate time-varying parameters within compartmental models with accuracy and offer a fresh perspective on understanding epidemic spread mechanisms, providing useful insights for the development of effective prevention and control strategies.
In this study, we introduced a new VOCs-INN algorithm, which integrates epidemiological data, deep learning, and epidemiological models to estimate the transmission intensity of various COVID-19 strains during the pandemic. It should be noted that the VOCs-INN algorithm emphasizes learning underlying patterns in multivariate data to improve model optimization and achieve accurate fitting. This algorithm is capable of fitting multi-source epidemic data effectively, and it can also be applied in scenarios involving a single virus. In such cases, the multi-strain model can be simplified to the classical SIR model with time-varying parameters, as proposed in 1927. Additionally, we applied this algorithm to analyze data from two VOIs in British Columbia (BC) and five internal health institutions in Canada. The VOCs-INN algorithm also demonstrated good performance in simulating both observed and unobserved dynamics, facilitating time-dependent parameter inference and enabling short-term predictions. Furthermore, we compared the VOCs-INN algorithm with traditional fitting methods, showcasing its advantages in fitting accuracy and predictive performance. This analysis further supports the effectiveness and applicability of our proposed algorithm.
The remainder of this study is structured as outlined below. In the subsequent section, we will introduce the multi-strain infectious disease model and deep neural networks, establish the VOCs-INN algorithm, present the datasets utilized, and outline the experimental settings. Subsequently, we will share the key findings related to data fitting, parameter estimation, and predictions. We will also delve into the impact of time-varying transmission rates and the effective reproduction number on COVID-19 transmission, and validate the accuracy of the estimated parameters. Additionally, we will discuss comparisons between our proposed algorithm and traditional methods. Finally, in the discussion section, we will summarize our research findings.
Methods
Model formulation
Based on the fundamental SIR epidemiological model, we propose an epidemic model with VOCs. The transfer diagram is shown in Fig 1. Denote the number of VOCs by n, and the population is divided into susceptible individuals S(t), infectious individuals
infected by the i-th VOC (1 ≤ i ≤ n), and recovered individuals R(t). Additionally, we include auxiliary compartments,
, to keep track of the cumulative confirmed cases of infection by the i-th VOC. The parameters may change over time due to various factors, including government interventions, shifts in human behavior, virus mutations, and vaccination efforts. Therefore, to simulate the actual transmission dynamics of the pandemic accurately and effectively, we assume that the transmission rates and removal rates are functions of time, denoted by
and
. Table 1 describes the meaning of state variables and parameters.
Consequently, the corresponding model is expressed as follows:(1)
with intital conditions , and
. Here N denotes the total population and
Model (1) is both epidemiologically and mathematically well-posed.
We define the effective reproduction number for model (1) to be
Additionally, define the basic reproduction number by . The basic reproduction number
represents the average number of secondary infections caused by a single infected individual in a susceptible population without any interventions. On the other hand, the effective reproduction number
dynamically quantifies the secondary infections caused by an infected individual in a population made up of both susceptible and non-susceptible host, and
decreases as S ( t ) decreases. Therefore, both
and
are crucial indicators for policymakers in formulating prevention and control strategies.
helps in predicting the trajectory of an outbreak, while
provides real-time assessments of the effectiveness of implemented measures. A high value of
indicates the need for intensified interventions, whereas sustained low levels may justify a gradual relaxation of restrictions.
Physical information neural networks (PINNs)
Physics-informed neural networks [31] (PINNs) are data-driven neural network algorithms initially used to solve forward problems (approximate the solutions) and inverse problems (identify time-varying parameters) of partial differential equations (PDEs) [32–34]. In handling PDEs and time-varying parameters, PINNs have demonstrated unique advantages, making them highly promising for parameter inferences in epidemiological models. Recent studies [23–30] have shown that the PINNs framework can not only accurately fit data with compartmental models but also stably and effectively estimate time-varying parameters. Inspired by these studies, we propose a VOCs-INN method that can accurately fit data and infer complex time-varying parameters by encoding real VOCs infection data, epidemiological transmission mechanisms, and ODEs into neural networks.
Specifically, we designate and
as two independent deep neural networks with the time variable t as the input, and their weights and biases are parameterized by
and
, respectively. Let
where is used to fit data with the
model (1) and approximate the solutions of the model;
comprises 2n independent deep neural networks, each utilized to identify different transmission rates
or removal rates
.
is used to fit the state variables of the model (1) (represented by the pink shaded area), while
is employed to infer the time-varying parameters (represented by the yellow shaded area). Specifically,
and
are used to infer
and
, respectively.
denotes the automatic differentiation operator, and the LOSS consists of two components,
and
. By minimizing the LOSS, simultaneous data fitting and inference of time-varying parameters can be achieved.
Fig 2 presents the visualization of the two aforementioned networks. Additionally, Fig 2 illustrates the loss composition of VOCs-INN. The loss of VOCs-INN has two parts. The first part, , is the mean squared error between the compartmental data outputted by network
and the real data:
(2)
where represents the number of training data, and
represents the newly confirmed infected individuals. It should be noted that while includes calculations for data from different compartments, only some compartments have real data support. Therefore, it is necessary to adjust the calculation of
based on available data.
Additionally, the solutions outputted by and the time–varying parameters identified by network
should adhere to the ordinary differential equations defined by
model. Therefore, the second part of the loss is defined as
, serving as a penalty term to ensure that the neural network outputs conform to the ordinary differential equations. The form of
is as follows:
(3)
where represents the number of residual points, which are randomly selected from the entire computational domain and
(4)
By continuously minimizing the loss function
to train the deep neural network, we have determined the optimal fitting network parameters and
. With these optimal network parameters
and
, we have effectively achieved data fitting and identification of time-varying parameters.
Experiment settings
We collected COVID-19 datasets from the province of British Columbia (BC) Centre for Disease Control (CDC) in Canada [35]. BC and its five regions datasets from March 26, 2023, to January 7, 2024, are shown in Figs 3 and Supporting information, respectively. During this period, two main VOIs were XBB.1.16.* and XBB.1.5.*. They are both mutant variants of the XBB family and have inherited some of its characteristics [36]. The XBB. 1.5 and XBB. 1.5.* variants were first discovered in Australia, India, and Bangladesh, while the XBB. 1.16 and XBB. 1.16.* variants were first discovered in India [37]. They emerged independently from one another, and each rapidly became dominant, regionally or globally, outcompeting previous variants. They also show distinct levels of virulence, antigenicity, and infectivity. Our datasets are limited to the weekly new infection data ( and weekly cumulative infection data
) of XBB.1.16.* and XBB.1.5.* respectively.
The graph on the left shows weekly new confirmed and weekly cumulative infected data for XBB.1.16.* from 26 March 2023 to 7 January 2024. The graph on the right shows weekly new confirmed and weekly cumulative data for XBB.1.5.* from 26 March 2023 to 7 January 2024. Both graphs have the horizontal axis in days and the vertical axis in number of people.
To control the spread of the COVID-19 pandemic, the BC government implemented a series of measures. These measures were accompanied by the release of related reports, which can be organized into a timeline depicted in Fig 4. In Fig 4, the five nodes marked with large circles represent extremely important events or milestones. These nodes are the core of the analysis and have a significant impact on overall trends or results. In contrast, the other five nodes marked with small circles, while still important, have relatively low importance and may represent secondary events or transitional stages on the timeline. All reports and intervention measures are publicly accessible through official websites, providing a comprehensive record for analytical and research purposes.
When fitting the data with the model, there are four time-dependent parameters
and
need to infer. The initial values of each compartment and the prior recovery rate ranges are summarized in Table 2. It is worth noting that the initial value R ( 0 ) for the removal population is assumed to be 0, which is mainly based on the XBB variant’s high immune escape ability. In addition, the data period we fitted here is nearly nine months, far beyond the safe window period after infection. We predefine hyperparameters to effectively train the neural network and summarize them in Table 3. As shown in Table 3, we present the hyperparameter values of the deep neural networks
,
,
,
and
for different regions. For instance,
for the BC province indicates that the neural network has five hidden layers, each containing 32 neurons. During training, the Adam and L-BFGS optimizers optimize the neural network parameters to minimize the loss function LOSS. Firstly, the neural network is optimized using the Adam optimizer, and the number of iterations for the Adam optimizer is provided in Table 3, with a fixed learning rate of
. Subsequently, the L-BFGS optimizer is used to further optimize the model, with a maximum of
iterations, terminating early if the default conditions are met. Based on our datasets, a more precise VOCs-INN framework is shown in Supporting information. The hyperbolic tangent function Tanh [44] is also used as the activation function. The VOCs-INN algorithm utilizes the open-source library TensorFlow-GPU 1.14 [45] in Python for automatic differentiation and deep learning computations.
Results
By combining VOCs-INN with infectious disease models, we have been able to fit epidemic data accurately and estimate parameters effectively. This method shows the time-varying nature of the parameters and can also make short-term predictions, making it useful for real-time disease monitoring and control. VOCs-INN also demonstrates its strong ability to learn and capture the underlying patterns of the dynamics.
Data fitting and parameter inferences
We used the model and datasets from BC and its five regions to train our VOCs-INN. The fitting results for the weekly and cumulative reported cases for the two VOIs of BC are presented in Fig 5. Additionally, fitting results for the five inner regions are provided in Supporting information, and extrapolation results for the unobserved data are available in Supporting information. In Fig 5, we show the time-dependent parameter estimation results. The red dashed lines represent the fitted curves for the transmission rates
and
, while the blue dashed lines represent the fitted curves for the removal rates
and
.
(a) Two regional blocks represent newly confirmed cases and cumulative confirmed cases. The red and dark blue dashed lines represent the values of and
inferred using VOCs-INN, while the purple and light blue lines represent newly confirmed cases and cumulative confirmed cases fitted using VOCs-INN. (b) Similar to the above. Both graphs have the horizontal axis in days and the vertical axis in number of people.
To quantify the goodness-of-fit of our model, we use the coefficient of determination (), which is a statistical measure of how observed outcomes are replicated by the model, based on the proportion of total variation of outcomes [46]. The
is less than 1, and an
of 1 indicates that the model predictions perfectly fit the data. The
for our model can be expressed mathematically as follows:
(5)
Here, represents the observed values,
represents the fitted values, ȳ is the mean of the observed values, and n is the number of observations. Based on our fitting results in Fig 5, we calculate the
values of two fitting sets and obtain
of 0.9979 for XBB. 1.16.* and 0 . 9865 for XBB. 1.5.*. These indicate that our model predictions fit the data almost perfectly. Also, they confirm the reasonableness and creditability of our parameter estimations. When comparing the estimated parameters with the actual data, we find considerable synchronism between the transmission rates and the fluctuations in the data. It was observed that there is always a time lag between the peak of transmission rate and the peak of reported cases per day in [29]. However, our results do not demonstrate this phenomenon, primarily because we use weekly data. Furthermore, the removal rates
are relatively stable; this is because
are the average infectious periods of the two VOIs. We also observe that the curve of
is flatter than that of
, while the fluctuations of
are more noticeable, and the value of
at the end of time is more significant than that of
. This suggests that XBB.1.5.* was more dominant during this period, which is consistent with the findings of [8].
The two blue lines represent the basic reproduction numbers and effective reproduction numbers of XBB.1.16.* variant, the two yellow lines represent the basic reproduction numbers and effective reproduction numbers of XBB.1.5.* variant, and the red dashed line represents the value when the effective reproduction numbers of the two VOIs are the same.
Based on the estimated time-dependent functions of the parameters, we have calculated the basic reproduction numbers, and
, as well as the effective reproduction numbers,
and
, for two VOIs of BC. These results are presented in Fig 6. The corresponding findings for the other five internal regions are shown in Supporting information. The estimated values for the basic reproduction numbers are approximately
and
. Additionally, the effective reproduction numbers vary over time with
and
.
Inference comparison of BC province with its five internal regions
The blue dashed line represents the transmission rate of the XBB.1.16.*, while the purple dashed line depicts the transmission rate of the XBB.1.5.*.
The estimated time-varying transmission rates of two VOIs in BC and its five internal regions using VOCs-INN are depicted in Fig 7. We have the following observations:
- Three internal regions of the BC province, FH, VCH, and VIH, display a similar overall trend to the BC province. [47] provides a visualization of the geographical locations of BC province. This similarity is likely due to their close geographical proximity. FH, VCH, and VIH are located near the coast and distant from the inland, and their proximity to each other may result in higher similarities and consistency in terms of personnel mobility, economic activities, and the implementation of epidemic prevention measures. Consequently, their transmission rates and epidemic progression patterns tend to be similar.
- The trend observed in the IH region differs notably from those in the BC province and other regions. IH is located inland and shares a border with the province of Alberta, which has implemented strict prevention and control policies. This geographical and policy divergence may contribute to the differences in epidemic transmission and prevention measures between IH and other regions. Additionally, due to its proximity to Alberta, IH may be influenced by its prevention policies, resulting in a unique epidemic development trend.
- The trend of change in the NH region seems to be somewhat slower compared to other regions. This could be attributed to NH being the largest region in the BC province with a low population density. In areas with low population density, the speed and scope of epidemic transmission may be influenced by various factors, such as population density, personnel mobility, and medical resources. Therefore, NH may face more significant challenges in responding to the epidemic, leading to a relatively slower change in its transmission rate.
The transmission rates in the BC province are influenced by the interactions among its five internal regions. These findings indicate the complexity and variety of how infectious diseases are transmitted and spread.
Prediction
In this subsection, we evaluate the performance of our proposed VOCs-INN algorithm in forecasting the future trend of the COVID-19 pandemic. We assign the transmission rates and removal rates for the subsequent 16 weeks (from 7 January to 26 April 2024) to the values inferred by the VOCs-INN algorithm at the final time point (7 January 2024), as illustrated in Supporting information (For the transmission rate prediction maps of the five regions, please refer to Supporting information). Using these parameters, we predict the weekly reported cases and cumulative number of infections in BC over the next 16 weeks, as shown in Figs 8 and 9, respectively. The forecasted results for the five regions within BC are presented in Supporting information. Given the lack of information on the data for the confidence intervals, we incorporate uncertainty bounds of 10% and 20% to account for the impact of various control measures on the time-dependent parameters and
, and then propagate this uncertainty to the prediction component. By comparing the predictions with the unused data from the fitting process in Figs 8 and 9, we observe that the time-dependent
model calibrated by the VOCs-INN algorithm predicts the trend of future outbreaks quite well, with the shaded areas differing by a small number of people from the real data.
The red dashed lines represent the training data, the lavender dotted lines represent the validation data, and the purple dashed lines represent the predictions. The shaded areas indicate the uncertainty range obtained by adjusting the transmission rates up or down by 10% and 20%. The enlarged image of the shaded area can be found in Supporting information. The orange vertical dashed line represents the segmentation line between fitting and prediction.
The red dashed line represents the training data, the lavender dotted line represents the validation data, and the purple dashed lines represent the predictions. The shaded areas indicate the uncertainty range obtained by adjusting the transmission rates up or down by 10% and 20%. The enlarged image of the shaded area can be found in Supporting information. The orange vertical dashed line represents the segmentation line between training data and validation data.
Verification
To verify the effectiveness of training, fitting, and estimation results of our proposed VOCs-INN algorithm, we incorporate the time-dependent transmission rates and
estimated by the VOCs-INN algorithm, along with other constant parameters, into the
model. We then utilize the ODE solver in Python to solve the model. In Figs 10 and 11, using the numerical solutions obtained from our mathematical model, we present the weekly and cumulative reported cases for two VOIs. The simulation curves align well with the data. This indicates that the training results are consistent with the transmission mechanisms identified by the mathematical model (see Supporting information for images of the five regions).
The pink dashed line and blue curve represent the data on new cases reported per week and the corresponding numerical solution for new cases reported per week.
The pink dashed line and blue curve represent the data on cumulative reported cases per week and the corresponding numerical solution for cumulative reported cases per week.
Comparison with nonlinear least squares method
To validate the effectiveness of our proposed VOCs-INN, we selected the traditional nonlinear least squares method as a benchmark for comparison. The experimental setup is as follows. We chose the classical SIR model as epidemiological information, i,e. model (1) with a single strain. In the data preparation phase, we selected daily new COVID-19 case data from Italy between November 7, 2021, and June 28, 2022, as our fitting dataset, while data from June 28, 2022, to July 12, 2022, were used to assess prediction performance.
Initially, we attempted to estimate the transmission rate β and removal rate γ by fitting the SIR model to the dataset using the nonlinear least squares method, and the fitting results are shown in Fig 12(a). This figure shows that the real data has multiple peaks, but the least squares fitting result only captures one peak, suggesting that the fitting is not good. This issue is mainly because the transmission rate β is treated as a constant in the least squares method, making it difficult to simulate multiple peaks accurately.
To enhance the fitting effect, we divided the fitting data into three subsets for piecewise nonlinear least squares estimation based on the observed peak patterns in the dataset. The results of piecewise estimation are presented in Fig 12(b). From this figure, we can see that while the piecewise estimation method improves the fitting of peak times and overall trends, some deviations remain. Next, we used the best-fit parameter values of the third stage to solve the ODE model and predict the daily reported cases over the coming two weeks. The prediction results show that this method is able to capture the increasing trend of daily reported cases, but there is still room for improvement in prediction accuracy.
Although the piecewise estimation does improve the fitting effect to a certain degree, the dataset’s segmentation relies on manually identified peak trends, which introduces a significant subjective component. To address this issue, we used the VOCs-INN model for fitting and prediction, shown in Fig 12(c). To compare the fitting and prediction performance of the three methods comprehensively, we calculated their Mean Squared Error (MSE) and coefficient of determination (), with the specific results listed in Table 4. By examining the data in Table 4, it is evident that the VOCs-INN model performs better than the traditional methods in both fitting and prediction.
The impact of physical information
Furthermore, to gain a deeper understanding of the impact of physical information on model performance, we designed the following experiment. By gradually adjusting the weight ratio of relative to
, and carefully observing the changes in the goodness of fitting as the weight ratio increased from 0 to 1. Specifically, we conducted experiments with five weight values: 0 , 0 . 25 , 0 . 5 , 0 . 75, and 1. The results of these experiments are presented in Fig 13. A weight of 0 means that only neutral networks are used for fitting without any physical information. In this case, as illustrated in Fig 13(e), the data fitting seems perfect, however the solution to the ODE doesn’t match the training data, especially during the peak period. As the weight gradually increased from a non-zero value to 1, we observed that the ODE inversion effect gradually improved, while the fitting effect remained relatively stable. This observation underscores the significance of weight selection. To improve the overall performance and prediction accuracy of the algorithm, it is necessary to find a suitable weight ratio between
and
. However, it is important to note that this does not necessarily imply that a higher weight is always advantageous. Excessively high weights may overly emphasize physical information constraints while overlooking the inherent characteristics of the data, potentially resulting in degraded performance in data fitting. Therefore, finding this balance requires a thorough consideration of various factors, including data characteristics, the physical background of the model, and the requirements of practical applications. The precise determination of this balance point is essential for ensuring the validity and reliability of the model.
(a) fitting and prediction using least squares method. (b) Fitting and prediction using piecewise least squares method. (c) Fitting and prediction using VOCs-INN.
Discussion
In our research, we sought to address the complexities of understanding and modeling the transmission dynamics of Variants of Concern (VOCs) by integrating an model with neural networks. This innovative approach, termed VOCs-INN, allows the analysis of time-varying characteristics of VOCs, such as transmission rates, removal rates, and effective reproduction numbers.
The VOCs-INN algorithm was applied to study the spread of two VOCs in the province of BC and its five constituent regions. Our research results indicate that the fitting results of this algorithm align well with real-world data, enabling accurate short-term disease progression predictions. The parameters derived from the neural network are persuasive, and inverse simulations using an ODE solver further validate the accuracy of our estimations.
The alignment of our estimated effective reproduction number in BC province with figures reported in related research articles provides reassurance about the validity of our approach. For instance, Manathunga, S. S et al. [48] used a local regression (LOESS) model to estimate the time-varying reproduction number range of the Omicron variant as 1.21–1.95. At the end of 2022, Uriu, Keiya et al. [49] proposed that the effective reproduction number of XBB.1.5 was 1.2 times higher than that of XBB.1, indicating that XBB.1.5 is more infectious. In January 2023, Bloom Lab [50] reviewed previous reports and estimated the effective reproduction number of XBB.1.5 to be around 1.6. By the end of March of that year, Yamasoba, Daichi et al. [51] noted that XBB.1.16 performed better than other variants in India, with an effective reproduction number 1.13 times that of XBB.1.5, which is generally in line with our BC province estimates.
During this period, the estimated values of and
both showed continuous fluctuations and intersected at three different time points. Both values also demonstrated a general downward trend. The observed fluctuations in the effective reproduction numbers (
and
) can be attributed to two factors. Firstly, during this period, governments took several intervention measures, such as promoting vaccination, enforcing mandatory mask-wearing in hospitals in April, and lifting the compulsory vaccination requirement for health personnel in July [38–42]. These actions might temporarily reduce
, but upon their removal,
could potentially rise again. The observed fluctuations in
may be a result of this cycle of implementing and then lifting these measures. Secondly, in Fig 6(c), we notice that the values of
initially are higher than those of
. Subsequently, they intersect three times, and ultimately, the values of
become higher than those of
. These observations suggest that the XBB.1.16* variant initially had a greater presence, then coexisted and competed with the XBB.1.5* variant, which ultimately became more dominant. This aligns with the findings of [52].
Despite the promising results, it is essential to acknowledge the limitations of our study. In the training of neural networks, particularly in regression tasks, the application of a contraction factor is recognized as a fundamental preprocessing step. This step is vital for enhancing numerical stability and improving the overall effectiveness of the model. In our research, we empirically adjusted the contraction factor based on the total population of various regions, aiming to bolster numerical stability. Supporting information illustrates the impact of different contraction factors on the fitting performance and ODE inversion accuracy of our VOCs-INN algorithm. It can be clearly seen from the figure that the results significantly improve when the contraction factor is under this data.
Moreover, the selection of time interpolation points is another crucial factor that significantly influences the model’s fitting performance and the accuracy of parameter inference. We meticulously aimed to balance data length and model complexity when determining these points. As depicted in Supporting information, an increase in the number of time interpolation points generally enhances the fitting performance and ODE inversion accuracy of VOCs-INN. However, it is worth noting that this improvement is not linear, excessively large numbers of interpolation points can lead to increased computational time and complexity. Therefore, it is important to note that there is still room for improvement.
Another limitation is that the current multi-strain model does not consider factors such as the latency period, associated fatalities, and possible transitions. Furthermore, while the fixed parameters in the model can be adapted as time-varying parameters using VOCs-INN, this method necessitates a larger and more complex parameter network. These limitations provide avenues for future research to refine further and improve the VOCs-INN algorithm.
In conclusion, the integration of Physics-Informed Neural Networks (PINNs) with infectious disease models, particularly through the proposed VOCs-INN approach, represents a significant research contribution. VOCs-INN leverages available observational data and the underlying physical principles governing infectious disease models to capture the time-varying nature of parameters. This approach enhances the predictive accuracy of the models and provides policymakers with more reliable and scientific frameworks for decision-making. The flexibility of VOCs-INN allows it to be adapted to various epidemiological scenarios and datasets, offering a robust tool for addressing the complexities and uncertainties associated with epidemic challenges. As we continue to neural network into the potential uses of VOCs-INN in infectious disease modeling, we aim to integrate it with other advanced machine-learning methods to better fit and predict infectious disease data.
Supporting information
S1 Fig. Weekly confirmed data and weekly cumulative data for the five regions.
https://doi.org/10.1371/journal.pcbi.1012778.s001
(EPS)
S2 Fig. Schematic diagram of VOCs-INN for
model.
is used to fit the state variables of the model (represented by the pink shaded area), while
is employed to infer the time-varying parameters (represented by the yellow shaded area). Specifically,
and
(i=1,2), respectively.
denotes the automatic differentiation operator, and the LOSS consists of two components,
and
. By minimizing the LOSS, simultaneous data fitting and inference of time-varying parameters can be achieved.
https://doi.org/10.1371/journal.pcbi.1012778.s002
(EPS)
S3 Fig. Data fitting and time-dependent parameters inferences for five regions within BC using the VOCs-INN algorithm.
The bar charts display weekly new data and weekly cumulative data. The light blue dashed line represents the best result of the VOCs-INN fitting, while the red dashed line depicts the estimation results of the transmission rates ,
.
https://doi.org/10.1371/journal.pcbi.1012778.s003
(EPS)
S4 Fig. Comparison of the basic and effective reproduction numbers of two VOIs in five regions.
https://doi.org/10.1371/journal.pcbi.1012778.s004
(EPS)
S5 Fig. Prediction of transmission rates
,
for two VOIs in five regions using the VOCs-INN algorithm.
Blue and red are training and validation data, respectively. The shaded areas indicate the uncertainty ranges obtained by increasing or decreasing the transmission rate by 10% and 20%.
https://doi.org/10.1371/journal.pcbi.1012778.s005
(EPS)
S6 Fig. Forecasted transmission rates and removal rates utilizing the VOCs-INN approach.
During the model validation and performance assessment, the blue and red dashed lines represent the training data for the VOCs-INN, while the purple dashed line signifies the validation dataset used to simulate actual prediction scenarios. We calculated the predictive range by adjusting β and γ with an upward and downward fluctuation of 10% and 20%, and presented this range as shaded areas, which gives a probabilistic estimate of the prediction accuracy. The orange vertical dashed line represents the segmentation line between fitting and prediction.
https://doi.org/10.1371/journal.pcbi.1012778.s006
(EPS)
S7 Fig. Weekly reported cases and cumulative cases are predicted for outbreaks in five regions within BC in 16 weeks from 7 January 2024 (i.e., vertical lines) using a two-VOIs model learned with the VOCs-INN algorithm.
https://doi.org/10.1371/journal.pcbi.1012778.s007
(EPS)
S8 Fig. Results of extrapolating the state variable time series for the five regions using VOCs-INN.
https://doi.org/10.1371/journal.pcbi.1012778.s008
(EPS)
S9 Fig. Enlarged image of the predicted portion of daily new cases.
https://doi.org/10.1371/journal.pcbi.1012778.s009
(EPS)
S10 Fig. Enlarged image of the predicted portion of cumulative new cases.
https://doi.org/10.1371/journal.pcbi.1012778.s010
(EPS)
S11 Fig. Parameters inferred from VOCs-INN are used to solve the model using the ODE solver, and the pink dashed lines and blue curves represent the weekly data on new cases reported for the two VOIs in the five regions and the corresponding extrapolated results for the new cases reported each week.
https://doi.org/10.1371/journal.pcbi.1012778.s011
(EPS)
S12 Fig. Comparison of fitting performance and ODE solver accuracy of VOCs-INN across different contraction factor.
The gray line represents the actual data, the blue dashed line represents the VOCs-INN fitting line, and the red dashed line represents the ODE solver’s reverse calculation line. (a) Contraction factor = 1. (b) Contraction factor = 0.1. (c) Contraction factor = 0.01. (d) Contraction factor = 0.001. (e) Contraction factor = 0.0001.
https://doi.org/10.1371/journal.pcbi.1012778.s012
(EPS)
S13 Fig. Comparison of fitting performance and ODE solver accuracy of VOCs-INN across time interpolation points.
The gray line represents the actual data, the blue dashed line represents the VOCs-INN fitting line, and the red dashed line represents the ODE solver’s reverse calculation line. (a) Time interpolation points = 234. (b) Time interpolation points = 300. (c) Time interpolation points = 500. (d) Time interpolation points = 700. (e) Time interpolation points = 900.
https://doi.org/10.1371/journal.pcbi.1012778.s013
(EPS)
References
- 1. Feng Y-X, Li W-T, Ruan S, Yang F-Y. Dynamics and asymptotic profiles of a nonlocal dispersal SIS epidemic model with bilinear incidence and Neumann boundary conditions. J Diff Eq. 2022;335294–346.
- 2. Guo H, Li MY. Global stability of the endemic equilibrium of a tuberculosis model with immigration and treatment. Canad Appl Math Quart. 2011;19:1–18.
- 3. He S, Tang S, Wang W. A stochastic SIS model driven by random diffusion of air pollutants. Phys A: Statist Mech Appl. 2019;532121759.
- 4. Jawad M, Alyobi S, Yassen MF, Weera W. A higher order Galerkin time discretization scheme for the novel mathematical model of COVID-19. Math 2023;8(2):3763–90.
- 5. Paul S, Mahata A, Mukherjee S, Das M, Mali PC, Roy B, et al. Study of fractional order SIR model with MH type treatment rate and its stability analysis. Bullet Biomath. 2024;2(1):85–113.
- 6. Wang X, Wu H, Tang S. Assessing age-specific vaccination strategies and post-vaccination reopening policies for COVID-19 control using SEIR modeling approach. Bull Math Biol 2022;84(10):108. pmid:36029391
- 7. Chen X, Tian B, Xu X, Yang R, Zhong S. A stochastic SIS epidemic infectious diseases model with double stochastic perturbations. Int J Biomath. 2024:17(04);2350040.
- 8. Gao S, Shen M, Wang X, Wang J, Martcheva M, Rong L. A multi-strain model with asymptomatic transmission: application to COVID-19 in the US. J Theor Biol. 2023;565111468. pmid:36940811
- 9. Ciupeanu A-S, Varughese M, Roda WC, Han D, Cheng Q, Li MY. Mathematical modeling of the dynamics of COVID-19 variants of concern: asymptotic and finite-time perspectives. Infect Dis Model 2022;7(4):581–96. pmid:36097594
- 10. Song H, Fan G, Zhao S, Li H, Huang Q, He D. Forecast of the COVID-19 trend in India: a simple modelling approach. 2021.
- 11. Samek W, Montavon G, Lapuschkin S, Anders CJ, Muller K-R. Explaining deep neural networks and beyond: a review of methods and applications. Proc IEEE 2021;109(3):247–78.
- 12.
Bechtel MG, McEllhiney E, Kim M, Yun H. A low-cost deep neural network-based autonomous car. In: 2018 IEEE 24th international conference on embedded and real-time computing systems and applications (RTCSA). IEEE; 2018. .
- 13. Razzak MI, Naz S, Zaib A. Deep learning for medical image processing: overview, challenges and the future. In: Classification in BioApps: automation of decision making. 2018; p. 323–50.
- 14. Goldberg Y. A primer on neural network models for natural language processing. Jair. 2016;57345–420.
- 15. Shorten C, Khoshgoftaar TM, Furht B. Deep learning applications for COVID-19. J Big Data 2021;8(1):18. pmid:33457181
- 16. John CC, Ponnusamy V, Krishnan Chandrasekaran S, R N. A survey on mathematical, machine learning and deep learning models for COVID-19 transmission and diagnosis. IEEE Rev Biomed Eng. 2022;15325–40. pmid:33769936
- 17. Arora P, Kumar H, Panigrahi BK. Prediction and analysis of COVID-19 positive cases using deep learning models: a descriptive case study of India. Chaos Solitons Fractals. 2020;139110017. pmid:32572310
- 18. Shahid F, Zameer A, Muneeb M. Predictions for COVID-19 with deep learning models of LSTM, GRU and Bi-LSTM. Chaos Solitons Fract. 2020;140110212. pmid:32839642
- 19. Zhou L, Zhao C, Liu N, Yao X, Cheng Z. Improved LSTM-based deep learning model for COVID-19 prediction using optimized approach. Eng Appl Artif Intell. 2023;122106157. pmid:36968247
- 20. Wang P, Zheng X, Ai G, Liu D, Zhu B. Time series prediction for the epidemic trends of COVID-19 using the improved LSTM deep learning method: case studies in Russia, Peru and Iran. Chaos Solitons Fract. 2020;140110214. pmid:32839643
- 21. Chandra R, Jain A, Singh Chauhan D. Deep learning via LSTM models for COVID-19 infection forecasting in India. PLoS One 2022;17(1):e0262708. pmid:35089976
- 22. Zeroual A, Harrou F, Dairi A, Sun Y. Deep learning methods for forecasting COVID-19 time-series data: a comparative study. Chaos Solitons Fract. 2020;140110121. pmid:32834633
- 23. Long J, Khaliq A, Furati KM. Identification and prediction of time-varying parameters of COVID-19 model: a data-driven deep learning approach. Int J Comput Math. 2021;98(8):1617–32.
- 24. Nguyen L, Raissi M, Seshaiyer P. Modeling, Analysis and Physics Informed Neural Network approaches for studying the dynamics of COVID-19 involving human-human and human-pathogen interaction. Comput Math Biophys 2022;10(1):1–17.
- 25. Cai M, Em Karniadakis G, Li C. Fractional SEIR model and data-driven predictions of COVID-19 dynamics of Omicron variant. Chaos 2022;32(7):071101. pmid:35907723
- 26. Kharazmi E, Cai M, Zheng X, Zhang Z, Lin G, Karniadakis GE. Identifiability and predictability of integer- and fractional-order epidemiological models using physics-informed neural networks. Nat Comput Sci 2021;1(11):744–53. pmid:38217142
- 27. Ning X, Jia L, Wei Y, Li X-A, Chen F. Epi-DNNs: Epidemiological priors informed deep neural networks for modeling COVID-19 dynamics. Comput Biol Med. 2023;158106693. pmid:36996662
- 28. He M, Tang S, Xiao Y. Combining the dynamic model and deep neural networks to identify the intensity of interventions during COVID-19 pandemic. PLoS Comput Biol 2023;19(10):e1011535. pmid:37851640
- 29. He M, Tang B, Xiao Y, Tang S. Transmission dynamics informed neural network with application to COVID-19 infections. Comput Biol Med. 2023;165107431. pmid:37696183
- 30. Saikia D, Bora K, Bora MP. Counting the uncounted: estimating the unaccounted COVID-19 infections in India. Nonl Dyn. 2024; p. 1–15.
- 31. Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Comput Phys. 2019;378686–707.
- 32. Meng X, Li Z, Zhang D, Karniadakis GE. PPINN: Parareal physics-informed neural network for time-dependent PDEs. Comput Methods Appl Mech Eng. 2020;370113250.
- 33. Cai S, Mao Z, Wang Z, Yin M, Karniadakis GE. Physics-informed neural networks (PINNs) for fluid mechanics: a review. Acta Mech Sin 2021;37(12):1727–38.
- 34. Mishra S, Molinaro R. Estimates on the generalization error of physics-informed neural networks for approximating a class of inverse problems for PDEs. IMA J Numer Anal 2021;42(2):981–1022.
- 35. This page contains BC-specific information on COVID-19 genomic surveillance over time by age and by region, based on whole genome sequencing of positive samples sent to the BCCDC Public Health laboratory. 2024 [cited 2024 May 28]. Available from: https://bccdc.shinyapps.io/genomic_surveillance/
- 36. New research: new variant of the new crown, XBB.1.16.*with greater propagation capacity. . 2024 [cited 2024 May 28]. Available from: https://www.shanxi.gov.cn/ztjj/fyyqfk/fyyqyw/202304/t20230417_8370724.shtml.
- 37. VOI GRA(XBB) first detected in the world. 2024 [cited 2024 May 28]. Available from: https://gisaid.org/hcov19-variants/
- 38. Weekly B.C. COVID-19 data for March 30: number in hospital jumps to nearly 300. 2024 [cited 2024 May 28]. Available from: https://vancouversun.com/health/local-health/weekly-bc-covid-numbers-march-30
- 39. B.C. launches spring COVID-19 booster program, targeting vulnerable and unvaccinated. 2024 [cited 2024 May 28]. Available from: https://www.theglobeandmail.com/canada/british-columbia/
- 40. ’It is a concern’: 12 cases of XBB.1.5 COVID-19 variant found in B.C. 2024 [cited 2024 May 28]. Available from: https://bc.ctvnews.ca/mobile/it-is-a-concern-12-cases-of-xbb-1-5-covid-19-variant-found-in-b-c-1.6217823?cache=yes/5-things-to-know-for-tuesday-september-10-2019-1.4585806?contactForm=true.
- 41. COVID-19: there are 119 cases of Arcturus in B.C. Here’s what we know about this latest subvariant. 2024 [cited 2024 May 28]. Available from: https://vancouversun.com/news/local-news/covid-19-what-to-know-arcturus-subvariant-bc-canada
- 42. B.C. officials respond to WHO declaration that COVID-19 is no longer a global health emergency. 2024 [cited 2024 May 28]. Available from: https://bc.ctvnews.ca/b-c-officials-respond-to-who-declaration-that-covid-19-is-no-longer-a-global-6386701
- 43. Coronavirus Disease (COVID-19). 2024 [cited 2024 May 28]. Available from: https://www.who.int/news-room/fact-sheets/detail/coronavirus-disease-(covid-19).
- 44. Fan E. Extended tanh-function method and its applications to nonlinear equations. Phys Lett A. 2000;277(4–5):212–8.
- 45.
Dürr O, Sick B, Murina E. Probabilistic deep learning: with python, keras and tensorflow probability. Manning Publications; 2020.
- 46. Carpenter R. Principles and procedures of statistics, with special reference to the biological sciences. Eugen Rev. 1960;52(3):172. Epub 2025 Jan 17.
- 47. Map of British Columbia Province. 2024 [cited 2025 Jan 17]. https://www.worldatlas.com/maps/canada/british-columbia
- 48. Manathunga SS, Abeyagunawardena IA, Dharmaratne SD. A comparison of transmissibility of SARS-CoV-2 variants of concern. Virol J 2023;20(1):59. pmid:37009864
- 49. Uriu K, Ito J, Zahradnik J, Fujita S, Kosugi Y, Schreiber G, et al. Enhanced transmissibility, infectivity, and immune resistance of the SARS-CoV-2 omicron XBB.1.5 variant. Lancet Infect Dis. 2023;23(3):280–1. pmid:36736338
- 50. Reproductive number Rt at present across Pango lineages. 2024 [cited 2024 Dec 02]. Available from: https://x.com/jbloom_lab/status/1610002769888555010/photo/1
- 51. Yamasoba D, Uriu K, Plianchaisuk A, Kosugi Y, Pan L, Zahradnik J, et al. Virological characteristics of the SARS-CoV-2 omicron XBB.1.16 variant. Lancet Infect Dis. 2023;23(6):655–6. pmid:37148902
- 52. XBB.1.5 Rapid Risk Assessment. 2023. [cited 11 Jan 2023]. Available from: https://www.who.int/zh/mega-menu/health-topics/popular/covid-19