A Hybrid AI-Mathematical approach for epidemic threshold prediction in metapopulation networks: Integrating physics-guided neural networks with spectral graph theory

Etienne Kouokam

doi:10.1371/journal.pone.0344827

Abstract

Predicting the epidemic threshold in contact networks is a central challenge in computational epidemiology. Classical structural approaches based on spectral graph theory—most notably Quenched Mean-Field (QMF) and the recently proposed KSEL (K Spectral Energy of Laplacian) method—deliver fast but approximate predictions. We propose a hybrid AI-mathematical framework that integrates spectral graph features, epidemiological parameters, and epidemiologically motivated soft constraints derived from compartmental theory into a physics-guided neural network (PGNN). Rather than claiming state-of-the-art predictive performance, this work makes three complementary contributions: (i) a rigorous stochastic ground-truth estimation procedure with Monte Carlo uncertainty quantification (median , n = 775 networks); (ii) a systematic comparative evaluation of seven methods—including tree-based models (Random Forest, Gradient Boosting) trained on the same feature set—revealing the conditions under which deep learning surpasses and falls short of simpler baselines; and (iii) a full ablation study and SHAP interpretability analysis identifying the role of individual spectral features and physical constraints as structured regularisers. Evaluated on 775 synthetic networks spanning Erdős–Rényi, Barabási–Albert, Watts–Strogatz, and regular topologies, Gradient Boosting achieves the best predictive accuracy (R² = 0.908, RMSE = 0.0731), while the PGNN (R² = 0.093) offers complementary value through physical consistency and interpretability. These results are established on synthetic benchmarks; application to empirical contact networks (hospital, school, workplace settings) is a natural next step but requires dedicated validation beyond the scope of the present study. Ablation results show that the boundedness constraint is a beneficial regulariser while the stability constraint over-regularises in the low-data regime. Automatic gradient-based calibration of the KSEL coefficient yields topology-dependent optimal values (), substantially departing from the universal constant k = 0.3 of prior work.

Citation: Kouokam E (2026) A Hybrid AI-Mathematical approach for epidemic threshold prediction in metapopulation networks: Integrating physics-guided neural networks with spectral graph theory. PLoS One 21(6): e0344827. https://doi.org/10.1371/journal.pone.0344827

Editor: Americo Cunha Jr, LNCC: Laboratorio Nacional de Computacao Cientifica, BRAZIL

Received: February 25, 2026; Accepted: May 27, 2026; Published: June 18, 2026

Copyright: © 2026 Etienne Kouokam. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The complete code and dataset are publicly available at: https://doi.org/10.5281/zenodo.19411519 (CC BY 4.0).

Funding: This work was supported by institutional funding from the University of Yaoundé I, Cameroon, and IRD-UMMISCO, Sorbonne Université, France. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

The propagation of infectious diseases over contact networks is a complex, multiscale phenomenon whose understanding demands multidisciplinary tools combining mathematical analysis, network science, and machine learning. Central to this endeavour is the concept of the epidemic threshold , defined as the critical ratio of infection rate to recovery rate beyond which a stochastic SIS or SIR epidemic transitions from extinction to persistence in a large population. This parameter is intimately related to the basic reproduction number R₀: the epidemic invades if and only if R₀ > 1, which corresponds to operating above .

The dependence of on network structure has been intensively studied over the past two decades. In homogeneous random graphs, coincides with the inverse of the mean degree. In scale-free networks, mean-field analysis by Pastor-Satorras and Vespignani [1] showed that in the thermodynamic limit, implying the absence of an epidemic threshold. The Quenched Mean-Field (QMF) approach of Wang et al. [2] refined this picture by expressing , where is the spectral radius of the adjacency matrix A. This compact formula captures the essential role of network topology yet may underestimate when degree correlations are present or when the network is highly modular.

More recently, Kanyou et al. [3] proposed the KSEL estimator, which exploits the Laplacian energy LE(G) as an additional structural descriptor capturing diffusive dynamics: . With an empirically calibrated constant , KSEL achieves correlation with QMF on 31 networks and outperforms QMF on correlated topologies. Nevertheless, both QMF and KSEL derive from closed-form approximations and can miss complex non-linear interactions between structural and epidemiological factors.

On the mathematical modelling side, the Ross–Macdonald model [4] for vector-borne diseases in patchy metapopulations provides a fully rigorous dynamical framework: equilibrium stability is governed by a spectral condition on a matrix encoding spatial migration, vector density, and transmission rates. While analytically exact, this framework requires solving large systems of ordinary differential equations and is therefore impractical for real-time prediction over thousands of dynamically changing networks.

The gap between fast-but-approximate structural estimators and exact-but-slow mathematical models motivates a hybrid approach. Recent progress in Physics-Guided Neural Networks (PINNs), pioneered by Raissi et al. [5], demonstrates that embedding physical laws directly into the neural network training objective can simultaneously enforce theoretical consistency and capture data patterns invisible to pure mathematical models.

Contributions.

This paper makes three principal contributions:

(i) Rigorous ground-truth estimation and comparative evaluation. We introduce a validated stochastic SIS simulation protocol with per-label Monte Carlo uncertainty quantification ( median = 0.0072, n = 775 networks). Gradient Boosting achieves the best predictive accuracy (R² = 0.908, RMSE = 0.0731), outperforming the PGNN (see the Experimental Methodology and Results sections).
(ii) Physics-guided neural network with ablation analysis. We design a multi-layer perceptron with three epidemiologically motivated soft constraints. Ablation: removing spectral features degrades by −19.9% RMSE; removing by −33.7%; removing L_bound worsens by +25.0% RMSE (see the Hybrid Theoretical Framework and Results sections).
(iii) Automatic KSEL calibration and SHAP interpretability. Gradient descent yields topology-dependent . SHAP identifies , , as dominant predictors (see the Results section).

Main contribution.

This work provides a systematic and reproducible evaluation of physics-guided neural models for epidemic threshold prediction on networks. Our results show that, in low-data regimes ( networks), physics-guided neural networks do not outperform classical tree-based methods (Random Forest, Gradient Boosting) in terms of predictive accuracy. However, they offer complementary advantages as structured regularisers, enabling the integration of domain knowledge and improving interpretability via SHAP attribution. This clarifies the role of such models as principled, constraint-aware approaches rather than performance-oriented alternatives in data-limited settings—a distinction essential for the correct use of hybrid models in network epidemiology. We note that all experiments are conducted on synthetic networks; the extent to which these findings transfer to real-world empirical contact networks (e.g., hospital wards, school settings, conference interactions) remains an open question that we identify as a priority for future work.

The remainder of the paper is organised as follows. The Related Work and Mathematical Foundations section reviews the mathematical foundations. The Hybrid Theoretical Framework section describes the theoretical hybrid framework and Physics-Guided Neural Network architecture. The Experimental Methodology section details the experimental methodology. The Results section presents numerical results. The Discussion section discusses implications and limitations. The Conclusion section concludes.

2 Related work and mathematical foundations

2.1 The ross–macdonald model in fragmented environments

The Ross–Macdonald model was originally developed for malaria dynamics and describes transmission between hosts and mosquito vectors. Auger et al. [4] generalised this model to n spatial patches, allowing host migration but not vector migration. Let x_i(t) denote the prevalence of infected hosts in patch i and y_i(t) the prevalence of infected vectors. The metapopulation dynamics read:

(1)

(2)

where m is the vector density vector, is the migration matrix of hosts (with ), and are the host-to-vector and vector-to-host transmission rates, c is the host recovery rate, and is the vector mortality rate. The basic reproduction number takes the form:

(3)

where restricted to the p patches carrying vectors, and denotes the spectral radius. The main theorem of [4] establishes:

if R₀ ≤ 1, the disease-free equilibrium (DFE) is globally asymptotically stable.
If R₀ > 1, there exists a unique endemic equilibrium that is globally asymptotically stable on the positive orthant.

This structural result reveals the central role of spatial migration encoded in D: increasing connectivity can either accelerate or impede epidemic spread depending on its interaction with of the migration-adjusted system. The spectral bridge between R₀ in (3) and of the contact network will be a key theoretical anchor for our hybrid model (see the Hybrid Theoretical Framework section).

2.2 Structural approaches to epidemic threshold prediction

Let G=(V,E) be an undirected contact network with n = |V| nodes and m = |E| edges. Let A be its adjacency matrix and its Laplacian, where is the degree matrix.

Mean-Field (MF).

Under the homogeneous mixing assumption: , where is the mean degree. MF ignores all topological heterogeneity beyond the mean degree.

Heterogeneous Mean-Field (HMF).

For uncorrelated networks with degree distribution p(k): . HMF captures degree variance but ignores correlations and clustering.

Quenched Mean-Field (QMF).

Introduced by Wang et al. [2], QMF uses the full adjacency spectrum:

(4)

QMF is known to be exact for regular graphs and a lower bound for scale-free networks in the thermodynamic limit [6]. It consistently outperforms MF and HMF, yet systematically underestimates when degree assortativity is positive.

KSEL.

Kanyou et al. [3] proposed the K Spectral Energy of Laplacian estimator using the Laplacian energy , where are eigenvalues of L:

(5)

with empirically calibrated constant . KSEL combines information from the diffusion operator L (via LE) and the epidemic contact matrix A (via ), giving it a complementary theoretical foundation compared with QMF.

3 Hybrid theoretical framework and physics-guided neural network model

3.1 Unified feature representation

Our hybrid approach rests on the observation that is a function of both the network structure and the intrinsic disease parameters:

(6)

where is a complex, potentially non-linear function that existing closed-form estimators approximate with different levels of fidelity. We propose to learn f from data while enforcing physical constraints derived from the Ross–Macdonald theory.

For each network G and parameter pair , we construct a feature vector :

(a) Spectral features: , (algebraic connectivity), LE(G);
(b) Size features: number of nodes n, number of edges m;
(c) Topological moments: , , mean clustering coefficient , network diameter ;
(d) Epidemiological parameters: , , ratio ;
(e) Auxiliary predictors: (used as a feature encoding the linearised spectral constraint).

Note on as a feature versus a benchmark. is included in as an input feature because it encodes the theoretically motivated mean-field baseline in a compact, interpretable form. The benchmarking comparison (see the Results section) evaluates the PGNN against used as a standalone predictor—a fundamentally different role. Concretely, the network receives as one of eleven inputs and learns to correct its systematic bias using additional structural and epidemiological information. This is analogous to residual learning in physics-informed machine learning: the model learns the deviation rather than from scratch, which explains the strong SHAP importance of (see the SHAP Interpretability Analysis section) without implying circular evaluation.

This representation encodes the information used by all existing estimators while enriching it with second-order topological moments and explicit epidemiological parameters.

Note on prediction task and target leakage.

The target is the empirical epidemic threshold estimated via stochastic SIS simulation—not the analytical ratio . A simple regression yields R² = 0.31 on our dataset, confirming that and individually carry dynamical information beyond their ratio. The feature vector therefore includes and separately but excludes their ratio to avoid partial target leakage.

3.2 Physics-guided neural network architecture

We model f as a multi-layer perceptron (MLP) with architecture illustrated in Fig 1:

Input layer: , batch-normalised.
Hidden layers: three fully-connected layers of widths 128, 64, 32 with ReLU activations, Dropout (p = 0.2), and Batch Normalisation between each pair.
Output layer: single neuron with Softplus activation to guarantee .

Download:

Fig 1. Physics-Guided Neural Network architecture.

Input features are batch-normalised and passed through three hidden layers (128–64–32 neurons, ReLU activation, Dropout p = 0.2). The Softplus output guarantees . Three epidemiologically motivated soft constraints are added to the MSE loss: (QMF lower bound), (monotonicity), and (SIS upper bound). The ablation study (Fig 2) reveals that is a beneficial regulariser while over-regularises in the low-data regime. All elements are original and created by the author.

https://doi.org/10.1371/journal.pone.0344827.g001

Hybrid loss function

The training objective combines a data-fidelity term and a physics-consistency penalty:

(7)

where is the mean squared error over training examples, and penalises three types of physical violations:

Stability constraint: For any network with , the epidemic should be sub-critical (consistency with the QMF lower-bound property of the DFE).
Monotonicity constraint: must decrease when increases for fixed , i.e., .
Boundedness constraint: over the training distribution.

Download:

Fig 2. Performance Breakdown by Network Family and PGNN Ablation Study Results.

(A) RMSE breakdown by network topology family for QMF, Random Forest, Gradient Boosting, and PGNN. Tree-based methods dominate across all families. The PGNN advantage is relatively largest on regular graphs (REG), consistent with less heterogeneity reducing the sample-complexity burden. (B) Ablation study: percentage change in RMSE relative to the full PGNN. Green bars indicate that removing the component improves performance (over-regularisation); red bars indicate degradation (component is beneficial). Key findings: is a genuine regulariser (+25.0% RMSE when removed); as an auxiliary feature provides the strongest structural prior (−33.7% RMSE when removed). All elements are original and created by the author.

https://doi.org/10.1371/journal.pone.0344827.g002

The hyperparameter is set by cross-validated grid search. We have verified that increasing beyond 0.5 degrades data-fidelity without improving physical consistency, while values below 0.01 result in constraint violations on out-of-domain networks.

Theoretical basis and scope of validity of the constraints. The three soft constraints are grounded in well-established properties of the SIS epidemic threshold under the QMF approximation [2,6]. Specifically: (i) the stability constraint () encodes the QMF lower bound, which is asymptotically exact for regular graphs and a provable lower bound in the thermodynamic limit for uncorrelated networks; (ii) the monotonicity constraint () follows directly from ; and (iii) the boundedness constraint () follows from the SIS persistence condition . These constraints are theoretically motivated within the regime where the QMF approximation is reliable, namely networks with moderate degree heterogeneity and weak finite-size effects. Outside this regime—and in particular for the ≈18% of networks in our dataset where the stochastic ground truth violates the QMF lower bound due to finite-size effects—the stability constraint acts primarily as a regulariser (preventing extreme predictions) rather than as a theoretically grounded bound. This distinction is consistent with the ablation results of the Results section: is genuinely beneficial while over-regularises precisely because the finite-size regime falls outside the validity of the QMF bound.

Automatic calibration of the KSEL coefficient.

A secondary objective is to demonstrate that gradient-based optimisation can automatically calibrate the parameter k in KSEL (5). We treat k as a trainable scalar and minimise over k alone via Adam, treating as a differentiable model. This replaces the manual grid search in [0.2, 0.5] used by [3] with a principled gradient descent procedure.

4 Experimental methodology

4.1 Ethics statement

This study does not involve human participants, animal experiments, or identifiable human data. No ethics approval was required.

4.2 Synthetic network dataset generation

We generated 775 undirected networks with nodes, divided among four families:

Erdős–Rényi (G(n,p)): , 194 instances.
Barabási–Albert (scale-free, preferential attachment with ): 194 instances.
Watts–Strogatz (small-world, ring with initial neighbours and rewiring probability ): 194 instances.
Regular and empirical (random d-regular graphs and contact traces from published datasets): 194 instances.

For each network G_i, we sample epidemiological parameters uniformly: , . The true epidemic threshold is estimated via stochastic SIS Monte Carlo simulation: we identify the critical at which the probability of epidemic outbreak transitions from <10% to >90% over 1,000 independent realisations. This produces an accurate continuous ground truth label for each (G_i, , ) triplet.

The dataset is partitioned into train / validation / test splits of 70% / 15% / 15% (542 / 116 / 116 networks), stratified by network family and size decile to ensure balanced coverage.

Uncertainty estimation.

To account for stochastic variability in epidemic simulations, each threshold estimate is computed over R = 200 independent Monte Carlo realisations. We report the per-label uncertainty (median = 0.0072, 95th percentile = 0.0219) derived from logistic curve propagation, confirming that the ground-truth labels are statistically reliable. Networks with are excluded from the dataset (1 of 776 attempts).

Robustness and sensitivity of the epidemic threshold estimator. The susceptibility-peak criterion is a widely used operational estimator for the epidemic threshold in computational epidemiology [2]. Its robustness depends on the sharpness of the phase transition, which varies across network topology and size. For the four families studied here (ER, BA, WS, REG), the transition is sufficiently sharp that the susceptibility peak is well-defined and stable across realisations, as evidenced by the low median uncertainty . However, this estimator can be sensitive to the choice of simulation parameters in specific regimes: networks with very low edge density (sparse ER, p < 0.02) exhibit a broader transition, leading to higher per-label uncertainty; such networks are flagged and excluded when . A multi-criteria approach combining the susceptibility peak with persistence-time and final-size estimators would provide a more robust reference for heterogeneous or strongly modular networks, and is identified as a direction for future work (see Section 6).

4.3 Training protocol

All models are implemented in PyTorch and optimised with Adam (, exponential decay factor 0.95 every 20 epochs) for a maximum of 200 epochs with early stopping (patience = 20 epochs). A 5-fold cross-validation on the training set guides hyperparameter selection: layer widths, , and Dropout rate .

We evaluate seven models:

(i) QMF: (closed-form).
(ii) KSEL: .
(iii) Linear regression on .
(iv) Random Forest (RF): 200 trees, max depth 10.
(v) Gradient Boosting (GB): 200 estimators, lr = 0.05.
(vi) Standard MLP: PGNN architecture, .
(vii) PGNN (ours): full model, .

To obtain robust performance estimates, we use extbf5-fold cross-validation repeated over 5 independent random seeds (25 (seed, fold) combinations total). Results are reported as mean ± standard deviation over these 25 runs. Statistical significance of pairwise differences is assessed by the Wilcoxon signed-rank test on the 25 paired RMSE values.

Statistical robustness.

All reported results are averaged over 5 independent runs with different random seeds, each combined with 5-fold cross-validation, yielding (seed, fold) evaluations per method. We report mean performance metrics alongside standard deviations to assess variability and robustness. Statistical significance of pairwise differences is assessed by the Wilcoxon signed-rank test on the 25 paired RMSE values.

5 Results

5.1 Predictive performance

Table 1 reports test-set performance for all methods. Fig 3 visualises predicted versus true values.

Download:

Fig 3. Comparative performance of seven methods on the 116-network test set.

(A) RMSE: lower is better. Gradient Boosting (RMSE = 0.0731) and Random Forest (RMSE = 0.0744) substantially outperform the PGNN (RMSE = 0.2299) in this data regime. (B) R²: QMF and KSEL yield negative R², confirming that closed-form spectral estimators fail to track the variance of the stochastic ground-truth . The PGNN (R² = 0.093) offers interpretability and physical consistency rather than predictive superiority. All elements are original and created by the author.

https://doi.org/10.1371/journal.pone.0344827.g003

Download:

Table 1. Predictive performance: mean ± std over

(seed, fold) combinations of 5-fold cross-validation. Best results in bold; second best underlined. GB and RF dominate in the low-data regime; the PGNN (

, 8.7× less stable than GB) exhibits high variance consistent with overfitting (Section 5.2).

https://doi.org/10.1371/journal.pone.0344827.t001

Table 1 reports mean ± std RMSE and R² over 25 (seed, fold) combinations of 5-fold cross-validation. Gradient Boosting achieves the best predictive accuracy (0.0757 ± 0.0463, ), followed closely by Random Forest (0.0784 ± 0.0511, ). Both ensemble methods are highly stable across folds (std ≈0.05). Linear regression performs competitively (), confirming that a substantial fraction of the variance in is linear in the eleven features.

The PGNN shows mean RMSE and . The large standard deviations—a factor 8.7× larger than GB—reveal high instability across folds, the empirical signature of overfitting in a high-dimensional model applied to a small dataset. Section 5.2 provides a principled explanation in terms of the parameter-to-sample ratio and the interaction between physical constraints and fifinite-size effects.

The closed-form baselines QMF () and KSEL () yield negative mean R², confirming that closed-form spectral estimators fail to track the variance of the stochastic ground-truth across diverse topologies. Inference time for all learned models remains ∼1.2 ms per network, orders of magnitude faster than stochastic simulation (∼50–100 ms).

5.2 Why does the PGNN underperform tree-based methods?

The substantial performance gap between the PGNN and tree-based methods (RMSE = 0.448, on the 5-fold cross-validation) deserves a principled explanation beyond the observation that the dataset is small. We identify three complementary mechanisms.

(i) Parameter-to-sample ratio. The PGNN contains approximately 12,845 trainable parameters (including BatchNorm statistics), yielding a parameter-to-sample ratio of ≈252 at the typical training fold size of . By contrast, Gradient Boosting uses effective parameters (100 trees × depth-4 nodes), giving a ratio of ≈88. Operating in the regime where parameters substantially exceed examples is a well-known source of high variance [7,8].
(ii) Instability under cross-validation. The standard deviation of PGNN RMSE across the (seed, fold) combinations is 0.403—a factor 8.7× larger than that of Gradient Boosting (0.046). This high variance is the empirical signature of overfitting: on some folds the PGNN finds a reasonable solution, on others it collapses to a trivial predictor.
(iii) Physical constraint interaction. The ablation study (Table 2) shows that removing reduces RMSE by 28.4%, implying that this constraint is actively harmful in the current regime. The QMF lower bound is a valid asymptotic result but is violated by the stochastic ground truth on a non-negligible fraction of networks (≈18% of our dataset) due to fifinite-size effects. Enforcing it as a hard soft constraint therefore introduces systematic bias in the low-data regime.

Download:

Table 2. Ablation study.

= percentage change in RMSE relative to the full PGNN (positive = degradation; negative = improvement, indicating over-regularisation in the current data regime).

is the only constraint that is genuinely beneficial;

as an auxiliary feature provides the strongest structural prior.

https://doi.org/10.1371/journal.pone.0344827.t002

These findings collectively suggest that the PGNN would become competitive with tree-based methods at training examples, where the parameter-to-sample ratio falls below 10 and regularisation by physical constraints can act as a genuine inductive bias rather than a source of misspecification.

This highlights an important limitation of the present study: hybrid neural approaches may not be the optimal choice when data availability is limited, and their advantages may only emerge at larger scales. Practitioners working with small network datasets (n < 500) should prefer tree-based ensemble methods for pure prediction tasks, and reserve physics-guided models for settings where out-of-distribution safety or theoretical interpretability is a primary requirement.

5.3 Automatic calibration of the KSEL coefficient k

Gradient-based optimisation of k converges to topology-specific values that depart substantially from the empirical constant k = 0.3 of [3]: , , , , and (over the full training set). These values are visualised in Fig 4B.

Download:

Fig 4. SHAP Feature Importance and Topology-Dependent KSEL Calibration.

(A) Mean |SHAP value| per input feature for the PGNN. is the dominant predictor (0.045), confirming that the linearised spectral constraint encodes the most informative structural signal. Epidemiological parameters (0.040) and (0.032) collectively rank second, while ranks tenth (0.011), its information subsumed by . (B) Topology-dependent optimal KSEL coefficient k^* obtained by gradient descent. Values range from to , substantially departing from the universal constant k = 0.3 of prior work. All elements are original and created by the author.

https://doi.org/10.1371/journal.pone.0344827.g004

The topology-dependence is substantial: Watts–Strogatz small-world networks require the lowest k^* (0.627), while Barabási–Albert scale-free and regular graphs require values 4× larger (1.265 and 1.322 respectively). This heterogeneity confirms that a universal constant k is a significant simplification, and that the Laplacian energy captures fundamentally different aspects of diffusion dynamics across graph families. A theoretically rigorous derivation of k(G) as a topology-dependent spectral invariant represents a promising direction for future work.

5.4 SHAP interpretability analysis

Fig 4A shows the mean absolute SHAP values for each input feature of the PGNN. The QMF auxiliary predictor dominates with importance 0.045, confirming that the linearised spectral constraint is the single most informative input. Epidemiological parameters (0.040) and (0.032) jointly account for more importance (0.072) than any individual spectral feature. The Laplacian energy LE(G) (0.022) and algebraic connectivity (0.021) contribute comparably, while the raw ranks tenth (0.011)—its information being largely subsumed by .

This ranking is consistent across the PGNN and Gradient Boosting, validating the feature engineering independently of model class. The convergence on , , and as top predictors provides actionable guidance: epidemic threshold prediction benefits most from combining the spectral structural baseline with epidemiological parameters, rather than from raw spectral features alone.

6 Discussion

Our comparative study yields four overarching insights.

On predictive accuracy and model selection. Tree-based ensemble methods (GB: ; RF: ) substantially outperform the PGNN () in the low-data regime. As shown in Section 5.2, this reflects the parameter-to-sample ratio disadvantage and not an inherent limitation of physics-guided modelling.

On the role of physical constraints as structured regularisers. The ablation study reveals an ambivalent role: is a genuine regulariser (+25.0% RMSE when removed), while over-regularises (−28.4% RMSE when removed) because the QMF lower bound is violated by on ≈18% of networks due to fifinite-size effects. Future work should derive data-adaptive bounds calibrated to the stochastic simulation protocol.

On the KSEL calibration contribution. The gradient-based calibration reveals that k^* ranges from 0.627 (WS) to 1.322 (REG), departing substantially from the universal constant k = 0.3 of prior work. This topology-dependence has direct implications for practitioners applying KSEL to real contact networks.

On the conditions under which PGNN is preferable. Three scenarios favour the PGNN over ensemble methods. First, out-of-distribution generalisation: physical constraints prevent predictions that violate known epidemiological laws on unseen topologies. Second, large-data regimes: neural network and ensemble learning curves are expected to cross at examples [7,8]; at this scale the PGNN should outperform GB while retaining its interpretability advantages. Third, knowledge integration: new theoretical results can be incorporated directly into the PGNN loss function—a modular extensibility that tree-based methods cannot offer.

6.1 Why physics-guided models remain relevant

Although tree-based models outperform the proposed PGNN in our experiments, this does not invalidate the relevance of hybrid approaches. Physics-guided models offer three key advantages that purely data-driven methods cannot provide.

First, structural inductive bias. The physical constraints , , and encode known properties of epidemic dynamics derived from the QMF approximation and SIS compartmental theory. In out-of-distribution settings—network topologies not represented in the training data—these constraints prevent the model from producing predictions that violate fundamental epidemiological laws, a guarantee that Gradient Boosting, as a purely data-driven method, cannot provide.

Second, principled knowledge integration. When new theoretical results become available (e.g., tighter spectral bounds, heterogeneous SIS corrections, age-structured contact patterns), they can be incorporated directly into the PGNN loss function. This modular extensibility is architecturally impossible for tree-based methods, making the PGNN a natural framework for progressive model refinement as domain knowledge grows.

Third, interpretability. Through SHAP attribution, the PGNN provides feature-level insight into how structural and epidemiological factors jointly influence . The convergence of PGNN and Gradient Boosting on the same top predictors (, , ) validates this interpretability independently of model class.

Therefore, rather than competing directly with classical models in terms of raw predictive performance, physics-guided neural networks should be viewed as complementary tools, particularly suited for scientifically grounded modelling in settings where theoretical consistency and extrapolation safety matter as much as in-sample accuracy.

Limitations. Several limitations merit acknowledgment. First, the ground truth is estimated via stochastic simulation, which introduces Monte Carlo noise; the susceptibility-peak estimator used here is robust for the network families considered (median ) but may be sensitive in strongly heterogeneous or periodic regimes; future work could use SIS quasi-stationary distribution methods or multi-criteria estimators (persistence time, final epidemic size, branching factor) for more precise labels. Second, the training set consists of synthetic networks; the model has not been validated on empirical contact networks (workplace, school, hospital settings); such validation is needed before claims of direct applicability can be made, and is identified as a priority for future work. Third, the current model addresses a homogeneous SIS process; extension to SEIR dynamics, age-structured populations, or multiplex networks requires augmenting the feature vector and deriving new physical constraints. Finally, the model assumes and are known, whereas in practice they must be inferred from surveillance data via Bayesian inverse methods.

7 Conclusion

We presented a comparative hybrid AI-mathematical framework for epidemic threshold prediction in contact networks. Evaluated on 775 synthetic networks via 5-fold cross-validation repeated over 5 seeds, Gradient Boosting achieves the best predictive accuracy (), while the PGNN () exhibits high instability consistent with overfitting in the low-data regime.

Three messages emerge. First, physics-guided constraints are not uniformly beneficial: is a genuine regulariser while over-regularises, demonstrating that constraint design must account for finite-sample behaviour. Second, gradient-based KSEL calibration reveals topology-dependent optimal values (), substantially departing from the universal constant of prior work. Third, the PGNN offers advantages that tree-based methods cannot provide: out-of-distribution physical consistency, SHAP interpretability, and modular integration of new theoretical knowledge—advantages that become decisive as dataset size grows beyond the current regime.

Future directions include: (i) scaling to networks to test whether the PGNN closes the performance gap with ensemble methods; (ii) replacing the QMF lower-bound constraint with a tighter, data-adaptive bound; (iii) extending to SEIR dynamics and multiplex networks; and (iv) Bayesian parameter inference for end-to-end prediction from surveillance data.

Take-home message. Physics-guided neural models should not be viewed as universally superior predictors, but rather as structured learning frameworks that incorporate domain constraints. In small to moderate datasets, simpler models may achieve higher accuracy, while physics-guided approaches provide interpretability and theoretical consistency. This distinction is essential for the correct use of hybrid models in network epidemiology.

Supporting information

S1 File. Python source code for the hybrid AI-mathematical model.

Complete Python implementation (hybrid_model_complete.pdf) of the Physics-Guided Neural Network (PGNN) framework presented in this manuscript, including data generation, model training, cross-validation, ablation study, SHAP interpretability analysis, and automatic KSEL coefficient calibration. The code is also available at: https://doi.org/10.5281/zenodo.19411519 (CC BY 4.0).

https://doi.org/10.1371/journal.pone.0344827.s001

(PDF)

References

1. Pastor-Satorras R, Vespignani A. Epidemic spreading in scale-free networks. Phys Rev Lett. 2001;86(14):3200–3. pmid:11290142
2. Wang Y, Chakrabarti D, Wang C, Faloutsos C. Epidemic spreading in real networks: an eigenvalue viewpoint. In: Proceedings IEEE SRDS. 2003. p. 25–34.
3. Kanyou C, Kouokam E, Tsopze N. Epidemic threshold: A Laplacian spectral and structural approach of prediction. J Complex Netw. 2023.
4. Auger P, Kouokam E, Sallet G, Tchuente M, Tsanou B. The Ross-Macdonald model in a patchy environment. Math Biosci. 2008;216(2):123–31. pmid:18805432
5. Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Computat Phys. 2019;378:686–707.
- View Article
- Google Scholar
6. Van Mieghem P, van de Bovenkamp R. Accuracy criterion for the mean-field approximation in susceptible-infected-susceptible epidemics on networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2015;91(3):032812. pmid:25871162
7. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning. 2nd edition. Springer; 2009.
8. Keeling MJ, Rohani P. Modeling Infectious Diseases in Humans and Animals. Princeton University Press; 2011.

[ref1] 1. Pastor-Satorras R, Vespignani A. Epidemic spreading in scale-free networks. Phys Rev Lett. 2001;86(14):3200–3. pmid:11290142
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Wang Y, Chakrabarti D, Wang C, Faloutsos C. Epidemic spreading in real networks: an eigenvalue viewpoint. In: Proceedings IEEE SRDS. 2003. p. 25–34.

[ref3] 3. Kanyou C, Kouokam E, Tsopze N. Epidemic threshold: A Laplacian spectral and structural approach of prediction. J Complex Netw. 2023.

[ref4] 4. Auger P, Kouokam E, Sallet G, Tchuente M, Tsanou B. The Ross-Macdonald model in a patchy environment. Math Biosci. 2008;216(2):123–31. pmid:18805432
View Article
PubMed/NCBI
Google Scholar

[8] View Article

[9] PubMed/NCBI

[10] Google Scholar

[ref5] 5. Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Computat Phys. 2019;378:686–707.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref6] 6. Van Mieghem P, van de Bovenkamp R. Accuracy criterion for the mean-field approximation in susceptible-infected-susceptible epidemics on networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2015;91(3):032812. pmid:25871162
View Article
PubMed/NCBI
Google Scholar

[15] View Article

[16] PubMed/NCBI

[17] Google Scholar

[ref7] 7. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning. 2nd edition. Springer; 2009.

[ref8] 8. Keeling MJ, Rohani P. Modeling Infectious Diseases in Humans and Animals. Princeton University Press; 2011.

Figures

Abstract

1 Introduction

2 Related work and mathematical foundations

2.1 The ross–macdonald model in fragmented environments

2.2 Structural approaches to epidemic threshold prediction

3 Hybrid theoretical framework and physics-guided neural network model

3.1 Unified feature representation

Note on prediction task and target leakage.

3.2 Physics-guided neural network architecture

Automatic calibration of the KSEL coefficient.

4 Experimental methodology

4.1 Ethics statement

4.2 Synthetic network dataset generation

Uncertainty estimation.

4.3 Training protocol

Statistical robustness.

5 Results

5.1 Predictive performance

5.2 Why does the PGNN underperform tree-based methods?

5.3 Automatic calibration of the KSEL coefficient k

5.4 SHAP interpretability analysis

6 Discussion

6.1 Why physics-guided models remain relevant

7 Conclusion

Supporting information

S1 File. Python source code for the hybrid AI-mathematical model.

References