Figures
Abstract
Mathematical models of viral infection have been developed, fitted to data, and provide insight into disease pathogenesis for multiple agents that cause chronic infection, including HIV, hepatitis C, and B virus. However, for agents that cause acute infections or during the acute stage of agents that cause chronic infections, viral load data are often collected after symptoms develop, usually around or after the peak viral load. Consequently, we frequently lack data in the initial phase of viral growth, i.e., when pre-symptomatic transmission events occur. Missing data may make estimating the time of infection, the infectious period, and parameters in viral dynamic models, such as the cell infection rate, difficult. However, having extra information, such as the average time to peak viral load, may improve the robustness of the estimation. Here, we evaluated the robustness of estimates of key model parameters when viral load data prior to the viral load peak is missing, when we know the values of some parameters and/or the time from infection to peak viral load. Although estimates of the time of infection are sensitive to the quality and amount of available data, particularly pre-peak, other parameters important in understanding disease pathogenesis, such as the loss rate of infected cells, are less sensitive. Viral infectivity and the viral production rate are key parameters affecting the robustness of data fits. Fixing their values to literature values can help estimate the remaining model parameters when pre-peak data is missing or limited. We find a lack of data in the pre-peak growth phase underestimates the time to peak viral load by several days, leading to a shorter predicted growth phase. On the other hand, knowing the time of infection (e.g., from epidemiological data) and fixing it results in good estimates of dynamical parameters even in the absence of early data. While we provide ways to approximate model parameters in the absence of early viral load data, our results also suggest that these data, when available, are needed to estimate model parameters more precisely.
Author summary
In a typical acute infection, the viral load initially surges rapidly, peaks, and subsequently declines. Data collection typically begins after symptoms appear, usually around or after the peak viral load, and thus missing the critical early phase of viral growth, when pre-symptomatic transmission occurs. In the present study, we evaluate the impact of missing pre-peak data on estimating key parameters in viral dynamic models. This lack of early data complicates estimating crucial parameters in viral dynamic models, such as the time of infection, leading to shorter predicted growth phases. We found that while model parameter estimates of the time of infection are sensitive to data quantity, parameters like the loss rate of infected cells are less affected. Conversely, knowing the time of infection and fixing it can improve the robust estimation of dynamical parameter estimates even without early data. However, incorporating additional information, such as the average time to peak viral load, can enhance parameter estimation robustness. Fixing certain parameters, such as viral infectivity and production rate, to literature values can improve parameter estimation robustness when pre-peak data is limited.
Citation: Zitzmann C, Ke R, Ribeiro RM, Perelson AS (2024) How robust are estimates of key parameters in standard viral dynamic models? PLoS Comput Biol 20(4): e1011437. https://doi.org/10.1371/journal.pcbi.1011437
Editor: Roger Dimitri Kouyos, University of Zurich, SWITZERLAND
Received: August 17, 2023; Accepted: April 1, 2024; Published: April 16, 2024
This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Data Availability: All relevant data are within the manuscript and its supporting information files.
Funding: This work was performed under the auspices of the U.S. Department of Energy under contract 89233218CNA000001 and was supported by NIH grants R01-AI078881 (ASP), R01-AI152703 and U54-HL143541 (RK), Los Alamos Laboratory National Laboratory LDRD 20200743ER (RMR), and Los Alamos National Laboratory LDRD 20200695ER (ASP). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. ASP, RMR and RK received salary from NIH. ASP and RMR received salary from Los Alamos National Laboratory LDRD.
Competing interests: The authors have declared that no competing interests exist.
Introduction
In a typical acute infection, the viral load initially increases rapidly, reaches a peak, and then declines (Fig 1A). The same pattern is seen in infections, such as HIV or hepatitis C, which can progress from acute to chronic, where the decline does not necessarily lead to the elimination of the virus. The viral load frequently correlates with a person’s infectiousness and, thus, the probability of viral transmission [1–8]. Understanding the viral dynamics throughout the course of infection, including prior to the viral peak, is critical to understanding viral transmission [9–11]. However, viral load data is often obtained from settings where infected individuals are tested and identified days or weeks after symptoms develop [3,12–14]. In the case of influenza A virus, SARS-CoV-2, and respiratory syncytial virus, symptom onset is usually around the peak viral load and corresponds to the time when a person is highly contagious [1,2,15–20]. Consequently, as viral load measurements in infected individuals are often collected post-symptom onset, we typically lack data collected during the viral growth phase and, thus, pre-peak viral load (Fig 1B). Additionally, the exact time of pathogen exposure is often unknown, or estimates of infection times are often based on incomplete data. Knowing the time of infection and the time to peak viral load can be used to determine the duration of quarantine, help in contact tracing, and determine the period high-risk infected individuals should be monitored for symptoms. Furthermore, understanding the dynamics of viral infection may also help in the timing of antiviral treatment to clear the infection, prevent viral rebound or emerging resistance, and decrease shedding duration [21–25].
A) Typical course of an acute infection, e.g., influenza A virus or SARS-CoV-2, where the viral load initially increases (growth, up-slope), peaks, and decreases (clearance, down-slope). Viral load data is usually collected post-symptom onset (red dots). B) Example of SARS-CoV-2 data collection that starts 5 to 7 days post infection (from Wolfel et al. [3]). Filled red circles are viral load measurements above the limit of detection (horizontal dashed line), non-filled circles are below the limit of detection. The vertical dotted line represents the time of symptom onset as reported in [26]. [Created with BioRender.com].
Mathematical modeling has a long history in studying the dynamics of many viruses causing acute and chronic infections, including West Nile virus [27], dengue [28–32], Zika [33–35], respiratory syncytial virus [36,37], influenza [25,38–42], SARS-CoV-2 [1,12,16,22–24,43–50], hepatitis C virus [51–58], hepatitis B virus [59–66] and HIV [67–76]. Interestingly, the model structure to study these varied infections is very similar. Those models were used to understand the fundamentals of viral infections, study the role of the immune system, and antiviral treatment efficacy. Furthermore, mathematical modeling has been a valuable tool in aiding decision-making in the SARS-CoV-2 pandemic [77–81].
The accuracy of model predictions depends on model complexity, chosen initial conditions, parameter estimation algorithm, and, most importantly, the quantity and quality of experimental measurements and clinical and epidemiological data. The impact of these issues in parameter estimation of viral dynamic models has been analyzed in previous studies [82–86]. In addition, other studies also addressed issues of structural and practical identifiability of model parameters [86–88].
Here, we analyze the effect of having different amounts of data early on (as in previous studies [50,82,85,86,89,90]), the effect of knowing some parameters, and the effect of knowing the time of infection. We then quantify how this extra information impacts parameter estimates. We are not interested in any particular infection since the early behavior (in Fig 1) is similar across many different infections, and the models used also have very similar structure; our approach should be valid for different infections. However, to be concrete, we use, as an example, data from infection with SARS-CoV-2, since this has become one of the infections with the most data. In particular, we use a data set with frequent measurements throughout, before, and after the peak of viral load [13,14]. We found that the cell infection and virus production rate are crucial parameters in viral dynamic models needed to robustly estimate the dynamics of the viral growth phase. If data is missing in the first few days post-infection, knowing those parameters (e.g., from prior studies) led to similar predictions of viral load as having frequent viral load measurements in the pre-peak growth phase. Alternatively, knowledge of the time of infection (e.g., from epidemiological evidence) or assuming a given duration until peak viral load is attained (e.g., in SARS-CoV-2 assuming the median of 5 days [91–104]) represent promising alternatives to obtain consistent parameter estimates. Nevertheless, frequent viral load measurements at the early stages of infection are crucial for obtaining more precise estimates of model parameters and the overall viral trajectory within individuals.
Methods
Mathematical model
A mathematical model often used to study viral infections is the target cell limited model (TCLM) with an eclipse phase, which was introduced to study influenza infection [25]. This model has been used to study various acute infections, such as Zika, dengue, influenza A, West Nile virus, Ebola, and SARS-CoV-2 [1,25,27,35,43,58], due to its simplicity and small number of parameters.
The TCLM describes the dynamics of target cells, i.e., cells susceptible to infection, T, infected cells in the eclipse phase that are not yet virus-producing, E, virus-producing infected cells, I, and virus, V. The TCLM has also been augmented by including an innate immune response that has provided a better description of influenza and SARS-CoV-2 infection dynamics [1,25]. In this model, we include a population of cells that are refractory to infection, R, and call the model the refractory cell model (RCM). Refractory cells are in an antiviral state induced by the innate immune response mediated by type I and type III interferons [105–107]. The following system of ordinary differential equations (ODEs) gives the dynamics of the five populations of the RCM:
(1)
The equations of the TCLM are similar and to obtain them we remove the dR/dt equation, and the terms φIT and ρR in the dT/dt equation (see equation S1 in S1 Text). In the model, target cells, T, become infected by virus with rate constant β and then enter the eclipse phase, E, which lasts for an average duration 1/k during which time they produce no virus. At the end of the eclipse phase, cells become productively infected cells, I, produce virus with rate constant π and die with rate constant δ. Note that the average infected cell lifespan is . Finally, virus, V, is cleared with first-order rate constant c (Fig 2). In infections such as HIV, where virus and infected cells are sampled in blood, π represents the average rate of viral production by an infected cell. However, when the virus is isolated from swabs, such as in respiratory infections (e.g., SARS-CoV-2), π is the viral production rate multiplied by the fraction of virus sampled on a swab contained in 1 mL of viral transport fluid as described in the supporting information in Ke et al. [1].
A susceptible target cell, T, is infected by virus, V, with the infection rate constant β. Infected cells in the eclipse phase, E, become actively virus-producing cells, I, with the transition rate constant k. Infected cells, I, produce virus with production rate constant π or die with degradation rate δ. Virus is cleared with clearance rate c. In the refractory cell model, in addition we also account for the innate immune response, which turns susceptible cells into refractory cells,R, with constant rate φ, which are in an antiviral state and refractory to infection. However, refractory cells can become susceptible to infection with constant rate ρ. [Created with BioRender.com].
Infection induces the release of interferons that may establish an antiviral state in non-infected target cells. For simplicity, we do not explicitly include interferons but model their concentration as proportional to the number of infected cells (for more details, see supporting information in [1]). We model the effect of interferon by assuming target cells enter the refractory state with rate φIT, and leave it with a constant first-order rate constant ρ, making them susceptible to infection again (Fig 2).
Here, for concreteness, we will focus on SARS-CoV-2 infection. Consistent with our previous work [1], we fixed the initial target cell population at the time of infection (tinf) to T(tinf) = 8×107 cells. The initial target cell population is based on the estimated number of epithelial cells in the upper respiratory cell tract (4×108 cells), with 20% of them expressing angiotensin-converting enzyme 2 (ACE2), the entry receptor used by SARS-CoV-2 [25,108]. Note that in these models, the initial target cell population and the virus production rate, π, are correlated and only identifiable as a product. Thus, an increase or decrease in target cells decreases or increases the virus production proportionally [25,39,86,87,109,110].
Additionally, we assumed the initial refractory cell population R(tinf) = 0 cells. The virus concentration that initiates infection is hard to estimate. Thus, as has been done previously [111], we set V(tinf) = 0 and started the growth phase of the infection with one infected cell in the eclipse phase E(tinf) = 1 [1], which was the value that led to the best model fit according to our sensitivity analysis (Table A in S2 Text). Note that it is possible that there is a variable time of stochastic rounds of infection before the virus starts to increase, and we are not including that phenomenon in our model (see [89] for details of this process in HIV). In vitro experiments have shown that it usually takes 4–8 hours before an infected cell starts to produce SARS-CoV-2 [22,25,112], yielding a rate of transition out of the eclipse phase of k = 4 d−1. Further, we fixed the virus clearance rate c = 10 d−1 [22] (for more details, see supporting information in [1]).
For the above models, the basic reproductive number (R0) is given by
(2)
which corresponds to the average number of cells infected by one single infected cell at the start of infection.
Parameter estimation, model selection, and model analysis
Fitting the model to log10-transformed data was implemented using the non-linear mixed effects modeling framework in Monolix (lixoft.com) version 2023R1 (see S2 Text for the Monolix function used). In this approach, we assume that a model parameter ηi is drawn from a distribution with a fixed part θ, which is the median value of the parameter in the population, and a random term (for individual variability) ϕi, which is assumed to be normally distributed with zero mean and standard deviation σθ. Unless otherwise specified, we assumed that all parameters follow a lognormal distribution. Note that for fixed model parameters, random effects were not included. To minimize the negative log-likelihood, Monolix uses the SAEM (Stochastic Approximation Expectation-Maximization) algorithm with a Markov Chain Monte Carlo (MCMC) procedure to sample the conditional distribution of the parameters. We used all default settings and a constant error model. The 95% confidence intervals were calculated with the R package Rsmlx [113] using Monolix’s Fisher Information Matrix. We conducted at least 100 different parameter estimation rounds using Monolix’s R-functions with randomly chosen initial parameter values uniformly distributed within the following ranges: tinf∈[−8,−5] days, log10(β)∈[−8,−5] mL/RNA copies/days, δ∈[0.1,3] 1/day, π∈[1,100] RNA copies/mL/day, log10(φ)∈[−8,−4] 1/cell/day, and log10(ρ)∈[−3,−1] 1/ day. Note that these ranges do not correspond to restrictions on the parameter space; rather, each set corresponds to the ranges sampled from the initial parameters guesses for a full fitting run. Thus, the estimated parameters could lie inside or outside these ranges. Indeed, one would expect that for very simple fitting problems (simple model and large, well-behaved data sets, e.g., with a well-behaved global minimum), all of these fits would lead to the exact same parameter estimates. From the more than 100 different parameter estimation rounds (i.e., 100 randomly chosen sets of initial parameter values), the best model fit was selected for each data set by comparing the negative log-likelihood (-LL) and root-mean-square error (RMSE). We analyzed practical identifiability using the profile likelihood estimation (PLE) method for each fitted model parameter using Monolix’s R-functions (see S2 Text) [114–117]. A model parameter is identifiable if its likelihood profile is finite (crosses the confidence level) in a U-shape [117].
Data and subsets of data
We used published data from the National Basketball Association (NBA), where unvaccinated individuals were regularly tested during the NBA tournament in 2020 and 2021 [13,14]. The data is based on a combination of anterior nares and oropharyngeal swabs tested using RT-qPCR. The resulting Ct values for each individual were converted to viral RNA (genome equivalents per mL) as described by Kissler et al. [13,14].
We chose 25 unvaccinated individuals from this cohort (9 from [13] and 16 from [14]) with frequent viral load measurements, i.e., individuals with four or more viral load measurements above the limit of detection [LOD ≥ log10(2.658) RNA copies/mL], representing the entire course of infection (viral load up-slope, peak, and down-slope) and no viral load above detection more than 14 days pre- and post-peak. For more information about this selection process, see S2 Text. We excluded vaccinated individuals since breakthrough infections involve different immune mechanisms, such as the presence of neutralizing antibodies and immune memory that may lead to quicker responses against the infection as compared to in unvaccinated individuals.
On average (± standard deviation), the 25 selected individuals had 9.8 ± 3.8 viral load measurements above the LOD with 3.1 ± 1.6 data points obtained during the up-slope, one measurement representing the observed peak viral load, and 5.7 ± 3.0 measurements obtained during the post-peak down-slope. The time of peak viral load was defined in the original studies as t = 0 [13,14] and, thus, the time of infection, tinf, is negative and denotes the number of days before the viral peak that infection was estimated to have taken place.
We first fitted the viral dynamic models to all the viral load measurements and then compared estimated model parameters and model predictions to different subsets of the complete data set, where we included only measurements started 3, 5, and 7 days post-infection or included only post-peak viral loads (Figs C, D, E, and F in S2 Text). We are interested in how well these smaller data sets re-create the parameter estimates we obtained when fitting the full data set, which we take as the ground truth for our purposes.
To measure how well the smaller data sets allow parameter estimates that describe the full data set, we calculate the root-mean-square error (RMSE)
(3)
where yi are the log10 of actual measurements and
are log10 model predictions at time i using parameters estimated with the smaller data set but for the full course of infection with all n non-censored data points, i.e., viral load measurements above the detection limit (LOD). Note that we are not using the RMSE to fit the data, only to calculate how close the viral loads in the full data set are to those from a model simulation generated using parameters estimated with a subset of the data.
Results
SARS-CoV-2 dynamics: The course of infection
The RCM fits the entire course of infection of the 25 selected individuals and describes the initial viral growth and subsequent virus clearance (see Fig A in S2 Text for the best fit with peak viral load t = 0 and Fig 3 for the best fit with tinf shifted to zero).
Filled circles are measurement points; non-filled circles are censored and below the detection limit (dotted grey line). We fixed population parameters (without allowing random effects) for the initial target cell population, T0 = 8×107 cells, initial eclipse cell population, E0 = 1 cell, transition rate out of the eclipse phase, k = 4 /day, and the virus clearance rate, c = 10 /day (see Table 1 and Table B in S2 Text for individual parameters, S1 Data for viral load measurements and model predictions).
We estimated the time of infection to be a median of -6.4 days from the observed viral load peak, ranging individually from -9.8 to -5.3 days (Fig A and Table B in S2 Text). This is consistent with the findings in a human challenge study [118], where the viral load peak measured with nasal swabs occurred 6.2 days after infection and ranged between 3 and 9 days. We note that the model predicted time to the peak is not necessarily the same as the time to the observed peak viral load in the data. In some individuals, the model predicts an earlier peak viral load than the observed one, e.g., the predicted viral load of individual 2349 peaks four days before the measured one, probably due to the data not showing a well-defined peak (Fig 3 and Fig A in S2 Text). The model predicts a median time to peak of 5.7 days (Fig 4), i.e., 0.7 days before the observed peak, due to a rapid increase in viral load early post-infection in some individuals. The model also predicts another 9.1 days to clear the infection (from peak viral load to the LOD), with a predicted infection duration of around 15 days. Note that the time of exposure to SARS-CoV-2 was not reported in [13,14]. Nevertheless, the last negative (below the LOD) measurement (5.8 ± 2 days pre-peak viral load) and the first positive measurement above the LOD (4.1 ± 1.7 days pre-peak viral load) were reported. Thus, our model result for the predicted time of infection is consistent with these observations.
Predicted dynamics of virus, infected cells, refractory cells, and target cells using the population parameters (black line) and individual parameters (colored lines) throughout the course of infection predicted by the RCM using the best-fit population parameter estimates (see S1 Data for model predictions).
In combination with the eclipse phase duration, 1/k, the average lifespan of infected cells is . Using the estimated values of k and δ, we find the average lifespan of an infected cell is 0.64 days or 15 h. The within-host reproductive number R0 was, on average 5 (Table 1).
The infected cell population peaks at around the same time as the viral load (Fig 4). Target cells start to decline substantially 3 to 4 dpi until less than 3% of target cells are left at around 7 dpi when refractory cells reach their maximum of 81% of the initial susceptible target cells (Fig 4). To further explore these fits, we performed a correlation analysis of the population parameters obtained from fits with a negative log-likelihood (-LL) in the range of 2 units from the best fit [that is, min(-LL) to min(-LL) + 2]. We found more than 50 fits with a -LL in the defined range with several model parameters significantly correlated (Fig 5). For example, the cell infection rate constant (β) and the virus production rate (π) are negatively correlated, as has been seen before [40]. The transition rate of susceptible cells into refractory cells (φ) is positively correlated with π but negatively correlated with β. Thus, the faster the estimated rate cells transition into the refractory state, the lower the estimated cell infection rate and the higher the estimated virus production rate. Furthermore, the transition rate of refractory cells back into susceptible cells (ρ) is positively correlated with the loss rate of infected cells (δ). Note that when we included these correlations in the model fitting, there was an increase in the BICc (991 with correlations compared to 981 without). Thus, we did not include the correlations in further analyses. We further assessed model identifiability by performing a profile likelihood analysis (PLE) for every estimated model parameter. The PLE [117] showed that all estimated parameters were practically identifiable (Figs G, H, I, J, K, and L in S2 Text). Parameter identifiability for similar models was done in detail previously [82,86,87,90,112,119], and we do not explore it further here.
The sample size gives the number of fits that fit the model equally well in the range of min (-LL) to min(-LL)+2. Correlations that are crossed are non-significant (p-value > 0.05). The plot has been generated with ggstatsplot [doi:10.21105/joss.03167].
Missing data in the pre-peak growth phase
In most human studies, data is not collected starting at the time of infection but rather starting at the time of or later than the onset of symptoms [16]. To understand the effect of not having early data, we constructed different data sets, with varying numbers of viral load measurements during the viral up-slope, to study the robustness (i.e., consistency) of estimated model parameters to missing data by comparison with the results obtained by fitting the full data set presented above. Based on the estimate of tinf for each individual, data sets were constructed starting i) 3 days post-infection (dpi), yielding on average 2.1 ± 1.6 pre-peak measurements, ii) 5 dpi, yielding on average 0.7 ± 0.9 pre-peak measurements, and iii) 7 dpi yielding on average 0.1 ± 0.6 pre-peak measurements. In the 3, 5, and 7-dpi data sets, pre-peak viral load measurements were available from 21, 12, and 1 individual, respectively (for figures, see Figs C, D, E, and F in S2 Text). Note that the 7-dpi data set starts very close to the peak viral load (±1 day). Thus, only one individual has pre-peak data, 17 individuals lose the peak viral load, and five individuals have at least one missing data point after peak viral load (viral down-slope). Finally, we studied the robustness of parameter estimates using an additional data set with only the peak and post-peak measurements as a proxy for data collection starting around symptom onset and, consequently, the most common data set obtained in clinical practice.
An important issue with fitting acute infection data is that typically, we do not know the time of infection and, thus, the times relative to infection when data was collected. To study this issue, we considered three scenarios for each artificial data set created above. First, we assumed that we do not know the infection time and estimate it (tinf = est) from the data as we did above, but now we are using our reduced data sets. Here, the first measurement is assigned time 0, and we estimate infection before that. In the second case, we assume we know the time of infection, as estimated from the full data set, and thus, tinf = 0 is the actual infection time. We fit the model to the various data sets and estimate model parameters (with tinf = 0). Lastly, when data in the initial viral growth phase is missing, it is common practice to set the time of infection from the literature [22,23,48,120], e.g., by assuming the viral load peaks at the estimated median time of symptom onset, i.e., 5 dpi for SARS-CoV-2 [91–104]. Thus, for this third case, we only use the post-peak data set and assume the time of the observed peak viral load is day 5. While used in previous literature, this approach is crude as it ignores the individual variability in time to symptom onset. Here, we are testing the effect of fixing this time on the estimation of parameters and using a fixed value of 5 days biases against our objectives since we “know” from the full data set that the population estimate is 5.7 days.
We fitted the model in turn to these different data sets and assumptions of tinf, and then evaluated how well the model with these newly estimated parameters describes the full data set, calculating the RMSE for the full data, as we did for the full model fit. Thus, we are calculating RMSE on the same full data set. The lower the RMSE, the better a model fitted to a subset of the data agrees with the full viral load course. We found, as expected, that the closer the data starts to the time of infection (tinf), i.e., the more data available pre-peak viral load, the lower the RMSE and the better the predictions align with those made with the full data set (Fig 6) [86]. Collecting data 3 dpi, 5 dpi, or 7 dpi, and knowing the time of infection (tinf = 0) results in lower RMSEs than when estimating tinf (Fig 6A–6D). However, the more viral load measurements pre-peak that are missing and without knowledge about the time of infection, the higher the RMSEs, leading to more uncertainty in the model prediction. Furthermore, for the post-peak viral load data set, if we assume that the viral load peaks 5 dpi and fix this (Fig 6D, blue dots), we obtain lower RMSEs compared to re-estimating infection times for each individual (Fig 6D, orange triangles), which in addition are usually poorly estimated. The TCLM yields similar results, but interestingly, in some cases, that model also yields slightly lower average RMSEs than the RCM (S1 Text). Note that the scenario assuming 5 days from infection to peak viral load was only fitted with the post-peak data.
Infection times (tinf) are re-estimated (triangle), infection times are known and set to zero (square), or infection times are set to zero and the VL peaks 5 dpi (circle). RMSEs are shown for different subsets of the full data set: A) 3 dpi, B) 5 dpi, C) 7 dpi, and D) post-peak. For each case, all model parameters were re-estimated (NONE fixed on the x-axis), or the indicated model parameters were fixed to the values estimated from the full course of the infection data set and the remaining parameters estimated (see Table 1). The dashed line represents the RMSE calculated from the best model fit using the full course of infection (see S1 Data for RMSE values).
Fixing one or more model parameters is a common practice to reduce uncertainty in data fitting [114]. Therefore, we were interested in how many model parameters, in addition to c and k, had to be fixed to describe the entire course of infection well with the different data sets. The parameters c and k are typically fixed as they refer to processes that take place on a timescale of minutes to hours for which data is unavailable [22,25,38,45,108,111,121]. Thus, we systematically fixed every single or possible pair of remaining model parameters to the population value estimated from the fit to the entire data set representing the entire course of infection (Table 1). Adding knowledge to the model fitting by fixing one or two additional parameters improved the RMSEs. Knowing tinf yielded the lowest RMSE, followed by assuming the viral load peaks at 5 dpi, where both outperformed re-estimating tinf. By fixing the cell infection rate β and either the virus production rate π or the infected cell loss rate δ or the conversion rate of refractory cells into susceptible cells ρ, we generally observed the lowest RMSEs, which were close to those calculated from the whole course of infection data set (Fig 6A–6D). Especially β and π are crucial parameters of the initial growth phase. However, fixing only one of those model parameters (β or π) led to conflicting results. If infection times are not known and, thus, must be estimated, fixing only β yielded lower RMSEs than fixing only π when early data is missing (compare Fig 6A and 6B when early data is available with Fig 6C and 6D where only late data is available). However, if we assume the viral load peaks 5 dpi, fixing π yielded slightly lower RMSEs than fixing β (Fig 6D). Additionally, if infection times are re-estimated or if we assume the viral load peaks 5 dpi and fix π, the TCLM yielded lower RMSEs than the RCM (S1 Text).
How robust are estimated infection times and other model parameters?
To evaluate the robustness of parameter estimates to missing data, we analyzed the parameters estimated with the modified data sets, with and without fixing model parameters (beyond c and k) and compared them to those estimated from the full course of infection. Note that we do not evaluate the biological accuracy of the estimated parameter values, which needs experimental and clinical validation, but rather studied the robustness or consistency of model parameters estimated from the data sets with varying numbers of data points pre-peak viral load. That is, we assume that the fit to the full data set is the ground truth, and thus the parameters estimated in that fit are “correct”.
The estimated δ was close to the value estimated from the full course of infection if ρ was fixed due to their correlation (Fig 7A and Fig B in S2 Text). Furthermore, only by fixing β and π, were we able to estimate the time of infection robustly and almost exactly (Fig 7B). Estimating β was most robust if we assume the viral load peaks 5 dpi and fix one or more model parameters (Fig 7C). Again, π was mostly over or underestimated when fixing one or two model parameters. However, not fixing any parameters led to the most robust estimate of π for all three studied cases (Fig 7D). For both innate immune response model parameters φ and ρ, fixing β and π or β and δ performed best for all subsets of the full data set (Fig 7E and 7F).
The dotted line represents the population parameter estimated from the “full course of infection” data set. The gray shaded area represents 95% confidence intervals of the population parameter estimated from the “full course of infection” data set (Population parameters estimated from each data subset individually can be found in Fig B in S2 Text and S1 Data).
Effect of choosing different combinations of model parameters
Given that our results demonstrated the importance of knowing the cell infection rate (β) and the virus production rate (π), we were interested in the model performance when these are unknown but fixed at biologically plausible values. For that, we fixed β and π at various values and estimated the remaining model parameters, using the post-peak data set and assuming 5 days to reach peak viral load, as this is the most common study scenario. We then used the parameter estimates from these fits to simulate the full course of infection and calculate RMSE for the full course of infection. As shown in Fig 5, β and π correlate inversely, and because of this, we found that different combinations of β and π led to equally good fits, represented as dark blue tiles and, thus, low RMSEs (Fig 8). By visual inspection, we define fits with RMSEs within 20% of the best RMSE as equally good.
A) RMSEs for different combinations of literature values. B) RMSEs for fixed infection rates from literature and estimated virus production rates. C) RMSEs for fixed virus production rates from literature and estimated infection rates. ◊ = the population parameters we estimated, x = values for β and/or π found in literature. Parameter values can be found in Table C in S2 Text. Note that the underlying heatmap is the same for A), B), and C), and it was calculated from fits to the post-peak viral load data set and assumed 5 days to reach peak viral load for each pair of β and π values indicated in the x and y-axes, respectively (see S1 Data for RMSE values).
We include different values for β and π reported in the literature in these analyses and obtained using either the RCM or TCLM [1,12,21,24,44,120,122,123] (Table C in S2 Text, Fig 8). Compared to our estimate of β = 1×10−8 mL/RNA copies/day, the infection rate values found in the literature were mostly in the range between 1×10−9 to 5×10−8. These values and their corresponding virus production rates, which ranged between 50 and 1500 (our estimate π = 151 RNA copies/mL/day), led to equally good fits if β≥5×10−8 and π≥50 (Fig 8A). However, in the absence of pre-peak viral load data, we observed overall lower RMSEs, if fixing β and estimating π (Fig 8B, Table C in S2 Text) instead of fixing π and estimating β (Fig 8C, Table C in S2 Text). Thus, using infection rate values from the literature obtained using similar models and diverse data sets represents a good strategy to deal with missing data in the initial growth phase and missing information about infection times.
Discussion
Estimating parameter values in viral dynamic models with missing data is challenging. Especially in acute infection, where individuals generally only become aware of being infected when symptoms develop. Thus, information about the time of infection and viral load measurements prior to symptom onset is often not available. The uncertainty of model predictions with missing data early post-infection was recently studied by Ciupe et al. for an acute infection (SARS-CoV-2) [86]. But they focused only on two scenarios of data availability, early and late, and on the estimate of virus infectivity, β. They found that the availability of early data post-infection and, thus, pre-peak viral load data improved the practical identifiability of β. However, they did not use a mixed effects approach [86] as in the current study, which improves identifiability properties [124]. Indeed, another recent study [89] evaluated the same issue of sparse initial data, but for chronic HIV-1, using a similar model and also using the same population mixed-effects approach we do. They found that information was gained across individuals, even when some had data missing, although uncertainty rose with fewer data points close to infection [89]. In our study, in addition to analyzing the robustness of estimated viral dynamic model parameters in the absence of variable amounts of data in the pre-peak growth phase, we also studied the effect of having extra information, such as the time of infection and/or knowing the values of some parameters. We found that viral infection and virus production rates are key parameters in determining the pre-peak viral growth rate, and with a lack of early data, the time to peak viral load was often underestimated. However, fixing the time of infection based on epidemiological studies represented a good alternative to estimating infection times and resulted in good model fits.
Viral dynamics of the entire course of infection
The RCM describes the frequent viral load measurements of the 25 studied individuals well (Fig 3). Most estimated model parameters agreed with the estimates made in previous work, except for the transition rate of refractory cells back into susceptible cells, which we now estimate to be almost 3-fold higher [1]. Interestingly, we estimated that only 3% of the initial number of target cells were infected at the viral peak and 6% cumulatively from infection to peak viral load (Fig 4). In contrast, at the peak viral load, most cells were in a refractory state (81% of T0), only 12% of the initial number of target cells remained susceptible to infection, and the remainder of T0 were either infected cells or cells killed by the virus. This suggests that turning target cells into cells refractory to viral infection by establishing an antiviral state in uninfected cells may be a critical host defense mechanism early on in fighting a viral infection. And indeed, there are several reported mechanisms for a cell to accomplish this [105,107,125,126]. However, as far as we know, experimental measurements of the in vivo fraction of cells in an antiviral state during SARS-CoV-2 (or indeed any other) infection are not available and thus limit our ability to compare these predictions to data. Note that in the RCM, viral rebound occurs when refractory cells become susceptible again to infection. In vivo, the rebound is most likely prevented by adaptive immune responses that neutralize the free virus and kill infected cells. Due to a lack of data associated with the adaptive immune response, we did not model these mechanisms here.
The effect of missing data in the pre-peak viral growth phase
With missing data in the pre-peak viral growth phase, infection times are underestimated by 2 to 3 days, resulting in very fast initial growth rates. Here, we show that estimation of the infection time can be improved by adding knowledge to the model. Cell infection and virus production rates are crucial parameters for describing the pre-peak growth phase. Fixing both model parameters to our population values led to infection time estimates similar to those estimated from the entire course of infection data set.
We were further interested in the robustness of other model parameter estimates to missing data. By fitting the RCM to subsets of the full data set, thus mimicking different data collection scenarios, we found that knowing the infection time led to the lowest RMSEs (Fig 6). However, a low RMSE did not guarantee the correct estimation of population parameters due to the correlations in the model parameters, such as the correlation between the cell infection rate (β) and the virus-production rate (π). Furthermore, the infection time is often unknown, but if one has knowledge of that time (e.g., from epidemiological studies) and uses it by fixing the infection time in the fitting, we obtain more robust results, which are better than estimating infection times. Estimated infection times were underestimated by up to 3 days when peak viral load data was missing while fixing β and π to values estimated from the full data set led to the most robust infection time estimates and closest to the value estimated with the full data set. Nevertheless, fixing a time to peak viral load of 5 days (for SARS-CoV-2 [91–104]) represented a good alternative to estimating infection times and led to estimates of the population parameters close to those estimated from the full course of infection data set. However, while we offer methods for estimating model parameters even without early viral load data, carefully gathered viral load measurements at different stages of infection data can enhance the precision and reliability of the model’s parameter estimates and predictions of infection times, incubation periods, and infectiousness.
Interestingly, whether infection times are known, estimated, or assumed, the loss rate of infected cells (δ) represented the most robust model parameter, with more consistent estimation, due to frequent viral load measurements after symptom onset and thus after peak viral load.
Estimating β and π: What if β and π are unknown?
The cell infection rate β and virus production rate π are crucial parameters of the pre-peak viral growth phase. Fixing both model parameters may lead to robust predictions of infection times (Fig 7). However, both model parameters are often unknown and challenging to measure experimentally.
If infection times are unknown, assuming 5 days from the time of infection to peak viral load led to the most robust estimates of β. Nevertheless, estimates of π showed more variability, possibly due to lower sensitivity. It has been previously shown that the initial target cell population T(tinf) also correlates with the virus production rate π and only their product [T(tinf)⋅π] is identifiable [87,90,109,127].
Encouragingly, using estimates for the cell infection and virus production rates from other modeling studies that used similar RCM and TCLM models [1,12,21,24,44,120,122,123] led to equally good fits of the data, for example, β = 10−8 mL/RNA copies/day and π = 150 to 200 RNA copies/mL/ day or β = 10−9 mL/RNA copies/day and π = 1000 to 1500 RNA copies/mL/day (Fig 8 and Table C in S2 Text). Consequently, when there is missing data in the pre-peak growth phase, taking cell infection and virus production rates from other modeling studies in the literature that used similar models may allow robust predictions of the viral load dynamics during the growth phase.
Limitations and outlook
Our analysis was based on models of infection similar to those used for a variety of viruses, including West Nile virus [27], Zika [35], respiratory syncytial virus [36], influenza [25,38–41], and SARS-CoV-2 [1,12,22,24,43], as well as primary infection in HIV [89] and HBV [66], among others. Still, we only analyzed in detail two models, the TCLM and the RCM, and not all our conclusions may extend to variations of these models. We based our study on a specific data set for SARS-CoV-2 infection due to its availability and richness. However, we claim that our results are a property of the models and not of the specific data set, and we could have just used synthetic data (generated by model simulation) to obtain the same conclusions (as in [86]). A more important limitation is that we analyzed the effect of extra information, which may not be available for some infections, especially emergent infections. However, if nothing or very little is known, modeling is inherently difficult, and we show that at least “guessing” the time to the viral peak, and fixing it in the fitting procedure, will inject some robustness into the results. Another issue is that we specifically did not include an adaptive immune response in our models, which for example precluded us from using data from vaccinated individuals. The reason is that the immune response is often quite variable among individuals and may result in an additional level of uncertainty [12].
Overall, our results may be useful in guiding data collection and using that data to best estimate viral dynamic parameters such as the death rate of infected cells, which can inform about viral pathogenesis. Moreover, if a novel virus emerges in the future, we emphasize that one should strive to obtain the most informative data sets, i.e., frequent measurements throughout the course of infection, so that one can have the most confidence in any inference about the infection kinetics and the infectious period of an individual. Therefore, in the event of a novel virus emergence, prioritizing data collection with frequent viral load measurements beginning as early as possible and continuing throughout the course of infection can improve model predictions and, hence, our understanding of the virus’ behavior to respond effectively to the evolving situation.
In summary, the current study provides new insights into viral dynamic modeling in the absence of frequent viral load measurements, particularly in the early stages of infection. We evaluated the robustness of estimated model parameters to missing data and found that cell infection and virus production rates are key parameters of the viral growth phase that are difficult to estimate without early data. Furthermore, missing data before the viral load peaks leads to underestimates of the time to peak viral load and to inconsistent estimated model parameters. However, fixing infection times from epidemiological studies and/or model parameters that are important in determining the viral growth rate (β and π) represented a good alternative and led to robust model fits and model parameter estimates.
Supporting information
S1 Data. Experimental measurements and model predictions.
https://doi.org/10.1371/journal.pcbi.1011437.s003
(XLSX)
References
- 1. Ke R, Zitzmann C, Ho DD, Ribeiro RM, Perelson AS. In vivo kinetics of SARS-CoV-2 infection and its relationship with a person’s infectiousness. Proc Natl Acad Sci Royal. 2021;118.
- 2. Marc A, Kerioui M, Blanquart F, Bertrand J, Mitjà O, Corbacho-Monné M, et al. Quantifying the relationship between sars-cov-2 viral load and infectiousness. Elife. 2021;10. pmid:34569939
- 3. Wölfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Müller MA, et al. Virological assessment of hospitalized patients with COVID-2019. Nature 2020 581:7809. 2020;581: 465–469. pmid:32235945
- 4. Carrat F, Vergu E, Ferguson NM, Lemaitre M, Cauchemez S, Leach S, et al. Time lines of infection and disease in human influenza: A review of volunteer challenge studies. Am J Epidemiol. 2008;167: 775–785. pmid:18230677
- 5. Tsang TK, Cowling BJ, Fang VJ, Chan KH, Ip DKM, Leung GM, et al. Influenza A virus shedding and infectivity in households. J Infect Dis. 2015;212: 1420–1428. pmid:25883385
- 6. Asher J, Lemenuel-Diot Id A, Clay Id M, Durham DP, Mier-Y-Teran-Romeroid L, Arguello CJ, et al. Novel modelling approaches to predict the role of antivirals in reducing influenza transmission. PLoS Comput Biol. 2023;19: e1010797. pmid:36608108
- 7. Gonçalves A, Maisonnasse P, Donati F, Albert M, Behillil S, Contreras V, et al. SARS-CoV-2 viral dynamics in non-human primates. PLoS Comput Biol. 2021;17: e1008785. pmid:33730053
- 8. Heitzman-Breen N, Ciupe SM. Modeling within-host and aerosol dynamics of SARS-CoV-2: The relationship with infectiousness. PLoS Comput Biol. 2022;18: e1009997. pmid:35913988
- 9. Eisinger RW, Dieffenbach CW, Fauci AS. HIV viral load and transmissibility of HIV infection: Undetectable equals untransmittable. JAMA. 2019;321: 451–452. pmid:30629090
- 10. Fraser C, Lythgoe K, Leventhal GE, Shirreff G, Hollingsworth TD, Alizon S, et al. Virulence and pathogenesis of HIV-1 infection: an evolutionary perspective. Science. 2014;343. pmid:24653038
- 11. Wilson DP, Law MG, Grulich AE, Cooper DA, Kaldor JM. Relation between HIV viral load and infectiousness: a model-based analysis. The Lancet. 2008;372: 314–320. pmid:18657710
- 12. Ke R, Martinez PP, Smith RL, Gibson LL, Mirza A, Conte M, et al. Daily longitudinal sampling of SARS-CoV-2 infection reveals substantial heterogeneity in infectiousness. Nature Microbiology 2022 7:5. 2022;7: 640–652. pmid:35484231
- 13. Kissler SM, Fauver JR, Mack C, Olesen SW, Tai C, Shiue KY, et al. Viral dynamics of acute SARS-CoV-2 infection and applications to diagnostic and public health strategies. PLoS Biol. 2021;19: e3001333. pmid:34252080
- 14. Kissler SM, Fauver JR, Mack C, Tai CG, Breban MI, Watkins AE, et al. Viral dynamics of SARS-CoV-2 variants in vaccinated and unvaccinated persons. New England Journal of Medicine. 2021;385: 2489–2491. pmid:34941024
- 15. Stankiewicz Karita HC, Dong TQ, Johnston Christine, Neuzil KM, Paasche-Orlow MK, Kissinger PJ, et al. Trajectory of viral RNA load among persons with incident SARS-CoV-2 G614 infection (wuhan strain) in association with COVID-19 symptom onset and severity. pmid:35006245
- 16. Challenger JD, Foo CY, Wu Y, Yan AWC, Marjaneh MM, Liew F, et al. Modelling upper respiratory viral load dynamics of SARS-CoV-2. BMC Med. 2022;20. pmid:35022051
- 17. Ahmad A, Eze K, Noulin N, Horvathova V, Murray B, Baillet M, et al. EDP-938, a respiratory syncytial virus inhibitor, in a human virus challenge. N Engl J Med. 2022;386: 655–666. pmid:35172056
- 18.
CDC. Key Facts About Influenza (Flu). Available: https://www.cdc.gov/flu/about/keyfacts.htm
- 19. Handel A, Longini IM, Antia R. Neuraminidase inhibitor resistance in influenza: Assessing the danger of its generation and spread. PLoS Comput Biol. 2007;3: e240. pmid:18069885
- 20. Du Z, Nugent C, Galvani AP, Krug RM, Meyers LA. Modeling mitigation of influenza epidemics by baloxavir. Nat Commun. 2020;11. pmid:32487990
- 21. Perelson AS, Ribeiro RM, Phan T. An explanation for SARS-CoV-2 rebound after Paxlovid treatment. medRxiv. 2023; 2023.05.30.23290747. pmid:37398088
- 22. Gonçalves A, Bertrand J, Ke R, Comets E, Lamballerie X, Malvy D, et al. Timing of antiviral treatment initiation is critical to reduce SARS-CoV-2 viral load. CPT Pharmacometrics Syst Pharmacol. 2020;9: 509–514. pmid:32558354
- 23. Goyal A, Cardozo-Ojeda EF, Schiffer JT. Potency and timing of antiviral therapy as determinants of duration of SARS-CoV-2 shedding and intensity of inflammatory response. Sci Adv. 2020;6. pmid:33097472
- 24. Kim KS, Ejima K, Iwanami S, Fujita Y, Ohashi H, Koizumi Y, et al. A quantitative model used to compare within-host SARS-CoV-2, MERS-CoV, and SARS-CoV dynamics provides insights into the pathogenesis and treatment of SARS-CoV-2. PLoS Biol. 2021;19: e3001128. pmid:33750978
- 25. Baccam P, Beauchemin C, Macken CA, Hayden FG, Perelson AS. Kinetics of influenza A virus infection in humans. J Virol. 2006;80: 7590–9. pmid:16840338
- 26. Böhmer MM, Buchholz U, Corman VM, Hoch M, Katz K, Marosevic D V, et al. Investigation of a COVID-19 outbreak in Germany resulting from a single travel-associated primary case: a case series. 2020 [cited 10 Jan 2024]. pmid:32422201
- 27. Banerjee S, Guedj J, Ribeiro RM, Moses M, Perelson AS. Estimating biologically relevant parameters under uncertainty for experimental within-host murine West Nile virus infection. J R Soc Interface. 2016;13. pmid:27075003
- 28. Ben-Shachar R, Koelle K. Minimal within-host dengue models highlight the specific roles of the immune response in primary and secondary dengue infections. J R Soc Interface. 2014;12: 20140886–20140886. pmid:25519990
- 29. Ben-Shachar R, Schmidler S, Koelle K. Drivers of inter-individual variation in dengue viral load dynamics. Ferguson NM, editor. PLoS Comput Biol. 2016;12: e1005194. pmid:27855153
- 30. Ben-Shachar R, Koelle K. Transmission-clearance trade-offs indicate that dengue virulence evolution depends on epidemiological context. Nat Commun. 2018;9. pmid:29907741
- 31. Clapham HE, Tricou V, Van Vinh Chau N, Simmons CP, Ferguson NM. Within-host viral dynamics of dengue serotype 1 infection. J R Soc, Interface. 2014;11: 504–507. pmid:24829280
- 32. Clapham HE, Quyen TH, Kien DTH, Dorigatti I, Simmons CP, Ferguson NM, et al. Modelling virus and antibody dynamics during dengue virus infection suggests a role for antibody in virus clearance. Antia R, editor. PLoS Comput Biol. 2016;12: e1004951. pmid:27213681
- 33. Aid M, Abbink P, Larocca RA, Boyd M, Nityanandam R, Nanayakkara O, et al. Zika virus persistence in the central nervous system and lymph nodes of rhesus monkeys. Cell. 2017;169: 610–620.e14. pmid:28457610
- 34. Osuna CE, Lim S-Y, Deleage C, Griffin BD, Stein D, Schroeder LT, et al. Zika viral dynamics and shedding in rhesus and cynomolgus macaques. Nat Med. 2016;22: 1448–1455. pmid:27694931
- 35. Best K, Guedj J, Madelain V, de Lamballerie X, Lim S-Y, Osuna CE, et al. Zika plasma viral dynamics in nonhuman primates provides insights into early infection and antiviral strategies. Proc Natl Acad Sci U S A. 2017;114: 8847–8852. pmid:28765371
- 36. Patel K, Kirkpatrick CM, Nieforth KA, Chanda S, Zhang Q, McClure M, et al. Respiratory syncytial virus-A dynamics and the effects of lumicitabine, a nucleoside viral replication inhibitor, in experimentally infected humans. Journal of Antimicrobial Chemotherapy. 2019;74: 442–452. pmid:30376079
- 37. Wethington D, Harder O, Uppulury K, Stewart WCL, Chen P, King T, et al. Mathematical modelling identifies the role of adaptive immunity as a key controller of respiratory syncytial virus in cotton rats. J R Soc Interface. 2019;16. pmid:31771450
- 38. Pawelek KA, Huynh GT, Quinlivan M, Cullinane A, Rong L, Perelson AS. Modeling within-host dynamics of influenza virus infection including immune responses. Antia R, editor. PLoS Comput Biol. 2012;8: e1002588. pmid:22761567
- 39. Hadjichrysanthou C, Cauët E, Lawrence E, Vegvari C, de Wolf F, Anderson RM, et al. Understanding the within-host dynamics of influenza A virus: from theory to clinical implications. J R Soc Interface. 2016;13: 499–522. pmid:27278364
- 40. Smith AM. Host-pathogen kinetics during influenza infection and coinfection: insights from predictive modeling. Immunol Rev. 2018;285: 97–112. pmid:30129197
- 41. Smith AM, Perelson AS. Influenza A virus infection kinetics: Quantitative data and models. Wiley Interdiscip Rev Syst Biol Med. 2011;3: 429–445. pmid:21197654
- 42. Koelle K, Farrell AP, Brooke CB, Ke R. Within-host infectious disease models accommodating cellular coinfection, with an application to influenza †. Virus Evol. 2019;5: 18. pmid:31304043
- 43. Perelson AS, Ke R. Mechanistic modelling of SARS-CoV-2 and other infectious diseases and the effects of therapeutics. Clin Pharmacol Ther. 2021. pmid:33410134
- 44. Ejima K, Kim KS, Ludema C, Bento AI, Iwanami S, Fujita Y, et al. Estimation of the incubation period of COVID-19 using viral load data. Epidemics. 2021;35: 100454. pmid:33773195
- 45. Néant N, Lingas G, Le Hingrat Q, Ghosn J, Engelmann I, Lepiller Q, et al. Modeling SARS-CoV-2 viral kinetics and association with mortality in hospitalized patients from the French COVID cohort. Proc Natl Acad Sci U S A. 2021;118: e2017962118. pmid:33536313
- 46. Yang Y, Guo L, Yuan J, Xu Z, Gu Y, Zhang J, et al. Viral and antibody dynamics of acute infection with SARS-CoV-2 omicron variant (B.1.1.529): a prospective cohort study from Shenzhen, China. Lancet Microbe. 2023;4: e632–e641. pmid:37459867
- 47. He X, Lau EHY, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat Med. 2020;26: 672–675. pmid:32296168
- 48. Sadria M, Layton AT. Modeling within-host SARS-CoV-2 infection dynamics and potential treatments. Viruses 2021, Vol 13, Page 1141. 2021;13: 1141. pmid:34198530
- 49. Wang S, Pan Y, Wang Q, Miao H, Brown AN, Rong L. Modeling the viral dynamics of SARS-CoV-2 infection. Math Biosci. 2020;328: 108438. pmid:32771304
- 50. Blanco-Rodríguez R, Du X, Hernández-Vargas E. Computational simulations to dissect the cell immune response dynamics for severe and critical cases of SARS-CoV-2 infection. Comput Methods Programs Biomed. 2021;211. pmid:34610492
- 51. Shudo E, Ribeiro RM, Perelson AS. Modeling HCV kinetics under therapy using PK and PD information. Expert Opin Drug Metab Toxicol. 2009;5: 321–332. pmid:19331594
- 52. Dahari H, Lo A, Ribeiro RM, Perelson AS. Modeling hepatitis C virus dynamics: Liver regeneration and critical drug efficacy. J Theor Biol. 2007;247: 371–381. pmid:17451750
- 53. Zitzmann C, Kaderali L, Perelson AS. Mathematical modeling of hepatitis C RNA replication, exosome secretion and virus release. PLoS Comput Biol. 2020;16: e1008421. pmid:33151933
- 54. Perelson A, Shudo E, Ribeiro R. Modeling hepatitis C virus kinetics under therapy using pharmacokinetic and pharmacodynamic information. Expert Opin Drug Metab Toxicol. 2009;5: 321–332.
- 55. Dahari H, Major M, Zhang X, Mihalik K, Rice CM, Perelson AS, et al. Mathematical modeling of primary hepatitis C infection: Noncytolytic clearance and early blockage of virion production. Gastroenterology. 2005;128: 1056–1066. pmid:15825086
- 56. Guedj J, Dahari H, Rong L, Sansone ND, Nettles RE, Cotler SJ, et al. Modeling shows that the NS5A inhibitor daclatasvir has two modes of action and yields a shorter estimate of the hepatitis C virus half-life. Proc Natl Acad Sci U S A. 2013;110: 3991–6. pmid:23431163
- 57. Rong L, Perelson AS. Treatment of hepatitis C virus infection with interferon and small molecule direct antivirals: viral kinetics and modeling. Crit Rev Immunol. 2010;30: 131–48. pmid:20370626
- 58. Zitzmann C, Kaderali L. Mathematical analysis of viral replication dynamics and antiviral treatment strategies: From basic models to age-based multi-scale modeling. Front Microbiol. Frontiers; 2018. p. 1546. pmid:30050523
- 59. Ribeiro RM, Lo A, Perelson AS. Dynamics of hepatitis B virus infection. Microbes Infect. 2002;4: 829–835. pmid:12270730
- 60. Hailegiorgis A, Ishida Y, Collier N, Imamura M, Shi Z, Reinharz V, et al. Modeling suggests that virion production cycles within individual cells is key to understanding acute hepatitis B virus infection kinetics. PLoS Comput Biol. 2023;19: e1011309. pmid:37535676
- 61. Nowak MA, Bonhoeffer S, Hill AM, Boehme R, Thomas HC, Mcdade H. Viral dynamics in hepatitis B virus infection. Proc Natl Acad Sci U S A. 1996;93: 4398. pmid:8633078
- 62. Goyal A, Ribeiro RM, Perelson AS. The role of infected cell proliferation in the clearance of acute HBV infection in humans. Viruses. 2017;9. pmid:29156567
- 63. Dahari H, Shudo E, Ribeiro RM, Perelson AS. Modeling complex decay profiles of hepatitis B virus during antiviral therapy. Hepatology. 2009;49: 32–38. pmid:19065674
- 64. Ciupe SM, Ribeiro RM, Perelson AS. Antibody responses during hepatitis B viral infection. PLoS Comput Biol. 2014;10: e1003730. pmid:25078553
- 65. Ciupe SM, Ribeiro RM, Nelson PW, Perelson AS. Modeling the mechanisms of acute hepatitis B virus infection. J Theor Biol. 2007;247: 23. pmid:17428501
- 66. Ciupe SM, Ribeiro RM, Nelson PW, Dusheiko G, Perelson AS. The role of cells refractory to productive infection in acute hepatitis B viral dynamics. Proc Natl Acad Sci U S A. 2007;104: 5050–5055. pmid:17360406
- 67. Perelson AS, Essunger P, Cao Y, Vesanen M, Hurley A, Saksela K, et al. Decay characteristics of HIV-1-infected compartments during combination therapy. Nature. 1997;387: 188–191. pmid:9144290
- 68. Kim H, Perelson AS. Viral and latent reservoir persistence in HIV-1-infected patients on therapy. PLoS Comput Biol. 2006;2: 1232–1247. pmid:17040122
- 69. Ho DD. HIV-1 dynamics in vivo. J Biol Regul Homeost Agents. 1995;9: 76–77. pmid:8782010
- 70. Dixit NM, Markowitz M, Ho DD, Perelson AS. Estimates of intracellular delay and average drug efficacy from viral load data of HIV-infected individuals under antiretroviral therapy. Antivir Ther. 2004;9: 237–246. pmid:15134186
- 71. Rong L, Perelson AS. Modeling latently infected cell activation: Viral and latent reservoir persistence, and viral blips in HIV-infected patients on potent therapy. Antia R, editor. PLoS Comput Biol. 2009;5: e1000533. pmid:19834532
- 72. Rong L, Feng Z, Perelson AS. Emergence of HIV-1 drug resistance during antiretroviral treatment. Bull Math Biol. 2007;69: 2027–2060. pmid:17450401
- 73. Ribeiro RM, Li H, Wang S, Stoddard MB, Learn GH, Korber BT, et al. Quantifying the Diversification of Hepatitis C Virus (HCV) during Primary Infection: Estimates of the In Vivo Mutation Rate. PLoS Pathog. 2012;8. pmid:22927817
- 74. Ribeiro RM, Perelson AS. The analysis of HIV dynamics using mathematical models. 2004.
- 75. Perelson AS, Ribeiro RM. Modeling the within-host dynamics of HIV infection. BMC Biol. 2013;11. pmid:24020860
- 76. Perelson AS, Ribeiro RM. Estimating drug efficacy and viral dynamic parameters: HIV and HCV. Stat Med. 2008;27: 4647–4657. pmid:17960579
- 77. Pagel C, Yates C, Yates CA. Role of mathematical modelling in future pandemic response policy. 2022 [cited 15 Dec 2023]. pmid:36109042
- 78. Smith DRM, Chervet S, Pinettes T, Shirreff G, Jijón S, Oodally A, et al. How have mathematical models contributed to understanding the transmission and control of SARS-CoV-2 in healthcare settings? A systematic search and review. Journal of Hospital Infection. 2023;141: 132–141. pmid:37734676
- 79. Panovska-Griffiths J, Kerr CC, Waites W, Stuart RM. Mathematical modeling as a tool for policy decision making: Applications to the COVID-19 pandemic. 2021 [cited 15 Dec 2023].
- 80. Wang J. Mathematical models for COVID-19: applications, limitations, and potentials. [cited 15 Dec 2023]. pmid:32724894
- 81. James LP, Salomon JA, Buckee CO, Menzies NA. The use and misuse of mathematical modeling for infectious disease policymaking: Lessons for the COVID-19 pandemic. Review Medical Decision Making. 2021;41: 379–385. pmid:33535889
- 82. Nguyen VK, Klawonn F, Mikolajczyk R, Esteban A, Centre H, Chi H. Analysis of practical identifiability of a viral infection model. PLoS One. 2016;11: 1–30. pmid:28036339
- 83. Roda WC, Varughese MB, Han D, Li MY. Why is it difficult to accurately predict the COVID-19 epidemic? Infect Dis Model. 2020;5: 271–281. pmid:32289100
- 84. Gallo L, Frasca M, Latora V, Russo G. Lack of practical identifiability may hamper reliable predictions in COVID-19 epidemic models. Sci Adv. 2022;8: 5234. pmid:35044820
- 85. Beauchemin CAA, Handel A. A review of mathematical models of influenza A infections within a host or cell culture: Lessons learned and challenges ahead. BMC Public Health. 2011;11: 1–15.
- 86. Ciupe SM, Tuncer N. Identifiability of parameters in mathematical models of SARS-CoV-2 infections in humans. Scientific Reports 2022 12:1. 2022;12: 1–14. pmid:36030320
- 87. Miao H, Xia X, Perelson AS, Wu H. On identifiability of nonlinear ODE models and applications in viral dynamics. SIAM Rev Soc Ind Appl Math. 2011;53: 3. pmid:21785515
- 88. Tuncer N, Martcheva M. Determining reliable parameter estimates for within-host and within-vector models of Zika virus. J Biol Dyn. 2021;15: 430–454. pmid:34463605
- 89. Reeves DB, Rolland M, Dearlove BL, Li Y, Robb ML, Schiffer JT, et al. Timing HIV infection with a simple and accurate population viral dynamics model. J R Soc Interface. 2021;18. pmid:34186015
- 90. Wu H, Zhu H, Miao H, Perelson AS. Parameter identifiability and estimation of HIV/AIDS dynamic models. Bull Math Biol. 2008;70: 785–799. pmid:18247096
- 91. Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, et al. Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia. New England Journal of Medicine. 2020;382: 1199–1207. pmid:31995857
- 92. Linton NM, Kobayashi T, Yang Y, Hayashi K, Akhmetzhanov AR, Jung SM, et al. Incubation period and other epidemiological characteristics of 2019 novel coronavirus infections with right truncation: A Statistical analysis of publicly available case data. Journal of Clinical Medicine 2020, Vol 9, Page 538. 2020;9: 538. pmid:32079150
- 93. Guan W, Ni Z, Hu Y, Liang W, Ou C, He J, et al. Clinical characteristics of coronavirus disease 2019 in China. N Engl J Med. 2020;382: 1708–1720. pmid:32109013
- 94. Backer Jantien A, Don K, Jacco W. Incubation period of 2019 novel coronavirus (2019-nCoV) infections among travellers from Wuhan, China. Euro Surveill. 2020;25: 20–28. pmid:32046819
- 95. Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. Lancet Infect Dis. 2020;20: 911–919. pmid:32353347
- 96. Lauer SA, Grantz KH, Bi Q, Jones FK, Zheng Q, Meredith HR, et al. The incubation period of coronavirus disease 2019 (CoVID-19) from publicly reported confirmed cases: Estimation and application. Ann Intern Med. 2020;172: 577–582. pmid:32150748
- 97. Curran-Sebastian J, Das R, Fyles M, Han Y, Lewkowicz H, Pang X, et al. Challenges in control of COVID-19: short doubling time and long delay to effect of interventions. 2021. pmid:34053267
- 98. Ki M, 2019-nCoV TF for. Epidemiologic characteristics of early cases with 2019 novel coronavirus (2019-nCoV) disease in Korea. Epidemiol Health. 2020;42. pmid:32035431
- 99. Sun K, Chen J, Viboud C. Early epidemiological analysis of the coronavirus disease 2019 outbreak based on crowdsourced data: a population-level observational study. Lancet Digit Health. 2020;2: e201. pmid:32309796
- 100. Kraemer MUG, Yang CH, Gutierrez B, Wu CH, Klein B, Pigott DM, et al. The effect of human mobility and control measures on the COVID-19 epidemic in China. Science. 2020;368: 493–497. pmid:32213647
- 101. Jiang X, Rayner S, Luo MH. Does SARS-CoV-2 has a longer incubation period than SARS and MERS? J Med Virol. 2020;92: 476. pmid:32056235
- 102. You C, Deng Y, Hu W, Sun J, Lin Q, Zhou F, et al. Estimation of the time-varying reproduction number of COVID-19 outbreak in China. Int J Hyg Environ Health. 2020;228: 113555. pmid:32460229
- 103. Qiu C, Deng Z, Xiao Q, Shu Y, Deng Y, Wang H, et al. Transmission and clinical characteristics of coronavirus disease 2019 in 104 outside-Wuhan patients, China. J Med Virol. 2020;92: 2027. pmid:32369217
- 104. Sanche S, Lin YT, Xu C, Romero-Severson E, Hengartner N, Ke R. High contagiousness and rapid spread of severe acute respiratory syndrome coronavirus 2. Emerg Infect Dis. 2020;26: 1470–1477. pmid:32255761
- 105.
Samuel CE. Antiviral actions of interferons. Clinical Microbiology Reviews. American Society for Microbiology (ASM); 2001. pp. 778–809. https://doi.org/10.1128/CMR.14.4.778–809.2001
- 106. Levy DE, García-Sastre A. The virus battles: IFN induction of the antiviral state and mechanisms of viral evasion. Cytokine Growth Factor Rev. 2001;12: 143–156. pmid:11325598
- 107. García-Sastre A, Biron CA. Type 1 interferons and the virus-host relationship: A Lesson in détente. Science (1979). 2006;312: 879–882. pmid:16690858
- 108. Hou YJ, Okuda K, Edwards CE, Martinez DR, Asakura T, Dinnon KH, et al. SARS-CoV-2 reverse genetics reveals a variable infection gradient in the respiratory tract. Cell. 2020;182: 429–446.e14. pmid:32526206
- 109. Sachak-Patwa R, Lafferty EI, Schmit CJ, Thompson RN, Byrne HM. A target-cell limited model can reproduce influenza infection dynamics in hosts with differing immune responses. J Theor Biol. 2023;567: 111491. pmid:37044357
- 110. Handel A, Longini IM, Antia R. Towards a quantitative understanding of the within-host dynamics of influenza A infections. J R Soc Interface. 2010;7: 35–47. pmid:19474085
- 111. Smith AP, Moquin DJ, Bernhauerova V, Smith AM. Influenza virus infection model with density dependence supports biphasic viral decay. Front Microbiol. 2018;9. pmid:30042759
- 112. Korosec CS, Betti MI, Dick DW, Ooi HK, Moyles IR, Wahl LM, et al. Multiple cohort study of hospitalized SARS-CoV-2 in-host infection dynamics: Parameter estimates, identifiability, sensitivity and the eclipse phase profile. J Theor Biol. 2023;564. pmid:36894132
- 113.
Package Rsmlx. [cited 13 Nov 2023]. Available: https://cran.r-project.org/web/packages/Rsmlx/index.html
- 114. Maiwald T, Hass H, Steiert B, Vanlier J, Engesser R, Raue A, et al. Driving the model to its limit: Profile likelihood based model reduction. 2016 [cited 3 Feb 2023]. pmid:27588423
- 115. Kreutz C, Raue A, Kaschek D, Timmer J. Profile likelihood in systems biology. FEBS Journal. 2013. pp. 2564–2571. pmid:23581573
- 116. Raue A, Kreutz C, Maiwald T, Bachmann J, Schilling M, Klingmüller U, et al. Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinformatics. 2009;25: 1923–1929. pmid:19505944
- 117. Wieland FG, Hauber AL, Rosenblatt M, Tönsing C, Timmer J. On structural and practical identifiability. Curr Opin Syst Biol. 2021;25: 60–69.
- 118. Killingley B, Mann A✉. Safety, tolerability and viral kinetics during SARS-CoV-2 human challenge in young adults. [cited 10 Aug 2023]. pmid:35361992
- 119. Nguyen VK, Hernandez-Vargas EA. Identifiability Challenges in Mathematical Models of Viral Infectious Diseases. IFAC-PapersOnLine. 2015;48: 257–262.
- 120. Hernandez-Vargas EA, Velasco-Hernandez JX. In-host Mathematical Modelling of COVID-19 in Humans. Annu Rev Control. 2020;50: 448–456. pmid:33020692
- 121. Ogando NS, Dalebout TJ, Zevenhoven-Dobbe JC, Limpens RWAL, van der Meer Y, Caly L, et al. SARS-coronavirus-2 replication in Vero E6 cells: replication kinetics, rapid adaptation and cytopathology. J Gen Virol. 2020;101: 925. pmid:32568027
- 122. Goyal A, Reeves DB, Fabian Cardozo-Ojeda E, Schiffer JT, Mayer BT. Viral load and contact heterogeneity predict sars-cov-2 transmission and super-spreading events. Elife. 2021;10: 1–63. pmid:33620317
- 123. Padmanabhan P, Dixit N. Modeling suggests a mechanism of synergy between hepatitis C virus entry inhibitors and drugs of other classes. CPT Pharmacometrics Syst Pharmacol. 2015;4: 445–453. pmid:26380153
- 124. Castroid M, De Boerid RJ. Testing structural identifiability by a simple scaling method. 2020 [cited 23 Jan 2024]. pmid:33141821
- 125.
McNab F, Mayer-Barber K, Sher A, Wack A, O’Garra A. Type I interferons in infectious disease. Nature Reviews Immunology. Nature Publishing Group; 2015. pp. 87–103. https://doi.org/10.1038/nri3787 pmid:25614319
- 126. Fensterl V, Sen GC. Interferons and viral infections. BioFactors. 2009;35: 14–20. pmid:19319841
- 127. Stafford MA, Corey L, Cao Y, Daar ES, Ho DD, Perelson AS. Modeling plasma virus concentration during primary HIV infection. J Theor Biol. 2000;203: 285–301. pmid:10716909