Figures
Abstract
Eastern equine encephalitis virus (EEEV) is an arthropod-borne, positive-sense RNA alphavirus posing a substantial threat to public health. Unlike similar viruses such as SARS-CoV-2, EEEV replicates efficiently in neurons, producing progeny viral particles as soon as 3–4 hours post-infection. EEEV infection, which can cause severe encephalitis with a human mortality rate surpassing 30%, has no licensed, targeted therapies, leaving patients to rely on supportive care. Although the general characteristics of EEEV infection within the host cell are well-studied, it remains unclear how these interactions lead to rapid production of progeny viral particles, limiting development of antiviral therapies. Here, we present a novel rule-based model that describes attachment, entry, uncoating, replication, assembly, and export of both infectious virions and virus-like particles within mammalian cells. Additionally, it quantitatively characterizes host ribosome activity in EEEV replication via a model parameter defining ribosome density on viral RNA. To calibrate the model, we performed experiments to quantify viral RNA, protein, and infectious particle production during acute infection. We used Bayesian inference to calibrate the model, discovering in the process that an additional constraint was required to ensure consistency with previous experimental observations of a high ratio between the amounts of full-length positive-sense viral genome and negative-sense template strand. Overall, the model recapitulates the experimental data and predicts that EEEV rapidly concentrates host ribosomes densely on viral RNA. Dense packing of host ribosomes was determined to be critical to establishing the characteristic positive to negative RNA strand ratio because of its role in governing the kinetics of transcription. Sensitivity analysis identified viral transcription as the critical step for infectious particle production, making it a potential target for future therapeutic development.
Author summary
Eastern equine encephalitis virus (EEEV) is a positive-sense RNA virus transmitted via mosquitoes. It can cause lethal disease in humans with a high mortality rate, exceeding 30%. There are no licensed targeted treatments or vaccines currently available. We constructed a rule-based model that describes the mechanisms and the resulting dynamics of EEEV replication inside a mammalian cell. With a novel experimental dataset that measures the concentrations of EEEV RNA, proteins, and infectious viral particles over time in combination with a biological constraint based on known replication characteristics, we calibrated the model rate parameters with a Bayesian inference method that estimates parameter distributions and quantifies the confidence of model predictions. The resulting calibrated model captures key features of the experimental dataset. Model analyses identified a tight constraints in the RNA replication dynamics among the genome, the negative-sense template, and the subgenome, which is used for structural protein synthesis. The calibrated model demonstrates the potential for EEEV to rapidly recruit and densely pack host ribosomes on its viral RNA to accelerate replication. Sensitivity analysis found that parameters involving viral transcription, particularly of the genome and subgenome, are most critical for infectious viral particle production.
Citation: Larkin CI, Dunn MD, Shoemaker JE, Klimstra WB, Faeder JR (2025) A detailed kinetic model of Eastern equine encephalitis virus replication in a susceptible host cell. PLoS Comput Biol 21(6): e1013082. https://doi.org/10.1371/journal.pcbi.1013082
Editor: Rustom Antia, Emory University, UNITED STATES OF AMERICA
Received: December 25, 2024; Accepted: April 22, 2025; Published: June 4, 2025
Copyright: © 2025 Larkin et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Experimental data and code are available on the following GitHub repository: https://github.com/LarkinIt/EEEV-Replication-Model. Data generated from calibration and used for downstream analysis is available on Zenodo at http://doi.org/10.5281/zenodo.15080538.
Funding: C.I.L. was supported by NSF GRFP grant 1747452. J.E.S. acknowledges support from NIH NIAID grant R21AI153882, NSF CAREER #1943777, NIH NIAID 1R21AI174080, NIH NIAID R21AI151418, and NIH NIAID R01AI180382. W.B.K. acknowledges support from NIH NIAID grants R01AI160188 and R01AI153209. J.R.F. acknowledges support from NIH grants P41GM10371 and R01GM115805. Experimental studies were performed in the University of Pittsburgh Regional Biocontainment Laboratory, which is supported by NIH NIAID UC7AI180311. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors of this manuscript have the following competing interests. W.B.K. is a co-founder of Advanced Virology. The Klimstra laboratory has received unrelated funding in support of sponsored research agreements from SAB Biotherapeutics. The remaining authors have declared no competing interest.
Introduction
Eastern equine encephalitis virus (EEEV) is an arthropod-borne, neurovirulent virus endemic to the eastern seaboard of the United States. When neuroinvasive, it can cause severe disease characterized by meningitis and encephalitis with a human mortality rate between 30-70% [1]. Further, the overwhelming majority of survivors live with permanent neurological damage [2,3]. Currently, treatment is limited to supportive care as there are no preventative or targeted therapies. Additionally, EEEV can also be aerosolized, which poses a significant bioterror threat [4–8]. As of December 2024, an outbreak of Eastern equine encephalitis (EEE) in the eastern United States has generated 19 human cases across 8 states [9,10]. A previous outbreak in 2019 produced 34 symptomatic cases and 12 deaths [11–13].
EEEV is a single-stranded positive-sense RNA virus belonging to the Alphavirus genus in the Togaviridae family. Among alphaviruses, EEEV and the related Western equine encephalitis virus are unique in avoiding infection of lymphoid tissues due to restricted replication in hematopoietic cells [14–17]. EEEV replicates instead in osteoblasts, fibroblasts, and neurons by leveraging the efficient binding to the ubiquitously-expressed host proteoglycan, heparan sulfate, as well as several protein receptors [18–21]. The replication cycle is rapid, with viral particles being produced as early as 3-4 hours post-infection (hpi) [22] and cytopathic effects evident by 12 hpi [23]. The details of the EEEV replication cycle require further study because most previous studies of alphavirus replication were performed with laboratory strains of relatively avirulent arthritogenic alphaviruses, which likely differ from EEEV in key aspects. In this work, we develop a computational model of the dynamic replication strategy employed by EEEV in order to develop a better understanding of the mechanisms that contribute to the rapid timing and amplitude of particle production. This knowledge could help to guide the development of future therapeutic strategies.
Previous computational models have described intracellular replication in other positive-sense RNA viruses including Hepatitis C virus [24–30], SARS-CoV-2 [31–33], Dengue virus [30,34,35], and Coxsackieviruses [30,36]. However, despite its relevance as a human pathogen with limited treatment options, no computational model has been developed that specifically addresses the replication cycle of EEEV or any other pathogenic alphavirus. Given experimental limitations, a computational model could be an invaluable tool to characterize the intracellular mechanistic underpinnings of EEEV replication.
Here, we present a detailed mechanistic model of the EEEV replication cycle that encodes alphavirus-specific mechanisms starting from particle attachment, culminating in the production and egress of both infectious and non-infectious progeny particles. This model contains detailed replication steps, including transcription, translation, and post-translational processing. Unlike previous models of viral replication in which ribosome density has been experimentally determined, we treat ribosome density as a variable parameter and use the length of RNA template strands to determine the polysome size (number of ribosomes in the translation complex) for translation. We estimate the value of this and other parameters by using Bayesian inference to calibrate the model based on experimental measurements of viral protein, RNA, and infectious particle dynamics during the early stages of infection. Using the calibrated model, we find that the recruitment and distribution of host ribosomes can play a critical role in the EEEV replication cycle onset and the resulting timing and magnitude of infectious particle production. Further analysis of the model suggests that the speed and regulation of RNA synthesis are the most critical determinants for the amount of infectious particles produced.
Results
A rule-based kinetic model of alphavirus replication
In order to obtain insights into alphavirus replication dynamics, we dissected the known replication steps into a series of discrete rate rules using the rule-based modeling language, BioNetGen [37] (Fig 1). Each biological process in the replication cycle is described by a rate rule based on the kinetic law of mass action. The remainder of this subsection presents an overview of the processes represented in the model. Additional details are provided in Materials and methods.
Entry (Steps 1–4). Infection begins with a viral particle binding to a host attachment factor to enter the cell. Subsequently, the viral envelope and capsid uncoat to release the viral genome (posRNA), which swiftly recruits host ribosomes to form the translation complex (TC:g). At the same time, the virus utilizes host lipid bilayers to form a replication spherule around replication machinery as shown with the orange-shaded region. Replication (Steps 5–24). Additional copies of viral RNA and proteins are synthesized. Host ribosomes synthesize the nonstructural polyproteins (P1234 and P123) from the genome and the structural polyprotein from the subgenome (sgRNA). To transcribe positive-sense RNA, a negative-sense template (negRNA) must first be synthesized by a competent replicase complex (P123-4). P123-4 is cleaved again into C1234, which catalyzes only positive-sense strand synthesis. Positive-sense RNA products are transcribed from either free or duplex forms of negative-sense strand template. There are two separate promoters on the negative-sense template strand that encode either posRNA or sgRNA, denoted as RC:pos or RC:sg respectively. sgRNA is bound by host ribosomes (TC:sg) to synthesize the structural polyprotein. Structural polyprotein processing (Steps 25–29). Once translated, the structural polyprotein undergoes a series of modifications through the secretory pathway yielding viral capsid and spike proteins. Assembly and Export (Steps 30–34). Before assembly, the capsid protein may bind to the genome. Both free and RNA-bound capsids assemble with spike proteins to form progeny particles. Plaque-forming units (PFUs) are infectious particles containing the infectious genome while virus-like particles (VLPs) are non-infectious due to the lack of a genome. Created with BioRender.com
The first three model steps describe viral entry: attachment of an infectious viral particle to a host cell attachment factor in the cell membrane (Step 1), entry into the host cell (Step 2), and uncoating of the nucleocapsid to release the viral genome, which is then bound by host ribosomes (Step 3). After uncoating, it is assumed that the unbound host attachment factor is recycled to the cell surface (Step 4).
A unique aspect of positive-sense RNA viruses is the ability to form replication spherules (RS) using host-derived lipid membranes [38–41]. There are many potential functions of RS including concentration of the viral replication machinery, molecular trafficking of viral RNA and protein, and shielding from host immunosurveillance [42–45]. We incorporated RS into the model by including separate species for viral RNA located in the RS versus cytoplasm starting at Step 3.
With alphaviruses, the genome, denoted as posRNA in the model, is translated to make one of two possible nonstructural polyproteins (nsPs): P123 (Step 5) and P1234 (Step 6). These are differentiated by the presence or absence of nsP4, the alphaviral RNA-dependent RNA-polymerase. nsPs 1–3 serve numerous independent regulatory, replicative, and host-antagonizing functions but also synergize with nsP4 to create the replicase complex [46]. To catalyze RNA synthesis, nsP4 must be cleaved from the nascent polyprotein, P1234, but persists in a complex with the remaining polyprotein, denoted as P123-4 in Fig 1 (Step 7). This replicase complex subsequently binds to posRNA to form the replication complex RC:pos_neg (Step 8), from which negative template (negRNA) is transcribed (Step 9).
A second proteolytic cleavage forms the late replicase complex, which is labeled C1234 (Step 10), which catalyzes only positive-sense strand synthesis. This irreversible step is critical for shifting transcription away from synthesis of the negative-sense RNA template and toward production of the positive-sense RNA products needed to produce progeny particles. Given their complementary nature, the posRNA and negRNA can pair to form double-stranded RNA (dsRNA) (Step 11). Both dsRNA and negRNA may be used as templates for positive-sense synthesis by the late replicase complex, C1234 (Steps 12–15). Depending on which promoter the replicase binds, these complexes may generate either progeny copies of the full-length genome (posRNA) (Steps 16-17) or the 26S subgenome (sgRNA) (Steps 18–19). The model assumes that viral RNA molecules synthesized in the RS translocate to the cytoplasm following first-order kinetics (Steps 21–23).
The requirement of host ribosomes to translate viral proteins has been established for a wide array of viruses [47–50]. Similar to eukaryotic translation, several copies of viral proteins may be translated simultaneously by multiple host ribosomes bound to a single strand of viral RNA, forming a complex called the polysome [48,50–53]. Alphaviruses such as Sindbis virus have been shown to employ polysomes for viral translation [54–56]. Other mechanistic models of positive-sense RNA virus replication have accounted for polysomes by fixing the composition based on virus-specific experimental observations [24,36]. Although polysome formation has not been studied with EEEV specifically, we assume that it does occur and incorporate polysome formation by the addition of a model parameter, ribosome_distance, that describes the average distance in nucleotides between ribosomes in a polysome on a single strand of RNA. The number of ribosomes in a polysome is then determined by the length of the RNA being translated divided by ribosome_distance. The polysome size thus differs by a factor of roughly three for translation of the genome and subgenome, which is approximately one third the length of the genome. We estimate ribosome_distance during model calibration (see next subsection) with bounds based on empirical measurements of eukaryotic translation [57]. The model assumes that the rate of translation scales linearly with the number of ribosomes in a polysome.
In the model, the fraction of host ribosomes available to EEEV is fixed at 20% (BNID: 102344, [58]), as assumed in existing models [36]. Host ribosomes can bind and form polysomes with free progeny copies of the viral genome (posRNA) to translate additional copies of the nsPs (Step 20) or move out of the RS (Step 21). Ribosomes also bind sgRNA to translate the structural polyprotein (Step 24–25). A nascent copy of the structural polyprotein undergoes several post-translational processing steps: auto-proteolytic cleavage of the capsid from the structural polyprotein (Step 26), translocation of the remaining polyprotein to the endoplasmic reticulum (Step 27), cleavage of the remaining polyprotein into individual proteins throughout the secretory pathway (Step 28), and cleavage, glycosylation, and palmitoylation of the spike precursors into mature spike proteins (Step 29) [59].
Functional viral particles, known as plaque-forming units (PFUs), form when capsid proteins bind first to cytoplasmic genome (Step 30), followed by assembly with the mature spike proteins into infectious particles (Step 31) that bud from the cell membrane (Step 33) (reviewed in [60,61]). The model assumption that a genome-bound capsid is required for PFU assembly is supported by studies with the Chikungunya alphavirus, which has shown that PFU production decreases when genome-binding sites on the capsid are mutated [62]. The model also allows capsid self-assembly to occur without prior binding of the genome (Step 32), resulting in formation of noninfectious virus-like particles (VLPs) that can then bud from the cell membrane (Step 34). Formation of VLPs via this mechanism is supported by experiments with Venezuelan equine encephalitis virus that have shown that noninfectious VLPs are produced without the genome binding to the capsid, yielding genome-free particles [63]. Further, studies developing VLP-based vaccines for Eastern, Western, and Venezuelan equine encephalitis viruses have used vectors encoding the structural polyproteins for VLP production [64,65].
Overall, the model contains 36 species whose dynamics are defined with 60 rate rules. There are 36 model parameters, which include both initial conditions and rate constants for the individual replication steps (S1 Table). Of the 36 model parameters, 10 are fixed based on experimental observations (references in S1 Table). The remaining parameters are largely unknown, posing a significant obstacle to the biological relevance and predictive capability of the model. To obtain a better basis for estimating model parameters, we performed additional quantitative experiments to measure the concentrations of viral RNA, proteins, and infectious particle production in immune-incompetent fibroblast cells over the first 13 hours post-infection (hpi) with EEEV (see Materials and methods). We then used this data to carry out Bayesian inference of these unknown model parameters, as we now describe.
Calibration of model parameters with Bayesian parameter estimation
Previous studies have shown that Bayesian methods can be effective in finding optimal model parameter sets that best fit experimental data from biological systems [66–68]. Unlike gradient-based optimization methods, which only return a single set of model parameters that fit the provided dataset, Bayesian methods return an ensemble of parameter set samples fitting the experimental data. Here, we used a recently-developed Bayesian method, preconditioned Monte Carlo (PMC), which combines Sequential Monte Carlo with Normalizing Flows (NFs) to improve the efficiency of moves in high-dimensional parameter spaces (see Materials and methods for details) [69]. We utilized PMC to estimate the 26 unknown model parameters of our EEEV replication model, as well as two experimental scaling factors, using the time course data of viral proteins, RNA, and progeny PFU production (Fig 2A).
(A) Time courses of (left) nonstructural (nsP) and structural (sP) polyproteins, (middle) RNA, and (right) infectious viral particles (PFU). The mean and 95% weighted credible interval of the model dynamics are plotted with the line and shaded regions, respectively. The mean and standard deviation of the experimental data are shown as symbols with error (n = 3 replicates). Experimental measurements of protein expression use polyprotein luminescence reported in relative light units (RLUs). (B) Histograms of the (left) experimental data error (a measure of the goodness-of-fit, defined in Materials and methods), (middle) cumulative RNA genome strand ratio (see text), and (right) cumulative particle-to-PFU ratio obtained (see text) from model calibration. The top row of panels A and B (labeled “No constraint”) presents calibration results using only experimental data, whereas the bottom row (labeled “With constraint”) shows results using both experimental data and the RNA genome strand ratio constraint. Model-predicted dynamics of the (C) RNA genome strand ratio and (D) particle-to-PFU ratio following calibration including the RNA genome strand ratio constraint. The red dashed line in C corresponds to the goal ratio used in the constraint. Histograms, means, and credible intervals are determined from a weighted ensemble of 2,500 parameter sets obtained from PMC.
When fit to the data without additional constraints, the model captures most features of the experimental dataset (Fig 2A, top row), including the timing and amplitude of the nonstructural (nsP) and structural (sP) polyprotein production (left), the synthesis of the genomic (posRNA) and subgenomic (sgRNA) RNA (middle), and PFU production (right). However, the predicted value of the RNA genome strand ratio (posRNA divided negRNA) is much less than one (Fig 2B, top row, middle panel), whereas some previous experiments with arthritogenic alphaviruses have found values in the range of 10–20 [70,71]. Using strand-specific RT-qPCR, we attempted to directly measure the kinetics of the negRNA, but this data was excluded due to a lack of sensitivity. We therefore sought to improve our model calibration strategy by incorporating an additional constraint to reflect a higher RNA genome strand ratio.
To constrain model behavior to be consistent with experimental observations, we implemented a penalty for parameter sets that produced a cumulative strand ratio below 20. Details of the implementation are provided in Materials and methods. Addition of the strand ratio constraint moderately increased the error in the fits (Fig 2B, left) while retaining most features of the experimentally-observed dynamics (Fig 2A and S2 Fig). The increased fit error arises largely from the model overshooting the observed posRNA (genome) concentration at 9 and 13 hpi (Fig 2A, middle). However, the constraint brings the predicted RNA genome strand ratio into the experimentally observed range as over 90% of the resulting ensemble has a cumulative ratio above 10 (Fig 2B). This shift in the strand ratio is achieved mostly through terminating negative-strand RNA synthesis by 5 hpi, when progeny particles are first produced, in conjunction with a modest increase in production of posRNA at the later times (Fig 2A, middle). The predicted RNA genome strand ratio is highly dynamic throughout infection, exhibiting three phases (Fig 2C). The first stage occurs within the first hour as the infecting particles are binding, entering the cell, and uncoating to release posRNA, which results in a high strand ratio since there is no negRNA. From 1 to 4.5 hpi the ratio steeply declines as the viral replicase proteins are produced and begin negative-strand synthesis, causing the RNA genome strand ratio to fall briefly below one. negRNA dominance is short-lived, however, because its synthesis is shut off at about 4.5 hpi while the rate of posRNA synthesis increases, leading to the final phase of increasing strand ratio.
We next examined how the addition of the RNA genome strand ratio constraint affected the infectivity of progeny particles, which is quantified by the particle-to-PFU ratio: the total number of progeny particles divided by the number of PFU. When calibrating solely to the experimental data, PFU production generally outpaces VLP production (Fig 2A), resulting in a low average cumulative particle-to-PFU ratio of about 1.2 (Fig 2B, top row, right panel). The introduction of the RNA genome strand ratio constraint increases the average cumulative particle-to-PFU ratio to 6.9 (Fig 2B, bottom row, right panel), which is low compared to other positive-sense RNA viruses [72–75]. However, the low particle-to-PFU ratio is consistent with studies using alphaviruses that have been adapted to use heparan sulfate (HS) for entry [76–78], likely reflecting natural HS binding by EEEV [19]. Inclusion of the RNA strand ratio constraint increases the particle-to-PFU ratio by increasing the VLP assembly rate constant (Step 32, r_vlp_assemble) and decreasing the PFU assembly rate constant (Step 31, r_assemble) (S2 Fig). Together, these changes decrease the amount of posRNA packaged into PFUs, which increases levels of free posRNA in order to satisfy the RNA genome strand ratio constraint. Finally, the model predicts that the particle-to-PFU ratio peaks early during the course of infection and then decreases (Fig 2D), which is consistent with the experimental finding that infectivity increases over the course of infection with Sindbis alphavirus [76,79,80].
A notable benefit to Bayesian inference methods is the ability to quantify the uncertainty of model parameter values, often referred to as parameter identifiability, and model predictions using the posterior distributions of the estimated parameters, as we have already shown in Fig 2. The narrower the posterior distribution of a model parameter, the more identifiable it is. S1 Fig shows the individual (marginal) posterior distributions of the estimated parameters (26 model parameters and 2 scaling factors) across ten independent runs when calibrating with or without the RNA genome strand ratio constraint. In both cases the posterior distributions are well-converged across the 10 runs (S3 Fig and S4 Fig). The posterior distributions demonstrate that including RNA genome strand ratio constraint improves parameter identifiability, which is seen in the narrowing of the posterior distributions for a substantial number of the model parameters. The overall number of parameters varying over three orders of magnitude or more drops from 19 (68%) without the constraint to 14 (50%) with the constraint. Moreover, with the introduction of the RNA genome strand ratio constraint, several biologically important and potentially observable parameters including r_endo, r_replicase_cleave, and ribosome_distance, become well-constrained. Additionally, the vast majority of model parameters are not significantly correlated when using the RNA genome strand ratio constraint, as shown in S5 Fig. Taken together, the RNA genome strand ratio constraint effectively holds model behavior to be consistent with experimental observations while simultaneously improving parameter identifiability. From the resulting (posterior) ensemble of 2,500 parameter sets, we are able to identify and analyze robust behaviors of the model and characterize quantitatively key relationships between model parameters that arise from specific biological mechanisms.
RNA strand synthesis is tightly regulated by the viral replicase
With a calibrated model that captures key characteristics of EEEV replication, we next focused on the kinetics of viral RNA synthesis, which is controlled by the viral replicases, P123-4 and C1234. Fig 3A shows that the calibrated model predicts replicase forms rise sharply between 3 and 4 hpi, but P123-4 (Fig 3A, red dashed line) declines steadily afterwards while C1234 reaches a high level and even continues to increase at a slower rate (Fig 3A, blue solid line). Except for a narrow window around 4–5 hpi, C1234 is by far the more prevalent replicase form, suggesting that the timing of negative strand synthesis is tightly regulated. The kinetics of the transcription fluxes (Fig 3B) reflect the dynamics of the replicase complexes. A surge in synthesis for all viral RNA occurs around 4 hpi, shortly after the jump in replicase concentrations. The production flux of negRNA briefly surpasses the posRNA production flux between 4–5 hpi but drops off rapidly afterwards. Moreover, within the same time window, the sgRNA becomes the dominant output of viral transcription (Fig 3B, light blue dashed line). Taken together, we see that negative-strand synthesis occurs within a narrow time window during which the intermediate P123-4 form of the replicase is prevalent. Afterwards, EEEV transcription switches to generation of positive-sense products in the form of posRNA, which can be directly packaged into viral particles, and sgRNA, which undergoes translation to produce structural proteins that make up viral particles.
(A) Model kinetics of the replicase complex at different functional stages: negative-sense competent (P123-4, dashed red) and positive-sense competent (C1234, solid blue). (B) RNA transcription fluxes for negRNA (red), posRNA (solid dark blue), and the sgRNA (dashed light blue). (C) Scatter plot of ensemble promoter affinities for the posRNA (ka_transcribe_4pos) and sgRNA (ka_transcribe_4sg) sites on the negative strand and (D) histogram of the ratio (subgenome/genome) of these affinities. (E) Kinetics of fraction of genome located in replication spherules (solid orange) and cytoplasm (dashed black). (F) Kinetics of fluxes associated with viral genome for association with host ribosome (Step 20) (dashed pink), association with P123-4 (Step 8) (dash-dot light blue), and translocation out of the RS (Step 21) (dotted dark green). (G) Histograms of the ratio of translocation flux to ribosome association flux at 0 hpi (black) and 24 hpi (light gray). (H) Scatter plot of ensemble rate constants for genome translocation out of the replication spherule to the cytoplasm (Step 21, r_rna_out) versus the rate constant for translation initiation (Step 20, ka_translate), respectively. Histograms, means, and credible intervals are determined from a weighted ensemble of 2,500 parameter sets obtained from PMC. The transparency of individual samples in scatter plots is proportional to their corresponding importance weights. In Panels C and E the r value displayed is the Pearson correlation coefficient between the quantities shown on the x and y axes, respectively.
We used the parameter set ensemble generated by PMC to determine whether the relative promoter strengths for production of genome (ka_transcribe_4pos) and subgenome (ka_transcribe_4sg) from negative-sense strand could be inferred from model calibration. The promoter affinities, which were independent parameters in the model, are not individually well constrained (S1 Fig), but their ratio is highly correlated (Fig 3C and S3 Fig, S5 Fig) with a mean value of about in favor of the subgenomic promoter (Fig 3D). This result suggests that the affinity of C1234 for the subgenomic promoter needs to be about five-fold higher than its affinity for the genomic promoter in order to generate the experimentally observed ratio of sgRNA to posRNA of about 2–10 (Fig 2A).
EEEV allocates its genome for efficient assembly of infectious virions
A challenge specific to positive-sense RNA viruses is to maintain a distribution of posRNA to various replication processes in order to generate infectious particles efficiently, which are comprised of both transcriptional and translational products. posRNA is allocated over the following reaction fluxes: (1) transcription to produce nascent negRNA; (2) translation to produce the nsPs, P123 and P1234, which are required for viral transcription; and (3) translocation out of RS into cytoplasm for assembly into progeny PFUs.
To elucidate how model parameter estimates influence the distribution of the genome to various replication processes, we first examined the subcellular localization dynamics of the total genome in the cell (Fig 3E). In particular, genomes located in the RS correspond to transcription and translation while genomes in the cytoplasm indicate translocation for assembly. Following viral entry, the majority of the genome is located in the cytoplasm until 2.5 hpi. As the RS forms and the replication machinery is initially translated, genome allocation switches briefly to the RS as progeny copies of genomes begin to be synthesized around 4 hpi, but by 5 hpi the majority of genome copies has moved into the cytoplasm. By 10 hpi almost no genome copies remain in the RS.
Next, we investigated the specific fluxes of transcription, translation, and translocation as shown in Fig 3F. Prior to 4 hpi, the posRNA is primarily allocated to translation (Fig 3G, pink dashed curve). An abrupt increase in posRNA synthesis starting around 2 hpi (Fig 3B) drives increases in all three allocation fluxes. At about 4.5 hpi, translation drops abruptly due to decline of P123-4 (Fig 3B) but translation and translocation continue to rise. Transcription briefly dominates before dropping off, while translocation continues to increase and becomes dominant. The change of the translocation to translation ratio from very low early on to very high later during infection (Fig 3G) further demonstrates the dramatic nature of the shift that occurs. The strong correlation between the rate constants governing translocation and translation (Fig 3H and S3 Fig, S5 Fig) demonstrates that the balance between these processes must be tightly constrained in order to generate the observed viral replication dynamics.
To summarize, as replication is initially established, the genome is primarily used to synthesize the nonstructural polyproteins, but this allocation abruptly shifts around 4 hpi, when the genome is primarily translocated to the cytoplasm for assembly. Overall, there are three phases evident in Fig 3E–3F describing the allocation of viral genome to replication processes: (1) translation-dominant; (2) transcription-dominant; (3) assembly-dominant. During phase (3) the sgRNA takes over the translation role for viral proteins, signaling a switch from nonstructural to structural protein production.
High ribosome density on the viral genome is critical for rapid negative-sense template strand synthesis
Calibration of our model yields a posterior distribution of the distance between ribosomes on viral RNA (ribosome_distance) that is narrowly distributed about a mean value of 212 nucleotides (Fig 4C and S1 Fig), which is near the lower bound measured in eukaryotic cells [57]. This suggests that high ribosome density (i.e., low ribosome_distance) is necessary to achieve the experimentally observed replication kinetics (Fig 2). The average polysome compositions are calculated by dividing the respective RNA length by ribosome_distance. Thus, the model predicts that there are an average of 55 and 19 ribosomes bound to a strand of the posRNA and sgRNA, respectively. In the first 4 hpi, the available host ribosome pool does not exhibit substantial depletion, and posRNA and sgRNA do not compete for ribosomes. From 4 hpi onward, however, ribosomes undergo rapid depletion and bind predominantly to sgRNA (Fig 4A). Interestingly, translation of posRNA spikes sharply at 4 hours, falling off rapidly thereafter and mirroring its short-lived period of high concentration in the RS (Fig 3E). On the other hand, synthesis of the structural polyprotein using the sgRNA sharply increases until 4 hpi and is sustained for the duration of infection. Ultimately, it is rate limited by the total amount of host ribosomes available to the virus. Taken together, EEEV can rapidly utilize host ribosomes by densely packing ribosomes on the sgRNA to drive synthesis of the structural polyprotein and switch off the production of nsPs.
(A) Free versus bound ribosome dynamics over 24 hpi. Ribosomes are in one of three possible states: free (dotted gray), genome-bound (posRNA, solid dark blue), or subgenome-bound (sgRNA, dashed light blue). Parameter scans of (B) the initial pool of ribosomes and (C) polysome composition as defined by ribosome_distance on infectious particle production (PFU, dashed black) and RNA genome strand ratio (dash-dot blue). PFU production is defined as the area under the respective species curve (AUC) after 24 hpi. For panel B, the vertical dotted red line represents the nominal value of the corresponding parameter (). For panel C, the estimated distribution of ribosome_distance after model calibration is shown in red. For all panels, the mean and 95% credible interval from the posterior samples are represented by the lines and shaded regions, respectively.
To further probe the relationship between host ribosomes and EEEV replication dynamics, we varied the parameters Ribo_0 (Fig 4B), which is the fixed, initial pool of available ribosomes, and ribosome_distance (Fig 4C) to assess their influence on PFU production. In order to quantify the uncertainty of our analysis, parameter scans of these two parameters were performed by varying the corresponding parameter over a range for each member of the parameter set ensemble. Fig 4B shows that the model predicts a sharp threshold of about 105 available ribosomes is required for significant PFU production, which is just slightly lower than the fixed value used in the model calibration.
On the other hand, total PFU production is insensitive to polysome composition (Fig 4C, black dashed line), which is unexpected given how well-constrained the posterior distribution is for the corresponding parameter, ribosome_distance, after model calibration (Fig 4C, red bars). A further investigation revealed that ribosome_distance is primarily constrained by the RNA genome strand ratio, which remains within the experimental range only at the smallest values of ribosome_distance allowed (Fig 4C, blue dashed dot line).
To identify the origins of this behavior, we decreased the ribosome density by increasing ribosome_distance five fold over the parameter set ensemble. Decreasing the ribosome density in polysomes delays the time to host ribosome depletion by roughly 2-3 hours compared to the original model prediction, but, perhaps counterintuitively, lowers the final steady state number of free ribosomes (Fig 5A). posRNA production exhibits a similar delay in onset (Fig 5B), but becomes similar at later times. In contrast, decreased ribosome density generates increased steady state nsP levels (Fig 5C) by a factor of approximately 3. The increase of nsP production in turn allows for about an order of magnitude more negRNA to be generated (Fig 5D) compared to the original ribosome density, which explains the decrease in RNA genome strand ratio seen in Fig 4C. Fig 5E shows that sgRNA binding to ribosomes is significantly delayed, which expands the window of opportunity for the genome to be bound and translated by host ribosomes to produce nsPs and accounts for the increased production of nsP seen in Fig 5C. Fig 5F shows that PFU production is more modestly affected compared with the RNA genome strand ratio. In summary, decreased ribosome density in viral translation delays the kinetics of viral replication in the model through the interplay of transcriptional and translational mechanisms. Slower translation to produce replicase introduces a delay in the kinetics of transcriptional products but also extends the window for translation of the genome to occur. This leads to generation of more nsPs and ultimately more negRNA, which accounts both for a moderate increase in the amount of PFU produced and a more dramatic decrease in the RNA genome strand ratio.
The predicted dynamics of (A) free host ribosomes, (B) total genome (posRNA), (C) nonstructural polyprotein luminescence (nsP RLU), (D) negative-sense template (negRNA), (E) subgenome-bound polysome (TC:sg), and (F) infectious progeny particles (PFU) after increasing the average distance between ribosomes in a polysome (ribosome_distance) 5-fold over the parameter set ensemble. For all panels, the mean of the original trajectories and after a 5-fold increase in ribosome_distance are represented with solid and dashed lines, respectively. Shaded regions represent the 95% credible interval from the parameter set ensemble ().
One of three post-translational modifications to the structural polyprotein is rate-limiting
A potential benefit of Bayesian estimation methods is the ability to capture multiple regimes of parameter combinations that fit the experimental data to identify distinct possible mechanisms that can account for the observed dynamics. To search for such mechanisms, we performed hierarchical clustering on model species trajectories (see Materials and methods) and identified three distinct clusters of trajectories that do not differ significantly in their PFU production (Fig 6A and B) but differ in their distributions of the parameters describing the processing steps of the structural polyproteins through the secretory pathway, Steps 27-29 (Fig 6C–6E). Each cluster is characterized by one of three rate parameters, r_pro_translocate, r_pro_process, and r_pro_cleave, being significantly lower in value and more tightly-constrained, while the remaining two parameters have higher values distributed over a broader range. The corresponding reactants can be distinguished by cluster (Fig 6F–6H). Specifically, Clusters A (dark blue), B (orange), and C (light green) correspond to the build-up of either the structural polyprotein without capsid (PE26KE1), the translocated polyprotein in the endoplasmic reticulum (PE26KE1-ER), or the precursor spike proteins (PE2-E1), respectively. For each cluster, the concentrations of one of these model species reaches a steady-state level significantly and uniquely higher compared to the other clusters’ respective species trajectories (Fig 6G–6I). Taken together, this suggests that either structural polyprotein translocation to the endoplasmic reticulum, polyprotein post-translational modifications in the secretory pathway, or cleavage of the spike proteins is rate-limiting and critical for fitting the experimental data.
(A) The estimated ensemble parameter sets (n=2,500) were hierarchically clustered using all model species trajectories over 24 hpi. (B) Principal components analysis of estimated parameter sets based on all model species trajectories. The shade of each sample corresponds to the PFU production (PFU AUC), which is defined as the area under the curve of the PFU trajectory. (C) Schematic of the corresponding model rate rules (Steps 27-29) from Fig 1. (D)–(F) Estimated ensemble distributions, by cluster, of model parameters involved in post-translational protein processing: r_pro_translocate (Step 27), r_pro_process (Step 28), and r_pro_cleave (Step 29). (G)–(I) Cluster trajectories of the associated protein product of the corresponding rate rules with a rate shown in panels (D)–(F). Mean (line) and 95% credible intervals (shaded) for each cluster are plotted. Panel C was created using Biorender.com.
Transcription speed and positive-sense promoter affinities are major factors driving infectious particle production
We next sought to identify critical model parameters that drive PFU production over the short infection period. For this, we employed a linear regression-based sensitivity analysis (LRSA) method (Fig 7A, see caption and Methods for additional details). Originally, we generated 100,000 parameter set samples using log-uniform Latin hypercube sampling of the model parameters from the model calibration bounds (see S1 Table). However, the vast majority of the sampled parameter sets (99.99%) resulted in PFU production levels below 100 particles, rendering this method impractical for further analysis (S6A Fig). As an alternative, we utilized the best-fitting parameter set from our parameter set ensemble as the reference point for sampling, varying each parameter by a factor of 10 in each direction. This approach results in the retention of 44% of the 10,000 generated samples after the filtering out samples that do not yield significant PFU production (Fig 7B and S6B Fig). We then performed linear regression to approximate the PFU production (y) as a linear function of the parameter values (
),
(A) An overview of LRSA in the context of EEEV replication. To understand the relationship between the model parameters and viral production, (1) 10,000 parameter sets are generated and (2) subsequently simulated. The viral production for each sample is calculated as the area under the curve of the infectious particle trajectory. (3) Parameter sets yielding significant PFU production are selected. (4) The remaining parameter sets are used as features in a linear regression to predict their corresponding PFU production as defined by the PFU area under the curve (AUC) after 24 hpi. (B) Linear regression performance when fitting globally-sampled parameters sets to their respective PFU AUC. (C) The sensitivity coefficients of each model parameter (n = 26). A positive (blue) sensitivity coefficient indicates a positive correlation with PFU production while a negative (red) coefficient indicates a negative relationship.
where each coefficient ci, estimated by linear regression, represents the linear sensitivity of the output y with respect to the feature xi, and is the error.
Overall, the linear regression model predicts PFU production with an R2 = 0.685 (Fig 7B). Several parameters involved in transcription emerge as strongly affecting progeny PFU production as indicated by their sensitivity coefficients (Fig 7C). The parameter with the highest, positive sensitivity coefficient is the rate constant for transcription, r_transcribe. Other critical parameters include the relative affinities of the promoters on the negative-sense template for posRNA (ka_transcribe_neg_4pos) and sgRNA (ka_transcribe_neg_4sg), whose sensitivities have opposite signs. Since the subgenome promoter affinity has a negative coefficient while the genome promoter affinity has a positive one, the sensitivity analysis suggests that a shift favoring posRNA synthesis relative to sgRNA will ultimately lead to increased PFU production. Based on the competition for ribosomes between the posRNA and sgRNA as shown in Figs 4 and 5, an increase in posRNA production relative to the sgRNA will increase nsP synthesis. By increasing the amount of nsPs produced, more negRNA is also synthesized before the replicases irreversibly shift to positive-sense RNA synthesis. Ultimately, this culminates in more posRNA synthesized from negRNA to be available to be packaged into progeny PFUs.
Another model parameter positively associated with PFU production is the processing rate of the structural polyprotein in the endoplasmic reticulum resulting in the precursor spike protein (rna_pro_cleave, Step 28). However, given the complex relationship with other structural polyprotein processing parameters, we sought to further probe the relationship between these correlated parameters with progeny PFU production. We performed LRSA for each of the three clusters identified in Fig 6 by globally sampling around the cluster’s best posterior sample. The sensitivity coefficients for all model parameters other than the rate-limiting parameter remain similar among all clusters (S6C Fig). For each cluster, the rate-limiting parameter for that cluster (r_pro_translocate, r_pro_cleave, or r_pro_process) had the highest positive sensitivity coefficient, while the other non-rate limiting parameters had no significant relationship to PFU production. In summary, the LRSA results support the finding from Fig 6 that post-translational processing of the structural polyprotein contains a rate-limiting step critical for PFU production.
Discussion
We constructed a mechanistic model of the Eastern equine encephalitis virus (EEEV) replication cycle within a mammalian host cell to better understand the replication dynamics resulting in the characteristic rapid production of progeny particles. We encoded the alphavirus-specific steps of intracellular replication into a series of interpretable rate rules starting with the attachment of a viral particle to a host attachment factor leading to the release of progeny particles differing in infectivity (Fig 1). Since the kinetic rate constants governing most of the processes considered in the model have not been directly measured, we utilized a Bayesian inference method to estimate them based on experimental measurements of EEEV infection kinetics during the first 13 hpi. To match experimental observations, we also incorporated a biological constraint into the model calibration process that favors synthesis of the positive-sense genome over the negative-sense template strand (i.e., a high RNA genome strand ratio). The resulting calibrated model captures key features of an experimental dataset generated from EEEV infection of mammalian fibroblasts. Further, the model suggests that the target RNA genome strand ratio of 20 is not reached until late in infection (Fig 2). In addition, the relative synthesis rates of the full-length genome, negative-sense template strand, and the subgenome are tightly-regulated by (1) a rapid transition of the replicase from its negative-sense to positive-sense transcribing forms, and (2) a constrained ratio between the genome and subgenome promoter affinities (Fig 3). Regression-based sensitivity analysis confirmed that the promoter affinities for the genome and subgenome strongly affect the amount of PFUs produced and also identified the importance of the overall transcription rate (Fig 7). Dynamics predicted by the model also reveal a constrained replication strategy that relies on allocating the majority of progeny copies of the genome to the cytoplasm for assembly. Finally, we found that the number of ribosomes in a single polysome is critical for establishing the RNA genome strand ratio, because a reduction of ribosome density leads to greater production of the negative-sense template strand and delayed production of infectious viral particles (Figs 4 and 5).
Initial calibration to the experimental dataset alone yielded an incorrect prediction, suggesting that the kinetics of the template strand followed those of subgenomic RNA, which reaches concentrations 2-fold or greater compared to the genome (Fig 2A). Constraining the model to match the experimentally-observed dominance of posRNA to negRNA only slightly decreased the quality of the fit to the experimental data (Fig 2A), while increasing the identifiability of a number of the model parameters, including ribosome_distance, which controls the number of ribosomes that simultaneously translate viral RNA. These results suggest the model is able to reproduce the observed kinetics of viral replication under the constraint of holding negative-strand RNA to low levels, which reflects empirical evidence and may be required for the virus to evade host immune responses [81]. The RNA genome strand ratio described by the model is highly dynamic as it sharply declines initially, briefly falling below 1 around 4 hours, and then rises rapidly thereafter (Fig 2C). Recent experiments measuring single-cell replication dynamics with the Sindbis alphavirus have similarly found that both positive and negative strands are synthesized at similar rates early in infection but shift towards positive-sense synthesis at later times [82].
Our model predicts that the termination of nonstructural polyprotein synthesis in combination with the rapid transition of the replicase form from negative-sense to positive-strand synthesis drives temporal variation of the RNA strand ratio dynamics. The model prediction suggesting that the replicase form responsible for negative-strand synthesis is short-lived is consistent with experimental observations in other alphaviruses [83]. Further, the model infers that negative-strand synthesis peaks at 4 hpi, which coincides with a rapid increase in the production of the positive-strand. The rapid increase in positive strand around 4 hpi has been recorded in the context of arthritogenic alphavirus infection [82–86]. Future experimental studies in combination with the incorporation of the innate immune response in the model, as described below, could help to further elucidate both the kinetics of and driving mechanisms for RNA strand asymmetry during alphavirus replication.
The model was constructed based on mechanisms applicable to any pathogenic alphavirus, and is thus easily adaptable through repeated model calibration. As a specific example, Semliki Forest Virus (SFV) lacks the opal termination codon between nsP3 and nsP4 [87]. As a result, SFV only produces the full-length nonstructural polyprotein (P1234), which can be modeled by setting a single model parameter () to zero. In a broader context, even though alphaviruses bind to many different attachment factors [88,89], the model structure would not need to change to accommodate these differences; only parametric changes including the initial amount of unbound receptors (init_EU) or the affinity of the virus particle to the respective host factor (r_bind) would be necessary. Thus, the model serves as a foundation to study the replication dynamics of any alphavirus that targets mammalian cells.
The model does, however, make a number of assumptions that might be productively addressed in future work. First, we describe transport of viral RNA from the replication spherule to the cytoplasm as a first-order kinetic process independent of any viral or host proteins. However, recent structural studies have shown that copies of nsP1 form a ring structure at the neck of replication spherules, suggesting a role for molecular trafficking of viral RNA [43,90,91].
Second, the model assumes that translation initiation is mechanistically and kinetically identical when using the genome or subgenome as the template. However, studies have shown that alphaviruses have developed mechanisms that accelerate translation initiation on the subgenome in order to circumvent host cell stress responses and viral shut-down of host translation (reviewed in [92]). Further, the intracellular location and timing of translation may differ between the genome and subgenome, which may also affect ribosome recruitment [70,92].
Third, the competition for ribosomes between the host and virus is incompletely modeled. Generally, alphaviruses siphon off ribosomes from host translation using a two-pronged approach: (1) shut down of host transcription and translation and (2) recruitment of ribosomes to viral RNA [56,93–101]. Experiments investigating the subcellular location and composition of viral translation complexes would allow validation of model predictions such as high ribosome density on viral RNA and allocation of host ribosomes by the subgenome. Importantly, experiments could be used to test the feasibility of possible therapeutic strategies based on diminishing translation of the viral subgenome. In addition to host ribosomes, many other host factors play important roles in the propagation of alphavirus infection, including viral entry [88,102,103], RNA replication [104–108], and protein processing [109]. In the future, their inclusion in the model could help to elucidate the complex interplay between the host and virus that determine the outcome of infection.
In particular, the model currently simplifies structural polyprotein processing by consolidating the cleavage, folding, and trafficking of the structural proteins (except capsid) into a single step that does not include host proteins (Step 28). Additionally, the model does not explicitly include frameshifting of the 6K protein to produce the TF protein, which has been shown to play a role in alphavirus assembly [110–112]. Combined with experimental studies examining the dynamics of structural post-translational processing, further refinement of the model to include the details of these individual processes would provide the opportunity to determine specifically which post-translational mechanism is rate-limiting for infectious particle production.
Given the speed of EEEV replication, the adaptive immune response is often too slow to prevent infection from having catastrophic consequences for the host. Instead, the major determinant of acute EEEV infection outcome is the timing and amplitude of the innate immune response through the induction of Type I interferons (IFNs) [81,113–116]. EEEV and other alphaviruses have developed several mechanisms to antagonize the IFN response through the suppression of host translation and transcription, as well as other strategies [93,113,117]. Although many of the mechanisms of interaction between the host and virus have been identified, the underlying dynamics and relationship to infection outcomes remain poorly understood, particularly in the context of EEEV infection. The model presented here serves as a starting point to probe host and viral antagonism through the future incorporation of the innate immune response.
Materials and methods
Mechanistic model of alphavirus replication
Model construction.
The model is written in the BioNetGen Language (see S1 Table and S1 File) [37] and converted into the Systems Biology Markup Language Format (see S2 File) [118] for simulation using the RoadRunner Simulation Engine, which allows efficient compilation and integration of the ordinary differential equations generated by the model rules [119,120]. The model is composed of 60 mass-action rate rules and 36 individual species. For each reaction defined by a mass action rate rule, the rate of the reaction is given by the product of a rate constant parameter and the concentration of each reactant species. Overall, there are 36 model parameters: 10 are fixed based on published literature and 26 are estimated (S1 Table). Structurally, the model was designed to include replication mechanisms common in alphaviruses. However, given that the model is calibrated to EEEV data, we describe the model mechanisms and corresponding parameters below in the context of EEEV infection. In the detailed description of the model processes below, the step numbers refer to those listed in the Rate Rules tab of S1 Table.
Attachment, entry, and uncoating.
Infection begins when a functional EEEV particle (nV) binds to an external unbound host attachment factor (R_EU) with rate constant r_bind (Step 1). We assume the initial number of viral particles is equal to the experimental multiplicity of infection (MOI), which is 5. EEEV particles predominantly bind to heparan sulfate, a ubiquitously-expressed proteoglycan [21], which we assume is in significant excess over viral particles by setting . The viral bound-host attachment factor (R_EB) enters the cell with a rate constant r_endo (Step 2) via a clathrin-mediated endocytic pathway, forming an intracellular endosome (R_IB) [89,121].
The next two rate rules encompasses multiple uncoating processes that serve to establish viral replication (Steps 3A and 3B). The four processes that occur during uncoating are: (A) endosomal compartment acidification, which leads to membrane fusion of the viral lipid envelope with the endosomal membrane and release of the nucleocapsid into the cytoplasm [122]; (B) nucleocapsid core disassembly to release the viral genome; (C) binding of viral genome by host ribosomes; and (D) formation of replication spherules (RS), which positive-sense RNA viruses employ to isolate and concentrate viral replication machinery through the rearrangement of host cell membranes [123]. Most alphaviruses including EEEV create these spherules using the lipid membrane from host endo-lysosomal membranes to form Type I-II cytopathic vacuoles [44,124,125]. The net reaction carried out by Steps 3A and 3B is the unbinding of the viral particle from host attachment factor to form unbound receptor (R_IU) and the genome translation complex (translate_complex, labeled TC:g in Fig 1). The process is split into two reactions in order to model the kinetics of polysome complexes that involve multiple ribosomes bound to a single viral genome, as described in more detail below. Although RS do not appear explicitly in the model species or rules, events described by rules 5–20 involve species that are confined to RS except for the host ribosomes and subgenomic RNA. In summary, processes A-D outlined above are combined into a single rate rule with a corresponding rate constant, r_uncoat. Lastly, unbound, intracellular heparan sulfate is recycled to the extracellular surface defined by the rate constant r_recycle (Step 4).
Translation and polysomes.
The EEEV genome (posRNA) and subgenome (sgRNA) are used as templates for translation of the nonstructural (P123 and P1234) and structural (s_poly) polyproteins, respectively [54]. We assume that there is a fixed pool of free host ribosomes (host_ribo) available to the virus upon initial infection (BNID: 106861, [58]). An important component of translation regulation is polysome formation, in which multiple ribosomes bind to a single RNA strand to amplify protein synthesis. The number of ribosomes in a polysome has been shown to increase with the length of RNA being translated [57]. To capture this effect, we used a parameter, ribosome_distance, which represents the average distance in nucleotides between ribosomes on an RNA strand, to determine the average number of ribosomes in polysome complexes.
The model assumes that polysomes form in a single step, which is equivalent to assuming that all translation occurs from the steady state polysome complex as previously described in [24]. The net reaction to form a polysome is:
In BioNetGen the kinetics of polysome formation are split into two rules, which in generic form are:
where TC represents the resulting translation complex, ka is the forward binding rate constant for ribosome–RNA binding, length_RNA is the RNA length in nucleotides, and ribosome_distance is the average distance in nucleotides between ribosomes bound to the viral RNA. Each rule generates differential equation terms affecting only species that undergo a net change in stoichiometry between reactants and products: RNA and TC in rule A and ribo in rule B. As a result, the differential equation terms arising from the these two rules are:
where square brackets indicate concentrations of the associated species. These rate rules and corresponding differential equations ensure that the translation complex species have the stoichiometries defined in Eq 2. Specific implementations of these generic polysome complex formation rules can be found in Steps 3, 20, and 24.
There are two nonstructural polyprotein products synthesized by host ribosomes using the genome: P1234 and P123, differing in the synthesis of the RNA-dependent RNA-polymerase (nsP4). The resulting polyprotein product is determined by the readthrough of the opal codon between nsP3 and nsP4 [126]. For most alphaviruses, readthrough of this codon to produce P1234 occurs 10-20% of the time [127]. This is reflected in the model using a fixed scaling factor, f_nsp_scale, which represents the frequency in which P123 is synthesized. In summary, the rate of translation for P123 (Step 5A) is defined as:
and the rate of P1234 synthesis (Step 6A) is:
In both functions, r_translate represents the individual ribosome translation rate (in amino acids * ), length_genome is the length of the genome in nucleotides, and length_P123(4) represents the length of the resulting protein product in amino acids.
Later in the replication cycle, the subgenomic RNA is used to synthesize a structural polyprotein (s_poly), which contains the individual proteins that comprise a progeny viral particle (Step 25). To maintain stoichiometric balance during particle assembly, the structural synthesis rate is scaled to reflect the number of structural proteins () required to make a single viral particle [128,129]. Thus, translation of the structural polyprotein is defined by the following function:
where length_sg is the length of the subgenome in nucleotides and length_s_poly is the length of the structural polyprotein in amino acids.
For all protein products, it is assumed that the rate of protein synthesis scales proportionally to the polysome composition and the polysome complex dissociates from the RNA template into individual ribosomes once synthesis is terminated. In BioNetGen, translation and turnover of polysomes is described by two rate rules, which in general form are:
where translate_fun is the translation rate function defined above (e.g., translate_P123(4) or translate_struct). Similar to translation initiation, these rate rules create corresponding stochiometrically balanced differential equations terms:
The specific implementations of translation are defined in Steps 5, 6, and 25. Overall, the model assumes viral translation is localized to the replication spherule.
Transcription.
In order for transcription to commence, the nsP4 must be cleaved off the polyprotein by the nsP2 with a rate constant, r_rdrp_cleave (Step 7) [130]. Although cleaved off, all 4 nsPs remain together comprising a replicase complex, as they all play regulatory roles in RNA synthesis [46,71,131–134]. The initial complex corresponding model species is denoted as P123_4 (labeled P123-4 in Fig 1). This form of the replicase binds to the genome (posRNA) with a promoter affinity, ka_transcribe_pos, to form the transcription complex, pos_replicase_4neg, which is denoted as RC:pos_neg in Fig 1 (Step 8). This complex subsequently synthesizes the negative-sense RNA template, negRNA (Step 9).
A notable characteristic of alphaviruses is the further cleavage of the replicase complex which signals a shift from negative-sense synthesis to positive-sense synthesis [86,135–137]. Biologically, this shift occurs when first, the nsP1 is cleaved from nsPs2-3 and second, the junction between nsP2 and nsP3 is cleaved, leading to 4 individual proteins. The model combines these steps into a single rule (Step 10) because the intermediate form, where only nsP1 and nsP4 are cleaved off, is exceptionally short-lived [46]. The rate of this process is defined by the rate constant r_replicase_cleave and yields the replicase form responsible for positive-sense synthesis, C1234.
It is important to note that the model includes the binding of the free genome (posRNA) with the free, complementary template strand (negRNA) in the replication spherule, forming a double-stranded RNA intermediate, dsRNA, governed by the rate constant, r_rna_bind (Step 11) [138]. Since the nsP2 in the replicase complex has helicase activity [139,140], we assume that the negative-strand in free or duplex forms can be used as a template to synthesize positive-sense RNA products. Specifically, C1234 can initiate transcription through four mechanisms based on the form of the negative-sense template (free negRNA vs in-duplex dsRNA) and which of the two promoters it binds to. These promoters on the negative-strand correspond to the full-length genome (Steps 12, 13) and the subgenome, sgRNA, (Steps 14, 15). As a result there are four model species representing the possible transcription complexes: two for the genomic promoter, neg_replicase_4pos and ds_replicase_4pos (both denoted as RC:pos in Fig 1), and two for the subgenomic promoter, neg_replicase_4sg, and ds_replicase_4sg (both denoted as RC:sg in Fig 1). Previous studies, as well as our EEEV dataset, shown that the relative subgenome and genome production is asymmetric in favor of subgenome as the respective promoters are bound distinct sites on the RNA-dependent RNA polymerase [132,141]. We account for this in the model through the inclusion of two parameters, ka_transcribe_neg_4pos and ka_transcribe_neg_4sg, defining the genome and subgenome promoter affinities. We assume that these promoter affinities are not dependent on whether the negative-sense strand is free or in-duplex.
In general, we assume that only a single replicase is synthesizing a single strand of RNA at a time. Similar to translation, it is assumed that the transcription rate is dependent on the length of the viral RNA product. For full-length copies of the viral RNA (either posRNA or negRNA), the total rate is used for Steps 9, 16, 17 and is defined as:
and the subgenome transcription rate in Steps 18-19 is:
where r_transcribe is the replicase transcription rate (in nucleotide *), and length_genome and length_sg are the lengths of the full-length RNA (posRNA,negRNA) and the subgenome in nucleotides. We assume that all transcription complexes (replicase + viral RNA) dissociate once the RNA product has been transcribed. Lastly, unbound forms of posRNA, negRNA, and dsRNA exit the replication spherule into the cytoplasm (labeled as cyto_posRNA, cyto_negRNA, and cyto_dsRNA) in a first-order rate rule governed by model parameter, r_rna_out, as shown in Steps 21-23.
Structural polyprotein synthesis and processing.
Once synthesized, host ribosomes bind to the subgenome to form a translation complex (sg_translate_complex, labeled TC:g in Fig 1) in Step 24 and synthesize the structural polyprotein s_poly (Step 25) (see Translation section).
The following series of steps describes the processing of the structural polyprotein into functional individual proteins required for assembly into mature progeny particles [142]. First, the capsid protein auto-proteolytically cleaves itself from the structural protein (s_poly) yielding capsid protein, the remaining polyprotein, PE26KE1, and the nanoLuciferase reporter used to measure structural polyprotein production, luciferase (Step 26) [143,144]. The rate of Step 26 is defined with the rate constant, r_sp_cleave. The remaining polyprotein (PE26KE1) contains a signal sequence leading to its translocation to the endoplasmic reticulum, (PE26KE1_ER), determined by the rate constant, r_pro_translocate, in Step 27 [145–147]. The polyprotein is transported through the secretory pathway towards the plasma membrane, undergoing a series of post-translational modifications which we model in a single rate rule governed by rate constant, r_pro_process, in Step 28 [70,122,147]. The resulting product is the spike precursor heterodimer, PE2_E1. pE2 on the heterodimer is subsequently cleaved by host protein furin to produce a fully mature spike protein [109,148]. We assume that the host protein furin is not rate-limiting, and thus assume this cleavage is a first-order reaction with a rate constant, r_pro_cleave, and yields the spike protein species, E2_E1 (Step 29).
Assembly and export of progeny viral particles.
The model proposes two mechanisms of assembly leading to two products: infectious, plaque-forming units (PFU) and non-infectious virus-like particles (VLP). In the model, the key difference between VLPs and PFUs is the presence of the viral genome. Mechanistically, the capsid protein preferentially binds to free genome in the cytoplasm (posRNA_cyto) yielding species, rna_capsid [62,149]. The corresponding rate is governed by the rate constant, r_capsid_rna_bind, as defined by Step 30. In the model, either free capsid or RNA-bound capsid will co-localize the spike proteins near the plasma membrane in preparation for the budding of progeny VLPs and PFUs, respectively. We assume this occurs at different rates for PFUs and VLPs as specified by r_assemble (Step 31) and r_vlp_assemble (Step 32), and ultimately yields the assembled model species assembled and assembledVLP. Lastly, the model assumes that both types of progeny particles bud out of the cell governed by the same rate constant r_export to produce progeny model species PFU and VLP, respectively (Steps 33, 34).
RNA, protein and progeny particle degradation.
All free viral RNA, regardless of form (i.e., single-stranded vs duplex) or location (i.e., RS vs cytoplasm), undergo first-order degradation (Steps 35–41 in S1 Table) governed by model parameter r_rna_degrade. Similarly, all free viral proteins have first-order degradation (Steps 42–52 in S1 Table) defined by parameter r_pro_degrade. Extracellular PFUs and VLPs are assumed to degrade with an identical rate constant, r_particle_deg (Step 53–54 in S1 Table).
Model calibration
Processing experimental data.
In order to apply the experimental data, which is described below in the Experimental methods section, we converted the raw values of individual replicates (n = 3 per assay) into particles/units/copy number per cell from particles/units/number per mL using a cell density of cells per mL. Additionally, we assume that since the MOI for all experimental data is 5, the number of virus particles available at t = 0 is 5. To account for the incubation period, we shift back all time points in the experimental dataset by one hour. Lastly, we exclude data collected after 12 hpi for model calibration. At 12 hpi, cytopathic events become evident [23], and since the model does not describe the effect of cell death on replication kinetics, the post-12 hpi data points are henceforth excluded. Additionally, we excluded the negative-sense EEEV RNA RT-qPCR data from model calibration due to a lack of sensitivity in the negative-sense RNA data from positive-sense RNA signal bleed over.
The model species, PFU, is directly fit to the plaque assay data. We assume that the genomic PCR dataset counts all forms of viral RNA with the genomic promoter. Thus we fit this dataset by summing the model species: translate_complex, posRNA, pos_replicase_4neg, dsRNA, ds_replicase_4pos, ds_replicase_4sg, cyto_dsRNA, cyto_posRNA,and rna_capsid. This is denoted as posRNAALL.To fit the subgenomic PCR data, the following model species are summed with posRNAALL:sg_RNA and sg_translate_complex.
For the luciferase assays measuring the viral proteins, we calibrate an additional 2 parameters, scale_nsp and scale_sp to scale protein quantities per cell to relative light units (RLU). Specifically, to fit nsP3 luminescence data, all forms of the nonstructural polyprotein, free or bound, are assumed to be included in the assay. Moreover, the following model species are summed and subsequently multiplied by the scaling factor, scale_nsp: P123, P1234, P123_4, C1234, pos_replicase_4neg, neg_replicase_4pos, neg_replicase_4sg, ds_replicase_4pos, and ds_replicase_4sg. For the structural protein luminescence, we included a model species, luciferase, that represents structural polyprotein production as it is inserted and subsequently cleaved off as shown in Fig 1 Step 26.
To calibrate the model to the experimental data, we define the goodness of fit as follows:
where n = 9 is the number of time points, .
is the mean experimental measurement and
is the standard deviation of species j at time i calculated from the triplicate data. y represents the model output of species j at time i.
Integrating biological knowledge during model calibration with an additional constraint.
Given the complexity of the model calibration (26 model parameters and 2 scaling factors to be estimated), we implemented a constraint to be used with the experimental data set during model calibration. Previous studies of other alphaviruses have demonstrated that the synthesis of the negative-sense template maximally reaches 5% of its positive-sense counterpart [70,71]. Thus, we assume the fold-difference between the genome to template is 20-fold or greater. To integrate this during parameter estimation, we calculate the total genome and template produced over 24 hpi as follows:
where and
represent the total amount of genome and template strands, respectively. For the negative-strand,
represents the sum of model species: negRNA, cyto_negRNA, neg_replicase_4pos, neg_replicase_4sg, dsRNA, cyto_dsRNA, ds_replicase_4pos, and ds_replicase_4sg.
Candidate parameter sets with a QRNA<20 are assigned the following penalty:
The RNA genome strand ratio constraint is integrated with model fit to the experimental data to form a likelihood function as follows:
where and represents the weight of the constraint error relative to the experimental data error. wRNA was manually-tuned from the following potential weights:
.
resulting in the best balanced QRNA and Cexp values.
Bounding parameters biologically.
All model parameters assume a log-uniform prior with bounds specified in S1 Table. To constrain parameters to stay biologically relevant during model calibration, we identified kinetic limits using a combination of experimental studies looking at binding kinetics as well as previously published mechanistic models of similar RNA viruses [36,57,150–155]. Numerous kinetic parameters, denoted as X, were provided in units, , and converted as follows:
which yields a conversion factor . We assume the average cell volume is
for BHK-21 cells [156,157]. The remaining unknown parameters are given wide bounds as specified in S1 Table. All calibrated parameters have log-uniform prior distributions based on the specified bounds.
Parameter estimation using preconditioned Monte Carlo.
For all downstream model calibration and analysis, we used RoadRunner to load and simulate the model [119]. To calibrate the model, we utilized the Python implementation of preconditioned Monte Carlo (PMC), pocoMC [69]. PMC is an iterative algorithm that combines Sequential Monte Carlo and Normalizing Flows that repeatedly samples and evaluates candidate parameter sets from the initial prior distribution to the target, true distribution using importance sampling with Markov chain Monte Carlo. NFs are generative models that learn a bijective mapping of a multivariate distribution to a target distribution [158]. In particular, PMC is defined by the repetition of the following three steps:
- (Re)weighing samples
- Resampling
- Mutation
PMC begins with a weighted ensemble of N samples drawn from the prior distribution and then applies importance sampling to adjust the weights of each sample based on their likelihood. The importance weight for each sample is computed as the ratio of the target posterior distribution to the proposal distribution from which the samples were drawn. It is updated iteratively to better approximate the true posterior density. The updated importance weights, along with the sample weights from the previous iteration, are used to calculate new weights for each sample. Once the weights have been recalculated, PMC proceeds by resampling the ensemble based on these updated weights. Next, the mutation step evolves the ensemble of samples independently with MCMC-based methods, like Metropolis-Hastings, to allow for local exploration of the parameter space. To improve sampling efficiency, the PMC algorithm employs NFs to map the estimated parameter distributions from the ensemble to a multivariate Gaussian distribution. It is in this learned, Gaussian space in which the mutation step is performed.
Overall, this three-step sequence is repeated iteratively. However, unlike other MCMC methods, the number of iterations is not fixed. Instead, PMC uses a metric, denoted , to assess convergence, and importantly, update the proposal distribution used for resampling. Specifically, the next
is numerically estimated from the importance weights to maintain a user-specified fraction,
, of independent samples from the ensemble (i.e., effective sample size) (see Karamanis et al., 2022 [69] for details). PMC is considered to have “converged" once
.
We use all default parameters when using pocoMC with the exception of the number of active particles and effective particles, which are both set to 2,500. The initial configurations of the particles is determined with Latin hypercube sampling of the model parameter bounds in log-uniform space. To assess convergence of PMC, the model was calibrated using 10 different initial configurations per calibration strategy (i.e., with or without the RNA genome strand ratio constraint).
Model analysis
Hierarchical clustering analysis.
We performed hierarchical clustering on posterior samples’ trajectories for every model species over 24 hpi using the scikit-learn Python library [159]. These are then used to cluster posterior samples using the Ward linkage criterion. The number of clusters was manually assessed based on the dendrogram generated by hierarchical clustering. To examine how individual clusters contribute to the sensitivity analysis results, we additionally performed the regression-based sensitivity analysis on each individual cluster based on the best parameter set which is defined by the highest likelihood as specified in Eq 11.
Linear regression-based sensitivity analysis.
To understand how the calibrated model parameters affect infectious particle (PFU) production, we employed global, linear regression sensitivity analysis. First, 10,000 parameter sets were sampled in log-uniform space using Latin hypercube sampling, yielding a matrix. We do not include the experimental luminescence scaling factors in this analysis. To sample the parameter space effectively to yield any viral production, the best parameter set in the posterior samples was used to define sampling bounds. Moreover, these bounds are defined by multiplying the nominal parameter values from the best posterior sample by a factor of 10 in each direction. Model behavior is simulated to 24 hpi for each sample and the PFU production is calculated as follows:
Samples are subsequently filtered, keeping parameter sets leading to significant viral production, which is defined as a maximum PFU concentration between 102 and 108 within 24 hpi. Linear regression is performed on remaining samples using log-transformed parameter values as features and log-transformed as output. The corresponding weights from the linear regression are the sensitivity coefficients (see Eq 1).
Experimental methods
Materials availability.
cDNA clones are the property of Dr. William Klimstra and may be available upon request. Primer sequences and assay reagents are listed in S2 Table.
Experimental model and reagents.
Cell line
BHK-21 cells
The Baby hamster kidney (BHK) cell line is highly permissive due in part to defective interferon production. The BHK C-13 (ATCC CCL-10) cells were cultured in RPMI (Corning) 10% DBS (Gibco), 10% TBP (Moltox), 1mM L-Glutamine (Corning), 1x penicillin/streptomycin (Corning). Cells were infected in PBS (Corning), 1% DBS (Gibco) in a humidified incubator with 5% CO2 at 37°C and overlaid with growth media + 0.05% immunodiffusion agarose (MP Bio).
Virus stock
EEEV FL93-939 nLuc TaV
The EEEV FL93-939, EEEV FL93-939 nLuc nsP3 fusion or EEEV FL93-939 nLuc TaV viruses were generated from a FL93-939 EEEV cDNA clone (provided by Scott Weaver, UTMB, Galveston) modified to express nanoLuciferase either as a non-cleaved fusion within the nonstructural protein 3 gene or as a cleavable fusion between the capsid and PE2 protein genes [160]. RNA genomes were synthesized in an in vitro transcription reaction with a T7 promoter RNA polymerase kit (Invitrogen) as previously described in [160]. Genomic RNA was electroporated into BHK-21 cells and viral supernatants were harvested and clarified 24 hours post electroporation followed by storage at -80°C. Viral titer was measured by BHK cell plaque assay (as in [160] and below).
RT-qPCR standards.
Full length sense and negative-sense EEEV FL93-939 genomic RNA was generated from a dual promoter cDNA clone [7]. Transcription from the SP6 (Invitrogen) promoter yielded negative-sense full-length genome while transcription from the T7 promoter (Invitrogen) yielded positive sense full length genome. RNA genomes were treated with DNase (Invitrogen) and recovered with RNeasy Cleanup Kit (Qiagen) per manufacturer protocols. Concentrations were measured with a Nano-drop spectrophotometer and RNA was made into ten-fold dilution series to serve as RT-qPCR standards. Copy numbers for RNA genomes were calculated where X = mass of amplicon and N = length of amplicon using
Virus growth curves.
BHK-21 cells were seeded into 24-well cell culture plates (Falcon) and allowed to grow overnight in standard growth media + 5% CO2 at 37 °C until they reached an average density of cells per well. Cells were infected with EEEV viruses (0.1 mL infection medium) (MOI=5) for 1 hour at 37 °C + 5%CO2and brought up to 1 mL total volume in growth media after infection. Cells were washed 3x with cell growth medium prior to lysate harvesting. Viral lysate (
cells) and supernatant RNA samples (0.25 mL) were collected into Trizol reagent (0.75 mL) (Invitrogen). For plaque assay, supernatant fractions were collected (0.1 mL) in growth medium. For nLuc assay, cell lysates (
cells) were made in passive lysis buffer (0.1 mL) (Promega). Cells were reacted with virus in 0.1 mL infection medium for one hour, then washed 3x with growth medium. Samples were collected immediately after infection (experimental t=0) and again every hour for the first six hours, followed by collections at 8, 12, 16, 20, 24 and 36 hours post-infection. All samples were collected in triplicate from separate wells.
Plaque assay.
BHK-21 cells were seeded into 6-well cell culture plates (Falcon) and allowed to grow overnight in standard growth media 37°C + 5% CO2until confluent. Cell monolayers were infected in duplicate with (0.2 mL) ten-fold dilutions of collected supernatant samples for one hour, 37°C + 5%CO2.Cell monolayers were overlaid with 2 mL growth media + 0.05% agarose (MP Bio) and incubated 48 hours 37°C + 5%CO2.Overlaid cells were stained with 2 mL neutral red solution (DPBS + 1% DBS, 1% Pen/Strep, 1x (Fisher) neutral red) overnight at 30°C + 5%CO2.Plaques were visualized and counted over a standard light source.
Protein nanoLuciferase assay.
NanoLuciferase protein expression was measured from triplicate collections of EEEV FL93 nLuc infected cell lysates by the manufacturer’s protocol (Promega) as previously described in [160] on a Biotek Synergy LX plate reader.
RT-qPCR assay.
Each 1 mL Trizol sample of infected cell supernatant or lysate was combined with 0.2 mL BCP (chloroform substitute) and 5 μL polyacryl carrier (MRC) and RNA extracted using a standard phenol/BCP (MRC) phase separation as previously described in [7] with an isopropanol precipitation and ethanol wash. cDNA synthesis was performed using the First-strand cDNA Synthesis M-MLV Reverse Transcriptase Kit (Invitrogen) protocol, including RNaseOUT (Invitrogen) [7].
To quantify the EEEV RNA genome, cDNA synthesis primers tagged with a T7 promoter ( bolded) (IDT) were used to target the nsP2 coding region on both the positive and negative strands of the viral RNA.
Positive: 5’- GCGTAATACGACTCACTATACACCGGCCAAAGTCTTCCATACTAT -3’.
Negative: 5’- GCGTAATACGACTCACTATAGCGCTACAAGGTCAATGAGAA-3’.
A similar strategy was used to quantify the EEEV RNA subgenome by targeting the capsid coding region on the positive strand with the following T7-tagged cDNA synthesis primer:
5’- GCGTAATACGACTCACTATAACAGCTCCATGATGCCAGTTA-3’.
Thermocycling conditions were performed per manufacturer’s suggested protocol (Invitrogen). cDNA was diluted 1:1 with nuclease free water (Hyclone) to 40 μL total volume and stored -80°C until further analysis. For quantification of genomic vRNA, qRT-PCR was performed using the 2x Fast TaqMan Universal PCR Master Mix, No AmpErase UNG (Applied Biosystems), following the manufacturer’s instructions. Forward primer and probe targeting EEEV positive strand nsP2 include:
forward primer 5’-GCGCTACAAGGTCAATGAGA-3’ and
probe 5’-ACGCACAGACATCTGAGCATGTGAA-3’.
Forward primer and probe targeting the negative strand include:
forward primer 5’-CACCGGCCAAAGTCTTCCATA and
probe 5’-TTCACATGCTCAGATGTCTGTGCGT-3’.
For quantification of the subgenomic vRNA, the following forward primer and probe were used to target the capsid gene:
forward 5’-GACCTTGAGTACGGTGATGTGC-3’ and
probe 5’-GTACACCAGTGACAAGCCTCCTGGC-3’.
The probes were labeled at the 5’ end with the reporter molecule 6-carboxyfluorescein (6-FAM) and quenched internally at a modified thymidine residue with Black Hole Quencher (BHQ1), with a modified 3’ end to prevent probe extension by Taq polymerase. The T7 promoter tag introduced during cDNA synthesis was used as the reverse primer for each qPCR target: 5’-GCGTAATACGACTCACTATA-3’.
Thermocycler parameters consisted of initial denaturing: 95°C for 20 seconds and cycling PCR amplification (40 cycles): 95°C for 3 seconds and 60°C for 20 seconds. Quantification of viral genomes or subgenomes was determined by comparing the cycle threshold (CT) values from unknown samples to CT values from the previously described 10-fold dilutions of in vitro transcribed sense and negative-sense EEEV RNA as described in [7].
Supporting information
S1 Table. Mechanistic model description including species, parameters, and rate rules
https://doi.org/10.1371/journal.pcbi.1013082.s001
(XLSX)
S1 Fig. Representative 1D marginal distributions for likelihood function (ℒ) and estimated parameters after calibrating the model with experimental data with and without the RNA genome strand ratio constraint.
Top left plot shows likelihood distribution as calculated by Eq 11. The remaining plots show 26 of the 36 model parameters in the BioNetGen model are calibrated as well as 2 scaling factors, nsp_scale and sp_scale, which are used to scale nonstructural and structural polyprotein concentrations to luminescence. These distributions are from the posterior ensemble of 2,500 parameter sets after calibrating the model given different set of initial particles. Specifically, the model was calibrated 10 times with the RNA genome strand constraint (blue) and 10 times without the constraint (orange). Vertical, dashed red lines indicate the parameter bounds provided during model calibration.
https://doi.org/10.1371/journal.pcbi.1013082.s003
(TIF)
S2 Fig. The model fits to the experimental data in linear space when performing parameter estimation with the RNA genome strand ratio constraint.
The posterior samples from model calibration (n=2500) trajectory weighted means (line) and 95th percentiles (shaded) are shown in all panels. The mean and standard deviation of experimental data (n = 3 replicates) are plotted from the protein luminescence assays, RT-qPCR, and plaque assay (see Materials and methods). Fits to protein luminescence data of nonstructural protein 3 (nsP) and structural protein (sP) in relative light units (RLUs) are shown in panels (A) and (B), respectively. (C) RNA dynamics of the positive-sense genome (posRNA, dark blue) and subgenomic RNA (sgRNA, light blue) are shown. (D) Model trajectories of viral particle production of plaque-forming, infectious particles (PFU).
https://doi.org/10.1371/journal.pcbi.1013082.s004
(TIF)
S3 Fig. The marginal posterior cumulative distributions of the likelihood and estimated parameters.
Model calibration to the RNA genome strand ratio constraint and experimental data was run 10 times given different initial configurations. The top left plot corresponds to the likelihood distributions (see Eq 11). The remaining 28 panels correspond to the 26 estimated model parameters and 2 scaling factors.
https://doi.org/10.1371/journal.pcbi.1013082.s005
(TIF)
S4 Fig. Pairwise Komolgorov-Smirnov statistic distributions for all model parameters across 10 independent runs of model calibration.
Using the resulting marginal cumulative distributions generated by each run of model calibration, as shown in S3 Fig, the two-way weighted Komolgorov-Smirnov statistic between each combination of runs (10 runs, 45 pairs for comparison) for the likelihood (labeled Total Error and defined in Eq 11), 26 model parameters, and 2 scaling factors. The black lines in each violin plot represent individual KS statistic for a pair of runs.
https://doi.org/10.1371/journal.pcbi.1013082.s006
(TIF)
S5 Fig. Pairwise calibrated parameter correlations.
The Pearson correlation coefficients between calibrated parameters (n = 28, 26 model parameters and 2 scaling factors) are calculated from the parameter set ensemble (n = 2,500).
https://doi.org/10.1371/journal.pcbi.1013082.s007
(TIF)
S6 Fig. Linear regression sensitivity analysis sampling approaches and results based on the clusters identified inFig 6.
(A) Sorted distribution of maximal PFU yield from parameter set samples generated from global sampling with calibration bounds specified in S1 Table. (B) Sorted distribution of maximal PFU yields of parameter set samples generated using the best posterior set sample as described in Materials and methods. The number of parameter set samples generated are 105 and 104 for panels A and B, respectively. Green shaded regions in panels A and B correspond to the filtered PFU count range for samples to be used for the linear regression. (C) Cluster-based linear regression sensitivity analysis reveals three modes of post-translational processing dynamics. The sensitivity coefficients when sampling around the best posterior parameter set from each individual cluster (n = 3) using the same method described in Fig 7A. As shown in Fig 6, the left plot corresponds to the cluster in which r_pro_translocate is the rate-limiting parameter. The center shows the resulting sensitivity coefficients when sampling parameters around the best sample cluster in which r_pro_process is rate-limiting. The right plot corresponding to the sensitivity coefficients for model parameters using the best parameter set from the cluster in which r_pro_cleave is rate-limiting.
https://doi.org/10.1371/journal.pcbi.1013082.s008
(TIF)
S1 File. The mechanistic model in the BioNetGen Language format.
Remove the .txt extension before running with BioNetGen.
https://doi.org/10.1371/journal.pcbi.1013082.s009
(TXT)
S2 File. The mechanistic model in the Systems Biology Markup Language format.
https://doi.org/10.1371/journal.pcbi.1013082.s010
(XML)
Acknowledgments
We thank the members of the Faeder research group as well as Rachel Gottschalk for useful feedback and discussions. Additionally, the first author would like to thank her canine companion, Clover, for her assistance throughout the course of this study.
References
- 1. Lindsey NP, Staples JE, Fischer M. Eastern equine encephalitis virus in the United States, 2003–2016. Am J Trop Med Hyg. 2018;98(5):1472–7. pmid:29557336
- 2. Deresiewicz RL, Thaler SJ, Hsu L, Zamani AA. Clinical and neuroradiographic manifestations of eastern equine encephalitis. N Engl J Med. 1997;336(26):1867–74. pmid:9197215
- 3. Ronca SE, Dineley KT, Paessler S. Neurological sequelae resulting from encephalitic alphavirus infection. Front Microbiol. 2016;7:959. pmid:27379085
- 4. Reed DS, Lackemeyer MG, Garza NL, Norris S, Gamble S, Sullivan LJ, et al. Severe encephalitis in cynomolgus macaques exposed to aerosolized Eastern equine encephalitis virus. J Infect Dis. 2007;196(3):441–50. pmid:17597459
- 5. Porter AI, Erwin-Cohen RA, Twenhafel N, Chance T, Yee SB, Kern SJ, et al. Characterization and pathogenesis of aerosolized eastern equine encephalitis in the common marmoset (Callithrix jacchus). Virol J. 2017;14(1):25. pmid:28173871
- 6. Phelps AL, O’Brien LM, Eastaugh LS, Davies C, Lever MS, Ennis J, et al. Aerosol infection of Balb/c mice with eastern equine encephalitis virus; susceptibility and lethality. Virol J. 2019;16(1):2. pmid:30611287
- 7. Albe JR, Ma H, Gilliland TH, McMillen CM, Gardner CL, Boyles DA, et al. Physiological and immunological changes in the brain associated with lethal eastern equine encephalitis virus in macaques. PLoS Pathog. 2021;17(2):e1009308. pmid:33534855
- 8. Morens DM, Folkers GK, Fauci AS. Eastern equine encephalitis virus - another emergent arbovirus in the United States. N Engl J Med. 2019;381(21):1989–92. pmid:31747726
- 9.
Centers for Disease Control and Prevention. ArboNET: National Arbovirus Surveillance System; 2024. https://www.cdc.gov/mosquitoes/php/arbonet/index.html
- 10.
CDC. Current Year Data. 2024. https://www.cdc.gov/eastern-equine-encephalitis/data-maps/current-year-data.html
- 11. Lindsey NP, Martin SW, Staples JE, Fischer M. Notes from the field: multistate outbreak of eastern equine encephalitis virus - United States, 2019. MMWR Morb Mortal Wkly Rep. 2020;69(2):50–1. pmid:31945032
- 12. Ladzinski AT, Tai A, Rumschlag MT, Smith CS, Mehta A, Boapimp P, et al. Clinical characteristics of the 2019 Eastern equine encephalitis outbreak in Michigan. Open Forum Infect Dis. 2023;10(5):ofad206. pmid:37180595
- 13. Hill V, Koch RT, Bialosuknia SM, Ngo K, Zink SD, Koetzner CA, et al. Dynamics of eastern equine encephalitis virus during the 2019 outbreak in the Northeast United States. Curr Biol. 2023;33(12):2515-2527.e6. pmid:37295427
- 14. Trobaugh DW, Klimstra WB. Alphaviruses suppress host immunity by preventing myeloid cell replication and antagonizing innate immune responses. Curr Opin Virol. 2017;23:30–4. pmid:28288385
- 15. Trobaugh DW, Gardner CL, Sun C, Haddow AD, Wang E, Chapnik E, et al. RNA viruses can hijack vertebrate microRNAs to suppress innate immunity. Nature. 2014;506(7487):245–8. pmid:24352241
- 16. Trobaugh D, Sun C, Klimstra W. Eastern equine encephalitis virus evades induction of the host immune response through miR-142-3p restriction of myeloid cell replication. J Immunol. 2018;200(Supplement_1):183.7-183.7.
- 17. Trobaugh DW, Sun C, Bhalla N, Gardner CL, Dunn MD, Klimstra WB. Cooperativity between the 3’ untranslated region microRNA binding sites is critical for the virulence of eastern equine encephalitis virus. PLoS Pathog. 2019;15(10):e1007867. pmid:31658290
- 18. Gardner CL, Burke CW, Tesfay MZ, Glass PJ, Klimstra WB, Ryman KD. Eastern and Venezuelan equine encephalitis viruses differ in their ability to infect dendritic cells and macrophages: impact of altered cell tropism on pathogenesis. J Virol. 2008;82(21):10634–46. pmid:18768986
- 19. Gardner CL, Ebel GD, Ryman KD, Klimstra WB. Heparan sulfate binding by natural eastern equine encephalitis viruses promotes neurovirulence. Proc Natl Acad Sci U S A. 2011;108(38):16026–31. pmid:21896745
- 20. Gardner CL, Choi-Nurvitadhi J, Sun C, Bayer A, Hritz J, Ryman KD, et al. Natural variation in the heparan sulfate binding domain of the eastern equine encephalitis virus E2 glycoprotein alters interactions with cell surfaces and virulence in mice. J Virol. 2013;87(15):8582–90. pmid:23720725
- 21. Chen C-L, Hasan SS, Klose T, Sun Y, Buda G, Sun C, et al. Cryo-EM structure of eastern equine encephalitis virus in complex with heparan sulfate analogues. Proc Natl Acad Sci U S A. 2020;117(16):8890–9. pmid:32245806
- 22. Aguilar PV, Leung LW, Wang E, Weaver SC, Basler CF. A five-amino-acid deletion of the eastern equine encephalitis virus capsid protein attenuates replication in mammalian systems but not in mosquito cells. J Virol. 2008;82(14):6972–83. pmid:18480443
- 23. Vogel P, Kell WM, Fritz DL, Parker MD, Schoepp RJ. Early events in the pathogenesis of eastern equine encephalitis virus in mice. Am J Pathol. 2005;166(1):159–71. pmid:15632009
- 24. Dahari H, Ribeiro RM, Rice CM, Perelson AS. Mathematical modeling of subgenomic hepatitis C virus replication in Huh-7 cells. J Virol. 2007;81(2):750–60. pmid:17035310
- 25. Binder M, Sulaimanov N, Clausznitzer D, Schulze M, Hüber CM, Lenz SM, et al. Replication vesicles are load- and choke-points in the hepatitis C virus lifecycle. PLoS Pathog. 2013;9(8):e1003561. pmid:23990783
- 26. Guedj J, Dahari H, Rong L, Sansone ND, Nettles RE, Cotler SJ, et al. Modeling shows that the NS5A inhibitor daclatasvir has two modes of action and yields a shorter estimate of the hepatitis C virus half-life. Proc Natl Acad Sci U S A. 2013;110(10):3991–6. pmid:23431163
- 27. Clausznitzer D, Harnisch J, Kaderali L. Multi-scale model for hepatitis C viral load kinetics under treatment with direct acting antivirals. Virus Res. 2016;218:96–101. pmid:26409026
- 28. Aunins TR, Marsh KA, Subramanya G, Uprichard SL, Perelson AS, Chatterjee A. Intracellular hepatitis C virus modeling predicts infection dynamics and viral protein mechanisms. J Virol. 2018;92(11):e02098-17. pmid:29563295
- 29. Zitzmann C, Kaderali L, Perelson AS. Mathematical modeling of hepatitis C RNA replication, exosome secretion and virus release. PLoS Comput Biol. 2020;16(11):e1008421. pmid:33151933
- 30. Zitzmann C, Dächert C, Schmid B, van der Schaar H, van Hemert M, Perelson AS, et al. Mathematical modeling of plus-strand RNA virus replication to identify broad-spectrum antiviral treatment strategies. PLoS Comput Biol. 2023;19(4):e1010423. pmid:37014904
- 31. Grebennikov D, Kholodareva E, Sazonov I, Karsonova A, Meyerhans A, Bocharov G. Intracellular life cycle kinetics of SARS-CoV-2 predicted using mathematical modelling. Viruses. 2021;13(9):1735. pmid:34578317
- 32. Getz M, Wang Y, An G, Asthana M, Becker A, Cockrell C. Iterative community-driven development of a SARS-CoV-2 tissue simulator. 2021. https://www.biorxiv.org/content/10.1101/2020.04.02.019075v5
- 33. Castle BT, Dock C, Hemmat M, Kline S, Tignanelli C, Rajasingham R, et al. Biophysical modeling of the SARS-CoV-2 viral cycle reveals ideal antiviral targets. 2020. https://www.biorxiv.org/content/10.1101/2020.05.22.111237v2
- 34. Castaño-Arcila M, Aguilera LU, Rodríguez-González J. Modeling the intracellular dynamics of the dengue viral infection and the innate immune response. J Theor Biol. 2021;509:110529. pmid:33129952
- 35. Zitzmann C, Schmid B, Ruggieri A, Perelson AS, Binder M, Bartenschlager R, et al. A coupled mathematical model of the intracellular replication of dengue virus and the host cell immune response to infection. Front Microbiol. 2020;11:725. pmid:32411105
- 36. Lopacinski AB, Sweatt AJ, Smolko CM, Gray-Gaillard E, Borgman CA, Shah M, et al. Modeling the complete kinetics of coxsackievirus B3 reveals human determinants of host-cell feedback. Cell Syst. 2021;12(4):304-323.e13. pmid:33740397
- 37. Harris LA, Hogg JS, Tapia J-J, Sekar JAP, Gupta S, Korsunsky I, et al. BioNetGen 2.2: advances in rule-based modeling. Bioinformatics. 2016;32(21):3366–8. pmid:27402907
- 38. Romero-Brey I, Bartenschlager R. Membranous replication factories induced by plus-strand RNA viruses. Viruses. 2014;6(7):2826–57. pmid:25054883
- 39. Nagy PD, Strating JRPM, van Kuppeveld FJM. Building viral replication organelles: close encounters of the membrane types. PLoS Pathog. 2016;12(10):e1005912. pmid:27788266
- 40. Wolff G, Melia CE, Snijder EJ, Bárcena M. Double-membrane vesicles as platforms for viral replication. Trends Microbiol. 2020;28(12):1022–33. pmid:32536523
- 41. den Boon JA, Nishikiori M, Zhan H, Ahlquist P. Positive-strand RNA virus genome replication organelles: structure, assembly, control. Trends Genet. 2024;40(8):681–93. pmid:38724328
- 42. Frolova EI, Gorchakov R, Pereboeva L, Atasheva S, Frolov I. Functional Sindbis virus replicative complexes are formed at the plasma membrane. J Virol. 2010;84(22):11679–95. pmid:20826696
- 43. Jones R, Bragagnolo G, Arranz R, Reguera J. Capping pores of alphavirus nsP1 gate membranous viral replication factories. Nature. 2021;589(7843):615–9. pmid:33328629
- 44. Elmasri Z, Nasal BL, Jose J. Alphavirus-induced membrane rearrangements during replication, assembly, and budding. Pathogens. 2021;10(8):984. pmid:34451448
- 45. Scutigliani EM, Kikkert M. Interaction of the innate immune system with positive-strand RNA virus replication organelles. Cytokine Growth Factor Rev. 2017;37:17–27. pmid:28709747
- 46. Rupp JC, Sokoloski KJ, Gebhart NN, Hardy RW. Alphavirus RNA synthesis and non-structural protein functions. J Gen Virol. 2015;96(9):2483–500. pmid:26219641
- 47. Baltimore D, Girard M, Darnell JE. Aspects of the synthesis of poliovirus RNA and the formation of virus particles. Virology. 1966;29(2):179–89. pmid:4287327
- 48. Bodo G, Palese P, Horak I, Hilfenhaus J, Siegert W, Jungwirth C. Interferon action: effect on the formation of poxvirus specific polysomes and viral RNA. Z Naturforsch B. 1970;25(10):1164–70. pmid:4394875
- 49. Lee AS-Y, Burdeinick-Kerr R, Whelan SPJ. A ribosome-specialized translation initiation pathway is required for cap-dependent translation of vesicular stomatitis virus mRNAs. Proc Natl Acad Sci U S A. 2013;110(1):324–9. pmid:23169626
- 50. Neidermyer WJ Jr, Whelan SPJ. Global analysis of polysome-associated mRNA in vesicular stomatitis virus infected cells. PLoS Pathog. 2019;15(6):e1007875. pmid:31226162
- 51. Sydiskis RJ, Roizman B. Polysomes and protein synthesis in cells infected with a DNA virus. Science. 1966;153(3731):76–8. pmid:4287165
- 52. Kiho Y. Polysomes containing infecting viral genome in tobacco leaves infected with tobacco mosaic virus. Jpn J Microbiol. 1970;14(4):291–302. pmid:5311544
- 53. Kempf BJ, Barton DJ. Poliovirus 2A(Pro) increases viral mRNA and polysome stability coordinately in time with cleavage of eIF4G. J Virol. 2008;82(12):5847–59. pmid:18400852
- 54. Mowshowitz D. Identification of polysomal RNA in BHK cells infected by sindbis virus. J Virol. 1973;11(4):535–43. pmid:4633684
- 55. Tesfay MZ, Yin J, Gardner CL, Khoretonenko MV, Korneeva NL, Rhoads RE, et al. Alpha/beta interferon inhibits cap-dependent translation of viral but not cellular mRNA by a PKR-independent mechanism. J Virol. 2008;82(6):2620–30. pmid:18160435
- 56. Götte B, Panas MD, Hellström K, Liu L, Samreen B, Larsson O, et al. Separate domains of G3BP promote efficient clustering of alphavirus replication complexes and recruitment of the translation initiation machinery. PLoS Pathog. 2019;15(6):e1007842. pmid:31199850
- 57. Morisaki T, Lyon K, DeLuca KF, DeLuca JG, English BP, Zhang Z, et al. Real-time quantification of single RNA translation dynamics in living cells. Science. 2016;352(6292):1425–9. pmid:27313040
- 58. Milo R, Jorgensen P, Moran U, Weber G, Springer M. BioNumbers--the database of key numbers in molecular and cell biology. Nucleic Acids Res. 2010;38(Database issue):D750-3. pmid:19854939
- 59. Strauss JH, Strauss EG. The alphaviruses: gene expression, replication, and evolution. Microbiol Rev. 1994;58(3):491–562. pmid:7968923
- 60. Mendes A, Kuhn RJ. Alphavirus nucleocapsid packaging and assembly. Viruses. 2018;10(3):138. pmid:29558394
- 61. Button JM, Qazi SA, Wang JC-Y, Mukhopadhyay S. Revisiting an old friend: new findings in alphavirus structure and assembly. Curr Opin Virol. 2020;45:25–33. pmid:32683295
- 62. Brown RS, Anastasakis DG, Hafner M, Kielian M. Multiple capsid protein binding sites mediate selective packaging of the alphavirus genomic RNA. Nat Commun. 2020;11(1):4693. pmid:32943634
- 63. Lulla V, Kim DY, Frolova EI, Frolov I. The amino-terminal domain of alphavirus capsid protein is dispensable for viral particle assembly but regulates RNA encapsidation through cooperative functions of its subdomains. J Virol. 2013;87(22):12003–19. pmid:24006447
- 64. Hu W-G, Steigerwald R, Kalla M, Volkmann A, Noll D, Nagata LP. Protective efficacy of monovalent and trivalent recombinant MVA-based vaccines against three encephalitic alphaviruses. Vaccine. 2018;36(34):5194–203. pmid:30037666
- 65. Ko S-Y, Akahata W, Yang ES, Kong W-P, Burke CW, Honnold SP, et al. A virus-like particle vaccine prevents equine encephalitis virus infection in nonhuman primates. Sci Transl Med. 2019;11(492):eaav3113. pmid:31092692
- 66. Liepe J, Kirk P, Filippi S, Toni T, Barnes CP, Stumpf MPH. A framework for parameter estimation and model selection from experimental data in systems biology using approximate Bayesian computation. Nat Protoc. 2014;9(2):439–56. pmid:24457334
- 67. Ballnus B, Hug S, Hatz K, Görlitz L, Hasenauer J, Theis FJ. Comprehensive benchmarking of Markov chain Monte Carlo methods for dynamical systems. BMC Syst Biol. 2017;11(1):63. pmid:28646868
- 68. Gupta S, Hainsworth L, Hogg JS, Lee REC, Faeder JR. Evaluation of parallel tempering to accelerate bayesian parameter estimation in systems biology. Proc Euromicro Int Conf Parallel Distrib Netw Based Process. 2018;2018:690–7. pmid:30175326
- 69. Karamanis M, Beutler F, Peacock JA, Nabergoj D, Seljak U. Accelerating astronomical and cosmological inference with preconditioned Monte Carlo. Monthly Notices Roy Astron Soc. 2022;516(2):1644–53.
- 70.
Howley P, Knipe D, Whelan S, Freed E. Fields virology. 1st ed. Philadelphia: Wolters Kluwer Health. 2022.
- 71. Wang YF, Sawicki SG, Sawicki DL. Sindbis virus nsP1 functions in negative-strand RNA synthesis. J Virol. 1991;65(2):985–8. pmid:1824787
- 72. Artinger M. Meeting report VLPNPV: Session 10: rapid total particle quantification. Hum Vaccin Immunother. 2014;10(10):3083–6. pmid:25511481
- 73. Klasse PJ. Molecular determinants of the ratio of inert to infectious virus particles. Prog Mol Biol Transl Sci. 2015;129:285–326. pmid:25595808
- 74. Jones JE, Le Sage V, Lakdawala SS. Viral and host heterogeneity and their effects on the viral life cycle. Nat Rev Microbiol. 2021;19(4):272–82. pmid:33024309
- 75. McCormick W, Mermel LA. The basic reproductive number and particle-to-plaque ratio: comparison of these two parameters of viral infectivity. Virol J. 2021;18(1):92. pmid:33931090
- 76. Sokoloski KJ, Snyder AJ, Liu NH, Hayes CA, Mukhopadhyay S, Hardy RW. Encapsidation of host-derived factors correlates with enhanced infectivity of Sindbis virus. J Virol. 2013;87(22):12216–26. pmid:24006438
- 77. Hernandez R, Brown DT, Paredes A. Structural differences observed in arboviruses of the alphavirus and flavivirus genera. Adv Virol. 2014;2014:259382. pmid:25309597
- 78. Hernandez R, Glaros T, Rizzo G, Ferreira DF. Purification and proteomic analysis of alphavirus particles from sindbis virus grown in mammalian and insect cells. Bio Protoc. 2019;9(10):e3239. pmid:33654768
- 79. Sokoloski KJ, Hayes CA, Dunn MP, Balke JL, Hardy RW, Mukhopadhyay S. Sindbis virus infectivity improves during the course of infection in both mammalian and mosquito cells. Virus Res. 2012;167(1):26–33. pmid:22484152
- 80. Jose J, Taylor AB, Kuhn RJ. Spatial and temporal analysis of alphavirus replication and assembly in mammalian and mosquito cells. mBio. 2017;8(1):e02294-16. pmid:28196962
- 81. Carpentier KS, Morrison TE. Innate immune control of alphavirus infection. Curr Opin Virol. 2018;28:53–60. pmid:29175515
- 82. Singer ZS, Ambrose PM, Danino T, Rice CM. Quantitative measurements of early alphaviral replication dynamics in single cells reveals the basis for superinfection exclusion. Cell Syst. 2021;12(3):210-219.e3. pmid:33515490
- 83. Sawicki DL, Sawicki SG. Short-lived minus-strand polymerase for Semliki Forest virus. J Virol. 1980;34(1):108–18. pmid:6768898
- 84. Sawicki DL, Sawicki SG, Keränen S, Kääriäinen L. Specific Sindbis virus-coded function for minus-strand RNA synthesis. J Virol. 1981;39(2):348–58. pmid:7277580
- 85. Sawicki SG, Sawicki DL. The effect of loss of regulation of minus-strand RNA synthesis on Sindbis virus replication. Virology. 1986;151(2):339–49. pmid:3705466
- 86. Lemm JA, Rümenapf T, Strauss EG, Strauss JH, Rice CM. Polypeptide requirements for assembly of functional Sindbis virus replication complexes: a model for the temporal regulation of minus- and plus-strand RNA synthesis. EMBO J. 1994;13(12):2925–34. pmid:7517863
- 87. Kääriäinen L, Ahola T. Functions of alphavirus nonstructural proteins in RNA replication. Prog Nucleic Acid Res Mol Biol. 2002;71:187–222. pmid:12102555
- 88. Schnierle BS. Cellular attachment and entry factors for chikungunya virus. Viruses. 2019;11(11):1078. pmid:31752346
- 89. Holmes AC, Basore K, Fremont DH, Diamond MS. A molecular understanding of alphavirus entry. PLoS Pathog. 2020;16(10):e1008876. pmid:33091085
- 90. Zhang K, Law Y-S, Law MCY, Tan YB, Wirawan M, Luo D. Structural insights into viral RNA capping and plasma membrane targeting by Chikungunya virus nonstructural protein 1. Cell Host Microbe. 2021;29(5):757-764.e3. pmid:33730549
- 91. Jones R, Hons M, Rabah N, Zamarreño N, Arranz R, Reguera J. Structural basis and dynamics of Chikungunya alphavirus RNA capping by nsP1 capping pores. Proc Natl Acad Sci U S A. 2023;120(12):e2213934120. pmid:36913573
- 92. Carrasco L, Sanz MA, González-Almela E. The regulation of translation in alphavirus-infected cells. Viruses. 2018;10(2):70. pmid:29419763
- 93. Aguilar PV, Weaver SC, Basler CF. Capsid protein of eastern equine encephalitis virus inhibits host cell gene expression. J Virol. 2007;81(8):3866–76. pmid:17267491
- 94. Bhalla N, Sun C, Metthew Lam LK, Gardner CL, Ryman KD, Klimstra WB. Host translation shutoff mediated by non-structural protein 2 is a critical factor in the antiviral state resistance of Venezuelan equine encephalitis virus. Virology. 2016;496:147–65. pmid:27318152
- 95. Gorchakov R, Frolova E, Frolov I. Inhibition of transcription and translation in Sindbis virus-infected cells. J Virol. 2005;79(15):9397–409. pmid:16014903
- 96. Patel RK, Burnham AJ, Gebhart NN, Sokoloski KJ, Hardy RW. Role for subgenomic mRNA in host translation inhibition during Sindbis virus infection of mammalian cells. Virology. 2013;441(2):171–81. pmid:23601784
- 97. Toribio R, Díaz-López I, Boskovic J, Ventoso I. An RNA trapping mechanism in Alphavirus mRNA promotes ribosome stalling and translation initiation. Nucleic Acids Res. 2016;44(9):4368–80. pmid:26984530
- 98. Frolova EI, Fayzulin RZ, Cook SH, Griffin DE, Rice CM, Frolov I. Roles of nonstructural protein nsP2 and Alpha/Beta interferons in determining the outcome of Sindbis virus infection. J Virol. 2002;76(22):11254–64. pmid:12388685
- 99. Frolov I, Schlesinger S. Translation of Sindbis virus mRNA: analysis of sequences downstream of the initiating AUG codon that enhance translation. J Virol. 1996;70(2):1182–90. pmid:8551579
- 100. McInerney GM, Kedersha NL, Kaufman RJ, Anderson P, Liljeström P. Importance of eIF2α phosphorylation and stress granule assembly in alphavirus translation regulation. MBoC. 2005;16(8):3753–63.
- 101. Frolova EI, Palchevska O, Dominguez F, Frolov I. Alphavirus-induced transcriptional and translational shutoffs play major roles in blocking the formation of stress granules. J Virol. 2023;97(11):e0097923. pmid:37902397
- 102. Kielian M, Chanel-Vos C, Liao M. Alphavirus entry and membrane fusion. Viruses. 2010;2(4):796–825. pmid:21546978
- 103. De Caluwé L, Ariën KK, Bartholomeeusen K. Host factors and pathways involved in the entry of mosquito-borne alphaviruses. Trends Microbiol. 2021;29(7):634–47. pmid:33208275
- 104. Meshram CD, Agback P, Shiliaev N, Urakova N, Mobley JA, Agback T, et al. Multiple Host factors interact with the hypervariable domain of chikungunya virus nsP3 and determine viral replication in cell-specific mode. J Virol. 2018;92(16):e00838-18. pmid:29899097
- 105. Cristea IM, Rozjabek H, Molloy KR, Karki S, White LL, Rice CM, et al. Host factors associated with the Sindbis virus RNA-dependent RNA polymerase: role for G3BP1 and G3BP2 in virus replication. J Virol. 2010;84(13):6720–32. pmid:20392851
- 106. Clewley JP, Kennedy SI. Purification and polypeptide composition of Semliki Forest virus RNA polymerase. J Gen Virol. 1976;32(3):395–411. pmid:965947
- 107. Barton DJ, Sawicki SG, Sawicki DL. Solubilization and immunoprecipitation of alphavirus replication complexes. J Virol. 1991;65(3):1496–506. pmid:1847467
- 108. Frolov I, Kim DY, Akhrymuk M, Mobley JA, Frolova EI. Hypervariable domain of eastern equine encephalitis Virus nsP3 redundantly utilizes multiple cellular proteins for replication complex assembly. J Virol. 2017;91(14):e00371-17. pmid:28468889
- 109. Zhang X, Fugère M, Day R, Kielian M. Furin processing and proteolytic activation of Semliki Forest virus. J Virol. 2003;77(5):2981–9. pmid:12584323
- 110. Firth AE, Chung BY, Fleeton MN, Atkins JF. Discovery of frameshifting in Alphavirus 6K resolves a 20-year enigma. Virol J. 2008;5:108. pmid:18822126
- 111. Chung BY-W, Firth AE, Atkins JF. Frameshifting in alphaviruses: a diversity of 3’ stimulatory structures. J Mol Biol. 2010;397(2):448–56. pmid:20114053
- 112. Snyder JE, Kulcsar KA, Schultz KLW, Riley CP, Neary JT, Marr S, et al. Functional characterization of the alphavirus TF protein. J Virol. 2013;87(15):8511–23. pmid:23720714
- 113. Gardner CL, Yin J, Burke CW, Klimstra WB, Ryman KD. Type I interferon induction is correlated with attenuation of a South American eastern equine encephalitis virus strain in mice. Virology. 2009;390(2):338–47. pmid:19539968
- 114. Ryman KD, Klimstra WB. Host responses to alphavirus infection. Immunol Rev. 2008;225:27–45. pmid:18837774
- 115. Aguilar PV, Paessler S, Carrara A-S, Baron S, Poast J, Wang E, et al. Variation in interferon sensitivity and induction among strains of eastern equine encephalitis virus. J Virol. 2005;79(17):11300–10. pmid:16103182
- 116. Frolov I, Akhrymuk M, Akhrymuk I, Atasheva S, Frolova EI. Early events in alphavirus replication determine the outcome of infection. J Virol. 2012;86(9):5055–66. pmid:22345447
- 117. Fros JJ, Pijlman GP. Alphavirus infection: host cell shut-off and inhibition of antiviral responses. Viruses. 2016;8(6):166. pmid:27294951
- 118. Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19(4):524–31. pmid:12611808
- 119. Somogyi ET, Bouteiller J-M, Glazier JA, König M, Medley JK, Swat MH, et al. libRoadRunner: a high performance SBML simulation and analysis library. Bioinformatics. 2015;31(20):3315–21. pmid:26085503
- 120. Welsh C, Xu J, Smith L, König M, Choi K, Sauro HM. libRoadRunner 2.0: a high performance SBML simulation and analysis library. Bioinformatics. 2023;39(1):btac770. pmid:36478036
- 121. Leung JY-S, Ng MM-L, Chu JJH. Replication of alphaviruses: a review on the entry process of alphaviruses into cells. Adv Virol. 2011;2011:249640. pmid:22312336
- 122. Sánchez-San Martín C, Liu CY, Kielian M. Dealing with low pH: entry and exit of alphaviruses and flaviviruses. Trends Microbiol. 2009;17(11):514–21. pmid:19796949
- 123.
Wileman T, Netherton CL, Powell PP. Virus factories and mini-organelles generated for virus replication. Encyclopedia of Cell Biology. Elsevier. 2016. p. 819–27. https://doi.org/10.1016/b978-0-12-394447-4.20102-8
- 124. Grimley PM, Berezesky IK, Friedman RM. Cytoplasmic structures associated with an arbovirus infection: loci of viral ribonucleic acid synthesis. J Virol. 1968;2(11):1326–38. pmid:5750316
- 125. Friedman RM, Levin JG, Grimley PM, Berezesky IK. Membrane-associated replication complex in arbovirus infection. J Virol. 1972;10(3):504–15. pmid:4342056
- 126. Strauss EG, Rice CM, Strauss JH. Sequence coding for the alphavirus nonstructural proteins is interrupted by an opal termination codon. Proc Natl Acad Sci U S A. 1983;80(17):5271–5. pmid:6577423
- 127. Li G, Rice CM. The signal for translational readthrough of a UGA codon in Sindbis virus RNA involves a single cytidine residue immediately downstream of the termination codon. J Virol. 1993;67(8):5062–7. pmid:8331741
- 128. Hasan SS, Sun C, Kim AS, Watanabe Y, Chen C-L, Klose T, et al. Cryo-EM structures of eastern equine encephalitis virus reveal mechanisms of virus disassembly and antibody neutralization. Cell Rep. 2018;25(11):3136-3147.e5. pmid:30540945
- 129. Chen C-L, Klose T, Sun C, Kim AS, Buda G, Rossmann MG, et al. Cryo-EM structures of alphavirus conformational intermediates in low pH-triggered prefusion states. Proc Natl Acad Sci U S A. 2022;119(30):e2114119119. pmid:35867819
- 130. de Groot RJ, Hardy WR, Shirako Y, Strauss JH. Cleavage-site preferences of Sindbis virus polyproteins containing the non-structural proteinase. Evidence for temporal regulation of polyprotein processing in vivo. EMBO J. 1990;9(8):2631–8. pmid:2142454
- 131. Wang YF, Sawicki SG, Sawicki DL. Alphavirus nsP3 functions to form replication complexes transcribing negative-strand RNA. J Virol. 1994;68(10):6466–75. pmid:8083984
- 132. Li M-L, Stollar V. Identification of the amino acid sequence in Sindbis virus nsP4 that binds to the promoter for the synthesis of the subgenomic RNA. Proc Natl Acad Sci U S A. 2004;101(25):9429–34. pmid:15197279
- 133. Suopanki J, Sawicki DL, Sawicki SG, Kääriäinen L. Regulation of alphavirus 26S mRNA transcription by replicase component nsP2. J Gen Virol. 1998;79 ( Pt 2):309–19. pmid:9472615
- 134. Spuul P, Salonen A, Merits A, Jokitalo E, Kääriäinen L, Ahola T. Role of the amphipathic peptide of Semliki forest virus replicase protein nsP1 in membrane association and virus replication. J Virol. 2007;81(2):872–83. pmid:17093195
- 135. Lemm JA, Bergqvist A, Read CM, Rice CM. Template-dependent initiation of Sindbis virus RNA replication in vitro. J Virol. 1998;72(8):6546–53. pmid:9658098
- 136. Kim KH, Rümenapf T, Strauss EG, Strauss JH. Regulation of Semliki Forest virus RNA replication: a model for the control of alphavirus pathogenesis in invertebrate hosts. Virology. 2004;323(1):153–63. pmid:15165827
- 137. Shirako Y, Strauss JH. Regulation of Sindbis virus RNA replication: uncleaved P123 and nsP4 function in minus-strand RNA synthesis, whereas cleaved products from P123 are required for efficient plus-strand RNA synthesis. J Virol. 1994;68(3):1874–85. pmid:8107248
- 138. Simmons DT, Strauss JH. Replication of Sindbis virus. II. Multiple forms of double-stranded RNA isolated from infected cells. J Mol Biol. 1972;71(3):615–31. pmid:4675143
- 139. Gomez de Cedrón M, Ehsani N, Mikkola ML, García JA, Kääriäinen L. RNA helicase activity of Semliki Forest virus replicase protein NSP2. FEBS Lett. 1999;448(1):19–22. pmid:10217401
- 140. Das PK, Merits A, Lulla A. Functional cross-talk between distant domains of chikungunya virus non-structural protein 2 is decisive for its RNA-modulating activity. J Biol Chem. 2014;289(9):5635–53. pmid:24407286
- 141. Li M-L, Stollar V. Distinct sites on the Sindbis virus RNA-dependent RNA polymerase for binding to the promoters for the synthesis of genomic and subgenomic RNA. J Virol. 2007;81(8):4371–3. pmid:17287268
- 142. Yap ML, Klose T, Urakami A, Hasan SS, Akahata W, Rossmann MG. Structural studies of Chikungunya virus maturation. Proc Natl Acad Sci U S A. 2017;114(52):13703–7. pmid:29203665
- 143. Aliperti G, Schlesinger MJ. Evidence for an autoprotease activity of sindbis virus capsid protein. Virology. 1978;90(2):366–9. pmid:726255
- 144. Hahn CS, Strauss EG, Strauss JH. Sequence analysis of three Sindbis virus mutants temperature-sensitive in the capsid protein autoprotease. Proc Natl Acad Sci U S A. 1985;82(14):4648–52. pmid:3895223
- 145. Hashimoto K, Erdei S, Keränen S, Saraste J, Kääriäinen L. Evidence for a separate signal sequence for the carboxy-terminal envelope glycoprotein E1 of Semliki forest virus. J Virol. 1981;38(1):34–40. pmid:7241658
- 146. Garoff H, Huylebroeck D, Robinson A, Tillman U, Liljeström P. The signal sequence of the p62 protein of Semliki Forest virus is involved in initiation but not in completing chain translocation. J Cell Biol. 1990;111(3):867–76. pmid:2391367
- 147. Brown RS, Wan JJ, Kielian M. The alphavirus exit pathway: what we know and what we wish we knew. Viruses. 2018;10(2):89. pmid:29470397
- 148. de Curtis I, Simons K. Dissection of Semliki Forest virus glycoprotein delivery from the trans-Golgi network to the cell surface in permeabilized BHK cells. Proc Natl Acad Sci U S A. 1988;85(21):8052–6. pmid:3186706
- 149. Tellinghuisen TL, Hamburger AE, Fisher BR, Ostendorp R, Kuhn RJ. In vitro assembly of alphavirus cores by using nucleocapsid protein expressed in Escherichia coli. J Virol. 1999;73(7):5309–19. pmid:10364277
- 150. Gleitsman KR, Sengupta RN, Herschlag D. Slow molecular recognition by RNA. RNA. 2017;23(12):1745–53. pmid:28971853
- 151. Yan X, Hoek TA, Vale RD, Tanenbaum ME. Dynamics of translation of single mRNA molecules in vivo. Cell. 2016;165(4):976–89.
- 152. Oh JW, Ito T, Lai MM. A recombinant hepatitis C virus RNA-dependent RNA polymerase capable of copying the full-length viral RNA. J Virol. 1999;73(9):7694–702. pmid:10438859
- 153. Castro C, Smidansky E, Maksimchuk KR, Arnold JJ, Korneeva VS, Götte M, et al. Two proton transfers in the transition state for nucleotidyl transfer catalyzed by RNA- and DNA-dependent RNA and DNA polymerases. Proc Natl Acad Sci U S A. 2007;104(11):4267–72. pmid:17360513
- 154. Kim H, Ellis VD 3rd, Woodman A, Zhao Y, Arnold JJ, Cameron CE. RNA-dependent RNA polymerase speed and fidelity are not the only determinants of the mechanism or efficiency of recombination. Genes (Basel). 2019;10(12):968. pmid:31775299
- 155. Chhajer H, Rizvi VA, Roy R. Life cycle process dependencies of positive-sense RNA viruses suggest strategies for inhibiting productive cellular infection. J R Soc Interface. 2021;18(184):20210401. pmid:34753308
- 156. Griffiths G, Fuller SD, Back R, Hollinshead M, Pfeiffer S, Simons K. The dynamic nature of the Golgi complex. J Cell Biol. 1989;108(2):277–97. pmid:2537312
- 157. Griffiths G, Back R, Marsh M. A quantitative analysis of the endocytic pathway in baby hamster kidney cells. J Cell Biol. 1989;109(6 Pt 1):2703–20. pmid:2592402
- 158. Papamakarios G, Nalisnick E, Rezende D, Mohamed S, Lakshminarayanan B. Normalizing flows for probabilistic modeling and inference. J Mach Learn Res. 2021;22:1–64.
- 159. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12(85):2825–30.
- 160. Sun C, Gardner CL, Watson AM, Ryman KD, Klimstra WB. Stable, high-level expression of reporter proteins from improved alphavirus expression vectors to track replication and dissemination during encephalitic and arthritogenic disease. J Virol. 2014;88(4):2035–46. pmid:24307590