This is an uncorrected proof.
Figures
Abstract
The accumulation of viral structural proteins along the endoplasmic reticulum–Golgi intermediate compartment (ERGIC) membrane drives SARS-CoV-2 self-assembly and budding through interactions among proteins, RNA, and the host membrane. The membrane (M) protein, the most abundant structural component, is thought to interact with other proteins and form clusters that induce membrane curvature and initiate virion formation. However, the relative roles of direct and membrane-mediated interactions between M proteins in this clustering process remain unclear. Here, we combine all-atom molecular dynamics (MD) simulations, continuum modeling, and experiments to demonstrate that M–M interactions alone are sufficient to drive clustering in ERGIC-like lipid bilayers, even in the absence of other proteins or RNA. From MD simulations, we quantify the membrane thinning induced by M proteins and the resulting membrane-mediated interaction energy. Integrating these results into a continuum model that describes the evolution of M protein density on a planar membrane, we identify a critical effective interaction energy required for cluster formation at a given protein density. Comparison with atomic force microscopy (AFM) measurements of M protein clusters enables quantitative estimation of the direct and membrane-mediated interaction energies, revealing that direct M–M interactions dominate through an effective oligomerization energy. Together, these findings establish that M protein interactions are sufficient to drive clustering and provide a quantitative framework for understanding the interplay of direct and membrane-mediated forces in coronavirus assembly and budding.
Author summary
Coronaviruses such as SARS-CoV-2 must assemble new viral particles within infected cells before they can spread. This process begins when viral structural proteins accumulate on a specific cellular membrane known as the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). Among these proteins, the membrane (M) protein plays a central role in shaping the virus, yet how M proteins organize into clusters that drive assembly has remained poorly understood. In this study, we combined molecular simulations, theoretical modeling, and experiments to investigate how M proteins interact with each other and the surrounding membrane. We found that M proteins can spontaneously form clusters even in the absence of other viral components, and that direct M–M interactions are strong enough to drive this process. By comparing our model predictions with experimental images, we quantified the strength of these interactions and showed that they outweigh the indirect effects mediated by the membrane. Together, our findings provide a quantitative picture of how M proteins self-organize during coronavirus assembly and reveal a key physical mechanism underlying virion formation.
Citation: McTiernan J, Zhang Y, Li S, Kuhlman TE, Mohideen U, Colvin ME, et al. (2026) Clustering of SARS-CoV-2 membrane proteins in lipid bilayer membranes. PLoS Comput Biol 22(4): e1014229. https://doi.org/10.1371/journal.pcbi.1014229
Editor: Arne Elofsson, Stockholm University: Stockholms Universitet, SWEDEN
Received: November 11, 2025; Accepted: April 10, 2026; Published: April 27, 2026
Copyright: © 2026 McTiernan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data and code used in this paper are available through the Zenodo repository at: https://doi.org/10.5281/zenodo.19614750.
Funding: This work was supported by University of California Office of the President UC Multicampus Research Programs and Initiatives, grant M21PR3267 (T.E.K, U.M. M.E.C., R.Z, and A.G.); National Science Foundation RAPID, grant 2034794 (U.M., T.E.K. and R.Z.); National Science Foundation, NSF DMR-2131963 (R.Z. and S.L.); National Science Foundation, NSF-CREST: Center for Cellular and Biomolecular Machines at UC Merced, NSF-HRD-1547848 and NSF-HRD-2112675 (A.G.); National Institutes of Health, NIH G-RISE, T32GM141862 (J.M. and A.G.); National Science Foundation, Center for Engineering Mechanobiology, grant CMMI-1548571 (A.G.); and National Science Foundation, Pinnacles Computing Cluster, NSF-ACI-2019144 (J.M.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The assembly of viral components into a complete, infectious particle is one of the most critical steps in the viral life cycle. This process determines the number of virions produced, their structural resilience, and their ability to persist and evolve in the host environment [1,2]. For many RNA viruses, assembly is a highly orchestrated event driven by interactions among structural proteins, the viral genome, and the host cell membrane [3–5]. In some viruses, the genome itself plays an active role in the assembly process, and the structure and stability of the resulting protein shell depend sensitively on the size of the encapsulated cargo [6–8]. In coronaviruses, the unusually large single-stranded RNA genome (≈30 kb) imposes additional constraints on packaging, influencing both the organization of the nucleocapsid and the mechanical stability of the virion. Like many RNA viruses, coronaviruses rely on coordinated interactions among structural proteins, the genome, and a host membrane, but uniquely assemble and bud at the membranes of the endoplasmic reticulum–Golgi intermediate compartment (ERGIC), where their structural proteins and genomic RNA come together to form new virions. The four main structural proteins—spike (S), membrane (M), envelope (E), and nucleocapsid (N)—each play distinct but interdependent roles in this process. The S protein is responsible for binding the virion to the host cell, enabling its entry [9–11]. The M protein defines virion shape, provides the scaffold for assembly, and contributes to maintaining the stability of the surrounding membrane [12–14]. The small E protein acts as an ion channel embedded within the membrane and participates in curvature induction [15,16], while the N protein recruits viral RNA and remains inside the virion, bound to the genome [17–19].
A fully formed coronavirus virion consists of S, M, and E proteins arranged along the viral envelope surrounding the RNA–N complex [9,10]. Outside of the S protein [20], the other three structural proteins do not play a direct role in receptor binding or entry, but are essential for efficient and complete virion formation. Depending on the coronavirus, its replication through assembly and budding along the ERGIC requires M protein and either N protein bound to vRNA, E protein, or both [21–24]. However, in every case, M proteins and their interactions with each other [25,26] are required for complete virion formation [9,14]. Furthermore, M protein is the most prominent structural protein in the virion, and is thought to be responsible for guiding the S [27,14], N/RNA [28–30], and E protein [14,31] throughout the assembly and budding process. After accumulation in the ERGIC, M proteins interact with S and E proteins along the membrane surface, possibly leading to the induction of membrane curvature through protein clusters. Within the surrounding cytoplasm, N protein recruits viral RNA, which in turn binds to these M protein clusters along the surface of the membrane, believed to introduce additional curvature. After sufficient curvature generation, the M/E/S populated ERGIC membrane curls around the virus’ genetic material, leading to the budding of the ∼100 nm virion into the ERGIC [32–34]. Here, we focus on the initial formation of SARS-CoV-2 M protein clusters vital for viral assembly.
Recent cryo-EM experiments have revealed that the SARS-CoV-2 M protein forms a homodimer that exists in two distinct membrane embedded conformations: a compact or “short” form and an elongated or “long” form [28,35]. Both forms consist of an N-terminal domain oriented such that it slightly protrudes from a formed virion, an embedded transmembrane domain, and an intravirion C-terminal domain. Cryo-ET and cryo-EM studies of full coronavirus virions showed that the short form is localized to thinner regions of the membrane with lower curvature, whereas the long form is found in regions of higher curvature [13]. It remains to be seen if M protein is the driving component of virion-like curvature induction. Furthermore, using an improved method for synthesizing the M protein, we previously confirmed the thinning effect of the short form by combining all-atom molecular dynamics (MD) simulations with atomic force microscopy (AFM) measurements [36]. We also found that M protein clusters can form within ERGIC-like membranes depending on the local protein density. However, only the short form was observed throughout the study, consistent with previous findings suggesting that the long form requires additional structural protein interactions for stability [13,35,36].
Here, we seek a quantitative understanding of how interactions between M proteins—and their surface density—govern the formation and size of self-assembled clusters. A clearer grasp of how these M protein clusters form is essential for elucidating the mechanisms underlying viral assembly and budding. For example, it remains uncertain whether the membrane-thinning behavior of the M protein facilitates membrane scission during budding or instead promotes lateral assembly [37–39]. Additionally, disrupting the M protein has been shown to inhibit virion production [40,41], potentially due to reduced M protein oligomerization [41]. The relative magnitudes of membrane-mediated and direct M–M interactions, and their respective contributions to clustering, are also unknown. Capturing both the influence of a single protein on the local membrane and the collective behavior of hundreds to thousands of proteins requires a multiscale approach. In this work, we therefore combine all-atom MD simulations, continuum modeling, and experimental measurements to address these open questions.
Initially, using an extended 2 all-atom MD simulation, we measured the membrane-thinning profile in the vicinity of the M protein and computed a corresponding line tension of approximately 0.10 kBT/nm ± 0.04 kBT/nm. The obtained profile is consistent with our earlier 1
simulations [36] and with observations of the SARS-CoV M protein [13].
Next, we characterized the early stages of M protein assembly within flat supported membranes using atomic force microscopy (AFM). Following the methods described in [36], we generated AFM images of M proteins embedded in 2.25 m × 2.25
m supported lipid bilayers that mimic the physiological composition of the ERGIC membrane. From these images, we identified the existence of a critical protein density above which cluster formation occurs and analyzed the distribution of inter-cluster distances.
To understand the respective contributions of membrane-thinning-induced line tension and direct M–M interactions in clustering during the onset of assembly, we turned to analytical modeling. We employed a Cahn–Hilliard framework [42,43] on a flat membrane - a well-established approach for describing two-component phase separation [44]. Using this model, we analytically and numerically determined how M protein clusters form and evolve as a function of protein area coverage and effective interaction energy. We identified the existence of a critical effective interaction energy for each density, where the effective interaction encompasses all nearest-neighbor forces experienced by an individual protein. By directly comparing model predictions with AFM scans and the line tension obtained from our MD-based membrane-thinning profile, we estimated the effective interaction energy of M proteins in the absence of curvature effects as kBT, 9.6 kBT] and the effective oligomerization energy as
kBT, 8.9 kBT]. These results suggest that membrane thinning does not play a dominant role in M protein cluster formation, but may instead be more relevant at later stages of assembly and budding. Furthermore, the density fraction required for cluster formation on a flat membrane,
, together with the existence of a critical effective interaction energy, points to strategies for inhibiting cluster formation - as seen in [40,41]. Such inhibition would suppress the initial step of the assembly and budding process, thereby limiting SARS-CoV-2 replication.
Materials and methods
All-atom molecular dynamics
The all-atom MD simulation of the short form embedded in a lipid bilayer was performed using the CHARMM36m force field with the MD package GROMACS, version 2022.3 [45,46]. The CHARMM-GUI input generator was used to set up the simulated system with periodic boundary conditions, and supplied the six steps used for equilibration [47–55]. After equilibration, the system was simulated for 2 with a timestep of 2 femtoseconds in the NPT ensemble. System temperature was maintained at 303.15 K using the Nose-Hoover thermostat [56,57], with the pressure maintained semi-isotropically at 1 bar in the x-y dimensions and separately in the z-dimension using the Parrinello-Rahman barostat [58,59]. The coordinates were saved once every 50 thousand timesteps, or every 0.1 ns, for a total of 20 thousand frames.
The protein was inserted into the membrane with an orientation and depth that matched other studies [28,35]. This insertion can be seen in Fig 1a and 1b. Only residues 9–204 are accounted for in the short form structure (PDB: 7vgs), with the first eight and last 18 residues excluded [28]. The membrane was composed of Chol 15%; DOPC 45%; DOPE 20%; DOPS 7%; POPI 13% in both leaflets, with the solvent consisting of NaCl at a concentration of 0.15 M and TIP3P water. Initially, the simulation box consisted of a 24.7 nm x 24.7 nm membrane in the x-y plane with at least 5 nm of solvent above and below the protruding protein, yielding a total unit cell thickness of 17.1 nm in the z dimension. However, by the end of the simulation, the membrane size changed to 24 nm x 24 nm, with a z-axis box size of 18.1 nm. Atom counts and box size evolution can be seen in S1 Fig. The final trajectory was reoriented frame-by-frame such that the protein was centered in the box for all frames. While each trajectory was fitted to eliminate protein translation, this was not the case for the rotation of the protein. All-atom simulations were visualized using ChimeraX [60], with the python library MDAnalysis used to analyze the processed trajectory [61,62].
Final simulation frames of the short form M protein (purple) embedded in a 25 nm x 25 nm lipid bilayer physiologically similar to the ERGIC from above (a) and as an x-axis cross-section (b). (c) Cartoon defining membrane thickness () as the difference between the upper and lower leaflets for a given radial distance r. The two chains in the M protein are distinguished with red or blue, where the C-terminal of the protein pointing downwards is within the virion. Created with BioRender.com (https://biorender.com/se022wy). (d) Average membrane thickness is shown along the x-y plane from 500 ns to 2000 ns, where red represents thicker regions of membrane and blue thinner. Gold signifies the cumulative cross-section of the protein. (e) Radial thickness profile relative to center of protein, obtained from averaging (d) over all angles, is shown with blue representing individual bins. A 95% confidence interval for a line of best fit is shown in orange, with the black trend line representing a radial binning of the blue points. The purple curve represents the analytic solution to the thickness profile given with
nm,
nm, and
nm.
Membrane thickness, defined in Fig 1c, is computationally determined by creating an 18 x 18 square grid in the x-y direction with edges defined by the minimum and maximum phospholipid head position in either direction. At each point in time, these heads are binned, and the average height of the lower and upper leaflets are determined. In the case a bin does not have any head atoms, the corresponding upper and lower leaflets are ignored for that position at the given time. From here, the difference between average height for each leaflet is calculated at every point in time beyond 500 ns, and averaged over time to get the figure shown in Fig 1d.
Thinning induced line tension
Assuming a symmetric deformation, the membrane thickness profile can be considered the combination of two equivalent monolayers. Adapting the model developed in [63] for determining monolayer height in a transition region between higher and lower membrane thickness due to lipid rafts leads to Eq. 1. In this case, the protein is considered a raft with very high elastic moduli which deforms the surrounding membrane to match its hydrophobic thickness. This expression provides membrane thickness as a function of distance from the protein (r), where is the thickness of the unperturbed monolayer,
is the average between low (
) and high thickness monolayers,
is the difference between these regions (
),
and
. The factor of two in the equation converts thickness to that of a bilayer, and the contribution from induced curvature is taken to be negligible.
In [36] we showed, using AFM methods, that regions with protein are much stiffer than the surrounding membrane. With Bs,r and Ks,r the bending and tilt moduli of the surrounding membrane and the raft/protein respectively, and
. Furthermore, Young’s modulus measurements of a similar M protein populated supported lipid bilayer system in [36] gives Bs ∼ 3.0 kBT ± 1.0 kBT, leading to KS ∼ 3.0
± 1.0
,
nm, and
. Error in Bs and Ks are overestimates obtained from standard deviation in Young’s modulus measurements from [36].
With this thickness profile, the line tension generated from the cost of bending and tilt can be determined as described in [63]. Without the negligible contributions from induced curvature, Eq. 2 shows the line tension . Where
and
allows for the corresponding approximation, and the factor of two accounts for the bilayer nature of the membrane.
Protein assembly continuum model
To properly represent the assembly of a large number of M proteins along a flat membrane, we utilize a continuum model. We use a Cahn-Hilliard model [42,43] to describe protein density evolution on a flat plane, an approach commonly used to describe two component phase separation [44]. It is to be noted that such approaches have also been extended to describe the motion of curvature inducing transmembrane proteins [64–68], though here we consider only the planar lipid bilayer case and do not consider density dependent membrane surface tension nor membrane viscosity [65,66,69–71]. Our analysis is restricted to a flat membrane for direct comparison with supported lipid bilayer AFM profiles, which show no height variation beyond protrusions from M protein C-terminals. The total free energy of the system consists of both an enthalpic component, arising from protein-protein and protein-membrane interactions, and an entropic part,
The entropic portion of the free energy is a typical entropy of mixing which can be expressed as,
where is the protein density fraction defined as a two dimensional scalar field along a flat infinitesimally thin surface, T is the temperature, and a represents the nearest neighbor distance between proteins [44]. Additionally, a defines the M protein saturation density
[65,68]. We take this distance to be the approximate width of a single M protein since AFM images show tight packing.
The interaction portion of the free energy, , involves three different terms,
where is an effective interaction energy accounting for all nearest neighbor interactions a single M protein would experience (refer to Effective interaction energy for details) [44]. An effective interaction energy greater than zero implies attraction. This has the form of a two-component regular solution or Flory-Huggins model [42,44,72,73], with volume exclusion, protein attraction, and interfacial energy terms.
Following a standard approach [65,67], the dynamics of can now be expressed using Model B dynamics to account for the conservation of protein density, leading to
The full expression for the variational derivative and density evolution can be seen in S1 Appendix.
Linear stability analysis
To quantitatively understand the conditions and parameter regimes required for clustering, we perform linear stability analysis. This is done by first applying a small perturbation to a homogeneous density state of the form,
Here, the initial protein density fraction is , and the perturbation
(set by
). Each mode is defined by its wavevector q and growth rate
. Throughout this work, q is nondimensionalized according to
. A positive growth rate for any q implies an unstable mode with that wavevector and indicates clustering with an average distance between cluster formations (d) set by
. A schematic characterizing protein assembly in an unstable regime can be seen in Fig 3.
Using Eq. 7 in Eq. 6 and ignoring all nonlinear terms leads to the protein evolution equation,
where tildes signify nondimensionalized quantities ( and
). Applying the form of the perturbation in Eq. 8 to Eq. 9 allows us to obtain the growth rate as a function of wavevector,
From this dispersion relation, the wavevector and growth rate corresponding to the fastest growing modes are determined.
Effective interaction energy
The effective interaction energy () (or Flory Huggins interaction parameter) is determined by considering a lattice where lattice sites can be occupied by membrane or a protein, and only nearest neighbor and spatially independent interactions are considered. As such,
can then be expressed in terms of membrane and protein interactions as [44],
where z defines the number of nearest neighbors each site has, and is the interaction between two sites occupied by entities of type i and j a distance a apart where
. Membrane sites are notated with mem, while protein ones are indicated with m.
Each site-site interaction term can be determined using smaller scale MD simulations, or from elasticity models given the thickness profile [37,38]. However, in doing so, numerous unknown parameters and membrane characteristics are introduced to the system, such as lipid tilt or membrane tension. Note that since we neglect curvature its contribution to is not considered. Furthermore, the scale and nature of AFM data prevents the analysis of these quantities. We therefore treat
as an effective parameter to be estimated.
We note that the energy difference between a discrete lattice with two proteins spread apart and one with them adjacent to each other is equivalent to Eq. 12. This energy difference can also be considered the energetic cost resulting from line tension and direct protein-protein interactions. These direct protein-protein interactions do not correspond to a defined structural interface, but rather a set of them, and is considered an effective oligomerization energy (). As a result, the effective interaction energy can be expressed as,
where is the thinning-induced line tension and
is the energy in which these proteins are bound together. A more detailed derivation of this concept is shown in S2 Appendix.
Finite difference method
To numerically solve the protein evolution equation shown in Eq. 6 and compare with the analytical cluster formation predictions, a finite difference method is utilized. The python code developed for this finite difference method can be found in [74]. Spatial derivatives are determined through fourth order central difference, while Fourth Order Runge Kutta is used for time evolution. Membrane (M) protein is initially distributed normally with an average equal to the initial protein density fraction () and a standard deviation of 5 × 10−5, over a 250 x 250 grid. Each grid point is 0.4a in length, where a is the approximate width of an M protein and is assumed to be a = 5 nm, while corresponding time steps depend on the effective interaction energy within the range of [1 × 10−9, 5 × 10−9] s. High spatial and temporal resolution is required due to the lack of a more sophisticated finite element method. The temperature used throughout every simulation is T = 303.15 K, with the diffusion constant A = 5 × 10−13 m2/s.
To compare with the analytically determined wavevector and growth rate, a radial power spectrum of the density is calculated for every 100 time steps within linearity, defined as . The maximum wavevector (qmax) is determined by fitting a Gaussian to the radial power spectrum, and equating it to the point in which there is a maximum. From here, the square root of this Gaussian at the maxima for each measured time step is fit to exponential growth,
Example power spectra and fits are shown in S3 Fig. Outside of linearity, the nearest neighbor distance is used to classify cluster formation periodicity. Clusters are defined according to a density threshold such that the protein area fraction is equivalent to the initial protein density (). This process can be seen in S4 Fig for each of the images shown in Fig 4b. The evolution of nearest neighbor distance is shown in S5 Fig, and is measured every 10,000 time steps after
. Time cut-offs for calculating nearest neighbor distance to minimize the impact of Ostwald ripening were found by determining the flattest region of the nearest neighbor distance evolution plots, with the cut-offs shown in S5 Fig.
AFM sample preparation and imaging
The methods in this section are similar to those from [36]. First, monodisperse LUVs around 120 nm in diameter were prepared using a vesicle extruder, with lipid composition corresponding to that of the endoplasmic reticulum-Golgi intermediate compartment (ERGIC). The molar ratio of different lipids (Avanti Polar Lipids, Inc, Alabaster, AL) used are 1-palmitoyl-2-oleoyl-glycero-3-phosphocholine (POPC): 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphoethanolamine (POPE): 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphoinositol (POPI): 1-palmitoyl-2-oleoyl-sn-glycero-3-phospho-L-serine (POPS): Cholesterol = 0.45: 0.2: 0.13: 0.07: 0.15 [75]. The 5 mg/ml lipid solution in chloroform was dried in a glass vial with N2 gas stream, then vacuumed overnight at -30 in Hg. The dried lipid mixture was hydrated with buffer (150 mM NaCl, 20 mM HEPES, pH = 7.2) and 30 s vortex, prior to ten freeze-thaw cycles with dry ice and 37 °C bath. After the final thawing step, the aqueous solution was passed 11 times through a polycarbonate membrane with 100 nm pores (Nuclepore Track-Etch membrane, Whatman, Chicago, IL).
To reconstitute the M proteins into LUVs, concentrated stock solution (∼ 400 mM) of n-dodecyl--D-maltoside (DDM Avanti Polar Lipids, Inc, Alabaster, AL) were added into 5 mg/mL of freshly extruded LUVs solution to reach a final concentration of 100 mM. M-protein stabilized by Triton X-100 (stock solution: 1 wt.% Triton X-100 per every 2 mg/mL M protein) was added to the LUVs solution at mass ratio of M/lipid = 1/100, 1/67, and 1/50 after 10 min of incubation. The solution was allowed another 10 min of incubation and the detergent was removed using 80 mg of wet BioBeads (Bio-Rad, Hercules, CA) per mL of LUVs. After 3 additions of BioBeads (once per 2 hours), the M-reconstituted LUVs were separated from the solution using the centrifuge.
For the preparation of supported bilayer samples for AFM imaging, 75 L of the solution containing LUVs collected from the bottom of a microcentrifuge tube was deposited onto freshly cleaved pristine mica and incubated for 1 hour. Next, the sample was then gently rinsed with 5 mL of buffer. The samples were kept submerged by buffer during the sample preparation and imaging process. Imaging was performed within 1 hour of sample preparation.
An AFM fluid cell filled with buffer with an MSNL cantilever (Bruker, Camarillo, CA) in tapping mode was used. The spring constant was calibrated to be 0.30 N/m using the thermal oscillation spectrum. 512 x 512 pixel images with dimensions 2.25 m x 2.25
m were taken, with a vertical resolution of 0.01 nm and horizontal resolution of 4.4 nm. The cantilever tip size and image resolution was calibrated using 10 nm gold spheres (Ted Pella, Redding, CA) using our previously reported [36,76].
Atomic force microscopy cluster determination
To define regions with and without protein from the AFM height images shown in Fig 2, a total variation denoising filter was used (TV Bregman from the python skimage library). The impact of this filter can be seen in S6 Fig. The TV Bregman filter allows for the maintainment of sharp height changes as the probe transitions from membrane to protein, while also reducing noise within protein clusters or large regions without protein [77,78]. After filtering the data for the case of Fig 2, regions with or without protein are decided based off a thresholding value defined using Otsu’s method [79]. Anything above this value is a pixel with protein, while values below represent membrane. However, for the lowest area fraction, Otsu’s method is not sufficient for protein determination, and thus the threshold value for the middle density fraction is used. Refer to S7 Fig for the dependence of threshold on area fraction.
(a) Cartoon showing nearby M protein attracting as a result of minimizing membrane-thinning-induced line tension. Longer range protein-protein interactions could also help this process. The different shades of blue represent the monomers within the dimer. (b) After attracting an adjacent protein, they will bind together as a result of direct M-M interactions and to further minimize line tension. (c) With sufficient protein density, enough of these complexes will form to create a periodic arrangement of clusters, defined by an average distance between them (d). Panels a-c were created with BioRender.com (https://biorender.com/ogth2uc). (d) AFM height images of a 2.25 m × 2.25
m supported membrane for three different M protein mass to lipid mass ratios. The corresponding protein density area coverage is shown for each ratio, increasing from left to right, where lighter regions represent greater heights.
Upon thresholding the filtered data, a cluster is defined as more than four adjacent pixels with protein. This restriction represents anything more than a single protein, since each pixel is approximately 4.4 nm across. After this restriction, the center of ’height’ for each cluster is determined. This additional weighting accounts for proteins being off-center, or the pixel only showing the edge of the protein. Lastly, nearest neighbor distance is calculated such that each centroid is supplied a minimum distance representing the closest cluster using the sklearn python library NearestNeighbor function. For two centroids that are both closest to each other, only a single value is accounted for. To determine whether the centroid density is roughly constant throughout the images, kernel density estimation from the sklearn python library is used (KernelDensity function). See S8 Fig for centroid density images at each protein area fraction.
Results
M protein induced thinning profile and line tension
To determine the short form’s impact on membrane thickness we perform all-atom MD simulations of it embedded within a 25 nm x 25 nm lipid bilayer physiologically similar to the ERGIC. Final simulation frames, after 2 , can be seen in Fig 1a and Fig 1b, where the thickness is defined as the difference between the height of the upper and lower leaflets (Fig 1c). Averaged over the last 1.5
of the simulation, the membrane thickness is shown from above in Fig 1d, with the cumulative cross-section of the protein in gold. While not completely axisymmetric, the membrane is noticeably thinner near the protein. Furthermore, taking a radial profile of membrane thickness by averaging over every angle with respect to the center of the protein leads to Fig 1e. Every bin is represented as a blue triangle, with the black trend line corresponding to averaging bins over a radial range. Additionally, the linear line of best fit and the corresponding 95% confidence interval in orange shows the significance of the short form’s membrane thinning. Similar to the shorter MD results and AFM profile displayed in [36], the short form thins the membrane by 0.5 nm starting 12 nm from the center of the protein. Confirmation of the protein’s stability can be seen in S9 Fig.
We can compare the membrane thickness data to Eq. 1, which represents an analytic prediction of the membrane thickness as a function of distance from the center of the protein (purple line in Fig 1e). In our case, the thickness profile is parametrized by average monolayer thickness nm, unperturbed monolayer thickness
nm, and total thinning
nm. With the symmetric behavior of the membrane, each of these quantities are half of the full bilayer values. Integrating the elastic energy cost of this membrane thinning profile provides an estimate of the line tension, as described in Thinning induced line tension. Using elastic constants for a membrane from a similar system setup (Ks ∼ 3.0 kBT/nm ± 1.0 kBT/nm and Bs ∼ 3.0 kBT ± 1.0 kBT) [36], the line tension from bending and tilt is approximately
kBT/nm ± 0.04 kBT/nm. We note that this line tension could serve as a possible membrane-mediated interaction to create clusters.
M protein assembly depends on initial density fraction
Next, we use AFM to directly observe M protein cluster formation on a supported lipid membrane. First, shown in Fig 2a’s cartoon, two nearby M proteins can attract solely in an attempt to minimize the elastic deformation of the membrane from thinning. As these proteins attract each other, Fig 2b displays a cartoon of the minimized line tension and role of direct M-M interactions in binding them together. Through these membrane-mediated interactions, protein clusters can form depending on the protein density with some characteristic distance between them (cartoon shown in Fig 2c).
Using three different protein-lipid mass ratios provides three different protein area coverage values, shown as AFM images in Fig 2d for a 2.25 m × 2.25
m membrane with protein spread throughout. Area coverage is considered the percentage of pixels occupied with a protein, where protein occupation is defined based on a threshold AFM height value dependent on mass ratio. A mass ratio of 0.01
leads to the lowest area coverage, shown in the first panel of Fig 2d, ranging from 0.0013 to 0.0017 when accounting for the 0.01 nm vertical resolution of the AFM scan. In this case, M proteins are found individually or as small oligomers without any cluster formation. As this ratio is increased to 0.015
in Fig 2d panel 2, the area coverage increases to 0.020 ± 0.001. Furthermore, larger clusters begin to form while individual proteins or small oligomers remain isolated, leading to a non-isotropic protein density. When compared to the highest mass ratio of 0.02
, the protein area coverage jumps to 0.161 ± 0.005, shown in Fig 2d panel 3. With this area coverage, protein clusters form roughly isotropically, where individual proteins are very rare, allowing for the characterization of a typical distance between clusters (d). Thus, there exists a critical M protein density between the area coverages shown in Fig 2d panels 2 and 3, where clusters begin forming consistently with a characteristic distance. Position dependent centroid density for each of the three AFM images is shown in S8 Fig.
Effective interaction energy defines onset of cluster formation
To quantitatively understand the cluster formations in our planar AFM experiments and to better characterize M protein assembly overall, we utilize a Cahn-Hilliard model accounting for protein interactions through two-component regular solution theory described in Protein assembly continuum model. Linear stability analysis is used to identify the onset of cluster formation, providing parameter regimes in which these formations are possible (refer to Linear stability analysis).
Briefly, we derive a dispersion relation between the wavevector q and corresponding growth rate of a perturbation. Unstable modes will have large positive values for
and stable modes will have negative values. An example unstable relation is shown in Fig 3a, where there exists a maximum growth rate
at wavevector qmax. This is considered the fastest growing mode and will dominate the others numerically and experimentally, leading to cluster formation with a spacing defined by the maximum wavevector, as seen in the upper right panel of Fig 3a. Additionally, a stable mode will decay back to the initial condition, or the proteins will be spread randomly (leftmost panel of Fig 2d and the bottom right panel of Fig 3a).
(a) Typical unstable dispersion relation, where each mode for the perturbation is defined by its wavevector q and its growth rate
. A growth rate greater than zero represents an unstable mode, where the fastest growing mode is described by
. Within this mode, M protein clustering will occur for an average distance between clusters d, as seen in the upper right panel. A stable mode is shown in the bottom right panel, where proteins remain isolated. Created with BioRender.com (https://biorender.com/geblf4d, https://biorender.com/90ou0jv). Phase diagrams for maximum wavevector (b) and maximum growth rate (c) with respect to the effective interaction energy (
) and the initial protein density fraction (
). Each dotted grey line represents a corresponding contour in the provided legend.
The effective interaction energy and initial protein density fraction define the onset of cluster formation. Initial M protein density fraction is equivalent to the AFM area coverage, ranging from 0 - 1. Fig 3b shows these parameters’ impact on the maximum wavevector, where white represents a pairing without any cluster formation. As effective interaction energy is increased so is the maximum wavevector. Additionally, Fig 3c shows the maximum growth rate as a function of
and
, which is asymmetric about
as opposed to the maximum wavevector.
Thus, for any , there exists a range of
where clusters will start to form. Furthermore, for any initial density fraction there exists a critical effective interaction energy at which the onset of assembly begins. This is in agreement with Fig 2d, where there exists some critical density at which consistent clusters begin to form.
Inverting the curves displayed in Fig 3b leads to a relation defining the effective interaction energy as a function of the initial protein density fraction and the maximum wavevector,
This can also be used to express the expected distance between clusters for a given interaction strength,
As a result, effective interaction energy can be estimated from our experimental AFM images shown in Fig 2d, assuming the relation holds beyond linearity. See S1 Appendix for the expanded version of maximum growth rate and maximum wavevector. Solely from the ability to form clusters at , a rough lower bound for the effective interaction energy can be identified:
kBT. However, direct comparison with AFM images using Eq. 15 is required for a more accurate estimation. While the onset of cluster formation is needed for further assembly, it is unknown whether these formations will survive beyond linearity. Thus, we numerically simulate the system within and outside of linearity to confirm Eq. 15 and determine whether these formations survive.
Numerical simulations of M protein density fraction evolution were performed with a finite difference method along a uniform grid (details provided in Finite difference method). Fig 4a shows the final frame within linearity for three different simulations at different effective interaction energies, with Fig 4b displaying a later time image prior to Ostwald ripening dominating the system for the same simulations. While density variations are smoothed out in Fig 4a, extrema appear roughly periodically, increasing in frequency as increases, agreeing with Fig 3b. Furthermore, the time needed to reach linearity decreases with increasing effective interaction energy. These trends are confirmed by calculating qmax and
for each simulation, as described in Finite difference method and shown in S3 Fig. Fig 4c shows these quantities within linearity, where individual simulations for each interaction energy are shown with an empty circle and simulations shown in Fig 4a are designated with a full circle. With the agreement between simulations and the analytic prediction (colored according to
, Eq. 25 in S1 Appendix), we confirm that our linear stability analysis holds in linearity.
Simulations for three different interaction energies are shown for , with (a) portraying each at the final frame of linearity and (b) displaying later frames of a stable state before Ostwald ripening begins to dominate. The time of the frame is shown above the image, with brighter regions representing higher density, where the scale of the colorbar changes between (a) and (b). (c) Plot of the numerically determined maximum growth rate and maximum wavevector for the shown simulations and additional replicates within linearity. Each interaction energy is identified with a different color, where replicates are shown as hollow circles, those shown in (a) and (b) are filled in circles, and the analytic prediction for a given interaction energy is a red star. The analytic prediction for each interaction energy is shown with the line between the higher/lower density regions, which changes color according to
in the corresponding colorbar. (d) Comparison between average nearest neighbor distance at stable states of numerical simulations to an analytic prediction for different interaction energies. Simulations shown in (a) and (b) are filled in orange, while extra replicates are filled with black. Representing the analytic prediction, the blue curve is determined using the highlighted equation. For (c) and (d), the striped red region represents the direction the respective prediction would move if increasing density, while the striped blue region gives the reverse.
High density regions survive nonlinear transitions
The numerically determined evolution of M protein density from linearity to nonlinearity involves a sharp transition to a regime of diffusion limited growth where clusters gradually deplete the unclaimed surrounding protein. After depleting the reservoir of isolated protein, these clusters very slowly remodel, where they grow and shrink similarly to Ostwald ripening, typical of Model B dynamics [42]. Furthermore, through comparison between Fig 4a and 4b, it is clear that maxima within linearity survive cluster formation. This behavior can be seen for a variety of effective interaction energies in S1-S5 Videos, and for the shown systems in S10 Fig. Note that the small subset of clusters that form during the initial transition and quickly dissipate coincide to weaker maxima within the linear regime. With AFM formations seemingly within the diffusion limited growth regime (Fig 2d), analysis is only performed on simulations prior to Ostwald ripening dominating the system through a cut-off time. The cut-off time chosen for each simulation is shown above its x-y density plot, as seen in Fig 4b, with more information in Finite difference method and exact cut-off times shown in S5 Fig. Similar to linear stability analysis and Fig 4a, larger effective interaction energies decrease the time it takes for the system to reach the end of the diffusion limited growth dominated regime.
To quantify the survival of high density regions beyond linearity, the nearest neighbor distance between cluster formations within diffusion limited growth is used in combination with the maximum wavevector determined within linearity. Average nearest neighbor distance (d) is shown for each set of effective interaction energies in Fig 4d, where orange points correspond to simulations shown in Fig 4a and 4b and black to extra replicates. The analytic prediction, Eq. 16, from linear stability analysis is shown in blue, where the corresponding equation is highlighted. Spread between replicates for a given effective interaction energy are a result of the non-infinite size of the simulated box and the discrete number of clusters. Additionally, the slight overestimate of d, particularly for kBT, stems from the initial stage of Ostwald ripening. Refer to S4 Fig for the process of calculating average nearest neighbor distance (d) with the images in Fig 4b. The evolution of this quantity, in addition to the number of clusters and density variance, for each effective interaction energy is shown in S5 Fig, where more information on nearest neighbor distance can be found in Finite difference method.
With the agreement in average cluster nearest neighbor distance for linear stability analysis and numerical simulations beyond linearity at a variety of effective interaction energies, Eq. 15 holds beyond linearity. Thus, it can be used to estimate effective interaction energy from AFM cluster formations. In Fig 4c and 4d, every has five replicates, where some points are so similar to the point of obscuring each other.
Effective interaction energy from AFM images
Protein clusters are identified following the filtering and thresholding of the raw AFM data displayed in Fig 2d, as described in Atomic force microscopy cluster determination. Fig 5a shows the image with the highest area coverage, since clusters are not isotropically distributed otherwise, where red dots represent the centroid of a cluster. A representation of this cluster density for every area coverage is shown in S8 Fig. Binned nearest neighbor distances for each cluster are shown in Fig 5b, where the red curve represents a kernel density estimate of the histogram and the dotted red line signifies the corresponding maxima, equated to d = 77.2 nm. A comparable average nearest neighbor distance for a simulation with kBT is shown through the black dotted line as d = 79.6 nm. This simulation is shown on the leftmost panel in Fig 5c and 5d.
(a) Filtered and thresholded AFM image at the highest protein area coverage, . Protein clusters larger than four pixels are shown in yellow with their corresponding centroid in red, while other regions are in purple. (b) A normalized histogram of nearest neighbor distance between protein clusters is shown in blue, where the kernel density estimate is shown in red. The average nearest neighbor distance is taken as the point corresponding to the maximum in the kernel density estimate, and is shown as a red dotted line. A dotted black line displays a comparable d for a simulation with
kBT. Simulation frames are shown at (c) the final frame of linearity and (d) t = 0.0969 s, with time at the top of each image. This time corresponds to the stable state before Ostwald ripening takes over for
. Initial density fraction increases from left to right. Note that the colorbar in (c) changes for each figure. The simulated value shown in (b) is in leftmost column of (c) and (d).
With this nearest neighbor distance, the effective interaction energy can be calculated using Eq. 15. Accounting for the spread in the nearest neighbor distance 77.2 nm ± 1.2 nm, error in calculating area coverage 0.161 ± 0.005 (refer to S7 Fig), and the size of the protein nm, gives an effective interaction energy in the range of
kBT, 9.6 kBT]. This relationship holds because AFM clustering appears to be within the diffusion limited growth regime.
Next, Fig 5c and 5d indicate the impact of higher initial protein density in cluster formation within our model. While linearity is still in agreement with our analytic predictions for , as seen in Fig 5c and S11 Fig, this is not the case for higher densities beyond linearity. For an initial density fraction of
, shortly after transitioning beyond linearity, clusters begin to merge together. This leads to a much larger distance between clusters than the predicted quantity. In the case of
, upon leaving linearity, lines of higher density are formed, with the typical cluster formations only appearing at much later time. In both cases, after sufficient time for stabilization, maxima within the linear regime do not coincide with clusters post transition. The transition beyond linearity for the middle and rightmost panels of Fig 5d involves coarsening and coalescence beyond diffusion limited growth. Clusters initially form and then merge together, most likely until there is only a single cluster left. These observations can be seen in S3, S6, and S7 Videos.
Discussion
In this paper, we examined M protein cluster formation—an essential early step in the assembly and budding of SARS-CoV-2—across multiple length and time scales. As discussed in [36], the long-form M protein was not observed in our AFM experiments, consistent with the substantial structural rearrangements seen during long-form simulations [36] and the requirement for binding with a Fab complex for structural determination [28,35]. Whether the absence of the long form stems from the lack of additional viral structural proteins or its occurrence only within large, mature clusters remains unknown. Another possibility is that the conformational transition from the short to the long form involves a substantial energy barrier, making the long form inaccessible under the conditions of our simulations and AFM experiments. Thus, the cluster formation analyzed here is most representative of early stage assembly, involving only the short form, and does not include other aspects of viral assembly such as membrane curvature induction or interactions with additional structural proteins. Nucleation and energy barriers play an important role in viral assembly in general [80,81], and the emergence of short-form M clusters marks a key nucleation event that overcomes the initial energy barrier for assembly and sets the stage for subsequent curvature generation and budding. These clusters therefore represent not only the onset of viral assembly but also a crucial control point whose energetics could define bottlenecks in the budding process.
Utilizing all-atom MD of an individual short-form M protein embedded in an ERGIC-like membrane, we quantified the protein-induced membrane thinning on the scale of 0.5 nm near the protein, decaying over roughly 12 nm. The thinning appears purely elastic, without any noticeable correlation between location and lipid composition. As discussed in [36], this elastic deformation likely arises from the mismatch between the transmembrane domain (≈4 nm) and the average bilayer thickness. The thinning profile closely resembles that obtained in shorter MD runs of the same structure and AFM images of individual full-length M proteins [36], while also agreeing with previous observations of the SARS-CoV M protein [13]. Furthermore, the thinning-induced line tension derived from this deformation can be estimated as described in Thinning induced line tension. This line tension acts as a membrane-mediated attraction between neighboring M proteins with a magnitude of approximately 0.10 kBT/nm ± 0.04 kBT/nm.
To characterize M protein clustering, we analyzed AFM images of M proteins embedded in planar supported lipid bilayers. AFM images were acquired within a 2.25 m × 2.25
m membrane patch at three different protein area coverages (density fractions). Our results indicated a critical average density fraction above which isotropic clusters appeared, with formation observed at
but not at
. While some clusters were observed for
, their small sizes and heterogeneity suggest they likely result from local density variations or aggregation prior to vesicle insertion, rather than true phase-separated clusters at equilibrium. We attribute the lack of any observed Ostwald ripening at the highest imaged area fraction to the proteins being kinetically trapped, potentially due to interactions with the M protein’s N-terminal and the bilayer supporting mica surface.
To quantitatively predict M protein cluster formation, we employed a continuum model describing density evolution on a flat membrane. Linear stability analysis revealed a critical density for cluster formation that depends on the effective interaction energy. Moreover, for each initial density fraction, a critical interaction energy exists above which clustering occurs. Based on the AFM data, we estimated a lower bound for the effective interaction energy of kBT. Linear stability analysis also allowed us to estimate this interaction energy directly from the average distance between clusters and the protein area coverage.
We then confirmed these predictions using finite-difference simulations within the density range observed in AFM data. In the linear regime, or for small density evolution, protein density appears as a set of smoothed extrema that gradually sharpen into high-density clusters located at the initial maxima for low-density fractions. This behavior is consistent with numerical solutions of similar models in the low-density regime [65]. Within this regime, linear stability analysis remains valid well beyond the linear phase, until late-time Ostwald ripening becomes dominant. For densities beyond that of our AFM data, where density fractions are two to three times higher, linear stability holds in the early regime but breaks down after clusters merge, leading to coalescence and coarsening not observed in AFM images.
Having established agreement between our linear stability analysis and numerical solutions beyond linearity, we compared theoretical predictions with AFM measurements to estimate effective interaction energies. This comparison only holds for low densities, where coalescence plays little to no role, and under the assumption that AFM measurements are kinetically trapped before Ostwald ripening dominates. From the AFM-determined average nearest-neighbor distance of 77.2 ± 1.2 nm between clusters, we estimated kBT, 9.6 kBT]. Using the MD-derived thinning-induced line tension of 0.10 kBT/nm ± 0.04 kBT/nm, we estimate thinning’s contribution to the effective interaction energy as only [0.4 kBT, 1.5 kBT] dependent on the protein’s variable width (see Effective interaction energy for details). Thus, membrane thinning alone does not play a dominant role in cluster formation, likely acting more prominently during later curvature generation and budding. The remaining portion of the effective energy, attributed to direct M–M interactions and considered an effective oligomerization energy, lies in the range
kBT, 8.9 kBT]. These energies are comparable to those of a model transmembrane protein observed with high-speed AFM, with membrane thinning introducing an attraction of
kBT to a total interaction of ∼6 kBT [39].
The higher-order oligomerization of M proteins has been consistently identified [13,28,36,41], providing support for the estimated effective oligomerization energy range. Adjacent dimers are also believed to predominantly contact each other through their C-terminal domains [13,28], as needed to achieve our detected tightly packed AFM clusters. While the structural interface for M protein higher-order oligomerization has not been elucidated, cryo-EM studies have extensively classified monomer-monomer contacts within the M protein dimerization interface for SARS-CoV-2 in its natural state [28,35], and when disrupted by small-molecules [40,41]. This interface remains comparable for other coronaviruses [82,83], with the C-terminal involved [28], and the transmembrane domain potentially dominating [84,85]. While our estimated effective oligomerization energy only considers interactions between dimers, the interface between monomers could play a role in the set of potential dimer-dimer contacts. Additionally, MD simulations of SARS-CoV-2 M protein structural predictions previously identified a binding energy between two monomers similar in scale to our oligomerization energy [84], yet no studies have explored the dimer-dimer interface. Although it is unknown how these energies differ with M protein conformation, the act of altering conformations [25,26,40,41] is capable of inhibiting assembly, highlighting the importance of M–M interactions throughout the assembly and budding process [86].
Confirmation of linear stability predictions beyond the linear regime, together with the estimated interaction energies, yields a density fraction range of [0.118, 0.304] required for assembly-like cluster formation. To the best of our knowledge, M protein density along the ERGIC during budding has not been measured. However, by comparing the number of buds forming in a given compartment to the compartment’s surface area, we can roughly estimate the minimal M protein density. Studies of coronavirus infected cells reveal the number of virions produced in an individual compartment at a given time varies widely [41,87–93], with smaller compartments (r ∼ 100 nm) holding a single virion [89] and larger ones (r ∼ 300 nm) capable of creating ten or more [87]. Given ∼1000 M proteins with a radius of 2.5 nm in a single virion [13], and assuming a spherically shaped compartment, leads to an M protein area fraction lower bound of ∼0.17 independent of compartment size, well within our predicted density range.
Although smaller apparent clusters are visible in AFM images at lower densities, these are unlikely to represent stable assembly intermediates. At higher densities, cluster coalescence may dominate, potentially disrupting the controlled assembly required for budding. However, given typical conditions inside cells, membranes are free to move and populated with numerous viral and host components. As such, curvature induction serves as a potential way to prevent this coalescence, while also introducing a membrane-mediated attraction between proteins capable of increasing , and lowering the critical density and interaction range thresholds needed for assembly along a flat membrane [68,94,95]. Furthermore, additional viral components are also capable of lowering these critical thresholds, with E and M proteins known to colocalize [15,31,96]. Similarly, interactions between M and the RNA–N protein complex could prompt local M protein enrichment along the ERGIC surface, potentially lowering both thresholds [13,28,30,83]. While the overall clustering behavior remains robust for early budding stages, changes in membrane composition, tension, or temperature are also capable of modulating the critical density and interaction energy thresholds. Beyond shifting critical values, these behaviors play an even more important role at later budding stages.
Earlier models of SARS-CoV-2 assembly did not distinguish between the short and long forms of the M protein. Our results demonstrate that the short form alone can self-associate into clusters without other structural proteins. A plausible scenario is that assembly and budding begins with short-form cluster formation, followed by conformational transitions to the long form. These conformational transitions could be prompted by the oligomerization energy driving assembly, interactions with other structural proteins, or lipids [13,28,97]. Remaining short-form M proteins could localize near the bud neck, where their thinning effect facilitates membrane scission and virion release. The presence of such conformational and clustering-related energy barriers may also serve as kinetic control points, regulating the rate of assembly and budding. By modulating how quickly M proteins transition between conformations or coalesce into stable clusters, these barriers could fine-tune the timescale of virion formation, ensuring coordinated recruitment of other structural components.
Further work across multiple scales is needed to refine these conclusions. The estimated interaction energies—and particularly the contribution from direct M–M binding—can be validated using all-atom MD of multiple M proteins embedded in membranes, along with potential of mean force calculations. Such simulations would also allow for conformation-dependent interaction energies to be determined. Incorporating these results into continuum models that include curvature-dependent terms [64–68] would provide a more complete picture of how M proteins coordinate with other structural components to drive budding.
Our findings highlight that the short-form M protein alone can overcome the energetic bottleneck for initiating assembly and generate stable clusters through strong direct interactions. These insights identify quantitative thresholds—both in protein density and binding energy—that define the onset of viral assembly. From a therapeutic perspective, targeting these parameters could either provide new strategies for hindering SARS-CoV-2 replication, or explain previous ones [40,41]. For example, reducing M–M binding energy through mutation or chemical interference below the critical threshold, or lowering the local M density below the required range, could effectively suppress assembly. Ultimately, a deeper understanding of M protein clustering and its energetic landscape provides a foundation for exploring novel antiviral strategies and offers broader implications for other enveloped viruses.
Supporting information
S1 Appendix. Nonlinear form of continuum model and conversion to linearity.
https://doi.org/10.1371/journal.pcbi.1014229.s001
(PDF)
S2 Appendix. Approximating the contribution of line tension in effective interaction energy from a discrete protein lattice to a continuum.
https://doi.org/10.1371/journal.pcbi.1014229.s002
(PDF)
S1 Fig. All-atom molecular dynamics atom counts (a) and simulation box size (b).
(a) Since the leaflets are symmetric, the number of molecules for a corresponding lipid type in a leaflet is half the system-wide value, adding to 1000 lipid molecules per leaflet. (b) After a quick change in the length along each axis within the first few nanoseconds, the box remains stable throughout the simulation.
https://doi.org/10.1371/journal.pcbi.1014229.s003
(TIF)
S2 Fig. Schematic of a discrete protein lattice with M proteins (a) far apart and (b) within range of nearest neighbor interactions.
The discrete protein lattice, with distance between lattice sites equivalent to the approximate width of the protein a, shows the two prominent types of site-site interactions when two proteins are not nearest neighbors: and
. (b) As these proteins become nearest neighbors,
encompasses direct interactions between proteins. Converting this representation for a system of randomly distributed proteins to a continuum results in the described continuum model. Created with BioRender.com (https://biorender.com/gxjhs1k, https://biorender.com/bkbr3e1).
https://doi.org/10.1371/journal.pcbi.1014229.s004
(TIF)
S3 Fig. Process for calculating maximum wavevector and growth rate from numerical simulations.
(a) Power spectra are shown for the three images displayed in Fig 4a, where the colorbar displays power spectrum density (PSD) dependent on wavelength per pixel. The radius of the prominent ring is the maximum wavevector. (b) After radially binning from the center of the spectra and averaging the PSD of non-unique radii, the radial profile as a function of the wavevector is shown for each effective interaction energy (wavevector is times wavelength per pixel). Blue dots represent the average PSD for every distance from the center, while the black curve shows the corresponding gaussian fit with the maximum wavevector shown as the dotted orange line. (c) The square root of the PSD fit maxima for each measured time is shown with the dotted blue line, while the exponential growth fit is shown in red. The growth rate used for each of these plots is the corresponding maximum growth rate.
https://doi.org/10.1371/journal.pcbi.1014229.s005
(TIF)
S4 Fig. Thresholding and finding nearest neighbor distance for simulated cluster formation.
(a) Plot of the thresholded simulations at the cut-off time shown in Fig 4b, where regions with protein are shown in yellow and cluster centroids are shown in red. (b) Kernel density estimate for each of the three images in black ( kBT), purple (
kBT), and green (
kBT), where the dotted line displays the nearest neighbor distance in which there is a maximum (d). Each of these distances are shown as orange dots in Fig 4d.
https://doi.org/10.1371/journal.pcbi.1014229.s006
(TIF)
S5 Fig. Evolution of (a) simulated nearest neighbor distance, (b) number of clusters, (c) and variance for every simulation performed at
.
Each replicate is shown as a dotted line of variable color, where the corresponding average of all replicates is shown as a black line for (a) and (b). Vertical cyan lines represent the cut-off time calculated by finding the time of minimal slope for the average nearest neighbor evolution curve. This time changes for each interaction energy, where t8 = 0.193 s, t8.25 = 0.13265 s, t9 = .0969 s, t9.25 = .09668 s, and t10.25 = .01863 s. The average value in (a) and (b) at the cut-off time is displayed with an orange dot. For (a), the horizontal cyan line represents the analytically predicted nearest neighbor distance. In (c), the slight fluctuations or bumps in density variance are a result of the gradually decreasing number of clusters stemming from Ostwald ripening shown in (b) for each interaction energy. None of these quantities were gathered until .
https://doi.org/10.1371/journal.pcbi.1014229.s007
(TIF)
S6 Fig. Using a total variation filter to reduce noise of raw AFM data while maintaining large protein clusters.
(a) The raw AFM data at the three different protein area coverages shown in Fig 2d. (b)-(c) This AFM data was filtered with total variation denoising using split-Bregman optimization, with a denoising weght of one and a tolerance of 1 × 10−5. Height values for the cross-section shown in (a) and (b) as a dotted blue line are displayed in (c) for comparison between the raw and filtered data.
https://doi.org/10.1371/journal.pcbi.1014229.s008
(TIF)
S7 Fig. Thresholded and filtered AFM images for the three different protein area coverages.
Height thresholds for the two highest area coverages were found using Otsu’s method, where the threshold can be found above each image in (a). Due to the low protein density in the leftmost image, its threshold was chosen to match that of the middle panel. (b) Protein area coverage is shown as a function of threshold height, where the used value is shown as a dotted vertical blue line. As the filter is applied, the curve gets closer to a step function, as expected for a transmembrane protein. The dotted grey line represents the higher bound on area coverage when considering AFM vertical resolution, while the dotted red line signifies the reverse, leading to the error bars in (a). Insets highlighting the change in area coverage near the chosen threshold heights are displayed for each system. Lastly, (c) displays a histogram of filtered image heights, with the threshold as a dotted black line. Note that Otsu’s method loses effectiveness as the data becomes more Gaussian.
https://doi.org/10.1371/journal.pcbi.1014229.s009
(TIF)
S8 Fig. Cluster centroid density for each protein area coverage.
The top panel represents filtered and thresholded AFM data for each area coverage with cluster centroids shown as red dots. A cluster is defined as five or more adjacent pixels with height values greater than the threshold. Two-dimensional kernel density estimates of centroids are shown in the panel below, where higher density is shown in pink and lower is shown in blue. Of the three area coverages, only the highest has a close to constant density, showing isotropic cluster distribution.
https://doi.org/10.1371/journal.pcbi.1014229.s010
(TIF)
S9 Fig. The short form remains stable while embedded in an ERGIC-like membrane throughout the 2
all-atom MD simulation.
(a) Root mean square deviation (RMSD) defines how much a protein structure has changed relative to its initial position. It is defined as , where N is the total number of
atoms,
is the position for the i’th
atom at time t, and ri,0 is the initial position of the corresponding atom. (b) Root mean square fluctuation (RMSF) is shown for both short form chains, where the N-terminal starts at residue 9 and the C-terminal ends at residue 204. RMSF is the averaged RMSD over time for each residue. (c) Total radius of gyration (Rg) of short form
atoms is shown in red. With little change throughout the simulation in (a) and (c), and reasonable RMSF for each residue in (b), the protein remains stable throughout the simulation. All of these quantities were calculated using the appropriate GROMACS command.
https://doi.org/10.1371/journal.pcbi.1014229.s011
(TIF)
S10 Fig. Later time Ostwald ripening shown through density evolution from (a) linearity to (b) end of diffusion limited growth dominated regime to (c) final simulation frame.
(a) Frames at the end of linearity from Fig 4a. (b) Frames before Ostwald ripening dominates from Fig 4b. (c) Final simulation frames for corresponding interaction energies. Slight dissipation of clusters can be seen in each final frame, where smallest clusters in (b) or weaker maxima in (a) dissipate. Times are shown at the top of each image.
https://doi.org/10.1371/journal.pcbi.1014229.s012
(TIF)
S11 Fig. Maximum wavevectors and maximum growth rates during linearity agree with predictions for higher density simulations.
Each simulation was performed at kBT, with five replicates for each of the three different initial protein densities:
(blue),
(black), and
(green). Filled in circles correspond to replicates shown in Fig 5. Analytical curves from Eq. 25 in S1 Appendix, where the color gradient represents different effective interaction energies, are shown for each initial density fraction. Red stars are the expected analytic value for the maximum wavevector and maximum growth rate dependent on
. The lowest curve corresponds to
at a variety of effective interaction energies, while the middle applies to
, and the upper
.
https://doi.org/10.1371/journal.pcbi.1014229.s013
(TIF)
S1 Video. Protein density evolution for
and
kBT.
https://doi.org/10.1371/journal.pcbi.1014229.s014
(MP4)
S2 Video. Protein density evolution for
and
kBT.
https://doi.org/10.1371/journal.pcbi.1014229.s015
(MP4)
S3 Video. Protein density evolution for
and
kBT.
https://doi.org/10.1371/journal.pcbi.1014229.s016
(MP4)
S4 Video. Protein density evolution for
and
kBT.
https://doi.org/10.1371/journal.pcbi.1014229.s017
(MP4)
S5 Video. Protein density evolution for
and
kBT.
https://doi.org/10.1371/journal.pcbi.1014229.s018
(MP4)
S6 Video. Protein density evolution for
and
kBT.
https://doi.org/10.1371/journal.pcbi.1014229.s019
(MP4)
S7 Video. Protein density evolution for
and
kBT.
https://doi.org/10.1371/journal.pcbi.1014229.s020
(MP4)
References
- 1. Bruinsma RF, Wuite GJL, Roos WH. Physics of viral dynamics. Nat Rev Phys. 2021;3(2):76–91. pmid:33728406
- 2. Zandi R, Dragnea B, Travesset A, Podgornik R. On virus growth and form. Physics Reports. 2020;847:1–102.
- 3. Elrad OM, Hagan MF. Mechanisms of size control and polymorphism in viral capsid assembly. Nano Lett. 2008;8(11):3850–7. pmid:18950240
- 4. Perlmutter JD, Perkett MR, Hagan MF. Pathways for virus assembly around nucleic acids. J Mol Biol. 2014;426(18):3148–65. pmid:25036288
- 5. Li S, Tresset G, Zandi R. From disorder to icosahedral symmetry: How conformation-switching subunits enable RNA virus assembly. Sci Adv. 2025;11(39):eady7241. pmid:40991695
- 6. Li S, Erdemci-Tandogan G, van der Schoot P, Zandi R. The effect of RNA stiffness on the self-assembly of virus particles. J Phys Condens Matter. 2018;30(4):044002. pmid:29235442
- 7. He L, Porterfield Z, van der Schoot P, Zlotnick A, Dragnea B. Hepatitis virus capsid polymorph stability depends on encapsulated cargo size. ACS Nano. 2013;7(10):8447–54. pmid:24010404
- 8. Sun J, DuFort C, Daniel M-C, Murali A, Chen C, Gopinath K, et al. Core-controlled polymorphism in virus-like particles. Proc Natl Acad Sci U S A. 2007;104(4):1354–9. pmid:17227841
- 9.
Cadena-López D, Villalba-Nieto M, Campos-Melendez F, Rosales-Mendoza S, Comas-Garcia M. Assembly of Coronaviruses and CoV-Like-Particles. Physical Virology: From the State-of-the-Art Research to the Future of Applied Virology. Springer. 2023. 141–60.
- 10. V’kovski P, Kratzel A, Steiner S, Stalder H, Thiel V. Coronavirus biology and replication: implications for SARS-CoV-2. Nat Rev Microbiol. 2021;19(3):155–70. pmid:33116300
- 11. Lim YX, Ng YL, Tam JP, Liu DX. Human Coronaviruses: A Review of Virus-Host Interactions. Diseases. 2016;4(3):26. pmid:28933406
- 12. Neuman BW, Adair BD, Yoshioka C, Quispe JD, Orca G, Kuhn P, et al. Supramolecular architecture of severe acute respiratory syndrome coronavirus revealed by electron cryomicroscopy. J Virol. 2006;80(16):7918–28. pmid:16873249
- 13. Neuman BW, Kiss G, Kunding AH, Bhella D, Baksh MF, Connelly S, et al. A structural analysis of M protein in coronavirus assembly and morphology. J Struct Biol. 2011;174(1):11–22. pmid:21130884
- 14. Neuman BW, Buchmeier MJ. Supramolecular Architecture of the Coronavirus Particle. Adv Virus Res. 2016;96:1–27. pmid:27712621
- 15. Schoeman D, Fielding BC. Coronavirus envelope protein: current knowledge. Virol J. 2019;16(1):69. pmid:31133031
- 16. Sarkar M, Saha S. Structural insight into the role of novel SARS-CoV-2 E protein: A potential target for vaccine development and other therapeutic strategies. PLoS One. 2020;15(8):e0237300. pmid:32785274
- 17. Chang C, Hou M-H, Chang C-F, Hsiao C-D, Huang T. The SARS coronavirus nucleocapsid protein--forms and functions. Antiviral Res. 2014;103:39–50. pmid:24418573
- 18. Forsythe HM, Rodriguez Galvan J, Yu Z, Pinckney S, Reardon P, Cooley RB, et al. Multivalent binding of the partially disordered SARS-CoV-2 nucleocapsid phosphoprotein dimer to RNA. Biophys J. 2021;120(14):2890–901. pmid:33794152
- 19. Iserman C, Roden CA, Boerneke MA, Sealfon RSG, McLaughlin GA, Jungreis I. Genomic RNA elements drive phase separation of the SARS-CoV-2 nucleocapsid. Molecular Cell. 2020;80:1078-91.e6.
- 20. Adhikari P, Li N, Shin M, Steinmetz NF, Twarock R, Podgornik R, et al. Intra- and intermolecular atomic-scale interactions in the receptor binding domain of SARS-CoV-2 spike protein: implication for ACE2 receptor binding. Phys Chem Chem Phys. 2020;22(33):18272–83. pmid:32756685
- 21. Boson B, Legros V, Zhou B, Siret E, Mathieu C, Cosset F-L, et al. The SARS-CoV-2 envelope and membrane proteins modulate maturation and retention of the spike protein, allowing assembly of virus-like particles. J Biol Chem. 2021;296:100111. pmid:33229438
- 22. Siu YL, Teoh KT, Lo J, Chan CM, Kien F, Escriou N, et al. The M, E, and N structural proteins of the severe acute respiratory syndrome coronavirus are required for efficient assembly, trafficking, and release of virus-like particles. J Virol. 2008;82(22):11318–30. pmid:18753196
- 23. Xu R, Shi M, Li J, Song P, Li N. Construction of SARS-CoV-2 Virus-Like Particles by Mammalian Expression System. Front Bioeng Biotechnol. 2020;8:862. pmid:32850726
- 24. Plescia CB, David EA, Patra D, Sengupta R, Amiar S, Su Y, et al. SARS-CoV-2 viral budding and entry can be modeled using BSL-2 level virus-like particles. J Biol Chem. 2021;296:100103. pmid:33214224
- 25. de Haan CA, Vennema H, Rottier PJ. Assembly of the coronavirus envelope: homotypic interactions between the M proteins. J Virol. 2000;74(11):4967–78. pmid:10799570
- 26. Arndt AL, Larson BJ, Hogue BG. A conserved domain in the coronavirus membrane protein tail is important for virus assembly. J Virol. 2010;84(21):11418–28. pmid:20719948
- 27. de Haan CA, Smeets M, Vernooij F, Vennema H, Rottier PJ. Mapping of the coronavirus membrane protein domains involved in interaction with the spike protein. J Virol. 1999;73(9):7441–52. pmid:10438834
- 28. Zhang Z, Nomura N, Muramoto Y, Ekimoto T, Uemura T, Liu K, et al. Structure of SARS-CoV-2 membrane protein essential for virus assembly. Nat Commun. 2022;13(1):4399. pmid:35931673
- 29. Cubuk J, Alston JJ, Incicco JJ, Singh S, Stuchell-Brereton MD, Ward MD, et al. The SARS-CoV-2 nucleocapsid protein is dynamic, disordered, and phase separates with RNA. Nat Commun. 2021;12(1):1936. pmid:33782395
- 30. Lu S, Ye Q, Singh D, Cao Y, Diedrich JK, Yates JR 3rd, et al. The SARS-CoV-2 nucleocapsid phosphoprotein forms mutually exclusive condensates with RNA and the membrane-associated M protein. Nat Commun. 2021;12(1):502. pmid:33479198
- 31. Lim KP, Liu DX. The missing link in coronavirus assembly. Retention of the avian coronavirus infectious bronchitis virus envelope protein in the pre-Golgi compartments and physical interaction between the envelope and membrane proteins. J Biol Chem. 2001;276(20):17515–23. pmid:11278557
- 32. Saraste J, Prydz K. Assembly and Cellular Exit of Coronaviruses: Hijacking an Unconventional Secretory Pathway from the Pre-Golgi Intermediate Compartment via the Golgi Ribbon to the Extracellular Space. Cells. 2021;10(3):503. pmid:33652973
- 33. Klein S, Cortese M, Winter SL, Wachsmuth-Melm M, Neufeldt CJ, Cerikan B, et al. SARS-CoV-2 structure and replication characterized by in situ cryo-electron tomography. Nat Commun. 2020;11(1):5885. pmid:33208793
- 34. Calder LJ, Calcraft T, Hussain S, Harvey R, Rosenthal PB. Electron cryotomography of SARS-CoV-2 virions reveals cylinder-shaped particles with a double layer RNP assembly. Commun Biol. 2022;5(1):1210. pmid:36357779
- 35. Dolan KA, Dutta M, Kern DM, Kotecha A, Voth GA, Brohawn SG. Structure of SARS-CoV-2 M protein in lipid nanodiscs. Elife. 2022;11:e81702. pmid:36264056
- 36. Zhang Y, Anbir S, McTiernan J, Li S, Worcester M, Mishra P, et al. Synthesis, insertion, and characterization of SARS-CoV-2 membrane protein within lipid bilayers. Sci Adv. 2024;10(9):eadm7030. pmid:38416838
- 37. Kahraman O, Koch PD, Klug WS, Haselwandter CA. Bilayer-thickness-mediated interactions between integral membrane proteins. Phys Rev E. 2016;93:042410. pmid:27176332
- 38. Haselwandter CA, Wingreen NS. The role of membrane-mediated interactions in the assembly and architecture of chemoreceptor lattices. PLoS Comput Biol. 2014;10(12):e1003932. pmid:25503274
- 39. Jiang Y, Thienpont B, Sapuru V, Hite RK, Dittman JS, Sturgis JN, et al. Membrane-mediated protein interactions drive membrane protein organization. Nat Commun. 2022;13(1):7373. pmid:36450733
- 40. Van Damme E, Abeywickrema P, Yin Y, Xie J, Jacobs S, Mann MK, et al. A small-molecule SARS-CoV-2 inhibitor targeting the membrane protein. Nature. 2025;640(8058):506–13. pmid:40140563
- 41. Laporte M, Jochmans D, Bardiot D, Desmarets L, Debski-Antoniak OJ, Mizzon G, et al. A coronavirus assembly inhibitor that targets the viral membrane protein. Nature. 2025;640(8058):514–23. pmid:40140569
- 42. Berry J, Brangwynne CP, Haataja M. Physical principles of intracellular organization via active and passive phase transitions. Rep Prog Phys. 2018;81(4):046601. pmid:29313527
- 43. Mao S, Kuldinow D, Haataja MP, Košmrlj A. Phase behavior and morphology of multicomponent liquid mixtures. Soft Matter. 2019;15(6):1297–311. pmid:30506078
- 44.
Safran S. Statistical thermodynamics of surfaces, interfaces, and membranes. CRC Press. 2018.
- 45.
Bauer P, Hess B, Lindahl E. GROMACS 2022.3 source code. Geneva, Switzerland: Zenodo. 2022.
- 46. Huang J, Rauscher S, Nawrocki G, Ran T, Feig M, de Groot BL, et al. CHARMM36m: an improved force field for folded and intrinsically disordered proteins. Nat Methods. 2017;14(1):71–3. pmid:27819658
- 47. Jo S, Kim T, Im W. Automated builder and database of protein/membrane complexes for molecular dynamics simulations. PLoS One. 2007;2(9):e880. pmid:17849009
- 48. Jo S, Kim T, Iyer VG, Im W. CHARMM-GUI: a web-based graphical user interface for CHARMM. J Comput Chem. 2008;29(11):1859–65. pmid:18351591
- 49. Brooks BR, Brooks CL 3rd, Mackerell AD Jr, Nilsson L, Petrella RJ, Roux B, et al. CHARMM: the biomolecular simulation program. J Comput Chem. 2009;30(10):1545–614. pmid:19444816
- 50. Jo S, Lim JB, Klauda JB, Im W. CHARMM-GUI Membrane Builder for mixed bilayers and its application to yeast membranes. Biophys J. 2009;97(1):50–8. pmid:19580743
- 51. Wu EL, Cheng X, Jo S, Rui H, Song KC, Dávila-Contreras EM, et al. CHARMM-GUI Membrane Builder toward realistic biological membrane simulations. J Comput Chem. 2014;35(27):1997–2004. pmid:25130509
- 52. Lee J, Cheng X, Swails JM, Yeom MS, Eastman PK, Lemkul JA, et al. CHARMM-GUI Input Generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM Simulations Using the CHARMM36 Additive Force Field. J Chem Theory Comput. 2016;12(1):405–13. pmid:26631602
- 53. Lee J, Patel DS, Ståhle J, Park S-J, Kern NR, Kim S, et al. CHARMM-GUI Membrane Builder for Complex Biological Membrane Simulations with Glycolipids and Lipoglycans. J Chem Theory Comput. 2019;15(1):775–86. pmid:30525595
- 54. Lee J, Hitzenberger M, Rieger M, Kern NR, Zacharias M, Im W. CHARMM-GUI supports the Amber force fields. J Chem Phys. 2020;153(3):035103. pmid:32716185
- 55. Park S, Choi YK, Kim S, Lee J, Im W. CHARMM-GUI Membrane Builder for Lipid Nanoparticles with Ionizable Cationic Lipids and PEGylated Lipids. J Chem Inf Model. 2021;61(10):5192–202. pmid:34546048
- 56. Hoover W. Canonical dynamics: Equilibrium phase-space distributions. Phys Rev A Gen Phys. 1985;31(3):1695–7. pmid:9895674
- 57. Nosé S. A molecular dynamics method for simulations in the canonical ensemble. Molecular Physics. 1984;52(2):255–68.
- 58. Nosé S, Klein ML. Constant pressure molecular dynamics for molecular systems. Molecular Physics. 1983;50(5):1055–76.
- 59. Parrinello M, Rahman A. Polymorphic transitions in single crystals: A new molecular dynamics method. Journal of Applied Physics. 1981;52(12):7182–90.
- 60. Meng EC, Goddard TD, Pettersen EF, Couch GS, Pearson ZJ, Morris JH. UCSF ChimeraX: Tools for structure building and analysis. Protein Science. 2023;32(11):e4792.
- 61.
Gowers RJ, Linke M, Barnoud J, Reddy TJE, Melo MN, Seyler SL. MDAnalysis: a Python package for the rapid analysis of molecular dynamics simulations. Los Alamos, NM (United States): Los Alamos National lab. 2019.
- 62. Michaud-Agrawal N, Denning EJ, Woolf TB, Beckstein O. MDAnalysis: a toolkit for the analysis of molecular dynamics simulations. J Comput Chem. 2011;32(10):2319–27. pmid:21500218
- 63. Kuzmin PI, Akimov SA, Chizmadzhev YA, Zimmerberg J, Cohen FS. Line tension and interaction energies of membrane rafts calculated from lipid splay and tilt. Biophys J. 2005;88(2):1120–33. pmid:15542550
- 64. Gov NS. Guided by curvature: shaping cells by coupling curved membrane proteins and cytoskeletal forces. Philos Trans R Soc Lond B Biol Sci. 2018;373(1747):20170115. pmid:29632267
- 65. Mahapatra A, Saintillan D, Rangamani P. Curvature-driven feedback on aggregation-diffusion of proteins in lipid bilayers. Soft Matter. 2021;17(36):8373–86. pmid:34550131
- 66. Tozzi C, Walani N, Arroyo M. Out-of-equilibrium mechanochemistry and self-organization of fluid membranes interacting with curved proteins. New J Phys. 2019;21(9):093004.
- 67. Veksler A, Gov NS. Phase transitions of the coupled membrane-cytoskeleton modify cellular shape. Biophys J. 2007;93(11):3798–810. pmid:17704150
- 68. Winter A, Liu Y, Ziepke A, Dadunashvili G, Frey E. Phase separation on deformable membranes: Interplay of mechanical coupling and dynamic surface geometry. Phys Rev E. 2025;111(4–1):044405. pmid:40411015
- 69. Rangamani P, Mandadap KK, Oster G. Protein-induced membrane curvature alters local membrane tension. Biophys J. 2014;107(3):751–62. pmid:25099814
- 70. Gupta S, Ashkar R. The dynamic face of lipid membranes. Soft Matter. 2021;17(29):6910–28. pmid:34235519
- 71. Kumarage T, Morris NB, Ashkar R. The effects of molecular and nanoscopic additives on phospholipid membranes. Front Phys. 2023;11.
- 72. Flory PJ. Thermodynamics of High Polymer Solutions. The Journal of Chemical Physics. 1942;10(1):51–61.
- 73. Huggins ML. Some Properties of Solutions of Long-chain Compounds. J Phys Chem. 1942;46(1):151–8.
- 74. McTiernan J, Zhang Y, Li S, Kuhlman TE, Mohideen U, Colvin ME, et al. Clustering of SARS-CoV-2 membrane proteins in lipid bilayer membranes. Zenodo; 2026. Available from:
- 75. Mandala VS, McKay MJ, Shcherbakov AA, Dregni AJ, Kolocouris A, Hong M. Structure and drug binding of the SARS-CoV-2 envelope protein transmembrane domain in lipid bilayers. Nat Struct Mol Biol. 2020;27(12):1202–8. pmid:33177698
- 76. Chen S, Xu J, Liu M, Rao ALN, Zandi R, Gill SS, et al. Investigation of HIV-1 Gag binding with RNAs and lipids using Atomic Force Microscopy. PLoS One. 2020;15(2):e0228036. pmid:32015565
- 77. Luo Y, Andersson SB. A comparison of reconstruction methods for undersampled atomic force microscopy images. Nanotechnology. 2015;26(50):505703. pmid:26585418
- 78.
Chen A, Bertozzi AL, Ashby PD, Getreuer P, Lou Y. Enhancement and Recovery in Atomic Force Microscopy Images. Applied and Numerical Harmonic Analysis. Birkhäuser Boston. 2012. 311–32. https://doi.org/10.1007/978-0-8176-8379-5_16
- 79. Otsu N. A threshold selection method from gray-level histograms. Automatica. 1975;11(285–296):23–7.
- 80. Moerman P, van der Schoot P, Kegel W. Kinetics versus Thermodynamics in Virus Capsid Polymorphism. J Phys Chem B. 2016;120(26):6003–9. pmid:27027925
- 81. Timmermans SBPE, Ramezani A, Montalvo T, Nguyen M, van der Schoot P, van Hest JCM, et al. The Dynamics of Viruslike Capsid Assembly and Disassembly. J Am Chem Soc. 2022;144(28):12608–12. pmid:35792573
- 82. Mann MK, Yin Y, Marsili S, Xie J, Doijen J, Miller R, et al. Structural insights into MERS and SARS coronavirus membrane proteins. Commun Biol. 2025;8(1):1651. pmid:41286109
- 83. Wang X, Yang Y, Sun Z, Zhou X. Crystal structure of the membrane (M) protein from a bat betacoronavirus. PNAS Nexus. 2023;2(2):pgad021. pmid:36874273
- 84. Cao Y, Yang R, Wang W, Jiang S, Yang C, Liu N, et al. Probing the formation, structure and free energy relationships of M protein dimers of SARS-CoV-2. Comput Struct Biotechnol J. 2022;20:573–82. pmid:35047128
- 85. Ortiz Mateu J, Pearson GJ, Rius-Salvador M, Sedighian S, Pavlova A, Alonso-Romero J. SARS-CoV-2 membrane protein biogenesis. bioRxiv. 2026;2026:2026–01.
- 86. Li S, Zandi R. Biophysical Modeling of SARS-CoV-2 Assembly: Genome Condensation and Budding. Viruses. 2022;14(10):2089. pmid:36298645
- 87. Goldsmith CS, Tatti KM, Ksiazek TG, Rollin PE, Comer JA, Lee WW, et al. Ultrastructural characterization of SARS coronavirus. Emerg Infect Dis. 2004;10(2):320–6. pmid:15030705
- 88. Mendonça L, Howe A, Gilchrist JB, Sheng Y, Sun D, Knight ML, et al. Correlative multi-scale cryo-imaging unveils SARS-CoV-2 assembly and egress. Nat Commun. 2021;12(1):4629. pmid:34330917
- 89. Bergner T, Zech F, Hirschenberger M, Stenger S, Sparrer KMJ, Kirchhoff F, et al. Near-Native Visualization of SARS-CoV-2 Induced Membrane Remodeling and Virion Morphogenesis. Viruses. 2022;14(12):2786. pmid:36560790
- 90. Ogando NS, Dalebout TJ, Zevenhoven-Dobbe JC, Limpens RWAL, van der Meer Y, Caly L, et al. SARS-coronavirus-2 replication in Vero E6 cells: replication kinetics, rapid adaptation and cytopathology. J Gen Virol. 2020;101(9):925–40. pmid:32568027
- 91. Peiris JSM, Lai ST, Poon LLM, Guan Y, Yam LYC, Lim W, et al. Coronavirus as a possible cause of severe acute respiratory syndrome. Lancet. 2003;361(9366):1319–25. pmid:12711465
- 92. Zhou P, Yang X-L, Wang X-G, Hu B, Zhang L, Zhang W, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579(7798):270–3. pmid:32015507
- 93. Kim JM, Chung YS, Jo HJ, Lee NJ, Kim MS, Woo SH. Identification of coronavirus isolated from a patient in Korea with COVID-19. Osong Public Health and Research Perspectives. 2020;11(1):3.
- 94. Weikl TR. Membrane-Mediated Cooperativity of Proteins. Annu Rev Phys Chem. 2018;69:521–39. pmid:29490203
- 95. McMahon HT, Boucrot E. Membrane curvature at a glance. J Cell Sci. 2015;128(6):1065–70. pmid:25774051
- 96. Corse E, Machamer CE. The cytoplasmic tails of infectious bronchitis virus E and M proteins mediate their interaction. Virology. 2003;312(1):25–34. pmid:12890618
- 97.
Dutta M, Dolan KA, Amiar S, Bass EJ, Sultana R, Voth GA. Direct lipid interactions control SARS-CoV-2 M protein conformational dynamics and virus assembly. bioRxiv. 2024.