Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Computational design of functional random heteropolymers through atomistic simulations

  • Tianyi Jin,

    Roles Data curation, Formal analysis, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America, Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America

  • Collin S. Lung,

    Roles Data curation, Formal analysis, Writing – original draft, Writing – review & editing

    Affiliation Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America

  • Ting Xu,

    Roles Writing – review & editing

    Affiliations Department of Materials Science and Engineering, University of California, Berkeley, Berkeley, California, United States of America, Department of Chemistry, University of California, Berkeley, Berkeley, California, United States of America, Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America, Kavli Energy NanoScience Institute, University of California, Berkeley, California, United States of America

  • Connor W. Coley,

    Roles Supervision, Writing – original draft, Writing – review & editing

    Affiliations Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America

  • Alfredo Alexander-Katz

    Roles Funding acquisition, Supervision, Writing – original draft, Writing – review & editing

    aalexand@mit.edu

    Affiliation Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America

Abstract

Random heteropolymers (RHPs) are emerging single-chain nanoparticles with great potential in protein mimicry, yet a systematic understanding of how chemical composition and monomer structures govern their structure, dynamics, and hydration remains limited. Using atomistic molecular dynamics simulations, we examine how various design parameters, including chain length, backbone architecture, charged monomer concentration, chain-level composition, and side-chain micropolarity influence RHP assembly and hydration behavior. As chain length increases, methacrylate-based RHPs transition from rod-like to random-walk statistics and ultimately collapse into compact globules stabilized by hydrophobic collapse and methacrylate-poly(ethylene glycol) (PEG) interactions. Positively charged monomers follow the Hofmeister series in their hydration. Interestingly, the dimerization results from hydrophobic and PEG-positively charged-monomer interactions, and not from opposite charge interactions. Alternative backbones such as acrylate and (meth)acrylamide display sequence-dependent compactness and dynamics, reflecting greater chemical sensitivity. PEG side-chain length strongly affects solubility and hydration, with shorter side chains making the overall chain more hydrophobic. Also, we show that branching-induced micropolarity modulates local hydration patterns of hydrophobic residues. Overall, these results establish general molecular design principles for tuning the assembly and dynamics of RHPs through compositional and chemical control, providing a foundation for engineering synthetic polymers that mimic the compactness, hydration, and functional adaptability of proteins.

Introduction

Proteins are the functional workhorses of living organisms. By modifying their amino acid sequences, proteins can perform a wide array of tasks, facilitated by natural evolution and selection [1]. The precision and selectivity of proteins are rooted in the well-established sequence-structure-function paradigm [2,3]. To address these challenges, researchers have turned to synthetic alternatives that offer greater resilience, cost-effectiveness, and scalability. Compared to natural proteins, these synthetic alternatives can be designed and manufactured more efficiently while maintaining functional versatility, positioning them as practical platforms for applications where proteins face constraints in stability and large-scale production. The vast diversity of synthetic monomers, exceeding the 20 canonical amino acids, provides opportunities to develop functionalities that surpass those found in nature. A promising synthetic approach involves single-chain nanoparticles (SCNPs), which mimic versatile protein functions including catalysis and sensing [410]. SCNPs are synthesized using intramolecular cross-linking of polymer precursors with reactive pendant groups, enabling them to fold into nanosized, well-defined structures with specific functions. Examples of reactive pendant groups include moieties for metal binding [11,12] and organocatalyst [1315]. Upon cross-linking, polymer scaffolds are accessible to substrates, yielding SCNPs capable of specialized enzyme-mimetic tasks.

Recently, methacrylate-based (MMA-based) RHPs have gained significant interest as a versatile family of SCNPs capable of mimicking multiple protein-like functions. RHPs, often referred to as random or statistical copolymers, consist of two or more monomers randomly distributed along a polymer chain. The hydrophobic MMA backbone, complemented by diverse side chains or pendant groups, collapses into molten globules with complex internal structures [16, 17]. We would like to highlight that in our design, the reactive groups responsible for specific functionalities are not pre-attached as pendant groups to the linear polymer precursors. Instead, the RHPs are engineered to create a tailored microenvironment that can subsequently independently recruit reactive groups, such as heme, after polymerization [18]. This modular approach enhances the versatility of RHPs, enabling the incorporation of diverse functional components without the constraints of predefined pendant group chemistries. Furthermore, using a similar design approach (without the recruitment of reactive groups), MMA-RHPs have demonstrated multiple protein-like functions beyond catalysis [19]. These include binding to proteins and preserving their functions in non-native environments [2023], as well as transporting protons rapidly and selectively across lipid membranes [24,25]. Other functions of MMA-RHPs include the formation of membraneless organelles capable of encapsulating small organic molecules, providing a versatile platform for molecular encapsulation and compartmentalization [26,27], and serving as adhesive hydrogels [28]. The ability of MMA-based RHPs to interact with biomacromolecules, including DNA [23] and proteins [20,23], highlights their potential as delivery vehicles (e.g., copolymer polyplexes [2931]), as polyclonal antibody [32] and for enhancing enzyme functionality in industrial processes such as composting [21,33,34] and other harsh or non-native environments.

Unlike natural proteins, which rely on precise sequences and well-defined three-dimensional structures, RHPs employ a sequence ensemble and thus a structure ensemble to achieve designated functions through specific combinations of monomers and compositions. While RHPs may not achieve the same precision and selectivity as proteins, their collective behavior within an ensemble provides a population-based alternative that serves as a functionally versatile platform [23]. These functionalities are not only designable but also evolvable through the optimization of the physicochemical properties of the polymer, including chain length, monomer chemistry, and composition [23,35,36]. Additionally, functionality can be further refined through the optimization of RHP blend formulations, enabling synergistic interactions and enhanced performance [37].

We have developed a computational protocol to study the assembly of RHPs using atomistic molecular dynamics (MD) simulations in an explicit solvent [38]. This protocol includes an initial collapse in an implicit solvent followed by an annealing cycle in an explicit solvent, making it adaptable to RHPs with varying monomer identities and compositions. Using this approach, we have investigated the molecular fundamentals underlying protein-mimicry properties. MMA-based RHPs, formulated as reported in 2018 [20], collapse into compact globules with limited backbone mobility and highly mobile side chains [38,39]. These characteristics resemble the molten globule state of proteins [40,41]. The globules exhibit soft glass transitions and heterogeneous dynamics [39]. This glassy nature hinders chain reconfiguration in water [42] and at solid-liquid interfaces [43], while the globules can remodel in organic solvents such as DMSO, THF, and hexane [44] and at liquid-liquid interfaces [45]. Our simulations also reveal that these globules possess chemically heterogeneous surfaces, featuring both hydrophobic and hydrophilic groups [38] with a rugged energy landscape[46]. The molecular origin of this heterogeneous surface lies in hydration frustration, wherein hydrophobic groups are hydrated, and polar or hydrophilic groups are dehydrated. This phenomenon mirrors a key feature of protein functionality, including binding and catalysis [47,48]. The hydration frustration arises from the negative Flory-Huggins interaction parameter (χ) between the PMMA backbone and polar/hydrophilic polyethylene glycol (PEG) side chains [49]. Specifically, the PMMA backbone folds into a globule to minimize surface interactions, while the PEG pendant groups partition between the backbone globule and the aqueous solvent, leveraging their favorable interactions with both PMMA and water. Hydrophobic ethylhexyl groups, expected to form a core, are excluded from the center of the globule due to spatial occupancy by PEG moieties. This leads to the unexpected exposure of hydrophobic groups to water. The polarity of these groups plays a critical role in determining their hydration patterns and overall surface behavior. Notably, some protein-mimetic properties of RHPs, such as compactness, hydration, and chaperoning, are sequence-insensitive. This lack of dependence on precise sequence control suggests that one-pot synthesis is sufficient to achieve convergent functionalities in these RHPs [50]. These RHPs can laterally bind to β-barrel membrane proteins and stabilize them by interacting with their loop regions and reducing structural fluctuations [51]. This finding underscores the practicality and scalability of RHPs as synthetic platforms for mimicking protein-like behaviors [50].

In this work, we extend our previous studies to examine how subtle chemical variations in backbone architecture and side-chain chemistry influence the assembly and dynamics of RHPs in water. We specifically focus on the effects of chain length, the incorporation of positively charged monomers, and the dimerization behavior of MMA-based RHPs on hydration patterns that are critical to their protein-like properties. In addition, we investigate how backbone architecture and chain-level composition contribute to sequence-sensitive behaviors. Finally, we explore the role of side-chain micropolarity arising from variations in poly(ethylene glycol) length and the degree of branching, which strongly affects hydration through nuanced differences in partial charge distribution. We find that MMA-based RHPs exhibit distinct scaling and morphological behaviors as chain length increases. The incorporation of positively charged monomers in single-chain RHPs and RHP dimers has negligible electrostatic effects due to the glassy nature of the PMMA backbone. Hydration of these monomers follows the Hofmeister series, with even hydrophobic charges remaining well-hydrated, underscoring the difficulty of burying charges in these systems. MMA-based RHPs exhibit sequence-insensitive compactness and form frustrated globules, while peptoid- and peptide-based RHPs also exhibit sequence-insensitive compactness but form core-shell globules. In contrast, MA-, MAn-, and MMAn-based RHPs display sequence-sensitive compactness and form core-shell globules. RHP dynamics are similarly sensitive to sequence. In MAn-based RHPs, through a two-step chain-level composition evolution, hydration patterns are determined by the synergistic effect of chain-level composition and sequence identity, whereas compactness is dictated solely by chain-level composition. Furthermore, the length of the PEG pendant group strongly influences solubility, with shorter PEG chains exhibiting reduced hydration and increased hydrophobicity. The micropolarity difference in the side chains [e.g., linear vs. branched alkyl chain, and poly(propylene glycol) vs. poly(ethylene glycol)] is key to their hydration. Together, these findings highlight the tunable nature of RHPs through adjustments in monomer chemistry and composition. MMA-based backbones emerge as a robust platform for designing sequence-insensitive, compact globules, offering significant potential for applications as protein mimetics and beyond.

Materials and methods

System setup

Sequence ensembles with a degree of polymerization (DP) of 100 or 200 are generated using the Composition Drift Program [52,53], targeting a composition of 50% methyl methacrylate (MMA), 25% oligo(ethylene glycol) methacrylate (OEGMA) with nine PEG repeating units terminated with a methyl group, 20% 2-ethylhexyl methacrylate (EHMA), and 5% 3-sulfopropyl methacrylate (SPMA). SPMA is fully ionized as a strong acid in the experimentally relevant pH conditions that are close to physiological levels. From the generated sequence ensemble, ten sequences are randomly selected. The selected set of sequences has been shown to adequately represent the sequence space [50]. The sequences of 20mer and 50mer are selected from our previous work [39]. All sequences are shown in S1 Fig in S1 File. In systems containing positively charged monomers, 5% of the OEGMA monomers are randomly replaced with these positively charged monomers. Due to the stochastic nature of sequence generation, the overall composition of ten sequences and the chain-level compositions of individual sequence may slightly differ from the targeted composition. Chirality is assigned randomly when required to create a racemic mixture.

Two-component copolymer sequences are randomly generated. Two sequences, one with the smallest (SEQ1) and the other with the largest (SEQ2) radius of gyration (Rg) of backbone atoms, are chosen as the starting points for chain-level composition evolution. These sequences are subsequently mutated in two steps by random substitution to match the chain-level composition of the other sequence.

The backbone architectures include poly(methyl methacrylate) (PMMA), poly(methyl acrylate) (PMA), poly(methyl acrylamide) (PMAn), poly(methyl methacrylamide) (PMMAn), peptide, and peptoid. The selected backbones span a broad range of hydrophobicity, solubility, and flexibility [54]. Peptide and peptoid backbones are included due to their established relevance in biomimetic design, while MA and MMA provide chemically distinct, experimentally accessible synthetic analogs that allow us to explore the influence of backbone chemistry on polymer behavior.

In summary, each system is defined by N = #sequences×#conformations. For 20mer, 50mer, and 100mer four-component MMA-based RHPs, N = 10×10 = 100 simulations are performed, while the 200mer system uses N = 10×1 = 10 simulations. Systems containing positively charged monomers and their dimers are simulated with N = 10×1 = 10. For MA-, MAn-, MMAn-, and peptide-based RHPs, N = 10×10 = 100, and for the peptoid-based RHP, N = 10×1 = 10. To examine the effects of sequence, two representative sequences are selected (SEQ1 and SEQ2) from MA-, MMA-, MAn, MMAn, and peptide-based RHPs, resulting in N = 10×2 = 20 simulations. For the two-component copolymer systems, N = 1×1 = 1 simulation is performed. The mutated sequence sets include N = 10×10 = 100 simulations. Simulations exploring PEG side-chain length variations use N = 10×10 = 100 for MMAn-based RHPs, while systems with octyl or Jeffamine-like side chains use N = 10×1 = 10. The error bars are calculated as the standard deviation across all N simulations.

Molecular dynamics simulations

The General Amber Force Field (GAFF) is used [55]. Partial charges for all monomers are assigned using the restrained electrostatic potential (RESP) method [56], calculated with Gaussian 16 Revision C.01 [57]. This parameterization does not include explicit polarization and relies on Hartree-Fock–level calculations, but it has been extensively validated for homopolymer and copolymer systems in previous studies [38,4245,4951,58,59]. These results support the suitability of this parameterization for accurately capturing the structural and thermodynamic properties of the polymers studied.

Polymer parameterization and monomer assembly into extended chains are performed using AmberTools19 from Amber 18 [60]. Langevin thermostat with a collision frequency of 2 ps-1 is used for temperature control in all simulations [61]. Bond lengths involving hydrogen atoms are constrained using SHAKE algorithm [62]. Isotropic Berendsen barostat with a time constant of 1.0 ps is used for pressure control [63]. The simulations are performed with a 2 fs time step, using the molecular dynamics leapfrog integrator. Electrostatic interactions are calculated using the Particle Mesh Ewald (PME) method with a real-space cutoff of 8.0 Å [64]. The van der Waals interaction cutoff is also set to 8.0 Å.

The simulation protocol follows our previous work [38] and generally consists of two stages: (1) collapse of an initially extended linear chain in an implicit solvent, and (2) subsequent annealing and equilibration in an explicit solvent. The polymer system, prepared using AmberTools 19, first undergoes a brief energy minimization using the steepest descent method, transitioning to the conjugate gradient descent method after 1000 cycles. It is then minimized using the generalized Born/surface area implicit solvent model [65]. The system is then heated to 500 K over 18 ps, maintained at this temperature for 2 ps, and equilibrated at 500 K for 20 ns. The annealing protocol involves cooling the system from 500 K to 300 K over 6 ns, followed by a 74-ns hold at 300 K. No periodic boundary conditions are applied during the implicit solvent simulations. The resulting conformation obtained is explicitly solvated using the SPC/E water model, with potassium or chloride counter ions added to neutralize the system, using monovalent ion parameters from [66]. The choice of the SPC/E water model provides the highest accuracy in reproducing bulk water dynamics and structure compared with other common models such as TIP3P and SPC [67].

The solvated system undergoes a second minimization step, again using the steepest descent method transitioning to the conjugate gradient descent method after 1000 cycles. It is then heated to 300 K over 18 ps and maintained at this temperature for 2 ps in an NVT ensemble. Following this, the system undergoes a 2-ns equilibration at 300 K and 1.0 bar. An NVT annealing cycle is then conducted, consisting of 20 ns at 650 K, 40 ns of cooling from 650 K to 300 K, and a final 20-ns equilibration at 300 K. This annealing process facilitates relaxation of the polymer structure in the explicit solvent. The system is subsequently maintained at 300 K and 1.0 bar for 40 ns, which serves as the production simulation used for analysis. In our previous work, we have shown that the structures are metastable due to the glassy nature of the system, while physicochemical properties such as size and monomer hydration equilibrate within 40 ns [38,49,50]. Periodic boundary conditions are applied in all directions.

For each five-component RHP, a dimer system is examined, consisting of two identical RHP conformations extracted from the final frame of the production run, using PACKMOL [68]. In the initial structures, one pair of oppositely charged monomers from the two chains is intentionally positioned in close proximity to examine specific charge-pair interactions. The same simulation protocol as the single-chain RHP system is applied, except that no annealing runs are performed.

Trajectory and data analysis are performed using AmberTools19 and Python packages (numpy and pytraj), with latter interfacing with Amber’s cpptraj program [69,70]. Data are represented as all data points, the median, the lower and upper quartiles, and whiskers extending within 1.5 times the interquartile range (IQR).

The spatial distance between monomers [R(s)] are calculated for Cα atoms, and the backbone radius of gyration (Rg) are determined based on all backbone atoms. The asphericity is calculated using the formula:

(1)

where I1, I2, and I3 are the principal moments of inertia of the structure. The value of asphericity ranges from 0 (spherical) to 1 (rod-like).

Density is calculated using the mass of the sequence and the estimated volume derived from Rg. The population of water molecules in the first solvation shell (FSS) is determined separately for each monomer and normalized by their maximum values to obtain the normalized first solvation shell (NFSS). The atomic root-mean-square fluctuations (RMSF) of backbone atoms and the standard deviation of the backbone dihedral angles (SD dihedral angle) are calculated over the last 5 ns of the production runs. For example, a 20mer MMA-based RHP contains 20 backbone atoms and 37 backbone dihedral angles.

The sequence sensitivity is assessed based on convex hull analysis described in our previous work [50]. For each conformation, the dimension of monomer hydration, represented by NFSS values, is 73, corresponding to 42 methyl groups, 14 ethylhexyl groups, 14 ethylene glycol groups, and 3 sulfopropyl groups, which are common across all the sequences. The convex hull in the space pf principal components 1 and 2 (PC1 and PC2) is defined as the smallest convex set enclosing those, where each point represents one conformation of the same sequence.

Results and discussions

Effect of chain length on four-component MMA-based RHPs

The starting point of this work builds on the design of four-component biomimetic MMA-based RHPs, as reported in 2018 [20]. These polymers consist of 50% MMA (short or backbone monomer), 25% OEGMA (polar monomer), 20% EHMA (hydrophobic monomer), and 5% SPMA (negatively charged monomer) (Fig 1A). Here, we examine the assembly of RHPs with degrees of polymerization (DP) of 20, 50, 100, and 200, with sequence schematics shown in S1 Fig in S1 File. Previous works have demonstrated that 20mer, 50mer, and 100mer RHPs collapse with distinct metastable states [38,39,42]. The RHP structures are described using the average spatial distance between two backbone monomers, i and j (Fig 1B). Usually, this metric is denoted as R(s), where s is the number of monomers separating monomers, i and j, or s=. In equilibrium melts or globules, chains typically follow R(s) ∝ s1/2 (random walk statistics) [71], and single rod-like segments follow R(s) ∝ s. On a small scale, regardless of DP, all RHPs exhibit rod-like chain conformations (R ∝ s) due to the high characteristic ratio () of PMMA. On a larger scale, 200mer RHPs first plateau and then transition to random walk statistics (R ∝ s1/2). However, this transition from rod-like to random walk at is not observed in 100mer RHPs. The asphericity indicates that 100mer RHPs are nearly isotropic spheres compared to 200mer RHPs (Fig 1C). The plateau at R(s) ≈ 20 Å for 100mer RHPs suggests that these spherical globules have a Rg around 20 Å (by analyzing the snapshots directly, Rg = Å). In contrast, a hypothetical spherical 200mers would have a lower surface area-to-volume ratio (∝ 1/r, where r is the radius of the sphere) and may not sufficiently expose and hydrate all negatively charged SPMA monomers on the globule surface, which have a kosmotropic nature [72]. This limitation promotes an ellipsoidal morphology that increaes surface area, facilitating the hydration of kosmotropic SPMA monomers (Fig 1E, SPMA).

thumbnail
Fig 1. Effect of chain length on four-component MMA-based RHPs.

(A) The chemical structure of MMA-based RHPs, with monomers color-coded as follows: MMA in magenta, OEGMA in blue, EHMA in gray, and SPMA in yellow. Side chains are abbreviated as “M”, “OEG”, “2-EH”, and “3-SP”, respectively. (B) The spatial distance between monomer i and monomer j [R(s), s=] for RHPs with degrees of polymerization (DP) of 20 (N = 100, pink), 50 (N = 100, orange), 100 (N = 100, purple), and 200 (N = 20, navy). Insets show the slopes for the low-s and high-s regimes. Error bars indicate standard deviations across all RHP conformations. (C) Asphericity and (D) density of RHPs across four selected chain lengths. (E) NFSS for each monomer type in RHPs across four chain lengths. Data are represented as all data points, the median, the lower and upper quartiles, and whiskers extending within 1.5 times the interquartile range (IQR).

https://doi.org/10.1371/journal.pone.0343799.g001

For shorter chains (DP = 20 and 50), the sizes are comparable to or smaller than the Kuhn length of PMMA (≈ 15.3 Å) [73], which prevents them from fully collapsing into perfect spherical globules, unlike 100mer RHPs (Fig 1B). The density of RHPs (approximated as polymer mass divided by the volume derived from Rg) initially increases with chain length in the short-chain regime, but decreases at higher DPs. This trend reflects increased monomer-monomer contacts and monomer dehydration in compactified globules (Fig 1E). Monomer-water contacts are replaced by monomer-monomer contacts, primarily between MMA and PEG moiety, due to their favorable Flory-Huggins interaction parameters () [74]. This trend is consistent with our previous work on MMA-based RHPs with less OEGMA and more SPMA contents [42]. For longer chains, density decreases. The plateau of 200mer RHPs near s≈ 50, observed between R ∝ s1 and R ∝ s1/2 regimes, suggests that the morphology of 200mer RHPs is hierarchical. The large-scale ideal chain (R ∝ s1/2) consists of effective “monomers” that correspond approximately to rod-like shortmers (close to 100mer, R ∝ s1). The decrease in density can be attributed to the asphericity of the 200-mer RHPs, which are unable to hydrate all charged residues in a spherical morphology as discussed before. Therefore, the sphericity and compactness of MMA-based RHPs are optimized at a chain length of approximately 100 monomers.

Hydration frustration, characterized by hydrated hydrophobic groups and dehydrated polar/hydrophilic groups, is universally observed in MMA-based 100mer RHPs [49]. The degree of core hydration can be experimentally determined through solvatochromism [16], which evaluates the polarity of the local microenvironment. Small-angle neutron scattering (SANS) can provide the radial density distribution of selectively deuterated monomers and, in principle, resolve the spatial distribution of all monomer types within a polymer globule, thereby offering insights into hydration-related structural organization. Morphologically, these RHPs exhibit heterogeneous, patchy surfaces [38]. Monomer hydration is quantified using the normalized number of water molecules in the first solvation shell (NFSS). Defined as the relative population of water molecules within the first solvation shell (3.4 Åradius) compared to a fully solvated monomer of the same type, NFSS enables direct comparison between chemically diverse building blocks [75]. NFSS values range from 1 (fully hydrated) to 0 (completely dehydrated). This calculation includes all atoms, including hydrogen. On average, MMA, EHMA, and OEGMA monomers become increasingly dehydrated as the chain length increases. SPMA hydration remains consistent but is statistically more dehydrated in 200mer RHPs than in 100mer RHPs (p-value ). The enhanced MMA hydration in 200mer RHPs offsets the dehydration of other monomers types, contributing to increased relative volume and reduced density. Short chains (e.g., 20mer RHPs) remain well-hydrated across all monomer types, particularly EHMA and OEGMA, because they cannot fully compactify into globules. This highlights a critical distinction in terms of monomer hydration frustration: 20mer RHPs, which exhibit semi-frustrated hydrophobic groups, differ fundamentally from longer RHPs that display full hydration frustration [49].

Effect of positively charged monomer identity on five-component MMA-based single-chain RHPs and RHP dimers

To extend the potential functionality of four-component MMA-based RHPs in an experimental setting, we introduce a fifth monomer with a positive charge (R+) (Fig 2A). For example, the monomer 2-(dimethylamino)ethyl methacrylate (DMAEMA), with a side chain, has been incorporated to enable temperature-dependent intermolecular interactions for potential use in liquid-liquid phase separation (LLPS) [76] in an experimental setting, as demonstrated in previous work [23]. The degree of methylation decreases from - to -, resulting in progressively more chaotropic monomers [72]. Other amino acid-based side groups, including guanidinium (-Gdm+, analogous to arginine), imidazolium (-Idm+, analogous to histidine), and ammonium (- or -Amm+, analogous to lysine), are also incorporated along with a relatively hydrophobic group, -. R+ is fully ionized to isolate the effect of its chemical structure rather than its degree of protonation. Some R+ groups, such as imidazole and -, are weak bases; their partially protonated states are not considered in this work. The composition of the system is adjusted from the original four-component systems to include 50% MMA, 20% EHMA, 20% OEGMA, 5% SPMA, and 5% R+ by replacing 5% OEGMA to R+, thereby converting the RHPs into polyampholytes. The sequences and overall compositions are shown in Fig 2A.

thumbnail
Fig 2. Effect of positively charged monomer identity on five-component MMA-based single-chain RHPs and RHP dimers.

(A) The chemical structures of the positively charged monomers (R+) in MMA-based RHPs, along with their corresponding sequences and overall five-component composition. Monomers are color-coded as follows: MMA in magenta, OEGMA in blue, EHMA in gray, SPMA in yellow, and R+ in green. N = 10 simulations are performed for each R+. (B) The spatial distance between monomers i and j [R(s), s=], represented as follows: all monomer pairs (gray), oppositely charged monomers (blue), and like-charged monomers (magenta). Error bars indicate standard deviations across all RHP conformations. NFSS for (C) SPMA and (D) R+ in single-chain RHPs (yellow and green symbols) and RHP dimers (brown and olive symbols). The first entry in (C) shows the NFSS of SPMA in the four-component RHPs presented in Fig 1.

https://doi.org/10.1371/journal.pone.0343799.g002

On a small scale, the scaling behavior of the average spatial distance R(s) between two backbone monomers, i and j, remains consistent with that of four-component RHPs, following R(s) ∝ s for rod-like chain conformations (Fig 2B, gray). The plateau in R(s) occurs around 20 Å, which is similar to that observed in the four-component system. Further analysis of the spatial distance between oppositely charged monomers (blue) and like-charged monomers (magenta) reveals minimal influence of electrostatic attraction or repulsion on either the proximity of oppositely charged monomers or the separation of like-charged monomers. The average spatial distance between charged monomers is unaffected by charge identity, remaining similar to distances between other monomer pairs. This behavior is attributed to the glassy nature of the PMMA backbone [39,58], which hinders chain rearrangement driven by intrachain electrostatic interactions [42,43].

The hydration of SPMA monomers remains consistent across all RHPs, regardless of the chemical structures of R+ (Fig 2C, yellow symbols). This consistency highlights the limited influence of intrachain electrostatic interactions on chain rearrangement. Except for Idm+ and Gdm+, the hydration of R+ monomers follows the Hofmeister series: more kosmotropic charges (“salting out”, such as -) exhibit higher hydration levels than less kosmotropic charges (“salting in”, such as -) [72] (Fig 2D, green symbols). Interestingly, even the relatively hydrophobic charges (-) remain well-hydrated, indicating that burying a charged group through brutal thermodynamic forces alone is challenging due to the low dielectric constant of MMA-based RHPs and their alkyl side chains. Additionally, relatively dehydrated charges tend to form complexes with PEG side chains, analogous to crown ether-type interactions [49,77].

To examine oppositely charged-pair interactions, single-chain systems (yellow and green symbols) are extended to dimer systems (brown and olive symbols), each initialized with two identical single-chain conformations (Fig 2C-D). In these setups, one randomly selected pair of oppositely charged monomers from the two chains is intentionally positioned close to each other to examine charge pair interactions. The hydration of SPMA remains the same in the single-chain and dimer systems, while the hydration of R+ decreases. This suggests that dimerization is not driven by charge pairs between SPMA and R+. The hydration of EHMA and OEGMA monomers (S2 Fig in S1 File) also decreases alongside the hydration of R+, which suggests the dimerization results primarily from hydrophobic contacts as well as PEG-R+ interaction. This observation aligns with unpublished data on the dimerization of four-component RHPs, which indicate that interfacial interactions are primarily dominated by hydrophobic effects (EHMA-EHMA contacts) [78]. Side-chain rearrangement governs the remodeling of individual chains in the dimer system, consistent with findings with RHP adsorption at liquid-solid interface [43] rather than liquid-liquid interface [45].

Effect of backbone architecture and chain-level compositions on RHP compactification, dynamics and hydrogen bond formation

One-pot synthesis through RAFT copolymerization produces an ensemble of pseudorandom sequences influenced by monomer reactivity ratios [79]. While MMA-based RHPs exhibit sequence insensitivity in terms of compactness, monomer hydration, and protein stabilization potential [50], this property does not extend to other backbone architectures such as acrylate (MA), acrylamide (MAn), or methacrylamide (MMAn). Nature, in contrast, has achieved precise sequence-structure-function relationships in biopolymers like proteins, DNA, and RNA. Efforts to replicate such precise sequence control in synthetic polymers include peptoids (N-substituted glycine backbones) [80], among others sequence-defined systems [81,82]. In this study, to examine backbone effects on the assembly, we investigate peptoid and peptide architectures using the same sequence ensemble (Fig 3). Both architectures collapse into compact globules, with backbone Rg of 14.8 ± 0.7 Åfor peptoid- and 14.5 ± 1.1 for peptide-based RHPs. Monomer NFSS indicates a core-shell morphology in both systems, driven by the absence of a negative-χ parameter between PEG side groups and the corresponding backbones. These findings demonstrate that without sophisticated sequence design, peptoid- and peptide-based RHPs collapse into compact, though not hydration-frustrated, globules.

thumbnail
Fig 3. Monomer hydration and backbone Rg of peptoid and peptide backbone architecture.

The chemical structure of (A) peptoid (N = 10, lignt violet) and (B) peptide (N = 100, olive) backbone architectures. NFSS indicates that both backbone architectures adopt a core-shell morphology.

https://doi.org/10.1371/journal.pone.0343799.g003

We expand the scope of backbone architectures to include soluble variants (Fig 4A). Two four-component sequences with distinct chain-level compositions are selected: SEQ1, a relatively more hydrophobic sequence with 44 MMAs (−6), 27 EHMAs (+7), 26 OEGMAs (+1), and 3 SPMAs (−2), and SEQ2, a relatively more charged sequence with 51 MMAs (+1), 14 EHMAs (−7), 27 OEGMAs (+2), and 8 SPMAs (+3). The numbers in parentheses represent the differences between the actual composition and the targeted composition. The sequence schematics are shown in Fig 4B and corresponding chain-level compositions are shown in Fig 7A.

thumbnail
Fig 4. Effect of backbone architecture and chain-level compositions on RHP compactification.

(A) The chemical structures of different backbone architectures in four-component RHPs, with backbones color-coded as follows: acrylate (“MA”) in dark red, methacrylate (“MMA”) in magenta, acrylamide (“MAn”) in purple, methacrylamide (“MMAn”) in blue, and peptide in green. (B) Two selected sequences with different chain-level compositions from the same sequence ensemble: one with the smallest (SEQ1, filled symbols) and the other with the largest (SEQ2, open symbols) Rg of backbone atoms. Side chains are color-coded following Fig 1A as follows: M in magenta, OEG in blue, 2-EH in gray, and 3-SP in yellow. (C) Backbone Rg for the two selected sequences across different backbone architectures (N = 10 for each category). The left entry corresponds to SEQ1, and the right entry corresponds to SEQ2. (D) Backbone Rg of random copolymers with two types of monomers: OEG and 2-EH (left panel), and OEG and M (right panel), plotted against the corresponding composition of non-methyl side chains.

https://doi.org/10.1371/journal.pone.0343799.g004

In MA-, MAn-, and MMAn-based RHPs, SEQ2 (open symbols) adopts a less compact morphology compared to SEQ1 (filled symbols) (Fig 4C). SEQ1 forms compact globules (Rg < 20 Å), characterized by a core-shell morphology. In these structures, hydrophobic groups form a core, while polar and charged groups extend into water, a phenomenon known as hydrophobic collapse [49]. In contrast, SEQ2 forms extended coil morphology in these backbone architectures. However, both SEQ1 and SEQ2 exhibit similar compactness in MMA- and peptide-based RHPs, which both feature insoluble backbones. Peptide-based RHPs, like MMA-RHPs, demonstrate sequence insensitivity in globule compactness (see also Fig 3B). The key distinction between these systems lies in the degree of hydration frustration.

The influence of chain-level composition and hydrophobic collapse is also observed in random copolymers with soluble backbone architectures, such as poly(OEGX-r-EHX) and poly(OEGX-r-MX), where X = A (acrylate), An (acrylamide), or MAn (methacrylamide), with MA (methacrylate) serving as a reference (Fig 4B). MMA-based copolymers consistently exhibit compact structures, a property not observed in other backbone architectures. This suggests that MMA-based polymers offer a broad compositional design space for achieving compact and hydration-frustrated globules. In poly(OEGX-r-EHX)s with soluble backbone architectures, extended coil morphologies are observed up to 20% 2-EH side chains. Beyond this threshold, increasing 2-EH content induces hydrophobic core formation, compactifying the chains. Conversely, poly(OEGX-r-MX)s remain extended coils regardless of composition, lacking the driving forces required for collapse.

The dynamics of RHPs are influenced by both backbone architecture and chain-level composition. To quantify their effects, we examine the backbone atomic root-mean-square fluctuation (RMSF) and the standard deviation of backbone dihedral angles (SD dihedral) (Fig 5). SEQ2 exhibits greater mobility than SEQ1 in both metrics, attributable to the extended coil morphology of SEQ2, which increases polymer-water contacts and enhances mobility compared to the globule morphology of SEQ1. Backbones with soluble or peptide architectures are more dynamic than those with MMA backbone architectures. This disparity arises from the glassy nature of PMMA and the additional stabilizing effect of favorable MMA-PEG contacts, the latter of which is absent in peptide-based RHPs due to unfavorable peptide-PEG interactions (). The dynamics of MMA-based globules are comparable to those of the same polymer in the melt or vacuum state [39]. Backbone dihedral angle fluctuations are strongly influenced by the degree of backbone methylation. Architectures without methyl groups (e.g., acrylate and acrylamide) exhibit greater mobility than methylated architectures (e.g., methacrylate and methacrylamide) due to steric hindrance introduced by methyl groups [59]. Among all backbone architectures, peptide backbones exhibit the least increase in dihedral angle fluctuations with rising temperature.

thumbnail
Fig 5. Effect of backbone architectures and chain-level compositions on RHP dynamics.

(A) Root-mean-square fluctuation (RMSF) of backbone Cα atoms and (B) standard deviation of backbone dihedral angles (SD dihedral angle) for two selected sequences across different backbone architectures. For RMSF, results are averaged over ten conformations. The left entry corresponds to SEQ1, and the right entry corresponds to SEQ2. For SD dihedral angles, values are averaged over 5 ns of the annealing process (filled symbols represent SEQ1, and open symbols represent SEQ2).

https://doi.org/10.1371/journal.pone.0343799.g005

The differences in compactness and dynamics between SEQ1 and SEQ2, despite both originating from the same sequence ensemble accessible via one-pot synthesis, highlight the capability of MMA-based RHPs to achieve convergent properties. However, this sequence-insensitive behavior is not observed in other backbone architectures. Experimentally, purification processes such as chromatography can shift the overall composition of MMA-based RHPs [79], and MA-based RHPs have been shown to exhibit distinct subpopulations [83], which are not observed in MMA-based RHPs [84]. Our computational results confirm that the properties of MMA-based RHPs are more homogeneous and less sequence-sensitive, than those of MA-based RHPs [50], reflecting the coexistence of coil and globule conformations.

This discrepancy in dynamics across backbone identities can be attributed to the backbone’s ability to form hydrogen bonds, particularly intrachain hydrogen bonds between amine and carbonyl groups, as well as the hydrogen bonds between electronegative backbone atoms and water molecules. Peptide-based RHPs form significantly more intrachain hydrogen bonds (amine-carbonyl interactions) compared to MAn- and MMAn-based RHPs (Fig 6A), which act as a major driving force for compactification in peptide-based RHPs, similar to the behavior observed in proteins. MMA-based RHPs lack such intramolecular hydrogen bonds; their compactification is instead driven by hydrophobic interactions of the backbone and favorable negative-χ parameter between MMA and PEG. This highlights the distinct mechanism behind MMA-based RHP compactification, as opposed to peptide-based RHPs that are dominated by hydrogen bond.

thumbnail
Fig 6. Effect of backbone architectures on hydrogen bond formation.

The number of hydrogen bonds in a single RHP chain (DP = 100, with N = 100) is analyzed as follows: (A) between carbonyl and amide groups, (B) between carbonyl groups and water molecules, and (C) between ether/amine groups and water molecules. Schematics are shown, and the results are averaged over 20 conformations from two selected sequences. Error bars indicate standard deviations across all RHP conformations. The color coding of backbone architectures follows Fig 4.

https://doi.org/10.1371/journal.pone.0343799.g006

Hydrogen bonding also influences backbone-water interactions. The number of hydrogen bonds formed between the backbone (carbonyl/ether/amine groups) and water molecules (Fig 6B-C) correlates well with the backbone fluctuation observed in Fig 5. Increased backbone solubilization through intermolecular hydrogen bonding with water enhances backbone mobility. The significance of hydrogen bonding underscores that the sequence sensitivity in aqueous polymer globules cannot be predicted solely from the glass transition temperature (Tg) of the homopolymer melts (e.g., polyacrylate, polymethacrylate, polyacrylamide, and polymethacrylamide) (S3 Fig, data adapted from [59]). Melt systems fail to capture polymer-water interactions, while aqueous globule systems place less emphasis on the role of intermonomer hydrogen bonding. Both factors are essential for accurately predicting the behavior of RHPs in aqueous environments.

Evolution of chain-level compositions in SEQ1 and SEQ2 of MAn-based RHPs, and sequence sensitivity assessment

To examine the effect of chain-level composition on compactness and monomer hydration patterns, we employ a two-step evolution process in MAn-based RHPs. Compactness is quantified by backbone Rg, and monomer hydration patterns are characterized using the hydration array introduced in our recent work [50]. This hydration array is a 73-dimensional array, representing the NFSS values for four monomer types (42 M, 14 2-EH, 14 OEG, and 3 3-SP) that are shared across all sequences. The similarity between systems is assessed by comparing the overlap of convex hulls formed by dimensionally-reduced hydration arrays using PCA (See Methods for details).

The chain-level composition evolution of SEQ1 and SEQ2 proceeds in two steps, passing through a shared intermediate composition that represents the average composition of SEQ1 and SEQ2. The compositions of SEQ1, SEQ2, and the intermediate are shown in Fig 7A. In each step, the selected sequence undergoes random point mutations to generate ten new sequences that match the targeted composition. For example, in SEQ1, seven EHAn monomers are randomly mutated into one OEGAn, four MAn, and two SPAn monomers, producing ten sequences collectively referred to as SEQ1mut1, which correspond to the intermediate composition. Subsequently, six EHAn monomers in SEQ1mut1 are further mutated into three MAn and three SPAn, generating ten sequences labeled SEQ1mut2, which match the chain-level composition of SEQ2. Conversely, SEQ2 is first mutated to generate SEQ2mut1, matching the intermediate composition, and then further mutated to generate SEQ2mut2, which aligns with the chain-level composition of SEQ1. Full sequence schematics for these steps are shown in S4 Fig in S1 File. As a brief note, SEQ1 and SEQ2mut2, SEQ1mut1 and SEQ2mut1, and SEQ2 and SEQ1mut2 share the same chain-level composition. Additionally, SEQ1, SEQ1mut1, and SEQ1mut2 exhibit high sequence similarity, as do SEQ2, SEQ2mut1, and SEQ2mut2.

thumbnail
Fig 7. Evolution of chain-level compositions in SEQ1 and SEQ2 of MAn-based RHPs, and sequence sensitivity assessment.

(A) Two-step chain-level composition evolution. The evolution process involves an intermediate composition, illustrated using pie charts. Monomers are color-coded as follows: MAn in magenta, OEGAn in blue, EHAn in gray, and SPAn in yellow. In each step, monomers in the selected sequence are randomly mutated to generate ten offspring sequences through point mutations that progressively match the target composition. (B) Backbone Rg values of six systems. Statistical significance is evaluated using a two-sample t-test, with p-values denoted as follows: ns (p ≥ 0.05) and **** (p 0.0005). (C) Convex hulls of six systems in monomer hydration patterns via principal components analysis. Convex hulls illustrate monomer hydration patterns of the same system, with hulls of the same color indicating high sequence similarity arising from to point mutation evolution and hulls with the same patterns representing identical chain-level compositions. The variance explained by the first and second principal components is 53.6% and 9.6%.

https://doi.org/10.1371/journal.pone.0343799.g007

From SEQ1 to SEQ1mut2, the backbone Rg increases, indicating decompactification. Conversely, from SEQ2 to SEQ2mut2, the backbone Rg decreases, reflecting compactification. Moreover, systems with the same chain-level compositions (SEQ1 and SEQ2mut2, SEQ1mut1 and SEQ2mut1, and SEQ2 and SEQ1mut2) exhibit nearly identical backbone Rg distributions. These findings demonstrate that compactness in MAn-based RHPs is primarily dictated by chain-level composition, rather than specific sequence details.

The convex hulls formed by SEQ1 (solid magenta, left) and SEQ2 (dotted blue, right) do not overlap. During the evolutionary process, the convex hull for SEQ1 shifts to the right as it evolves into SEQ1mut2, while the convex hull for SEQ2 shifts to the left as it transitions into SEQ2mut2. Convex hulls corresponding to systems with the same chain-level composition but low sequence similarity overlap. Notably, convex hulls for systems with different chain-level compositions and sequences (e.g., SEQ1mut2 and SEQ2mut2) also show significant overlap. These observations suggest that monomer hydration patterns are governed by a combined influence between sequence identity and chain-level composition, rather than by either factor alone in MAn-based RHPs. In contrast, MMA-based RHPs generally exhibit more robust properties with respect to sequence and chain-level composition, even when generated from the same sequence ensemble obtained through stochastic synthesis.

Effect of the length of PEG pendant group and side-chain micropolarity in MMAn-based RHPs

The water solubility of poly(OEGnMA) is heavily dependent on the length of its oligo(ethylene glycol) (PEG) side chains units, particularly in low-n regime [85,86]. For example, poly(OEGnMA) is insoluble when n = 1, while reversible cloud points occur at 26 °C and 52 °C for n = 2 and n = 3, respectively, demonstrating thermoresponsive behavior. In this study, we examine the solubility of OEGnMAn monomers within MMAn-based RHPs (Fig 8). The hydration of the monomers, quantified by NFSS, decreases significantly as the PEG pendant group length shortens from 9 to 4–2 repeat units, with hydration values dropping from 0.77 to 0.60. Atomic partial charges calculated via RESP reveal that the terminal and penultimate oxygens in the polyether chain (−0.466 and −0.593, respectively) carry smaller magnitudes of partial charge compared to the other oxygens (−0.682). This difference in charge distribution increases the hydrophobicity of the monomer as the PEG repeat unit number decreases. Such reductions in hydration and increased hydrophobicity with shorter PEG side chains are consistent with trends observed in other polyethers, where solubility is strongly linked to chain length and partial charge distributions [87]. These findings emphasize the critical role of PEG chain length in determining the solubility and hydration behavior of MMAn-based RHPs.

thumbnail
Fig 8. Effect of the length of PEG pendant group of OEGMAn in MMAn-based RHPs.

The chemical structures of OEGMAn with nine (OEG9MAn), four (OEG4MAn), and two (OEG2MAn) PEG repeating units, and their corresponding NFSS values. Each system is simulated for N = 100 simulations.

https://doi.org/10.1371/journal.pone.0343799.g008

The polarity of alkyl side chains, including isobutyl, 2-ethylhexyl, and octyl, has been shown to influence their hydration patterns, ranging from frustrated isobutyl and 2-ethylhexyl, to non-frustrated octyl [49]. This difference in micropolarity, such as between valine, alanine, and glycine, is known to govern the organization of elastin-like polypeptide (ELP) condensates [88]. Here, we further investigate the hydration patterns between 2-ethylhexyl and octyl, as well as between poly(ethylene glycol) (PEG) and poly(propylene glycol-co-ethylene glycol) (PG-EG) side chains (Fig 9). The latter polyether has been used in SCNP to modulate both water solubility [89,90] and compactness [91]. The branched 2-ethylhexyl side chain and polyether (PG-EG) exhibit greater hydration than their linear counterparts, octyl and PEG. The higher degree of branching promotes hydration, which can be attributed to the polar carbon atoms in the branching. For example, the average atomic partial charges of the terminal atoms in polypropylene glycol is −0.404, compared to −0.303 in polyethylene glycol. This increased hydration of the polar group in EHMA-(PG-EG)MA-RHP and OMA-(PG-EG)MA-RHP results in hydration patterns characterized as semi-frustrated (frustrated hydrophobic group but non-frustrated polar group) and core-shell (non-frustrated hydrophobic and polar groups), in comparison with the fully-frustrated hydration pattern observed in EHMA-OEGMA-RHP.

thumbnail
Fig 9. Effect of the side chain micropolarity in MMAn- and MAn-based RHPs.

The chemical structure of RHPs with linear (O-) and branched (EH-) alkyl side chains, and Jeffamine-like (PG-EG), and their corresponding NFSS values. Each system is simulated for N = 10 simulations. In both panels, the left entry corresponds to the EH- side chain, and the right entry corresponds to the O- side chain for each backbone architecture.

https://doi.org/10.1371/journal.pone.0343799.g009

Conclusion

This study offers a detailed exploration of the factors that govern the compactness, dynamics, hydration, and solubility of RHPs, focusing on variations in chain length, monomer identity, backbone architecture, and chain-level composition. By systematically examining four- and five-component MMA-based RHPs alongside alternative backbone types and side chains with distinct micropolarities, we uncover molecular principles underlying their assembly and hydration behavior. Together, these findings underscore the tunable nature of RHPs and demonstrate the expansive design space through MMA-based RHPs. Such polymers hold significant promise as synthetic analogs of proteins, enabling the rational design of protein-mimetic materials with customizable structure and function.

We would like to highlight that for MMA-based RHPs, which have demonstrated versatile protein-like functions experimentally, a chain length of approximately 100 monomers appears to be optimal for achieving both compactness and balanced hydration. The negative χ parameter between MMA and PEG gives rise to a distinct form of hydration frustration that sets MMA-based RHPs apart from other backbone architectures and imparts sequence insensitivity to many of their physicochemical properties.

Future studies can expand the applicability of RHPs by exploring the effects of multivalent ions and cofactor binding, which could facilitate molecular encapsulation and open new opportunities for catalytic applications and delivery vehicles. Temperature- and pH-dependent transitions will further enable the use of RHPs as responsive materials. Coupled with autonomous laboratory [28,36,37], advanced experimental and computational techniques will support the rational design and engineering of RHPs with protein-like functions, ultimately unlocking their full potential for diverse applications [19].

Supporting information

S1 File. Supplementary information (S1–S4 Fig).

https://doi.org/10.1371/journal.pone.0343799.s001

(PDF)

Acknowledgments

The authors acknowledge the MIT SuperCloud and Lincoln Laboratory Supercomputing Center for providing computing resources.

References

  1. 1. Arnold FH. Directed Evolution: Bringing New Chemistry to Life. Angew Chem Int Ed Engl. 2018;57(16):4143–8. pmid:29064156
  2. 2. Anfinsen CB. Principles that govern the folding of protein chains. Science. 1973;181(4096):223–30. pmid:4124164
  3. 3. Dill KA, Ozkan SB, Shell MS, Weikl TR. The protein folding problem. Annu Rev Biophys. 2008;37:289–316. pmid:18573083
  4. 4. Lyon CK, Prasher A, Hanlon AM, Tuten BT, Tooley CA, Frank PG, et al. A brief user’s guide to single-chain nanoparticles. Polym Chem. 2015;6(2):181–97.
  5. 5. Hanlon AM, Lyon CK, Berda EB. What Is Next in Single-Chain Nanoparticles?. Macromolecules. 2015;49(1):2–14.
  6. 6. ter Huurne GM, Palmans ARA, Meijer EW. Supramolecular Single-Chain Polymeric Nanoparticles. CCS Chem. 2019;1(1):64–82.
  7. 7. Chen R, Berda EB. 100th Anniversary of Macromolecular Science Viewpoint: Re-examining Single-Chain Nanoparticles. ACS Macro Lett. 2020;9(12):1836–43. pmid:35653673
  8. 8. Barbee MH, Wright ZM, Allen BP, Taylor HF, Patteson EF, Knight AS. Protein-Mimetic Self-Assembly with Synthetic Macromolecules. Macromolecules. 2021;54(8):3585–612.
  9. 9. Wijker S, Palmans ARA. Protein-Inspired Control over Synthetic Polymer Folding for Structured Functional Nanoparticles in Water. Chempluschem. 2023;88(7):e202300260. pmid:37417828
  10. 10. Mundsinger K, Izuagbe A, Tuten BT, Roesky PW, Barner-Kowollik C. Single Chain Nanoparticles in Catalysis. Angew Chem Int Ed Engl. 2024;63(7):e202311734. pmid:37852937
  11. 11. Terashima T, Mes T, De Greef TFA, Gillissen MAJ, Besenius P, Palmans ARA, et al. Single-chain folding of polymers for catalytic systems in water. J Am Chem Soc. 2011;133(13):4742–5. pmid:21405022
  12. 12. Sanders MA, Chittari SS, Foley JR, Swofford WM, Elder BM, Knight AS. Leveraging Triphenylphosphine-Containing Polymers to Explore Design Principles for Protein-Mimetic Catalysts. J Am Chem Soc. 2024;146(25):17404–13. pmid:38863219
  13. 13. Huerta E, Stals PJM, Meijer EW, Palmans ARA. Consequences of folding a water-soluble polymer around an organocatalyst. Angew Chem Int Ed Engl. 2013;52(10):2906–10. pmid:23233410
  14. 14. Liang Y, Bergbreiter DE. Recyclable polyisobutylene (PIB)-bound organic photoredox catalyst catalyzed polymerization reactions. Polym Chem. 2016;7(12):2161–5.
  15. 15. Eisenreich F, Meijer EW, Palmans ARA. Amphiphilic Polymeric Nanoparticles for Photoredox Catalysis in Water. Chemistry. 2020;26(45):10355–61. pmid:32428312
  16. 16. Terashima T, Sugita T, Fukae K, Sawamoto M. Synthesis and Single-Chain Folding of Amphiphilic Random Copolymers in Water. Macromolecules. 2014;47(2):589–600.
  17. 17. Upadhya R, Tamasi MJ, Di Mare E, Murthy S, Gormley AJ. Data-driven design of protein-like single-chain polymer nanoparticles. ChemRxiv. 2022.
  18. 18. Yu H, Eres M, Hilburg SL, Kang P, Jin T, Grigoropoulos A, et al. Random heteropolymers as enzyme mimics. Nature. 2025;649(8095):83–90.
  19. 19. Wu G, Jin T, Zhou H, Coley CW, Alexander-Katz A, Lu H. Seeking Precise Protein-like Functions from Random Heteropolymer Ensemble and through Dimensionality Reduction. ACS Cent Sci. 2025;11(11):2053–62. pmid:41341047
  20. 20. Panganiban B, Qiao B, Jiang T, DelRe C, Obadia MM, Nguyen TD, et al. Random heteropolymers preserve protein function in foreign environments. Science. 2018;359(6381):1239–43. pmid:29590071
  21. 21. DelRe C, Jiang Y, Kang P, Kwon J, Hall A, Jayapurna I, et al. Near-complete depolymerization of polyesters with nano-dispersed enzymes. Nature. 2021;592(7855):558–63. pmid:33883730
  22. 22. Waltmann C, Mills CE, Wang J, Qiao B, Torkelson JM, Tullman-Ercek D, et al. Functional enzyme-polymer complexes. Proc Natl Acad Sci U S A. 2022;119(13):e2119509119. pmid:35312375
  23. 23. Ruan Z, Li S, Grigoropoulos A, Amiri H, Hilburg SL, Chen H, et al. Population-based heteropolymer design to mimic protein mixtures. Nature. 2023;615(7951):251–8. pmid:36890370
  24. 24. Jiang T, Hall A, Eres M, Hemmatian Z, Qiao B, Zhou Y, et al. Single-chain heteropolymers transport protons selectively and rapidly. Nature. 2020;577(7789):216–20. pmid:31915399
  25. 25. Yan T, Liu S, Li C, Xu J, Yu S, Wang T, et al. Flexible Single-Chain-Heteropolymer-Derived Transmembrane Ion Channels with High K+ Selectivity and Tunable pH-Gated Characteristics. Angew Chem Int Ed Engl. 2022;61(42):e202210214. pmid:36039469
  26. 26. Wang J, Waltmann C, Umana-Kossio H, Olvera de la Cruz M, Torkelson JM. Heterogeneous Charged Complexes of Random Copolymers for the Segregation of Organic Molecules. ACS Cent Sci. 2021;7(5):882–91. pmid:34079903
  27. 27. Wang J, Waltmann C, Harms C, Hu S, Hegarty J, Shindel B, et al. Tailoring Interactions of Random Copolymer Polyelectrolyte Complexes to Remove Nanoplastic Contaminants from Water. Langmuir. 2023;39(21):7514–23. pmid:37196238
  28. 28. Liao H, Hu S, Yang H, Wang L, Tanaka S, Takigawa I, et al. Data-driven de novo design of super-adhesive hydrogels. Nature. 2025;644(8075):89–95. pmid:40770436
  29. 29. Leyden MC, Oviedo F, Saxena S, Kumar R, Le N, Reineke TM. Synergistic Polymer Blending Informs Efficient Terpolymer Design and Machine Learning Discerns Performance Trends for pDNA Delivery. Bioconjug Chem. 2024;35(7):897–911. pmid:38924453
  30. 30. Dalal RJ, Oviedo F, Leyden MC, Reineke TM. Polymer design via SHAP and Bayesian machine learning optimizes pDNA and CRISPR ribonucleoprotein delivery. Chem Sci. 2024;15(19):7219–28. pmid:38756796
  31. 31. Kumar R, Le N, Oviedo F, Brown ME, Reineke TM. Combinatorial Polycation Synthesis and Causal Machine Learning Reveal Divergent Polymer Design Rules for Effective pDNA and Ribonucleoprotein Delivery. JACS Au. 2022;2(2):428–42. pmid:35252992
  32. 32. Zhou H, Wu G, Zhang Z, Zhu Z, Zhao T, Zhou Q, et al. Data-Driven Design of Random Heteropolypeptides as Synthetic Polyclonal Antibodies. J Am Chem Soc. 2025;147(24):21077–88. pmid:40481783
  33. 33. DelRe C, Chang B, Jayapurna I, Hall A, Wang A, Zolkin K, et al. Synergistic Enzyme Mixtures to Realize Near-Complete Depolymerization in Biodegradable Polymer/Additive Blends. Adv Mater. 2021;33(49):e2105707. pmid:34623716
  34. 34. Kwon J, DelRe C, Kang P, Hall A, Arnold D, Jayapurna I, et al. Conductive Ink with Circular Life Cycle for Printed Electronics. Adv Mater. 2022;34(30):e2202177. pmid:35580071
  35. 35. Kosuri S, Borca CH, Mugnier H, Tamasi M, Patel RA, Perez I. Machine-assisted discovery of chondroitinase abc complexes toward sustained neural regeneration. Advanced Healthcare Materials. 2022;:2102101.
  36. 36. Tamasi MJ, Patel RA, Borca CH, Kosuri S, Mugnier H, Upadhya R, et al. Machine Learning on a Robotic Platform for the Design of Polymer-Protein Hybrids. Adv Mater. 2022;34(30):e2201809. pmid:35593444
  37. 37. Wu G, Jin T, Alexander-Katz A, Coley CW. Autonomous Discovery of Functional Random Heteropolymer Blends through Evolutionary Formulation Optimization. Matter. 2025;:102336. pmid:40881158
  38. 38. Hilburg SL, Ruan Z, Xu T, Alexander-Katz A. Behavior of Protein-Inspired Synthetic Random Heteropolymers. Macromolecules. 2020;53(21):9187–99.
  39. 39. Jin T, Hilburg SL, Alexander-Katz A. Glass transition of random heteropolymers: A molecular dynamics simulation study in melt, in water, and in vacuum. Polymer. 2023;265:125503.
  40. 40. Shakhnovich EI, Gutin AM. Formation of unique structure in polypeptide chains. Theoretical investigation with the aid of a replica approach. Biophys Chem. 1989;34(3):187–99. pmid:2611345
  41. 41. Pande VS, Rokhsar DS. Is the molten globule a third phase of proteins?. Proc Natl Acad Sci U S A. 1998;95(4):1490–4. pmid:9465042
  42. 42. Han Z, Hilburg SL, Alexander-Katz A. Forced Unfolding of Protein-Inspired Single-Chain Random Heteropolymers. Macromolecules. 2022;55(4):1295–309.
  43. 43. Jin T, Coley CW, Alexander-Katz A. Adsorption of Biomimetic Amphiphilic Heteropolymers onto Graphene and Its Derivatives. Macromolecules. 2023;56(5):1798–809.
  44. 44. Hilburg SL, Alexander-Katz A. Solvent Remodeling in Single-Chain Amphiphilic Heteropolymer Systems. Macromol Rapid Commun. 2022;43(17):e2200142. pmid:35298063
  45. 45. Hilburg SL, Jin T, Alexander-Katz A. Dynamic transformation of bio-inspired single-chain nanoparticles at interfaces. J Chem Phys. 2023;159(11):114902. pmid:37712796
  46. 46. Jin T, Coley CW, Alexander-Katz A. Energy Landscape and Stability in Random Heteropolymers: Somewhere Between Protein Folding and Plastic Miscibility. Macromolecules. 2026;59(4):2315–27.
  47. 47. Warshel A. Energetics of enzyme catalysis. Proc Natl Acad Sci U S A. 1978;75(11):5250–4. pmid:281676
  48. 48. Jones S, Thornton JM. Analysis of protein-protein interaction sites using surface patches. J Mol Biol. 1997;272(1):121–32. pmid:9299342
  49. 49. Jin T, Coley CW, Alexander-Katz A. Designing single-polymer-chain nanoparticles to mimic biomolecular hydration frustration. Nat Chem. 2025;17(7):997–1004. pmid:40074826
  50. 50. Jin T, Coley CW, Alexander-Katz A. Sequence-Sensitivity in Functional Synthetic Polymer Properties. Angew Chem Int Ed Engl. 2025;64(2):e202415047. pmid:39378183
  51. 51. Jin T, Dagadu A, Coley CW, Alexander-Katz A. Random Heteropolymers Enable Nonspecific Protein Binding and Loop-Mediated Stabilization. ACS Nano. 2025;19(45):39259–71. pmid:41208466
  52. 52. Smith AAA, Hall A, Wu V, Xu T. Practical Prediction of Heteropolymer Composition and Drift. ACS Macro Lett. 2019;8(1):36–40. pmid:35619408
  53. 53. Jayapurna I, Ruan Z, Eres M, Jalagam P, Jenkins S, Xu T. Sequence Design of Random Heteropolymers as Protein Mimics. Biomacromolecules. 2023;24(2):652–60. pmid:36638823
  54. 54. Stals PJM, Gillissen MAJ, Nicolaÿ R, Palmans ARA, Meijer EW. The balance between intramolecular hydrogen bonding, polymer solubility and rigidity in single-chain polymeric nanoparticles. Polym Chem. 2013;4(8):2584.
  55. 55. Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general amber force field. J Comput Chem. 2004;25(9):1157–74. pmid:15116359
  56. 56. Bayly CI, Cieplak P, Cornell W, Kollman PA. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: the RESP model. J Phys Chem. 1993;97(40):10269–80.
  57. 57. Frisch M, Trucks G, Schlegel H, Scuseria G, Robb M, Cheeseman J. Gaussian 16. Wallingford, CT, USA: Gaussian, Inc. 2016.
  58. 58. Jin T, Coley CW, Alexander-Katz A. Molecular signatures of the glass transition in polymers. Phys Rev E. 2022;106(1–1):014506. pmid:35974655
  59. 59. Jin T, Coley CW, Alexander-Katz A. A Computationally Informed Unified View on the Effect of Polarity and Sterics on the Glass Transition in Vinyl-based Polymer Melts. ACS Macro Lett. 2023;12(11):1517–22. pmid:37889173
  60. 60. Case DA, Ben-Shalom IY, Brozell SR, Cerutti DS, Cheatham TE, Cruzeiro VWD. AMBER 2019. 2019.
  61. 61. Loncharich RJ, Brooks BR, Pastor RW. Langevin dynamics of peptides: the frictional dependence of isomerization rates of N-acetylalanyl-N’-methylamide. Biopolymers. 1992;32(5):523–35. pmid:1515543
  62. 62. Ryckaert J-P, Ciccotti G, Berendsen HJC. Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. Journal of Computational Physics. 1977;23(3):327–41.
  63. 63. Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola A, Haak JR. Molecular dynamics with coupling to an external bath. The Journal of Chemical Physics. 1984;81(8):3684–90.
  64. 64. Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG. A smooth particle mesh Ewald method. The Journal of Chemical Physics. 1995;103(19):8577–93.
  65. 65. Tsui V, Case DA. Theory and applications of the generalized Born solvation model in macromolecular simulations. Biopolymers. 2000;56(4):275–91. pmid:11754341
  66. 66. Joung IS, Cheatham TE III. Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. J Phys Chem B. 2008;112(30):9020–41. pmid:18593145
  67. 67. Mark P, Nilsson L. Structure and Dynamics of the TIP3P, SPC, and SPC/E Water Models at 298 K. J Phys Chem A. 2001;105(43):9954–60.
  68. 68. Martínez L, Andrade R, Birgin EG, Martínez JM. PACKMOL: a package for building initial configurations for molecular dynamics simulations. J Comput Chem. 2009;30(13):2157–64. pmid:19229944
  69. 69. van der Walt S, Colbert SC, Varoquaux G. The NumPy Array: A Structure for Efficient Numerical Computation. Comput Sci Eng. 2011;13(2):22–30.
  70. 70. Roe DR, Cheatham TE III. PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. J Chem Theory Comput. 2013;9(7):3084–95. pmid:26583988
  71. 71. Grosberg A, Rabin Y, Havlin S, Neer A. Crumpled Globule Model of the Three-Dimensional Structure of DNA. Europhys Lett. 1993;23(5):373–8.
  72. 72. Hofmeister F. Zur Lehre von der Wirkung der Salze. Archiv f experiment Pathol u Pharmakol. 1888;25(1):1–30.
  73. 73. Fetters LJ, Lohse DJ, Colby RH. Chain Dimensions and Entanglement Spacings. Physical Properties of Polymers Handbook. Springer New York. 2007. p. 447–54. https://doi.org/10.1007/978-0-387-69002-5_25
  74. 74. Ito H, Russell TP, Wignall GD. Interactions in mixtures of poly(ethylene oxide) and poly(methyl methacrylate). Macromolecules. 1987;20(9):2213–20.
  75. 75. Chothia C. The nature of the accessible and buried surfaces in proteins. J Mol Biol. 1976;105(1):1–12. pmid:994183
  76. 76. Fournier D, Hoogenboom R, Thijs HML, Paulus RM, Schubert US. Tunable pH- and Temperature-Sensitive Copolymer Libraries by Reversible Addition−Fragmentation Chain Transfer Copolymerizations of Methacrylates. Macromolecules. 2007;40(4):915–20.
  77. 77. Rüdiger V, Schneider H-J, Solov’ev VP, Kazachenko VP, Raevsky OA. Crown Ether–Ammonium Complexes: Binding Mechanisms and Solvent Effects. Eur J Org Chem. 1999;1999(8):1847–56.
  78. 78. Israelachvili J, Pashley R. The hydrophobic interaction is long range, decaying exponentially with distance. Nature. 1982;300(5890):341–2. pmid:7144887
  79. 79. Yu H, Liu L, Yin R, Jayapurna I, Wang R, Xu T. Mapping composition evolution through synthesis, purification, and depolymerization of random heteropolymers. Journal of the American Chemical Society. 2024;146(9):6178–88.
  80. 80. Zuckermann RN. Peptoid origins. Biopolymers. 2011;96(5):545–55. pmid:21184486
  81. 81. Lutz J-F, Ouchi M, Liu DR, Sawamoto M. Sequence-controlled polymers. Science. 2013;341(6146):1238149. pmid:23929982
  82. 82. DeStefano AJ, Segalman RA, Davidson EC. Where Biology and Traditional Polymers Meet: The Potential of Associating Sequence-Defined Polymers for Materials Science. JACS Au. 2021;1(10):1556–71. pmid:34723259
  83. 83. Gomez DC, Seth S, Mondal R, Koehler SJ, Baker JG, Plate C. Polyacrylates with protein recognition and functional modulation. 2025.
  84. 84. Yu H, Eres ME, Hilburg S, Kang P, Li Z, Jin T. Tiered Segmental Design of Random Heteropolymers as Folded Protein Mimics. Nature.
  85. 85. Han S, Hagiwara M, Ishizone T. Synthesis of Thermally Sensitive Water-Soluble Polymethacrylates by Living Anionic Polymerizations of Oligo(ethylene glycol) Methyl Ether Methacrylates. Macromolecules. 2003;36(22):8312–9.
  86. 86. Lutz J-F, Hoth A. Preparation of Ideal PEG Analogues with a Tunable Thermosensitivity by Controlled Radical Copolymerization of 2-(2-Methoxyethoxy)ethyl Methacrylate and Oligo(ethylene glycol) Methacrylate. Macromolecules. 2005;39(2):893–6.
  87. 87. Ensing B, Tiwari A, Tros M, Hunger J, Domingos SR, Pérez C, et al. On the origin of the extremely different solubilities of polyethers in water. Nat Commun. 2019;10(1):2893. pmid:31253797
  88. 88. Ye S, Latham AP, Tang Y, Hsiung C-H, Chen J, Luo F, et al. Micropolarity governs the structural organization of biomolecular condensates. Nat Chem Biol. 2024;20(4):443–51. pmid:37973891
  89. 89. ter Huurne GM, Gillissen MAJ, Palmans ARA, Voets IK, Meijer EW. The Coil-to-Globule Transition of Single-Chain Polymeric Nanoparticles with a Chiral Internal Secondary Structure. Macromolecules. 2015;48(12):3949–56.
  90. 90. Liu Y, Pauloehrl T, Presolski SI, Albertazzi L, Palmans ARA, Meijer EW. Modular Synthetic Platform for the Construction of Functional Single-Chain Polymeric Nanoparticles: From Aqueous Catalysis to Photosensitization. J Am Chem Soc. 2015;137(40):13096–105. pmid:26388188
  91. 91. Wijker S, Dellemme D, Deng L, Fehér B, Voets IK, Surin M, et al. Revealing the Folding of Single-Chain Polymeric Nanoparticles at the Atomistic Scale by Combining Computational Modeling and X-ray Scattering. ACS Macro Lett. 2025;14(4):428–33. pmid:40101120