Figures
Abstract
RiPPs (ribosomally-synthesized and post-translationally modified peptides) are a class of pharmaceutically-relevant natural products expressed as precursor peptides before being enzymatically processed into their final functional forms. Bioinformatic methods have illuminated hundreds of thousands of RiPP enzymes in sequence databases and the number of characterized chemical modifications is growing rapidly; however, it remains difficult to functionally express them in a heterologous host. One challenge is peptide stability, which we addressed by designing a RiPP stabilization tag (RST) based on a small ubiquitin-like modifier (SUMO) domain that can be fused to the N- or C-terminus of the precursor peptide and proteolytically removed after modification. This is demonstrated to stabilize expression of eight RiPPs representative of diverse phyla. Further, using Escherichia coli for heterologous expression, we identify a common set of media and growth conditions where 24 modifying enzymes, representative of diverse chemistries, are functional. The high success rate and broad applicability of this system facilitates: (i) RiPP discovery through high-throughput “mining” and (ii) artificial combination of enzymes from different pathways to create a desired peptide.
Citation: Glassey E, King AM, Anderson DA, Zhang Z, Voigt CA (2022) Functional expression of diverse post-translational peptide-modifying enzymes in Escherichia coli under uniform expression and purification conditions. PLoS ONE 17(9): e0266488. https://doi.org/10.1371/journal.pone.0266488
Editor: Yu-Hsuan Tsai, Shenzhen Bay Laboratory, CHINA
Received: October 22, 2021; Accepted: March 22, 2022; Published: September 19, 2022
Copyright: © 2022 Glassey et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Code for MS/MS analysis (msms_structure_annot) and LC-MS analysis (ripp-analysis) are available on GitHub (https://github.com/dantheand/msms_structure_annot and https://github.com/VoigtLab/ripp-analysis). Raw data is available on Zenodo (https://doi.org/10.5281/zenodo.6333545). Plasmid sequences are available in S8 Table and deposited with the NCBI (accession OM858691-OM858812). All physical strains and plasmids are available upon request.
Funding: This research was funded by the US Defense Advanced Research Projects Agency (DARPA) Living Foundries program award (HR0011-15-C-0084) and 1KM award (HR0011-15-C-0084) and a research award from Novartis Institute for BioMedical Research (Cambridge, USA). Additionally, A.M.K. was supported by Banting Fellowships Program and D.A.A. was supported by NSF GRFP. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Metagenomics has led to a deluge of microbial genomes, leading to high-throughput efforts to “mine” the molecules made by organisms by rebuilding pathways and screening for functions-of-interest [1–3]. Because they are gleaned from sequence databases, the organism or genomic DNA may not be available, thus necessitating the use of DNA synthesis and a heterologous host to obtain the chemical product [4–6]. RiPPs (ribosomally-synthesized and post-translationally modified peptides) are a potentially rich source of functional diversity that are encoded in gene clusters as a precursor peptide that is enzymatically modified before being proteolytically released [7–14]. Because the peptidic product is made by the ribosome, as opposed to a large megasynthase, the probability of successful heterologous expression is high. However, expressed peptides are often unstable in vivo and post-translational modifying enzymes may not function in new contexts [15–17]. As a result, only a small fraction of the thousands of known RiPP pathways have been explored [13].
RiPPs are classified by the chemical modifications made to the peptide. Some are defined by cyclization chemistry, including lanthipeptides (lanthionine macrocyclizations), thiopeptides ([4+2] cycloaddition of dehydrated serine/threonine), lasso peptides (N-terminal macrocyclization with asp/glu), graspetides (lactone/lactam macrocyclizations), bottromycin (macrolactamidine macrocyclization), ranthipeptides (Non-Cα thioether macrocyclizations), pantocins (glutamate crosslink), and sactipeptides (sactionine macrocyclizations) [7, 14]. Others by individual modifications present, such as glycocins (side chain glycosylation), microcin C (aminoacyl adenylation or cytidylation), comX (indole cyclization and prenylation), sulfatyrotide (tyrosine sulfation), spliceotide (β-amino acids from backbone splicing), and cyanobactins (N-terminal proteolysis). Precursor peptide organization varies between RiPP classes. Modifying enzymes can either bind to a leader/follower sequence in the precursor peptide or directly modify the core. The core consists of 2 to over 50 amino acids and there can be multiple cores in one precursor peptide [17–20]. Leader peptides range from 7 to over 80 amino acids and can recruit multiple modifying enzymes that can have overlapping binding sequences [21–23]. The diversity in chemistry and genetic encoding complicates the creation of general engineering tools that can be systematically used for mining efforts across RiPP classes.
There are examples from most RiPP classes have been successfully expressed in E. coli [16, 17, 20–22, 24–29]. Tools have been developed to aid heterologous production, including multi-plasmid inducible systems and exploration of E. coli, various Streptomyces strains and Microvirgula aerodentrificans as expression hosts [17, 30–33]. In vitro methods have also been used to engineer production of new molecules or study biosynthesis [34–37]. Gene cluster regulation may not function properly in a new host. To overcome this, the precursor peptide and modifying enzymes can be cloned and expressed separately [17, 33]. However, precursor peptides have been observed to often be unstable due to host proteases, thus necessitating the use of stabilization tags [15, 16, 24]. Large tags need to be removed before peptide modifications can be observed by mass spectrometry, as is the case for maltose binding protein (MBP, 45 kD), green fluorescent protein (GFP, 27 kD) or glutathione-S-transferase (GST, 26 kD) [38]. In contrast, the small ubiquitin-like modifier tag (SUMO, 12 kD) is smaller, thus allowing modifications to be observed prior to its removal. Further, it can be removed using SUMO protease immediately after purification without desalting [39], which simplifies its use in high-throughput formats. SUMO has been used for expression of both eukaryotic and prokaryotic antimicrobial peptides in E. coli [40–42] as well as a post-translationally modified lanthipeptide from Lactococcus [43] and a xenorceptide from Xenorhabdus [44].
Here, we develop a method for the expression, modification and purification of diverse RiPPs under uniform conditions in order to inform high-throughput mining efforts. We use a SUMO tag and add affinity tags, cleavage sites and linkers, which we refer to as “RiPP Stabilization Tag” (RST). This SUMO-based tag is demonstrated to work with diverse RiPP classes and modifying enzymes. Inducible systems from E. coli Marionette [45] are used to express the tagged precursor peptide and modifying enzymes from separate plasmids. These enzymes are all expressed in the same heterologous host (E. coli) under uniform culture conditions and induction times. We tested 50 precursor peptides with 46 modifying enzymes and identify 39 peptides that express as RST fusions, of which 24 were able to be modified with the SUMO-based tag attached. This provides a method that can be used for high-throughput RiPP discovery out of metagenomics data and for selecting enzymes that can be combined to build pathways to new-to-nature RiPPs.
Results
Expression system for modified peptides
Two versions of the RST were designed so that it can be fused to either the N- or C-terminus (RSTN or RSTC) of a precursor peptide (Fig 1A). Typically, the N-terminal version was used, but having a C-terminal version is convenient when either there is a modification at the N-terminus or the leader peptide is removed during modification. For purification, a HIS6 tag was included at the terminus of the RST. Small Ubiquitin-like Modifier (SUMO) was used as the stabilizing domain, with a linker sequence between SUMO and the precursor peptide. The linker was adapted from a recombinant protein expression system [46] and includes protease cleavage sites: TEV (N-terminal) or Thrombin (C-terminal). After purification, the RST can be removed using either these proteases or SUMO protease (for RSTN). Treatment with TEV leaves a GC scar at the N-terminus, where the cysteine is included for SAMDI (self-assembled monolayers on gold for matrix-assisted laser desorption/ionization) mass spectroscopy [47, 48].
a Schematic of the RSTs. Sequences for the backbones are in S8 Table, part sequences in S7 Table. b The two-plasmid system used to express the RiPP peptide and modifying enzyme. The genetic part sequences are provided in S7 Table. c The extracted peptides were analyzed using LC-MS to observe the mass shift associated with the modification. The tags used are shown above the spectra. The observed masses were calculated from 5 consecutive m/z peaks in the spectrum (Methods and S1 Table) and the expected masses are the molar mass of the peptide, calculated from the primary sequences. The initially designed RSTN variant with Link-1 was used in panel c. The spectra in c were collected from single experiments.
A two-plasmid system was used to separately express the precursor peptide and modifying enzyme, thus enabling combinations to be tested rapidly through co-transformations (Figs 1B and S1). The inducible system for the precursor peptide was selected to maximize its expression. To this end, we used the IPTG-inducible PT5LacO promoter [45] and a strong ribosome binding site (RBS) designed using the RBS Calculator [49, 50] (Methods). For the modifying enzyme, we used the cumate-inducible PCymR* or ahl-inducible PLuxB because of their high dynamic range (low off and high on) [45]. A different RBS is calculated for each modifying enzyme to maximize the probability of successful expression and bias toward similar expression levels.
Expression and purification protocols were first developed for low-throughput growth in 250 ml flasks in LB media (Methods). The tagged precursor peptide and modifying enzyme were induced simultaneously. After induction with 1 mM IPTG and 200 μM cumate (for PCymR*) or 10 μM 3OC6-AHL (for PLuxB), cultures were grown at 18°C for 20 hours with shaking. Then, the peptide was purified using immobilized metal affinity chromatography (IMAC) and analyzed using LC-MS.
An example of the production of a modified peptide in flasks is shown in Fig 1C using a variant of the trunkamide precursor peptide (TruE*) and cognate modifying enzyme TruD. Two samples were prepared: 1. the TruE* peptide fused to an initial version of RSTN that used Link-1 (instead of Link-2) co-transformed with the plasmid containing PLuxB-controlled truD (pEG1128), 2. the TruE* peptide expressed as an MBP fusion also co-transformed with pEG1128. From the LC-MS spectra, we calculated the observed mass for each of the peptides, and the expected error given the resolution of the mass spectrometer (Methods). TruD catalyzes the formation of a thiazoline from cysteine, causing a loss of water and a corresponding mass shift of -18 Da. The larger MBP obfuscates the observation of this expected mass shift because the resolution of the mass spectrometer is too low to reliably resolve modified versus unmodified MBP-tagged peptide and the isotope distributions between modified and unmodified MBP-tagged peptides overlap. In contrast, the expected mass spectrometer resolution of the RSTN fusion is 6 Da (standard deviation), and the expected and observed masses match (Fig 1C). Therefore, we conclude that we can observe the mass shift that occurs due to post-translational modification without removing the RSTN, even using a low-resolution quadrupole mass spectrometer.
Next, we tested RSTN stabilization of diverse precursor peptides across RiPP classes (Fig 2). The following examples from each class were selected: microviridin L from graspetides, bottromycin from bottromycins, streptide from streptides, PQQ from pyrroloquinoline quinones, subtilosin A from sactipeptides, trifolitoxin from linear azole peptides, prochlorosin from lanthipeptides, thiomuracin from thiopeptides, and pheganomycin from guanidinotides [14, 20, 29, 51–57]. This set encompasses a wide range of lengths, amino acid compositions, number of modifying enzyme binding sites, N- and C-terminal leaders/followers, and pheganomycin has two cores.
The peptide schematics show precursor peptide structural elements (leader/follower, white box) and core (grey box). From top to bottom, the gene names (species) are: mdnA, bmbC, strA, pqqA, sboA, tfxA, procA1.7, tbtA, pgm2. The LC-MS total ion chromatograms are shown (Methods). Peaks boxed in red match peptide expected mass, blue match the expected mass of a peptide cleavage product. Masses are provided in S1 Table. In +HIS6 column, top-to-bottom, extracts are 192, 193, 194, 195, 196, 198, 199, 200, and 202. In +RSTN column, top-to-bottom, extracts are 216, 220, 219, 218, 217, 221, 222, 223, and 225. For these experiments, the initially designed RSTN variant with Link-1 was used. The chromatograms are representative of two replicates that yielded similar results (the second performed with the high-throughput expression/purification protocol, Methods).
We tested the ability for RSTN to stabilize the unmodified peptides. Expression was measured in the absence of modifying enzymes to abrogate any stabilization affect that arises from peptide modification. Expression and purification were performed at the 250 mL flask scale, as described above (Methods). First, we evaluated precursor peptide expression when fused only to a N-terminal HIS6 tag [16]. This tag led to only three of nine peptides being detected by LC-MS (Fig 2). In contrast, when the precursor peptides were fused to an initial version of RSTN (using Link-1), large peaks appeared for all of the peptides. In summary, two of the nine peptides expressed as HIS6 fusions and seven of the nine peptides expressed as RST fusions were successfully expressed and purified as full-length peptides with the expected mass (S1 Table). One of the nine peptides (trifolitoxin) expressed as a HIS6 fusion and two of the nine (trifolitoxin and subtilosin A) expressed as RST fusions were expressed and purified, but the masses observed matched peptides cleaved by site-specific E. coli proteases at internal cleavage sites (S1 Table).
Production of active haloduracin
We then validated the production of a biologically-active product using our expression system. Modifications are directed at an RST-fused peptide, after which the tag is cleaved and activity tested. We selected haloduracin, which is a two-component lanthipeptide that had been previously expressed and purified from E. coli and shown to have antibiotic activity [34]. We synthesized the genes for the haloduracin A1 and haloduracin A2 peptides (fused to the final RSTN version with Link-2) and corresponding HalM1 and HalM2 modifying enzymes from Bacillus subtilis (Fig 3). An additional TEV protease cleavage site was added between the leader and core regions of the precursor peptide (Fig 3A) so that the core could be cleaved and recovered as the active product (Fig 3B). This leaves a single N-terminal glycine on the released core sequence. The peptide and enzyme genes were cloned into the two-plasmid system (Fig 1B) and co-transformed into E. coli NEB Express (Methods).
a Schematic of peptides from the haloduracin cluster, engineered to have TEV cleavage sites in between the leader and core peptides. b The expected structures after TEV cleavage are shown, with the glycine residue left by TEV colored red. c LC-MS TICs of haloduracins A1 and A2 after affinity purification, with peak corresponding to SUMO-tagged peptides boxed in red. d LC-MS TICs of haloduracins A1 and A2 after cleavage with TEV protease and purification, with peaks corresponding to peptides boxed in red. e LC-MS/MS fragmentation spectra of cleaved haloduracins A1 and A2, with masses that match fragments of structure in b, highlighted in blue (full annotation is in S2–S5 Notes). f Halo-assay of haloduracin A1, A2, and A1+A2 antimicrobial activity, using B. subtilis as the reporter strain (solvent negative control in S2 Fig). Data in panels c/d has been collected twice with similar results (once via “haloduracin production and purification” method and once via “peptide purification for MS/MS data analysis” method). MS/MS data in panel e represents a single replicate and halo-assay data in panel f is a representative example of data collected in triplicate (S2 Fig).
We developed a high-throughput 96-well system for expression and purification, which we tested using haloduracin (Methods). Cultures were grown in 2 mL of TB media in deep well plates (two 1 mL wells for each peptide), where they were each induced with 1 mM IPTG/200 μM cumate for 20 hours at 30°C with shaking. The cells were lysed, affinity-purified, and desalted using solid phase extraction, all in 96-well format. Then, the samples were treated with TEV protease to remove RSTN and the leader peptide and desalted again to concentrate the core peptide (Fig 3D) (Methods). The presence of the cleaved cores was verified by LC-MS (Fig 3D) and LC-MS/MS to investigate if SUMO disrupted or altered the lanthionine macrocyclizations present in both molecules (Fig 3E and S2–S3 Notes) (Methods). For both, the predicted structures were in close agreement with previously reported structural data [34, 58, 59]. For HalA2, seven of eight Ser/Thr residues were dehydrated and LC-MS/MS-based assignment of the single unmodified residue has been previously localized to Thr18, Thr22, or Ser23 [34, 58, 59]. We observed the correct number of dehydrations, and fragmentation that supported dehydration of Thr18, with Thr22 or Ser23 left unmodified. A previous report showed that the mutation S23A did not change the number of dehydrations [58], which agrees with our data, noting that we were not able to fully verify this structure through LC-MS/MS.
For the antimicrobial assay, the cleaved and desalted core peptides were resuspended in 50 μL 1:1 methanol:water. Bacillus subtilis PY79 was used as indicator strain and was spread on a LB-agar surface, on which 5 μL of either or 2.5 μL of both haloduracins (Figs 3F and S2) or a solvent control was added (S2 Fig) (Methods). Individually, they showed limited activity, but combined they form a clear halo of growth inhibition, suggesting that both peptides were properly modified and cyclized.
High-throughput assay of diverse modifying enzymes
We collated a set of 46 modifying enzymes and their cognate 50 precursor peptides from the literature. The complete list of pathways and enzymes is provided in S2–S3 Tables whereas the subset we ultimately found to be active are in Fig 4. These are representative of 13 bacterial RiPP classes from diverse genera and catalyze 22 different chemical transformations, including glycosylation, radical carbon-carbon bond formation and cysteine heterocyclization (S3 Table). The precursor peptide and modifying enzyme genes were codon optimized for E. coli and synthesized, or amplified when the source DNA was available, and cloned into the two-plasmid system (Methods). The precursor peptides were tagged with RSTN, except for macrocyclization of lasso peptides, which were fused to RSTC. The plasmids containing the modifying enzymes and precursor peptides were co-transformed into E. coli NEB Express.
RiPP class, source, gene names, and previously published structure for successfully observed modifications. Modified peptides analyzed with MS/MS are demarcated with an “X” in the MS/MS column. Potential structural ambiguity is listed in the Notes column: Blank—the collected LC-MS or LC-MS/MS data confirms the published structure or is consistent with published data for the structure, a–multiple cysteines present could be glycosylated and cannot be differentiated by LC-MS, b–cross-linking pattern was not validated and could differ from wild-type structure shown, c–MS/MS shows four distinct intra-modified cores, but does not validate cross-linking pattern within each core. Mean percent modified is listed for each modification in LB and TB medias, with individual replicates plotted in S4 Fig. Fraction modified for ThcoA/K is the sum of mono- and di-phosphorylated peptide. Percentages marked with an asterisk (*) have a modification with a small mass shift–percent modified may be inaccurate due to isotope overlap between unmodified/modified peptides (SboA/AlbA cleaved spectrum validation in S5 Fig; PapA/PapB in S5 Note). References are listed for modifications that have previously been shown in E. coli.
The cultures were grown following the high-throughput protocol in 96-well plates (Methods). Both TB media and LB media have been used to functionally express RiPPs in E. coli. The choice of media can impact the function of an enzyme; for example, radical S-adenosyl-L-methionine (rSAM) enzymes are more active in TB than LB, the latter requiring a reduction in shake speed and/or increased iron-sulfur cluster biosynthesis [22, 60, 61]. For applications requiring the high-throughput mining or the artificial combination of RiPP enzymes, it is desirable to have a single set of culture conditions. To this end, we evaluated the ability for the enzymes to modify their precursor peptides following the same culture conditions either in LB or TB (Fig 4). Other conditions, such as temperature, dissolved O2 and incubation time are known to affect enzyme activity and specificity, but we sought here to survey enzyme activity across a common set of conditions for recombinant expression.
All of the enzymes and precursor peptides were expressed in 1 mL of media in deep-well plates with shaking (Methods). Induction by 1 mM IPTG and 200 μM cumate was performed for 20 hours at 30°C, after which the modified peptide was purified. In all cases, the modification could be observed with LC-MS without cleaving RSTN. In total, 24/46 (52%) of the enzymes tested were active against at least one peptide in one of the medias tested, two of which had not previously been reported expressed in E. coli. The “% modified” values shown in Fig 4 were calculated from the extracted compound chromatograms (ECCs) based on the expected charge state m/z’s for unmodified, partially modified (if relevant) and modified peptide molar masses (S3 Fig). The ECC peaks were fit with a Gaussian and the areas under the curve calculated. The “% modified” was calculated by dividing the area of the peak associated with the modification by the total areas of modified and unmodified (and partially modified, if relevant) (Methods). More enzymes (24) had activity in TB media than LB media (20) and, on average, the % modified is higher. As expected, rSAM enzymes (AlbA, PapB, PlpXY) are more active in TB media and several only have activity in this media. Similarly, RaxST is a sulfur-requiring enzyme that is more active in TB media. Previous expression of this enzyme in LB has relied on co-expression of RaxP and RaxQ to biosynthesize the sulfur donor 3’-phosphoadenosine 5’-phosphosulfate (PAPS) [62].
The 25 modified peptides shown in Fig 4 showed the exact mass change that is expected to result from the modification. The amino acid specificities of some of the enzymes restrict the mass change to only one possible modified peptide product; there are 14 in this category (S4 Table) [21, 25, 28, 63–69]. However, some modifications could occur at different positions than the wild-type modification, leading to a different peptide with the same mass [19, 22, 29, 55, 70–75]. When multiple products are possible, the addition of an RST could change where the modification occurs (Fig 4). To test for this outcome, we selected several modifications from different classes to evaluate by LC-MS/MS (Methods). In addition to HalA1/HalA2 investigated above, the following were selected for structural annotation: PsnA2 macrolactonization by PsnB and PapA sactionine macrocyclization by PapB. The precursor peptides were modified to contain a TEV cleavage site between the leader and core peptides. The modifying enzymes and precursor peptides were expressed following the high-throughput protocol, the RSTN and leader peptide removed using TEV protease, and the modified core analyzed with LC-MS/MS (Methods). Fragmentation of PsnA2 was observed between the core repeats, with each core repeat fragment mass corresponding to two lactone macrocyclizations per repeat, in agreement with previously published results [19]. Similarly, fragmentation of PapA was observed between CxxxD motifs in the core, indicating that the correct CxxxD cycles have formed rather than other cyclization patterns.
Of the enzymes tested, 20 of the 46 did not modify a peptide at all and 3 of the 46 modified the peptide incorrectly or generated a complex mixture of modification products. This includes 14 clusters for which there are examples of successful expression in E. coli using a variety of expression methods (S2 Table). We looked for patterns based on the phylogeny from which the pathway was sourced, noting that they span cyanobacteria, actinobacteria, proteobacteria, and firmicutes (S6 Fig). There are successful examples from each of these phyla. The least successful phylum, Actinobacteria, yielded 2/7 functional pathways. We do not observe a relationship between similarity to E. coli and the likelihood of success. Most classes of chemical transformation had at least one example that was functional (Fig 5). However, both prenyl transferases (ComQ and KgpF) and all three P450 oxidases (MibO, CinX, and PbtO) were not functional. Other possible explanations for an enzyme to not modify include disruption of the modification by the SUMO tag, the necessity of other enzymes or genes from the native pathway to support modification, non-functional RBSs, co-factor availability, toxicity of the modified peptide and incompatible expression conditions.
All enzyme modification chemistries used in this publication. Example structures are shown in each box, with the modified residues/structures highlighted in red. Amino acids involved are listed in the lower left of each box (if amino acids are chemically restrictive). Green boxes are those for which we observed functional enzymes in our expression system. Black boxes were tested, but no tested enzymes were functional.
Discussion
While the number of characterized RiPP enzymes is growing rapidly in the literature, the conditions under which each enzyme is characterized vary across studies. This poses a challenge for high-throughput screening efforts if the conditions have to be re-optimized for each pathway. This manuscript presents a side-by-side survey of recombinant RiPP enzymes in E. coli, using the same growth and induction methods. Further, protocols are developed for every step to be performed in 96-well plate format under conditions that are consistent with high-throughput screening platforms [2, 76–78]. The RSTs address the problem of precursor peptide stability, for which degradation and solubility are likely the dominant causes of unobservable substrate. Their use increases the probability that a pathway will be successfully expressed in a new host; in other words, they increase the “hit rate” of screening efforts. The SUMO-based RSTs are compatible with many modifying enzymes, facilitate high-throughput purification and do not need to be removed prior to LC-MS analysis of modifications. Software was developed to rapidly analyze LC-MS data (Methods). Collectively, this presents a suite of tools that enable the high-throughput screening of RiPP pathways mined from sequence databases [13, 79, 80]. In this manuscript, we only look at the action of a single enzyme. To mine complete RiPP-encoding gene clusters, additional enzyme genes can either be assembled as operons or placed under the control of different inducible promoters (e.g., E. coli Marionette) [45].
The fraction of enzymes found to be functional in E. coli under common conditions is surprisingly high, especially considering the diversity in the source genera and chemistries. In our experience, it is much higher than the successful transfer of other natural products genes, such as non-ribosomal peptide synthases, which also produce peptidic products. These results imply that RiPP enzymes can be combined from different sources to create synthetic pathways from which all the enzymes are functionally expressed. Indeed, several examples have been published demonstrating the artificial combination of RiPP enzymes from different source species and pathways to make products not observed in nature [30, 81, 82]. Knowing that roughly half of RiPP enzymes are functionally compatible with E. coli expands the potential peptide chemical space that can be explored through the artificial mixing-and-matching of these enzymes. Fully enabling this requires a better understanding of the rules for designing precursor peptides that can be acted on by multiple modifying enzymes. Collectively, these tools for the mining and de novo design of RiPPs will allow the exploration of the vast universe of modified peptides for novel antibiotics, intercellular communication channels, and signaling molecules that influence animal and plant physiology.
Materials and methods
Strains, plasmids, media, and chemicals
E. coli NEB 10-beta (C3019I, New England BioLabs, Ipswich, MA, USA) was used for all routine cloning. E. coli BL21 (C2530H, New England BioLabs, Ipswich, MA, USA) was used to characterize RSTs and linker variants in low-throughput (flask) cultures. E. coli NEB Express (C2523I, New England BioLabs, Ipswich, MA, USA) was used to express all other experiments. All plasmids containing RST-fused precursor peptide genes use a pSC101 origin variant (var 2) with ampicillin resistance (S1 Fig and S7 Table) [83]. All plasmids carrying modifying enzyme genes contain p15A origins of replication and kanamycin resistance. LB-Miller media (B244620, BD, Franklin Lakes, NJ, USA) or TB media (T0311, Teknova, Hollister, CA, USA) supplemented with 0.4% glycerol (BDH1172-4LP, VWR, OH, USA) were used for peptide expression and modification. 2xYT liquid media (B244020, BD, Franklin Lakes, NJ, USA) and 2xYT + 2% agar (B214010, BD, Franklin Lakes, NJ, USA) plates were used for routine cloning and strain maintenance. SOB liquid media (S0210, Teknova, Hollister, CA, USA) was used for making competent cells. SOC liquid media (B9020S, New England BioLabs, Iwsich, MA, USA) was used for outgrowth. Cells were induced with the following chemicals: cumate (cuminic acid) ≥98% purity from Millipore Sigma (268402, Millipore Sigma, Saint Louis, MO, USA) added as 1000X stock (200 mM) in EtOH or DMSO; isopropyl β-D-1-thiogalactopyranoside (IPTG) ≥99% purity (I2481C, Gold Biotechnology, Saint Louis, MO, USA) added as 1000X stock (1 M) in water or DMSO; 3OC6-AHL from Millipore Sigma (K3007, Millipore Sigma, Saint Louis, MO, USA) added as a 1000X stock (10 mM) in DMF. Cells were selected with the following antibiotics: 50 μg/ml kanamycin (K-120-10, Gold Biotechnology, Saint Louis, MO, USA); 100 μg/ml carbenicillin (C-103-5, Gold Biotechnology, Saint Louis, MO, USA); 30 μg/ml chloramphenicol (B20841, Alfa Aesar, Ward Hill, MA, USA). Liquid chromatography was performed with Optima Acetonitrile (A996-4, Thermo Fisher Scientific, MA, USA) and water (Milli-Q Advantage A10, Millipore Sigma, Saint Louis, MO, USA) supplemented with LC-MS Grade Formic Acid (85178, Thermo Fisher Scientific). The following solvents/chemicals were used: Ethanol (V1001, Decon Labs, King of Prussia, PA, USA), Methanol (3016–16, Avantor, Center Valley, PA, USA), dimethyl sulfoxide (DMSO) (32434, Alfa Aesar, Ward Hill, MA, USA), Imidazole (IX0005, Millipore Sigma, Saint Louis, MO, USA), sodium chloride (X190, VWR, OH, USA), sodium phosphate monobasic monohydrate (20233, USB Corporation, Cleveland, OH, USA), sodium phosphate dibasic anhydrous (204855000, Acros, NJ, USA), guanidine hydrochloride (50950, Millipore Sigma, Saint Louis, MO, USA), tris (75825, Affymetrix, Cleveland, OH, USA), TCEP (51805-45-9, Gold Biotechnology, Saint Louis, MO, USA), and EDTA (0.5M stock, 15694, USB Corporation, Cleveland, OH, USA). DNA oligos and gblocks were ordered from Integrated DNA Technologies (San Francisco, CA, USA).
Gene design
A list of plasmids and corresponding plasmid maps are provided in S5 Table and S1 Fig. Amino acid sequences of all modifying enzymes and peptides are provided in S6 Table. Sequences of genetic parts and full plasmids are in S7–S8 Tables. Unless stated otherwise, modifying enzymes and peptides were codon optimized for E. coli, filtered for Type IIs restriction sites (SapI, BsaI, BbsI, BsmBI), and synthesized and cloned by Twist Biosciences (San Francisco, CA, USA) into custom backbones. The following genes were obtained from other sources: bmbC, truE, pgm2, strA, and paaP were codon optimized for E. coli and synthesized/cloned using DNA oligos; sboA and albA were amplified from the genome of Bacillus subtilus subsp. spizizenii (ATCC 6633); mdnA and mdnC were amplified from pARW071 [29] and mdnA* is a leader truncation of mdnA; pqqA was amplified from the genome of Klebsiella variicola str. 342; tfxA was amplified from the genome of Rhizobium leguminosarum Jordan (ATCC 53912); mcbCD and mcbA were synthesized and cloned by Twist without codon optimization (sequences were sourced from E. coli); truD was amplified from plasmid Topo-E1 [17]; paaA was codon optimized for E. coli and synthesized as a gblock by IDT. Ribosome binding sequences for each modifying enzyme were designed using the RBS calculator [49, 50] with a target translation initiation rate of 20,000–30,000. The RBS for peptide expression was optimized with the RBS Calculator to maximize expression rate. The HIS6 tag for peptide expression (ATag-1, S1 Fig and S7 Table) is taken from the pTEV5 construct reported [46] for expression without SUMO (plasmids pEG3044 –pEG3055). With SUMO the first 10 amino acids of the pTEV5 construct were used as a HIS6 tag to limit the effect of the gene sequence on RBS translation rate. The SUMO sequence and the 4 amino acids N-terminal to SUMO were synthesized as a gblock with sequence from pE-SUMO vector (1001K, LifeSensors, Malvern, PA, USA). For initial testing of the RSTN construct in Figs 1C and 2, the linker between SUMO and the precursor peptide was adapted from the pTEV5 tag to generate Link-1 (plasmids pEG3058 –pEG3067). For future experiments with RSTN, the repeated HIS6 sequence was removed, a TEV cleavage site was added, a cysteine after the SUMO cut site and after the TEV cut site was added, and several other amino acids were added to increase the length of the linker to generate Link-2 (plasmids pEG3121 –pEG3132 and pEG3248). A later iteration also added BsaI sites around the operon for subcloning (plasmids pEG2192 –pEG2575, pEG3157 –pEG3197, pEG3283 –pEG3286, and pEG3563 –pEG3905). When using RSTC, the N-terminal affinity tag is maintained, but the four amino acids between HIS6 and SUMO were removed (ATag-3). For the linker (Link-3), the TEV cleavage site was removed from the C-terminus of the linker and a thrombin site was added to the N-terminus. An affinity tag was also added to the C-terminus of SUMO (ATag-4) in case the leader (and tag) was removed during modification (plasmids pEG3212 –pEG3215 and pEG3553 –pEG3562). Modifying enzymes and their individual RBSs are expressed using pLux in pEG1128 or pCym in pEG7034–7171 (flanking SapI sites around RBS+CDS), except pEG7172 and pEG7173 which lacked flanking SapI sites.
Peptide expression/modification from flasks and purification
Plasmids were transformed into E. coli BL21, struck out on 2xYT agar with carbenicillin (or chloramphenicol for pEG3017) and kanamycin (if co-transforming modifying enzyme) and incubated (30°C, overnight). Individual colonies were used to inoculate 3 mL of LB media in a culture tube (352059, Corning, NY, USA) and incubated overnight (30°C, 250 r.p.m.) in an Innova44 (Eppendorf, NY, USA). Aliquots (500 μl) were taken from the overnight cultures and subcultured into 50 mL of LB media in a 250 mL Erlenmeyer flask. After 3 hours incubation (Innova44, 30°C, 250 r.p.m.), IPTG and 3OC6-AHL (if inducing modifying enzyme) was added to final concentrations of 1 mM and 10 μM and cultures were incubated for 20 hours (Innova44, 18°C, 250 r.p.m.) (note: IPTG was not added for pEG3017, where the MBP-tagged peptide is constitutively expressed). The 50 mL cultures were transferred to a falcon tube (352070, Corning, NY, USA), centrifuged (4,500g, 4°C, 20 min) in a Sorvall Legend XFR Centrifuge (Thermo Fisher Scientific, MA, USA), pellets were resuspended in 600 μl lysis buffer (5 M guanidinium hydrochloride, 300 mM NaCl, 50 mM sodium phosphate, pH 7.5), and freeze-thawed twice (frozen in -80°C freezer; thawed in innova44 incubator at 30°C, 250 r.p.m). Cell lysates were centrifuged (Eppendorf 5424, 21,130g, room temperature, 15 min) in an Eppendorf 5424 Centrifuge (Eppendorf, NY, USA) and the peptides affinity purified using His SpinTrap TALON columns (29-0005-93, GE Life Sciences (now Cytiva), Marlborough, MA, USA), following manufacturer instructions, using 600 μL lysis buffer twice for column equilibration, loading 600 μL clarified lysate, two washes with 600 μL wash buffer (300 mM NaCl, 50mM sodium phosphate, 5 mM imidazole, pH 7.5), and 200 μL elution buffer (300mM NaCl, 50mM sodium phosphate, 200 mM imidazole, pH 7.5) for elution. Purifications used an Eppendorf 5424 centrifuge.
Liquid chromatography/mass spectrometry
All chromatography was performed using mobile phases ACN (acetonitrile supplemented with 0.1% formic acid and 0.1% water) and water (supplemented with 0.1% formic acid). LC-MS was performed on one of two mass spectrometers: “QQQ” is an Agilent 1260 Infinity liquid chromatograph with binary pump configured in low-dwell volume mode, high-performance autosampler chilled to 18°C, and column oven, coupled to an Agilent 6420 QQQ mass spectrometer equipped with an Agilent electrospray ionization (ESI) source; nitrogen gas is supplied by a Parker Nitroflowlab and ESI source parameters are 350°C gas temp at 12 L/min flow rate, 15 psi nebulizer voltage, 4000 V capillary voltage, 135 V fragmentor voltage, and 7 V cell accelerator voltage. “QTOF” is an Agilent 1260 Infinity II liquid chromatograph with binary pump configured in low-dwell volume mode and column oven set to 40°C, coupled to an Agilent 6545 QTOF mass spectrometer equipped with an Agilent electrospray ionization (ESI) source; nitrogen gas is building supplied and ESI source parameters are 350°C gas temperature, 12 L/min gas flow, 30 psig nebulizer pressure, 350°C sheath gas temperature, 8 L/min sheath gas flow, 3000 V capillary voltage, 1000 V nozzle voltage, 135 V fragmentor voltage, 15 V skimmer voltage, 600 V Oct 1 RF Vpp; the mass spectrometer was run in MS mode with reference mass enabled and tuned in positive mode with standard mass range (3200 m/z) and 2 GHz extended dynamic range. A list of peptide extracts analyzed is provided in S4 Table, which details the machine used for each extract. For extracts 192–202: analysis was done with a Phenomenex Aeris Widepore XB-C18 3.6 μm 150 mm x 2.1 mm column with column oven set to 30°C. Flow rate was 0.5 ml/min. Gradient was 3% ACN for 3 min, 3% to 54% ACN over 34 min, 54% to 97% ACN over 3 min, 97% ACN for 5 min, with 5 min re-equilibration. The mass spectrometer was run in positive mode 100–2250 m/z range with a 200 ms scan time. Injections were 20 μL. For extracts 216–225: analysis was done with a Phenomenex Aeris Widepore XB-C18 3.6 μm 150 mm x 2.1 mm column with column oven set to 30°C. Flow rate was 0.5 mL/min. The gradient was 10% ACN for 1 min, 10% to 15% ACN over 1.5 min, 15% to 50% ACN over 52.5 min, 50% to 97% ACN over 3 min, 97% ACN for 2 min, with 5 min re-equilibration. The mass spectrometer was run in positive mode 100–2250 m/z range with a 200 ms scan time. Injections were 20 μl. For all other extracts analyzed with “QQQ”: analysis was done with a Phenomenex Aeris PEPTIDE XB-C18 2.6 μm 50 mm x 2.1 mm column with column oven set to 40°C. Flow rate was 0.6 mL/min. Gradient was 20% ACN for 0.5 min, 20% to 47% ACN over 4.5 min, 47% to 90% ACN over 0.5 min, 90% ACN for 1 min, with 0.8 min re-equilibration. The mass spectrometer was run in positive mode, 500–2000 m/z range with a 300 ms scan time. Injections were 10 μL for extracts from LB media and 5 μL for extracts from TB media (as a starting point, injection volumes were occasionally adjusted depending on the yield of the 96-well prep). For all extracts analyzed with “QTOF”: analysis was done with a Phenomenex Aeris PEPTIDE XB-C18 2.6 μm 50 mm x 2.1 mm column. Flow rate was set at 0.5 mL/min. The gradient used was 20% ACN for 0.5 min, 20% to 55% ACN over 5.5 min, 55% to 90% ACN over 0.5 minutes, 90% ACN for 1.5 min, with 0.8 min re-equilibration. Injections were 5 μL for extracts from LB media and 2 μL for extracts from TB media (as a starting point, injection volumes were occasionally adjusted depending on the yield of the 96-well prep).
Calculation of peptide molar masses
For large peptides/proteins, mass was calculated as described for ESIprot [84]: five consecutively charged m/z’s (m1, m2, m3, m4, m5) were taken from the spectra and used to calculate the charge states (z1, z2, z3, z4, z5) for each of the peaks. For peaks m1 and m2, which have charge states, z1 and z2, where z2 = z1−1 (peak 1 has one proton more than peak 2): z1 = (m2-1)/(m2-m1). Charges z1, z2, z3, and z4 were calculated using each of the four pairs of consecutively charged masses (m1 and m2, m2 and m3, m3 and m4, m4 and m5), subtracted by the number of protons the peak has compared to m5, and averaged together and rounded to the nearest integer to calculate the lowest charge (z5). Charges z1-4 are recalculated based on charge z5 (z1 = z5 + 4, z2 = z5 + 3, etc.), uncharged masses are calculated from each of the five m/z’s: uncharged mass = (zx • observed m/z)–zx. The mean and standard deviation was calculated from each observed m/z to give the average observed mass and mass standard deviation. The final reported standard deviation is the greater of either the mass standard deviation or the standard deviation calculated from the manufacturer-specified resolution of the mass spectrometer. For the QQQ mass spectrometer: peak full width half max (FWHM) = 0.7, σ = 0.3 is scaled by the largest charge state observed to give the standard deviation < 0.3 z1. For the QTOF, FWHM = 0.06, σ = 0.026, standard deviation < 0.026 z1.
Peptide expression in 96-well plates
Plasmids were transformed into E. coli NEB Express using 15–20 μL of competent cells and 1 μL of each plasmid being transformed in a 96-well PCR plate (1402–9596, USA Scientific, FL, USA or 951020401, Eppendorf, NY, USA). Transformations were incubated on ice (20–30 min), heat shocked (42°C, 30 sec), and incubated on ice again (5 min). Cells were then transferred to a deep well 96-well plate (1896–2000, USA Scientific, FL, USA) with 100 μL of SOC media. After outgrowth (Multitron Pro, 1 hr, 37°C) in an Infors HT Multitron Pro (Infors USA, MD, USA), 400 μL LB media was added with appropriate antibiotics (100 μg/ml carbenicillin and 50 μg/ml kanamycin) and incubated (Multitron Pro, 30°C, 900 r.p.m.) until all wells reached stationary phase (cultures were visibly saturated, 12–30 hours). Overnight cultures were diluted 1:100 into 1 mL LB or TB media with antibiotics in deep well plates. After a 3 hour incubation (Multitron Pro, 30°C, 900 r.p.m.), appropriate inducer was added (1 mM IPTG and 200 μM cumate) and cultures were incubated for 20 hours (Multitron Pro, 30°C, 900 r.p.m.). The 96-well plates were centrifuged (Legend XFR, 4,500 g, 4°C, 20 min) and media discarded. Pellets were stored at -20°C until purification.
Haloduracin production and purification
Haloduracin was produced following the 96-well expression protocol described above, with each sample being produced in two wells of 1 mL TB media to double the amount of product produced. Culture pellets were resuspended in 800 μL lysis buffer, freeze-thawed (frozen at -80°C; thawed in Multitron Pro at 37°C, 900 r.p.m), and centrifuged (Legend XFR, 4,500 g, 4°C, 30 min). Peptides were affinity purified using HIS MultiTrap TALON plates, using 500 μL water and two 500 μL lysis buffer washes for column equilibration (Legend XFR, 500 g, 4°C, 2 min), loading 600 μL of both matching sample’s clarified lysates iteratively (load one, then centrifuge, then load the second, then centrifuge) (Legend XFR, 100 g, 4°C, 5 min), washing twice with 500 μL wash buffer, and eluting three times with 200 μL elution buffer to maximize titer. Purification was followed by solid-phase extraction (SPE) using Strata-XL microtiter plates (8E-S043-TGB, Phenomenex, CA, USA). Plates were conditioned with 1 mL methanol wash followed by 1 mL water wash. All 600 μL of TALON eluent was loaded, washed twice with 1 mL water, and then eluted twice with 500 μl 1:1 acetonitrile:water (supplemented with 0.1% formic acid). Plates with eluent were dried down at room temperature in a Savant Speedvac SPD2010 (Thermo Fisher Scientific, MA, USA), samples resuspended in 40 μL TE buffer (10 mM tris, 1 mM EDTA) with 20 μL 2 mg/mL TEV protease, and then incubated (stationary, 30°C, 8 hr). Cut fractions were desalted using a Strata-X SPE plate (8E-S100-TGB, Phenomenex, CA, USA) with same condition/wash/elution/drying steps as above. Dried down samples were resuspended in 50 μL 1:1 methanol:water.
LC-MS/MS data acquisition
Chromatography was performed using the same ACN/water mobile phases as for LC-MS. MS/MS data were acquired on an Agilent 1260 Infinity II liquid chromatograph with binary pump configured in low-dwell volume mode and column oven set to 40°C, coupled to an Agilent 6545 QTOF mass spectrometer equipped with an Agilent electrospray ionization (ESI) source. Nitrogen gas is building-supplied and ESI source parameters are 350°C gas temperature, 12 L/min gas flow, 30 psig nebulizer pressure, 350°C sheath gas temperature, 8 L/min sheath gas flow, 3000 V capillary voltage, 1000 V nozzle voltage, 135 V fragmentor voltage, 15 V skimmer voltage, 600 V Oct 1 RF Vpp; the mass spectrometer was run in MS mode with reference mass enabled and tuned in positive mode with standard mass range (3200 m/z) and 2 GHz extended dynamic range. For the targeted MS/MS, the parent ion was selected, and 4 spectra/s were sampled with fixed collision energies of 30, 45, 60, and 75 V. A narrow isolation width (1.3 m/z) and observed monoisotopic mass (exact masses found in S1–S5 Notes) was used for fragmentation of each peptide. Sample analysis was performed with a Phenomenex Aeris PEPTIDE XB-C18 2.6 μm 50 mm x 2.1 mm column. The flow rate was set at 0.5 mL/min and 5 μl sample was injected. The gradient used was 10% ACN for 1.0 min, 10% to 70% ACN over 5.0 min, 70% to 90% ACN over 0.5 minutes, 90% ACN for 1.0 min, with 1.0 min re-equilibration. Accurate mass predictions of peptides were generated using the online resource, ChemCalc [85]
MS/MS data analysis and spectra annotation
A published algorithm [59] was adapted for Python from R-package. For each spectrum, all ion abundances were capped at an upper limit/ceiling calculated as the user-specified upper limit multiplied by the average peak abundance in the spectrum. Background peaks below a threshold were then removed in mass window slices (number of slices is user-specified), where the threshold was calculated as the average peak abundance in each slice and its flanking slices, multiplied by a user-specified signal-to-noise threshold. Hypothetical ions were then matched to the filtered MS/MS signals within a user-specified m/z tolerance. Hypothetical ion m/z’s were determined by first calculating the parent ion mass (H+ adduct of full-length peptide with modification). The peptide was iteratively truncated along its backbone from its N-terminus (for y ions) or its C-terminus (for b ions). The masses of these truncated peptides were calculated and made up the hypothetical ion masses. Only monoisotopic masses were considered. To avoid spectrum deconvolution, we enabled the algorithm to work with multiply charged masses. To do this, we calculated the expected m/z values for [M+H]1+, [M+2H]2+, and [M+3H]3+ charge states of all hypothetical ions for matching to observed m/z values.
Haloduracin antimicrobial assay
Bacillus subtilis PY79 was used as indicator strain for purified haloduracins. Overnight cultures were resuspended in LB (OD600 = 0.1) in a Cary 60 UV-Vis spectrometer (Agilent Technologies, CA, USA) and then a cotton-tipped applicator (24-106-2S, McKesson, TX, USA) was used to spread on an LB-agar surface. After drying, 5 μL of purified haloduracin or solvent control was added to plate. For combined haloduracin A1 + A2, 2.5 μL of each purified peptide was added to the same spot. Strains were incubated (30°C, overnight) and zones of inhibition visualized.
Proteolytic cleavage and removal of SUMO
For purification of haloduracin for antimicrobial assays, TEV protease was purified as described previously [86] [Addgene #8827, concentrated to 2 mg/mL in TEV buffer (25 mM Tris-HCl, pH 8.0, 50 mM NaCl, 1 mM TCEP, 50% glycerol)]. For MS/MS analysis, TEV protease was prepared as a 50 mg/mL solution of 10% (w/w) TEV lyophilizate (Gene and Cell Technologies, CA, USA) in TEV Buffer.
Peptide purification in 96-well plates
Culture pellets were resuspended in 800 μl lysis buffer, and freeze-thawed (frozen in liquid nitrogen at -196°C; thawed in Multitron Pro at 37°C, 900 r.p.m). Cell lysates were centrifuged (Legend XFR, 4,500 g, 4°C, 30 min) and peptides affinity-purified using His MultiTrap TALON plates (29-0005-96, GE Life Sciences (now Cytiva), Marlborough, MA, USA), following manufacturer instructions with a centrifuge used for each wash/flow-through step, using 500 μL water and two 500 μL lysis buffer washes for column equilibration (Legend XFR, 500 g, 4°C, 2 min), loading 600 μL clarified lysate (Legend XFR, 100 g, 4°C, 5 min), washing twice with 500 μL wash buffer (Legend XFR, 500 g, 4°C, 2 min), and eluting with 200 μL elution buffer (Legend XFR, 500 g, 4°C, 2 min).
LCMS data analysis and peak integration
LC-MS datafiles were converted to mzXML format using MassHunter (Agilent Technologies, Santa Clara, CA, USA). Data were exported in centroid format with no deconvolution and a minimum signal intensity of 1,000. mzXML files were parsed and imported into python to a long-form pandas dataframe and filtered for signals between 1–6 min and 500–2,500 Da. For each extract, the expected molecular weight of unmodified, modified, and partially modified (if applicable) peptides were calculated based on the peptide sequence and the molar weights of each amino acid. Partial modification is only considered if the modification mass shift of individual modifications are ≥15 Da (S3 Table), since smaller shifts are not clearly resolved. For each modification state molecular weight, all charge state [M+xH]x+ (x is number of protons/charges) masses were calculated and extracted as an EIC with a mass window of +/- 2/x Da for extracts analyzed with “QQQ” and 1/x Da for extracts analyzed with “QTOF”. Charge state EIC intensities were summed together at each timepoint to generate an extracted compound chromatogram (ECC). If present, an ECC peak is fit with a skewed gaussian with parameters peak area, retention time, peak width, peak skew, and peak baseline. Peaks are considered real/trustworthy based on the following criteria: greater than eight charge states present/observed at the same retention time (+/- 0.2 min) with at least four being consecutive charge states, only one “large” peak in the ECC (i.e., there are no peaks greater than 80% of the largest peak height in the chromatogram), and not more than two “small” peaks (i.e., <3 peaks are greater than 40% of the largest peak height), peak skew between 0 and 1.5, peak width less than or equal to 0.25. Within an extract, “total peptide” is defined as the sum of the peak areas of unmodified, modified, and partially modified (if applicable). Fraction modified is defined as the modified peptide peak area divided by the “total peptide”. Peak integrations and masses for each extract are listed in the peak list file available on GitHub. Chromatograms, peak fits, and spectra for a representative replicate are shown in S5 Fig. All analyses were performed in Python 3.5 using pandas, scipy, numpy, and matplotlib libraries.
Peptide purification for MS/MS analysis
Peptide/enzyme pairs presented with LC-MS/MS data in S2–S5 Notes were produced and purified following the protocol described for 96-well affinity expression and purification, except samples were eluted 3x with 200 μL elution buffer. After elution, 6 μL of TEV protease (Gene and Cell Technologies, USA) (50 mg of TEV lyophilizate/mL in TE buffer, lyophilizate is 10% TEV by weight) and 20 μL of TCEP (20 mM in water) were added to the eluent and incubated overnight at room temperature. A 20 μL aliquot of the sample was taken from this reaction and diluted 1:2 in TE buffer, centrifuged to remove precipitate, and analyzed with LC-MS/MS.
Supporting information
S1 Fig. Plasmid maps used in this study.
a. Architectures of precursor peptide plasmid with MBP. b. Architectures of precursor peptide plasmids without SUMO (with ATag-1). c. Architectures of precursor peptide plasmids with the initially used RSTN (with Link-1). d. Architectures of plasmids with RSTN (with ATag-2 and Link-2). e. Architectures of plasmids with RSTN, with flanking BsaI sites added around the operon for optional subcloning. f. Architectures of plasmids with RSTC. g. Architecture of modifying enzyme plasmid with pLux promoter. h. Architectures of modifying enzyme plasmids with pCym promoter. i. Architecture of modifying enzyme plasmids with pCym promoter and SapI sites around RBS+gene for optional subcloning.
https://doi.org/10.1371/journal.pone.0266488.s001
(PDF)
S2 Fig. Haloduracin zones of inhibition.
Each plate represents an individual replicate of Haloduracin A1 and A2 expression, purification, cleavage, and assaying for zone of inhibition. Quadrants are Haloduracin A1 (top right), Haloduracin A2 (top left), Haloduracin A1 and A2 (bottom right), and 50% methanol in water %vol/vol solvent control (bottom left).
https://doi.org/10.1371/journal.pone.0266488.s002
(PDF)
S3 Fig. Chromatograms of high-throughput peptide expression/modification samples.
Spectra of the modified peaks are shown on the right. Chromatograms are grouped by peptide plasmid number and modifying enzyme plasmid number (numbers are shown without the preceeding “pEG”). Both expression medias are shown, with one replicate chosen for each and the extract number listed for the chosen replicate. Y-axis for all plots is Intensity in arbitrary units (x107 for extracts analyzed with QQQ and x105 for extracts analyzed with QTOF). From left to right, plots represent: TIC (black trace), ECC of unmodified peptide (green trace), ECC of partially modified peptide(s) (blue if passes peak thresholds or yellow otherwise), ECC of properly modified peptide (red trace), and mass spectrum. In ECC plots, peak fit is drawn as a black line. If a peak passes all thresholds, the peak area (x105 for extracts analyzed with QQQ and x103 for extracts analyzed with QTOF) is provided as a colored number in the top right corner of the plot. Partially modified peaks can have multiple numbers listed, one for each modification state (these are ordered ascending by magnitude top-to-bottom, left-to-right). All extracts are analyzed with QQQ unless there is an asterisk (*) next to the extract number, which denotes analyzed with QTOF. Peak lists and raw data are available for more detailed analysis on Zenodo.
https://doi.org/10.1371/journal.pone.0266488.s003
(PDF)
S4 Fig. Bar chart of successful peptide/modifying enzyme combinations.
Peptide plasmid number and gene name, modifying enzyme plasmid number and gene name, and replicate extract numbers are listed alongside fraction modified in TB (dark grey) and LB (light grey) medias. Dashed line demarcates 50% modification (half of peptide modified). Three replicates are shown for each peptide/enzyme/media combination and represent independent expressions that were purified and assayed. Bar is shown at the replicate mean.
https://doi.org/10.1371/journal.pone.0266488.s004
(PDF)
S5 Fig. SboA+AlbA post-cleavage mass spectrum.
Spectra of small mass-shift modification catalyzed by AlbA. Ion distribution for [M+4H]4+ is shown, with unmodified, modified, and observed m/z’s listed for the monoisotopic mass, which is labeled with an arrow. Data for cleaved peptide was collected once.
https://doi.org/10.1371/journal.pone.0266488.s005
(PDF)
S6 Fig. Phylogenetic tree species from which we mine enzymes.
Tree was generated using the producing organisms listed in S2 Table and the NCBI Common Tree application. Species from which functional enzymes were sourced are black, nonfunctional are red.
https://doi.org/10.1371/journal.pone.0266488.s006
(PDF)
S1 Table. Flask-expressed peptide mass spectra masses.
https://doi.org/10.1371/journal.pone.0266488.s012
(PDF)
S2 Table. Pathways investigated in this study.
https://doi.org/10.1371/journal.pone.0266488.s013
(PDF)
S3 Table. Chemical modification types studied.
https://doi.org/10.1371/journal.pone.0266488.s014
(PDF)
S4 Table. Structural validation of modified peptides.
https://doi.org/10.1371/journal.pone.0266488.s015
(PDF)
S6 Table. Enzyme and peptide amino acid sequences.
https://doi.org/10.1371/journal.pone.0266488.s017
(PDF)
S1 File. Supplementary references.
References from Supporting information figures/tables/notes.
https://doi.org/10.1371/journal.pone.0266488.s020
(PDF)
Acknowledgments
We thank Alexander Cristofaro (MIT-Broad Foundry) for help with bioinformatics, Prof. Mohammad Seyedsayamdost (Princeton University) for providing plasmid encoding StrA/B, Prof. Eric Schmidt (Univeristy of Utah) for providing plasmid encoding trunkamide/patellins, and Prof. Elke Dittman (University of Potsdam) for providing plasmid encoding microviridin.
References
- 1. Zhang MM, Qiao Y, Ang EL, Zhao H. Using natural products for drug discovery: the impact of the genomics era. Expert Opinion on Drug Discovery. 2017;12(5):475–87. pmid:28277838
- 2. Smanski MJ, Zhou H, Claesen J, Shen B, Fischbach MA, Voigt CA. Synthetic biology to access and expand nature’s chemical diversity. Nature Reviews Microbiology. 2016;14(3):135–49. pmid:26876034
- 3. Medema MH, Fischbach MA. Computational approaches to natural product discovery. Nature Chemical Biology. 2015;11(9):639–48. pmid:26284671
- 4. Bayer TS, Widmaier DM, Temme K, Mirsky EA, Santi DV, Voigt CA. Synthesis of Methyl Halides from Biomass Using Engineered Microbes. Journal of the American Chemical Society. 2009;131(18):6508–15. pmid:19378995
- 5. Freeman MF, Gurgui C, Helf MJ, Morinaka BI, Uria AR, Oldham NJ, et al. Metagenome Mining Reveals Polytheonamides as Posttranslationally Modified Ribosomal Peptides. Science. 2012;338(6105):387–90. pmid:22983711
- 6. Guo C-J, Chang F-Y, Wyche TP, Backus KM, Acker TM, Funabashi M, et al. Discovery of Reactive Microbiota-Derived Metabolites that Inhibit Host Proteases. Cell. 2017;168(3):517–26.e18. pmid:28111075
- 7. Arnison PG, Bibb MJ, Bierbaum G, Bowers AA, Bugni TS, Bulaj G, et al. Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature. Natural Product Reports. 2013;30(1):108–60. pmid:23165928
- 8. Lin PF, Samanta H, Bechtold CM, Deminie CA, Patick AK, Alam M, et al. Characterization of siamycin I, a human immunodeficiency virus fusion inhibitor. Antimicrobial Agents and Chemotherapy. 1996;40(1):133–8. pmid:8787894
- 9. Jin M, Liu L, Wright SAI, Beer SV, Clardy J. Structural and Functional Analysis of Pantocin A: An Antibiotic from Pantoea agglomerans Discovered by Heterologous Expression of Cloned Genes. Angewandte Chemie International Edition. 2003;42(25):2898–901. pmid:12833353
- 10. Potterat O, Wagner K, Gemmecker G, Mack J, Puder C, Vettermann R, et al. BI-32169, a Bicyclic 19-Peptide with Strong Glucagon Receptor Antagonist Activity fromStreptomycessp. Journal of Natural Products. 2004;67(9):1528–31. pmid:15387654
- 11. Yano K, Yamasaki M, Yoshida M, Matsuda Y, Yamaguchi K. RES-701-2, a Novel and Selective Endothelin Type B Receptor Antagonist Produced by Streptomyces sp. II. Determination of the Primary Structure. The Journal of Antibiotics. 1995;48(11):1368–70. pmid:8557586
- 12. Vogt E, Künzler M. Discovery of novel fungal RiPP biosynthetic pathways and their application for the development of peptide therapeutics. Applied Microbiology and Biotechnology. 2019;103(14):5567–81. pmid:31147756
- 13. Skinnider MA, Johnston CW, Edgar RE, Dejong CA, Merwin NJ, Rees PN, et al. Genomic charting of ribosomally synthesized natural product chemical space facilitates targeted mining. Proceedings of the National Academy of Sciences. 2016;113(42):E6343–E51. pmid:27698135
- 14. Montalban-Lopez M, Scott TA, Ramesh S, Rahman IR, van Heel AJ, Viel JH, et al. New developments in RiPP discovery, enzymology and engineering. Nat Prod Rep. 2021;38(1):130–239. Epub 20200916. pmid:32935693; PubMed Central PMCID: PMC7864896.
- 15. Li Y. Recombinant production of antimicrobial peptides in Escherichia coli: A review. Protein Expression and Purification. 2011;80(2):260–7. pmid:21843642
- 16. Shi Y, Yang X, Garg N, Van Der Donk WA. Production of Lantipeptides inEscherichia coli. Journal of the American Chemical Society. 2011;133(8):2338–41. pmid:21114289
- 17. Donia MS, Ravel J, Schmidt EW. A global assembly line for cyanobactins. Nature Chemical Biology. 2008;4(6):341–3. pmid:18425112
- 18. Zhang Y, Li K, Yang G, Mcbride JL, Bruner SD, Ding Y. A distributive peptide cyclase processes multiple microviridin core peptides within a single polypeptide substrate. Nature Communications. 2018;9(1). pmid:29725007
- 19. Lee H, Park Y, Kim S. Enzymatic Cross-Linking of Side Chains Generates a Modified Peptide with Four Hairpin-like Bicyclic Repeats. Biochemistry. 2017;56(37):4927–30. pmid:28841794
- 20. Meulenberg J. Cloning of Klebsiella pneumoniae pqq genes and PQQ biosynthesis in Escherichia coli. 1990;71(3):337–43.
- 21. Morinaka BI, Lakis E, Verest M, Helf MJ, Scalvenzi T, Vagstad AL, et al. Natural noncanonical protein splicing yields products with diverse β-amino acid residues. Science. 2018;359(6377):779–82. pmid:29449488
- 22. Himes PM, Allen SE, Hwang S, Bowers AA. Production of Sactipeptides in Escherichia coli: Probing the Substrate Promiscuity of Subtilosin A Biosynthesis. ACS Chemical Biology. 2016;11(6):1737–44. pmid:27019323
- 23. Zhang Z, Hudson GA, Mahanta N, Tietz JI, Van Der Donk WA, Mitchell DA. Biosynthetic Timing and Substrate Specificity for the Thiopeptide Thiomuracin. Journal of the American Chemical Society. 2016;138(48):15511–4. pmid:27700071
- 24. Van Staden ADP, Faure LM, Vermeulen RR, Dicks LMT, Smith C. Functional Expression of GFP-Fused Class I Lanthipeptides inEscherichia coli. ACS Synthetic Biology. 2019;8(10):2220–7. pmid:31553571
- 25. Zhu S, Hegemann JD, Fage CD, Zimmermann M, Xie X, Linne U, et al. Insights into the Unique Phosphorylation of the Lasso Peptide Paeninodin. Journal of Biological Chemistry. 2016;291(26):13662–78. Epub 2016/05/05. pmid:27151214; PubMed Central PMCID: PMC4919450.
- 26. Hegemann JD, Zimmermann M, Xie X, Marahiel MA. Caulosegnins I–III: A Highly Diverse Group of Lasso Peptides Derived from a Single Biosynthetic Gene Cluster. Journal of the American Chemical Society. 2013;135(1):210–22. pmid:23214991
- 27. Ren H, Biswas S, Ho S, Van Der Donk WA, Zhao H. Rapid Discovery of Glycocins through Pathway Refactoring in Escherichia coli. ACS Chemical Biology. 2018;13(10):2966–72. pmid:30183259
- 28. Serebryakova M, Tsibulskaya D, Mokina O, Kulikovsky A, Nautiyal M, Van Aerschot A, et al. A Trojan-Horse Peptide-Carboxymethyl-Cytidine Antibiotic from Bacillus amyloliquefaciens. Journal of the American Chemical Society. 2016;138(48):15690–8. pmid:27934031
- 29. Weiz R, Annika , Ishida K, Makower K, Ziemert N, Hertweck C, Dittmann E. Leader Peptide and a Membrane Protein Scaffold Guide the Biosynthesis of the Tricyclic Peptide Microviridin. Chemistry & Biology. 2011;18(11):1413–21. pmid:22118675
- 30. Burkhart BJ, Kakkar N, Hudson GA, Van Der Donk WA, Mitchell DA. Chimeric Leader Peptides for the Generation of Non-Natural Hybrid RiPP Products. ACS Central Science. 2017;3(6):629–38. pmid:28691075
- 31. Bhushan A, Egli PJ, Peters EE, Freeman MF, Piel J. Genome mining- and synthetic biology-enabled production of hypermodified peptides. Nature Chemistry. 2019;11(10):931–9. pmid:31501509
- 32. Young S, Travis , Dorrestein C, Pieter , Walsh T, Christopher . Codon Randomization for Rapid Exploration of Chemical Space in Thiopeptide Antibiotic Variants. Chemistry & Biology. 2012;19(12):1600–10. pmid:23261603
- 33. Knappe TA, Linne U, Zirah SV, Rebuffat S, Xie X, Marahiel MA. Isolation and Structural Characterization of Capistruin, a Lasso Peptide Predicted from the Genome Sequence of Burkholderia thailandensis E264. Journal of the American Chemical Society. 2008;130(34):11446–54. pmid:18671394
- 34. Mcclerren AL, Cooper LE, Quan C, Thomas PM, Kelleher NL, Van Der Donk WA. Discovery and in vitro biosynthesis of haloduracin, a two-component lantibiotic. Proceedings of the National Academy of Sciences. 2006;103(46):17243–8. pmid:17085596
- 35. Hudson GA, Zhang Z, Tietz JI, Mitchell DA, Van Der Donk WA. In Vitro Biosynthesis of the Core Scaffold of the Thiopeptide Thiomuracin. Journal of the American Chemical Society. 2015;137(51):16012–5. pmid:26675417
- 36. Reyna‐González E, Schmid B, Petras D, Süssmuth RD, Dittmann E. Leader Peptide‐Free In Vitro Reconstitution of Microviridin Biosynthesis Enables Design of Synthetic Protease‐Targeted Libraries. Angewandte Chemie International Edition. 2016;55(32):9398–401. pmid:27336908
- 37. Vinogradov AA, Shimomura M, Goto Y, Ozaki T, Asamizu S, Sugai Y, et al. Minimal lactazole scaffold for in vitro thiopeptide bioengineering. Nature Communications. 2020;11(1). pmid:32385237
- 38. Rosano GNL, Ceccarelli EA. Recombinant protein expression in Escherichia coli: advances and challenges. Frontiers in Microbiology. 2014;5(172). pmid:24860555
- 39.
Panavas T, Sanders C, Butt TR. SUMO Fusion Technology for Enhanced Protein Production in Prokaryotic and Eukaryotic Expression Systems. Methods in Molecular Biology: Humana Press; 2009. p. 303–17.
- 40. Gaglione R, Pane K, Dell’Olmo E, Cafaro V, Pizzo E, Olivieri G, et al. Cost-effective production of recombinant peptides in Escherichia coli. New Biotechnology. 2019;51:39–48. pmid:30790718
- 41. Satakarni M, Curtis R. Production of recombinant peptides as fusions with SUMO. Protein Expression and Purification. 2011;78(2):113–9. pmid:21586326
- 42. He J, Luo X, Jin D, Wang Y, Zhang T. Identification, Recombinant Expression, and Characterization of LGH2, a Novel Antimicrobial Peptide of Lactobacillus casei HZ1. Molecules. 2018;23(9):2246. pmid:30177656
- 43. Ma Q, Yu Z, Han B, Wang Q, Zhang R. Expression and purification of lacticin Q by small ubiquitin-related modifier fusion in Escherichia coli. The Journal of Microbiology. 2012;50(2):326–31. pmid:22538663
- 44. Nguyen TQN, Tooh YW, Sugiyama R, Nguyen TPD, Purushothaman M, Leow LC, et al. Post-translational formation of strained cyclophanes in bacteria. Nature Chemistry. 2020;12(11):1042–53. pmid:32807886
- 45. Meyer AJ, Segall-Shapiro TH, Glassey E, Zhang J, Voigt CA. Escherichia coli “Marionette” strains with 12 highly optimized small-molecule sensors. Nature Chemical Biology. 2019;15(2):196–204. pmid:30478458
- 46. Rocco CJ, Dennison KL, Klenchin VA, Rayment I, Escalante-Semerena JC. Construction and use of new cloning vectors for the rapid isolation of recombinant proteins from Escherichia coli. Plasmid. 2008;59(3):231–7. pmid:18295882
- 47. Mrksich M. Mass Spectrometry of Self-Assembled Monolayers: A New Tool for Molecular Surface Science. ACS Nano. 2008;2(1):7–18. pmid:19206542
- 48. Huang C-F, Mrksich M. Profiling Protein Tyrosine Phosphatase Specificity with Self-Assembled Monolayers for Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry and Peptide Arrays. ACS Combinatorial Science. 2019;21(11):760–9. pmid:31553163
- 49. Espah Borujeni A, Channarasappa AS, Salis HM. Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites. Nucleic Acids Research. 2014;42(4):2646–59. pmid:24234441
- 50. Salis HM, Mirsky EA, Voigt CA. Automated design of synthetic ribosome binding sites to control protein expression. Nature Biotechnology. 2009;27(10):946–50. pmid:19801975
- 51. Hou Y, Tianero MDB, Kwan JC, Wyche TP, Michel CR, Ellis GA, et al. Structure and Biosynthesis of the Antibiotic Bottromycin D. Organic Letters. 2012;14(19):5050–3. pmid:22984777
- 52. Schramma KR, Bushin LB, Seyedsayamdost MR. Structure and biosynthesis of a macrocyclic peptide containing an unprecedented lysine-to-tryptophan crosslink. Nature Chemistry. 2015;7(5):431–7. pmid:25901822
- 53. Babasaki K, Takao T, Shimonishi Y, Kurahashi K. Subtilosin A, a New Antibiotic Peptide Produced by Bacillus subtilis 168: Isolation, Structural Analysis, and Biogenesis1. The Journal of Biochemistry. 1985;98(3):585–603. pmid:3936839
- 54. Breil B, Borneman J, Triplett EW. A newly discovered gene, tfuA, involved in the production of the ribosomally synthesized peptide antibiotic trifolitoxin. Journal of bacteriology. 1996;178(14):4150–6. pmid:8763943
- 55. Li B, Sher D, Kelly L, Shi Y, Huang K, Knerr PJ, et al. Catalytic promiscuity in the biosynthesis of cyclic peptide secondary metabolites in planktonic marine cyanobacteria. Proceedings of the National Academy of Sciences. 2010;107(23):10430–5. pmid:20479271
- 56. Morris RP, Leeds JA, Naegeli HU, Oberer L, Memmert K, Weber E, et al. Ribosomally Synthesized Thiopeptide Antibiotics Targeting Elongation Factor Tu. Journal of the American Chemical Society. 2009;131(16):5946–55. pmid:19338336
- 57. Noike M, Matsui T, Ooya K, Sasaki I, Ohtaki S, Hamano Y, et al. A peptide ligase and the ribosome cooperate to synthesize the peptide pheganomycin. Nature Chemical Biology. 2015;11(1):71–6. pmid:25402768
- 58. Cooper LE, Mcclerren AL, Chary A, Van Der Donk WA. Structure-Activity Relationship Studies of the Two-Component Lantibiotic Haloduracin. Chemistry & Biology. 2008;15(10):1035–45. pmid:18940665
- 59. Zhang Q, Ortega M, Shi Y, Wang H, Melby JO, Tang W, et al. Structural investigation of ribosomally synthesized natural products by hypothetical structure enumeration and evaluation using tandem MS. Proceedings of the National Academy of Sciences. 2014;111(33):12031–6. pmid:25092299
- 60. Caruso A, Martinie RJ, Bushin LB, Seyedsayamdost MR. Macrocyclization via an Arginine-Tyrosine Crosslink Broadens the Reaction Scope of Radical S-Adenosylmethionine Enzymes. Journal of the American Chemical Society. 2019;141(42):16610–4. pmid:31596076
- 61. Hudson GA, Burkhart BJ, Dicaprio AJ, Schwalen CJ, Kille B, Pogorelov TV, et al. Bioinformatic Mapping of Radical S-Adenosylmethionine-Dependent Ribosomally Synthesized and Post-Translationally Modified Peptides Identifies New Cα, Cβ, and Cγ-Linked Thioether-Containing Peptides. Journal of the American Chemical Society. 2019;141(20):8228–38. pmid:31059252
- 62. Shuguo H, Wei Z, Chao Z, Daoji W. One-Step Expression and Tyrosine O-Sulfonation of Ax21 in Escherichia coli. Applied Biochemistry and Biotechnology. 2012;166(5):1368–79. pmid:22249854
- 63. Zimmermann M, Hegemann JD, Xie X, Marahiel MA. Characterization of caulonodin lasso peptides revealed unprecedented N-terminal residues and a precursor motif essential for peptide maturation. Chemical Science. 2014;5(10):4032–43.
- 64. Zhu S, Fage CD, Hegemann JD, Yan D, Marahiel MA. Dual substrate-controlled kinase activity leads to polyphosphorylated lasso peptides. FEBS Letters. 2016;590(19):3323–34. pmid:27585551
- 65. Gavrish E, Sit S, Clarissa, Cao S, Kandror O, Spoering A, Peoples A, et al. Lassomycin, a Ribosomally Synthesized Cyclic Peptide, Kills Mycobacterium tuberculosis by Targeting the ATP-Dependent Protease ClpC1P1P2. Chemistry & Biology. 2014;21(4):509–18. pmid:24684906
- 66. Ghodge SV, Biernat KA, Bassett SJ, Redinbo MR, Bowers AA. Post-translational Claisen Condensation and Decarboxylation en Route to the Bicyclic Core of Pantocin A. Journal of the American Chemical Society. 2016;138(17):5487–90. pmid:27088303
- 67. Luu DD, Joe A, Chen Y, Parys K, Bahar O, Pruitt R, et al. Biosynthesis and secretion of the microbial sulfated peptide RaxX and binding to the rice XA21 immune receptor. Proceedings of the National Academy of Sciences. 2019;116(17):8525–34. pmid:30948631
- 68. Kupke T, Kempter C, Jung G, Götz F. Oxidative Decarboxylation of Peptides Catalyzed by Flavoprotein EpiD. Journal of Biological Chemistry. 1995;270(19):11282–9. pmid:7744764
- 69. Sardar D, Pierce E, Mcintosh JA, Schmidt EW. Recognition Sequences and Substrate Evolution in Cyanobactin Biosynthesis. ACS Synthetic Biology. 2015;4(2):167–76. pmid:24625112
- 70. Kaunietis A, Buivydas A, Čitavičius DJ, Kuipers OP. Heterologous biosynthesis and characterization of a glycocin from a thermophilic bacterium. Nature Communications. 2019;10(1). pmid:30846700
- 71. Hegemann JD, Van Der Donk WA. Investigation of Substrate Recognition and Biosynthesis in Class IV Lanthipeptide Systems. Journal of the American Chemical Society. 2018;140(17):5743–54. pmid:29633842
- 72. Mu D, Montalbán-López M, Deng J, Kuipers OP. Lantibiotic Reductase LtnJ Substrate Selectivity Assessed with a Collection of Nisin Derivatives as Substrates. Applied and Environmental Microbiology. 2015;81(11):3679–87. pmid:25795677
- 73. Caetano T, Barbosa J, Möesker E, Süssmuth RD, Mendo S. Bioengineering of lanthipeptides in Escherichia coli: assessing the specificity of lichenicidin and haloduracin biosynthetic machinery. Research in Microbiology. 2014;165(7):600–4. pmid:25058408
- 74. Roh H, Han Y, Lee H, Kim S. A Topologically Distinct Modified Peptide with Multiple Bicyclic Core Motifs Expands the Diversity of Microviridin‐Like Peptides. ChemBioChem. 2019;20(8):1051–9. pmid:30576039
- 75. Precord TW, Mahanta N, Mitchell DA. Reconstitution and Substrate Specificity of the Thioether-Forming RadicalS-Adenosylmethionine Enzyme in Freyrasin Biosynthesis. ACS Chemical Biology. 2019;14(9):1981–9. pmid:31449382
- 76. Casini A, Chang F-Y, Eluere R, King AM, Young EM, Dudley QM, et al. A Pressure Test to Make 10 Molecules in 90 Days: External Evaluation of Methods to Engineer Biology. Journal of the American Chemical Society. 2018;140(12):4302–16. pmid:29480720
- 77. Hillson N, Caddick M, Cai Y, Carrasco JA, Chang MW, Curach NC, et al. Building a global alliance of biofoundries. Nature Communications. 2019;10(1). pmid:31068573
- 78. Voigt CA. Synthetic biology 2020–2030: six commercially-available products that are changing our world. Nature Communications. 2020;11(1). pmid:33311504
- 79. Kloosterman AM, Shelton KE, Van Wezel GP, Medema MH, Mitchell DA. RRE-Finder: a Genome-Mining Tool for Class-Independent RiPP Discovery. mSystems. 2020;5(5). pmid:32873609
- 80. Tietz JI, Schwalen CJ, Patel PS, Maxson T, Blair PM, Tai H-C, et al. A new genome-mining tool redefines the lasso peptide biosynthetic landscape. Nature Chemical Biology. 2017;13(5):470–8. pmid:28244986
- 81. Sardar D, Lin Z, Schmidt W, Eric . Modularity of RiPP Enzymes Enables Designed Synthesis of Decorated Peptides. Chemistry & Biology. 2015;22(7):907–16. pmid:26165156
- 82. Van Heel AJ, Mu D, Montalbán-López M, Hendriks D, Kuipers OP. Designing and Producing Modified, New-to-Nature Peptides with Antimicrobial Activity by Use of a Combination of Various Lantibiotic Modification Enzymes. ACS Synthetic Biology. 2013;2(7):397–404. pmid:23654279
- 83. Segall-Shapiro TH, Sontag ED, Voigt CA. Engineered promoters enable constant gene expression at any copy number in bacteria. Nature Biotechnology. 2018;36(4):352–8. pmid:29553576
- 84. Winkler R. ESIprot: a universal tool for charge state determination and molecular weight calculation of proteins from electrospray ionization mass spectrometry data. Rapid Communications in Mass Spectrometry. 2010;24(3):285–94. pmid:20049890
- 85. Patiny L, Borel A. ChemCalc: A Building Block for Tomorrow’s Chemical Infrastructure. Journal of Chemical Information and Modeling. 2013;53(5):1223–8. pmid:23480664
- 86.
Tropea JE, Cherry S, Waugh DS. Expression and Purification of Soluble His6-Tagged TEV Protease. Methods in Molecular Biology: Humana Press; 2009. p. 297–307.