Moorea producens JHB, a Jamaican strain of tropical filamentous marine cyanobacteria, has been extensively studied by traditional natural products techniques. These previous bioassay and structure guided isolations led to the discovery of two exciting classes of natural products, hectochlorin (1) and jamaicamides A (2) and B (3). In the current study, mass spectrometry-based ‘molecular networking’ was used to visualize the metabolome of Moorea producens JHB, and both guided and enhanced the isolation workflow, revealing additional metabolites in these compound classes. Further, we developed additional insight into the metabolic capabilities of this strain by genome sequencing analysis, which subsequently led to the isolation of a compound unrelated to the jamaicamide and hectochlorin families. Another approach involved stimulation of the biosynthesis of a minor jamaicamide metabolite by cultivation in modified media, and provided insights about the underlying biosynthetic machinery as well as preliminary structure-activity information within this structure class. This study demonstrated that these orthogonal approaches are complementary and enrich secondary metabolomic coverage even in an extensively studied bacterial strain.
Citation: Boudreau PD, Monroe EA, Mehrotra S, Desfor S, Korobeynikov A, Sherman DH, et al. (2015) Expanding the Described Metabolome of the Marine Cyanobacterium Moorea producens JHB through Orthogonal Natural Products Workflows. PLoS ONE 10(7): e0133297. https://doi.org/10.1371/journal.pone.0133297
Editor: Franck Chauvat, CEA-Saclay, FRANCE
Received: March 2, 2015; Accepted: June 25, 2015; Published: July 29, 2015
Copyright: © 2015 Boudreau et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: NMR and MS data for structural elucidation are found within the paper and its Supporting Information files. All gene sequence data are available from GenBank with the accession numbers KP860346-48. All mass spectrometry data used in forming the Molecular Networks are available from Dryad (datadryad.org) under DOI: 10.5061/dryad.2k6nr.
Funding: This work was supported by National Institutes of Health grant NS053398 (to WHG and TFM), National Institutes of Health grant CA100851 (to WHG), National Institutes of Health grant CA108874 (to WHG, LG, and DHS), National Institutes of Health grant GM107550 (to WHG, LG, and PCD), and Russian Science Foundation grant 14-50-069 (to AK).
Competing interests: The authors have declared that no competing interests exist.
Moorea producens JHB is a strain of tropical filamentous marine cyanobacterium that has been in culture in our laboratory for nearly two decades since its collection from Hector’s Bay, Jamaica in 1996. We have extensively studied its natural products by traditional isolation techniques [1, 2]. An NMR-guided process led to the isolation and structure elucidation of 1, a highly potent cytotoxin which enhances actin polymerization . In separate work, sodium channel blocking and fish toxicity assays guided the isolation and discovery of 2 and 3 . Since then, this strain has been extensively studied (under its name before reclassification Lyngbya majuscula JHB), with cultures from our laboratory being the subject of five more publications [1–7]. In this work we present the application of new methods for natural product discovery being applied to study the metabolome of M. producens JHB, from which several novel compounds, not observed in previous studies of this strain, were discovered (Fig 1).
A. The metabolome of M. producens JHB from prior studies; and B. the expanded metabolome of M. producens JHB from approaches described in this study.
Recent developments in the field of natural products have both improved the traditional activity/structure-based isolation workflows, and provided other orthogonal techniques to profile the metabolomes of organisms of interest. Traditional activity/structure-based isolation procedures have been supplemented by the use of solvent extractors , improved HPLC systems and columns to improve compound yields , new high-throughput screening techniques ; or analytical techniques that make it possible to determine the structures of ever smaller quantities of compounds, such as more sensitive cryo-probe NMR systems and new NMR pulse sequences [11, 12]. These improvements have enabled nanomole-scale structure elucidations such as were used to characterize the sanguinamides from the marine nudibranch Hexabranchus sanguineus . Concurrently, the field of natural products is exploring novel methods to assess secondary metabolomes that are distinct from traditional isolation workflows, chief among them is genome mining . As genome sequencing has become more rapid and less expensive, and the pipeline for assembling and annotating secondary metabolite pathways from gene sequence information, using programs such as antiSMASH and NaPDos, has become more efficient [15–19]. Consequently, applications of genome mining to the field of natural products have become increasingly diversified [14, 20, 21]. Genome mining has also shown that many strains possess far more biosynthetic gene clusters than previously expected, and indeed, these ‘silent’ pathways in some cases constitute the preponderance of a strain’s biosynthetic capacity [20–22]. Work with the daptomycin producing strain of Streptomyces roseosporus showed that it has the capacity to produce three other Non-Ribosomal Peptide Synthetase (NRPS) products, namely arylomycin, napsamycin, and stenothricin, all previously unreported from this organism .
Even with these impressive enhancements to our isolation methodologies, significant challenges remain to fully characterize the metabolic potential of an organism. Any single workflow has drawbacks; bioassay-guided isolation methods are inherently biased and overlook metabolites with differing activity. Structure-guided isolation schemes can miss compounds that are produced at low concentration and thus escape detection by normal methods, or if they have unremarkable spectroscopic properties. In addition, it is never known whether an organism is expressing all of its secondary metabolite biosynthetic pathways under a given set of environmental or culture conditions . While a successful genome mining effort might fully characterize the biosynthetic pathways within an organism, connecting and coordinating this with an isolation workflow can be challenging . Without heterologous expression or knockout experiments, links between a biosynthetic pathway and a structure are often only tentative.
In our decision to reexamine the metabolome of Moorea producens JHB we were mindful of these challenges, but also of the many examples in the literature where novel techniques found previously unreported compounds from productive well studied strains. Since the first report from M. producens JHB, our laboratory has gained access to new assays, such as modulation of cathepsin L activity , and new facilities on the UCSD campus for high-field NMR and High-Resolution Mass Spectrometry (HRMS) . However, because 1 and 2 are the major secondary metabolites from this strain, we knew that any additional compounds would be minor metabolites, and hence, alternative techniques that could target these compounds were needed.
Mass spectrometry based-molecular networking is well suited to serve as an alternative metabolic profiling platform because it is highly sensitive, amenable to use with compound or culture libraries, and blind to traditional guideposts for isolation projects such as chromatographic retention time or halogen isotope patterns [27, 28]. Mass spectrometry is highly sensitive, and while no ionization technique is universal, electrospray ionization (ESI) effectively ionizes a wide range of structural classes providing good coverage of the metabolome .
Molecular networking utilizes MS2 data to sort parent ions based on their structural similarity. Secondary ion mass fragmentation data relate directly to molecular structure because chemical bonds break on the basis of bond strength, strain within a molecule, and ability of a fragment to stabilize charge . Fragmentation patterns are thus intimately related to molecular structure, but independent of other bases for assessing compound similarity, such as the parent ion mass, halogen isotope pattern, or LCMS retention time. The Spectral Networking algorithm normalizes the intensity of fragment ions, uses each as an independent axis to construct a multidimensional vector for each spectrum, and finally compares the similarity of these vectors using a cosine function . This cosine score is then used to plot the relationships between different parent ion masses with the open source software Cytoscape (Fig 2) .
The process of forming a molecular network begins with collection of MS1 spectra (Step 1), from which parent ions are selected for MS2 fragmentation (Step 2). After the data is acquired it is processed by the Spectral Networking algorithm which converts MS2 data to vectors. The minimum number of peaks required to construct a vector is six; in this figure it is represented as only three so that the vector could be presented visually. It is important to note that when the multi-dimensional vectors are compared against all others by cosine function the vectors can have hundreds of dimensions (Step 3). This output is then visualized in Cytoscape with each node representing a MS2 spectrum labeled and colored based on its parent mass (Step 4).
For M. producens JHB, a crude extract of the cyanobacterium was profiled by LCMS and data from these runs were constructed into a molecular network as described above. This network identified an array of previously unreported compounds that were related to the major metabolites 1 and 2, likely produced as intermediates along the biosynthetic pathway, or by promiscuity in the substrate binding sites of these modular-type biosynthetic enzymes. These analogs provide new insights into the biosynthetic machinery and a more thorough understanding of the true secondary metabolic capacity of M. producens JHB than accomplished in previous work.
Additionally, further insight into the biosynthesis of 2 was gained by growing M. producens JHB in seawater media supplemented with sodium iodide. Under these conditions a minor metabolite of the jamaicamide family was produced in large quantities, and allowed the isolation and characterization of the iodinated analog of 2. This also provided insight into the halogenation enzyme involved in this biosynthetic step as well as yielding sufficient compound for biological assays.
The two approaches described above revealed new compounds within the jamaicamide and hectochlorin families, products of the two biosynthetic gene clusters previously identified in this organism [2, 4, 5]. To assess whether Moorea producens JHB possessed a broader biosynthetic capacity, we complemented the molecular networking approach with an in-depth bioinformatics interrogation of the sequenced genome. This led to the detection of numerous short NRPS pathways that were predicted by anti-SMASH to encode for small polar peptides, as well as one somewhat longer pathway for a predicted water soluble compound, the structure of which will be presented in a separate publication. An effort to isolate the products of these shorter NRPS pathways led to the isolation and characterization of a new compound, hectoramide (4), containing two N-methyl valine residues and a modified tyrosine residue. The structure of this new peptide was determined by 2D-NMR and HRMS analysis, and configuration of the residues by hydrolysis and chiral chromatographic analysis.
Results and Discussion
Molecular networking of M. producens JHB
The MS data from the crude extract LCMS chromatograms of M. producens JHB were analyzed using the Spectral Networking tool to create a molecular network and visualized in Cytoscape (Fig A in S1 Text) [27, 30]. These data were also uploaded into the Global Natural Products Social Molecular Networking database where they are publicly available (Massive ID MSV000078990) .
The network was interrogated for the [M+H]+ ions from the known major metabolites from this cyanobacterial strain, 1 and 2. In both cases, this revealed a cluster of parent masses from related analogs in the same family of compounds (Figs 3 and 4). One important caveat of the molecular networking process is that a single compound can give rise to multiple nodes. For example ions deriving from different adducts, from differing isotope composition, as well as source fragments, are all observed in the molecular network when present in sufficient intensity for the spectrometer to collect their MS2 spectra. The hectochlorin cluster is clearly an illustration of a single molecule producing multiple nodes from different halogen isotopes. The presence of heavy chlorine atoms in some of the hectochlorin molecules results in three nodes with mass differences of two Daltons within the hectochlorin cluster (Fig 3).
The presence of 35Cl or 37Cl within specific hectochlorin molecules yielded different parent masses and fragment spectra. The species with both 35Cl atoms has an m/z of 665, the species with one 35Cl and one 37Cl atom has and m/z of 667, and the species with 37Cl atoms has an m/z of 669. Because of this the fragment spectra share only masses from those fragments without chlorine atoms, and those fragments bearing the chlorine atoms show the same mass differences as their parent masses.
As with the hectochlorins, shown in Fig 3, the jamaicamides are present in the network as multiple parent masses from the presence of heavy halogen atoms.
The hectochlorin cluster was also interrogated for new hectochlorin analogs; many of the masses that clustered with the nodes from compound 1 were consistent with hypothetical analogs. The LCMS trace showed these species had distinct retention times from that of 1 (i.e. they were not source fragments), as well as isotope patterns that matched their variable levels of halogenation. Using the structure of 1 as a template, a series of hypothetical compounds could be proposed which were consistent with these masses (Table 1).
The network was also examined for new analogs within the jamaicamide cluster. Both 2 and 3 were present as a pair of [M+H]+ ions separated by 2 Da, again due to incorporation of heavy halogen atoms (Cl, Br). Another pair of nodes, with parent masses also separated by 2 Da, clustered with high cosine scores to the known jamaicamides. The LCMS1 chromatogram showed that this pair of nodes possessed a distinct retention time, and that it had an isotopic composition consistent with a singly brominated but non-chlorinated species. Thus, a dechloro-analog of 2, jamaicamide D (8), was hypothesized to explain this pair of connected nodes and was supported by HRMS data [m/z 533.1983, retention time 29.9 min, C27H38BrN2O4 [M+H]+ calculated 533.2009, -2.6 mamu].
The biosynthetic gene cluster and many of the individual biosynthetic steps for 2 are known, and previous work in our laboratory has suggested that 3 is the precursor for 2 with bromination occurring as the final step in the pathway [2, 3, 5, 6]. In this scenario, 8 would be formed by bromination of a non-chlorinated precursor (Fig AD in S1 Text). Further inspection of the network revealed a mass consistent with such a non-halogenated species, termed jamaicamide E (9), also clustered in the jamaicamide family (Fig 4). The LCMS1 trace again showed a distinctive retention time for 9, and an HRMS formula supportive of this structure [m/z 455.2877, retention time 26.5 min, C27H39N2O4 [M+H]+ calculated 455.2904, -2.7 mamu].
Purification of analogs from the jamaicamide and hectochlorin families
The LCMS analysis of this cyanobacterial extract used a serial set of five scans; one high-resolution, one low-resolution, and then three LTQ-MS2 scans of the three most intense ions from the previous scan using dynamic exclusion. Low-resolution LTQ scans were used to generate the MS2 data for molecular networking because of the higher sensitivity of this mode. This analysis generated MS1 and MS2 data consistent with the proposed structures for the jamaicamide and hectochlorin analogs. To rigorously characterize these new compounds, an extract that had been preparatively fractionated by Vacuum Liquid Chromatography (VLC) was interrogated for analogs . By LCMS analysis, fractions eluting with 80% EtOAc/hexanes and 100% EtOAc contained hectochlorin and the jamaicamide metabolites. These two VLC fractions were combined and further purified by Reverse-Phase Solid-Phase Extraction (RP-SPE) and RP-HPLC. The two VLC fractions eluting in 25% methanol to ethyl acetate, and in 100% methanol, were found to contain additional hectochlorin analog masses by LCMS, and were combined and purified in the same fashion as the less polar fractions.
These HPLC purifications afforded pure 1, 2, 3, and several of the minor analogs proposed by molecular networking analysis of the crude extracts. The major metabolites 1 and 2, along with the new derivatives 8 and 5, were isolated in sufficient quantity to fully characterize by NMR spectroscopy. However, not all of the species identified from the network were present in isolable amounts, as in the case of 9. To better characterize these other more minor analogs, they were subjected to HRMS2 fragmentation analysis, and then assembled de novo using the previously reported cyclic peptide dereplication tool .
The observation that 5, the deacetyl-analog, was the most abundant analog of compound 1, is consistent with the conjecture that this is an intermediate in the biosynthesis of 1 . The gene cluster encoding for hectochlorin biosynthesis shows no evidence of an acylation enzyme in the NRPS domain responsible for constructing the cyclic core of the molecule . Indeed, in the original report on the biosynthetic gene cluster, it was noted that the hct gene cluster lacked an acylation enzyme; it was proposed that acylation occurs as a post-NRPS modification . Moreover, 5 was previously reported along with 1 from the sea hare Bursatella leachii. In that report the authors suggested that accumulation of 5 in B. leachii could be the result of metabolism of 1 by the sea hare . With direct isolation of 5 from the cyanobacterial source, however, it is clear that direct accumulation from its diet is also a possibility.
At the time of its isolation, 5 was found to be a more potent cytotoxin than 1 against the human carcinoma of the nasopharynx (KB) and human small cell lung cancer (NCI-H187) cell lines (LD50’s of 0.31 μM and 0.32 μM for 5, compared to 0.86 μM and 1.20 μM for 1) . Molecular networking groups compounds based on their similar structural frameworks irrespective of their abundance, polarity, or other characteristics such as halogenation pattern or biological activity. Therefore, metabolites with significantly different polarities within an extract can still be easily recognized as structural analogs by molecular networking. The majority of compound 5 was found in the last two and most polar VLC fractions, which are often complex mixtures with poorly soluble nuisance compounds such as salts and glycolipids, potentially masking compounds of interest. By MS-based profiling and then targeting the isolation process towards the isolation of a suite of structurally similar products, rather than a single compound, information about structure activity relationships can be developed, as well as insights gained into the biosynthetic process.
Besides 5, the other analogs of 1 detected in this investigation are likely biosynthetic shunt products. They may be produced by errors in loading of the biosynthetic substrates or in the reactions the biosynthetic enzymes catalyze. For example, the monochlorinated analog of 1, hectochlorin C (6), is likely produced by a process wherein the chlorination enzyme fails to catalyze the addition of a second chlorine atom to the substrate, and this modification is transparent to subsequent downstream biosynthetic steps. However, the mono-chloro group in 6 is curious because formation of the trichloromethyl group of barbamide (10) in a different strain of M. producens, which is catalyzed by a highly homologous enzyme [35–37], occurs without going through a discrete monochlorinated intermediate. Rather, in the case of barbamide, the biosynthesis proceeds through an initial dichlorination step catalyzed by the BarB2 halogenase protein, followed by a monochlorination step catalyzed by the BarB1 halogenase protein to yield the final trichlorinated product. The isolation of 6 reveals that the halogenation enzyme in hectochlorin biosynthesis is different from the barbamide halogenases, as neither barbamide enzyme was observed to catalyze the formation of a discrete monochlorinated species in a detectable yield, and as the barbamide proteins act on the less reactive terminal methyl species [35–37].
Genomic Insights into M. producens JHB Natural Products
Genome sequence analysis of M. producens JHB revealed the known biosynthetic pathways of hectochlorin and the jamaicamides [2, 4, 5]. Other numerous short biosynthetic pathways were annotated by antiSMASH, and many of the predicted products of these pathways were dipeptides (Fig 5). Based on their predicted structures, we posited that if produced these compounds would be present in the more polar VLC fractions. A careful search of the metabolome contained in the polar fractions by LCMS analysis showed the dominant compound in the most polar eluting fractions was the hectochlorin analog 8. However, during purification by RP-SPE followed by RP-HPLC, not only was 8 isolated, but also a novel compound, named here as hectoramide (4), which was thoroughly characterized by NMR, HRMS and chemical degradation to components that could be compared with standards.
Analysis of the TOtal Correlation SpectroscopY (TOCSY) spectrum for compound 4 revealed four spin systems. Two of these were valine residues, seen by correlations of two doublet methyl peaks to their respective β-proton multiplets, and from there to their α-protons. Both of these residues were shown to be N-methyl valine residues by reciprocal Heteronuclear Multiple Bond Correlation (HMBC) correlations between the N-methyl singlets and the α-carbons. The presence of a para-substituted aromatic ring was revealed by a COSY correlation between two doublet 2H aromatic peaks; one of the substituents was revealed to be an O-methyl by HMBC signals between the singlet methyl protons at δ 3.79 and an aromatic ring carbon (C-19). The final COSY spin system connected an α-proton and a β-methylene group; the latter showed HMBC correlations to several of the aromatic ring carbons, and thus explained the other ring substituent. The downfield shift of the α-carbon (δ 69.9) and α-proton (δ 4.52) indicated this position was substituted with a hydroxy group rather than an amino group, revealing the final residue to be a 3-(4-methoxyphenyl)lactic acid (Mpla).
The linkage of these residues was determined by 2D-NMR and MS2 fragmentation analysis. The N-methyl group of the first valine showed a correlation to the carbonyl of the second valine residue. In turn, this latter valine showed a correlation from its N-methyl to the carbonyl of the adjacent Mpla residue. As such, the initial structure proposed for this compound was (HOOC-N-Me-Val)-(N-Me-Val)-(Mpla-OH) (11); however, this was inconsistent with data from HRMS analysis. While the structure 11 would have a predicted molecular formula of C22H34N2O6, the MS1 [M+H]+ peak of this compound was at m/z 422.2648 Da, consistent with the molecular formula of C22H35N3O5. This was resolved by comparing predicted 13C-NMR shifts with various candidate structures, and indicated that a terminal primary amide and hydroxy group were fully consistent with the C-1 and C-14 13C-NMR shifts (Fig AT in S1 Text). In addition, MS2 fragmentation showed fragment ions diagnostic for this structure (Fig 6), explicitly (H2NOC-N-Me-Val)-(N-Me-Val)-(Mpla-OH).
The absolute configuration of 4 was elucidated by traditional hydrolysis and derivatization methods. The two N-methyl-valine residues were characterized by Marfey’s analysis [38, 39], and the Mpla was derivatized with (2S)-2-octanol for chiral GCMS analysis . These analyses established the configuration of both valine residues as L, and the Mpla moiety as S.
Based on the structure of 4, its biosynthetic pathway would be expected to have three adenylation domains, coding for one tyrosine and two valine residues, and three methylation domains, two for amide N-methylations and one for the phenol O-methylation. In this regard, the planar structure of 4 is different from any of the predicted antiSMASH molecules for the pathways observed in the M. producens JHB genome (Fig 5). To refine the predicted structures of these pathways, the adenylation domains were submitted to NRPSpredictor2 [41–43]. This improved our confidence in the predictions from antiSMASH, but also strongly suggested that these pathways were not responsible for the production of 4. The genome was further interrogated specifically for valine and tyrosine adenylation domains and methyltransferases. Searching the genome for these targets found several hits; however, each was inconsistent with the expected biosynthetic pathway of 4. The bacteriocin pathways annotated by AntiSMASH were also inconsistent with the biosynthesis of 4 because the necessary sequence “YVV” was not found in the leader peptide sequences, and because there were no methylation domains in these clusters.
If 4 was produced by a heterotrophic bacterial strain living in association with M. producens JHB, this might explain why the pathway was not found in the Moorea producens JHB genome. Because Moorea producens JHB is not axenic, the raw genome data is actually a rarefied metagenome of DNA from both the cyanobacteria and associated heterotrophic bacterial strains. After the DNA sequence data were assembled, it was binned into groups of high (generally heterotrophic) or low (generally cyanobacterial) G+C content. To continue the search for a 4 biosynthetic pathway, the high-G+C scaffolds were also evaluated with antiSMASH, and revealed three additional biosynthetic gene clusters. However, none of the latter pathways were NRPS pathways; and an additional bacteriocin pathway contained neither a leader sequence consistent with 4, nor methylation domains. As none of the pathways found in the M. producens JHB metagenomic data could convincingly be attributed to 4 biosynthesis, this leaves the origin of this compound uncertain; possibilities include that the compound is produced by a fragmented pathway, a pathway that was not annotated by AntiSMASH, or by a heterotrophic bacterium in the culture but for which the genetic material was not captured by this genome sequencing work.
Molecular networking analysis of the M. producens JHB extract revealed that the pathways for jamaicamide and hectochlorin had a greater capacity for analog production than was previously appreciated. To further probe this, specifically the process of halogenation in the jamaicamides, filaments of Moorea producens JHB were grown in SW-BG11 media that was supplemented with sodium bromide and sodium iodide in equimolar concentrations (10 mM each) above the natural abundance of bromide (0.84 mM) or iodide (0.2–0.5 μM, average of 0.4 μM) in seawater . Profiling of the crude extract of this small scale experiment by LCMS with molecular networking analysis indicated an additional mass and fragmentation pattern consistent with an iodinated analog of 2, named here as jamaicamide F (12) (Figs H and I in S1 Text).
To confirm the structure of compound 12, multiple larger scale cultures of M. producens JHB were grown in media supplemented solely with sodium iodide (10 mM). After growth for 8–9 weeks the cultures were extracted and the crude extract purified by sequential RP-SPE and RP-HPLC to afford 12. Its structure was confirmed by NMR analysis, and generally showed a dataset very similar to that of 2. All carbon atoms in 12 were within 0.0–0.1 ppm of those in 2 except for those four closest to the terminal alkyne, suggestive of conservation of structure and configuration between these two analogs (Fig 7). The iodo-alkyne functionality possessed distinctive chemical shifts of -6.6 and 94.2 ppm for C-1 and C-2, respectively (Table AI and Fig AK in S1 Text). The original crude extract chromatogram of M. producens JHB grown in normal SW-BG11 media was subsequently inspected for the presence of 12. Indeed 12 could be observed in the MS1 extracted ion chromatogram at m/z 615 with a distinct retention time (29.4 min), and is thus to our knowledge the first report of a naturally occurring iodo-alkyne. However, it was of too low intensity to be selected for fragmentation and observed by the MS2-based molecular network. Despite the fact that iodide salts are not specifically included in SW-BG11 media, they are present as trace constituents in sodium chloride (bromide averaging 56 ppm and iodide averaging 0.24 ppm in Instant Ocean brand sea salt ), thus explaining the source of iodine to produce compound 12 in cultures prepared in SW-BG11 media.
It was interesting that when M. producens JHB was grown in media with an equal molar abundance of bromide and iodide ions, the two compounds 2 and 12 were observed in the LCMS chromatogram with equal intensity. While ion intensity can be difficult to quantitate, in this case because the difference in structure is replacement of a bromine atom with an iodine atom, it is likely that compounds 2 and 12 ionize comparably. The production of 2 and 12 in equal amounts in this experiment suggests that the enzyme is not selective for bromine or iodine. Halogenation enzymes can derive their selectivity from differences of halide electronegativity, abundance in nature, ionic radius and reactivity [46, 47]. Fluoride’s paramount electronegativity results in the fact that enzymes incorporating this halogen are both rare, and unusually for halogenases, non-oxidative . In seawater fluorine, bromine, and iodine are relatively rare with F/Cl/Br/I molar ratios of 2,000:1,000,000:2,000:1 . Thus, it appears that the enzyme responsible for bromination of jamaicamide excludes iodine on the basis of its relative scarcity in seawater, yielding 2 as the major product in nature.
Previous work on the jamaicamide biosynthetic pathway showed that hexanoic acid, 5-hexenoic acid, and 5-hexynoic acid but not 6-bromo-5-hexynoic acid were suitable substrates for JamA . By in vivo MALDI-TOF MS analysis following 15N labelling, it was found that production of 3 is light dependent, whereas production of 2 continues in the dark, and that 2 can be produced without fresh production of 3 . These data strongly support the proposal that the biosynthesis proceeds through a final bromination step on a pool of free 3 to afford 2.
Three mechanisms of bromination of 2 are conceivable using Br- by a nucleophilic halogenase, Br⦁ by a non-heme iron O2-dependent halogenase, or ‘Br+’ by a haloperoxidase or an FADH2-dependent halogenase (Fig 8) [47, 48]. Assuming bromination occurs on the terminal alkyne, as discussed above, then halogenation by a nucleophilic halogenase Br- is unlikely because this would putatively proceed through a high energy sp hybridized alkynyl carbocation species, which is highly disfavored over the resonance stabilized propargyl cation . Bromination through the alkynyl radical is a conceivable mechanism, but the enzyme activity profile of reacting with both bromine and iodine and being unable to react with abundant chlorine is different from the reactivity profile of the known radical halogenases. Known non-heme iron O2-dependent halogenase can catalyze both chlorination and bromination (with chlorination being highly favored), and no reported iodination activity ; this may be due to the steric constraint of the larger iodide or because of a difference in fundamental energetics, radical iodination with elemental iodine is an endothermic process . Bromination through a hypohalite ‘Br+’ species is compatible with the observation of an iodinated species and no chlorinated species because bromoperoxidases are capable of oxidizing the less electronegative iodine but not the more electronegative fluorine or chlorine . Flavin-dependent halogenases have not been observed to iodinate, hypothesized as a problem of sterics for the larger iodide or hydrolysis of the bound iodine intermediate, so they are a poorer candidate for producing the reactive “X+” equivalent .
Walsh and coworkers suggested an alternative mechanism for the bromination of jamaicamide via radical bromination of a saturated precursor methyl terminus ; however, this seems flawed for two possible reasons. First, as noted above, this terminal halogenation appears to occur with the alkyne rather than the alkane. Additionally, as stated above, with the discovery of 12, the reactivity profile of acting on bromine and iodine but not chlorine does not match to any known non-heme iron and O2-depenedent halogenase. Therefore, electrophilic addition by a haloperoxidase remains as the most likely mechanism for the bromination/iodination event in jamaicamide biosynthesis.
Ion Channel Pharmacology of the Jamaicamides
The ability of compounds 2, 3, and 12 to antagonize veratridine-stimulated Ca2+ influx was studied in neocortical neurons. All three jamaicamides tested (2, 3, and 12) produced concentration-dependent antagonism for the increase in neuronal [Ca2+]i induced by veratridine (Fig AP in S1 Text). The concentration-response curves were fit by a three-parameter logistic equation yielding IC50 values of 1.82 μM (95% CI = 0.9–3.3 μM), 6.88 μM (95% CI = 3.0–15.6 μM) and 4.3 μM (95% CI = 2.2–8.4 μM), respectively for 2, 3, and 12. The results suggest that 2 is approximately 2–3 times more potent than the other two jamaicamides tested.
Given that 2 and 3 have previously been reported to have sodium channel blocking activity at a concentration of 5 μM in Neuro-2a mouse neuroblastoma cell lines , we assessed their ability to block veratridine-stimulated sodium influx. We used primary cultures of mammalian neurons as a model system that is more relevant than transformed cell lines to mammalian neurotoxicology . We determined the ability of these compounds to antagonize veratridine-stimulated Na+ influx in neocortical neurons. All three jamaicamides (A, B and F) produced concentration-dependent antagonism of the increase in neuronal [Na+]i induced by veratridine (Fig AQ in S1 Text). The concentration-response curves were best fit by a three-parameter logistic equation yielding IC50 values of 1.1 μM (95% CI = 0.5–2.5 μM), 3.6 μM (95% CI = 1.5–8.5 μM) and 2.3 μM (95% CI = 1.0–5.0 μM), respectively for 2, 3, and 12. Again, our results suggest that 2 is approximately 2-3-times more potent than the other two jamaicamides as a sodium channel blocker in neocortical neurons. Collectively, these data indicate that the structural differences between 2, 3, and 12 have only minor roles in the interaction of these compounds with voltage-gated sodium channels.
The results of using molecular networking, genome analysis, and directed biosynthetic feeding studies with M. producens JHB illustrated that each approach has the potential to enhance the traditional structure or bioassay-guided natural products workflows. Just as importantly, it revealed that previous studies have overlooked metabolic diversity in the hectochlorin and the jamaicamide structural families. This is not because the previous studies were poorly designed or executed, but because these previous methods were inherently less comprehensive and thus do not fully explore the secondary metabolome of a given organism.
The current study demonstrated the power of molecular networking to guide the isolation of new analogs in desired natural product families; however, the greater lesson is that no single method is comprehensive, and that to truly appreciate the secondary metabolome of an organism, it is necessary to employ orthogonal methods (Fig 9). Such an approach reveals minor or overlooked metabolites, which may be intermediates or byproducts of the biosynthetic machinery responsible for the secondary metabolome, and is applicable to complex samples, such as field collections from the environment or, as in this case, non-axenic cultures. Compounds produced in only trace quantities, however, can still be extremely valuable for understanding the biosynthetic process, for providing analogs to gain an initial appreciation of structure-activity relationships, and to provide a holistic view of the secondary metabolome of an organism.
Materials and Methods
General Experimental Procedures
NMR spectra were collected on a Varian Unity 500 MHz (500 MHz and 125 MHz for the 1H and 13C nuclei respectively), a Varian VX 500 MHz with 13C-optimized cryoprobe, and a Bruker 600 MHz (600 MHz and 150 MHz for the 1H and 13C nuclei respectively) with 1.7 mm inverse cryo-probe. NMR experiments were conducted using CDCl3 from Cambridge Isotope Laboratories, Inc. 99.8% D, containing 0.03% or 1.0% v/v trimethylsilane (referencing δH 0.0 as the internal standard from trimethylsilane, and δC 0.0 or δC 77.16 as internal standards using trimethylsilane and CDCl3, respectively). Microwave-heated reactions were run in a Biotage Initiator microwave synthesizer. LR-LCMS data were collected on a Thermo Finnigan Surveyor Autosampler/LC-Pump-Plus/PDA-Puls with a Thermo Finnigan Advantage Max mass spectrometer. HRMS data was collected on a Finnigan LTQ-FTICR-MS instrument (Thermo-Electron Corporation, San Jose, CA) fitted with either an Ion-Max ESI source for LCMS runs, or a Biversa Nanomate (Advion Biosystems, Ithaca, NY) electrospray source. HPLC purification was carried out with a Waters 515 HPLC Pump with a Waters 996 Photodiode Array Detector using Empower Pro software. All solvents were HPLC grade except for H2O which was purified with a Millipore Milli-Q system before use, CH3CN which was LCMS grade from J.T. Baker, and acetone which was distilled before use.
Crude Extract and LCMS
M. producens JHB samples were prepared from material cultured in separate technical replicates in artificial seawater media using a 16–8 h standard light-dark cycle at 28°C. The cultures were collected by rapid vacuum filtration from the media at room temperature, combined, and extracted iteratively with 2:1 dichloromethane/methanol to afford 0.81 g crude extract. LC-HRMS data for the creation of the molecular network were collected using samples of the crude cyanobacterial extract prepared from stock solutions of the extract at 100 mg/mL in 1:1 EtOH/isooctane. Of the stock solution, 10 μL (1 mg crude extract) was diluted to 1 mg/mL in MeOH or CH3CN, and insoluble solids were filtered with a Celltreat PTFE syringe filter before use. Samples and a blank of CH3CN were run on a LC-LTQ-FTICR-MS (Liquid-column Chromatography Linear Triple Quadrupole Fourier Transform Ion Cyclotron Resonance Mass Spectrometry with a Phenomenex Synergi 4 μm Fusion-RP 80 Å 100 x 2.00 mm column with a Security Guard. The elution used a gradient that began with 70% H2O (acidified 1% v/v with HCOOH)/CH3CN for 5 min, then ramped to 1% H2O/CH3CN at 40 min, held there for 5 min, brought back 70% at 47 min, and re-equilibrated for 3 min. The flow was diverted to waste for the first 5.25 min. The MS detector used a serial set of five scans; one high-resolution FT-MS1 scan was followed by one low-resolution LTQ-MS1 scan, and then three LTQ-MS2 scans of the three most intense ions from the LTQ-MS1 scan.
The VLC fractions containing hectochlorin (80% EtOAc/hexanes and 100% EtOAc) were combined and fractionated with a Grace Pure C18-Max 1 g/6 mL reverse phase solid phase extraction column (RP-SPE). The column was prepared with three column volumes of CH3CN and then run with a 30 mL elution each of CH3CN, MeOH, and CH2Cl2. The latter VLC fractions which contained the hectochlorin analogs (25% MeOH/EtOAc and 100% MeOH) were combined and fractionated in the same fashion. From each the CH3CN fraction was purified by HPLC, using a Synergi 4μ RP-80 250 x 10.00 mm column with an isocratic elution at 3.0 mL/min. The first fraction was eluted with 1.2:1.8 H2O/CH3CN for 70 min, the second with 1.25:1.75 H2O/CH3CN for 30 min. 1 was collected as 6.6 mg of a yellow-green oil, 2 as 8.9 mg of a light green oil, and 3 as 6.7 mg of a yellow-green oil; each of these matched literature reports for the NMR and HRMS spectra of these compounds [1, 2]. Also collected was 4 as 1.3 mg of a yellow oil, 6 as 0.2 mg of a yellow oil, 7 as 2.6 mg of a yellow-brown oil, and 8 as 0.5 mg of a light-green solids. Samples of 5 from the two HPLC purifications were combined and re-purified with a Jupiter 5μ C-18 300 Å 250 x 4.60 mm column eluted with 2:3 CH3CN/H2O at 1.5 mL/min and the purified 5 was collected as 1.1 mg of colorless oil.
The crude extract from the sodium iodide enrichment experiment was purified similarly with the same RP-SPE column but using an elution of 1:1 CH3CN/H2O, CH3CN, MeOH, and CH2Cl2. The CH3CN eluting fraction was purified by HPLC using a Phenomenex Kinetex 5μ C-18 100 Å 100 x 4.60 mm column, eluted with an isocratic elution of 52% CH3CN/H2O at 1.0 mL/min. The 3 was collected as 1.2 mg of a yellow oil and 12 as 1.5 mg of a faint-yellow oil.
A yellow oil; [α]25D -11.1 (CH2Cl2, c = 3.5 mM); UV-Vis (MeOH) λmaxima (log ε) 204 (2.(2)) nm, 262 (1.(7)) nm; IR (neat) νmax 3396.4, 2961.8, 2929.7, 1681.1, 1629.4, 1466.9, 1249.5, 1073.9, 1036.7 cm-1; see Table 2 for NMR data; HR-ESI-FT-MS [M + H]+ m/z 422.2648 (calculated for C22H36N3O5 422.2649, -0.1 mamu).
Hectochlorin B (5).
A faint-yellow oil; [α]25D -0.325 (MeOH, c = 47.2 mM); UV-Vis (MeOH) λmax (log ε) 237 (4.(0)) nm; IR (neat) νmax 3394.2, 2976.5, 2932.2, 1715.4, 1379.8, 1323.6, 1244.5, 1150.8, 1090.5, 741.1 cm-1; see Table R in S1 Text for NMR data; HR-ESI-FT-MS [M + H]+ m/z 623.1050 (calculated for C25H33O8Cl2N2S2 623.1050, 0 mamu).
Hectochlorin C (6).
A yellow oil; [α]25D 4.35 (MeOH); UV-Vis (MeOH) λmax (log ε) 232 (3.(9)) nm; IR (neat) νmax 2926.6, 2857.7, 1715.6, 1243.7, 1211.7, 1151.1, 1085.1, 1048.3 cm-1; HR-ESI-FT-MS [M + H]+ m/z 631.1545 (calculated for C27H36O9ClN2S2 631.1545, 0 mamu).
Hectochlorin D (7).
A yellow-brown oil; [α]24D -1.53 (MeOH); UV-Vis (MeOH) λmax (log ε) 233 (3.(4)) nm; IR (neat) νmax 3302.8, 2928.1, 2861.1, 1712.2, 1449.0, 1242.5, 971.0 cm-1; HR-ESI-FT-MS [M + H]+ m/z 679.1306 (calculated for C28H37O9Cl2N2S2 679.1312, -0.6 mamu).
Jamaicamide D (8).
A faint-green, amorphous solids; see Table AE in S1 Text for NMR data; HR-ESI-FT-MS [M + H]+ m/z 533.2012 (calculated for C27H38O4BrN2 533.2009, 0.3 mamu).
Jamaicamide F (12).
A faint-yellow oil; [α]25D 83.8 (CH2Cl2, c = 2.4 mM); UV-Vis (MeOH) λmax (log ε) 266 (4.(1)) nm; IR (neat) νmax 3318, 3116, 2933, 2863, 1716, 1656, 1599, 1545, 1439, 1396, 1335, 1296, 1204, 1171, 1137, 1082, 973, 823, and 754 cm-1; see Table AI in S1 Text for NMR data; HR-ESI-FT-MS [M + H]+ m/z 615.1471 (calculated for C27H37O4ClIN2 615.1481, -1.0 mamu).
Pure Compound HR MS/MS
Fractions from the HPLC purification were prepared for HRMS. Samples were diluted 1:10 with 50% H2O/MeOH acidified 1% v/v with formic acid and subjected to electrospray ionization with a Nanomate nano-spray source (pressure: 0.3–0.4 psi, spray voltage: 1.3–1.4 kV), and then fragmented and analyzed on a LTQ-FT-MS. The instrument was first auto-tuned to a standard of 100 μM cytochrome C. The [M + H]+ ion of each compound was isolated in the linear ion trap and fragmented by collision induced dissociation. Sets of consecutive, high-resolution MS/MS scans were acquired in profile mode and averaged using QualBrowser software by Thermo.
MS spectra were converted to Mascot generic format (.mgf) files using MSCovert (from Proteowizard ) and then networked with the Spectral Networking algorithm using a minimum cosine of 0.7; the data were then viewed in Cytoscape .
Preparation of Mpla Ester Standards.
Samples of l and D O-methyl-tyrosine, 19.6 mg and 19.7 mg respectively, were charged into 8.0 mL vials and magnetic stir bars were added. These amino acids were dissolved in 1.00 mL of 0.9999(7) M NaNO2 (aq) and cooled to 0°C with stirring and 5.0 mL of 0.200(6) M perchloric acid (aq) was added. The reaction mixture was warmed to room temperature slowly over 30 min then sealed and heated to reflux by oil bath for 5 min. Afterwards, it was cooled to room temperature and diluted with 10 mL of 5% citric acid (aq) and extracted with 3 x 10 mL of CH2Cl2. The crude reaction mixture was concentrated to afford a colorless oil. This was placed in a fresh vial with a stir bar, then dissolved in 300 μL of (2S)-2-octanol while stirring under Ar. The acyl chloride (150 μL) was added dropwise, the vial sealed and heated to 115°C in an oil bath. After 1.5 h the vials were cooled to room temperature and quenched with 1.0 mL of water and 2.0 mL of dichloromethane. The organic layer was passed over ca. 2 cm of MgSO4 in a glass pipette, concentrated to afford 4.8 and 4.1 mg of the two esters as colorless oils for the l and D case, respectively.
Stereochemical Analysis of Hectoramide (4)
A 300. μL aliquot of a 1 mg/mL solution of 4 in CH2Cl2 was transferred to a 0.5–2 mL microwave reaction tube with a stir bar, dried under N2 (g), and taken up in 600 μL of 6 M HCl (aq). This mixture was reacted in a microwave reactor at 110°C for 5 min, then dried again under N2 (g). The sample was redissolved in 600 μL of CH2Cl2, split into two equal portions, dried again under N2 (g), charged with a magnetic stir bar, and put under an inert Ar atmosphere.
The first sample was dissolved in 150 μL of 1 M NaHCO3 and 300.0 μL of D-FDAA solution (1.00 mg/mL in acetone) was added dropwise with stirring. This reaction mixture was warmed to 45–50°C for 1 h, then cooled to room temperature and neutralized with 75.0 μL of 2 M HCl (aq). This material was transferred to a fresh vial with 3 x 0.250 mL of CH3CN, concentrated to dryness under N2 (g). The sample was transferred to an LCMS vial through a syringe filter with 3 x 0.5 mL of CH3CN, re-concentrated under N2 (g) and brought to a final volume of 150 μL CH3CN. A 25.0 μL aliquot of the sample was injected on the LR-LCMS and run in comparison with previously prepared authentic standards . The samples were analyzed by LR-LCMS with a Phenomenex Kinetex 5 μ C18 100 Å 100 x 4.60 mm column, and a 75 min gradient elution beginning at 5% CH3CN/95% H2O acidified with 0.1% v/v with HCOOH (Acros) for 5 min, and then ramped up to 50% CH3CN/50% H2O acidified with 0.1% v/v with formic acid over 65 min, and then brought back to the starting condition over 1 min and re-equilibrated for 4 min.
The second portion of the hydrolysate was dissolved in 200 μL of (2S)-2-octanol and 100 μL of acyl chloride was added dropwise with stirring. The reaction vial was sealed and warmed to 110–120°C and kept for 1 h, then cooled to room temperature and quenched with 0.50 mL of H2O. The reaction was extracted with 2.0 mL of CH2Cl2 and the organic layer was separated and passed through MgSO4. The organic layer was dried and reconstituted in a GCMS vial with 3 x 50 μL of CH2Cl2 and then analyzed by GCMS alone and in comparison with similarly derivatized D and l standards by co-injection.
Genomic DNA from cultured biomass of M. producens JHB was extracted using a standard phenol:CHCl3:isoamyl alcohol extraction protocol. The genome was sequenced at the University of Michigan Microarray Core Facility (http://www.umich.edu/~caparray/), using Illumina HiSeq sequencing. Raw reads were corrected via BayesHammer . Assembly was performed using the SPAdes Genome Assembler version 3.1.1 , followed by scaffolding with Opera and binning to obtain cyanobacterial-specific scaffolds , with an average G+C content of 44%, which is comparable to the related Moorea producens 3L genome G+C content of 44% . The sequences for the clusters detailed in Fig 5 were uploaded to GenBank with the accession numbers KP860346-48.
Cultures of Moorea producens JHB were grown as previously described , in 2 L Erlyenmeyer flasks with 1 L of SWBG-11 media supplanted with equimolar amounts of NaI and NaBr at a concentration of 10 mM (1.5 g and 1.0 g into 1 L SWBG-11 media, respectively), with a control sample grown in unmodified SWBG-11 media. The inoculation was made with only a few short filaments, roughly 2–3 centimeters long. After 7 weeks of growth, the cultures were extracted and analyzed by LCMS as described above.
To provide compound 12 on a larger scale, Moorea producens JHB was grown in SWBG-11 media containing 10 mM NaI (15.0 g into 10 L SWBG-11). Multiple cultures were grown in shallow pans containing 5 L of media each and split into new pans with fresh media after 8 or 17 weeks of growth. A total of four pans were grown and the cyanobacteria were extracted and compounds purified as described above.
Neocortical Neuron Culture.
Primary cultures of neocortical neurons were obtained from embryonic day 16 Swiss-Webster mice as described elsewhere [57, 58]. Briefly, pregnant mice were euthanized by CO2 asphyxiation and embryos were removed under sterile conditions. Neocortices were collected, stripped of meninges, minced by trituration with a Pasteur pipette, and treated with trypsin for 25 min at 37°C. The cells were then dissociated by two successive trituration and sedimentation steps in soybean trypsin inhibitor and DNase containing isolation buffer, centrifuged, and resuspended in Eagle’s minimal essential medium with Earle’s salt (MEM) and supplemented with 1 mM l-glutamine, 10% fetal bovine serum, 10% horse serum, 100 IU/mL penicillin, and 0.10 mg/mL streptomycin (pH 7.4). Cells were plated onto poly-l-lysine-coated 96-well (9 mm), clear-bottomed, black-well culture plates (MidSci, St. Louis, USA) at a density of 1.5 × 105 cells/well. Cells were then incubated at 37°C in a 5% CO2 and 95% humidity atmosphere. The culture media was changed every other day, starting from day 5 in-vitro using a serum-free growth medium containing Neurobasal Medium supplemented with B-27, 100 IU/mL penicillin, 0.10 mg/mL streptomycin, and 0.2 mM l-glutamine. Neocortical cultures were used in experiments between 10–11 days in-vitro. All animal use protocols were approved by the Institutional Animal Care and Use Committee (IACUC) of Creighton University.
Intracellular Ca2+ Concentration Measurement.
Cells grown in 96-well plates were used for determination of intracellular Ca2+ concentration ([Ca2+]i). Briefly, the growth medium was removed and replaced with dye-loading medium (100 μL per well) containing 4 μM fluo-3 AM and 0.04% pluronic acid in Locke's buffer. After 1 h of incubation in dye-loading medium, the cells were washed four times in fresh Locke's buffer (8.6 mM HEPES, 5.6 mM KCl, 154 mM NaCl, 5.6 mM glucose, 1.0 mM MgCl2, 2.3 mM CaCl2, 0.1 mM glycine, pH 7.4) using an automated microplate washer (Bio-Tek Instruments Inc., VT, USA). Various concentrations of 2, 3, and 12 were then added to the cells at a rate of 26 μL/s, yielding a final volume of 200 μL/well. The cells were incubated in the presence or absence of jamaicamides for 5–7 minutes at 37°C in a 5% CO2 and 95% humidity atmosphere prior to transfer to a FLEX Station II (Molecular Devices) benchtop scanning fluorescence chamber. The fluorescence measurements were performed at 37°C. The cells were excited at 488 nm and Ca2+-bound fluo-3 emission was recorded at 538 nm at 2 s intervals. After recording baseline fluorescence for 60 s, either vehicle or veratridine was added to each well at a rate of 26 μL/s; the fluorescence was then monitored for an additional 200 s.
Intracellular Na+ Concentration Measurement.
[Na+]i measurement and full in-situ calibration of sodium-binding benzofuran isophthalate (SBFI) fluorescence ratio were performed as described previously . Cells grown in 96-well plates were washed four times with Locke's buffer using an automated microplate washer (BioTek Instruments, Winooski, VT). After measuring the background fluorescence of each well, cells were incubated for 1 h at 37°C with dye-loading buffer (100 μL/well) containing 10 μM SBFI-AM (Invitrogen) and 0.02% Pluronic F-127 (Invitrogen). Cells were washed five times with Locke's buffer, leaving a final volume of 150 μL in each well. Jamaicamide addition and preincubation was as described above for Ca2+ concentration determination. Plates were then placed in a FlexStation II (Molecular Devices, Sunnyvale, CA) chamber to detect Na+-bound SBFI emission at 505 nm (excitation wavelength was 340 and 380 nm). Fluorescence readings were taken once every 5 s for 60 s to establish the baseline, then either vehicle (control) or veratridine was added and fluorescence was monitored for an additional 260 s. After correcting for background fluorescence, SBFI fluorescence ratios (340/380) and concentration-response graphs were generated.
Bioassay Data Analysis.
The Fluo-3 and SBFI raw fluorescence emission data were exported to an Excel work sheet. The Fluo-3 fluorescence or SBFI fluorescence ratios (340/380) versus time were analyzed and concentration-response graphs generated using GraphPad Prism software (GraphPad Software Inc., San Diego, CA). The IC50 values of 2, 3, and 12 for antagonism of veratridine-stimulated calcium or sodium influx were determined by nonlinear regression analysis using a three-parameter logistic equation.
This work was supported by NIH grants NSO53398 (to W. H. G. and T. F. M.), CA100851 (to W.H.G.), CA108874 (to W. H. G., L. G., D. H. S.) and GM107550 (W. H. G., L. G., and P. C. D.), and the Russian Science Foundation grant 14-50-069 (A. K.). We thank the government of Jamaica for permission to make the original collection of M. producens JHB, T. Byrum for help in the culturing of the cyanobacterium and the photo used to create Fig 1, B. Duggan for help with the acquisition of the 600 MHz data sets, and J. Watrous, M. Meehan, C. Rath, and D. Nguyen for help with the LTQ-FTICR-MS and construction of the molecular networks, R. C. Coates for allowing us use of his Moorea producens JHB extracts and VLC fractions, and P. Fardella for help with secondary metabolite sequence analyses.
Conceived and designed the experiments: PDB TFM LG PCD WHG. Performed the experiments: PDB EAM SM SD AK LG. Analyzed the data: PDB EAM SM AK LG WHG. Contributed reagents/materials/analysis tools: AK DHS TFM LG PCD WHG. Wrote the paper: PDB EAM TFM LG WHG.
- 1. Marquez BL, Watts KS, Yokochi A, Roberts MA, Verdier-Pinard P, Jimenez JI, et al. (2002) Structure and absolute stereochemistry of hectochlorin, a potent stimulator of actin assembly. J Nat Prod. 65: 866–871. pmid:12088429
- 2. Edwards DJ, Marquez BL, Nogle LM, McPhail K, Goeger DE, Roberts MA, et al. (2004) Structure and biosynthesis of the jamaicamides, new mixed polyketide-peptide neurotoxins from the marine cyanobacterium Lyngbya majuscula. Chem Biol. 11: 817–833. pmid:15217615
- 3. Dorrestein PC, Blackhall J, Straight PD, Fischbach MA, Garneau-Tsodikova S, Edwards DJ, et al. (2006) Activity screening of carrier domains within nonribosomal peptide synthetases using complex substrate mixtures and large molecule mass spectrometry. Biochemistry. 45: 1537–1546. pmid:16460000
- 4. Ramaswamy AV, Sorrels CM, Gerwick WH (2007) Cloning and biochemical characterization of the hectochlorin biosynthetic gene cluster from the marine cyanobacterium Lyngbya majuscyla. J Nat Prod. 70: 1977–1986. pmid:18001088
- 5. Gu L, Wang B, Kulkarni A, Geders TW, Grindberg RV, Gerwick L, et al. (2009) Metamorphic enzyme assembly in polyketide diversification. Nature. 459: 731–735. pmid:19494914
- 6. Esquenazi E, Jones AC, Byrum T, Dorrestein PC, Gerwick WH (2011) Temporal dynamics of natural product biosynthesis in marine cyanobacteria. P Natl Acad Sci USA. 108: 5226–5231.
- 7. Engene N, Rottacker EC, Kaštovský J, Byrum T, Choi H, Ellisman MH, et al. (2012) Moorea producta gen. nov., sp. nov. and Moorea bouillonii, comb. nov., tropical marine cyanobacteria rich in bioactive secondary metabolites. Int J Syst Evol Microbiol. 62: 1171–1178. pmid:21724952
- 8. Johnson TA, Morgan MVC, Aratow NA, Estee SA, Sashidhara KV, Loveridge ST, et al. (2010) Assessing pressurized liquid extraction for the high-throughput extraction of marine-sponge-derived natural products. J Nat Prod. 73: 359–364. pmid:20030364
- 9. Hayes R, Ahmed A, Edge T, Zhang H (2014) Core-shell particles: preparation, fundamentals and applications in high performance liquid chromatography. J Chromatography A. 1357: 36–52.
- 10. Hou Y, Braun DR, Michel CR, Klassen JL, Adnani N (2012) Microbial strain prioritization using metabolomics tools for the discovery of natural products. Anal Chem. 84: 4277–4283. pmid:22519562
- 11. Kupče E, Freeman R (2003) Two-dimensional Hadamard spectroscopy. J Magn Reson. 162: 300–310. pmid:12810013
- 12. Molinski TF (2010) NMR of natural products at the ‘nanomole-scale’. Nat Prod Rep. 27: 321–329. pmid:20179874
- 13. Dalisay DS, Rogers EW, Edison AS, Molinski TF (2009) Structure elucidation at the nanomole scale. 1. Trisoxazole macrolides and thiazole-containing cyclic peptides from the nudibranch Hexabranchus sanguineus. J Nat Prod. 72: 732–738. pmid:19254038
- 14. Challis GL (2008) Mining microbial genomes for new natural products and biosynthetic pathways. Microbiology. 154: 1555–1569. pmid:18524911
- 15. Antibiotics & Secondary Metabolite Analysis Shell. Available: http://antismash.secondarymetabolites.org/. Accessed 9 January 2015.
- 16. Medema MH, Blin K, Cimermancic P, de Jager V, Zakrzewski P, Fischbach MA, et al. (2011) antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 39: W339–W346. pmid:21672958
- 17. Blin K, Medema MH, Kazempour D, Fischbach MA, Breitling R, Takano E, et al. (2013) antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers. Nucleic Acids Res. 41: W204–W212. pmid:23737449
- 18. Gross H, Stockwell VO, Henkels MD, Nowak-Thompson B, Loper JE, Gerwick WH (2007) The genomeisotopic approach: a systematic method to isolate products of orphan biosynthetic gene clusters. Chem Biol. 14: 53–63. pmid:17254952
- 19. Ziemert N, Podell S, Penn K, Badger JH, Allen E, Jensen PR (2012) The natural product domain seeker NaPDos: A phylogeny based tool to classify secondary metabolite gene diversity. PLoS ONE. 7: e34064. pmid:22479523
- 20. Jensen PR, Chavarria KL, Fenical W, Moore BS, Ziemert N (2014) Challenges and triumphs to genomics-based natural product discovery. J Ind Microbiol Biotechnol. 41: 203–209. pmid:24104399
- 21. Udwary DW, Zeigler L, Asolkar RN, Singan V, Lapidus A, Fenical W, et al. (2007) Genome sequencing reveals complex secondary metabolome in the marine acrinomycete Salinispora tropica. Proc Natl Acad Sci USA. 104: 10376–10381. pmid:17563368
- 22. Liu WT, Lamsa A, Wong WR, Boudreau PD, Kersten R, Peng Y, et al. (2013) MS/MS-based networking and peptidogenomics guided genome mining revealed the stenothricin gene cluster in Streptomyces roseosporus. J Antibiot. 67: 99–104. pmid:24149839
- 23. Jones AC, Monroe EA, Podell S, Hess WR, Klages S, Esquenazi E, et al. (2011) Genomic insights into the physiology and ecology of the marine filamentous cyanobacterium Lyngbya majuscula. Proc Natl Acad Sci USA. 108: 8815–8820. pmid:21555588
- 24. Terpe K (2006) Overview of bacterial expression systems for protein production: from molecular and biochemical fundamentals to commercial systems. Appl Microbiol Biotechnol. 72: 211–222. pmid:16791589
- 25. Miller B, Friedman AJ, Choi H, Hogan J, McCammon JA, Hook V, et al. (2014) The marine cyanobacterial metabolite gallinamide A is a potent and selective inhibitor of human Cathepsin L. J Nat Prod. 77: 92–99. pmid:24364476
- 26. UC San Diego Skaggs School of Pharmacy and Pharmaceutical Sciences NMR Facility. Available: http://sopnmr.ucsd.edu/. Accessed 25 July 2014.
- 27. Watrous J, Roach P, Alexandrov T, Heath BS, Yang JY, Kersten RD, et al. (2012) Mass spectral molecular networking of living microbial colonies. P Nat Acad Sci USA. 109: E1743–E1752.
- 28. Yang JY, Sanchez LM, Rath CM, Liu X, Boudreau PD, Bruns N, et al. (2013) Molecular networking as a dereplication strategy. J Nat Prod. 76: 1686–1689. pmid:24025162
- 29. Fenn JB (2003) Electrospray Wings for Molecular Elephants (Nobel Lecture). Angew Chem Int Ed. 42: 3871–3894.
- 30. Cytoscape. Available: http://www.cytoscape.org/. Accessed 25 July 2014.
- 31. GnPS: Global Natural Products Social Molecular Networking. Available: http://gnps.ucsd.edu/. Accessed 3 January 2015.
- 32. Standard system for VLC in our laboratory uses fixed volumes of progressively more polar mixtures of hexanes—EtOAc—MeOH proportional to the size of the extraction. (nine fractions, 100% hexanes, 10% EtOAc/hexanes, 20% EtOAc/hexanes, then 20% increments to 100% EtOAc, 25% MeOH/EtOAc, and 100% MeOH).
- 33. Ng J, Bandeira N, Liu W, Ghassemian M, Simmons TL, Gerwick WH, et al. (2009) Dereplication and de novo sequencing of nonribosomal peptides. Nat Methods 6: 596–600. pmid:19597502
- 34. Suntornchashwej S, Chaichit N, Isobe M, Suwanborirux K (2005) Hectochlorin and morpholine derivatives from the the Thai sea hare, Bursatella leachii. J Nat Prod. 68: 951–955. pmid:15974628
- 35. Galonić DP, Vaillancourt FH, Walsh CT (2006) Barbamide, a chlorinated metabolite with molluscicidal activity from the Caribbean cyanobacterium Lyngbya majuscula. J Am Chem Soc. 128: 3900–3901. pmid:16551084
- 36. Chang Z, Flatt P, Gerwick WH, Nguyen VA, Willis CL, Sherman DH (2002) The barbamide biosynthetic gene cluster: a novel marine cyanobacterial system of mixed polyketide synthase (PKS)-non-ribosomal peptide synthetase (NRPS) origin involving an unusual trichloroleucyl starter unit. Gene. 296: 235–247. pmid:12383521
- 37. Flatt PM, O’Connell SJ, McPhail KL, Zeller G, Willis CL, Sherman DH, et al. (2006) Characterization of the initial enzymatic steps of barbamide biosynthesis. J Nat Prod. 6:938–944.
- 38. Marfey P (1984) Determination of D-Amino Acids. II. Use of a bifunctional reagent, 1,5-difluoro-2,4-dinitrobenzene. Carlsberg Res Commun. 49: 591–596.
- 39. Bhushan R, Brückner H (2004) Marfey’s reagent for chiral amino acid analysis: A review. Amino Acids. 27: 231–247. pmid:15503232
- 40. Nunnery JK, Engene N, Byrum T, Cao Z, Jabba SV, Pereira AR, et al. (2012) Biosynthetically intriguing chlorinated lipophilic metabolites from geographically distant tropical marine cyanobacteria. J Org Chem. 77: 4198–4208. pmid:22489775
- 41. NRPS predictor 2. Available: http://nrps.informatik.uni-tuebingen.de/. Accessed 25 July 2014.
- 42. Rausch C, Weber T, Kohlbacher O, Wohlleben W, Huson DH (2005) Specificity prediction of sentlation domains in nonribosomal peptide synthetases (NRPS) using transductive support vector machines (TSVMs). Nucleic Acids Res. 33: 5799–5808. pmid:16221976
- 43. Röttig M, Medema MH, Blin K, Weber T, Rausch C, Kohlbacher O (2011) NRPSpredictor2—a web server for predicting NRPS adentlation domain specificity. Nucleic Acids Res. 39: W362–W367. pmid:21558170
- 44. Bruland KW. Chapter 45 Trace Elements in Sea-Water. In: Riley JP, Chester R, editors. Chemical oceanography. New York: Academic Press. 1983. pp. 172–173.
- 45. Link B. Customer Relations Lead Aquatics and Firstrax Divisions, Instant Ocean, Spectrum Brands Inc. In: Personal e-mail. 2015 Jan 13.
- 46. Butler A, Walker JV (1993) Marine haloperoxidases. Chem Rev. 93: 1937–1944.
- 47. Fujimori DG, Walsh CT (2007) What’s new in enzymatic halogenations. Curr Opin Chem Biol. 11: 553–560. pmid:17881282
- 48. Blasiak LC, Drennan CL (2009) Structural Perspective on Enzymatic Halogenation. Acc Chem Res. 42: 147–155. pmid:18774824
- 49. Carey FA, Sundberg RJ (2007) Advanced organic chemistry 5th edition, Part A: Structure and mechanisms. NewYork: Springer. 300, 1018–1019, and 1022 p.
- 50. Neumann CA, Fujimori DG, Walsh CT (2008) Halogenation strategies in natrual product biosynthesis. Chem Biol. 15: 99–109. pmid:18291314
- 51. Cao Z, Shafer TJ, Murray TF (2011) Mechanisms of pyrthroid insecticide-induced stimulation of calcium influx in neocortical neurons. J Pharmacol Exp Ther. 336: 197–205. pmid:20881019
- 52. ProteoWizard. Available: http://proteowizard.sourceforge.net/#. Accessed 25 July 2014.
- 53. Mevers E, Liu WT, Engene N, Mohimani H, Byrum T, Pevzner PA, et al. (2011) Cytotoxic veraguamides, alkynyl bromide-containing cyclic depsipeptides from the marine cyanobacterium cf. Oscillatoria margaritifera. J Nat Prod. 5: 928–936.
- 54. Nikolenko S, Korobeynikov A, Alekseyev M (2013) BayesHammer: Bayesian clustering for error correction in single-cell sequencing. BCM Genomics 14: S7.
- 55. Nurk S, Bankevich A, Antipov D, Gurevich A, Korobeynikov A, Lapidus A, et al. (2013) Assembling single-cell genomes and mini-metagenomes from chimeric MDA products. J Comput Biol. 20: 714–737. pmid:24093227
- 56. Gao S, Sung WK, Nagaraian N (2011) Opera: reconstructing optimal genomic scaffolds with high-throughput pair-end sequences. J Comput Biol. 18: 1681–1691. pmid:21929371
- 57. Cao Z, George J, Gerwick WH, Baden DG, Rainier JD, Murray TF (2008) Influence of lipid-soluble gating modifier toxins on sodium influx in neocortical neurons. J Pharmacol Exp Ther. 326: 604–613. pmid:18448863
- 58. Jabba SV, Prakash A, Dravid SM, Gerwick WH, Murray TF (2010) Antillatoxin, a novel lipopeptide, enhances neurite outgrowth in immature cerebrocortical neurons through activation of voltage-gated sodium channels. J Pharmacol Exp Ther. 332: 698–709. pmid:20026674