We present the solution-state NMR structures and preliminary functional characterizations of three venom peptides identified from the spitting spider Scytodes thoracica. Despite little sequence identity to other venom peptides, structural characterization reveals that these peptides contain an inhibitor cystine knot motif common to many venom peptides. These are the first structures for any peptide or protein from spiders of the Scytodidae family. Many venom peptides target neuronal ion channels or receptors. However, we have not been able to determine the target of these Scytodes peptides so we can only state with certainty the channels and receptors that they do not target.
Citation: Ariki NK, Muñoz LE, Armitage EL, Goodstein FR, George KG, Smith VL, et al. (2016) Characterization of Three Venom Peptides from the Spitting Spider Scytodes thoracica. PLoS ONE 11(5): e0156291. https://doi.org/10.1371/journal.pone.0156291
Editor: Israel Silman, Weizmann Institute of Science, ISRAEL
Received: March 18, 2016; Accepted: May 11, 2016; Published: May 26, 2016
Copyright: © 2016 Ariki et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Sequences are available from UniProt (accession numbers A0A0A0V662, A0A0A0V712, and A0A0A0V633 for U3-Sth1a, U3-Sth1h, and U5-Sth1a, respectively). Chemical shifts and restraints are available at the BioMagResBank (accession numbers 26002, 26003, and 26004 for U3-Sth1a, U3-Sth1h, and U5-Sth1a, respectively). Atomic coordinates are available from the Worldwide Protein Data Bank (accession codes 5FZV, 5FZW, and 5FZX for U3-Sth1a, U3-Sth1h, and U5-Sth1a, respectively).
Funding: N.M.L. was supported by Award Number R15GM085733 from the National Institute of General Medical Sciences. G.F.K. is supported by a Principal Research Fellowship from the Australian National Health & Medical Research Council and Discovery Grant DP130103813 from the Australian Research Council. N.K.A., L.E.M., E.L.A., F.R.G, and K.G.G. acknowledge support from the John S. Rogers Science Research Program at Lewis & Clark College. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Spider venoms are cocktails of peptides, proteins, and small organic molecules [1,2], with peptides being the most abundant compound. Each species of spider can produce hundreds or thousands of different venom peptides, which suggests that the number of unique venom peptides could be upwards of 12 million . However, to date, researchers have characterized only a small fraction of these peptides; some 1403 at last count . A handful of those characterized are potent modulators of neuronal ion channels and receptors [5–7]. While spiders primarily prey on insects, these venom peptides can target a wide range of invertebrate and vertebrate ion channels, including those found in humans and other mammals .
Disulfide-rich peptides are the dominant components in most spider venoms and they are often the key contributors to the activity and potency of the venom . While these disulfide-rich peptides can adopt a number of different structural motifs, the motif known as the inhibitor cystine knot (ICK) is the most widely observed. ICK peptides contain an antiparallel β-sheet stabilized by three or more disulfide bonds, creating a knot in the core of the peptide . The consensus sequence for the ICK motif  is: where “C” represents one of the six cysteines in the motif and “X” represents a stretch of other amino acid residues. The number of amino acid residues between cysteines varies, but is typically within the ranges given in the subscripts. The cystine knot is comprised of a ring formed between two disulfide bonds (CI–CIV, CII–CV) and the peptide backbone, with a third disulfide bond (CIII–CVI) penetrating the ring. This pseudo-knot confers these venom peptides with remarkable chemical and thermal stability; they are resistant to extremes of temperature and pH and have shown resistance to proteolytic degradation , even within gastric environments .
The repeated use of the ICK motif in spider venom presumably reflects the effectiveness of this structural class for generating large assortments of functionally diverse and stable peptides that can target a wide array of molecular targets. Gene duplication and diversification of the peptide sequence surrounding the knotted core has allowed this ICK structure to act as an adaptable framework for a wide range of peptide sequences  that can target neuronal ion channels with relative selectivity and potent paralytic or lethal function . Due to their robustness and target specificity, ICK peptides are promising candidates for use in preventing agricultural crop loss due to insect pests [8,14], and for use as therapeutic modulators of ion channels in humans .
In this paper, we present the structures of three venom peptides from the spider Scytodes thoracica. Scytodes are known as “spitting spiders” due to their unusual hunting method; they first restrain their prey with gluey spit before approaching and injecting venom to further immobilize their prey . The three structures that we determined are, to date, the only peptide or protein structures from Scytodidae, a family that includes five genera and 232 species . In addition to these structures, we present work towards the functional characterization of these peptides. Unfortunately, despite performing injections and topical applications of recombinantly-produced peptides into insects, fluorescent assays with ion channels, and radioligand screening against central nervous system receptors, we have not yet been able to determine the targets of these peptides.
mRNA Identification and Sequence Analysis
The sequences that we studied were first identified by Binford and colleagues using cDNA libraries generated from venom gland mRNA . The resulting data set contained more than 50 sequences that had a high likelihood of being venom toxins based on sequence homology and the pattern of cysteine residues.
From this set of putative venom toxin sequences, we selected three for further study based on the amount of peptide produced using our bacterial expression system. These three peptides, which had been given the names U3-scytotoxin-Sth1a, U3-scytotoxin-Sth1h, and U5-scytotoxin-Sth1a following the proposed unified nomenclature for spider toxins , will be for purposes of brevity referred to as U3-Sth1a, U3-Sth1h, and U5-Sth1a, respectively, in the following text. The full sequences for U3-Sth1a, U3-Sth1h, and U5-Sth1a are shown in Fig 1 and the UniProt accession numbers are A0A0A0V662, A0A0A0V712, and A0A0A0V633, respectively (see Table 1).
Alignment of Scytodes venom peptides illustrating the signal sequence, propeptide, and mature toxin regions. The signal sequence predicted by SignalP  is two residues shorter for U5-Sth1a than the other two peptides. The arrow after the processing quadruplet motif (PQM)  indicates the predicted cleavage sites for the mature toxins. The experimentally-determined disulfide-bond connectivity is shown below the alignment. Sequences were aligned using ClustalX 2.1  and visualized using JalView 2.8.1 . The coloring makes use of the default ClustalX color scheme, which is a function of sequence identity and amino acid type.
Venom peptides are typically expressed as prepropeptides containing a conserved N-terminal hydrophobic α-helical signaling sequence, a mature toxin sequence at the C-terminal end, and a propeptide region connecting the two. The N-terminal signaling sequence directs the translation of the prepropeptide into the lumen of the endoplasmic reticulum [1,18]. Subsequent proteolytic cleavage steps remove the N-terminal signaling sequence and propeptide, leaving only the mature toxin sequence. This mature toxin sequence then usually folds without any further post-translational modifications into an active ICK peptide [19,20].
For the three Scytodes peptide sequences, we predicted the location of the signal sequences indicated in Fig 1 using SignalP 4.0 . As is typical, the signal sequences are rich in hydrophobic amino acids (shown colored blue). The location of the cleavage site that results in the mature toxin is shown by an arrow in Fig 1. This cleavage site was predicted using the consensus sequence for the processing quadruplet motif (PQM) . In the consensus sequence for the PQM: where Y is the first amino acid of the mature toxin sequence, the cleavage site for the mature toxin is immediately after the arginine (R), and at least one of the three amino acids prior to the arginine (labeled X) is a glutamate (E). For U5-Sth1a, there is another match to the PQM consensus sequence immediately after the signal sequence. However, sequence alignment with other known full-length spider venom toxin sequences indicates that cleavage at this site would result in an uncharacteristically short linker sequence. In addition, proteomics work by Binford and colleagues using mass spectrometry  confirmed the presence of U5-Sth1a with the cleavage pattern indicated in Fig 1 in crude Scytodes thoracica venom.
Vector Design for Recombinant Expression
Codon-optimized genes encoding each of the mature toxins were synthesized and inserted into a variant of the pLic-MBP expression vector  by GeneArt (Invitrogen). The expression product from this system is a fusion protein with a signal sequence that directs transport of the fusion protein to the periplasm, a His6 tag, and maltose binding domain tag at the N-terminus of the toxin peptide. To allow for cleavage of the mature peptide from the rest of the fusion protein, we included a cleavage site for tobacco etch virus (TEV) protease immediately before the toxin sequence. As the mature toxin sequence for U5-Sth1a begins with an aspartate, it was possible to engineer the TEV protease cleavage site for the U5-Sth1a construct without introducing a non-native amino acid to the mature toxin sequence. For U3-Sth1a and U3-Sth1h, however, providing a TEV protease cleavage site necessitated the inclusion of a non-native glycine residue at the N-terminus of the mature toxin sequences. Consequently, the numbering of the amino acid sequences for the structures of U3-Sth1a and U3-Sth1h begins at zero rather than at one.
U3-Sth1a and U3-Sth1h Peptide Expression and Purification
U3-Sth1a and U3-Sth1h samples were generated using a modified high-density expression protocol . First, pLICC vector containing either the U3-Sth1a or the U3-Sth1h insert was used to transform Escherichia coli BL21 (DE3) cells. The transformed cells were incubated overnight at 37°C in Luria-Bertani (LB) medium with ampicillin (100 μg/ml) and used to generate a glycerol stock that was stored at –70°C. A scraping of the resulting glycerol stock was used to inoculate a 1 mL starter culture of LB medium containing ampicillin (100 μg/ml), and grown overnight at 37°C. This starter culture was used to inoculate a 1 L baffled flask containing 250 mL of ZYP-5052 media  containing carbenicillin (60 μg/ml). The ZYP-5052 media was adjusted to pH 8 prior to inoculation. The cell culture was allowed to grow at 37°C for approximately 12 h at which point the culture reached an OD600 of 5–7.
The cells were then spun down at 2500 rpm for 15 min and resuspended in 50% of the initial volume in a minimal media  that had been adjusted to pH 8 and contained carbenicillin (60 μg/ml). This medium contained 15N NH4Cl as the nitrogen source and either unlabeled glucose or 13C6-glucose as the carbon source. After 1 h of growth, the cells were induced by adding enough isopropyl β-d-1-thiogalactopyranoside (IPTG) to reach a concentration of 40 μM in the culture. The cells were incubated at 22°C for 12 h after induction and then harvested by centrifugation at 4000 rpm for 30 min.
The pellets were resuspended in a lysis buffer containing 20 mM phosphate buffer (pH 7.4), 20 mM imidazole, and 500 mM sodium chloride. After lysis by ultrasonication, the sample was centrifuged at 18,000 rpm at 4°C for 30 min. The resulting supernatant was purified using immobilized metal ion affinity chromatography (IMAC) with a HisTrap FF 5 mL column (GE Life Sciences) on an ÄKTAprime plus chromatography system (GE Life Sciences). After eluting the fusion protein from the column and buffer exchange using an Amicon centrifugal filter unit (Millipore), we cleaved the purified fusion protein overnight at room temperature using TEV protease in a redox buffer of 0.6 mM reduced glutathione (GSH)/0.4 mM oxidized glutathione (GSSG).
The cleavage reaction mixture was then fractionated using reverse-phase liquid chromatography with a Jupiter C18 column (Phenomenex) and a water/acetonitrile gradient. Fractions containing U3-Sth1a and U3-Sth1h eluted at 40% acetonitrile. Chromatography fractions were dried under vacuum and then rehydrated them in a buffer appropriate for NMR spectroscopy (95% H2O/5% D2O/20 mM sodium phosphate pH 6.5/30 mM sodium chloride). Yields of fusion protein were as high as 150 mg per liter of culture, resulting in 2–3 mg of venom peptide after the final purification step.
U5-Sth1a Peptide Expression and Purification
The expression of U5-Sth1a followed a procedure that we previously described . Briefly, the sample was prepared as follows: pLICC vector containing the U5-Sth1a insert was used to transform E. coli BL21 (DE3) cells, which were incubated at 22°C for 2–3 days using either a 15N-labeled or a 13C,15N-labeled autoinducing minimal medium . The cells were then centrifuged and the resulting pellet was lysed using ultrasonication. After ultracentrifugation, the resulting supernatant was purified using IMAC. The fraction containing the fusion protein was buffer exchanged, cleaved with TEV protease at room temperature overnight in a redox buffer containing 0.6 mM GSH and 0.4 mM GSSG. This cleavage reaction mixture was then fractionated using reverse-phase liquid chromatography with a water/acetonitrile gradient. Fractions containing U5-Sth1a eluted at 31% acetonitrile. Chromatography fractions containing U5-Sth1a were dried under vacuum and then rehydrated them in a buffer appropriate for NMR spectroscopy (95% H2O/5% D2O/20 mM sodium phosphate pH 6.5/30 mM sodium chloride). Typical yields of U5-Sth1a fusion protein were 100–150 mg for a 0.5 L culture, resulting in 1–2 mg of venom peptide after the final purification step.
Mass spectra (Fig 2) were acquired on a Thermo Scientific Velos ion trap mass spectrometer with electrospray ionization. The sample used for U3-Sth1a was unlabeled whereas the samples used for U3-Sth1h and U5-Sth1a were 15N-labeled. The mass-to-charge ratios are consistent with fully-oxidized peptides (i.e., formation of all three disulfide bonds).
Electrospray ionization mass spectra of U3-Sth1a (top), U3-Sth1h (middle), and U5-Sth1a (bottom). Inset spectra are high-resolution zoom scans for the indicated peaks. The monoisotopic masses (in amu) and calculated mass-to-charge ratios (m/z) for the reduced and oxidized peptides are provided on the right. The sample used for U5-Sth1a was unlabeled whereas the samples used for U3-Sth1h and U5-Sth1a were 15N-labeled. The mass-to-charge ratios are consistent with fully-oxidized peptides (i.e., formation of all three disulfide bonds).
NMR spectra were acquired at 600 MHz for 1H on a Bruker NMR spectrometer with a room temperature probe. The sample temperature for all NMR experiments was 298 K. Band-selective excitation short transient [27,28] (BEST) variants of the standard triple resonance sequences (HNCO, HN(CA)CO, HNCACB, HN(CO)CACB) were used for backbone assignment. 15N TOCSY-HSQC and HCCH-TOCSY spectra were used for side chain assignments. 2D NOESY spectra and 3D NOESY-HSQC spectra with simultaneous evolution of 13C and 15N chemical shifts were acquired for generating distance constraints. We processed the spectroscopic data using TopSpin 2.1 (Bruker Biospin) and interpreted it using Analysis 2.4.2  (Collaborative Computing Project for NMR). Representative 15N HMQC spectra for the three peptides studied are shown in Fig 3.
15N-HMQC spectra of U3-Sth1a (top left), U3-Sth1h (bottom left), and U5-Sth1a (top right). Sequence-specific residue assignments are indicated. Peaks from arginine and lysine side chains that were folded into the spectrum are shown in orange along with their 15N chemical shifts (in parentheses). The U5-Sth1a spectrum includes several peaks from a minor conformation; assignments for these peaks are shown with red labels.
We used chemical shift-matched peak lists from the NOESY spectra along with torsion angle restraints derived using TALOS-N  as input for ARIA 2.3 . Disulfide-bond constraints were used in later rounds of structure calculations once the cysteine connectivities were unambiguously determined from previous iterations. We deviated from the standard ARIA protocol by using a log-harmonic potential , by increasing the number of cooling steps from 5000 to 15,000 for the first simulated annealing period (cool1) and from 4000 to 12,000 for the second (cool2), and by calculating 30 structures (rather than 20) for each iteration. In the final iteration, 100 structures were calculated. The 20 structures with the lowest energy were selected for water refinement, and then used to generate the structural ensembles shown in Fig 4.
Ensemble (top) and cartoon (bottom) representations of U3-Sth1a (left), U3-Sth1h (center), and U5-Sth1a (right). The N and C termini are labeled in both the ensemble (top) and cartoon (bottom) representations of the structures of U3-Sth1a (left), U3-Sth1h (center), and U5-Sth1a (right). The three disulfide bonds in the cartoon representations are also labeled. The same orientation is used for the top and bottom representations. The ensembles are comprised of the 20 structures with the lowest total energy out of 100 calculated structures. The cartoon representations show the lowest energy structure from each ensemble. The N-terminal region shows some disorder for all three structures, but is especially apparent for residues 1–6 of U5-Sth1a.
The number and type of restraints used for the calculations, as well as the statistics for the structures, are provided in Table 2. As noted in Table 1, we deposited chemical shifts and restraints at the BioMagResBank (accession numbers 26002, 26003, and 26004 for U3-Sth1a, U3-Sth1h, and U5-Sth1a, respectively) and atomic coordinates in the Worldwide Protein Data Bank (accession codes 5FZV, 5FZW, and 5FZX for U3-Sth1a, U3-Sth1h, and U5-Sth1a, respectively).
Functional Analysis of Recombinant Toxins
Crickets (Acheta domestica) were injected with up to 2 μL of recombinant peptide solution with a concentration of 1 mg/mL (U3-Sth1a and U3-Sth1h) or 2 mg/mL (U5-Sth1a). Crickets were then observed every 10 min for 1 h (and once again after 24 h) to determine if there was a change in how quickly the crickets were able to right themselves after being flipped over. Although a change in the righting response was noted for some crickets, the number of affected crickets was not significantly different from crickets in a control group that were injected with insect saline (data not shown).
A single high dose of each peptide dissolved in 3.4 μL of water was injected laterally into the thorax of adult blowflies (Lucilia cuprina; weight of 26.0–27.9 mg) as previously described . The doses used for these injections were 350 nmol of U3-Sth1a per gram of body weight, 290 nmol/g for U3-Sth1h, and 310 nmol/g for U5-Sth1a. All flies were then kept individually in 2 mL tubes and observed at 0.5, 1 and 24 h post-treatment for signs of paralysis or lethality.
Fluorescent Ca2+ assays in neuroblastoma cells were performed using a fluorescent imaging plate reader (FLIPRTetra, Molecular Devices, Sunnyvale, CA) as previously described . SH-SY5Y cells plated on black-walled 384-well imaging plates (Corning) were loaded with Calcium 4 no-wash dye (Molecular Devices) for 30 min to assess inhibition of responses mediated by Nav1.7 voltage-gated sodium channels (stimulation by veratridine (4 μM) in the presence of OD1 (30 nM)), CaV1.3 voltage-gated calcium channels (stimulation by KCl (90 mM)/CaCl2 (5 mM)), homomeric α7 (stimulation by choline (30 μM) in the presence of PNU120596 (10 μM)), and heteromeric α3-containing nicotinic acetylcholine receptors (stimulation by nicotine (30 μM)). The assays were performed in PSS (physiological salt solution, pH 7.4) containing 140 mM NaCl, 11.5 mM glucose, 5.9 mM KCl, 1.4 mM MgCl2, 1.2 mM NaH2PO4, 5 mM NaHCO3, 1.8 mM CaCl2, and 10 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES). The peptides (final concentration 10 μM) caused a small, but probably not functionally relevant, effect on all four channels as summarized in Table 3.
Samples of all three venom peptides were sent to the National Institute of Mental Health (NIMH) Psychoactive Drug Screening Program (PDSP)  for activity testing. The purpose of the PDSP is to screen potentially psychoactive compounds by testing them for activity on human and rodent central nervous system (CNS) receptors and transporters. This screen used radioligand binding assays to test for activity (i.e., inhibition or activation) against the 45 CNS targets listed in Table 4 including 11 serotonin receptors, nine adrenegic receptors, five dopamine receptors, five muscarinic acetylcholine receptors, four histamine receptors, three neurotransmitter transporters, three opioid receptors, and two sigma receptors. The PDSP tested U3-Sth1a, U3-Sth1h, and U5-Sth1a in quadruplicate against each of these targets, but all three failed to elicit a significant level of inhibition or activation of any of the targets.
Results and Discussion
The full prepropeptide sequences for U3-Sth1a, U3-Sth1h, and U5-Sth1a derived from a Scytodes thoracica venom-gland cDNA library are shown in Fig 1. The full sequences match the pattern expected for small peptides from spider venoms, with a hydrophobic signal sequence and a short propeptide sequence with a PQM motif preceding the mature toxin sequence. For both U3-Sth1a and U3-Sth1h, the mature toxin sequences match the consensus sequence for the ICK motif provided in the introduction. In the case of U5-Sth1a, the mature toxin sequence nearly fits the ICK consensus sequence albeit with one additional amino acid between the second and third cysteines. In addition, all three sequences follow the CX3GX2C motif between the first and second cysteines that is commonly found in venom peptide toxins from theraphosid and ctenid spiders. Consistent with the propeptide cleavage site we predicted based on the PQM motif, previous work  using mass spectrometry confirmed that U5-Sth1a (as well as a paralog of U3-Sth1a and U3-Sth1h) is present in crude Scytodes thoracica venom.
We used BLASTp  to search for mature toxin sequences in the ArachnoServer database  that align closely with the mature toxin sequences for U3-Sth1a, U3-Sth1h, and U5-Sth1a. Except for other sequences from Scytodes thoracica, no sequences matched U3-Sth1a or U3-Sth1h with expect values less than 10−5 (a commonly used threshold for sequence homology; ). For example, the closest match for U3-Sth1a from another species was U4-ctenitoxin-Pr1a, a peptide from the venom of the spider Phoneutria reidyi that moderately inhibits L-type voltage-gated calcium channels (CaV1/CACNA1)  matched with an expect value of 0.009. For U3-Sth1h, the closest match from another species was U21-ctenitoxin-Co1a from the venom of the spider Ctenus ornatus which has an unknown molecular target , with an expect value of 0.016. In both cases, the main residues that align are the cysteines and a few other residues, as shown in Fig 5.
Alignment of the mature toxin sequences for U3-Sth1a and U3-Sth1h (top) of U5-Sth1a (bottom) with their closest matches to venom peptides from different species found in the ArachnoServer toxin peptide database using BLASTp. Sequences were aligned using ClustalX 2.1 and visualized using JalView 2.8.1. The coloring makes use of the default ClustalX color scheme, which is a function of sequence identity and amino acid type.
There was one significant match for U5-Sth1a from another species; the peptide U1-sicaritoxin-Sdo1a  from the venom of Sicarius dolichocephalous has an unknown molecular function and matched with an expect value of 2×10−8. Potentially, this match may stem from the evolutionary relationship between Sicarius dolichocephalous and Scytodes thoracica as it has been proposed that their families (Sicariidae and Scytodidae, respectively) are sister taxa  that diverged at least 100 million years ago .
The structures for U3-Sth1a, U3-Sth1h, and U5-Sth1a determined using NMR spectroscopy are shown in Fig 4 as an ensemble of 20 structures (top) and a cartoon representation of the lowest energy structure (bottom). As predicated based on their sequences and origin, all three structures were found to contain an ICK motif, with disulfide bonds 1, 2, and 3 formed by cysteines I and IV, II and V, and III and VI, respectively. For all three peptides, the β-hairpin between cysteines V and VI is unusually truncated and, in the case of U5-Sth1a, the region between cysteines II and III is longer than that found in most other venom toxins.
When expressing proteins and peptides with multiple disulfide bonds there is the potential for these bonds to not form or to form with the incorrect topology. This is particularly important in cases, such as this one, where the peptides have not been functionally characterized as we do not have assays to determine whether the structures that we observed are functionally relevant. Nevertheless, there are several reasons why we think our structures represent the native folds. First, it has been shown repeatedly that spider venom toxins are able to fold correctly in vitro [41–43]. Second, the connectivity of the cysteines in our structures follow the pattern that we expected based on comparison with known spider venom toxins. Third, the chemical shifts for the cysteine α and β carbons  in our NMR spectra are consistent with cysteines in disulfide bonds (i.e., completely oxidized) and inconsistent with reduced cysteines. Finally, testing of our peptide samples with Ellman's Reagent (5,5’-dithiobis(2-nitrobenzoic acid), DTNB)  indicated that there are no reduced cysteines present. Taken together, these data indicate that the cysteines in U3-Sth1a, U3-Sth1h, and U5-Sth1a are fully oxidized and connected with the expected topology, leading us to believe that these peptides are correctly folded.
The 15N HMQC spectrum for U5-Sth1a (Fig 3, lower spectrum) reveals the presence of a minor conformation of this peptide. The minor conformation must have very similar surface properties to the major conformation as we could not separate the two conformations using liquid chromatography. In addition, the presence of a minor component is probably not due to proteolysis as we did not observe a secondary peak in electrospray ionization mass spectra (Fig 2). As the chemical shifts of the minor conformation in the 15N HMQC spectrum differ only slightly from those of the major conformation U5-Sth1a, it must adopt a very similar fold. One possibility is that the minor conformation corresponds to U5-Sth1a with incomplete disulfide bond formation. However, the chemical shifts for the cysteine α and β carbons of the minor conformation are consistent with all of the cysteines being oxidized (and therefore in disulfide bonds). Another possibility is that the minor conformation contains disulfide bonds in a different topology (i.e., between cysteines II and VI, and III and V). This could happen with very little distortion of the overall structure as these cysteines are close to one another. To test this hypothesis, we mapped the amide chemical shift differences between the major and minor conformations onto the structure. We found that the largest changes are in the part of the structure with the short α-helical region (residues 16–23), whereas the amide chemical shifts for the cysteines that would have non-native connections (14C, 24C, 30C 35C) change only modestly. This leads us to believe that both conformations have the expected disulfide bonding pattern, and that the difference is due to conformational exchange in the region of residues 16–23 that is slow on the NMR timescale.
The conserved pattern of cysteines, the presence of the ICK structural motif, and the venom gland source of U3-Sth1a, U3-Sth1h, and U5-Sth1a leads us to believe that these peptides are likely to be neurotoxins. Evolution has fine-tuned a majority of ICK peptides from spider venoms to target insect ion channels. While some ICK peptides evolved to target mammalian channels , a large majority have not, so in many cases any interaction in mammals is due to structural homology of the ion channel. Unfortunately, our attempts so far at functional characterization have not allowed us to determine the targets of the three Scytodes peptides. That the peptides failed to elicit a significant level of activation or inhibition of 45 different human and rodent CNS targets, as tested by the PDSP, is not surprising, as this screen did not involve targets from insects and the targets were mainly receptors rather than ion channels.
Injection of high doses of U3-Sth1a (350 nmol/g) and U5-Sth1a (310 nmol/g) into sheep blowflies did not cause any paralytic or lethal activity. Likewise, injection of doses of up to 6 nmol/g (U3-Sth1a and U3-Sth1h) and 12 nmol/g (U5-Sth1a) into crickets did not cause any paralytic or lethal activity. The lack of response when injected into crickets and blowflies is significant, but a likely explanation is that the particular peptides in this study have a high specificity for targets in prey species that are evolutionarily distant from the ones that we have studied so far.
A more remote possibility is that the peptides that we have studied are no longer functionally relevant. Spider venoms are cocktails that contain hundreds of components, the majority of which are ICK peptides. The large number of ICK peptides in the venom from individual species has evolved due to a combination of gene duplication events and focal hypermutation of the intercysteine regions [12,47, 48]. The peptides from the venom of a single species of spider vary considerably in target and species-specificity. The complexity of the venom cocktail allows spiders to paralyze a wide variety of prey species, but it also means that loss-of-function mutations of individual peptides might be tolerated. However, the genes for non-functional peptides would no longer be under selective pressure and would eventually mutate to sequences that would not be expressed or that would not be recognizable as spider venom peptides. As the sequences and structures for U3-Sth1a, U3-Sth1h, and U5-Sth1a follow the patterns that we expect for ICK venom peptides from spiders, and evidence exists that U5-Sth1a and at least one paralog of U3-Sth1a/U3-Sth1h are present in crude venom , we feel that the most parsimonious explanation for our inability to observe any biological of activity for these peptides is that they are functional but their molecular targets remain elusive.
We have presented the first three-dimensional structures for spider-venom peptides from the Scytodidae family. These peptides are structurally similar to other venom peptides in that they contain an ICK structural motif, but show little sequence identity to peptides from other species of spiders. Although the targets of these peptides remain elusive, we hope that further study will help determine their functions.
We thank Dr. Greta Binford and Dr. Pamela Zobel-Thropp (Department of Biology, Lewis & Clark College, Portland, Oregon, United States) for their work in discovering the venom peptide sequences characterized in this paper, and Dr. Geoff Brown (Department of Agriculture, Fisheries and Forestry, Brisbane, Australia) for supplying the blowflies used for the injection studies. Screening for binding to vertebrate receptors was generously provided by the National Institute of Mental Health's Psychoactive Drug Screening Program, Contract # HHSN-271-2013-00017-C (NIMH PDSP). The NIMH PDSP is Directed by Bryan L. Roth MD, PhD at the University of North Carolina at Chapel Hill and Project Officer Jamie Driscoll at NIMH, Bethesda MD, USA.
Conceived and designed the experiments: NML GFK VH IV. Performed the experiments: NKA LEM ELA FRG KGG VLS IV VH NML. Analyzed the data: NML NKA IV VH. Contributed reagents/materials/analysis tools: NML GFK. Wrote the paper: NML NKA IV VH GFK.
- 1. Saez NJ, Senff S, Jensen JE, Er SY, Herzig V, Rash LD, et al. Spider-Venom Peptides as Therapeutics. Toxins 2010;2: 2851–2871. pmid:22069579
- 2. Kuhn-Nentwig L, Stoecklin R, Nentwig W. Venom composition and strategies in spiders: is everything possible? Adv. Insect Physiol 2011;40: 1–86.
- 3. Escoubas P, King GF. Venomics as a drug discovery platform. Expert Rev Proteomics 2009;6: 221–224. pmid:19489692
- 4. Herzig V, Wood DLA, Newell F, Chaumeil P-A, Kaas Q, Binford GJ, et al. ArachnoServer 2.0, an updated online resource for spider toxin sequences and structures. Nucleic Acids Res 2011;
- 5. King GF. Modulation of insect Ca(v) channels by peptidic spider toxins. Toxicon 2007;49: 513–530. pmid:17197008
- 6. Nicholson G.M. Insect-selective spider toxins targeting voltage-gated sodium channels. Toxicon 2007;49: 490–512. pmid:17223149
- 7. Klint JK, Smith JJ, Vetter I, Rupasinghe DB, Er SY, Senff S, et al. Seven novel modulators of the analgesic target Na 1.7 uncovered using a high-throughput venom-based discovery approach. Br J Pharmacol 2015;172: 2445–58. pmid:25754331
- 8. King GF, Hardy MC. Spider-venom peptides: structure, pharmacology, and potential for control of insect pests. Annu Rev Entomol. 2013;58: 475–496. pmid:23020618
- 9. Norton RS, Pallaghy PK. The cystine knot structure of ion channel toxins and related polypeptides. Toxicon 1998;36: 1573–1583. pmid:9792173
- 10. Nicholson GM. Spider Venom Peptides. In: Kastin A, editor. The Handbook of Biologically Active Peptides. San Diego: Elsevier; 2006. pp. 369–379.
- 11. Herzig V, King GF. The Cystine Knot Is Responsible for the Exceptional Stability of the Insecticidal Spider Toxin ω-Hexatoxin-Hv1a. Toxins 2015;7: 4366–4380. pmid:26516914
- 12. Sollod BL, Wilson D, Zhaxybayeva O, Gogarten JP, Drinkwater R, King GF. Were arachnids the first to use combinatorial peptide libraries? Peptides 2005;26: 131–139. pmid:15626513
- 13. Daly NL, Craik DJ. Bioactive cystine knot proteins. Curr Opin Chem Biol 2011;15: 362–368. pmid:21362584
- 14. Windley MJ, Herzig V, Dziemborowicz SA, Hardy MC, King GF, Nicholson GM. Spider-Venom Peptides as Bioinsecticides. Toxins 2012;4: 191–227. pmid:22741062
- 15. Zobel-Thropp PA, Correa SM, Garb JE, Binford GJ. Spit and Venom from Scytodes Spiders: A Diverse and Distinct Cocktail. J. Proteome Res. 2014;13: 817–835. pmid:24303891
- 16. Natural History Museum Bern. World Spider Catalog version 17.0; c2016 [updated 2016 Mar 17; cited 2016 Mar 17]. Available: http://wsc.nmbe.ch.
- 17. King GF, Gentz MC, Escoubas P, Nicholson GM. A rational nomenclature for naming peptide toxins from spiders and other venomous animals. Toxicon 2008;52: 264–276. pmid:18619481
- 18. Buczek O, Bulaj G, Olivera BM. Conotoxins and the posttranslational modification of secreted gene products. Cell Mol Life Sci 2005;62: 3067–3079. pmid:16314929
- 19. Liang S. An overview of peptide toxins from the venom of the Chinese bird spider Selenocosmia huwena. Toxicon 2004;43: 575–585. pmid:15066414
- 20. Mouhat S, Adreotti N, Jouriou B, Sabatier JM. Animal toxins acting on voltage-gated potassium channels. Curr Pharm Des 2008;14: 2503–2518. pmid:18781998
- 21. Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods 2011;8: 785–786. pmid:21959131
- 22. Kozlov S, Malyavka A, McCutchen B, Lu A, Schepers E, Herrmann R, et al. A Novel Strategy for the Identification of Toxin-like Structures in Spider Venom. Proteins 2005;59: 131–140. pmid:15688451
- 23. Klint JK, Senff S, Saez NJ, Seshadri R, Lau HY, Bende NS, et al. Production of Recombinant Disulfide-Rich Venom Peptides for Structural and Functional Analysis via Expression in the Periplasm of E. coli. PLoS ONE 2013;8(5): e63865. pmid:23667680
- 24. Sivashanmugam A, Murray V, Cui C, Zhang Y, Wang J, Li Q. Practical Protocols for Production of Very High Yields of Recombinant Proteins using Escherichia coli. Protein Science 2009;18: 936–94. pmid:19384993
- 25. Studier FW Protein Production by Auto-Induction in High-Density Shaking Cultures. Protein Expr and Purif 2005;41: 207–234.
- 26. Loening NM, Wilson ZN, Zobel-Thropp PA, Binford GJ. Solution structures of two homologous venom peptides from Sicarius dolichocephalus. PLoS ONE 2013;8(1): e54401. pmid:23342149
- 27. Schanda P, Van Melckebeke H, Brutscher B. Speeding up three-dimensional protein NMR experiments to a few minutes. J Am Chem Soc 2006;128: 9042–9043. pmid:16834371
- 28. Lescop E, Schanda P, Brutscher B. A set of BEST triple-resonance experiments for time-optimized protein resonance assignment. J Magn Reson 2007;187: 163–169. pmid:17468025
- 29. Vranken WF, Boucher W, Stevens TJ, Fogh RH, Pajon A, et al. The CCPN data model for NMR spectroscopy: development of a software pipeline. Proteins 2005;59: 687–696. pmid:15815974
- 30. Shen Y, Bax A. Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks. J Biomol NMR 2013;56: 227–241. pmid:23728592
- 31. Rieping W, Habeck M, Bardiaux B, Bernard A, Malliavin TE, Nilges M. ARIA2: automated NOE assignment and data integration in NMR structure calculation. Bioinformatics 2007;23: 381–382. pmid:17121777
- 32. Nilges M, Bernard A, Bardiaux B, Malliavin TE, Habeck M, et al. Accurate NMR Structures Through Minimization of an Extended Hybrid Energy. Structure 2008;16: 1305–1312. pmid:18786394
- 33. Bende NS, Dziemborowicz S, Herzig V, Ramanujam V, Brown GW, Bosmans F, et al. The insecticidal spider toxin SFI1 is a knottin peptide that blocks the pore of insect voltage-gated sodium channels via a large β-hairpin loop. Febs J 2015;282: 904–920. pmid:25559770
- 34. Dutertre S, Jin AH, Vetter I, Hamilton B, Sunagar K, Lavergne V, et al. Evolution of separate predation- and defence-evoked venoms in carnivorous cone snails. Nat Commun 2014;5:3521. pmid:24662800
- 35. Besnard J, Ruda GF, Setola V, Abecassis K, Rodriguiz RM, Huang XP, et al. Automated design of ligands to polypharmacological profiles. Nature 2012; 492:215–220. pmid:23235874
- 36. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997;25: 3389–3402. pmid:9254694
- 37. Zhang Y, Chen J, Tang X, Wang F, Jiang L, Xiong X, et al. Transcriptome analysis of the venom glands of the Chinese wolf spider Lycosa singoriensis. Zoology 2010;113: 10–18. pmid:19875276
- 38. Lúcio AD, Campos FV, Richardson M, Cordeiro MN, Mazzoni MS, de Lima ME, et al. A new family of small (4kDa) neurotoxins from the venoms of spiders of the genus Phoneutria. Protein Pept Lett 2008;15: 700–708. pmid:18782065
- 39. Platnick NI, Coddington JA, Forster RR, Griswold CE. Spinneret morphology and the phylogeny of Haplogyne spiders (Araneae, Araneomorphae). Am. Mus. Novitat. 1991;3016: 1–73.
- 40. Binford GJ, Callahan MS, Bodner MR, Rynerson MR, Berea Núñez P, Ellison CE, et al. Phylogenetic relationships of Loxosceles and Sicarius spiders are consistent with Western Gondwanan vicariance. Mol Phylogenet Evol 2008;49: 538–553. pmid:18755282
- 41. Liang S, Shu Q, Wang X, Zong X. Oxidative Folding of Reduced and Denatured Huwentoxin-I. J Protein Chem 1999;18: 619–625. pmid:10609637
- 42. Ostrow KL, Mammoser A, Suchyna T, Sachs F, Oswald R, Kubo S, et al. cDNA sequence and in vitro folding of GsMTx4, a specific peptide inhibitor of mechanosensitive channels. Toxicon 2003;42: 263–274. pmid:14559077
- 43. Jensen JE, Durek T, Alewood PF, Adams DJ, King GF, Rash LD. Chemical synthesis and folding of APETx2, a potent and selective inhibitor of acid sensing ion channel 3. Toxicon 2009;54: 56–61. pmid:19306891
- 44. Sharma D, Rajarathnam K. 13C NMR chemical shifts can predict disulfide bond formation. J Biomol NMR 2000;18: 165–171. pmid:11101221
- 45. Riddles PW, Blakely RL, Zerner B. Reassessment of Ellman’s Reagent. Methods in Enzymology 1983;91: 49–60. pmid:6855597
- 46. Bohlen CJ, Priel A, Zhou S, King D, Siemens J, and Julius D. A bivalent tarantula toxin activates the capsaicin receptor, TRPV1, by targeting the outer pore domain. Cell 2010;141: 834–845. pmid:20510930
- 47. Escoubas P, Rash LD. Tarantulas: eight-legged pharmacists and combinatorial chemists. Toxicon 2004;43: 555–574. pmid:15066413
- 48. Pineda SS, Sollod BL, Wilson D, Darling A, Sunagar K, Undheim EAB, et al. Diversification of a single ancestral gene into a successful toxin superfamily in highly venomous Australian funnel-web spiders. BMC Genomics 2014;15: 177. pmid:24593665
- 49. Laskowski RA, Rullmann JAC, MacArthur MW, Kaptein R, Thornton JM. AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. J Biomol NMR 1996;8: 477–486. pmid:9008363
- 50. Chen VB, Arendall WB 3rd, Headd JJ, Keedy DA, Immormino RM, Kapral GJ, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr D Biol Crystallogr 2010;66: 12–21. pmid:20057044
- 51. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, et al. Clustal W and Clustal X version 2.0. Bioinformatics 2007;23: 2947–2948. pmid:17846036
- 52. Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 2009;25: 1189–1191. pmid:19151095