Animal venoms represent a vast library of bioactive peptides and proteins with proven potential, not only as research tools but also as drug leads and therapeutics. This is illustrated clearly by marine cone snails (genus Conus), whose venoms consist of mixtures of hundreds of peptides (conotoxins) with a diverse array of molecular targets, including voltage- and ligand-gated ion channels, G-protein coupled receptors and neurotransmitter transporters. Several conotoxins have found applications as research tools, with some being used or developed as therapeutics. The primary objective of this study was the large-scale discovery of conotoxin sequences from the venom gland of an Australian cone snail species, Conus victoriae. Using cDNA library normalization, high-throughput 454 sequencing, de novo transcriptome assembly and annotation with BLASTX and profile hidden Markov models, we discovered over 100 unique conotoxin sequences from 20 gene superfamilies, the highest diversity of conotoxins so far reported in a single study. Many of the sequences identified are new members of known conotoxin superfamilies, some help to redefine these superfamilies and others represent altogether new classes of conotoxins. In addition, we have demonstrated an efficient combination of methods to mine an animal venom gland and generate a library of sequences encoding bioactive peptides.
Citation: Robinson SD, Safavi-Hemami H, McIntosh LD, Purcell AW, Norton RS, Papenfuss AT (2014) Diversity of Conotoxin Gene Superfamilies in the Venomous Snail, Conus victoriae. PLoS ONE 9(2): e87648. doi:10.1371/journal.pone.0087648
Editor: Mande Holford, The City University of New York-Graduate Center, United States of America
Received: September 5, 2013; Accepted: December 28, 2013; Published: February 5, 2014
Copyright: © 2014 Robinson et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: RSN and AWP acknowledge fellowship support from the Australian National Health and Medical Research Council. HSH is support by a Marie Curie Fellowship of the European Union. ATP was supported by an NHMRC Career Development Fellowship. The work was partially supported by the Victorian State Government Operational Infrastructure Support, Australian Government NHMRC IRIISS and the Australian Research Council. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Animal venoms represent a vast library of bioactive peptides and proteins. This is illustrated elegantly in cone snails (genus Conus), a group of carnivorous mollusks that exhibits a remarkable strategy for prey capture. A cone snail injects venom into its victim using a modified radula tooth, whereby components of the venom act potently and selectively at a range of molecular targets in the victim’s nervous system to achieve incapacitation . Cone snail venoms are remarkably complex, containing hundreds of unique bioactive peptides termed conotoxins (or conopeptides).
Molecular targets of individual conotoxins are diverse and include a range of voltage-gated ion channels, ligand-gated ion channels, G-protein coupled receptors and neurotransmitter transporters . As such, Conus venoms are an excellent source of pharmacological tools crucial to fundamental neuroscience research. Moreover, conotoxins have found use as therapeutics. An example is Ziconotide (Prialt®), the synthetic equivalent of ω-MVIIA from the venom of Conus magus, which is being used to treat chronic pain in cancer and AIDS patients . Several others also show potential and are currently undergoing development for the treatment of pathologies including postoperative and neuropathic pain, epilepsy, myocardial infarction and hypertension .
The epithelial cells lining the duct of a cone snail’s venom gland, are rich in messenger RNAs (mRNAs) encoding conotoxins . These mRNAs are translated initially as inactive precursor peptides that require post-translational processing prior to secretion from the cell as the bioactive mature peptides . Conotoxin precursors exhibit a generally recognizable primary structure: a hydrophobic signal peptide (prepeptide) sequence, followed by a propeptide region and commonly a cysteine-rich mature peptide region. The signal sequence of a precursor peptide is responsible for targeting it to the cellular secretory pathway, but is removed prior to secretion of the mature peptide. Conotoxins can be classified into gene superfamilies according to this signal peptide sequence . Members of a conotoxin superfamily share a high percentage of sequence identity in their signal peptide sequence but less so in their propeptide sequence, and can be highly variable in their mature peptide sequence (often with the exception of the cysteine framework) . A conotoxin’s cysteine framework refers to the characteristic arrangement of cysteine residues in its primary structure and is independent of disulfide connectivity (to date, approximately 25 distinct cysteine frameworks have been described in conotoxins). While there is no interdependence between gene superfamily and biological function , a conotoxin’s gene superfamily (and cysteine framework) remains a useful predictor of biological function.
The primary objective of this study was the large-scale discovery of novel conotoxin sequences from the venom gland of C. victoriae. The focus of this study, C. victoriae (Reeve, L.A., 1843) is a molluscivorous species of cone snail endemic to the coastline of north-western Australia. To date, it is best known as the source of α-conotoxin Vc1.1, a conotoxin with considerable potential for development as an analgesic drug . Other than Vc1.1, 23 unique conotoxin sequences from only a few gene superfamilies (A, O1, O2, T) are known from this species –. Here we report the discovery of over 100 unique conotoxin sequences from 20 gene superfamilies. Many of the sequences identified are new members of known superfamilies and some will help to redefine these superfamilies. Other sequences represent altogether new classes of conotoxins. This work paints a comprehensive portrait of the molecular diversity present in Conus venom.
Sequencing, Assembly & Annotation
RNA was extracted from the venom gland of C. victoriae. A normalized cDNA library was generated and sequenced using the Roche 454 platform. Sequencing yielded (following clipping to remove 454 adapter sequences) a total of 701,536 reads (265,403,303 nucleotides (nt), minimum length: 2 nt, average length: 378 nt, median length: 419 nt, maximum length 920 nt).
Assembly with MIRA produced 40,513 contigs (from 463,701 reads longer than 30 nt) with an average length of 588 nt (median: 528 nt), a maximum of 7,406 nt and minimum of 30 nt (user-defined). A general annotation of the transcriptome using BLASTX ,  revealed 7,818 contigs with significant similarity to sequences in the reference databases (UniProt/SwissProt and ConoServer ).
While BLASTX was used for a general annotation of the transcriptome, profile hidden Markov models (pHMMs) were used (independently of BLAST) to annotate conotoxins. pHMM models were built based on known conotoxin superfamilies (as described in methods) and used to search the C. victoriae venom gland transcriptome. Briefly, 2,048 contigs (26%) were identified (using pHMM searches) as conotoxin-encoding (combined total from all superfamilies). In terms of sequencing reads, of those that were assembled, 100,846 (22%) corresponded to conotoxins. A total of 113 conotoxins was identified from 20 superfamilies, which are described in detail below.
The C. victoriae cDNA library was subjected to normalization in an effort to enhance the diversity of transcripts observed. Normalization refers to a process by which distinct cDNAs are equalized and is useful to identify genes transcribed at a relatively low level (in a cellular transcriptome the number of mRNA copies per gene may differ by several orders of magnitude ). Normalization has the effect of “dampening down” highly abundant transcripts and consequently increasing the proportion of reads encoding rare transcripts . We opted to utilize normalization as the goal of this study was to maximize the number of unique conotoxin transcripts identified. One consequence of normalization is that the number of sequencing reads no longer directly reflects transcript expression level. However, it is not expected to alter the rank order of gene expression, such that a highly abundant transcript will still be represented by the highest number of reads while a low abundance transcript will be represented by few. With this in mind, we investigated those contigs that were generated from the highest number of sequencing reads. Conotoxins made up the majority of high-ranking contigs (45 of the top 50 annotated contigs). The 10 contigs with highest read coverage included the four conotoxins Vc5.1, Vc1.1, Vc5.3 and T_Vc5.9 (described in detail below), as well as two contigs with significant similarity to each of the cytochrome c oxidase subunits 1 and 2 [UniProt: Q34941, P00409] and a contig with significant similarity to the human mucin-6 protein [UniProt: Q6W4X9], a secreted protein that plays an important role in the protection of epithelial tissues. Most of other high-ranking non-toxin contigs were associated with the processing and transport of secreted proteins. These included several potential chaperones of the heat shock protein family [UniProt: P08712, Q16956, Q05557, Q71U34, P19120, Q9Y3Q3, P41827], protein disulfide isomerases [UniProt: P09103, P05307] and a neuroendocrine convertase [P63240]. Two contigs with significant similarity to proteins of the transposase 5 family were present [UniProt: P35072, P03934]. Also present was a contig sharing significant sequence similarity with the angiotensin-converting enzyme (ACE) [UniProt: Q50JE5]. ACE converts angiotensin I to angiotensin II, with a resultant increase in vasoconstrictor activity. Its presence here raises the possibility of a role in envenomation.
Conotoxin Gene Superfamilies
A pHMM was built based on the sequences of known A-superfamily conotoxins and used to search the C. victoriae venom gland trancriptome. This enabled the identification of a cDNA sequence encoding the peptide precursor of a novel A-superfamily conotoxin (Figure 1). This precursor shared obvious homology with other A-superfamily conotoxins, at least in its signal peptide sequence, although the sequence encoding the mature peptide is clearly novel. A_Vc22.1 is the first A-superfamily peptide to exhibit the type XXII cysteine framework (i.e. 8 cysteine residues separated by 7 loops: C-C-C-C-C-C-C-C). Several conotoxin precursor sequences with this framework have been identified in Conus californicus , although they share very little sequence similarity with A_Vc22.1, and do not belong to the A-superfamily. No conotoxin with framework XXII has been characterized to date and A_Vc22.1 offers an exciting prospect as a functionally novel conotoxin.
*, Vc1.2 precursor  shown for comparison is in grey; Cys, yellow; Predicted signal peptides are underlined in purple and the predicted mature peptides are underlined in black, while that of Vc1.2 is underlined in grey. This color scheme is used in all subsequent figures.
Other A-superfamily peptide precursor sequences identified in the venom gland transcriptome of C. victoriae were those of Vc1.1  and Vc1.3  (Figure 1). Vc1.1 is a potent analgesic in neuropathic pain models  and targets both the α9α10 nAChR and the γ-aminobutyric acid (GABA)B receptor , while Vc1.3, which was identified previously in embryonic C. victoriae, had little effect at either the nAChRs subtypes tested or at the GABAB receptor . Vc1.1 is, to date, the only conotoxin from the venom of C. victoriae with a defined molecular target. The naming of conotoxin precursors is described in the Discussion.
Six unique I1-superfamily conotoxins were identified in the venom gland transcriptome of C. victoriae (Figure 2A). I1-superfamily conotoxins characterized so far display excitatory activity , some through subtype-specific modulation of voltage-gated Na+ channels , . The predicted mature peptide sequence of I1_Vc11.5 shares 89% identity with an I1-superfamily conotoxin from Conus marmoreus (M11.2) , while that of I1_Vc11.6 shares 82% identity with an I1-superfamily conotoxin from Conus episcopatus (Ep11.1) . The remaining sequences I1_Vc11.1–4 do not show any notable similarity, other than their cysteine framework, to known sequences.
Four unique I2-superfamily conotoxins were identified (Figure 2B). They displayed the same precursor structure as those identified previously with a C-terminal propeptide region and a mature peptide region characterized by cysteine framework XI (C-C-CC-CC-C-C). All I2-superfamily conotoxins characterized so far (BtX, ViTx and sr11a) are K+ channel modulators –. Of the sequences identified here, there is little similarity in the mature peptide regions to known sequences. One can only speculate that, like their counterparts, these peptides would share the ability to modulate K+ channels, although the lack of similarity presented in their mature peptide sequences makes it is quite possible, as observed with other conotoxin superfamilies, that they display altered selectivity.
Four unique J-superfamily conotoxins were identified in the venom gland transcriptome of C. victoriae (Figure 3A). These sequences displayed only superficial similarity to known J-superfamily sequences (specifically cysteine framework). The only J-superfamily conotoxin characterized as yet, pl14a, was observed to have a potent inhibitory affect at both nicotinic acetylcholine receptors (α3β4-neuronal, α1β1εδ-neuromuscular) and a voltage gated K+ channel subtype (Kv1.6) . Given the low similarity between pl14a and the sequences identified here one can only speculate as to their activity. However, we note that the J-superfamily makes up a large proportion of the conotoxin mRNA transcripts observed in the venom gland of C. victoriae.
Several conotoxin sequences from each of the M1, M2 and conomarphin subgroups of the M-superfamily were identified (Figure 3B and C). M4 and M5 conotoxins are believed to be absent from mollusc-hunting Conus , and indeed were not identified in C. victoriae.
Almost all of the M-superfamily sequences identified in C. victoriae (M_Vc3.1–2, 4–10) were very similar if not identical to previously reported M-superfamily sequences. While the M4/5 branch of conotoxins is well characterized, there are limited published data describing the M1 and M2 branches. Of the M1/M2 conotoxins tested so far, the majority elicited excitatory symptoms upon intracranial (IC) injection in mice , , while LtIIIA enhanced tetrodotoxin-sensitive Na+ currents in a whole-cell patch-clamp assay .
The M_conomarpin_Vc1 and M_conomarpin_Vc2 sequences clearly belong to the cysteine-free conomarphin class of conotoxins, although the predicted mature peptides of each differ substantially from previously identified conomarphins. M_Vc3, along with a sequence recently identified in C. marmoreus (Mr038) , presumably constitutes a new class of single disulfide-containing conotoxins.
The O1-superfamily of conopeptides consists of δ- (which block inactivation of voltage-gated Na+ channels), μ- (voltage-gated Na+ channel blockers), κ- (voltage-gated K+ channel blockers) and ω-conopeptides (voltage-gated Ca2+ channel blockers), all of which share a type VI/VII cysteine framework (C-C-CC-C-C).
Several O1-superfamily sequences have been identified previously in C. victoriae , . Surprisingly, while many O1-superfamily sequences were identified here (Figure 4), none matched exactly those identified previously. Minor variants of Vc6.1, Vc6.4 and Vc6.6 were present that displayed up to three differences each in their prepropeptide regions. As there was no change in the mature peptide sequence we have denoted these sequences as variants e.g. O1_Vc6.1ii. A sequence clearly similar to Vc6.2 was also evident (with minor variation); because some of this variation occurred in the predicted mature peptide region, however, this sequence was designated as unique (O1_Vc6.41). Three unique variants of Vc6.3 were present, none of which corresponded exactly to the original Vc6.3. Again the variation occurred in the prepropeptide region and the predicted mature peptide region remained unchanged.
The remaining O1-superfamily sequences identified were completely novel, although some showed similarity to known ω-, δ-, and μ-conotoxins. Notably, the predicted mature peptide sequence of O1_Vc6.31 was 90% identical to μ-MrVIB, an O1-superfamily conotoxin from C. marmoreus that is an inhibitor of the NaV1.8 subtype of voltage-gated Na+ channels with analgesic properties .
A single cysteine-free sequence (O1_Vc1) from the O1-superamily may constitute a new class of conotoxin. Close inspection of the sequencing reads encoding this transcript (taking into account contig coverage and read quality) indicated that this unusual sequence was not simply the result of a frameshift due to sequencing error.
Eleven O2 conotoxin precursors were identified previously by cDNA sequencing of the C. victoriae venom gland and designated Vc6.7–17 .
A pHMM was built based on the sequences of all known O2/contryphan-superfamily conotoxins and used to search the C. victoriae venom gland transcriptome. 18 unique O2-superfamily (cysteine framework VI/VII) and two contryphan conotoxins were identified (Figure 5A and B). Of the 16 O2-superfamily conotoxins identified with cysteine framework VI/VII, eight had been identified previously. A minor variant of Vc6.16 was also evident, with a single difference in the predicted mature peptide region (this sequence was therefore designated O2_Vc6.25). The predicted mature peptide sequence of O2_Vc6.22 was 81% identical to TxVIIA, a modulator of molluscan pacemaker channels (γ-conotoxin) .
Contryphans are short single disulfide-containing conotoxins that display a diversity of function but could generally be described as Ca2+ channel modulators , . Both of the contryphans identified share obvious homology, at least in their signal peptide sequence, to other contryphans, although the sequences encoding the mature peptides are clearly novel. Contryphan_Vc1 is the first contryphan peptide identified that exhibits an intercystine loop length other than five residues. Indeed, this peptide is remarkably different in its entire primary structure from any conotoxin previously characterized.
All contryphans identified so far have either Pro/Hyp followed by D-Trp or Val followed by D-Leu at positions one and two of the intercystine loop. Hyp (or Pro) at position 1 of the disulfide loop appears to be necessary for slow conformational interconversion observed in these peptides . The precursor cDNA sequence of contryphan_Vc2 indicates that this peptide has a Trp at position two (presumably D-Trp ) but is unique among contryphans in that it exhibits the positively-charged amino acid Arg at position one. Its sequence also differs from other known contryphans at positions 3 and 5 (Thr and Val, respectively). Further characterization of this peptide is likely to offer important information on the structure-activity relationship of contryphans.
Other than its propeptide sequence and single pair of cysteines, contryphan_Vc1 shares no obvious sequence similarity to contryphan_Vc2, or indeed any other contryphans.
One O3 superfamily precursor was identified in C. victoriae (Figure 6A). The signal peptide sequence indicated that this sequence was related to the O3-superfamily, although the pro- and mature peptide regions differed markedly from known O3-superfamily sequences, most notably in that it was devoid of cysteines, in contrast to all O3-superfamily conotoxins identified to date, which are cysteine-rich with framework VI/VII, e.g. the bromosleeper peptide .
Three P-superfamily precursor sequences, P_Vc9.1, P_Vc9.2 and P_Vc14.5, were identified in the venom gland transcriptome of C. victoriae (Figure 6B). While P_Vc9.1 and P_Vc9.2 display the type IX cysteine framework (C-C-C-C-C-C) consistent with previously identified P-superfamily conotoxins , , P_Vc14.5 displays a type XIV cysteine framework (C-C-C-C). Alignment of this sequence with the two type IX peptides indicates that the equivalent II–V and III–VI cysteine pairs are still present but the I–IV cysteine pair is absent.
The predicted mature peptide sequence of P_Vc9.2 is 96% identical to GmIXA, a conotoxin from the venom of Conus gloriamaris that induces hyperactivity and spasticity in mice following IC injection . Like the J-superfamily, the relatively uncharacterized P-superfamily appears to constitute a large proportion of conotoxin mRNA transcripts in the venom gland of C. victoriae.
The two S-superfamily conotoxins to have undergone pharmacological characterization displayed different activity: GVIIIA competitively inhibited the 5-HT3 serotonin receptor , while αS-RVIIIA inhibited nAChRs . A single S-superfamily precursor sequence, S_Vc8.1 was identified in the venom gland transcriptome of C. victoriae (Figure 6C). The peptide shared the same cysteine framework as previously identified S-superfamily conotoxins. The predicted mature peptide sequence of S_Vc8.1 shares 93% identity with that of tx8.1 from Conus textile .
The precursor sequences of 27 unique T-superfamily conotoxins were identified (Figure 7), making it not only the most abundant superfamily in C. victoriae, but also the most diverse. Three different cysteine frameworks (V, X and XIII) were identified.
Three of the 27 sequences had been identified previously in C. victoriae venom duct mRNA, while the predicted mature peptide sequences of two others, T_Vc5.7 and T_Vc13.1, had been identified previously in the venom of C. textile. The predicted mature peptide sequence of T_Vc13.1 was identical to TxXIIIA, a unique T-superfamily conotoxin identified in C. textile . This peptide is similar to the Type V framework (CC-CC) conotoxins, but contains an extra Cys (CC-CCC), and is found in the venom as a homodimer. The predicted mature peptide sequence of T_Vc5.7 was identical to TxVA, one of the most highly modified conotoxins, with γ-carboxyglutamate, hydroxyproline, bromotryptophan and glycosylation , . This conotoxin induces hyperactivity and spasticity in mice following IC injection, and may target a pre-synaptic Ca2+ channel or GPCR. One T-superfamily sequence identified in C. victoriae venom gland mRNA in a previous study , Vc5.4 (Vc5c), was not identified here, although a very similar sequence (T_Vc5.12) was present. T_Vc10.1 shares obvious homology with known χ-conotoxins (inhibitors of the noradrenaline transporter), in both its T-superfamily signal peptide and mature peptide sequences.
Despite evidence that the T-superfamily is abundant, not only in C. victoriae but in other species of Conus as well, remarkably little is known about this group of conotoxins .
A pHMM was constructed based on the sequences of known conantokin precursors and was used to search the C. victoriae venom gland transcriptome. This search yielded a single conantokin transcript (Figure 8A). An almost identical sequence (only three changes in the predicted prepropeptide region) has been reported in another molluscivorous species, C. gloriamaris (Con-Gm) . The mature form of Con-Gm is reportedly 19 amino acids in length, with residues Glu4, Glu10 and Glu14 being modified to γ-carboxyglutamate and the C-terminus being amidated.
*, Con-Gm , G56 , con-ikot-ikot , p21a , Conodipine-M  and B2-superfamily sequences from C. literratus  and C. consors  are shown for comparison. The conodipine catalytic His-Asp dyad is boxed in red.
The original con-ikot-ikot was identified and characterized from the venom of the Conus striatus . Uniquely among conotoxins, it displayed an effect on α-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid (AMPA) receptors, inhibiting channel desensitization. Con-ikot-ikot is a relatively large conotoxin with 13 cysteine residues, where the active form is a dimer of covalent dimers.
A recently discovered conotoxin isolated from the venom of Conus purpurascens, p21a, showed 48% homology with con-ikot-ikot . p21a defined a new 10-cysteine, 7-loop framework (XXI), a similar cysteine arrangement to con-ikot-ikot. Unlike con-ikot-ikot, however, this conotoxin has been proposed to form a non-covalent dimer. Multiple con-ikot-ikot precursor sequences were also recently identified in the venom gland transcriptome of Conus geographus , three of which shared framework XXI with p21a, and two displayed the original con-ikot-ikot framework.
Here we show that con-ikot-ikots are not limited to the fish-hunting species described above. A con-ikot-ikot precursor sequence was identified in C. victoriae (Figure 8B). This sequence displayed the same cysteine framework (XXI) as p21a.
Secretory phospholipase-A2s (sPLA2s) have been reported in a wide variety of animal venoms, as well as mammalian tissues and bacteria. They catalyze the hydrolysis of the ester bond at the sn-2 position of 1,2-diacyl-sn-phosphoglycerides. In addition to enzymatic activity some of these venom PLA2s display potent neurotoxicity.
Conodipine-M, a 13.6 kDa component of the venom of C. magus , was until now the only phospholipase characterized from Conus venom, although various conodipine isoforms are reportedly present in the venom gland transcriptome of Conus consors . Its sequence was partially characterized and differed from most other conotoxins in that it was present as a heterodimer of two polypeptide chains, an α- and a β-chain. Conodipine-M displayed sPLA2 activity and like other sPLA2s, required Ca2+ as a cofactor . Its sequence, while retaining key catalytic motifs present in other sPLA2s, shared little sequence identity with other sPLA2s and therefore defined a new group (IX) of enzymes.
Here we show that conodipines, like other sPLA2s, are encoded by a single precursor consisting of a signal peptide sequence followed by the α-chain, a propeptide linker and finally the β-chain (Figure 8C).
Two of the precursors identified display remarkable similarity in their predicted mature peptide region to conodipine-M, including their cysteine framework and catalytic His-Asp dyad. The remaining sequence retains the general precursor structure of conodipine_Vc1 and 2 and the predicted catalytic dyad, but displays not only a unique signal peptide sequence but also a unique cysteine framework. Given its unique signal peptide sequence, this conotoxin could be considered the first member of a new superfamily.
New or Recently Identified Conotoxin Superfamilies
In a previous study, several linear peptides identified in the venom proteome of C. consors were matched to a sequence in the transcriptome that did not correspond to a known conotoxin superfamily . Interestingly, a similar sequence (UniProt Q2HZ30) had been identified at high frequency in a Conus litteratus venom gland cDNA library . Although the function of the peptide products of these sequences remains unknown, the authors proposed that these sequences may constitute an as yet undescribed conotoxin superfamily. Recently, a similar sequence was identified in the venom gland transcriptome of C. marmoreus and subsequently designated as the B2-superfamily .
Based on alignment of two known B2-superfamily precursor sequences from C. litteratus and C. consors, a pHMM was built and used to search the transcriptome of C. victoriae, as well as the transcriptomes of Conus bullatus  and C. geographus . Each species yielded a single B2-superfamily precursor sequence displaying remarkable similarity to those from C. consors and C. litteratus (Figure 8D). As observed in C. litteratus, B2_Vc1 is observed at high frequency in the venom gland transcriptome of C. victoriae.
E- and F-superfamilies.
The E- and F-superfamilies of conotoxins were recently described from the venom gland transcriptome of C. marmoreus , with each superfamily consisting at present of a single sequence. The peptide product of the only E-superfamily precursor so far identified (Mr104), is 26 amino acids in length, with four cysteines (two disulfide bonds) and a bromotryptophan. A peptide product was also identified for the F-superfamily precursor (Mr105). This short linear peptide was derived from the predicted propeptide sequence.
pHMMs were constructed based on each of the known precursor sequences and used to search the C. victoriae venom gland transcriptomes for E- and F- superfamily conotoxins. As with C. marmoreus, single transcripts for each of the E- and F- superfamilies were present in C. victoriae (Figure 9A and B), which showed remarkable similarity to those present in C. marmoreus (Mr104 and Mr105). The venom gland transcriptomes of C. bullatus and C. geographus were also searched, using the same method, for E- and F- superfamily conotoxins, although none were identified in these species.
The precursor sequences of several novel conotoxins clearly belonged to the recently discovered H-superfamily of conotoxins from C. marmoreus  (Figure 9C). Superficially, the cysteine pattern observed in H_Vc7.1 and H_Vc7.2 is identical to that of the O1- and O2-superfamilies. However, closer comparison reveals that there is little similarity in either the intercysteine loop composition or length . The hitherto uncharacterized H-superfamily constitutes a large proportion of conotoxin mRNA transcripts in the venom gland of C. victoriae.
A single H-superfamily sequence encoding a cysteine-free predicted mature peptide region was also encountered (H_Vc1), indicating that, like other superfamilies, the H-superfamily is not limited to a single cysteine framework. This unusual sequence probably constitutes a new class of conotoxin. As described above for O1_Vc1, a close inspection of the sequencing reads was performed to confirm that this unusual sequence was not simply the result of a frameshift due to sequencing error.
A recently described third I-superfamily (I3)  (Figure 2D), was searched for but not identified in the venom gland of C. victoriae. However, during the process of designing and building each I-superfamily pHMM, it became apparent that a fourth, unrecognized, superfamily of conotoxins was presently grouped into the I2-superfamily. These sequences included Gla-TxX from C. textile  and Gla-MrII from C. marmoreus , the mature peptides of which are 47 and 50 residues, respectively, each with 5 γ-carboxyglutamate modifications. Not only do these conotoxins have a clearly distinct signal peptide sequence but they also exhibit a distinct cysteine framework, XII (C-C-C-C-CC-C-C), compared to other I-superfamily conotoxins . This disparity has been noted previously , and it was proposed that this group of peptides be redefined as ‘E-conotoxins’. As an E-superfamily has since been described, and given the similarity of these conotoxins to other I-superfamilies, we propose a new I4-superfamily, which would include, among others, Gla-TxX, GlaMrII and the sequence identified in C. victoriae described below.
Construction of a pHMM based on these sequences enabled the identification of a single I4-superfamily member in the venom gland transcriptome of C. victoriae (Figure 2C). The predicted mature peptide sequence of this peptide was 92% identical to Gla-TxX. I4_Vc12.1 shares the glutamate sites of Gla-TxX, so is probably present in the venom in a similarly modified form.
Annotation of the C. victoriae venom gland transcriptome with BLAST+, identified two sequences with homology to the “textile convulsant peptide” isolated two decades ago from the venom of C. textile  (Figure 9D). The textile convulsant peptide, on IC injection in mice, induces symptoms characterized by “sudden jumping activity followed by convulsions, stretching of limbs and jerking behavior”. The authors noted that this peptide was unique and predicted that it belonged to a new undefined class of conotoxins. In this study we have identified the precursor sequence of two similar conotoxins from C. victoriae, and shown that they are indeed members of a previously undefined conotoxin superfamily, which we have designated the U-superfamily.
Although the pre- and propeptide sequences clearly differ from known conotoxin superfamilies, the U-superfamily peptides share the cysteine framework (VI/VII) of most members of the O1-, O2- and O3-superfamilies, as well as the H-superfamily. However, on comparison with these superfamilies it is apparent that there is little similarity either in the intercysteine loop composition or length . For instance, loop 1 of the U-superfamily peptides is relatively short at two residues, compared with six in the O-superfamily conotoxins.
Discovery of the signal peptide sequence for this superfamily should allow the rapid identification of U-superfamily conopeptides in other Conus species. With this in mind, we searched transcriptome databases of both C. geographus  and C. bullatus . This search did not yield any hits, suggesting that this superfamily is not present (at least in high-abundance) in the fish-hunting cone snails C. geographus and C. bullatus.
Given the sequence similarity in the mature peptide sequences of U_Vc7.3 and 7.4 to the textile convulsant peptide, it is likely that they share similar biological activity. Despite its potent biological activity, the molecular target of the textile convulsant peptide has not been identified.
While the venoms of Conus species have been rigorously investigated, those of other venomous snails remain largely unstudied. A recent investigation of the venomous Auger snail Hastula hectica revealed several venom peptides (termed augerpeptides) similar to those found in Conus venom as well as various venom gland transcripts apparently encoding other venom peptides . Of the few augerpeptides identified, no overlap with conotoxins has so far been reported.
Annotation of the venom gland transcriptome of C. victoriae with BLAST facilitated the identification of a contig with significant similarity to the augerpeptide hhe53 (Figure 10), a 38-residue peptide with two disulfide bonds, predicted from cDNA sequencing of the venom gland of the Auger snail Hastula hectica. In fact, the reported amino acid sequence of hhe53 was 100% identical to a translated region in an open-reading frame of the C. victoriae transcript. Investigation of the C. victoriae transcript revealed a stop codon in the expected position following the predicted mature peptide region as well as an Arg residue immediately 5′ to the predicted mature peptide region, indicating a possible cleavage site. However, neither an obvious signal peptide nor translation initiation codon was evident in the same open-reading frame (frame 1). The assembled contig did not suffer from low coverage (69 reads), implying that the absence of a signal peptide was not the result of a simple frameshift caused by sequencing error. We did observe, however, the presence of a possible partial signal peptide with an initiation codon in a separate reading frame (frame 2), immediately 5′ to the predicted mature peptide. We have observed elsewhere in other conotoxin sequences a naturally occurring missing propeptide region (presumably a separate exon) causing the obvious signal peptide and mature peptide regions to appear in different reading frames when translated (unpublished observation). Without a reference precursor sequence, however, it is not possible to confirm that this is the explanation for the result observed here. It remains a possibility that this presumably inactive sequence results from a polymorphism in the individual from which the mRNA was collected and that in other individuals this transcript may encode the functional peptide. The functional relevance of this sequence in C. victoriae therefore remains open to speculation, but the observation of an overlapping sequence in venom gland transcripts between H. hectica and C. victoriae does seem a striking coincidence.
Possible initiator codon in frame 2 is underlined in purple and the sequence encoding the predicted mature peptide in frame 1 is underlined in black.
To give a general indication of the relative expression levels of each conotoxin superfamily in the venom gland of C. victoriae, reads encoding each conotoxin superfamily are presented in Figure 11. It is important to keep in mind that, owing to normalization, transcripts of high abundance may be under-represented and this chart should only be used as a general indicator.
High abundance reads may be under-represented as a result of cDNA library normalization.
Known superfamilies searched for, but not identified in the venom gland transcriptome of C. victoriae included the C, D, G, I3, K, L, N, V, Y and conopressin superfamilies. Most of these superfamilies are described from a single species or narrow range of species and it is therefore not surprising that they were not identified here in C. victoriae. One exception is the conopressin superfamily, identified in a number of species including the closely related C. textile, but not identified here.
The traditional approach for venom peptide identification has been assay-directed fractionation, followed by isolation and peptide sequencing. This approach is labour-intensive and requires a large amount of venom, which is not always available. The use of targeted PCR amplification of venom duct cDNA increased the speed at which venom peptides could be identified and also reduced the amount of starting material required. Similarly, large-scale cloning of cDNA libraries and Sanger sequencing has also been performed and has successfully generated a large number of novel peptide sequences , , but is relatively expensive. The recent advent of high-throughput ‘next generation’ sequencing technologies has facilitated larger, more rapid and cost-effective identification of novel venom peptides and proteins through the sequencing of venom gland transcriptomes. The potential of this approach has been recognized and applied recently to the venom gland transcriptomes of several species of Conus , , , . Of the next generation sequencing platforms available, our use of 454 sequencing technology was motivated by the current superior read length generated compared to other technologies.
One trade-off, however, with this technology is the higher error rate in homopolymer runs (compared with other sequencing platforms). Such errors can result in insertions or deletions, which can introduce frameshifts or amino acid changes in the resulting sequences. For this reason reporting of 454 reads prior to assembly is risky. Higher sequence coverage provided by the assembly process works to reduce sequencing errors, producing more reliable sequences and reducing the likelihood of reporting minor variants and unusual sequences that are simply the result of sequencing error. De novo transcriptome assembly, however, can be a challenging task. In the assembly of the C. victoriae venom gland transcriptome there was evidence, particularly for the more abundant conotoxin superfamilies, that multiple contigs encoding the same transcript were generated by the assembler. In some cases this was caused by a substitution error, while others were the result of frameshifts (usually in regions of low coverage). This was also reported for the assembly of the C. geographus venom gland transcriptome . Clustering of contigs could potentially reduce this problem, but we deemed that it was not appropriate here. A high frequency of minor variations occurs naturally in the genes encoding conotoxins (and indeed venom peptides in general) and the process of clustering is likely to mask any naturally occurring minor variations. Indeed, even without clustering, some contigs in this study were the product of two clearly unique minor variants that had been clustered by the assembler. It was necessary to perform a thorough manual examination of the contigs corresponding to each precursor sequence presented here. This was especially important for some of the minor variants and more unusual reported sequences to ensure that these were not the result of sequencing error. Researchers employing the methods described herein need to be aware of the complications associated with read error and transcriptome assembly and therefore be rigorous in their examination of, and conservative in their reporting of, unusual sequences or minor sequence variants.
Recently, it was demonstrated that pHMMs can be used to classify conotoxins and proposed that the use of pHMMs was a highly suitable approach for identifying conotoxin sequences in large datasets (e.g. transcriptomes) . Here we employed pHMM searches for a more detailed investigation of the conotoxin gene superfamilies present in the venom gland transcriptome of C. victoriae and describe the highest diversity of conotoxins so far reported in a single study. While a number of variables could potentially contribute to this result, a comparison with a recent study performed in a similar manner but with a non-normalized cDNA library  suggests that our cDNA library normalization has played a major part. Hu et al.,  investigated the venom gland transcriptome of C. geographus, reporting the identification of 63 unique conotoxin sequences from a dataset of 791,971 sequencing reads. From a similar dataset, in terms of total read number and average length, we report almost twice as many unique conotoxin sequences. Conotoxin sequences dominated the C. geographus dataset, constituting 88% of the total sequencing reads with over 250,000 of these reads encoding just three conotoxins. In our study, only 22% of the total sequencing reads encoded conotoxins, with the most abundant conotoxin, Vc5.1, comprising only 3,405 sequencing reads. In sacrificing coverage of some of our more abundant conotoxins we improved our ability to identify rarer conotoxins. Indeed, several conotoxin contigs were assembled from as few as two reads, and without a normalized cDNA library these would not have been identified. Thus, cDNA library normalization appears to be an effective strategy to maximize the identification of unique venom components.
Most of the conotoxins identified here display little amino acid sequence similarity to conotoxins with a defined molecular target. Moreover, several sequences define new classes of conotoxins and seem likely to display novel activity profiles. While each of the conotoxin precursor sequences described here is unique, several appear to encode mature peptides that are similar, if not identical, to known conotoxins (Table 1). Even subtle differences, however, in a conotoxin’s primary structure can have a dramatic effect on its function, and in most cases this is likely to be reflected in different functionality (possibly subtype selectivity or even molecular target. There seems little doubt that this library of conotoxin sequences holds a diversity of as yet undescribed functions.
The naming of conotoxin precursors identified in this study was undertaken according to the conventional conotoxin nomenclature (where species is represented by one or two letters, cysteine framework by an Arabic numeral and, following a decimal, order of discovery by a second numeral) , with slight modifications. For previously identified conotoxin precursors the names were not altered in any way. For novel sequences we have chosen to include the superfamily as a prefix. cDNA sequencing is now the primary method for conotoxin identification, and without information on a conotoxin’s function (or even cysteine framework) the gene superfamily is becoming increasingly important for conotoxin classification. Moreover, we have made no distinction between ‘cysteine-poor’ and ‘cysteine-rich’ sequences, as this division is now considered to be largely redundant . In the O1-superfamily several precursors were identified that differed in their prepropeptide but not in their mature predicted peptide regions, such that there would presumably be no difference in the peptide products of these precursors. These sequences were given the same name but a small roman numeral was added as a suffix to denote the minor variations. We suggest that the slight modifications applied here to the conventional conotoxin naming scheme should assist in the naming of new sequences identified by transcriptomic studies.
Two of the conotoxins identified here (A_Vc22.1 and P_Vc14.5) displayed cysteine frameworks not previously associated with their particular superfamily. In the case of P_Vc14.5, comparison with the primary structures of framework IX P-superfamily conotoxins suggests that this change may only be subtle. However A_Vc22.1 is not at all similar to other A-superfamily conotoxins and could therefore be expected to display a unique activity profile. Cysteine-poor conotoxins were identified in several of the traditionally cysteine-rich superfamilies (M, O1, O2, O3, and H). Other than the conomarphins and contryphans, these sequences probably represent new conotoxin classes. A con-ikot-ikot conotoxin, previously limited to piscivorous species of Conus, was identified here in C. victoriae. Additionally, a conantokin sequence was identified, providing more evidence that this superfamily is also not limited to piscivorous species of Conus.
Several of the relatively uncharacterized conotoxin superfamilies were observed at high abundance in the venom gland transcriptome of C. victoriae (H, J, P and B2). This suggests that they are key components of the venom repertoire of this species and thus warrant further investigation of their functional properties.
The goal of future studies utilizing the information presented here will be the functional characterization of the peptide products of new conotoxin sequences. The first step will be to determine the mature peptide(s) corresponding to each precursor sequence. While many mature peptide sequences and post-translational modifications can be predicted directly from a precursor sequence, some will require a more thorough examination of the venom of C. victoriae by tandem mass spectrometry (MS/MS) matching. To this end, the library generated here can be used as a query database for MS/MS matching against the venom of C. victoriae, as demonstrated recently in other Conus species , . MS/MS matching will confirm mature peptide sequences and the presence of post-translational modifications. The prediction of disulfide connectivity from conotoxin precursor sequences is notoriously difficult , , and in most cases requires experimental determination. The improvement of methods for the rapid and efficient determination of a peptide’s (or protein’s) disulfide connectivity remains an active area of research .
Given the history of the small number of conotoxins so far characterized, we predict that components discovered in this work have the potential to become valuable research tools, if not drug leads or therapeutics. This study illustrates the arsenal of molecular weapons present in the venom gland of a single species of cone snail. Furthermore, it highlights the wonderful molecular resource that is animal venom.
Materials and Methods
Specimen Collection and RNA Extraction
Specimens of C. victoriae were collected from Broome, Western Australia. Whole venom glands of live specimens were dissected, snap-frozen in liquid nitrogen and stored at -80°C. Frozen venom glands were pulverized and homogenized using an MM 400 mixer mill (Retsch). Total RNA was extracted with Trizol (Invitrogen, Life Technologies). Total RNA integrity, quantity and purity were determined by capillary electrophoresis using a Bioanalyzer 2100 with the RNA 6000 Nano assay kit (Agilent Technologies).
cDNA Library Preparation and Sequencing
cDNA library preparation, normalization and sequencing were performed by Eurofins, MWG Operon (Budendorf, GER). From the total RNA sample, poly(A)+ RNA was isolated and used for cDNA synthesis. An N6 randomized primer was used for first strand cDNA synthesis. 454 adapters A and B were then ligated to the 5′ and 3′ ends of the cDNA, respectively. The cDNA was finally amplified by PCR (11 cycles).
Normalization was carried out by one cycle of denaturation and re-association of the cDNA. Re-associated double-stranded cDNA was separated from the remaining single stranded-cDNA (normalized cDNA) by passing the mixture over a hydroxylapatite column. After hydroxylapatite chromatography, the single-stranded cDNA was PCR amplified (8 cycles). cDNA in the size range of 500–1100 nt was eluted from a preparative agarose gel for sequencing. 454 sequencing was performed using GS FLX+ chemistry.
During the assembly process, single reads are aligned with each other to form contigs (contiguous consensus sequences). All reads were initially trimmed to remove primer and barcode sequences. Reads were then cleaned using prinseq-lite-0.17.1 . De novo transcriptome assembly was performed using the following settings in MIRA3 : mira -job = denovo,est,accurate,454 454_SETTINGS -CO:fnicpst COMMON_SETTINGS -GE:not = 6 -AS:nop = 4:sep = 1 -CL:ascdc = 1 454_SETTINGS -LR:lsd = 1:ft = fastq -AS:mrl = 30 -CL:cpat = 1. Based on a recent comparison of 454 assembly methods, MIRA and newbler were identified as the leading de novo transcriptome assemblers , with MIRA being more conservative about merging reads into contigs. To avoid over-assembly in the first instance, in order to identify as many alleles and paralogues as possible, we selected MIRA as our assembler. A database of open reading frames longer than 40 amino acids was generated from the transcriptome assembly. This database was used for subsequent pHMM searches.
Transcriptome Annotation with BLAST+
For a general annotation of the transcriptome we utilized BLAST+ (version 2.2.27+) , . Reference databases were constructed from the current UniProt/swissprot database (release 2012_09) and the non-redundant ConoServer database . Each contig from the assembled transcriptome was aligned to the two databases using BLASTX (E-value cutoff: 10−3) and the combined best hit used. Ties were resolved by taking the ConoServer hit preferentially.
Conotoxin Gene Superfamily Annotation with pHMMs
All conotoxin sequences available from ConoServer were downloaded and grouped according to superfamily (classification provided by ConoServer). Any identical sequences were removed. Full-length precursor sequences were used where available, but for superfamilies with less sequence information all available sequences were used.
Using the hmmbuild tool from the HMMER 3.0 package a single pHMM was built for each superfamily. The hmmsearch tool was then applied to the C. victoriae venom gland transcriptome database of open reading frames.
All sequence alignments were performed with MAFFT version 7 using the L-INS-i method . Signal peptide sequences were determined using the SignalP 4.1 server . Mature peptide regions were predicted based on similarity to related conotoxin sequences.
Availability of Supporting Data
Conotoxin prepropeptide sequences from this Transcriptome Shotgun Assembly project have been deposited at DDBJ/EMBL/GenBank [accession: GAIH00000000]. The version described in this paper is the first version, GAIH01000000. Raw sequencing data has been deposited in the NCBI sequence read archive [SRA accession: SRR833564].
Specimens of Conus victoriae were collected specifically for research use, under a commercial fishing license of the Western Australian Specimen Shell Managed Fishery (license number 2577). Ethics approval was not required, in Australia, for taking samples from Conus.
We thank Johan Pas for specimen collection and Dr Shayne Bellingham for technical assistance with RNA quantification.
Conceived and designed the experiments: SDR HSH AWP RSN. Performed the experiments: SDR HSH LDM ATP. Analyzed the data: SDR. Contributed reagents/materials/analysis tools: HSH AWP RSN ATP. Wrote the paper: SDR.
- 1. Norton RS, Olivera BM (2006) Conotoxins down under. Toxicon 48: 780–798. doi: 10.1016/j.toxicon.2006.07.022
- 2. Lewis RJ, Dutertre S, Vetter I, Christie MJ (2012) Conus venom peptide pharmacology. Pharmacological Reviews 64: 259–298. doi: 10.1124/pr.111.005322
- 3. Miljanich GP (2004) Ziconotide: neuronal calcium channel blocker for treating severe chronic pain. Current Medicinal Chemistry 11: 3029–3040. doi: 10.2174/0929867043363884
- 4. King GF (2011) Venoms as a platform for human drugs: translating toxins into therapeutics. Expert Opinion on Biological Therapy 11: 1469–1484. doi: 10.1517/14712598.2011.621940
- 5. Hu H, Bandyopadhyay P, Olivera B, Yandell M (2012) Elucidation of the molecular envenomation strategy of the cone snail Conus geographus through transcriptome sequencing of its venom duct. BMC Genomics 13: 284. doi: 10.1186/1471-2164-13-284
- 6. Woodward SR, Cruz LJ, Olivera BM, Hillyard DR (1990) Constant and hypervariable regions in conotoxin propeptides. The EMBO journal 9: 1015–1020.
- 7. Kaas Q, Westermann JC, Craik DJ (2010) Conopeptide characterization and classifications: An analysis using ConoServer. Toxicon 55: 1491–1509. doi: 10.1016/j.toxicon.2010.03.002
- 8. Olivera BM, Walker C, Cartier GE, Hooper D, Santos AD, et al. (1999) Speciation of cone snails and interspecific hyperdivergence of their venom peptides: potential evolutionary significance of introns. Annals of the New York Academy of Sciences 870: 223–237. doi: 10.1111/j.1749-6632.1999.tb08883.x
- 9. Satkunanathan N, Livett B, Gayler K, Sandall D, Down J, et al. (2005) Alpha-conotoxin Vc1.1 alleviates neuropathic pain and accelerates functional recovery of injured neurones. Brain Research 1059: 149–158. doi: 10.1016/j.brainres.2005.08.009
- 10. Safavi-Hemami H, Siero WA, Kuang Z, Williamson NA, Karas JA, et al. (2011) Embryonic toxin expression in the cone snail Conus victoriae: Primed to kill or divergent function? Journal of Biological Chemistry 286: 22546–22557. doi: 10.1074/jbc.m110.217703
- 11. Jakubowski JA, Keays DA, Kelley WP, Sandall DW, Bingham JP, et al. (2004) Determining sequences and post-translational modifications of novel conotoxins in Conus victoriae using cDNA sequencing and mass spectrometry. Journal of Mass Spectrometry 39: 548–557. doi: 10.1002/jms.624
- 12. Jakubowski JA, Kelley WP, Sweedler JV (2006) Screening for post-translational modifications in conotoxins using liquid chromatography/mass spectrometry: an important component of conotoxin discovery. Toxicon 47: 688–699. doi: 10.1016/j.toxicon.2006.01.021
- 13. Jakubowski JA, Sweedler JV (2004) Sequencing and mass profiling highly modified conotoxins using global reduction/alkylation followed by mass spectrometry. Analytical Chemistry 76: 6541–6547. doi: 10.1021/ac0494376
- 14. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. Journal of Molecular Biology 215: 403–410. doi: 10.1016/s0022-2836(05)80360-2
- 15. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, et al. (2009) BLAST+: architecture and applications. BMC Bioinformatics 10: 421. doi: 10.1186/1471-2105-10-421
- 16. Alberts B, Johnson A, Lewis J, Raff M, Roberts K, et al. (2002) Molecular biology of the cell, 4th edition. New York: Garland Science.
- 17. Soares MB, Bonaldo MF, Jelene P, Su L, Lawton L, et al. (1994) Construction and characterization of a normalized cDNA library. Proceedings of the National Academy of Sciences 91: 9228–9232. doi: 10.1073/pnas.91.20.9228
- 18. Biggs JS, Watkins M, Puillandre N, Ownby J-P, Lopez-Vera E, et al. (2010) Evolution of Conus peptide toxins: Analysis of Conus californicus Reeve, 1844. Molecular Phylogenetics and Evolution 56: 1–12. doi: 10.1016/j.ympev.2010.03.029
- 19. Sandall DW, Satkunanathan N, Keays DA, Polidano MA, Liping X, et al. (2003) A novel α-conotoxin identified by gene sequencing is active in suppressing the vascular response to selective stimulation of sensory nerves in vivo. Biochemistry 42: 6904–6911. doi: 10.1021/bi034043e
- 20. Callaghan B, Haythornthwaite A, Berecki G, Clark RJ, Craik DJ, et al. (2008) Analgesic α-conotoxins Vc1.1 and RgIA inhibit N-type calcium channels in rat sensory neurons via GABAB receptor activation. Journal of Neuroscience 28: 10943–10951. doi: 10.1523/jneurosci.3594-08.2008
- 21. Jimenez EC, Shetty RP, Lirazan M, Rivier J, Walker C, et al. (2003) Novel excitatory Conus peptides define a new conotoxin superfamily. Journal of Neurochemistry 85: 610–621. doi: 10.1046/j.1471-4159.2003.01685.x
- 22. Buczek O, Wei D, Babon JJ, Yang X, Fiedler B, et al. (2007) Structure and sodium channel activity of an excitatory I1-superfamily conotoxin. Biochemistry 46: 9929–9940. doi: 10.1021/bi700797f
- 23. Fiedler B, Zhang M-M, Buczek O, Azam L, Bulaj G, et al. (2008) Specificity, affinity and efficacy of iota-conotoxin RXIA, an agonist of voltage-gated sodium channels NaV1.2, 1.6 and 1.7. Biochemical Pharmacology 75: 2334–2344. doi: 10.1016/j.bcp.2008.03.019
- 24. Buczek O, Jimenez EC, Yoshikami D, Imperial JS, Watkins M, et al. (2008) I1-superfamily conotoxins and prediction of single d-amino acid occurrence. Toxicon 51: 218–229. doi: 10.1016/j.toxicon.2007.09.006
- 25. Buczek O, Yoshikami D, Watkins M, Bulaj G, Jimenez EC, et al. (2005) Characterization of D-amino-acid-containing excitatory conotoxins and redefinition of the I-conotoxin superfamily. FEBS Journal 272: 4178–4188. doi: 10.1111/j.1742-4658.2005.04830.x
- 26. Kauferstein S, Huys I, Lamthanh H, Stöcklin R, Sotto F, et al. (2003) A novel conotoxin inhibiting vertebrate voltage-sensitive potassium channels. Toxicon 42: 43–52. doi: 10.1016/s0041-0101(03)00099-0
- 27. Aguilar MB, Pérez-Reyes LI, López Z, de la Cotera EPH, Falcón A, et al. (2010) Peptide sr11a from Conus spurius is a novel peptide blocker for Kv1 potassium channels. Peptides 31: 1287–1291. doi: 10.1016/j.peptides.2010.04.007
- 28. Fan C-X, Chen X-K, Zhang C, Wang L-X, Duan K-L, et al. (2003) A novel conotoxin from Conus betulinus, κ-BtX, unique in cysteine pattern and in function as a specific BK channel modulator. Journal of Biological Chemistry 278: 12624–12633. doi: 10.1074/jbc.m210200200
- 29. Imperial JS, Bansal PS, Alewood PF, Daly NL, Craik DJ, et al. (2006) A novel conotoxin inhibitor of Kv1.6 channel and nAChR subtypes defines a new superfamily of conotoxins. Biochemistry 45: 8331–8340. doi: 10.1021/bi060263r
- 30. Jacob RB, McDougal OM (2010) The M-superfamily of conotoxins: A review. Cellular and Molecular Life Sciences 67: 17–27. doi: 10.1007/s00018-009-0125-0
- 31. McDougal OM, Turner MW, Ormond AJ, Poulter CD (2008) Three-dimensional structure of conotoxin tx3a: An M-1 branch peptide of the M-superfamily. Biochemistry 47: 2826–2832. doi: 10.1021/bi702388b
- 32. Corpuz GP, Jacobsen RB, Jimenez EC, Watkins M, Walker C, et al. (2005) Definition of the M-conotoxin superfamily: Characterization of novel peptides from molluscivorous Conus venoms. Biochemistry 44: 8176–8186. doi: 10.1021/bi047541b
- 33. Wang L, Liu J, Pi C, Zeng X, Zhou M, et al. (2009) Identification of a novel M-superfamily conotoxin with the ability to enhance tetrodotoxin sensitive sodium currents. Archives of Toxicology 83: 925–932. doi: 10.1007/s00204-009-0453-8
- 34. Dutertre S, Jin A-h, Kaas Q, Jones A, Alewood PF, et al. (2012) Deep venomics reveals the mechanism for expanded peptide diversity in cone snail venom. Molecular and Cellular Proteomics.
- 35. Wilson MJ, Zhang M-M, Azam L, Olivera BM, Bulaj G, et al. (2011) NaVβ Subunits Modulate the Inhibition of NaV1.8 by the Analgesic Gating Modifier µO-Conotoxin MrVIB. Journal of Pharmacology and Experimental Therapeutics 338: 687–693. doi: 10.1124/jpet.110.178343
- 36. Fainzilber M, Gordon D, Hasson A, Spira ME, Zlotkin E (1991) Mollusc-specific toxins from the venom of Conus textile neovicarius. European Journal of Biochemistry 202: 589–595. doi: 10.1111/j.1432-1033.1991.tb16412.x
- 37. Hansson K, Ma X, Eliasson L, Czerwiec E, Furie B, et al. (2004) The first γ-carboxyglutamic acid-containing contryphan. Journal of Biological Chemistry 279: 32453–32463. doi: 10.1074/jbc.m313825200
- 38. Sabareesh V, Gowd KH, Ramasamy P, Sudarslal S, Krishnan KS, et al. (2006) Characterization of contryphans from Conus loroisii and Conus amadis that target calcium channels. Peptides 27: 2647–2654. doi: 10.1016/j.peptides.2006.07.009
- 39. Pallaghy PK, He W, Jimenez EC, Olivera BM, Norton RS (2000) Structures of the contryphan family of cyclic peptides. Role of electrostatic interactions in cis−trans isomerism. Biochemistry 39: 12845–12852. doi: 10.1021/bi0010930
- 40. Jacobsen R, Jimenez EC, Grilley M, Watkins M, Hillyard D, et al. (1998) The contryphans, a d-tryptophan-containing family of Conus peptides: interconversion between conformers. Journal of Peptide Research 51: 173–179. doi: 10.1111/j.1399-3011.1998.tb01213.x
- 41. Craig AG, Jimenez EC, Dykert J, Nielsen DB, Gulyas J, et al. (1997) A novel post-translational modification involving bromination of tryptophan. Journal of Biological Chemistry 272: 4689–4698. doi: 10.1074/jbc.272.8.4689
- 42. Lirazan MB, Hooper D, Corpuz GP, Ramilo CA, Bandyopadhyay P, et al. (2000) The Spasmodic Peptide Defines a New Conotoxin Superfamily. Biochemistry 39: 1583–1588. doi: 10.1021/bi9923712
- 43. Miles LA, Dy CY, Nielsen J, Barnham KJ, Hinds MG, et al. (2002) Structure of a novel P-superfamily spasmodic conotoxin reveals an inhibitory cystine knot motif. Journal of Biological Chemistry 277: 43033–43040. doi: 10.1074/jbc.m206690200
- 44. England LJ, Imperial J, Jacobsen R, Craig AG, Gulyas J, et al. (1998) Inactivation of a serotonin-gated ion channel by a polypeptide toxin from marine snails. Science 281: 575–578. doi: 10.1126/science.281.5376.575
- 45. Teichert RW, Jimenez EC, Olivera BM (2005) αS-Conotoxin RVIIIA: A structurally unique conotoxin that broadly targets nicotinic acetylcholine receptors. Biochemistry 44: 7897–7902. doi: 10.1021/bi047274+
- 46. Liu L, Wu X, Yuan D, Chi C, Wang C (2008) Identification of a novel S-superfamily conotoxin from vermivorous Conus caracteristicus. Toxicon 51: 1331–1337. doi: 10.1016/j.toxicon.2008.03.001
- 47. Quinton L, Gilles N, De Pauw E (2009) TxXIIIA, an atypical homodimeric conotoxin found in the Conus textile venom. Journal of Proteomics 72: 219–226. doi: 10.1016/j.jprot.2009.01.021
- 48. Rigby AC, Lucas-Meunier E, Kalume DE, Czerwiec E, Hambe B, et al. (1999) A conotoxin from Conus textile with unusual posttranslational modifications reduces presynaptic Ca2+ influx. Proceedings of the National Academy of Sciences 96: 5758–5763. doi: 10.1073/pnas.96.10.5758
- 49. Walker CS, Steel D, Jacobsen RB, Lirazan MB, Cruz LJ, et al. (1999) The T-superfamily of conotoxins. Journal of Biological Chemistry 274: 30664–30671. doi: 10.1074/jbc.274.43.30664
- 50. Petrel C, Hocking HG, Reynaud M, Upert G, Favreau P, et al. (2013) Identification, structural and pharmacological characterization of τ-CnVA, a conopeptide that selectively interacts with somatostatin sst3 receptor. Biochemical Pharmacology.
- 51. Abogadie FC, Cruz LJ, Olivera BM, Walker C, Colledge C, et al. (2003) Conantokins. United States Patent: University of Utah Research Foundation, Salt Lake City, UT (US); Cognetix, Inc., Salt Lake City, UT (US); Salk Institute, LaJolla, CA (US).
- 52. Walker CS, Jensen S, Ellison M, Matta JA, Lee WY, et al. (2009) A novel Conus snail polypeptide causes excitotoxicity by blocking desensitization of AMPA receptors. Current Biology 19: 900–908. doi: 10.1016/j.cub.2009.05.017
- 53. Möller C, Marí F (2011) 9.3 KDa components of the injected venom of Conus purpurascens define a new five-disulfide conotoxin framework. Biopolymers 96: 158–165. doi: 10.1002/bip.21406
- 54. McIntosh JM, Ghomashchi F, Gelb MH, Dooley DJ, Stoehr SJ, et al. (1995) Conodipine-M, a novel phospholipase A isolated from the venom of the marine snail Conus magus. Journal of Biological Chemistry 270: 3518–3526. doi: 10.1074/jbc.270.8.3518
- 55. Terrat Y, Biass D, Dutertre S, Favreau P, Remm M, et al. (2012) High-resolution picture of a venom gland transcriptome: Case study with the marine snail Conus consors. Toxicon 59: 34–46. doi: 10.1016/j.toxicon.2011.10.001
- 56. Violette A, Biass D, Dutertre S, Koua D, Piquemal D, et al. (2012) Large-scale discovery of conopeptides and conoproteins in the injectable venom of a fish-hunting cone snail using a combined proteomic and transcriptomic approach. Journal of Proteomics.
- 57. Pi C, Liu J, Peng C, Liu Y, Jiang X, et al. (2006) Diversity and evolution of conotoxins based on gene expression profiling of Conus litteratus. Genomics 88: 809–819. doi: 10.1016/j.ygeno.2006.06.014
- 58. Hu H, Bandyopadhyay P, Olivera B, Yandell M (2011) Characterization of the Conus bullatus genome and its venom-duct transcriptome. BMC Genomics 12: 60. doi: 10.1186/1471-2164-12-60
- 59. Heinemann SH, Leipold E (2007) Conotoxins of the O-superfamily affecting voltage-gated sodium channels. Cellular and Molecular Life Sciences 64: 1329–1340. doi: 10.1007/s00018-007-6565-5
- 60. Yuan D-D, Liu L, Shao X-X, Peng C, Chi C-W, et al. (2009) New conotoxins define the novel I3-superfamily. Peptides 30: 861–865. doi: 10.1016/j.peptides.2009.01.012
- 61. Hansson K, Furie B, Furie BC, Stenflo J (2004) Isolation and characterization of three novel Gla-containing Conus marmoreus venom peptides, one with a novel cysteine pattern. Biochemical and Biophysical Research Communications 319: 1081–1087. doi: 10.1016/j.bbrc.2004.05.088
- 62. Liu Z, Yu Z, Liu N, Zhao C, Hu J, et al. (2010) cDNA cloning of conotoxins with framework XII from several Conus species. Acta Biochimica et Biophysica Sinica 42: 656–661.
- 63. Cruz LJ, Ramilo CA, Corpuz GP, Olivera BM (1992) Conus peptides: Phylogenetic range of biological activity. The Biological Bulletin 183: 159–164. doi: 10.2307/1542418
- 64. Imperial JS, Kantor Y, Watkins M, Heralde FM, Stevenson B, et al. (2007) Venomous auger snail Hastula (Impages) hectica (Linnaeus, 1758): molecular phylogeny, foregut anatomy and comparative toxinology. Journal of Experimental Zoology 308B: 744–756. doi: 10.1002/jez.b.21195
- 65. Pi C, Liu Y, Peng C, Jiang X, Liu J, et al. (2006) Analysis of expressed sequence tags from the venom ducts of Conus striatus: focusing on the expression profile of conotoxins. Biochimie 88: 131–140. doi: 10.1016/j.biochi.2005.08.001
- 66. Lluisma AO, Milash BA, Moore B, Olivera BM, Bandyopadhyay PK (2012) Novel venom peptides from the cone snail Conus pulicarius discovered through next-generation sequencing of its venom duct transcriptome. Marine Genomics: 43–51.
- 67. Laht S, Koua D, Kaplinski L, Lisacek F, Stöcklin R, et al. (2012) Identification and classification of conopeptides using profile Hidden Markov Models. Biochimica et Biophysica Acta 1824: 488–492. doi: 10.1016/j.bbapap.2011.12.004
- 68. Puillandre N, Koua D, Favreau P, Olivera B, Stöcklin R (2012) Molecular phylogeny, classification and evolution of conopeptides. Journal of Molecular Evolution 74: 297–309. doi: 10.1007/s00239-012-9507-2
- 69. Akcan M, Cao Y, Chongxu F, Craik DJ (2013) The three-dimensional solution structure of mini-M conotoxin BtIIIA reveals a disconnection between disulfide connectivity and peptide fold. Bioorganic & Medicinal Chemistry.
- 70. Poppe L, Hui JO, Ligutti J, Murray JK, Schnier PD (2011) PADLOC: A powerful tool to assign disulfide bond connectivities in peptides and proteins by NMR spectroscopy. Analytical Chemistry 84: 262–266. doi: 10.1021/ac203078x
- 71. Bhattacharyya M, Gupta K, Gowd KH, Balaram P (2013) Rapid mass spectrometric determination of disulfide connectivity in peptides and proteins. Molecular BioSystems.
- 72. Schmieder R, Edwards R (2011) Quality control and preprocessing of metagenomic datasets. Bioinformatics 27: 863–864. doi: 10.1093/bioinformatics/btr026
- 73. Chevreux B, Pfisterer T, Drescher B, Driesel AJ, Müller WEG, et al. (2004) Using the miraEST Assembler for Reliable and Automated mRNA Transcript Assembly and SNP Detection in Sequenced ESTs. Genome Research 14: 1147–1159. doi: 10.1101/gr.1917404
- 74. Mundry M, Bornberg-Bauer E, Sammeth M, Feulner PGD (2012) Evaluating characteristics of de novo assembly software on 454 transcriptome data: A simulation approach. PLoS ONE 7: e31410. doi: 10.1371/journal.pone.0031410
- 75. Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Molecular Biology and Evolution 30: 772–780. doi: 10.1093/molbev/mst010
- 76. Petersen TN, Brunak S, von Heijne G, Nielsen H (2011) SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Meth 8: 785–786. doi: 10.1038/nmeth.1701
- 77. Brown MA, Begley GS, Czerwiec E, Stenberg LM, Jacobs M, et al. (2005) Precursors of novel Gla-containing conotoxins contain a carboxy-terminal recognition site that directs γ-carboxylation. Biochemistry 44: 9150–9159. doi: 10.1021/bi0503293
- 78. Conticello SG, Gilad Y, Avidan N, Ben-Asher E, Levy Z, et al. (2001) Mechanisms for evolving hypervariability: The case of conopeptides. Molecular Biology and Evolution 18: 120–131. doi: 10.1093/oxfordjournals.molbev.a003786
- 79. Han Y, Huang F, Jiang H, Liu L, Wang Q, et al. (2008) Purification and structural characterization of a d-amino acid-containing conopeptide, conomarphin, from Conus marmoreus. FEBS Journal 275: 1976–1987. doi: 10.1111/j.1742-4658.2008.06352.x
- 80. McIntosh JM, Hasson A, Spira ME, Gray WR, Li W, et al. (1995) A new family of conotoxins that blocks voltage-gated sodium channels. Journal of Biological Chemistry 270: 16796–16802. doi: 10.1074/jbc.270.28.16796
- 81. Nakamura T, Yu Z, Fainzilber M, Burlingame AL (1996) Mass spectrometric-based revision of the structure of a cysteine-rich peptide toxin with γ-carboxyglutamic acid, TxVIIA, from the sea snail, Conus textile. Protein Science 5: 524–530. doi: 10.1002/pro.5560050315
- 82. Jimenez EC, Watkins M, Juszczak LJ, Cruz LJ, Olivera BM (2001) Contryphans from Conus textile venom ducts. Toxicon 39: 803–808. doi: 10.1016/s0041-0101(00)00210-5
- 83. McIntosh JM, Corpuz GO, Layer RT, Garrett JE, Wagstaff JD, et al. (2000) Isolation and characterization of a novel Conus peptide with apparent antinociceptive activity. Journal of Biological Chemistry 275: 32391–32397. doi: 10.1074/jbc.m003619200
- 84. Azam L, McIntosh JM (2009) Alpha-conotoxins as pharmacological probes of nicotinic acetylcholine receptors. Acta Pharmacologica Sinica 30: 771–783. doi: 10.1038/aps.2009.47
- 85. Sharpe IA, Gehrmann J, Loughnan ML, Thomas L, Adams DA, et al. (2001) Two new classes of conopeptides inhibit the α1-adrenoceptor and noradrenaline transporter. Nature Neuroscience 4: 902–907. doi: 10.1038/nn0901-902
- 86. Bulaj G, Zhang M-M, Green BR, Fiedler B, Layer RT, et al. (2006) Synthetic µO-conotoxin MrVIB blocks TTX-resistant sodium channel NaV1.8 and has a long-lasting analgesic activity. Biochemistry 45: 7404–7414. doi: 10.1021/bi060159+
- 87. Shon K-J, Grilley MM, Marsh M, Yoshikami D, Hall AR, et al. (1995) Purification, characterization, synthesis, and cloning of the lockjaw peptide from Conus purpurascens venom. Biochemistry 34: 4913–4918. doi: 10.1021/bi00015a002
- 88. Terlau H, Stocker M, Shon KJ, McIntosh JM, Olivera BM (1996) MicroO-conotoxin MrVIA inhibits mammalian sodium channels, but not through site I. Journal of Neurophysiology. 76: 1423–1429.
- 89. Olivera B, Gray W, Zeikus R, McIntosh J, Varga J, et al. (1985) Peptide neurotoxins from fish-hunting cone snails. Science 230: 1338–1343. doi: 10.1126/science.4071055
- 90. Massilia GR, Eliseo T, Grolleau F, Lapied B, Barbier J, et al. (2003) Contryphan-Vn: a modulator of Ca2+-dependent K+ channels. Biochemical and Biophysical Research Communications 303: 238–246. doi: 10.1016/s0006-291x(03)00331-0
- 91. Liu J, Wu Q, Pi C, Zhao Y, Zhou M, et al. (2007) Isolation and characterization of a T-superfamily conotoxin from Conus litteratus with targeting tetrodotoxin-sensitive sodium channels. Peptides 28: 2313–2319. doi: 10.1016/j.peptides.2007.09.006
- 92. Mena EE, Gullak MF, Pagnozzi MJ, Richter KE, Rivier J, et al. (1990) Conantokin-G: A novel peptide antagonist to the N-methyl-d-aspartic acid (NMDA) receptor. Neuroscience Letters 118: 241–244. doi: 10.1016/0304-3940(90)90637-o