Fasciola gigantica (Digenea) is an important foodborne trematode that causes liver fluke disease (fascioliasis) in mammals, including ungulates and humans, mainly in tropical climatic zones of the world. Despite its socioeconomic impact, almost nothing is known about the molecular biology of this parasite, its interplay with its hosts, and the pathogenesis of fascioliasis. Modern genomic technologies now provide unique opportunities to rapidly tackle these exciting areas. The present study reports the first transcriptome representing the adult stage of F. gigantica (of bovid origin), defined using a massively parallel sequencing-coupled bioinformatic approach. From >20 million raw sequence reads, >30,000 contiguous sequences were assembled, of which most were novel. Relative levels of transcription were determined for individual molecules, which were also characterized (at the inferred amino acid level) based on homology, gene ontology, and/or pathway mapping. Comparisons of the transcriptome of F. gigantica with those of other trematodes, including F. hepatica, revealed similarities in transcription for molecules inferred to have key roles in parasite-host interactions. Overall, the present dataset should provide a solid foundation for future fundamental genomic, proteomic, and metabolomic explorations of F. gigantica, as well as a basis for applied outcomes such as the development of novel methods of intervention against this neglected parasite.
Fasciola gigantica (Digenea) is a socioeconomically important liver fluke of humans and other mammals. It is the predominant cause of fascioliasis in the tropics and has a serious impact on the lives of tens of millions of people and other animals; yet, very little is known about this parasite and its relationship with its hosts at the molecular level. Here, advanced sequencing and bioinformatic technologies were employed to explore the genes transcribed in the adult stage of F. gigantica. From >20 million raw reads, >30,000 contiguous sequences were assembled. Relative levels of transcription were estimated; and molecules were characterized based on homology, gene ontology, and/or pathway mapping. Comparisons of the transcriptome of F. gigantica with those of other trematodes, including F. hepatica, showed similarities in transcription for molecules predicted to play roles in parasite-host interactions. The findings of the present study provide a foundation for a wide range of fundamental molecular studies of this neglected parasite, as well as research focused on developing new methods for the treatment, diagnosis, and control of fascioliasis.
Citation: Young ND, Jex AR, Cantacessi C, Hall RS, Campbell BE, et al. (2011) A Portrait of the Transcriptome of the Neglected Trematode, Fasciola gigantica—Biological and Biotechnological Implications. PLoS Negl Trop Dis 5(2): e1004. doi:10.1371/journal.pntd.0001004
Editor: Elodie Ghedin, University of Pittsburgh, United States
Received: October 7, 2010; Accepted: November 23, 2010; Published: February 1, 2011
Copyright: © 2011 Young et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was supported by the Australian Research Council (RBG), an Endeavour Fellowship (NDY), Charles Sturt University (TWS), and the Victorian Life Sciences Computation Initiative (VLSCI). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Liver flukes are socio-economically important parasitic flatworms (Platyhelminthes: Trematoda: Digenea) affecting humans and livestock in a wide range of countries. Two key representatives are Fasciola gigantica and F. hepatica. These parasites are the main cause of fascioliasis, a significant disease in ungulates – and humans, which is usually contracted via the ingestion of contaminated aquatic plants . Fascioliasis due to F. gigantica is recognized as a neglected tropical disease and is estimated to affect millions of people, mainly in parts of Africa, the Middle East and South-East Asia , –.
Fasciola gigantica and F. hepatica share common morphological, phylogenetic and biological characteristics, most clearly inferred by the evidence of sustained F. gigantica x F. hepatica (i.e. hybrid or introgressed) populations –. Fasciola spp. have di-heteroxenous life cycles ,  which involve (freshwater) lymnaeid snails as intermediate hosts and mammalian definitive hosts. The pathogenesis of fascioliasis in the definitive host is characterized by two main phases: (i) the acute/subacute phase begins with the ingestion of the metacercarial stage on herbage and is characterized by tissue damage, caused by the migration of immature worms through the duodenal wall, and then the liver capsule and parenchyma (usually 2–6 weeks) . Clinical signs can include abdominal pain, fever, anaemia, hepatomegaly and weight loss; (ii) the chronic phase commences when adult worms have established in the biliary ducts (~7–8 weeks after infection) . In addition to hepatic fibrosis (following acute/subacute infection) and anaemia, the chronic phase is characterized by progressive cholangitis, hyperplasia of the duct epithelium and periductal fibrosis, which can result in cholestatic hepatitis , . The onset of clinical signs can be variable, slow and typically include anaemia, jaundice, inappetence, oedema/ascites and/or diarrhoea , . Fascioliasis can also sometimes be associated with complications, such as co-infections with anaerobic bacteria , .
Despite their substantial morphological and biological similarities, differences in host specificity between F. gigantica and F. hepatica appear to define the aetiology and clinical manifestation of disease in the definitive host . A well-characterized difference between these parasites is their adaptation to different intermediate snail hosts. Fasciola gigantica usually prefers snail species (e.g., Radix natalensis and R. rubiginosa) that live in warm climates, whereas F. hepatica often utilizes snails (e.g., Lymnaea tomentosa and Galba truncatula) that are widespread in cool climates . This difference in intermediate host-preference appears to affect the distribution of the parasites, with F. gigantica being the most common cause of fascioliasis in the tropics and F. hepatica being more common in temperate regions. In sub-tropical regions, where both species of Fasciola can co-exist, fascioliasis is reported to be associated with F. gigantica, F. hepatica and/or F. gigantica x F. hepatica hybrid populations , . The clinical manifestation of fascioliasis in definitive hosts can also depend on parasite factors (e.g., species/strain of worm, infective dose and/or intensity of infection) and host factors (e.g., species of host, immune response and phase/duration of the infection) –, –. Some studies seem to suggest that F. gigantica may be better adapted to parasitize cattle, with higher levels of resistance being observed in sheep and goats , , . In contrast, most breeds of sheep are highly susceptible to fascioliasis caused by F. hepatica . Current evidence , , ,  suggests differences in biology between F. gigantica and F. hepatica as well as the disease(s) that these parasites cause; yet, our understanding of the molecular biology of these parasites and of fascioliasis, particularly in humans, is in its infancy , .
Recent developments in high-throughput sequencing – and bioinformatics  are now providing researchers with the much-needed tools to explore the fundamental biology of digeneans , . To date, molecular biological research of socioeconomically important trematodes has been dominated by a focus on Schistosoma mansoni and S. japonicum, culminating, recently, in the sequencing of their nuclear genomes , . These two genome sequences provide an invaluable resource to support fundamental explorations of the biology and evolution of flukes as well as their interactions with their hosts . However, the biology of schistosomes, which live en copula (i.e. as male/female pairs) in the blood stream of mammalian hosts, is distinct from that of hermaphroditic liver flukes, such as F. gigantica and F. hepatica. Recently, the transcriptomes of several foodborne liver flukes, including F. hepatica, Clonorchis sinensis and Opisthorchis viverrini, were determined , . Although this progress has improved our understanding of the molecular biology of these worms and has paved the way toward the discovery of new intervention targets, almost nothing is known about F. gigantica. This paucity of knowledge is clearly illustrated by the comparison of >60,000 transcripts currently available for F. hepatica , ,  with a total of 39 for F. gigantica in public databases (National Center for Biotechnology Information, NCBI).
In the present study, we characterized the transcriptome of the adult stage of F. gigantica and provide an essential resource for future explorations of this socioeconomically important parasite. We used massively parallel nucleotide sequencing of a non-normalized cDNA library to provide a deep insight into this transcriptome as well as relative transcription levels in this developmental stage. In addition, comparative analyses of the dataset predicted a range of proteins that are conserved among trematodes, providing an invaluable resource to underpin future efforts toward developing new approaches for the intervention against and control of fascioliasis.
Materials and Methods
Collection of adult F. gigantica
Adults of F. gigantica were collected (at an abattoir in Khon Kaen, Thailand), from the large bile ducts of a liver from a water buffalo (Bubalus bubalis) with a naturally acquired infection. All work was conducted in accordance with protocols approved by the animal ethics committee of the Department of Anatomy, Faculty of Veterinary Medicine, Khon Kaen University, Thailand. Adult worms were washed extensively in physiological saline and then transferred to and maintained in culture in vitro for 2 h  to allow the worms to regurgitate caecal contents. Subsequently, all worms were washed extensively in physiological saline, snap-frozen in liquid nitrogen and then stored at −80°C. The specific identity of each individual worm was verified by isolating genomic DNA  and conducting PCR-coupled, bidirectional sequencing (ABI 3730xl DNA analyzer, Applied Biosystems, California, USA) of the second internal transcribed spacer (ITS-2) of nuclear ribosomal DNA . In addition, the reproductive state and ploidy of each of three adult worms used for transcriptomic sequencing were examined histologically ; the presence of mature eggs and sperm confirmed that all three worms represented F. gigantica and not F. gigantica x F. hepatica hybrids (see ).
Library construction and sequencing
A full poly(A)-selected transcriptome sequencing approach (RNA-seq) was employed. DNase I-treated total RNA was extracted from three adult worms of F. gigantica using the TriPure isolation reagent (Roche), according to manufacturer's protocol. The amounts of total RNA were determined spectrophotometerically, and RNA integrity was verified by agarose gel electrophoresis and using a 2100 BioAnalyzer (Agilent). Polyadenylated (polyA+) RNA was purified from 10 µg of total RNA using Sera-Mag oligo(dT) beads, fragmented to a length of 100–500 nucleotides, reverse transcribed using random hexamers, end-repaired and adaptor-ligated, according to the manufacturer's protocol (Illumina). Ligated products of ~200 base pairs (bp) were excised from agarose and PCR-amplified (15 cycles). Products were cleaned using a MinElute column (Qiagen) and sequenced on a Genome Analyzer II (Illumina), according to the manufacturers' instructions.
Assembly and remapping of short-insert Illumina reads
The short-insert, single reads, generated from the adult F. gigantica cDNA library, were assembled using the computer program SOAPdenovo v1.04 . Briefly, short-insert, single-end reads filtered for adapter sequences and suboptimal read quality (i.e. with PHRED quality scores of <28) were used to construct and store a De Bruijn-graph using a k-mer value of 29 bp. Sequence reads were trimmed, and links with low coverage were removed before contig sequence k-mers were conjoined in an unambiguous path. To reduce apparent redundancy, sequences of >200 nucleotides were clustered using the contig assembly program (CAP3) , employing a minimum overlap length of 40 nucleotides and an identity threshold of 95%. Using BLASTn and then BLASTx analyses, all nucleotide sequences (n = 12) with significantly higher identity (based on the E-value) to those of any potential contaminants (including bacteria, fungi and/or the bovid host) than to digeneans or any other eukaryotes (for which sequence data are currently available) were removed.
The raw sequence reads derived from the non-normalized adult F. gigantica cDNA library were then mapped to the non-redundant transcriptomic data using the program SOAP2 . Briefly, raw sequence reads were aligned to the non-redundant transcriptomic data, such that each raw sequence read was uniquely mapped (i.e. to a unique transcript). Reads that mapped to more than one transcript (designated “multi-reads”) were randomly allocated to a unique transcript, such that they were recorded only once. To provide a relative assessment of transcript abundance, the number of raw reads that mapped to each sequence was normalized for length (i.e. reads per kilobase per million reads, RPKM) .
The non-redundant transcriptomic dataset for adult F. gigantica was annotated (April 2010) based on BLASTx for protein sequence homology at permissive (E-value: <1E−05), moderate (<1E−15) and/or stringent (<1E−30) search strategies against sequences in available databases, including: (i) NCBI non-redundant sequence database (GenBank, http://www.ncbi.nlm.nih.gov/); (ii) non-redundant genome-wide sequence databases for eukaryotic organisms [ENSEMBL (http://www.ensembl.org/), SchistoDB for S. mansoni (http://schistodb.net/schistodb20/)  and the Chinese Human Genome Center at Shanghai database for S. japonicum (http://lifecenter.sgst.cn/schistosoma) ; (iii) transcriptomic datasets available for F. hepatica, Clonorchis sinensis and Opisthorchis viverrini , ; and (iv) manually curated information resources for peptidases (MEROPS database)  and kinases (European Molecular Biology Laboratory kinase database, http://www.sarfari.org/kinasesarfari/).
Proteins were conceptually translated from the predicted coding domains of individual nucleotide sequences. Protein-coding sequences were classified functionally using the program InterProScan , employing the default search parameters. Based on their homology to conserved domains and protein families, predicted proteins of F. gigantica were assigned gene ontology (GO) categories and parental (i.e. level 2) terms (http://www.geneontology.org/). Inferred proteins with homologues/orthologues in other organisms were mapped to conserved biological pathways utilizing the Kyoto encyclopedia of genes and genomes (KEGG) orthology-based annotation system (KOBAS) . Orthologues in KEGG (i.e. metabolic) pathways were displayed using the tool iPath2 (http://pathways.embl.de/ipath2) . Signal peptides were also predicted using the program SignalP 3.0, employing both the neural network and hidden Markov models , and transmembrane domains using TMHMM , a membrane topology prediction program. Proteins inferred to be classically excreted and/or secreted from F. gigantica, based on the presence of a signal peptide, absence of any transmembrane domain(s) as well as sequence homology to one or more known excretory/secretory (ES) proteins listed in databases for eukaryotes , F. hepatica , S. mansoni  and the nematode Brugia malayi ,  were identified and collated.
Characterization of the transcriptome of F. gigantica
More than 20 million, short-insert Illumina reads were generated for the adult stage of F. gigantica (Table 1). Raw sequence data were deposited in the sequence read archive (SRA) database of NCBI (http://www.ncbi.nlm.nih.gov/sra) under accession number SRA024257. BLASTn searches (E-value: 1E−05) revealed that all 39 expressed sequence tags (ESTs) available in public databases for this parasite were contained within the present, assembled sequence dataset (available via http://gasser-research.vet.unimelb.edu.au/; contact corresponding authors); thus, only the sequence data from the present study were assembled (see Table 1). Short reads clustered into 30,525 unique sequences with a mean length of 524 nucleotides (range: 201–18,098) and with a G+C content of 46.0±4.2%. More than 25% of the raw reads were re-mapped (sequence length of ≥200 nucleotides) to the transcriptomic data, with a mean depth of coverage of 188±469 reads per sequence.
Sequence homology between F. gigantica and key eukaryotes
The transcriptomic dataset was used to interrogate genomic/transcriptomic databases (i.e. F. hepatica, C. sinensis, O. viverrini, S. mansoni, S. japonicum and NCBI non-redundant sequence databases) using BLASTx. The majority of F. gigantica sequences (27,755 of 30,513 sequence matches, equating to 91.0%) matched previously identified molecules at an E-value threshold of 1E−05 (Table 1). Proteins inferred from the transcriptome of F. gigantica were compared with those predicted from transcriptomic data for the adult stages of F. hepatica, C. sinensis and O. viverrini ,  and complete proteomic datasets for selected organisms, including Saccharomyces cerevisiae (yeast), S. mansoni and S. japonicum (trematodes), Caenorhabditis elegans (‘elegant worm’), Drosophila melanogaster (vinegar fly); Danio rerio (zebra fish), Gallus gallus (chicken), Xenopus tropicalis (frog); Bos taurus (cattle), Homo sapiens and Mus musculus (mouse) (Table 2). As expected, proteins predicted for F. gigantica (n = 30,513) had the highest sequence homology to F. hepatica using permissive (27,354 sequences; 89.6%), moderate (25,390 sequences; 83.2%) and stringent (20,798 sequences; 68.2%) search strategies. Amino acid sequences inferred for F. gigantica had the greatest similarity to those of other members of the class Trematoda included herein, resulting in 10,752 to 27,354 sequence matches (35.2–89.6%) or a total of 27,745 sequences matches (90.9%) at an E-value of 1E−05. In agreement with findings for other trematodes –, proteins inferred for F. gigantica had a higher sequence similarity to those of mammals (30.1–30.2%) than C. elegans (23.8%).
Comparative protein sequence analysis was carried out between or among key members of the Trematoda (Table 3). Despite significant differences in biology and life history, representatives of the family Fasciolidae (i.e. F. gigantica and F. hepatica) shared greater protein sequence homology (38.3%; E-value: 1E−05) with sequences encoded in the genomes of S. japonicum and S. mansoni (blood flukes; family Schistosomatidae) than to those encoded by transcripts from the adult stages of C. sinensis and O. viverrini (liver flukes; family Opisthorchiidae; 26.8%; E-value: 1E−05). Only a small number of proteins predicted for F. gigantica (i.e. 253 and 705 sequences at an E-value of 1E−30 and 1E−05, respectively) were homologous among the representatives of the families Fasciolidae, Schistosomatidae and Opisthorchiidae, but absent (based on a similar level of sequence homology) from the other eukaryotic organisms included in the present study (see Table S1). These molecules included proteases (mastin and leucine amino peptidase), membrane transporter proteins (aquaporin 3, multidrug resistance-associated protein-type ATP-binding cassette transporter and oxalate:formate antiporter) and proteins involved in cellular signalling (i.e. calcium binding proteins and an epidermal growth factor-like peptide).
Proteins inferred from the transcriptome of F. gigantica were predicted to contain signal peptide domains (1,543 sequences) and/or transmembrane domains (3,599 sequences) (Table 1). Based on the presence of signal peptide domains in and absence of transmembrane motifs from the predicted proteins as well as the presence of one or more homologues in current ES protein databases, 255 putative ES proteins, including cysteine proteases, cathepsins B and L, legumain and cystatin (a cysteine protease inhibitor) were inferred (Table S2).
Predicted proteins were also categorized according to their inferred molecular function, cellular localization and association with biological pathways, and compared with those encoded in the transcriptomes of the adult stages of other liver flukes, including F. hepatica (Table 1 and Table S3). A significant proportion (30.6%) of the transcriptome of F. gigantica was inferred to encode 3,535 conserved protein domains or family signatures. Based on this annotation, 1,124 GO terms were inferred. The transcriptome of F. gigantica contained most of the parental (i.e. level 2) terms assigned previously to F. hepatica (87%) , C. sinensis and O. viverrini (80%) , based on analyses of sequence data generated previously from normalized cDNA libraries representing adult worms. Predicted proteins assigned to the term ‘biological process’ (3,461 sequences; 401 GO terms) were associated predominantly with: (i) cellular processes (3,322 sequences; 64.1%), such as protein amino acid phosphorylation and transmembrane transport; (ii) metabolic processes (2,686 sequences; 51.8%), such as protein amino acid phosphorylation and translation; and (iii) localization (863 sequences; 16.7%), such as the directed movement of substances within or between cells including the transport of solutes across a membrane. Proteins assigned to the term ‘molecular function’ were mainly linked to: (i) binding (3,362 sequences; 70.1%), such as the binding of ATP, zinc ion and protein; (ii) catalytic activities (2,736 sequences; 52.8%) of enzymes, including protein kinases; and (iii) transporter activity (342; 6.6%), including ATPase activity, coupled to the transport of molecules through membranes. Predicted proteins for F. gigantica were also linked to cellular components, such as membranes, nucleus, protein complexes or ribosomes (Table S3).
Significant similarity (E-value: 1E−05) between protein sequences predicted for F. gigantica and those in the KOBAS database allowed 4,466 sequences to be assigned to 1,981 KO terms and 225 standardized KEGG pathway terms (Table 1). A significant proportion of amino acid sequences were associated with: (i) metabolic pathways (1,259 sequences; 549 KO terms), including carbohydrate, amino acid and lipid metabolism; (ii) cellular processes (919 sequences; 324 KO terms), including those linked to cell communication as well as the endocrine and/or immune systems; (iii) environmental information-processing pathways (738 sequences; 278 KO terms), including signal transduction, membrane transport and signaling molecules; (iv) genetic information processing pathways (661 sequences; 355 KO terms), including folding, sorting and degradation, translation and replication and repair; and (v) pathways linked to human diseases (341 sequences; 165 KO terms), including cancers, neurodegenerative disorders and infectious diseases (Table 4). Inferred proteins of F. gigantica (2097 sequences; 892 KO terms) were mapped to conserved, orthologous KEGG metabolic pathway terms, with a high degree of confidence based on protein sequence homology, employing moderate (785 KO terms; 88.0%; E-value, 1E−15) and stringent (589 KO terms, 66.0%; E-value, 1E−30) search strategies (Figure S1). Proteins predicted for F. gigantica that shared highest homology to conserved metabolic enzymes of eukaryotes (listed in the KEGG database) were associated predominantly with carbohydrate, lipid and/or energy metabolism. A high degree of similarity in metabolic pathways was evident between F. gigantica and F. hepatica  (Figure S2), regardless of whether the data were derived from a non-normalized cDNA library sequenced by Illumina (F. gigantica) [present study] or a normalized library sequenced using 454 technology (F. hepatica) . Interestingly, in F. gigantica, there was no evidence of any transcripts encoding 3-oxoacyl-[acyl-carrier-protein] synthase II [EC:184.108.40.206], which, in eukaryotes, is usually linked to the fatty acid biosynthesis pathway (KEGG pathway map00061). Although this molecule was encoded in F. hepatica (Figure S2) and S. mansoni , it is the only enzyme representing this particular pathway in these organisms. Current evidence (cf. , ) indicates that digeneans lack the repertoire of enzymes required for the de novo synthesis of fatty acids and that they are highly dependent on complex fatty acid precursors from their host(s).
Most abundantly transcribed genes (as assessed based on RPKM) in adult F. gigantica were those linked to reproductive processes, antioxidant molecules (thioredoxin, peroxiredoxin and fatty acid-binding proteins), molecular chaperones (heat shock proteins 70 and 90), proteins involved in the glycolytic pathway (fructose-bisphosphate aldolase, fructose-16-bisphosphatase-related protein, glutamate dehydrogenase and glyceraldehyde phosphate dehydrogenase), translation (elongation factor-1 alpha, RNA-binding protein 9 and cytosolic 80S ribosomal protein L39), cytoskeletal proteins (alpha-tubulin and dynein) and cysteine (calpain, cathepsin B, legumain-1 and legumain-2) and metallo (prolyl carboxypeptidase) proteases (Table S4). A detailed examination of the data revealed that a full complement of proteins required to degrade carbohydrates to phosphoenolpyruvate via the glycolytic pathway  was present (Figure S3).
Proteins predicted for F. gigantica were assigned to major families (2,214 sequences; 998 terms) based on homology to annotated proteins in the KEGG protein family database. Sequences encoded in the transcriptome were almost equally subdivided into three major categories: ‘genetic information processing’ (704 sequences; 31.8%), ‘cellular signaling’ (704 sequences; 31.8%) and ‘metabolism’ (676 sequences; 30.5%) (Figure 1A). Putative proteins were further categorized into various sub-categories, including: (i) protein kinases (364 sequences; 16.4%); (ii) cytoskeleton proteins (338 sequences; 15.3%); (iii) ubiquitin enzymes (224 sequences; 10.1%); and (iv) proteases (214 sequences; 9.7%) (Figure 1B). A further in silico analysis assigned most of the protein kinases (308 sequences) to eight structurally-related classes, inferred to be crucial for normal cellular processes (Figure 1C and Table S5), such as: (i) CMGC (66 sequences; 21.4%) cyclin-dependent (CDKs), mitogen-activated protein (MAP kinases), glycogen synthase (GSK) and CDK-like serine/threonine kinases; (ii) CAMK (55 sequences; 17.9%), Ca2+/calmodulin-dependent serine/threonine kinases; (iii) AGC (44 sequences; 14.3%), cAMP-dependent, cGMP-dependent and protein kinase C serine/threonine kinases; (iv) STE (35 sequences; 11.4%), serine/threonine protein kinases associated with the mitogen-activated protein kinase cascade; and (v) tyrosine kinases (34 sequences; 11.0%). Kinases that were abundantly transcribed included cAMP-dependent protein kinases (AGC) and casein kinases (CK1), essential for cell signal transduction; Ca2+/calmodulin-dependent serine/threonine kinases (CAMK) involved in calcium signalling ; and dual-specificity tyrosine-(Y)-phosphorylation regulated kinase 2 (CMGC) involved in the regulation of cellular growth and/or development .
Absolute numbers and percentages are given in parentheses. Category 1 (A) and category 2 (B) proteins were inferred based on homology to proteins in the Kyoto encyclopedia of genes and genomes (KEGG) database. Protein kinases (C) and proteases (D) were inferred based on homology to proteins in the EMBL Sarfari kinase and/or MEROPS databases. Kinase-like molecules were grouped within the: cyclin-dependent, mitogen-activated protein, glycogen synthase and CDK-like serine/threonine kinases (CMGC); Ca2+/calmodulin-dependent serine/threonine kinases (CAMK); cAMP-dependent, cGMP-dependent and protein kinase C serine/threonine kinases (AGC); serine/threonine protein kinases associated with the mitogen-activated protein kinase cascade (STE); tyrosine kinase (TK); tyrosine kinase-like (TKL); and other unclassified kinases (Other).
Similarly, further of the F. gigantica dataset inferred 304 proteases (linked to 247 MEROPS terms) and 137 protease inhibitors (122 MEROPS terms), including representatives of five of the seven protease catalytic types defined within the MEROPS database  (Figure 1D and Table S6). The ratio (aspartic:cysteine:metallo:serine:threonine)of catalytic types of proteases represented in the MEROPS database  and present in the of transcriptome of F. gigantica was 5:34:34:22:5, which was comparable with those inferred from the genomes of S. japonicum (4:32:35:21:8) and S. mansoni (6:29:35:23:7) , . In F. gigantica, genes encoding the metalloproteases (82 MEROPS terms; 33.5%), leucyl aminopeptidases, cytosolic exopeptidases, which cleave N-terminal residues from proteins and peptides, were abundantly transcribed. Cysteine proteases (82 MEROPS terms; 33.5%) inferred included those involved in the digestion of host proteins (legumain/asparaginyl endopeptidase and cathepsins) and calcium-induced modulation of cellular processes (calpain) (Table S4). Like all eukaryotes, F. gigantica was inferred to possess a rich diversity of serine proteases (55 MEROPS terms; 22.4%), including an abundantly transcribed serine carboxypeptidase, which are presumably important for fundamental cellular processes. Threonine proteases (13 MEROPS terms; 5.3%) which were abundantly represented included enzymes required for the assembly and activation of the proteasome complex . Aspartic proteases encoded (13 MEROPS terms; 5.3%) included cathepsin D, an aspartyl lysosomal peptidase which, in trematodes, is suggested to play a role in the degradation of host tissues .
Cathepsins representing families B and L were inferred (Table S7) from the present dataset by annotating and re-mapping sequences of ≥200 nucleotides (‘stringent conditions’). Inspection of the annotated data identified 18 and two sequences with homology to cathepsin B (including clades B1 and B2) and cathepsin L (clades 1 and 2), respectively. As cathepsin L is reported to be a dominant family of proteins of F. gigantica and F. hepatica , –, the relative levels of transcription of genes encoding members of cathepsins B and L were explored. The re-mapping of raw sequences (Illumina) to previously published transcripts (n = 15) encoding cathepsins from F. gigantica (see Table S8) – revealed high (RPKM of 2,543–214,634) and low (RPKMs of 14–21) levels of transcription for 10 and two representatives, respectively, of 12 distinct members of the cathepsin L family, and low and moderate (RPKMs of 0.9 and 300) levels for two of the three representatives of cathepsin B, respectively (Table S8).
A number of trematodes are of major socioeconomic importance; yet, they cause some of the most neglected diseases of humans and livestock worldwide. Until recently, there has been a reliance on data and information available for schistosomes (blood flukes) ,  to infer aspects of the molecular biology of key trematodes. The recent characterization of the transcriptomes of the liver flukes F. hepatica, C. sinensis and O. viverrini ,  has provided the first insights into the molecular biology of these foodborne trematodes. Extending this work, the present study provides a deep exploration of the transcriptome of the adult stage of F. gigantica. With only 39 transcripts previously available in public databases, the >30,000 sequences characterized here are novel for this species and constitute a significant contribution to current databases , , – and an invaluable resource to advance our understanding of the fundamental biology of F. gigantica, its interplay with its hosts and the disease that this parasite causes. Importantly, the present transcriptomic data set will also be an essential resource for the future assembly of the nuclear genome of F. gigantica, assisting in the determination of gene structures, prediction of alternative transcript splicing and the characterization of regulatory elements.
The present transcriptomic dataset should, in the future, assist significantly in identifying genes linked specifically to parasitism and also to our understanding of the evolution of trematodes . Based on current similarity searches, 80% (BLASTx, E-value 1E−15) to 90% (BLASTx, E-value 1E−05) of the predicted protein sequences of F. gigantica and F. hepatica were inferred to be homologues, reflecting their close biological and phylogenetic relationships . More broadly, 253 protein sequences inferred for F. gigantica were homologous (BLASTx, E-value <1E−30) to proteins identified in other trematodes but divergent from those predicted for a range of other eukaryotes, including human, mouse, cattle, zebrafish, vinegar fly, ‘elegant worm’ and/or yeast. Although there is a paucity of data on the function of the majority of such molecules, their characterization could lead to the discovery of new targets for the design of safe trematocidal drugs and/or vaccines.
Massively parallel nucleotide sequencing from a non-normalized cDNA library and the subsequent assembly of sequence data have produced a high quality draft of the transcriptome of adult F. gigantica and provided invaluable insights into the relative abundance of transcripts. The assignment of molecules encoded in the transcriptome to molecular functions and biological pathways has revealed a substantial diversity of terms, comparable with those predicted for other liver flukes, including F. hepatica , C. sinensis and O. viverrini , and the blood fluke S. mansoni (http://amigo.geneontology.org/; http://schistodb.net/schistodb20/). Proteins known to be expressed in adult F. hepatica , ,  were compared with those inferred from the transcriptome of F. gigantica. Molecules well represented in the adult transcriptomes of both F. gigantica [the present study] and F. hepatica  included antioxidants, heat shock proteins and cysteine proteases. Antioxidants have been suggested to play a role in host immune modulation and shown to be highly expressed throughout the life history of F. hepatica , including peroxiredoxin, thioredoxin and glutathione transferases, whose expression has been suggested to protect fasciolids from harmful, host-derived reactive oxygen species –. A similar protective role has also been reported for protein chaperones, such as heat shock protein-70, which have been inferred to play an important role in relation to protein folding and whose expression is proposed to be induced by one or more host immune responses to F. gigantica or F. hepatica . Therefore, within the definitive host, adult stages of F. gigantica and F. hepatica appear to express repertoires of molecules that are directed toward the protection of cellular processes from the host response to liver fluke infection, including the protection from reactive oxygen species (ROS) . Protection from damage caused by ROS is important, since juveniles of F. gigantica are susceptible (in vitro) to antibody-dependent cell-mediated cytotoxicity involving ROS .
A diverse array of proteases were abundantly represented in the transcriptome of the adult stage of F. gigantica, as expected based on previous proteomic studies , . Cysteine proteases constituted a significant proportion of catalytic enzymes encoded in this species (Figure 1; Table S6), which appears to reflect their crucial roles in parasite feeding and/or immuno-modulation in the definitive host , . A cathepsin B-like molecule (B1) was also well represented in the present transcriptome (Table S7 and Table S8). Evidence of abundant transcription of one or more homologues in the tegument and/or digestive and reproductive tracts  and their absence from ES products , , ,  suggests one or more key functions for cathepsin Bs within the tissues of this parasite. A detailed analysis also revealed that transcripts encoding cathepsin Ls (including members of clades 1, 2 and 5; ) were abundant in the present dataset (Table S8), consistent with their dominance in ES products from adult F. hepatica , , .
The complexity of the cathepsins and the close relatedness of some of them were reflected in a technical challenge in the assembly of (short-read) Illumina sequence data. The abundance of many related and, apparently, paralogous and/or alternatively spliced transcripts encoding cathepsin Ls (cf.) prevents accurate assemblies from short transcripts, even under stringent conditions (as used herein). This point emphasizes a limitation of the de novo-assembly of single-end sequences produced using short-read sequencing platforms, such as Illumina  and SOLiD , in the absence of a reference genome sequence. This limitation should be overcome in the future through the combined assembly and annotation of paired-end sequence data with medium to long sequences (e.g., of 350–1000 nucleotides) produced using alternative sequencing technology, such as 454 (Roche) . Such an integrated sequencing approach, preferably in conjunction with proteomic analyses, could be used to quantitatively study transcription/expression profiles in key developmental stages and distinct phenotypes (or hybrids) of F. gigantica , , . Although the transcriptome of the adult stage of F. gigantica has been defined here, there is no information on differential transcription among miracidial, sporocyst, redial, cercarial, juvenile and adult stages of this parasite. Clearly, exploring transcription among and also within all developmental stages of this parasite will have important implications for understanding development, reproduction, parasite-host interactions as well as fascioliasis at the biochemical, immunological, molecular and pathophysiological levels. Detailed knowledge of the transcriptome of F. gigantica will also assist in the study of developmental processes and metabolic pathways through functional genomics. Gene perturbation assays are available for S. mansoni and F. hepatica –, suggesting that they could be adapted to F. gigantica for functional genomic explorations. The integration of data from comparative and functional analyses could pave the way for the development of new intervention methods against F. gigantica, built on the identification and of essential genes or gene products linked to key biological or biochemical pathways. For instance, phosphofructokinase (a glycolytic enzyme) is a known metabolic “choke-point” in S. mansoni , because trivalent, organic antimony compounds can inhibit worm growth in vitro . The genes encoding phosphofructokinase and other key enzymes in the glycolysis pathway were abundantly transcribed in adult F. gigantica (Figure S3). Also a thioredoxin-glutathione reductase (a multifunctional detoxifying enzyme) might represent a novel drug target in F. gigantica, because a gene encoding a homologue of this enzyme in S. mansoni has been shown to be essential for life, based on functional genomic analyses –. Clearly, future structural and functional explorations of molecules (including kinases, proteases and their inhibitors, neuropeptides and selected structural proteins), which are recognized to be conserved among fasciolids and schistosomes and/or predicted to be essential and druggable , –, should assist in the design and development of entirely new classes of potent trematocidal compounds.
A summary of metabolic pathways predicted for amino acid sequences inferred from transcriptomic data for the adult stage of Fasciola gigantica. Mapping was conducted based on homology to annotated proteins in the Kyoto encyclopedia of genes and genomes (KEGG) pathways database. Results were displayed using iPath2 (http://pathways.embl.de/ipath2/). The colours represent sequence homology (BLASTx) to orthologous proteins at permissive (yellow path; E-value <1E−05), moderate (orange; E-value <1E−15) and stringent (red; E-value, <1E−30) search strategies.
A summary of metabolic pathways predicted for amino acid sequences inferred from the transcriptome of the adult stage of Fasciola gigantica and Fasciola hepatica  based on homology mapping to annotated proteins in the Kyoto encyclopedia of genes and genomes (KEGG) biological pathways database. Results were displayed using iPath2 (http://pathways.embl.de/ipath2/). Shared pathways (green) between F. gigantica (yellow) and F. hepatica (blue) are indicated.
The glycolysis pathway predicted for proteins inferred to be encoded in the transcriptome of the adult stage of Fasciola gigantica based on homology mapping to annotated proteins in the Kyoto encyclopedia of genes and genomes (KEGG) biological pathways database. Levels of transcription are inferred from sequencing depth and are represented by the number of reads per kilobase per million reads (RPKM). Transcription was ranked as high (red, RPKM >500), moderate (orange, RPKM 250–500) or low (yellow, RPKM <250). The present image was modified from that in the KEGG database (http://www.genome.jp/kegg/).
Adult Fasciola gigantica transcripts homologous to predicted proteins from other trematodes.
Putative adult Fasciola gigantica proteins with homology to classically secreted/excreted molecules.
Predicted function of adult Fasciola gigantica transcripts based on gene ontology (GO).
Putative adult Fasciola gigantica proteins with homology to proteins submitted to the NCBI non-redundant database.
Putative adult Fasciola gigantica proteins with homology to kinases within the European Molecular Biology Laboratory kinase database.
Putative adult Fasciola gigantica proteins with homology to proteases and protease inhibitors within the MEROPS enzyme database.
Putative adult Fasciola gigantica proteins with homology to the cathepsin family of cysteine proteases.
Raw sequence reads generated from adult Fasciola gigantica re-mapped to known F. gigantica cathepsins.
This research was supported by the Department of Education, Employment, and Workplace Relations (Endeavour Fellowship Program) and by the Victorian Life Sciences Computation Initiative (VLSCI).
Conceived and designed the experiments: NDY RBG. Performed the experiments: NDY ST PT TL. Analyzed the data: NDY ARJ CC TWS. Contributed reagents/materials/analysis tools: RSH BEC. Wrote the paper: NDY RBG.
- 1. Boray JC (1969) Experimental fascioliasis in Australia. Adv Parasitol 7: 95–210.
- 2. Spithill T, Smooker PM, Copeman B (1999) Fasciola gigantica: Epidemiology, control, immunology and molecular biology. In: Dalton JP, editor. Fasciolosis. Oxon, UK: CABI publishing. pp. 465–525.
- 3. Torgerson P, Claxton J (1999) Epidemiology and Control. In: Dalton JP, editor. Fasciolosis. Oxon, UK: CABI publishing. pp. 113–149.
- 4. Mas-Coma S, Bargues MD, Valero MA (2005) Fascioliasis and other plant-borne trematode zoonoses. Int J Parasitol 35: 1255–1278.
- 5. Ashrafi K, Massoud J, Holakouei K, Joafshani MA, Valero MA, et al. (2004) Evidence suggesting that Fasciola gigantica may be the most prevalent causal agent of fascioliasis in northern Iran. Iran J Public Health 33: 31–37.
- 6. Keiser J, Utzinger J (2005) Emerging foodborne trematodiasis. Emerg Infect Dis 11: 1507–1514.
- 7. Keiser J, Utzinger J (2009) Food-borne trematodiases. Clin Microbiol Rev 22: 466–483.
- 8. Le TH, De NV, Agatsuma T, Blair D, Vercruysse J, et al. (2007) Molecular confirmation that Fasciola gigantica can undertake aberrant migrations in human hosts. J Clin Microbiol 45: 648–650.
- 9. Mas-Coma MS (1999) Human fasciolosis. In: Dalton JP, editor. Fasciolosis. Oxon, UK: CABI publishing. pp. 411–434.
- 10. World-Health-Organization (2006) Report of the WHO Informal Meeting on Use of Triclabendazole in Fasciolosis Control. Geneva, Switzerland: WHO headquarters. pp. 1–33.
- 11. Itagaki T, Sakaguchi K, Terasaki K, Sasaki O, Yoshihara S, et al. (2009) Occurrence of spermic diploid and aspermic triploid forms of Fasciola in Vietnam and their molecular characterization based on nuclear and mitochondrial DNA. Parasitol Int 58: 81–85.
- 12. Peng M, Ichinomiya M, Ohtori M, Ichikawa M, Shibahara T, et al. (2009) Molecular characterization of Fasciola hepatica, Fasciola gigantica, and aspermic Fasciola sp. in China based on nuclear and mitochondrial DNA. Parasitol Res 105: 9–15.
- 13. Le TH, Van De N, Agatsuma T, Nguyen TGT, Nguyen QD, et al. (2008) Human fascioliasis and the presence of hybrid/introgressed forms of Fasciola hepatica and Fasciola gigantica in Vietnam. Int J Parasitol 38: 725–730.
- 14. Andrews SJ (1999) The Life Cycle of Fasciola hepatica. In: Dalton JP, editor. Fasciolosis. Oxon, UK: CABI publishing. pp. 1–30.
- 15. Behm CA, Sangster NC (1999) Pathology, Pathophysiology and Clinical Aspects. In: Dalton JP, editor. Fasciolosis. Oxon, UK: CABI publishing. pp. 185–224.
- 16. Marcos LA, Terashima A, Gotuzzo E (2008) Update on hepatobiliary flukes: fascioliasis, opisthorchiasis and clonorchiasis. Curr Opin Infect Dis 21: 523–530.
- 17. Stemmermann GN (1953) Human infestation with Fasciola gigantica. Am J Pathol 29: 731–759.
- 18. Marcos LA, Tagle M, Terashima A, Bussalleu A, Ramirez C, et al. (2008) Natural history, clinicoradiologic correlates, and response to triclabendazole in acute massive fascioliasis. Am J Trop Med Hyg 78: 222–227.
- 19. Mas-Coma S, Valero MA, Bargues MD, Rollinson D, Hay SI (2009) Chapter 2: Fasciola, lymnaeids and human fascioliasis, with a global overview on disease transmission, epidemiology, evolutionary genetics, molecular epidemiology and control. Adv Parasitol 69: 41–146.
- 20. Haroun ETM, Hillyer GV (1986) Resistance to fascioliasis — A review. Vet Parasitol 20: 63–93.
- 21. Piedrafita D, Raadsma HW, Prowse R, Spithill TW (2004) Immunology of the host-parasite relationship in fasciolosis (Fasciola hepatica and Fasciola gigantica). Can J Zool-Rev Can Zool 82: 233–250.
- 22. Raadsma HW, Kingsford NM, Suharyanta , Spithill TW, Piedrafita D (2007) Host responses during experimental infection with Fasciola gigantica or Fasciola hepatica in Merino sheep - I. Comparative immunological and plasma biochemical changes during early infection. Vet Parasitol 143: 275–286.
- 23. Raadsma HW, Kingsford NM, Suharyanta , Spithill TW, Piedrafita D (2008) Host responses during experimental infection with Fasciola gigantica and Fasciola hepatica in Merino sheep II. Development of a predictive index for Fasciola gigantica worm burden. Vet Parasitol 154: 250–261.
- 24. Roberts JA, Estuningsih E, Widjayanti S, Wiedosari E, Partoutomo S, et al. (1997) Resistance of Indonesian thin tail sheep against Fasciola gigantica and F. hepatica. Vet Parasitol 68: 69–78.
- 25. Periago MV, Valero MA, Panova M, Mas-Coma S (2006) Phenotypic comparison of allopatric populations of Fasciola hepatica and Fasciola gigantica from European and African bovines using a computer image analysis system (CIAS). Parasitol Res 99: 368–378.
- 26. Mas-Coma S, Bargues MD, Valero MA (2007) Plant-Borne Tremotode Zoonoses: Fascioliasis and Fasciolopsiasis. In: Murrell KD, Fried B, editors. World class parasites, vol 11 Food-borne parasitic zoonoses: fish and plant-borne parasites: Springer, New York. pp. 293–334.
- 27. Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, et al. (2008) Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456: 53–59.
- 28. Eid J, Fehr A, Gray J, Luong K, Lyle J, et al. (2009) Real-time DNA sequencing from single polymerase molecules. Science 323: 133–138.
- 29. Harris TD, Buzby PR, Babcock H, Beer E, Bowers J, et al. (2008) Single-molecule DNA sequencing of a viral genome. Science 320: 106–109.
- 30. Pandey V, Nutter RC, Prediger E (2008) Applied Biosystems SOLiD™ System: Ligation-Based Sequencing. In: Jantz M, editor. Next Generation Genome Sequencing: Towards Personalized Medicine: WIley. pp. 29–41.
- 31. Cantacessi C, Jex AR, Hall RS, Young ND, Campbell BE, et al. (2010) A practical, bioinformatic workflow system for large data sets generated by next-generation sequencing. Nucleic Acids Res 38: e171.
- 32. Brindley PJ, Pearce EJ (2007) Genetic manipulation of schistosomes. Int J Parasitol 37: 465–473.
- 33. Brindley PJ, Mitreva M, Ghedin E, Lustigman S (2009) Helminth genomics: The implications for human health. PLoS Negl Trop Dis 3: e538. doi:10.1371/journal.pntd.0000538.
- 34. Berriman M, Haas BJ, LoVerde PT, Wilson RA, Dillon GP, et al. (2009) The genome of the blood fluke Schistosoma mansoni. Nature 460: 352–358.
- 35. Liu F, Zhou Y, Wang ZQ, Lu G, Zheng H, et al. (2009) The Schistosoma japonicum genome reveals features of host-parasite interplay. Nature 460: 345–351.
- 36. Young ND, Hall RS, Jex AJ, Cantacessi C, Gasser RB (2010) Elucidating the transcriptome of Fasciola hepatica - a key to fundamental and biotechnological discoveries for a neglected parasite. Biotechnol Adv 28: 222–231.
- 37. Young ND, Campbell BE, Hall RS, Jex AJ, Cantacessi C, et al. (2010) Unlocking the transcriptomes of two carcinogenic parasites, Clonorchis sinensis and Opisthorchis viverrini. PLoS Negl Trop Dis 4: e719. doi:10.1371/journal.pntd.0000719.
- 38. Cancela M, Ruetalo N, Dell'Oca N, da Silva E, Smircich P, et al. (2010) Survey of transcripts expressed by the invasive juvenile stage of the liver fluke Fasciola hepatica. BMC Genomics 11: 227.
- 39. Robinson MW, Menon R, Donnelly SM, Dalton JP, Ranganathan S (2009) An integrated transcriptomics and proteomics analysis of the secretome of the helminth pathogen Fasciola hepatica: proteins associated with invasion and infection of the mammalian host. Mol Cell Proteomics 8: 1891–1907.
- 40. Gasser RB, Hu M, Chilton NB, Campbell BE, Jex AJ, et al. (2006) Single-strand conformation polymorphism (SSCP) for the analysis of genetic variation. Nat Protoc 1: 3121–3128.
- 41. Fletcher HL, Hoey EM, Orr N, Trudgett A, Fairweather I, et al. (2004) The occurrence and significance of triploidy in the liver fluke, Fasciola hepatica. Parasitology 128: 69–72.
- 42. Li R, Zhu H, Ruan J, Qian W, Fang X, et al. (2010) De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20: 265–272.
- 43. Huang X, Madan A (1999) CAP3: A DNA sequence assembly program. Genome Res 9: 868–877.
- 44. Li R, Yu C, Li Y, Lam TW, Yiu SM, et al. (2009) SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25: 1966–1967.
- 45. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 5: 621–628.
- 46. Zerlotini A, Heiges M, Wang H, Moraes RL, Dominitini AJ, et al. (2009) SchistoDB: a Schistosoma mansoni genome resource. Nucleic Acids Res 37: D579–582.
- 47. Liu F, Chen P, Cui SJ, Wang ZQ, Han ZG (2008) SjTPdb: integrated transcriptome and proteome database and analysis platform for Schistosoma japonicum. BMC Genomics 9: 304.
- 48. Rawlings ND, Barrett AJ, Bateman A (2010) MEROPS: the peptidase database. Nucleic Acids Res 38: D227–233.
- 49. Zdobnov EM, Apweiler R (2001) InterProScan—an integration platform for the signature-recognition methods in InterPro. Bioinformatics 17: 847–848.
- 50. Wu J, Mao X, Cai T, Luo J, Wei L (2006) KOBAS server: a web-based platform for automated annotation and pathway identification. Nucl Acids Res 34: W720–724.
- 51. Letunic I, Yamada T, Kanehisa M, Bork P (2008) iPath: interactive exploration of biochemical pathways and networks. Trends Biochem Sci 33: 101–103.
- 52. Bendtsen JD, Nielsen H, von Heijne G, Brunak S (2004) Improved prediction of signal peptides: SignalP 3.0. J Mol Biol 340: 783–795.
- 53. Krogh A, Larsson B, von Heijne G, Sonnhammer EL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305: 567–580.
- 54. Chen Y, Zhang Y, Yin Y, Gao G, Li S, et al. (2005) SPD—a web-based secreted protein database. Nucl Acids Res 33: D169–173.
- 55. Cass CL, Johnson JR, Califf LL, Xu T, Hernandez HJ, et al. (2007) Proteomic analysis of Schistosoma mansoni egg secretions. Mol Biochem Parasitol 155: 84–93.
- 56. Bennuru S, Semnani R, Meng Z, Ribeiro JM, Veenstra TD, et al. (2009) Brugia malayi excreted/secreted proteins at the host/parasite interface: stage- and gender-specific proteomic profiling. PLoS Negl Trop Dis 3: e410. doi:10.1371/journal.pntd.0000410.
- 57. Hewitson JP, Harcus YM, Curwen RS, Dowle AA, Atmadja AK, et al. (2008) The secretome of the filarial parasite, Brugia malayi: proteomic profile of adult excretory-secretory products. Mol Biochem Parasitol 160: 8–21.
- 58. Tielens AG (1999) Metabolism. In: Dalton JP, editor. Fasciolosis. Oxon, UK: CABI publishing. pp. 277–305.
- 59. Brouwers JF, Smeenk IM, van Golde LM, Tielens AG (1997) The incorporation, modification and turnover of fatty acids in adult Schistosoma mansoni. Mol Biochem Parasitol 88: 175–185.
- 60. Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S (2002) The protein kinase complement of the human genome. Science 298: 1912–1934.
- 61. Maddika S, Chen J (2009) Protein kinase DYRK2 is a scaffold that facilitates assembly of an E3 ligase. Nat Cell Biol 11: 409–419.
- 62. Gallastegui N, Groll M (2010) The 26S proteasome: assembly and function of a destructive machine. Trends Biochem Sci, in press.
- 63. Caffrey CR, McKerrow JH, Salter JP, Sajid M (2004) Blood 'n' guts: an update on schistosome digestive peptidases. Trends Parasitol 20: 241–248.
- 64. Cancela M, Acosta D, Rinaldi G, Silva E, Duran R, et al. (2008) A distinctive repertoire of cathepsins is expressed by juvenile invasive Fasciola hepatica. Biochimie 90: 1461–1475.
- 65. Robinson MW, Dalton JP, Donnelly S (2008) Helminth pathogen cathepsin proteases: it's a family affair. Trends BiochemSci 33: 601–608.
- 66. Robinson MW, Tort JF, Lowther J, Donnelly SM, Wong E, et al. (2008) Proteomics and phylogenetic analysis of the cathepsin L protease family of the helminth pathogen Fasciola hepatica. Mol Cell Proteomics 7: 1111–1123.
- 67. Grams R, Vichasri-Grams S, Sobhon P, Upatham ES, Viyanant V (2001) Molecular cloning and characterization of cathepsin L encoding genes from Fasciola gigantica. Parasitol Int 50: 105–114.
- 68. Meemon K, Grams R, Vichasri-Grams S, Hofmann A, Korge G, et al. (2004) Molecular cloning and analysis of stage and tissue-specific expression of cathepsin B encoding genes from Fasciola gigantica. Mol Biochem Parasitol 136: 1–10.
- 69. Cho PY, Lee MJ, Kim TI, Kang SY, Hong SJ (2006) Expressed sequence tag analysis of adult Clonorchis sinensis, the Chinese liver fluke. Parasitol Res 99: 602–608.
- 70. Laha T, Pinlaor P, Mulvenna J, Sripa B, Sripa M, et al. (2007) Gene discovery for the carcinogenic human liver fluke, Opisthorchis viverrini. BMC Genomics 8: 189.
- 71. Cho PY, Kim TI, Whang SM, Hong SJ (2008) Gene expression profile of Clonorchis sinensis metacercariae. Parasitol Res 102: 277–282.
- 72. Ju JW, Joo HN, Lee MR, Cho SH, Cheun HI, et al. (2009) Identification of a serodiagnostic antigen, legumain, by immunoproteomic analysis of excretory-secretory products of Clonorchis sinensis adult worms. Proteomics 9: 3066–3078.
- 73. Lee JS, Lee J, Park SJ, Yong TS (2003) Analysis of the genes expressed in Clonorchis sinensis adults using the expressed sequence tag approach. Parasitol Res 91: 283–289.
- 74. Park JK, Kim KH, Kang S, Kim W, Eom KS, et al. (2007) A common origin of complex life cycles in parasitic flatworms: evidence from the complete mitochondrial genome of Microcotyle sebastis (Monogenea: Platyhelminthes). BMC Evol Biol 7: 11.
- 75. Lotfy WM, Brant SV, DeJong RJ, Le TH, Demiaszkiewicz A, et al. (2008) Evolutionary origins, diversification, and biogeography of liver flukes (Digenea, Fasciolidae). Am J Trop Med Hyg 79: 248–255.
- 76. Hernandez-Gonzalez A, Valero ML, del Pino MS, Oleaga A, Siles-Lucas M (2010) Proteomic analysis of in vitro newly excysted juveniles from Fasciola hepatica. Mol Biochem Parasitol 172: 121–128.
- 77. Morphew RM, Wright HA, LaCourse EJ, Woods DJ, Brophy PM (2007) Comparative proteomics of excretory-secretory proteins released by the liver fluke Fasciola hepatica in sheep host bile and during in vitro culture ex host. Mol Cell Proteomics 6: 963–972.
- 78. Cervi L, Rossi G, Masih DT (1999) Potential role for excretory-secretory forms of glutathione-S-transferase (GST) in Fasciola hepatica. Parasitology 119: 627–633.
- 79. Salazar-Calderon M, Martin-Alonso JM, Ruiz de Eguino AD, Parra F (2001) Heterologous expression and functional characterization of thioredoxin from Fasciola hepatica. Parasitol Res 87: 390–395.
- 80. Sekiya M, Mulcahy G, Irwin JA, Stack CM, Donnelly SM, et al. (2006) Biochemical characterisation of the recombinant peroxiredoxin (FhePrx) of the liver fluke, Fasciola hepatica. FEBS Lett 580: 5016–5022.
- 81. Smith RE, Spithill TW, Pike RN, Meeusen ENT, Piedrafita D (2008) Fasciola hepatica and Fasciola gigantica: Cloning and characterisation of 70 kDa heat-shock proteins reveals variation in HSP70 gene expression between parasite species recovered from sheep. Exp Parasitol 118: 536–542.
- 82. Hewitson JP, Grainger JR, Maizels RM (2009) Helminth immunoregulation: the role of parasite secreted proteins in modulating host immunity. Mol Biochem Parasitol 167: 1–11.
- 83. Piedrafita D, Estuningsih E, Pleasance J, Prowse R, Raadsma HW, et al. (2007) Peritoneal lavage cells of Indonesian thin-tail sheep mediate antibody-dependent superoxide radical cytotoxicity in vitro against newly excysted juvenile Fasciola gigantica but not juvenile Fasciola hepatica. Infect Immun 75: 1954–1963.
- 84. Kasˇny` M, Mikeš L, Hampl V, Dvořák J, Caffrey CR, et al. (2009) Chapter 4. Peptidases of trematodes. Adv Parasitol 69: 205–297.
- 85. Smooker PM, Jayaraj R, Pike RN, Spithill TW (2010) Cathepsin B proteases of flukes: the key to facilitating parasite control? Trends Parasitol 26: 506–514.
- 86. Dalton JP, Neill SO, Stack C, Collins P, Walshe A, et al. (2003) Fasciola hepatica cathepsin L-like proteases: biology, function, and potential in the development of first generation liver fluke vaccines. Int J Parasitol 33: 1173–1181.
- 87. Jefferies JR, Campbell AM, van Rossum AJ, Barrett J, Brophy PM (2001) Proteomic analysis of Fasciola hepatica excretory-secretory products. Proteomics 1: 1128–1132.
- 88. Smith AM, Dowd AJ, McGonigle S, Keegan PS, Brennan G, et al. (1993) Purification of a cathepsin L-like proteinase secreted by adult Fasciola hepatica. Mol Biochem Parasitol 62: 1–8.
- 89. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, et al. (2005) Genome sequencing in microfabricated high-density picolitre reactors. Nature 437: 376–380.
- 90. Itagaki T, Kikawa M, Terasaki K, Shibahara T, Fukuda K (2005) Molecular characterization of parthenogenic Fasciola sp. in Korea on the basis of DNA sequences of ribosomal ITS1 and mitochondrial NDI gene. J Vet Med Sci 67: 1115–1118.
- 91. Geldhof P, Visser A, Clark D, Saunders G, Britton C, et al. (2007) RNA interference in parasitic helminths: current situation, potential pitfalls and future prospects. Parasitology 134: 609–619.
- 92. Kalinna BH, Brindley PJ (2007) Manipulating the manipulators: advances in parasitic helminth transgenesis and RNAi. Trends Parasitol 23: 197–204.
- 93. McGonigle L, Mousley A, Marks NJ, Brennan GP, Dalton JP, et al. (2008) The silencing of cysteine proteases in Fasciola hepatica newly excysted juveniles using RNA interference reduces gut penetration. Int J Parasitol 38: 149–155.
- 94. Rinaldi G, Morales ME, Alrefaei YN, Cancela M, Castillo E, et al. (2009) RNA interference targeting leucine aminopeptidase blocks hatching of Schistosoma mansoni eggs. Mol Biochem Parasitol 167: 118–126.
- 95. Rinaldi G, Morales ME, Cancela M, Castillo E, Brindley PJ, et al. (2008) Development of functional genomic tools in Trematodes: RNA interference and luciferase reporter gene activity in Fasciola hepatica. PLoS Negl Trop Dis 2: e260. doi:10.1371/journal.pntd.0000260.
- 96. Ding J, Su JG, Mansour TE (1994) Cloning and characterization of a cDNA encoding phosphofructokinase from Schistosoma mansoni. Mol Biochem Parasitol 66: 105–110.
- 97. Su JG, Mansour JM, Mansour TE (1996) Purification, kinetics and inhibition by antimonials of recombinant phosphofructokinase from Schistosoma mansoni. Mol Biochem Parasitol 81: 171–178.
- 98. Kuntz AN, Davioud-Charvet E, Sayed AA, Califf LL, Dessolin J, et al. (2007) Thioredoxin glutathione reductase from Schistosoma mansoni: an essential parasite enzyme and a key drug target. PLoS Med 4: e206. doi:10.1371/journal.pmed.0040206.
- 99. Sayed AA, Simeonov A, Thomas CJ, Inglese J, Austin CP, et al. (2008) Identification of oxadiazoles as new drug leads for the control of schistosomiasis. Nat Med 14: 407–412.
- 100. Simeonov A, Jadhav A, Sayed AA, Wang Y, Nelson ME, et al. (2008) Quantitative high-throughput screen identifies inhibitors of the Schistosoma mansoni redox cascade. PLoS Negl Trop Dis 2: e127. doi:10.1371/journal.pntd.0000127.
- 101. Tran MH, Pearson MS, Bethony JM, Smyth DJ, Jones MK, et al. (2006) Tetraspanins on the surface of Schistosoma mansoni are protective antigens against schistosomiasis. Nat Med 12: 835–840.
- 102. Caffrey CR, Rohwer A, Oellien F, Marhöfer RJ, Braschi S, et al. (2009) A comparative chemogenomics strategy to predict potential drug targets in the metazoan pathogen, Schistosoma mansoni. PLoS ONE 4: e4413. doi:10.1371/journal.pone.0004413.
- 103. Crowther GJ, Shanmugam D, Carmona SJ, Doyle MA, Hertz-Fowler C, et al. (2010) Identification of attractive drug targets in neglected-disease pathogens using an in silico approach. PLoS Negl Trop Dis 4: e804. doi:10.1371/journal.pntd.0000804.
- 104. Verjovski-Almeida S, DeMarco R, Martins EA, Guimaraes PE, Ojopi EP, et al. (2003) Transcriptome analysis of the acoelomate human parasite Schistosoma mansoni. Nat Genet 35: 148–157.