This is an uncorrected proof.
Figures
Abstract
Hookworms are intestinal parasitic nematodes that chronically infect ~500 million people. How hookworms successfully overcome host protective mechanisms is unclear, but it may involve hookworm proteins that digest host tissues, or counteract the host’s immune system, or both. To find such proteins in the zoonotic hookworm Ancylostoma ceylanicum, we used mass spectrometry to identify 565 genes encoding excreted-secreted (ES) proteins from adults, and used RNA-seq to identify genes expressed both in young adults (12 days post-infection) and in intestinal and non-intestinal tissues dissected from mature adults (19 days post-infection), infecting hamster hosts that either had normal immune systems or were immunosuppressed by dexamethasone. In adult A. ceylanicum, we observed 1,670 and 1,196 genes with intestine- and non-intestine-biased expression, respectively. Comparing hookworm gene activity in normal versus immunosuppressed hosts, we observed almost no changes of gene activity in 12-day young adults or non-intestinal 19-day adult tissues. However, in intestinal 19-day adult tissues, we observed 1,951 positively immunoregulated genes, and 137 genes that were negatively immunoregulated. Thus, immunoregulation was observed primarily in mature adult hookworm intestine directly exposed to host blood. Of positively immunoregulated intestinal genes, 50.1% (5.3-fold over background) also had male-biased expression, suggesting that male and female A. ceylanicum have different responses to the host immune system. We observed 153 ES genes showing positive immunoregulation in 19-day adult intestine, which disproportionately encoded CAP, ASPR, astacin, TIMP, TIL, ShK, and SCVP proteins, and that were enriched for ES gene orthologs in the dog hookworm Ancylostoma caninum, the human hookworm Necator americanus, or the related sheep parasite Haemonchus contortus. Such a mixture of rapidly evolving and conserved genes could comprise virulence factors enabling infection, provide new targets for vaccines against hookworm, and aid in developing therapies for immune-mediated diseases.
Author summary
Hookworms chronically infect half a billion humans. They do this by partially suppressing their hosts’ immune systems with excreted or secreted (ES) proteins, which makes it difficult to protect against hookworm infections with vaccination or to cure them permanently with drugs. The hookworm Ancylostoma ceylanicum is a good laboratory model for this problem because it naturally infects both humans and other mammals. We used three approaches to define ES proteins of A. ceylanicum that might be crucial for host immunomodulation: we identified A. ceylanicum genes encoding ES proteins; we identified RNA expression levels of A. ceylanicum genes from intestines and non-intestinal tissues of adult hookworms; and we compared gene expression levels of hookworms infecting normal hamster hosts to those infecting hamsters immunosuppressed with dexamethasone. We found 153 genes in A. ceylanicum that encode ES proteins, are expressed in the intestine, and have stronger expression in normal hosts than in immunosuppressed hosts. These genes may be part of a feedback loop, where a hookworm dynamically responds to its host’s immune systems by upregulating these genes, excreting their protein products into their hosts’ bloodstreams, and immunomodulating their hosts. Some have relatives in other parasitic nematodes and may be an evolutionarily conserved set of virulence genes.
Citation: Schwarz EM, Noon JB, Chicca JD, Garceau C, Li H, Antoshechkin I, et al. (2026) Hookworm genes encoding intestinal excreted-secreted proteins are transcriptionally upregulated in response to the host’s immune system. PLoS Negl Trop Dis 20(3): e0014106. https://doi.org/10.1371/journal.pntd.0014106
Editor: Alessandra Morassutti, University of Passo Fundo: Universidade de Passo Fundo, BRAZIL
Received: March 19, 2025; Accepted: March 2, 2026; Published: March 17, 2026
Copyright: © 2026 Schwarz et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: A. ceylanicum transcriptomic data have been archived in the Sequence Read Archive with NCBI BioProject accession PRJNA1045065, and with BioSample accessions SAMN38429478, SAMN38440155, SAMN38440168, SAMN38440169, SAMN38440219, and SAMN38440220. Mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifiers PXD047871 and PXD047879. Protein-coding gene predictions for A. ceylanicum have been archived at the Open Science Framework (https://osf.io/dxfsb) and at WormBase ParaSite (https://parasite.wormbase.org/Ancylostoma_ceylanicum_prjna231479/Info/Index).
Funding: This work was supported by the National Institutes of Health (NIH)’s National Institute of Allergy and Infectious Diseases (NIAID; https://www.niaid.nih.gov) grants 1R21-AI111173 and R01-AI056189 to R.V.A., by NIH’s Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD; https://www.nichd.nih.gov) grant 1R01-HD099072 to R.V.A., by a Fulbright Foundation (https://fulbright.gov.cz/en) fellowship PS00299111 to V.I., and by Cornell University (https://cals.cornell.edu/molecular-biology-genetics) startup funds to E.M.S. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The hookworms Necator americanus, Ancylostoma duodenale, and Ancylostoma ceylanicum are parasitic nematodes that infect ~500 million human beings, sickening them and lowering their economic productivity [1–3]. Hookworms are remarkably long-lived parasites: an adult hookworm can feed off its human host for up to 18 years [4,5], and infections lasting 1–5 years are common [6]. Hookworms achieve this partly by feeding on the host with enzymes optimized to digest host proteins [7], and partly by suppressing the immune systems of their hosts, which would otherwise kill or expel them quickly [8]. Only one drug, albendazole, is commonly used and effective enough to be useful in mass drug administration against hookworm infections [9,10]. However, human hookworms may be developing genetic resistance to this drug [11–13], as has already happened with dog hookworms [14]. Moreover, even effective use of albendazole does not prevent endemic hookworms from reinfecting people [15]. An anti-hookworm vaccine would be an ideal way to suppress hookworm infections [16], but no such vaccine exists, in part because we do not know which gene products mediating host-parasite interactions should be targeted as antigens [17].
It is not yet certain how hookworms (or any other parasitic nematodes) dampen the host immune system, but many possible effectors of immunomodulation exist: secreted proteases [7,18,19]; secreted protease inhibitors [20–22]; large gene families encoding diverse secreted proteins as antigenic decoys [23,24]; mimics of mammalian immune proteins [25,26]; lipid immunomodulators [27–29]; and secreted exosomes with anti-host miRNAs [30,31]. These mechanisms are by no means exclusive; some or all of them could be working at once. Such complexity may have evolved over 350 million years, when vertebrates first colonized the land and became vulnerable to parasitic nematodes [32–34]. Ignorance of how hookworms immunosuppress their hosts makes it challenging to devise vaccines against hookworm disease. It also hinders the possible use of hookworms as sources of new biological reagents against immune-mediated diseases [35,36], which appear to become more frequent with lower rates of helminth infection [37].
Hookworms and other parasitic helminths have long been known to excrete or secrete proteins into their hosts. These excreted-secreted (ES) proteins have been hypothesized, and in some cases demonstrated, to enable immunosuppression and virulence [8,38–41]. ES proteins can be exported from cephalic/pharyngeal glands, intestine, or cuticle [39,42,43]. Genes encoding ES proteins have been identified in the major human hookworm N. americanus [44], the dog hookworm Ancylostoma caninum [45], and five strongylid parasitic nematodes closely related to hookworm: Angiostrongylus vasorum [46], Haemonchus contortus [47], Heligmosomoides bakeri [48–50], Nippostrongylus brasiliensis [51,52], and Ostertagia ostertagi [53] (Fig 1). Until recently [54,55], ES proteins had not been identified for the zoonotic human hookworm A. ceylanicum, which is an important zoonotic pathogen of humans and companion animals and which can complete its lifecycle in hamsters, making it a key hookworm species for laboratory studies [2,56–58].
(A) Female of A. ceylanicum by 12 days after infection of a hamster host; scale bar, 500 μm. At this stage of development, the hookworms are young adults that have only just begun blood feeding, with mature males and a few gravid females, and with little or no egg laying. (B) Female of A. ceylanicum by 19 days after infection of a hamster host; scale bar, 1 mm. At this stage of development, the hookworms are fully mature adults that have been blood-feeding for at least a week, have mated, and have begun extensive egg laying that can last for weeks in a hamster host. Both photographs are reproduced from Schwarz et al. [59]. (C) Evolutionary relationships of the hookworm A. ceylanicum to some other nematodes discussed in this paper. This phylogeny links A. ceylanicum to other Ancylostoma and Necator hookworms, to more distantly related strongylid parasitic nematodes, and to two well-studied free-living rhabditid nematodes (C. elegans and Pristionchus pacificus). Phylogenetic data are taken from Coghlan et al., van Megen et al., and Xie et al. [185,246,247]. A. ceylanicum is highlighted in gold; three parasitic species (A. caninum, N. americanus, and H. contortus) whose ES genes were compared to those of A. ceylanicum in this paper are highlighted in red. Hookworms form a single clade (marked with a yellow bar); they are a subset of a larger clade of strongylid nematodes (marked with a green bar) which, like hookworms, are parasitic. Strongylids are, in turn, a subset of rhabditid nematodes (marked with a blue bar); this clade encompasses both parasitic nematodes and the free-living nematodes C. elegans and P. pacificus. Notably, A. ceylanicum and other strongylid parasites are more closely related to C. elegans than to P. pacificus despite having highly divergent parasitic life cycles.
To find A. ceylanicum genes whose products may enable parasitism, we have identified A. ceylanicum ES proteins, while also using RNA-seq to identify A. ceylanicum genes whose products interact with the host either through intestine-biased expression or through upregulation in response to a functioning host immune system. For the latter set of genes, we hypothesized that the parasite transcriptionally activates genes in response to the host immune system in order to neutralize the host immune response. Previously, we found that changes of A. ceylanicum gene activity during infection in vivo are much more extensive than changes seen during simulated infection in vitro [59]. Here we build on that observation by correlating ES genes with intestinal and immunoregulated genes (Fig 2). We identify genes encoding immunoregulated intestinal ES proteins that may be important for virulence or immunosuppression, that are new targets for vaccines against hookworm, and that may aid in the development of therapies against immune-mediated diseases.
(A) Gene prediction. Because optimal analysis of RNA-seq and proteomic data in A. ceylanicum requires gene predictions with the highest possible accuracy, we used published A. ceylanicum genomic sequence and RNA-seq data to repredict a protein-coding gene set (v2.0). Because we observed that v2.0 omitted some genes for which we had ES proteomic data, we merged non-redundant members of v1.0 to v2.0 to produce a final set of protein-coding gene predictions (v2.1). We then annotated the v2.1 genes with Phobius-predicted signal and transmembrane sequences, Pfam protein domains, orthologies to genes in other parasitic nematodes (e.g., ES protein-coding genes) and other domains/terms so that functions overrepresented by ES protein-coding or differentially expressed genes could be identified. (B) RNA-seq analysis. From either normal or dexamethasone-treated (immunosuppressed) hamster hosts, we isolated A. ceylanicum larvae 12 days after infection and A. ceylanicum adults 19 days after infection, with three biological replicates per condition (hookworm life stage and hamster status). For 19-day adults, we dissected intestines from non-intestinal tissues (12-day larvae being too small to dissect). We did RNA-seq analysis on all replicates, mapping RNA-seq reads to our v2.1 gene set. In addition, we did RNA-seq analysis on key previously published A. ceylanicum data (adult males versus adult females) to allow male-biased and female-biased genes to be identified in our results. We heatmapped all RNA-seq data, identified one pair of replicates (adult intestine, from normal versus dexamethasone-treated hamsters) that was likely to have been label-swapped, and removed them before statistical analysis of RNA-seq gene expression differences. (C) ES protein identification. We grew two sets of A. ceylanicum in normal hamster hosts to adulthood (20 days after infection), removed them from their hosts, and collected two sets of excreted/secreted (ES) proteins. We performed mass spectrometry on the two ES collections, mapped the spectra to v2.1 genes, and identified ES protein-coding genes. (D) Analysis of integrated ES and RNA-seq results. We identified statistically significant overlaps between gene categories (ES genes, differentially expressed genes, genes encoding protein domains) and biologically interpreted these results.
Results
Improved A. ceylanicum gene predictions
Analyzing gene function depends crucially on the quality of gene predictions [60,61]. We thus repredicted A. ceylanicum protein-coding genes with BRAKER2 [62], merging these repredictions (“version 2.0” or “v2.0”) with our original gene predictions (“version 1.0” or “v1.0”) to produce a hybrid v2.1 set of 33,190 protein-coding genes. Recently, Uzoechi et al. have also repredicted A. ceylanicum genes using methods similar to ours [54]; their new gene set is also substantially better than earlier predictions (94.5% completeness), though less complete than our v2.0 or v2.1 gene sets (Table 1). Having predicted the v2.1 gene set, we annotated its predicted protein products with predicted N-terminal signal sequences [63], conserved protein domains [64,65], orthologies to genes in related nematode species such as N. americanus [66], and Gene Ontology (GO) terms describing biological and molecular functions [67,68]; annotations are listed in S1 Table.
Identifying ES proteins
To collect ES proteins from adult A. ceylanicum hookworms, we isolated hookworms from hamster hosts 20 days after infection in two independent experiments, incubated the hookworms in protein-free culture medium for three days, concentrated and purified their supernatants, subjected their proteins to mass spectrometry, and mapped the resulting protein spectra to a nonredundant set of protein sequences. By this means, we identified 565 genes encoding ES proteins observed in at least one ES collection (1.7% of all genes), and 350 genes encoding ES proteins in both independent collections (1.05% of all genes; S1 and S2 Tables). Uzoechi et al. and Wong et al. have also identified genes encoding ES proteins from adult A. ceylanicum [54,55]; mapping their ES genes onto our v2.1 gene set, we find 955 ES genes from their analysis (2.9% of all genes), of which 444 are identical to ES genes from ours (1.3% of all genes; 79% of our ES genes). This overlap is 27-fold greater than chance (two-tailed Fisher test, p-value = 0; S3 Table), and shows high reproduciblity of ES genes in A. ceylanicum. Conversely, each ES gene set has unique members; they collectively have 1,076 genes (S2 Table).
Out of the 565 ES genes identified, we consider two subsets for detailed analysis here: either categories of ES genes for which we can show statistically significantly enhanced overlaps with other gene categories of interest (S3 Table), or individual instances of ES genes for which we have found specific published information in the literature which indicates that an individual ES gene may have a biologically informative function or effect.
A. ceylanicum ES genes encode possible host-parasite interaction proteins
Two-thirds of our ES genes (380) were predicted to encode classically secreted proteins with N-terminal signal sequences, 4.6-fold above genome-wide background (q = 4.5•10-181); the remaining one-third of ES genes (185) encode proteins that are presumably non-classically secreted (Tables 2 and S3) [69]. This mix of predominant but not universal N-terminal signals also exists in ES proteins from A. caninum, N. americanus, and H. contortus (S4-S6 Tables) [44,45,47], and may reflect nonclassical secretion of ES proteins through extracellular vesicles [70,71]. For ES genes that appear to encode non-classically secreted proteins, an alternative possibility remains that these genes have 5’ exons that encode signal sequences but that have been overlooked. We have no way to completely exclude this possibility without extensive manual curation of their gene structures [72,73]. However, there are two lines of evidence that incline us to think that many, if not all, predictions of non-classical ES protein secretion are correct. First, the gene predictions we used in this study were based on extensive RNA-seq data sets [59,74,75] which drove BUSCO completeness from 87.8% to 95.1% (Table 1), and thus are likely to have detected most 5’ exons. Second, noticeable fractions of ES protein-coding genes without known N-terminal signal sequences are observable not merely in our own data for A. ceylanicum (33% of 565 ES genes without a predicted signal sequence), but also for ES protein-coding genes in three other strongylids: A. caninum (63% of 315 ES genes), N. americanus (53% of 197 ES genes), and H. contortus (40% of 844 ES genes; all three ES data sets are analyzed in S5 Table). So, if the apparent non-classical secretion of ES proteins is an artifact due to exon mispredictions, this artifact is pervasive not only for our own protein-coding gene predictions of A. ceylanicum, but also for gene predictions in three other parasitic nematode genomes.
Compared to the whole A. ceylanicum proteome, the 565 ES genes disproportionately encoded several protein families with plausible roles in host-parasite interactions (Tables 2 and S3 and Fig 3) [40]. These included five types of proteases (aspartyl, astacin, cysteine, serine, and metallopeptidase) that may digest host tissues and blood [7,18,76–78], along with three types of protease inhibitors (Kunitz, TIL, and TIMP) that may protect A. ceylanicum from native or host proteases; both proteases and protease inhibitors may also counteract host immune responses [79,80]. Five ES genes encoded glutathione S-transferase, an enzyme thought to detoxify free heme generated during hemoglobin proteolysis and other toxins [40,81]. Sixteen ES genes encoded Stichodactyla helianthus toxin (ShK)-related proteins, which might suppress host T cells [82,83]; and nine ES genes encoded C-type lectin proteins, which might mimic mammalian immune proteins (enabling immunosuppression) or dissociate cells (enabling tissue invasion) [25,26,59,84,85].
This UpSet plot [240] shows, for the set of 565 ES genes, occurrences of 16 protein domains from the Pfam database that are overrrepresented in that set (Table 2). The x-axis lists individual protein domains, with the numbers of ES genes that encode them; abbreviations for protein domains are given in Table 2. The y-axis lists categories of ES genes that encode a particular set of overrepresented protein domains, with the numbers of ES genes in each category. Dots in the matrix show each category’s encoded domain or domains. Although most ES genes only encode one overrepresented protein domain, instances encoding two domains do exist, and are shown by connecting two dots with a line. Not shown are an additional 242 ES genes that lacked any of these 16 overrepresented Pfam domains. Most ES genes encode only a single protein domain, and all domains are unique to some ES genes; for instance, 109 ES genes encode only CAP domains while 22 ES genes encode only ASPR domains. A minority of ES genes encode two domains at once: for instance, 29 ES genes encode both CAP and ASPR domains, and four ES genes encode both ShK and either Astacin or CAP domains.
One quarter of the 565 ES genes (140) encoded secreted venom-like allergen proteins with CAP or SCP/TAPS domains [24]; in hookworms, these are called Ancylostoma secreted proteins, activation-associated secreted proteins, or ASPs [24,86,87]. One ASP, neutrophil inhibitory factor in A. caninum (Acan-NIF), has been experimentally shown to block neutrophil adhesion [88,89]; another ASP, hookworm platelet inhibitor in A. caninum (Acan-HPI), has been shown to prevent blood clotting [90]. We observed ES genes encoding orthologs of both these ASPs (Acey-NIF-B and Acey-HPI) which may themselves be immunosuppressant and antithrombotic proteins (S1 Table). However, this leaves the other 138 ES ASP genes with only conjectural functions.
ES genes also encode three other multigene families of secreted proteins that are suspected to be involved in infection, although their molecular roles remain largely uncharacterized: ASP-related (ASPR), transthyretin-like (TTL), and secreted clade V proteins (SCVP). ASPRs were first observed in A. ceylanicum as a divergent subfamily of ASPs transcriptionally upregulated immediately after host infection [59], and were later observed in ES proteins from N. americanus [44]. TTL proteins were previously observed as a major component of ES proteins from H. contortus [47] and as multigene families in several parasitic nematodes [91–93]; one TTL protein of H. contortus (HcTTR) antagonizes goat IL-4 in vitro, and thus may be immunosuppressive in vivo [94]. SCVPs were observed in A. ceylanicum as a novel family of secreted proteins upregulated in young hookworm adults that is conserved in C. elegans and expanded in hookworms and other strongylids [59].
Although we have focused on analyzing our own ES gene set, we have also identified statistically enriched protein domains for the 955 adult ES genes identified by Uzoechi et al. and Wong et al. (both involving Washington University, and thus collectively labeled “WashU”) and for their 511 ES genes not found in our data set (“WashU-only”; S3 Table). Domains preferentially enriched in WashU-only ES genes include aminopeptidases, fatty acid- and retinol-binding (FAR) proteins, SXP/RAL-2 proteins, and C-type lectins (Table 3). Aminopeptidases may cooperate with the five other protease types noted above to enable digestion and immunosuppression. FAR proteins are a nematode-specific protein family that may suppress immune responses in plant or animal hosts by sequestering lipid signaling molecules [95–97]. The SXP/RAL-2 protein family includes homologs of an immunodominant hypodermal antigen found in several parasitic nematodes: Ac16, in the dog hookworm A. caninum [98]; both As16 and As14, in the roundworm Ascaris suum [99–101]; Ani s 5 in the ascaridoid Anisakis simplex [102]; and Ov-RAL-2 in the filarial parasite Onchocerca volvulus [103]. Among the C-type lectins enriched in WashU-only ES genes, we observed two genes encoding structural mimics of mammalian mannose receptor (Table 4) [59]; in mammals, this receptor binds soluble secreted molecules from the whipworm Trichuris suis and from A. suum, and these A. ceylanicum mimics might thus be immunomodulatory [104]. Conversely, domains preferentially enriched in our ES genes included CAP, ASPR, cysteine protease, astacin, Kunitz protease inhibitor, ShK, TTL, and SCVP domains (Table 3).
Other ES genes we observed were potentially relevant to infection despite not being from overrepresented families (Table 4). One encodes a macin homolog that may protect hookworms from being infected by bacteria, and thus enable their long-term residence in the human gut [105,106]. Three ES homologs of A. caninum anticoagulant protein [107] could act with Acey-HPI to prevent clotting during blood feeding. Seven other ES genes encode possible immunosuppressants: two ES apyrases and one ES adenosine deaminase could suppress both blood clotting and inflammation by hydrolyzing extracellular ATP [108–110]; phospholipases A2 and D, like FAR proteins, could suppress lipid signals in immune responses [111]; the ES macrophage migration inhibitory factor Acey-MIF-1, previously observed to bind the human MIF receptor CD74 and promote monocyte migration, could alter cellular immune responses [112,113]; and an ES deoxyribonuclease II could hydrolyze extracellular DNA nets secreted by neutrophils to trap and kill large pathogens [114].
Because ES proteins are exposed to the human host’s immune system and are likely to promote infection, ES proteins are promising targets for an anti-hookworm vaccine. Twelve ES genes encoded proteins that have already been shown to give partial protection as vaccines in laboratory mammals, or to be homologous to such proteins (Table 4): the cysteine protease Acey-CP-1 [115]; the M13 peptidases Acey-MEP-6 and Acey-MEP-7 [116,117]; the glutathione S-transferase Acey-GST-1, homologous to A. caninum’s Acan-GST-1 [118] and N. americanus’ Na-GST-1 [119]; a homolog of the immunodominant hypodermal Ac-16 antigen of A. caninum [98]; two homologs of apyrases in Teledorsagia circumcincta and H. bakeri [120,121]; two homologs of fatty acid- and retinol-binding proteins in A. ceylanicum [122] and Onchocerca volvulus [123]; one homolog of Brugia malayi calreticulin [124]; one homolog of Onchocerca volvulus fructose-1,6-bisphosphate aldolase [125]; and one homolog to A. suum enolase [126]. In addition to these strict orthologs, three instances of astacin, ASP, and TTL proteins (all overrepresented among ES proteins) have given partial protection as vaccines: Ac-MTP-1 of A. caninum, Acan-ASP-2 of A. caninum, and HcTTR of H. contortus [127–129]. Finally, two WashU-only ES genes encoded homologs of an extracellular Argonaute (exWAGO) protein in H. bakeri that binds small RNAs, is imported by mouse cells during infection with possible immunomodulatory effects, and confers partial protection as a vaccine in mice [130]; a third WashU-only ES gene encoded the aspartic protease Acey-APR-1, homologous to the vaccine subunit Na-APR-1 from N. americanus [131]. These results show the vaccine potential of ES proteins, with hundreds of other ES proteins remaining untested as vaccine subunits.
Over half of A. ceylanicum ES genes are functionally conserved in related blood-feeding parasitic nematodes: they do not merely have orthologs, but ES gene orthologs (Tables 5, S1, and S7). Out of 565 A. ceylanicum ES genes, 269 (47.6%) had ES gene orthologs in the closely related hookworm A. caninum [45], 21-fold more than expected by chance (p = 6.82•10-301). For the more distantly related hookworm N. americanus [44], 141/565 ES genes (25.0%) had ES gene orthologs (18-fold over chance; p = 1.78•10-137); for the still more distantly related blood-feeding H. contortus [47], 180/565 ES genes (31.9%) had ES gene orthologs (8.9-fold over chance, p = 4.10•10-120). Considering all three species at once, 350/565 ES genes (61.9%) had some ES ortholog (12-fold over chance, p = 0). We found similarly high conservation for the 860 ES genes described by Uzoechi et al. (S7 Table), and overrepresented ES gene orthologies between A. ceylanicum and N. americanus were also observed by Uzoechi et al. [54]. These orthologies of A. ceylanicum ES genes in other species are predominantly to ES genes themselves (i.e., genes experimentally shown to encode ES proteins in these other species). When ES genes in other species are set aside, the remaining orthologies of A. ceylanicum ES genes fall to, or even below, background frequencies. For A. ceylanicum ES genes without known ES gene orthologs, orthology to A. caninum dropped to 1.6-fold over chance (p = 1.31•10-29); in N. americanus, to 1.1-fold below chance (p = 0.22); and in H. contortus, to 2.4-fold below chance (p = 2.05•10-30). Thus, the A. ceylanicum ES gene set predominantly encodes ES proteins not only in A. ceylanicum itself but also in related parasitic nematodes.
Identification of intestinal and immunoregulated genes
Because the hookworm intestine is an important source of ES proteins, we set out to characterize intestine- and non-intestine-biased expression of hookworm genes. We infected hamsters with A. ceylanicum hookworms, allowed infections to proceed for either 12 or 19 days, collected hookworms from hamster small intestines, extracted hookworm RNAs, and performed RNA-seq with 36.9 to 55.8 million reads per biological replicate (S8 Table). Hookworms at the 12-day infection stage were young adults that were too small for us to dissect further, so RNA-seq was done on whole 12-day hookworms. By 19 days, hookworms had grown into mature adults large enough to dissect, so we separated them into intestinal and non-intestinal tissues before RNA-seq. In addition, we hypothesized that hookworms would respond to the mammalian host’s immune system by increasing their expression of proteins that counteract the mammalian immune system. To detect hookworm genes regulated by the state of the host’s immune system, 12-day and 19-day infections were performed both in normal hamster hosts (nonDEX) and in hamsters whose immune systems had been suppressed by dexamethasone (DEX). We analyzed levels and changes of gene expression both for our RNA-seq data and for published A. ceylanicum RNA-seq data from adult male intestine, male adults, and female adults (Tables 6 and S1 and Figs 4-5). We defined a gene as significantly changing its activity between two biological conditions (e.g., intestinal versus non-intestinal 19-day tissues, or intestinal nonDEX versus intestinal DEX) if the gene changed its expression by at least 2-fold (log2FC ≤ -1 or ≥ 1) with a false discovery rate (p-value corrected for multiple hypothesis testing) of no more than 0.01 (FDR ≤ 0.01). For immunoregulated genes, it is important to note that upregulation of genes in a normal host is equivalent to downregulation of genes in a dexamethasone-immunosuppressed host; we have defined such genes as positively immunoregulated (and their reverse as negatively immunoregulated), but their regulation was observed by virtue of downregulation with dexamethasone.
Gene activity is shown for 8,130 A. ceylanicum genes with significant differences in expression between biological conditions, with biological replicates used for differential gene expression analysis on the x-axis and individual genes on the y-axis. Expression levels are in TPM (log10). Genes sharing similar patterns of expression are split into 10 clusters. Biological replicates are: young 12-day hookworm adults from normal hosts (YA_12dpi_noDEX); young 12-day adults from immunosuppressed hosts (YA_12dpi_DEX); dissected 19-day intestines from mature hookworm adults in normal hosts (intestines_19dpi_noDEX); dissected 19-day intestines in immunosuppressed hosts (intestines_19dpi_DEX); dissected 19-day non-intestinal tissues in normal hosts (non.intestines_19dpi_noDEX); dissected 19-day non-intestinal tissues in immunosuppressed hosts (non.intestines_19dpi_DEX); adult male intestine (WashU_male_intestine); adult males (WashU_adult_male); and adult females (WashU_adult_fem). On the x-axis, biological replicates are labeled as ‘condition_X’ where X is ‘01’ through ‘04’. The adult male intestine, adult male, and adult female data in Fig 4 is a reanalysis of A. ceylanicum RNA-seq data previously published by Wei et al. [74] and Bernot et al. [75]. Additional heatmaps of gene expression for all 33,190 genes and all replicates are given in S1 Fig.
This UpSet plot shows the 12 most abundant overlaps of differential gene expression types (Table 6). The x-axis lists types of differential gene expression, and the numbers of genes showing them. Types are abbreviated as follows: “Female”, female-biased, determined from previously published data; “Intest”, intestine-biased, 19-day; “Male”, male-biased, determined from previously published data; “N_12d_imm”, 12-day negatively immunoregulated; “N_int_imm”, 19-day negatively immunoregulated intestinal; “N_nin_imm”, 19-day negatively immunoregulated non-intestinal; “Non_int”, non-intestine-biased, 19-day; “P_int_imm”, 19-day positively immunoregulated intestinal; and “P_nin_imm”, 19-day positively immunoregulated non-intestinal. The y-axis lists categories of genes that have a particular combination of differential gene activities, with their gene numbers; dots in the matrix show each category’s type or types of expression patterns. Although many genes have only one type of differential expression, instances with two different types do exist and are shown by connecting two dots with a line. Not shown are an additional 25,302 genes that did not show any types of differential gene expression. The most common instance of genes with two differential expression patterns is a set of 957 genes with both male-biased and positively immunoregulated intestinal expression. The next instance is a set of 225 genes with both positively immunoregulated intestinal and non-intestine-biased expression; although they are immunologically upregulated in the intestine, they are more strongly expressed in non-intestinal tissues.
By these criteria, in 19-day adults we observed 1,670 genes with intestine-biased expression and 1,196 with non-intestine-biased expression, the remaining 30,324 genes being intermediate (Table 6 and Figs 4 and 6). Of the intestine-biased genes, 723 (43.3%) were orthologous to H. contortus genes with intestine-biased expression (4.4-fold above background; p = 8.18•10-306; S9 Table). For comparison, when we reanalyzed published RNA-seq data for male intestine versus whole male adults [74,75] as part of the same computation, we identified 635 genes with intestine-biased expression, of which only 180 were also found in our intestine-biased gene set (S9 Table). By chance alone, one would expect to observe 32 genes shared by both intestinal sets; the observed overlap was 5.6-fold above background (p = 9.93•10-85) but limited (only 28% of the 635 genes were shared). The observation of over 700 genes with intestine-biased expression conserved between strongylids supports our larger gene set. The disagreement of these intestine-biased gene sets could have one or more causes: different protocols for dissection and RNA extraction; different background tissues for comparison to intestinal RNA-seq (non-intestinal here, versus whole adults in the previous data); different sexes (mixed-sex here, versus males previously); and different strains of A. ceylanicum (HY135 here, versus Indian previously), which could cause different rates of RNA-seq read mapping onto our HY135 reference genome (S10 Table) [132–134].
This volcano plot [242] compares the expression of A. ceylanicum genes (shown as dots) in 19-day (adult) intestine versus 19-day (adult) non-intestine, with gene expression being assayed by RNA-seq of dissected intestines versus dissected non-intestinal tissues. For each gene, the x-axis shows the fold change (i.e., ratio) of its expression levels in intestine versus non-intestine; intestine-biased genes are positive and non-intestine-biased genes are negative. The y-axis shows the statistical significance (corrected for multiple hypothesis testing as a false discovery ratio, or FDR) of each gene’s fold change. Both axes have logarithmic scales (log2 for fold changes on the x-axis, -log10 for FDRs on the y-axis). In our analysis, a gene is considered to be significantly intestine-biased if its fold change is ≥ 2 with an FDR of ≤0.01, and significantly non-intestinally biased if its ratio is ≤ 0.5 with an FDR of ≤0.01; dashed lines are shown for these two criteria. Genes encoding ES proteins are plotted as red dots. Other non-ES genes that are intestine-biased are plotted as gold dots; non-intestine-biased ones are plotted as green dots; and non-biased ones are plotted as dark gray dots. ES genes are found in all expression categories, including genes with non-intestinally biased expression; this is consistent with previous observations that ES proteins can be secreted not only by intestine, but also by cephalic/pharyngeal glands or cuticle [39,42,43].
When comparing A. ceylanicum in normal (nonDEX) versus immunosuppressed (DEX) hosts, we detected substantial changes of gene expression in 19-day hookworm intestinal tissues, but almost no changes of gene expression either in 12-day young adults or in 19-day non-intestinal tissues. We observed 1,951 genes upregulated in 19-day nonDEX intestine versus 19-day DEX intestine (5.9% of all genes), along with 137 downregulated genes (0.41%; Table 6 and Fig 7). In contrast, we observed only 32 genes positively or negatively immunoregulated either in 12-day young adults or in 19-day non-intestinal tissues (Table 6 and Fig 8). One explanation for this pattern is that only 19-day intestinal tissues are directly exposed to the host’s immune system through blood feeding. Although 12-day young adult hookworms inhabit the small intestine, they have only just started feeding on blood; up to that point, they instead probably eat mucosal cells [135,136]. By 19 days, hookworm adults have been feeding on blood for up to a week; however, most of their bodily contact with this blood (and, thus, with the host’s immune system) is through the lumen of their intestine (although cephalic or pharyngeal glands may interact with their host as well).
This volcano plot compares the expression of A. ceylanicum genes assayed by RNA-seq of 19-day (adult) intestines from hookworms in normal hamster hosts (with normal immune systems) versus hookworms grown in dexamethasone-treated (immunosuppressed) hamster hosts. Aside from different conditions being compared, the x-axis, y-axis, and criteria for significance are as in Fig 6. Genes encoding ES proteins are plotted as red dots. Other non-ES genes that are positively immunoregulated in A. ceylanicum intestines are plotted as gold dots; negatively immunoregulated intestinal ones are plotted as green dots; and non-biased ones are plotted as dark gray dots. ES genes are prominently visible among genes with positively immunoregulated intestinal expression.
This volcano plot compares the expression of A. ceylanicum genes assayed by RNA-seq of 19-day (adult) non-intestinal tissues from hookworms in normal hamster hosts (with normal immune systems) versus hookworms grown in dexamethasone-treated (immunosuppressed) hamster hosts. Aside from different conditions being compared, the x-axis, y-axis, and criteria for significance are as in Fig 6. Genes encoding ES proteins are plotted as red dots. Other non-ES genes that are positively immunoregulated in A. ceylanicum non-intestinal tissues are plotted as gold dots; negatively immunoregulated non-intestinal ones are plotted as green dots; and non-biased ones are plotted as dark gray dots. Although their numbers are much smaller, a visible subset of ES genes also have positively immunoregulated non-intestinal expression.
Because nematodes lack corticosteroid receptor homologs [137], our analysis assumes that dexamethasone acts solely on the host immune system, with no direct effects on A. ceylanicum. However, when free-living larvae of the intermittently parasitic nematode Strongyloides stercoralis were treated in culture with dexamethasone by Rodpai et al., 433 S. stercoralis genes showed altered RNA expression [138]. Moreover, it has been speculated that dexamethasone administered to S. stercoralis-infected mammals may act directly on the nematodes to make them hyperinfective [139,140], perhaps through binding the S. stercoralis ecdysone receptor homolog [141]. Thus, it is possible that dexamethasone causes at least part of its transcriptional effects in A. ceylanicum by acting directly on the hookworm. That being said, some differences between S. stercoralis and A. ceylanicum may make this possibility less likely. First, the drug dosage applied to S. stercoralis by Rodpai et al. was ~ 3.6 mg of dexamethasone per 25 mg of agar, whereas the drug dosage in our experiments was 48-fold lower (3.0 mg of dexamethasone/ 1,000 mg of hamster). Second, dexamethasone has been shown through heterologous glucocorticoid-responsive transgenes to reach and affect all parts of the nematode body when fed to C. elegans [142]; in contrast, the transcriptional response of A. ceylanicum to dexamethasone treatment is almost entirely confined to its intestine. Thus, although we cannot rule out the possibility of direct transcriptional regulation by dexamethasone, we think the most likely mechanism of dexamethasone action on hookworms is by altering the immune system of their hosts.
Intestine-biased genes encode likely components of food digestion, detoxification, and absorption
Whereas ES genes overwhelmingly encoded proteins with N-terminal signal sequences alone, intestine-biased genes were modestly but significantly enriched for several predicted types of secreted or transmembrane proteins (Tables 7 and S9). These included secreted proteins, proteins predicted to have one transmembrane anchor sequence after the N-terminal signal peptide, and multi-transmembrane proteins. Intestinal genes disproportionately encoded protein families relevant to digesting, detoxifying, and absorbing food (Table 7 and Fig 9): aspartyl, cysteine, aminopeptidase, and metallopeptidase proteases [7,18,76–78]; UDP-glucoronosyl or UDP-glucosyl transferases, ecdysteroid kinase-like enzymes, and ABC transporters [81,143,144]; and major facilitator and sugar transporters [145,146]. Intestinal genes also disproportionately encoded histones; this raises the question of whether mitosis persists in the adult hookworm intestine, as has been observed or inferred for A. suum and H. bakeri [147–149]. In addition to these well-studied protein families, another notably overrepresented family was Strongylid L4 Proteins (SL4Ps), first observed in A. ceylanicum as a novel family of non-classically secreted proteins upregulated in fourth-stage (L4) larvae [59]. Analyzing previously published H. contortus intestinal RNA-seq data (S11 Table), we observed that SL4P genes were also disproportionately represented among intestine-biased H. contortus genes, along with several of the better-characterized protein families implicated in digestion, detoxification, or absorption (e.g., cysteine proteases, UDP-glucoronosyl/glucosyl transferases, and major facilitator transporters; S12 Table). Moreover, the C. elegans genes numr-1 and numr-2 encode SL4P proteins that are intestinally expressed [150]. We conclude that SL4Ps have conserved intestinal expression in hookworms and other nematodes, and that (as in C. elegans) their function might be to counteract toxic environmental stresses [151,152].
This UpSet plot shows, for the set of 1,670 intestine-biased genes, occurrences of 17 Pfam domains overrrepresented in that set (Table 7). The x-axis, y-axis, and matrix are as in Fig 3, except that the genes being considered are intestine-biased rather than ES protein-coding. Abbreviations for Pfam domains are given in Table 7. Not shown are an additional 1,451 intestine-biased genes that lacked any of these 17 overrepresented Pfam domains. Although most domains are solitary, three domains that may be associated with detoxification (APH, DUF1679, and EcKL) are encoded in 13 intestinally biased genes. Another 13 intestinally biased genes encode two or three histone-related domains (Histone, H2A_C, and CBFD), which might be associated with mitotic cells in the intestine of adult parasitic nematodes [147–149].
Immunoregulated genes encode signal transduction, male-associated, host-parasite, and ES proteins
The 1,951 positively immunoregulated intestinal genes were modestly, but significantly, enriched for encoding secreted proteins, proteins with a single transmembrane anchor, or both: in other words, possible secreted or cell-surface proteins (Table 8). Overrepresented protein families included possible signal transduction components: protein kinases, protein phosphatases, and SH2 scaffold proteins [153–155]. Other overrepresented families included homologs of nematode motile sperm proteins (MSPs) [156], along with two other classes of proteins associated with MSPs: MSP fiber 2 proteins (MFP2s) [157] and DUF236 proteins [158]. Although MSPs are indeed hallmark proteins of nematode sperm cells and are usually assumed to be entirely specific to sperm, there are in fact MSPs expressed in somatic cells and required for their mobility. Two different MSP homologs in Caenorhabditis elegans have cell and axonal migration RNAi phenotypes in male linker cells and hermaphroditic neurons, with one of them being expressed in linker cells [159,160]. Moreover, msp genes are expressed in C. elegans ADL chemosensory neurons [161], and are evolutionarily retained even in parthenogenetic nematode species lacking sperm [162]. Thus, nematodes express some msp genes somatically, which fits our observation here of msp, mfp2, and duf236 gene expression in dissected intestines of adult A. ceylanicum. In addition, positively immunoregulated intestinal genes disproportionately encoded protein families with possible roles in host-parasite interactions. These families included: astacin, leishmanolysin, metallopeptidase, and trypsin-like proteases; TIMP protease inhibitors; ShK-related proteins; ASPs and ASPRs; SCVPs; and immunoglobulin domain-containing proteins (Table 8). While most of these families were also enriched among ES genes, leishmanolysins, trypsins, and immunoglobulin domains were uniquely enriched here. Despite being expressed intestinally, only one of these immunoregulated genes had intestine-biased expression; instead, 258 were non-intestine-biased (3.67-fold over chance; p = 5.51•10-78; S13 Table).
Genes with types of immunoregulation other than positive intestinal had fewer notable protein families (S13 Table). The 137 negatively immunoregulated intestinal genes disproportionately encoded cysteine proteases and saposins, with saposins being uniquely enriched in this gene set (36-fold over chance; q = 9.94•10-3). At least four C. elegans saposins have antimicrobial activity in vitro [163–166], as does a saposin from the strongylid parasite T. circumcincta [167]; conversely, one saposin in A. ceylanicum has no antibacterial activity but does lyse blood cells in vitro [168]. Negatively immunoregulated hookworm saposins might have either or both functions. Of the 26 positively immunoregulated non-intestinal genes, 15 encoded ASP proteins with CAP domains (46-fold over chance; q = 7.61•10-19); 10 had non-intestine-biased expression (10.7-fold over chance; p = 1.12•10-8). Both gene sets disproportionately encoded secreted proteins; neither set had prominently sex-biased expression.
Immunoregulated genes have significantly male-biased expression
Multicellular parasites such as hookworms are generally studied to understand mechanisms of infection common to both sexes, and indeed we collected both ES proteins and RNA from mixed-sex populations. However, males of the African tick Rhipicephalus appendiculatus excrete immunoglobulin-binding proteins that the males themselves do not seem to need, and that are instead required for coinfecting female ticks to feed on host blood efficiently [169]. This instance of sexual cooperation in a parasite made us wonder whether there existed male-biased or female-biased genes among our immunoregulated gene set. Using published RNA-seq data from A. ceylanicum adult males and females [75], we observed that 3,135 and 1,547 of our 33,190 predicted A. ceylanicum genes had male- or female-biased expression, respectively (Table 6 and Fig 5). Going on to check for overlaps of these gene sets with our immunoregulated gene set, we found that 50.1% of our positively immunoregulated intestinal genes (977/1,951; 5.3-fold over background; p = 0) were also male-biased, while only 11.6% of them (227/1,951; 2.5-fold over background; p = 3.34•10-38) were female-biased (Table 9 and S13 and Fig 10).
This UpSet plot shows, for 2,122 genes that in some way are immunoregulated in adult (19-day) A. ceylanicum, their 16 most abundant overlaps with ES genes or other gene expression sets (Table 9). Note that because only adult immunoregulated genes are considered here, the numbers of other gene categories (e.g., 1,003 male-biased genes in this plot) are lower than for the entire A. ceylanicum genome (3,135 genes; Table 6). Types are abbreviated as follows: “ES_genes”, encoding ES proteins; “Female”, female-biased, determined from previously published data; “Intest”, intestine-biased, 19-day; “Male”, male-biased, determined from previously published data; “N_int_imm”, 19-day negatively immunoregulated intestinal; “N_nin_imm”, 19-day negatively immunoregulated non-intestinal; “Non_int”, non-intestine-biased, 19-day; “P_int_imm”, 19-day positively immunoregulated intestinal; and “P_nin_imm”, 19-day positively immunoregulated non-intestinal. The y-axis lists categories of genes that belong to a particular combination of gene types, with their gene numbers; dots in the matrix show each category’s type or types. The largest overlap has 950 genes with both male-biased and positively immunoregulated intestinal expression; the largest overlap involving ES genes has 107 genes that also show positively immunoregulated intestinal expression.
Such predominantly male-biased expression, along with the enrichment of three male-associated gene families (MSP, MFP2, and DUF236) in this gene set, raises the question of whether the changes in hookworm intestinal gene expression we observed with different host immunological backgrounds were actually due to our having harvested a higher proportion of male hookworms from normal than immunosuppressed hosts, leading to a male skew in putatively immunoregulated genes. We see two reasons why such bias is not sufficient to explain our observations. First, we only observed extensive changes of gene activity in dissected intestines from 19-day adults, while observing much smaller changes in non-intestinal tissues from those same adults (Table 6 and Figs 5, 7, and 8). If apparent immunoregulation of genes had actually been due to overlooked male bias in the hookworms we collected from normal hosts, one would expect to see at least as much (if not more) changed gene expression in non-intestinal tissues (which contained most, if not all, of the gonadal tissue from adults) as we saw from dissected intestines. The absence of significant immunoregulation in non-intestinal tissues (or in our 12-day whole animals) is inconsistent with such an artifact. Second, positively immunoregulated intestinal genes are not a simple subset of male-biased genes, as would be expected if male selection accounted for immunological changes of gene expression: 747 of these 1,951 immunoregulated genes had no sex bias, and 227 of them had female-biased expression. We conclude that the transcriptional immunoregulation we describe here, though dominated by male-biased genes, is nevertheless real. This suggests that male and female A. ceylanicum have different responses to the host immune system, and that such differences might account for the observed male-bias from our mixed-sex A. ceylanicum 19-day intestinal RNA-seq data.
The host immune system affects ES gene expression
Protein products of ES genes are thought to affect the host immune system, but whether the host immune system affects ES genes has been unclear. Comparing the ES gene set with immunomodulated gene sets, we observed significant overlaps (Table 9 and Fig 10). Out of 1,951 positively immunoregulated intestinal genes in 19-day hookworms, 153 also encoded ES proteins (27.1% of ES genes; 4.6-fold over chance; p = 1.19•10-59). Of 26 positively immunoregulated non-intestinal genes, 13 were also ES genes (29-fold above chance; p = 7.47•10-17); such expression might reflect synthesis and secretion of ES proteins by cephalic/pharyngeal glands [42].
The 153 positively immunoregulated intestinal ES genes disproportionately encoded astacin proteases, TIMP and TIL protease inhibitors, ShK-like proteins, ASPs, ASPRs, and SCVPs (Table 10 and Fig 11). Of these genes, 69 had ES gene orthologs in A. caninum (20-fold enrichment; p = 6.00•10-72), 24 in N. americanus (11-fold enrichment; p = 2.35•10-18), and 24 in H. contortus (4.4-fold enrichment; p = 1.23•10-9; S14 Table). As noted above, all of the above protein families may affect immunomodulation in hookworm hosts; in addition, astacin has been shown to enable tissue invasion in vitro by A. caninum [77]. Other immunoregulated ES genes also encoded proteins with possible functions in immunoregulation, antithrombosis, or digesting host tissue (S1 Table): the ASPs Acey-NIF-B [88,89] and Acey-HPI [90], two apyrases [109,110], deoxyribonuclease II [114], hyaluronidase [170,171], and superoxide dismutase [172,173].
This UpSet plot shows, for the set of 153 positively immunoregulated intestinal ES genes, occurrences of seven Pfam domains overrrepresented in that set (Table 10). The x-axis, y-axis, and matrix are as in Fig 3, except that the genes being considered are not only ES protein-coding but also positively immunoregulated in the 19-day intestine. Abbreviations for Pfam domains are given in Table 10. Not shown are an additional 53 positively immunoregulated intestinal ES genes that lacked any of these seven overrepresented Pfam domains. The domain patterns are a subset of those observed for ES genes alone (Fig 3), and represent possible virulence factors in the ES gene set.
Discussion
The ability of hookworms to suppress or evade their hosts’ immune systems and feed on their hosts’ blood and other tissues, for years, has motivated identifying hookworm genes whose products interact with the host, either by encoding ES proteins or by directly contacting host blood and tissues in the hookworm intestine [40,74]. Here we have identified ES proteins, intestine-biased genes, and immunoregulated genes in the hookworm A. ceylanicum, the last of which is unique to this study. Indeed, we hypothesized that hookworms regulate genes in response to the host immune system in order to suppress it, which motivated our immunosuppression/transcriptomic study here. Strongylid parasitic nematodes interact with the immune system despite generally not being blood-feeding [83,135,174], so blood feeding is not necessary for such interactions. However, in the case of hookworms, our findings imply that interaction with the host immune system primarily occurs through contact of the hookworm intestinal lumen with host blood.
Zoonotic A. ceylanicum productively infects both humans and other mammals, making it important to humans while also enabling laboratory studies of a true hookworm that is clinically relevant [2,58]. We have analyzed both our new gene sets and previously identified gene sets [54,74,75], to identify traits encoded either by ES genes, intestinal genes, immuoregulated genes, or all three. To enable this work, we repredicted protein-coding genes in A. ceylanicum to the highest completeness so far achieved. We have greatly expanded the number of intestine-biased genes in A. ceylanicum and found them to be extensively homologous to intestine-biased genes in H. contortus [175]. We identified a mixture of functionally suggestive and poorly understood protein families in A. ceylanicum ES proteins and intestine-biased genes, observed genes that are immunoregulated in adult intestines (where they are exposed to host blood and circulating immune factors) but not other tissues, detected an unexpected male bias in positively immunoregulated intestinal genes, and defined a positively immunoregulated subset of ES genes that have a mosaic of conservation and species-specificity.
On examination, it was possible to infer biologically coherent functions from what might seem to be a jumble of hookworm ES proteins. Immunosuppressors have been long suspected to be part of the ES protein repertoire, and we observed several protein types that could have this role. We also observed a pattern of diverse multigene families encoding ES proteins (ASPs, ASPRs, TTLs, and SCVPs); in other parasitic nematodes, such a pattern has been repeatedly seen both in ES proteins and in genetic diversity between strains or sibling species [40,93,176]. It is not clear why this pattern exists; perhaps hookworms and other parasitic nematodes use diverse ES multigene families to survive unpredictably varying host immune systems by confusing their hosts with complex and varying mixtures of protein antigens, which leads parasitic nematodes to accumulate multigene families through balancing selection [177–179]. Another possible function that consistently emerged when examining ES proteins was inhibition of blood clotting. Despite being less often discussed and less obviously virulent, antithrombotics are at least as important for parasitism as immunosuppressants. Hookworm anticoagulants were first observed in 1904 [180], and are likely to be necessary for sustained blood-feeding [110,181,182].
Although we found that ES and immunoregulated genes encode proteins already known or suspected to modulate the host immune response, we also observed two gene families overrepresented in ES and immunoregulated genes that (to our knowledge) have no previously proposed immunomodulatory functions: ASP-related (ASPR) and secreted clade V proteins (SCVP; Tables 2, 8, and 10). ASPR domains (Pfam family PF17641) are related to but distinct from the CAP domains that originally defined ASP proteins (Pfam family PF00188) [24,183]. In A. ceylanicum, ASPR genes are transcriptionally upregulated at the start of infection [59], and have been observed in N. americanus ES proteins [44]. In the most recent release (WBPS19; https://parasite.wormbase.org) of ParaSite-Wormbase [184], ASPR-encoding genes are found in eight additional strongylid species: A. caninum, A. duodenale, Angiostrongylus costaricensis, A. vasorum, Cylicocyclus nassatus, Dictyocaulus viviparus, H. bakeri, N. brasiliensis, Oesophagostomum dentatum, and Strongylus vulgaris. SCVPs (Pfam family PF17619) were first observed as a novel family of secreted proteins upregulated in young A. ceylanicum adults, with no previous related protein domains; although one to a few SCVP genes are found in free-living clade V nematodes such as in C. elegans and Pristionchus pacificus, SCVP gene families are expanded in hookworms and H. contortus [59]. The most recent release (38.0; https://www.ebi.ac.uk/interpro/entry/pfam) of Pfam [183] also shows expansions of SCVP genes in the strongylids A. caninum, C. nassatus, Haemonchus placei, O. dentatum, S. vulgaris, and Teladorsagia circumcincta. Given that they observably span multiple subdivisions of strongylids [185], ASPR and SCVP genes might be newly identified immunomodulatory factors in many or all strongylid parasites.
The strong overlap that we observed between immunoregulation and male-biased gene expression raises the question of whether hookworms have sex-specialized repertoires of genes that they express during infections [186]. There has been one striking instance (in the African tick R. appendiculatus) where males and females interact differently with their hosts and the males act to promote female success during parasitism [169]. In vitro, male and female fourth-stage larvae of H. bakeri release different ES proteins when cultured together than they do when cultured separately as male-only or female-only populations, and mixed-sex ES products from H. bakeri show more immunomodulation of dendritic cells than single-sex products [187,188]. Our current data do not indicate whether such cooperation happens during hookworm infections, let alone whether the male-biased immunoregulated genes we describe are relevant to such hypothetical cooperation. However, such a bias might ensure that both sexes are present in a successful infection, which is required for reproduction. More extensive analysis to test for sexual cooperation during infections or sex-biased gene immunoregulation should be pursued for hookworms and other parasitic nematodes.
Immunoregulated genes of A. ceylanicum have orthologs in A. caninum, N. americanus, and H. contortus. The orthologs of immunoregulated ES genes also encode ES proteins, suggesting that other parasitic nematodes may also have immunoregulated genes, and some findings indicate that they actually do. For the strongylid T. circumcincta, being embedded in host mucosa upregulates ES genes and putative immunomodulatory genes [83,174]. For the strongylid H. bakeri, infecting mouse hosts with colitis induces different ES proteins than normal hosts [189]. Finally, for the strongylid N. brasiliensis, recent transcriptomic and proteomic analyses shows genes that are positively or negatively regulated during infections of normal versus immunocompromised stat6 mutant mouse hosts, a subset of which encode ES proteins [52,190]. Thus, the immunoregulation we observe here for A. ceylanicum may be a general phenomenon found in many other parasitic nematodes.
Materials and methods
Ethics statement
All animal experiments were carried out under protocols approved by the University of Massachusetts Chan Medical School. All housing and care of laboratory animals used in this study conformed to the NIH Guide for the Care and Use of Laboratory Animals in Research (18-F22) and all requirements and all regulations issued by the USDA, including regulations implementing the Animal Welfare Act (P.L. 89–544) as amended (18-F23).
Sample procurement, preparation and storage
Cultures, infections, and collections of A. ceylanicum followed published methods [59,194]. For each A. ceylanicum infection in support of RNA-seq, we purchased Syrian golden hamsters (Mesocricetus auratus) of the HsdHan:AURA strain at 3–4 weeks of age. Hamsters were provided with food and water ad libitum. For immunosuppression experiments, we immunosuppressed half of each set of hamsters by injecting them with dexamethasone (3 mg/kg) twice per week throughout the duration of the experiment [195]; the other half were given mock injections. After the first two injections (one week), both immunocompetent and immunocompromised hamsters were infected at approximately four to five weeks of age with A. ceylanicum. Infections were allowed to progress for 12 or 19 days, after which we euthanized the hamsters and collected hookworms from small intestines dissected from the hamsters. Dissected hamster intestines were put into Hank’s Balanced Salt Solution (pre-warmed to 37°C); worms were picked quickly from the dissected tissues by hand. Collected worms were snap-frozen in liquid nitrogen and stored at −80°C until use. For A. ceylanicum infections in support of ES protein mass spectrometry, we followed a similar protocol but without injections, with two collections of A. ceylanicum at 20 days after infection, and without snap-freezing of dissected hookworms. Stages of A. ceylanicum selected for RNA-seq or proteomics are based on previously described stages of growth in golden hamsters [196]. For hookworm intestinal-specific studies, from each triplicate set of 19-day post-infection hookworms isolated from hamsters as above, we dissected both hookworm intestinal and hookworm non-intestinal tissue, and extracted RNA from the dissected tissues; for the triplicates of smaller 12-day post-infection young adult worms, such dissection was not possible, so we extracted RNA from whole 12-day young adult worms.
RNA harvesting and sequencing
Total A. ceylanicum RNA was extracted from either whole worms or from dissected tissues as in Romeo and Lemaitre [59,197]. RNA-seq was done largely as in Srinavasan et al. [198]. RNA-seq libraries were built with Illumina’s TruSeq RNA Sample Prep Kit v2 executed according to manufacturer’s instructions, using 1 µg of total RNA for each sample. RNA-seq libraries were sequenced in single-end mode with read lengths of 50 nt (for all 19-day data) or 100 nt (for all 12-day data). Newly generated A. ceylanicum RNA-seq libraries are listed in S8 Table.
Protein harvesting
After 20 days of infection, A. ceylanicum hookworms were isolated from their hamster hosts and cultured in liquid culture medium for up to 3 days at 37°C in 5% CO2. Liquid culture medium consisted of RPMI Medium 1640 (Gibco, Cat#11835–030) supplemented with 25 mM pH 7.2 HEPES buffer, 10 µg/ml amphotericin B (Gibco, Cat# 15290–026), and 100 U/ml penicillin/streptomycin (Gibco, Cat# 1570–063), with medium sterilized with a 0.22 µm filter. Fetal bovine serum was omitted from the medium to eliminate any added protein when quantifying ES proteins by BCA assay and to eliminate contamination for proteomics. After the first 24 hours, proteins excreted or secreted from the hookworms were collected by aspirating media only. The aspirated media were centrifuged to remove debris (1500 gs for 20 minutes, at 4°C); the resulting supernatant was concentrated 10-fold using a 3kD Amicon ultra-centrifugal filter (4000 gs for 45 minutes, at 4°C). After this first 10-fold concentration, PBS was added to match the initial volume, after which the media were reconcentrated by again being centrifuged (4000 gs for 45 minutes, at 4°C); we repeated this twice, for a total of three PBS rinses and reconcentrations. After the third PBS rinse/reconcentration, total protein was quantified using a Pierce BCA Protein Assay kit. Purified ES proteins were then frozen and stored at -80°C.
General computation
Where possible, we used mamba to install and run version-controlled software environments from bioconda [199]. For reformatting or parsing of computational results, we used Perl scripts either developed for general use or custom-coded for a given analysis. All such Perl scripts (named below with italics and the suffix “.pl”) were archived on GitHub (https://github.com/schwarzem/ems_perl). Internet sources (URLs) for other software are listed in S15 Table.
Nematode genomes, transcriptomes, coding sequences, and proteomes
For transcriptomic or proteomic analyses, published genome sequences, coding sequences, proteomes, and gene annotations of relevant nematodes were downloaded from WormBase [200] or WormBase ParaSite [184,193] (S16 Table). Published RNA-seq data of A. ceylanicum [59,74,75] and H. contortus [175] were downloaded from the European Nucleotide Archive (S11 Table). Alternative gene predictions for A. ceylanicum recently published by Uzoechi et al. [54] were obtained as a Generic Feature Format Version 3 (GFF3) annotation file (https://github.com/The-Sequence-Ontology/Specifications/blob/master/gff3.md) from Young-Jun Choi <choi.y@wustl.edu> that we have archived at https://osf.io/dxfsb. We extracted protein sequences from this GFF3 via gffread 0.12.7 [201] with the arguments ‘-g [input genome sequence FASTA file] -o/dev/null -C --sort-alpha --keep-genes -P -V -H -l 93 -y [output protein FASTA file] [input gene prediction GFF3 file]’.
Heligmosomoides species nomenclature
Recent genomic analysis has shown that the gastrointestinal parasitic nematode Heligmosomoides has two distinct species: H. bakeri and H. polygyrus [93]. This reinforced previous molecular phylogenetic observations indicating that H. bakeri is a terminal sister clade to H. polygyrus [202], and that both the morphology and the the host-specificity of H. bakeri are distinct from those of H. polygyrus [203–207]. Although laboratory strains of Heligmosomoides have often been described as H. polygyrus [208], it now appears likely that many or all of these strains have actually been H. bakeri. Thus, following previous suggestions for revised nomenclature [209], we refer exclusively to H. bakeri even when citing published work that was nominally done with H. polygyrus.
RNA-seq subsampling
We found that published A. ceylanicum RNA-seq data (S11 Table) were too extensive for gene predictions by BRAKER2 because of memory limitations. To make these data usable by BRAKER2, we selected a representative subset of them with khmer [210,211]. Because khmer requires “#/1” and “#/2” suffixes for paired-end reads, we retrofitted paired-end RNA-seq reads lacking such suffixes with retroname_fastq_reads.pl. We ran normalize-by-median.py from khmer on all data (paired and unpaired) twice, with the arguments ‘-k 31 -C 100 -M 100G’, and ‘-k 31 -C 30 -M 100G’; we then ran filter-abund.py from khmer on paired-end data with the arguments ‘--variable-coverage -C 2 [k-mer hash] [paired-end reads]’. We sorted khmer-filtered data into interleaved paired- and unpaired-end read files with paired_vs_unp_fastq.or.a.pl with the arguments ‘--r1 “#0\/1” --r2 “#0\/2”’.
Reprediction of protein-coding genes
To repredict protein-coding genes in A. ceylanicum, we ran braker.pl in BRAKER2 2.1.6 [62] on our published repeat-softmasked A. ceylanicum genome assembly [59], with the arguments ‘--genome [genome assembly FASTA] --prot_seq [collected proteomes FASTA] --bam [sorted mapped khmer-filtered paired-end read BAM alignment] --etpmode --softmasking --cores 48 --gff3’. BRAKER2 requires an input genome sequence to have its FASTA header lines previously stripped of comments; we did this with uncomment_FASTA_headers.pl. Running BRAKER2 also required us to install: AUGUSTUS 3.4.0 [212]; BamTools 2.5.2 [213]; cdbfasta 0.99 [214]; DIAMOND 2.0.9 [215]; GeneMark-ES/ET 4.68 [216]; and SAMtools 1.12 [217]. To guide BRAKER2 gene predictions, we used our khmer-subsampled paired-end subset of published A. ceylanicum RNA-seq data, along with predicted proteomes from the related nematodes Caenorhabditis elegans [200], H. contortus [218], and N. americanus [44]. After running BRAKER2, we renamed gene, transcript, and exon names in the GFF3 prediction file with a modified version of updateBRAKERGff.py (https://github.com/Gaius-Augustus/BRAKER/issues/416) and Perl one-line commands. To extract coding DNAs (CDS DNAs) and protein sequences from the renamed GFF3, we used gffread 0.12.7 [201] as above, with the additional argument ‘-x [output CDS DNA file]’.
This gave us a new gene set (“v2.0”) that was generally superior (in completeness as assayed by BUSCO, and nonfragmentation as assayed by count_fasta_residues.pl) to our original 2015 gene set (“v1.0”). However, we observed that some v1.0 genes encoded ES proteins (as determined by mass spectrophotometric mapping) but had no genomic overlap with v2.0 genes. To ensure no valid ES genes would be overlooked, we used BEDtools 2.30.0 [219] to identify v1.0 genes whose protein-coding exon sequences (CDSes) had no genomic overlap with v2.0 CDSes, as follows. We used get_gff3_gene_subset.pl to remove GFF3 v2.0 annotations that could not be translated by gffread, and then used Perl to extract CDS annotation lines from the GFF3 files of published v1.0 and gffread-translatable v2.0 gene predictions. We reformatted the CDS-subset files from GFF3 to BED with gff2bed in BEDOPS 2.4.41 [220]. We identified overlapping coordinates of the CDS-only BED annotation files for v1.0 and v2.0 genes with intersect from BEDtools 2.30.0 [219], using the argument ‘-loj’, and used Perl to extract a list of v1.0 genes having no CDS overlaps with v2.0 genes. Given this list of non-overlapping v1.0 genes, we used extract_fasta_subset.pl to extract their encoded CDS DNAs and proteins as subsets from the full published v1.0 CDS DNA and protein sets, and added these CDS DNA and protein subsets to our previous generated v2.0 CDS DNA and protein sequences. This gave us our final hybrid prediction set (“v2.1”) of CDS DNAs and proteins on which all further analyses in this paper were performed. The v2.1 CDS DNAs and proteome have been archived at https://osf.io/dxfsb.
We generated a v2.1 GFF3 as follows. From the published v1.0 GFF3, we extracted GFF3 annotations for these non-overlapping v1.0 genes first by running extract_parasite_GFF3_subset.pl and then by selecting ‘WormBase_imported’ annotation lines with Perl. We reformatted the resulting v1.0 non-overlapping GFF3 with AGAT 1.1.0 [221]. Likewise, we reformatted the gffread-translatable v2.0 GFF3 with AGAT. We then merged the two AGAT-reformatted GFF3s to produce a single v2.1 GFF3 with uniform formatting, which we have archived at https://osf.io/dxfsb.
Assessing quality of protein-coding genes
We determined general statistics of protein-coding gene products with count_fasta_residues.pl using the arguments ‘-e -t prot’; we obtained gene counts from maximum-isoform proteome subsets generated with get_largest_isoforms.pl using the arguments ‘-t parasite’ or ‘-t maker’. We determined and compared the completeness of predicted A. ceylanicum protein-coding gene sets with BUSCO 5.2.2 using the arguments ‘--lineage_dataset nematoda_odb10 --mode proteins’ [222], which tested a given proteome against 3,131 highly conserved single-gene orthologs in nematodes.
Gene reannotation
For the protein products of our v2.1 A. ceylanicum gene set, we predicted both N-terminal signal sequences and transmembrane alpha-helical anchors with Phobius 1.01 [63], reformatting results with tabulate_phobius_hits.pl. We predicted coiled-coil domains with Ncoils 2002.08.22 [223], reformatting results with tabulate_ncoils_x.fa.pl. We predicted low-complexity regions with PSEG 1999.06.10 [224] using the argument ‘-l’ and reformatting results with summarize_psegs.pl. We identified protein domains with Pfam 35.0 [183] database with hmmscan in HMMER 3.3.2 [225,226], using the arguments ‘--cut_ga’ to impose family-specific significance thresholds, ‘-o/dev/null’ to discard text outputs, and ‘--tblout’ to export tabular outputs; Pfam results were reformatted with pfam_hmmscan2annot.pl. We also identified protein domains with interproscan.sh in InterProScan 5.57-90.0 [65] using the arguments ‘-dp -iprlookup -goterms’, and reformatting results with tabulate_iprscan_tsv.pl. We generated Gene Ontology (GO) terms [67,68], EggNOG descriptions [227], and KEGG codes [228] with EnTAP 0.10.7-beta [229] using the argument ‘--runP’, and using selected UniProt [230] and RefSeq [231] proteome databases from highly GO-annotated model organisms (S16 Table); proteome databases were generated with makedb from DIAMOND 0.9.9 [215], itself bundled with EnTAP; annotations from EnTAP were reformatted from protein to gene annotation tables with cds2gene_EnTAP_annot.pl and cds2gene_annot.pl. We identified genes encoding possible antimicrobial peptides (AMP) by mapping predictions by Irvine et al. [106] from v1.0 to v2.1 of our gene predictions. We identified orthologies between A. ceylanicum and other nematodes with OrthoFinder 2.5.4 [66] using the arguments ‘-S diamond_ultra_sens -og’; results were reformatted with prot2gene_ofind.pl and genes2omcls.pl. For all analyses except OrthoFinder, full proteomes were used; for OrthoFinder, we used maximum-isoform proteome subsets generated with get_largest_isoforms.pl. An overall annotation table was constructed from both these and RNA-seq annotations (below) with add_tab_annots.pl.
Gene annotation for related parasitic nematodes
To make consistent comparisons of ES genes (or other categories of genes) between A. ceylanicum and other species, we annotated the protein-coding genes for three related parasites (A. caninum, H. contortus, and N. americanus) using the same methods as for A. ceylanicum above; the analyzed proteomes are listed in S16 Table.
Selection of ES gene sets from related parasitic nematodes for comparative analysis
We selected three published ES gene sets from A. caninum, H. contortus, and N. americanus for comparative analysis by the following criteria. They had to have either gene identification numbers (IDs) from public genome assemblies, or, if they did not have gene IDs from valid genomes, they needed to at least have expressed sequence tag (EST) IDs for which the original sequence data were publicly available so that they could be mapped onto genomes (and thus to modern genes with proper IDs). This requirement disqualified previously published ES gene sets for H. bakeri and N. brasiliensis. Second, the gene IDs needed to be correctly mapped from protein mass spectrometric data of one species onto genes of the same species. Remarkably, this requirement disqualified the ES genes from Angiostrongylus, which were generated from A. vasorum protein mass spectrometry data but were mapped onto genes of A. cantonensis and A. costaricensis. Third, the ES genes needed to be from parasitic nematodes that had evolved parasitism in common with A. ceylanicum hookworms. This criterion disqualified ES genes from the giant roundworm Ascaris suum or the whipworm Trichuris muris, which evolved parasitism independently of hookworms and other strongylids.
RNA-seq data
For A. ceylanicum, we generated biologically triplicated RNA-seq data for each biological condition. We also analyzed published RNA-seq data for A. ceylanicum [59,74,75] and H. contortus [175], listed in S11 Table. Before analysis, we Chastity-filtered new A. ceylanicum RNA-seq reads with quality_trim_fastq.pl using the arguments ‘-q 33 -m 50’. We then quality-filtered and adaptor-trimmed new A. ceylanicum RNA-seq reads with fastp 0.20.0 [232] using the arguments ‘--dont_overwrite --detect_adapter_for_pe --n_base_limit 0 --length_required X’, with X = 50 for new A. ceylanicum data. We quality-filtered and adaptor-trimmed previously published A. ceylanicum and H. contortus RNA-seq reads with fastp 0.20.0 using the arguments ‘--dont_overwrite --n_base_limit 0 --length_required X’ with X = 50 for A. ceylanicum.
RNA-seq expression values and significances
We quantified gene expression from RNA-seq data sets with Salmon 1.9.0, generating expression values in transcripts per million (TPM) and estimating mapped read counts per gene [233]. To prevent spurious mappings of RNA-seq reads, we used full selective alignment to a “gentrome” (a CDS DNA set, treated as a target for real mappings, combined with its genome, treated as a decoy for spurious mappings), followed by quantification using Salmon in non-alignment mode [234,235]. For Salmon’s index program, we used the arguments ‘--no-version-check --keepDuplicates -t [gentrome sequence] -d [decoy list]’; for Salmon’s quant program, we used the arguments ‘--no-version-check --seqBias --gcBias --posBias --libType A --geneMap [transcript-to-gene table]’, with ‘--unmatedReads’ used for single-end data. Results from quant.genes.sf output files were reformatted with make_salmon_TPM_slices.pl.
Heatmapping of RNA-seq data
To visualize and cluster gene expression values for RNA-seq replicates, we converted expression values of 0 TPM to empirical pseudozeros with assort_tpms.pl, with each replicate’s empirical pseudozero being defined as the lowest non-zero TPM expression value observed for all genes within that replicate. This allowed logarithmic transformation of zero expression values while defining these values in a replicate-specific way. We then converted the expression values of replicates to log10(TPM) scores with log10_tsv_numbers.pl. Finally, we heatmapped and dendrogram-clustered all replicates with ComplexHeatmap 2.18.0 [236] using default parameters. Heatmapping showed individual non-dexamethasone (nonDEX) and dexamethasone (DEX) intestinal RNA-seq replicates that anomalously clustered with two DEX and nonDEX intestinal RNA-seq replicates, respectively (S1 Fig); these anomalous replicates were thus not used for differential gene expression analysis.
Differential gene expression analysis
Having generated RNA-seq readcounts, we used the exactTest function of edgeR 3.36.0 [237] to compute log2 fold-changes (log2FC) and false discovery rate (FDR) significance values [238], to identify statistically significant changes of gene expression between pairs of biological conditions in our RNA-seq data. Dispersions were computed with edgeR’s estimateDisp function using the argument ‘robust=TRUE’. For this analysis, we used all non-anomalous replicates of our A. ceylanicum RNA-seq data (S8 Table) along with published A. ceylanicum RNA-seq data from whole fourth-stage larval (L4) males, whole adult males, whole adult females, and adult male intestine (S11 Table). To increase the statistical signal for differentially expressed genes, we used filter_minimum_readcounts.pl with the argument ‘10’ to remove genes which failed to achieve 10 mapped reads in any biological replicate before submitting readcounts to edgeR. An edgeR R script (S1 File) was generated with make_edgeR_classic_scripts.pl, and run in batch mode with R 4.1.3. In summarizing edgeR results, we defined a gene as being differentially expressed (e.g., with tissue-, sex-, or condition-biased expression) if edgeR scored it as being ≥2-fold more strongly or weakly expressed in one condition than another (either log2FC ≥ 1, or log2FC ≤ -1) with an FDR of ≤ 0.01. All other genes with intermediate expression ratios between two conditions (-1 < log2FC < 1), or with expression ratios that were determined with lower significance (FDR > 0.01) were not classified as having differential gene expression. We used edgeR’s exactTest with these criteria because it is well-adapted for analyzing small numbers of biological replicates [239].
UpSet plotting of gene set overlaps
To visualize overlaps between categories of genes, we used UpSet 2 [240,241] at the Visualization Design Lab at the University of Utah (https://vdl.sci.utah.edu/upset2), uploading our data to UpSet 2 via Multinet.app (https://upset.multinet.app).
Volcano plotting of RNA-seq data
To visualize changes of gene expression between two biological conditions in our RNA-seq data, we generated volcano plots [242] with EnhancedVolcano 1.24.0 (https://github.com/kevinblighe/EnhancedVolcano). R scripts (S2 File) were run in batch mode with R 4.4.3.
RNA-seq and differential gene expression analysis for H. contortus
These were performed with published RNA-seq data for dissected adult female intestine and whole adult female bodies (S11 Table) [175] and with predicted genes from the published chromosomal reference genome (S16 Table) [218], by the same methods as used for A. ceylanicum.
Constructing a protein set for mass spectrophotometric analysis
To combine our newly predicted A. ceylanicum proteins with alternative predictions by Coghlan et al. [185], we ran cd-hit-2d from CD-HIT 4.8.1 [243] with the arguments ‘-i [v2.1 predicted proteome] -i2 [alternative published proteome by Coghlan et al.] -o [distinct Coghlan protein sequences] -d 100 -c 1.0 -M 0 -T 1 -l 5 -s 0.0 -aL 0.0 -aS 1.0’. We then joined the v2.1 proteome with distinct Coghlan sequences to yield a nonredundant A. ceylanicum protein set for LC-MS/MS analysis.
Liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis
Peptides were analyzed with an Orbitrap Exploris 480 hybrid quadrupole-Orbitrap mass spectrometer coupled to an UltiMate 3000 RSLCnano liquid chromatography system (ThermoFisher Scientific). Peptides were loaded onto an Acclaim PepMap RSLC column (75 µm × 15 cm, 2 µm particle size, 100 Å pore size, ThermoFisher Scientific) over 15 min in 97% mobile phase A (0.1% formic acid) and 3% mobile phase B (0.1% formic acid, 80% acetonitrile) at 0.5 µL/min. Peptides were eluted at 0.3 µL/min using a linear gradient from 3% mobile phase B to 50% mobile phase B over 120 min. Peptides were electrosprayed through a nanospray emitter tip connected to the column by applying 2000 V through the ion source’s DirectJunction adapter. Full MS scans were performed at a resolution of 60,000 at 200 m/z over a range of 300–1,200 m/z with an AGC target of 300% and the maximum injection time set to ‘auto’. The top 20 most abundant precursors with a charge state of 2–6 were selected for MS/MS analysis with an isolation window of 1.4 m/z and a precursor intensity threshold of 5 × 103. A dynamic exclusion window of 20 s with a precursor mass tolerance of ± 10 ppm was used. MS/MS scans were performed using HCD fragmentation using a normalized collision energy of 30% and a resolution of 15,000 with a fixed first mass of 110 m/z. The AGC target was set to ‘standard’ with a maximum injection time of 22 ms.
Mass spectrometry data analysis
Thermo RAW files were searched against our nonredundant A. ceylanicum protein set using the SEQUEST algorithm in Proteome Discoverer 2.4 (Thermo). Data were searched for peptides with tryptic specificity with a maximum of two missed cleavages. The precursor mass tolerance was set at 10 ppm and the fragment mass tolerance was set at 0.02 Da. Search parameters included carbamidomethylation at cysteine (+57.021 Da) as a constant modification and the following dynamic modifications: oxidation at Met (+15.995 Da), acetylation at protein N-termini (+42.011 Da), Met loss at protein N-termini (-131.040 Da), Met loss+acetylation at protein N-termini (-89.030 Da). The Percolator node of Proteome Discoverer was used for PSM validation at a false discovery rate of 1%. Final results for both LC-MS/MS analyses are given in S17 Table.
Mapping ES mass spectrometry hits onto gene predictions
Our protein database for analyzing ES data included both our new v2.1 A. ceylanicum predictions and any predictions by Coghlan et al. that differed by even one amino acid residue from ours. We selected ES protein hits in S17 Table whose experimental combined q-value was ≤ 0.01; for each ES sample, we extracted their gene names as separate v2.1 and Coghlan genes, yielding four ES gene lists in all.
Because gene prediction in complex eukaryotes remains challenging [60] and because the Coghlan predictions were made on a different strain of A. ceylanicum (Indian strain, US National Parasite Collection Number 102954) than our laboratory strain used for genome sequencing and gene prediction (HY135), ES proteins whose mass spectrophotometic data fit a Coghlan prediction exclusively might do so only through slight differences in predicted gene structures or through strain-specific amino acid polymorphisms. To identify such cases where a Coghlan gene was equivalent to one of our v2.1 genes, we mapped the protein-coding exons (CDSes) of Coghlan genes onto our v2.1 gene set, as follows. We downloaded Coghlan gene annotations in GFF3 format and their corresponding A. ceylanicum genome assembly from WormBase ParaSite (S16 Table). We mapped the coordinates of Coghlan gene annotations onto our A. ceylanicum genome assembly sequence (i.e., lifted them over) with Liftoff 1.6.3 [244] using the arguments ‘-copies -polish -cds’. We extracted CDS coordinate lines from the resulting liftover GFF3 file with Perl, and did likewise for the CDS coordinates from the GFF3 of our v2.1 gene prediction set. We converted Coghlan and v2.1 CDS coordinate files from GFF3 to BED format with gff2bed in BEDOPS 2.4.41 [220]. We identified overlapping Coghlan and v2.1 CDSes with intersect in BEDtools 2.30.0 [219], using the argument ‘-loj’. We extracted columns 10 and 20 from the resulting intersection file with cut (https://www.gnu.org/software/coreutils/cut) using the argument ‘-f 10,20’, and processed the results with washu.cds_loj_umass.cds_genemap.pl using the positional arguments ‘[CDS-to-gene table] [BEDtools intersect columns 10 and 20 table]’; the CDS-to-gene map file was extracted from the FASTA headers of Coghlan and v2.1 proteomes via Perl.
We then used annot_es_v01.pl with our Coghlan-to-v2.1 gene map and our four ES gene lists to produce a unified list of ES-encoding v2.1 genes.
Mapping ES genes of Uzoechi et al. onto our gene predictions
Uzoechi et al. have recently identified ES-encoding genes in A. ceylanicum, using their own reprediction of our original 2015 gene set [54]. To compare their results to ours, we first mapped their results from ES transcripts to ES genes with map_tx_to_genes.pl. We then mapped their predicted gene set onto ours using methods similar to those for the Coghlan gene set. Because their repredictions were performed on our A. ceylanicum HY135 genome sequence rather than their A. ceylanicum Indian genome (as Coghlan predictions had been), we did not need to use Liftoff to map coordinates from one genome to the other. Subsequent mapping steps (CDS extraction with Perl; GFF to BED conversion with BEDOPS gff2bed; overlap determination with BEDtools intersect; extraction and summarizing of ES gene results with annot_es_v02.pl) were as above.
Statistical significance of overlapping gene sets
To identify significant overlaps between sets of A. ceylanicum genes, we used the Perl script motif_group_fisher.pl, which computed p-values from two-tailed Fisher tests with the Perl module Text::NSP::Measures::2D::Fisher::twotailed; for multiple hypothesis testing (e.g., comparisons to sets of protein domains) this Perl script also computed q-values via the qvalue program of the MEME 5.4.1 software suite [245].
Supporting information
S1 Table. Annotations, RNA-seq expression, and differential gene expression for the A. ceylanicum v2.1 protein-coding gene set.
These annotations are provided both for all 33,190 protein coding genes (in the sheet labeled ‘Acey_v2.1 annots’), and for the following subsets of genes: encoding ES proteins (‘ES genes’, 565 genes); encoding ES proteins that were detected twice (‘ES genes detected 2x’, 350 genes); intestine-biased expression, 19-day (‘Intestine-biased’, 1,670 genes); non-intestine-biased expression, 19-day (‘Non-intestine-biased’, 1,196 genes); 19-day positively immunoregulated intestinal (‘Pos.intest.immunoreg’, 1,951 genes); 19-day positively immunoregulated intestinal and also encoding ES proteins (‘Pos.intest.immunoreg + ES’, 153 genes); 19-day negatively immunoregulated intestinal (‘Neg.intest.immunoreg’, 137 genes); 19-day positively immunoregulated non-intestinal (‘Pos.non-intest.immunoreg’, 26 genes); 19-day negatively immunoregulated non-intestinal (‘Neg.non-intest.immunoreg’, 4 genes); and 12-day negatively immunoregulated (‘Neg.12d.immunoreg’, 2 genes). In practice, users of this data will have many other different subsets of annotations that they want to extract. Thus, both here and in subsequent supplementary tables (S2-S17 Tables), the data are provided as an Excel file that the reader can filter in order to select particular subsets of data. To do this in an open Excel file, first select the top row (which will contain values that will be filtered); then select the AutoFilter option in the Data menu. For each data column, drop-down arrows should appear. These are options for filtering data in each column. For a data type/column of interest, click the drop-down arrow to open its filter menu, pick criteria for the data in that column (e.g., “Contains” a gene name or “Greater than or equal to” a given number), and click the “Apply Filter” button to get only the subset of data meeting those criteria. Multiple criteria for multiple data types can be selected (and, if they are too stringent or mutually inconsistent, may end up yielding no data). Criteria can be cleared from a given data column with the “Clear Filter” button. The data columns for S1 Table are as follows. Gene identifications and equivalencies Gene: A given predicted protein-coding gene in the A. ceylanicum genome assembly. All further data columns are pertinent to that particular gene. Gene_name: A human-readable gene name, where it exists (e.g., “Acey-CP-1” instead of simply “Acey_s0154.v2.g3234”). These names are used in the main text to discuss genes of particular interest. Mapped_v1.0_gene: Any gene or genes from our previous v1.0 predictions which overlap the v2.1 gene here by least 1 nt of coding exon sequence in the A. ceylanicum genome. Although this criterion errs on the side of sensitivity and we have made no effort to filter out short overlaps, we expect that in practice this will identify extensive exon overlaps between an earlier v1.0 gene and its reprediction in v2.1. Mapped_Coghlan_gene: Any gene or genes from the previous A. ceylanicum gene predictions by Coghlan et al. [185] which, when lifted over onto our A. ceylanicum genome assembly, overlap the v2.1 gene here by least 1 nt of coding exon sequence. Mapped_Uzoechi_gene: Any gene or genes from the recent A. ceylanicum gene repredictions by Uzoechi et al. [54] which overlap the v2.1 gene here by least 1 nt of coding exon sequence. General traits of protein products. Max_prot_size: The size of the largest predicted protein product. Prot_size: This shows the full range of sizes for all protein products from a gene’s predicted isoforms. Phobius: This denotes predictions of signal and transmembrane sequences made with Phobius 1.01 [63]. ‘SigP’ indicates a predicted signal sequence, and ‘TM’ indicates one or more transmembrane-spanning helices, with N helices indicated with ‘(Nx)’. Varying predictions from different isoforms are listed. NCoils: This shows coiled-coil domains, predicted by Ncoils 2002.08.22 [223]. Both the proportion of such sequence (ranging from 0.01 to 1.00) and the exact ratio of coiled residues to total residues are given. Proteins with no predicted coiled residues are blank. Psegs: This shows what fraction of a protein is low-complexity sequence, as detected by PSEG 1999.06.10 [224]. As with Ncoils, relative and absolute fractions of low-complexity residues are shown. Pfam: Predicted protein domains from Pfam 35.0 [183], with family-specific significance thresholds. InterPro: Predicted protein domains from InterProScan 5.57-90.0 [65]. AMP: An antimicrobial peptide (AMP) gene annotation for v1.0 of our A. ceylanicum gene predictions, predicted by Irvine et al. [106], and mapped onto our v2.1 gene predictions here. GO_Biological: Annotations from the biological subset of Gene Ontology (GO) terms [67,68], generated with EnTAP 0.10.7-beta [229]. GO_Molecular: Annotations from the molecular subset of Gene Ontology (GO) terms [67,68], generated with EnTAP 0.10.7-beta [229]. GO_Cellular: Annotations from the cellular subset of Gene Ontology (GO) terms [67,68], generated with EnTAP 0.10.7-beta [229]. EggNOG_description: EggNOG descriptions [227], generated with EnTAP 0.10.7-beta [229]. EggNOG_KEGG: KEGG codes [228], generated with EnTAP 0.10.7-beta [229]. Orthologies of protein products. OFind_Summary and OFind_Full: The results for our OrthoFinder analysis [66] of orthologies between A. ceylanicum and the related nematodes A. caninum, C. elegans, H. contortus, H. bakeri, N. americanus, N. brasiliensis, Pristionchus pacificus, and T. circumcincta. For one of these species (N. americanus) two different proteomes were included: the original predicted proteome by Tang et al. [248] labeled ‘necator_orig’, and the repredicted proteome by Logan et al. [44] labeled ‘necator_rev’. Two different views of these results are given: the summary lists taxa and gene counts, while the full results give individual gene names. ES- and gene-expression-related traits. Intest_haemonchus: Orthologous H. contortus genes (taken from OFind_Full) that have intestine-biased gene expression, as computed by our analysis of previously published H. contortus RNA-seq data (S11 Table). Non-intest_haemonchus: Orthologous H. contortus genes (taken from OFind_Full) that have non-intestine-biased gene expression, as computed by our analysis of previously published H. contortus RNA-seq data (S11 Table). ES_component: A gene whose protein product was detected in at least one of our two ES mass spectrometry experiments, E20201108-05 and E20201108-07. Note that this was used as a binary (Boolean) classification for statistical analyses of gene set overlaps; see below for other such binary classifications. ES_observations: Observations of a gene’s protein product being present in either E20201108-05, or E20201108-07, or both. Coghlan-spec_ES: In our mass spectrometry analysis, our observation of a peptide mapping specifically to a Coghlan gene-encoded protein that was distinct from our v2.1 proteins by at least one amino acid residue, but whose Coghlan gene’s coding exons could then be lifted over to the coding exons of the v2.1 gene here (Mapped_Coghlan_gene). For each such mapping, the ES observations behind it are noted (either E20201108-05, or E20201108-07, or both). Uzoechi_ES: An ES-encoding gene detected by Uzoechi et al. [54] whose coding exons mapped onto the coding exons of this v2.1 gene (in Mapped_Uzoechi_gene). Note that Uzoechi et al. separately observed both male and female ES proteins, and those specific observations are given here. Wong_ES: An ES-encoding gene detected by Wong et al. [55] whose coding exons mapped onto the coding exons of this v2.1 gene (in Mapped_Uzoechi_gene). Note that Wong et al. separately observed both L3i and adult ES proteins, and those specific observations are given here. ES_a_caninum: Orthology to an A. caninum gene encoding an A. caninum ES protein observed by Morante et al. [45], extracted from OFind_Full. ES_necator_rev: Orthology to an N. americanus gene encoding an N. americanus ES protein observed either by Logan et al. [44] or by Wong et al. [55], extracted from OFind_Full. ES_haemonchus: Orthology to an H. contortus gene encoding an H. contortus ES protein observed by Wang et al. [47], extracted from OFind_Full. Binary (Boolean) classifications of genes. Male-biased: Annotation here indicates a gene with male-biased gene expression, as defined by ≥2-fold higher expression in males versus females, with a false discovery rate (FDR) of ≤ 0.01. Female-biased: Annotation here indicates a gene with female-biased gene expression, as defined by ≥2-fold higher expression in females versus males, with a false discovery rate (FDR) of ≤ 0.01. Intestine-biased: Annotation here indicates a gene with intestine-biased gene expression, as defined by ≥2-fold higher expression in 19-day intestinal tissue from normal hosts versus 19-day non-intestinal tissue from normal hosts, with a false discovery rate (FDR) of ≤ 0.01. Non-intestine-biased: Annotation here indicates a gene with non-intestine-biased gene expression, as defined by ≥2-fold higher expression in 19-day non-intestinal tissue from normal hosts versus 19-day intestinal tissue from normal hosts, with a false discovery rate (FDR) of ≤ 0.01. Any.19d.immunoreg: Annotation here indicates a gene with either intestinal or non-intestinal and either positively or negatively immunoregulated gene expression, as defined by ≥2-fold higher or lower expression in 19-day intestinal or non-intestinal tissue from normal hosts versus 19-day corresponding tissue from dexamethasone-immunosuppressed hosts, with a false discovery rate (FDR) of ≤ 0.01. Any.19d.pos.immunoreg: Annotation here indicates a gene with either intestinal or non-intestinal positively immunoregulated gene expression, as defined by ≥2-fold higher in 19-day intestinal or non-intestinal tissue from normal hosts versus 19-day corresponding tissue from dexamethasone-immunosuppressed hosts, with a false discovery rate (FDR) of ≤ 0.01. Pos.intest.immunoreg: Annotation here indicates a gene with intestinal positively immunoregulated gene expression, as defined by ≥2-fold higher expression in 19-day intestinal tissue from normal hosts versus 19-day intestinal tissue from dexamethasone-immunosuppressed hosts, with a false discovery rate (FDR) of ≤ 0.01. Neg.intest.immunoreg: Annotation here indicates a gene with intestinal negatively immunoregulated gene expression, as defined by ≥2-fold higher expression in 19-day intestinal tissue from dexamethasone-immunosuppressed hosts versus 19-day intestinal tissue from normal hosts, with a false discovery rate (FDR) of ≤ 0.01. Pos.non-intest.immunoreg: Annotation here indicates a gene with non-intestinal positively immunoregulated gene expression, as defined by ≥2-fold higher expression in 19-day non-intestinal tissue from normal hosts versus 19-day non-intestinal tissue from dexamethasone-immunosuppressed hosts, with a false discovery rate (FDR) of ≤ 0.01. Neg.non-intest.immunoreg: Annotation here indicates a gene with non-intestinal negatively immunoregulated gene expression, as defined by ≥2-fold higher expression in 19-day non-intestinal tissue from dexamethasone-immunosuppressed hosts versus 19-day non-intestinal tissue from normal hosts, with a false discovery rate (FDR) of ≤ 0.01. Neg.12d.immunoreg: Annotation here indicates a gene with negatively immunoregulated gene expression in 12-day young adults, as defined by ≥2-fold higher expression in 12-day young adults from dexamethasone-immunosuppressed hosts versus 12-day young adults from normal hosts, with a false discovery rate (FDR) of ≤ 0.01. (Note that no genes were observed with positively immunoregulated gene expression in 12-day young adults, so no ‘Pos.12d.immunoreg’ data column was needed). WashU_any_adult_ES: Annotation here indicates a gene which was detected as encoding some sort of A. ceylanicum ES protein either by Uzoechi et al. [54] or by Wong et al. [55]. Uzoechi_male.only_ES: Annotation here indicates a gene which was detected as encoding a male-specific ES protein by Uzoechi et al. [54], annotated as solely ‘Uzoechi_male_ES’ in Uzoechi_ES. Uzoechi_female.only_ES: Annotation here indicates a gene which was detected as encoding a female-specific ES protein by Uzoechi et al. [54], annotated as solely ‘Uzoechi_female_ES’ in Uzoechi_ES. Uzoechi_both_ES: Annotation here indicates a gene which was detected as encoding an ES protein in both males and females by Uzoechi et al. [54], annotated as both ‘Uzoechi_male_ES’ and ‘Uzoechi_female_ES’ in Uzoechi_ES. Gene expression. [X].TPM: For each individual RNA-seq data set (with ‘X’ denoting the data set’s abbreviation), this gives gene expression levels in TPM, computed by Salmon 1.9.0 [233]. Keys to all abbreviations are given in S8 and S11 Tables. Biological replicates of RNA-seq samples are denoted by suffixes such as ‘_1’, ‘_2’, or ‘_3’. [X].reads: For each individual RNA-seq data set (with ‘X’ denoting the data set’s abbreviation), this gives numbers of mapped RNA-seq reads per gene, computed for individual RNA-seq data sets by Salmon 1.9.0 [233], with fractional values rounded down to integers. Differential gene expression between biological conditions. [X].vs.[Y].logFC: The fold-changes of gene expression between biological condition X and biological condition Y, expressed as log2 values, and with positive values representing greater expression in condition X. The values listed here are only those were computed to be significant using edgeR 3.36.0 [237], with multiple biological RNA-seq replicates for most conditions being compared, with all biological replicates being analyzed in a single edgeR run, and with significant results annotated for individual genes. Biological conditions of RNA-seq samples are abbreviated as shown in [X].TPM or [X].reads but without the replicate suffixes. [X].vs.[Y].FDR: The false discovery rate (FDR) for gene expression changes between biological condition X and biological condition Y, annotated for individual genes. The FDR for a given set of positive results is defined as that significance threshold which, if accepted, will lead to the entire set of positives having a collective false-positive rate no greater than the FDR; it therefore provides a way to correct for testing multiple hypotheses without rejecting excessive numbers of true positives. As with [X].vs.[Y].logFC, only changes that were computed to be significant by edgeR are listed.
https://doi.org/10.1371/journal.pntd.0014106.s001
(XLSX)
S2 Table. Numbers of genes encoding adult ES proteins of A. ceylanicum.
Numbers are given for our observations, for recently published observations by Uzoechi et al. [54] and Wong et al. [55], and for their subsets and overlaps. Uzoechi et al. independently observed ES proteins from males and females, so a gene in their set could encode ES proteins from either or both sexes. ES genes by Uzoechi et al. or Wong et al. were mapped from their predicted gene set (Table 1) to ours before analysis. Of our observed ES genes, 14 had no corresponding gene in the Uzoechi gene set, and thus might have actually been present in the Uzoechi or Wong ES mass spectrometry data yet not mapped onto a gene.
https://doi.org/10.1371/journal.pntd.0014106.s002
(XLSX)
S3 Table. Statistical analyses of the overlaps between A. ceylanicum genes encoding protein domains (or other traits) and A. ceylanicum genes encoding ES proteins.
Each category of genes (e.g., genes encoding a particular protein domain or having a particular trait) was compared for its degree of overlap to a set of ES genes, and statistically analyzed for the non-randomness of this overlap by a two-tailed Fisher test. In situtations where many categories were compared at once (for instance, an entire set of protein domains from Pfam, InterPro, or Phobius), p-values initially generated by Fisher testing were used to compute q-value significance scores which corrected for multiple hypothesis testing. Statistics are provided for the set of 565 ES genes described in this paper (labeled “UMass_ES”), for the set of 955 adult ES genes described by Uzoechi et al. [54] or Wong et al. (labeled “WashU_ES”), and for the set of 511 adult ES genes described only by Uzoechi et al. or Wong et al. but not by data in this paper (labeled “WashU-only ES”). For each ES set, spreadsheets are given for analyses of overlaps with the following traits; signal or transmembrane domain organizations predicted by Phobius; protein domains in Pfam; protein domains in InterPro; and various binary comparisons with other gene traits (“Various”). In addition, two spreadsheets are given for comparisons of the statistical significances (q-values) for overrepresented Pfam and InterPro domains, with the comparisons being between our set of 565 ES genes and the WashU-only set of 511 adult ES genes. In these comparisons, only domains with a q-value ≤ 0.1 for at least one gene set were considered, and the q-values were compared by computing their ratios for each domain. For a given domain, a q-value ratio very far from 1 indicated that that domain was much more significantly overrepresented in one gene set than in the other. Each analysis of overrepresentation in gene sets provides the following data: “Motif” denotes the specific protein domain (or other gene category) for which genes encoding it are being tested for nonrandomly high (or low) overlap with the ES gene set; “All_genes” gives the total number of v2.1 protein-coding genes within which overlaps were tested; “Motif_genes” gives the number of genes annotated with the protein domain (or other trait) being tested for overlap; “Class_genes” gives the number of ES genes being tested for overlap; “Motif.Class_overlap” gives the observed number of genes falling into both categories; “Exp_rand_overlap” gives the number of genes that would be expected to overlap purely randomly; “Enrichment” gives the ratio of observed versus random overlaps (note that this ratio can be lower than 1, and in fact can be as low as 0); “p-value” gives an initial stastistical significance for the observed overlap, computed by a two-tailed Fisher test; “q-value” gives, for cases of many comparisons at once (e.g., testing for all Pfam domains simultaneously), a significance score that conservatively corrects for multiple hypothesis testing [238,245]. Note that q-values were not computed for simple binary comparisons of gene traits listed in “Various”, which are annotated for A. ceylanicum genes in S1 Table, and defined in its table legend: Intestine-biased; Non-intestine-biased; Pos.intest.immunoreg; Neg.intest.immunoreg; Pos.non-intest.immunoreg; Neg.non-intest.immunoreg; Male-biased; Female-biased; Uzoechi_any_ES; Uzoechi_male.only_ES; Uzoechi_female.only_ES; and Uzoechi_both_ES. Also, again note that highly significant overlaps can be either higher or lower than the randomly expected genome-wide background rate.
https://doi.org/10.1371/journal.pntd.0014106.s003
(XLSX)
S4 Table. Annotations, RNA-seq expression, and differential gene expresssion for the H. contortus protein-coding gene set.
We annotated predicted protein products with N-terminal signal sequences, conserved protein domains, orthologies to genes in related nematode species, and Gene Ontology (GO) terms describing biological and molecular functions. Protein-coding genes were predicted by Doyle et al. [218]. Since the annotation methods we used for this H. contortus proteome were identical to those we used for our A. ceylanicum v2.1 proteome, almost all the data columns used here are equivalent to those in S1 Table. One data column unique to this table is “Hco_ES”, which denotes H. contortus genes previously demonstrated by Wang et al. [47] to encode ES proteins.
https://doi.org/10.1371/journal.pntd.0014106.s004
(XLSX)
S5 Table. Annotations for protein-coding gene sets of A. caninum and N. americanus.
We annotated predicted protein products with N-terminal signal sequences, conserved protein domains, orthologies to genes in related nematode species, and Gene Ontology (GO) terms describing biological and molecular functions. Protein-coding genes were predicted for A. caninum by Coghlan et al. [185] and for N. americanus by Logan et al. [44]. Since the annotation methods we used for these proteomes were identical to those we used for our A. ceylanicum v2.1 proteome, almost all the data columns used here are equivalent to those in S1 Table. Two data columns unique to these tables are Acan_ES and Necator_ES, which respectively denote A. caninum or N. americanus genes previously demonstrated by Morante et al. [45] or Logan et al. [44] to encode ES proteins.
https://doi.org/10.1371/journal.pntd.0014106.s005
(XLSX)
S6 Table. Statistical analyses of the overlaps between A. caninum, N. americanus, or H. contortus genes encoding protein domains (or other traits) and A. caninum, N. americanus, or H. contortus genes encoding ES proteins.
ES gene annotations are taken from S4 and S5 Tables. Statistical significances of overlaps between domain/trait genes and ES genes were computed and described as in S3 Table.
https://doi.org/10.1371/journal.pntd.0014106.s006
(XLSX)
S7 Table. Statistical analyses of overlaps between pairs of A. ceylanicum gene sets, with A. ceylanicum genes encoding ES proteins versus A. ceylanicum genes having various orthologies to either ES genes or any genes in A. caninum, H. contortus, or N. americanus.
“ES_a_caninum”, “ES_haemonchus”, and “ES_necator_rev” denote sets of A. ceylanicum genes with orthology to ES genes in A. caninum, H. contortus, or N. americanus; “ES_Acan.or.Hco.or.L2020” denotes a set of A. ceylanicum genes with orthology to ES genes in any of these three species. “Acan_homology”, “Haemonchus_homology”, and “Necator_homology” denote sets of A. ceylanicum genes with orthology to any genes (ES protein-encoding, or not) in A. caninum, H. contortus, or N. americanus; “Acan.Hco.Nec_homology” denotes a set of A. ceylanicum genes with orthology to any genes in any of these three species. The ES gene sets are either from this study (“all_UMass_ES”) or from Uzoechi et al. [54] (“all_WashU_ES”). Subsets of these ES gene sets that lack orthologies to known ES genes in related parasites are denoted with “non_AcanES_homol_[ES]” (for A. ceylanicum ES genes lacking orthologies to A. caninum ES genes), “non_HcoES_homol_[ES]” (for A. ceylanicum ES genes lacking orthologies to H. contortus ES genes), “non_NecES_homol_[ES]” (for A. ceylanicum ES genes lacking orthologies to N. americanus ES genes), or “non_anyES_homol_[ES]” (for A. ceylanicum ES genes lacking orthologies to ES genes in any of the three other species). Statistical significances of overlaps between orthologous genes and ES genes were computed and described as in S3 Table.
https://doi.org/10.1371/journal.pntd.0014106.s007
(XLSX)
S8 Table. A. ceylanicum RNA-seq libraries newly generated in this study.
“RNA-seq library ID” provides a short abbreviation for each library; these abbreviations have been used for expression levels (TPMs), mapped read counts (reads), and differential gene expression analyses in later supplementary data tables (S9-S17 Tables). For each library, “SRA accession”, “BioProject accession” and “BioSample accession” provide accession numbers; “Reads”, “Total nt”, and “Read length in nt” give the number of reads, total sequence in nt, and read length; and “RNA-seq description” summarizes biological contents.
https://doi.org/10.1371/journal.pntd.0014106.s008
(XLSX)
S9 Table. Statistical analyses of the overlaps between A. ceylanicum genes encoding protein domains (or other traits) and A. ceylanicum genes with either intestine-biased or non-intestine-biased gene expression.
Statistical significances of overlaps between domain/trait genes and intestine-biased or non-intestine-biased genes were computed and described as in S3 Table. In addition, pairwise overlaps between intestine- or non-intestine-biased gene sets and other gene categories are provided in the ‘Various’ spreadsheet; since these do not involve multiple comparisons of entire domain sets, only p-values rather than q-values are computed for these overlaps. A. ceylanicum gene categories in “Various” are: “Intest_haemonchus”, genes with orthology to H. contortus genes with intestine-biased expression; “Non-intest_haemonchus”, genes with orthology to H. contortus genes with non-intestine-biased expression; “WashU-intestine-biased”, genes with intestine-biased expression computed from previously published A. ceylanicum RNA-seq data; “WashU-non-intestine-biased”, genes with non-intestine-biased expression computed from previously published A. ceylanicum RNA-seq data (S11 Table) [74,75].
https://doi.org/10.1371/journal.pntd.0014106.s009
(XLSX)
S10 Table. The frequencies with which RNA-seq reads (either from our new RNA-seq libraries listed in S8 Table, or from previously published RNA-seq libraries listed in S11 Table were mapped to A. ceylanicum v2.1 genes by Salmon 1.9.0 [233].
https://doi.org/10.1371/journal.pntd.0014106.s010
(XLSX)
S11 Table. Published RNA-seq data of A. ceylanicum [59,74,75] and H. contortus [175] used in this study.
All RNA-seq files were downloaded from the European Nucleotide Archive (ENA). “Species” gives the file’s origin species. “Abbreviation” provides a short abbreviation for each library; these abbreviations have been used for expression levels (TPMs), mapped read counts (reads), and differential gene expression analyses in later supplementary data tables (S12-S17 Tables). “Biological condition (sex, developmental stage, tissue) and replicate number” describes biological content and replication. “Notes” describes any ambiguities that had to be resolved during analysis of the data. “Database” is uniformly ‘ENA’, but is included for completeness in the table. “Accession” and “URL” give SRA accession numbers and ENA Web sources for each file.
https://doi.org/10.1371/journal.pntd.0014106.s011
(XLSX)
S12 Table. Statistical analyses of the overlaps between H. contortus genes encoding protein domains (or other traits) and H. contortus genes with either intestine-biased or non-intestine-biased gene expression.
Statistical significances of overlaps between domain/trait genes and intestine-biased or non-intestine-biased genes were computed and described as in S3 Table.
https://doi.org/10.1371/journal.pntd.0014106.s012
(XLSX)
S13 Table. Statistical analyses of the overlaps between A. ceylanicum genes encoding protein domains (or other traits) and A. ceylanicum genes with positively or negatively immunoregulated intestinal or non-intestinal gene expression.
Statistical significances of overlaps between domain/trait genes and positively immunoregulated intestinal genes were computed and described as in S3 Table. In addition, pairwise overlaps between immunoregulated gene sets and other categories (such as “Male-biased”) are provided in the “Various” spreadsheet; since these do not involve multiple comparisons of entire domain sets, only p-values rather than q-values are computed for these overlaps.
https://doi.org/10.1371/journal.pntd.0014106.s013
(XLSX)
S14 Table. Statistical analyses of the overlaps between A. ceylanicum genes encoding protein domains (or other traits) and A. ceylanicum genes encoding ES proteins that also have positively immunoregulated gene expression in 19-day intestines.
Statistical significances of overlaps between domain/trait genes and positively immunoregulated ES genes were computed and described as in S3 Table. In addition, pairwise overlaps between the positively immunoregulated intestinal ES gene set and other categories (such as “Male-biased”) are provided in the “Various” spreadsheet; since these do not involve multiple comparisons of entire domain sets, only p-values rather than q-values are computed for these overlaps.
https://doi.org/10.1371/journal.pntd.0014106.s014
(XLSX)
S15 Table. Software used in this study.
“Software” provides the name of each computer program or suite of computer programs; “Purpose” describes why this software was used here; “Main web site (URL) or code location” gives the primary Web site for this software; “Bioconda source (if used)” gives the bioconda web site for programs that were installed as bioconda environments; “Online documentation” gives the web site for detailed online manuals, if a given program has one. Publications and arguments for each program are cited and described in Methods.
https://doi.org/10.1371/journal.pntd.0014106.s015
(XLSX)
S16 Table. Published genomic data used in this study.
Genome sequences, coding sequences, proteomes, and gene annotations of relevant nematodes were downloaded from WormBase [200] or WormBase ParaSite [184,193] and used for transcriptomic or proteomic analyses; they include Coghlan gene predictions and their corresponding A. ceylanicum genome assembly, along with UniProt [230] and RefSeq [231] proteome databases from highly GO-annotated model organisms. Four separate spreadsheets are given for data files of genomes, proteomes, CDS DNA sets, and gene annotations in GFF format. For each data file, “Species” gives the biological species described by the file, “Comments” describes the particular use(s) to which the data file was put in this study, and “URL” gives the Web source of the file.
https://doi.org/10.1371/journal.pntd.0014106.s016
(XLSX)
S17 Table. LC-MS/MS analyses of two A. ceylanicum ES protein sets, E20201108-05 and E20201108-07.
Data columns were generated by Proteome Discoverer 2.4; other details of the analysis are given in Methods.
https://doi.org/10.1371/journal.pntd.0014106.s017
(XLSX)
S1 Fig. Gene expression in A. ceylanicum.
Gene activity is shown for all 33,190 A. ceylanicum genes, with biological replicates on the x-axis and individual genes on the y-axis. Expression levels are in TPM (log10). In S1A Fig, replicates are ordered as in Fig 4; in S1B Fig, replicates are ordered by their similarity of expression (and a dendrogram for their similiaries is shown along the top x-axis). Genes sharing similar patterns of expression are split into 10 clusters. Biological replicates are: young 12-day hookworm adults from normal hosts (YA_12dpi_noDEX); young 12-day adults from immunosuppressed hosts (YA_12dpi_DEX); dissected 19-day intestines from mature hookworm adults in normal hosts (intestines_19dpi_noDEX); dissected 19-day intestines in immunosuppressed hosts (intestines_19dpi_DEX); dissected 19-day non-intestinal tissues in normal hosts (non.intestines_19dpi_noDEX); dissected 19-day non-intestinal tissues in immunosuppressed hosts (non.intestines_19dpi_DEX); previously published adult male intestine (WashU_male_intestine); published adult males (WashU_adult_male); and published adult females (WashU_adult_fem). These heatmaps include two biological replicates (19-day nonDEX intestine replicate 2 and 19-day DEX intestine replicate 2) that clustered anomalously with replicates 1 and 3 of intestinal RNA-seq samples with opposite nonDEX versus DEX status. We interpret this to mean that labeling for the nonDEX/DEX status of these samples was accidentally reversed between RNA harvesting and RNA-seq. We thus omitted these two samples from differential gene expression analysis.
https://doi.org/10.1371/journal.pntd.0014106.s018
(PDF)
S1 File. An R scrsipt used in batch mode for differential gene expression with edgeR 3.36.0 and R 4.1.3.
https://doi.org/10.1371/journal.pntd.0014106.s019
(TXT)
S2 File. Three R scripts used in batch mode for visualizing differential gene expression in volcano plots with EnhancedVolcano 1.24.0 and R 4.4.3.
https://doi.org/10.1371/journal.pntd.0014106.s020
(TXT)
Acknowledgments
We thank Titus Brown and the Michigan State University High-Performance Computing Center (supported by U.S. Department of Agriculture grant 2010-65205-20361 and NIFA-National Science Foundation (NSF) grant IOS-0923812) for computational support; additional computing was enabled by start-up and research allocations from NSF XSEDE (TG-MCB180039 and TG-MCB190010). We also thank the Millard and Muriel Jacobs Genetics and Genomics Laboratory at the California Institute of Technology for sequencing and computational support.
Disclaimer: This manuscript is the result of funding in whole or in part by the National Institutes of Health (NIH). It is subject to the NIH Public Access Policy. Through acceptance of this federal funding, NIH has been given a right to make this manuscript publicly available in PubMed Central upon the Official Date of Publication, as defined by NIH.
References
- 1. Bethony J, Brooker S, Albonico M, Geiger SM, Loukas A, Diemert D, et al. Soil-transmitted helminth infections: ascariasis, trichuriasis, and hookworm. Lancet. 2006;367(9521):1521–32. pmid:16679166
- 2. Traub RJ. Ancylostoma ceylanicum, a re-emerging but neglected parasitic zoonosis. Int J Parasitol. 2013;43(12–13):1009–15. pmid:23968813
- 3. Pullan RL, Smith JL, Jasrasaria R, Brooker SJ. Global numbers of infection and disease burden of soil transmitted helminth infections in 2010. Parasit Vectors. 2014;7:37. pmid:24447578
- 4. Beaver PC. Light, long-lasting necator infection in a volunteer. The American Journal of Tropical Medicine and Hygiene. 1988;39(4):369–72. pmid:3189697
- 5. Gems D. Longevity and ageing in parasitic and free-living nematodes. Biogerontology. 2000;1(4):289–307. pmid:11708211
- 6. Hoagland KE, Schad GA. Necator americanus and Ancylostoma duodenale: life history parameters and epidemiological implications of two sympatric hookworms of humans. Exp Parasitol. 1978;44(1):36–49. pmid:627275
- 7. Ranjit N, Zhan B, Hamilton B, Stenzel D, Lowther J, Pearson M, et al. Proteolytic degradation of hemoglobin in the intestine of the human hookworm Necator americanus. J Infect Dis. 2009;199(6):904–12. pmid:19434933
- 8. Maizels RM, Smits HH, McSorley HJ. Modulation of Host Immunity by Helminths: The Expanding Repertoire of Parasite Effector Molecules. Immunity. 2018;49(5):801–18. pmid:30462997
- 9. Loukas A, Hotez PJ, Diemert D, Yazdanbakhsh M, McCarthy JS, Correa-Oliveira R, et al. Hookworm infection. Nat Rev Dis Primers. 2016;2:16088. pmid:27929101
- 10. Keiser J, Utzinger J. Efficacy of current drugs against soil-transmitted helminth infections: systematic review and meta-analysis. JAMA. 2008;299(16):1937–48. pmid:18430913
- 11. Orr AR, Quagraine JE, Suwondo P, George S, Harrison LM, Dornas FP. Genetic markers of benzimidazole resistance among human hookworms (Necator americanus) in Kintampo North Municipality, Ghana. The American Journal of Tropical Medicine and Hygiene. 2019;100(2):351–6. pmid:30734697
- 12. Vlaminck J, Cools P, Albonico M, Ame S, Ayana M, Cringoli G, et al. Therapeutic efficacy of albendazole against soil-transmitted helminthiasis in children measured by five diagnostic methods. PLoS Negl Trop Dis. 2019;13(8):e0007471. pmid:31369562
- 13. Walker M, Cools P, Albonico M, Ame SM, Ayana M, Dana D, et al. Individual responses to a single oral dose of albendazole indicate reduced efficacy against soil-transmitted helminths in an area with high drug pressure. PLoS Negl Trop Dis. 2021;15(10):e0009888. pmid:34665810
- 14. Venkatesan A, Jimenez Castro PD, Morosetti A, Horvath H, Chen R, Redman E, et al. Molecular evidence of widespread benzimidazole drug resistance in Ancylostoma caninum from domestic dogs throughout the USA and discovery of a novel β-tubulin benzimidazole resistance mutation. PLoS Pathog. 2023;19(3):e1011146. pmid:36862759
- 15. Jia T-W, Melville S, Utzinger J, King CH, Zhou X-N. Soil-transmitted helminth reinfection after drug treatment: a systematic review and meta-analysis. PLoS Negl Trop Dis. 2012;6(5):e1621. pmid:22590656
- 16. Cohen J. Unfilled Vials. Science. 2016;351(6268):16–9. pmid:26721985
- 17. Zawawi A, Else KJ. Soil-Transmitted Helminth Vaccines: Are We Getting Closer?. Front Immunol. 2020;11:576748. pmid:33133094
- 18.
Knox D. Proteases as Vaccines Against Gastrointestinal Nematode Parasites of Sheep and Cattle. Parasitic Helminths. Wiley. 2012. 399–420. https://doi.org/10.1002/9783527652969.ch24
- 19. Pearson MS, Tribolet L, Cantacessi C, Periago MV, Valero MA, Jariwala AR, et al. Molecular mechanisms of hookworm disease: stealth, virulence, and vaccines. J Allergy Clin Immunol. 2012;130(1):13–21. pmid:22742835
- 20. Manoury B, Gregory WF, Maizels RM, Watts C. Bm-CPI-2, a cystatin homolog secreted by the filarial parasite Brugia malayi, inhibits class II MHC-restricted antigen processing. Curr Biol. 2001;11(6):447–51. pmid:11301256
- 21. Hartmann S, Lucius R. Modulation of host immune responses by nematode cystatins. Int J Parasitol. 2003;33(11):1291–302. pmid:13678644
- 22. Klotz C, Ziegler T, Figueiredo AS, Rausch S, Hepworth MR, Obsivac N, et al. A helminth immunomodulator exploits host signaling events to regulate cytokine production in macrophages. PLoS Pathog. 2011;7(1):e1001248. pmid:21253577
- 23. Tribolet L, Cantacessi C, Pickering DA, Navarro S, Doolan DL, Trieu A, et al. Probing of a human proteome microarray with a recombinant pathogen protein reveals a novel mechanism by which hookworms suppress B-cell receptor signaling. J Infect Dis. 2015;211(3):416–25. pmid:25139017
- 24. Wilbers RHP, Schneiter R, Holterman MHM, Drurey C, Smant G, Asojo OA, et al. Secreted venom allergen-like proteins of helminths: Conserved modulators of host responses in animals and plants. PLoS Pathog. 2018;14(10):e1007300. pmid:30335852
- 25. Loukas A, Doedens A, Hintz M, Maizels RM. Identification of a new C-type lectin, TES-70, secreted by infective larvae of Toxocara canis, which binds to host ligands. Parasitology. 2000;121 Pt 5:545–54. pmid:11128806
- 26. Yoshida A, Nagayasu E, Horii Y, Maruyama H. A novel C-type lectin identified by EST analysis in tissue migratory larvae of Ascaris suum. Parasitol Res. 2012;110(4):1583–6. pmid:22006188
- 27. Harnett MM, Melendez AJ, Harnett W. The therapeutic potential of the filarial nematode-derived immunodulator, ES-62 in inflammatory disease. Clin Exp Immunol. 2010;159(3):256–67. pmid:19968663
- 28. Jex AR, Nejsum P, Schwarz EM, Hu L, Young ND, Hall RS, et al. Genome and transcriptome of the porcine whipworm Trichuris suis. Nat Genet. 2014;46(7):701–6. pmid:24929829
- 29. Shinoda K, Choe A, Hirahara K, Kiuchi M, Kokubo K, Ichikawa T, et al. Nematode ascarosides attenuate mammalian type 2 inflammatory responses. Proc Natl Acad Sci U S A. 2022;119(9):e2108686119. pmid:35210367
- 30. Buck AH, Coakley G, Simbari F, McSorley HJ, Quintana JF, Le Bihan T, et al. Exosomes secreted by nematode parasites transfer small RNAs to mammalian cells and modulate innate immunity. Nat Commun. 2014;5:5488. pmid:25421927
- 31. Coakley G, McCaskill JL, Borger JG, Simbari F, Robertson E, Millar M, et al. Extracellular Vesicles from a Helminth Parasite Suppress Macrophage Activation and Constitute an Effective Vaccine for Protective Immunity. Cell Rep. 2017;19(8):1545–57. pmid:28538175
- 32. Durette-Desset MC, Beveridge I, Spratt DM. The origins and evolutionary expansion of the Strongylida (Nematoda). Int J Parasitol. 1994;24(8):1139–65. pmid:7729974
- 33. Clark WC. Origins of the parasitic habit in the nematoda. Int J Parasitol. 1994;24(8):1117–29. pmid:7729972
- 34. Anderson RC. The origins of zooparasitic nematodes. Can J Zool. 1984;62(3):317–28.
- 35. Ryan SM, Ruscher R, Johnston WA, Pickering DA, Kennedy MW, Smith BO, et al. Novel antiinflammatory biologics shaped by parasite-host coevolution. Proc Natl Acad Sci U S A. 2022;119(36):e2202795119. pmid:36037362
- 36. Smallwood TB, Navarro S, Cristofori-Armstrong B, Watkins TS, Tungatt K, Ryan RYM, et al. Synthetic hookworm-derived peptides are potent modulators of primary human immune cell function that protect against experimental colitis in vivo. J Biol Chem. 2021;297(1):100834. pmid:34051231
- 37. Maizels RM. Regulation of immunity and allergy by helminth parasites. Allergy. 2020;75(3):524–34. pmid:31187881
- 38. Lightowlers MW, Rickard MD. Excretory-secretory products of helminth parasites: effects on host immune responses. Parasitology. 1988;96 Suppl:S123-66. pmid:3287288
- 39. Pritchard DI. Antigens of gastrointestinal nematodes. Trans R Soc Trop Med Hyg. 1986;80(5):728–34. pmid:3299889
- 40. Abuzeid AMI, Zhou X, Huang Y, Li G. Twenty-five-year research progress in hookworm excretory/secretory products. Parasit Vectors. 2020;13(1):136. pmid:32171305
- 41. Stirewalt MA. Chemical biology of secretions of larval helminths. Ann N Y Acad Sci. 1963;113:36–53. pmid:14088704
- 42. Huang Y, Abuzeid AMI, Liu Y, He L, Zhao Q, Yan X, et al. Identification and localization of hookworm platelet inhibitor in Ancylostoma ceylanicum. Infect Genet Evol. 2020;77:104102. pmid:31689543
- 43. Zhan B, Liu Y, Badamchian M, Williamson A, Feng J, Loukas A, et al. Molecular characterisation of the Ancylostoma-secreted protein family from the adult stage of Ancylostoma caninum. Int J Parasitol. 2003;33(9):897–907. pmid:12906874
- 44. Logan J, Pearson MS, Manda SS, Choi YJ, Field M, Eichenberger RM, et al. Comprehensive analysis of the secreted proteome of adult Necator americanus hookworms. PLoS Negl Trop Dis. 2020;14(5):e0008237. pmid:32453752
- 45. Morante T, Shepherd C, Constantinoiu C, Loukas A, Sotillo J. Revisiting the Ancylostoma Caninum Secretome Provides New Information on Hookworm-Host Interactions. Proteomics. 2017;17(23–24):10.1002/pmic.201700186. pmid:29052354
- 46. Gillis-Germitsch N, Kockmann T, Asmis LM, Tritten L, Schnyder M. The Angiostrongylus vasorum Excretory/Secretory and Surface Proteome Contains Putative Modulators of the Host Coagulation. Front Cell Infect Microbiol. 2021;11:753320. pmid:34796127
- 47. Wang T, Ma G, Ang C-S, Korhonen PK, Koehler AV, Young ND, et al. High throughput LC-MS/MS-based proteomic analysis of excretory-secretory products from short-term in vitro culture of Haemonchus contortus. J Proteomics. 2019;204:103375. pmid:31071474
- 48. Hewitson JP, Harcus Y, Murray J, van Agtmaal M, Filbey KJ, Grainger JR, et al. Proteomic analysis of secretory products from the model gastrointestinal nematode Heligmosomoides polygyrus reveals dominance of venom allergen-like (VAL) proteins. J Proteomics. 2011;74(9):1573–94. pmid:21722761
- 49. Moreno Y, Gros P-P, Tam M, Segura M, Valanparambil R, Geary TG, et al. Proteomic analysis of excretory-secretory products of Heligmosomoides polygyrus assessed with next-generation sequencing transcriptomic information. PLoS Negl Trop Dis. 2011;5(10):e1370. pmid:22039562
- 50. Hewitson JP, Ivens AC, Harcus Y, Filbey KJ, McSorley HJ, Murray J, et al. Secretion of protective antigens by tissue-stage nematode larvae revealed by proteomic analysis and vaccination-induced sterile immunity. PLoS Pathog. 2013;9(8):e1003492. pmid:23966853
- 51. Sotillo J, Sanchez-Flores A, Cantacessi C, Harcus Y, Pickering D, Bouchery T, et al. Secreted proteomes of different developmental stages of the gastrointestinal nematode Nippostrongylus brasiliensis. Mol Cell Proteomics. 2014;13(10):2736–51. pmid:24994561
- 52. Ferguson AA, Rossi HL, Herbert DR. The Secretome of Adult Murine Hookworms Is Shaped by Host Expression of STAT6. Parasite Immunol. 2024;46(7):e13056. pmid:39073185
- 53. Price DRG, Steele P, Frew D, McLean K, Androscuk D, Geldhof P, et al. Characterisation of protective vaccine antigens from the thiol-containing components of excretory/secretory material of Ostertagia ostertagi. Vet Parasitol. 2024;328:110154. pmid:38490160
- 54. Uzoechi SC, Rosa BA, Singh KS, Choi Y-J, Bracken BK, Brindley PJ, et al. Excretory/Secretory Proteome of Females and Males of the Hookworm Ancylostoma ceylanicum. Pathogens. 2023;12(1):95. pmid:36678443
- 55. Wong Y, Rosa BA, Becker L, Camberis M, LeGros G, Zhan B, et al. Proteomic characterization and comparison of the infective and adult life stage secretomes from Necator americanus and Ancylostoma ceylanicum. PLoS Negl Trop Dis. 2025;19(1):e0012780. pmid:39832284
- 56. Stracke K, Jex AR, Traub RJ. Zoonotic Ancylostomiasis: An Update of a Continually Neglected Zoonosis. Am J Trop Med Hyg. 2020;103(1):64–8. pmid:32342850
- 57. Garside P, Behnke JM. Ancylostoma ceylanicum in the hamster: observations on the host-parasite relationship during primary infection. Parasitology. 1989;98 Pt 2:283–9. pmid:2762039
- 58. Colella V, Bradbury R, Traub R. Ancylostoma ceylanicum. Trends Parasitol. 2021;37(9):844–5. pmid:34049804
- 59. Schwarz EM, Hu Y, Antoshechkin I, Miller MM, Sternberg PW, Aroian RV. The genome and transcriptome of the zoonotic hookworm Ancylostoma ceylanicum identify infection-specific gene families. Nat Genet. 2015;47(4):416–22. pmid:25730766
- 60. Hatje K, Mühlhausen S, Simm D, Kollmar M. The Protein-Coding Human Genome: Annotating High-Hanging Fruits. Bioessays. 2019;41(11):e1900066. pmid:31544971
- 61. Guigó R. Genome annotation: From human genetics to biodiversity genomics. Cell Genom. 2023;3(8):100375. pmid:37601977
- 62. Brůna T, Hoff KJ, Lomsadze A, Stanke M, Borodovsky M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genom Bioinform. 2021;3(1):lqaa108. pmid:33575650
- 63. Käll L, Krogh A, Sonnhammer ELL. A combined transmembrane topology and signal peptide prediction method. J Mol Biol. 2004;338(5):1027–36. pmid:15111065
- 64. Mistry J, Chuguransky S, Williams L, Qureshi M, Salazar GA, Sonnhammer ELL, et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021;49(D1):D412–9. pmid:33125078
- 65. Paysan-Lafosse T, Blum M, Chuguransky S, Grego T, Pinto BL, Salazar GA. InterPro in 2022. Nucleic Acids Research. 2023;51(D1):D418–27. pmid:36350672
- 66. Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20(1):238. pmid:31727128
- 67. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–9. pmid:10802651
- 68. Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Res. 2021;49(D1):D325–34. pmid:33290552
- 69. Dimou E, Nickel W. Unconventional mechanisms of eukaryotic protein secretion. Curr Biol. 2018;28(8):R406–10. pmid:29689224
- 70. Wang J, Barr MM, Wehman AM. Extracellular vesicles. Genetics. 2024;227(4):iyae088. pmid:38884207
- 71. Balmer EA, Faso C. The Road Less Traveled? Unconventional Protein Secretion at Parasite-Host Interfaces. Front Cell Dev Biol. 2021;9:662711. pmid:34109175
- 72. Athanasouli M, Witte H, Weiler C, Loschko T, Eberhardt G, Sommer RJ, et al. Comparative genomics and community curation further improve gene annotations in the nematode Pristionchus pacificus. BMC Genomics. 2020;21(1):708. pmid:33045985
- 73. Moya ND, Stevens L, Miller IR, Sokol CE, Galindo JL, Bardas AD, et al. Novel and improved Caenorhabditis briggsae gene models generated by community curation. BMC Genomics. 2023;24(1):486. pmid:37626289
- 74. Wei J, Damania A, Gao X, Liu Z, Mejia R, Mitreva M, et al. The hookworm Ancylostoma ceylanicum intestinal transcriptome provides a platform for selecting drug and vaccine candidates. Parasit Vectors. 2016;9(1):518. pmid:27677574
- 75. Bernot JP, Rudy G, Erickson PT, Ratnappan R, Haile M, Rosa BA, et al. Transcriptomic analysis of hookworm Ancylostoma ceylanicum life cycle stages reveals changes in G-protein coupled receptor diversity associated with the onset of parasitism. Int J Parasitol. 2020;50(8):603–10. pmid:32592811
- 76. Caffrey CR, Goupil L, Rebello KM, Dalton JP, Smith D. Cysteine proteases as digestive enzymes in parasitic helminths. PLoS Negl Trop Dis. 2018;12(8):e0005840. pmid:30138310
- 77. Williamson AL, Lustigman S, Oksov Y, Deumic V, Plieskatt J, Mendez S, et al. Ancylostoma caninum MTP-1, an astacin-like metalloprotease secreted by infective hookworm larvae, is involved in tissue migration. Infect Immun. 2006;74(2):961–7. pmid:16428741
- 78. Yang Y, Wen Y jun, Cai YN, Vallée I, Boireau P, Liu MY, et al. Serine proteases of parasitic helminths. Korean J Parasitol. 2015;53(1):1–11. pmid:25748703
- 79. Knox DP. Proteinase inhibitors and helminth parasite infection. Parasite Immunol. 2007;29(2):57–71. pmid:17241394
- 80. Chu D, Bungiro RD, Ibanez M, Harrison LM, Campodonico E, Jones BF, et al. Molecular characterization of Ancylostoma ceylanicum Kunitz-type serine protease inhibitor: evidence for a role in hookworm-associated growth delay. Infect Immun. 2004;72(4):2214–21. pmid:15039345
- 81. Matoušková P, Vokřál I, Lamka J, Skálová L. The Role of Xenobiotic-Metabolizing Enzymes in Anthelmintic Deactivation and Resistance in Helminths. Trends Parasitol. 2016;32(6):481–91. pmid:26968642
- 82. Chhabra S, Chang SC, Nguyen HM, Huq R, Tanner MR, Londono LM. Kv1.3 channel-blocking immunomodulatory peptides from parasitic worms: implications for autoimmune diseases. FASEB J. 2014. pmid:24891519
- 83. McNeilly TN, Frew D, Burgess STG, Wright H, Bartley DJ, Bartley Y, et al. Niche-specific gene expression in a parasitic nematode; increased expression of immunomodulators in Teladorsagia circumcincta larvae derived from host mucosa. Sci Rep. 2017;7(1):7214. pmid:28775251
- 84. Harcus Y, Nicoll G, Murray J, Filbey K, Gomez-Escobar N, Maizels RM. C-type lectins from the nematode parasites Heligmosomoides polygyrus and Nippostrongylus brasiliensis. Parasitol Int. 2009;58(4):461–70. pmid:19751847
- 85. Wang Z, Lu QQ, Weng MM, Li YL, Han LL, Song YY, et al. Binding of Trichinella spiralis C-type lectin with syndecan-1 on intestinal epithelial cells mediates larval invasion of intestinal epithelium. Vet Res. 2023;54(1):86. pmid:37784173
- 86. Yatsuda AP, Eysker M, Vieira-Bressan MCR, De Vries E. A family of activation associated secreted protein (ASP) homologues of Cooperia punctata. Res Vet Sci. 2002;73(3):297–306. pmid:12443689
- 87. Hawdon JM, Jones BF, Hoffman DR, Hotez PJ. Cloning and characterization of Ancylostoma-secreted protein. A novel protein associated with the transition to parasitism by infective hookworm larvae. J Biol Chem. 1996;271(12):6672–8. pmid:8636085
- 88. Lo SK, Rahman A, Xu N, Zhou MY, Nagpala P, Jaffe HA, et al. Neutrophil inhibitory factor abrogates neutrophil adhesion by blockade of CD11a and CD11b beta(2) integrins. Mol Pharmacol. 1999;56(5):926–32. pmid:10531396
- 89. Moyle M, Foster DL, McGrath DE, Brown SM, Laroche Y, De Meutter J, et al. A hookworm glycoprotein that inhibits neutrophil function is a ligand of the integrin CD11b/CD18. J Biol Chem. 1994;269(13):10008–15. pmid:7908286
- 90. Del Valle A, Jones BF, Harrison LM, Chadderdon RC, Cappello M. Isolation and molecular cloning of a secreted hookworm platelet inhibitor from adult Ancylostoma caninum. Mol Biochem Parasitol. 2003;129(2):167–77. pmid:12850261
- 91. Hunt VL, Tsai IJ, Selkirk ME, Viney M. The genome of Strongyloides spp. gives insights into protein families with a putative role in nematode parasitism. Parasitology. 2017;144(3):343–58. pmid:27618747
- 92. Tritten L, Ballesteros C, Beech R, Geary TG, Moreno Y. Mining nematode protein secretomes to explain lifestyle and host specificity. PLoS Negl Trop Dis. 2021;15(9):e0009828. pmid:34587193
- 93. Stevens L, Martínez-Ugalde I, King E, Wagah M, Absolon D, Bancroft R, et al. Ancient diversity in host-parasite interaction genes in a model parasitic nematode. Nat Commun. 2023;14(1):7776. pmid:38012132
- 94. Tian X, Lu M, Wang W, Jia C, Muhammad E, Yan R, et al. HcTTR: a novel antagonist against goat interleukin 4 derived from the excretory and secretory products of Haemonchus contortus. Vet Res. 2019;50(1):42. pmid:31164173
- 95. Parks SC, Nguyen S, Nasrolahi S, Bhat C, Juncaj D, Lu D, et al. Parasitic nematode fatty acid- and retinol-binding proteins compromise host immunity by interfering with host lipid signaling pathways. PLoS Pathog. 2021;17(10):e1010027. pmid:34714893
- 96. Parks SC, Nguyen S, Boulanger MJ, Dillman AR. The FAR protein family of parasitic nematodes. PLoS Pathog. 2022;18(4):e1010424. pmid:35446920
- 97. Azizpor P, Montoya J, Eyabi F, Ramirez J, Hill T, Pena R. A secreted fatty acid- and retinol- binding protein from Heligmosomoides polygyrus suppresses host macrophage polarization. bioRxiv. 2025. pmid:40501701
- 98. Fujiwara RT, Zhan B, Mendez S, Loukas A, Bueno LL, Wang Y, et al. Reduction of worm fecundity and canine host blood loss mediates protection against hookworm infection elicited by vaccination with recombinant Ac-16. Clin Vaccine Immunol. 2007;14(3):281–7. pmid:17267592
- 99. Tsuji N, Miyoshi T, Islam MK, Isobe T, Yoshihara S, Arakawa T, et al. Recombinant Ascaris 16-Kilodalton protein-induced protection against Ascaris suum larval migration after intranasal vaccination in pigs. J Infect Dis. 2004;190(10):1812–20. pmid:15499538
- 100. Wei J, Versteeg L, Liu Z, Keegan B, Gazzinelli-Guimarães AC, Fujiwara RT, et al. Yeast-expressed recombinant As16 protects mice against Ascaris suum infection through induction of a Th2-skewed immune response. PLoS Negl Trop Dis. 2017;11(7):e0005769. pmid:28708895
- 101. Tsuji N, Suzuki K, Kasuga-Aoki H, Matsumoto Y, Arakawa T, Ishiwata K, et al. Intranasal immunization with recombinant Ascaris suum 14-kilodalton antigen coupled with cholera toxin B subunit induces protective immunity to A. suum infection in mice. Infect Immun. 2001;69(12):7285–92. pmid:11705899
- 102. García-Mayoral MF, Treviño MA, Pérez-Piñar T, Caballero ML, Knaute T, Umpierrez A, et al. Relationships between IgE/IgG4 epitopes, structure and function in Anisakis simplex Ani s 5, a member of the SXP/RAL-2 protein family. PLoS Negl Trop Dis. 2014;8(3):e2735. pmid:24603892
- 103. Zhan B, Bottazzi ME, Hotez PJ, Lustigman S. Advancing a Human Onchocerciasis Vaccine From Antigen Discovery to Efficacy Studies Against Natural Infection of Cattle With Onchocerca ochengi. Front Cell Infect Microbiol. 2022;12:869039. pmid:35444961
- 104. van Die I, Cummings RD. The Mannose Receptor in Regulation of Helminth-Mediated Host Immunity. Front Immunol. 2017;8:1677. pmid:29238348
- 105. Jung S, Sönnichsen FD, Hung C-W, Tholey A, Boidin-Wichlacz C, Haeusgen W, et al. Macin family of antimicrobial proteins combines antimicrobial and nerve repair activities. J Biol Chem. 2012;287(17):14246–58. pmid:22396551
- 106. Irvine A, Huws SA, Atkinson LE, Mousley A. Exploring the antimicrobial peptidome of nematodes through phylum-spanning in silico analyses highlights novel opportunities for pathogen control. PLoS Negl Trop Dis. 2023;17(9):e0011618. pmid:37672536
- 107. Stassens P, Bergum PW, Gansemans Y, Jespers L, Laroche Y, Huang S, et al. Anticoagulant repertoire of the hookworm Ancylostoma caninum. Proc Natl Acad Sci U S A. 1996;93(5):2149–54. pmid:8700900
- 108. Gounaris K, Selkirk ME. Parasite nucleotide-metabolizing enzymes and host purinergic signalling. Trends Parasitol. 2005;21(1):17–21. pmid:15639736
- 109. Guiguet A, Dubreuil G, Harris MO, Appel HM, Schultz JC, Pereira MH, et al. Shared weapons of blood- and plant-feeding insects: Surprising commonalities for manipulating hosts. J Insect Physiol. 2016;84:4–21. pmid:26705897
- 110. Pala ZR, Alves ESTL, Minai M, Crews B, Patino-Martinez E, Carmona-Rivera C, et al. Anopheles salivary apyrase regulates blood meal hemostasis and drives malaria parasite transmission. bioRxiv. 2023. pmid:37292610
- 111. Parks SC, Okakpu OK, Azizpor P, Nguyen S, Martinez-Beltran S, Claudio I, et al. Parasitic nematode secreted phospholipase A2 suppresses cellular and humoral immunity by targeting hemocytes in Drosophila melanogaster. Front Immunol. 2023;14:1122451. pmid:37006283
- 112. Cho Y, Jones BF, Vermeire JJ, Leng L, DiFedele L, Harrison LM, et al. Structural and functional characterization of a secreted hookworm Macrophage Migration Inhibitory Factor (MIF) that interacts with the human MIF receptor CD74. J Biol Chem. 2007;282(32):23447–56. pmid:17567581
- 113. Sparkes A, De Baetselier P, Roelants K, De Trez C, Magez S, Van Ginderachter JA, et al. The non-mammalian MIF superfamily. Immunobiology. 2017;222(3):473–82. pmid:27780588
- 114. Bouchery T, Moyat M, Sotillo J, Silverstein S, Volpe B, Coakley G, et al. Hookworms Evade Host Immunity by Secreting a Deoxyribonuclease to Degrade Neutrophil Extracellular Traps. Cell Host Microbe. 2020;27(2):277-289.e6. pmid:32053791
- 115. Noon JB, Schwarz EM, Ostroff GR, Aroian RV. A highly expressed intestinal cysteine protease of Ancylostoma ceylanicum protects vaccinated hamsters from hookworm infection. PLoS Negl Trop Dis. 2019;13(4):e0007345. pmid:31009474
- 116. Wiśniewski M, Jaros S, Bąska P, Cappello M, Wędrychowicz H. Ancylostoma ceylanicum metalloprotease 6 DNA vaccination induces partial protection against hookworm challenge infection. Acta Parasitol. 2013;58(3):376–83. pmid:23990436
- 117. Wiśniewski M, Jaros S, Bąska P, Cappello M, Długosz E, Wędrychowicz H. Hamsters vaccinated with Ace-mep-7 DNA vaccine produced protective immunity against Ancylostoma ceylanicum infection. Exp Parasitol. 2016;163:1–7. pmid:26795262
- 118. Xiao S, Zhan B, Xue J, Goud GN, Loukas A, Liu Y, et al. The evaluation of recombinant hookworm antigens as vaccines in hamsters (Mesocricetus auratus) challenged with human hookworm, Necator americanus. Exp Parasitol. 2008;118(1):32–40. pmid:17645877
- 119. Zhan B, Perally S, Brophy PM, Xue J, Goud G, Liu S, et al. Molecular cloning, biochemical characterization, and partial protective immunity of the heme-binding glutathione S-transferases from the human hookworm Necator americanus. Infect Immun. 2010;78(4):1552–63. pmid:20145100
- 120. Berkachy R, Smyth DJ, Schnoeller C, Harcus Y, Maizels RM, Selkirk ME, et al. Characterisation of the secreted apyrase family of Heligmosomoides polygyrus. Int J Parasitol. 2021;51(1):39–48. pmid:32931780
- 121. Nisbet AJ, McNeilly TN, Price DRG, Oliver EM, Bartley Y, Mitchell M, et al. The rational simplification of a recombinant cocktail vaccine to control the parasitic nematode Teladorsagia circumcincta. Int J Parasitol. 2019;49(3–4):257–65. pmid:30690091
- 122. Fairfax KC, Vermeire JJ, Harrison LM, Bungiro RD, Grant W, Husain SZ, et al. Characterisation of a fatty acid and retinol binding protein orthologue from the hookworm Ancylostoma ceylanicum. Int J Parasitol. 2009;39(14):1561–71. pmid:19591834
- 123. Jenkins RE, Taylor MJ, Gilvary N, Bianco AE. Characterization of a secreted antigen of Onchocerca volvulus with host-protective potential. Parasite Immunol. 1996;18(1):29–42. pmid:9223154
- 124. Yadav S, Sharma P, Sharma A, Ganga L, Saxena JK, Srivastava M. Immunization with Brugia malayi Calreticulin Protein Generates Robust Antiparasitic Immunity and Offers Protection during Experimental Lymphatic Filariasis. ACS Infect Dis. 2021;7(4):790–9. pmid:33667079
- 125. McCarthy JS, Wieseman M, Tropea J, Kaslow D, Abraham D, Lustigman S. Onchocerca volvulus glycolytic enzyme fructose-1,6-bisphosphate aldolase as a target for a protective immune response in humans. Infect Immun. 2002;70(2):851–8. pmid:11796620
- 126. Chen N, Yuan Z-G, Xu M-J, Zhou D-H, Zhang X-X, Zhang Y-Z, et al. Ascaris suum enolase is a potential vaccine candidate against ascariasis. Vaccine. 2012;30(23):3478–82. pmid:22465737
- 127. Tian X, Lu M, Jia C, Bu Y, Aimulajiang K, Zhang Y, et al. Haemonchus contortus transthyretin domain - containing protein (HcTTR): A promising vaccine candidate against Haemonchus contortus infection. Vet Parasitol. 2020;279:109045. pmid:32045836
- 128. Bethony J, Loukas A, Smout M, Brooker S, Mendez S, Plieskatt J, et al. Antibodies against a secreted protein from hookworm larvae reduce the intensity of hookworm infection in humans and vaccinated laboratory animals. FASEB J. 2005;19(12):1743–5. pmid:16037096
- 129. Zhan B, Hotez PJ, Wang Y, Hawdon JM. A developmentally regulated metalloprotease secreted by host-stimulated Ancylostoma caninum third-stage infective larvae is a member of the astacin family of proteases. Mol Biochem Parasitol. 2002;120(2):291–6. pmid:11897134
- 130. Neophytou K, Martinez-Ugalde I, Fenton T, Robertson E, Strachan LJ, Harcus Y. A non-vesicular Argonaute protein is transmitted from nematode to mouse and is important for parasite survival. bioRxiv. 2025. pmid:40236219
- 131. Pearson MS, Bethony JM, Pickering DA, de Oliveira LM, Jariwala A, Santiago H, et al. An enzymatically inactivated hemoglobinase from Necator americanus induces neutralizing antibodies against multiple hookworm species and protects dogs against heterologous hookworm infection. FASEB J. 2009;23(9):3007–19. pmid:19380510
- 132. Degner JF, Marioni JC, Pai AA, Pickrell JK, Nkadori E, Gilad Y, et al. Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data. Bioinformatics. 2009;25(24):3207–12. pmid:19808877
- 133. Bell AD, Chou HT, Valencia F, Paaby AB. Beyond the reference: gene expression variation and transcriptional response to RNA interference in Caenorhabditis elegans. G3 (Bethesda). 2023;13(8):jkad112. pmid:37221008
- 134. Rezansoff AM, Laing R, Martinelli A, Stasiuk S, Redman E, Bartley D, et al. The confounding effects of high genetic diversity on the determination and interpretation of differential gene expression analysis in the parasitic nematode Haemonchus contortus. Int J Parasitol. 2019;49(11):847–58. pmid:31525371
- 135. Bansemir AD, Sukhdeo MV. The food resource of adult Heligmosomoides polygyrus in the small intestine. J Parasitol. 1994;80(1):24–8. pmid:8308654
- 136. Bansemir AD, Sukhdeo MV. Intestinal distribution of worms and host ingesta in Nippostrongylus brasiliensis. J Parasitol. 2001;87(6):1470–2. pmid:11780840
- 137. Thornton JW, Need E, Crews D. Resurrecting the ancestral steroid receptor: ancient origin of estrogen signaling. Science. 2003;301(5640):1714–7. pmid:14500980
- 138. Rodpai R, Sanpool O, Thanchomnang T, Laoraksawong P, Sadaow L, Boonroumkaew P, et al. Exposure to dexamethasone modifies transcriptomic responses of free-living stages of Strongyloides stercoralis. PLoS One. 2021;16(6):e0253701. pmid:34181669
- 139. Genta RM. Dysregulation of strongyloidiasis: a new hypothesis. Clin Microbiol Rev. 1992;5(4):345–55. pmid:1423214
- 140. Herbert DR, Stoltzfus JDC, Rossi HL, Abraham D. Is Strongyloides stercoralis hyperinfection induced by glucocorticoids a result of both suppressed host immunity and altered parasite genetics?. Mol Biochem Parasitol. 2022;251:111511. pmid:36007683
- 141. Gonzalez Akimori D, Dalessandro EJ, Nolan TJ, Stieha CR, Lok JB, Stoltzfus JDC. Transcriptional profiles in Strongyloides stercoralis males reveal deviations from the Caenorhabditis sex determination model. Sci Rep. 2021;11(1):8254. pmid:33859232
- 142. Monsalve GC, Yamamoto KR, Ward JD. A New Tool for Inducible Gene Expression in Caenorhabditis elegans. Genetics. 2019;211(2):419–30. pmid:30504365
- 143. Scanlan JL, Battlay P, Robin C. Ecdysteroid kinase-like (EcKL) paralogs confer developmental tolerance to caffeine in Drosophila melanogaster. Curr Res Insect Sci. 2022;2:100030. pmid:36003262
- 144. Raza A, Williams AR, Abeer MM. Importance of ABC Transporters in the Survival of Parasitic Nematodes and the Prospect for the Development of Novel Control Strategies. Pathogens. 2023;12(6):755. pmid:37375445
- 145. Drew D, North RA, Nagarathinam K, Tanabe M. Structures and General Transport Mechanisms by the Major Facilitator Superfamily (MFS). Chem Rev. 2021;121(9):5289–335. pmid:33886296
- 146. Chen L-Q, Cheung LS, Feng L, Tanner W, Frommer WB. Transport of sugars. Annu Rev Biochem. 2015;84:865–94. pmid:25747398
- 147. Pollo SMJ, Liu H, Coria AL, Rosin N, Labit E, Biernaskie J. A single-cell transcriptomics atlas for the parasitic nematode Heligmosomoides bakeri: Extrapolating model organism information to non-model systems. bioRxiv. 2024.
- 148. Anisimov AP, Tokmakova NP. Proliferation and growth of intestinal epithelium in Ascaris suum (Nematoda) during postnatal ontogeny. I. Mitotic activity in the intestinal epithelium. Sov J Dev Biol. 1974;4(4):341–8. pmid:4428217
- 149. Anisimov AP, Usheva LN. Proliferation and growth of intestinal epithelium in Ascaris suum (Nematoda) during postnatal ontogeny. II. Increase in the number and size of cells as a growth factor. Sov J Dev Biol. 1974;4(4):379–83. pmid:4471355
- 150. Tvermoes BE, Boyd WA, Freedman JH. Molecular characterization of numr-1 and numr-2: genes that increase both resistance to metal-induced stress and lifespan in Caenorhabditis elegans. J Cell Sci. 2010;123(Pt 12):2124–34. pmid:20501697
- 151. Wu C-W, Wimberly K, Pietras A, Dodd W, Atlas MB, Choe KP. RNA processing errors triggered by cadmium and integrator complex disruption are signals for environmental stress. BMC Biol. 2019;17(1):56. pmid:31311534
- 152. Hong M, Zhou X, Zeng C, Xu D, Xu T, Liao S, et al. Nucleolar stress induces nucleolar stress body formation via the NOSR-1/NUMR-1 axis in Caenorhabditis elegans. Nat Commun. 2024;15(1):7256. pmid:39179648
- 153. Taylor SS, Kornev AP. Protein kinases: evolution of dynamic regulatory proteins. Trends Biochem Sci. 2011;36(2):65–77. pmid:20971646
- 154. Kokot T, Köhn M. Emerging insights into serine/threonine-specific phosphoprotein phosphatase function and selectivity. J Cell Sci. 2022;135(19):jcs259618. pmid:36205606
- 155. Diop A, Santorelli D, Malagrinò F, Nardella C, Pennacchietti V, Pagano L, et al. SH2 Domains: Folding, Binding and Therapeutical Approaches. Int J Mol Sci. 2022;23(24):15944. pmid:36555586
- 156. Smith HE. Nematode sperm motility. WormBook. 2014;:1–15. pmid:24715710
- 157. Grant RP, Buttery SM, Ekman GC, Roberts TM, Stewart M. Structure of MFP2 and its function in enhancing MSP polymerization in Ascaris sperm amoeboid motility. J Mol Biol. 2005;347(3):583–95. pmid:15755452
- 158. Rödelsperger C, Ebbing A, Sharma DR, Okumura M, Sommer RJ, Korswagen HC. Spatial Transcriptomics of Nematodes Identifies Sperm Cells as a Source of Genomic Novelty and Rapid Evolution. Mol Biol Evol. 2021;38(1):229–43. pmid:32785688
- 159. Schmitz C, Kinge P, Hutter H. Axon guidance genes identified in a large-scale RNAi screen using the RNAi-hypersensitive Caenorhabditis elegans strain nre-1(hd20) lin-15b(hd126). Proc Natl Acad Sci U S A. 2007;104(3):834–9. pmid:17213328
- 160. Schwarz EM, Kato M, Sternberg PW. Functional transcriptomics of a migrating cell in Caenorhabditis elegans. Proc Natl Acad Sci U S A. 2012;109(40):16246–51. pmid:22991463
- 161. Ow MC, Nishiguchi MA, Dar AR, Butcher RA, Hall SE. RNAi-dependent expression of sperm genes in ADL chemosensory neurons is required for olfactory responses in Caenorhabditis elegans. Front Mol Biosci. 2024;11:1396587. pmid:39055986
- 162. Heger P, Kroiher M, Ndifon N, Schierenberg E. Conservation of MAP kinase activity and MSP genes in parthenogenetic nematodes. BMC Dev Biol. 2010;10:51. pmid:20478028
- 163. Bányai L, Patthy L. Amoebapore homologs of Caenorhabditis elegans. Biochim Biophys Acta. 1998;1429(1):259–64. pmid:9920402
- 164. Hoeckendorf A, Leippe M. SPP-3, a saposin-like protein of Caenorhabditis elegans, displays antimicrobial and pore-forming activity and is located in the intestine and in one head neuron. Dev Comp Immunol. 2012;38(1):181–6. pmid:22677064
- 165. Roeder T, Stanisak M, Gelhaus C, Bruchhaus I, Grötzinger J, Leippe M. Caenopores are antimicrobial peptides in the nematode Caenorhabditis elegans instrumental in nutrition and immunity. Dev Comp Immunol. 2010;34(2):203–9. pmid:19818806
- 166. Hoeckendorf A, Stanisak M, Leippe M. The saposin-like protein SPP-12 is an antimicrobial polypeptide in the pharyngeal neurons of Caenorhabditis elegans and participates in defence against a natural bacterial pathogen. Biochem J. 2012;445(2):205–12. pmid:22519640
- 167. Rooney J, Rivera-de-Torre E, Li R, Mclean K, Price DRG, Nisbet AJ, et al. Structural and functional analyses of nematode-derived antimicrobial peptides support the occurrence of direct mechanisms of worm-microbiota interactions. Comput Struct Biotechnol J. 2024;23:1522–33. pmid:38633385
- 168. He L, Abuzeid AMI, Zhuang T, Zhao Q, Zhu S, Chen X, et al. Expression and biological functions of Ancylostoma ceylanicum saposin-like protein. Parasitol Res. 2021;120(11):3805–13. pmid:34546437
- 169. Wang H, Paesen GC, Nuttall PA, Barbour AG. Male ticks help their mates to feed. Nature. 1998;391(6669):753–4. pmid:9486641
- 170. Hotez PJ, Narasimhan S, Haggerty J, Milstone L, Bhopale V, Schad GA, et al. Hyaluronidase from infective Ancylostoma hookworm larvae and its possible function as a virulence factor in tissue invasion and in cutaneous larva migrans. Infect Immun. 1992;60(3):1018–23. pmid:1541516
- 171. Yang X, Khan S, Zhao X, Zhang J, Nisar A, Feng X. Suppression of hyaluronidase reduces invasion and establishment of Haemonchus contortus larvae in sheep. Vet Res. 2020;51(1):106. pmid:32854758
- 172. Brophy PM, Patterson LH, Pritchard DL. Secretory nematode SOD--offensive or defensive?. Int J Parasitol. 1995;25(7):865–6. pmid:7558576
- 173. Knox DP, Jones DG. A comparison of superoxide dismutase (SOD, EC:1.15.1.1) distribution in gastro-intestinal nematodes. Int J Parasitol. 1992;22(2):209–14. pmid:1587685
- 174. Price DRG, Nisbet AJ, Frew D, Bartley Y, Oliver EM, McLean K, et al. Characterisation of a niche-specific excretory-secretory peroxiredoxin from the parasitic nematode Teladorsagia circumcincta. Parasit Vectors. 2019;12(1):339. pmid:31292008
- 175. Laing R, Kikuchi T, Martinelli A, Tsai IJ, Beech RN, Redman E, et al. The genome and transcriptome of Haemonchus contortus, a key model parasite for drug and vaccine discovery. Genome Biol. 2013;14(8):R88. pmid:23985316
- 176. Cole R, Holroyd N, Tracey A, Berriman M, Viney M. The parasitic nematode Strongyloides ratti exists predominantly as populations of long-lived asexual lineages. Nat Commun. 2023;14(1):6427. pmid:37833369
- 177. Thomas JH, Robertson HM. The Caenorhabditis chemoreceptor gene families. BMC Biol. 2008;6:42. pmid:18837995
- 178. Tellier A, Moreno-Gámez S, Stephan W. Speed of adaptation and genomic footprints of host-parasite coevolution under arms race and trench warfare dynamics. Evolution. 2014;68(8):2211–24. pmid:24749791
- 179. Ebert D, Fields PD. Host-parasite co-evolution and its genomic signature. Nat Rev Genet. 2020;21(12):754–68. pmid:32860017
- 180. Loeb L, Smith AJ. The presence of a substance inhibiting the coagulation of the blood in Anchylostoma. Proceedings of the Pathological Society of Philadelphia. 1904;7(6):173–8.
- 181.
Oliveira PL, Genta FA. Blood Digestion in Triatomine Insects. Entomology in Focus. Springer International Publishing. 2021. 265–84. https://doi.org/10.1007/978-3-030-64548-9_12
- 182. Kvist S, Manzano-Marín A, de Carle D, Trontelj P, Siddall ME. Draft genome of the European medicinal leech Hirudo medicinalis (Annelida, Clitellata, Hirudiniformes) with emphasis on anticoagulants. Sci Rep. 2020;10(1):9885. pmid:32555498
- 183. Paysan-Lafosse T, Andreeva A, Blum M, Chuguransky SR, Grego T, Pinto BL, et al. The Pfam protein families database: embracing AI/ML. Nucleic Acids Research. 2025;53(D1):D523–34. pmid:39540428
- 184. Howe KL, Bolt BJ, Shafie M, Kersey P, Berriman M. WormBase ParaSite - a comprehensive resource for helminth genomics. Molecular and Biochemical Parasitology. 2017;215:2–10. pmid:27899279
- 185. International Helminth Genomes Consortium. Comparative genomics of the major parasitic worms. Nat Genet. 2019;51(1):163–74. pmid:30397333
- 186. Costa AFDV, Gasser RB, Dias SRC, Rabelo EML. Male-enriched transcription of genes encoding ASPs and Kunitz-type protease inhibitors in Ancylostoma species. Mol Cell Probes. 2009;23(6):298–303. pmid:19646525
- 187. Maruszewska-Cheruiyot M, Szewczak L, Krawczak-Wójcik K, Głaczyńska M, Donskow-Łysoniewska K. The production of excretory-secretory molecules from Heligmosomoides polygyrus bakeri fourth stage larvae varies between mixed and single sex cultures. Parasit Vectors. 2021;14(1):106. pmid:33557937
- 188. Maruszewska-Cheruiyot M, Donskow-Łysoniewska K, Piechna K, Krawczak K, Doligalska M. L4 stage Heligmosomoides polygyrus prevents the maturation of dendritic JAWS II cells. Exp Parasitol. 2019;196:12–21. pmid:30465732
- 189. Maruszewska-Cheruiyot M, Szewczak L, Krawczak-Wójcik K, Kierasińska M, Stear M, Donskow-Łysoniewska K. The Impact of Intestinal Inflammation on Nematode’s Excretory-Secretory Proteome. Int J Mol Sci. 2023;24(18):14127. pmid:37762428
- 190. Ferguson AA, Inclan-Rico JM, Lu D, Bobardt SD, Hung L, Gouil Q, et al. Hookworms dynamically respond to loss of Type 2 immune pressure. PLoS Pathog. 2023;19(12):e1011797. pmid:38079450
- 191. Katz K, Shutov O, Lapoint R, Kimelman M, Brister JR, O’Sullivan C. The Sequence Read Archive: a decade more of explosive growth. Nucleic Acids Res. 2022;50(D1):D387–90. pmid:34850094
- 192. Perez-Riverol Y, Bai J, Bandla C, García-Seisdedos D, Hewapathirana S, Kamatchinathan S, et al. The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 2022;50(D1):D543–52. pmid:34723319
- 193. Lee RYN, Howe KL, Harris TW, Arnaboldi V, Cain S, Chan J, et al. WormBase 2017: molting into a new stage. Nucleic Acids Research. 2017;46(D1):D869–74. pmid:29069413
- 194. Hu Y, Zhan B, Keegan B, Yiu YY, Miller MM, Jones K, et al. Mechanistic and single-dose in vivo therapeutic studies of Cry5B anthelmintic action against hookworms. PLoS Negl Trop Dis. 2012;6(11):e1900. pmid:23145203
- 195. Tritten L, Nwosu U, Vargas M, Keiser J. In vitro and in vivo efficacy of tribendimidine and its metabolites alone and in combination against the hookworms Heligmosomoides bakeri and Ancylostoma ceylanicum. Acta Trop. 2012;122(1):101–7. pmid:22210439
- 196. Ray DK, Bhopale KK, Shrivastava VB. Migration and growth of Ancylostoma ceylanicum in golden hamsters Mesocricetus auratus. J Helminthol. 1972;46(4):357–62. pmid:4641405
- 197. Romeo Y, Lemaitre B. Drosophila immunity: methods for monitoring the activity of Toll and Imd signaling pathways. Methods Mol Biol. 2008;415:379–94. pmid:18370166
- 198. Srinivasan J, Dillman AR, Macchietto MG, Heikkinen L, Lakso M, Fracchia KM, et al. The draft genome and transcriptome of Panagrellus redivivus are shaped by the harsh demands of a free-living lifestyle. Genetics. 2013;193(4):1279–95. pmid:23410827
- 199. Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, et al. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018;15(7):475–6. pmid:29967506
- 200. Davis P, Zarowiecki M, Arnaboldi V, Becerra A, Cain S, Chan J, et al. WormBase in 2022-data, processes, and tools for analyzing Caenorhabditis elegans. Genetics. 2022;220(4):iyac003. pmid:35134929
- 201. Pertea G, Pertea M. GFF Utilities: GffRead and GffCompare. F1000Res. 2020;9:ISCB Comm J-304. pmid:32489650
- 202. Zaleśny G, Hildebrand J, Paziewska-Harris A, Behnke JM, Harris PD. Heligmosomoides neopolygyrus Asakawa & Ohbayashi, 1986, a cryptic Asian nematode infecting the striped field mouse Apodemus agrarius in Central Europe. Parasit Vectors. 2014;7:457. pmid:25303901
- 203. Nieberding C, Morand S, Libois R, Michaux JR. Parasites and the island syndrome: the colonization of the western Mediterranean islands by Heligmosomoides polygyrus (Dujardin, 1845). Journal of Biogeography. 2006;33(7):1212–22.
- 204. Durette-Desset MC, Kinsella JM, Forrester DJ. Arguments in favor of a double origin for nearctic nematodes of the genus Heligmosomoides Hall, 1916. Ann Parasitol Hum Comp. 1972;47(3):365–82. pmid:4674340
- 205. Behnke JM, Keymer AE, Lewis JW. Heligmosomoides polygyrus or Nematospiroides dubius?. Parasitol Today. 1991;7(7):177–9. pmid:15463487
- 206. Quinnell RJ, Behnke JM, Keymer AE. Host specificity of and cross-immunity between two strains of Heligmosomoides polygyrus. Parasitology. 1991;102 Pt 3:419–27. pmid:1866189
- 207. Musah-Eroje M, Burton L, Kerr N, Behnke JM. Quantitative assessment of spicule length in Heligmosomoides spp. (Nematoda, Heligmosomidae): distinction between H. bakeri, H. polygyrus and H. glareoli. Parasitology. 2023;150(11):1022–30. pmid:37705292
- 208. Maizels RM, Hewitson JP, Gause WC. Heligmosomoides polygyrus: one species still. Trends Parasitol. 2011;27(3):100–1. pmid:21159557
- 209. Behnke J, Harris PD. Heligmosomoides bakeri: a new name for an old worm?. Trends Parasitol. 2010;26(11):524–9. pmid:20729145
- 210. Crusoe MR, Alameldin HF, Awad S, Boucher E, Caldwell A, Cartwright R, et al. The khmer software package: enabling efficient nucleotide sequence analysis. F1000Res. 2015;4:900. pmid:26535114
- 211. Zhang Q, Pell J, Canino-Koning R, Howe AC, Brown CT. These are not the k-mers you are looking for: efficient online k-mer counting using a probabilistic data structure. PLoS One. 2014;9(7):e101271. pmid:25062443
- 212. Stanke M, Diekhans M, Baertsch R, Haussler D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008;24(5):637–44. pmid:18218656
- 213. Barnett DW, Garrison EK, Quinlan AR, Strömberg MP, Marth GT. BamTools: a C++ API and toolkit for analyzing and managing BAM files. Bioinformatics. 2011;27(12):1691–2. pmid:21493652
- 214. Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, et al. TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003;19(5):651–2. pmid:12651724
- 215. Buchfink B, Reuter K, Drost H-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat Methods. 2021;18(4):366–8. pmid:33828273
- 216. Lomsadze A, Burns PD, Borodovsky M. Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm. Nucleic Acids Res. 2014;42(15):e119. pmid:24990371
- 217. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10(2):giab008. pmid:33590861
- 218. Doyle SR, Tracey A, Laing R, Holroyd N, Bartley D, Bazant W, et al. Genomic and transcriptomic variation defines the chromosome-scale assembly of Haemonchus contortus, a model gastrointestinal worm. Commun Biol. 2020;3(1):656. pmid:33168940
- 219. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26(6):841–2. pmid:20110278
- 220. Neph S, Kuehn MS, Reynolds AP, Haugen E, Thurman RE, Johnson AK, et al. BEDOPS: high-performance genomic feature operations. Bioinformatics. 2012;28(14):1919–20. pmid:22576172
- 221.
Dainat J. AGAT: Another Gff analysis toolkit to handle annotations in any GTF/GFF format. 1.1.0 ed. Zenodo. 2023.
- 222. Waterhouse RM, Seppey M, Simão FA, Manni M, Ioannidis P, Klioutchnikov G, et al. BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics. Mol Biol Evol. 2018;35(3):543–8. pmid:29220515
- 223. Lupas A. Prediction and analysis of coiled-coil structures. Methods Enzymol. 1996;266:513–25. pmid:8743703
- 224. Wootton JC. Non-globular domains in protein sequences: automated segmentation using complexity measures. Comput Chem. 1994;18(3):269–85. pmid:7952898
- 225. Eddy SR. A new generation of homology search tools based on probabilistic inference. Genome Inform. 2009;23(1):205–11. pmid:20180275
- 226. Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016;44(D1):D279-85. pmid:26673716
- 227. Hernandez-Plaza A, Szklarczyk D, Botas J, Cantalapiedra CP, Giner-Lamia J, Mende DR, et al. eggNOG 6.0: enabling comparative genomics across 12 535 organisms. Nucleic Acids Res. 2023;51(D1):D389–D94. pmid:36399505
- 228. Kanehisa M, Furumichi M, Sato Y, Kawashima M, Ishiguro-Watanabe M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 2023;51(D1):D587–92. pmid:36300620
- 229. Hart AJ, Ginzburg S, Xu MS, Fisher CR, Rahmatpour N, Mitton JB, et al. EnTAP: Bringing faster and smarter functional annotation to non-model eukaryotic transcriptomes. Mol Ecol Resour. 2020;20(2):591–604. pmid:31628884
- 230. UniProt Consortium. UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2023;51(D1):D523–31. pmid:36408920
- 231. O’Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, McVeigh R, et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2016;44(D1):D733-45. pmid:26553804
- 232. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34(17):i884–90. pmid:30423086
- 233. Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017;14(4):417–9. pmid:28263959
- 234. Srivastava A, Malik L, Sarkar H, Zakeri M, Almodaresi F, Soneson C, et al. Alignment and mapping methodology influence transcript abundance estimation. Genome Biol. 2020;21(1):239. pmid:32894187
- 235. Srivastava A, Zakeri M, Sarkar H, Soneson C, Kingsford C, Patro R. Accounting for fragments of unexpected origin improves transcript quantification in RNA-seq simulations focused on increased realism. bioRxiv. 2021.
- 236. Gu Z. Complex heatmap visualization. Imeta. 2022;1(3):e43. pmid:38868715
- 237. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40. pmid:19910308
- 238. Noble WS. How does multiple testing correction work?. Nat Biotechnol. 2009;27(12):1135–7. pmid:20010596
- 239. Schurch NJ, Schofield P, Gierliński M, Cole C, Sherstnev A, Singh V, et al. How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?. RNA. 2016;22(6):839–51. pmid:27022035
- 240. Lex A, Gehlenborg N, Strobelt H, Vuillemot R, Pfister H. UpSet: Visualization of Intersecting Sets. IEEE Trans Vis Comput Graph. 2014;20(12):1983–92. pmid:26356912
- 241.
Gadhave K, Strobelt H, Gehlenborg N, Lex A. UpSet 2: from prototype to tool. In: Proceedings of the IEEE Information Visualization Conference, Vancouver, BC, Canada, 2019.
- 242. Cui X, Churchill GA. Statistical tests for differential expression in cDNA microarray experiments. Genome Biol. 2003;4(4):210. pmid:12702200
- 243. Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22(13):1658–9. pmid:16731699
- 244. Shumate A, Salzberg SL. Liftoff: accurate mapping of gene annotations. Bioinformatics. 2021;37(12):1639–43. pmid:33320174
- 245. Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009;37(Web Server issue):W202-8. pmid:19458158
- 246. Xie Y, Hoberg EP, Yang Z, Urban JF Jr, Yang G. Ancylostoma ailuropodae n. sp. (Nematoda: Ancylostomatidae), a new hookworm parasite isolated from wild giant pandas in Southwest China. Parasit Vectors. 2017;10(1):277. pmid:28576124
- 247. van Megen H, van den Elsen S, Holterman M, Karssen G, Mooyman P, Bongers T, et al. A phylogenetic tree of nematodes based on about 1200 full-length small subunit ribosomal DNA sequences. Nematol. 2009;11(6):927–50.
- 248. Tang YT, Gao X, Rosa BA, Abubucker S, Hallsworth-Pepin K, Martin J, et al. Genome of the human hookworm Necator americanus. Nat Genet. 2014;46(3):261–9. pmid:24441737