Figures
Abstract
The mosquito midgut functions as a key interface between pathogen and vector. However, studies of midgut physiology and virus infection dynamics are scarce, and in Culex tarsalis—an extremely efficient vector of West Nile virus (WNV)—nonexistent. We performed single-cell RNA sequencing on Cx. tarsalis midguts, defined multiple cell types, and determined whether specific cell types are more permissive to WNV infection. We identified 20 cell states comprising 8 distinct cell types, consistent with existing descriptions of Drosophila and Aedes aegypti midgut physiology. Most midgut cell populations were permissive to WNV infection. However, there were higher levels of WNV RNA (vRNA) in enteroendocrine cells (EE), suggesting enhanced replication in this population. In contrast, proliferating intestinal stem cells (ISC) had the lowest levels of vRNA, a finding consistent with studies suggesting ISC proliferation in the midgut is involved in infection control. ISCs were also found to have a strong transcriptional response to WNV infection; genes involved in ribosome structure and biogenesis, and translation were significantly downregulated in WNV-infected ISC populations. Notably, we did not detect significant WNV-infection induced upregulation of canonical mosquito antiviral immune genes (e.g., AGO2, R2D2, etc.) at the whole-midgut level. Rather, we observed a significant positive correlation between immune gene expression levels and vRNA load in individual cells, suggesting that within midgut cells, high levels of vRNA may trigger antiviral responses. Our findings establish a Cx. tarsalis midgut cell atlas, and provide insight into midgut infection dynamics of WNV by characterizing cell-type specific enhancement/restriction of, and immune response to, infection at the single-cell level.
Author summary
West Nile virus is the leading cause of mosquito-borne disease in N. America. Cx. tarsalis is a highly competent vector of WNV that plays a central role in the transmission and maintenance of WNV in nature. It is hypothesized that the permissibility of mosquito midgut cells contributes to the midgut infection barrier and thus impacts the ability of pathogens to establish infection in a mosquito. Additionally, it is postulated that the midgut is the most important organ with respect to determining vector competence. The recent publication of the full Cx. tarsalis genome, in conjunction with the growing body of work demonstrating the successful application of single-cell RNA sequencing methodologies in insect models made it possible for us to examine the cellular composition of the Cx. tarsalis midgut, and WNV infection dynamics therein, at single-cell resolution. We found cell-type-specific differences in viral RNA levels suggesting variability in WNV replication efficiency in specific cell types, identified patterns of differential expression associated with WNV infection in specific cell populations, and characterized aspects of the innate immune response to WNV infection at the tissue and cellular level.
Citation: Fitzmeyer EA, Dutt TS, Pinaud S, Graham B, Gallichotte EN, Hill JL, et al. (2025) A single-cell atlas of the Culex tarsalis midgut during West Nile virus infection. PLoS Pathog 21(1): e1012855. https://doi.org/10.1371/journal.ppat.1012855
Editor: Lark L. Coffey, University of California Davis, UNITED STATES OF AMERICA
Received: July 24, 2024; Accepted: December 20, 2024; Published: January 27, 2025
Copyright: © 2025 Fitzmeyer et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All sequencing data files included in the described analyses are available on NCBI’s Sequence Read Archive (PRJNA1209375). All scripts used for data processing, analysis, and visualization are available on GitHub: https://github.com/fitz-meyer/scRNA_seq_fitz. RDS files for all samples included in the described analyses are available on GitHub: https://github.com/fitz-meyer/scRNA_seq_rdsFiles.git.
Funding: The research reported in this publication was supported by National Institutes of Health grant R01-AI067380 (GDE), National Institute of Allergy and Infectious Diseases of the National Institutes of Health under Award Number T32-AI162691 (EAF), Colorado State University’s Office of the Vice President for Research’s “Accelerating Innovations in Pandemic Disease” initiative, made possible through support from The Anschutz Foundation (EAF), Verena (viralemergence.org) from the U.S. National Science Foundation, including NSF BII 2021909 and NSF BII 2213854 (ENG), the Wellcome Trust (grant 206194/Z/17/Z) (MKNL, SP, VH) and an MRC Career Development Award G1100339 (MKNL). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The content is solely the responsibility of the authors.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Arthropod-borne viruses represent a severe and ever-growing public health threat [1]. Mosquito-borne viruses alone are estimated to cause over 400 million infections globally each year [2]. For transmission of a mosquito-borne virus to occur, a mosquito must first become infected with a virus via ingestion of an infectious bloodmeal after feeding on a viremic host [3,4]. The virus must establish infection in the mosquito midgut before it disseminates into the hemocoel, and eventually enters the salivary glands and saliva—where transmission occurs [3,4]. The mosquito midgut is a complex organ composed of a variety of cell types with distinct functions including digestion, nutrient absorption, endocrine signaling, and innate immune activity [5,6]. The midgut is also the site of infection and escape barriers that strongly influence virus population dynamics [4,5]. Previous studies have demonstrated that successful infection of the midgut epithelium, and replication and immune evasion therein, is essential for establishing disseminated infection in an arthropod vector [3,7]. In these ways, for hematophagous disease vectors like mosquitoes, the midgut serves as a critical interface between vector and pathogen and the establishment of infection.
Cx. tarsalis is a major vector of West Nile virus (WNV) in much of North America [8–10]. WNV is the most epidemiologically important arbovirus in N. America, causing ~2,700 deaths from 1999 to 2022 [8,9,11]. Despite the importance of Cx. tarsalis as a vector of WNV and other important human viruses, studies examining the cellular composition of its midgut, and WNV infection dynamics therein, are nonexistent. The recent publication of the full Cx. tarsalis genome, in conjunction with the growing body of work demonstrating the successful application of single-cell RNA sequencing methodologies in insect models has made it possible to address this significant knowledge gap [12–20]. Therefore, we performed single-cell RNA sequencing (scRNA-seq) on dissociated midgut cells from both mock and WNV-infected Cx. tarsalis mosquitoes to gain a better understanding of how the midgut functions as the interface between vector and WNV.
We utilized a scRNA-seq approach previously demonstrated to be flavivirus RNA inclusive, which allowed us to detect WNV viral RNA (vRNA) in addition to host transcripts [21]. Through this approach we identified distinct midgut populations corresponding with midgut cell types previously described in Drosophila and Aedes aegypti midguts—enterocyte (nutrient absorption cells), enteroendocrine (secretory cells), cardia (peritrophic matrix secreting cells), intestinal stem cell/enteroblast (undifferentiated progenitor cells), proliferating intestinal stem cell/enteroblast, visceral muscle cells, and hemocytes (immune cells)—and characterized the infection and replication dynamics of WNV within each population [5,6,14,16,17]. We found that WNV infects most midgut cell types, with evidence suggesting enhanced replication in enteroendocrine cells and reduced viral replication in proliferating intestinal stem cells/enteroblasts. Additionally, we characterized the Cx. tarsalis immune response to WNV infection at both the whole-midgut and single-cell level. This study has bolstered our understanding of WNV midgut infection in a highly competent vector, and elucidated the midgut biology of Cx. tarsalis.
Results
Single-cell RNA sequencing of female Cx. tarsalis midguts identified 20 distinct cell populations
Using the 10X Genomics platform we performed scRNA-seq on pools of 10 dissociated Cx. tarsalis midguts at 4 and 12 days post-infection (dpi) with either an infectious bloodmeal containing WNV, or a mock uninfected bloodmeal. Replicate infected and mock pools were collected and sequenced at each timepoint (four replicates at 4dpi and two replicates at 12dpi for each condition). We recovered an average of 2,416 cells per pool with an average coverage of 255,000 reads per cell, which were mapped to the Cx. tarsalis genome (S1 File). Following quality control (QC) filtering, we retained data for 12,886 cells at 4dpi (7,386 WNV-infected, 5,500 mock), and 9,301 cells at 12dpi (4,609 WNV-infected, 4,692 mock) for downstream analyses (S1 File). Cells retained after QC contained an average of 597 (611 WNV-infected, 580 mock) and 407 (448 WNV-infected, 367 mock) unique genes per cell at 4 and 12dpi respectively.
Guided clustering in Seurat (v4.3.0.1) generated 20 (4dpi) and 17 (12dpi) distinct clusters of cells (Fig 1A and 1B). The cell types of 15 cell clusters were identified using canonical gene markers and gene enrichment patterns previously identified in Drosophila and Ae. aegypti midguts (Fig 1A-1C) [14,16,18,20]. All cell-type identifications, with indicated exceptions, were based on cluster markers that were conserved between mock and WNV infection (S2 and S3 Files). We identified enterocytes (EC) by significant expression of POU2F1 (nubbin), PLA2G6 (phospholipase A2), and AGBL5 (zinc carboxypeptidase), and enteroendocrine cells (EE) by expression of PROX1 (prospero) (Figs 2C and 3A) [14,17,22,23]. High expression of Mlc2 (myosin light chain), Mhc (myosin heavy chain), and ACTB (actin), allowed us to identify visceral muscle cells (VM); VM-1, VM-2 (Fig 1C) [14,17,22,24]. We identified a population of hemocytes (HC) based on expression of NIMB2 and SPARC (Fig 1C) [16,18]. EC-like cells (EC-like-1, EC-like-2, EC-like-3) were identified as such based on enrichment for several serine protease and alpha amylase genes (Fig 1C, S2 File) [14,17]. One cardia population (cardia-1) was identified by enrichment for sugar transport and chitin-binding genes, as well as several serine protease genes, and a second cardia population (cardia-2) was identified by expression of C-type lysozyme and sugar transporter genes (Fig 1C) [14,17]. Intestinal stem cells/enteroblasts (ISC/EB) were identified by visualizing klumpfuss (klu) expression specific to these clusters via violin plot (S3B Fig). One of the ISC/EB clusters was significantly enriched for PCNA—a marker for cell proliferation—and therefore named ISC/EB-prol to reflect this (Figs 1C and S6) [25,26]. A cluster that shared identical conserved markers with cardia-1 and was also significantly enriched for PCNA was identified as cardia-prol (Fig 1C) [25,26]. A cluster of fat body cells (FBC) was identified by high expression of apoliphorin-III [23]. A cluster of Malpighian tubule cells (MT) that was only present in one out of 12 samples (mg5c) (S2C Fig) was identified by significant enrichment for an inward rectifier potassium channel gene (irk-2) as well as several glutathione and vacuolar ATPase genes (Fig 1C, S2 File) [27,28]. This indicates that Malpighian tubule tissue was inadvertently retained upon midgut collection for sample mg5c. Clusters without identifying markers are subsequently referred to by cluster number (e.g., cluster 4). Importantly, HCs, FBCs, and MTs are not midgut cells, but are considered associated with the midgut, while EC, EE, cardia, ISC/EB, and VM cell populations comprise the midgut, and EC, EE, cardia, and ISC/EB comprise the midgut epithelium (Fig 2A and 2B). We compared the proportion of each cluster between mock and WNV-infected replicates and found no significant differences (S2A and S2B Fig). The percent of the total population composed by each cluster/cell-type can be found in S1 Table. Finally, we visualized the expression of S and G2/M phase cell cycle markers and found no evidence of cell cycle-driven clustering, with the exception of proliferating cell populations (S8 and S9 Figs).
Uniform Manifold Approximation and Projection (UMAP) reduction was used to visualize midgut cell populations. (A) 20 distinct clusters were identified at 4dpi and (B) 17 distinct clusters were identified at 12dpi. (C) Expression of canonical marker genes that were conserved between infection condition (mock and WNV-infected) were used to determine cell type. Dot size denotes percent of cells expressing each gene, color denotes scaled gene expression. Panel C was derived from the total population of cells for each subtype.
Expression of characterizing genes visualized for EEs (A), HCs (B), and ISC/EBs (A). UMAP limits were set to include only clusters of interest, orientation of clusters in the total population can be found in panel C. (D) Expression of canonical enteroendocrine/neuroendocrine, neuropeptide, and secretory genes. (E) Expression of canonical HC, and granulocyte and oenocytoid genes (F) Expression of canonical ISC and EB genes as well as gene markers for proliferation and mitosis. Panels A and B were created in BioRender. Gallichotte, E. (2024) https://BioRender.com/f66n058. Gene counts for the total population (timepoints and conditions combined) were converted to binary (blue = 1, grey = 0) and visualized via UMAP.
(A) Levels of WNV vRNA across midgut cell populations at 4 and 12dpi. Purple to grey gradient denotes high to low levels of vRNA. (B) percent of cells in the total population containing WNV vRNA (percent expressing) and average WNV vRNA level in the total population (average expression) calculated for each WNV-infected replicate and compared between timepoints. Significance was determined by unpaired t-test, * = p<0.05. (C) WNV vRNA level in individual cells of distinct clusters. Black asterisks denote significant enrichment of vRNA as a cluster marker, pink asterisks denote significant absence of vRNA as a cluster marker. The percentage of cells containing WNV vRNA in each epithelial cell population at (D) 4dpi and (E) 12dpi. Average WNV vRNA level in epithelial cell populations at (F) 4dpi and (G) 12dpi. Significance determined by one-way ANOVA, * = p<0.05, ** = p<0.005, *** = p<0.0005, **** = p<0.0001. Points = replicate values, bars = mean of replicate values, error bars = SD. (E) Trajectory inference for ISC/EB and ISC/EB-prol populations identified two lineages. (F) vRNA levels visualized along lineage progression. Green = lineage 1 (EC), blue = lineage 2 (EE). Panels B-G were derived from only WNV-infected samples. Panels D-G exclude values derived from clusters of ≤5 cells. Panels A, and H-I were derived from the total population.
Characterization of Cx. tarsalis midgut secretory, immune, and progenitor cells
Enteroendocrine cells (EE) are the secretory cells of the midgut (Fig 2A) that, through the secretion of neuropeptides, regulate digestion and behavioral responses associated with feeding, satiety, stress, etc. [14,17,29]. These cells, and the neuropeptides they secrete, have been previously characterized in Ae. aegypti and Drosophila, but never Cx. tarsalis [30,31]. We identified Cx. tarsalis orthologs for previously described insect gut hormones found in EE cells - short neuropeptide (sNPF), bursicon (Burs), ion transport peptide (ITP), and tachykinin (Tk) receptor [14,17,20,30,32]. However, these neuropeptides/neuropeptide receptors were found to have very low and nonspecific expression in the EE population (S3A Fig). The EE population was significantly enriched for canonical neuroendocrine genes (IA2, and SCG5) [33,34] and Syt1, and showed low and nonspecific expression of Syt4, Syt6, Syx1A and nSyb (genes associated with vesicle docking and secretion) (Figs 2D and S3A) [14,35]. Interestingly, the EE population showed strong enrichment for NEUROD6, an ortholog for neurogenic differentiation factor 4 in Ae. aegypti (AAEL004252) and for a neuronal differentiation gene known to be involved in behavioral reinforcement in mammals (Figs 2D and S3A) [36].
Hemocytes (HC) are immune cells that circulate in the hemolymph (Fig 2B) and play various roles in the mosquito immune response that are HC class-dependent [15,16,18,37,38]. Much like EEs, hemocytes have not been characterized in Cx. tarsalis. The HC population was identified as mature granulocytes by expression of SCRASP1, c-type lectin, defensin, and cecropin genes (Figs 2E and S3C) [16]. We saw no expression of SCRB3—an oenocytoid marker—in our HC population (Figs 2E and S3C) providing further support for the observation that the HCs detected were granulocytes [18].
Intestinal stem cells (ISCs) are the progenitor cells of the midgut that differentiate into either enterocyte progenitors (enteroblasts, EBs) or enteroendocrine progenitors (EEPs) (Fig 2A) [39,40]. We identified a Culex ortholog for Delta (Delta-like protein – CPIJ019429), a canonical ISC marker in Ae. aeygpti and Drosophila midguts, however, it was not enriched in our ISC populations (Fig 2F). EBs and ISCs are often indistinguishable by UMAP, and thus we were able to use klumpfuss (Klu), an EB marker, to identify ISC populations [14,17,20]. The ISC/EB-prol population was enriched for aurora kinase A and B (mitotic markers) as well as PCNA (proliferation marker) further confirming the proliferative state of this cluster (Fig 2F) [25,26,41,42].
COG profiles demonstrate homogeneity between midgut cell populations despite differences in conserved markers
We next examined the transcriptional profiles of each cluster to understand the function of unidentified clusters and compare the transcriptomes of distinct cell populations. We used cluster of orthologous genes (COG) categories, and a two-pronged approach to visualization—COG profiles of all genes expressed in >75% of cells in a cluster (termed “base genes”) and COG profiles of all significant (p<0.05), conserved cluster markers with positive log2 fold-changes (log2FC) relative to the other clusters (S1A and S1B Fig). Base gene and cluster marker gene profiles were derived from the total population at each timepoint. Despite the varying cell types, we noted homogeneity across base genes for each cluster, with the plurality of each profile for most clusters comprising genes involved in translation and ribosomal biogenesis (J), and energy production and conversion (C) (S1A Fig). However, ISC/EB-prol, cardia-2, and cardia-prol all possessed fewer ‘J’ COGs than other clusters. The COG profiles of EC-like populations show variability between their transcriptomes and ECs (S1A and S1B Fig). As expected, VM populations contained the highest proportions of cytoskeletal genes (Z) in both base gene and cluster marker profiles compared to other cell types (S1A and S1B Fig).
WNV vRNA is detected at varying levels in the majority of midgut cell populations
In addition to characterizing the cellular heterogeneity of Cx. tarsalis midguts, we also sought to examine WNV infection dynamics at the single-cell level. The five-prime bias of the scRNA-seq chemistry captured and allowed us to detect the WNV 5’ UTR as a feature in our data. Visualization of the sequence alignment map files generated by the Cell Ranger pipeline confirmed that WNV reads mapped with Cell Ranger solely mapped to the 5’ UTR. Importantly, WNV viral RNA (vRNA) was only detected in our WNV-infected samples, and was broadly detected across most cell populations at both time points (Fig 3A). Within WNV-infected replicates we compared the percent of cells with detectable vRNA and the average vRNA level in the total population for each timepoint (Fig 3B). We saw no significant difference in the total percent of WNV-infected cells between timepoints, but a significant increase in average total vRNA level by 12dpi (Fig 3B). Within clusters (replicates within time points combined), cells contained variable levels of vRNA, however some clusters (cluster 17, cardia-prol, etc.) were either not present or comprised ≤5 cells in the WNV-infected condition (Fig 3C). Examining cluster markers in the WNV-infected condition revealed that vRNA was a significant cluster marker for several clusters at each timepoint (Fig 3C). At both timepoints, cluster 4 contained both the highest average level of vRNA and was significantly enriched for vRNA relative to the other clusters (Figs 3C and 4, S2 Table). Cluster 4 lacked canonical markers; however, it was significantly enriched for mitochondrial genes and mitochondrial tRNAs, suggesting these cells are either apoptotic or in states of increased energy demand or stress (S4 Fig). However, there was minimal expression of pro-apoptotic genes across all clusters, and no notable enrichment of pro-apoptotic genes in cluster 4 (S7 Fig). At both timepoints the ISC/EB-prol populations, and at 12dpi the ISC/EB population, were distinguished by a significant absence of vRNA relative to the other clusters (Fig 3C, S2 Table). vRNA in the 4dpi ISC/EB-prol population had a log2FC value of -2.2, and vRNA in the 12dpi ISC/EB-prol and ISC/EB populations had log2FC values of -3.6 and -2.5 respectively; greater absolute values than any other cell population/cluster (S2 Table).
(A) Number of significant (adjusted p-value<0.05) differentially expressed genes (DEGs) associated with WNV infection in each midgut cell type. Green = 12dpi, purple = 4dpi, black = upregulated, pink = downregulated. (B) Total significant DEGs identified in midgut cell populations. Green = 12dpi, purple = 4dpi, black = upregulated, pink = downregulated. (C) Visualizing WNV-infection associated DEGs in cell populations of interest (ISC/EBs, EEs). Negative log2FC indicates downregulation and positive log2FC indicates upregulation associated with WNV infection. In the interest of data density, the bar representing the log2FC of the WNV 5’ UTR is not shown. WNV 5’ UTR log2FC values can be found in S4 Table. (D) Gene ontology enrichment analysis (GOEA) for all downregulated DEGs (timepoints and cell populations combined). (E) GOEA for all upregulated DEGs (timepoints and cell populations combined). Color denotes -log10(FDR). FDR cutoff<0.05 was used for GOEA.
WNV vRNA levels differ between epithelial cell populations
Next, we compared the presence and level of vRNA in epithelial cell populations; EC and EC-like, EE, cardia, ISC/EB, and ISC/EB-prol. Average expression and percent expression values derived from clusters composed of ≤5 cells in a given replicate were excluded from this comparison. At 4dpi the EC-like-2 population had the highest percentage of cells containing vRNA, significantly more than EC-like-1, EC, EE, ISC/EB-prol, and cardia populations (Fig 3D). Interestingly, the other EC-like population at 4dpi (EC-like-1) had significantly lower percentages of cells containing vRNA compared to other populations (Fig 3D). There were no significant differences in the percent of cells containing vRNA between any epithelial cell population at 12dpi (Fig 3E). At both time points, EC-like-2 and EE populations the highest average levels of vRNA (Fig 3F and 3G). At 12dpi EE populations had significantly higher levels of vRNA than all other epithelial cell populations (Fig 3G).
To further explore the dynamics of epithelial cell populations we used slingshot (v2.10.0) to perform a trajectory inference and identify cell lineages. We identified two lineages: (1) ISC/EB → ISB/EB-prol → EE, and (2) ISC/EB → ISC/EB-prol → EC-like → EC (Fig 3H). As expected, we observed decreases in Klu and PCNA expression in both lineages with increasing pseudotime, and saw an increase in expression of EC cell marker nubbin and EE cell marker prospero corresponding with the differentiation of lineages 1 and 2 respectively (S11 Fig). Plotting levels of WNV vRNA along the progression of each lineage confirmed our finding that vRNA levels are lowest in the ISC/EB-prol population compared to fully differentiated EE and EC cells (Fig 3I). Importantly, visualizing vRNA along a lineage progression is a useful way to further examine vRNA level variability between cell populations, it is not a timeline of viral infection, i.e., it does not suggest that WNV initiates infection in the ISC/EB or ISC/EB-prol population.
WNV infection is associated with downregulation of ribosomal and translation related genes in progenitor cells
To further examine the impact of WNV infection on specific cell populations, we performed pseudo-bulk differential expression (DE) analysis between mock and WNV-infected conditions for all midgut cell populations (EC, EC-like, EE, ISC/EB, cardia, VM). We identified 111 significant infection-associated DEGs in these populations with ISC/EB, ISC/EB-prol, and EC-like-1 populations comprising the greatest percentage of the total with 31, 25, and 24 DEGs respectively (Fig 4A, S3 Table). The majority (104/111) of cell-type specific, infection-associated DEGs were identified in populations from 4dpi, while only 7 DEGs were identified in populations from 12dpi (Fig 4B). The majority (89/111) of DEGs were downregulated in response to infection, with only 22 upregulated (Fig 4B). Interestingly, predominantly ribosomal and translation related genes were downregulated in infected ISC/EB and ISC/EB-prol populations at 4dpi (Fig 4C, S3 Table), and two innate immune genes were significantly downregulated in infected EE populations at 4dpi (Fig 4C, S3 Table). To get a broader overview of the biological processes and molecular functions associated with WNV infection, we performed gene ontology (GO) enrichment analysis of all DEGs associated with infection (Fig 4D and 4E and S3 Table). GO categories for genes with functions related to ribosomal structure and translation were enriched among DEGs downregulated in WNV-infected cell populations—likely driven by the large number of such genes downregulated in ISC/EB populations (Fig 4D). GO categories associated with amino acid salvage, and biological processes involving L-methionine were enriched among DEGs upregulated in WNV-infected cell populations (Fig 4E). However, due to there being fewer upregulated DEGs, the false discovery rates were much larger for the results of this GO enrichment analysis (Fig 4E).
Identification of genes associated with WNV infection at the whole-tissue, and single-cell level
Bulk-RNA sequencing comparing WNV-infected to uninfected Cx. tarsalis midguts has not yet been described, so we performed a pseudo-bulk DE analysis to identify differentially expressed genes (DEGs) associated with mock and WNV-infected midguts at the whole-tissue level. We identified six significant DEGs at 4dpi; homocysteine S-methyltransferase, DMAS1 (aldo-keto reductase), and GBE1 (deltamethrin resistance-associated gene) were upregulated in response to WNV infection while BCAN (c-type lectin), uncharacterized gene11056, and a serine protease gene were downregulated (Fig 5A, S4 Table). At 12dpi we identified 10 significant DEGs; an ML (MD-2 related lipid recognition) domain-containing gene, four CRYAB (HSP20 heat shock protein) genes, and a chitin-binding domain-containing gene were upregulated in response to WNV infection, and three uncharacterized genes (gene13447, gene11056, gene9296) and fibrinogen/fibronectin were downregulated (Fig 5B, S4 Table). DEGs differed for each timepoint and, as such, we next examined DEGs in the WNV-infected condition between timepoints. We found many significant DEGs between timepoints; several leucine rich repeat containing genes were upregulated at 4dpi, and host immune gene LYSC4 was upregulated at 12dpi (Fig 5C).
DEGs associated with WNV-infected condition when compared to the mock condition at 4dpi (A) and 12dpi (B) were identified with DESeq2. Negative log2FC indicates downregulation and positive log2FC indicates upregulation associated with WNV infection. (C) Genes upregulated during WNV infection at 4dpi (negative log2FC) and 12dpi (positive log2FC). (D) Characterized genes that are significantly correlated with WNV vRNA at 4dpi (purple) and 12dpi (teal), with correlation values of >0.65. Only significant (p<0.05) relationships shown. (E) Correlation of enteroendocrine specific genes and WNV vRNA at 4dpi (purple) and 12dpi (teal). Solid line represents the mean correlation of WNV vRNA with 1000 randomly selected genes at the specified timepoint (dotted lines represent the upper and lower 95% confidence intervals for this value). Labeled dotted lines denote vRNA correlation with RPL8 and RPL32 housekeeping reference genes. Panels C-E were derived from only WNV-infected replicates.
To further examine genes associated with WNV infection, we performed a gene correlation on normalized counts for the top 500 variable genes (genes that have variation in expression across all cells) for each timepoint, determined significance via bootstrapping, then extracted and visualized characterized genes correlated (>0.65) with vRNA (Fig 5D, S5 Table). Transcription regulator ATRX, a cytochrome p450 gene and several uncharacterized genes were strongly correlated with vRNA at 4dpi (Fig 5D, S5 Table). At 12dpi, GSTE4, HAO1, METTLE20, prospero, UROS, BCAN, DHDH, and CHKov1 were strongly correlated with vRNA along with serine protease, AMP dependent ligase, cytochrome p450, mitochondrial ribosomal S26, glutathione S-transferase, and aldo/keto reductase family genes (Fig 5D, S5 Table). Many of these genes have no documented roles in flavivirus infection. However, ATRX has been implicated in the cellular response to DNA damage, and chromatin remodeling—processes which many viruses exploit during infection [43–45]. Further, cytochrome p450 enzymes and serine proteases have been purported to play a role in the mosquito response to viral infections [46–49].
Upon observing that prospero, the canonical marker for EE cells, was significantly positively correlated with vRNA at 12dpi, we examined the correlation between vRNA and several previously described neuroendocrine genes (Figs 2D and 5E). Two previously described housekeeping genes, RPL8 and RPL32 [50], were validated as having broad expression throughout the total population and used to both confirm that the high prevalence of vRNA in these populations was not confounding the results and provide a visual reference for a biologically insignificant correlation value (Fig 5E). Additionally, for each timepoint we determined the correlation between WNV vRNA and 1,000 random genes from the dataset (unlabeled solid line denotes the average of this calculation with 95% confidence intervals) (Fig 5E). At 4dpi prospero, and at 12dpi prospero, Syt1, and IA2 had positive correlations with vRNA (Fig 5E).
Characterization of the midgut immune response to WNV infection at the whole-tissue and single-cell level
While previous work demonstrated an increase in hemocyte proliferation upon bloodmeal ingestion and infection, there were no significant increases in the proportion of hemocyte populations associated with infection at either time point (Fig 6A and 6B) [51,38]. Upon observing that no mosquito immune genes were identified as significantly upregulated in response to WNV infection by pseudo-bulk DEG and correlation analyses, we manually compared percent of cells expressing and expression levels of key immune genes that have been implicated in viral control/infection response [19,37,52–55]. We identified orthologs in the Cx. tarsalis genome for mosquito immune genes; DOME, NANOS1, MYD88, IMD, AGO2, R2D2, STAT, Cactus, PIAS1, SUMO2, LYSC4, MARCH8, PIWI, and REL (S6 Table) and found no significant differences in the percent of cells expressing or average expression of these genes at either time point (Fig 6C and 6D). Next, we examined the relationships between expression of these immune genes and vRNA at the single-cell level in the WNV-infected population (Fig 6E). Interestingly, almost all genes were significantly positively correlated with vRNA at both timepoints (Fig 6E). To further confirm these findings, we visualized the relationship of the four most highly correlated immune genes (IMD, PIWI, PIAS1, and DOME) with vRNA in individual cells, and compared the expression level of each immune gene in both mock and WNV-infected conditions (timepoints combined) (Figs 6F, 6G and S10). These genes and vRNA were correlated, despite comparable expression levels of each gene across infection conditions, confirming that while vRNA load can be correlated with specific immune genes at the individual cell level, it does not induce significant immune gene enrichment at the total population level (Figs 6F, 6G and S10).
(A-B) Percent of the total population of each replicate comprised by hemocytes compared between mock and WNV-infected conditions at each timepoint. Significance was determined by unpaired t-test. Points = replicate values, bars = mean of replicate values, error bars = SD. (C-D) Average expression of, and percent of cells expressing, mosquito innate immune genes compared between mock and WNV-infected replicates for each timepoint. Significance was determined by multiple unpaired t-tests. Points = replicate values, bars = mean of replicate values, error bars = SD. (E) Correlation of immune genes and WNV vRNA at 4dpi (purple) and 12dpi (teal). Solid line represents the mean correlation of WNV vRNA with 1000 randomly selected genes at the specified timepoint (dotted lines represent the upper and lower 95% confidence intervals for this value). Labeled dotted lines denote vRNA correlation with RPL8 and RPL32 housekeeping reference genes. (F) Feature scatter of IMD vs. WNV 5’ UTR for both timepoints combined. (G) Expression level of IMD in mock and WNV-infected conditions for both timepoints combined. Panels E-F were derived from only WNV-infected replicates.
Discussion
In this study, we sought to generate a midgut cell atlas (i.e., map of cell type and function) for Cx. tarsalis and characterize WNV infection of the midgut at single-cell resolution by performing scRNA-seq on mock and WNV-infected midguts, collected at days 4 and 12 post infection. We identified and described nutrient absorptive (enterocyte), secretory (enteroendocrine), peritrophic matrix secreting (cardia), undifferentiated progenitor (intestinal stem cell/enteroblast), visceral muscle, immune (hemocyte), and fat body cell populations [5,17,18]. The distribution and proportion of each cell-type in the total population varied between timepoints, however we identified at least one cluster comprised of each cell-type at each timepoint. Several clusters were precluded from identification due to either lack of canonical markers/enrichment patterns, or origination from a single replicate. Nonetheless, we have demonstrated that single-cell sequencing of Cx. tarsalis midguts is feasible and that distinct cell populations can be identified and characterized using previously described canonical cell-type markers and enrichment patterns [14,16,18,20].
There were several differences between our identification of populations in the Cx. tarsalis midgut and previously described cell typing efforts in the midguts of Ae. aegypti and Drosophila [14,17,22,23]. Most notably, Delta, the canonical marker gene for intestinal stem cells (ISCs) in both Ae. aegypti and Drosophila, was not enriched in Cx. tarsalis ISCs which were, instead, identified by their colocalization with enteroblasts (EBs – enterocyte progenitors) within the UMAP projection [14,17]. While klumpfuss, the EB marker gene, was specifically expressed in ISC/EB populations, we saw only nonspecific expression of piezo, the marker for enteroendocrine progenitors in Drosophila (S12 Fig) [14]. Further, while there is reference to ISC differentiation markers and proliferative capabilities, existing midgut cell atlases for Ae. aegypti and Drosophila have not identified a distinct subset of proliferating ISCs—a cell population that, in Cx. tarsalis, is highly transcriptionally active and appears important in the context of virus infection (S6 Fig) [14,17,22,23]. Nubbin, the canonical marker gene for enterocytes (ECs), was reported to be a marker for both EC and EC-like populations in Ae. aegypti [17]. In our dataset nubbin was present in ECs and EC-like cells but was solely identified as a marker in the EC population, while EC-like populations were identified by high expression of a variety of digestive enzyme genes. Additionally, gambicin and pgant4, previously described cardia cell markers, were not enriched in Cx. tarsalis cardia cell populations [14,17,22,23]. No gambicin ortholog was identified in the Cx. tarsalis gene-set despite the existence of a gambicin ortholog in the Cx. quinquefasciatus genome (CPIJ016084) and, while several pgant4 orthologs were identified in this gene-set, they were not specifically expressed in any cell population. Thus, identification of cardia populations in our system was based on enrichment for chitin-binding genes, sugar transport genes, and digestive enzyme genes similar to those found in EC-like populations. Our enteroendocrine (EE) cell population was identified by expression of the canonical marker prospero, which aligns with descriptions of EE identification in other systems. However, our EE population showed little to no expression of the previously described mosquito and Drosophila neuropeptides/neuropeptide receptors [14,17,20,30,32]. This could be due to the known bias of scRNA-seq towards highly expressed genes, or due to Cx. tarsalis EE cell secretion of yet uncharacterized neuropeptides. Finally, NEUROD6—an ortholog for a neurogenic differentiation factor frequently found in neurons involved in behavioral reinforcement—was highly expressed in our EE population [36]. Similarly, a gene commonly expressed in neurons (CNMa) has been identified in a subset of EE cells within the Drosophila midgut [14].
We detected WNV RNA (vRNA) in the majority of midgut cells at both timepoints. While vRNA significantly increased in the total midgut by 12dpi, the percent of infected cells did not, demonstrating that the majority of midgut cells that will become infected are infected by 4dpi, while vRNA load increases as infection progresses. The high percentage of WNV-infected cells and permissibility of most cell populations to infection supports previous work demonstrating the extreme vector competence of Cx. tarsalis for WNV [9,56–58].
Interestingly, while WNV infected almost all midgut cell populations, cluster 4 was significantly enriched for vRNA at both timepoints. This high WNV-expressing cluster was associated with very few (<15) defining cluster markers, precluding us from identifying its cell-type. The cluster markers associated with this cell state are composed entirely of mitochondrial genes and mitochondrial tRNAs, suggesting a heightened state of cell stress and/or energy production. Importantly, this cluster was present and enriched for the same mitochondrial genes in the mock condition, indicating that viral replication did not induce significant stress and/or energy production responses [59,60]. Indeed, the presence of cluster 4 in both mock and WNV-infected conditions suggests that this cell state may be associated with blood meal consumption [61,62]. Several previous studies have demonstrated that positive sense-single stranded RNA viruses like dengue virus (DENV), and SARS-CoV-2 modulate mitochondrial dynamics to facilitate replication and/or immune evasion [60,63,64], suggesting potential beneficial interactions between WNV and the mitochondria. Additionally, a study in Lepidoptera (moths and butterflies) purported that enrichment of mitochondrial genes is associated with insect stress resistance [59] and we demonstrated that several heat shock genes (known to be protective against cell stress) were significantly upregulated in WNV-infected midguts, suggesting possible interplay between WNV infection and the stress response [65–67]. However, these heat shock genes were mainly enriched in cluster 8, an un-typed cluster that does not contain the high level of vRNA seen in cluster 4. Additionally, the absence of apoptotic markers in cluster 4 does not discount the possibility that mitochondrial genes are heightened in these cells because they are in the process of dying. If these are dying cells, increased membrane permeability is to be expected and thus high levels of vRNA could be an artifact of the cell state, not a result of it [62].
At 4dpi the EE population is trending toward having more vRNA than other epithelial cell populations and, by 12dpi, has significantly higher levels of vRNA than other populations. Prospero (the canonical marker for EE cells) and select neuroendocrine and vesicle docking genes present in EE cells were positively correlated with vRNA by 12dpi. Additionally, two innate immune system genes were significantly downregulated in the WNV-infected EE population compared to mock at 4dpi. These findings led us to hypothesize that that EE cells serve as sites of enhanced WNV replication during midgut infection. This hypothesis is further supported by previous studies that suggest arboviruses preferentially replicate in highly polarized cell types, such as EE cells [68,69], and previous work with Sindbis virus (SINV) - a mosquito-borne alphavirus—that identified EE cells as a site of infection initiation in Ae. aegypti [70]. Further, a recently published scRNA-seq study of Zika virus infection of the Ae. aegypti midgut implicated EE cells as potential key players in infection [23].
In contrast, the levels of vRNA in ISC/EB and ISC/EB-prol populations, relative to the total population, were strikingly and significantly low. We identified the highest number of DEGs associated with WNV infection in ISC/EB and ISC/EB-prol populations, indicating that they have a greater transcriptional response to WNV infection than other midgut cell types. The majority of these DEGs were downregulated, which is consistent with previous scRNA-seq studies in Aedes aegypti [23]. Upon visualizing the DEGs identified in ISC/EB and ISC/EB-prol populations we observed that predominantly ribosomal and translation related genes were downregulated in infected populations. GO enrichment analysis of all DEGs (cell populations/timepoints combined) that were significantly downregulated in response to infection confirmed that the most highly enriched GO terms denote involvement in ribosomal structure and translation.
Progenitor cells proliferate in response to midgut stress in order to replenish midgut cells impacted by the stressor [71]. While the low levels of vRNA found in ISC/EB and ISC/EB-prol populations could be due to the relative “newness” of the cells comprising this population, our findings suggest that the transcriptional state of ISC/EB populations impedes WNV replication. This is further supported by our cluster of orthologous genes (COG) analysis, which revealed that the base transcriptome of the ISC/EB-prol population—which contains less vRNA than the ISC/EB population and the least vRNA of any epithelial cell population—is composed of fewer COGs that denote translation/ribosome structure and biogenesis than non-proliferating populations. Further, the downregulation of these genes in the WNV-infected condition compared to the mock condition implies that this unfavorable transcriptional state may be a direct response of ISC/EBs to WNV infection. We are not the first to find evidence that proliferating ISCs play a role in infection control in the midgut. Previous work examining Plasmodium infection in the Anopheles midgut revealed that decreased oocyst survival was associated with enhanced proliferation of progenitor cells [72]. Another study characterizing ISC dynamics in response to DENV in the Ae. aegypti midgut found that ISC proliferation increased refractoriness to infection, suggesting that cell turnover is an important part of the midgut immune response, and further implicating ISC/EB populations in infection control within the midgut [73].
While the presence of vRNA alone does not signify active replication, average vRNA levels increased in all epithelial cell populations between timepoints, apart from the ISC/EB and ISC/EB-prol populations (S5 Fig), suggesting that the high vRNA levels in EEs, and low vRNA levels in ISC/EBs, are the result of enhanced and restricted replication respectively. The lack of significant differences associated with the rate of ISC/EB and EE infection at both timepoints further supports that varying levels of vRNA in EE and ISC/EB-prol cells is due to permissivity to replication and not susceptibility to infection. The factors that determine cell-type specific enhancement or suppression of WNV replication are not currently known, but could include more efficient evasion of antiviral pathways, more efficient mechanisms of midgut escape/dissemination, abundance of pro/anti-viral genes or, in the case of ISC/EBs, scarcity of specific genes related ribosome structure and translation.
We captured a population of sessile hemocytes (hemocytes attached to tissue) that we identified as mature granulocytes—a class of hemocytes known to play an important role in the phagocytic and lytic components of the mosquito immune response—at both timepoints, allowing us to characterize a cellular immune component of mosquito midguts [5,15]. The classification of this population as granulocytes was unsurprising, given estimates that granulocytes comprise more than 85% of hemocyte populations in the adult mosquito [51]. There was no evidence of increased immune cell proliferation and attachment to the midgut, or immune gene upregulation in the total infected population compared to mock—suggesting little to no immune activation in the midgut upon WNV infection. This was surprising given the importance of the midgut as a site of innate immune activation [5,6,57,58]. Although scRNA-seq of Cx. tarsalis after WNV infection has not been described, several previous studies in Ae. aegypti and Cx. pipiens have noted significant upregulation of IMD and Toll pathway genes in response to viral infection and highlighted that the innate immune response in mosquitoes is a strong determinant of vector competence [65,74,75]. The absence of a notable immune response to WNV infection in Cx. tarsalis could be a determinant of the vector’s extreme susceptibility and competence [9,56,58]. However, in individual cells, most immune genes had some degree of significant positive correlation with vRNA suggesting that, while WNV infection does not cause significant enrichment of these genes in the total population, WNV infection and replication can influence the expression of these genes at the single-cell level. This finding highlights that scRNA-seq is a powerful tool for characterizing infection dynamics that are not apparent when looking at the population average.
Limitations and future directions
Our inability to detect certain genes (i.e., neuropeptide genes and additional canonical markers we would expect to see) could be due to the low percentage (~30%) of reads mapped to the Cx. tarsalis genome (likely due to either the presence of reads that were too short to map, or the current state of the Cx. tarsalis genome annotation), or the absence of those genes in the existing annotation file. Future scRNA-seq studies in Culex mosquitoes could potentially benefit from adjusting the fragmentation time recommended by 10X Genomics. Further, improvement of the existing Cx. tarsalis genome annotation would facilitate studies of gene expression in this species. A multitude of genes detected in our dataset remain uncharacterized due to a lack of appropriate orthologs, which could be explained by the evolutionary divergence between Cx. tarsalis and the species from which most gene orthologs were derived; Cx. quinquefasciatus and Ae. Aegypti, which diverged 15–22 million years ago (MYA) and 148-216 MYA respectively [12]. High levels of vRNA in specific cell types imply that replication is occurring/has occurred but it is important to note that the presence of vRNA is not analogous to active viral replication (e.g., the presence of vRNA could be the result of phagocytosis of an infected cell). Future studies could use qRT-PCR, focusing on our top genes of interest, to measure expression kinetics and levels following midgut infection in Cx. tarsalis and other relevant vectors. Finally, early WNV infection in the Cx. tarsalis midgut will be further studied in our lab via immunofluorescence and flow cytometry assays using cell-type and cell-state specific antibodies in whole midguts, putting the findings described here into a spatial context.
Conclusion
The work presented here demonstrates that WNV is capable of infecting most midgut cell types in Cx. tarsalis. Moreover, while most cells within the midgut are susceptible to WNV infection, we observed modest differences in virus replication efficiency that appeared to occur in a cell-type specific manner, with EE cells being the most permissive and proliferating ISC/EB cells being the most refractory. Further, our findings demonstrate that ISC/EBs downregulate genes associated with ribosome structure and translation during WNV infection, and suggest that the resulting transcriptional state of ISC/EBs is unfavorable for WNV replication. We observed minimal to no upregulation of key mosquito immune genes in the midgut as a whole, however, we show that immune gene expression can be correlated with WNV vRNA level within individual cells. Additionally, we have generated a midgut cell atlas for Cx. tarsalis and, in doing so, improved the field’s understanding of how WNV establishes infection in this highly competent vector.
Methods
Virus
All infections were performed with a recombinant barcoded WNV (bcWNV) passage 2 stock (epidemic lineage I strain, 3356) grown on Vero cells. Titer for the stock was determined by standard Vero cell plaque assay [76].
Mosquito infection
Mosquito studies were conducted using laboratory colony-derived Cx. tarsalis mosquitoes (>50 passages). WNV infections in mosquitoes were performed under A-BSL3 conditions. Larvae were raised on a diet of powdered fish food. Mosquitoes were maintained at 26°C with a 16:8 light:dark cycle and maintained at 70–80% relative humidity, with water and sucrose provided ad libitum. Cx. tarsalis mosquitoes were transferred to A-BSL3 conditions 48 hours prior to blood feeding, and dry starved 20–24 hours before blood feeding. Seven days after pupation (6–7 days after emergence) mosquitoes were exposed to an infectious bloodmeal containing a 1:1 dilution of defibrinated calf’s blood and bcWNV stock diluted in infection media (Dulbecco’s Modified Eagle’s Medium, 5% penicillin-streptomycin, 2% amphotericin B, and 1% fetal bovine serum (FBS)) for a final concentration of 3-6e7 PFU/mL, or a mock bloodmeal containing a 1:1 dilution of defibrinated calf’s blood and infection media. All bloodmeals were provided in a hog’s gut glass membrane feeder, warmed by circulating 37°C water. Following 50–60 minutes of feeding, mosquitoes were cold-anesthetized, and engorged females were separated into cartons and maintained on sucrose.
Collection of mosquito tissues
At indicated time points, mosquitoes were cold-anesthetized and transferred to a dish containing Sf900III insect cell culture media (Gibco) with 5% FBS. Midguts were dissected, and transferred to tubes containing 500uL Sf900III media + 5% FBS and kept on ice for the duration of dissections. Ten pooled midguts per sample were collected for dissociation and sequencing.
Midgut dissociation and single-cell suspension preparation
A dissociation buffer containing Bacillus licheniformis protease (10mg/mL) and DNAse I (25U/mL) was prepared in Sf900III media (Gibco). Pools of 10 midguts were resuspended in dissociation media, transferred to a 96-well culture dish, and triturated with a p1000 pipet at 15–20 minute intervals for 105 minutes. At each interval 100–125μL containing dissociated single-cells was collected (with replacement) from the top of the dissociation reaction and transferred to 25mL of Sf900III + 5% FBS on ice. Dissociation reactions were kept covered at 4°C between trituration. Upon complete tissue dissociation the entire remaining volume of each reaction was transferred to Sf900III + 5% FBS on ice. Collection tubes with dissociated cells were centrifuged at 700xg for 10 minutes at 4°C, resuspended in 500μL of Sf900III + 5% FBS, and passed through a 40μm small volume filter (PluriSelect). Immediately prior to loading on the Chromium Controller, cell suspensions were spun down at 700xg for 10 minutes and washed twice in 1mL PBS + 0.04% bovine serum albumin (BSA), and resuspended in 50μL of PBS + 0.04% BSA. Cell concentration and viability was determined using the Countess II Automated Cell Counter (Thermo Fisher Scientific). Appropriate cell suspension volume (target recovery of 10,000 live cells) was loaded on the Chromium controller. Further details on this protocol can be found here-dx.doi.org/10.17504/protocols.io.j8nlke246l5r/v1.
Gel bead-in emulsions (GEM) generation and cDNA synthesis
GEM generation and cDNA synthesis were performed using the Next GEM Single Cell 5’ GEM kit v2 (PN-1000266) and Next GEM Chip K Single Cell Kit (PN-1000286) (10X Genomics). Reactions for GEM generation were prepared according to the Chromium Next GEM Single Cell 5’ Reagent Kit v2 (dual index) user guide with one alteration; a primer specific to the WNV envelope region of the genome (5′-AAGCAGTGGTATCAACGCAGAGTAC-GGGTCAGCACGTTTGTCATTG-3′) was added to all samples at a concentration of 10nM (7.5μl – displacing 7.5μl of the total H2O added to each reaction) [21,77]. GEMs were generated using the 10X Chromium controller X series and cDNA was synthesized according to the Chromium Next GEM Single Cell 5’ Reagent Kit v2 (dual index) user guide.
Library preparation and sequencing
Four libraries were prepared using the Next GEM single cell 5’ v2 library construction kit and Dual Index Kit TT Set A (10X Genomics, PN-1000190 and PN-1000215 respectively). Library construction was carried out according to the Chromium Next GEM Single Cell 5’ Reagent Kit v2 (dual index) user guide. Library concentration was determined by KAPA Library Quantification Kit (Roche). Libraries were then diluted to 15nM, pooled by volume, and sequenced at the CU Anschutz Genomics and Microarray core on the NovaSeq 6000 (150 × 10 × 10 × 150) (Illumina) at a target coverage of 4.0e8 read pairs per sample, equating to 40,000 read pairs per cell. Average sample coverage and cell recovery per sample was 5.3e8 read pairs and 2417.9 cells respectively (S1 File).
Reference generation and sample processing with cell ranger
The gene feature files associated with the Cx. tarsalis and WNV genomes were converted to gene transfer format (gtf) using AGAT (v1.0.0) prior to being filtered with the CellRanger (v7.0.1) mkgtf function [78]. An ‘MT-‘prefix was manually added to all non-tRNA features located in the mitochondrial chromosome of the Cx. tarsalis genome. The contents of the filtered WNV genome feature file, and fasta file, were appended to the Cx. tarsalis feature and fasta files respectively, and run through CellRanger::mkref. All sequencing data were processed and mapped to the aforementioned Cx. tarsalis/WNV combined reference genome using CellRanger::count with the following parameters: --include-introns=true \ --expect-cells=10000.
Quality control and Seurat workflow
Cell Ranger output files were individually read into RStudio (RStudio - v2023.09.0+463, R – v4.3.2) as SingleCellExperiment objects using the singleCellTK package (v2.12.0). Doublet identification and ambient RNA estimation were performed with singleCellTK::runCellQC using the algorithms “scDblFinder” and “DecontX” respectively. Samples were filtered for doublets and ambient RNA contamination by keeping cells with the following metrics: decontX_contamination<0.6, scDblFinder_doublet_score<0.9. Samples were then converted to Seurat objects, log normalized individually, and merged into one Seurat object, with columns pertaining to sample of origin, infection condition, and timepoint added to the object metadata prior to processing as described in the Seurat (v4.3.0.1) guided clustering tutorial [79]. Briefly, mitochondrial gene percentage for each cell was calculated, and cells with the following metrics were retained: nFeature_RNA>100, nFeature_RNA<2500, percent_mt<25 [14]. Cell retention metrics were informed by a previous study of the Drosophila midgut [14]. Percent of cells retained after QC for each sample can be found in S1 File. Features were log normalized, variable features were identified, the top 2000 variable features were scaled, a principal component (PC) analysis dimensionality reduction was run, and the number of PCs needed to adequately capture variation in the data was determined via elbow plot. Nearest neighbors were computed, and appropriate clustering granularity was determined with Clustree (v0.5.0). Uniform Manifold Approximation and Projection (UMAP) dimensional reduction was performed, and clusters were visualized with UMAP reduction. Cluster markers were identified with Seurat::FindConservedMarkers using default parameters and infection condition as the grouping variable. WNV vRNA as a cluster marker was identified by splitting the dataset by infection condition and timepoint and using Seurat::FindAllMarkers on the WNV-infected samples with a logfc threshold of 0.25. In all cases where calculations were performed on individual replicates or individual conditions, the merged Seurat object was split by sample or condition using Seurat::SplitObject so that calculations performed on subsets of the data or individual replicates were derived from a dataset that had been normalized as one. Percent expression and average expression were calculated with scCustomize::Percent_Expressing (v1.1.3) and Seurat::AverageExpression respectively. Feature expression levels were visualized with Seurat::FeaturePlot and Seurat::VlnPlot. Solely mitochondrial genes and tRNAs were identified as markers for cluster 4, suggesting that this cluster was composed of dying cells. While it is common practice to remove dying cells from scRNAseq datasets further examination of cluster 4 revealed 12.27% of features were mitochondrial, 0.97% were viral, and 33.91% were tRNA, demonstrating that the percentage of mitochondrial genes was well below our QC threshold and was not artificially deflated by high vRNA levels. As we could neither confirm cell death nor rule out the possibility that cluster 4 represents a highly metabolically active cell state we did not see fit to remove this population from our dataset.
Trajectory inference with Slingshot
Lineage structure and pseudotime inference was performed using the Slingshot (v2.10.0) and tradeSeq (v1.16.0) functions getLineages, getCurves, and fitGAM successively on the dataset containing epithelial cell populations (timepoints combined). ISC/EB and ISC/EB-prol populations were specified as the start state and EC and EE populations were specified as the end state for lineage determination.
Pseudo-bulk differential expression analysis with DESeq2
Pseudo-bulk dataset was generated and differential expression analysis performed as described by Khushbu Patel’s (aka bioinformagician) pseudo-bulk analysis for single-cell RNA-Seq data workflow tutorial [80]. Briefly, raw counts were aggregated at the sample level using Seurat::AggregateExpression, and the aggregated counts matrix was extracted and used to create a DESeq2 object (dds). The dds object was filtered to retain genes with counts >=10 prior to running DESeq2 and extracting results for the appropriate contrast. For total population DEGs specific to timepoints we subset the dataset by timepoint. For DEGs specific to midgut cell populations we subset the dataset by ident/cluster/population prior to performing the DESeq2 workflow. We identified infection associated DEGs between timepoints by performing a pseudo-bulk DE analysis between 4dpi and 12dpi WNV-infected samples and mock samples separately. We then filtered out DEGs associated only with bloodmeal consumption (genes that came up as significantly differentially expressed between timepoints in our mock condition) leaving only DEGs associated with infection. DESeq2 results for the total population were visualized via EnhancedVolcano (v1.20.0). DESeq2 results for specific midgut cell populations were visualized in Prism (10.0.3).
Gene ontology enrichment analysis
GOEA was performed by generating a list of Culex quinquefasciatus ortholog accession numbers corresponding to all up or downregulated DEGs identified in specific cell populations. In instances where accession numbers were available for orthologs in multiple species the Culex quinquefasciatus ortholog was selected. Accession number lists were submitted to ShinyGO 0.80 (web version - http://bioinformatics.sdstate.edu/go/) with the Cx. quinquefasciatus CpipJ2 assembly set as a reference, FDR cutoff set to 0.05, and default settings for all other parameters [81]. GOEA plots were generated using the “chart” tab, and downloaded directly from ShinyGO 0.80 [81].
Gene correlation with vRNA level
Raw gene counts for each timepoint were extracted and normalized using scLink::sclink_norm (v1.0.1) [82]. Correlation matrices were then generated for the top 500 variable genes (identified during the Seurat guided clustering workflow) using scLink::sclink_corr [82]. For specific neuroendocrine and immune gene correlations a vector of specific gene names was supplied in lieu of the top 500 variable genes. P-values associated with correlation values were determined via bootstrapping. Average correlation of 1000 random genes was determined by generating a vector of all Seurat object rownames containing the string “gene”, randomly sampling 999 genes from this vector, appending the WNV 5’ UTR feature ID to the subsample, and normalizing and determining correlations as described above. Mean correlation value and 95% confidence intervals were calculated in Prism (10.0.3)
Ortholog identification with eggNOG mapper
Coding sequence (CDS) genome coordinates were extracted from the Cx. tarsalis gene transfer format file (gtf) and corresponding genome sequences were extracted from the Cx. tarsalis genome fasta file using Bedtools (v2.26.0). CDS were then assigned orthologs via eggNOG-mapper v2 (web version - http://eggnog-mapper.embl.de) using default parameters (S4 File) [83,84]. Gene IDs from the gtf file… were merged with the eggNOG output file using custom R scripts. Gene ID, ortholog seed, preferred gene name, COG category notation, PFAM information, and gene description were retained in a gene name and ID file within which information pertaining to identical gene IDs was aggregated using a custom R script. This aggregated gene name and ID file (S5 File) was used to assign gene names to the marker files used for cell-typing, DESeq2 results files, scLink correlation matrices, etc.
Statistical analyses
Statistical analyses were performed in GraphPad Prism version 10.0.3. Differences in vRNA level in known clusters were measured by one-way ANOVA with Tukey’s multiple comparisons test. Differences in average expression and percent expression of mosquito immune genes between mock and WNV-infected conditions were measured by multiple unpaired t-tests. Differences in hemocyte population proportion were measured by unpaired t-test.
All scripts used for data processing, analysis, and visualization are available on GitHub: https://github.com/fitz-meyer/scRNA_seq_fitz
RDS files for all samples included in the described analyses are available on GitHub: https://github.com/fitz-meyer/scRNA_seq_rdsFiles.git
All sequencing data files included in the described analyses are available on NCBI.
Supporting information
S1 Fig. COG profiles of midgut cell populations.
Cluster of orthologous gene (COG) profiles for (A) genes expressed in ≥75% of cells in each cluster and (B) cluster marker genes were visualized as percentage of total for each cluster/population. Colors represent COG notation A-Z, NA and DIV as shown in the notation key embedded in the figure. Where applicable, marker gene COG profiles were derived from cluster markers that are conserved between infection conditions.
https://doi.org/10.1371/journal.ppat.1012855.s001
(TIF)
S2 Fig. Cluster proportion and grouping by condition and replicate.
Proportion of the total population comprised by each cluster compared between mock and WNV-infected conditions at 4dpi (A) and 12dpi (B). Only significant comparisons shown. Significance determined by multiple unpaired t-tests. Bar = mean, error bars = SD. (C) Cluster grouping and composition by sample—mock and infected samples both plotted. Different colors denote different samples. (D) Cluster grouping and composition by infection condition. Salmon = mock, blue = WNV-infected.
https://doi.org/10.1371/journal.ppat.1012855.s002
(TIF)
S3 Fig. Characterizing EE, ISC/EB, and HC cell-types.
Visualizing expression of (A) neuroendocrine and secretory genes, (B) proliferation and mitotic marker genes, and (C) HC class marker genes across the total population to determine specificity of expression. Violin plots show expression level in individual cells grouped by cluster. (D) Gene IDs and accession numbers for all genes described in this figure.
https://doi.org/10.1371/journal.ppat.1012855.s003
(PPTX)
S4 Fig. Enrichment for identical sets of mitochondrial genes and tRNAs specific to cluster 4.
Dot size denotes percent of cells expressing each gene, color denotes scaled gene expression. “MT-“ prefix was added to tRNA gene names in this figure for clarity. Mitochondrial tRNAs were not included in mitochondrial gene estimation for QC filtering. Plots were derived from only WNV-infected samples.
https://doi.org/10.1371/journal.ppat.1012855.s004
(TIF)
S5 Fig. Side by side comparison of vRNA levels in midgut cell populations at each timepoint.
Purple = 4dpi, green = 12dpi. Points within bars denote replicate values, bar denotes mean of replicate values. Error bars = SD.
https://doi.org/10.1371/journal.ppat.1012855.s005
(TIF)
S6 Fig. Examining transcriptional activity of clusters.
Feature counts and RNA counts for individual cells within clusters.
https://doi.org/10.1371/journal.ppat.1012855.s006
(TIF)
S7 Fig. Confirming that cell death does not drive clustering.
Expression of apoptotic and anti-apoptotic genes visualized in the total population via UMAP feature plot. Color in feature map denotes expression level.
https://doi.org/10.1371/journal.ppat.1012855.s007
(TIF)
S8 Fig. S phase markers.
Expression of S phase gene markers visualized in the total population via UMAP feature plot. Color in feature map denotes expression level.
https://doi.org/10.1371/journal.ppat.1012855.s008
(TIF)
S9 Fig. G2/M phase markers.
Expression of G2/M phase gene markers visualized in the total population via UMAP feature plot. Color in feature map denotes expression level.
https://doi.org/10.1371/journal.ppat.1012855.s009
(TIF)
S10 Fig. Visually confirming vRNA level is correlated with select immune gene expression without significantly increasing expression in the total population.
Correlation between vRNA and 3 of the most highly correlated immune genes (as determined by scLink) confirmed by feature scatter. Equivalent expression levels of immune genes between mock and WNV-infected conditions confirmed by violin plot. Scatter plots derived from only WNV-infected replicates.
https://doi.org/10.1371/journal.ppat.1012855.s010
(TIF)
S11 Fig. Visualizing cell-type marker expression along epithelial cell lineages identified by trajectory analysis.
Lineage 1 (green–EC lineage) and 2 (blue–EE lineage) displayed expression of klumpfuss (Klu), the canonical marker for enteroblasts, prior to differentiation. PCNA expression was associated with the progression of each lineage into the proliferating ISC/EB population. The canonical marker for ECs (POU2F1/nubbin) was enriched upon differentiation of lineage 1 into ECs. The canonical marker for EEs (PROX1/prospero) was enriched upon differentiation of lineage 2 into EEs.
https://doi.org/10.1371/journal.ppat.1012855.s011
(TIF)
S12 Fig. Nonspecific expression of piezo.
Expression of the EEP gene piezo in individual cells grouped by cluster.
https://doi.org/10.1371/journal.ppat.1012855.s012
(TIF)
S1 Table. Cluster proportion.
Percent of the total midgut cell population each distinct cluster composes.
https://doi.org/10.1371/journal.ppat.1012855.s013
(XLSX)
S2 Table. WNV vRNA as a cluster marker.
Log2FC values of the WNV 5’ UTR (nbis-gene-2-utr) in clusters where vRNA was identified as a significant (p-adj<0.0.5) cluster marker in the WNV-infected condition. Log2FC values denote vRNA enrichment in a cluster relative to the total remaining population.
https://doi.org/10.1371/journal.ppat.1012855.s014
(XLSX)
S3 Table. All significant DEGs identified as associated with WNV-infection in midgut cell populations.
Significant (p-adj<0.05) differentially expressed genes (DEGs) grouped by population and timepoint. Log2FC values denote enrichment of a gene in a specific cell population in the WNV-infected condition relative to that specific cell population in the mock condition.
https://doi.org/10.1371/journal.ppat.1012855.s015
(XLSX)
S4 Table. Log2FC values and accession numbers for total population pseudo-bulk differential expression analysis.
Gene ID, gene name, Log2FC value, ortholog accession number, adjusted p-value, and description for all genes significantly (p-adj<0.05) differentially expressed between infected and mock conditions.
https://doi.org/10.1371/journal.ppat.1012855.s016
(XLSX)
S5 Table. Complete list of genes identified as significantly positively correlated with WNV vRNA.
Gene name, ortholog accession number, gene ID, correlation value, p-value, and description for all genes significantly (p<0.05) positively correlated (>0.65) with vRNA. Significance was determined by bootstrapping.
https://doi.org/10.1371/journal.ppat.1012855.s017
(XLSX)
S6 Table. Innate immune gene identifiers.
Gene ID, gene name, and accession number for mosquito innate immune genes compared between mock and WNV-infected conditions.
https://doi.org/10.1371/journal.ppat.1012855.s018
(XLSX)
S1 File. scRNA-seq run summaries
Provides technical details for all samples processed as part of this study: cells recovered, median genes per cell, mean reads per cell, percent of reads mapped to genome, percent of cells retained after initial QC, percent of cells infected, coverage, %Q30, pooling method, cell viability, prep details/alterations, and justification for exclusion of samples.
https://doi.org/10.1371/journal.ppat.1012855.s019
(XLSX)
S2 File. Midgut cell markers.
Gene ID, accession number, name/description, and expression information for all genes used in identifying or supporting the identification of each cell population.
https://doi.org/10.1371/journal.ppat.1012855.s020
(XLSX)
S3 File. Top 50 conserved markers.
The top 50 (arranged by positive log2FC in the mock condition) conserved cluster markers identified via Seurat’s FindConservedMarkers function for all 20 clusters.
https://doi.org/10.1371/journal.ppat.1012855.s021
(XLSX)
S4 File. Original eggNOG mapper output file.
Unaltered eggNOG mapper ortholog identification results for all coding sequences identified in the Cx. tarsalis genome feature file.
https://doi.org/10.1371/journal.ppat.1012855.s022
(XLSX)
S5 File. Aggregated gene ID ortholog list with accession number/ortholog seed.
eggNOG mapper ortholog identification output merged with gene IDs from the Cx. tarsalis genome feature file with gene ID repeats collapsed so that each gene ID corresponds to one ortholog/gene name entry.
https://doi.org/10.1371/journal.ppat.1012855.s023
(CSV)
Acknowledgments
Next Generation Sequencing was performed at the Genomics Shared Resource (RRID: SCR_021984) at the University of Colorado. This resource is supported by Cancer Center Support Grant (P30CA046934).
References
- 1. Girard M, Nelson CB, Picot V, Gubler DJ. Arboviruses: a global public health threat. Vaccine. 2020 May;38(24):3989–94. pmid:32336601
- 2. Pierson TC, Diamond MS. The continued threat of emerging flaviviruses. Nat Microbiol. 2020 May 4;5(6):796–812. pmid:32367055
- 3. Carpenter A, Bryant WB, Santos SR, Clem RJ. Infection of Aedes aegypti mosquitoes with midgut-attenuated Sindbis virus reduces, but does not eliminate, disseminated infection. J Virol. 2021 Jun 10;95(13).
- 4. Weaver SC, Forrester NL, Liu J, Vasilakis N. Population bottlenecks and founder effects: implications for mosquito-borne arboviral emergence. Nat Rev Microbiol. 2021;19(3):184–95. pmid:33432235
- 5. Hixson B, Taracena ML, Buchon N. Midgut epithelial dynamics are central to mosquitoes’ physiology and fitness, and to the transmission of vector-borne disease. Front Cell Infect Microbiol. 2021 Mar 25;11.
- 6. Caccia S, Casartelli M, Tettamanti G. The amazing complexity of insect midgut cells: types, peculiarities, and functions. Cell Tissue Res. 2019 Sep 29;377(3):505–25.
- 7. Lan H, Chen H, Liu Y, Jiang C, Mao Q, Jia D. Small interfering RNA pathway modulates initial viral infection in midgut epithelium of insect after ingestion of virus. J Virol. 2016 Jan 15;90(2):917–29.
- 8.
Center for Disease Control and Prevention, West Nile virus. 2023. Available from: https://www.cdc.gov/westnile/statsmaps/data-and-maps.html [Internet].
- 9. Dunphy BM, Kovach KB, Gehrke EJ, Field EN, Rowley WA, Bartholomay LC, et al. Long-term surveillance defines spatial and temporal patterns implicating Culex tarsalis as the primary vector of West Nile virus. Sci Rep. 2019 Apr 29;9(1):6637. pmid:31036953
- 10. Turell MJ, O’Guinn ML, Dohm DJ, Webb JP Jr, Sardelis MR. Vector competence of Culex tarsalis from Orange County, California, for West Nile virus. Vector Borne Zoon Dis. 2002 Sep;2(3):193–6. pmid:12737548
- 11. Artsob H, Gubler DJ, Enria DA, Morales MA, Pupo M, Bunning ML. West Nile virus in the New World: trends in the spread and proliferation of West Nile virus in the Western Hemisphere. Zoon Public Health. 2009 Aug 9;56(6–7):357–69.
- 12. Main BJ, Marcantonio M, Johnston JS, Rasgon JL, Brown CT, Barker CM. Whole-genome assembly of Culex tarsalis. G3 Genes|Genomes|Genetics. 2021 Apr 12;11(2).
- 13. Merkling SH, Lambrechts L. Taking insect immunity to the single-cell level. Trends Immunol. 2020 Mar;41(3):190–9. pmid:32035764
- 14. Hung R-J, Hu Y, Kirchner R, Liu Y, Xu C, Comjean A, et al. A cell atlas of the adult Drosophila midgut. Proc Natl Acad Sci U S A. 2020 Jan 21;117(3):1514–23. pmid:31915294
- 15. Severo MS, Landry JJM, Lindquist RL, Goosmann C, Brinkmann V, Collier P, et al. Unbiased classification of mosquito blood cells by single-cell genomics and high-content imaging. Proc Natl Acad Sci. 2018 Aug 7;115(32).
- 16. Raddi G, Barletta ABF, Efremova M, Ramirez JL, Cantera R, Teichmann SA, et al. Mosquito cellular immunity at single-cell resolution. Science (1979). 2020 Aug 28;369(6507):1128–32.
- 17. Cui Y, Franz AWE. Heterogeneity of midgut cells and their differential responses to blood meal ingestion by the mosquito, Aedes aegypti. Insect Biochem Mol Biol. 2020 Dec;127:103496. pmid:33188922
- 18. Kwon H, Mohammed M, Franzén O, Ankarklev J, Smith RC. Single-cell analysis of mosquito hemocytes identifies signatures of immune cell subtypes and cell differentiation. Elife. 2021 Jul 28;10.
- 19. Lee W-S, Webster JA, Madzokere ET, Stephenson EB, Herrero LJ. Mosquito antiviral defense mechanisms: a delicate balance between innate immunity and persistent viral infection. Parasit Vectors. 2019 Dec 11;12(1):165. pmid:30975197
- 20. Hung R-J, Li JSS, Liu Y, Perrimon N. Defining cell types and lineage in the Drosophila midgut using single cell transcriptomics. Curr Opin Insect Sci. 2021 Oct;4712–7. pmid:33609768
- 21. Sanborn M, Li T, Victor K, Siegfried H, Fung C, Rothman A. Analysis of cell-associated DENV RNA by oligo(dT) primed 5’ capture scRNAseq. Sci Rep. 2020 Jun 3;10(1):9047.
- 22. Wang S, Huang Y, Wang F, Han Q, Ren N, Wang X, et al. A cell atlas of the adult female Aedes aegypti midgut revealed by single-cell RNA sequencing. Sci Data. 2024 Jun 5;11(1):587. pmid:38839790
- 23. Chen T-Y, Raduwan H, Marín-López A, Cui Y, Fikrig E. Zika virus exists in enterocytes and enteroendocrine cells of the Aedes aegypti midgut. iScience. 2024 Jul;27(7):110353. pmid:39055935
- 24. Hess NK, Singer PA, Trinh K, Nikkhoy M, Bernstein SI. Transcriptional regulation of the Drosophila melanogaster muscle myosin heavy-chain gene. Gene Expr Patterns. 2007 Feb;7(4):413–22. pmid:17194628
- 25. Dietrich DR. Toxicological and pathological applications of proliferating cell nuclear antigen (PCNA), a novel endogenous marker for cell proliferation.. Crit Rev Toxicol. 1993 Jan 25;23(1):77–109.
- 26. Ng L, Prelich G, Anderson CW, Stillman B, Fisher PA. Drosophila proliferating cell nuclear antigen. Structural and functional homology with its mammalian counterpart. J Biol Chem. 1990 Jul;265(20):11948–54. pmid:1973166
- 27. Piermarini PM, Esquivel CJ, Denton JS. Malpighian tubules as novel targets for mosquito control. Int J Environ Res Public Health. 2017 Jan 24;14(2):111. pmid:28125032
- 28. Li Y, Piermarini P, Esquivel C, Drumm H, Schilkey F, Hansen I. RNA-Seq comparison of larval and adult Malpighian tubules of the yellow fever mosquito Aedes aegypti reveals life stage-specific changes in renal function. Front Physiol. 2017 May 9;8.
- 29. Schoofs L, De Loof A, Van Hiel M. Neuropeptides as regulators of behavior in insects. Annu Rev Entomol. 2017 Jan 31;62(1):35–52.
- 30. Predel R, Neupert S, Garczynski S, Crim J, Brown M, Russell W. Neuropeptidomics of the Mosquito Aedes aegypti. J Proteome Res. 2010 Apr 5;9(4):2006–15.
- 31. Chen J, Kim S-M, Kwon JY. A Systematic analysis of drosophila regulatory peptide expression in enteroendocrine cells. Mol Cells. 2016 Apr;39(4):358–66. pmid:27025390
- 32. C .Christ P, Hill SR, Schachtner J, Hauser F, Ignell R. Functional characterization of mosquito short neuropeptide F receptors. Peptides. 2018 May;103:31–9. pmid:29550617
- 33. Torii S. Expression and function of IA-2 family proteins, unique neuroendocrine-specific protein-tyrosine phosphatases. Endocr J. 2009;56(5):639–48. pmid:19550073
- 34. Mbikay M, Seidah NG, Chrétien M. Neuroendocrine secretory protein 7B2: structure, expression and functions. Biochem J. 2001 Jul 15;357(Pt 2):329–42. pmid:11439082
- 35. Shao X, Li C, Fernandez I, Zhang X, Südhof TC, Rizo J. Synaptotagmin-syntaxin interaction: the C2 domain as a Ca2+-dependent electrostatic switch. Neuron. 1997 Jan;18(1):133–42. pmid:9010211
- 36. Bimpisidis Z, König N, Stagkourakis S, Zell V, Vlcek B, Dumas S, et al. The NeuroD6 Subtype of VTA neurons contributes to psychostimulant sensitization and behavioral reinforcement. eNeuro. 2019;6(3):ENEURO.0066-19.2019. pmid:31097625
- 37. Kumar A, Srivastava P, Sirisena P, Dubey S, Kumar R, Shrinet J, et al. Mosquito innate immunity. Insects. 2018 Aug 8;9(3):95.
- 38. Cardoso-Jaime V, Tikhe CV, Dong S, Dimopoulos G. The role of mosquito hemocytes in viral infections. Viruses. 2022 Sep 20;14(10):2088.
- 39. Jneid R, Loudhaief R, Zucchini-Pascal N, Nawrot-Esposito MP, Fichant A, Rousset R. Bacillus thuringiensis toxins divert progenitor cells toward enteroendocrine fate by decreasing cell adhesion with intestinal stem cells in Drosophila.. Elife. 2023 Feb 27;12.
- 40. Baulies A, Angelis N, Li V. Hallmarks of intestinal stem cells. Development. 2020 Aug 1;147(15).
- 41. Abdelbaki A, Akman HB, Poteau M, Grant R, Gavet O, Guarguaglini G. AURKA destruction is decoupled from its activity at mitotic exit but is essential to suppress interphase activity. J Cell Sci. 2020 Jun 15;133(12).
- 42. Portella G, Passaro C, Chieffi P. Aurora B: a new prognostic marker and therapeutic target in cancer. Curr Med Chem. 2011 Feb 1;18(4):482–96. pmid:21143115
- 43. Aguilera P, López-Contreras AJ. ATRX, a guardian of chromatin. Trends Genet. 2023 Jun;39(6):505–19. pmid:36894374
- 44. Lilley CE, Chaurushiya MS, Weitzman MD. Chromatin at the intersection of viral infection and DNA damage. Biochim Biophys Acta. 2010 Mar;1799(3–4):319–27. pmid:19616655
- 45. Tsai K, Thikmyanova N, Wojcechowskyj JA, Delecluse H-J, Lieberman PM. EBV tegument protein BNRF1 disrupts DAXX-ATRX to activate viral early gene transcription. PLoS Pathog. 2011 Nov 10;7(11):e1002376. pmid:22102817
- 46. Gorman MJ, Paskewitz SM. Serine proteases as mediators of mosquito immune responses. Insect Biochem Mol Biol. 2001;31(3):257–62. pmid:11167095
- 47. Cao X, Gulati M, Jiang H. Serine protease-related proteins in the malaria mosquito, Anopheles gambiae. Insect Biochem Mol Biol. 2017 Sep;88:48–62. pmid:28780069
- 48. Brackney DE, Foy BD, Olson KE. The effects of midgut serine proteases on dengue virus type 2 infectivity of Aedes aegypti. Am J Trop Med Hyg. 2008 Aug;79(2):267–74. pmid:18689635
- 49. Skorokhod O, Vostokova E, Gilardi G. The role of P450 enzymes in malaria and other vector-borne infectious diseases. BioFactors. 2024 Jan 9;50(1):16–32.
- 50. Dzaki N, Ramli KN, Azlan A, Ishak IH, Azzam G. Evaluation of reference genes at different developmental stages for quantitative real-time PCR in Aedes aegypti. Sci Rep. 2017 Mar 16;7(1):43618.
- 51. Hillyer JF, Strand MR. Mosquito hemocyte-mediated immune responses. Curr Opin Insect Sci. 2014 Sep;3:14–21. pmid:25309850
- 52. Cai D, Liu L, Tian B, Fu X, Yang Q, Chen J. Dual-role ubiquitination regulation shuttling the entire life cycle of the Flaviviridae. Front Microbiol. 2022 May 6;13.
- 53.
Pujhari S, Hughes GL, Pakpour N, Suzuki Y, Rasgon JL. Wolbachia-induced inhibition of O’nyong nyong virus in Anopheles mosquitoes is mediated by Toll signaling and modulated by cholesterol. bioRxiv. 2023 Jun 1.
- 54. Stokes S, Almire F, Tatham MH, McFarlane S, Mertens P, Pondeville E, et al. The SUMOylation pathway suppresses arbovirus replication in Aedes aegypti cells. PLoS Pathog. 2020 Dec 22;16(12):e1009134. pmid:33351855
- 55. Souza-Neto JA, Sim S, Dimopoulos G. An evolutionary conserved function of the JAK-STAT pathway in anti-dengue defense. Proc Natl Acad Sci U S A. 2009 Oct 20;106(42):17841–6. pmid:19805194
- 56. Fikrig E, Main AJ, Cheng G, Anderson JF, Ferrandino FJ. Horizontal and vertical transmission of West Nile virus genotype NY99 by Culex salinarius and genotypes NY99 and WN02 by Culex tarsalis. Am J Trop Med Hyg. 2012 Jan 1;86(1):134–9.
- 57. Grubaugh ND, Weger-Lucarelli J, Murrieta RA, Fauver JR, Garcia-Luna SM, Prasad AN, et al. Genetic drift during systemic arbovirus infection of mosquito vectors leads to decreased relative fitness during host switching. Cell Host Microbe. 2016 Apr 13;19(4):481–92. pmid:27049584
- 58. Fitzmeyer EA, Gallichotte EN, Weger-Lucarelli J, Kapuscinski ML, Abdo Z, Pyron K, et al. Loss of West Nile virus genetic diversity during mosquito infection due to species-dependent population bottlenecks. iScience. 2023 Oct;26(10):107711. pmid:37701570
- 59. Pandey A, Suman S, Chandna S. Predictive role of mitochondrial genome in the stress resistance of insects and nematodes. Bioinformation. 2010 Jun 24;5(1):21–7.
- 60. Gatti P, Ilamathi H, Todkar K, Germain M. Mitochondria targeted viral replication and survival strategies—prospective on SARS-CoV-2. Front Pharmacol. 2020 Aug 28;11.
- 61. Okuda K, de Almeida F, Mortara RA, Krieger H, Marinotti O, Bijovsky AT. Cell death and regeneration in the midgut of the mosquito, Culex quinquefasciatus. J Insect Physiol. 2007 Dec;53(12):1307–15. pmid:17716685
- 62. Zhang Y, Chen X, Gueydan C, Han J. Plasma membrane changes during programmed cell deaths. Cell Res. 2018 Jan 27;28(1):9–21.
- 63. Barbier V, Lang D, Valois S, Rothman AL, Medin CL. Dengue virus induces mitochondrial elongation through impairment of Drp1-triggered mitochondrial fission. Virology. 2017 Jan;500:149–60. pmid:27816895
- 64. Chatel-Chaix L, Cortese M, Romero-Brey I, Bender S, Neufeldt CJ, Fischl W, et al. Dengue virus perturbs mitochondrial morphodynamics to dampen innate immune responses. Cell Host Microbe. 2016 Sep;20(3):342–56. pmid:27545046
- 65. Evans C, Chang L, Gestwicki J. Heat shock protein 70 (Hsp70) as an emerging drug target. J Med Chem. 2010 Jun 24;53(12):4585–602.
- 66. Li D-C, Yang F, Lu B, Chen D-F, Yang W-J. Thermotolerance and molecular chaperone function of the small heat shock protein HSP20 from hyperthermophilic archaeon, Sulfolobus solfataricus P2. Cell Stress Chaperones. 2012 Jan;17(1):103–8. pmid:21853411
- 67. Qian J, Vafiadaki E, Florea SM, Singh VP, Song W, Lam CK. Small heat shock protein 20 interacts with protein phosphatase-1 and enhances sarcoplasmic reticulum calcium cycling. Circ Res. 2011 Jun 10;108(12):1429–38.
- 68. Tamhankar M, Patterson JL. Directional entry and release of Zika virus from polarized epithelial cells. Virol J. 2019 Dec 8;16(1):99.
- 69. Ramirez L, Betanzos A, Raya-Sandino A, González-Mariscal L, Del Angel RM. Dengue virus enters and exits epithelial cells through both apical and basolateral surfaces and perturbs the apical junctional complex. Virus Res. 2018 Oct;258:39–49. pmid:30278191
- 70. Ahearn YP, Saredy JJ, Bowers DF. The alphavirus Sindbis Infects Enteroendocrine cells in the Midgut of Aedes aegypti. Viruses. 2020 Aug 4;12(8):848. pmid:32759668
- 71.
Zeng X, Chauhan C, Hou SX. Stem Cells in the Drosophila Digestive System. 2013. p. 63–78.
- 72. Barletta A-BF, Smith JC, Burkart E, Bondarenko S, Sharakhov IV, Criscione F, et al. Mosquito midgut stem cell cellular defense response limits Plasmodium parasite infection. Nat Commun. 2024;15(1):1422. pmid:38365823
- 73. Taracena ML, Bottino-Rojas V, Talyuli OAC, Walter-Nuno AB, Oliveira JHM, Angleró-Rodriguez YI, et al. Regulation of midgut cell proliferation impacts Aedes aegypti susceptibility to dengue virus. PLoS Negl Trop Dis. 2018 May 21;12(5):e0006498. pmid:29782512
- 74. Yang C, Wang F, Huang D, Ma H, Zhao L, Zhang G, et al. Vector competence and immune response of Aedes aegypti for Ebinur Lake virus, a newly classified mosquito-borne orthobunyavirus. PLoS Negl Trop Dis. 2022 Jul 18;16(7):e0010642. pmid:35849620
- 75. Chen T-Y, Bozic J, Mathias D, Smartt CT. Immune-related transcripts, microbiota and vector competence differ in dengue-2 virus-infected geographically distinct Aedes aegypti populations. Parasit Vectors. 2023 May 19;16(1):166. pmid:37208697
- 76. Sexton NR, Bellis ED, Murrieta RA, Spangler MC, Cline PJ, Weger-Lucarelli J, et al. Genome Number and Size Polymorphism in Zika Virus Infectious Units [Internet]. 2021. Available from: http://jvi.asm.org/.
- 77. O’Neal J, Upadhyay A, Wolabaugh A, Patel N, Bosinger S, Suthar M. West Nile Virus-inclusive single-cell RNA sequencing reveals heterogeneity in the type I interferon response within single cells. J Virol. 2019 Mar 5;93(6).
- 78.
Dainat J. AGAT: Another Gff Analysis Toolkit to handle annotations in any GTF/GFF format.
- 79.
Satija R. Seurat - Guided Clustering Tutorial. 2023.
- 80. Patel K. DESeq2 workflow tutorial | Differential Gene Expression Analysis | Bioinformatics. 2022;101. Available from: https://www.youtube.com/watch?v=04gB2owLKus
- 81. Ge S, Jung D, Yao R. ShinyGO: a graphical gene-set enrichment tool for animals and plants. Bioinformatics. 2020;36(8):2628–9.
- 82. Vivian Li W, Li Y. scLink: inferring sparse gene co-expression networks from single-cell expression data. Genom Proteom Bioinform. 2021 Jun;19(3):475–92. pmid:34252628
- 83.
Cantalapiedra CP, Hernandez-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. 2021.
- 84.
Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. 2019.