Rickettsia peacockii, also known as the East Side Agent, is a non-pathogenic obligate intracellular bacterium found as an endosymbiont in Dermacentor andersoni ticks in the western USA and Canada. Its presence in ticks is correlated with reduced prevalence of Rickettsia rickettsii, the agent of Rocky Mountain Spotted Fever. It has been proposed that a virulent SFG rickettsia underwent changes to become the East Side Agent. We determined the genome sequence of R. peacockii and provide a comparison to a closely related virulent R. rickettsii. The presence of 42 chromosomal copies of the ISRpe1 transposon in the genome of R. peacockii is associated with a lack of synteny with the genome of R. rickettsii and numerous deletions via recombination between transposon copies. The plasmid contains a number of genes from distantly related organisms, such as part of the glycosylation island of Pseudomonas aeruginosa. Genes deleted or mutated in R. peacockii which may relate to loss of virulence include those coding for an ankyrin repeat containing protein, DsbA, RickA, protease II, OmpA, ScaI, and a putative phosphoethanolamine transferase. The gene coding for the ankyrin repeat containing protein is especially implicated as it is mutated in R. rickettsii strain Iowa, which has attenuated virulence. Presence of numerous copies of the ISRpe1 transposon, likely acquired by lateral transfer from a Cardinium species, are associated with extensive genomic reorganization and deletions. The deletion and mutation of genes possibly involved in loss of virulence have been identified by this genomic comparison. It also illustrates that the introduction of a transposon into the genome can have varied effects; either correlating with an increase in pathogenicity as in Francisella tularensis or a loss of pathogenicity as in R. peacockii and the recombination enabled by multiple transposon copies can cause significant deletions in some genomes while not in others.
Citation: Felsheim RF, Kurtti TJ, Munderloh UG (2009) Genome Sequence of the Endosymbiont Rickettsia peacockii and Comparison with Virulent Rickettsia rickettsii: Identification of Virulence Factors. PLoS ONE 4(12): e8361. doi:10.1371/journal.pone.0008361
Editor: Niyaz Ahmed, University of Hyderabad, India
Received: October 28, 2009; Accepted: November 20, 2009; Published: December 21, 2009
Copyright: © 2009 Felsheim et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was supported by National Institutes of Health (NIH) grant 2RO1 AI49424-06A2 to UGM. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Rickettsia peacockii is an obligate intracellular bacterium identified in Rocky Mountain wood ticks (Dermacentor andersoni) from Montana, USA . It is of interest to rickettsiologists due to its co-localization on the eastern side of the Bitterroot Valley with a much reduced prevalence of D. andersoni infected with Rickettsia rickettsii, while spotted fever ravaged the west side of the valley  . Thus began the study of a phenomenon considered as evidence for interference, where the presence of R. peacockii in D. andersoni ticks may prevent the transovarial transmission of R. rickettsii and therefore limit its spread in the tick population. It is not clear whether this interference is an active process or simply a case in which ticks carrying R. peacockii have a reproductive advantage because they do not suffer the reduced fecundity associated with R. rickettsii infection . Surveys of rickettsiae in tick populations around the western US and Canada have shown that R. peacockii is widespread in Dermacentor ticks and R. rickettsii is relatively rare , . While R. peacockii is closely related to R. rickettsii, it is not a pathogen of mammals and not deleterious to ticks.
The purpose of this work was to determine the sequence of the R. peacockii genome and compare it to the genome of it's nearest pathogen relative, R. rickettsii. Although the R. peacockii genome is similar in size (1.29Mb) to those of other spotted fever group (SFG) rickettsiae and there is high homology between many of their genes, the genome of R. peacockii has several gene deletions and mutations that may account for its lack of pathogenicity. In addition to the genome sequence of the Iowa strain of R. rickettsii  it represents a valuable set of data for comparison with the genomes of related pathogenic rickettsiae. The most dramatic difference between the genomes of R. peacockii and R. rickettsii is the presence of the ISRpe1 transposon and the effects multiple transposon copies have had as a point of homology for recombination, resulting in numerous deletions and genome shuffling (this manuscript). In contrast, genome comparisons of virulent and non-virulent Francisella tularensis showed that transposon mediated recombination and shuffling of gene order occurred in the pathogenic strains rather than the non-pathogenic strain . R. peacockii is also among the growing list of rickettsiae harboring plasmids which are apparently lacking in R. rickettsii.
Results and Discussion
R. peacockii accession numbers, chromosome GenBank:CP001227,plasmid GenBank:CP001228.
The genome sequence of R. peacockii strain Rustic was determined; the size of the circular chromosome (RPR) is 1,288,492 bp and the size of the circular plasmid (pRPR) is 26,406 bp. The gene sequences of R. peacockii were found to be most similar to those of virulent R. rickettsii Sheila Smith (SS) and avirulent R. rickettsii Iowa. The genome of the non-pathogen R. peacockii was compared to the genome of its closest pathogenic neighbor R. rickettsii SS in order to identify differences that may relate to pathogenicity. Presence of laterally transferred DNA in the genome of R. peacockii is the most striking difference between them; including a plasmid, the ISRpe1 transposon and three chromosomal regions of Rickettsia bellii-like DNA containing Tra genes. The locations of chromosomal DNA sequences present in R. peacockii and lacking in R. rickettsii SS are shown in Text S1.
Impact of ISRpe1 Transposons on the R. peacockii Genome
There are 40 copies of the transposon and 2 transposon fragments on the chromosome and 2 copies of the transposon on the plasmid. Most ISRpe1 transposons contain an intact transposase coding sequence (31 of 42) while 11 contain frameshift mutations or internal stop codons. There are no other types of transposases annotated as genes on the chromosome and only three other types of transposon pseudogenes are found on the chromosome, two of which are also found in R. rickettsii and one within a fragment of the tra cluster.
Recombination between the transposons has resulted in a dramatic shuffling of gene order between the R. peacockii and R. rickettsii genomes (Figure 1). A dot plot comparison of the two genomes is shown in Figure S1. By comparing the two genomes using Mauve  we found that an ISRpe1 transposon in R. peacockii co-localized to 31 of 37 junctions between syntenic blocks. Numerous deletions co-localized to copies of the transposon as well, suggesting the deletions occurred during this recombination. To determine the extent of correlation between deletions and the presence of transposons, the backbone file from the Mauve comparison, Artemis  and blastn was used to locate copies of ISRpe1 within 5 bp of the deletion junctions. All deletions over 100 bp in size were examined. There are transposons not associated with gdeletions or changes to synteny, transposons that locate to changes in synteny with or without deletion, and transposons that locate to a point of deletion but do not affect synteny. Deletions of the latter variety likely occurred when two transposons integrated near each other followed by recombination between them and deletion of the intervening DNA. Text S2 shows that for all deletions greater than 100 bp in size, 71.4% of deletions (25 out of 35) are flanked by one or two ISRpe1 transposons. There are also 3 smaller deletions flanked by transposons. It is possible that some of the small deletions resulting in frameshift mutations or split genes are the result of inexact DNA repair following excision of the transposon as it moved to a new location. In contrast to this study, genome comparisons of virulent and non-virulent Francisella tularensis showed that transposon mediated recombination and shuffling of gene order occurred in the pathogenic strains rather than the non-pathogenic strain , also they did not find deletions associated with transposon mediated recombination as seen in R. peacockii.
The alignment of the Rickettsia rickettsii SS and Rickettsia peacockii genomes using progressive Mauve with default parameters shows the lack of synteny between the genomes of these closely related organisms. The breakpoints of the syntenic blocks in R. peacockii are largely associated (31 of 37) with the ISRpe1 transposon, indicated with black arrows. The genome on top is that of R. rickettsii SS (reference genome) and that below is R. peacockii.
The transposon ISRpe1, originally identified in R. peacockii,  is also found twice in the R. massiliae genome along with a gene fragment (E = 0.0; RMA_0538 and RMA_0748) and a frameshifted mutant copy is found on the R. felis plasmid (E = 4e−154; RF_p48). The genomic locations for the copies of ISRpe1 in R. massiliae are different from those found in R. peacockii, indicating the transposition events occurred independently rather than in a common ancestor. Homologs of ISRpe1 are found three times (E = 5e−173; Aasi_0934, Aasi_0956 and Aasi_0884) in the genome of Candidatus Amoebophilus asiaticus, a member of the phylum Cytophaga-Flavobacterium-Bacteroides (CFB) and endosymbiont of Acanthamoeba . The phylum also contains Candidatus Cardinium endosymbionts of arthropods. C. Cardinium spp. are related to C. A. asiaticus and closely related to one another, yet exist in a wide range of arthropod species . This is indicative of horizontal transmission and could put these bacteria in contact with various species of rickettsiae. A phylogenetic tree, Figure 2, shows the close relationship between ISRpe1 and the Candidatus A. asiaticus transposon, but given the presence of this transposon in few rickettsiae, we suggest the transposon was transferred to these few rickettsiae in the recent past rather than from C. A. asiaticus. A possible source of the transposon is a C. Cardinium species from D. andersoni ticks that is similar to an endosymbiont cultured from Ixodes scapularis ticks . To explore this link, genomic DNA from the cultured C. Cardinium spp. was used as template in a PCR reaction with ISRpe1 primers not previously used in the lab and the sequence of the product was determined. The DNA sequence of the 931 bp C. Cardinium PCR product shares 98% identity with the most homologous ISRpe1 copy in R. peacockii. The derived amino acid sequence was added to the phylogenetic tree shown in Figure 2. These results support the hypothesis that the transposon was transferred from a C. Cardinium species to a recent ancestor of R. peacockii.
Neighbor joining (NJ) and maximum parsimony (MP) analyses included 14 taxa. Exclusion of gaps left 309 amino acids for the analyses; 105 amino acids were constant, 36 of the variable amino acids were parsimony uninformative and 168 of the variable amino acids were parsimony informative. Bootstrap analysis involved 2,000 replicates: top number is NJ bootstrap value and bottom number the MP bootstrap value. Genbank references for the proteins found in Text S7.
Tra Gene Cluster of R. peacockii
The three regions of R. bellii-like DNA in the chromosome are located at nucleotides 142335–149261 (one end of the tra cluster with genes TraB, TraE, leucine-rich protein gene and a TraV fragment), 806589–813116 (with degraded genes for a permease, TraA and TraD) and 497519–499744 (other end of the tra cluster with U gene). The phenomenon of lateral transfer of the tra cluster was first observed in the R. massiliae genome . An ISRpe1 transposon is present at four of the six junctions of these three regions and the remaining two junctions are the tRNAVal gene and a chimeric tRNA gene. It was difficult to determine the junction at 149261 (the chimeric tRNA gene) as this area has been lost in R. rickettsii. This junction border was chosen due to blastn comparison with R. conorii that shows the highest homology (98%) upstream from nucleotide 149262 and no homology downstream. This correlates well with events of integration at tRNA genes by integrons such as the tra cluster . The leucine-rich protein gene (RPR_00830) in the TraBE region is not found in R. massiliae and only shows homology (97%) to RBE_0439 of R. bellii which is near TraE, suggesting either strong selection for this gene sequence uniquely in R. peacockii and R. bellii or independent introduction of the tra cluster within the rickettsiae. Also, the permease pseudogene in the TraAD region of R. peacockii is not found in R. massiliae, but found in the tra cluster of R. canadensis (A1E_02610 and A1E_02615). In R. peacockii it appears the tra cluster integration preceded the arrival of the ISRpe1 transposon which then transposed into the tra cluster and split it into the three remaining regions by recombination and presumable deletion of the bulk of the tra cluster. The genome of R. rickettsii SS does not contain this tra cluster but may have a remnant 227 bp fragment located at nucleotides 7243640–724132 near the tRNAVal gene . When this 227 bp region from R. peacockii is compared with other rickettsiae using blastn, R. rickettsii shows 68% identity (E = 6e−14) while R. bellii shows 88% identity (E = 1e−73) and R. massiliae shows 85% identity (E = 4e−67). If the 227 bp region of R. rickettsii shared ancestry with the tra cluster of closely related R. peacockii, one would expect the percent identity to R. rickettsii to be higher than that of R. bellii and R. massiliae.
Features of R. peacockii Plasmid
The 26 kb plasmid of R. peacockii (pRPR) contains 20 putative genes (Table 1), two of which are involved in plasmid maintenance and replication, ParA and DnaA. The ParA gene RPR_p01 and the two neighboring genes RPR_p02 and RPR_p03 are most closely related to RF_p23, RF_p22 and RF_p21 of R. felis and flanked by 56 bp inverted repeats, whereas the R. massiliae plasmid has an unrelated ParA gene. A phylogenetic tree was made comparing rickettsial plasmid borne parA proteins with their closest blast hits (Figure 3). It is interesting that the ParA genes on rickettsial plasmids fall into diverse groups suggesting foreign plasmids have periodically entered rickettsiae. ParA is the likely determinant of compatibility so entrance of a new parA gene enables a second plasmid to be maintained. The C-terminal domain of DnaA is similar to the plasmid-borne DnaA-like proteins of R. massiliae RMA_p01, R. felis RF_p05 and also the smaller version in R. felis RF_p19. The N-terminal domain is similar to the DnaA-like protein of R. monacensis. The plasmid contains five genes, RPR_p06 – RPR_p10, most closely related to orfs B, C, D, F, G found in a region termed the glycosylation island in Pseudomonas aeruginosa, shown to be involved (orfs A, N and E) in flagellar glycosylation . The gene order is mostly maintained in this gene cluster on pRPR with only orf E deleted and orfB flipped between the repeats shown in the annotation. The GC content of this region is 48.4% vs. 34.7% for the entire plasmid and 32.6% for the R. peacockii chromosome, another indication beyond the homology for lateral transfer. The function of these five genes in R. peacockii is unknown but by homology they appear to be involved in phospholipid biosynthesis and may be maintained to increase the flow of glycerol-3-phosphate into the phospholipid biosynthesis pathway given that R. peacockii has a frameshift mutation in the glycerol-3-phosphate dehydrogenase gene (Table 2, location 96554..97530). This gene is necessary to make glycerol-3-phosphate from dihydroxyacetone phosphate (DHAP) in rickettsiae. Due to the absence of the glycolytic pathway, rickettsiae are unable to synthesize DHAP from fructose-1,6-diphosphate and must import it from the host cell . Rickettsiae commonly have a glycerol-3-phosphate transporter as well as a DHAP transporter so can obtain glycerol-3-phosphate from the host cell directly or indirectly, while only R. peacockii has a mutant copy of the glycerol-3-phosphate dehydrogenase gene, so is dependent on import of glycerol-3-phosphate alone. This mutation in R. peacockii may limit the amount of glycerol-3-phosphate available for phospholipid biosynthesis and the presence of these 5 genes on the plasmid may alleviate this problem.
Exclusion of gaps left 166 amino acids for the analysis; 8 amino acids were constant, 2 of the variable amino acids were parsimony uninformative and 156 of the variable amino acids were parsimony informative. Bootstrap analysis involved 1,000 replicates: numbers at selected branches are the NJ (MP/NJ) bootstrap values that were ≥50%. The top two or three blastP hits to rickettsial plasmid parA's with E = >1e−30 were chosen for the analysis. The parA from the Trichoplax adhaerens genome project is likely from a bacterium associated with this simplest of eukaryotes as several contigs have homology to Rickettsiales. The parA proteins from the rickettsial endosymbiont of Ixodes scapularis (REIS) were added to the analysis following PCR and sequencing to confirm the presence of the genes in our REIS isolate (Baldridge et. al., in preparation). Maximum parsimony and neighbor joining (not shown) phylograms were congruent. Genbank references for the proteins found in Text S7.
Also found on the plasmid are two small heat shock genes, one (RPR_p13) appears to be a common feature present on rickettsial plasmids . RPR_p13 homologs are not represented on the chromosomes of other rickettsiae except R. felis (RF_1004) but this is an unusual case in that the other R. felis chromosomal copy (RF_1005) has a frameshift mutation and RF_1004 may have arisen via recombination with the plasmid copy. RPR_p12 is more similar to small heat shock proteins found on the chromosome of all rickettsiae but phylogenetic analysis shows this plasmid copy falls into a third group of rickettsial small heat shock proteins (Text S3). A comparison of the three small heat shock proteins of R. peacockii (including chromosomal copy RPR_2300) using Kyte-Doolittle plots shows the degree of N-terminal hydrophobicity varies from high to low between the three proteins with the two plasmid copies having the greatest difference (Text S4). In yeast the strength of this N-terminal hydrophobicity determines the strength of interaction of these chaperones with their target proteins , . A family of these chaperones with a range of strengths of interaction may well help rickettsiae survive changing environmental temperatures during their life cycles in arthropods. It is also possible that individual target proteins or membranes benefit from a specialized chaperone. The plasmid contains genes for two transporters; RPR_p11 is an ABC type with ATPase and permease domains with strong homology (E = 0.0) to Aasi_0982 from Candidatus A. asiaticus and no homology to known rickettsial genes. The second transporter RPR_p17 is an SMR-type multi-drug efflux transporter and the only other rickettsia to have a homolog of RPR_p17 is R. bellii, while next closest relatives are in CFB group of bacteria. Other genes on the plasmid code for various transposases, a putative lipoprotein, a TPR repeat-containing protein and an apparent chimeric protein. All the plasmid genes with rickettsial chromosomal homologs have far lower homology to those of R. peacockii or R. rickettsii and higher homology to other more distantly related rickettsiae, which is an indication of horizontal gene transfer to the plasmid, as seen in pRF of R. felis . The exceptions are the two ISRpe1 transposon copies on the plasmid.
Deletions and Mutations in R. peacockii vs. R. rickettsii Strain Sheila Smith
Deletions in R. peacockii vs R. rickettsii SS are shown in Table 3. Deletions greater than 100 bases were examined as well as smaller deletions that disrupted genes or were within 5 bases of ISRpe1 transposons. Nonsense mutations resulting from premature stop codons and small deletions or insertions causing frameshifts in R. peacockii vs R. rickettsii SS are shown in Table 2. It appears that some deletions and mutations in R. peacockii may be responsible for its lack of pathogenicity and are focused upon in this section. Possible candidate genes include those coding for an ankyrin repeat containing protein, DsbA, RickA, Protease II, OmpA, Sca1, and a putative phosphoethanolamine transferase. The deletion located at SS coordinates 869412..871928 (Table 3) was likely deleted in R. peacockii during recombination between ISRpe1 transposons and contains a gene coding for one of the two larger ankyrin repeat containing proteins in R. rickettsii SS (A1G_05165). Ankyrin repeat proteins have been shown to be effector proteins or virulence factors in several pathogens . In another member of the order Rickettsiales, AnkA is rapidly translocated to the host cell and phosphorylated by host cell kinases upon Anaplasma phagocytophilum binding to the host cell . AnkA also binds DNA and alters transcription of defense related genes in HL-60 cells infected with A. phagocytophilum or transfected with an AnkA expression plasmid . Transcript levels for ankA (APH_0740) were shown to be 2.3 to 3 fold higher in A. phagocytophilum grown in mammalian cells versus tick cells  and in R. rickettsii transcript levels of A1G_05165 were 3.2 fold higher in mammalian cells versus tick cells as well as being one of few differentially expressed genes detected . Moreover, this gene that is deleted in R. peacockii is mutated in R. rickettsii Iowa, which has attenuated virulence compared to R. rickettsii SS . The deletion in the Iowa gene (RrIowa_1113) removes three of the four ankyrin repeats from the protein. Strengthening the case for this as a virulence factor is the observation that this gene has been deleted in Rickettsia monacensis (R. Felsheim, unpublished) and is not found in R. bellii, both non-pathogenic for humans. Recently the putative genome sequence from the non-pathogenic rickettsial endosymbiont of Ixodes scapularis (REIS) has been released and this ank gene also appears to have been deleted in a similar manner to that seen in R. monacensis. A remnant of the gene found at nucleotides 53520–53589 of REIS contig ACLC01000066.1 are homologous to the 3′ end of the ank gene. There is about a 3.7 kb deletion in this region of the REIS genome compared to R. rickettsii. All of the rickettsial pathogens for which genome sequence is available have this gene, except Rickettsia akari.
While most bacteria have a single DsbA gene, R. rickettsii SS and other rickettsiae have two DsbA genes, one (A1G_03355) is deleted in R. peacockii due to recombination between two ISRpe1 transposons (Table 3, location 588183..589234). DsbA codes for a protein-disulfide oxidoreductase which catalyzes disulfide-bond formation in the periplasm during the folding of secreted proteins. Both rickettsial DsbA proteins are predicted to be anchored into the membrane, one via a transmembrane domain and one as a lipoprotein. In pathogenic Vibrio cholerae, the DsbA homolog (TcpG) is responsible for the folding, maturation and secretion of virulence factors . The importance of DsbA for virulence has been demonstrated in a variety of organisms , , , , , , . Compensation for this deletion by the other copy of DsbA is likely for some functions, but in Neisseria meningitides which has three DsbA homologs, they vary in functional activity in complementation assays .
RickA was previously shown to be truncated by the ISRpe1 transposon in R. peacockii and interaction of R. peacockii with actin was found to be lacking . We show that most of the RickA gene has been deleted along with the neighboring succinyl-CoA:3-ketoacid-coenzyme A transferase genes during recombination between transposons (Table 3, location 841975..846686). Actin based motility is thought to mediate intracellular and cell-to-cell movement of rickettsiae. Time-lapse photography of our GFP expressing R. peacockii shows them to be non-motile compared to other rickettsiae observed (unpublished data). The protease II gene in R. peacockii (Table 3, location 381886..381961) has a 74 bp deletion causing a frameshift in the middle of the gene. Proteases have been shown to be important for binding and entry in a wide variety of organisms and protease II is an S9A type protease, the type shown to be necessary for entry of Trypanosoma cruzi into host cells . The three frameshift mutations in the OmpA gene of R. peacockii have previously been discussed . In the avirulent strain R. rickettsia Iowa, OmpA is also truncated as a result of a frameshift mutation  that differs from those in the R. peacockii OmpA gene. Sca1, which is in the same superfamily of autotransported surface proteins as OmpA, is deleted in R. peacockii via transposon insertion upstream of the Sca1 gene and also near the stop codon (Table 3, location 19653..25701), followed by recombination and deletion of a 6kb DNA fragment containing the Sca1 gene. Sca1 is present in all other Rickettsia spp. and the type of selection pressure on the N-terminal passenger domain implicates the N-terminal domain of this autotransported protein in interactions with the host cell . Outer membrane proteins implicated in binding of rickettsiae to host cells, include OmpA , OmpB , RP828, RC1281, and the autotransporter domain of OmpB . Of these only OmpA is defective in R. peacockii. It is possible that rickettsiae use different members of this Omp family to bind to different host cell types or cells of different species.
Since the N-terminal domain of OmpA-B family members is likely extended from the membrane through the slime layer, it is possible that proper configuration of the surface lipooligosaccharides is important for proper arrangement of these surface proteins. While the slime layer is characteristic of the SFG rickettsiae, in R. peacockii the slime layer is thin and not always discernable . The increase in the thickness of the slime layer in R. rickettsii upon tick feeding correlates with the restoration of virulence . Genes for a sugar reductase and sugar epimerases including CapD, predicted to be involved in slime layer biosynthesis  are located very close to a mutant putative phosphoethanolamine transferase gene in R. peacockii. The putative phosphoethanolamine transferase gene is found between nucleotides 1176591..1178159 (Table 2) with two frameshift mutations close together that introduce a stop codon truncating the transferase domain. This enzyme is required for the correct structure of surface lipooligosaccharides of Neisseria meningitides and mutation of phosphoethanolamine transferase decreases bacterial binding to endothelial cells 10 fold , . Our experience with R. peacockii in culture is that it binds very poorly to host cells and extracellular R. peacockii are observed more abundantly in vitro than other rickettsial species maintained in our laboratory . The protein sequences of phosphoethanolamine transferases are not well conserved among bacteria except around the transferase domain, which is where significant homology exists to this rickettsial protein. This putative phosphoethanolamine transferase shares the same 5-transmembrane structure with others as well (Text S5). The locus tag in R. rickettsii SS is A1G_02570 and closely related genes are found in all other Rickettsia spp.
Another deletion via recombination between transposons removes A1G_04605 (YhbC) and A1G_04620 (transcriptional regulator of the RirA / Rrf-2 superfamily). YhbC was picked up in a mutant screen for virulence factors of Salmonella enteritidis due to its effect on the growth rate of the bacteria, making the mutant a potential live vaccine candidate . The growth rate of R. peacockii in culture is slower than other rickettsiae grown in our lab. Transcription factors of the rirA type are repressors containing an iron-sulfur cluster, and thus can sense iron concentrations as well as nitric oxide which dissociates the cluster and alters DNA binding. The lack of iron or presence of nitric oxide leads to derepression of genes regulated by rirA, so lack of rirA protein results in an increase in expression of these regulated genes ,  . In the rirA mutant Sinorhizobium meliloti a toxic amount of iron builds up and leads to a hypersensitivity to H2O2 . R. peacockii are not found in hemocytes while R. rickettsii are commonly found in hemocytes and this deletion of rirA may contribute to this observation, given that it has been shown that reactive oxygen species are produced in cattle tick hemocytes  and presumably in other tick species as well.
R. peacockii also has other deletions and mutations, notably genes that are conserved in other rickettsiae like the methyltransferase A1G_03950 (Table 2) and hypothetical gene A1G_03530 (Table 3). Nonsense mutations in R. rickettsii SS vs R. peacockii are shown in Text S6. DNA sequence found in R. peacockii and not in R. rickettsii SS is shown in Text S1 and includes mainly the ISRpe1 transposons, the three fragments of the tra cluster, the 10.5 kb fragment present in R. rickettsii Iowa vs. R. rickettsii SS  and a tandem gene duplication of A1G_02330 (RPR_04375 and RPR_04376).
Our results support the speculation that in the past, a virulent SFG rickettsia underwent changes to become the East Side Agent (Rickettsia peacockii) . Gene reduction in rickettsiae and some other bacteria correlates with an increase in virulence , ,  but our analysis of gene loss in R. peacockii suggests that transposon mediated gene reduction is responsible for avirulence in this case.
R. peacockii has a dynamic genome that has been and is likely still being shaped by ISRpe1 activity. The changes have resulted in a dramatic lack of synteny with R. rickettsii and likely contributed to rendering it non-pathogenic for vertebrates, restricting it to the tick host. R. peacockii has lost several genes that appear important in the transmission of pathogenic rickettsiae to a vertebrate host. At the same time it has retained a gene repertoire that enables it to survive and grow in the tick and to be transmitted transovarially to the tick's progeny. The extensive remodeling of the genome makes reversion to pathogenicity unlikely unless new virulence genes are imported. Ticks encounter and interact with many bacteria during their life cycle, some of which can invade the ovaries and cohabit the same cell (e.g. the Francisella-like D. andersoni symbiont and C. Cardinium spp.). We propose that symbionts such as R. peacockii could conceivably acquire novel genes via lateral gene transfer through their interactions with a range of bacteria including pathogens acquired by the tick during its blood meal . Ticks have mechanisms for excluding foreign bacteria during the internalization of the blood meal but feeding on a heavily infected mammal may provide a challenge to this system. The acquisition of DNA from P. aeruginosa onto the R. peacockii plasmid may relate to the fact that ticks absorb cells and large macromolecules intracellularly for digestion  and P. aeruginosa is known to secrete large amounts of genomic DNA . R. peacockii and its acquisition of mobile DNA is a good example of the ‘intracellular arena’ hypothesis at work, in that obligate intracellular bacteria more readily share genetic material if they cohabit the same cells . Obligate intracellular bacteria like rickettsiae that live in arthropods which feed on mammals also increase their rate of exposure to novel gene pools . We see the plasmid as the only recent recipient of foreign DNA, other than the ISRpe1 transposon, in the genome of R. peacockii.
Materials and Methods
Rickettsia peacockii Rustic  was grown in Ixodes scapularis cell line ISE6 ,  for eight in vitro passages. Genomic DNA was prepared from rickettsiae released from infected cells by forcing suspended cells five times through a 25 G needle attached to a 5 ml syringe. The resulting lysate was centrifuged at 270 rcf for 5 min to remove whole cells and the supernatant filtered through a 1.2 µm syringe filter (Whatman Puradisc FP30; Sigma-Aldrich St. Louis, MO). Rickettsiae were recovered from the filtrate by centrifugation (18,400 rcf, 5 min 4°C), resuspended in Dulbecco's Phosphate Buffered Saline (PBS) containing calcium and magnesium (Mediatech, Inc. Herndon, VA) and DNase I (15 µg/ml; from bovine pancreas Type II-S, Sigma-Aldrich), and incubated at room temperature for 30 min. After DNase I treatment to remove contaminating Ixodes DNA rickettsiae were centrifuged again (18,400 rcf, 5 min, 4°C) and genomic DNA was prepared using the Puregene kit (Gentra Systems, Minneapolis, MN) following the protocol for Gram negative bacteria. The C. Cardinium spp. isolate  was grown and DNA isolated in the same manner as above.
DNA was pyrosequenced on a 454FLX machine (454-Roche, Branford, CT) (226,040 reads, >30X coverage) at the BioMedical Genomics Center, U of M, St. Paul, MN and assembled using Newbler (454-Roche) requiring 99% homology; 56 contigs 500 bases or larger were obtained. To determine if transposons occupied the gaps, 50–100 basepairs from each end of the ISRpe1 transposon were used to recover 454 traces using Blastn and assembled using Sequencher (Gene Codes, Ann Arbor, MI) requiring 100% homology. These contigs were then assembled onto the ends of the original 454 generated contigs. The ISRpe1 transposons are too similar to one another to be assembled from 454 traces and mapping the contigs to the R. rickettsii genome followed by PCR across the gaps yielded artifactual results, again due to the similarity between individual transposons and their interaction during PCR. Therefore, contigs were extended using GenomeWalker (Clontech, Mountain View, CA) ligation mediated PCR using a single gene specific primer for each contig end. (nested PCR was unnecessary). Four GenomeWalker libraries were made using EcoRV, HaeIII, PvuII and HpaI which do not cut in the transposon. The last gap could not be filled this way, nor with standard PCR and it appeared that two transposons were present here. All contigs from the 454 assembly were then assembled onto the linear genome contig, and with the exception of contigs from Ixodes scapularis mitochondrial and genome sequence, only one remained unassembled. This contig contained the junction of a transposon and a transposon fragment. This junction sequence was used to retrieve 454 traces using blastn that were then assembled using Sequencher, requiring 100% homology. Primers were designed to bind to unique sequence within this junction for use in PCR with gene specific primers from the ends of the large contig. A primer was designed from a region of the transposon not found in the transposon fragment to confirm the sequence at the junction. The GenomeWalker PCR products were sequenced with transposon specific primers. AccuTaq DNA polymerase (Sigma-Aldrich) was used throughout. The coverage layout along the contigs was calculated with 454 de novo assembler software (version 2.0.00.20) using the derived file 454AlignmentInfo.tsv in 100 nucleotide scale and visually scanned for anomalies. One contig had twice the normal number of traces per unit contig length and this region was investigated with PCR and was found to contain a gene duplication (RPR_04375 and RPR_04376) that was originally assembled into one gene. The results are a circular chromosome of 1,288,492 bp and a circular plasmid of 26,406 bp.
Annotation and Analysis
The genome was annotated using PGAAP at NCBI (http://www.ncbi.nlm.nih.gov/genomes/static/Pipeline.html). Apparent frameshifts were examined manually by recovering 454 traces from each area using Blastn and assembling them using Sequencher requiring 90% homology to determine the validity of the sequence. In ten of the areas PCR and sequencing was carried out to validate the sequence. The sequence was subjected to manual annotation by viewing the .gbf file using Artemis  and editing of the .sqn file. Gene fragments from apparent gene reduction auto-annotated as orfs by PGAAP were extended by blast analysis and re-annotated as misc_features (152) or removed.
Artemis Comparison Tool (ACT) (The Sanger Institute, Cambridge, UK) was used to compare the R. peacockii genome to that of R. rickettsii SS. Unique DNA sequence of each was extracted as a text file and examined using blast analysis. The level of synteny (or lack thereof) between the two genomes was examined by using Mauve  (Figure 1), ACT and a dotplot comparison (Figure S1). Mauve and Artemis were used to determine where the presence of a transposon coincided with a change in synteny between R. peacockii and R. rickettsii SS. Blastn and the backbone file from Mauve were used to examine all deletions in R. peacockii over 100 bp in size, to find which deletion junctions in R. peacockii were found within 5 bp of an ISRpe1 transposon (Text S2).
A dot plot comparison of the R. peacockii and R. rickettsii genomes.
(0.01 MB PNG)
DNA sequence found in R. peacockii and not in R. rickettsii SS.
(0.07 MB DOC)
Deletions in R. peacockii and their association with ISRpe1 transposons.
(0.26 MB DOC)
Phylogenetic analysis of the rickettsial small hsp proteins.
(0.15 MB DOC)
Kyte-Doolittle plots of hydrophobicity for the three small hsp/chaperone proteins from R. peacockii.
(0.04 MB DOC)
Support for R. rickettsii A1G_02570 as phosphoethanolamine transferase.
(0.05 MB DOC)
Nonsense mutations in R. rickettsii Sheila Smith relative to R. peacockii.
(0.04 MB DOC)
The authors would like to thank Zheng Jin Tu at the University of Minnesota Supercomputing Institute for help with UNIX.
Conceived and designed the experiments: RFF TJK UGM. Performed the experiments: RFF TJK. Analyzed the data: RFF. Wrote the paper: RFF TJK UGM.
- 1. Niebylski ML, Schrumpf ME, Burgdorfer W, Fischer ER, Gage KL, et al. (1997) Rickettsia peacockii sp. nov., a new species infecting wood ticks, Dermacentor andersoni, in western Montana. Int J Syst Bacteriol 47: 446–452.
- 2. Burgdorfer W, Hayes SF, Mavros AJ (1981) Nonpathogenic rickettsiae in Dermacentor andersoni: a limiting factor for the distribution of Rickettsia rickettsii. In: Burgdorfer W, Anacker RL, editors. Rickettsiae and rickettsial diseases. New York, N.Y.: Academic Press. pp. 585–594.
- 3. Philip RN, Casper EA (1981) Serotypes of spotted fever group rickettsiae isolated from Dermacentor andersoni (Stiles) ticks in western Montana. Am J Trop Med Hyg 30: 230–238.
- 4. Niebylski ML, Peacock MG, Schwan TG (1999) Lethal effect of Rickettsia rickettsii on its tick vector (Dermacentor andersoni). Appl Environ Microbiol 65: 773–778.
- 5. Dergousoff SJ, Gajadhar AJ, Chilton NB (2009) Prevalence of Rickettsia in Canadian populations of the ticks Dermacentor andersoni and D. variabilis. Appl Environ Microbiol.
- 6. Ellison DW, Clark TR, Sturdevant DE, Virtaneva K, Porcella SF, et al. (2008) Genomic comparison of virulent Rickettsia rickettsii Sheila Smith and avirulent Rickettsia rickettsii Iowa. Infect Immun 76: 542–550.
- 7. Rohmer L, Fong C, Abmayr S, Wasnick M, Larson Freeman TJ, et al. (2007) Comparison of Francisella tularensis genomes reveals evolutionary events associated with the emergence of human pathogenic strains. Genome Biol 8: R102.
- 8. Darling AC, Mau B, Blattner FR, Perna NT (2004) Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14: 1394–1403.
- 9. Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, et al. (2000) Artemis: sequence visualization and annotation. Bioinformatics 16: 944–945.
- 10. Simser JA, Rahman MS, Dreher-Lesnick SM, Azad AF (2005) A novel and naturally occurring transposon, ISRpe1 in the Rickettsia peacockii genome disrupting the rickA gene involved in actin-based motility. Mol Microbiol 58: 71–79.
- 11. Horn M, Harzenetter MD, Linner T, Schmid EN, Muller KD, et al. (2001) Members of the Cytophaga-Flavobacterium-Bacteroides phylum as intracellular bacteria of acanthamoebae: proposal of ‘Candidatus Amoebophilus asiaticus’. Environ Microbiol 3: 440–449.
- 12. Zchori-Fein E, Perlman SJ (2004) Distribution of the bacterial symbiont Cardinium in arthropods. Mol Ecol 13: 2009–2016.
- 13. Kurtti TJ, Munderloh UG, Andreadis TG, Magnarelli LA, Mather TN (1996) Tick cell culture isolation of an intracellular prokaryote from the tick Ixodes scapularis. J Invertebr Pathol 67: 318–321.
- 14. Blanc G, Ogata H, Robert C, Audic S, Claverie JM, et al. (2007) Lateral gene transfer between obligate intracellular bacteria: evidence from the Rickettsia massiliae genome. Genome Res 17: 1657–1664.
- 15. Williams KP (2002) Integration sites for genetic elements in prokaryotic tRNA and tmRNA genes: sublocation preference of integrase subfamilies. Nucleic Acids Res 30: 866–875.
- 16. Arora SK, Bangera M, Lory S, Ramphal R (2001) A genomic island in Pseudomonas aeruginosa carries the determinants of flagellin glycosylation. Proc Natl Acad Sci U S A 98: 9342–9347.
- 17. Fuxelius HH, Darby A, Min CK, Cho NH, Andersson SG (2007) The genomic and metabolic diversity of Rickettsia. Res Microbiol 158: 745–753.
- 18. Baldridge GD, Burkhardt NY, Felsheim RF, Kurtti TJ, Munderloh UG (2008) Plasmids of the pRM/pRF family occur in diverse Rickettsia species. Appl Environ Microbiol 74: 645–652.
- 19. Haslbeck M, Ignatiou A, Saibil H, Helmich S, Frenzl E, et al. (2004) A domain in the N-terminal part of Hsp26 is essential for chaperone function and oligomerization. J Mol Biol 343: 445–455.
- 20. Stromer T, Fischer E, Richter K, Haslbeck M, Buchner J (2004) Analysis of the regulation of the molecular chaperone Hsp26 by temperature-induced dissociation: the N-terminal domail is important for oligomer assembly and the binding of unfolding proteins. J Biol Chem 279: 11222–11228.
- 21. Gillespie JJ, Beier MS, Rahman MS, Ammerman NC, Shallom JM, et al. (2007) Plasmids and rickettsial evolution: insight from Rickettsia felis. PLoS ONE 2: e266.
- 22. Pan X, Luhrmann A, Satoh A, Laskowski-Arce MA, Roy CR (2008) Ankyrin repeat proteins comprise a diverse family of bacterial type IV effectors. Science 320: 1651–1654.
- 23. IJdo JW, Carlson AC, Kennedy EL (2007) Anaplasma phagocytophilum AnkA is tyrosine-phosphorylated at EPIYA motifs and recruits SHP-1 during early infection. Cell Microbiol 9: 1284–1296.
- 24. Garcia-Garcia JC, Rennoll-Bankert KE, Pelly S, Milstone AM, Dumler JS (2009) Silencing of Host Cell CYBB Gene Expression by the Nuclear Effector AnkA of the Intracellular Pathogen Anaplasma phagocytophilum. Infect Immun.
- 25. Nelson CM, Herron MJ, Felsheim RF, Schloeder BR, Grindle SM, et al. (2008) Whole genome transcription profiling of Anaplasma phagocytophilum in human and tick host cells by tiling array analysis. BMC Genomics 9: 364.
- 26. Ellison DW, Clark TR, Sturdevant DE, Virtaneva K, Hackstadt T (2009) Limited transcriptional responses of Rickettsia rickettsii exposed to environmental stimuli. PLoS One 4: e5612.
- 27. Peek JA, Taylor RK (1992) Characterization of a periplasmic thiol:disulfide interchange protein required for the functional maturation of secreted virulence factors of Vibrio cholerae. Proc Natl Acad Sci U S A 89: 6210–6214.
- 28. Coulthurst SJ, Lilley KS, Hedley PE, Liu H, Toth IK, et al. (2008) DsbA plays a critical and multifaceted role in the production of secreted virulence factors by the phytopathogen Erwinia carotovora subsp. atroseptica. J Biol Chem 283: 23739–23753.
- 29. Ha UH, Wang Y, Jin S (2003) DsbA of Pseudomonas aeruginosa is essential for multiple virulence factors. Infect Immun 71: 1590–1595.
- 30. Lee Y, Kim Y, Yeom S, Kim S, Park S, et al. (2008) The role of disulfide bond isomerase A (DsbA) of Escherichia coli O157:H7 in biofilm formation and virulence. FEMS Microbiol Lett 278: 213–222.
- 31. Rosadini CV, Wong SM, Akerley BJ (2008) The periplasmic disulfide oxidoreductase DsbA contributes to Haemophilus influenzae pathogenesis. Infect Immun 76: 1498–1508.
- 32. Yu J (1998) Inactivation of DsbA, but not DsbC and DsbD, affects the intracellular survival and virulence of Shigella flexneri. Infect Immun 66: 3909–3917.
- 33. Yu J, Kroll JS (1999) DsbA: a protein-folding catalyst contributing to bacterial virulence. Microbes Infect 1: 1221–1228.
- 34. Yu J, Oragui EE, Stephens A, Kroll JS, Venkatesan MM (2001) Inactivation of DsbA alters the behaviour of Shigella flexneri towards murine and human-derived macrophage-like cells. FEMS Microbiol Lett 204: 81–88.
- 35. Sinha S, Langford PR, Kroll JS (2004) Functional diversity of three different DsbA proteins from Neisseria meningitidis. Microbiology 150: 2993–3000.
- 36. Bastos IM, Grellier P, Martins NF, Cadavid-Restrepo G, de Souza-Ault MR, et al. (2005) Molecular, functional and structural properties of the prolyl oligopeptidase of Trypanosoma cruzi (POP Tc80), which is required for parasite entry into mammalian cells. Biochem J 388: 29–38.
- 37. Baldridge GD, Burkhardt NY, Simser JA, Kurtti TJ, Munderloh UG (2004) Sequence and expression analysis of the ompA gene of Rickettsia peacockii, an endosymbiont of the Rocky Mountain wood tick, Dermacentor andersoni. Appl Environ Microbiol 70: 6628–6636.
- 38. Ngwamidiba M, Blanc G, Raoult D, Fournier PE (2006) Sca1, a previously undescribed paralog from autotransporter protein-encoding genes in Rickettsia species. BMC Microbiol 6: 12.
- 39. Li H, Walker DH (1998) rOmpA is a critical protein for the adhesion of Rickettsia rickettsii to host cells. Microb Pathog 24: 289–298.
- 40. Uchiyama T, Kawano H, Kusuhara Y (2006) The major outer membrane protein rOmpB of spotted fever group rickettsiae functions in the rickettsial adherence to and invasion of Vero cells. Microbes Infect 8: 801–809.
- 41. Renesto P, Samson L, Ogata H, Azza S, Fourquet P, et al. (2006) Identification of two putative rickettsial adhesins by proteomic analysis. Res Microbiol 157: 605–612.
- 42. Simser JA, Palmer AT, Munderloh UG, Kurtti TJ (2001) Isolation of a spotted fever group Rickettsia, Rickettsia peacockii, in a Rocky Mountain wood tick, Dermacentor andersoni, cell line. Appl Environ Microbiol 67: 546–552.
- 43. Hayes SF, Burgdorfer W (1982) Reactivation of Rickettsia rickettsii in Dermacentor andersoni ticks: an ultrastructural analysis. Infect Immun 37: 779–785.
- 44. Santhanagopalan V, Coker C, Radulovic S (2006) Characterization of RP 333, a gene encoding CapD of Rickettsia prowazekii with UDP-glucose 4-epimerase activity. Gene 369: 119–125.
- 45. Takahashi H, Carlson RW, Muszynski A, Choudhury B, Kim KS, et al. (2008) Modification of lipooligosaccharide with phosphoethanolamine by LptA in Neisseria meningitidis enhances meningococcal adhesion to human endothelial and epithelial cells. Infect Immun 76: 5777–5789.
- 46. Cox AD, Wright JC, Li J, Hood DW, Moxon ER, et al. (2003) Phosphorylation of the lipid A region of meningococcal lipopolysaccharide: identification of a family of transferases that add phosphoethanolamine to lipopolysaccharide. J Bacteriol 185: 3270–3277.
- 47. Kurtti TJ, Simser JA, Baldridge GD, Palmer AT, Munderloh UG (2005) Factors influencing in vitro infectivity and growth of Rickettsia peacockii (Rickettsiales: Rickettsiaceae), an endosymbiont of the Rocky Mountain wood tick, Dermacentor andersoni (Acari, Ixodidae). J Invertebr Pathol 90: 177–186.
- 48. Chang J, Pang E, He H, Kwang J (2008) Identification of novel attenuated Salmonella Enteritidis mutants. FEMS Immunol Med Microbiol 53: 26–34.
- 49. Heier K, Thomson MJ, Aziz N, Moir JW (2008) The nitric oxide (NO)-sensing repressor NsrR of Neisseria meningitidis has a compact regulon of genes involved in NO synthesis and detoxification. J Bacteriol 190: 2488–2495.
- 50. Isabella VM, Lapek JD Jr, Kennedy EM, Clark VL (2009) Functional analysis of NsrR, a nitric oxide-sensing Rrf2 repressor in Neisseria gonorrhoeae. Mol Microbiol 71: 227–239.
- 51. Ngok-Ngam P, Ruangkiattikul N, Mahavihakanont A, Virgem SS, Sukchawalit R, et al. (2009) Roles of Agrobacterium tumefaciens RirA in iron regulation, oxidative stress response and virulence. J Bacteriol.
- 52. Chao TC, Buhrmester J, Hansmeier N, Puhler A, Weidner S (2005) Role of the regulatory gene rirA in the transcriptional response of Sinorhizobium meliloti to iron limitation. Appl Environ Microbiol 71: 5969–5982.
- 53. Pereira LS, Oliveira PL, Barja-Fidalgo C, Daffre S (2001) Production of reactive oxygen species by hemocytes from the cattle tick Boophilus microplus. Exp Parasitol 99: 66–72.
- 54. Pallen MJ, Wren BW (2007) Bacterial pathogenomics. Nature 449: 835–842.
- 55. Darby AC, Cho NH, Fuxelius HH, Westberg J, Andersson SG (2007) Intracellular pathogens go extreme: genome evolution in the Rickettsiales. Trends Genet 23: 511–520.
- 56. Fournier PE, El Karkouri K, Leroy Q, Robert C, Giumelli B, et al. (2009) Analysis of the Rickettsia africae genome reveals that virulence acquisition in Rickettsia species may be explained by genome reduction. BMC Genomics 10: 166.
- 57. Munderloh UG, Jauron SD, Kurtti TJ (2005) The Tick: a Different Kind of Host for Human Pathogens. In: Goodman JL, Dennis DT, Sonenshine DE, editors. Tick-Borne Diseases of Humans. Washington, D.C.: ASM Press. pp. 37–64.
- 58. Allesen-Holm M, Barken KB, Yang L, Klausen M, Webb JS, et al. (2006) A characterization of DNA release in Pseudomonas aeruginosa cultures and biofilms. Mol Microbiol 59: 1114–1128.
- 59. Bordenstein SR, Wernegreen JJ (2004) Bacteriophage flux in endosymbionts (Wolbachia): infection frequency, lateral transfer, and recombination rates. Mol Biol Evol 21: 1981–1991.
- 60. Bordenstein SR, Reznikoff WS (2005) Mobile DNA in obligate intracellular bacteria. Nat Rev Microbiol 3: 688–699.
- 61. Munderloh UG, Jauron SD, Fingerle V, Leitritz L, Hayes SF, et al. (1999) Invasion and intracellular development of the human granulocytic ehrlichiosis agent in tick cell culture. J Clin Microbiol 37: 2518–2524.