Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Molecular Characterization of msp2/p44 of Anaplasma phagocytophilum Isolated from Infected Patients and Haemaphysalis longicornis in Laizhou Bay, Shandong Province, China

  • Yong Wang,

    Affiliations College of Animal Science & Technology, Shihezi University, Shihezi, Xinjiang Province, People’s Republic of China, Department of Rickettsiology, National Institute for Communicable Disease Control and Prevention, China CDC, Beijing, People’s Republic of China

  • Chuangfu Chen , (LJZ); (CFC)

    Affiliation College of Animal Science & Technology, Shihezi University, Shihezi, Xinjiang Province, People’s Republic of China

  • Lijuan Zhang (LJZ); (CFC)

    Affiliation Department of Rickettsiology, National Institute for Communicable Disease Control and Prevention, China CDC, Beijing, People’s Republic of China

Molecular Characterization of msp2/p44 of Anaplasma phagocytophilum Isolated from Infected Patients and Haemaphysalis longicornis in Laizhou Bay, Shandong Province, China

  • Yong Wang, 
  • Chuangfu Chen, 
  • Lijuan Zhang


Molecular characterization of the MSP2/P44 protein of Anaplasma phagocytophilum may determine not only if the bacterium is capable of invading hosts but also whether it generates antigenic variation for the purpose of escaping the host immune response, resulting in various pathologic injuries and serious clinical outcomes. Chinese anaplasmosis patients usually present with serious manifestations, and the fatality rate is as high as 26.5%. In this study, we amplified, cloned and sequenced the msp2/p44 genes of three Chinese A. phagocytophilum isolates from Laizhou Bay, Shandong Province, where human granulocytic anaplasmosis (HGA) patients present severe clinical manifestations, and analyzed their genetic characterization and structural features. We also compared them with the HZ and Webster A. phagocytophilum strains. The sequences for both strains are available in GenBank. Analyses indicated that Chinese A. phagocytophilum isolates were significantly different from the HZ and Webster strains in terms of nucleotide sequences, amino acid sequences and protein secondary and tertiary structures. Moreover, the number of immunologic B-cell epitopes (19) of the MSP2 protein of the Chinese isolates was higher than that of the A. phagocytophilum strains HZ (16) and Webster (9). This genetic diversity of the MSP2/P44 protein of Chinese A. phagocytophilum isolates might be relevant and might have serious clinical outcomes. This observation could provide a clue to further understand the pathogenesis of Chinese A. phagocytophilum.


Anaplasma (A) phagocytophilum (APH) is a Gram-negative and obligate intracellular pathogen that infects mammal hosts worldwide and is transmitted by ticks [13]. A. phagocytophilum is an important zoonotic pathogen in that it infects not only humans but also some domestic animals, including horses, dogs, cattle and sheep [4,5]. The life cycle of APH is related to its natural hosts, such as rodents and ruminants, as well as its transmission vectors, which include some members of the genera Ixodes and Haemaphysalis [6-12]. Humans are a dead-end host for A. phagocytophilum and are thus not part of the life cycle of the bacteria [13]. The transmission of A. phagocytophilum into mammals mainly relies on infected tick vectors. However, in 2006, the nosocomial transmission of A. phagocytophilum was proven in the Anhui Province in China, suggesting that the pathogen was transmitted through contact with blood or respiratory aerosols from infected patients [14]. The manifestations of human granulocytic anaplasmosis (HGA) infection include fever, chills, headache, myalgia, leukopenia and thrombocytopenia, as well as elevated levels of liver aminotransferase [15]. The number of HGA cases has increased annually because of certain natural and social factors, such as global warming, increases in outdoor activities, globalization of the economy and worldwide trade. The number of A. phagocytophilum infection cases reached 1,161 in the United States in 2009, and the HGA case-fatality rate in the Midwestern United States is 0.6%- 0.7%. This rate may be on the rise, however, due to misdiagnosis [13]. Additionally, tick-borne ruminant fever (for example, cattle and sheep) caused by A. phagocytophilum infection is common in Europe [16]. In Asian countries, including China, Japan and South Korea, HGA cases and HGA agents have been continuously discovered and detected in the last few years [7-12,17-20].

A growing number of medical reports indicate that the clinical manifestations of Chinese HGA patients are significantly different than those of patients from Western countries. The HGA occurring in China is usually accompanied by several life-threatening complications, including systemic inflammation response syndrome (SIRS) and multiple organ dysfunction syndrome (MODS). Moreover, the fatality rate of Chinese HGA patients has been reported to be as high as 26.5% [21]. Therefore, studies that genetically characterize virulence factors and examine the pathogenesis of native Chinese A. phagocytophilum isolates have major clinical and public health significance in China. The members of the outer membrane protein OMP1/MSP2/P44 superfamily are regarded as important virulence factors of A. phagocytophilum pathogens. Genetic variation of the MSP2/P44 protein may not only determine if the bacterium is capable of invading the host but also whether it can generate antigenic variation to allow for escape from the host immune response, resulting in various pathologic injuries and serious clinical outcomes [21-24]. Therefore, A. phagocytophilum pathogenesis, which is related to the genetic characteristics of MSP2/P44, has recently become a topic of growing interest. Given the severe clinical manifestations of HGA in China, we focused on the analysis of the genetic variation of the msp2/p44 genes of three native Chinese A. phagocytophilum isolates from Laizhou Bay, Shandong Province, where 100% of patients had severe clinical manifestations.

Materials and Methods

Ethics statement

The use of pathogenic DNA isolated from patients was approved by the ethics committee of the Chinese CDC (No. 201103), and all samples were anonymized.

Bacteria strains

Three native Chinese A. phagocytophilum isolates, including two human isolates (LZ-HGA-agent-3 and LZ-HGA-agent-4) from HGA patients and one tick isolate (named LZ-HGA-agent-T1) from infected Haemaphysalis (H) longicornis, were isolated at Laizhou Bay in Shandong Province in 2009 - 2010. All three pathogenic isolates were cultured and conserved in HL-60 cell lines in our laboratory. The two human pathogens were isolated from patients with severe clinical manifestations; the ank A genes from the samples had 100% identity with each other and were 100% homologous to the tick isolate (LZ-HGA-agent-T1) [25].

PCR primer design

The msp2/p44 genes of A. phagocytophilum usually contain two open reading frames (ORFs) with msp2 and p44 [26,27]. To obtain msp2/p44, PCR primers were initially designed with the bio-software Primer Premier 5.0, according to the Webster strain sequence (accession number AY164491) of A. phagocytophilum, published in the GenBank database. The specificity of the PCR primers was also assessed using an online program ( The names and relative sites of the PCR primers and the predicted size of the PCR products are shown in Figure 1.

Figure 1. Sequences and positions of the msp2/p44 PCR primers and the predicted size of the PCR product.

The PCR product and its size are shown in the larger box. The two smaller boxes inside the larger box indicate the p44 ORF and msp2 ORF, respectively. Inter-genic sequences (ITS) are shown using dotted lines under letters. Ellipses inside the PCR product indicate the omission of some letters. The PCR products of the primers msp2-F and msp2-R are indicated using green letters under the green arrowhead. SC: start codon; TC: terminal codon.

PCR amplification analyses and sequencing

Genomic DNA (gDNA) was separately prepared from three Chinese native A. phagocytophilum isolates (LZ-HGA-agent-3, LZ-HGA-agent-4 and LZ-HGA-agent-T1) using a DNeasy® Blood & Tissue Kit (QIAGEN, Cat No. 69506) and was then used as a PCR template. The PCR primers were as follows: msp2-F 5’-ACTTATGGTGTTCGGGAGTCTTC-3’ and msp2-R 5’-AATAATAGGAACGGTCACGGAG-3’, and the predicted size of the PCR product was 2,486 bp. Briefly, 3.0 μL of gDNA was used as a template in a 25-μL reaction mixture system containing 2.5 μL 10×Taq Buffer (SDS Genetech Co., Ltd, China, Cat# ET-500), 1.0 μL of each primer: msp2-F and msp2-R (0.4 μM final concentrations of each primer), 0.5 μL of deoxynucleoside triphosphates (dNTPs, 10 mM), 2.5 μL of dye, 0.5 μL of Taq DNA polymerase (5 U/μL, SBS Genetech Co., Ltd, China, Lot#042512) and 14 μL of ddH2O. PCR was performed using a SensoQuest LabCycler standard plus (SensoQuest GmbH, Goettingen, Germany) with a pre-denaturation at 94°C for 5 min, followed by 35 cycles of a denaturation step at 94°C for 40 seconds, an annealing step at 57°C for 40 seconds and an extension step at 72°C for 3 minutes. There was a final extension at 72°C for 10 minutes. The PCR amplification products were analyzed using 1.0% agarose gel electrophoresis. To obtain the entire sequences of the msp2/p44 genes and to avoid the loss of some sequence information at the ends of both primers, we cloned the PCR products as follows: the PCR product was purified using a multi-function DNA purification kit (BioTeke Corporation, Cat#DP1502). Purified msp2/p44 was cloned into a pEASY-T1 Cloning vector (Beijing TransGen Biotech Co., Ltd., Lot#G30716), and the recombinant plasmid was designated pEASY-msp2/p44. The recombinant plasmid pEASY-msp2/p44 was transformed into E. coli DH5a competent cells. Positive clones were screened by PCR using the primers msp2-F and msp2-R. The recombinant plasmid was extracted from positive clones using a high-purification plasmid mini-preparation kit (BioTeke Corporation,Cat#DP1002) and then directly sequenced by two separate commercial sequencing companies in China: Beijing Tsingke BioTech Co., Ltd. and Sangon BioTech (Shanghai) Co., Ltd.

Data analysis

The sequencing was performed with universal primers from the pEASY-T1 Cloning Kit (Beijing TransGen Biotech Co., Ltd., Lot#G30716) and using the Sanger sequencing method. The sequences of msp2/p44 were processed through manual splicing and proofreading and were also analyzed using the nucleotide blast program ( For the analysis of the msp2/p44 sequences, the DNASTAR package (Lasergene, Madison, WI) was used. The msp2/p44 nucleotide sequences and their coded amino acid sequences were edited with the EditSeq program of the package. The msp2/p44 nucleotide sequences and their coded amino acid sequences were then aligned with the MegAlign program of the package by comparison with the corresponding sequences from the A. phagocytophilum HZ and Webster strains. For the purpose of delineating genetic evolution information of the native Chinese A. phagocytophilum isolates, a phylogenetic tree was constructed with 10 sequences, including Chinese A. phagocytophilum isolates and another 9 varying Anaplasma strains (Table 1), which were identified in different hosts from different geographic regions. The msp2 sequences for A. phagocytophilum HZ (CP000235) and Webster (AY164491) were used for outgroup comparisons. The phylogenetic analysis of the msp2/p44 gene sequences was conducted using the program MEGA 5.05 (Arizona State University), as previously described [6]. In general, the sequences were aligned using CLUSTAL W of MEGA 5.05, with the application of the IUB matrix for nucleotide sequences and the Gonnet matrix for protein sequences. Tree construction was achieved using the neighbor-joining method with the complete deletion option, using the Jukes-Cantor matrix for nucleotide sequences and the Dayhoff matrix for protein sequences. Bootstrap analysis was conducted with 1,000 replicates.

Accession No.Bacteria strainsHostsGeographic originRemark
KC128828/KC430333/KC430334LZ-HGA-AgenthumanChina p44+msp2
CP000235APH-HZhumanUSA Omp-1N+msp2
AY164491APH-Webster-var AhumanUSA p44ESup1+msp2
AY164492APH- HGE2-var II1humanUSA p44ESup1+msp2
AY164493APHhumanUSA p44ESup1+msp2
AY137510APH-NY-37humanUSA p44+omp-1
FJ600595APH-Tick-176-5ES-Iwate-IpIxodes persulcatusJapan p44+omp-1
FJ600601APH-Tick-176-5ES-Iwate-IpIxodes persulcatusJapan p44+omp-1
DQ519565APH-NORSHESsheepNorway p44ESup1+msp2
DQ519566APH-SWDOGESdogSweden p44ESup1+msp2

Table 1. Selected msp2/p44(p44ESup1/omp-1) gene sequences for the phylogenetic analysis.

Note: omp-1 may stand for omp-1N, omp-1X or both; APH: A. phagocytophilum
Download CSV

Bioinformatics analysis of the MSP2/P44 protein

The structural information for the MSP2/P44 protein was predicted and delineated using online software and/or programs. In particular the ProtParam tool ( was used for the primary structure of the protein and the Predict Secondary Structure (PSIPRED v3.0) ( for the secondary structures. TMHMM Server v.2.0 ( was used for the transmembrane domains, BepiPred 1.0 Server ( was used for B-cell epitope-bearing regions, and Galaxy TBM ( was used for the protein tertiary structure prediction from sequences obtained with template-based modeling.

Accession numbers of the full-length msp2/p44 gene sequences of LZ-HGA-Agent-3, LZ-HGA-Agent-4 and LZ-HGA-Agent-T1 have been deposited in the GenBank database under the accession numbers KC128828, KC430333 and KC430334, respectively.


PCR amplification and msp2/p44 gene sequencing

PCR amplification revealed that the predicted 2.5-kb fragments of the msp2/p44 genes were successfully amplified, using the msp2-F/msp2-R primer pair and the genomic DNA of three native Chinese A. phagocytophilum isolates, namely LZ-HGA-Agent-3 (KC128828), LZ-HGA-Agent-4 (KC430333) and LZ-HGA-Agent-T1 (KC430334), as templates. The sequencing analysis indicated that the sequences of msp2/p44 in all of the isolates were 100% identical to each other at the nucleotide level and were mainly composed of one p44 (825 bp) open reading frame (ORF), one msp2 (1323 bp) ORF and a few intergenic sequences (ITS) (Figure 1). Therefore, the three native Chinese A. phagocytophilum isolates mentioned above were designated LZ-HGA-Agent (KC128828/KC430333/KC430334) for simplifying the description in the study, where the name collectively stands for LZ-HGA-Agent-3, LZ-HGA-Agent-4 and LZ-HGA-Agent-T1.

The p44 ORF sequence of LZ-HGA-Agent is 100% and 99.6% identical to the p44ESup1 sequence of the Webster strain of A. phagocytophilum and the omp-1N sequence of the HZ strain, respectively. There is a 3-bp difference in the p44 ORF sequence of LZ-HGA-Agent compared with the omp-1N sequence of the HZ strain at the nucleotide level. The details of the differences are as follows: A to G nucleotide sequence change at positions 270 and 720 and G to A at position 484 in the HZ strain sequence (data not shown). In contrast, the msp2 ORF sequence of LZ-HGA-Agent displays 87.7% and 45.1% identity to that of the msp2 sequences of both the A. phagocytophilum Webster and HZ strains, respectively. The LZ-HGA-Agent msp2 ORF sequence displays relatively little homology to the msp2 sequences of the A. phagocytophilum HZ strain (45.1%) when contrasted with the A. phagocytophilum Webster strain (87.7%) because of the occurrence of large-scale diversity at the nucleotide level (see Figure S1).

The P44 amino acid sequence, which is encoded by the p44 ORF in the LZ-HGA-Agent, is 100% and 99.6% similar to the p44ESup1-coded product sequence of the Webster strain and the omp-1N-coded product sequence of the HZ strain at the amino acid level, respectively. The LZ-HGA-Agent and the HZ strains have identical P44 sequences, except for a difference in the amino acid at position 162, namely V to M in the HZ strain sequence (data not shown). The MSP2 amino acid sequence, which is encoded by msp2 ORF in LZ-HGA-Agent, displays 84.6% and 27.9% homology to the MSP2 amino acid sequences of the A. phagocytophilum Webster and HZ strain, respectively. It is of note that, for the msp2 ORFs and the coding amino acid sequences that were analyzed in this work, the identities and similarities between the different msp2 sequences in various strains demonstrate that coding amino acid similarities are lower than nucleotide identities, suggesting that the msp2 nucleotide exchanges of LZ-HGA-Agent were extremely nonsynonymous substitutions (see Figure S1 and Figure S2). Thus, we conclude that extreme differences in the genetic variation of the msp2 ORF sequence and its amino acid sequence in LZ-HGA-Agent exist, but such extreme differences do not exist with the p44 ORF sequences.

In this study, we also compared our results with another Chinese sequence of msp2/p44 (EU 008082) identified in rodents in the southeast of China [28], and the results indicated that the identity of the msp2 nucleotide sequence (from nt 1475 to 2352) and the amino acid sequences of both the A. phagocytophilum LZ-HGA-Agent strain and the rodent (EU008082) strain were 48.4% and 27.7%, respectively.

Phylogenetic analysis

To assess the relationship between LZ-HGA-Agent and other strains of A. phagocytophilum investigated in this study, another 9 sequences identified in different host species from different geographic regions were used to construct a phylogenetic tree (Table 1). Specifically, we constructed a neighbor-joining (NJ) tree. As shown in Figure 2a, all major branches referring to the gene sequences used in the work were supported by bootstrap values >60%. Using amino acid sequences to construct the tree, similar results were obtained (Figure 2b). From Figure 2a and b, we noticed that the A. phagocytophilum Chinese isolate LZ-HGA-Agent (KC128828/KC430333/KC430334) was very closely related to the human A. phagocytophilum Webster strain (AY164491) from the United States, human strain NY-37 (AY137510) from the United States, sheep NORSHES strain (DQ519565) from Norway, canine NORSHES strain (DQ519566) from Sweden and tick strain Tick-176-5ES-Iwate-Ip (FJ600595 and FJ600601) from Japan, but was less related to the human HZ strain (CP000235) from the United States (Figure 2a and 2b).

Figure 2. Phylogenetic tree based on the msp2/p44 nucleotide sequences (a) and MSP2/P44 amino acid sequences (b) generated using the neighbor-joining method.

a. Bootstrap values (>60%) are shown next to the nodes of the tree, and the scale bar indicates the number of nucleotide substitutions per site; b. The MSP2/P44 amino acid sequences were obtained by msp2/p44 gene sequence translation, and the bootstrap values (>50%) are shown next to the nodes of the tree. The scale bar indicates the number of amino acid substitutions per site. The Chinese isolate LZ-HGA-Agent (red) and the international reference strains for APH-Webster and APH-HZ (blue) are highlighted. APH: Anaplasma phagocytophilum.

Bioinformatics analyses of the MSP2 protein

Because the p44 ORF sequence of LZ-HGA-Agent displayed a higher homology to the p44ESup1 sequence (100%) of A. phagocytophilum Webster (APH-Webster) and the omp-1N sequence (99.6%) of A. phagocytophilum HZ (APH-HZ), their bioinformatics analyses were not conducted in detail in this study. In contrast, we focused on the analyses of the MSP2 proteins of the different strains.

MSP2 amino acid composition

The amino acid composition of the MSP2 proteins belonging to LZ-HGA-Agent, APH-HZ and APH-Webster were analyzed using the ProtParam tool (, and the results are shown in Table 2. The more abundant amino acids in MSP2 from LZ-HGA-Agent include Gly (12.5%), Ala (9.5%) and Val (9.5%), and the percent of the Gly content (12.5%) of LZ-HGA-Agent-MSP2 was higher than that of APH-HZ-MSP2 (10.1%) but lower than that of APH-Webster-MSP2 (13.8%). The Val content (9.5%) of LZ-HGA-Agent-MSP2 was nearly equal to that of APH-HZ-MSP2 (8.8%) and APH-Webster-MSP2 (9.7%). The percentage of the Ala content (9.5%) of LZ-HGA-Agent-MSP2 was roughly equal to that of APH-Webster-MSP2 (9.2%) but was obviously higher than that of APH-HZ-MSP2 (5.8%). The MSP2 isoelectric point analysis (pI) indicated that the pI (5.87) of LZ-HGA-Agent-MSP2 from China was approximately equal to the pI (5.59) of human APH-Webster-MSP2 from the United States but was different than the pI (9.20) of human APH-HZ-MSP2 from the United States.

Amino acidNumberPercentageNumberPercentageNumberPercentage
Ala (A)429.50%215.80%409.20%
Arg (R)163.60%226.00%163.70%
Asn (N)184.10%174.70%173.90%
Asp (D)317.00%133.60%327.40%
Cys (C)40.90%20.50%40.90%
Gln (Q)61.40%92.50%71.60%
Glu (E)194.30%226.00%235.30%
Gly (G)5512.50%3710.10%6013.80%
His (H)71.60%92.50%51.10%
Ile (I)173.90%215.80%173.90%
Leu (L)317.00%4211.50%306.90%
Lys (K)296.60%205.50%337.60%
Met (M)102.30%92.50%102.30%
Phe (F)163.60%195.20%153.40%
Pro (P)163.60%92.50%122.80%
Ser (S)327.30%277.40%276.20%
Thr (T)296.60%174.70%255.70%
Trp (W)20.50%20.50%20.50%
Tyr (Y)184.10%154.10%184.10%
Val (V)429.50%328.80%429.70%

Table 2. Amino acid composition of MSP2 proteins from LZ-HGA-Agent, APH-HZ and APH-Webster.

Download CSV

MSP2 secondary structure.

The secondary structures of each MSP2 protein were predicted using the program Predict Secondary Structure (PSIPRED v3.0) (, and the results are shown in Figure S3, Figure S4 and Figure S5. As shown in these figures, all MSP2 protein structures from LZ-HGA-Agent, A. phagocytophilum Webster and A. phagocytophilum HZ included a main random coil structure, a β-strand structure dispersed to the two ends of the protein and a few α-helices. However, there was a greater number of α-helices in the LZ-HGA-Agent-MSP2 protein than in APH-HZ-MSP2 and APH-Webster-MSP2. In particular, there were six α-helices in the LZ-HGA-Agent-MSP2 from China but only three α-helices in the APH-HZ-MSP2 and five α-helices in APH-Webster-MSP2 from the United States. In contrast, there were 17 β-strands in the LZ-HGA-Agent-MSP2 protein but 20 in APH-HZ-MSP2 and 17 in APH-Webster-MSP2, suggesting that the secondary structure of the LZ-HGA-Agent-MSP2 protein from the Chinese isolate was obviously distinct from the structures of APH-HZ-MSP2 from the United States but not from APH-Webster-MSP2 from the Unites States.

MSP2 transmembrane domains

The transmembrane domains of each MSP2 protein were predicted using TMHMM Server v.2.0 ( (Figure 3). The predicted results indicated that the proteins were hardly different in terms of the location of the transmembrane domains of MSP2, when comparing the LZ-HGA-Agent isolate, APH-HZ and APH-Webster strains. The transmembrane domains of MSP2 were basically located in the 5th to 24th amino acid sequence region.

Figure 3. Predicted transmembrane domain for LZ-HGA-Agent-MSP2 (A), APH-HZ-MSP2 (B) and APH-Webster-MSP2 (C).

The red legend (transmembrane —), blue legend (inside —) and pink legend (outside —) in panels A, B and C indicate the transmembrane domain, interior domain and exterior domain, respectively, for the MSP2 protein as predicted by the TMHMM program. The number on the horizontal abscissa in panels A, B and C indicates the amino acid (AA) residual site and size.

B-cell epitope analysis of MSP2

B-cell epitope-bearing regions of each protein were predicted using BepiPred 1.0 Server (, and the results are shown in Table 3. Nineteen B-cell epitopes were predicted (the number of amino acids ≥3 at an epitope) in the MSP2 protein of the LZ-HGA-Agent isolate, which was higher than that of the MSP2 proteins from both the APH-HZ and APH-Webster strains, which only had 16 and 9 predicted B-cell epitopes (the number of amino acids ≥3), respectively.

No.Start/End positionOligo peptide for epitopePeptide length
21/30( APH-Webster)TSAHADNDKS10
248/50( LZ-HGA-Agent)SPA3
50/56( APH-Webster)IDDGGET7
360/68( LZ-HGA-Agent)RESNGETKA9
77/83( APH-Webster)YWGPEVA7
473/76( LZ-HGA-Agent)LKDG4
92/98( APH-Webster)NTTFGGS7
578/80( LZ-HGA-Agent)SVK3
125/132( APH-Webster)HKGRKGGG8
686/96( LZ-HGA-Agent)FDWNTPDPRIG11
191/193 ( APH-Webster)LKR3
7109/115( LZ-HGA-Agent)VGYGIGG7
205/209( APH-Webster)PRNRS5
8132/144( LZ-HGA-Agent)IRDSGSKEDGADT13
273/276( APH-Webster)CAGI4
9158/164( LZ-HGA-Agent)TGQTDNL7
335/345( APH-Webster)DISPTNSVREK11
10170/175( LZ-HGA-Agent)KTSGKD6
11185/193( LZ-HGA-Agent)VSHPTIDGK9
13214/226( LZ-HGA-Agent)AVESDVKTGNNNN13
14233/246( LZ-HGA-Agent)AGSTDGTGSSSPQV14
16295/302( LZ-HGA-Agent)LTPEEKTI8
17312/315( LZ-HGA-Agent)EGGE4
18398/404( LZ-HGA-Agent)VYDDLPA7
19408/421( LZ-HGA-Agent)VDDTSPAGRTKDTA14

Table 3. Predicted epitopes for LZ-HGA-Agent-MSP2, APH-Webster-MSP2, and APH-HZ-MSP2.

APH-HZ: A. phagocytophilum HZ strain, APH-Webster: A. phagocytophilum Webster strain, LZ-HGA-Agent: Chinese A. phagocytophilum isolate.
Download CSV

MSP2 tertiary structures.

Protein tertiary structure prediction was performed using the program GalaxyTBM (, and the predicted results are shown in Figure 4. The predicted results indicated that the MSP2 protein tertiary structures of the LZ-HGA-Agent isolate, the APH-HZ and the APH-Webster strains are very different (Figure 4).

Figure 4. Predicted tertiary structures of MSP2 proteins based on the Nspa (PDB: 1P4T-A) template.

The proteins sequences were copied into GalaxyTBM ( to compute their tertiary structures. The β-sheet (green), β-turn (blue) and random coil (gray) are highlighted.


The A. phagocytophilum HZ strain was first isolated from a patient with human immunodeficiency disease in New York, USA, in 1995 [29]. This strain caused typical clinical manifestations of HGA and could be cultured by HL60 cells and formed morulas in the cytoplasm of the culture cells. Moreover, this strain is highly cross-reactive with other HGA agents and E. chaffeensis. The genomic size of the A. phagocytophilum HZ strain is approximately 1.47 Mbps, consisting of 1,369 ORFs, over 100 p44 (msp2) genes, type IV secretion (T4S) and numerous repeats [13,30]. The human HZ strain infects granulocytes by subverting its powerful innate antimicrobial defenses, which also makes infected humans and animals more susceptible to opportunistic infection and causes the resulting endothelial cell adhesion, transmigration, motility, degranulation, respiratory burst and phagocytosis [13]. The changes in these functions are to increase bacterial dissemination into the neutrophil.

The A. phagocytophilum Webster (Wisconsin) strain was isolated from a patient in northwestern Wisconsin in 1996 [31], where the seroprevalence of HGA among permanent residents is as high as 14.9% [32], the prevalence of A. phagocytophilum in deer was 8.9%-11.5%, and the infection rate in ticks was 5.6%-26% [33]. It is noteworthy that serological cross-reaction assays indicated that there was a striking antigen difference between the A. phagocytophilum strains Webster and HZ [34].

The LZ-HGA-Agent isolates were isolated from patients and tick-vectors from Laizhou Bay, Shandong Province, during 2009-2010 [25], which is the largest wetland in northern China and a famous migratory bird post across Asia and the West Pacific. As seen in a recent clinical report [21], these two cases were characterized by severe clinical manifestations, including systemic inflammatory response syndrome (SIRS) and multiple organ dysfunction syndrome (MODS). In addition, nearly 100% of Chinese HGA patients in these areas had severe clinical features including SIRS and MODS, significantly lower WBC counts and PLT counts, as well as significantly elevated levels of LDH, CK, BUN, ALT and AST.

The pathogenesis of A. phagocytophilum is an issue of worldwide significance because of the effects of the bacteria on human public health. Currently, the members of the major outer membrane protein superfamily OMP-1/MSP2/P44, belonging to A. phagocytophilum, are known to be important genetic determinants of pathogenesis and can allow A. phagocytophilum to not only adhere to the host cells but also to avoid the host immune surveillance, thus contributing to colonization of the host intracellular environment [4,13,26,35]. The analysis based on comparative genomics demonstrated that the expansion of the msp2/p44 family is a common feature in A. phagocytophilum strains [30]. The diversity of paralogous p44 genes is related to their geographic origin, host-specificity and the mechanisms of functional divergence [3638]. As a typical example, p44-1 was found in all human isolates from New York State but not in isolates from Minnesota, whereas p44-18 was found in isolates from both regions [36]. The antigenic variability of msp2/p44 is due to differential expression of major immunodominant outer membrane proteins encoded by members of a multigene family [39].

The homologous recombination of p44/msp2 occurs by the use of one of the most important gene conversion mechanisms, the RecFOR recombination pathway, because A. phagocytophilum lacks the RecBCD pathway [13,29,35,40,41]. Consequently, the antigenic variation of the P44/MSP2 proteins of A. phagocytophilum is most likely an intrinsic property, contributing to the bacterial survival by subverting the host immune system and the persistence within the host intracellular environment [13,29,35,42,43].

In this work, the msp2 sequences of the Chinese native LZ-HGA-Agent showed a striking difference, both at the nucleotide and amino acid levels (Figure S1 and Figure S2). At the same time, bioinformatics analyses indicated that the Chinese isolates possessed unique protein secondary structures, as the number of α-helices in this strain was greater than that of the proteins in the A. phagocytophilum HZ and Webster strains from the United States. However, the MSP2 protein of the LZ-HGA-Agent isolate had fewer β-strands than the HZ strain while maintaining the same number as the Webster strain (see Figures S3, S4 and S5). In addition, a major difference in tertiary structures of the MSP2 protein was observed between the Chinese LZ-HGA-Agent isolate and both the APH-HZ and APH-Webster strains from the USA. The B-cell epitopes in the MSP2 protein of the LZ-HGA-Agent isolate were clearly more prevalent than those in the HZ and Webster strains.

The second and tertiary structures of the MSP2 protein may directly influence its spatial conformations/structures and may change its biological function in terms of host adaptation, bacterial adhesion and bacterial membrane structural integrity.

B-cell epitopes (also called antigenic determinants) are specific regions that are recognized by and/or interact with immunoreceptors and/or antibody molecules of the B-lymphocyte surface during pathogen-host cell interaction. Multi-antigenic epitopes may exist on one single protein, and various antigenic epitopes may play a different role in bacterial infection. Furthermore, the amount, relative position, stability and conformation of the antigenic epitope are closely related to the protein behavior. The differences in the B-cell epitopes of MSP2 of the LZ-HGA-Agent isolate may directly influence the recognition of immunological B cells by the bacteria, resulting in host cell invasion and, subsequently, the severe clinical symptoms observed following infection with the pathogen isolate in China.

Although we recognize that three isolates of A. phagocytophilum is limited, the LZ-HGA-Agent isolate data obtained in this work may further help us to enhance our basic genetic knowledge of the pathogenesis and biology of the Chinese A. phagocytophilum pathogenic strains. The next study will continue to gather more isolates of Chinese A. phagocytophilum pathogens and will focus on determining how antigenic variation of the MSP2/P44 protein family contributes to the biology and/or pathogenesis of the human isolate LZ-HGA-Agent and which antigenic variations of the same protein family are involved in the severe clinical symptoms of HGA patients in China.

Supporting Information

Figure S1.

msp2 linear alignment of LZ-HGA-Agent, A. phagocytophilum HZ and A. phagocytophilum Webster at the nucleotide level. The alignment report was performed using the MegAlign program of the DNASTAR package. The nucleotide sequence names are indicated to the left, and the nucleotide numbers are shown to the right. The solid, deep red letters differ from the consensus, whereas all others match the consensus. APH-Webster: A. phagocytophilum Webster strain, APH-HZ: A. phagocytophilum HZ strain.


Figure S2.

MSP2 linear alignment of LZ-HGA-Agent, A. phagocytophilum HZ and A. phagocytophilum Webster at the amino acid level. The alignment report was performed using the MegAlign program of the DNASTAR package. Amino acid sequence names are indicated to the left, and the amino acid numbers are shown to the right. The amino acid residues colored a solid, deep red differ from the consensus sequence, and all others match the consensus. AA: amino acid residue. APH-Webster: A. phagocytophilum Webster strain, APH-HZ: A. phagocytophilum HZ strain.


Figure S3.

Putative secondary structure of LZ-HGA-Agent MSP2 determined using the Predict Secondary Structure (PSIPRED v3.0) program (


Figure S4.

Putative secondary structure of A. phagocytophilum HZ (APH-HZ) MSP2 determined using the Predict Secondary Structure (PSIPRED v3.0) program (


Figure S5.

Putative secondary structure of A. phagocytophilum Webster (APH-Webster) MSP2 determined using the Predict Secondary Structure (PSIPRED v3.0) program (



We thank Didier Raoult for providing the serological assays and Rickettsia strains for this study. We also thank J Robert Massung for providing the Ehrlichia chaffeensis antigen and JS Dumler for providing the Anaplasma phagocytophilum antigen and training us in the culturing of the A. phagocytophilum pathogen.

Author Contributions

Conceived and designed the experiments: LJZ. Performed the experiments: YW. Analyzed the data: YW LJZ. Contributed reagents/materials/analysis tools: CFC. Wrote the manuscript: YW LJZ.


  1. 1. Walker DH, Dumler JS (1994) Emerging and reemerging rickettsial diseases.N Engl J Med 331: 1651-1652. doi:10.1056/NEJM199412153312410. PubMed: 7969347.
  2. 2. Chapman AS, Bakken JS, Folk SM, Paddock CD, Bloch KC et al. (2006) Diagnosis and management of tick borne rickettsial diseases: Rocky Mountain spotted fever, ehrlichioses, and anaplasmosis-United States. MMWR Recomm Rep 55Volumes RR-4: 1-27.
  3. 3. Dumler JS, Choi KS, Garcia-Garcia JC, Barat NS, Scorpio DG et al. (2005) Human granulocytic anaplasmosis and Anaplasma phagocytophilum. Emerg Infect Dis11: 1828-1834. doi:10.3201/eid1112.050898. PubMed: 16485466.
  4. 4. Dumler JS (2012) The biological basis of severe outcomes in Anaplasma phagocytophilum infection. FEMS Immunol Med Microbiol 64: 13-20. doi:10.1111/j.1574-695X.2011.00909.x. PubMed: 22098465.
  5. 5. Aguero-Rosenfeld ME, Dumler JS (2003) Ehrlichia, Anaplasma, Neorickettsia, and Aegyptianella. In: PR Murray. Manual of Clinical Microbiology. 8th ed. Washington DC, USA: ASM Press. pp. 1015-1029.
  6. 6. Scharf W, Schauer S, Freyburger F, Petrovec M, Schaarschmidt-Kiener D et al. (2011) Distinct host species correlate with Anaplasma phagocytophilum ankA gene clusters. J Clin Microbiol 49: 790-796. doi:10.1128/JCM.02051-10. PubMed: 21177886.
  7. 7. Zhang L, Liu H, Xu B, Lu Q, Li L et al. (2012) Anaplasma phagocytophilum infection in domestic animals in ten provinces/cities of China. Am J Trop Med Hyg 87: 185-189. doi:10.4269/ajtmh.2012.12-0005. PubMed: 22764312.
  8. 8. Zhan L, Cao WC, Jiang JF, Zhang XA, Wu XM et al. (2010) Anaplasma phagocytophilum in livestock and small rodents. Vet Microbiol 144: 405-408. doi:10.1016/j.vetmic.2010.02.018. PubMed: 20558015.
  9. 9. Yoshimoto K, Matsuyama Y, Matsuda H, Sakamoto L, Matsumoto K et al. (2010) Detection of Anaplasma bovis and Anaplasma phagocytophilum DNA from Haemaphysalis megaspinosa in Hokkaido, Japan. Vet Parasitol 168: 170-172. doi:10.1016/j.vetpar.2009.10.008. PubMed: 19897306.
  10. 10. Oh JY, Moon B-C, Bae BK, Shin E-H, Ko YH et al. (2009) Genetic identification and phylogenetic analysis of anaplasma and Ehrlichia species in Haemaphysalis longicornis collected from Jeju Island, Korea. J Bacteriol Virol 39: 257-267. doi:10.4167/jbv.2009.39.4.257.
  11. 11. Sun J, Liu Q, Lu L, Ding G, Guo J et al. (2008) Coinfection with four genera of bacteria (Borrelia, Bartonella, Anaplasma, and Ehrlichia) in Haemaphysalis longicornis and Ixodes sinensis ticks from China. Vector Borne Zoonotic Dis 8: 791-795. doi:10.1089/vbz.2008.0005. PubMed: 18637722.
  12. 12. Kim CM, Kim MS, Park MS, Park JH, Chae JS (2003) Identification of Ehrlichia chaffeensis, Anaplasma phagocytophilum, and A. bovis in Haemaphysalis longicornis and Ixodes persulcatus ticks from Korea. Vector Borne Zoonotic Dis 3: 17-26. doi:10.1089/153036603765627424. PubMed: 12804377.
  13. 13. Rikihisa Y (2011) Mechanisms of obligatory intracellular infection with Anaplasma phagocytophilum. Clin Microbiol Rev 24: 469-489. doi:10.1128/CMR.00064-10. PubMed: 21734244.
  14. 14. Zhang L, Liu Y, Ni D, Li Q, Yu Y et al. (2008) Nosocomial transmission of human granulocytic anaplasmosis in China. JAMA 300: 2263-2270. doi:10.1001/jama.2008.626. PubMed: 19017912.
  15. 15. Dumler JS, Madigan JE, Pusterla N, Bakken JS (2007) Ehrlichioses in humans: epidemiology, clinical presentation, diagnosis, and treatment. Clin Infect Dis 45: S45-S51. doi:10.1086/518146. PubMed: 17582569.
  16. 16. Woldehiwet Z (2006) Anaplasma phagocytophilum in ruminants in Europe. Ann N Y Acad Sci 1078: 446-460. doi:10.1196/annals.1374.084. PubMed: 17114753.
  17. 17. Zhan L, Cao WC, Jiang JF, Zhang XA, Liu YX et al. (2010) Anaplasma phagocytophilum from Rodents and Sheep, China. Emerg Infect Dis 16: 764-768. doi:10.3201/eid1605.021293. PubMed: 20409364.
  18. 18. Zhan L, Cao WC, Chu CY, Jiang BG, Zhang F et al. (2009) Tick-borne agents in rodents, China, 2004-2006. Emerg Infect Dis 15: 1904-1908. doi:10.3201/eid1512.081141. PubMed: 19961668.
  19. 19. Zhang L, Shan A, Mathew B, Yin J, Fu X et al. (2008) Rickettsial Seroepidemiology among farm workers, Tianjin, People's Republic of China. Emerg Infect Dis 14: 938-940. doi:10.3201/eid1406.071502. PubMed: 18507907.
  20. 20. Ohashi N, Inayoshi M, Kitamura K, Kawamori F, Kawaguchi D et al. (2005) Anaplasma phagocytophilum-infected ticks, Japan. Emerg Infect Dis 11: 1780-1783. doi:10.3201/eid1111.050407. PubMed: 16318739.
  21. 21. Li H, Zhou Y, Wang W, Guo D, Huang S et al. (2011) The clinical characteristics and outcomes of patients with human granulocytic anaplasmosis in China. Int J Infect Dis 15: e859-e866. doi:10.1016/j.ijid.2011.09.008. PubMed: 22015246.
  22. 22. Yu Q, Chen CF, Chen Q, Zhang LJ (2012) Expression and immunogenicity of recombinant immunoreactive surface protein 2 of Anaplasma phagocytophilum. Clin Vaccine Immunol 19: 919-923. doi:10.1128/CVI.05709-11. PubMed: 22539470.
  23. 23. Wuritu , Ozawa Y, Gaowa , Kawamori F, Masuda T et al. (2009) Structural analysis of a p44/msp2 expression site of Anaplasma phagocytophilum in naturally infected ticks in Japan. J Med Microbiol 58: 1638-1644. doi:10.1099/jmm.0.011775-0. PubMed: 19713360.
  24. 24. Troese MJ, Sarkar M, Galloway NL, Thomas RJ, Kearns SA et al. (2009) Differential expression and glycosylation of Anaplasma phagocytophilum major surface protein 2 paralogs during cultivation in sialyl Lewis x-deficient host cells. Infect Immun 77: 1746-1756. doi:10.1128/IAI.01530-08. PubMed: 19223475.
  25. 25. Zhang L, Wang G, Liu Q, Chen C, Li J et al. (2013) Molecular analysis of Anaplasma phagocytophilum isolated from patients with febrile diseases of unknown etiology in China. PLOS ONE;8(2): e57155. doi:10.1371/journal.pone.0057155. PubMed: 23451170.
  26. 26. Scorpio DG, Leutenegger C, Berger J, Barat N, Madigan JE et al. (2008) Sequential analysis of Anaplasma phagocytophilum msp2 transcription in murine and equine models of human granulocytic anaplasmosis. Clin Vaccine Immunol 15: 418-424. doi:10.1128/CVI.00417-07. PubMed: 18094110.
  27. 27. Lin M, Kikuchi T, Brewer HM, Norbeck AD, Rikihisa Y (2011) Global proteomic analysis of two tick-borne emerging zoonotic agents: Anaplasma phagocytophilum and Ehrlichia chaffeensis. Front Microbiol 2: 24. PubMed: 21687416.
  28. 28. Zhan L, Cao WC, de Vlas S, Xie SY, Zhang PH et al. (2008) A newly discovered Anaplasma phagocytophilum variant in rodents from southeastern China. Vector Borne Zoonotic Dis 8: 369-380. doi:10.1089/vbz.2007.0211. PubMed: 18471056.
  29. 29. Rikihisa Y, Zhi N, Wormser GP, Wen B, Horowitz HW et al. (1997) Ultrastructural and antigenic characterization of a granulocytic ehrlichiosis agent directly isolated and stably cultivated from a patient in New York State. J Infect Dis 175: 210–213. doi:10.1093/infdis/175.1.210. PubMed: 8985223.
  30. 30. Dunning Hotopp JC, Lin MQ, Madupu R, Crabtree J, Angiuoli SV, et al. (2006) Comparative genomics of emerging human ehrlichiosis agents. PLOS Genet 2: 208-223. doi:10.1371/journal.pgen.0020208. PubMed: 16482227.
  31. 31. Bakken JS,Goellner P, Van Etten M, Boyle DZ, Swonger OL et al. (1998) Seroprevalence of human granuloctytic ehrlichiosis among permanent residents of northwestern Wisconsin.J Infect Dis27: 1491-1496.
  32. 32. Inokuma H, Brouqui P, Dumler JS, Raoult D (2003) Serotyping isolates of Anaplasma phagocytophilum by using monoclonal antibodies. Clin Diagn Lab Immunol 10: 969-972. PubMed: 12965936.
  33. 33. Michalski M, Rosenfield C, Erickson M, Selle R, Bates K et al. (2006) Anaplasma phagocytophilum in central and western Wisconsin: a molecular survey. Parasitol Res 99:694-699.
  34. 34. Asanovich Kristin M, Johan S, Bakken et al John E. (1997) Antigenic diversity of granulocytic Ehrlichia isolates from humans in Wisconsin and New York and a horse in California. J Infect Dis 176:1029–1034. doi:10.1086/516529. PubMed: 9333162.
  35. 35. Rikihisa Y (2010) Anaplasma phagocytophilum and Ehrlichia chaffeensis: subversive manipulators of host cells. Nat Rev Microbiol 8:328-339. doi:10.1038/nrmicro2318. PubMed: 20372158.
  36. 36. Lin Q, Rikihisa Y, Massung RF, Woldehiwet Z, Falco RC (2004) Polymorphism and transcription at the p44-1/p44-18 genomic locus in Anaplasma phagocytophilum strains from diverse geographic regions. Infect Immun 72:5574-5581. doi:10.1128/IAI.72.10.5574-5581.2004. PubMed: 15385454.
  37. 37. Foley J, Nieto NC, Madigan J, Sykes J (2008) Possible differential host tropism in Anaplasma phagocytophilum strains in the Western United States. Ann N Y Acad Sci 1149:94-97. doi:10.1196/annals.1428.066. PubMed: 19120182.
  38. 38. Al-Khedery B, Lundgren AM, Stuen S, Granquist EG, Munderloh UG et al. (2012) Structure of the type IV secretion system in different strains of Anaplasma phagocytophilum .BMC Genomics 29:13:678. PubMed: 23190684.
  39. 39. Park Jin-Ho, KC,Patil S, Dumler JS (2002,) Genetic Variability and Stability of Anaplasma phagocytophila msp2 (p44). Infect Immun March 70:1230–1234. doi:10.1128/IAI.70.3.1230-1234.2002. PubMed: 11854205.
  40. 40. Foley JE, Nieto NC, Barbet A, Foley P (2009) Antigen diversity in the parasitic bacterium Anaplasma phagocytophilum arises from selectively-represented, spatially clustered functional pseudogenes. PLOS ONE 4:e8265. doi:10.1371/journal.pone.0008265. PubMed: 20016821.
  41. 41. Lin Q, Zhang C, Rikihisa Y (2006) Analysis of involvement of the RecF pathway in p44 recombination in Anaplasma phagocytophilum and in Escherichia coli by using a plasmid carrying the p44 expression and p44 donor loci. Infect Immun 74:2052-2062. doi:10.1128/IAI.74.4.2052-2062.2006. PubMed: 16552034.
  42. 42. Lin Q, Rikihisa Y (2005) Establishment of cloned Anaplasma phagocytophilum and analysis of p44 gene conversion within an infected horse and infected SCID mice. Infect Immun 73:5106-5114. doi:10.1128/IAI.73.8.5106-5114.2005. PubMed: 16041027.
  43. 43. Carlyon JA, Fikrig E (2003) Invasion and survival strategies of Anaplasma phagocytophilum. Cell Microbiol 5:743-754. doi:10.1046/j.1462-5822.2003.00323.x. PubMed: 14531890.