Unlike in HIV-1 infection, the majority of HIV-2 patients produce broadly reactive neutralizing antibodies, control viral replication and survive as elite controllers. The identification of the molecular, structural and evolutionary footprints underlying these very distinct immunological and clinical outcomes may lead to the development of new strategies for the prevention and treatment of HIV infection.
We performed a side-by-side molecular, evolutionary and structural comparison of the C2, V3 and C3 envelope regions from HIV-1 and HIV-2. These regions contain major antigenic targets and are important for receptor binding. In HIV-2, these regions also have immune modulatory properties. We found that these regions are significantly more variable in HIV-1 than in HIV-2. Within each virus, C3 is the most entropic region followed by either C2 (HIV-2) or V3 (HIV-1). The C3 region is well exposed in the HIV-2 envelope and is under strong diversifying selection suggesting that, like in HIV-1, it may harbour neutralizing epitopes. Notably, however, extreme diversification of C2 and C3 seems to be deleterious for HIV-2 and prevent its transmission. Computer modelling simulations showed that in HIV-2 the V3 loop is much less exposed than C2 and C3 and has a retractile conformation due to a physical interaction with both C2 and C3. The concealed and conserved nature of V3 in the HIV-2 is consistent with its lack of immunodominancy in vivo and with its role in preventing immune activation. In contrast, HIV-1 had an extended and accessible V3 loop that is consistent with its immunodominant and neutralizing nature.
We identify significant structural and functional constrains to the diversification and evolution of C2, V3 and C3 in the HIV-2 envelope but not in HIV-1. These studies highlight fundamental differences in the biology and infection of HIV-1 and HIV-2 and in their mode of interaction with the human immune system and may inform new vaccine and therapeutic interventions against these viruses.
Citation: Barroso H, Borrego P, Bártolo I, Marcelino JM, Família C, Quintas A, et al. (2011) Evolutionary and Structural Features of the C2, V3 and C3 Envelope Regions Underlying the Differences in HIV-1 and HIV-2 Biology and Infection. PLoS ONE 6(1): e14548. https://doi.org/10.1371/journal.pone.0014548
Editor: John J. Rossi, City of Hope, United States of America
Received: July 1, 2010; Accepted: December 10, 2010; Published: January 20, 2011
Copyright: © 2011 Barroso et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by projects PTDCSAU-FCF6767/2006 from Fundo para a Ciencia e Tecnologia, Portugal and by Collaborative HIV and Anti-HIV Drug Resistance Network (CHAIN), from the European Union. PB and IB are supported by PhD grants from Fundo para a Ciencia e Tecnologia. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Human Immunodeficiency Virus type 1 (HIV-1) infection affects more than 40 million individuals throughout the world. It is caused mainly by isolates belonging to group M. Within this group there are nine different subtypes named A to H, six subsubtypes (F1, F2, A1–A4) and at least thirty six recombinant forms named CRF01 up to CRF36 . In contrast to the HIV-1 pandemic, HIV-2 is only prevalent in West Africa where it seems to have been present since the 1940s . In Europe infection with HIV-2 remains rare (2–3% of all AIDS cases), being observed mainly in France and Portugal , , . Eight different HIV-2 groups named A through H have been reported but only groups A and B cause human epidemics , , , . Isolates from group A are, however, responsible for the vast majority of HIV-2 infections worldwide .
For reasons that are still not clear, HIV-1 and HIV-2 infections lead to very different immunological and clinical outcomes. In contrast to HIV-1 infected patients, the majority of HIV-2-infected individuals have reduced general immune activation, normal CD4+ T cell counts, low or absent viremia and absence of clinical disease , , , . This may be related with a more effective immune response produced against HIV-2. In fact, most HIV-2 infected individuals have strong cytotoxic responses to Env and Gag proteins and raise autologous and heterologous neutralizing antibodies , , , , . The attenuated course of HIV-2 infection compared to HIV-1 has also been associated to a lower state of immune activation, which may be related to the immunosuppressive activity of the C2-V3-C3 envelope region , , . Similar immunosuppressive activity has not been found in the homologous C2-V3-C3 region in the HIV-1 envelope . Finally, the transmission rate of HIV-2 is also significantly lower than that of HIV-1 and this has been associated with the low or absent viremia found in most HIV-2 patients , .
The HIV-1 Env glycoprotein is a trimer on the virion surface with extensive N-linked glycosylation that effectively shields many conserved epitopes from antibody recognition . It is composed of trimers of a surface (SU) glycoprotein with a molecular weight of 120–125 kDa (gp120–125) that is bound to a transmembrane (TM) glycoprotein with 36–41 kDa (gp36–41). SU can be divided into five hipervariable regions, named V1 to V5, bordered by five conserved regions, named C1 to C5. The C2 and C3 regions associate to form the CD4 binding site such that mutations in amino acid at positions 267Q in C2 and 368R in C3 abrogate gp120 binding to CD4 , . In HIV-1, V3 is one of the most important determinants of viral tropism and co-receptor usage , . This region also contains major antigenic and neutralizing epitopes in HIV-1 which are well exposed upon CD4-binding , , , , , , . Although still debatable, the V3 region in HIV-2 may also contain broadly neutralizing epitopes , , , , , , . However, in contrast to HIV-1, the V3 and flanking C2 and C3 regions are not immunodominant in HIV-2 infected patients , , , . Moreover, it remains to be determined whether these regions are exposed or concealed in the envelope complex of primary isolates of HIV-2.
In HIV-1 infection escape from antibody neutralization occurs frequently and is the major driving force of the molecular evolution of the envelope glycoproteins , . Not surprisingly, codons under diversifying selection (positive selection) seem to be clustered mostly in the hypervariable V1/V2 and V3 regions that contain important and accessible neutralizing targets , . The impact of the neutralizing antibody response in the in vivo evolution of the HIV-2 Env is currently unknown.
The present study was designed to identify molecular and evolutionary features of the C2, V3 and C3 regions in HIV-1 and HIV-2 infected patients that could be related with their different immunological and clinical outcomes. We describe some potentially important differences in the genetic constitution, molecular evolution and conformation of the C2, V3 and C3 regions in HIV-1 and HIV-2 that provide new insights into their function and may inform the design of HIV vaccines.
HIV-1 is significantly more variable in the envelope C2, V3 and C3 regions than HIV-2
We compared the inter-patient genetic diversity of HIV-1 and HIV-2 in two different datasets: HIV-1 group M (all subtypes) and HIV-2 group A sequences from all over the world (Control dataset composed of reference sequences) and newly derived HIV-1 and HIV-2 sequences obtained from Portuguese (PT) patients. Phylogenetic analysis showed that HIV-1 sequences circulating in Portugal belong to different subtypes and recombinant forms (Figure S1A). Forty five sequences were subtype B and six belonged to the recombinant form CRF14_BG. Subtypes G (4 sequences) and C (2), sub-subtype F1 (2), and CRF02_AG (1) were also found. Regarding HIV-2, all sequences from Portugal clustered together within group A (Figure S1B). Collectively, these results are consistent with previous studies showing a highly complex HIV epidemics in Portugal caused exclusively by HIV-2 group A and different subtypes of HIV-1 group M , , , . Nucleotide diversity between HIV-1 viruses found in Portugal was significantly higher compared to HIV-2 (mean number of substitutions per site, 0.336, 95%CI [0.329; 0.342] vs 0.239, [0.236; 0.243], P<0.0001). Similar results were found for the HIV-1 and HIV-2 Control datasets (Table S1). Hence, we conclude that HIV-1 is genetically more diverse than HIV-2 in the envelope region comprising C2, V3 and C3.
Amino acid diversity in the C2, V3 and C3 regions of HIV-1 and HIV-2 were compared by calculating Shannon's entropy . Mean entropy values for the three regions were significantly higher in HIV-1 than in HIV-2 both in PT (0.794 vs 0.409, P<0.0001) and Control datasets (0.702 vs 0.353, P<0.0001) confirming that these regions are more variable in HIV-1 than in HIV-2. Entropy was also significantly higher in HIV-1 than in HIV-2 in each separate region (C2, P<0.05; V3, P<0.005; C3, P<0.0005) of PT sequences. The region with higher mean entropy was C3 in both viruses (1.031, 95%CI [0.845, 1.217] for HIV-1 vs 0.534, 95%CI [0.378, 0.689] for HIV-2, P<0.0005) followed by V3 (0.674, [0.506, 0.841]) and C2 (0.574, [0.427, 0.721]) in HIV-1 and C2 (0.326, [0.175, 0.477]) and V3 (0.304, [0.176, 0.433]) in HIV-2 (Figure 1). Comparable results were obtained for the Control datasets but in this case V3 was the least entropic region both in HIV-1 and HIV-2 (Table S1 and Figure S2). Not surprisingly, amino acids with higher entropy (values above 1) were primarily located in the C3 region of both viruses and there were more highly entropic amino acids in C3 in HIV-1 than in HIV-2 both in the PT and Control datasets (PT dataset: 51.9% in HIV-1 vs 24.5% in HIV-2; Control dataset: 35.3% in HIV-1 vs 20.8% in HIV-2). Notably, the amino acids in V3 that are related with co-receptor usage, positions 11/25 in HIV-1 (codons 306/320) ,  and possibly positions 18/19/27 in HIV-2 (codons 319/320/328) , , had a high entropy score in both viruses.
(A) HIV-1 alignment (PT dataset), sites were numbered according to codon env position of HIV-1 HXB2 reference strain; (B) HIV-2 alignment (PT dataset), sites were numbered according to codon env position of HIV-2 ALI reference strain.
The mean number of potential N-linked glycosylation sites both in HIV-1 and HIV-2 sequences from Portugal was 7 (range: 4–9 in HIV-1; 5–9 in HIV-2). The most conserved glycosylation sites were located in C2 in both viruses (Figure 2). Nonetheless, in this region, there were four highly conserved glycosylation sites in HIV-2 (present in ≥80% of strains) and only two such sites in HIV-1. With the exception of the highly conserved site located in the beginning of C3 in HIV-1, glycosylation sites found in C3 varied from strain to strain in number and location, this being more evident in HIV-1 than in HIV-2. In V3 there were two highly conserved glycosylation sites in both viruses. Similar observations were made for HIV-1 and HIV-2 sequences in the Control datasets (Table S1 and Figure S3).
(A) HIV-1 alignment (PT dataset), sites were numbered according to codon env position of HIV-1 HXB2 reference strain; (B) HIV-2 alignment (PT dataset), sites were numbered according to codon env position of HIV-2 ALI reference strain.
Selective pressures act differently in HIV-1 and HIV-2
We have recently found that HIV-2 displays a faster evolutionary rate in the envelope gp125 and C2-V3-C3 region than HIV-1 in patients with chronic and advanced disease , . The faster evolutionary rate in HIV-2 was more pronounced in synonymous sites than in non-synonymous sites suggesting a weaker positive selection in HIV-2 than in HIV-1. To investigate this possibility, we analysed diversifying selection in the C2-V3-C3 region of both viruses using codon-based models of molecular evolution. Firstly, we estimated the ratio of non-synonymous and synonymous substitution rates (dN/dS ratio) averaged over all sites. For HIV-1 sequences from Portugal dN/dS ratio was 0.703, 95%CI [0.668, 0.740]; for HIV-2 it was 0.451, [0.419, 0.484]. Similar values were obtained for the Control alignments (Table S1). These results are consistent with the higher degree of genetic conservation of the C2, V3 and C3 regions in HIV-2.
Site-by-site analysis revealed that diversifying selection is unevenly distributed along the studied region between the two viruses (PT, P<0.001; Controls, P<0.001) (Figures 3 and S4). For HIV-2 sequences from the PT dataset, there were between 7 and 9 positively selected (PS) sites depending on the method that was used (SLAC/FEL/REL) while for HIV-1 the number of sites ranged from 7 to 17 (Table 1). Taking into account only sites that were selected by at least two methods, HIV-2 had a total of 7 PS sites whereas in HIV-1 there were 9 sites. The sites were distributed as follows: in C2 there were 3 sites in HIV-2 and 2 in HIV-1; in V3 there were 2 sites in HIV-1, and no sites in HIV-2; in C3 there were 4 sites in HIV-2 and 5 in HIV-1, including one codon within the CD4 binding site (codon 378 in HIV-1) and two in the α2-helix (codons 343 and 346) . In Control data sets the number of PS sites was slightly lower but they were similarly distributed, with the exception of the V3: 1 PS site in HIV-2, but no sites in HIV-1 (Tables S1 and S2). Importantly, we found that when compared to HIV-1, positive selection was stronger in HIV-2 in most sites (Tables 1 and S2).
dN-dS values were estimated by FEL and scaled by the total codon tree length. (A) HIV-1 alignment (PT dataset), sites were numbered according to codon env position of HIV-1 HXB2 reference strain; (B) HIV-2 alignment (PT dataset), sites were numbered according to codon env position of HIV-2 ALI reference strain.
The comparison of diversifying selection between terminal and internal branches of the phylogenetic trees revealed two distinct profiles for HIV-1 and HIV-2. Firstly, non-synonymous substitution rates were significantly different between the internal nodes and the tips of the tree in all datasets: PT, P = 0.002 for HIV-2 and P = 0.011 for HIV-1; Controls, P<0.001 and P = 0.004 (data not shown). Stronger selection was in general found at codons selected simultaneously at the tips and the external branches of the HIV-1 and HIV-2 trees. Importantly, however, only 2 of the 7 sites (29%) detected in terminal branches of PT HIV-2 tree were also under positive selection along the internal branches (codons 267 and 270 in C2). In contrast, for HIV-1 most positively selected sites (6/9, 67%) were present both in the internal and the terminal branches. In Control datasets these percentages were 43% for HIV-2 and 71% for HIV-1 (Table S2). These results suggest that natural selection affects less the transmission fitness of HIV-1 than HIV-2.
Structure and solvent accessibility of V3 differ in HIV-1 and HIV-2
A model of the structure of the C2-V3-C3 region was built for HIV-1 and HIV-2 based on the atomic coordinates of the HIV-1 gp120 and SIV gp120 using consensus sequences from both the PT and Control HIV-1 and HIV-2 alignments. For HIV-1, the structures of PT and Control sequences were almost identical having only a slight difference in V3, which presents less regular secondary structure in the PT sequence (Figure S5). For HIV-2, the structures of PT and Control sequences were identical. The structure of the C2-V3-C3 region was however markedly different between HIV-1 and HIV-2, the most striking differences being the significant retraction of the V3 loop in HIV-2 and its potential interaction both with C2 and C3 (Figure 4A). Identical results were obtained when comparing the HIV-1 and HIV-2 control sequences (Figure S6). The predicted non-covalent interaction between V3, C3 and C3 in HIV-2 involves residues Tyr296 and His301 in C2 binding, respectively, to Arg331 and Trp334 in V3, and Phe337 in C3 binding to Phe321 in V3 (Figure 4B).
The conformational structure of consensus amino acid sequences derived from the PT datasets was obtained by homology modeling as indicated in Material and Methods. In the schematics, C2 is shown in red, V3 in yellow and C3 in blue. (A) Balls represent the amino acids under positive selection. The red balls represent codons selected simultaneously by SLAC, FEL and REL methods, while green balls stand for codons selected by at least two of these methods; (B) Model structure showing the predicted interactions between V3, C2 and C3 in HIV-2 gp125. The non-covalent interaction involves residues Tyr296 and His301 in C2 binding, respectively, to Arg331 and Trp334 in V3, and Phe337 in C3 binding to Phe321 in V3.
The solvent accessibilities of amino acid residues were also calculated for both models (Figure 5). As expected, both in HIV-1 and HIV-2 most PS sites and N-glycans had at least 50% surface exposure. In HIV-2, 8 out of 37 (22%) amino acids in C2, 8/34 (24%) in V3 and 19/53 (36%) in C3 were highly exposed (≥70% solvent accessibility) whereas in HIV-1 these were 9/37 (24%), 15/35 (43%) and 10/52 (19%), respectively. Consistent with the high exposure of the V3 region in HIV-1, the two amino acids at positions 306 and 320 involved in binding to co-receptors were well exposed (≥50% solvent accessibility). In contrast, in HIV-2, among amino acids 319/320 and 328 in V3 loop potentially involved in co-receptor binding, only 319 was relatively well exposed. Despite the potential interaction between V3 and C3 (Figure 4B), the overall exposition of C3 was higher in HIV-2 than in HIV-1. Thus, for instance, 42% (5/12) of the residues in C3 that may contribute for the formation of the CD4-binding site (positions 377–388) in HIV-2 showed high solvent accessibility. In HIV-1 only 3 out of 16 (19%) amino acids with similar function (positions 367–382) were highly exposed. Similar results were obtained when comparing the HIV-1 and HIV-2 control sequences (Figure S7).
(A) HIV-1 alignment (PT dataset), sites were numbered according to codon env position of HIV-1 HXB2 reference strain; (B) HIV-2 alignment (PT dataset), sites were numbered according to codon env position of HIV-2 ALI reference strain. Coloured bars represent the amino acids under positive selection and have the same colours (red and green) as the corresponding positions (balls) highlighted in Figure 4A. The dark blue stars over the bars correspond to potential N-glycosylation sites conserved along the alignment (present in ≥50% of strains), whereas the light blue stars represent sites only present in less than 50% of sequences.
To investigate the molecular and structural features underlying the differences in HIV-1 and HIV-2 biology and human infection, we have analysed the C2-V3-C3 envelope region from a significant number of HIV-1 and HIV-2 infected patients living in Portugal and worldwide. HIV-2 sequences from Portugal belonged to group A and the majority of HIV-1 sequences belonged to subtype B (75%) followed by subtypes G, C and F, CRF02_AG and CRF14_BG. Collectively, these results are consistent with previous studies showing a highly complex HIV epidemic in Portugal caused by HIV-2 group A and different subtypes of HIV-1 group M , , , , , , .
Genetic distances and amino acid diversity between HIV-1 viruses were significantly higher compared to HIV-2. This was surprising since at the individual level HIV-2 displays a similar  or even faster evolutionary rate than HIV-1 in the C2-V3-C3 region . The more pronounced evolutionary rate in synonymous sites than in non-synonymous sites in HIV-2  together with the rare escape of this virus from autologous neutralizing antibodies , suggested that the lower amino acid diversity in HIV-2 could be related with a weaker positive selection or even with negative selection . This was not the case however since most sites in C2 and C3 were under stronger positive selection in HIV-2 than in HIV-1. The C3 region of HIV-1 is antibody accessible  and is subject to diversifying selection because it is a major neutralizing target , , , . Therefore, the high level of positive selection detected in C3 together with its high solvent exposure strongly suggests that this region is also antibody accessible in HIV-2 and might be a major neutralizing domain.
Strength of selection was significantly different between internal and external branches of the HIV-1 and HIV-2 phylogenetic trees. This is expected in populations of highly variable RNA viruses and implies that non-synonymous substitutions can be highly deleterious , . In HIV-1, most of the codons selected in the tips of the tree were also under selection along the internal branches, indicating that adaptation in these sites is occurring at the host and population levels . In contrast, most adaptive mutations in HIV-2 were only found in the tips of the tree indicating that they are recent maladaptive substitutions that are transitory at the population level , . Thus, in contrast to HIV-1, diversification of C2 and C3 in HIV-2 seems to have a dominant negative effect on viral fitness and transmission. This data suggests that one possible consequence of the unexpectedly high evolutionary rate of HIV-2 at the patient level can be the frequent accumulation of deleterious mutations and production of defective viruses , , . A high frequency of defective viruses in HIV-2 infected individuals could explain the poor replication of this virus in vivo as well as its very low transmissibility.
Unlike in HIV-1, the V3 loop in HIV-2 always presented the lower amino acid diversity. This result might be a consequence of significant structural and conformational constraints due to its role in preventing chronic and disruptive immune activation  and in co-receptor binding . On the other hand, these results imply that the V3 loop is not well exposed in the HIV-2 envelope complex in vivo. Indeed, by computer modelling simulations we show that in HIV-2 the V3 loop is much less exposed than C2 and C3 and likely has a retractile conformation due to non-covalent interaction both with C2 and C3. In contrast, HIV-1 had, as previously found, an extended and highly accessible V3 loop , , . Such conformation is entirely consistent with its immunodominant and neutralizing nature and with its crucial role in HIV-1 co-receptor binding and tropism , , , , , . Conversely, the concealed nature of V3 in the HIV-2 envelope complex implies that this region may not be immunodominant in HIV-2 infection. Indeed, a significant number of HIV-2 patients do not raise antibodies against the V3 loop  or against a polypeptide comprising the C2, V3 and C3 regions . Thus, the occlusion of V3 in the HIV-2 envelope complex may prevent it from over immune recognition and associated sequence changes thereby preserving its crucial functions in viral entry. It has been shown that removal or antigenic dampening of the HIV-1 V3 loop redirects the neutralizing immune response to other epitopes of the Env protein that otherwise would be non-neutralizing or non-antibody responsive , , , . In this context, the occluded nature of the V3 region in the HIV-2 envelope complex might favour a more effective production of broadly neutralizing antibodies targeting other regions in gp125 such as the C2, V1, V2, V4 and C5 regions , , , .
In conclusion, the C2 and C3 regions are well exposed in the HIV-2 envelope complex and are under strong diversifying selection suggesting that, like in HIV-1, they may harbour neutralizing epitopes. However, extreme diversification of C2 and C3 in HIV-2 seems to be deleterious for the virus and prevent its transmission. On the other hand, V3 is highly conserved in HIV-2 and is concealed within the envelope complex, possibly due to a physical interaction with C2 and C3. In contrast, V3 is highly exposed and variable in HIV-1 which is consistent with its immunodominant and neutralizing properties. Collectively, we identify significant structural and functional constrains to the diversification and evolution of C2, V3 and C3 in the HIV-2 envelope but not in HIV-1. These studies highlight fundamental differences in the biology and infection of HIV-1 and HIV-2 and in their mode of interaction with the human immune system and may inform new vaccine and therapeutic interventions against these viruses.
Materials and Methods
Amplification, cloning and sequencing of HIV-1 and HIV-2 viruses from Portugal
Portuguese (PT) samples were collected from HIV infected patients, followed in hospitals in the North and South of Portugal and presenting different clinical stages of infection and CD4+ T-cell counts. HIV-2 samples were collected between 1997 and 2005 from 49 patients, some of whom were infected in late-1970s , . HIV-1 samples were collected from 60 patients between 1993 and 1998.
Proviral DNA was extracted from uncultured PBMCs, or viral genomic RNA was extracted from plasma and reverse transcribed. A nested PCR technique was used to amplify a 373 bp HIV-2 C2-V3-C3 env gene region and a 409 pb HIV-1 C2-V3-C3 env region as described elsewhere , . PCR products were sequenced using the BigDye Terminator Cycle sequencing kit (Applied Biosystems) and an automated capillary sequencer (ABI PRISM 310, Applied Biosystems). Newly derived HIV-1 sequences from Portugal have been assigned GenBank accession numbers: EU335962 - EU335903. Newly derived HIV-2 sequences from Portugal have been assigned GenBank accession numbers: AY913773-AY913794, AY649545-AY649554 and GU591163.
Additionally, 16 HIV-2 consensus sequences from a previous publication  were also included in this study. The samples used to obtain these consensus sequences were: 03PTHCC1, 03PTHCC2, 03PTHCC4, 03PTHCC5, 03PTHCC7, 03PTHCC8, 03PTHCC12, 05PTHCC13, 03PTHCC14, 03PTHCC17, 03PTHCC19, 03PTHSM2, 05PTHSM3, 03PTHSM7, 03PTHSM9 and 03PTHSM10.
As Control datasets to this study, HIV-1 group M (all subtypes) reference sequence alignment (94 sequences) was obtained from the Los Alamos HIV database (http://www.hiv.lanl.gov/). HIV-2 group A reference sequence alignment was also obtained from the Los Alamos HIV database. Additional C2-V3-C3 sequences derived from group A primary isolates were retrieved from the Los Alamos Database adding to a total of 59 HIV-2 Control sequences. Both control alignments are available as supplementary information (Alignment S1 and S2).
Molecular and phylogenetic analysis
Nucleotide sequences were aligned using ClustalX 1.8 . Maximum likelihood analyses were performed using the best-fit models of molecular evolution estimated by Modeltest . These were GTR+G+I  for the PT HIV-2 dataset and TVM+G+I for PT HIV-1 and for HIV-1 and HIV-2 Control datasets .
Evolutionary distances were estimated under these models using PAUP version 4.0 . Tree searches were also conducted in PAUP version 4.0 using either nearest-neighbor interchange (NNI) or subtree pruning-regrafting (SPR) heuristic strategies, with bootstrap resampling. All positions containing gaps and missing data were eliminated from the dataset. In the final datasets there were a total of 369 nucleotide positions in PT HIV-2 and 372 positions in PT HIV-1 alignments, and 369 positions in HIV-2 and HIV-1 Control alignments. Both alignments were tested for recombination with the Single Breakpoint Recombination (SBP) tool  in the DATAMONKEY web-server ; evidence for recombination, inferred by the small sample AIC score, was only found for HIV-1 Control dataset. Thus, when appropriate, a multiple partition dataset was used for HIV-1 Control analysis. Detection of N-linked glycosylation sites was performed with Glycosite . The entropy at each position in protein alignment was measured with Shannon's entropy .
Tests for codon selection
Selection pressures over the HIV-1 and HIV-2 C2-V3-C3 regions were examined with the HYPHY software package  and the DATAMONKEY web-server . All estimations were performed using the MG94 codon substitution model  crossed with the nucleotide substitution model previously selected with Modeltest, GTR for PT HIV-2 and TVM for PT HIV-1 and Control alignments. To understand if selection pressure within a host is different from selection for transmission among hosts, non-synonymous substitutions were compared between terminal and internal branches of the phylogenetic tree, with the TestBranchDNDS.bf batch file in HyPHy, as described elsewhere .
Four different approaches were used to identify codons under selection: single-likelihood ancestor counting (SLAC), fixed-effects likelihood (FEL), internal fixed effects likelihood (IFEL) and relaxed-effects likelihood (REL) methods , . While SLAC, FEL and REL detect sites under selection at the external branches of the phylogenetic three, IFEL identifies such sites only along the internal branches. To classify a site as positively or negatively selected the cut-off P-value was 10% for SLAC, FEL and IFEL. For REL, codons under selection were detected with a cut-off value for the Bayes factor of 50. Since SLAC, FEL and IFEL can estimate site-specific ratios of non-synonymous and synonymous substitutions rates (dN/dS ratios) as undefined or infinite due to dS = 0, we reported dN-dS values instead, which were scaled by the total codon tree length to allow a better comparison between the two datasets. A multiple partition dataset was used for the identification of codons under selection in HIV-1 Control analysis. Site-by-site variation of synonymous substitution rates can bias estimations of codon's diversifying selection . Although all four methods described above model for this variation, variation of synonymous rates from codon to codon in each dataset was tested with the dNdSRateAnalysis.bf batch file in HyPHy, as described elsewhere . Finally, comparison of the dN/dS distribution rates and the strength of selection between the HIV-1 and HIV-2 alignments, was performed with dNdSDistributionComparison.bf batch file also in HyPHy, as described elsewhere .
Molecular modelling and calculation of solvent accessible surfaces
Consensus amino acid sequences were derived for the different HIV-1 and HIV-2 datasets. Structural models of HIV-1 and HIV-2 C2-V3-C3 were produced with SWISS-MODEL homology modelling server in project mode resorting to Swiss-Pdb Viewer (DeepView) version 4.0, using PDB file 2B4C (from HIV-1 JR-FL gp120) for HIV-1, and PDB file 2BF1 (from SIV gp120) for HIV-2 as templates , , , . Accelrys Discovery Studio Visualizer 2.5  was used to produce three dimensional images of the models obtained. Solvent accessible surface area in Å2 was calculated by Gerstein's calc-surface software on UCSF Chimera ,  with a probe size of 1.4 Å. All atoms in the input PDB file were included in the calculation. The solvent accessible surface data was normalized dividing each amino acid residue solvent accessible surface value added by the solvent accessible surface value of the corresponding amino acid residue (X) in the tripeptide Gly-X-Gly. The inter-chain H-Bonds formed by HIV-2 V3 with C2 and C3 were calculated with H-Bond Finder software on UCSF Chimera ,  with a probe size of 1.4 Å. All atoms in the input PDB file were included in the calculation.
Statistical analyses were performed using GraphPad Prism version 4.0c for Macintosh (GraphPad Software, 2005, San Diego, California, USA, www.graphpad.com) with a level of significance of 5%. Non-parametric Mann-Whitney U test was used to compare Shannon's entropy values and nucleotide distances.
Genotyping HIV-1(A) and HIV-2 (B) by maximum-likelihood phylogenetic analysis. The phylogenetic trees were constructed using the SPR heuristic search strategy and 1000 bootstrap replications, with reference sequences from HIV-1, under the TVM+G+I evolutionary model (A) and with reference sequences from HIV-2, under the GTR+G+I evolutionary model (B). The bootstrap values (above 50%) supporting the internal branches are shown. The scale bar represents evolutionary distances in substitutions per site.
(0.21 MB PDF)
Shannon's entropy of individual amino acids in the C2, V3 and C3 envelope regions in HIV-1 and HIV-2. (A) HIV-1 alignment (Control dataset), sites were numbered according to codon env position of HIV-1 HXB2 reference strain; (B) HIV-2 alignment (Control dataset), sites were numbered according to codon env position of HIV-2 ALI reference strain.
(0.96 MB TIF)
Frequency of N-glycosylation sites in the C2, V3 and C3 envelope regions in HIV-1 and HIV-2. (A) HIV-1 alignment (Control dataset). Sites were numbered according to codon env position of HIV-1 HXB2 reference strain. (B) HIV-2 alignment (Control dataset). Sites were numbered according to codon env position of HIV-2 ALI reference strain.
(0.58 MB TIF)
Positive selection in the C2, V3 and C3 envelope regions in HIV-1 and HIV-2. dN-dS values were estimated by FEL and scaled by the total codon tree length. (A) HIV-1 alignment (Control dataset). Sites were numbered according to codon env position of HIV-1 HXB2 reference strain. (B) HIV-2 alignment (Control dataset). Sites were numbered according to codon env position of HIV-2 ALI reference strain.
(0.53 MB TIF)
Superimposition of the conformational structures generated by homology modelling of Portuguese and Control C2, V3 and C3 regions of HIV-1 and HIV-2. In the schematics, Portuguese structures are represented in red, and Control structures are in blue.
(0.78 MB TIF)
Conformational structure of C2, V3 and C3 envelope regions in HIV-1 and HIV-2. The conformational structure of consensus amino acid sequences derived from the Control datasets was obtained by homology modeling as indicated in “Materials and Methods.” In the schematics, C2 is shown in red, V3 in yellow, and C3 in blue. Balls represent the amino acids under positive selection. (A) The red balls represent codons selected simultaneously by SLAC, FEL and REL methods, while green balls stand for codons selected by at least two of these methods. (B) Model structure showing the predicted interactions between V3, C2 and C3 in HIV-2 gp125. The non-covalent interaction involves residues Tyr296 and His301 in C2 binding, respectively, to Arg331 and Trp334 in V3, and Phe337 in C3 binding to Phe321 in V3.
(0.88 MB TIF)
Solvent accessible surface area, positive selection and potential N-glycosylation sites in C2-V3-C3 region. (A) HIV-1 alignment (Control dataset). Sites were numbered according to codon env position of HIV-1 HXB2 reference strain. (B) HIV-2 alignment (Control dataset). Sites were numbered according to codon env position of HIV-2 ALI reference strain. Coloured bars represent the amino acids under positive selection and have the same colours (red and green) as the corresponding positions (balls) highlighted in Figure S6. The dark blue stars over the bars correspond to potential N-glycosylation sites conserved along the alignment (present in ≥50% of strains), whereas the light blue stars represent sites only present in less than 50% of sequences.
(1.60 MB TIF)
Summary of results for phylogenetic, codon selection and solvent accessibility analysis for C2, V3 and C3 regions of HIV-1 and HIV-2 Control datasets.
(0.04 MB DOC)
Positively selected sites detected by SLAC, FEL, REL and/or IFEL in Control HIV-1 and HIV-2 env C2, V3 and C3 regions.
(0.10 MB DOC)
Alignment of HIV-1 reference sequences used as a Control for the Portuguese HIV-1 dataset. Each sequence is identified by the corresponding GenBank accession number.
(0.04 MB TXT)
Conceived and designed the experiments: NT. Performed the experiments: HB PB IB JM. Analyzed the data: HB PB CF AQ NT. Wrote the paper: HB PB NT.
- 1. Zhang M, Foley B, Schultz AK, Macke JP, Bulla I, et al. (2010) The role of recombination in the emergence of a complex and dynamic HIV epidemic. Retrovirology 7: 25.
- 2. Lemey P, Pybus OG, Wang B, Saksena NK, Salemi M, et al. (2003) Tracing the origin and history of the HIV-2 epidemic. Proc Natl Acad Sci U S A 100: 6588–6592.
- 3. de Silva TI, Cotten M, Rowland-Jones SL (2008) HIV-2: the forgotten AIDS virus. Trends Microbiol 16: 588–595.
- 4. Semaille C, Barin F, Cazein F, Pillonel J, Lot F, et al. (2007) Monitoring the dynamics of the HIV epidemic using assays for recent infection and serotyping among new HIV diagnoses: experience after 2 years in France. J Infect Dis 196: 377–383.
- 5. Valadas E, Franca L, Sousa S, Antunes F (2009) 20 years of HIV-2 infection in Portugal: trends and changes in epidemiology. Clin Infect Dis 48: 1166–1167.
- 6. Gao F, Yue L, Robertson DL, Hill SC, Hui H, et al. (1994) Genetic diversity of human immunodeficiency virus type 2: evidence for distinct sequence subtypes with differences in virus biology. J Virol 68: 7433–7447.
- 7. Chen Z, Luckay A, Sodora DL, Telfer P, Reed P, et al. (1997) Human immunodeficiency virus type 2 (HIV-2) seroprevalence and characterization of a distinct HIV-2 genetic subtype from the natural range of simian immunodeficiency virus-infected sooty mangabeys. J Virol 71: 3953–3960.
- 8. Yamaguchi J, Devare SG, Brennan CA (2000) Identification of a new HIV-2 subtype based on phylogenetic analysis of full-length genomic sequence. AIDS Res Hum Retroviruses 16: 925–930.
- 9. Damond F, Worobey M, Campa P, Farfara I, Colin G, et al. (2004) Identification of a highly divergent HIV type 2 and proposal for a change in HIV type 2 classification. AIDS Res Hum Retroviruses 20: 666–672.
- 10. Rowland-Jones S (2006) Protective immunity against HIV infection: lessons from HIV-2 infection. Future Microbiol 1: 427–433.
- 11. Berry N, Ariyoshi K, Jaffar S, Sabally S, Corrah T, et al. (1998) Low peripheral blood viral HIV-2 RNA in individuals with high CD4 percentage differentiates HIV-2 from HIV-1 infection. J Hum Virol 1: 457–468.
- 12. Soares R, Foxall R, Albuquerque A, Cortesao C, Garcia M, et al. (2006) Increased frequency of circulating CCR5+ CD4+ T cells in human immunodeficiency virus type 2 infection. J Virol 80: 12425–12429.
- 13. Marlink R, Kanki P, Thior I, Travers K, Eisen G, et al. (1994) Reduced rate of disease development after HIV-2 infection as compared to HIV-1. Science 265: 1587–1590.
- 14. Drylewicz J, Matheron S, Lazaro E, Damond F, Bonnet F, et al. (2008) Comparison of viro-immunological marker changes between HIV-1 and HIV-2-infected patients in France. AIDS 22: 457–468.
- 15. Rodriguez SK, Sarr AD, MacNeil A, Thakore-Meloni S, Gueye-Ndiaye A, et al. (2007) Comparison of heterologous neutralizing antibody responses of human immunodeficiency virus type 1 (HIV-1)- and HIV-2-infected Senegalese patients: distinct patterns of breadth and magnitude distinguish HIV-1 and HIV-2 infections. J Virol 81: 5331–5338.
- 16. Lizeng Q, Skott P, Sourial S, Nilsson C, Andersson SS, et al. (2003) Serum immunoglobulin A (IgA)-mediated immunity in human immunodeficiency virus type 2 (HIV-2) infection. Virology 308: 225–232.
- 17. Shi Y, Brandin E, Vincic E, Jansson M, Blaxhult A, et al. (2005) Evolution of human immunodeficiency virus type 2 coreceptor usage, autologous neutralization, envelope sequence and glycosylation. J Gen Virol 86: 3385–3396.
- 18. Bjorling E, Scarlatti G, von Gegerfelt A, Albert J, Biberfeld G, et al. (1993) Autologous neutralizing antibodies prevail in HIV-2 but not in HIV-1 infection. Virology 193: 528–530.
- 19. Cavaleiro R, Sousa AE, Loureiro A, Victorino RM (2000) Marked immunosuppressive effects of the HIV-2 envelope protein in spite of the lower HIV-2 pathogenicity. AIDS 14: 2679–2686.
- 20. Cavaleiro R, Brunn GJ, Albuquerque AS, Victorino RM, Platt JL, et al. (2007) Monocyte-mediated T cell suppression by HIV-2 envelope proteins. Eur J Immunol 37: 3435–3444.
- 21. Grossman Z, Meier-Schellersheim M, Sousa AE, Victorino RM, Paul WE (2002) CD4+ T-cell depletion in HIV infection: are we closer to understanding the cause? Nat Med 8: 319–323.
- 22. Blaak H, van der Ende ME, Boers PH, Schuitemaker H, Osterhaus AD (2006) In vitro replication capacity of HIV-2 variants from long-term aviremic individuals. Virology 353: 144–154.
- 23. MacNeil A, Sarr AD, Sankale JL, Meloni ST, Mboup S, et al. (2007) Direct evidence of lower viral replication rates in vivo in human immunodeficiency virus type 2 (HIV-2) infection than in HIV-1 infection. J Virol 81: 5325–5330.
- 24. Wyatt R, Kwong PD, Desjardins E, Sweet RW, Robinson J, et al. (1998) The antigenic structure of the HIV gp120 envelope glycoprotein. Nature 393: 705–711.
- 25. Douagi I, Forsell MN, Sundling C, O'Dell S, Feng Y, et al. (2009) Influence of Novel CD4 binding-defective HIV-1 Envelope Glycoprotein Immunogens on Neutralizing Antibody and T Cell Responses in Non-human Primates. J Virol.
- 26. Moore JP, Willey RL, Lewis GK, Robinson J, Sodroski J (1994) Immunological evidence for interactions between the first, second, and fifth conserved domains of the gp120 surface glycoprotein of human immunodeficiency virus type 1. J Virol 68: 6836–6847.
- 27. Hwang SS, Boyle TJ, Lyerly HK, Cullen BR (1991) Identification of the envelope V3 loop as the primary determinant of cell tropism in HIV-1. Science 253: 71–74.
- 28. Rizzuto CD, Wyatt R, Hernandez-Ramos N, Sun Y, Kwong PD, et al. (1998) A conserved HIV gp120 glycoprotein structure involved in chemokine receptor binding. Science 280: 1949–1953.
- 29. Javaherian K, Langlois AJ, McDanal C, Ross KL, Eckler LI, et al. (1989) Principal neutralizing domain of the human immunodeficiency virus type 1 envelope protein. Proc Natl Acad Sci U S A 86: 6768–6772.
- 30. Javaherian K, Langlois AJ, LaRosa GJ, Profy AT, Bolognesi DP, et al. (1990) Broadly neutralizing antibodies elicited by the hypervariable neutralizing determinant of HIV-1. Science 250: 1590–1593.
- 31. Krachmarov CP, Honnen WJ, Kayman SC, Gorny MK, Zolla-Pazner S, et al. (2006) Factors determining the breadth and potency of neutralization by V3-specific human monoclonal antibodies derived from subjects infected with clade A or clade B strains of human immunodeficiency virus type 1. Journal of Virology 80: 7127–7135.
- 32. Davis KL, Bibollet-Ruche F, Li H, Decker JM, Kutsch O, et al. (2009) Human immunodeficiency virus type 2 (HIV-2)/HIV-1 envelope chimeras detect high titers of broadly reactive HIV-1 V3-specific antibodies in human plasma. J Virol 83: 1240–1259.
- 33. Forsell MN, Schief WR, Wyatt RT (2009) Immunogenicity of HIV-1 envelope glycoprotein oligomers. Curr Opin HIV AIDS 4: 380–387.
- 34. Scheid JF, Mouquet H, Feldhahn N, Seaman MS, Velinzon K, et al. (2009) Broad diversity of neutralizing antibodies isolated from memory B cells in HIV-infected individuals. Nature 458: 636–640.
- 35. Walker LM, Phogat SK, Chan-Hui PY, Wagner D, Phung P, et al. (2009) Broad and potent neutralizing antibodies from an African donor reveal a new HIV-1 vaccine target. Science 326: 285–289.
- 36. Traincard F, Rey-Cuille MA, Huon I, Dartevelle S, Mazie JC, et al. (1994) Characterization of monoclonal antibodies to human immunodeficiency virus type 2 envelope glycoproteins. AIDS Res Hum Retroviruses 10: 1659–1667.
- 37. Bjorling E, Broliden K, Bernardi D, Utter G, Thorstensson R, et al. (1991) Hyperimmune antisera against synthetic peptides representing the glycoprotein of human immunodeficiency virus type 2 can mediate neutralization and antibody-dependent cytotoxic activity. Proc Natl Acad Sci U S A 88: 6082–6086.
- 38. Bjorling E, Chiodi F, Utter G, Norrby E (1994) Two neutralizing domains in the V3 region in the envelope glycoprotein gp125 of HIV type 2. J Immunol 152: 1952–1959.
- 39. McKnight A, Shotton C, Cordell J, Jones I, Simmons G, et al. (1996) Location, exposure, and conservation of neutralizing and nonneutralizing epitopes on human immunodeficiency virus type 2 SU glycoprotein. J Virol 70: 4598–4606.
- 40. Morner A, Achour A, Norin M, Thorstensson R, Bjorling E (1999) Fine characterization of a V3-region neutralizing epitope in human immunodeficiency virus type 2. Virus Res 59: 49–60.
- 41. Matsushita S, Matsumi S, Yoshimura K, Morikita T, Murakami T, et al. (1995) Neutralizing monoclonal antibodies against human immunodeficiency virus type 2 gp120. J Virol 69: 3333–3340.
- 42. Marcelino JM, Borrego P, Rocha C, Barroso H, Quintas A, et al. (2010) Potent and broadly reactive HIV-2 neutralizing antibodies elicited by a vaccinia virus vector prime-C2V3C3 polypeptide boost immunization strategy. J Virol 84: 12429–12436.
- 43. Plantier JC, Damond F, Souquieres S, Brun-Vezinet F, Simon F, et al. (2001) V3 serological subtyping of human immunodeficiency virus type 2 infection is not relevant. J Clin Microbiol 39: 3803–3807.
- 44. Marcelino JM, Nilsson C, Barroso H, Gomes P, Borrego P, et al. (2008) Envelope-specific antibody response in HIV-2 infection: C2V3C3-specific IgG response is associated with disease progression. AIDS 22: 2257–2265.
- 45. Marcelino JM, Barroso H, Goncalves F, Silva SM, Novo C, et al. (2006) Use of a new dual-antigen enzyme-linked immunosorbent assay to detect and characterize the human antibody response to the human immunodeficiency virus type 2 envelope gp125 and gp36 glycoproteins. J Clin Microbiol 44: 607–611.
- 46. Huang ML, Essex M, Lee TH (1991) Localization of immunogenic domains in the human immunodeficiency virus type 2 envelope. J Virol 65: 5073–5079.
- 47. Frost SD, Wrin T, Smith DM, Kosakovsky Pond SL, Liu Y, et al. (2005) Neutralizing antibody responses drive the evolution of human immunodeficiency virus type 1 envelope during recent HIV infection. Proc Natl Acad Sci U S A 102: 18514–18519.
- 48. Lemey P, Rambaut A, Pybus OG (2006) HIV evolutionary dynamics within and among hosts. AIDS Rev 8: 125–140.
- 49. Liang B, Luo M, Ball TB, Plummer FA (2007) QUASI analysis of the HIV-1 envelope sequences in the Los Alamos National Laboratory HIV sequence database: pattern and distribution of positive selection sites and their frequencies over years. Biochem Cell Biol 85: 259–264.
- 50. Canducci F, Marinozzi MC, Sampaolo M, Berre S, Bagnarelli P, et al. (2009) Dynamic features of the selective pressure on the human immunodeficiency virus type 1 (HIV-1) gp120 CD4-binding site in a group of long term non progressor (LTNP) subjects. Retrovirology 6: 4.
- 51. Antunes R, Figueiredo S, Bartolo I, Pinheiro M, Rosado L, et al. (2003) Evaluation of the clinical sensitivities of three viral load assays with plasma samples from a pediatric population predominantly infected with human immunodeficiency virus type 1 subtype G and BG recombinant forms. J Clin Microbiol 41: 3361–3367.
- 52. Borrego P, Marcelino JM, Rocha C, Doroana M, Antunes F, et al. (2008) The role of the humoral immune response in the molecular evolution of the envelope C2, V3 and C3 regions in chronically HIV-2 infected patients. Retrovirology 5: 78.
- 53. Esteves A, Parreira R, Venenno T, Franco M, Piedade J, et al. (2002) Molecular epidemiology of HIV type 1 infection in Portugal: high prevalence of non-B subtypes. AIDS Res Hum Retroviruses 18: 313–325.
- 54. Palma AC, Araujo F, Duque V, Borges F, Paixao MT, et al. (2007) Molecular epidemiology and prevalence of drug resistance-associated mutations in newly diagnosed HIV-1 patients in Portugal. Infect Genet Evol 7: 391–398.
- 55. Korber BT, Kunstman KJ, Patterson BK, Furtado M, McEvilly MM, et al. (1994) Genetic differences between blood- and brain-derived viral sequences from human immunodeficiency virus type 1-infected patients: evidence of conserved elements in the V3 region of the envelope protein of brain-derived sequences. J Virol 68: 7467–7481.
- 56. De Jong JJ, De Ronde A, Keulen W, Tersmette M, Goudsmit J (1992) Minimal requirements for the human immunodeficiency virus type 1 V3 domain to support the syncytium-inducing phenotype: analysis by single amino acid substitution. J Virol 66: 6777–6780.
- 57. Resch W, Hoffman N, Swanstrom R (2001) Improved success of phenotype prediction of the human immunodeficiency virus type 1 from envelope variable loop 3 sequence using neural networks. Virology 288: 51–62.
- 58. Isaka Y, Sato A, Miki S, Kawauchi S, Sakaida H, et al. (1999) Small amino acid changes in the V3 loop of human immunodeficiency virus type 2 determines the coreceptor usage for CXCR4 and CCR5. Virology 264: 237–243.
- 59. Skar H, Borrego P, Wallstrom TC, Mild M, Marcelino JM, et al. (2010) HIV-2 genetic evolution in patients with advanced disease is faster than in matched HIV-1 patients. J Virol 84: 7412–7415.
- 60. Kwong PD, Wyatt R, Robinson J, Sweet RW, Sodroski J, et al. (1998) Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing human antibody. Nature 393: 648–659.
- 61. Soriano V, Gomes P, Heneine W, Holguin A, Doruana M, et al. (2000) Human immunodeficiency virus type 2 (HIV-2) in Portugal: clinical spectrum, circulating subtypes, virus isolation, and plasma viral load. J Med Virol 61: 111–116.
- 62. Barroso H, Taveira N (2005) Evidence for negative selective pressure in HIV-2 evolution in vivo. Infect Genet Evol 5: 239–246.
- 63. Zhou T, Xu L, Dey B, Hessell AJ, Van Ryk D, et al. (2007) Structural definition of a conserved neutralization epitope on HIV-1 gp120. Nature 445: 732–737.
- 64. Moore PL, Gray ES, Choge IA, Ranchobe N, Mlisana K, et al. (2008) The c3-v4 region is a major target of autologous neutralizing antibodies in human immunodeficiency virus type 1 subtype C infection. J Virol 82: 1860–1869.
- 65. Rong R, Gnanakaran S, Decker JM, Bibollet-Ruche F, Taylor J, et al. (2007) Unique mutational patterns in the envelope alpha 2 amphipathic helix and acquisition of length in gp120 hypervariable domains are associated with resistance to autologous neutralization of subtype C human immunodeficiency virus type 1. J Virol 81: 5658–5668.
- 66. Gaschen B, Taylor J, Yusim K, Foley B, Gao F, et al. (2002) Diversity considerations in HIV-1 vaccine selection. Science 296: 2354–2360.
- 67. Choisy M, Woelk CH, Guegan JF, Robertson DL (2004) Comparative study of adaptive molecular evolution in different human immunodeficiency virus groups and subtypes. J Virol 78: 1962–1970.
- 68. Pond SL, Frost SD, Grossman Z, Gravenor MB, Richman DD, et al. (2006) Adaptation to different human populations by HIV-1 revealed by codon-based analyses. PLoS Comput Biol 2: e62.
- 69. Pybus OG, Rambaut A, Belshaw R, Freckleton RP, Drummond AJ, et al. (2007) Phylogenetic evidence for deleterious mutation load in RNA viruses and its contribution to viral evolution. Mol Biol Evol 24: 845–852.
- 70. Derdeyn CA, Decker JM, Bibollet-Ruche F, Mokili JL, Muldoon M, et al. (2004) Envelope-constrained neutralization-sensitive HIV-1 after heterosexual transmission. Science 303: 2019–2022.
- 71. Lemey P, Kosakovsky Pond SL, Drummond AJ, Pybus OG, Shapiro B, et al. (2007) Synonymous substitution rates predict HIV disease progression as a result of underlying replication dynamics. PLoS Comput Biol 3: e29.
- 72. Huang CC, Tang M, Zhang MY, Majeed S, Montabana E, et al. (2005) Structure of a V3-containing HIV-1 gp120 core. Science 310: 1025–1028.
- 73. Pinter A (2007) Roles of HIV-1 Env variable regions in viral neutralization and vaccine development. Curr HIV Res 5: 542–553.
- 74. Binley J (2009) Specificities of broadly neutralizing anti-HIV-1 sera. Curr Opin HIV AIDS 4: 364–372.
- 75. Tomaras GD, Haynes BF (2009) HIV-1-specific antibody responses during acute and chronic HIV-1 infection. Curr Opin HIV AIDS 4: 373–379.
- 76. Selvarajah S, Puffer BA, Lee FH, Zhu P, Li Y, et al. (2008) Focused dampening of antibody response to the immunodominant variable loops by engineered soluble gp140. AIDS Res Hum Retroviruses 24: 301–314.
- 77. Garrity RR, Rimmelzwaan G, Minassian A, Tsai WP, Lin G, et al. (1997) Refocusing neutralizing antibody response by targeted dampening of an immunodominant epitope. J Immunol 159: 279–289.
- 78. Kim YB, Han DP, Cao C, Cho MW (2003) Immunogenicity and ability of variable loop-deleted human immunodeficiency virus type 1 envelope glycoproteins to elicit neutralizing antibodies. Virology 305: 124–137.
- 79. Skott P, Achour A, Norin M, Thorstensson R, Bjorling E (1999) Characterization of neutralizing sites in the second variable and fourth variable region in gp125 and a conserved region in gp36 of human immunodeficiency virus type 2. Viral Immunol 12: 79–88.
- 80. Mota-Miranda A, Gomes H, Lima-Alves C, Araujo F, Cunha-Ribeiro LM, et al. (2001) Perinatally acquired HIV-2 infection diagnosed at 15 and 24 years of age. AIDS 15: 2460–2461.
- 81. Leitner T, Escanilla D, Marquina S, Wahlberg J, Brostrom C, et al. (1995) Biological and molecular characterization of subtype D, G, and A/D recombinant HIV-1 transmissions in Sweden. Virology 209: 136–146.
- 82. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25: 4876–4882.
- 83. Posada D, Crandall KA (1998) MODELTEST: testing the model of DNA substitution. Bioinformatics 14: 817–818.
- 84. Rodriguez F, Oliver JL, Marin A, Medina JR (1990) The general stochastic model of nucleotide substitution. J Theor Biol 142: 485–501.
- 85. Posada D (2003) Using Modeltest and PAUP* to select a model of nucleotide substitution. In: A. D. Baxevanis DBD, R. D. M. Page et al, editors. Current Protocols in Bioinformatics. Chichester, UK: John Wiley & Sons, Inc. pp. 6.5.1–6.5.14.
- 86. Swofford DL (1998) PAUP*. Phylogenetic Analysis using Parsimony (*and other Methods). Version 4. Version 4 ed. Sunderland, MA: Sinauer-Associates.
- 87. Kosakovsky Pond SL, Posada D, Gravenor MB, Woelk CH, Frost SD (2006) Automated phylogenetic detection of recombination using a genetic algorithm. Mol Biol Evol 23: 1891–1901.
- 88. Pond SL, Frost SD (2005) Datamonkey: rapid detection of selective pressure on individual sites of codon alignments. Bioinformatics 21: 2531–2533.
- 89. Zhang M, Gaschen B, Blay W, Foley B, Haigwood N, et al. (2004) Tracking global patterns of N-linked glycosylation site variation in highly variable viral glycoproteins: HIV, SIV, and HCV envelopes and influenza hemagglutinin. Glycobiology 14: 1229–1246.
- 90. Pond SL, Frost SD, Muse SV (2005) HyPhy: hypothesis testing using phylogenies. Bioinformatics 21: 676–679.
- 91. Muse SV, Gaut BS (1994) A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. Mol Biol Evol 11: 715–724.
- 92. Kosakovsky Pond SL, Poon AWF, Frost SDW (2009) Estimating selection pressures on aligments of coding sequences: practice. In: Lemey P, Salemi M, Vandamme A, editors. The Phylogenetic Handbook: A Paratical Approach to Phylogenetic Analysis and Hypothesis Testing, 2nd ed. Cambridge, UK: Cambridge University Press. pp. 452–490.
- 93. Kosakovsky Pond SL, Frost SD (2005) Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol 22: 1208–1222.
- 94. Pond SK, Muse SV (2005) Site-to-site variation of synonymous substitution rates. Mol Biol Evol 22: 2375–2385.
- 95. Arnold K, Bordoli L, Kopp J, Schwede T (2006) The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics 22: 195–201.
- 96. Kiefer F, Arnold K, Kunzli M, Bordoli L, Schwede T (2009) The SWISS-MODEL Repository and associated resources. Nucleic Acids Res 37: D387–392.
- 97. Peitsch MC (1995) Protein Modeling by E-mail. Nature Biotechnology 13: 658–660.
- 98. Guex N, Peitsch MC (1997) SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electrophoresis 18: 2714–2723.
- 99. Accelrys Inc (2008) Discovery Studio 2.1. San Diego, CA: Accelrys Inc..
- 100. Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, et al. (2004) UCSF Chimera, a visualization system for exploratory research and analysis. J Comput Chem 25: 1605–1612.
- 101. Gerstein M (1992) A Resolution-Sensitive Procedure for Comparing Protein Surfaces and its Application to the Comparison of Antigen-Combining Sites. Acta Cryst A48: 271–276.