We analyzed genetic diversity and phylogenetic relationships among 124 HIV-1 and 19 HIV-2 strains in sera collected in 1986 from patients of the state hospital in Ouagadougou, Burkina Faso. Phylogenetic analysis of the HIV-1 env gp41 region of 65 sequences characterized 37 (56.9%) as CRF06_cpx strains, 25 (38.5%) as CRF02_AG, 2 (3.1%) as CRF09_cpx, and 1 (1.5%) as subtype A. Similarly, phylogenetic analysis of the protease (PR) gene region of 73 sequences identified 52 (71.2%) as CRF06_cpx, 15 (20.5%) as CRF02_AG, 5 (6.8%) as subtype A, and 1 (1.4%) was a unique strain that clustered along the B/D lineage but basal to the node connecting the two lineages. HIV-2 PR or integrase (INT) groups A (n = 17 [89.5%]) and B (n = 2 [10.5%]) were found in both monotypic (n = 11) and heterotypic HIV-1/HIV-2 (n = 8) infections, with few HIV-2 group B infections. Based on limited available sampling, evidence suggests two recombinant viruses, CRF06_cpx and CRF02_AG, appear to have driven the beginning of the mid-1980s HIV-1 epidemic in Burkina Faso.
Citation: Fonjungo PN, Kalish ML, Schaefer A, Rayfield M, Mika J, Rose LE, et al. (2014) Recombinant Viruses Initiated the Early HIV-1 Epidemic in Burkina Faso. PLoS ONE 9(3): e92423. https://doi.org/10.1371/journal.pone.0092423
Editor: Rongge Yang, Chinese Academy of Sciences, Wuhan Institute of Virology, China
Received: October 1, 2013; Accepted: February 22, 2014; Published: March 19, 2014
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Funding: The work was funded by the Centers for Disease Control and Prevention. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Acquired immunodeficiency syndrome (AIDS) is caused by two genetically distinct types of human immunodeficiency virus (HIV), HIV type 1 (HIV-1) and HIV type 2 (HIV-2). HIV-1 consists of four major groups, M, N, O and P, each resulting from separate cross-species transmissions from chimpanzees or gorillas to humans –. Within the Group M viruses, phylogenetic analysis has identified 12 subtypes and sub-subtypes (A1, A2, A3, B, C, D, F1, F2, G, H, J and K), at least 57 circulating recombinant forms (CRFs), and innumerable unique recombinant forms, which encompass the majority of HIV-1 infections worldwide . The level of infections with group O viruses has remained very low and they are found mostly in West-Central Africa , . Currently, very few cases of HIV-1 groups N and P have been identified, all in persons from Cameroon , –.
Similarly, HIV-2, which is the result of 8 separate cross-species infections from sooty mangabey monkeys to humans, has been classified into genetically distinct groups designated A to H and are found primarily in West African countries, such as Guinea-Bissau, Senegal, Ivory Coast, Sierra Leone, Ghana, Mali, and The Gambia –, but has spread to other non-African countries , . HIV-2 group A is most prevalent, followed by group B. HIV-2 groups C-H are rare , .
Although HIV-1 and HIV-2 seroreactivity was first reported in sub-Saharan African countries in the mid-1980s –, little genetic characterization has been done on those early HIV strains. Phylogenetic characterization of HIV sequences remains a powerful tool for tracking the evolution and distribution of HIV worldwide. Of critical importance to our understanding the evolution of the HIV pandemic is having access to samples from early in the epidemic, especially from West and Central Africa, which are the epicenters of the HIV-2 and HIV-1 epidemics, respectively , , . The earliest known HIV-1 sequence (ZR59) came from a 1959 HIV-1 seropositive plasma sample from Zaire, now called the Democratic Republic of Congo (DRC). Phylogenetic analysis of gag, pol, and env genes confirmed it as an HIV-1 group M virus and placed it basal to the node connecting the subtype B and D lineages . The second earliest known HIV sequence was found in a 1960 lymph node biopsy specimen obtained from an adult female in Kinshasa, Zaire (DRC60), and was phylogenetically characterized as HIV-1 group M, subtype A in the gag, pol and env gene regions . Finding viruses from two different HIV-1 subtypes, suggested that significant viral diversification had already taken place by 1960 . Next, 56 samples from Kinshasa, Zaire, collected in the mid-1980s, demonstrated the presence of all the HIV-1 group M subtypes, except B, along with circulating recombinant form (CRF) 01 (CRF01_AE), and many unique recombinant strains .
Between 1985-1987 the HIV-2 strains ROD, BEN and ST, isolated from patients with AIDS from Cape Verde, Mali and Senegal, respectively, phylogenetically clustered with HIV-2 group A reference strains -, while the HIV-2 strains GH-2, D205 and UC1 (1986–1988) recovered from AIDS patients in Ghana and Ivory Coast, respectively, clustered with HIV-2 group B , –. To date, no HIV-2 subtypes have been described for any of the HIV-2 groups.
Early studies on serum specimens from Burkina Faso, collected between 1985–1987, showed an HIV seroprevalence of 1.7%, 1.8%, 4.5% and 14.6% among pregnant women, prisoners, hospital patients and prostitutes, respectively . A study from 1994 showed the HIV prevalence in female sex workers to be as high as 58.2% . The first HIV-1 genetic subtype described from Burkina Faso was in a specimen collected in 1996 , . Phylogenetic analysis of the full genome sequence classified the HIV isolate, BFP90, as a complex recombinant, CRF06_cpx involving recombination between at least 4 HIV-1 subtypes: A, G, J and K , , . Thereafter, additional studies on specimens collected in 1996, 2000, 2001, 2003, and 2004 have confirmed CRF06_cpx to be the predominant strain in Burkina Faso, followed by CRF02_AG, and the less prevalent subtypes A, A3, G, F1, H and CRF09-cpx –. In other West and West-Central African countries, such as in Senegal, Ivory Coast, Nigeria, Gabon, Cameroon, where multiple subtypes and CRFs co-circulate, more recent samplings have reported their predominant HIV-1 strain as CRF02_AG –.
In this study we have sequenced and performed phylogenetic analysis on HIV strains from specimens collected in 1986 in the West African city of Ouagadougou, Burkina Faso, in order to characterize the viruses that may have initiated the early epidemic and to understand how the distribution of those strains have evolved.
Materials and Methods
We received 849 remnant and anonymized serum samples collected in 1986 from patients attending the state hospital in Ouagadougou, the capital of Burkina Faso in West Africa. The amount of the serum samples we received was very small, ranging from 100 μl to about 250 μl. Since both HIV-1 and HIV-2 are found in West Africa, all sera were tested by two separate whole viral lysate EIA assays to detect antibodies to HIV-1 and HIV-2 (Genetic Systems, Redmond, WA, USA). To confirm HIV type, EIA reactive specimens were tested with INNO-LIA HIV-1/2 (INNOGENETICS, Ghent, Belgium), which has synthetic peptides recognizing antibodies to HIV-1 p17, p24, p31, gp41, and gp120 proteins, and to HIV-2 gp36 and gp105 proteins. Western blot was performed on some samples, however, the discrimination between HIV-1 and HIV-2 was based on INNO-LIA HIV-1/2 results as previously described , .
The initial HIV prevalence survey samples were consented to at the time of blood draw, but there was no formal IRB in Burkina Faso in 1986 and no written informed consent retained. Permission for the survey and specimen collection was granted by the Ministry of Health of Burkina Faso. For purposes of the genetic studies reported here, the archived specimens were received unlinked and anonymized and oversight of their analysis fell under the Global Monitoring for Variants Strains of HIV protocol #1367 approved by the CDC Institutional Review Board.
RNA Extraction, Amplification and Sequencing
Extraction of RNA from all INNO-LIA positive sera was done using the NucliSens nucleic acid manual or automated protocols (Organon Teknika, Boxtel, the Netherlands). Conditions for RT-PCR and secondary DNA PCR amplifications (RT-PCR/PCR) and the sets of primers for HIV env gp41 region, PR, and INT amplifications have been described previously: HIV-1 env gp41 (outside: gp40F1/gp41R1 and nested: gp46F2/gp47R2 or gp48R2) ; HIV-1 PR (outside: DP10F/DP11R and nested: DP16F/DP17R) , , HIV-2 PR (outside: DP20F/DP21R and nested: DP26aF/DP27R) ; and HIV-2/SIV INT (outside: INT-F1/INT-R1 and nested: INT-F2/INT-R2) . PCR-amplified products were purified with the QIAquick PCR Purification Kit (Qiagen, Valencia, CA USA) and directly sequenced with both forward and reverse nested PCR primers on an automated DNA sequencer (ABI model 377; Applied Biosystems Inc., Foster City, CA).
Characterization of Specimens
HIV sequences from INNO-LIA positive sera were subjected to RT-PCR/PCR amplification of different regions of the HIV genome. First, HIV-1 seropositive specimens were amplified using env gp41 primers that allow amplification of HIV-1 groups M, N, and O and SIVcpz within a 366-bp fragment . Samples were further screened for HIV-1 using PR primers known to amplify the entire 297-bp PR gene of group M subtypes A-K in field samples with a high efficiency , . HIV-2 seropositive specimens were amplified with HIV-2 type-specific PR primers that allow identification of at least the two major groups, A and B . The HIV-2 PCR samples from this collection were further screened for HIV-2 with pol-INT primers.
Specimens showing INNO-LIA HIV-1/HIV-2 heterotypic seroreactivity were submitted to separate amplifications with HIV-1 and HIV-2 PR primers, which are type specific and permit selective PCR amplification of HIV-1 and HIV-2 strains from mixed HIV-1/HIV-2 infections . These samples were additionally screened for the presence of HIV-1 with gp41 primers and for HIV-2 with pol-INT primers . Sequencing of two different gene regions, especially from opposite ends of the genome, might increase the likelihood of detecting potential HIV-1 intersubtype recombinants. Since our serum samples were more than 20 years old and the HIV RNA was likely degraded, we selected both the HIV-1 gp41 and PR gene regions and the HIV-2 PR and INT genes for initial amplification attempts because they are reasonably conserved genes and fairly short gene regions.
The derived HIV-1 and HIV-2 DNA sequences were aligned using CLUSTAL W1.83 multiple-sequence alignment program included in the GeneStudio package, which provides the interface to phylogenetic programs on a PC workstation . Alignments included known HIV-1 or HIV-2 sequences extracted from the Los Alamos Laboratory HIV Sequence database , representing the different genetic subtypes and the CRFs documented in West Africa. Phylogenetic analysis was performed by the neighbor joining method, with the nucleotide distance calculated by Kimura’s two-parameter model included in the PHYLIP package (version 3.5c)  with and without bootstrapping. SIVcpz sequences were used as outgroups. It was further verified with Maximum Likelihood method using FastDNAml program (http://www.genestudio.com).
Genetic Distance Analysis
Subtype and CRF designations were determined from phylogenetic analysis, and sequences were input into the MEGA program, version 2.0  to calculate the means of the pair-wise genetic differences among each subtype and CRF. Values for the genetic differences between sequences were evaluated based on the distance index that was created from the nucleotide alignment 
Seroreactivity and RT-PCR Amplification
Of the 849 1986 Burkina Faso specimens from hospitalized patients, 172 (20.3%) were confirmed to be HIV-1 and/or HIV-2 seropositive by the INNO-LIA HIV Line assay. From the 172 seropositive samples, 124 (72.1%) were only HIV-1 RT-PCR/PCR amplifiable; 51 (41.1%) were amplified by gp41 primers, 59 (47.6%) by PR primers and 14 (11.3%) were amplified by both gp41 and PR primers (Table 1). Monotypic HIV-2 was amplified from 11 seropositive samples, 10 (90.9%) in the PR region, and 1 (9.1%) additional sample was amplified in the HIV-2 INT region. Both HIV-1 and HIV-2 were amplified from 8 samples: 7 HIV-1 PR genes and 8 HIV-1 gp41 genes were amplified and all 8 HIV-2 strains were amplified by HIV-2 PR primers.
Phylogenetic Analysis of Early Sequences
Phylogenetic analysis of the 65 HIV-1 gp41 sequences from monotypic and heterotypic infections demonstrated that 37 (56.9%) were subtype G in that region, but further stongly subclustered with CRF06_cpx reference sequences; likewise, 25 (38.5%) clustered with subtype A sequences yet further subclustered with CRF02_AG reference strains (Figure 1A). Two (3.1%) were also subtype A but further subclustered with CRF09_cpx reference strains, and 1 (1.5%) clustered with subtype A reference sequences without any further subclustering. Similarly, phylogenetic analysis of the PR gene region of 73 sequences revealed 52 (71.2%) strongly subclustered with CRF06_cpx reference sequences, 15 (20.5%) subclustered with CRF02_AG, 5 (6.8%) closely related sequences clustered with subtype A sequences and 1 (1.4%) was a unique strain that clustered within the B/D lineage but basal to the node connecting the two (Figure 1B). Parallel analysis of HIV-1 PR and env gp41 gene regions, for the 14 strains with sequences available from both genes, revealed CRF concordance in 12 (85.7%) viruses, with 3 CRF02_AG(PR)/CRF02_AG(gp41) and 9 CRF06_cpx(PR)/CRF06_cpx(gp41). Of those with discordance between the two gene regions, 2 had mosaic patterns of CRF06_cpx(PR)/CRF02_AG(gp41) (Table 1), demonstrating further recombination occurring between the two prevalent recombinant viruses. The distribution of HIV-1 subtypes, recombinants, and dual infections is shown in Table 1. One limitation of our study is that we were only able to amplify fairly short gene regions due to small serum volumes and degradation to the HIV RNA since collection in the mid-1980s. While we tried to amplify both HIV-1 gp41 and PR for HIV-1 serologically determined homotypic infections, HIV-2 PR and INT for HIV-2 homotypic infections, and both for HIV-1/HIV-2 heterotypic infections, for most samples we only had success with one of the two gene regions.
The Burkino Faso sequences are identified by the prefix, BF. Sequences from mixed HIV-1/HIV-2 infections are highlighted in gray. The reference HIV-1 group M and CRF sequences, respectively, have prefixes A, B, C, D, F1, F2, G, H, J, K, and potential L to denote the subtype or sub-subtype, or number indicating which CRF they represent, i.e., 1 for CRF01, 2 for CRF02, etc. The position of the outgroup (SIV-cpz sequence) is not shown. The numbers on nodal branches represent the bootstrap values (out of 100 replicates); only values 68% or greater are shown. The scale bar indicates an evolutionary distance of 0.1 nucleotides per site. Vertical distances are for clarity only.
Phylogenetic analysis of 19 HIV-2 PR sequences from monotypic and heterotypic infections classified 17 sequences as group A (89.5%) and 2 as group B (10.5%) (Figure 2), with one of the group B being an INT sequence (Table 1). HIV-2 group A sequences were identified in all 10 HIV-2 monotypic infections. From the 8 HIV-1/HIV-2 heterotypic infections, HIV-1 strains were represented by 2 CRF06_cpx(PR) (25%), 1 CRF02_AG(PR) (12.5%), 1 CRF06_cpx(gp41) (12.5%), 3 CRF06_cpx(PR)/CRF06_cpx(gp41) (37.5%) and 1 subtype A in PR (12.5%) (Table 1). From those 8 heterotypic HIV-1/HIV-2 infections, HIV-2 group A was present in 7 (87.5%) and group B in 1 (12.5%).
For comparison, 7 1980s sequences from Ivory Coast were included in the tree and contain the prefixes IC. Highlighted sequences in gray were from mixed HIV-1/HIV-2 infections; the remainders were from monotypic HIV-2 infections; groups #1 and #2 within subtype A represent distinct subclusters of closely related sequences. Viruses from sooty mangabeys are shown (SIVsm). The position of the outgroup (SIV-cpz sequence) is not shown. The number on the node indicates the bootstrap value; only values 68% or greater are shown. The scale bar indicates an evolutionary distance of 0.1 nucleotides per site. Vertical distances are for clarity only.
Genetic Distance Analysis
We found the mean nucleotide distance among all 52 Burkina Faso HIV-1 CRF06_cpx(PR) sequences was 1.1% (range 0.1–4.1%) (Table 2). Similarly, there was a limited nucleotide divergence with a mean of 1.8% (range 0.3–3.8%) for the 15 CRF02_AG(PR) sequences (Table 2). Also, low genetic distances were observed for the 37 CRF06_cpx(gp41) sequences with a mean of 2.0% (range 0.3–6.3%), 25 CRF02_AG(gp41) with a mean of 3.5% (range 0.8–7.7%), and 2 CRF09_cpx (mean 0.3%) (Table 2).
Genetic distance analysis of the 17 group A HIV-2 PR sequences revealed a broad diversity (mean 6.2%, 0.3–12.8%) (Table 2). Eleven of the 17 HIV-2 PR group A sequences phylogenetically clustered into 2 distinct subgroups supported by high bootstrap value of 93% and 99%, respectively (Figure 2). Pairwise nucleotide divergence was small among sequences within these 2 groups: mean 0.8%, range 0.3–2.3% for group #1 and mean 1.1%, range 0.3–3.0% for group #2 (Table 2). Pairwise analysis of the remaining 6 unrelated HIV-2 sequences showed a relatively high range (mean 6.1%, range 2.0–12.8%) of nucleotide diversity due to either a long existence of HIV-2 strains in Burkina Faso or the separate introduction of these variants into the country.
A unique set of historical serum samples collected in 1986 from Burkina Faso provided us the opportunity to determine the types and distribution of HIV that were present in the early years of the epidemic in that country. Based on two gene regions, PR and env gp41 of HIV-1 and PR or INT of HIV-2, our results showed that recombinant viruses were co-circulating early in the epidemic with the predominant strain being CRF06_cpx, a complex recombinant virus consisting of genomic regions from subtypes A, G, J, and K. CRF02_AG was the next major strain, followed by subtype A and CRF09_cpx, another complex recombinant virus, as minor strains. In addition, HIV-1 and HIV-2 heterotypic and HIV-2 monotypic infections were also present, with HIV-2 group A present in the majority of these infections.
Phylogenetic analysis of nucleotide sequences showed that the early 1986 HIV-1 epidemic in Burkina Faso was driven by recombinant viruses, with CRF06_cpx and CRF02_AG causing the majority of infections. Although these samples came from one hospital in Quagadougou, almost thirty years later the same CRFs are still predominant and co-circulating today and causing infections in Burkina Faso, with no significant change in proportion between CRF06_cpx and CRF02_AG , ; thus, strongly suggesting the same strains that initiated the early epidemic. Similarly, Kalish et al  found the distribution of HIV-1 strains in Kinshasa, Zaire from 1984 through 1986 were composed of CRF01_AE, unique recombinant viruses, unclassifiable strains, and all the group M subtypes but B. Other researchers found almost 20 years later, that the type and proportion of the Kinshasa subtypes and recombinant viruses were strikingly similar , .
Some of the earliest known samples of HIV infection (ZR59 and DRC60) were traced back to Kinshasa, Zaire , . Although Central Africa appears to be the epicenter of the HIV-1 pandemic, CRF02_AG and CRF06 were not reported in Kinshasa until 2004. Following the first report of CRF06_cpx in a Burkina Faso specimen (BFP90) in 1996 ,  and later in Mali in 1999 , CRF06_cpx viruses have since been documented in other West African countries, including Senegal, Mali, Ivory Coast, Niger and Nigeria , . It is unclear where CRF06_cpx arose. Some have speculated that it originated in Burkina Faso because of the early predominance of CRF06_cpx in the country . However, CRF06_cpx is a complex recombinant with portions of HIV-1 subtypes A, G, J, and K. Since subtypes J and K have not been reported in West Africa, especially so early in the epidemic, it does not seem likely that it could have arisen from multiple recombination events in Burkina Faso. A more plausible explanation is that it may have evolved in West-Central Africa, the epicenter of the HIV pandemic, and radiated to the west. Alternately, the first recombination events could have taken place in West-Central Africa, for instance between viruses from subtypes J and K, and additional recombination events took place when people with these infections traveled to the west where subtypes A and G were commonly reported. Since we only had PR or gp41 sequences, and both of these gene regions for CRF06-cpx are subtype G, another interesting possibility might be that these viruses could represent parental subtype G strains from which the CRF06 arose. However, as we state above, we do not believe that CRF06 originated in West Africa, since viruses of subtypes J and K were only found in Central Africa in the mid-1980s  and are still unreported in West Africa. Furthermore, recombination had to occur multiple times to generate the CRF06 virus. The very high genetic relatedness of all our early CRF06 viruses in Burkina Faso, and their close evolutionary relationship to CRF06 reference sequences, supports the conclusion that CRF06 was most likely introduced into Burkina Faso a couple years before our sampling in 1986. Finally, CRF06 and CRF02 are currently co-circulating in Burkina Faso in roughly the same proportions as we found in 1986 , .
CRF02-AG was first identified in West Africa between 1989 and 1991 , , and soon became the prominent strain throughout much of West and West Central African countries, including Nigeria, Cameroon, Senegal, and Cote d'Ivoire , –. While some investigators have suggested that recombinant viruses have the potential to be more pathogenic, more transmissible, or more biologically fit –, it still remains to be proven that recombinant viruses have a selective advantage . West Africa in the early 1980s was not yet experiencing an HIV-1 epidemic, so it was easy for newly introduced HIV-1 strains to become established within high-risk groups and then radiate out into the rest of the population. It is interesting, though, that the two strains that appear to have driven the 1986 HIV-1 epidemic in Burkina Faso were the recombinant viruses CRF06_cpx and CRF02_AG.
While the range of intra-subtype diversity was high in Kinshasa, Zaire in 1984-1986 , the intra-CRF diversity was low among 1986 HIV-1 PR and gp41gene sequences from Burkina Faso, suggesting that this was a much newer epidemic. In Zaire, during the same time period, the mean range of intra-CRF and intra-subtype diversity spanned 9.6% through 19.7% in the env C2V3C3 gene region, yet the env gp41 genetic distances for CRF06 and CRF02 showed only an average nucleotide difference of 3.5% and 2.0%, respectively. While the gp41 region is more conserved than the C2V3C3 region of the envelope, it does not explain the huge differences seen between the Kinshasa and Ouagadougou divergence among sequences from each city. The small amount of genetic differences among the viruses within CRF06_cpx and CRF02_AG suggest that they were recently introduced into Burkina Faso as founder viruses, not long before their sampling in 1986. Since the genetic divergence is larger for CRF06_cpx in both gene regions analyzed, it suggests that CRF02_AG may have entered Burkina Faso as a founder virus shortly after CRF06_cpx. A somewhat similar finding of low diversity among two founder viruses, CRF01_AE and subtype B’ was observed from specimens collected shortly after their introduction into Thailand . Thai national serosurveys allowed researchers to conclude that there were two separate introductions of subtype B’ and CRF01_AE by parenteral and sexual risk factors, respectively, approximately one year apart. Unfortunately, we do not have any data about risk factors that might have helped us explain how these recombinant viruses entered Burkina Faso, but they do appear to be sequential introductions occurring within a couple years of each other.
Although there are at least 8 HIV-2 groups –, only A and B have become endemic throughout West Africa, with group A being the dominant strain, except in Cote d’Ivoire, where HIV-2 group B is prevalent –. In our study of HIV strains in Burkina Faso, we found monotypic HIV-1 or HIV-2 infections as well as heterotypic HIV-1 and HIV-2 infections. Similar to what has been observed previously in West Africa, HIV-2 group A was predominant in Burkina Faso whether in HIV-2 monotypic or HIV-1/HIV-2 heterotypic infections.
We found the mean PR genetic distance among Burkina Faso HIV-2 group A viruses was 6.2%, which is considerably more divergent than HIV-1 PR for CRF02_AG and CRF06_cpx (1.8% and 1.0%, respectively). Since HIV-1 replicates faster and diverges more quickly than HIV-2, this greater divergence of HIV-2 viruses in Burkina Faso suggests that the introduction of HIV-2 could have predated the introduction of HIV-1, which is consistent with observations in other West African countries , . However, another explanation for the higher diversity of HIV-2 group A could be the separate introduction of divergent strains into Burkina Faso, carried in from other West African countries where HIV-2 was already endemic.
Analysis of HIV-2 phylogenetic trees from Burkina Faso showed two strongly related sequence subclusters within group A. Each subcluster defines groups of closely related sequences with very short branch lengths and small nucleotide differences among them. These clusters represent two founder group A viruses that entered Burkina Faso and subsequently spread rapidly, most likely due to the high-risk groups they entered. These types of transmission networks contributed significantly to establishing HIV-2 infections in Burkina Faso and throughout West Africa. In addition, phylogenetic analysis demonstrated the sequences in group #2 formed a strong monophyletic cluster (99% bootstrap support) with the reference sequence NIHZ, which was originally isolated in 1986 from Guinea Bissau . Such a relationship suggests that some of the HIV-2 strains found in Burkina Faso might have their origins in Guinea Bissau, a country in West Africa with one of the highest prevalences of HIV-2 infections  or that NIHZ-like strains originated in Burkina Faso and helped establish the HIV-2 epidemic in Guinea Bissau.
In summary, our results provide new and unique insights into the HIV epidemic in West Africa during this early period in the global pandemic. Based on sequences from only 138 HIV strains, it appears that CRF02_AG and CRF06_cpx were founder viruses that initiated the early HIV-1 epidemic in Burkina Faso. We also provide characterization of early HIV-2 sequences and document that while group A was predominant, both groups A and B variants were found in both HIV-2 monotypic and mixed HIV-1/HIV-2 infections. This information is critical for understanding the newly emerging HIV epidemic in Burkina Faso and for monitoring the dissemination and evolution of HIV-1 and HIV-2 strains in Burkina Faso, as well as throughout West Africa.
The authors would like to thank the support and participation of the Ministry of Health of Burkina Faso. We also thank Dr. Dennis Ellenberger for critical review of the manuscript.
Conceived and designed the experiments: PNF MLK MR DP. Performed the experiments: AS JM LER OH. Analyzed the data: PNF MLK DP. Contributed reagents/materials/analysis tools: MR RS. Wrote the paper: PNF MLK DP. Read, revised and approved the paper: PNF MLK AS MR JM LER OH RS DP.
- 1. De Leys R, Vanderborght B, Vanden Haesevelde M, Heyndrickx L, van Geel A, et al. (1990) Isolation and partial characterization of an unusual human immunodeficiency retrovirus from two persons of west-central African origin. J. Virol 64: 1207–1216.
- 2. Charneau P, Borman AM, Quillent C, Guétard D, Chamaret S, et al. (1994) Isolation and Envelope Sequence of a Highly Divergent HIV-1 Isolate: Definition of a New HIV-1 Group. Virology 15: 247–253.
- 3. Simon F, Mauclère P, Roques P, Loussert-Ajaka I, Müller-Trutwin MC, et al. (1998) Identification of a new human immunodeficiency virus type 1 distinct from group M and group O. Nat Med. 4: 1032–7.
- 4. Plantier JC, Leoz M, Dickerson JE, De Oliveira F, Cordonnier F, et al. (2009) A new human immunodeficiency virus derived from gorillas. Nat Med 15: 871–2.
- 5. Sharp PM, Hahn BH (2011) Origins of HIV and the AIDS Pandemic. Cold Spring Harb Perspect Med 1: : a006841.
- 6. Los Alamos Database. HIV Circulating Recombinant Forms (CRFs). Available: http://www.hiv.lanl.gov/content/sequence/HIV/CRFs/CRFs.html. Accessed 2013 August 1.
- 7. Ayouba A, Mauclère P, Martin PM, Cunin P, Mfoupouendoun J, et al. (2001) HIV-1 group O infection in Cameroon, 1986 to 1998. Emerg Infect Dis 7: 466–7.
- 8. Barin F, Cazein F, Lot F, Pillonel J, Brunet S, et al. (2007) Prevalence of HIV-2 and HIV-1 group O infections among new HIV diagnoses in France: 2003–2006. AIDS 21: 2351–3.
- 9. Ayouba A, Souquières S, Njinku B, Martin PM, Müller-Trutwin MC, et al. (2000) HIV-1 group N among HIV-1-seropositive individuals in Cameroon. AIDS 14: 2623–5.
- 10. Roques P, Robertson DL, Souquière S, Apetrei C, Nerrienet E, et al. (2004) Phylogenetic characteristics of three new HIV-1 N strains and implications for the origin of group N. AIDS. 18: 1371–81.
- 11. Yamaguchi J, McArthur CP, Vallari A, Coffey R, Bodelle P, et al. (2006) HIV-1 Group N: evidence of ongoing transmission in Cameroon. AIDS Res Hum Retroviruses 22: 453–7.
- 12. Yamaguchi J, Coffey R, Vallari A, Ngansop C, Mbanya D, et al. (2006) Identification of HIV type 1 group N infections in a husband and wife in Cameroon: viral genome sequences provide evidence for horizontal transmission. AIDS Res Hum Retroviruses 22: 83–92.
- 13. Vallari A, Bodelle P, Ngansop C, Makamche F, Ndembi N, et al. (2010) Four new HIV-1 group N isolates from Cameroon: Prevalence continues to be low. AIDS Res Hum Retroviruses 26: 109–15.
- 14. Vallari A, Holzmayer V, Harris B, Yamaguchi J, Ngansop C, et al. (2011) Confirmation of putative HIV-1 group P in Cameroon. J Virol 85: 1403–7.
- 15. Schim van der Loeff MF, Aaby P (1999) Towards a better understanding of the epidemiology of HIV-2. AIDS 13 Suppl A: S69–84.
- 16. Yamaguchi J, Devare SG, Brennan CA (2000) Identification of a new HIV-2 subtype based on phylogenetic analysis of full-length genomic sequence. AIDS Res Hum Retroviruses 16: 925–30.
- 17. Damond F, Worobey M, Campa P, Farfara I, Colin G, et al. (2004) Identification of a highly divergent HIV type 2 and proposal for a change in HIV type 2 classification. AIDS Res Hum Retroviruses 20: 666–72.
- 18. Campbell-Yesufu OT, Gandhi RT (2011) Update on human immunodeficiency virus (HIV)-2 infection. Clin Infect Dis 52: 780–7.
- 19. Machuca A, Soriano V, Gutiérrez M, Holguín A, Aguilera A, et al. (1999) Human immunodeficiency virus type 2 infection in Spain. The HIV-2 Spanish Study Group. Intervirology 42: 37–42.
- 20. Soriano V, Gomes P, Heneine W, Holguín A, Doruana M, et al. (2000) Human immunodeficiency virus type 2 (HIV-2) in Portugal: clinical spectrum, circulating subtypes, virus isolation, and plasma viral load. J Med Virol 61: 111–6.
- 21. Gao F, Yue L, White AT, Pappas PG, Barchue J, et al. (1992) Human infection by genetically diverse SIVSM-related HIV-2 in west Africa. Nature 358: 495–9.
- 22. Brennan CA, Yamaguchi J, Vallari AS, Hickman RK, Devare SG (1997) Genetic variation in human immunodeficiency virus type 2: identification of a unique variant from human plasma. AIDS Res Hum Retroviruses 13: 401–4.
- 23. Cheingsong-Popov R, Lister S, Callow D, Kaleebu P, Beddows S, et al. (1994) Serotyping HIV type 1 by antibody binding to the V3 loop: relationship to viral genotype. AIDS Res Hum Retroviruses 11: 1379–86.
- 24. Barin F, Lahbabi Y, Buzelay L, Lejeune B, Baillou-Beaufils A, et al. (1996) Diversity of antibody binding to V3 peptides representing consensus sequences of HIV type 1 genotypes A to E: an approach for HIV type 1 serological subtyping. AIDS Res Hum Retroviruses 12: 1279–89.
- 25. Plantier JC, Damond F, Lasky M, Sankalé JL, Apetrei C, et al. (1999) V3 Serotyping of HIV-1 Infection: Correlation With Genotyping and Limitations. J Acquir Immune Defic Syndr Hum Retrovirol 20: 432–441.
- 26. Vidal N, Peeters M, Mulanga-Kabeya C, Nzilambi N, Robertson D, et al. (2000) Unprecedented degree of human immunodeficiency virus type 1 (HIV-1) group M genetic diversity in the Democratic Republic of Congo suggests that the HIV-1 pandemic originated in Central Africa. J Virol 74: 10498–507.
- 27. Kalish ML, Robbins KE, Pieniazek D, Schaefer A, Nzilambi N, et al. (2004) Recombinant viruses and early global HIV-1 epidemic. Emerg Infect Dis 10: 1227–34.
- 28. Zhu T, Korber BT, Nahmias AJ, Hooper E, Sharp PM, et al. (1998) An African HIV-1 sequence from 1959 and implications for the origin of the epidemic. Nature 391: 594–7.
- 29. Worobey M, Gemmel M, Teuwen DE, Haselkorn T, Kunstman K, et al. (2008) Direct evidence of extensive diversity of HIV-1 in Kinshasa by 1960. Nature 455: 661–4.
- 30. Kong LI, Lee SW, Kappes JC, Parkin JS, Decker D, et al. (1998) West African HIV-2-related human retrovirus with attenuated cytopathicity. Science 240: 1525–9.
- 31. Kirchhoff F, Jentsch KD, Stuke A, Mous J, Hunsmann G (1990) Genomic divergence of an HIV-2 from a German AIDS patient probably infected in Mali. AIDS 4: 847–57.
- 32. Gao F, Yue L, Robertson DL, Hill SC, Hui H, et al. (1994) Genetic diversity of human immunodeficiency virus type 2: evidence for distinct sequence subtypes with differences in virus biology. J Virol 68: 7433–47.
- 33. Dietrich U, Adamski M, Kreutz R, Seipp A, Kühnel H, et al. (1989) A highly divergent HIV-2-related isolate. Nature 342: 948–50.
- 34. Barnett SW, Quiroga M, Werner A, Dina D, Levy JA (1993) Distinguishing features of an infectious molecular clone of the highly divergent and noncytopathic human immunodeficiency virus type 2 UC1 strain. J Virol 67: 1006–14.
- 35. Miura T, Sakuragi J, Kawamura M, Fukasawa M, Moriyama EN, et al. (1990) Establishment of a phylogenetic survey system for AIDS-related lentiviruses and demonstration of a new HIV-2 subgroup. AIDS 4: 1257–61.
- 36. Kanki PJ, M'Boup S, Ricard D, Barin F, Denis F, et al. (1987) Human T-lymphotropic virus type 4 and the human immunodeficiency virus in West Africa. Science 236: 827–31.
- 37. Lankoandé S, Meda N, Sangaré L, Compaoré IP, Catraye J, et al. (1998) Prevalence and risk of HIV infection among female sex workers in Burkina Faso. Int J STD AIDS 9: 146–50.
- 38. Oelrichs RB, Workman C, Laukkanen T, McCutchan FE, Deacon NJ (1998) A novel subtype A/G/J recombinant full-length HIV type 1 genome from Burkina Faso. AIDS Res Hum Retroviruses 14: 1495–500.
- 39. Montavon C, Bibollet-Ruche F, Robertson D, Koumare B, Mulanga C, et al. (1999) The identification of a complex A/G/I/J recombinant HIV type 1 virus in various West African countries. AIDS Res Hum Retroviruses 15: 1707–12.
- 40. Montavon C, Toure-Kane C, Nkengasong JN, Vergne L, Hertogs K, et al. (2002) CRF06-cpx: a new circulating recombinant form of HIV-1 in West Africa involving subtypes A, G, K, and J. J Acquir Immune Defic Syndr. 29: 522–30.
- 41. Ouédraogo-Traoré R, Montavon C, Sanou T, Vidal N, Sangaré L, et al. (2003) CRF06-cpx is the predominant HIV-1 variant in AIDS patients from Ouagadougou, the capital city of Burkina Faso. AIDS 17: 441–2.
- 42. Manigart O, Courgnaud V, Sanou O, Valéa D, Nagot N, et al. (2004) HIV-1 superinfections in a cohort of commercial sex workers in Burkina Faso as assessed by an autologous heteroduplex mobility procedure. AIDS 18: 1645–51.
- 43. Tebit DM, Ganame J, Sathiandee K, Nagabila Y, Coulibaly B, et al. (2006) Diversity of HIV in rural Burkina Faso. J Acquir Immune Defic Syndr 43: 144–52.
- 44. Nkengasong JN, Luo CC, Abouya L, Pieniazek D, Maurice C, et al. (2000) Distribution of HIV-1 subtypes among HIV-seropositive patients in the interior of Côte d'Ivoire. J Acquir Immune Defic Syndr 23: 430–6.
- 45. Montavon C, Toure-Kane C, Liegeois F, Mpoudi E, Bourgeois A, et al. (2000) Most env and gag subtype A HIV-1 viruses circulating in West and West Central Africa are similar to the prototype AG recombinant virus IBNG. J Acquir Immune Defic Syndr 23: 363–74.
- 46. Hemelaar J, Gouws E, Ghys PD, Osmanov S, WHO-UNAIDS Network for HIV Isolation (2011) Characterisation (2011) Global trends in molecular epidemiology of HIV-1 during 2000–2007. AIDS 25: 679–89.
- 47. Esu-Williams E, Mulanga-Kabeya C, Takena H, Zwandor A, Aminu K, et al. (1997) Seroprevalence of HIV-1, HIV-2, and HIV-1 group O in Nigeria: evidence for a growing increase of HIV infection. J Acquir Immune Defic Syndr Hum Retrovirol 16: 204–210.
- 48. Wiktor SZ, Nkengasong JN, Ekpini ER, Adjorlolo-Johnson GT, Ghys PD, et al. (1999) Lack of protection against HIV-1 infection among women with HIV-2 infection. AIDS 13: 695–699.
- 49. Yang C, Dash BC, Simon F, van der Groen G, Pieniazek D, et al. (2000) Detection of diverse variants of human immunodeficiency virus-1 groups M, N, and O and simian immunodeficiency viruses from chimpanzees by using generic pol and env primer pairs. J Infect Dis 181: 1791–1795.
- 50. Janini LM, Pieniazek D, Peralta JM, Schechter M, Tanuri A, et al. (1996) Identification of single and dual infections with distinct subtypes of human immunodeficiency virus type 1 by using restriction fragment length polymorphism analysis. Virus Genes 13: 69–81.
- 51. Ramos A, Tanuri A, Schechter M, Rayfield MA, Hu DJ, et al. (1999) Dual and recombinant infections: an integral part of the HIV-1 epidemic in Brazil. Emerg Infect Dis 5: 65–74.
- 52. Pieniazek D, Ellenberger D, Janini LM, Ramos AC, Nkengasong J, et al. (1999) Predominance of human immunodeficiency virus type 2 subtype B in Abidjan, Ivory Coast. AIDS Res Hum Retroviruses 15: 603–608.
- 53. Masciotra S, Yang C, Pieniazek D, Thomas C, Owen SM, et al. (2002) Detection of simian immunodeficiency virus in diverse species and of human immunodeficiency virus Type 2 by using consensus primers within the pol region. J Clin Microbiol 40: 3167–3171.
- 54. Masciotra S, Livellara B, Belloso W, Clara L, Tanuri A, et al. (2000) Evidence of a high frequency of HIV-1 subtype F infections in a heterosexual population in Buenos Aires, Argentina. AIDS Res Hum Retroviruses 16: 1007–1014.
- 55. Fonjungo PN, Mpoudi EN, Torimiro JN, Alemnji GA, Eno LT, et al. (2002) Human immunodeficiency virus type 1 group m protease in cameroon: genetic diversity and protease inhibitor mutational features. J Clin Microbiol 40: 837–845.
- 56. Pieniazek D, Peralta JM, Ferreira JA, Krebs JW, Owen SM, et al. (1991) Identification of mixed HIV-1/HIV-2 infections in Brazil by polymerase chain reaction. AIDS 5: 1293–1299.
- 57. Gene studio. Available: http://www.genestudio.com/. Accessed 2013 August 5.
- 58. Los Alamos Database. HIV Sequence Database. Available: http://www.hiv.lanl.gov/content/sequence/HIV/mainpage.html. Accessed 2013 August 5.
- 59. Felsenstein J (1989) PHYLIP-Phylogeny interference package (version 3.2). Cladistics 5: 164–166.
- 60. MEGA (version 2.0). Molecular Evolutionary Genetics Analysis. Available: http://www.megasoftware.net/. Accessed 2013 August 5.
- 61. Smith TF, Waterman MS, Fitch WM (1981) Comparative biosequence metrics. J Mol Evol 18: 38–46.
- 62. Tebit DM, Sangaré L, Makamtse A, Yameogo S, Somlare H, et al. (2008) HIV drug resistance pattern among HAART-exposed patients with suboptimal virological response in Ouagadougou, Burkina Faso. J Acquir Immune Defic Syndr 49: 17–25.
- 63. Tebit DM, Sangaré L, Tiba F, Saydou Y, Makamtse A, et al. (2009) Analysis of the diversity of the HIV-1 pol gene and drug resistance associated changes among drug-naïve patients in Burkina Faso. J Med Virol 81: 1691–701.
- 64. Vidal N, Mulanga C, Bazepeo SE, Mwamba JK, Tshimpaka JW, et al. (2005) Distribution of HIV-1 variants in the Democratic Republic of Congo suggests increase of subtype C in Kinshasa between 1997 and 2002. J Acquir Immune Defic Syndr 40: 456–62.
- 65. Imamichi H, Koita O, Dabitao D, Dao S, Ibrah M, et al. (2009) Identification and characterization of CRF02_AG, CRF06_cpx, and CRF09_cpx recombinant subtypes in Mali, West Africa. AIDS Res Hum Retroviruses 25: 45–55.
- 66. Howard TM, Rasheed S (1996) Genomic structure and nucleotide sequence analysis of a new HIV type 1 subtype A strain from Nigeria. AIDS Res Hum Retroviruses 12: 1413–25.
- 67. Carr JK, Salminen MO, Albert J, Sanders-Buell E, Gotte D, et al. (1998) Full genome sequences of human immunodeficiency virus type 1 subtypes G and A/G intersubtype recombinants. Virology 247: 22–31.
- 68. McCutchan FE, Carr JK, Bajani M, Sanders-Buell E, Harry TO, et al. (1999) Subtype G and multiple forms of A/G intersubtype recombinant human immunodeficiency virus type 1 in Nigeria. Virology 254: 226–34.
- 69. Peeters M, Esu-Williams E, Vergne L, Montavon C, Mulanga-Kabeya C, et al. (2000) Predominance of subtype A and G HIV type 1 in Nigeria, with geographical differences in their distribution. AIDS Res Hum Retroviruses 2000 16: 315–25.
- 70. Montavon C, Toure-Kane C, Liegeois F, Mpoudi E, Bourgeois A, et al. (2000) Most env and gag subtype A HIV-1 viruses circulating in West and West Central Africa are similar to the prototype AG recombinant virus IBNG. J Acquir Immune Defic Syndr 23: 363–74.
- 71. Nkengasong JN, Luo CC, Abouya L, Pieniazek D, Maurice C, et al. (2000) Distribution of HIV-1 subtypes among HIV-seropositive patients in the interior of Côte d'Ivoire. J Acquir Immune Defic Syndr 23: 430–6.
- 72. Fonjungo PN, Mpoudi EN, Torimiro JN, Alemnji GA, Eno LT, et al. (2000) Presence of diverse human immunodeficiency virus type 1 viral variants in Cameroon. AIDS Res Hum Retroviruses 16: 1319–24.
- 73. Peeters M, Liegeois F, Torimiro N, Bourgeois A, Mpoudi E, et al. (1999) Characterization of a highly replicative intergroup M/O human immunodeficiency virus type 1 recombinant isolated from a Cameroonian patient. J Virol 73: 7368–75.
- 74. Hu DJ, Vanichseni S, Mastro TD, Raktham S, Young NL, et al. (2001) Viral load differences in early infection with two HIV-1 subtypes. AIDS 15: 683–91.
- 75. Njai HF, Gali Y, Vanham G, Clybergh C, Jennes W, et al. (2006) The predominance of Human Immunodeficiency Virus type 1 (HIV-1) circulating recombinant form 02 (CRF02_AG) in West Central Africa may be related to its replicative fitness. Retrovirology 3: 40.
- 76. Montano MA, Novitsky VA, Blackard JT, Cho NL, Katzenstein DA, et al. (1997) Divergent transcriptional regulation among expanding human immunodeficiency virus type 1 subtypes. J Virol 71: 8657–65.
- 77. Montano MA, Nixon CP, Ndung'u T, Bussmann H, Novitsky VA, et al. (2000) Elevated tumor necrosis factor-alpha activation of human immunodeficiency virus type 1 subtype C in Southern Africa is associated with an NF-kappaB enhancer gain-of-function. J Infect Dis 181: 76–81.
- 78. Ndung'u T, Sepako E, McLane MF, Chand F, Bedi K, et al. (2006) HIV-1 subtype C in vitro growth and coreceptor utilization. Virology 347: 247–60.
- 79. Tàpia N, Franco S, Puig-Basagoiti F, Menéndez C, Alonso PL, et al. (2003) Influence of human immunodeficiency virus type 1 subtype on mother-to-child transmission. J Gen Virol 84: 607–13.
- 80. Ou CY, Takebe Y, Weniger BG, Luo CC, Kalish ML, et al. (1993) Independent introduction of two major HIV-1 genotypes into distinct high-risk populations in Thailand. Lancet 341: 1171–4.
- 81. Ishikawa K, Janssens W, Banor JS, Shinno T, Piedade J, et al. (2001) Genetic analysis of HIV type 2 from Ghana and Guinea-Bissau, West Africa. AIDS Res Hum Retroviruses 17: 1661–3.
- 82. McCutchan FE (2006) Global epidemiology of HIV. J Med Virol 78 Suppl 1S7–S12.
- 83. Ruelle J, Sanou M, Liu HF, Vandenbroucke AT, Duquenne A, et al. (2007) Genetic polymorphisms and resistance mutations of HIV type 2 in antiretroviral-naive patients in Burkina Faso. AIDS Res Hum Retroviruses 23: 955–64.
- 84. Hamel DJ, Sankalé JL, Eisen G, Meloni ST, Mullins C, et al. (2007) Twenty years of prospective molecular epidemiology in Senegal: changes in HIV diversity. AIDS Res Hum Retroviruses 23: 1189–96.
- 85. Poulsen AG, Aaby P, Gottschau A, Kvinesdal BB, Dias F, et al. (1993) HIV-2 infection in Bissau, West Africa, 1987-1989: incidence, prevalences, and routes of transmission. J Acquir Immune Defic Syndr 6: 941–8.
- 86. Naucler A, Albino P, Da Silva AP, Andreasson PA, Andersson S, et al. (1991) HIV-2 infection in hospitalized patients in Bissau, Guinea-Bissau. AIDS 5: 301–304.
- 87. Piedade J, Venenno T, Prieto E, Albuquerque R, Esteves A, et al. (2000) Longstanding presence of HIV-2 infection in Guinea-Bissau (West Africa). Acta Trop 76: 119–124.