17 Jan 2012: Lingappa JR, Petrovski S, Kahle E, Fellay J, Shianna K, et al. (2012) Correction: Genomewide Association Study for Determinants of HIV-1 Acquisition and Viral Set Point in HIV-1 Serodiscordant Couples with Quantified Virus Exposure. PLoS ONE 7(1): 10.1371/annotation/1dcd8ee4-bcf4-4cf6-97a3-2e26f0f77060. doi: 10.1371/annotation/1dcd8ee4-bcf4-4cf6-97a3-2e26f0f77060 View correction
Host genetic factors may be important determinants of HIV-1 sexual acquisition. We performed a genome-wide association study (GWAS) for host genetic variants modifying HIV-1 acquisition and viral control in the context of a cohort of African HIV-1 serodiscordant heterosexual couples. To minimize misclassification of HIV-1 risk, we quantified HIV-1 exposure, using data including plasma HIV-1 concentrations, gender, and condom use.
We matched couples without HIV-1 seroconversion to those with seroconversion by quantified HIV-1 exposure risk. Logistic regression of single nucleotide polymorphisms (SNPs) for 798 samples from 496 HIV-1 infected and 302 HIV-1 exposed, uninfected individuals was performed to identify factors associated with HIV-1 acquisition. In addition, a linear regression analysis was performed using SNP data from a subset (n = 403) of HIV-1 infected individuals to identify factors predicting plasma HIV-1 concentrations.
After correcting for multiple comparisons, no SNPs were significantly associated with HIV-1 infection status or plasma HIV-1 concentrations.
This GWAS controlling for HIV-1 exposure did not identify common host genotypes influencing HIV-1 acquisition. Alternative strategies, such as large-scale sequencing to identify low frequency variation, should be considered for identifying novel host genetic predictors of HIV-1 acquisition.
Citation: Lingappa JR, Petrovski S, Kahle E, Fellay J, Shianna K, McElrath MJ, et al. (2011) Genomewide Association Study for Determinants of HIV-1 Acquisition and Viral Set Point in HIV-1 Serodiscordant Couples with Quantified Virus Exposure. PLoS ONE 6(12): e28632. doi:10.1371/journal.pone.0028632
Editor: Roberto F. Speck, University Hospital Zurich, Switzerland
Received: June 8, 2011; Accepted: November 11, 2011; Published: December 12, 2011
Copyright: © 2011 Lingappa et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Funding for this work was through the Bill and Melinda Gates Foundation grant #26469 and NIH/NIAID grants AI27757, AI073115 and Center for HIV-1/AIDS Vaccine Immunology grant AI067854. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
HIV-1 interacts with many host factors during the process of infection and replication. However, there is only one confirmed example of a host factor variant that modifies HIV-1 infection outcomes: CCR5- Δ32, a variant in the co-receptor for HIV-1 cellular entry has been shown to increase host resistance to HIV-1 infection and HIV-1 disease progression –. While, CCR5- Δ32 is relatively infrequent or absent in many populations, its existence supports the need to broadly evaluate the human genome for other host genetic variants that influence HIV-1 acquisition.
To date, broad evaluations for host genetic factors have been most successful with identifying predictors of HIV-1 control in infected individuals with recent genome-wide association studies (GWAS) – identifying single nucleotide polymorphisms (SNPs) in the Human Leukocyte Antigen (HLA) complex associated with plasma HIV-1 set point, and disease non-progression , as well as genes outside HLA associated with disease progression .
Studies looking for host genetic variation associated with HIV-1 acquisition have been more challenging. Candidate variation has been evaluated in HLA, chemokines/chemokine receptors, mediators of the innate, and adaptive immune responses, and factors thought to underlie intracellular viral restriction in diverse epidemiologic contexts (reviewed in reference ). To date, no specific gene or variant has been found to consistently influence HIV-1 acquisition/resistance across these diverse studies. Recently, a GWAS evaluated common SNPs across the human genome comparing HIV-1 seropositive to seronegative Malawians and found no common SNPs associated with HIV-1 infection .
However, interpretation of studies of HIV-1 acquisition is complicated by the fact that levels of HIV-1 exposure are difficult to quantify, and yet modify risk of HIV-1 sexual transmission by up to 300-fold . Lack of HIV-1 exposure quantification could therefore result in reduced power to detect relevant common variants due to misclassification in assigning HIV-1 acquisition phenotypes (e.g., HIV-1 susceptible individuals with low HIV-1 exposure and therefore at low risk of HIV-1 acquisition who, without quantitative assessment of exposure, are misclassified as HIV-1 resistant).
The principal determinant of HIV-1 sexual transmission risk, and therefore primary HIV-1 exposure factor, is the plasma HIV-1 RNA level in the transmitting partner , . Other epidemiologic, biologic and behavioral factors (e.g., circumcision status of male uninfected partners, and frequency of unprotected sex between partners) also contribute to this risk , –. Thus, accurate quantification of the level of HIV-1 exposure and associated HIV-1 sexual transmission risk requires data from both sexual partners.
Studies of HIV-1 serodiscordant couples (one partner HIV-1 infected and the other HIV-1 uninfected) offer unique advantages for identifying factors associated with HIV-1 acquisition. In particular, prospective collection of specimens and data from both sexual partners facilitates quantification of HIV-1 exposure risk, and confirmation of HIV-1 transmission linkage between partners. Therefore, this study design allows HIV-1 uninfected individuals with little to no HIV-1 exposure to be excluded from the analysis. To date, no GWAS for host genetic factors underlying HIV-1 acquisition has been performed in a cohort of HIV-1 serodiscordant couples. Here we report use of specimens and data from African heterosexual HIV-1 serodiscordant couples in a GWAS for host genetic predictors of HIV-1 acquisition and set point plasma RNA levels.
Study participants were selected from two cohorts of African HIV-1 serodiscordant heterosexual couples:
- The Partners in Prevention HSV/HIV Transmission Study enrolled 3408 African HIV-1 serodiscordant couples at 14 sites in East and Southern Africa, and followed them quarterly for up to 24 months to evaluate the efficacy of herpes simplex virus type-2 (HSV-2) suppression to reduce HIV-1 transmission to their heterosexual HIV-1 uninfected partners , . HIV-1 infected partners in this trial were required to be dually-infected with HSV-2 with a CD4 count ≥250 cells/mm3; there was no eligibility criterion related to HSV-2 serostatus of the HIV-1 uninfected partner. The primary analysis for this trial found acyclovir suppression reduced plasma HIV-1 level of the HIV-1 infected partners by a mean of 0.25 log10 copies/ml, but did not reduce the risk of HIV-1 transmission .
- The Couples Observational Study (COS) used a similar recruitment process to enroll 485 HIV-1 serodiscordant couples from Soweto, South Africa and Kampala, Uganda without restriction on CD4 count or HSV-2 serostatus of the HIV-1 infected partner; these couples were followed quarterly for up to 12 months.
In both cohorts, HIV-1 serostatus in the initially HIV-1 uninfected partner was assessed by dual HIV-1 rapid assays and HIV-1 seroconversions confirmed by ELISA, and Western blot or RT-PCR . Plasma HIV-1 env and gag sequencing of both partners were compared with those consistent with transmission linkage within the partnership classified as “linked” , . Seroconverting partners were also followed after seroconversion to document plasma HIV-1 RNA set point and CD4 counts.
Among all participants recruited at both COS study sites and at the 10 Partners in Prevention HSV/HIV Transmission Study sites at which consent for host genetic studies had been obtained, a total of 863 individuals were selected for genotyping. Procedures used to identify these individuals are described below.
In order to identify HIV-1 non-seroconverting and seroconverting partners with similar ranges of HIV-1 exposure we identified epidemiologic factors predicting HIV-1 transmission. Baseline data from linked transmitting and non-transmitting couples identified in the Partners in Prevention HSV/HIV Transmission Study was used to develop a Cox proportional hazards model identifying HIV-1 exposure factors associated with HIV-1 transmission: gender, age, male circumcision, HIV-1 infected partner plasma RNA level, and unprotected sex. Seroconverting couples in either cohort were matched to two non-seroconverting couples based on baseline status for each HIV-1 exposure factor. To augment power to detect genotypes associated with host resistance to HIV-1 we also included additional HIV-1 uninfected individuals with all HIV-1 exposure factors in high-risk strata.
To facilitate comparisons of HIV-1 exposure levels we used the regression coefficients of the Cox prediction model to develop an exposure score that ranged from 0 (lowest exposure) to 7 (highest exposure) (Table 1). The cumulative risk of HIV-1 infection for individuals with an exposure risk score ≥2 was 31-fold greater than for those with an exposure risk <2 (4.99% vs 0.16%).
Among the 863 individuals identified for genotyping through this process, 512 (59%) were HIV-1 infected (384 [75%] prevalent and 127 [25%] incident HIV-1 infections) and 352 (41%) remained HIV-1 uninfected despite documented HIV-1 exposure. Table 2 provides a breakdown of the sample selection by HIV-1 infected and uninfected status.
Genomic DNA was extracted from 1 ml archived whole blood. All samples were genotyped using Illumina HumanHap 1M-Duo (np135) Bead Chips , which feature more than 1 million SNPs including 21 directly genotyped variants that have been identified in previous studies as associated with HIV-1 susceptibility . SNPs with a call frequency of <99%, with minor allele frequency <1% or with >5% missing results were excluded leaving 990,115 SNPs for association analysis. Bonferroni correction for multiple testing used a P value cutoff of 5.1×10−8 for genome-wide significance.
Candidate SNP subset
As a subanalysis, we evaluated 21 candidate SNPs previously implicated in HIV-1 infection that were present on the 1M-Duo chip platform. We report uncorrected p-values for these 21 SNPs.
A total of 25 samples were excluded based on genotyping quality control steps (twelve samples failed genotyping, eleven had genotype inconsistent with epidemiologically assigned gender, and two failed cryptic relatedness requirements). Population structure was evaluated using a modified EIGENSTRAT method , the first principal component (eigenvector) discriminated individuals based on whether they were from Southern African (South Africa and Botswana) or Eastern African (Kenya, Uganda and Tanzania) study sites (Figure S1); at this step, eight samples were removed as population outliers.
Finally, in order to capture all HIV-1 infected individuals for genotyping, the initial matching for HIV-1 exposure included all seroconverting couples. Thus, some HIV-1 uninfected partners were selected for genotyping by matching to HIV-1 exposure scores of unlinked seroconverting couples; many of these HIV-1 uninfected partners had HIV-1 exposure risk scores<2. However, since all couples with linked transmission had HIV-1 exposure scores ≥2, we took this as an HIV-1 exposure cutoff and excluded from analysis 32 HIV-1 uninfected individuals with exposure score <2.
Analysis of specific HIV-1 phenotypes.
1) HIV-1 susceptibility analysis: We evaluated genotypes for all HIV-1 seropositive individuals, including prevalent HIV-1 infections (partners who were HIV-1 infected at enrollment), and incident infections (partners HIV-1 seronegative at enrollment who became infected during follow-up). These were compared to genotypes for all HIV-1 exposed, seronegative individuals (HIV-1 exposures scores ≥2). We also compared genotypes of HIV-1 seropositive individuals to the subset of all HIV-1 uninfected individuals with all baseline HIV-1 exposure characteristics in the highest risk strata. For both of these analyses, we performed standard logistic regression, additive genetic model, in PLINK (version 1.07) , , using gender, age and the individual coordinates of six EIGENSTRAT eigenvectors as covariates.
2) Plasma RNA set point analysis: Similar to previous analyses  we defined the plasma HIV-1 RNA set point among individuals with prevalent infection (partner who was HIV-1 infected at enrollment) as the average log10 plasma RNA level after excluding RNA measurements taken at or after the initiation of antiretroviral therapy (ART) or when CD4 count was <200 cells/mm3. We required RNA measurements to be stable, with measurements for each individual visually inspected for notable discrepancies (e.g., no plasma HIV-1 RNA measurements from each individual differing by >1 log copies/ml); we required a minimum of 2 reliable and consistent measurements per individual. For individuals with incident infection (e.g., HIV-1 seroconversion during follow-up) an estimated date of HIV-1 infection was established based on a combination of HIV-1 serology and plasma HIV-1 RNA PCR results, with HIV-1 set point calculated as the average of all log10 plasma HIV-1 RNA measurements taken 4 months or more after the estimated date of infection . For all analyses, plasma HIV-1 RNA levels below the limit of detection (240 copies/mL) were set to 120 copies/mL. For this analysis, we performed linear regression for set point using age, gender, acyclovir randomization, seroprevalent vs. seroconverter status and five EIGENSTRAT eigenvectors as covariates.
All individuals whose samples were evaluated through this genotyping provided informed consent for storage of samples for future research including genetic studies. Human subject review and approval for this analysis was obtained at the University of Washington and at local and affiliated institutional review boards for study sites where participants were enrolled. The Partners in Prevention HSV/HIV Transmission Study was registered with ClinicalTrials.gov (#NCT00194519).
Genome-wide common variation associated with HIV-1 acquisition
After quality control, 798 samples remained for analysis (496 from HIV-1 infected and 302 from HIV-1 uninfected individuals). Table 3 shows epidemiological and clinical characteristics of these individuals.
In the multivariate regression analysis, no single SNP reached genome-wide significance of p<5.1×10−8 (Figure 1). Furthermore, a meta-analysis based on the two separate analyses for Eastern and Southern African recruited individuals was consistent with the results from the pooled analysis. An annotated list of all SNPs with p<1×10−5 based on the pooled analysis of Eastern and Southern recruited Africans is provided (Table S1). Among the 21 SNPs available on the 1M-Duo chip platform that have previously been implicated with HIV-1 acquisition, none were GWAS significant; only two were significant at a p<0.05 threshold with both of these having effects in the opposite direction from the original findings: for rs2070729-IRF1 the G allele was linked to increased susceptibility (p = 0.01) in contrast to the original study, , and for rs1800451-MBL2 the A allele was linked to reduced susceptibility (p = 0.02) in contrast to prior studies – (Table S2).
-log10(p) is plotted for all SNPs against physical location of each SNP in the genome (listed by chromosome number 1 through 22, and X and XY). The threshold for genome-wide significance (P = 5.1×10−8) is indicated.
Comparison of the 496 HIV-1 infected individuals to 90 HIV-1 uninfected individuals having the higher HIV-1 exposure risk scores (>5 risk score) did not identify SNPs reaching genome-wide significance.
Genome-wide common variation associated with HIV-1 set point
Among the 496 HIV-1 infected individuals in our cohort, 403 (81%) met the requirements for stable HIV-1 set point including 293 (73%) prevalent and 110 (27%) incident infections (Table 4). Comparison of log10 plasma HIV-1 levels of prevalent and incident infections showed no statistically significant difference between them (p = 0.14), so both groups were combined for subsequent analyses. The overall median plasma HIV-1 level for individuals included in this plasma HIV-1 set point analysis was 4.53 log10 copies/ml. The median plasma HIV-1 level of males (n = 180) was 4.57 log10 copies/ml compared to females (n = 223) 4.51 log10 copies/ml (p = 0.57). Linear regression performed on these 403 HIV-1 infected individuals for the 990,115 SNPs that passed quality control found no single SNP reaching genome-wide significance (Figure 2). An annotated list of all markers obtaining a P-value less than 1×10−5 was generated using WGAviewer software  (Table S3).
-log10(p) is plotted for all SNPs against physical location of each SNP in the genome (listed by chromosome number 1 through 22, and X and XY). The threshold for genome-wide significance is indicated.
We found no common SNPs associated with HIV-1 acquisition at genome-wide significance. This result is consistent with a recent GWAS of Africans recruited from a high-risk setting . Our study is the first to select participants based on HIV-1 exposure levels ensuring that HIV-1 uninfected individuals had documented risk for HIV-1 acquisition. Furthermore, we also found that, among the subset of African HIV-1 infected participants with stable plasma HIV-1 level (combining individuals with incident and chronic HIV-1 infection), no SNPs on the 1M-Duo chip were associated at genome-wide significance with plasma HIV-1 set point. This is also similar to recent findings from a GWAS for determinants of plasma HIV-1 set point in an African-American cohort .
Three important limitations to this analysis must be considered in interpreting our findings. First, our overall sample size (n = 798 individuals passing quality control criteria) had sufficient power to detect host genetic variants with large effect sizes: e.g., variants with minor allele frequencies of 5% or 20% having a genotype relative risk greater than 3.2 and 2.1, respectively. A larger cohort would be needed to evaluate for host genetic factors associated with smaller genotype relative risks. However, it is possible that lower levels of linkage disequilibrium, particularly within the MHC region in African populations, reduced our ability to identify MHC genetic variants potentially associated to plasma HIV-1 set point in this cohort. This was apparent in a previous analysis for host genetic determinants of set point viral load in an African-American cohort . Although two of the top four SNPs (rs10484434 and rs11755492 –Table S3) in our analysis for determinants of set point are located physically close to the MHC region, there is no evidence that these SNPs tag causative variants within the MHC for our cohort. Additional studies and meta-analyses of these SNPs may provide further information on whether these SNPs may have a weak association with set point that we were not powered to detect. Finally, in addition to reducing our power to detect host genetic association with set point, the lower levels of linkage disequilibrium in African populations might have reduced our power to detect weaker host genetic associations with HIV-1 acquisition.
Second, the set of common variants evaluated for association with HIV-1 outcomes is based on HapMap data derived from common variation in West African (Yoruban) populations and likely does not capture detailed host variation across diverse subSaharan African populations –. Our analysis of population structure did provide clear discrimination of persons of Southern African and Eastern African origin (Figure S1a and S1b), However, improved databases of common host genetic variation in East and Southern African populations are becoming available through the recently completed 1000 genomes project . Nevertheless, the capacity to indirectly capture overall host variation through linkage disequilibrium will still be lower in African populations due to lower levels of linkage disequilibrium present in African populations , .
Finally, recent studies suggest that low frequency or rare host variation that cannot be readily captured through GWAS analysis is an important source of factors contributing to human disease causation , . Such causal rare variation can only be captured through large-scale genome sequencing efforts.
A unique component to our analysis was our use of clinical and behavioral factors from HIV-1 serodiscordant couples to quantify overall HIV-1 exposure risk. This study design provides unique advantages for controlling for epidemiological modifiers of HIV-1 acquisition – both explicitly for factors that are known to influence HIV-1 acquisition (which we have done through the exposure matching), and implicitly for any unidentified exposure factors shared within the partnership. While consanguinity of partners is a potential source of bias in this approach, across all samples, our analysis found only 2 pairs of samples with cryptic relatedness, and one sample from each cryptically related pair was excluded from the analysis. Consistent with previous epidemiologic analyses of HIV-1 transmission risk performed in this and other cohorts , , , , plasma HIV-1 RNA in the HIV-1 infected partner has the greatest impact on estimated HIV-1 exposure level. Although we also included additional factors from the HIV-1 uninfected partner (e.g., history of any unprotected sex, circumcision status of male HIV-1 uninfected partners, and age<35 years for HIV-1 uninfected female partners) in our exposure risk quantification, these factors contribute to a much smaller degree to HIV-1 transmission risk. We also did not adjust our findings for the direction of transmission (male-to-female versus female-to-male) since a recent per contact analysis for HIV-1 transmission risk in this cohort found that, after adjusting for plasma HIV-1 levels, the relative risk for male-to-female versus female-to-male transmission was 1.03 (p = 0.93) . Our HIV-1 exposure risk score correlated well with overall proportion acquiring HIV-1 infection, with those having exposure scores ≥2 having a 31-fold increased risk of HIV-1 infection. However, our overall analysis included individuals with a range of HIV-1 exposure levels with limited power to evaluate only those HIV-1 uninfected individuals at the highest HIV-1 exposure levels. Finally, we also did not account for longitudinal changes in exposure risk (e.g., HIV-1 infected partners initiating antiretroviral therapy resulting in reduced plasma HIV-1 levels, or behavioral changes related to frequency of sex or use of condoms). Thus, it remains an open question whether the search for genomic factors underlying HIV-1 acquisition might benefit from identifying individuals with extreme transmission phenotypes, e.g., those who remain HIV-1 seronegative despite persistently high HIV-1 exposures.
In summary, our GWAS comparing HIV-1 infected individuals to HIV-1 uninfected individuals with documented HIV-1 exposure risk did not identify host genetic factors strongly modifying risk of HIV-1 acquisition. Future studies of HIV-1 acquisition and set point determination may benefit from use of larger sample sizes, identification of extreme transmission phenotypes, and large-scale sequencing technologies to capture rare and previously uncharacterized common variants in these African cohorts.
Plot of PC1 versus PC2 population substructure after removal of outliers. After removing the eight outlier samples, EIGENSTRAT was re-run to obtain the eigenvectors for use as covariates in association analysis. Graphical plots are by A) Region, with black indicating individuals recruited from study sites in Southern African countries (South Africa and Botswana), and red indicating individuals recruited from study sites in East African countries (Kenya, Uganda and Tanzania), and B) HIV-1 status, with black indicating individuals who remained HIV-1 seronegative, and red indicating HIV-1 seropositive partners and individuals who seroconverted.
SNPs associated with p<10−5 for HIV-1 susceptibility/resistance. SNP rs identifier, uncorrected p-value, Chromosome number and basepair position (build 36.3, hg18), a description of the relative position of the SNP in the closest gene, Minor Allele Frequency (MAF) in HIV-1-negative and HIV-1-positive populations, and name and distance to closest gene are as indicated.
List of tested variants previously reported to have an association with HIV-1 susceptibility/resistance. SNPs listed are those present on the Illumina HumanHap 1M-Duo (np135) Bead Chips that have been previously implicated in candidate gene studies as having impact on HIV-1 acquisition. Characteristics of the studies that reported those previous associations are described.
Variants with p<10−5 in HIV-1 set point analysis. SNP rs identifier, uncorrected p-value, chromosome number and basepair position (build 36.3, hg18), and name and distance to closest gene are as indicated.
We thank the Partners in Prevention HSV/HIV Transmission Study participants for their participation, the study teams at each local study site for their efforts, and the operations and data management staff at the University of Washington for their dedication and perseverance. Dr. Lingappa had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
The Partners in Prevention HSV/HIV Transmission Study Team (sites permitting host genomic studies)
University of Washington Coordinating Center and Central Laboratories, Seattle, USA: Connie Celum (principal investigator), Anna Wald (protocol co-chair), Jairam R. Lingappa (medical director), Jared M. Baeten, Mary S. Campbell, Lawrence Corey, Robert W. Coombs, James P. Hughes, Amalia Magaret, M. Juliana McElrath, Rhoda Morrow, James I. Mullins
Study site principal investigators and study coordinators:
Cape Town, South Africa (University of Cape Town): David Coetzee; Eldoret, Kenya (Moi University, Indiana University): Kenneth Fife, Edwin Were; Gaborone, Botswana (Botswana Harvard Partnership): Max Essex, Joseph Makhema; Kampala, Uganda (Infectious Disease Institute, Makerere University): Elly Katabira, Allan Ronald; Kisumu, Kenya (Kenya Medical Research Institute, University of California San Francisco): Elizabeth Bukusi, Craig Cohen; Moshi, Tanzania (Kilimanjaro Christian Medical College, Harvard University): Saidi Kapiga, Rachel Manongi; Nairobi, Kenya (University of Nairobi, University of Washington): Carey Farquhar, Grace John-Stewart, James Kiarie; Orange Farm, South Africa (Reproductive Health Research Unit, University of the Witwatersrand): Sinead Delany-Moretlwe, Helen Rees; Soweto, South Africa (Perinatal HIV Research Unit, University of the Witwatersrand): Guy de Bruyn, Glenda Gray, James McIntyre; Thika, Kenya (University of Nairobi, University of Washington): Nelly Rwamba Mugo.
Conceived and designed the experiments: JRL SP JF MJM JMB CC AW JIM DD BH DG. Performed the experiments: JRL SP KS CC AW GdB JIM EN-J CF ME JK DG. Analyzed the data: JRL SP EK KKT. Wrote the paper: JRL SP EK JF KS MJM KKT JMB CC AW GdB JIM EN-J CF ME DD JK DG.
- 1. Dean M, Carrington M, Winkler C, Huttley GA, Smith MW, et al. (1996) Genetic restriction of HIV-1 infection and progression to AIDS by a deletion allele of the CKR5 structural gene. Science 273: 1856–1862.
- 2. Liu R, Paxton WA, Choe S, Ceradini D, Martin SR, et al. (1996) Homozygous defect in HIV-1 coreceptor accounts for resistance of some multiply-exposed individuals to HIV-1 infection. Cell 86: 367–377.
- 3. Samson M, Libert F, Doranz BJ, Rucker J, Liesnard C, et al. (1996) Resistance to HIV-1 infection in caucasian individuals bearing mutant alleles of the CCR-5 chemokine receptor gene. Nature 382: 722–725.
- 4. Fellay J, Ge D, Shianna KV, Colombo S, Ledergerber B, et al. (2009) Common genetic variation and the control of HIV-1 in humans. PLoS Genet 5: e1000791.
- 5. Dalmasso C, Carpentier W, Meyer L, Rouzioux C, Goujard C, et al. (2008) Distinct genetic loci control plasma HIV-RNA and cellular HIV-DNA levels in HIV-1 infection: the ANRS Genome Wide Association 01 study. PLoS ONE 3: e3907.
- 6. Pelak K, Goldstein DB, Walley NM, Fellay J, Ge D, et al. (2010) Host determinants of HIV-1 control in African Americans. J Infect Dis 201: 1141–1149.
- 7. Limou S, Le Clerc S, Coulonges C, Carpentier W, Dina C, et al. (2009) Genomewide association study of an AIDS-nonprogression cohort emphasizes the role played by HLA genes (ANRS Genomewide Association Study 02). J Infect Dis 199: 419–426.
- 8. Limou S, Coulonges C, Herbeck JT, van Manen D, An P, et al. (2010) Multiple-cohort genetic association study reveals CXCR6 as a new chemokine receptor involved in long-term nonprogression to AIDS. The Journal of infectious diseases 202: 908–915.
- 9. Restrepo C, Rallon NI, Carrillo J, Soriano V, Blanco J, et al. (2011) Host Factors Involved in Low Susceptibility to HIV Infection. AIDS Rev 13: 30–40.
- 10. Petrovski S, Fellay J, Shianna KV, Carpenetti N, Kumwenda J, et al. (2011) Common human genetic variants and HIV-1 susceptibility: a genome-wide survey in a homogeneous African population. AIDS 25: 513–18.
- 11. Powers KA, Poole C, Pettifor AE, Cohen MS (2008) Rethinking the heterosexual infectivity of HIV-1: a systematic review and meta-analysis. Lancet Infect Dis 8: 553–563.
- 12. Quinn TC, Wawer MJ, Sewankambo N, Serwadda D, Li C, et al. (2000) Viral load and heterosexual transmission of human immunodeficiency virus type 1. Rakai Project Study Group. N Engl J Med 342: 921–929.
- 13. Lingappa JR, Hughes JP, Wang RS, Baeten JM, Celum C, et al. (2010) Estimating the impact of plasma HIV-1 RNA reductions on heterosexual HIV-1 transmission risk. PLoS One 5: e12598.
- 14. Boily MC, Baggaley RF, Wang L, Masse B, White RG, et al. (2009) Heterosexual risk of HIV-1 infection per sexual act: systematic review and meta-analysis of observational studies. Lancet Infect Dis 9: 118–129.
- 15. Lingappa JR, Baeten JM, Thomas K, Donnell D, De Bruyn G, et al. (2010) Exposure Risk Score to Identify HIV-1 Exposed Seronegative Individuals; AIDS Vaccine 2010, Sept 28–Oct 1; Atlanta, USA.
- 16. Mackelprang R, Baeten J, Donnell D, Celum C, Farquhar C, et al. (2011) Longitudinal HIV-1 Exposure Profiles among HIV-discordant Couples. 18th Congress on Retroviruses and Opportunistic Infections. Feb 28–Mar 2; Boston, USA.
- 17. Lingappa J, Kahle E, Mugo N, Mujugira A, Magaret A, et al. (2009) Characteristics of HIV-1 Discordant Couples Enrolled in a Trial of HSV-2 Suppression to Reduce HIV-1 Transmission: The Partners Study. PLoS ONE 4: e5272.
- 18. Celum C, Wald A, Lingappa JR, Magaret AS, Wang RS, et al. (2010) Acyclovir and Transmission of HIV-1 from Persons Infected with HIV-1 and HSV-2. N Engl J Med 362: 427–439. PMCID: PMC2838503.
- 19. Campbell M, Mullins J, Hughes J, Celum C, Wong K, et al. (2011) Identifying the transmitted virus: viral linkage in HIV-1 seroconverters and their partners in an HIV-1 prevention clinical trial. PLoS One 6: e16986.
- 20. Illumina Inc (2010) Human1M-Duo DNA Analysis BeadChip Kits.
- 21. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38: 904–909.
- 22. Purcell AW PLINK: Whole genome association analysis toolset. http://pngu.mgh.harvard.edu/~purcell/plink/Harvard University webpage. Accessed 17 Nov. 2011.
- 23. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81: 559–575.
- 24. Fellay J, Shianna KV, Ge D, Colombo S, Ledergerber B, et al. (2007) A whole-genome association study of major determinants for host control of HIV-1. Science 317: 944–947.
- 25. Lingappa JR, Thomas K, Hughes J, Baeten J, Fife K, et al. (2011) Infected Partner's Plasma HIV-1 RNA Level and the HIV-1 Set Point of Their Heterosexual Seroconverting Partners. 18th Congress on Retroviruses and Opportunistic Infections. Feb 28–Mar 2; Boston, USA.
- 26. Ball TB, Ji H, Kimani J, McLaren P, Marlin C, et al. (2007) Polymorphisms in IRF-1 associated with resistance to HIV-1 infection in highly exposed uninfected Kenyan sex workers. AIDS 21: 1091–1101.
- 27. Boniotto M, Braida L, Pirulli D, Arraes L, Amoroso A, et al. (2003) MBL2 polymorphisms are involved in HIV-1 infection in Brazilian perinatally infected children. AIDS 17: 779–780.
- 28. Garred P, Madsen HO, Balslev U, Hofmann B, Pedersen C, et al. (1997) Susceptibility to HIV infection and progression of AIDS in relation to variant alleles of mannose-binding lectin. Lancet 349: 236–240.
- 29. Malik S, Arias M, Di Flumeri C, Garcia LF, Schurr E (2003) Absence of association between mannose-binding lectin gene polymorphisms and HIV-1 infection in a Colombian population. Immunogenetics 55: 49–52.
- 30. Pastinen T, Liitsola K, Niini P, Salminen M, Syvanen AC (1998) Contribution of the CCR5 and MBL genes to susceptibility to HIV type 1 infection in the Finnish population. AIDS Res Hum Retroviruses 14: 695–698.
- 31. Vallinoto AC, Menezes-Costa MR, Alves AE, Machado LF, de Azevedo VN, et al. (2006) Mannose-binding lectin gene polymorphism and its impact on human immunodeficiency virus 1 infection. Mol Immunol 43: 1358–1362.
- 32. Mombo LE, Lu CY, Ossari S, Bedjabaga I, Sica L, et al. (2003) Mannose-binding lectin alleles in sub-Saharan Africans and relation with susceptibility to infections. Genes Immun 4: 362–367.
- 33. Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL, et al. (2007) A second generation human haplotype map of over 3.1 million SNPs. Nature 449: 851–861.
- 34. International HapMap Consortium (2005) A haplotype map of the human genome. Nature 437: 1299–1320.
- 35. Teo YY, Small KS, Kwiatkowski DP (2010) Methodological challenges of genome-wide association analysis in Africa. Nat Rev Genet 11: 149–160.
- 36. Durbin RM, Abecasis GR, Altshuler DL, Auton A, Brooks LD, et al. (2010) A map of human genome variation from population-scale sequencing. Nature 467: 1061–1073.
- 37. Bhangale TR, Rieder MJ, Nickerson DA (2008) Estimating coverage and power for genetic association studies using near-complete variation data. Nat Genet 40: 841–843.
- 38. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, et al. (2009) Finding the missing heritability of complex diseases. Nature 461: 747–753.
- 39. Fellay J, Shianna KV, Telenti A, Goldstein DB (2010) Host genetics and HIV-1: the final phase? PLoS Pathog 6: e1001033.
- 40. Fideli US, Allen SA, Musonda R, Trask S, Hahn BH, et al. (2001) Virologic and immunologic determinants of heterosexual transmission of human immunodeficiency virus type 1 in Africa. AIDS Res Hum Retroviruses 17: 901–910.
- 41. Hughes JP, Baeten JM, Lingappa JR, Magaret AS, Wald A, et al. (2011) Determinants of per coital act HIV-1 infectivity among African HIV-1 serodiscordant couples. Journal of Infectious Diseases. In Press.