Skip to main content
Advertisement
  • Loading metrics

Cell-free RNA reveals host and microbial correlates of broadly neutralizing antibody development against HIV

  • Mark Kowarsky,

    Roles Data curation, Formal analysis, Methodology, Investigation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Physics, Stanford University, Stanford, California, United States of America

  • Mercedes Dalman,

    Roles Data curation, Formal analysis, Investigation, Validation, Visualization, Writing – review & editing

    Affiliations Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, University of Gothenburg, Gothenburg, Sweden, Wallenberg Centre for Molecular and Translational Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden

  • Mira N. Moufarrej,

    Roles Investigation, Methodology, Writing – review & editing

    Affiliation Department of Bioengineering, Stanford University, Stanford, California, United States of America

  • Jennifer Okamoto,

    Roles Data curation, Investigation, Writing – review & editing

    Affiliation Chan Zuckerberg Biohub, San Francisco, California, United States of America

  • Yike Xie,

    Roles Formal analysis, Investigation, Validation, Writing – review & editing

    Affiliations Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, University of Gothenburg, Gothenburg, Sweden, Wallenberg Centre for Molecular and Translational Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden

  • Norma F. Neff,

    Roles Methodology, Resources, Writing – review & editing

    Affiliation Chan Zuckerberg Biohub, San Francisco, California, United States of America

  • Salim S. Abdool Karim,

    Roles Resources, Writing – review & editing

    Affiliations Centre for the AIDS Programme of Research in South Africa (CAPRISA), University of KwaZulu Natal, Durban, South Africa, Department of Epidemiology, Columbia University, New York, New York, United States of America

  • Nigel Garrett,

    Roles Resources, Writing – review & editing

    Affiliations Centre for the AIDS Programme of Research in South Africa (CAPRISA), University of KwaZulu Natal, Durban, South Africa, Desmond Tutu HIV Centre, University of Cape Town, Cape Town, South Africa, Discipline of Public Health Medicine, School of Nursing and Public Health, University of KwaZulu-Natal, Durban, South Africa

  • Penny L. Moore ,

    Roles Conceptualization, Funding acquisition, Supervision, Resources, Writing – review & editing

    steve@quake-lab.org (SRQ); pennym@nicd.ac.za (PLM); joan.camunas@gu.se (JC-S)

    Affiliations Centre for the AIDS Programme of Research in South Africa (CAPRISA), University of KwaZulu Natal, Durban, South Africa, SA MRC Antibody Immunity Research Unit, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa, Centre for HIV and STIs, National Institute for Communicable Diseases (NICD) of the National Health Laboratory Service (NHLS), Johannesburg, South Africa, Wits Infectious Diseases and Oncology Research Institute, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa

  • Joan Camunas-Soler ,

    Roles Conceptualization, Formal analysis, Funding acquisition, Project administration, Supervision, Investigation, Writing – original draft, Writing – review & editing

    steve@quake-lab.org (SRQ); pennym@nicd.ac.za (PLM); joan.camunas@gu.se (JC-S)

    Affiliations Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, University of Gothenburg, Gothenburg, Sweden, Wallenberg Centre for Molecular and Translational Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden, Science for Life Laboratory, Institute of Biomedicine, University of Gothenburg, Gothenburg, Sweden

  • Stephen R. Quake

    Roles Conceptualization, Funding acquisition, Methodology, Supervision, Writing – review & editing

    steve@quake-lab.org (SRQ); pennym@nicd.ac.za (PLM); joan.camunas@gu.se (JC-S)

    Affiliations Department of Bioengineering, Stanford University, Stanford, California, United States of America, Chan Zuckerberg Biohub, San Francisco, California, United States of America

Abstract

A small number of people living with HIV (PLWH) develop broadly neutralizing antibodies (bNAbs) targeting multiple HIV strains. Although several viral and immune factors contribute to bNAb development, the genetic and environmental factors driving this response remain largely unknown. We performed combined cell-free DNA (cfDNA) and cell-free RNA (cfRNA) sequencing in 42 plasma samples from a longitudinal cohort of 14 PLWH (7 who develop bNAbs and 7 matched controls). This approach enabled us to non-invasively monitor the host transcriptome, viral genetic variation, and microbiome composition during HIV infection, and to identify molecular correlates of bNAb development. We find that development of bNAbs is associated with a transcriptomic signature of early immune activation characterized by elevated levels of MHC class I antigen presentation genes. This signature is independent of viral load or CD4 count and declines over time. In addition to host features, we recovered sufficient viral reads to reconstruct HIV consensus sequences, supporting the utility of cfRNA for viral genotyping. Finally, we also identified an enrichment of several microbial taxa in bNAb producers and increased levels of GB virus C (GBV-C), a non-pathogenic lymphotropic virus. Our findings suggest a distinct early immune activation profile in PLWH who develop bNAbs. More broadly, we show that combined cfDNA/cfRNA sequencing can reveal relationships between a protective immunogenic response to HIV infection, the host immune system, and microbiome, highlighting its potential for biomarker discovery in future vaccine and therapeutic studies.

Author summary

A subset of people living with HIV develop broadly neutralizing antibodies (bNAbs), a phenomenon that remains poorly understood but relevant to vaccine development. We applied combined cell-free RNA and DNA sequencing to 42 longitudinal plasma samples (7 bNAb producers, 7 controls) to monitor the host transcriptome, viral variation, and microbiome composition during bNAb development. Early in infection, bNAb producers showed an MHC class I-linked immune activation signature that declined over time, alongside changes in commensal microbial sequences and increased levels of GB virus C in cfRNA. These findings support the use of cfDNA/cfRNA sequencing for biomarker discovery in HIV research.

Introduction

HIV has infected over 36 million people worldwide, reaching epidemic proportions in Eastern and Southern Africa with an estimated adult prevalence close to 7% [1]. Antiretroviral therapy (ART) has greatly improved outcomes, allowing HIV to be a manageable condition throughout life. However, global access to ART is difficult to achieve and also requires lifelong daily adherence to a pill taking regime. Moreover, despite effective treatment, transmission of HIV remains common in vulnerable populations, and people living with HIV (PLWH) are at increased risk of aging-related and chronic diseases [2].

Although virtually all PLWH develop strain-specific autologous responses [36], few develop broadly neutralizing antibodies (bNAbs), which can neutralize multiple strains of the virus [7], and when administered passively can prevent acquisition of susceptible HIV strains and suppress viral replication in PLWH [8]. There is great interest in understanding the drivers of bNAbs, as this could inform the development of an effective vaccine. Efforts to design an effective HIV vaccine have increasingly focused on strategies to induce bNAbs by guiding B-cell affinity maturation, with studies demonstrating promising immunization approaches through the rational design of immunogens [911]. Despite these advances, which specific cellular processes give rise to bNAbs during infection remains incompletely described. So far, there is evidence that viral and host genetic factors might play a role [1216], and that a combination of high viral titres and reduced CD4 counts early in infection is associated with the development of bNAbs in some individuals [17,18]. Additionally, changes in frequency of certain immune subpopulations, such as T follicular helper cells (Tfh) might also influence affinity maturation towards bNAb production [1923]. Moreover, there is some evidence that bNAbs might arise from pre-existing memory B cells initially stimulated by bacterial antigens -likely from the gut microbiome- which later cross-react with the HIV glycan shield and develop neutralizing capabilities [24,25]. Although microbial antigen cross-reactivity has also been shown to lead to non-protective immune responses in HIV vaccine trials [26], a better understanding of the interplay between the host immune response, microbiome and a protective bNAb response could aid in vaccine development.

Sequencing of plasma cell-free DNA (cfDNA) has become an established tool in clinical medicine, with applications ranging from non-invasive prenatal testing and oncology to organ transplant monitoring [27,28]. In contrast, plasma cell-free RNA (cfRNA) provides an emerging complementary approach to measure dynamic changes in the host transcriptome [29]. Because transcripts released into circulation can originate from many different cell types and tissues [30], cfRNA enables non-invasive monitoring of systemic physiological and immune states [31,32]. To date, cfRNA sequencing has been used in diverse biomedical settings, including the monitoring of placental and pregnancy complications for maternal-fetal health [3336], to follow bone marrow reconstitution after stem cell transplantation [37], and to characterize host immune responses to tuberculosis infection [38]. However, integrative analyses that capture host, viral, and microbial nucleic acids simultaneously remain uncommon, and cfRNA sequencing approaches have been rarely applied to monitor retroviral infections such as HIV. Here, we conducted a pilot study to investigate whether combined cfDNA and cfRNA sequencing of plasma could reveal molecular features associated with the development of bNAbs in a longitudinal cohort of PLWH.

Results

Study design, sequencing overview and viral analysis

We analyzed 42 plasma samples from 14 PLWH from the CAPRISA studies, of whom 7 developed bNAbs and 7 did not. For each participant, plasma was collected at approximately 6 months, 1 year, and 3 years post-HIV acquisition and prior to ART (Fig 1A). At the 6-month time point, both groups had comparable CD4 counts and viral loads (S1 Fig). The extent of bNAb development was assessed by measuring plasma neutralization breadth against a panel of 18 Env-pseudotyped HIV viruses, summarized as percentage breadth, a continuous measure of bNAb development (Fig 1B and S1 Table and Methods).

thumbnail
Fig 1. Study design and development of neutralization breadth.

A) Longitudinal trajectories of neutralization breadth in bNAb producers (red) and controls (blue). Time zero corresponds to the estimated HIV infection point, with breadth defined as the ability of plasma antibodies to neutralize a cross-clade panel of 18 env-pseudotyped viruses (S2 Table). B) Heatmap of serum neutralization titers (IC50) against the 18-virus panel, grouped by individual, with time points sorted from first (top) to last (bottom). Colors represent log10-transformed IC50 neutralization titers, with higher values (yellow) indicating stronger neutralization potency and black indicating values below the limit of detection.

https://doi.org/10.1371/journal.ppat.1014066.g001

We performed combined cfDNA and cfRNA sequencing in each of the 42 samples (Methods). After quality control and filtering, we obtained enough reads to sample the host transcriptome (80% of total reads for cfRNA) and microbiome (0.2% and 20% reads of cfDNA and cfRNA respectively) (S2 Fig). Sequencing depth before and after sample QC processing (cleaning) is similar for both cfDNA and cfRNA, with comparable metrics between bNAb producers and non-producers (S3 Fig). Similarly, cell-type deconvolution of the host transcriptome showed comparable contributions of the most abundant cell types (e.g., red blood cells, immune cells, platelets) across both groups (S4 Fig). We detected HIV reads in the cfRNA assay for all samples, with coverage sufficient to genotype the majority strain in 38 out of 42 samples (S5A and S5B Fig). For high coverage samples, the nucleotide identity of HIV clustered within individuals based on previous single-genome amplified envelope sequences (Fig 2A) except for one mislabeled sample that was excluded from further analyses (Methods).

thumbnail
Fig 2. HIV consensus sequences from cfRNA.

A) Genomic similarity matrix of consensus HIV sequences obtained from cfRNA, showing clustering within individuals. B) Maximum-likelihood phylogeny of cfRNA-derived HIV consensus sequences (colored labels) with previously published envelope sequences (bold, CAPRISA). Shaded boxes highlight individual-level clustering. With the exception of one sample, cfRNA consensus sequences consistently cluster with the corresponding envelope sequences.

https://doi.org/10.1371/journal.ppat.1014066.g002

We validated the accuracy of cfRNA-derived consensus sequences by comparing them to previously published envelope sequences from the same CAPRISA participants (S2 Table), observing consistent clustering and strain concordance across time points and sequencing methods (Figs 2B and S5C). By comparing differences in consensus sequences within individuals (S5D Fig), we found that most mutations occur in the env gene, which encodes the surface protein responsible for host cell binding. The Env protein is the sole target of bNAbs and is known to be the most rapidly evolving part of the HIV genome [3941]. Using cfRNA we estimated that the dominant viral strain in each individual accumulates approximately 10 mutations per year (S5E Fig), in line with other estimates from virion or PBMC-derived studies [42,43]. Altogether, these results show that cfRNA is a quantitative tool to perform genotype analysis of viral pathogens and to track strain-specific mutational dynamics over time.

Analysis of read lengths and mapping start and end positions indicates that HIV-derived reads in plasma cfRNA (cfHIV) predominantly consist of short RNA fragments (200–500 bases) and do not show systematic positional biases along the HIV genome (S6 Fig). This is consistent with cfHIV being largely derived from fragmented cell-free viral RNA rather than libraries dominated by intact full-length viral genomes. In addition, to assess the quality of HIV genome recovery from cfRNA, we performed a subsampling analysis showing that approximately 50,000 cfHIV reads are sufficient to recover >90% of the genome at ≥50X depth (S7 Fig). In this cohort, this level of coverage was achieved for most samples (83%).

Host transcriptome correlates of cfHIV and clinical parameters

To characterize host responses to HIV infection, we measured the correlation between the host transcriptome and the total abundance of cfHIV, as well as with clinical parameters such as viral load and CD4 counts. Spearman correlation of cfHIV, viral load and CD4 counts to the human transcriptome revealed several genes associated with these measurements (Figs 3A and S8A and S8B). Analysis of genes significantly correlated to cfHIV revealed 19 genes typically associated with host immune responses to viral infection (FDR < 0.05), whereas only 1 for each of viral load and CD4 count, respectively (Fig 3B). CD4 count levels showed an association with IL4, a cytokine inducing differentiation of naive T helper cells. Although the relation between cytokine profile and HIV progression is controversial, increased secretion of IL4 in relation to IFN-gamma is a known hallmark of switching from Th1 to Th2 lymphokine secretion in PLWH [44]. Our observation of IL4 being increased in low CD4 samples is compatible with an increase of IL-4 levels upon disease progression and T helper cell decrease. Top correlates of cfHIV include: CCL5, also known as RANTES, an HIV suppression factor released by CD8+ T cells, TFRC (CD71), a marker of T-cell activation, and CCL2, a chemokine that attracts immune cells to sites of infection (Figs 3C and S8C). These genes do not show a differential behaviour between bNAb producers and controls, indicating that they represent shared responses to HIV infection. This shows that cfRNA sequencing captures shared host transcriptional responses associated with peripheral viral burden during HIV infection.

thumbnail
Fig 3. Correlation between cfHIV levels and host transcriptome.

A) Number of genes significantly correlated with cfHIV counts, viral load (VL) and CD4 counts (Spearman’s rank correlation, see Methods). B) Subset of genes from panel A previously described as involved in host immune responses to viral infection. C) Example correlations between cfHIV counts and three HIV-associated genes. Spearman correlation coefficients indicated. No partitioning is observed between bNAb producers (red) and controls (blue).

https://doi.org/10.1371/journal.ppat.1014066.g003

Development of bNAbs is associated with a transcriptional phenotype of immune activation

We next compared the transcriptome of bNAb producers and non-producers. We performed differential gene abundance analysis finding 256 genes elevated in bNAbs producers (FDR < 0.05, fold-change>1, Fig 4A, Methods). This variation could not be attributed to environmental factors such as site of collection, that showed little to no correlation to these transcriptomic signatures. Remarkably, 52% (n = 131) of elevated genes in bNAb producers are immune-related (primarily associated with adaptive immunity), compared with only 14% (n = 39) of the 281 decreased genes. Among the elevated genes, 25 of them are associated with production of RANTES (CCL5), a chemokine involved in homing and migration of effector and memory T cells. Although CCL5 itself is not enriched in bNAb producers in our dataset, elevated RANTES, along with other inflammatory markers, have previously been linked to the development of bNAbs in spontaneous controllers of HIV [45].

thumbnail
Fig 4. Gene level differences in cfRNA between bNAb producers and controls.

A) Volcano plot of differentially abundant genes. Genes with a fold change > 2 and FDR < 0.05 are highlighted (red = increased, blue = decreased in bNAb producers). Labelled points are genes involved in host immune responses to viral infection. B) Violin plots of the distribution of elevated genes in bNAb producers, stratified by group and neutralization breadth (number of viral strains neutralized), used here as a proxy for progression along bNAb development. C) Genes in the MHC class I antigen presentation pathway. Most show negative correlations with breadth, and some are also differentially abundant between bNAb producers and controls. Correlations with breadth were assessed using Spearman’s rank correlation (Spearman’s ρ indicated next to each gene, see Methods). * correlated with breadth at FDR < 0.05, † differentially abundant between bNAb producers and controls at FDR < 0.05, ‡ correlated with cfHIV at FDR < 0.05.

https://doi.org/10.1371/journal.ppat.1014066.g004

A pathway analysis confirmed these results and further revealed an enrichment of molecular functions related to an immunologic response in bNAb producers (S9A Fig). Terms such as receptor activity, IgG and peptide antigen binding, cytokine signaling and NF-kB signaling were enriched in bNAb positive samples. Analyses of the genes decreased in the bNAb producers showed lower enrichment scores and did not point to the existence of other host immunoregulatory effects (S9B Fig). Overall, host cfRNA sequencing provides evidence for an increased immune activation in participants who developed bNAbs, despite having similar CD4 counts and viral loads to non-bNAb producers.

To investigate this immune activation in more detail, we performed pseudotime analysis on the two groups, using plasma neutralization breadth to define the temporal progression (Fig 4B, Methods). Pseudotime is a trajectory-inference approach that orders samples by relative biological state rather than chronological time, allowing comparison across individuals at different stages of bNAb development. We found that the immunologic gene signature is elevated only early in breadth development in bNAb producers (when only a limited number of viral strains are neutralized) and later converges to levels comparable to non-bNAb producers as neutralization breadth increases. This aligns with previous studies showing that factors such as CD4 levels inversely correlate with bNAb development only early in infection [4]. However, in our study, both groups had similar CD4 counts at the first time point. Focusing on the genes significantly correlated with breadth in both groups (S9C and S9D Fig, FDR < 0.05) revealed almost the entirety of the MHC class I antigen presentation pathway was anticorrelated with the development of breadth (i.e., pseudotime) and sometimes differentially altered or correlated with the level of cfHIV measured (Fig 4C). Hence, MHC class I antigen genes are highly elevated early in infection in participants who go on to develop bNAbs, and their abundance decrease as breadth develops, becoming comparable to that of participants who do not develop bNAbs. MHC class I antigen presentation is present in almost all cells of the body, presents antigens to be killed by cytotoxic T cells (CD8) but is not generally associated with antigen presenting cells or the production of antibodies by B cells [4]. Taken together, these results indicate that early immune activation associated with MHC class I antigen presentation, independent of viral load or CD4 count, is a relevant factor in the development of bNAbs.

Microbiome associated with bNAbs and the development of breadth

Development of bNAbs has been hypothesized to be triggered by antigen cross-reactivity between HIV and bacterial components present in the gut microbiome [24,25,46,47]. To test whether a differentiated microbiota could be observed in participants who developed bNAbs, we mapped all non-human reads (both cfDNA and cfRNA) to a database of bacterial and viral genomes. After discarding reads that mapped to potential reagent contaminants (S10 Fig, Methods) [48], we identified 249 taxa in cfDNA and 4366 taxa in cfRNA that could be confidently attributed to the participants microbiome. We combined data from all participants to assemble a phylogenetic tree of the human microbiome during HIV infection as measured from circulating nucleic acids (S11 Fig). This accounted for ~70% of non-human reads in both cfDNA and cfRNA, and spanned several areas of the tree of life including archaea, bacteria and both DNA and RNA viruses. Among other features, we identified the presence of several DNA and RNA viruses such as Anelloviruses, Flaviviruses, Herpesviruses indicative of concomitant viral infections for several participants. In cfRNA, we found that ribosomal reads encompass 37% of the nonhost microbiome reads, and we also identified a moderate fraction of reads (~20%) that could not be attributed to specific species and mapped to uncultured bacteria.

We then compared the microbiome of both groups (bNAb producers/controls) finding 19 genera that were differentially abundant between both groups (FDR < 0.05, |log2FC| > 1, Fig 5A). Among genera enriched in bNAb producers, we identified several taxa that include opportunistic pathogens and are commonly associated with mucosal surfaces and the gastrointestinal tract, including Enterobacter, Serratia and Leclercia (S12 Fig). By performing a temporal analysis, we found that the detection of these pathogens does not differ across timepoints and does not precede development of breadth (S13 Fig). Overall, and despite the relatively small sample size of this study, our data shows the existence of differences in the microbiota compositions in participants who develop bNAbs from those who do not.

thumbnail
Fig 5. Microbial differences and GBV-C enrichment in bNAb producers.

A) Volcano plot of differentially abundant microbial genera between bNAb producers and non-producers, colored by phylum (FDR < 0.05, Benjamini-Hochberg). Positive log2 fold change indicates enrichment in bNAb producers. B) Scatterplots showing cfGBV-C abundance in samples partitioned by time and study group (left, bNAbs producers, right, controls). The dashed line indicates the threshold used to define elevated cfGBV-C levels. Using a logistic mixed-effects model with participant as a random effect, bNAb producers showed higher odds of elevated GBV-C levels (odds ratio [OR]: 6.4, 95% CI: 2.3-17.9). In some participants, GBV-C read counts exceeded those of HIV up to 100-fold.

https://doi.org/10.1371/journal.ppat.1014066.g005

Additionally, metagenomic analysis of cfRNA revealed the presence of GB virus C (GBV-C, previously known as Hepatitis G) in several participants (Fig 5B). GBV-C is a flavivirus that infects lymphocytes, has no known pathology, and has been associated with improved survival in people with HIV infection [49,50]. Given the phylogenetic relationship between GBV-C and hepatitis C virus (HCV), we also examined reads mapping to HCV. However, we did not find evidence of HCV signal, whereas reads mapping to GBV-C showed broad coverage across the viral genome (S13 Fig). In our study, all bNAb producers had non-zero levels of GBV-C (p < 0.05, one-sided Fisher exact test) and showed higher odds of elevated GBV-C levels (Fig 5B). Differential gene abundance analysis on samples with high levels of GBV-C (defined as >1/2000 nonhost reads) revealed 42 host genes that are also elevated in these participants. Among top candidates we identified several genes of the MHC class II antigen presentation complex (HLA-DQA1, HLA-DQB1, HLA-DRB1), an HIV suppressive factor involved in inflammatory signaling (CCL3), several proinflammatory cytokines (IL6, EBI3), and immunoglobulin receptors (KIR3DL2). Notably, increased expression of inhibitory KIRs, including KIR3DL2, has been associated with bNAb development, suggesting a potential link between natural killer cell maturation and the regulation of antibody production [51]. These findings suggest that high levels of GBV-C might be associated with an increased activity of antigen presenting cells.

Discussion

In this work, we demonstrate simultaneous monitoring of viral infection and host response through sequencing of circulating nucleic acids (DNA and RNA) in plasma samples of PLWH prior to ART. We show that cfRNA can be used to characterize circulating HIV viral strains and, at the same time, to profile the host transcriptome, allowing characterization of immune activity during HIV infection. Analysis of fragment length and genome coverage suggests that HIV-derived cfRNA likely reflects fragmented viral RNA circulating in plasma rather than being dominated by intact full-length viral genomes. Subsampling analyses further shows that near-complete genome coverages (>90%) at moderate depths (≥50X) can be recovered from cfRNA, supporting the feasibility of viral genome analyses.

As cfRNA contains transcripts released from diverse tissues and cell types across the body [30], it offers a systemic view of immune processes that may be otherwise difficult to capture. We find that overall cfHIV levels—defined as HIV-derived fragments in cfRNA—correlate with abundance of host genes involved in immune responses to viral infections, including CCL5, TFRC, and CCL2, consistent with responses to peripheral viral burden. For instance, CCL5 encodes RANTES, a chemokine with HIV-suppressive activity that can block CCR5-mediated viral entry, while TFRC (CD71) and CCL2 (MCP-1) are involved in pathways commonly perturbed during infection [5254]. However, when comparing participants who develop bNAbs with those who did not, we observed a distinct and orthogonal transcriptomic signature suggestive of an enhanced immune activation early in infection that was specific to bNAb producers. This response is independent of viral load and CD4 counts, and is enriched in MHC class I antigen presentation genes, RANTES-associated pathways (though not CCL5 itself), and other immune activation pathways.

Previous studies have highlighted the central role of lymph nodes in shaping humoral responses during HIV infection, yet their direct study remains challenging [23,55,56]. Cell-free RNA may provide a non-invasive window into immune activity occurring in tissue compartments, including lymphoid tissues where Tfh-B cell interactions drive affinity maturation. Hence, it is plausible that the immune-related cfRNA transcripts we report here reflects immune activity within lymph nodes, consistent with previous evidence that cfRNA can capture tissue-resident hematopoietic activity such as bone marrow progenitor transcripts [37]. However, we emphasize that cfRNA represents an integrated signal from multiple sources, and we cannot exclude the possibility that increased cfRNA levels are due to contributions of circulating immune cells or other tissues.

Analysis of non-human reads further allowed us to assess the microbiome and virome, revealing a reduced set of taxa whose presence or absence is associated with the development of bNAbs. Microbial sequences detected in plasma cfRNA/cfDNA most likely reflect leakage of microbial products into circulation due to barrier dysfunction and dysbiosis, a process well described in chronic HIV infection [5760]. While gut microbiota has been speculated to increase antigen cross-reactivity, our data cannot establish a direct role in bNAb development. Surprisingly, we found that participants who developed bNAbs consistently had higher levels of GBV-C. GBV-C is a non-pathogenic lymphotropic flavivirus that has been associated with improved outcomes in PLWH [49], with some studies suggesting this may reflect an indirect rather than a causal association [50]. Mechanistically, co-infection with GBV-C has been reported to inhibit HIV replication and modulate cytokine release, co-receptor expression, and T-cell homeostasis [50,61,62]. In our study, GBV-C abundance correlated with increased abundance of proinflammatory cytokines and adaptative immune genes, raising the possibility that co-infection influences the host immune milieu in a way that could facilitate bNAb development. While causality cannot be inferred from our dataset, understanding such modulatory viral co-infections early after HIV acquisition could inform strategies to mimic beneficial immune environments in the context of HIV vaccine design.

Our study represents a small pilot investigation (42 samples, 14 PLWH) in a well-characterized cohort of young women from South Africa [63], followed from prior to HIV acquisition into acute infection and before initiation of ART. Biobanked samples from such cohorts are an invaluable resource for studies of factors associated with the development of breadth, as earlier ARV treatment regimens are now standard of care. Although this provides a relatively homogeneous and well-controlled population for an initial discovery study, the small sample size limits generalizability and statistical power. For example, associations between viral load or CD4 counts and bNAb induction reported in other studies may not be detectable here due to the limit size of this pilot cohort. Expanded studies within the CAPRISA cohort or in other settings, and with finer temporal resolution, will be important to validate the findings reported here and their relationship to bNAb development. Despite these limitations, this work illustrates the potential of cfRNA/cfDNA sequencing as a non-invasive approach to interrogate host, viral, and microbial processes simultaneously from a single plasma sample. This approach could be of use to support biomarker discovery in vaccine studies and help identify biological contexts that favor the induction of bNAbs.

Materials and methods

Ethics statement

The CAPRISA 002 Acute Infection study was reviewed and approved by the research ethics committees of the University of KwaZulu-Natal (E013/04), the University of Cape Town (025/2004), and the University of the Witwatersrand (MM040202). All participants provided written informed consent for screening, enrolment and specimen storage.

Participants

The study utilized biobanked plasma specimen of PLWH with known neutralization status enrolled in the CAPRISA cohorts. The CAPRISA 002 Acute Infection cohort was established in 2004 and has been following women from the early/acute stage of infection [63]. We included participants from two CAPRISA clinical sites, Ethekwini in Durban (site 1) and Vulindlela (site 2), and accounted for potential site-specific effects in all downstream analyses.

Neutralization measurements

Neutralization breadth was measured using a standardized pseudovirus assay that included 18 Env-pseudotyped HIV viruses (S1 Table) [64,65]. Briefly, HIV pseudoviruses were produced by co-transfecting HEK293T cells with each HIV-1 Env plasmid and a pSG3 backbone. Serial dilutions of heat-inactivated plasma were preincubated with each pseudovirus, followed by the addition of TZM-bl cells. After 48h, infection was quantified by luciferase read-out, and titers reported as the reciprocal dilution where 50% of the virus is neutralized (IC50). Breadth was defined as the percentage of pseudoviruses neutralized at an IC50 ≥ 1:45.

Participants were classified as bNAb producers if they achieved ≥20% breadth within year of infection or ≥40% breadth against the 18-pseudovirus panel at any three years post-infection.

Isolation of cfDNA and cfRNA and library preparation

Blood samples were collected in sodium citrate tubes and plasma extracted according to the standard clinical centrifugation protocol of the HIV/AIDS Network Coordination, and stored at -80 C. Each plasma sample was thawed once and split into two aliquots for cfDNA/cfRNA extraction (0.4-1 mL per aliquot). To minimize confounding effects from hemolysis and platelet-derived RNA, plasma samples were visually inspected prior to processing for signs of hemolysis (e.g., pink discoloration), which was not visible for any sample. Immediately before nucleic acid extraction, plasma samples were subjected to a final centrifugation step (1 min at 16,000 rpm) to remove any residual debris. cfDNA was extracted using the Qiagen Circulating Nucleic Acid kit. cfRNA was extracted using the Plasma/Serum Circulating RNA and Exosomal Purification kit (Norgen, cat 42800), followed by DNA digestion using Baseline-ZERO DNase (Epicentre) and then cleaned up using the RNA Clean and Concentrator-5 kit (Zymo) [32,66]. cfDNA/cfRNA libraries were prepared using the automated Ovation SP Ultralow Library Systems (Mondrian ST) and the SMARTer Stranded Total RNA Pico Input Mammalian kit (Clontech) respectively [32,66]. A negative control (1 mL PBS, Gibco) was processed in parallel for every batch to control for potential contaminants [67].

Sequencing methods

Paired end Illumina sequencing (2 × 75 bp) was performed on the Nextseq 500 for cfDNA and on the Novaseq 6000 for cfRNA. This resulted in a median of 69 million (cfDNA) and 109 million (cfRNA) reads per sample (S3 Fig).

Sequencing analysis

Sequencing reads from cfRNA and cfDNA were processed using a shared initial pipeline. First, low quality bases and adapter sequences were trimmed by Trimmomatic v0.38 [68], and overlapping read pairs were merged using FLASH v1.2.11 [69]. Merged reads were then aligned to the UniVec database using bowtie2 to remove reads mapping to common vectors, linkers and control sequences [70]. These processing QC steps removed only a fraction of reads in cfRNA and cfDNA libraries from most plasma samples (S2 and S3 Figs). Higher cleaning rates were observed in cfDNA negative controls (~50%) and in three cfRNA plasma samples (~25%), that may reflect lower sample input or quality. Overall, no differences in QC processing were observed between bNAb producers and non-producers.

The remaining reads were first aligned to the human genome (GRCh38, Ensembl) using bowtie2, retaining unmapped reads, which were subsequently passed through a second homology filtering by BLASTN against the NCBI nt database restricted to taxid 9606, to obtain non-human reads for microbial and viral analysis. Taxonomic classification of non-human reads was performed by BLASTN (NCBI BLAST+ v2.8) against the NCBI nt database, using curated taxid lists limited to viral, bacterial and fungal references [66,67]. BLAST hits were filtered by requiring ≥80% nucleotide identity and ≥80% per-read query coverage, also de-duplicating counts to the best-supported accession to avoid counting the same read twice across hits. Subsequently, per-accession counts and coverage profiles were generated and aggregated at taxonomic ranks for downstream analysis.

For cfRNA, reads were additionally aligned to the human genome using STAR v.25.4a [71] and duplicate-filtered using Picard v2.18.2,and gene abundance was quantified using HTSeq [72]. Downstream analysis of gene abundance data is detailed in the following section.

For HIV-specific analysis, non-human cfRNA reads were re-aligned to the HIV-1 group M consensus reference (CON-C.fa) using bowtie2. Consensus sequences were generated with samtools mpileup and bcftools consensus [73], followed by multiple sequence alignment using MAFFT v7 [74]. A maximum-likelihood phylogenetic tree was constructed using IQ-TREE [75], and visualized with Bio.Phylo [76], with metadata-based label coloring and shaded backgrounds highlighting predefined sample clusters. Previously published HIV reference sequences from the same CAPRISA participants were obtained from the Los Alamos National Laboratory (LANL) HIV Sequence Database (S2 Table). Sample identity was primarily validated by assessing HIV sequence consistency across all the three time-points per participant (Figs 2 and S3). As an orthogonal check, we performed HLA-B typing on cfRNA using seq2HLA (v2.3, 4-digit resolution) [77], which confirmed exclusion of one sample (CAP200, time point 1), where a mismatch was detected in both the HIV sequence analysis and HLA-B typing.

To control for potential environmental contamination in both cfDNA and cfRNA data, six negative controls were used to create a database of background environmental taxa that were blacklisted in downstream analysis. This filtering preserved 72% of microbial reads in cfRNA and 17% in cfDNA. In addition, non-template control barcodes (n = 8) were spiked into the cfRNA libraries to estimate sample cross-contamination, which was found to be < 0.5% and compatible with index-hopping during sequencing. This threshold was used to define the detection limit for each individual microbial taxon.

Differential gene abundance analysis

Differential Gene Abundance Analysis was performed using edgeR with a generalized linear model and quasi-likelihood F-tests. Site of sample collection was included as a covariate in the model to account for potential confounding effects. All p-values were corrected for multiple hypothesis testing using the Benjamini-Hochberg method, and results are reported as false discovery rate (FDR). For downstream analysis, we selected genes with FDR < 0.05 and absolute fold change > 1. Associations between gene abundance and clinical metadata (e.g., viral load, CD4 + counts) were measured using nonparametric Spearman correlation unless otherwise stated (SciPy package, Python). Differential abundance of microbial taxa was also assessed in edgeR using generalized linear models, with the same thresholds and covariate adjustments. Overall, no significant site-associated differences were observed in either the transcriptomic or microbial enrichment analyses.

Pseudotime analysis

Pseudotime analysis was used as an alternative approach to investigate factors associated with the development of neutralization breadth, given the limited number of longitudinal samples and heterogeneity in bNAb development across individuals. Samples were ordered according to the number of pseudoviruses neutralized by each plasma sample, which was used as a proxy for progression along the bNAb development continuum. This trajectory-inference approach allows us to account for differences in the pace of bNAb development across individuals. Spearman’s rank correlation analyses were then performed against this pseudotime variable to evaluate changes both between bNAb producers and non-producers and along the continuous trajectory of breadth progression.

GBV-C analysis

GBV-C reads were normalized to the total number of non-human reads per sample and scaled by a factor of 1,000 (GBV-C reads per 1,000 non-human reads), analogous to the normalization used for cfHIV. Differences in GBV-C abundance between bNAb producers and non-producers were assessed using a logistic mixed-effects model (statsmodels package, Python), where we model GBV-C abundance as a binary outcome and included participant as a random effect to account for repeated sampling across time points.

Supporting information

S1 Fig. Clinical information for blood samples.

Longitudinal trajectories for CD4 (orange) and viral load (purple). Diamonds indicate timepoints sequenced in this study. Grey bars indicate values for CD4 count < 200.

https://doi.org/10.1371/journal.ppat.1014066.s001

(TIF)

S2 Fig. Sequencing quality control metrics per sample.

A) Number of reads (2x75 bp) sequenced in each sample and negative controls. Two cfDNA samples failed extraction/library preparation. B) Percentage of reads cleaned in QC steps (low quality or that align to the UniVec Core database). Percentages are generally very low, except in cfDNA negative controls, which had a high proportion of primer/library-derived sequences. C) Percentage of reads mapping to the human genome. D) Total number of non-human reads, typically more than a million for the cfRNA samples and around 10 thousand for the cfDNA samples. Samples are ordered by time within each participant.

https://doi.org/10.1371/journal.ppat.1014066.s002

(TIF)

S3 Fig. Sequencing quality control metrics summarized by study group.

A) Total sequencing depth per sample (raw read counts). B) Read counts retained after QC cleaning steps (see Methods) C) Percentage of reads removed during cleaning D) Percentage of human-mapped reads after cleaning. Metrics are shown for cfRNA (left) and cfDNA (right) libraries across bNAb producers, non-producers, and negative controls. Data are shown as mean ± SEM.

https://doi.org/10.1371/journal.ppat.1014066.s003

(TIF)

S4 Fig. Cell-type deconvolution analysis of cfRNA.

(a) Relative cell-type contributions to the cfRNA composition for each sample. Samples are grouped by participant and study group (bNAb producer or control), and less abundant cell types are aggregated into ‘Other’. (b) Box plots showing the relative abundance of each cell type in bNAb producers and controls. Statistical significance was assessed using a Mann–Whitney U test, with p-values adjusted for multiple hypothesis testing using Benjamini–Hochberg.

https://doi.org/10.1371/journal.ppat.1014066.s004

(TIF)

S5 Fig. HIV genotypes, coverage and mutations.

A) Heatmap of coverage across the genome in each sample. The four low abundance samples are marked with black squares on the left hand panel. B) Total number of read fragments aligning to HIV in each sample. C) Phylogenetic tree of HIV consensus genotypes for each sample obtained in this study together with 200 other genotypes of HIV-1 group M, subtype C from South Africa. D) Locations of SNVs local to each participant across the three time points. Most cluster within the env gene. E) Mutation rate observed in the samples using differences in the consensus genotypes within participants.

https://doi.org/10.1371/journal.ppat.1014066.s005

(TIF)

S6 Fig. Fragment length and coverage of cfHIV reads.

A) Distribution of paired-end genomic spans for reads mapping to the HIV genome, showing a predominance of short fragments (200–600 bp). B) Mean paired-end genomic span as a function of genomic start position, showing no systematic trends that could indicate priming from long RNA templates. (c) Genomic coverage profile across the HIV genome, shown alongside the reference organization of the HIV genome.

https://doi.org/10.1371/journal.ppat.1014066.s006

(TIF)

S7 Fig. Subsampling analysis of HIV genome coverage from cfRNA.

A) Fraction of the HIV genome covered above increasing minimum depth thresholds (1X-50X) at different subsampling levels of sample CAP261. B) Number of HIV-mapped reads versus mean coverage depth across the HIV genome, showing a linear relationship. C) Genome coverage achieved for each sample in the cohort at ≥10X (left) and ≥50X (right) depth thresholds.

https://doi.org/10.1371/journal.ppat.1014066.s007

(TIF)

S8 Fig. Correlations of genes with different disease measurements.

A) Distributions of Spearman correlations of all genes with either cfHIV counts (top, blue), CD4 counts (middle, orange), or viral load (bottom, purple). Gray bands indicate regions of significant correlation as determined by the Spearman test on the data. The small ticks (rugplot) at the bottom of each plot indicate known HIV genes. B) Tables of the number of genes strongly correlated/anticorrelated between genes and measurements (as in A). C) Scatter plots of cfHIV abundance vs gene expression (z-score) for the 14 significantly correlated known HIV-associated genes. Besides HLA-A, none of these show strong differences between the two study groups.

https://doi.org/10.1371/journal.ppat.1014066.s008

(TIF)

S9 Fig. Enriched gene pathways in bNAb producers.

A) Top gene ontologies (molecular function, biological process) and pathways enriched in genes that are elevated in bNAb producers. Numbers indicate the number of selected genes compared with the total number assigned to that category. B) Similar to A, but for the genes with decreased abundance in bNAb producers. Note that scores are much lower for decreased genes than for elevated genes and no clear enriched groups of genes are present in decreased genes. C) Distribution of genes correlated with breadth in bNAb producers (upper, red) and controls (lower, blue). D) Tables of the number of genes meeting correlation thresholds in (C).

https://doi.org/10.1371/journal.ppat.1014066.s009

(TIF)

S10 Fig. Background microbial sequences in negative controls.

Volcano plot showing differential abundance of microbial genera between plasma samples and negative controls (NC) (FDR < 0.05, Benjamini–Hochberg). Negative log2 fold-change values indicate genera enriched in NC, which were used to define background levels. Several NC-enriched taxa showed large effect sizes (|log2FC| > 4, FDR < 10-8), primarily comprising environmental taxa previously reported in DNA extraction kits and reagents in low-biomass sequencing.

https://doi.org/10.1371/journal.ppat.1014066.s010

(TIF)

S11 Fig. Phylogenetic tree of microbial sequences from circulating nucleic acids in HIV infection (all participants).

A) Phylogenetic tree of cfDNA-derived microbiome reads pooled across all participants (PLWH who do and do not develop bNAbs). The main branches represent archaea, bacteria and viruses (from left to right); representative viruses including Torque Teno Viruses (TTV), Epstein-Barr virus (EBV) and cytomegalovirus (CMV) are highlighted. B) Phylogenetic tree of cfRNA-derived microbiome reads pooled across all participants. The main branches represent archaea, bacteria and viruses (from left to right). Within the viral branch, HIV and GB virus C are the main contributors to the virome.

https://doi.org/10.1371/journal.ppat.1014066.s011

(TIF)

S12 Fig. Subset of microbial genera enriched in bNAb producers.

Boxplots showing normalized abundances of selected microbial genera across negative controls (NC), bNAb non-producers, and bNAb producers. Values are shown as log2-transformed abundances relative to the median abundance in negative controls.

https://doi.org/10.1371/journal.ppat.1014066.s012

(TIF)

S13 Fig. Pseudo-temporal dynamics in differentially abundant genera between bNAb producers and non-producers.

Scatter plots of the abundance (z-score) vs breadth production (%) for the 22 genera found to be differentially abundant between case (bNAbs) and control (no bNAbs). No apparent time differences found.

https://doi.org/10.1371/journal.ppat.1014066.s013

(TIF)

S14 Fig. Coverage profiles of HCV and GBV-C genomes across samples.

A) Read coverage across the HCV genome. B) Read coverage across the GBV-C genome. Coverage for individual samples is shown in red (HCV) and blue (GBV-C); mean coverage is shown in black.

https://doi.org/10.1371/journal.ppat.1014066.s014

(TIFF)

S1 Table. Neutralization data for the panel of 18 Env-pseudotyped HIV viruses.

https://doi.org/10.1371/journal.ppat.1014066.s015

(XLSX)

S2 Table. Participant codes and corresponding accession numbers in the HIV database (https://www.hiv.lanl.gov/content/index) for the reference FASTA sequences included in this study.

https://doi.org/10.1371/journal.ppat.1014066.s016

(XLSX)

Acknowledgments

Finally, we are indebted to the study participants for their generous support of scientific research.

References

  1. 1. UNAIDS Report 2017. Ending AIDS. Progress towards the 90-90-90 targets. (Global Aids Update 2017). Joint United Nations Programme on HIV/AIDS (UNAIDS); 2017.
  2. 2. Deeks SG, Overbaugh J, Phillips A, Buchbinder S. HIV infection. Nat Rev Dis Primers. 2015;1:15035. pmid:27188527
  3. 3. Moore PL, Gray ES, Choge IA, Ranchobe N, Mlisana K, Abdool Karim SS, et al. The c3-v4 region is a major target of autologous neutralizing antibodies in human immunodeficiency virus type 1 subtype C infection. J Virol. 2008;82(4):1860–9. pmid:18057243
  4. 4. Gray ES, Moore PL, Choge IA, Decker JM, Bibollet-Ruche F, Li H, et al. Neutralizing antibody responses in acute human immunodeficiency virus type 1 subtype C infection. J Virol. 2007;81(12):6187–96. pmid:17409164
  5. 5. Li B, Decker JM, Johnson RW, Bibollet-Ruche F, Wei X, Mulenga J, et al. Evidence for potent autologous neutralizing antibody titers and compact envelopes in early infection with subtype C human immunodeficiency virus type 1. J Virol. 2006;80(11):5211–8. pmid:16699001
  6. 6. Richman DD, Wrin T, Little SJ, Petropoulos CJ. Rapid evolution of the neutralizing antibody response to HIV type 1 infection. Proc Natl Acad Sci U S A. 2003;100(7):4144–9. pmid:12644702
  7. 7. Landais E, Huang X, Havenar-Daughton C, Murrell B, Price MA, Wickramasinghe L, et al. Broadly neutralizing antibody responses in a large longitudinal sub-Saharan HIV primary infection cohort. PLoS Pathog. 2016;12(1):e1005369. pmid:26766578
  8. 8. Corey L, Gilbert PB, Juraska M, Montefiori DC, Morris L, Karuna ST, et al. Two randomized trials of neutralizing antibodies to prevent HIV-1 acquisition. N Engl J Med. 2021;384:1003–14.
  9. 9. Caniels TG, Prabhakaran M, Ozorowski G, MacPhee KJ, Wu W, van der Straten K, et al. Precise targeting of HIV broadly neutralizing antibody precursors in humans. Science. 2025;389(6759):eadv5572. pmid:40373114
  10. 10. Willis JR, Prabhakaran M, Muthui M, Naidoo A, Sincomb T, Wu W, et al. Vaccination with mRNA-encoded nanoparticles drives early maturation of HIV bnAb precursors in humans. Science. 2025;389(6759):eadr8382. pmid:40373112
  11. 11. Leggat DJ, Cohen KW, Willis JR, Fulp WJ, deCamp AC, Kalyuzhniy O, et al. Vaccination induces HIV broadly neutralizing antibody precursors in humans. Science. 2022;378(6623):eadd6502. pmid:36454825
  12. 12. Rusert P, Kouyos RD, Kadelka C, Ebner H, Schanz M, The Swiss HIV Cohort Study. Determinants of HIV-1 broadly neutralizing antibody induction. Nat Med. 2016;22:1260–7.
  13. 13. Moody MA, Pedroza-Pacheco I, Vandergrift NA, Chui C, Lloyd KE, Parks R, et al. Immune perturbations in HIV-1-infected individuals who make broadly neutralizing antibodies. Sci Immunol. 2016;1(1):aag0851. pmid:28783677
  14. 14. Haynes BF, Shaw GM, Korber B, Kelsoe G, Sodroski J, Hahn BH, et al. HIV-host interactions: implications for vaccine design. Cell Host Microbe. 2016;19(3):292–303. pmid:26922989
  15. 15. Roskin KM, Jackson KJL, Lee J-Y, Hoh RA, Joshi SA, Hwang K-K, et al. Aberrant B cell repertoire selection associated with HIV neutralizing antibody breadth. Nat Immunol. 2020;21(2):199–209. pmid:31959979
  16. 16. Landais E, Moore PL. Development of broadly neutralizing antibodies in HIV-1 infected elite neutralizers. Retrovirology. 2018;15(1):61. pmid:30185183
  17. 17. Euler Z, van Gils MJ, Bunnik EM, Phung P, Schweighardt B, Wrin T, et al. Cross-reactive neutralizing humoral immunity does not protect from HIV type 1 disease progression. J Infect Dis. 2010;201(7):1045–53. pmid:20170371
  18. 18. Gray ES, Madiga MC, Hermanus T, Moore PL, Wibmer CK, Tumba NL, et al. The neutralization breadth of HIV-1 develops incrementally over four years and is associated with CD4+ T cell decline and high viral load during acute infection. J Virol. 2011;85(10):4828–40. pmid:21389135
  19. 19. Cottrell CA, Hu X, Lee JH, Skog P, Luo S, Flynn CT, et al. Heterologous prime-boost vaccination drives early maturation of HIV broadly neutralizing antibody precursors in humanized mice. Sci Transl Med. 2024;16(748):eadn0223. pmid:38753806
  20. 20. Cohen K, Altfeld M, Alter G, Stamatatos L. Early preservation of CXCR5+ PD-1+ helper T cells and B cell activation predict the breadth of neutralizing antibody responses in chronic HIV-1 infection. J Virol. 2014;88(22):13310–21. pmid:25210168
  21. 21. Locci M, Havenar-Daughton C, Landais E, Wu J, Kroenke MA, Arlehamn CL. Human circulating PD-1 CXCR3−CXCR5 memory tfh cells are highly functional and correlate with broadly neutralizing HIV antibody responses. Immunity. 2013;39:758–69.
  22. 22. Perreau M, Savoye A-L, De Crignis E, Corpataux J-M, Cubas R, Haddad EK, et al. Follicular helper T cells serve as the major CD4 T cell compartment for HIV-1 infection, replication, and production. J Exp Med. 2013;210(1):143–56. pmid:23254284
  23. 23. Crotty S. T follicular helper cell biology: a decade of discovery and diseases. Immunity. 2019;50:1132–48.
  24. 24. Liao H-X, Chen X, Munshaw S, Zhang R, Marshall DJ, Vandergrift N, et al. Initial antibodies binding to HIV-1 gp41 in acutely infected subjects are polyreactive and highly mutated. J Exp Med. 2011;208(11):2237–49. pmid:21987658
  25. 25. Trama AM, Moody MA, Alam SM, Jaeger FH, Lockwood B, Parks R, et al. HIV-1 envelope gp41 antibodies can originate from terminal ileum B cells that share cross-reactivity with commensal bacteria. Cell Host Microbe. 2014;16(2):215–26. pmid:25121750
  26. 26. Williams WB, Liao H-X, Moody MA, Kepler TB, Alam SM, Gao F, et al. HIV-1 VACCINES. Diversion of HIV-1 vaccine-induced immunity by gp41-microbiota cross-reactive antibodies. Science. 2015;349(6249):aab1253. pmid:26229114
  27. 27. Loy C, Ahmann L, De Vlaminck I, Gu W. Liquid biopsy based on cell-free DNA and RNA. Annu Rev Biomed Eng. 2024;26(1):169–95. pmid:38346275
  28. 28. Moufarrej MN, Bianchi DW, Shaw GM, Stevenson DK, Quake SR. Noninvasive prenatal testing using circulating DNA and RNA: advances, challenges, and possibilities. Annu Rev Biomed Data Sci. 2023;6:397–418. pmid:37196360
  29. 29. Koh W, Pan W, Gawad C, Fan HC, Kerchner GA, Wyss-Coray T, et al. Noninvasive in vivo monitoring of tissue-specific global gene expression in humans. Proc Natl Acad Sci U S A. 2014;111(20):7361–6. pmid:24799715
  30. 30. Vorperian SK, Moufarrej MN, Tabula Sapiens Consortium, Quake SR. Cell types of origin of the cell-free transcriptome. Nat Biotechnol. 2022;40(6):855–61. pmid:35132263
  31. 31. Loy CJ, Servellita V, Sotomayor-Gonzalez A, Bliss A, Lenz JS, Belcher E, et al. Plasma cell-free RNA signatures of inflammatory syndromes in children. Proc Natl Acad Sci U S A. 2024;121(37):e2403897121. pmid:39240972
  32. 32. Pan W, Ngo TTM, Camunas-Soler J, Song C-X, Kowarsky M, Blumenfeld YJ, et al. Simultaneously monitoring immune response and microbial infections during pregnancy through plasma cfRNA sequencing. Clin Chem. 2017;63(11):1695–704. pmid:28904056
  33. 33. Camunas-Soler J, Gee EPS, Reddy M, Mi JD, Thao M, Brundage T, et al. Predictive RNA profiles for early and very early spontaneous preterm birth. Am J Obstet Gynecol. 2022;227(1):72.e1-72.e16. pmid:35398029
  34. 34. Ngo TTM, Moufarrej MN, Rasmussen M-LH, Camunas-Soler J, Pan W, Okamoto J, et al. Noninvasive blood tests for fetal development predict gestational age and preterm delivery. Science. 2018;360(6393):1133–6. pmid:29880692
  35. 35. Rasmussen M, Reddy M, Nolan R, Camunas-Soler J, Khodursky A, Scheller NM, et al. RNA profiles reveal signatures of future health and disease in pregnancy. Nature. 2022;601(7893):422–7. pmid:34987224
  36. 36. Moufarrej MN, Vorperian SK, Wong RJ, Campos AA, Quaintance CC, Sit RV, et al. Early prediction of preeclampsia in pregnancy with cell-free RNA. Nature. 2022;602(7898):689–94. pmid:35140405
  37. 37. Ibarra A, Zhuang J, Zhao Y, Salathia NS, Huang V, Acosta AD, et al. Non-invasive characterization of human bone marrow stimulation and reconstitution by cell-free messenger RNA sequencing. Nat Commun. 2020;11(1):400. pmid:31964864
  38. 38. Chang A, Loy CJ, Eweis-LaBolle D, Lenz JS, Steadman A, Andgrama A, et al. Circulating cell-free RNA in blood as a host response biomarker for detection of tuberculosis. Nat Commun. 2024;15(1):4949. pmid:38858368
  39. 39. Burton DR, Stanfield RL, Wilson IA. Antibody vs. HIV in a clash of evolutionary titans. Proc Natl Acad Sci U S A. 2005;102(42):14943–8. pmid:16219699
  40. 40. Haddox HK, Dingens AS, Bloom JD. Experimental estimation of the effects of all amino-acid mutations to HIV’s envelope protein on viral replication in cell culture. PLoS Pathog. 2016;12(12):e1006114. pmid:27959955
  41. 41. Burton DR, Mascola JR. Antibody responses to envelope glycoproteins in HIV-1 infection. Nat Immunol. 2015;16(6):571–6. pmid:25988889
  42. 42. Cuevas JM, Geller R, Garijo R, López-Aldeguer J, Sanjuán R. Extremely high mutation rate of HIV-1 in vivo. PLoS Biol. 2015;13(9):e1002251. pmid:26375597
  43. 43. Zanini F, Puller V, Brodin J, Albert J, Neher RA. In vivo mutation rates and the landscape of fitness costs of HIV-1. Virus Evol. 2017;3.
  44. 44. Meyaard L, Hovenkamp E, Keet IP, Hooibrink B, de Jong IH, Otto SA, et al. Single cell analysis of IL-4 and IFN-gamma production by T cells from HIV-infected individuals: decreased IFN-gamma in the presence of preserved IL-4 production. J Immunol. 1996;157(6):2712–8. pmid:8805678
  45. 45. Dugast A-S, Arnold K, Lofano G, Moore S, Hoffner M, Simek M, et al. Virus-driven inflammation is associated with the development of bNAbs in spontaneous controllers of HIV. Clin Infect Dis. 2017;64(8):1098–104. pmid:28158448
  46. 46. Mascola JR, Haynes BF. HIV-1 neutralizing antibodies: understanding nature’s pathways. Immunol Rev. 2013;254(1):225–44. pmid:23772623
  47. 47. Moir S, Fauci AS. B-cell responses to HIV infection. Immunol Rev. 2017;275(1):33–48. pmid:28133792
  48. 48. Salter SJ, Cox MJ, Turek EM, Calus ST, Cookson WO, Moffatt MF, et al. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 2014;12:87. pmid:25387460
  49. 49. Xiang J, Wünschmann S, Diekema DJ, Klinzman D, Patrick KD, George SL, et al. Effect of coinfection with GB virus C on survival among patients with HIV infection. N Engl J Med. 2001;345(10):707–14. pmid:11547739
  50. 50. Schwarze-Zander C, Blackard JT, Rockstroh JK. Role of GB virus C in modulating HIV disease. Expert Rev Anti Infect Ther. 2012;10(5):563–72. pmid:22702320
  51. 51. Bradley T, Peppa D, Pedroza-Pacheco I, Li D, Cain DW, Henao R, et al. RAB11FIP5 expression and altered natural killer cell function are associated with induction of HIV broadly neutralizing antibody responses. Cell. 2018;175(2):387-399.e17. pmid:30270043
  52. 52. Mengozzi M, De Filippi C, Transidico P, Biswas P, Cota M, Ghezzi S, et al. Human immunodeficiency virus replication induces monocyte chemotactic protein-1 in human macrophages and U937 promonocytic cells. Blood. 1999;93(6):1851–7. pmid:10068657
  53. 53. Packard TA, Schwarzer R, Herzig E, Rao D, Luo X, Egedal JH, et al. CCL2: a chemokine potentially promoting early seeding of the latent HIV reservoir. mBio. 2022;13(5):e0189122. pmid:36073812
  54. 54. Madrid R, Janvier K, Hitchin D, Day J, Coleman S, Noviello C, et al. Nef-induced alteration of the early/recycling endosomal compartment correlates with enhancement of HIV-1 infectivity. J Biol Chem. 2005;280(6):5032–44. pmid:15569681
  55. 55. Cirelli KM, Carnathan DG, Nogal B, Martin JT, Rodriguez OL, Upadhyay AA, et al. Slow delivery immunization enhances HIV neutralizing antibody and germinal center responses via modulation of immunodominance. Cell. 2019;177(5):1153-1171.e28. pmid:31080066
  56. 56. Lindqvist M, van Lunzen J, Soghoian DZ, Kuhl BD, Ranasinghe S, Kranias G, et al. Expansion of HIV-specific T follicular helper cells in chronic HIV infection. J Clin Invest. 2012;122(9):3271–80. pmid:22922259
  57. 57. Alzahrani J, Hussain T, Simar D, Palchaudhuri R, Abdel-Mohsen M, Crowe SM, et al. Inflammatory and immunometabolic consequences of gut dysfunction in HIV: parallels with IBD and implications for reservoir persistence and non-AIDS comorbidities. EBioMedicine. 2019;46:522–31. pmid:31327693
  58. 58. Das Adhikari U, Froehle LM, Pipkin AN, Baharlou H, Linder AH, Shah P, et al. Immunometabolic defects of CD8+ T cells disrupt gut barrier integrity in people with HIV. Cell. 2025;188(20):5666-5679.e19. pmid:40939591
  59. 59. Brenchley JM, Price DA, Schacker TW, Asher TE, Silvestri G, Rao S, et al. Microbial translocation is a cause of systemic immune activation in chronic HIV infection. Nat Med. 2006;12(12):1365–71. pmid:17115046
  60. 60. Nazli A, Chan O, Dobson-Belaire WN, Ouellet M, Tremblay MJ, Gray-Owen SD, et al. Exposure to HIV-1 directly impairs mucosal epithelial barrier integrity allowing microbial translocation. PLoS Pathog. 2010;6(4):e1000852. pmid:20386714
  61. 61. Stapleton JT. Human pegivirus type 1: a common human virus that is beneficial in immune-mediated disease? Front Immunol. 2022;13:887760. pmid:35707535
  62. 62. Acchioni C, Sandini S, Acchioni M, Sgarbanti M. Co-infections and superinfections between HIV-1 and other human viruses at the cellular level. Pathogens. 2024;13:349.
  63. 63. van Loggerenberg F, Mlisana K, Williamson C, Auld SC, Morris L, Gray CM, et al. Establishing a cohort at high risk of HIV infection in South Africa: challenges and experiences of the CAPRISA 002 acute infection study. PLoS One. 2008;3(4):e1954. pmid:18414658
  64. 64. Montefiori DC. Evaluating neutralizing antibodies against HIV, SIV, and SHIV in luciferase reporter gene assays. In: CP in immunology, vol. 64; 2005. p. 12.11.1-12.11.17.
  65. 65. Moyo-Gwete T, Scheepers C, Makhado Z, Kgagudi P, Mzindle NB, Ziki R, et al. Enhanced neutralization potency of an identical HIV neutralizing antibody expressed as different isotypes is achieved through genetically distinct mechanisms. Sci Rep. 2022;12(1):16473. pmid:36182959
  66. 66. De Vlaminck I, Khush KK, Strehl C, Kohli B, Luikart H, Neff NF, et al. Temporal response of the human virome to immunosuppression and antiviral therapy. Cell. 2013;155(5):1178–87. pmid:24267896
  67. 67. Kowarsky M, Camunas-Soler J, Kertesz M, De Vlaminck I, Koh W, Pan W, et al. Numerous uncharacterized and highly divergent microbes which colonize humans are revealed by circulating cell-free DNA. Proc Natl Acad Sci U S A. 2017;114(36):9623–8. pmid:28830999
  68. 68. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. pmid:24695404
  69. 69. Magoč T, Salzberg SL. FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27(21):2957–63. pmid:21903629
  70. 70. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–9. pmid:22388286
  71. 71. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. pmid:23104886
  72. 72. Putri GH, Anders S, Pyl PT, Pimanda JE, Zanini F. Analysing high-throughput sequencing data in Python with HTSeq 2.0. Bioinformatics. 2022;38(10):2943–5. pmid:35561197
  73. 73. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10(2):giab008. pmid:33590861
  74. 74. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80. pmid:23329690
  75. 75. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. Corrigendum to: IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37(8):2461. pmid:32556291
  76. 76. Talevich E, Invergo BM, Cock PJA, Chapman BA. Bio.Phylo: a unified toolkit for processing, analyzing and visualizing phylogenetic trees in Biopython. BMC Bioinform. 2012;13:209. pmid:22909249
  77. 77. Boegel S, Löwer M, Schäfer M, Bukur T, de Graaf J, Boisguérin V, et al. HLA typing from RNA-Seq sequence reads. Genome Med. 2012;4(12):102. pmid:23259685