This is an uncorrected proof.
Figures
Abstract
Yellow fever virus (YFV), a mosquito-borne orthoflavivirus that causes severe hemorrhagic disease, is endemic in parts of South America and Africa, yet genomic data from Sierra Leone is lacking despite ongoing case-based surveillance. Using hybrid-capture metagenomic sequencing, we generated a complete 10,611 nt YFV genome (98% coverage) from an adult male patient who reported to the Kailahun Government Hospital with fever and muscle pain. Phylogenetic analysis assigned the genome to the West African II genotype via the YFV Nextstrain build. The Sierra Leone genome showed 57 substitutions, three of which were non-synonymous (NS2B: N79S, NS3: V515I, and NS5 (RdRp domain): A643V), relative to its most recent common ancestor with other genomes from Senegal and the Netherlands. Bayesian phylogenetics estimated the time to the most recent common ancestor with these genomes as January 14, 2001 (95% HPD: December 17, 1987 - April 28, 2009), potentially indicative of long-standing transmission within West Africa that has not been genomically characterized, rather than specific localization to Sierra Leone. Together, these findings underscore the need for expanded genomic surveillance to monitor YFV spread and evolution.
Author summary
Yellow fever is a viral disease spread by mosquitos which commonly causes fever and muscle aches, and can cause severe disease. Despite previously reported cases of yellow fever in Sierra Leone, there has not been any genomic information available about the virus in this population. In this work, we describe the first genome sequence from a yellow fever virus infection detected in Sierra Leone, and using phylogenetic approaches, we show that it is most genetically related to yellow fever virus sampled from Senegal and a Dutch individual that had travelled to Senegal and The Gambia. Our work fills a critical gap in the understanding of yellow fever virus diversity in West Africa, and highlights the important role that genomic surveillance plays in understanding circulating viral diseases, especially in regions such as Sierra Leone.
Citation: Sandi JD, Brock-Fisher TM, Kallon TMPS, Paye MF, Fofanah IU, Nosamiefan D, et al. (2026) Characterization of the first complete genome sequence of yellow fever virus (YFV) in Sierra Leone: Implications for public health. PLoS Negl Trop Dis 20(5): e0014354. https://doi.org/10.1371/journal.pntd.0014354
Editor: William B. Messer, Oregon Health and Science University, UNITED STATES OF AMERICA
Received: December 11, 2025; Accepted: May 8, 2026; Published: May 18, 2026
Copyright: © 2026 Sandi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All previously available sequences used are included in S1 Table. The Sierra Leone sequence has been submitted to NCBI with accession PX571057. Code used for analysis is available from: https://github.com/broadinstitute/kgh-yellowfever.
Funding: This work is made possible by support from Flu Lab and a cohort of generous donors through TED’s Audacious Project, including the ELMA Foundation, MacKenzie Scott, the Skoll Foundation, and Open Philanthropy. J.E.P. acknowledges the support of The Rockefeller Foundation (PC-2022-POP-005). IGH also received support from The Rockefeller Foundation (Grant #2021 HTH 017). T.B.F acknowledges financial support by the Fulbright U.S. Student Program, which is sponsored by the U.S. Department of State. P.C.S. acknowledges support from the National Institute of Allergy and Infectious Diseases (U19AI110818). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Yellow fever virus (YFV) is the causative agent of yellow fever (YF), an acute mosquito-borne viral haemorrhagic fever disease endemic in several countries across South America, Central America, and Africa [1]. YFV is a member of the Flaviviridae family within the orthoflavivirus genus, and carries a single-stranded, positive-sense RNA genome approximately 10,862 nucleotides in length [1]. The virus is genetically diverse, with seven phylogenetically and geographically distinct YFV genotypes reported to date [2]. Within Africa, there are five circulating genotypes: two in West Africa, and three in East and Central Africa [3].
Despite the availability of a safe and effective YFV vaccine [4], YFV remains a re-emerging vector-borne viral pathogen of major global concern, with multiple outbreaks reported each year in Africa and South America [4,5]. A recent global estimate by Wang and colleagues suggests an annual incidence of more than 86,000 cases, most occurring in sub-Saharan Africa [1]. Most confirmed YFV cases in Africa occur among unvaccinated individuals, highlighting the continued impact of low routine vaccination coverage in high-risk regions [6]. In Sierra Leone, YFV is an epidemic-prone pathogen that is under case-based surveillance, and based on serological testing triggered by clinical presentation, five confirmed YF cases have been reported across the country over the past 20 years: one case most recently in 2023, two cases from an outbreak in 2011, and two cases from an outbreak in 2008 [7–9]. However, the absence of genomic data from Sierra Leone limits our understanding of relative transmission dynamics and how locally circulating viruses relate to the YFV clades circulating elsewhere in Africa.
Through the Sentinel program, a national pathogen-surveillance effort integrating clinical sample collection with standard and multiplex CRISPR-based detection and sequencing, we received a total of 2,057 clinical excess samples from individuals who presented to the Bo, Kailahun, and Kenema Government Hospitals, with fever from July 2024 to March 2025. We tested these samples for circulating epidemic-prone pathogens using the high-throughput Combinatorial Arrayed Reactions for Multiplexed Evaluation of Nucleic acids (CARMEN), a CRISPR/Cas13-based assay [10]. We then sequenced samples that tested positive by CARMEN for further molecular analysis. Here, we describe the molecular features of YFV detected in one sample among this testing, which produced the first complete YFV genome sequence obtained in Sierra Leone.
Results
Sierra Leone’s YFV genome is a member of the West Africa II genotype
We detected YFV in a single excess clinical sample of blood plasma from an adult male who presented to Kailahun Government Hospital in November 2024, with fever, joint pain, and muscle pain. The sample tested positive on the CARMEN CRISPR-based diagnostic platform and was selected for genomic analysis. Because this was a retrospective study, only limited clinical information was available, and follow-up was not possible once the patient left the collection area. Therefore, we were unable to determine the timing of the patient’s symptoms relative to sampling, their vaccination status, any recent travel history, or the outcome of their infection.
Based on a positive CARMEN result for YFV, we performed hybrid-capture metagenomic sequencing and generated a complete, 10,611 nt (98% coverage) YFV genome with an average read depth of 33.6 (range: 7–99; Fig 1A). Using the YFV Nextstrain build, the genome was assigned to the West Africa II genotype. We assembled a contextual dataset of 73 high-quality (>90% complete) YFV genomes from the West Africa I and II genotypes retrieved from NCBI and used these sequences to infer a maximum-likelihood phylogeny (S1 Table; Fig 1B).
(B) Maximum likelihood tree showing the Sierra Leone YFV genome (red) in relation to a contextual dataset of 73 YFV genomes from the West Africa I genotype (grey) and West Africa II genotype (black). The Sierra Leone sequence clusters within the West Africa II genotype. (C) Expanded view on the clade containing the Sierra Leone sequence, highlighting its position as a sister lineage to genomes from Senegal and a travel-associated case detected in the Netherlands. Numbers on branches represent the number of mutations inferred along each branch. (D) Gene map highlighting the locations of the two non-synonymous polymorphisms consistent with reversion (NS2B: N79S and NS5 (RdRp domain): A643V) and one substitution (NS3: V515I) unique to the Sierra Leone YFV genome. The blue arrow in the panel also marks the 6-nucleotide region in 3’UTR that is deleted in the Dutch and Senegalese genomes but retained in the Sierra Leone and Nigerian genomes.
Phylogenetic analysis revealed that the Sierra Leone genome formed a well supported (bootstrap support = 100) sister lineage to viruses recently detected in Senegal and a travel-associated case in the Netherlands who had reported travel in Senegal and The Gambia [11]. This combined cluster was most related to Nigeria samples collected between 2018 and 2020 (Fig 1C). Maximum-likelihood ancestral-state reconstruction in Nextstrain [12] indicated that these sequences likely descended from a Senegalese population (confidence = 86.7%), although the majority of genomes were collected in Senegal (S1 Fig; Fig 1B) which could potentially bias this estimation. The branch leading to the Sierra Leone YFV genome had 54 synonymous and 3 non-synonymous mutations, and the stem branch of its sister lineage had 33 synonymous and 5 non-synonymous mutations. Comparisons of the Nigerian, Dutch, and Sierra Leone YFV genomes revealed two non-synonymous polymorphisms consistent with reversion (NS2B: N79S and NS5 (RdRp domain): A643V) and one non-synonymous substitutions unique to the Sierra Leone genome (NS3: V515I). The Sierra Leone genome also retained the same nucleotide sequence at positions 10368–10373 of the 3’UTR as the Nigerian sample, rather than the 6-nucleotide deletion observed in the Dutch and Senegalese genomes (Fig 1D, blue arrow).
Sierra Leone’s YFV genome started to diverge from the West African population around mid-January 2001
Bayesian phylogenetic analysis in BEAST X [13] estimated that the Sierra Leone genome diverged from its closest related Senegalese and Dutch sequences around mid-January 2001. The median inferred time to the most recent common ancestor (tMRCA) was January 14, 2001 with a 95 percent highest posterior density (HPD) interval from December 17, 1987 to April 28, 2009 (Fig 2). The estimated evolutionary rate across the full tree was 2.49e-4 substitutions per site per year (95% HPD:1.31e-4 - 3.63e-4), consistent with previous reports for sequences from the West Africa II genotype [14]. These results are consistent with an established lineage diverging from a larger West African population and persisting regionally, although they cannot rule out an introduction from another undersampled region. This uncertainty is further compounded by the lack of travel history for the patient and the overrepresentation of genomes from Senegal in the dataset.
The time to the most recent common ancestor (tMRCA) is shown with the posterior height distribution, 95% HPD shaded in blue.
Discussion
Our genomic analysis supports sporadic serological data indicating YFV has been present in Sierra Leone for at least the past 20 years, although the limited availability of regional genomes constrains what can be concluded, and in particular we cannot rule out recent travel history of the patient described in this work. The placement of the Sierra Leone genome as a sister lineage to viruses from Senegal, together with a tMRCA in January 2001, is consistent with long-standing circulation within the wider West African region, although the specific geographic origin of this lineage cannot be determined from available data. Because YFV genomic data from Sierra Leone and neighboring countries are sparse, we cannot determine where this lineage circulated during the intervening decades or whether Sierra Leone was its primary reservoir. YFV ecology in West Africa also involves both sylvatic and zoonotic cycles, and cryptic circulation could reflect maintenance in wild non-human primates (NHP) or vectors rather than sustained human transmission. There have not been serological studies of YFV infection among NHP in the region, but modeling has suggested that conditions in Sierra Leone are favourable for serious transmission risk between NHP populations and human populations, and YFV antibodies have been detected elsewhere in Africa among NHP species that are endemic to Sierra Leone [15,16].
Our analysis identified two non-synonymous polymorphisms consistent with reversion to the genotype of the shared most recent common ancestor, as well as one substitution not observed in the next most closely related West African genomes, along with the retention of a 3′UTR region deleted in the Senegalese and Dutch sequences. At present, there is currently no evidence that these differences affect viral phenotype, and the functional significance, if any, of these differences remains unknown. Interpreting the impact of individual mutations is challenging without functional data, and future work will be needed to assess whether these differences influence viral fitness, vector interactions, or host range. Importantly, we did not detect mutations that have been experimentally linked to reduced vaccine neutralizing ability in mouse models [17], and there is no evidence from this genome to suggest altered vaccine sensitivity. This is consistent with epidemiological observations that most confirmed YFV cases in Africa occur among unvaccinated individuals and that current vaccines remain effective against circulating genotypes.
This study is subject to several limitations inherent to its retrospective design. We were unable to determine the patient’s vaccination status, clinical course, or specific risk factors for infection, as only limited metadata were available from this clinical excess sample. Although yellow fever vaccination has been routinely available in Sierra Leone since 2003, with reported national coverage exceeding 75% over the past two decades [7,18], we cannot assess whether vaccination status contributed to this individual case. Likewise, while the patient presented to a hospital setting, suggesting moderate to severe illness, we cannot characterize symptom progression or disease outcome. Although all available high-quality sequences were included in our contextual dataset, the dataset is unevenly sampled and enriched for sequences from Senegal, which may influence ancestral reconstruction. Finally, although we identified non-synonymous substitutions relative to closely related genomes, the functional impact of these mutations, if any, cannot be inferred without experimental data. Future work will be required to determine whether these substitutions influence viral fitness, vector competence, or host interactions.
Despite its limitations, the results of this study highlight the importance of strengthening surveillance for YFV in Sierra Leone and across West Africa, and in particular highlight the potential value of surveillance testing and sequencing after case detection. Routine testing for YFV in Sierra Leone is largely serological and based on those who present to clinics and hospitals. However, due to the similarity of symptoms between YF and other common infections, such as malaria or hepatitis B, testing may be under-utilized in clinical settings [7]. Further, while serological testing is useful for detecting exposure, it is insufficient for tracking viral diversity, origins, and movement. Integrating genomic sequencing into these clinical systems would provide critical complementary information to identify introductions, monitor lineage turnover, and evaluate potential evolutionary changes with relevance to diagnostics or vaccines. Additionally, testing among insect and NHP populations, along with increased surveillance testing of non-symptomatic individuals, would provide much better estimates on the broader prevalence of the YFV in the country and help inform public health measures to reduce spillover risks. Given the re-emergence of YFV across the region, expanding genomic surveillance, paired with ecological and epidemiological data, will be essential for enabling earlier detection, improving outbreak preparedness, and informing targeted public health responses.
Methods
Ethics statement
This work is part of a larger project, Surveillancing Circulating Pathogens in Sierra Leone, for which ethics approval was obtained from the Sierra Leone Ethics and Scientific Review Committee (SLESRC) (SLESRC No: 002/05/2024). No consent was obtained as the sample was clinical excess used only for research purposes and no identifying information about the participant was reported to the authors.
Sampling technique/method
Clinical excess plasma samples collected from individuals who presented with fever at the Bo, Kailahun, and Kenema Government Hospitals were sent to the Kenema Government Hospital (KGH) viral hemorrhagic fever (VHF) research lab for a multi-viral pathogen screening as part of an ongoing diagnostic surveillance study. All samples were de-identified before transfer to the KGH VHF laboratory and were transported under cold chain conditions (2–8°C) using the standard triple packaging systems. Upon arrival at the KGH VHF laboratory, specimens were stored at −20 °C until analysis.
Pathogen detection
Laboratory diagnosis of pathogens of public health concern was performed using a multiplexed polymerase chain reaction (PCR) approach combined with a Combinatorial Arrayed Reactions for Multiplexed Evaluation of Nucleic acids (CARMEN) CRISPR-based detection platform, as previously described [10]. Briefly, nucleic acids were extracted from 200 µL plasma samples using the Applied Biosystems MagMAX Pathogen RNA/DNA Kit (Cat #: 4462359) and subjected to a multiplex PCR amplification using in-house-designed pathogen-specific primer sets targeting the RNA-dependent RNA polymerase domain of the NS5 gene. Primer sequences and more details are available from [19]. Amplified amplicons were detected using CRISPR-Cas13, which allows for simultaneous detection of several targets in each sample on a Standard Biotools Biomark X instrument.
Genome sequencing
Plasma samples in which at least one pathogen was detected by CARMEN were sequenced as previously described [20]. Briefly, we used the Illumina RNA Prep with Enrichment (L) Kit and the Viral Surveillance Panel (VSP) 2.0 to generate virus-enriched libraries following the manufacturer’s protocol. The libraries were then sequenced pair-end on the Illumina MiSeq platform.
Sierra leone genome assembly
We first demultiplexed the samples using the Broad Institute demux-deplete pipeline (github.com/broadinstitute/viral-pipelines/demux_deplete), followed by genome assembly using the Broad Institute assemble-denovo-metagenomic (github.com/broadinstitute/viral-pipelines/assemble_denovo_metagenomic) in Terra.bio using default settings.
Dataset curation and maximum likelihood phylogenetics
We used Nextstrain [12] to search NCBI for YFV genomes that were at least 90% complete (length ≥ 9,775 nt). The resulting genomes were downloaded and Nextclade was then used to select only Clade IV and V sequences which correspond to West Africa I and II genotypes respectively, resulting in 73 contextual genomes (S1 Table). Complete genome sequences were then aligned to the vaccine strain reference (NC.002031) using augur align. A maximum likelihood phylogeny of these samples was generated with IQ-TREE v3.0.1 [21], using a GTR substitution model and 1000 bootstrap replicates. Ancestral state reconstruction and mutation annotation were completed with augur traits and augur ancestral respectively in Nextstrain. Trees were annotated and plotted using ggtree in R [22].
Bayesian phylogenetics
From the complete genomic dataset described above, a subset of samples from the West Africa II genotype was selected to generate a time-scaled phylogeny using BEAST X [13]. We ran one chain of 100 million generations, subsampling every 10,000 steps to continuous parameter log and tree files, using a codon based substitution model (SRD06; HKY + gamma + 2) [23,24], an uncorrelated relaxed lognormal clock, and a flexible coalescent prior (a Bayesian skyride model, allowing for changes in population size through time [25]). Convergence was assessed using Tracer [26] and all ESS values were confirmed to be over 200. 10% burnin was discarded and trees were combined to an MCC tree using TreeAnnotater (https://beast.community/treeannotator) and visualized using ggtree in R.
Supporting information
S1 Table. Table of included contextual sequences including accession number, collection date, collection location, host name, genotype, and whether or not the sample is included in Fig 1C expanded view tree.
https://doi.org/10.1371/journal.pntd.0014354.s001
(CSV)
S1 Fig. Barplot showing the number of included genomes, per genotype, from each country.
https://doi.org/10.1371/journal.pntd.0014354.s002
(TIF)
References
- 1. Wang X, Li B, He B, Yan X, Huang L, Li J, et al. The incidence and trends of yellow fever from 1990 to 2021 in major endemic regions: A systematic analysis based on the 2021 global burden of disease study. Pathogens. 2025;14(6):594. pmid:40559602
- 2. Bryant JE, Holmes EC, Barrett ADT. Out of Africa: A molecular perspective on the introduction of yellow fever virus into the Americas. PLoS Pathog. 2007;3(5):e75. pmid:17511518
- 3. Mutebi JP, Wang H, Li L, Bryant JE, Barrett AD. Phylogenetic and evolutionary relationships among yellow fever virus isolates in Africa. J Virol. 2001;75(15):6999–7008. pmid:11435580
- 4. Collins ND, Barrett ADT. Live attenuated yellow fever 17D vaccine: A legacy vaccine still controlling outbreaks in modern day. Curr Infect Dis Rep. 2017;19(3):14. pmid:28275932
- 5. Possas C, Lourenço-de-Oliveira R, Tauil PL, Pinheiro P, Pissinatti A, Cunha RV, et al. Yellow fever outbreak in Brazil: the puzzle of rapid viral spread and challenges for immunisation. Mem Inst Oswaldo Cruz. 2018;113(10):e180278. pmid:30427974
- 6. Ali MS, Mekonen EG. Yellow fever vaccine coverage and associated factors among under-five children in Kenya: Data from Kenyan Demographic and Health Survey 2022. Hum Vaccin Immunother. 2024;20(1):2391596. pmid:39165035
- 7. Sesay U, Elduma AH, Barrie HUI, Bangura H, Bah MS, Gebru GN. Diagnostic challenges and implications of a yellow fever co-infected with malaria case in Freetown, Sierra Leone, September 2023: An outbreak investigation. Journal of Interventional Epidemiology and Public Health. 2025;8.
- 8. World Health Organization WHO. Disease Outbreak News – “2011_03_11-en”. https://www.who.int/emergencies/disease-outbreak-news/item/2011_03_11-en. 2011.
- 9. World Health Organization WHO. Disease Outbreak News – “2009_01_06a-en”. https://www.who.int/emergencies/disease-outbreak-news/item/2009_01_06a-en. 2009. Accessed 2025 November 18.
- 10. Welch NL, Zhu M, Hua C, Weller J, Mirhashemi ME, Nguyen TG, et al. Multiplexed CRISPR-based microfluidic platform for clinical testing of respiratory viruses and identification of SARS-CoV-2 variants. Nat Med. 2022;28(5):1083–94. pmid:35130561
- 11. Phan MVT, Murad SD, van der Eijk AA, Metselaar HJ, Hartog H, Harinck F. Genomic sequence of yellow fever virus from a Dutch traveller returning from the Gambia-Senegal region, the Netherlands, November 2018. Eurosurveillance. 2019;24:1800684.
- 12. Hadfield J, Megill C, Bell SM, Huddleston J, Potter B, Callender C, et al. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics. 2018;34(23):4121–3. pmid:29790939
- 13. Baele G, Ji X, Hassler GW, McCrone JT, Shao Y, Zhang Z, et al. BEAST X for Bayesian phylogenetic, phylogeographic and phylodynamic inference. Nat Methods. 2025;22(8):1653–6. pmid:40624354
- 14. Sall AA, Faye O, Diallo M, Firth C, Kitchen A, Holmes EC. Yellow fever virus exhibits slower evolutionary dynamics than dengue virus. J Virol. 2010;84(2):765–72. pmid:19889759
- 15. Aliaga-Samanez A, Real R, Segura M, Marfil-Daza C, Olivero J. Yellow fever surveillance suggests zoonotic and anthroponotic emergent potential. Nature Communications Biology. 2022;5(1):1–12.
- 16. Oyono MG, Kenmoe S, Abanda NN, Takuissu GR, Ebogo-Belobo JT, Kenfack-Momo R, et al. Epidemiology of yellow fever virus in humans, arthropods, and non-human primates in sub-Saharan Africa: A systematic review and meta-analysis. PLoS Negl Trop Dis. 2022;16(7):e0010610. pmid:35867659
- 17. Lou Y-N, Sun M-X, Li K, Xiong X-C, Zhou C, Cao T-S, et al. A single residue in domain II of envelope protein of yellow fever virus is critical for neutralization sensitivity. J Virol. 2025;99(4):e0177024. pmid:40019254
- 18.
WHO Immunization Data portal - Detail Page. Immunization Data. World Health Organization; 2024. https://immunizationdata.who.int/global/wiise-detail-page/yellow-fever-(yf)-vaccination-coverage?CODE=SLE&YEAR=
- 19. Kamariza M, McMahon K, Kim L, Welch NL, Stenson L, Allan-Blitz LT, et al. Multiplexed detection of febrile infections using CARMEN. Nat Commun. 2025;17(1):32. pmid:41365864
- 20. Campbell AKO, Sandi JD, Omah IF, Faye M, Parker E, Brock-Fisher T. Genomic epidemiology uncovers the origin of the mpox epidemic in Sierra Leone. MedRxiv. 2025.
- 21. Wong TKF, Ly-Trong N, Ren H, Banos H, Roger AJ, Susko E, et al. IQ-TREE 3: Phylogenomic Inference Software using Complex Evolutionary Models. EcoevoRxiv. 2025. https://ecoevorxiv.org/repository/view/8916/
- 22. Yu G, Smith DK, Zhu H, Guan Y, Lam TT. ggtree: An r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2016;8(1):28–36.
- 23. Beck A, Guzman H, Li L, Ellis B, Tesh RB, Barrett ADT. Phylogeographic reconstruction of African yellow fever virus isolates indicates recent simultaneous dispersal into east and west Africa. PLoS Negl Trop Dis. 2013;7(3):e1910. pmid:23516640
- 24. Shapiro B, Rambaut A, Drummond AJ. Choosing appropriate substitution models for the phylogenetic analysis of protein-coding sequences. Mol Biol Evol. 2006;23(1):7–9. pmid:16177232
- 25. Minin VN, Bloomquist EW, Suchard MA. Smooth skyride through a rough skyline: Bayesian coalescent-based inference of population dynamics. Mol Biol Evol. 2008;25(7):1459–71. pmid:18408232
- 26. Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7. Syst Biol. 2018;67(5):901–4. pmid:29718447