Skip to main content
Advertisement
  • Loading metrics

Detection and genetic characterization of alphacoronaviruses in co-roosting bat species, southeastern Kenya

  • Joseph G. Ogola ,

    Roles Conceptualization, Formal analysis, Investigation, Writing – review & editing

    ogolajoseph2000@gmail.com

    Affiliation Department of Medical Microbiology and KAVI Institute of Clinical Research, University of Nairobi, Nairobi, Kenya

  • Hussein Alburkat,

    Roles Formal analysis, Investigation, Writing – review & editing

    Affiliations Departments of Virology, University of Helsinki, Helsinki, Finland, Department of Microbiology, Faculty of Medicine Al-Muthanna University, Samawah, Iraq

  • Teemu Smura,

    Roles Formal analysis, Investigation, Writing – review & editing

    Affiliation Departments of Virology, University of Helsinki, Helsinki, Finland

  • Lauri Kareinen,

    Roles Conceptualization, Investigation, Writing – review & editing

    Affiliation Departments of Virology, University of Helsinki, Helsinki, Finland

  • Ravi Kant,

    Roles Formal analysis, Writing – review & editing

    Affiliations Departments of Virology, University of Helsinki, Helsinki, Finland, Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland, Department of Tropical Parasitology, Institute of Maritime and Tropical Medicine, Medical University of Gdansk, Gdynia, Poland

  • Essi M. Korhonen,

    Roles Writing – review & editing

    Affiliations Departments of Virology, University of Helsinki, Helsinki, Finland, Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland

  • Tamika J. Lunn,

    Roles Writing – review & editing

    Affiliation Odum School of Ecology, University of Georgia, Athens, GeorgiaUnited States of America

  • Moses Masika,

    Roles Writing – review & editing

    Affiliation Department of Medical Microbiology and KAVI Institute of Clinical Research, University of Nairobi, Nairobi, Kenya

  • Paul W. Webala,

    Roles Conceptualization, Investigation, Writing – review & editing

    Affiliation Department of Forestry and Wildlife Management, Maasai Mara University, Narok, Kenya

  • Philip Nyaga,

    Roles Conceptualization, Investigation, Writing – review & editing

    Affiliation Department of Veterinary Pathology, Microbiology and Parasitology, University of Nairobi, Nairobi, Kenya

  • Omu Anzala,

    Roles Conceptualization, Writing – review & editing

    Affiliation Department of Medical Microbiology and KAVI Institute of Clinical Research, University of Nairobi, Nairobi, Kenya

  • Olli Vapalahti,

    Roles Conceptualization, Writing – review & editing

    Affiliations Departments of Virology, University of Helsinki, Helsinki, Finland, Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland, HUS Diagnostic Center, Virology and Immunology, Helsinki University Hospital, Helsinki, Finland

  • Kristian M. Forbes,

    Roles Conceptualization, Investigation, Writing – review & editing

    Affiliation Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, United States of America

  • Tarja A. Sironen

    Roles Conceptualization, Formal analysis, Investigation, Writing – review & editing

    Affiliations Departments of Virology, University of Helsinki, Helsinki, Finland, Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland

Abstract

Bats are associated with some of the most significant and virulent emerging zoonoses globally, yet research and surveillance of bat pathogens remains limited across parts of the world. We surveyed the prevalence and genetic diversity of coronaviruses from bats in Taita Hills, southeastern Kenya, as part of ongoing surveillance efforts in this remote part of eastern Africa. We collected fecal and intestinal samples in May 2018 and March 2019 from 16 bat species. We detected one genus of coronavirus (alphacoronavirus), with an overall RNA prevalence of 6.5% (30/463). The prevalence of coronavirus RNA was 3.8% (9/235) and 11.6% (21/181) for the two most captured free-tailed bat species, Mops condylurus and M. pumilus respectively, with no detections from other bat species (0/90). Phylogenetic analyses based on the partial RNA-dependent RNA polymerase gene and whole genome sequences revealed that the sequences clustered together and were closely related to alphacoronavirus detected in free tailed bats in Eswatini, Nigeria and Rhinolophus simulator bats in South Africa. The sequences were more distantly related to alphacoronavirus isolated from Chaerophon plicatus bat species in Yunnan province, China and Ozimops species from southwestern Australia. These findings highlight coronavirus transmission among bats that share habitats with humans and livestock, posing a potential risk of exposure. Future research should investigate whether coronaviruses detected in these bats have the potential to spillover to other hosts.

Author summary

Bats are known to carry several zoonotic pathogens with potential to cause serious illnesses and death in humans. Yet, surveillance on the pathogens they carry remains limited in much of the world. We studied the prevalence and diversity of coronaviruses from bats in Taita Hills, southeastern Kenya to better understand the circulation of these viruses and inform disease preparedness. We detected alphacoronaviruses in urban Mops condylurus and M. pumilus bat species. The bat alpha coronaviruses we detected were closely related to alphacoronaviruses that have been previously detected in bats elsewhere in Africa and distantly related to alphacoronavirus detected from Chaerophon plicatus bat species in Yunnan province, China and Ozimops species from southwestern Australia. This work demonstrates coronavirus circulation among bats that share habitats with people and livestock providing conditions that can lead to spillover. Identifying whether coronaviruses detected in these bats have the potential to infect other hosts is critical for developing countermeasures and mitigating potential outbreaks.

1 Introduction

Coronaviruses (CoVs) infect a wide range of hosts and are distributed throughout the world [1]. They can cause a number of diseases in humans, with variable severity ranging from asymptomatic to severe respiratory, gastrointestinal, liver, and neurologic diseases [2]. Coronaviruses have the potential to cause localized epidemics and global pandemics, as witnessed by three recent outbreaks: Severe Acute Respiratory Syndrome coronavirus (SARS-CoV), which emerged in a market in Guangdong province, China [3,4], Middle east respiratory syndrome (MERS) caused by MERS-CoV in the Arabian Peninsula [5,6], and the most recent global pandemic (COVID-19) caused by Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2) [7]. The subfamily Coronavirinae is classified into four genera. Two of these, alpha- and beta-CoVs, can infect and sometimes cause disease in mammals, including humans. The remaining genera, gamma- and delta-CoVs, are mainly associated with birds [8,9].

All CoVs found in humans are likely zoonotic in origin, including those that cause mild but common respiratory diseases such as HCoV-HKU1 and HCoV-OC43 [10,11]. CoVs have a large genomic size ranging between 27 and 32 kb. As RNA viruses, they replicate by use of virus-encoded RNA polymerases, that are prone to high mutation rates. Phylogenetic studies indicate that cross-species transmission has occurred frequently during coronavirus evolution [12], facilitated by their high mutation rate, genetic recombination and ecological/anthropogenic factors. The occurrence of zoonotic spillover events may be linked to these viral features, together with the global distribution of CoVs in wildlife, giving CoVs opportunities to adapt to novel hosts [12,13].

Due to their high propensity for cross-species transmission and potential for disease emergence in humans, CoVs have become a focus for wildlife monitoring. Bats (Mammalia: Chiroptera), in particular, have played a major role as the gene source for the evolution of alpha-CoVs and beta-CoVs (8) and have come to the forefront of coronavirus surveillance efforts. More than 4,000 coronavirus sequences from 14 bat families have been identified so far [14], yet the true diversity of bat coronaviruses is probably much higher. Bats are present in all six human-inhabited continents [15], though their species richness is highest in areas close to the equator [16]. This has correlated with the emergence and re-emergence of bat-borne viral pathogens in countries near the equator, including identified disease hotspots in Africa, the Americas, and Asia [17].

Hotspots along the equator are often also areas of high anthropogenic activities increasing contact between bats, livestock and humans, and escalating the risk of pathogen spillover. Bats continue to move closer to human settlements and sometimes live in the same houses used by people or abandoned buildings, further increasing the potential for spillover of the pathogens they host [1820]. For example, serological evidence of human exposure to bat CoVs in rural China showed that spillover from bats might occur relatively commonly [21,22]. There is currently minimal surveillance of CoVs in eastern Africa despite Kenya being known for high diversity of bat fauna globally [23] and the risk CoVs pose to people. As part of ongoing wildlife surveillance for preparedness and prevention of zoonotic disease emergence, the purpose of this study was to survey the diversity and prevalence of CoVs from bats in Taita Hills, southeastern Kenya, a biodiversity hotspot, and characterize the genomes of identified viruses.

2 Methods

2.1 Ethics statement

Bat trapping and sample collections were conducted under permits from the Kenyan National Commission for Science, Technology and Innovation (permit no. NACOSTI/P/18/76501/22243) and the Kenya Wildlife Service (permit no. KWS/BRM/500), University of Nairobi Faculty of Veterinary Medicine; Biosafety, Animal use and Ethics committee (REF: FVM BAUEC/2018/180) and University of Arkansas Institutional Review Board (protocol #22012). Sample import to Finland was approved by the Finnish Food Safety Authority (EVIRA; 4250/0460/2016 and 2809/0460/2018).

2.2 Study area and sampling procedures

The study was conducted in the Taita Hills area, Taita Taveta County, southeast Kenya (Fig 1), as part of an ongoing virus surveillance project. Bats were captured in May 2018 and March 2019 using hand nets, and single- and triple-high mist nets at roosts and natural flyways. Captured bats were identified to species in the field using existing keys for bats of Kenya [23]. Non-conservation priority bat species (classified as least concern by the IUCN) were placed in clean individual cloth bags and retained for processing. For processing, bats were euthanized using an overdose of 4–5% isoflurane gas and immediately dissected for the collection of intestinal samples. Before terminal samples were collected, bat demographic information (sex, age and reproductive status) was recorded alongside standard measurements (weight and forearm length). Bats were considered reproductive if female bats were gestating or lactating at the time of capture. Gestation was observed by gentle palpation of the abdomen for the presence of uterine bulge while lactating bats were identified by the presence of engorged nipples either with or without bare patches of skin [28]. Male bats were considered reproductive if they were scrotal and not juvenile. Bats were classified as non-reproductive if there was no detectable pregnancy/lactation or scrotum among female and males respectively. Fecal samples were collected from the individual cloth bags when available. Both intestinal tissues and fecal samples were placed into separately marked tubes with RNAlater (Qiagen, Hilden, Germany), stored at -20°C, and later shipped on dry ice to Helsinki, Finland for laboratory testing.

thumbnail
Fig 1. Map of Taita Hills showing the spatial location of sites where bat species were captured in May 2018 and March 2019.

The white large dots and red dots indicate the bat trapping sites and positive CoV sites respectively. The map was created using ArcMap (version 10.8). Shape files for country boundaries, rivers, and lakes, as well as the 30-meter resolution SRTM Kenya Digital Elevation Model (DEM), were obtained from various open data sources [2426]. True color background data of Taita Hills was derived from Sentinel-2 MSI Level-1C imagery [27] (ESA/Copernicus, tile T37MCR, 09 September 2025).

https://doi.org/10.1371/journal.pntd.0012805.g001

2.3 Sample processing and detection of coronaviruses

Bat fecal and intestinal samples were treated with Tripure (Roche, http://www.roche.com), according to the manufacturer’s instructions, to inactivate any potentially biohazardous agents before RNA extractions. Extracted RNAs were eluted in 50 μL of RNase-free water, aliquoted into 15 μL, quantified by NanoDrop spectrophotometer (Thermo Fisher Scientific), and stored at –80°C for downstream analysis. Samples were then screened with qScript One-Step SYBR Green qRT-PCR Kit, Low ROX (Quanta Biosciences, Beverly, MA) using primers targeting the CoV RdRP-gene (11-FW 5′ TGATGATGSNGTTGTNTGYTAYAA 3′ and 13-RV GCATWGTRTGYTGNGARCARAATTC 3′ which are able to detect all four genera of coronaviruses currently known [29]). Positive samples were transcribed to cDNA using SuperScript IV One-Step RT-PCR Kit (ThermoFisher Scientific, Invitrogen, USA) with only the first reaction of the nested RT-PCR protocol used [30] and PCR amplicons were sequenced using Sanger sequencing.

A total of 12 positive coronavirus RNA samples from the two species were further subjected to next-generation sequencing (NGS). For this, the CoV positive RNA samples were reverse transcribed and amplified using the WTA2 kit (Sigma). Products were then purified based on the manufacturer’s instructions with the GeneJET PCR purification kit (Thermo Scientific). NGS libraries were prepared with the Nextera XT DNA Sample Preparation and the Nextera XT Index Kit for 24 Indexes (Illumina), and sequencing was performed with MiSeq (Illumina), using the MiSeq Reagent Kit v2-150.

Since NGS resulted in fragmentary CoV genomes, specific primer pairs were designed in the genomic regions flanking gaps and these regions were amplified with PCR using 10.0μl of 2x Phusion Flash PCR Master Mix (Thermo Scientific), 2.5μl of 10μM primers and 2.5μl of water. The cycling conditions were as follows: initial denaturation at 9oC for 10s, followed by 30 cycles of the following steps of denaturation at 98oC for 1s, annealing at 56oC for 5s and extension at 72oC for 7s, and final extension at 72oC for 1 min. PCR products were run on a 1.5% agarose gel, purified as described earlier, and sequenced with Sanger sequencing using PCR primers. If the gap-filling PCR was unsuccessful, Sanger sequencing was performed with the original complete genome PCR product as a template.

2.4 Statistical analysis

Field and laboratory data were stored in Microsoft Excel files. They were imported into SPSS software for descriptive and statistical analysis. Dependent variables that were analyzed included bat species, age, sex and reproductive status and location of bat trapping. We explored the association between the prevalence of CoV RNA and the dependent variables using univariate logistic regression.

2.5 Sequence analysis

The raw sequence reads were quality filtered, trimmed, de-novo assembled, and annotated using fastp v.1.0.1 [31], MEGAHIT v.1.2.8 [32] and SANSparallel [33], respectively, implemented in Lazypipe pipeline [34]. Thereafter, for the samples in which alphacoronavirus sequences were detected, the quality filtered reads (quality score threshold of 30, maximum allowed percentage of low-quality bases 40, minimum read length: 25, sliding window size: 20 mean with a mean quality threshold of 30; and base correction for paired-end reads enabled) were remapped against the near-complete genome sequence from sample X167 using BWA-MEM [35] implemented in HAVoC pipeline [36]. Thereafter, sequence reads were filtered by mapping quality (MAPQ) using a threshold of 30, followed by consensus calling with SAMtools [37] and BCFtools [38]. A summary of sequencing data quality and alignment metrics is provided in S1 Table.

For phylogenetic analysis, all available complete or nearly complete alphacoronavirus genomes were downloaded from NCBI GenBank and aligned using MAFFT [39]. The alignment was further subsampled to include only one representative of sequence groups with less than 5% pairwise amino acid sequence divergence. The complete genome alignment was divided into 5 alignments, representing each gene (coding for ORF1ab, spike, NS3, envelope and membrane proteins) in the alphacoronavirus core genome (i.e., genes present in all the members of the alphacoronavirus genus). Phylogenetic trees were constructed using maximum likelihood (ML) method implemented in IQ-TREE2 v.2 [40], employing ModelFinder [41] algorithm to determine the optimal protein substitution model, and the ultrafast bootstrap UFBoot2 [42] algorithm to compute 1000 bootstrap pseudo replicates. The final trees were visualized with iTOL v5 [43].

3 Results

A total of 510 bats were captured during the study period, comprising mainly Mops condylurus (n = 237) and M. pumilus (n = 183), which were the focus species of our field trapping due to their close association with humans as species that commonly roost in anthropogenic buildings. In addition, 90 other bats from 14 species were captured and screened (Table 1). The overall prevalence for CoV RNA was 6.5% (30/463) [95% CI 4.4-9.1%] and varied markedly among sites (Table 2), sometimes skewed by a small number of captured bats.

thumbnail
Table 1. Species of bats captured, number sampled and the prevalence of CoV RNA based on bat sex and reproductive status. Bats were classified as reproductive if they were reproductively active at the time of capture (females: pregnant or lactating; males: scrotal) or as non-reproductive if they were not.

https://doi.org/10.1371/journal.pntd.0012805.t001

thumbnail
Table 2. The prevalence of coronavirus RNA in Mops condylurus and M. pumilus trapped from different sites at Taita hills, Kenya.

https://doi.org/10.1371/journal.pntd.0012805.t002

The two species, Mops condylurus and Mops pumilus, had significantly different CoV prevalence: prevalence was lower in M. condylurus bats 3.8% (9/235) [95% CI 1.8-7.2%] as compared to M. pumilus bats 11.6% (21/181) [95% CI 7.3-17.2%], with odds ratio of 0.303 (0.135 -0.680), and p-value of 0.004 (Table 3). CoV prevalence varied slightly also based on reproductive status; however, the difference was not significant.

thumbnail
Table 3. Univariate logistic regression for the presence of bat coronaviruses and the species, sex and reproductive status of captured Mops condylurus and M. pumilus bats.

https://doi.org/10.1371/journal.pntd.0012805.t003

3.1 Phylogenetic analysis of partial RdRp coding sequences

Thirty partial RdRp nucleotide sequences were obtained from coronavirus screening PCR amplicons. The initial annotation using blastn algorithm indicated that they had 96–99% nucleotide identity against alphacoronaviruses sequenced from genus Chaerophon bats from Kenya, Eswatini, and Nigeria [4446] as well as Rhinolophus simulator bats from South Africa [47]. Three short sequences were excluded from further analysis due to the short length of nucleotide bases and the presence of ambiguous bases and the remaining sequences were submitted to the NCBI Genbank with accession numbers (Table 4). The raw NGS reads were deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB96286. In the phylogenetic analysis, these sequences clustered together and the sequences from Kenya interspersed with those from the other African countries (Fig 2). Notably, all the sequences from each sampling location do not cluster together, but form distinct clusters interspersed throughout the tree. On the other hand, some sequences sampled from two different locations were identical to each other. For example, three sequences from Maktau and 10 sequences from Voi (Fig 2) were identical, although Maktau and Voi are located 51.7 km apart. Further, the cluster containing the viruses sequenced in this study was more distantly related to alphacoronaviruses sequenced from Otomops harrisoni, Mount Suswa, Kajiado county, Kenya [48], Otomops martienssini, Rwanda, Chaerophon sp. bat from Kenya [45] as well as those sequenced from Hipposideros armiger and Hipposideros pomona bats in China [49].

thumbnail
Table 4. Complete coding sequences and partial RdRp sequences from M. condylurus and M. pumilus submitted to the NCBI GenBank with accession numbers.

https://doi.org/10.1371/journal.pntd.0012805.t004

thumbnail
Fig 2. Phylogenetic tree based on a 378 nucleotides long alignment of partial RdRp sequences encoding the conserved polymerase motif B.

The tree was constructed using maximum likelihood method implemented in IQTree2 software with TN + F + R3 substitution model and 1000 ultrafast bootstrap replicates. The tree was rooted to mid-point, the nodes with bootstrap support less than 70 were polytomized and the clades with sequences from only one country in eastern Africa, or 2-3 countries from western or southern Africa were collapsed for the sake of clarity.

https://doi.org/10.1371/journal.pntd.0012805.g002

3.2 Genomic characterization of complete coding sequences

Two nearly complete alphacoronavirus genomes were obtained using NGS. Of these, complete coding regions were obtained from the strain X167, whereas strain X152 contained two gaps (amino acids 1–42 and 71–114), which we were not able to close with specific PCRs due to the lack of sample availability. The genome organization of these strains was canonical to subgenus Decacovirus (Fig 3), where ORF1ab with (ribosomal slippage site) is followed by spike (S), non-structural protein (NS3), envelope protein (E), matrix protein (M) and nucleoprotein (N) coding regions, as well as ORFx. The spike protein contains the S1/S2 cleavage region, but they do not contain a polybasic cleavage site. Furthermore, Hidden Markov Model search suggested that the coded proteins contained conserved functional domains typical for alphacoronaviruses (S2 Table).

thumbnail
Fig 3. Gene composition of alphacoronavirus.

This figure illustrates the genome organization of an alphacoronavirus, highlighting the nucleotide lengths of individual genes along the genome map. Arrows represent the relative positions and sizes of each gene. The figure was created using BioRender.

https://doi.org/10.1371/journal.pntd.0012805.g003

3.3 Phylogenetic analysis based on complete coding regions

Phylogenies based on the amino acid sequences of core genes (ORF1ab, spike, NS3, E, M, N) indicated that our sequences clustered together on the basis of all analyzed proteins (Fig 4). The closest relatives for our sequences were alphacoronaviruses sequenced from Chaerephon pumilus fecal samples from Eswatini, southern Africa [44] and oral/rectal swabs of Mops condylurus bats from Nigeria, western Africa [46] (Fig 4). These African bat-associated viruses formed a larger cluster with more distantly related alphacoronaviruses Yunnan/CpYN11/2019 and WA3607 (subgenus Decacovirus) sequenced from Chaerephon plicatus bat fecal sample from Yunnan province, China [50] and Ozimops sp from southwestern Australia [51], respectively (S1A-E Fig). Notably, while these sequences formed a monophyletic group based on spike amino acid sequences, three other alphacoronavirus sequences were also included in this cluster. Alphacoronavirus isolate Tb2 from Tadarida brasiliensis bat fecal sample from Santa Fe province, Argentina [52], grouped together with WA3607 strain, and the Miniopterus sp alphacoronavirus strains HKU8 from Hong Kong [53] and BtMf-AlphaCoV/HuB2013 from Hubei province, China [54] formed their own node within this cluster (S1A-E Fig).

thumbnail
Fig 4. Phylogenetic trees based on (a) ORF1a, (b) ORF1b and (c) spike coding regions of the complete genome sequences from this study, as well as closely related sequences, of which nearly complete genomes were available.

The trees were constructed with maximum likelihood method implemented in IQTree2 software with best fit models GTR + F + I + G4 for ORF1a and ORF1b, and TIM3 + F + R3 for spike coding region and 1000 ultrafast bootstrap replicates. The tree was rooted to mid-point.

https://doi.org/10.1371/journal.pntd.0012805.g004

To further study the evolutionary relationship between our sequences and the related sequences, we included partial alphacoronavirus sequences sampled from Otomops harrisoni, Mount Suswa, Kajiado county, Kenya (partial ORF1a and complete ORF1b available) [48] and Chaerophon sp. bat from Kenya (complete ORF1b and onwards available) [45] to the analysis (Fig 4). Based on both ORF1a and ORF1b regions, the sequences from O. harrisoni form a sister clade to our sequences and sequences from Eswatini and Nigeria. In ORF1a and spike regions, our sequences cluster together with those from Eswatini. This clustering pattern remains similar throughout NS3, E, M and N coding regions. However, based on ORF1b, our sequences, along with those from Eswatini, Nigeria and KY22 strain from Kenya each form their separate lineage with unresolved deeper phylogenetic relationships.

4 Discussion

Predicting future coronavirus outbreaks requires enhanced surveillance to investigate coronavirus prevalence and distribution among bats alongside characterizing their genetic features. We identified coronaviruses in two bat species in the Taita Hills of southeastern Kenya and characterized the virus genomes. This information helps fill an important gap in knowledge about coronavirus diversity in underexplored and high-risk parts of the world.

The prevalence of CoVs (6.5%) in our study area was consistent with previous studies on coronavirus prevalence in bats. For example, Cappelle et al. reported coronavirus prevalence of 4.2% (24/573) in Cambodian bats from Kampot and 4.75% (22/463) in flying foxes from Kandal [55]. Studies by Xu et al. reported coronavirus in 5.3% (50/951) of bats in the Tibet autonomous Region of China [56]. Notably, coronavirus prevalence varies greatly between studies, with some of the highest coronavirus prevalence’s reported by Ge et al. 50% (138/276) in China [57], Balboni et al. 42% (19/45) in Italy [58] and by Tsuda et al. 29.6% (53/179) in the Philippines [59].

CoV prevalence in our study varied across the species and the capture sites, with prevalence being significantly higher in M. pumilus than in M. condylurus. The variability in host prevalence potentially highlights the complexity of exposure, infection, and immune dynamics over space and time. Although we only detected alphacoronaviruses in M. pumilus and M. condylurus, alphacoronaviruses and other novel coronaviruses have previously been found in some other bats in Kenya [48]. For example, Tong et al (2006) detected coronaviruses in different species of bats including Miniopterus inflatus, Rousettus aegytiacus and Eidolon helvum and others [60]. In addition, coronaviruses have also been previously detected in Eidolon helvum urban roost in Tanzania, frugivorous bats in Madagascar and different species of bats from Mozambique and Madagascar [6163].

We detected coronaviruses in bats trapped from houses occupied by humans and also water points which are used by both humans and livestock for drinking. This poses a significant zoonotic risk for the spillover of bat borne zoonotic viruses, particularly coronaviruses which have been documented by studies across Africa [6465]. In most rural communities in Africa, molossid bats, in particular M. pumilus and M. condylurus often roost in human occupied buildings, including roofs, ceilings and wall cavities, increasing contact with humans and livestock. Livestock in particular may act as an intermediate or amplifying host. Such close interaction increases the likelihood of viral exposure through contamination with bat excreta, saliva, or urine. In addition to the low public awareness of zoonotic risks, the risk of exposure is amplified in rural communities due to limited resources and housing conditions that may be inadequate to prevent wildlife invasion.

The partial coronavirus RdRp sequences obtained in our study showed a clustering pattern with Kenyan sequences, interspersed among bat sequences from other African countries. We observed two large clusters of nearly identical sequences; one cluster included bats trapped from two different colonies in two different locations (Voi and Maktau, 51.7 kms apart) while the second cluster included bats roosting in the same location (Voi, 3.5 kms apart). While Mops condylurus and Mops pumilus bats are thought to travel short distances due to their feeding behavior [66], detection of identical sequences in two different locations in Taita, as well as the intermixing of sequences from different countries in the phylogenetic tree, demonstrates likely longer distance movements of molossid bats.

In the past decade, knowledge on bat coronavirus ecology and epidemiology has significantly increased. Alphacoronaviruses have been reported in several bat populations and other mammalian hosts, with several lineages in bat species that often roost near human settlements and agricultural environments [1819]. Severe acute diarrhea syndrome coronavirus (SADS-CoV), an emerging virus responsible for severe, acute diarrhea in piglets in China is one of the most recent bat-derived alphacoronavirus [67]. Studies by Antony et al. estimated approximately 3,204 bat coronaviruses worldwide [68]. Similar to most emerging zoonoses, coronavirus spillover and emergence may be linked to high mutation rates, potential for recombination and anthropogenic changes such as deforestation, agricultural intensification and urbanization [17]. Further studies on bat coronavirus could focus on extensive surveillance of coronavirus in different bat species and also animals with close bat contact for coronavirus spillover and emergence in rural communities. The recent COVID-19 pandemic underscores the need for an increased assessment of CoV diversity and spillover risk at local and regional levels.

Supporting information

S1 Fig.

A. Phylogenetic trees based on amino acid sequences of the representatives of all available alphacoronavirus species. The trees were constructed with maximum likelihood method implemented in IQTree2 software with best fit substitution models LG + F + R10, WAG + F + R7, LG + F + I + G4, LG + F + I + G4 and LG + F + R5 for ORF1ab (A), spike (B), ORF3 (C), M (D) and N (E) respectively. The clusters are coloured on the basis of ORF1ab clustering pattern.

https://doi.org/10.1371/journal.pntd.0012805.s001

(PPTX)

S1 Table. Summary of sequencing quality and alignment metrics.

https://doi.org/10.1371/journal.pntd.0012805.s002

(DOCX)

S2 Table. The results of Hidden Markov Model (HMMER v3.4) search against Pfam database.

https://doi.org/10.1371/journal.pntd.0012805.s003

(DOCX)

Acknowledgments

We wish to acknowledge the training and support from the University of Nairobi’s Building Capacity for Writing Scientific Manuscripts (UANDISHI) Program at the Faculty of Health Sciences. The training was funded in part through the ADVANCE program at IAVI. This work is made possible by the support of the American People through the U.S. President’s Emergency Plan for AIDS Relief (PEPFAR) through United States Agency for International Development (USAID). The contents of this study are the sole responsibility of the authors and do not necessarily reflect the views of PEPFAR, USAID, or the United States Government. We also thank Ruut Uusitalo for preparing the map image.

References

  1. 1. Monchatre-Leroy E, Boué F, Boucher JM, Renault C, Moutou F, Gouilh MA. Identification of Alpha and Beta Coronavirus in Wildlife Species in France: Bats, Rodents, Rabbits, and Hedgehogs. Viruses. 2017;9(12).
  2. 2. Lai MMC, Holmes KV. Fields virology. In: Lippincott Williams DMK. Philadelphia PA: Lippincott Williams & Wilkins. 2001. 1163–85.
  3. 3. Zhong NS, Zheng BJ, Li YM, Poon, Xie ZH, Chan KH, et al. Epidemiology and cause of severe acute respiratory syndrome (SARS) in Guangdong, People’s Republic of China, in February, 2003. Lancet. 2003;362(9393):1353–8. pmid:14585636
  4. 4. Cui J, Li F, Shi Z-L. Origin and evolution of pathogenic coronaviruses. Nat Rev Microbiol. 2019;17(3):181–92. pmid:30531947
  5. 5. Bawazir A, Al-Mazroo E, Jradi H, Ahmed A, Badri M. MERS-CoV infection: Mind the public knowledge gap. J Infect Public Health. 2018;11(1):89–93. pmid:28647126
  6. 6. Zaki AM, van Boheemen S, Bestebroer TM, Osterhaus ADME, Fouchier RAM. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N Engl J Med. 2012;367(19):1814–20.
  7. 7. Khan S, Nabi G, Han G, Siddique R, Lian S, Shi H. Novel coronavirus: how things are in Wuhan. Clin Microbiol Infect. 2020;26(4):399–400.
  8. 8. Virus taxonomy: classification and nomenclature of viruses. Ninth report of the International Committee on Taxonomy of Viruses. 2012
  9. 9. Woo PCY, Lau SKP, Lam CSF, Lau CCY, Tsang AKL, Lau JHN, et al. Discovery of seven novel Mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus. J Virol. 2012;86(7):3995–4008. pmid:22278237
  10. 10. Corman VM, Baldwin HJ, Tateno AF, Zerbinati RM, Annan A, Owusu M. Evidence for an Ancestral Association of Human Coronavirus 229E with Bats. J Virol. 2015;89(23):11858–70.
  11. 11. Corman VM, Eckerle I, Memish ZA, Liljander AM, Dijkman R, Jonsdottir H, et al. Link of a ubiquitous human coronavirus to dromedary camels. Proc Natl Acad Sci U S A. 2016;113(35):9864–9. pmid:27528677
  12. 12. Graham RL, Baric RS. Recombination, reservoirs, and the modular spike: mechanisms of coronavirus cross-species transmission. J Virol. 2010;84(7):3134–46.
  13. 13. Hulswit RJG, de Haan CAM, Bosch B-J. Coronavirus Spike Protein and Tropism Changes. Adv Virus Res. 2016;96:29–57. pmid:27712627
  14. 14. Ruiz-Aravena M, McKee C, Gamble A, Lunn T, Morris A, Snedden CE, et al. Ecology, evolution and spillover of coronaviruses from bats. Nat Rev Microbiol. 2022;20(5):299–314. pmid:34799704
  15. 15. Simmons NB, Bat ALC. Bat species of the world: A taxonomic and geographic database. 2018.
  16. 16. Herkt KMB, Barnikel G, Skidmore AK, Fahr J. A high-resolution model of bat diversity and endemism for continental Africa. Ecol Modell. 2016;320:9–28.
  17. 17. Allen T, Murray KA, Zambrana-Torrelio C, Morse SS, Rondinini C, Di Marco M, et al. Global hotspots and correlates of emerging zoonotic diseases. Nat Commun. 2017;8(1):1124. pmid:29066781
  18. 18. Jackson RT, Webala PW, Ogola JG, Lunn TJ, Forbes KM. Roost selection by synanthropic bats in rural Kenya: implications for human-wildlife conflict and zoonotic pathogen spillover. R Soc Open Sci. 2023;10(9):230578. pmid:37711150
  19. 19. Lunn TJ, Jackson RT, Webala PW, Ogola JG, Forbes KM. Modern building structures are a landscape-level driver of bat–human exposure risk in Kenya. Front Ecol Environ. 2024;e2795.
  20. 20. Jackson RT, Lunn TJ, DeAnglis IK, Ogola JG, Webala PW, Forbes KM. Frequent and intense human-bat interactions occur in buildings of rural Kenya. PLoS Negl Trop Dis. 2024;18(2):e0011988. pmid:38412171
  21. 21. Wang N, Li SY, Yang XL o u, Huang HM, Zhang YJ, Guo H. Serological evidence of bat SARS-related coronavirus infection in humans, China. Virol Sin. 2018;33(1):104–7.
  22. 22. Li H, Mendelsohn E, Zong C, Zhang W, Hagan E, Wang N, et al. Human-animal interactions and bat coronavirus spillover potential among rural residents in Southern China. Biosaf Health. 2019;1(2):84–90. pmid:32501444
  23. 23. Patterson B, Webala P. Keys to the bats (Mammalia: Chiroptera) of East Africa. Fieldiana Life Earth Sci. 2012;12(6).
  24. 24. RCMRD. Kenya SRTM DEM 30 meters. RCMRD GMES & Africa Geoportal. 2025. https://gmesgeoportal.rcmrd.org/datasets/9a461a538cf848299b996a28cdc42ef6/explore
  25. 25. Natural Earth. Downloads – Free vector and raster map data at 1:10m, 1:50m, and 1:110m scales. n.d. https://www.naturalearthdata.com/downloads/
  26. 26. G A D M. Database of Global Administrative Areas. n.d. https://gadm.org/download_country.html
  27. 27. European Space Agency (ESA). Sentinel-2 MSI Level-1C. Copernicus Open Access Hub. 2023. https://scihub.copernicus.eu/
  28. 28. Happold DC, Happold M. Reproduction of Angola free-tailed bats (Tadarida condylura) and little free-tailed bats (Tadarida pumila) in Malawi (Central Africa) and elsewhere in Africa. J Reprod Fertil. 1989;85(1):133–49. pmid:2915350
  29. 29. Escutenaire S, Mohamed N, Isaksson M, Thorén P, Klingeborn B, Belák S, et al. SYBR Green real-time reverse transcription-polymerase chain reaction assay for the generic detection of coronaviruses. Arch Virol. 2007;152(1):41–58. pmid:16941059
  30. 30. De Souza Luna LK, Heiser V, Regamey N, Panning M, Drexler JF, Mulangu S. Generic detection of coronaviruses and differentiation at the prototype strain level by reverse transcription-PCR and nonfluorescent low-density microarray. J Clin Microbiol. 2007;45(3):1049–52.
  31. 31. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34(17):i884–90. pmid:30423086
  32. 32. Li D, Luo R, Liu CM, Leung CM, Ting HF, Sadakane K. MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods. 2023;102:3–11.
  33. 33. Somervuo P, Holm L. SANSparallel: interactive homology search against Uniprot. Nucleic Acids Res. 2015;43(W1):W24-9. pmid:25855811
  34. 34. Plyusnin I, Kant R, Jääskeläinen AJ, Sironen T, Holm L, Vapalahti O, et al. Novel NGS pipeline for virus discovery from a wide spectrum of hosts and sample types. Virus Evol. 2020;6(2):veaa091. pmid:33408878
  35. 35. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv. 2013.
  36. 36. Truong Nguyen PT, Plyusnin I, Sironen T, Vapalahti O, Kant R, Smura T. HAVoC, a bioinformatic pipeline for reference-based consensus assembly and lineage assignment for SARS-CoV-2 sequences. BMC Bioinformatics. 2021;22(1):373. pmid:34273961
  37. 37. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. pmid:19505943
  38. 38. Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27(21):2987–93. pmid:21903627
  39. 39. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80.
  40. 40. Quang B, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, Von Haeseler A. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Molecular Biology and Evolution. 2020;37(5):1530–4.
  41. 41. Kalyaanamoorthy S, Minh BQ, Wong TKF, Von Haeseler A, Jermiin LS. ModelFinder: Fast Model Selection for Accurate Phylogenetic Estimates. Nat Methods. 2017;14(6):587.
  42. 42. Hoang DT, Chernomor O, Von Haeseler A, Minh BQ, Vinh LS. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Molecular Biology and Evolution. 2018;35(2):518–22.
  43. 43. Letunic I, Bork P. Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Research. 2021;49(W1):W293-6.
  44. 44. Shapiro JT, Mollerup S, Jensen RH, Olofsson JK, Nguyen NP, Hansen TA. Metagenomic analysis reveals previously undescribed bat coronavirus strains in Eswatini. EcoHealth. 2021;18(4):421–8.
  45. 45. Tao Y, Tang K, Shi M, Conrardy C, Li KSM, Lau SKP, et al. Genomic characterization of seven distinct bat coronaviruses in Kenya. Virus Res. 2012;167(1):67–73. pmid:22561208
  46. 46. George U, George O, Oguzie J, Osasona O, Motayo B, Kamani J. Genomic characterization of Alphacoronavirus from Mops condylurus bats in Nigeria. Virus Research. 2023;334.
  47. 47. Anthony SJ, Johnson CK, Greig DJ, Kramer S, Che X, Wells H, et al. Global patterns in coronavirus diversity. Virus Evol. 2017;3(1):vex012. pmid:28630747
  48. 48. Kamau J, Ergunay K, Webala PW, Justi SA, Bourke BP, Kamau MW, et al. A Novel Coronavirus and a Broad Range of Viruses in Kenyan Cave Bats. Viruses. 2022;14(12):2820. pmid:36560824
  49. 49. Latinne A, Hu B, Olival KJ, Zhu G, Zhang L, Li H. Origin and cross-species transmission of bat coronaviruses in China. Nat Commun. 2020;11(1).
  50. 50. Zhou H, Ji J, Chen X, Bi Y, Li J, Wang Q, et al. Identification of novel bat coronaviruses sheds light on the evolutionary origins of SARS-CoV-2 and related viruses. Cell. 2021;184(17):4380-4391.e14. pmid:34147139
  51. 51. Prada D, Boyd V, Baker ML, O’Dea M, Jackson B. Viral Diversity of Microbats within the South West Botanical Province of Western Australia. Viruses. 2019;11(12):1157. pmid:31847282
  52. 52. Cerri A, Bolatti EM, Zorec TM, Montani ME, Rimondi A, Hosnjak L. Identification and characterization of novel alphacoronaviruses in Tadarida brasiliensis (Chiroptera, Molossidae) from Argentina: insights into recombination as a mechanism favoring bat coronavirus cross-species transmission. Microbiol Spectr. 2023;11(5).
  53. 53. Chu DKW, Peiris JSM, Chen H, Guan Y, Poon LLM. Genomic characterizations of bat coronaviruses (1A, 1B and HKU8) and evidence for co-infections in Miniopterus bats. J Gen Virol. 2008;89(Pt 5):1282–7. pmid:18420807
  54. 54. Du J, Yang L, Ren X, Zhang J, Dong J, Sun L. Genetic diversity of coronaviruses in Miniopterus fuliginosus bats. Sci China Life Sci. 2016;59(6):604–14.
  55. 55. Cappelle J, Furey N, Hoem T, Ou TP, Lim T, Hul V, et al. Longitudinal monitoring in Cambodia suggests higher circulation of alpha and betacoronaviruses in juvenile and immature bats of three species. Sci Rep. 2021;11(1):24145. pmid:34921180
  56. 56. Xu L, Zhang F, Yang W, Jiang T, Lu G, He B, et al. Detection and characterization of diverse alpha- and betacoronaviruses from bats in China. Virol Sin. 2016;31(1):69–77. pmid:26847648
  57. 57. Ge X-Y, Wang N, Zhang W, Hu B, Li B, Zhang Y-Z, et al. Coexistence of multiple coronaviruses in several bat colonies in an abandoned mineshaft. Virol Sin. 2016;31(1):31–40. pmid:26920708
  58. 58. Balboni A, Gallina L, Palladini A, Prosperi S, Battilani M. A real-time PCR assay for bat SARS-like coronavirus detection and its application to Italian greater horseshoe bat faecal sample surveys. ScientificWorldJournal. 2012;2012:989514. pmid:22654650
  59. 59. Tsuda S, Watanabe S, Masangkay JS, Mizutani T, Alviola P, Ueda N. Genomic and serological detection of bat coronavirus from bats in the Philippines. Arch Virol. 2012;157(12):2349–55.
  60. 60. Tong S, Conrardy C, Ruone S, Kuzmin IV, Guo X, Tao Y, et al. Detection of novel SARS-like and other coronaviruses in bats from Kenya. Emerg Infect Dis. 2009;15(3):482–5. pmid:19239771
  61. 61. Hoarau AOG, Goodman SM, Al Halabi D, Ramasindrazana B, Lagadec E, Le Minter G. Investigation of astrovirus, coronavirus and paramyxovirus co-infections in bats in the western Indian Ocean. Virology Journal. 2021;18(1).
  62. 62. Razanajatovo NH, Nomenjanahary LA, Wilkinson DA, Razafimanahaka JH, Goodman SM, Jenkins RK. Detection of new genetic variants of Betacoronaviruses in endemic frugivorous bats of Madagascar. Virol J. 2015;12(1).
  63. 63. Montecino-Latorre D, Goldstein T, Kelly TR, Wolking DJ, Kindunda A, Kongo G, et al. Seasonal shedding of coronavirus by straw-colored fruit bats at urban roosts in Africa. PLoS One. 2022;17(9):e0274490. pmid:36107832
  64. 64. Kumakamba C, Niama FR, Muyembe F, Mombouli J-V, Kingebeni PM, Nina RA, et al. Coronavirus surveillance in wildlife from two Congo basin countries detects RNA of multiple species circulating in bats and rodents. PLoS One. 2021;16(6):e0236971. pmid:34106949
  65. 65. Gibb R, Redding DW, Chin KQ, Donnelly CA, Blackburn TM, Newbold T. Zoonotic host diversity increases in human-dominated ecosystems. Nature. 2020;584(7821):398–402.
  66. 66. Noer CL, Dabelsteen T, Bohmann K, Monadjem A. Molossid bats in an African agro-ecosystem select sugarcane fields as foraging habitat. Afr Zool. 2012;47(1):1–11.
  67. 67. Zhou P, Fan H, Lan T, Yang XL, Shi WF, Zhang W. Fatal swine acute diarrhoea syndrome caused by an HKU2-related coronavirus of bat origin. Nature. 2018;556(7700):255–9.
  68. 68. Anthony SJ, Johnson CK, Greig DJ, Kramer S, Che X, Wells H. Global patterns in coronavirus diversity. Virus Evolution. 2017;3(1).