Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Banana bunchy top virus genetic diversity in Pakistan and association of diversity with recombination in its genomes

  • Sana Bashir,

    Roles Data curation, Formal analysis, Methodology, Software, Writing – original draft

    Affiliation Department of Biosciences, COMSATS University Islamabad, Islamabad, Pakistan

  • Syed Muhammad Saqlan Naqvi,

    Roles Conceptualization

    Affiliation Department of Biochemistry, PMAS-Arid Agriculture University, Rawalpindi, Pakistan

  • Aish Muhammad,

    Roles Resources

    Affiliation National Institute for Genomics and Advanced Biotechnology, National Agriculture Research Centre, Islamabad, Pakistan

  • Iqbal Hussain,

    Roles Validation

    Affiliation National Institute for Genomics and Advanced Biotechnology, National Agriculture Research Centre, Islamabad, Pakistan

  • Kazim Ali,

    Roles Validation

    Affiliation National Institute for Genomics and Advanced Biotechnology, National Agriculture Research Centre, Islamabad, Pakistan

  • Muhammad Ramzan Khan,

    Roles Visualization

    Affiliation National Institute for Genomics and Advanced Biotechnology, National Agriculture Research Centre, Islamabad, Pakistan

  • Sumaira Farrakh,

    Roles Investigation, Validation

    Affiliation Department of Biosciences, COMSATS University Islamabad, Islamabad, Pakistan

  • Tayyaba Yasmin,

    Roles Writing – review & editing

    Affiliation Department of Biosciences, COMSATS University Islamabad, Islamabad, Pakistan

  • Muhammad Zeeshan Hyder

    Roles Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing

    zeeshan_hyder@comsats.edu.pk

    Affiliation Department of Biosciences, COMSATS University Islamabad, Islamabad, Pakistan

Abstract

Banana Bunchy top virus (BBTV) is a multipartite circular single strand DNA virus that belongs to genus Babuvirus and family Nanoviridae. It causes significant crop losses worldwide and also in Pakistan. BBTV is present in Pakistan since 1988 however, till now only few (about twenty only) sequence of genomic components have been reported from the country. To have insights into current genetic diversity in Pakistan fifty-seven genomic components including five complete genomes (comprises of DNA-R, -U3, -S, -M, -C and -N components) were sequenced in this study. The genetic diversity analysis of populations from Pakistan showed that DNA-R is highly conserved followed by DNA-N, whereas DNA-U3 is highly diverse with the most diverse Common Region Stem-loop (CR-SL) in BBTV genome, a functional region, which previously been reported to have undergone recombination in Pakistani population. A Maximum Likelihood (ML) phylogenetic analysis of entire genomes of isolates by using sequence of all the components concatenated together with the reported genomes around the world revealed deeper insights about the origin of the disease in Pakistan. A comparison of the genetic diversity of Pakistani and entire BBTV populations around the world indicates that there exists a correlation between genetic diversity and recombination. Population genetics analysis indicated that the degree of selection pressure differs depending on the area and genomic component. A detailed analysis of recombination across various components and functional regions suggested that recombination is closely associated with the functional parts of BBTV genome showing high genetic diversity. Both genetic diversity and recombination analyses suggest that the CR-SL is a recombination hotspot in all BBTV genomes and among the six components DNA-U3 is the only recombined component that has extensively undergone inter and intragenomic recombination. Diversity analysis of recombinant regions results on average one and half fold increase and, in some cases up to four-fold increase due to recombination. These results suggest that recombination is significantly contributing to the genetic diversity of BBTV populations around the world.

Introduction

Banana bunchy top disease (BBTD) is the most common and devastating viral disease of Banana, predominantly found in Pacific and Asian regions. It has been considered one of the most important plant viral diseases around the world [1]. In Pakistan, BBTD was first observed in 1988 in Sindh province. Later, based on symptomology it was identified in 1991 [2]. BBTD is caused by Banana Bunchy top Virus (BBTV), which belongs to genus Babuvirus in the family Nanoviridae [3]. BBTV is transmitted persistently by the aphid Pentalonia nigronervosa Coq, its sole known vector, and infects only the members of Musaceae [4].

Banana bunchy top virus (BBTV) is a multipartite circular single stranded (css) DNA virus comprising of six css DNA components each of about 1Kb in size [5, 6]. BBTV genome consists of six integral ssDNA components, including DNA-R, C, M, S and -N which encode master replication initiation protein (M-Rep), cell cycle link protein (Clink) movement protein (MP), capsid protein (CP) and nuclear-shuttle protein (NSP), respectively while DNA-U3 encodes a protein for which no function has been assigned so far [5, 710]. Two conserved notable functional regions exist in all six DNA components of BBTV, which include the common region stem-loop (CR-SL), which is about 70 nucleotides long and contains a 31 nucleotides origin of replication (ori) for proposed rolling-circle replication [11, 12]. Second region which is more than 70% identical in six components, and about 90 nucleotide long, is the common region major (CR-M) [5, 13] to which small ssDNA primers bound and initiate complementary-strand DNA synthesis after entering a host cell [14] and is under the course of concerted type evolution [15, 16].

Occurrence of genetic variations due to recombination is a well-established phenomenon in BBTV [17]. This phenomenon is of particular significance in the evolution of viruses as it provides the possibility of natural recombination events, which leads to extensive viral diversity [17]. In geminivirus evolution, the importance of recombination is well recognized [1820] as it is the most probable mechanism responsible for the genetic diversification of agriculturally important begomovirus species [21, 22]. Whilst generating the descendants with increased fitness, recombination is also a source for increased genetic diversity in begomoviruses [17]. In viruses, recombination breakpoints identification is a useful way to detect circulating recombinant forms and to infer the underlying recombination mechanisms such as intra and intergenomic recombination [23, 24]. Although sequence analysis of individual BBTV components [9, 10, 15, 16, 2530] and complete BBTV genomes [3136] revealed evidence of intra and intergenomic recombination, but no study related to the understanding the contribution of these recombination mechanisms in genetic diversity has been reported so far for BBTV. Previous molecular analysis of BBTV from Pakistan has been very limited with only one partial genome [37] and a few DNA components [16, 25, 37] have been sequenced from different districts of Sindh province. Thus, earlier phylogenetic analyses within the country performed on BBTV have used sequences of individual BBTV components rather than full genomes.

In the current study, we reported the sequence of complete genomes of five BBTV isolates originated from different districts of Sindh, Pakistan. The diversity analysis and putative recombination events were studied for Pakistani isolates and then for sequences available in the public database in GenBank. Also, the contribution of genetic diversity by recombination, recombination hotspots and population genetics are studied for BBTV genomes to better understand the evolution of this virus in various geographic regions around the globe.

Material and methods

DNA extraction, PCR amplification, cloning and sequencing

BBTV infected plant material showing typical BBTD symptom was collected from district Tandojam, Sindh province in 2006 for P.TJ1 isolate (for which DNA-R was previously reported by Hyder and colleagues (2007)), and in 2007, for P.BS1 & 2, P.GH1 & 2, P.HD1 & 2, P. JS1 (the DNA-U3 for P.GH1, P.JS1 and P.HD1 were previously reported by Hyder and colleagues (2011)), P.KP1 & 2, P.MT1 & 2, P.NS1, P.TA1 & 2 from Tandojam, Bhitshah, Ghotki, Hyderabad, Jamshoro, Khairpur, Matiari, Nawabshah and Tandoadam districts of the Sindh respectively. The P.TJ3, P.NARC and P.Sakrand & P.TJ4 were isolated during 2011, 2017 and 2018 respectively from Tandojam district. The total genomic DNA extraction was performed using the CTAB method as described by Hyder et al., (2007). PCR amplification and DNA sequencing were performed with two pairs of adjacent outwardly extending abutting primer each specific for a different location in each genomic component. Using this technique multiple reads in sequencing every component was generated and the sites where one set of primer binds, were sequenced using 2nd pair of primer and vice versa. for each genomic component of BBTV (Table 1).

thumbnail
Table 1. Details of primers used to sequence BBTV genomic components from Pakistan.

https://doi.org/10.1371/journal.pone.0263875.t001

The PCR amplifications were carried out either using GoTaq® PCR Core System I (Promega Corp. Madison, WI, USA) or using Taq DNA Polymerase (recombinant) (Fermentas UAB Lithuania), according to manufacturer’s instructions. A typical PCR reaction contained about 50 ng DNA template, respective Taq buffer, 1.5 mM MgCl2, 200 μM of each dNTPs, 2.5 units of Taq DNA polymerase and 50 pM of each primer. The thermal profile for both primer sets included pre-PCR denaturation at 94°C for 3 minutes followed by 35 cycles of denaturing at 94°C for 30 seconds, annealing at 52°C for 30 seconds and extension at 72°C for 45 seconds, and a final extension at 72°C for 20 minutes. The PCR products of P.TJ1 were ligated into pTZ57R (InsTA Cloning Kit, Fermentas UAB Lithuania) according to the manufacturer’s instructions and transformed into Escherichia coli DH5α (supE44, ΔlacU169 (Φ80lacZΔM15), hsdR17, recA1, endA1, gyrA96, thi-1, relA1) cells by electroporation. While for those isolates sampled during 2007, the PCR products were ligated directly into the pGEM®-T Easy Vector (Promega, Madison, WI) and cloned in E. coli DH5α cell. The clones were selected using 50 μg/ml ampicillin and screened for white colonies generated by insertional inactivation of functional beta galactosidase gene whose expression was induced by IPTG (Isopropyl β- d-1-thiogalactopyranoside) converting chromogenic X-GAL (5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside) into blue color in non-recombinant bacterial cells. The plasmid DNA was extracted using a minipreparation protocol according to Sambrook and Russell [38], and confirmed by digestion with appropriate restriction endonucleases. For every component of an isolate, both the strands of two individual clones originated from two different primer sets of the respective component were sequenced. The genomic components of P.TJ1 isolate was sequenced using the CEQ Dideoxy Dye Terminator cycle sequencing Kit (Beckman Coulter, USA) and the CEQ 8000 Genetic Analysis System (Beckman Coulter, USA) by universal M13 and insert-specific primers to obtain full length sequence of each strand. The isolates sampled during 2007 were processed in a similar way and sequenced at sequencing facility of Iowa State University, U.S.A. Both the strands of the PCR products of two different primer sets specific for the same genomic component, from each isolate, sampled in 2011, 2017 and 2018 were directly sequenced using commercial DNA Sequencing Facility of Macrogen (Macrogen, Inc. South Korea). The sequence data of both the strand of a clone/PCR product was assembled first using DNA Dragon Contig Sequence Assembly Software (version 1.5.0) SequentiX—Digital DNA Processing (Germany) (https://www.sequentix.de/software_dnadragon.php). Then the consensus of sequencing, originating from two different primer sets, was developed for each component of an isolate. The consensus sequence of each sequenced component from an isolate was submitted in GenBank. The accession numbers were (MK140625, MK140626, MK140627, FJ859727, FJ859728, FJ859722, FJ859733, FJ859734, FJ859732, FJ859723, FJ859724, FJ859729, FJ859730, FJ859731, FJ859725, FJ859726, JX170762 for DNA-R), (MK140628, MK140629, MK140630, JX1700764 for DNA-U3), (MK140619, MK140620, MK140621, EF593169, FJ859740, FJ859741, FJ859735, FJ859746, FJ859747, FJ859745, FJ859736, FJ859737, FJ859742, FJ859743, FJ859744, FJ859738, FJ859739, JX170763 for DNA-S), (MK140616, MK140617, MK140618, EU095948, JX467685, JX467686, JX46768, JX170760 for DNA-M), (MK140613, MK140614, MK140615, EF520722, JX170759 for DNA-C), (MK140624, MK140625, MK140626, EF529519, JX170761 for DNA-N) respectively.

Sequence retrieval and genetic diversity

The full-length sequences of all components of BBTV genome were obtained from NCBI GenBank using Taxonomy Browser till April 05, 2020 and given abbreviated names for convenience (S1 Table). The coding and regulatory regions, including CR-SL and CR-M were identified as described by Burns and colleagues (1995), and the sequences were started from the start of major ORF identified by Herrera-Valencia and colleagues (2007) [39] in each component for alignment through MAFFT version 6.864 [40]. The genetic diversity values of each BBTV component and subgenomic regions were determined on the aligned sequences first for Pakistani isolates and then worldwide. The nucleotide diversity range analysis was performed using MatGAT [41], while Pair-wise average nucleotide diversity per 100 sites π (Pi) and Watterson estimator θw (Theta-w) for population mutation rates per 100 sites were determined using DnaSP version 6.12.03 [42].

To study the relationship BBTV isolates of Pakistan with BBTV isolates from other parts of the world, a Maximum Likelihood (ML) phylogenetic genomic tree based on the concatenated nucleotide sequences of entire genomic components (i.e. DNA-R, U3, S, M, C and N concatenated together in single sequence for an isolate) of those isolates for which full genomic sequences were available in the GenBank, was constructed with 1000 bootstrap replicates. In addition, individual component-based ML phylogenetic analyses of full-length sequences of each component separately was also performed in the Molecular Evolutionary Genetics Analysis program (MEGA), version X [43] with 1000 bootstrap replicates. The phylogenetic trees were visualized using FigTree version 1.4.3 [44] (http://tree.bio.ed.ac.uk/software/).

Selection pressure analysis

The selection pressure was estimated by dN/dS ratio, where dN represents the average number of nonsynonymous substitutions per nonsynonymous site and dS is the average number of synonymous substitutions per synonymous site. In MEGA X [43]. dN and dS values were estimated separately by using Nei-Gojobori method (Jukes-Cantor). The gene is under neutral selection when dN/dS ratio = 1, positive (or diversifying) selection when the dN/dS ratio is > 1 and negative (or purifying) selection when dN/dS ratio < 1.

Neutrality test

DnaSP version 6.12.03 [42] was used for testing Tajima D [45], Fu and Li’s D*and F* [46] number of haplotypes (H), haplotype diversity (Hd) and nucleotide diversity (π). Tajima’s D test in a genomic region measures the departure from neutrality for all mutations. Tajima’s D test is based on the differences between the number of segregating sites and the average number of nucleotide differences. Fu and Li’s D*test is based on the differences between the total number of mutations and number of singletons (mutations appearing only once among the sequence). Fu and Li’s F*test is based on the differences between the number of singletons and the average number of nucleotide differences between every pair of sequences. Haplotype diversity refers to the frequency and number of haplotypes in the population while nucleotide diversity estimates the average pairwise differences among sequences.

Recombination analysis

The recombination analysis was performed using Recombination Detection Program (RDP) version 4 Beta14 [47]. The intergenomic (intracomponent) recombination was determined by using MAFFT aligned sequences separately for each component, while intragenomic (intercomponent) recombination was determined using MAFFT aligned sequences of all the component together. The recombination analysis was performed using maximum χ2 (MaxChi) [48], bootscan [49], GENECONV [50], LARD [51], Distance Plot (SimPlot) [52], sister scanning (SiSscan) [53], TOPAL [54], chimaera [55] and reticulate (compatibility matrix) [56] implemented in RDP with the default values (P. value = 0.05, Multiple Comparison Correction = Bonferroni Correction, Number of permutations = 0) except that sequence type was set to circular.

The contribution of genetic diversity by recombinants in their respective populations was determined by selecting the recombined genomic regions for each recombination event in alignments of the respective component as identified by RDP and then selecting recombination population in the ML trees from the respective geographic location, aligning only the recombinant regions using MAFFT software and then calculating the genetic diversity range, π (Pi) and Watterson estimator θw (Theta-w) values as described earlier for the respective recombinant population with and without recombinant sequences.

Results

Genetic diversity of BBTV population

Total fifty-seven genomic components including five complete genomes of BBTV were sequenced from Pakistan during 2006 to 2018 (accession numbers given in Table 2). The genetic diversity analysis of these components along with other reported isolates of BBTV from Pakistan [16, 25, 37], showed that DNA-R is highly conserved followed by DNA-N, whereas DNA-U3 is highly diverse followed by DNA-C. Similarly, for coding regions, DNA-R is highly conserved followed by DNA-M, while, DNA-C followed by DNA-S are highly diverse (Table 3). However, when different functional regions of BBTV genome such as CR-M and CR-SL are compared with the full-length sequences and coding regions, it is revealed that between BBTV components CR-M is highly conserved while CR-SL is highly diverse in DNA-U3. Contrary to sequences of full-length components and coding regions, CR-M in DNA-R is highly diverse followed by DNA-C. In a similar contradiction, DNA-U3 bears the most conserved CR-M region. The sequence diversity in the CR-SL region of Pakistani BBTV population reveals very interesting findings, the DNA-R, -S, -M and -C have identical CR-SL and there exists no divergence, however, DNA-U3 and N showed diverse CR-SL region. While in DNA-N the genetic diversity was lower than DNA-U3 in this region (Table 3).

thumbnail
Table 2. Description of Banana bunchy top virus isolates sequenced from Pakistan for this study.

https://doi.org/10.1371/journal.pone.0263875.t002

thumbnail
Table 3. Genetic diversity of Banana bunchy top virus population in Pakistan.

https://doi.org/10.1371/journal.pone.0263875.t003

The BBTV population in Pakistan is considered to have a monophyletic origin and supposed to have been originated from a single introduction of BBTV in the country [25, 37] that took place before 1988 when BBTV was first observed in the Sindh [2]. The current diversity analysis (Table 3) suggests that BBTV components in Pakistan, are not evolving at a similar rate as some components are quite conserved i.e. DNA-R and DNA-N, while some have more diversification (i.e. DNA-U3). Interestingly, within a component, the divergence is not uniform, the most conserved component i.e. DNA-R has the most diverse CR-M, while DNA-U3 has CR-SL that is the most diverse functional region in the entire genome of BBTV.

Once having these insights about the genetic diversity of BBTV population in Pakistan, the genetic diversity of entire BBTV populations around the world was also calculated in similar way. The full-length sequences (1425 full-length components) from various BBTV isolates were obtained from GenBank using Taxonomy browser (S1 Table) and their diversity was calculated for two major groups of BBTV population i.e., the South Pacific and Asian groups [26] now referred to as Pacific Indian Ocean (PIO) and the Southeast Asian (SEA) group respectively [57] along with the entire world population. The analysis (Table 4) revealed that PIO group is more conserved compared to the SEA group, a finding that is consistent with the previous studies [7, 9, 26, 33, 58]. Among components DNA-R is the most conserved, while DNA-U3 is the most diverse component (Table 4) and the coding region analysis also follows the same trend. However, contradictory to the diversity analysis of BBTV-Pakistan, DNA-U3 has the most diverse CR-M in the entire BBTV population, while DNA-C has the most conserved CR-M region. The DNA-N has the most conserved CR-SL region in the entire population, while, like Pakistani population, DNA-U3 also has the most diverse CR-SL region. These analyses clearly suggest that different components and their functional part in BBTV genome are harboring different extent of genetic diversity and this diversity is related to different broader geographic regions such as PIO and SEA or to a country such as Pakistan.

thumbnail
Table 4. Genetic diversity of Banana bunchy top virus population in the world.

https://doi.org/10.1371/journal.pone.0263875.t004

Phylogenetic analysis

The phylogenetic relationship of BBTV isolates from different banana growing regions was analyzed to determine the relationships of isolates from Pakistan with those from worldwide. For this purpose, a concatenated genome-based ML phylogenetic tree was generated. The tree includes a total of 132 BBTV genomes out of which 107 were from PIO and 25 were from SEA group. The phylogenetic analysis showed close homology of Pakistan isolates with each other making a single clade with close relationships with isolates from Congo, Sri Lanka, Rwanda, Malawi, Burundi and India with a bootstrap support of 78% (Fig 1) in PIO. In addition, the multiple sequence alignments of a total 345 full-length DNA-R sequences (266 from PIO and 79 from the SEA), 208 full-length sequences of DNA-U3 (166 from PIO and 42 from SEA), 274 full-length sequences of DNA-S (214 from PIO and 60 from SEA), for DNA-M, 206 full-length sequences (164 from PIO and 42 from SEA), 194 full-length sequences of DNA-C (156 from PIO and 38 from SEA) and for DNA-N a total of 198 full-length sequences (154 from PIO and 44 from SEA) were conducted by MAFFT for each component separately and phylogenetic relationships were inferred using Maximum Likelihood (ML) method with 1000 bootstrap replications using MEGA X and Newick format files were also generated and annotated in FigTree version 1.4.3 [44] (Fig 2, S1S5 Figs. This analysis revealed a clear-cut partitioning of BBTV isolates into two major clusters (PIO and SEA group) [57]. Interestingly, though Pakistani BBTV population phylogenetically belonged to PIO based on all the genetic components however, their genetic diversity pattern (Table 3) is quite distinct compared to the PIO group (Table 4). Notably, the most conserved component in BBTV Pakistan is DNA-R, while in PIO it is DNA-C, while most conserved functional region in BBTV Pakistan is CR-SL (identical in DNA-R, -S, -M and -C) while in PIO it is CR-M of DNA-C.

thumbnail
Fig 1. Maximum Likelihood phylogenetic analysis of Banana bunchy top virus entire genomes, showing the relationship of newly characterized BBTV genomes from Pakistan.

Phylogenetic Maximum Likelihood analysis based on the concatenated nucleotide sequences of BBTV genomic components (R, U3, S, M, C, N together) indicates the evolutionary analyses of Pakistani isolates with the isolates from other parts of the world. The analysis involved 132 complete genome sequences. Each component is labelled with letters representing its geographic origin: Aus Australia, B. Burundi, C. China, Co. Congo, E. Egypt, I. India, M. Malawi, U. USA, S. Samoa, RW. Rwanda, Si. Sri Lanka, P. Pakistan, Ph. Philippines, T. Taiwan, To. Tonga, In. Indonesia, TH. Thailand. Pakistan isolates were shown in bold. The analysis showed two geographic regions in BBTV i.e Pacific Indian Ocean and Southeast Asia. Within the PIO Pakistan isolates showed homology with Indian, Samoa, Tonga Egypt, Australia and USA isolates. While SEA group includes isolates from China, Taiwan, Philippine, Thailand, Indonesia and a reassorted isolate of India.

https://doi.org/10.1371/journal.pone.0263875.g001

thumbnail
Fig 2. Phylogenetic analysis of BBTV DNA-U3 component illustrating intra and intergenomic recombination events at different nodes.

Phylogenetic analysis of DNA-U3 based on nucleotide sequences of BBTV isolates from Pakistan along with previously reported PIO and SEA isolates of BBTV. The evolutionary history was inferred using the Maximum likelihood (ML) method. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates). Each sequence is labelled with the GenBank accession number followed by origin and isolate name. The phylogenetic trees were further annotated in FigTree software v1.4.3 (Rambaut, 2010). Light green colour in the circle represents SEA population, while yellow colour represents PIO population. Similar analysis of DNA-R, -S, -M, -C and -N were performed and provided as supplementary figures (S1S5 Figs) Intergenomic recombination events were marked with pink arc, while Intragenomic recombination events marked with blue arc. Isolates involved in both events were coloured brown. E represents the number of events and a,b,c,d etc. shows different populations in one event detected in RDP. Asterisk sign (*) shows only one isolate was involved in recombination at the given node. Arrows on both sides indicates that all the isolates were involved in respective recombination event. Blue arrow on both sides indicates that all the isolates of PIO group were involved in respective recombination event. The recombinant events described in S2 and S3 Tables are marked. Due to recombination different isolates showed intra and intergenomic events at different nodes.

https://doi.org/10.1371/journal.pone.0263875.g002

Population genetics and selection pressure

To understand the evolutionary selection pressure on various BBTV genomic components, the nonsynonymous to synonymous substitutions rate per site were calculated (Table 5) for coding regions of each component. For all the components (DNA-R, -S, -M, -C, -N and -U3 in many geographic regions), the dN values (ranging from 0.001–0.062) for the coding region were smaller than dS (ranging from 0.002–0.352) and the dN/dS ratios (ranging from 0.031–0.612) were <1, indicating negative/purifying selection, except for DNA-U3 where dN/dS ratio was different between different geographic populations (Table 5). In Pakistani, Taiwanese, Egyptian and Tongan population DNA-U3 showed negative/purifying selection (dN/dS ratio <1), while for Australian, Congo and USA populations showed positive/diversifying selection (dN/dS ratio >1) while population from India and Samoa showed neutral (dN/dS ratio = 1) selection pressure. The data of selection pressure indicated that the coding regions of all the components (except DNA-U3) of BBTV genome in various geographic populations are under purifying selection pressure. The purifying selection pressure is known to preserve the biological function and help in removal of deleterious mutations [59, 60] indicating that proteins encoded by BBTV genomic components are essential and any deleterious mutations occurring in their genes are eliminated from these geographic populations. However, DNA-U3 is interesting exception to this general observation noted for BBTV genome. In Congo it is under high positive selection with dN/dS value of 3.828. While it is under moderate positive selection in USA and Australia (dN/dS value of 1.773 and 1.496 respectively) and under neutral selection pressure in India and Samoa (with dN/dS value of 1.070 and 1.038 respectively) (Table 5).

thumbnail
Table 5. Component wise population genetic parameters of Banana bunchy top virus.

https://doi.org/10.1371/journal.pone.0263875.t005

The DNA-U3 encodes U3 protein for which any function is not yet been determined, however, it is an integral [5] and essential component for infection as demonstrated by infectivity assay for nanoviruses [61, 62]. Despite being essential component, the higher dN/dS values noted for DNA-U3 in some countries where rest of the components are under purifying selection, indicates that some significant factors are at play differently for DNA-U3. Interestingly, three out of four DNA-U3 components analyzed in this study were previously reported to have undergone recombination in Burundi [31], similarly, all of the DNA-U3 components of India and most of the components from Samoa, USA and Australia have also been reported for recombination [31] thus pointing toward the possible role of recombination in observed positive selection seen in DNA-U3 in some countries.

Similarly, the Indian subcontinent is noted as a major hub [31] of long-distance banana and BBTV movements. Stainton and colleagues (2015) indicated it as both the major donor location (for BBTV dispersal events to other parts of the world) and the major recipient location (of virus introductions in it). In general, BBTV isolates from the PIO group are found throughout the natural geographical range of Musa balbisiana, whereas isolates from the SEA group are found throughout the ranges of M. balbisiana and M. acuminata [63]. However, the banana germplasm of some Indian regions primarily comprised of hybrids between M. balbisiana from the Indian subcontinent and M. acuminata from Southeast Asia [63, 64]. This might explain the neutral pressure seen in DNA-U3 in India isolates. Different selection pressure was also observed among different banana genome types [30]. Later, Chiaki and colleagues (2015) also noted that selection pressure was higher in viruses infecting banana varieties with the AAB or ABB genotypes than those infecting with AA or AAA genotypes. The data of selection pressure in present study, suggests that BBTV genome is under negative selection pressure, however, the coding regions of DNA-U3 in some geographic regions around the word are under neutral and positive selection which might be due to mixing of isolates from PIO and SEA groups, and/or due to virus propagation in hosts of different genetic backgrounds and/or due to recombination.

Haplotype diversity values were high for BBTV components ranging from 0.4–1.0 (Table 6). Tajima’s D values were calculated to determine the impact of demographic expansion and contraction in different BBTV DNA component populations. Negative values with statistical significance (P<0.02 or P<0.01) were obtained for DNA-R and DNA-C (Tonga population) and in the Australian population of DNA-S. These results were further confirmed by Fu and Li’s D and F statistical test values (Table 6). Which suggests that these populations may be under expansion phase. On the contrary, no statistically significant positive or negative values were found in the remaining populations of BBTV components suggesting that these populations may be undergoing a neutral or contraction period.

thumbnail
Table 6. Neutrality test, haplotype and nucleotide diversity of each BBTV population.

https://doi.org/10.1371/journal.pone.0263875.t006

Recombination analysis

A detailed analysis on full-length sequences of BBTV genomic components using various methods implemented in the Recombination Detection Program (RDP), reveals many recombination events of intergenomic (homologous recombination between the same components) (S2 Table) and intragenomic (heterologous recombination between the two different components) (S3 Table) in BBTV genome. There are about fifty-four (66%) intergenomic recombination events out of total eighty-two events, while there are twenty-eight (34%) intragenomic recombinant events. Component wise, DNA-U3 is involved in the majority (thirty-five events, about 43%) of detected recombination events, followed by DNA-M (thirteen events, about 16%) while DNA-R and -N are involved in only nine (11%) events each. The data of recombination occurrence for each component revealed that DNA-S which encodes coat protein of BBTV has the least recombination occurrence of 1.8% (5 recombination events from 274 components) followed by DNA-R with 2.6% (9 recombination events detected in 345 components) (Table 7). It is worthy to note that DNA-U3 showed the highest 16.8% (35 recombination events occurred in 208 component) occurrence of recombination in BBTV genome which is about 9 times higher than the least recombined component of DNA-S. Interestingly, the DNA-U3 which was found to have the greatest genetic diversity (Table 4) shows the highest involvement in the recombination events occurring in BBTV genome, while DNA-R that is the most conserved component, is also among the least recombined components in BBTV genome.

thumbnail
Table 7. Occurrence of recombination in Banana bunchy top virus genome.

https://doi.org/10.1371/journal.pone.0263875.t007

To understand the contribution of recombination in BBTV genetic diversity, the recombined regions identified by RDP for the respective events were selected and the diversity analysis of these geographic populations with and without recombinants was performed. The analysis (Table 8) revealed that recombination is responsible to increase the genetic diversity of many of the geographic populations which harbor recombinant isolates. In some instances (recombination event (1), Table 8), it is responsible to increase the diversity of about 4 times due to the recombinants in those populations. In some cases (recombination events (3), (15), (16), (17), (18), (33), (38), (39), (40), (48), (50), (53), (67), (72), Table 8) the diversity of recombinant population is less than their non-recombinant population, however, the average of the entire dataset (for which recombinant and non-recombinant population of certain geographic region was available) showed a net contribution of about 1.4 times suggesting a significant contribution by recombination in BBTV genetic diversity.

thumbnail
Table 8. Contribution of genetic diversity by recombinant isolates in their population.

https://doi.org/10.1371/journal.pone.0263875.t008

The frequency of recombined regions in BBTV genome was analyzed by plotting them on various components (Fig 3) using TJ1 as a reference for nucleotides coordinates. The data show that DNA-U3 is highly involved both in inter and intragenomic recombination and DNA-R and DNA-S are least recombined components. In the BBTV genome, CR-SL, a functional regulatory region is identified as the recombinant hotspot for both intragenomic and intergenomic recombination.

thumbnail
Fig 3. Recombination map of Banana bunchy top virus genome.

The genomic components of BBTV are represented with inner circle having CR-SL, CR-M and ORF in black, pink and blue colours respectively. The nucleotide coordinates are corresponding to P.TJ1 isolates for reference. The red circles represent the intergenomic recombination, while the blue circle represent the intragenomic recombination. The intensity of colours was used to depict the frequency of recombination at a particular region using the key above. The recombinant events described in S2 and S3 Tables are marked. The figures were not drawn on scale.

https://doi.org/10.1371/journal.pone.0263875.g003

Discussion

Understanding the genetic structure of virus populations and their evolutionary mechanisms is an important aspect of managing viral diseases [65]. In Pakistan, information on the molecular analysis of BBTV was very limited with only a partial genome [37] and few full-length DNA components sequenced and deposited within the public database in GenBank [16, 25, 37]. However, inferences based on full-length BBTV genomes suggest that genetic exchange by recombination and reassortment might have played an important role in BBTV evolution around the world [31]. Therefore, fifty-seven genomic components including five complete genomes were sequenced in this study from Pakistan and their genetic diversity and recombination analyses was performed.

In the present study to understand existing variability in BBTV population, a detailed analysis of genetic diversity of its all component was performed first for the Pakistani population and later for the entire population of BBTV in the world. The genetic diversity data of Pakistani population (Table 3) suggested that BBTV full-length components diversity (ℼ) ranged from 0.26 to 0.69 pair-wise average nucleotide diversity per 100 sites, with Watterson estimator (θw) for population mutations rate of 0.36 to 0.99 per 100 sites in the country. The analysis showed that DNA-R a highly conserved component has the most diverse Common Region-Major (CR-M) while DNA-U3, a highly diverse component with the most conserved Common Region-Major (CR-M) and most diverse Common Region Stem-loop (CR-SL) (Table 3). An earlier study of recombination in CR-SL of DNA-U3 from Pakistan has also verified the diverse nature of this subgenomic regulatory region [16]. The analysis also indicated the heterogeneity of genetic diversity values associated with each component and their different parts, the overall diversity remained relatively small compared to the previously reported values for the Pacific Indian Ocean group [4, 26, 58]. Which has been split from the total population of BBTV probably a much larger time than the first introduction of BBTV in 1988 in Pakistan.

The genetic diversity of entire BBTV population around the world (Table 4) was also calculated and compared with the diversity analysis of Pakistani population. In contrary to the existing BBTV diversity, diversity analysis of Pakistani BBTV population showed that the CR-M of DNA-U3, is the most diverse CR-M in the entire BBTV population. The analysis also confirmed the heterogeneity of genetic diversity values associated with each component and their different parts, through the diversity values (ℼ) for full-length sequences ranged between 4.42 to 8.54 pair-wise average nucleotide diversity per 100 sites, with Watterson estimator (θw) for population mutations rate of 6.61 to 11.07 per 100 sites, greater of an order of 10 to 20 times than the Pakistani population, suggesting that other factors might be at play for this observed diversity. Multiple sequence alignment of full-length DNA-R component worldwide shares 92–100% and 89.8–100% sequence identity (Table 4) with members of PIO and SEA group respectively. Which shows on average about 3% and 7% increase in PIO and SEA group as compared to earlier studies [15, 26, 33, 58] while in the case of CR-M and CR-SL on average increase was approximately 38% and 18.8% respectively in SEA population. For PIO population increase was about 12.3% and 1.85% in CR-M and CR-SL respectively. Notably multiple sequence alignment of full-length components of Pakistan population showed DNA-R, the most conserved component with 99.1–100% sequence identity while CR-SL was the conserved functional region with 100% sequence identity in all components except DNA-U3 and DNA-N with sequence identities of 90.2–100% and 98.8–100% respectively (Table 3). While multiple sequence analysis of full-length components in PIO showed DNA-C as most conserved component (92.4–100%) and CR-M of DNA-C as conserved functional region (92.1–100%) (Table 4). These differences of genetic diversity in various components of Pakistani BBTV population compared to PIO group might be due to various factors such as differential evolutionary pressure on important conserved proteins encoded by respective components or due to reassorted genomic components introduced in the country belonging to different regions. However, the phylogenetic analysis of individual genomic components (Fig 2, S1S5 Figs and full entire genomic ML phylogenetic analysis (Fig 1) indicate that all Pakistani isolates clustered together as a clade of BBTV indicating that they have a common origin. This observation argues strongly against any possible reassortment. Therefore, the observed difference in the genetic diversity values of Pakistan isolates compared to PIO group, is not due to different origin rather it might be due to different evolutionary pressure on each component and/or possible recombination. The phylogenetic analysis of full-length DNA-R component (S1 Fig) showed that Pakistan isolates have closed homology with Egyptian and Indian isolates. Which is similar to previous studies [25, 37, 66] based on DNA-R full-length where Pakistan isolates have close homology with Egypt, India and Australia isolates. Similar analysis for other full-length BBTV components (DNA-U3, -S, -M and -N) indicated close homology of Pakistan isolates with India (Fig 2, S3S5 Figs). While in DNA- C (S4 Fig) the order of homology of Pakistan isolates with other PIO isolates was Egypt, Sri Lanka and India respectively. Notably the phylogenetic analysis of concatenated sequences of BBTV (Fig 1) showed clustering of Pakistan isolates within PIO which shows its homology with members of this region. However, within the PIO, Pakistan isolates have close homology with isolates from India, Congo, Sri Lanka, Rwanda, Malawi, and Burundi. Based on entire genome wise and individual component wise (DNA-U3, -S, -M and -N) ML analyses it is evident that Pakistani isolates have very close relationships with the Indian isolates (except for DNA-R and DNA-C component where they are closer to Egyptian isolates than Indian). The previous phylogenetic analyses from Pakistan based on Neighbor-Joining phylogenetic analysis of DNA-R only however, indicated that Pakistani isolates were closer to Egyptian isolates [25, 37]. Based on more rigorous Maximum Likelihood phylogenetic analyses of entire as well as other genomic components (i.e. DNA-U3, -S, -M and -N) it is evident that Pakistani isolates are closely related to Indian isolates than rest of any country in the PIO group.

The tendency of BBTV to induce genetic diversity is perhaps relevant for their ecological fitness. Selection pressure is an important estimator of evolutionary constraints imposed on coding regions [67]. The dN/dS ratio for coding regions (Table 5) of BBTV components showed that the population of DNA-R, DNA-S, DNA-M, DNA-C, and DNA-N components have strong purifying/negative selection while DNA-U3 have either positive or negative selection, which corroborates with the results of earlier studies for component R and S [30, 64]. In a comparative susceptibility study between two banana cultivars (Dwarf Brazilian AAB and Williams AAA) to BBTV, Dwarf Brazilian showed lower percentage of BBTV infection (39%) as compared to Williams (79%) in field experiments [68]. Based on their observation, Hooks and colleagues (2009) hypothesize that one or more morphological differences between the two cultivars might impact P. nigronervosa ability to inoculate BBTV. Banana pseudostem is waxy and banana aphid prefers to feed in pesudostem so the differences in wax content or composition between the two cultivars may lead to observed differences in virus transmission in their studied varieties. Furuya and colleagues (2012) [69] in a susceptibility study between Dwarf Cavendish (AAA) and Itobasho (BB) also observed reduced virus transmission in itobasho variety. Our analysis exhibited an excess of synonymous over nonsynonymous substitutions, indicating strong purifying (negative) selection as an additional mechanism constraining genetic variation [70]. BBTV is disseminated by aphid in a persistent manner, hence it is admissible that the respective constraint is inflicted on the CP gene to circumvent the accumulation of deleterious mutations which might be able to impede the virus-vector complex interactions [64, 71]. Majority of substitutions in DNA S were synonymous. As the sequence of DNA-S is conserved among all areas, the role of CP is crucial for the endurance of virus in the host plant [30]. Another possible sign of nucleotide encoding protein sequence having an impact on recombination patterns in BBTV is that the DNA-U3 component, which has no confirmed protein coding function, has a higher concentration of detectable recombination breakpoints (S2 and S3 Tables, Fig 3) than those of the known protein coding genes of other components. Remarkably, BBTV DNA-U3 seems to be most frequently exchanged by reassortment [31]. The presence of higher frequencies in the respective components depicts that it is substantially evolving neutrally without any risk that the recombinants might express defective chimeric proteins [72, 80]. Therefore, there is little conservation of coevolved epistatic interactions within the component. In comparison to the full-length sequences, low genetic diversity was observed in the coding regions (Table 6) delineating that recombination is not only confined to coding regions but it also proliferates to CR-M and CR-SL (Fig 3) in BBTV.

To evaluate natural selection at the population level significant negative values of neutrality test in different BBTV components (Table 6) suggest population expansion that is consistent with the previous studies for BBTV components R and S [30]. This suggests that whenever a population is increasing the number of segregating sites will increase more rapidly than that of nucleotide diversity leading to a negative test value. Positive values in few populations of BBTV components indicate that during balancing selection, alleles are kept at intermediate frequencies because there will be more pairwise differences than segregating sites [73]. This is due to considerable sequence heterogeneity that imparts a reservoir of virus variants in the population. It enables a significant adaptation to the changing environmental conditions. Therefore, the gene flow provided by the recombination exploit the mechanism to ameliorate their evolutionary tendencies and local adaptation.

Recombination is a major contributor to genetic diversity [18, 74] and genetic variations [75] in ssDNA viruses. Recombination analysis of BBTV components based on nucleotide sequence showed that DNA-U3 exhibits complex intra- and inter recombination patterns (Fig 3) with a recombination occurrence (Table 7) of about 16.8% which is about 9 times more than the least recombined DNA-S component (1.8%). While the recombination occurrence for other BBTV components ranged from 2.6%-7.92%. These observations suggest that recombination in components such as DNA-U3 may be selectively more favorable than recombination in components such as DNA-R, -S, -M, -C and -N [31]. DNA-R and DNA-S showed more frequent intergenomic recombination than intragenomic with a percentage of 1.73 and 1.45 respectively. Few incidences of recombination in DNA-R and DNA-S suggest that because of conserved nature and core functions [16, 64, 76] they are more prone to intergenomic rather than intragenomic recombination.

The contribution of genetic diversity by recombination (Table 8) revealed by analyzing the recombined and without recombination regions for each recombination event detected in RDP. The analysis showed that overall, on average there was about a 56.25% (27 populations out of 48 populations) increase in genetic diversity due to recombination. While within the BBTV components DNA-N (population of Tonga) and DNA-U3 (population of Australia, Tonga, China, Philippines and Congo) showed on average 100% and 90% increased diversity of recombinant populations. which indicates that there is a possibility that with component reassortment, recombination is also a significant evolutionary process driving the diversification of BBTV [30, 64, 7678].

The frequency of recombined regions in BBTV genome undergoing recombination (Fig 3) involved as many as seven times in intergenomic, while as many as four times in intragenomic recombination in various isolates of BBTV. DNA-U3 showed higher frequencies of recombined regions than other components of BBTV and overall recombined regions reside mostly in the CR-SL region. This region is a recombination hot spot in members of the geminiviridae [12, 20, 72] due to the production of a nick in the stem-loop by Rep which serves as the initiation site for rolling circle replication [79] through host DNA polymerase. Since BBTV also replicates by a rolling-circle mechanism [14], the CR-SL of DNA-U3 may similarly be a region subject to high levels of recombination.

Conclusively, five full genomes have been sequenced from BBTV infected banana plants from Pakistan. Deep insight into the genetic diversity analysis of Pakistani and entire BBTV populations around the world showed some interesting contradictions in the diversity of functional regions which might be attributable to recombination. This study highlights the benefits of characterizing complete BBTV genomes rather than focusing on individual components from Pakistan. This analysis also complements the facts that different BBTV genomic components are diversifying at different rates and within one component, different parts have different levels of divergence. These differences suggest that geographic factors are also playing role in shaping the diversity and evolution of BBTV. Recombination analysis showed DNA-U3 a recombination hot component while CR-SL a recombination hotspot in BBTV genome. This in certain cases leads towards about four-fold increase in genetic diversity in the recombined population.

Supporting information

S1 Table. Abbreviations, accession numbers and geographical origin of Banana bunchy top virus isolates and their components used in the study.

https://doi.org/10.1371/journal.pone.0263875.s001

(DOCX)

S2 Table. Intergenomic recombination in Banana bunchy top virus genomes.

https://doi.org/10.1371/journal.pone.0263875.s002

(DOCX)

S3 Table. Intragenomic recombination in Banana bunchy top virus genomes.

https://doi.org/10.1371/journal.pone.0263875.s003

(DOCX)

S1 Fig. Phylogenetic analysis of BBTV DNA-R illustrating intra and intergenomic recombination events at different nodes.

https://doi.org/10.1371/journal.pone.0263875.s004

(TIF)

S2 Fig. Phylogenetic analysis of BBTV DNA-S illustrating intra and intergenomic recombination events at different nodes.

https://doi.org/10.1371/journal.pone.0263875.s005

(TIF)

S3 Fig. Phylogenetic analysis of BBTV DNA-M illustrating intra and intergenomic recombination events at different nodes.

https://doi.org/10.1371/journal.pone.0263875.s006

(TIF)

S4 Fig. Phylogenetic analysis of BBTV DNA-C illustrating intra and intergenomic recombination events at different nodes.

https://doi.org/10.1371/journal.pone.0263875.s007

(TIF)

S5 Fig. Phylogenetic analysis of BBTV DNA-N illustrating intra and intergenomic recombination events at different nodes.

https://doi.org/10.1371/journal.pone.0263875.s008

(TIF)

References

  1. 1. Dale JL. Banana bunchy top: an economically important tropical plant virus disease.1987 Advances in Virus Research 33:301–325. pmid:3296696
  2. 2. Khalid S, Soomro MH. Banana bunchy top disease in Pakistan. Plant pathology. 1993 Dec;42(6):923–6. https://doi.org/10.1111/j.1365-3059.1993.tb02678.x
  3. 3. Fauquet CM, Mayo MA, Maniloff J, Desselberger U, Ball LA, editors. Virus taxonomy: VIIIth report of the International Committee on Taxonomy of Viruses. Academic Press; 2005 Jul 15.
  4. 4. Harding RM, Burns TM, Dale JL. Virus-like particles associated with banana bunchy top disease contain small single-stranded DNA. Journal of General Virology. 1991 Feb 1;72(2):225–30. pmid:1993864
  5. 5. Burns TM, Harding RM, Dale JL. The genome organization of banana bunchy top virus: analysis of six ssDNA components. Journal of General Virology. 1995 Jun 1;76(6):1471–82. pmid:7782775
  6. 6. Burns TM, Harding RM, Dale JL. Evidence that banana bunchy top virus has a multiple component genome. Archives of virology. 1994 Sep;137(3):371–80. https://doi.org/10.1007/bf01309482
  7. 7. Harding RM, Burns TM, Hafner G, Dietzgen RG, Dale JL. Nucleotide sequence of one component of the banana bunchy top virus genome contains a putative replicase gene. Journal of General Virology. 1993 Mar 1;74(3):323–8. pmid:8445361
  8. 8. Horser CL, Harding RM, Dale JL. Banana bunchy top nanovirus DNA-1 encodes the ‘master’replication initiation protein. Journal of General Virology. 2001 Feb 1;82(2):459–64. pmid:11161286
  9. 9. Wanitchakorn R, Hafner GJ, Harding RM, Dale JL. Functional analysis of proteins encoded by banana bunchy top virus DNA-4 to-6. Microbiology. 2000 Jan 1;81(1):299–306. pmid:10640570
  10. 10. Timchenko T, Katul L, Sano Y, de Kouchkovsky F, Vetten HJ, Gronenborn B. The master rep concept in nanovirus replication: identification of missing genome components and potential for natural genetic reassortment. Virology. 2000 Aug 15;274(1):189–95. pmid:10936099
  11. 11. Argüello-Astorga G, Herrera-Estrella L, Rivera-Bustamante R. Experimental and theoretical definition of geminivirus origin of replication. Plant molecular biology. 1994a Oct;26(2):553–6.https://doi.org/10.1007/bf00013742
  12. 12. Argüello-Astorga GR, Guevara-Gonzalez RG, Herrera-Estrella LR, Rivera-Bustamante RF. Geminivirus replication origins have a group-specific organization of iterative elements: a model for replication. Virology. 1994b Aug 15;203(1):90–100. https://doi.org/10.1006/viro.1994.1458
  13. 13. Herrera-Valencia VA, Dugdale B, Harding RM, Dale JL. An iterated sequence in the genome of Banana bunchy top virus is essential for efficient replication. Journal of general virology. 2006 Nov 1;87(11):3409–12. pmid:17030877
  14. 14. Hafner GJ, Harding RM, Dale JL. A DNA primer associated with banana bunchy top virus. Journal of General Virology. 1997 Feb 1;78(2):479–86. pmid:9018072
  15. 15. Hu JM, Fu HC, Lin CH, Su HJ, Yeh HH. Reassortment and concerted evolution in banana bunchy top virus genomes. Journal of Virology. 2007 Feb 15;81(4):1746–61. pmid:17135318
  16. 16. Hyder MZ, Shah SH, Hameed S, Naqvi SM. Evidence of recombination in the Banana bunchy top virus genome. Infection, Genetics and Evolution. 2011 Aug 1;11(6):1293–300. pmid:21539936
  17. 17. Albuquerque LC, Inoue-Nagata AK, Pinheiro B, Resende RO, Moriones E, Navas-Castillo J. Genetic diversity and recombination analysis of sweepoviruses from Brazil. Virology journal. 2012 Dec;9(1):1–4. pmid:23082767
  18. 18. Padidam M, Sawyer S, Fauquet CM. Possible emergence of new geminiviruses by frequent recombination. Virology. 1999 Dec 20;265(2):218–25. pmid:10600594
  19. 19. Umaharan P, Padidam M, Phelps RH, Beachy RN, Fauquet CM. Distribution and diversity of geminiviruses in Trinidad and Tobago. Phytopathology. 1998 Dec;88(12):1262–8. pmid:18944827
  20. 20. Lefeuvre P, Martin DP, Hoareau M, Naze F, Delatte H, Thierry M, et al. Begomovirus ‘melting pot’in the south-west Indian Ocean islands: molecular diversity and evolution through recombination. Journal of General Virology. 2007 Dec 1;88(12):3458–68. pmid:18024917
  21. 21. Monci F, Sánchez-Campos S, Navas-Castillo J, Moriones E. A natural recombinant between the geminiviruses Tomato yellow leaf curl Sardinia virus and Tomato yellow leaf curl virus exhibits a novel pathogenic phenotype and is becoming prevalent in Spanish populations. Virology. 2002 Nov 25;303(2):317–26. pmid:12490393
  22. 22. Owor BE, Martin DP, Shepherd DN, Edema R, Monjane AL, Rybicki EP, et al. Genetic analysis of maize streak virus isolates from Uganda reveals widespread distribution of a recombinant variant. Journal of general virology. 2007 Nov 1;88(11):3154–65. pmid:17947543
  23. 23. Pérez-Losada M, Arenas M, Galán JC, Palero F, González-Candelas F. Recombination in viruses: mechanisms, methods of study, and evolutionary consequences. Infection, Genetics and Evolution. 2015 Mar 1;30:296–307. pmid:25541518
  24. 24. Lefeuvre P, Lett JM, Varsani A, Martin DP (2009) Widely conserved recombination patterns among single-stranded DNA viruses. J Virol 83(6):2697–2707. pmid:19116260
  25. 25. Hyder MZ, Raza SQ, Naqvi SMS, Hameed S, Khalid S. Phylogenetic relationship of TJ1 isolate of Banana bunchy top virus from Pakistan by DNA-R sequence analysis. Canadian journal of plant pathology. 2007 Mar 1;29(1):63–8. https://doi.org/10.1080/07060660709507438
  26. 26. Karan M, Harding RM, Dale JL. Evidence for two groups of banana bunchy top virus isolates. Journal of General Virology. 1994 Dec 1;75(12):3541–6. pmid:7996145
  27. 27. Karan M, Harding RM, Dale JL. Association of banana bunchy top virus DNA components 1 to 6 with all infections. Molecular Plant Pathology On-Line,http://www.bspp.org.uk/mppol/1997/0624karan. 1997. http://www.bspp.org.uk/mppol/citation.php
  28. 28. Sharman M, Thomas JE, Skabo S, Holton TA. Abacá bunchy top virus, a new member of the genus Babuvirus (family Nanoviridae). Archives of virology. 2008 Jan;153(1):135–47. pmid:17978886
  29. 29. Xie WS, Hu JS. Molecular cloning, sequence analysis, and detection of banana bunchy top virus in Hawaii. Phytopathology. 1995 Mar 1;85(3):339–47. https://doi.org/10.1094/phyto-85-339
  30. 30. Chiaki Y, Nasir N, Herwina H, Sonoda A, Fukumoto T, Nakamura M, et al. Genetic structure and diversity of the Banana bunchy top virus population on Sumatra Island, Indonesia. European journal of plant pathology. 2015 Sep;143(1):113–22. https://doi.org/10.1007/s10658-015-0683-y
  31. 31. Stainton D, Martin DP, Muhire BM, Lolohea S, Halafihi MI, Lepoint P, et al. The global distribution of Banana bunchy top virus reveals little evidence for frequent recent, human-mediated long distance dispersal events. Virus evolution. 2015 Mar 1;1(1). pmid:27774281
  32. 32. Fu HC, Hu JM, Hung TH, Su HJ, Yeh HH. Unusual events involved in Banana bunchy top virus strain evolution. Phytopathology. 2009 Jul;99(7):812–22. pmid:19522579
  33. 33. Islam MN, Naqvi AR, Jan AT, Haq QM. Genetic diversity and possible evidence of recombination among Banana Bunchy Top Virus (BBTV) isolates. International Research Journal of Microbiology. 2010;1(1):1–2.
  34. 34. Hyder MZ. Sequencing and genetic characterization of major DNA components of Banana bunchy top virus. 2009 (Doctoral dissertation, Arid Agriculture University, Rawalpindi).
  35. 35. Banerjee A, Roy S, Behere GT, Roy SS, Dutta SK, Ngachan SV. Identification and characterization of a distinct banana bunchy top virus isolate of Pacific-Indian Oceans group from North-East India. Virus Research. 2014 Apr 21;183:41–9. pmid:24468493
  36. 36. Baldodiya GM, Baruah G, Borah BK, Modi MK, Nath PD. Molecular characterization and sequence analyses of Banana bunchy top virus infecting banana cultivar Jahaji (Dwarf Cavendish) in Assam, India. 3 Biotech. 2019 Mar;9(3):1–0. https://doi.org/10.1007/s13205-019-1636-5
  37. 37. Amin I, Qazi J, Mansoor S, Ilyas M, Briddon RW. Molecular characterisation of Banana bunchy top virus (BBTV) from Pakistan. Virus genes. 2008 Feb 1;36(1):191–8. pmid:17990091
  38. 38. Sambrook J, Russell DW. Preparation of plasmid DNA by alkaline lysis with SDS: minipreparation. Cold Spring Harbor Protocols. 2006 Jun 1;2006(1):pdb-rot4084. pmid:22485489
  39. 39. Herrera-Valencia VA, Dugdale B, Harding RM, Dale JL. Mapping the 5′ ends of banana bunchy top virus gene transcripts. Archives of virology. 2007 Mar;152(3):615–20. pmid:17187296
  40. 40. Katoh K, Misawa K, Kuma KI, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic acids research. 2002 Jul 15;30(14):3059–66. pmid:12136088
  41. 41. Campanella JJ, Bitincka L, Smalley J. MatGAT: an application that generates similarity/identity matrices using protein or DNA sequences. BMC bioinformatics. 2003 Dec;4(1):1–4. pmid:12854978
  42. 42. Rozas J, Sánchez-DelBarrio JC, Messeguer X, Rozas R. DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics. 2003 Dec 12;19(18):2496–7. pmid:14668244
  43. 43. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Molecular biology and evolution. 2018 Jun;35(6):1547. pmid:29722887
  44. 44. Rambaut A. (2010). FigTree, a graphical viewer of phylogenetic trees. Available from: http.ac.uk/software/figtree.
  45. 45. Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989 Nov 1;123(3):585–95. pmid:2513255
  46. 46. Fu YX, Li WH. Statistical tests of neutrality of mutations. Genetics. 1993 Mar 1;133(3):693–709. pmid:8454210
  47. 47. Martin DP, Murrell B, Golden M, Khoosal A, Muhire B. RDP4: Detection and analysis of recombination patterns in virus genomes. Virus evolution. 2015 Mar 1;1(1). pmid:27774277
  48. 48. Smith JM. Analyzing the mosaic structure of genes. Journal of molecular evolution. 1992 Feb;34(2):126–9. pmid:1556748
  49. 49. Salminen MO, Carr JK, Burke DS, Mccutchan FE. Identification of breakpoints in intergenotypic recombinants of HIV type 1 by bootscanning. AIDS research and human retroviruses. 1995 Nov;11(11):1423–5. pmid:8573403
  50. 50. Rodríguez-Nevado C, Montes N, Pagán I. Ecological factors affecting infection risk and population genetic diversity of a novel potyvirus in its native wild ecosystem. Frontiers in plant science. 2017 Nov 14;8:1958. pmid:29184567
  51. 51. Holmes EC, Worobey M, Rambaut A. Phylogenetic evidence for recombination in dengue virus. Molecular biology and evolution. 1999 Mar 1;16(3):405–9. pmid:10331266
  52. 52. Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, Novak NG, et al. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. Journal of virology. 1999 Jan 1;73(1):152–60. pmid:9847317
  53. 53. Gibbs MJ, Armstrong JS, Gibbs AJ. Sister-scanning: a Monte Carlo procedure for assessing signals in recombinant sequences. Bioinformatics. 2000 Jul 1;16(7):573–82. pmid:11038328
  54. 54. McGuire G, Wright F. TOPAL 2.0: improved detection of mosaic sequences within multiple alignments. Bioinformatics. 2000 Feb 1;16(2):130–4. pmid:10842734
  55. 55. Posada D, Crandall KA. Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proceedings of the National Academy of Sciences. 2001 Nov 20;98(24):13757–62. pmid:11717435
  56. 56. Jakobsen IB, Easteal S. A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequences. Bioinformatics. 1996 Aug 1;12(4):291–5. pmid:8902355
  57. 57. Yu NT, Zhang YL, Feng TC, Wang JH, Kulye M, Yang WJ, et al. Cloning and sequence analysis of two banana bunchy top virus genomes in Hainan. Virus Genes. 2012 Jun 1;44(3):488–94. pmid:22286609
  58. 58. Furuya N, Kawano S, Natsuaki KT. Characterization and genetic status of Banana bunchy top virus isolated from Okinawa, Japan. Journal of General Plant Pathology. 2005 Feb;71(1):68–73.https://doi.org/10.1007/s10327-004-0162-3
  59. 59. Cvijović I, Good BH, Desai MM. The effect of strong purifying selection on genetic diversity. Genetics. 2018 Aug 1;209(4):1235–78. pmid:29844134
  60. 60. Hughes AL, Hughes MA. More effective purifying selection on RNA viruses than in DNA viruses. Gene. 2007 Dec 1;404(1–2):117–25. pmid:17928171
  61. 61. Timchenko T, Katul L, Aronson M, Vega-Arreguin JC, Ramirez BC, Vetten HJ, et al. Infectivity of nanovirus DNAs: induction of disease by cloned genome components of Faba bean necrotic yellows virus. Journal of General Virology. 2006 Jun 1;87(6):1735–43. pmid:16690940
  62. 62. Grigoras I, Timchenko T, Katul L, Grande-Pérez A, Vetten HJ, Gronenborn B. Reconstitution of authentic nanovirus from multiple cloned DNAs. Journal of virology. 2009 Oct 15;83(20):10778–87. pmid:19656882
  63. 63. Molina AB , Kudagamage C. The international network for the improvement of banana and plantain (INIBAP): PGR activities in South Asia. In: South Asia network on plant genetic resources (SANPGR) meeting held on December 9–11 at Plant Genetic Resources Center (PGRC), Peradeniya, Sri Lanka; 2002. pp. 1–7
  64. 64. Das T, Banerjee A. Distribution, molecular characterization and diversity of banana bunchy top virus in Tripura, India. Virusdisease. 2018 Jun;29(2):157–66. pmid:29911148
  65. 65. Kim MK, Seo JK, Kwak HR, Kim JS, Kim KH, Cha BJ, et al. Molecular genetic analysis of Cucumber mosaic virus populations infecting pepper suggests unique patterns of evolution in Korea. Phytopathology. 2014 Sep;104(9):993–1000. pmid:25116642
  66. 66. Yasmin TA, Naqvi SS, Shah HU, Khalid SA. RFLP-based relationship of Pakistani isolate of Banana bunchy top virus with South Pacific virus group. Pakistan Journal of Botany. 2005 Jun 1;37(2):399.
  67. 67. Farzadfar S, Pourrahim R. Positive selection and recombination shaped the large genetic differentiation of Beet black scorch virus population. PloS one. 2019 Apr 25;14(4):e0215574. pmid:31022219
  68. 68. Hooks C. R. R., Manandhar R., Perez E. P., Wang K-H., and R. Almeida P. P.. Comparative susceptibility of two banana cultivars to Banana bunchy top virus under laboratory and field environments." Journal of economic entomology 102, no. 3 (2009): 897–904. https://doi.org/10.1603/029.102.0306
  69. 69. Furuya N., Kawano S., and Natsuaki K. T. Evaluation of resistance of Japanese fibre banana cv. Itobasho (Musa balbisiana var. liukiuensis) to Banana bunchy top virus. Japanese Journal of Phytopathology78, no. 2 (2012): 108–110. https://doi.org/10.3186/jjphytopath.78.108
  70. 70. Sun BJ, Sun LY, Tugume AK, Adams MJ, Yang J, Xie LH, et al. Selection pressure and founder effects constrain genetic variation in differentiated populations of soilborne bymovirus Wheat yellow mosaic virus (Potyviridae) in China. Phytopathology. 2013 Sep;103(9):949–59. pmid:23550972
  71. 71. Kumar PL, Hanna R, Alabi OJ, Soko MM, Oben TT, Vangu GH, et al. Banana bunchy top virus in sub-Saharan Africa: investigations on virus distribution and diversity. Virus Research. 2011 Aug 1;159(2):171–82. pmid:21549775
  72. 72. Lefeuvre P, Lett JM, Varsani A, Martin DP (2009) Widely conserved recombination patterns among single-stranded DNA viruses. J Virol 83(6):2697–2707. pmid:19116260
  73. 73. Saunders K, Bedford ID, Stanley J. Pathogenicity of a natural recombinant associated with ageratum yellow vein disease: implications for geminivirus evolution and disease aetiology. Virology. 2001 Mar 30;282(1):38–47. pmid:11259188
  74. 74. Duffy S, Holmes EC. Phylogenetic evidence for rapid rates of molecular evolution in the single-stranded DNA begomovirus tomato yellow leaf curl virus. Journal of virology. 2008 Jan 15;82(2):957–65. pmid:17977971
  75. 75. Mukwa LF, Gillis A, Vanhese V, Romay G, Galzi S, Laboureau N, et al. Low genetic diversity of Banana bunchy top virus, with a sub-regional pattern of variation, in Democratic Republic of Congo. Virus genes. 2016 Dec;52(6):900–5. pmid:27550369
  76. 76. Timchenko T, De Kouchkovsky F, Katul L, David C, Vetten HJ, Gronenborn B. A single rep protein initiates replication of multiple genome components of faba bean necrotic yellows virus, a single-stranded DNA virus of plants. Journal of virology. 1999 Dec 1;73(12):10173–82. pmid:10559333
  77. 77. Martin DP, Lefeuvre P, Varsani A, Hoareau M, Semegni JY, Dijoux B, et al. Complex recombination patterns arising during geminivirus coinfections preserve and demarcate biologically important intra-genome interaction networks. PLoS Pathog. 2011 Sep 15;7(9):e1002203. pmid:21949649
  78. 78. van der Walt E, Rybicki EP, Varsani A, Polston JE, Billharz R, Donaldson L, et al. Rapid host adaptation by extensive recombination. The Journal of general virology. 2009 Mar;90(Pt 3):734. pmid:19218220
  79. 79. Fontes EP, Gladfelter HJ, Schaffer RL, Petty IT, Hanley-Bowdoin L. Geminivirus replication origins have a modular organization. The Plant Cell. 1994 Mar 1;6(3):405–16. pmid:8180499
  80. 80. Moradi Z, Nazifi E, Mehrvar M. Occurrence and evolutionary analysis of coat protein gene sequences of Iranian isolates of Sugarcane mosaic virus. The plant pathology journal. 2017 Jun;33(3):296. pmid:28592948