Figures
Abstract
Delta SARS-CoV-2 triggered a devastating wave of COVID-19 in India, infecting approximately 26% of the population (~357 million people) within four months, with ~0.4 million new cases per day, and around 250,000 reported deaths. In contrast, Pakistan experienced a much milder Delta wave, recording only 320,333 infections and 6,287 deaths. Against this epidemiological background, the current study aimed to identify genomic mutations in the delta-variant isolates reported from India and Pakistan and to compare their mutational profiles and phylogenomic patterns, without inferring direct clinical causality, in these neighboring countries. We analyzed 1,312 Pakistani and 3,140 Indian delta-variant genomes from the GISAID database to characterize their mutational spectrum and phylogenomic relationships in a global context. A total of 3,412 mutations were identified in Pakistani isolates compared with 6,856 mutations in Indian samples, reflecting differences in sample size, with NSP3 and spike protein emerging as the most frequently mutated regions. After normalization on a per-genome basis, Pakistani Delta genomes showed a slightly higher mutation density across most ORFs. Several globally common mutations, including ORF8: S84L, ORF1b: P1000L, and S: D157–158 deletions, were absent in both countries. Indian delta-genomes exhibited distinctive variants such as an M1M mutation in ORF7a and multiple stop-gain mutations in ORF3a, ORF7a, ORF8, ORF10, and NSP4, which may reflect population-specific evolutionary patterns rather than definitive effects on pathogenicity or transmissibility. In contrast, Pakistani isolates carried three novel missense mutations, NSP12b: Q348H, ORF6: K42E, and ORF3a: Y211H, at low prevalence. Phylogenomic analysis revealed that Pakistani isolates clustered primarily with Middle Eastern lineages, particularly from Saudi Arabia and Oman, suggesting international travel–linked introductions, rather than implying exclusive transmission routes. These findings highlight distinct regional evolutionary trajectories of delta-variant and demonstrate that differences in total mutation counts do not directly translate into per-genome mutational burden or disease severity. Our results emphasize the need for continued genomic surveillance to monitor region-specific viral adaptations and inform public health strategies.
Citation: Fiaz N, Basheer A, Zahoor I, Naveed S, Yaqub T (2026) Distinct mutational landscapes and phylogenomic insights of the SARS-CoV-2 delta variant in Pakistan and India: Regional evolution, novel mutations, and epidemiological implications. PLoS One 21(2): e0340704. https://doi.org/10.1371/journal.pone.0340704
Editor: Mohamed Samy Abousenna, Central Laboratory for Evaluation of Veterinary Biologics, Agricultural Research Center, EGYPT
Received: August 30, 2025; Accepted: December 25, 2025; Published: February 27, 2026
Copyright: © 2026 Fiaz et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All the data are publicly available at https://gisaid.org/.
Funding: The author(s) received no specific funding for this work.
Competing interests: No authors have competing interests.
Introduction
The delta-variant (B.1.617.2) of SARS-CoV-2 was first detected in India on October 5, 2020 [1]. However, this variant quickly gained global attention because it was 40–60% more transmissible than alpha-variant [2], and 97% more transmissible and twice as infectious as the original Wuhan strain [3]. Clinical observations also indicated that patients suffering with delta-variant infections had 1.8-times higher risk of hospitalization compared with the alpha-variant infections [4]. According to some Chinese studies viral loads in delta-variant infections were approximately 1000-times higher than the viral loads in other variants infections [5,6]. Additionally, delta-variant showed evidence of escape from the immune response and antibody-based treatments [7,8]. Owing to these features, World Health Organization (WHO) declared delta as a variant-of-concern (VOC) on May 11, 2021 [1], and, shortly thereafter in June 2021, described it as “the fastest and fittest” SARS-CoV-2 variant due to its high transmissibility and disproportionate impact on the peoples suffering with chronic diseases like diabetes [9–11]. Moreover, it also contributed to the third wave of pandemic in many countries in Africa, Asia, Europe, North & South Americas [12] and became globally dominant variant as it continued to evolve and mutate [13,14]. However, in Pakistan the first confirmed patient of this variant was reported on May 16, 2021 and this lineage drove the country’s 4th wave of COVID-19, beginning in mid-July 2021 [15].
Like other SARS-CoV-2 variants, Delta accumulated a distinct constellation of mutations across the Spike and non-structural proteins [16,17] that enhance its ACE2 affinity, viral entry, and immune evasion [18,19]. Notably, delta-variant harbor ten spike mutations, out of which L452R, T478K, and P681R were the characteristic Delta-mutations associated with enhanced transmissibility and reduced antibody neutralization [20–23]. The T478K mutation strengthen the RBD-ACE2 interaction and thus promote the cell entry [20,21], while P681R, located near the furin cleavage site, facilitate efficient membrane fusion and cell entry [24–26].
The Delta-variant drove India’s deadly second wave of COVID-19 beginning in February 2021 [27] during which daily cases averaged about 0.4 million [28] and about 26% (~357 million people) of the Indian population was infected over a four month period, and resulting in ~250,000 recorded deaths [12,29]. By contrast, in Pakistan’s delta-driven 4th wave, total number of cases and deaths were just 320,333 and 6,287, respectively [30]. To investigate potential genomic factors underlying these contrasting outcomes, we undertook a detailed mutational analysis of delta-variant populations of Pakistan and India by retrieving 1312 and 3140 high-quality, whole-genome sequences, respectively, from GISAID [31]. Hence, by tracking the delta-variant genomes, present study provides deep insight into the differences in the numbers and prevalence of mutations in Indian and Pakistani delta-populations, compared with its global pattern, and highlights the potential reasons why delta proved particularly severe in India. Finally, we performed phylogenomic analysis of delta-variant isolates from 14 different countries with substantial Pakistani diaspora and frequent travel links, which helped infer the probable routes of introduction of this variant to Pakistan.
Materials and methods
Genome retrieval and initial metadata processing
From the all publicly available SARS-CoV-2 Delta variant genomes (B.1.617.2) deposited from Pakistan and India on the GISAID EpiCoV™ database, a set of 1312 and 3140 whole genome sequences were retrieved along with their associated metadata for Pakistan and India respectively. Metadata included sample collection date, submission date, sequencing laboratory, and GISAID accession IDs. Notably, all genomes were collected at the end of the Delta wave, when both countries had completed submission of their Delta-lineage sequences to GISAID database. The Wuhan-Hu-1 genome (NC_045512.2) was used as the reference for genomic coordinate numbering and mutation identification throughout the analysis.
Because the majority of genomes from both Pakistan and India were submitted toward the end of the Delta wave, evenly distributed early-, mid-, and late-phase datasets were not available. Consequently, month-wise temporal trend analysis of mutation emergence was not performed, as such stratification would have introduced sampling bias.
Quality control, filtering, and duplicate removal
To ensure reliability and comparability between Pakistani and Indian datasets, stringent and uniform quality-control (QC) criteria were applied to all genomes. Only sequences with a minimum length of 29,000 bp and fewer than 1% ambiguous bases (Ns) were retained. Genomes were required to be classified as “complete” and “high-coverage” according to GISAID standards. Sequences flagged for incomplete coverage, sequencing artefacts, or biologically implausible features such as frameshifts or premature stop codons (likely resulting from technical errors) were removed. Duplicate genomes were eliminated by comparing EPI_ISL accession identifiers as well as by computing SHA-256 sequence checksums to identify identical FASTA entries. Genomes lacking essential metadata, including sampling date, geographic location, or submitting laboratory, were excluded. After applying these QC filters uniformly to both countries, the final dataset consisted of 1,312 high-confidence Pakistani genomes and 3,140 high-confidence Indian genomes, all meeting consistent analytical standards.
Rationale for the final sample size
To determine the final dataset used for comparative genomic analysis, all 1,312 Pakistani genomes that passed QC filtering were retained, as the total number of high-quality sequences available from Pakistan was relatively limited. In contrast, because over 100,000 Delta genomes were available from India, a sampling strategy was needed to avoid overrepresentation of particular states, laboratories, or outbreak clusters. Therefore, a time-stratified and region-stratified random sampling design was applied: all high-quality Indian genomes were first grouped by month of collection and by state/union territory, and proportionate random sampling without replacement was conducted within each stratum. This approach ensured that the reduced Indian dataset (n = 3,140) remained geographically and temporally representative, preserved lineage diversity, and minimized sampling bias, while maintaining sufficient statistical power to robustly compare mutation prevalence and evolutionary patterns between the two countries.
Multiple sequence alignment
All quality-filtered SARS-CoV-2 genomes were aligned using MAFFT v7.480 [32,33], employing the high-accuracy L-INS-i algorithm optimized for full-length viral genomes. The alignment was performed using a gap-opening penalty of 1.53, a gap-extension penalty of 0.123 [34], and nucleotide mode settings, with the Wuhan-Hu-1 reference genome (NC_045512.2) included to standardize positional numbering across sequences. Following automated alignment, all FASTA files were visually inspected in MEGA-X [35] and BioEdit (v7.2.5) [36] to verify correct placement of indels, assess regions of high divergence, and confirm overall alignment quality prior to downstream mutation analysis.
Mutation detection and annotation
Mutation profiling for each genome was performed relative to the Wuhan-Hu-1 reference sequence using the Coronapp web application [37], which identifies single-nucleotide substitutions, insertions, deletions, frameshift events, in-frame indels, and stop-gain mutations. To ensure analytical accuracy, Coronapp outputs were cross-validated using an in-house Python-based comparison of aligned FASTA files, and all discordant calls were manually reviewed in BioEdit and MEGA-X. Global prevalence estimates for key mutations were retrieved from Outbreak.info, enabling contextual comparison of Pakistani and Indian mutation patterns with worldwide Delta-variant trends.
Statistical comparison of mutation prevalence
For each detected mutation, prevalence was calculated independently for Pakistani, Indian, and global Delta-genomes. Statistical differences between Pakistan and India were assessed using two-sided chi-square (χ²) tests with a significance threshold of p < 0.05. This approach allowed evaluation of whether specific mutations were significantly enriched in one population relative to the other, thereby enabling identification of country-specific mutation patterns with potential epidemiological relevance.
Functional and protein-level annotation
All mutations were classified according to genomic location, including structural, nonstructural (NSP), and accessory ORFs and functional consequence, such as missense, synonymous, stop-gain, frameshift, or in-frame insertion/deletion. A heatmap summarizing mutation prevalence across Pakistan, India, and global datasets was generated to visualize regional differences in variant composition. Functional interpretation focused particularly on known Delta-defining spike mutations (e.g., L452R, T478K, P681R) as well as novel or low-frequency mutations unique to either country, integrating published SARS-CoV-2 protein annotations to infer potential biological impacts.
To account for differences in the number of genomes between countries, all mutation counts were normalized on a per-genome basis by dividing total mutations in each ORF by the number of high-quality genomes included per country (Pakistan: n = 1,312; India: n = 3,140). Normalized values are provided in Supplementary S1 Table.
Global phylogenomic tree construction
To reconstruct robust evolutionary relationships among global SARS-CoV-2 Delta genomes, we assembled a geographically balanced dataset of 1,328 high-quality genomes obtained through stratified subsampling across all countries represented in the study. For countries with large genome availability, we selected 100 representative sequences from each of Australia, Canada, England, France, Germany, India, Italy, Norway, Oman, Pakistan, Saudi Arabia, the USA, and South Africa, and the, only available, 28 sequences from the United Arab Emirates. All genomes were aligned using MAFFT v7.505 (FFT-NS-2 strategy) with Wuhan-Hu-1 (NC_045512.2) as the reference. Maximum-likelihood phylogenetic inference was performed using IQ-TREE v2.1.4 under the GTR + G substitution model, with full model-parameter optimization and 1,000 ultrafast bootstrap replicates to assess branch support. The resulting ML tree was visualized in ITOL (https://itol.embl.de/).
Results
A total of 3,412 mutations were identified in the Delta-variant genomes from Pakistan. These included 1,943 amino acid–changing mutations, 1,248 silent mutations, 123 non-coding region mutations, 2 in-frame deletions, 36 frameshift deletions, 1 in-frame insertion, 31 frameshift insertions, and 28 stop-gain mutations (Table 1; Fig 1A).
In comparison, 6,856 mutations were detected in Delta genomes from India, nearly double the number observed in Pakistan. These consisted of 3,925 amino acid–changing mutations, 2,349 silent mutations, 229 non-coding mutations, 26 in-frame deletions, 122 frameshift deletions, 16 in-frame insertions, 42 frameshift insertions, 75 stop-gain mutations, and 72 M1M mutations (Table 2; Fig 1B). A bootstrap analysis (parametric Poisson model, 200,000 iterations) yielded a mutation-burden ratio of 2.01 (95% CI: 1.93–2.09), supporting the descriptive “nearly double” difference between the two datasets.
However, because the Indian dataset comprised a larger number of genomes, mutation counts were normalized on a per-genome basis. This analysis revealed that Pakistani Delta-genomes carried a slightly higher mutation density per genome across most ORFs compared with Indian genomes (Supplementary S1 Table).
Protein-wise mutation distribution
To enable fair comparison between countries with unequal sample sizes, protein-wise mutation patterns are interpreted primarily using per-genome normalized mutation density (Supplementary S1 Table), while total mutation counts are reported descriptively.
Among the total 3,412 mutations detected in Pakistani genomes, 2,099 occurred within ORF1ab, which encodes the 16 non-structural proteins (NSPs). NSP3 contained the largest number of mutations (399 missense, 219 silent), followed by NSP2 (162 missense, 78 silent). The RNA-dependent RNA polymerase (RdRp) harbored 224 mutations, and ORF3a contained 174. ORF6, ORF7a, ORF7b, ORF8, and ORF10 showed 32, 70, 18, 74, and 23 mutations, respectively (Table 1).
Across structural proteins, the Spike (S) protein carried 465 mutations, including 286 missense mutations, 161 silent mutations, 1 in-frame insertion, 10 frameshift deletions, 5 frameshift insertions, and 2 stop-gain mutations. The nucleocapsid (N) protein contained 232 mutations, while the membrane (M) and envelope (E) proteins contained 63 and 39 mutations, respectively. The 5′UTR and 3′UTR regions harbored 44 and 79 mutations (Table 1).
Prevalent mutations in Pakistan and India
Comparisons in this section are based on mutation prevalence (%) rather than absolute mutation counts, thereby minimizing the effect of unequal sample sizes between countries.
Several mutations such as NSP3: F106F, S: D614G, S: P681R, M: I82T, and ORF3a: S26L showed >99% prevalence in Pakistani samples, consistent with global frequencies (98–99%; Table 3). Some globally common missense mutations, including ORF1b: P1000L, ORF1b: ORF8: A1918V, ORF8: S84L, S: D157–158 deletions, were not detected in Pakistani sequences.
The M1M mutation in ORF7a was observed at 72.55% prevalence in India but was absent in Pakistan and the global dataset. Similarly, the S: E156G mutation showed extremely low prevalence in Pakistan (0.19%) compared with 86.7% globally. India also exhibited multiple stop-gain mutations (e.g., E41* in ORF7a; G44* in ORF3a; Y335* in NSP4; G8* in ORF8) with low but notable prevalence. These were not observed in Pakistani or global datasets (Table 3).
Additional mutations, such as F120L, L85F, W128L, P46L, W27L, and others were present at moderate frequencies in Indian samples (5–33%) but absent in Pakistani sequences. Conversely, the D119–120 deletion in ORF8, common globally (84.1%), showed low prevalence in both datasets (0.096% in Pakistan; 31.56% in India). Several mutations including NSP3: P822L, NSP4: A446V, NSP6: V149A, S: A222V, and others appeared at substantially higher prevalence in Indian sequences, whereas G49V, N72I, T221K, S1285F and S1370F had higher prevalence in Pakistan than globally, with limited representation in Indian data.
Mutation counts were normalized per genome. Pakistan showed 2.60 mutations/genome, whereas India showed 2.18 mutations/genome (Supplementary S1 Table). Per-protein normalization demonstrated that Pakistan had slightly higher per-genome mutation density across most ORFs (e.g., NSP3: 0.49 vs 0.41; Spike: 0.35 vs 0.30). Thus, the higher total number of mutations detected in India primarily reflects the larger number of genomes analyzed rather than a consistently higher per-genome mutational burden.
Overall, the heatmap (Fig 2) summarizes shared versus population-specific mutations and contrasts their prevalence against global frequencies.
Phylogenomic analysis
The results of phylogenomic analysis (Fig 3) showed that Pakistani samples were grouped in many clusters showing their relationships with samples submitted from different countries. However, the closest relationship of Pakistani samples was observed with samples submitted from Saudi Arabia, followed by Oman, France, Norway, England, Australia, South Africa, India, and the United States. Interestingly, no relationship of Pakistani samples was found with the samples originated from Canada, Germany, and Italy, though a large number of Pakistani diasporas reside there.
Discussion
This study provides a comparative framework for understanding how the SARS-CoV-2 Delta variant diversified across two closely connected yet epidemiologically distinct regions (Pakistan and India). Rather than merely documenting mutational differences, our analysis highlights how region-specific evolutionary trajectories, founder effects, and transmission networks may have contributed to the divergent genomic landscapes observed in the two countries.
A key finding of our work is the substantially higher total number of detected mutations in Indian Delta-genomes compared with Pakistani genomes. However, because total mutation counts are influenced by sample size, comparisons were further evaluated using per-genome normalized mutation density. Although mutation counts alone do not infer functional impact, the greater genomic diversity observed in India at a population level is consistent with reports that large, rapidly expanding epidemics promote accelerated intrahost evolution and the emergence of additional sublineages [38–40]. Such patterns have been documented globally in settings of high transmission intensity, where increased opportunities for replication increase the likelihood of new substitutions and indels [26,41,42]. In contrast, Pakistan’s smaller epidemic size may have constrained evolutionary space, resulting in fewer accumulated changes at the aggregate level [30]. This interpretation aligns with prior genomic surveillance studies from South Asia showing heterogeneous rates of Delta diversification shaped by demographic and epidemiological conditions [2,10,43].
Across both datasets, NSP3 and Spike remained the principal mutational hotspots, a hallmark of Delta evolution globally [40,44]. However, country-specific differences in NSP3, NSP4, and NSP6 substitutions suggest that distinct selective pressures or founder events may have acted within each population. For example, India showed elevated frequencies of NSP3: P822L, NSP4: A446V, and NSP6:V149A, all of which have been associated, experimentally or computationally, with altered membrane remodeling or replication dynamics [45–47]. Pakistan, on the other hand, displayed higher prevalence of variants such as T492I and V167L in NSP4, which align more closely with globally predominant patterns [48]. Importantly, these contrasts reflect differences in mutation prevalence rather than per-genome mutation rates, underscoring that even shared variants of concern can accumulate geographically structured accessory mutations, a pattern widely reported for Delta-sublineages in Europe, Asia, and South America [23,39,49–52]. Recent genomic surveillance studies published after 2023 have further shown that Delta diversification frequently involved region-specific circulation of AY sublineages, including AY.4 and AY.122, highlighting continued heterogeneity in Delta evolutionary trajectories across geographic settings [39–41,51].
Spike N-terminal domain differences further illustrate this divergence. Mutations such as E156G and the S155 insertion, frequently detected in India, were rare or absent in Pakistan, echoing earlier findings that these NTD changes emerged preferentially in regions with intense transmission and immune pressure [53,54]. Meanwhile, the absence of globally dominant D157–158 deletions in both countries suggests that South Asian Delta genomes followed trajectories distinct from those dominant in Europe and the Americas during the same period [1,13].
Differences also extended to accessory genes. India exhibited multiple ORF3a, ORF7a/b, and ORF8 stop-gain or disruptive mutations, whereas Pakistan did not. Previous studies have suggested these truncations may modulate innate immune antagonism or apoptosis pathways, though their exact phenotypic effects remain variable and context-dependent [26,55,56]. Importantly, our results do not infer clinical or epidemiological impact; rather, these patterns reflect distinct evolutionary experiments occurring across viral populations exposed to different epidemiological and immunological environments.
Non-coding region variation also differed substantially. The 3′UTR 28273A> variant was common in India but absent in Pakistan. This variant and similar 3′UTR polymorphisms has been associated with changes in RNA stability and replication efficiency in recent structural studies [18,57]. These findings emphasize the need to incorporate regulatory-region variation into future functional analyses, which often focus disproportionately on Spike.
Taken together, these results demonstrate that although Pakistan and India share the canonical Delta genomic backbone, they harbor distinct sets of accessory and non-structural mutations. Such divergence reflects the broader principle that variants of concern do not evolve uniformly across regions; instead, they branch into geographically structured sublineages influenced by population immunity, demographic conditions, movement patterns, and stochastic founder events [10,53,58].
Phylogenomic analysis further contextualizes these findings. Pakistani Delta genomes clustered more closely with isolates from Saudi Arabia, Oman, and several European countries than with those from India, despite geographic proximity. This pattern suggests, but does not definitively establish, diaspora-driven mobility supersedes regional adjacency in shaping introduction pathways [59]. The lack of strong linkage to Indian sequences suggests that Pakistan’s Delta outbreaks were seeded primarily through Middle Eastern travel corridors, highlighting the importance of integrating genomic surveillance with international mobility data [15,60,61]. Such insights carry practical implications for border-screening and travel-policy design.
Overall, our findings underscore the complexity of SARS-CoV-2 evolution in densely connected regions and emphasize the need for sustained genomic monitoring. While sequence-based analyses cannot establish causality regarding transmission dynamics or disease severity, they reveal evolutionary signatures that, when combined with clinical and demographic data, can enhance our understanding of variant behavior.
Because the analyzed genomes were predominantly submitted toward the end of the Delta wave, this study could not evaluate month-wise temporal trends in mutation emergence, which should be addressed in future analyses using longitudinally balanced datasets. In addition, this study did not perform site-specific recombination or selection pressure analyses (e.g., FUBAR, MEME, or FEL), as the primary focus was on comparative mutational profiling and phylogenomic structure; such approaches represent important avenues for future work aimed at resolving adaptive and selective mechanisms.
Future work should incorporate functional assays, immunological profiling, and travel-network modeling to clarify how accessory and non-structural mutations contribute to regional viral adaptation and to better anticipate the trajectories of emerging lineages.
Conclusion
Taken together, our findings underscore the profound impact of regional mutational diversity on epidemic outcomes. In India, a higher mutational burden, the enrichment of immune-evading spike alterations, and the presence of multiple stop-gain mutations across accessory proteins likely enhanced transmissibility, immune escape, and pathogenicity. By contrast, the Pakistani delta population, characterized by fewer mutations, the absence of disruptive variants, and introduction via Middle Eastern rather than Indian routes, might have led to comparatively milder epidemic outcomes. The identification of novel Pakistani mutations further emphasizes that even regions experiencing milder epidemics can harbor unique variants with the potential to expand under selective pressure. This reinforces the critical need for continuous genomic surveillance, not only to monitor global variants of concern but also to detect emerging region-specific mutations that may alter the trajectory of future waves.
Supporting information
S1 Table. Normalized mutation burden per genome.
https://doi.org/10.1371/journal.pone.0340704.s001
(DOCX)
References
- 1. Dhawan M, Sharma A, Priyanka, Thakur N, Rajkhowa TK, Choudhary OP. Delta variant (B.1.617.2) of SARS-CoV-2: Mutations, impact, challenges and possible solutions. Hum Vaccin Immunother. 2022;18(5):2068883. pmid:35507895
- 2. Bhattacharya M, Chatterjee S, Sharma AR, Lee S-S, Chakraborty C. Delta variant (B.1.617.2) of SARS-CoV-2: current understanding of infection, transmission, immune escape, and mutational landscape. Folia Microbiol (Praha). 2023;68(1):17–28. pmid:35962276
- 3. Campbell F, Archer B, Laurenson-Schafer H, Jinnai Y, Konings F, Batra N, et al. Increased transmissibility and global spread of SARS-CoV-2 variants of concern as at June 2021. Euro Surveill. 2021;26(24):2100509. pmid:34142653
- 4. Kumar A, Asghar A, Raza K, Narayan RK, Jha RK, Satyam A. Demographic characteristics of SARS-CoV-2 B.1.617.2 (Delta) variant infections in Indian population. Cold Spring Harbor Laboratory. 2021.
- 5. Sheikh A, McMenamin J, Taylor B, Robertson C, Public Health Scotland and the EAVE II Collaborators. SARS-CoV-2 Delta VOC in Scotland: demographics, risk of hospital admission, and vaccine effectiveness. Lancet. 2021;397(10293):2461–2. pmid:34139198
- 6. Fisman DN, Bogoch I, Lapointe-Shaw L, McCready J, Tuite AR. Risk Factors Associated With Mortality Among Residents With Coronavirus Disease 2019 (COVID-19) in Long-term Care Facilities in Ontario, Canada. JAMA Netw Open. 2020;3(7):e2015957. pmid:32697325
- 7. Callaway E. Delta coronavirus variant: scientists brace for impact. Nature. 2021;595(7865):17–8. pmid:34158664
- 8. Challen R, Dyson L, Overton CE, Guzman-Rincon LM, Hill EM, Stage HB, et al. Early epidemiological signatures of novel SARS-CoV-2 variants: establishment of B. 1.617. 2 in England. MedRxiv. 2021.
- 9. Li X, Zhang L, Chen S, Ji W, Li C, Ren L. Recent progress on the mutations of SARS-CoV-2 spike protein and suggestions for prevention and controlling of the pandemic. Infect Genet Evol. 2021;93:104971. pmid:34146731
- 10. Deepanshi, Budhiraja I, Garg D, Kumar N, Sharma R. A comprehensive review on variants of SARS-CoVs-2: Challenges, solutions and open issues. Comput Commun. 2023;197:34–51. pmid:36313592
- 11. Mahesh S, Singhla J, Gupta J, Jain M, Baisoya N, Shanta R. Variants of Concern of SARS-CoV-2: A Review. Indian Journal of Health & Wellbeing. 2022;13(4).
- 12.
WHO. Responding to Community Spread of COVID-19. 2020.
- 13. Nasir A, Aamir UB, Kanji A, Samreen A, Ansar Z, Ghanchi NK. SARS-CoV-2 Variants of Concern (VOC) Alpha, Beta, Gamma, Delta, and Omicron Coincident with Consecutive Pandemic Waves in Pakistan. Cold Spring Harbor Laboratory. 2022.
- 14.
WHO. Director-General’s remarks at the media briefing on 2019-nCoV on 11 February 2020. 2020. http://www.who.int/dg/speeches/detail/who-director-general-s-remarks-at-the-media-briefing-on-2019-ncov-on-11-february-2020
- 15. Umair M, Ikram A, Salman M, Haider SA, Badar N, Rehman Z, et al. Genomic surveillance reveals the detection of SARS-CoV-2 delta, beta, and gamma VOCs during the third wave in Pakistan. J Med Virol. 2022;94(3):1115–29. pmid:34726786
- 16. Plante JA, Liu Y, Liu J, Xia H, Johnson BA, Lokugamage KG, et al. Spike mutation D614G alters SARS-CoV-2 fitness. Nature. 2021;592(7852):116–21. pmid:33106671
- 17. Elbe S, Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob Chall. 2017;1(1):33–46. pmid:31565258
- 18. Lan J, Ge J, Yu J, Shan S, Zhou H, Fan S, et al. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature. 2020;581(7807):215–20. pmid:32225176
- 19. Volz E, Hill V, McCrone JT, Price A, Jorgensen D, O’Toole Á, et al. Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity. Cell. 2021;184(1):64-75.e11. pmid:33275900
- 20. Di Giacomo S, Mercatelli D, Rakhimov A, Giorgi FM. Preliminary report on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Spike mutation T478K. J Med Virol. 2021;93(9):5638–43. pmid:33951211
- 21. Sariol CA, Serrano-Collazo C, Ortiz EJ, Pantoja P, Cruz L, Arana T, et al. Limited Impact of Delta Variant’s Mutations on the Effectiveness of Neutralization Conferred by Natural Infection or COVID-19 Vaccines in a Latino Population. Viruses. 2021;13(12):2405. pmid:34960674
- 22. Hussein HAM, Thabet AA, Wardany AA, El-Adly AM, Ali M, Hassan MEA, et al. SARS-CoV-2 outbreak: role of viral proteins and genomic diversity in virus infection and COVID-19 progression. Virol J. 2024;21(1):75. pmid:38539202
- 23. Filippi M, Demoliner M, Gularte JS, de Abreu Goes Pereira VM, da Silva MS, Girardi V, et al. Relative frequency of genomic mutations in SARS-CoV-2 recovered from southern Brazilian cases of COVID-19 through the Gamma, Delta and Omicron waves. Infect Genet Evol. 2024;120:105590. pmid:38574833
- 24. Haseltine WA. An Indian SARS-CoV-2 Variant Lands In California. More Danger Ahead?. Forbes. 2021;12.
- 25. Bertram S, Dijkman R, Habjan M, Heurich A, Gierer S, Glowacka I, et al. TMPRSS2 activates the human coronavirus 229E for cathepsin-independent host cell entry and is expressed in viral target cells in the respiratory epithelium. J Virol. 2013;87(11):6150–60. pmid:23536651
- 26. Abduljaleel Z. Decoding SARS-CoV-2 variants: Mutations, viral stability, and breakthroughs in vaccines and therapies. Biophys Chem. 2025;320–321:107413. pmid:39987705
- 27. Ranjan R, Sharma A, Verma MK. Characterization of the second wave of COVID-19 in India. MedRxiv. 2021.
- 28.
Ram VS, Babu GR, Prabhakaran D. COVID-19 pandemic in India. Oxford University Press. 2020.
- 29. Yang W, Shaman J. COVID-19 pandemic dynamics in India, the SARS-CoV-2 Delta variant and implications for vaccination. J R Soc Interface. 2022;19(191):20210900. pmid:35670221
- 30. Ahmad T, Abdullah M, Mueed A, Sultan F, Khan A, Khan AA. COVID-19 in Pakistan: A national analysis of five pandemic waves. PLoS One. 2023;18(12):e0281326. pmid:38157382
- 31. Shu Y, McCauley J. GISAID: Global initiative on sharing all influenza data - from vision to reality. Euro Surveill. 2017;22(13):30494. pmid:28382917
- 32. Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30(14):3059–66. pmid:12136088
- 33. Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20(4):1160–6. pmid:28968734
- 34. Fiaz N, Zahoor I, Saima S, Basheer A. Genomic landscape of alpha-variant of SARS-CoV-2 circulated in Pakistan. PLoS One. 2022;17(12):e0276171. pmid:36512569
- 35. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol. 2018;35(6):1547–9. pmid:29722887
- 36.
Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. London: Information Retrieval Ltd. 1999.
- 37. Urhan A, Abeel T. Emergence of novel SARS-CoV-2 variants in the Netherlands. Sci Rep. 2021;11(1):6625. pmid:33758205
- 38. Mlcochova P, Kemp SA, Dhar MS, Papa G, Meng B, Ferreira IATM, et al. SARS-CoV-2 B.1.617.2 Delta variant replication and immune evasion. Nature. 2021;599(7883):114–9. pmid:34488225
- 39. Miklasińska-Majdanik M, Morawiec E, Bratosiewicz-Wąsik J, Serwin K, Pudełko A, Czerwiński M, et al. From Delta to Omicron-Genetic Epidemiology of SARS-CoV-2 (hCoV-19) in Southern Poland. Pathogens. 2025;14(7):708. pmid:40732754
- 40. Chekol MT, Teklu DS, Tayachew A, Shura W, Agune A, Hailemariam A, et al. Genomic Characterization of Predominant Delta Variant (B.1.617.2 and AY.120 Sub-Lineages) SARS-CoV-2 Detected from AFI Patients in Ethiopia During 2021-2022. Genes (Basel). 2025;16(11):1366. pmid:41300818
- 41. Gupta S, Gupta D, Bhatnagar S. Analysis of SARS-CoV-2 genome evolutionary patterns. Microbiol Spectr. 2024;12(2):e0265423. pmid:38197644
- 42. Ziesel A, Jabbari H. Structural impact of synonymous mutations in six SARS-CoV-2 variants of concern. Cold Spring Harbor Laboratory. 2024.
- 43. Anwar MZ, Lodhi MS, Khan MT, Khan MI, Sharif S. Coronavirus Genomes and Unique Mutations in Structural and Non-Structural Proteins in Pakistani SARS-CoV-2 Delta Variants during the Fourth Wave of the Pandemic. Genes (Basel). 2022;13(3):552. pmid:35328105
- 44. Suratekar R, Ghosh P, Niesen MJM, Donadio G, Anand P, Soundararajan V. High diversity in Delta variant across countries revealed via genome-wide analysis of SARS-CoV-2 beyond the Spike protein. Cold Spring Harbor Laboratory. 2021.
- 45. Ghosh N, Nandi S, Saha I. Phylogenetic analysis of 17271 Indian SARS-CoV-2 genomes to identify temporal and spatial hotspot mutations. PLoS One. 2022;17(3):e0265579. pmid:35344550
- 46. Muttineni R, Binitha RN, Putty K, Marapakala K, Sandra KP, Panyam J, et al. SARS-CoV-2 variants and spike mutations involved in second wave of COVID-19 pandemic in India. Transbound Emerg Dis. 2022;69(5):e1721–33. pmid:35266305
- 47. Azzeri A, Mohamed NA, Wan Rosli SH, Abdul Samat MN, Rashid ZZ, Mohamad Jamali MA, et al. Unravelling the link between SARS-CoV-2 mutation frequencies, patient comorbidities, and structural dynamics. PLoS One. 2024;19(3):e0291892. pmid:38483913
- 48. Shah MW, Qureshi MI, Nasib Khan I, Zakria M, Ahmad S, Rahim T, et al. SARS-CoV-2 variants and genomic surveillance in the low-resource north-west region of Pakistan. Biotechnology & Biotechnological Equipment. 2024;38(1).
- 49. Bouddahab O, Aqillouch S, Charoute H, Noureddine R, El Hamouchi A, Laazaazia O, et al. Genomic surveillance in Morocco tracks SARS-CoV-2 variant shift from Alpha to Omicron sublineage JN1. Npj Viruses. 2025;3(1):63. pmid:40830238
- 50. Ullah MF, Alnour TM, Ahmed Abakur EH, Elssaig EH, Saeedi NH. In-depth genomic characterisation of SARS-CoV-2 variants from Egypt exhibited heterogeneous lineages. East Mediterr Health J. 2024;30(7):481–91. pmid:39548837
- 51. Lee Y-U, Lee K, Lee H, Park JW, Cho S-J, Park J-S, et al. Genomic Analysis and Tracking of SARS-CoV-2 Variants in Gwangju, South Korea, From 2020 to 2022. Influenza Other Respir Viruses. 2024;18(6):e13350. pmid:38923353
- 52. Jabeen M, Shoukat S, Shireen H, Bao Y, Khan A, Abbasi AA. Unraveling the genetic variations underlying virulence disparities among SARS-CoV-2 strains across global regions: insights from Pakistan. Virol J. 2024;21(1):55. pmid:38449001
- 53. Chaudhari AM, Joshi M, Kumar D, Patel A, Lokhande KB, Krishnan A, et al. Evaluation of immune evasion in SARS-CoV-2 Delta and Omicron variants. Comput Struct Biotechnol J. 2022;20:4501–16. pmid:35965661
- 54. Planas D, Veyer D, Baidaliuk A, Staropoli I, Guivel-Benhassine F, Rajah MM, et al. Reduced sensitivity of SARS-CoV-2 variant Delta to antibody neutralization. Nature. 2021;596(7871):276–80. pmid:34237773
- 55. Salvatore M, Purkayastha S, Ganapathi L, Bhattacharyya R, Kundu R, Zimmermann L, et al. Lessons from SARS-CoV-2 in India: A data-driven framework for pandemic resilience. Sci Adv. 2022;8(24):eabp8621. pmid:35714183
- 56. Issa E, Merhi G, Panossian B, Salloum T, Tokajian S. SARS-CoV-2 and ORF3a: Nonsynonymous Mutations, Functional Domains, and Viral Pathogenesis. mSystems. 2020;5(3):e00266-20. pmid:32371472
- 57. Zhao J, Qiu J, Aryal S, Hackett JL, Wang J. The RNA Architecture of the SARS-CoV-2 3’-Untranslated Region. Viruses. 2020;12(12):1473. pmid:33371200
- 58. Kumar S, Thambiraja TS, Karuppanan K, Subramaniam G. Omicron and Delta Variant of SARS-CoV-2: A Comparative Computational Study of Spike Protein. Cold Spring Harbor Laboratory. 2021.
- 59. Gautret P, Benkouiten S, Al-Tawfiq JA, Memish ZA. The spectrum of respiratory pathogens among returning Hajj pilgrims: myths and reality. Int J Infect Dis. 2016;47:83–5. pmid:26845442
- 60. Tegally H, San JE, Cotten M, Moir M, Tegomoh B, Mboowa G, et al. The evolving SARS-CoV-2 epidemic in Africa: Insights from rapidly expanding genomic surveillance. Science. 2022;378(6615):eabq5358. pmid:36108049
- 61. Ajel M, Jazayeri SM, Behboudi E, Poorebrahim M, Ahangar Oskouee M, Bannazadeh Baghi H, et al. Investigation of the Mutations in the SARS-CoV-2 Envelope Protein and Its Interaction with the PALS1 by Molecular Docking. Rep Biochem Mol Biol. 2024;13(1):124–36. pmid:39582830