Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genetic diversity, population structure, and combined detection of selection signatures in Iranian versus Afghan Baluchi sheep

  • Sadegh Taheri,

    Roles Formal analysis, Software, Writing – original draft

    Affiliation Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran

  • Mohammad Osman Karimi,

    Roles Data curation, Resources

    Affiliation Department of Animal Science, Faculty of Agriculture, Herat University, Herat, Afghanistan

  • Naghmeh Saedi,

    Roles Visualization, Writing – original draft

    Affiliation Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark

  • Saeed Zerehdaran,

    Roles Data curation, Methodology, Writing – review & editing

    Affiliation Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran

  • Mohammad Mahdi Shariati,

    Roles Software, Validation

    Affiliation Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran

  • Mohsen Gholizadeh,

    Roles Data curation, Resources

    Affiliation Department of Animal Science, Sari Agriculture and Natural Resources University, Sari, Iran

  • Ali Javadmanesh

    Roles Conceptualization, Project administration, Supervision, Writing – review & editing

    javadmanesh@um.ac.ir

    Affiliation Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran

Abstract

Selection to increase the frequency of useful mutations has left marks on animal genomes, genetic diversity, and population structure within populations. The study and investigation of these genomic regions can lead to the identification of genes related to economic traits or competence and adaptability. This study aimed to recognize genetic diversity, population structure, and selection signatures in Iranian (IB) and Afghan (AB) Baluchi sheep populations. In this study, 86 Iranian Baluchi and 15 Afghan Baluchi sheep were genotyped using Illumine Ovine SNP50K Beadchip arrays. Note that the sample size imbalance (IB n = 86 vs. AB n = 15) may reduce statistical power and potentially bias population structure and selection scan results. Additionally, use of the Ovine 50K array may introduce ascertainment bias; analyses were based on 38,193 shared SNPs, potentially missing population-specific variants. Generally, moderate genetic diversity was observed in both the Afghan Baluchi (AB) and Iranian Baluchi (IB) sheep populations, using various assessment methods. However, the IB population showed the lowest level of genetic diversity and the highest rate of linkage disequilibrium decay, despite having a better effective population size in recent generations. The ADMIXTURE analysis indicated that the optimal number of genetic clusters was K = 2, which was determined based on the lowest cross-entropy error of 0.603 observed during cross-validation. At K = 2 and NJ tree analysis, a clear genetic distinction between the AB and IB populations was evident. Additionally, the IB population demonstrated significant genetic uniformity when compared to the AB population in terms of genetic distance. Also, FST and XP-EHH were used to identify selection signatures. Some putative candidate genes for FST, including HDAC9, CSMD3, DAB1, FGF12, and PCDH9 were associated with important economic traits such as body weight, hot carcass weight, muscle weight in carcass, reproductive seasonality, and carcass fat percentage, respectively. Also, XP-EHH putative candidate genes were KCNIP4, FGF11, CNTROB, and ROBO2 in AB population, which were related to body weight, hot carcass weight, milk yield, and muscle weight in carcass. Moreover, XP-EHH putative candidate genes in IB population were GRIK3, NCOA1, and FGD3, that related to muscle weight in carcass, staple length, and milk fat percentage. Selection signals were identified using top 1% FST thresholds and XP-EHH without genome-wide multiple-testing correction; results require experimental validation. We observed very similar outcomes in terms of similar signatures related to economic traits in both FST and XP-EHH methods, indicating the robustness of analysis in this study. It can be concluded that selection has made a major distinction between Afghan and Iranian Baluchi sheep populations for reproduction, milk production, and growth traits. These could be due to the managed breeding programme in Iranian Baluchi sheep. Utilizing validated QTLs as described in this study could be applied to reveal the direction of breeding plans in livestock species.

Introduction

One of the primary grazing animals domesticated was sheep [1], due to its manageable size and the ability to adapt to various climates and poor nutrition diets. Artificial selection created a wide variety of breeds with respect to coat colour, distinct morphology, or particular production traits, among others. Natural selection (such as immunocompetence) has also put selection pressure on sheep. The Baluchi population, a major sheep breed in Iran regarding population size, is well-adapted to harsh climate conditions in central and eastern Iran with low-quality pastures. This breed is native to eastern and central regions of Iran, southwest of Pakistan, and southern Afghanistan. The fleece in Afghan Baluchi (AB) and Iranian Baluchi (IB) populations are coarse and white with black pigmentations at distal body parts (S1 Fig). Baluchi sheep is a fat-tailed breed, and it is very well adapted to a wide range of harsh environments and poor pasture quality in eastern Iran, a subtropical arid climate. Iranian Baluchi body size varies between 35 and 40 kg in adult ewes, milk yield between 40 and 50 kg in 125 days of milking, and annual greasy fleece weight of 1.3–1.8 kg (https://breeds.okstate.edu/sheep/baluchi-sheep.html). This breed has been under selection for important economic traits such as growth, milk production, etc in Abbas Abad Animal Breeding Centre, Mashhad, Iran, for more than four decades [2]. Also, the breeding of sheep in Afghanistan dates back to about 9000 years ago, when these valuable livestock species were domesticated [3]. As it is clear, around 2/3 of the population in Afghanistan is engaged in agriculture, particularly in the field of livestock [4]. Despite the importance of sheep production in Afghanistan, the lack of phenotypic records and accurate pedigree is the main issues that restrict the application of modern animal breeding programs. Afghan Baluchi fleece weight is around 1.3–2 Kg and the average body weight is 34–36 Kg (categorized as a meat-wool type breed). The average milk yield during the four months of a lactation period is about 40 Kg. It produces good-quality wool, suitable for carpet [4]. However, there is no evidence of breeding programs for Baluchi population in Afghanistan. It might be interesting to compare the impact of programmed and traditional selection methods on genomes of both populations, since it is believed that Iranian and Afghan Baluchi sheep populations has a common ancestor [5]. Both natural and artificial selection may have differential markers in specific genomic regions in domesticated animals. Therefore, the elucidation of certain genomic regions that show selection signatures may help to understand the process of biological selection and recognize potential candidate genes of interest [6].

Although the cost of genome sequencing is steadily decreasing, genotyping all individuals in a large population is still too expensive, which impedes the use of this technology. The SNP chip, which is a powerful tool for use in genetic studies, is a less expensive genotyping technology. A commercial 50K SNP chip for sheep genotyping became available in 2009 (http://www.illumina.com), providing a versatile, proficient, and cost-effective tool to detect positive selection signatures in the sheep genome [6,7].

Genetic diversity is a crucial aspect of overall biodiversity, alongside ecosystems and species, and is vital for the evolution of populations and species in the face of environmental and climatic changes. It also enhances the adaptability of animals to their habitats [8,9]. To preserve genetic diversity, it is essential to manage inbreeding and common ancestry, as noted by Gebreselase et al. [10]. However, in recent decades, traditional local breeding practices have declined due to the widespread use of commercial breeds specialized for production and heavy reliance on a limited number of rams, along with modern selection techniques [11]. Despite numerous studies focusing on genetic diversity and selection in sheep, Gorgol et al. point out a lack of genomic-level data on the genetic background and diversity of local sheep populations [12]. The presence of many local populations exhibiting distinct phenotypic differences warrants further research into their genetic differentiation. According to Meermans et al., local sheep breeds play an integral role in cultural heritage, are essential to local economies, possess unique traits adapted to regional climates, and are critical for sustainable and resilient agricultural systems [13].

The fixation index (FST) and cross-population extended haplotype heterozygosity test (XP-EHH) statistics are used to identify selection signatures within individual samples from a given population. XP-EHH was proposed to identify selection signals between populations [14]. The XP-EHH statistic was designed by comparing haplotypes from two populations in order to recognize ongoing or almost fixed selection signatures [14]. Sabeti et al. (2002) reported XP-EHH as an extension of the EHH [15]. FST, initially defined to measure the genetic divergence across populations [16], is an alternative statistic to recognize selection signatures among populations [17].

To classify genome regions under selection, FST has been formerly applied to different sheep populations by several studies [18,19]. Several genomic regions related to body size, reproductive performance, morphological traits, and skeletal morphology have been identified in these studies, which have been targeted via both artificial and natural selection during domestication [20].

Recently, QTL databases have been well developed due to advances in livestock genomics research and high-throughput genomics technologies. They could be linked to the most recent version of reference genomes such as sheep, cow, pig, fish, etc. [21]. These fine-mapped QTLs could provide accurate coordinates of genes and genomic regions related to known traits in livestock species. Therefore, researchers could pinpoint selection signatures to desired traits in livestock species [22].

This study aimed to assess genetic diversity, population structure, and identify regions under selection in each population and different regions between the populations using the Illumina OvineSNP50K Genotyping BeadChip array, and putative candidate genes were associated with these regions in Iranian and Afghan Baluchi populations of sheep.

Materials and methods

Sampling and genotyping

All experimental protocols were approved by the Biomedical Ethics Committee at Ferdowsi University of Mashhad with approval number 2/58110. Genotypes were obtained from previously published studies where all procedures were conducted in strict accordance with relevant guidelines and regulations, including the ARRIVE guidelines (Animal Research: Reporting of In Vivo Experiments). Our re-analysis of these public datasets followed standard bioinformatics practices. In this study, the genotype data of 101 unrelated animals, including 86 Iranian Baluchi and 15 Afghan Baluchi sheep, were used to identify selection signatures. Iranian Baluchi samples were collected from Abbas Abad Breeding Centre, Mashhad, Iran, and Afghan Baluchi samples were collected from different areas of Herat province. The IB flock was specifically selected as the most reliable source of purebred IB genetics, given the lack of documented purity information for Baluchi populations scattered across eastern and southern Iran, where field animals are typically maintained as crossbreeds or grade flocks. While the IB sample (n = 86) adequately represents this managed breeding population, it may not fully capture the broader geographic diversity of Iranian Baluchi sheep. The limited AB sample size reflects substantial logistical challenges in Afghanistan, where no dedicated Baluchi breeding centers exist, making identifying additional purebred animals infeasible. Also, due to extensive crossbreeding among Afghan sheep populations, finding pure Afghan Baluchi was very difficult; therefore, our sample size was limited. Iranian Baluchi genotype data were obtained from a study by Gholizadeh et al. (2015) (Supporting information IB) [23], and Afghan Baluchi data were obtained from Karimi et al. (2016) (Supporting information AB) [24]. Blood samples were taken in EDTA-containing tubes and kept in −20 °C until further use. DNA was extracted using GenElute™ Blood Genomic DNA Kit. Genotyping was performed by Ovine SNP50K Beadchip array (Illumina Oar_v4.0 assembly) in both groups, with all SNP coordinates mapped to this reference genome.

Quality control and population structure

PLINK V1.9 software was used to control the quality of genotype data of IB and AB populations. Animals with genotyping call rate lower than 99% and SNPs with genotyping call rate below 99% and minor allele frequency lower than 5% were removed. SNPs with a large deviation from Hardy-Weinberg equilibrium (P < 106) were removed as well [25]. Also, to control for inbreeding, examined pairs with a very high PI_HAT value (above 0.5) using the IBD test in PLINK version 1.9. In the Iranian Baluchi population, found only one pair that exhibited first-degree consanguinity. Therefore, one individual from this pair was excluded and the impact of this exclusion through a sensitivity analysis was evaluated. The principal component analysis was performed to screen population structure between and within breeds. Principal component analysis (PCA) was performed on SNPs using PLINK V1.9 to identify genetic relationships among breeds [26]. The graphical representation of PCA was depicted using the ggfortify v0.4.19 R package, with 95% confidence ellipses plotted around each population. After conducting principal component analysis, the population structure was determined using the ADMIXTURE v1.3.0 program [27]. Evaluated ancestral populations for K values ranging from 1 to 10, and the optimal number of ancestral populations was determined using the cross-validation criterion. ADMIXTURE v1.3.0 was run with 10-fold cross-validation and the default random seed. Cross-validation was enabled by simply adding the --cv flag to the ADMIXTURE command line. A good value of K will exhibit a low cross-validation error compared to other K values. Additionally, genetic distances were calculated by PLINK V1.9, with --distance 1-IBS flat-missing square to generate a pairwise identity-by-state (IBS) distance matrix. A heatmap was generated using the pheatmap v1.0.13 package in the R v4.5.0 software with Euclidean distance metric and Ward.D2 clustering method [28]. The Neighbor-Joining (NJ) tree was constructed from the IBS distance matrix using R, visualized, and enhanced with the iTOL website [29].

Genetic diversity parameters

To assess genetic diversity within and between two sheep populations, calculated several metrics: nucleotide diversity (π), inbreeding coefficient (FIS), observed heterozygosity (HO), expected heterozygosity (HE), minor allele frequency (MAF), and average pairwise genetic distance (D). The π was determined using VCFtools v0.1.17 with a 100-kb sliding window and a step size of 50 kb across the genome. For the calculations of MAF, HO, HE, and FIS, PLINK v1.9 utilised [26]. Additionally, the average proportion of alleles shared between two individuals within the same breed (DST) using PLINK v1.9, from which the genetic distance between individuals in a sheep population was calculated as D = 1 – DST [30].

Effective population size and linkage disequilibrium decay

The effective population size (Ne) was estimated using SNeP v1.1 [31] through the analysis of linkage disequilibrium (LD) patterns between SNPs. To exclude low-frequency alleles, a MAF threshold of 0.05 was applied [32]. Ne trajectories were obtained using the default LD-based settings of SNeP (mindist = 50 kb, maxdist = 4 Mb, alpha = 1, recombination rate 1 × 10 ⁻ ⁸ per bp, sample-size–corrected r² and unphased genotypes), following Barbato et al. (2015) [31], and 95% confidence intervals were derived from a block-bootstrap over LD bins (weighted by the number of SNP pairs per bin). Linkage disequilibrium (r²) among SNPs was estimated separately within each sheep population using PLINK v1.9 with parameters --ld-window-r2: 0, ld-window: 100, and ld-window-kb: 1000 [26]. Pairwise r² values were grouped into consecutive 10‑kb non-overlapping physical distance bins from 0 to 1 Mb (0–10 kb, 10–20 kb, …, 990–1000 kb). For each distance bin, we calculated the mean r², its standard error, and the corresponding 95% confidence interval across all SNP pairs; the mean r² values were plotted against the midpoint of each distance bin, with shaded 95% confidence ribbons, to visualize LD decay patterns in each population [33].

Detection of selection signatures

In this study, FST and XP-EHH statistical methods were used to identify selection signatures in IB and AB samples and to examine the demographic differences. Two populations were first merged based on common SNPs. FST identified genomic regions selected differentially in IB and AB. The FST statistic was calculated using PLINK v1.9 software, based on the unbiased estimator suggested by Weir and Cockerham (1984). Considering population differences and sampling error, this method has an advantage [34]. After calculating FST value, the average numerical values of the five adjacent SNPs were used as the Win5FST value to increase the chance of identifying selection signatures instead of the numerical value of each SNP [35]. The Win5FST approach indicates overlap windows by 5 adjacent SNPs rather than a fixed BP window, with one SNP moving forward in the sequence. R v.4.5.0 software was used to calculate numerical values for a high percentage of each chromosome, and regions with high values for all adjacent SNPs were considered as selection signatures [36]. XP-EHH is based on linkage disequilibrium and haplotype length. In this method, selection signatures were identified by alleles with high EHH [15]. Furthermore, XP-EHH compares EHH integrals between two populations with the same number of SNPs, taking into account linkage disequilibrium, haplotype length, as well as the frequency and distance between SNPs. The rehh v3.2.2 R package was used to calculate XP-EHH [37]. Before XP-EHH analysis, genomic VCF files for each population were filtered for high-quality SNPs (MAF > 0.01) and confirmed to contain phased haplotypes [14]. No additional imputation or phasing was performed, as the input VCFs were derived from pre-phased sequencing data. For each chromosome, haplotypes were converted to haplohh objects using data2haplohh () from rehh v3.2.2 with: min_maf = 0.01, allele_coding = “01”, polarize_vcf = FALSE, remove_multiple_markers = TRUE [37]. XP-EHH is based on linkage disequilibrium and haplotype length, identifying selection signatures by comparing EHH integrals between populations [15]. EHH scans were performed using scan_hh() (limhaplo = 5, limehh = 0.05, limehhs = 0.05, phased = TRUE) and XP-EHH scores calculated per-chromosome with ies2xpehh(standardize = TRUE) which performs Z-score normalization within each chromosome (mean = 0, SD = 1) [37]. Scans were conducted per chromosome and combined genome-wide. The top 1% outliers in FST and XP-EHH values (|XP-EHH| ≥ 3) were selected following standard empirical thresholds without multiple testing correction due to marker LD and the exploratory nature of selection scans [3840]. Finally, after identifying selection signatures using the FST and XP-EHH methods, related putative candidate genes were extracted with PLINK V1.9, utilizing a gene list generated by the Illumina Company [41].

Gene ontology and QTL report

DAVID (version 6.8) was used to detect important metabolic KEGG pathways and gene ontology analysis. DAVID is a database for visualisation, annotation, and integrated discovery [42]. After applying Bonferroni correction for multiple testing, pathways with p-values less than 0.05 were considered statistically significant. The AnimalQTL database (www.animalgenome.org/cgi-bin/QTLdb/OA/index) was used to identify positions of previously reported sheep QTLs [21]. Then, the identified genes from important FST and XP-EHH regions were matched with those QTLs to find any traits under selection in Baluchi sheep from Iran and Afghanistan.

All analyses were performed on a Linux (Ubuntu 20.04) high-performance computing cluster using R v4.5.0, plink v1.9, and VCFtools v0.1.17. Full command flags and random seed details are provided in the S1 File.

Results

Quality control of data and population structure

After quality control, 44791 SNPs and 42305 SNPs remained in AB and IB, respectively. Quality control levels are shown in S1 Table. Additionally, we utilized shared SNPs between the two populations to estimate genetic diversity, population structure, FST, and XP-EHH (38193 SNPs), thereby obtaining more accurate results. In general, IB and AB populations most likely have originated from a common founder population. However, the PCA result indicated that the IB and AB populations were distinct (Fig 1). The results of the principal component analysis of the IB and AB populations indicate that PC1 and PC2 account for 18.07% and 17.58% of the total variance, respectively.

thumbnail
Fig 1. Population structures analysis.

Results of Principal component analysis of Iranian (blue, n = 86) and Afghan Baluchi (red, n = 15) sheep based on SNPs. Eigenvector x (x-axis is PC1) versus Eigenvector y (y-axis is PC2). PC1 and PC2 account for 18.07% and 17.58% of the total variance, respectively. 95% confidence ellipses shown per population. SNPs, single-nucleotide polymorphisms.

https://doi.org/10.1371/journal.pone.0350262.g001

A heatmap was generated to visualize pairwise genetic distances among 101 individuals from the AB and IB sheep populations. The IB individuals formed a tight genetic cluster characterized by predominantly low pairwise distances (pale blue), indicating notable genetic uniformity within this population. In contrast, the AB population displayed substantially higher genetic distances (deep blue), highlighting the higher genetic distinctness between individuals in this population. The diagonal red band of low distances corresponds to self-comparisons, while the off-diagonal patterns reveal the hierarchical genetic structure differentiating the two populations (Fig 2A). The NJ tree analysis revealed that although both populations had a common ancestor in the relatively distant past, in recent times the genome of AB population had become different from that of IB population (Fig 2B). Also, ADMIXTURE analysis was conducted to investigate the population structure (Fig 2C), evaluating K values from 1 to 10. The optimal number of genetic clusters was determined to be K = 2, based on the lowest cross-entropy error of 0.603 observed during cross-validation (Fig 2D). At K = 2, a clear genetic distinction was evident between the AB and IB populations. Higher K values, such as K = 3 and K = 4, revealed further sub-structuring within the populations, with unique ancestry components identified in several individuals, as shown in S2 Fig.

thumbnail
Fig 2. Genetic relationships and population structure among AB (Afghan Baluchi) and IB (Iranian Baluchi) populations.

(A) Heatmap of pairwise 1-IBS genetic distances between the AB and IB populations. Euclidean distance; Ward.D2 hierarchical clustering. red (low similarity) to blue (high similarity). (B) Neighbour-joining tree based on 1-IBS genetic distances of IB and AB sheep. (C) ADMIXTURE analysis results for K = 2 displaying ancestry proportions for individuals in the AB (individual 1-15) and IB populations (individual 16-101). (D) Cross-entropy error values from the ADMIXTURE analysis suggest that K = 2 is the best number of genetic clusters‌‌.

https://doi.org/10.1371/journal.pone.0350262.g002

Genetic diversity parameters

Genetic diversity parameters, including MAF, nucleotide diversity (π), HO, HE, D, and FIS, were calculated for AB and IB populations (Table 1). The MAF values were nearly identical across both populations, with AB having a mean of 0.291 and IB slightly lower at 0.283. Nucleotide diversity showed minimal variation, ranging from 0.000037 in IB to 0.000039 in AB. HO and HE values were similar between the two populations, with AB showing both metrics at 0.38, while IB had HO at 0.38 and HE at 0.37. The IB population had the lowest HO and HE. The FIS values were low in both populations, with AB at −0.0177 and IB at −0.0194, indicating lower deviation from the Hardy-Weinberg equilibrium. The D values were also slightly lower in the IB population (0.296) than in the AB population (0.315).

thumbnail
Table 1. Different parameters of genetic diversity in AB and IB populations.

https://doi.org/10.1371/journal.pone.0350262.t001

Effective population size and linkage disequilibrium decay

The effective population size (Ne) over the past 1000 generations was estimated using LD-based methods, revealing a steady decline in Ne across both populations, indicative of diminishing genetic diversity over time. Between the two populations, IB showed higher Ne values in recent generations compared to AB. Approximately 400 years ago, Ne was estimated at around 1,700 for AB and 1,537 for IB, with AB initially having the larger Ne. However, this pattern shifted in more recent generations, with IB maintaining the higher Ne, suggesting differing historical population dynamics between the two populations. Shaded 95% confidence intervals around the Ne trajectories, obtained by block bootstrap over LD bins, illustrate the uncertainty associated with these recent Ne estimates (Fig 3A). Linkage disequilibrium (LD) was evaluated for AB and IB populations by calculating pairwise r² values over SNP distances up to 1 Mb. Both populations exhibited a decreasing trend in average LD (r²) with increasing physical distance between SNPs. Shaded 95% confidence ribbons around the mean r² curves illustrate the uncertainty of these estimates across distance bins. Notably, AB displayed the most rapid LD decay, consistently showing lower r² values across all distances, indicative of reduced genetic linkage within this population (Fig 3B); the number of pairwise SNP comparisons contributing to each distance bin is provided in S2 Table.

thumbnail
Fig 3. Effective population size (Ne) and linkage disequilibrium (LD) decay for the AB and IB populations.

(A) Changes in effective population size (Ne) over the last 1000 generations based on genome‑wide LD (r²) binned by physical distance. Shaded areas represent 95% confidence intervals obtained by block bootstrap over LD bins. (B) Decline in LD, shown as r², with increasing distances between SNPs. Shaded areas represent 95% confidence intervals of mean r² based on the standard error within each bin.

https://doi.org/10.1371/journal.pone.0350262.g003

Detection of selection signatures

In this study, the results of FST showed that several regions under selection were located on different chromosomes. A Manhattan plot of FST statistics is shown in Fig 4. In this plot, regions with high win5FST values represent the distinction between IB and AB. The complete information is presented in S3 Table.

thumbnail
Fig 4. Distribution of win5FST values in the genome between Iranian and Afghan Baluchi sheep.

X-axis represents the chromosome and Y-axis represents the distribution of Win5FST. Chromosomes are separated based on color. In this graph, regions with win5FST values above the line (0.18 < Win5FST) are introduced as significant regions, which are spread over 21 different chromosomes.

https://doi.org/10.1371/journal.pone.0350262.g004

XP-EHH was performed to evaluate the genomic pattern of positive selection in the Baluchi populations. In this study, XP-EHH was used as a complement to FST, with more reliable results in case of identification of similar genomic regions. The 38193 SNPs utilised in the current study covered 2869.91 Mb of the sheep genome, with an average distance of 77.13 kb between neighbouring SNPs. The Manhattan plot was drawn to identify selection signatures in IB and AB (Fig 5). According to this plot, 87 significant XP-EHH signals were detected in IB sheep, whereas only 27 were found in AB sheep.

thumbnail
Fig 5. The distribution of XP-EHH values in the genome between Iranian and Afghan Baluchi sheep showed negative values related to IB and positive values related to AB.

Chromosomes are separated based on color. X-axis represents chromosome, and Y-axis represents XP-EHH values. Significant regions in the IB population are markers whose XP-EHH distribution is less than −3, and significant regions in the AB population are markers whose distribution is higher than 3.

https://doi.org/10.1371/journal.pone.0350262.g005

Gene ontology and KEGG pathway analysis

The Gene Ontology analysis showed that significant genes in terms of FST were enriched in female pregnancy and regulation of glutamate receptor activity. KEGG pathway analysis and Gene Ontology of XP-EHH in IB population putative candidate genes were enriched in pathways, including intracellular signal transduction and regulation of microtubule cytoskeleton organization. In AB population, putative candidate genes were enriched in pathways including cancer and MAPK signalling pathway, and neuromuscular junction development (Table 2).

thumbnail
Table 2. KEGG pathway and Gene Ontology analysis of FST and XP-EHH putative candidate genes were related to Iranian and Afghan Baluchi sheep.

https://doi.org/10.1371/journal.pone.0350262.t002

Finding QTLs

Significant genes of FST were associated with some important traits in sheep, including body weight and hot carcass weight. Moreover, XP-EHH putative candidate genes in IB population were related to muscle weight in carcass, staple length, and milk fat percentage. Also, XP-EHH putative candidate genes in AB population were related to body weight, hot carcass weight, and milk yield traits, which are important in sheep (Table 3). The complete list of significant genes related to these traits has been presented in S4S6 Tables (S4 Table).

thumbnail
Table 3. Significant genes of FST and XP-EHH were associated with some QTL in sheep.

https://doi.org/10.1371/journal.pone.0350262.t003

Discussion

The study of genetic diversity between and within animal populations offers valuable insights into the structure and relationships of populations. This knowledge is crucial for the conservation of these populations, enabling them to withstand future environmental challenges and respond effectively to long-term selection, whether natural or artificial, for traits that are economically and culturally significant [10]. Indigenous breeds hold significant cultural value in every country, but their lower productivity often leads to their replacement by commercial breeds. To prevent losses in productivity and to enhance the economics of farming, urgent measures are needed to ensure the survival and protection of these indigenous breeds. Consequently, the genetic structure of indigenous sheep may be impacted by demographic events, such as gene flow between different breeds and subsequent genetic mixing among them [43]. Additionally, identifying genes associated with important economic traits is vital for developing effective breeding programs for both valuable animal species and genetic resources. The main objective of this study was to characterize the genetic diversity, population structure, and detect signals of selection in the indigenous AB and IB sheep populations using 50K beadchip genotyping data.

In this study, the Iranian Baluchi sheep population was sampled using 86 heads from a high-purity flock at a breeding station, a flock considered the most reliable source for pure genetic representation of the breed due to the lack of documented information on the purity of scattered populations in eastern and southern Iran. In contrast, only 15 Afghan Baluchi sheep were sampled from traditional and scattered flocks in various areas of Herat Province, reflecting the serious challenge of identifying and collecting more high-purity animals in this country. This imbalance in sample size, especially for the Afghan population, can lead to increased variance in allele frequency (AB) estimates, reduced statistical power in ADMIXTURE ancestry proportions, FST pairwise differentiation, XP-EHH selection scans, and a higher risk of false negatives in detecting population differentiation and selection signals. Therefore, the findings related to AB are more exploratory and should be interpreted with caution. To mitigate these limitations, we applied empirical genome-wide thresholds (top 1% outliers) rather than stringent p-value cutoffs and cross-validated results across complementary methods (FST + XP-EHH + LD decay). Despite these limitations, the combination of using a purebred management herd in Iran and selected field samples from Afghanistan provides a valuable picture of genetic differences and similarities between these two management and geographical contexts and can serve as a basis for designing larger studies in the future.

Analysis of population structure and genetic differentiation, including PCA, Admixture, and NJ tree, revealed a relatively high level of genetic differentiation between the AB and IB sheep populations, despite their common ancestry. This significant genetic divergence may be attributed to differing selection goals, adaptations to their respective environments, and distinct breeding practices for these two populations. A study conducted in 2020 by Eydivand et al. examined 14 native sheep breeds from the Middle East and South Asia. The findings revealed that while Iranian and Afghan Baluchi sheep are more similar to each other than to other breeds and descended from a common lineage, they are classified into two distinct populations [5]. Also, in a study conducted by Barani et al. (2023) that compared three Iranian sheep breeds, the results indicated that Baluchi sheep are genetically distinct from the other breeds. This distinction is largely due to the breeding practices at the Breeding Center and the isolation of this population [44]. A study conducted on three Afghan sheep breeds revealed that the Baluchi breed is genetically close to the other breeds but is classified in its own separate group. This distinction can be attributed to the traditional breeding practices associated with the Baluchi breed and the traditional interbreeding within this breed population [24].

Genetic diversity can be examined using various parameters in studies. In this study, genetic diversity between the two populations, AB and IB, revealed that various parameters, such as HO, HE, MAF, π, D, FIS, and LD, indicate that the genetic diversity in the IB population is quite similar to, but slightly lower than, that of the AB population. But in terms of effective population size (Ne), the IB population has fared better in recent generations compared to the AB population. The observed pattern of lower genetic diversity but higher recent Ne in IB compared to AB can be explained by three interacting factors. First, the IB samples from Abbas Abad Breeding Centre represent a managed flock with systematic mating strategies designed to maintain genetic diversity and avoid inbreeding, artificially inflating recent Ne estimates despite potentially lower standing variation compared to more diverse field populations. Second, LD-based Ne is highly sensitive to sample size, with smaller samples producing downward-biased recent Ne estimates due to increased variance in allele frequency sampling and stochastic LD inflation [45,46]. Third, AB samples from traditional Herat flocks likely experienced more severe historical bottlenecks and higher recent relatedness, reducing effective Ne, while the station-managed IB flock benefits from a larger census size and controlled mating. The study by Barani estimated the levels of HO, HE, and MAF in Iranian Baluchi sheep to be 0.37, 0.38, and 0.28, which aligns with our results [44]. In another study, the average HO statistic was estimated to be between 0.34 and 0.39 for Iranian sheep, including Afshari, Moghani, Qezel, Zel, and Lori-Bakhtiari, and between 0.37 and 0.38 for Afghan sheep, including Baluchi, Gedik, and Arab breeds [5]. In a study conducted by Shi on two Tibetan sheep breeds, the results indicated the following values: for the OuLa breed, HO was 0.226, HE was 0.277, MAF was 0.223, and π was 0.0027. For the PanOu breed, the values were 0.221 for HO, 0.267 for HE, 0.228 for MAF, and 0.0026 for π [8]. In a study of Chinese breed sheep conducted by Cheng, HE ranged from 0.226–0.316, while HO varied from 0.238–0.240 [47]. A study by Demissie et al. (2025) on various sheep breeds revealed that the average values of genetic diversity parameters, including HO, HE, FIS, and MAF, were 0.352, 0.344, −0.023, and 0.261, respectively [48]. In a study conducted on nine goat breeds, the results showed that the HO was 0.374 ± 0.021, HE was 0.0369 ± 0.023, and D ranged from 0.263 to 0.332 [49]. In a study examining genetic diversity among three goat breeds, the average values for the parameters MAF, HO, HE, π, and FIS were found to be 0.32, 0.304, 0.306, 0.28, and 0.009, respectively [30]. The results of the linkage disequilibrium (LD) study indicated that the LD rate decreases as the distance between markers increases, which aligns with findings from previous studies conducted on different sheep breeds [10,30,50]. Additionally, the average LD rate was higher in the IB population compared to the AB population. A higher LD rate is associated with lower genetic diversity [33]. The results regarding effective population size (NE) indicate a significant decrease in this parameter in new generations compared to earlier ones [46,51]. The primary factors contributing to this decline include selection pressure, climate change, artificial selection, flawed mating programs, and the use of a limited number of superior rams. Specifically, in the Iranian Baluchi, the effective population size has dropped from 2,840 over 1000 previous generations to just 119 over the last 13 generations. Similarly, in the Afghan Baluchi sheep, the effective population size has decreased from 2,890 over 680 generations to only 81 in the past 13 generations. The NE results are consistent with findings from previous studies conducted on different racial groups [9,52,53].

The identified selection signatures represent candidate regions based on empirical top 1% (FST) and |XP-EHH| ≥ 3 thresholds, which are heuristic cutoffs commonly used in livestock [5456] genomic scans but likely include false positives, particularly given the limited AB sample size. Functional validation through targeted sequencing or association studies will be required to confirm these preliminary signals. The selection signature aims to identify areas of genome differentiation between populations based on SNP information. This method is more effective when working on related populations [18]. Therefore, we analysed two populations of Iranian and Afghan Baluchi sheep that have common ancestors [5]. In this study, selection signatures were studied using FST and XP-EHH statistics. The results showed that several regions of the genomes of these two populations were selected, whose genes were associated with important economic traits. The selection signature was more intense in Iranian Baluchi sheep because they were from an animal breeding center. Therefore, it could be concluded that breeding programs caused significant differences in both studied populations at the genome level.

Regions under selection found by FST statistics showed that HDAC9, CSMD3, DAB1, FGF12, and PCDH9 genes were enriched in body weight, hot carcass weight, muscle weight in carcass, reproductive seasonality, and carcass fat percentage traits, respectively. The HDAC9 gene was involved in the muscle structure development pathway in the study of Cheng et al. (2020), which was performed on Chaka sheep using whole genome sequencing [47]. Moreover, this gene is related to muscle in Garut sheep breed from Indonesia, which is used in ram fights [57]. Additionally, HDAC9 inhibited muscle differentiation transcriptional circuitry through its negative-feedback loop by suppressing MEF2 activity [58]. Several studies performed on livestock showed that HDAC9 gene is associated with skeletal muscle development [59], carcass, and meat traits [60]. In a study of 14 indigenous sheep breeds from the Middle East and South Asia, the results showed that HDAC9 gene is related to economic traits and milk traits [5]. In another study on South African Merino and Afrino sheep populations, HDAC9 gene was associated with reproductive traits in the Merino population [61]. Also, HDAC9 was reported as a selection signature in a study on worldwide sheep populations [57].

In identification of some traits in Assaf and Churra dairy sheep breeds found that the CSMD3 was related to ILCY (individual laboratory cheese yield) trait in the Churra breed [62]. Also, CSMD3, which encodes a transmembrane protein [63], was linked to body size and stature in cattle similar to our finding in sheep [64]. In the current study, FGF12 was identified as a putative candidate gene relating to reproductive traits. It has been reported in other studies related to the same reproductive traits in cattle and goat [65]. Functional annotation of differentially expressed mRNAs in hair follicle tissue showed that FGF12 was enriched in MAPK, PI3K-Akt, and RAS signalling pathways, which had a certain influence on hair follicle growth and development [66] and reproductive traits [65]. Gene ontology enrichment analysis in cashmere goats revealed that FGF12 gene was enriched in several biological pathways that were involved in hair follicle development [67]. Protocadherins are thought to be involved in different aspects of neuronal functions and development. PCDH9 is involved in synaptic cell adhesion [68]. In a study by Mastrangelo et al. (2019), it was shown that the PCDH9 gene was associated with the signal of fat deposition pathway in domestic sheep breeds from Africa and Eurasia [69]. In research on local adaptation of Mediterranean sheep and goats, selection signatures involving the PCDH9 gene, was identified in both species and therefore, could play a significant adaptive role [70]. In a study by Bakhtiarizadeh, PCDH9 in Zel sheep breed was found to be associated with fat deposition. The PCDH9 with nine SNPs was located within a sheep QTL region for carcass fat percentage. These findings indicated the role of this gene as an important putative candidate for development of fat-tail in sheep [71]. Our results showed that CTNNA3 gene is associated with staple length. This gene was significantly related to body weight, height, length, and chest circumference, which can be used as an important marker in improving growth traits in sheep breeding [72]. Multiple studies pointed out that CTNNA3 gene is important for the formation of a stable complex with the other catenins and cadherins, playing a role in solid cell–cell adhesion [73]. In biological pathways involving adherens junctions and CAMs, CTNNA3 gene was closely related to cell adhesion mechanisms [74]. In a study by Chen et al. (2021), results showed that CTNNA3 gene was related to reproduction and production traits [75].

Our QTL results based on positive XP-EHH, which is related to the Afghan Baluchi sheep, showed that ROBO2 gene was related to lean meat yield percentage, muscle weight in carcass, and carcass fat percentage and other growth-related traits. Our results were highly consistent with the results of Montiel et al. (2020) study performed on Pelibuey Sheep by Genome-Wide Association Study; Also, ROBO2 gene was related to litter size [76]. During the development of the central nervous system (CNS), this gene is crucial for axon guidance across the midline [77]. In this study, the relationship between this gene and axon guidance was identified, too. Moreover, ROBO2 gene is crucial during the early stages of follicle development in sheep, as well as during ovary development as a factor determining follicle maturation [78], resulting in an altered expression pattern that may be affected by additional factors in the ovary and steroid hormones in other reproductive tissues [79]. In a study by Mohammadi et al. ROBO2 was found to be associated with milk fat and protein yield [80]. Also, according to previous studies, ROBO2 was associated with fat metabolism, particularly in the fatty acid profile [81]. According to our results, KCNIP4 gene was associated with body weight, muscle weight in carcass, hot carcass weight, and fat weight in carcass traits in AB population. KCNIP4 was related to economic traits in genomic scans for selective sweeps in sheep breeds from South Asia and Middle East. This gene plays a critical role in heart health, as well as skeletal muscle development, body weight, and immune response [5]. A genome-wide association study of Baluchi sheep showed that KCNIP4 is also directly involved in muscle growth and fat storage in addition to indirectly modulating temperature [82]. Furthermore, KCNIP4 was identified as a gene that was related to body weight and hot carcass weight traits in a genome-wide association study in Zandi sheep, which is in agreement with the results of the current study [83]. WWOX gene was also reported as a putative candidate gene associated with bone weight in carcass and total bone traits in AB population in this study. According to Mohammadi et al. (2020), the WWOX gene was associated with the regulation of postnatal growth, skeletal muscle differentiation, and bone growth in sheep [83]. The FGF11 gene was also reported as a putative candidate gene that was associated with body weight, hot carcass weight, and milk yield traits in this study. This gene was related to body size in Chinese Merino [84], and it was reported as a selection signature in Swiss sheep breeds [85]. Moreover, in some sheep genetic resistance to gastrointestinal helminth infection results show that FGF11 gene being related to organ development [86]. Another identified gene is the CHL1 gene, which is related to axon guidance. A study conducted on Tibetan sheep; results showed that CHL1 gene is related to gas transport. Also, CHL1 is associated with homeostatic adaptation during hypoxia, in comparison to wild-type individuals; CHL1 augmented ventilatory responses were also recorded in Tibetan breed of sheep [87] and Tibetans [88] at high altitudes.

Results of QTL based on negative XP-EHH relating to the Iranian Baluchi sheep showed that FGD3 gene is related to the Milk fat percentage trait. Moreover, FGD3 gene, related to metabolism and growth, was found under selection in the Welsh sheep breed [89]. In this study, NCOA1 is another significant gene relating to the staple length trait. To our knowledge, NCOA1 was related to litter size in Hu and Small-tailed Han sheep and Icelandic sheep [90]. The other significant gene, GRIK3 gene, was identified in the IB population, which is associated with lean meat yield percentage and muscle weight in carcass traits in this study. This gene encodes a glutamate inotropic receptor which plays a role in multiple biological pathways and can be found in diverse animal species, including cattle, pigs, chickens, and horses. Research has shown that GRIK3 plays a role in nervous system processes, membrane potential regulation, synaptic transmissions, and also has an essential role in the glutamate receptor signalling pathway. Moreover, GRIK3 was related to reproduction and fertility traits in a genome-wide association study in South African sheep breed [91].

It should be noted that the genes highlighted here are putative candidates, and that our interpretations are largely based on positional overlap with previously reported QTL and association signals. Because many livestock QTL are broad and sometimes pleiotropic, QTL overlap alone cannot prove causality, and some of the reported functional links may not be directly relevant to the specific environmental and management context of Baluchi sheep.

Conclusion

This study analysed the population structure, genetic diversity, and selection signatures between Iranian and Afghan Baluchi sheep populations. Although the Baluchi sheep population of Iran and Afghanistan shared a common ancestor, the population structure results indicated that the two populations are genetically distinct from one another. This difference may be attributed to varying selection goals, geographical factors, and environmental adaptations, and/or genetic drift due to demographic history. Overall, moderate genetic diversity was observed in both the Afghan Baluchi (AB) and Iranian Baluchi (IB) sheep populations. However, the IB population exhibited the lowest level of genetic diversity and the highest rate of linkage disequilibrium decay, while showing a better condition in terms of effective population size. Putative candidate genes identified in putative selection regions are associated with reproduction, milk production, and growth traits, consistent with previous studies. However, some signals may reflect genetic drift or demographic history rather than adaptive selection, particularly given the small Afghan sample size (n = 15). These gene-trait associations should be interpreted cautiously and framed as hypotheses requiring functional validation. According to the results of this research, it can be stated that the traditional selections (phenotypic) and modern breeding programs for economically important traits such as growth, reproduction, immune system, etc., might be in the same direction. These results should be interpreted as reflecting differences between a managed breeding population and a traditional field population rather than national level divergence. Further investigation with a larger sample size is required to compare the performance of these populations to assess the impact of identified selection signatures on economic traits. Utilizing validated QTLs as described in this study could be applied to reveal the direction of breeding plans in other livestock species such as cattle, goats, horses, chickens, and sheep.

Supporting information

S1 Fig. The Baluchi sheep, right: Iranian, left: Afghan.

https://doi.org/10.1371/journal.pone.0350262.s001

(PDF)

S1 Table. Description of quality control steps in Iranian and Afghan Baluchi sheep.

https://doi.org/10.1371/journal.pone.0350262.s002

(PDF)

S1 File. Full command flags and random seed details are provided.

https://doi.org/10.1371/journal.pone.0350262.s003

(PDF)

S2 Fig. ADMIXTURE analysis results for K = 3 and K = 4, for individuals in the AB and IB populations.

https://doi.org/10.1371/journal.pone.0350262.s004

(PDF)

S2 Table. Number of pairwise SNP comparisons contributing to each distance bin.

https://doi.org/10.1371/journal.pone.0350262.s005

(PDF)

S3 Table. Significant selection signatures via WIN5FST Method.

https://doi.org/10.1371/journal.pone.0350262.s006

(PDF)

S4 Table. Significant genes of FST were associated with some QTL in sheep.

https://doi.org/10.1371/journal.pone.0350262.s007

(PDF)

S5 Table. Significant genes of positive XP-EHH were associated with some QTL in sheep‌‌.

https://doi.org/10.1371/journal.pone.0350262.s008

(PDF)

S6 Table. Significant genes of negative XP-EHH were associated with some QTL in sheep.

https://doi.org/10.1371/journal.pone.0350262.s009

(PDF)

Acknowledgments

Authors would like to thank Professor Nicolò P. P. Macciota and Dr. Giustino Gaspa, Sassari University, Italy, for their assistance.

References

  1. 1. Zeder MA. Domestication and early agriculture in the Mediterranean Basin: Origins, diffusion, and impact. Proc Natl Acad Sci USA. 2008;105(33):11597–604. pmid:18697943
  2. 2. Yazdi MH, Johansson K, Gates P, Näsholm A, Jorjani H, Liljedahl LE. Bayesian analysis of birth weight and litter size in Baluchi sheep using Gibbs sampling. J Anim Sci. 1999;77(3):533–40. pmid:10229348
  3. 3. Rashiq MH. Building carbohydrates from a livestock nutrition perspective. Kabul Univ J. 1995;1:15–24.
  4. 4. Karimi MO. Investigation of genetic diversity and genomic selection signature in some Afghani sheep breeds. Ph.D. Thesis, The Ferdowsi University of Mashhad Faculty of Agriculture Department of Animal science. 2016. Available from: https://elmnet.ir/doc/10863769-2012
  5. 5. Eydivandi S, Roudbar MA, Karimi MO, Sahana G. Genomic scans for selective sweeps through haplotype homozygosity and allelic fixation in 14 indigenous sheep breeds from Middle East and South Asia. Sci Rep. 2021;11(1):2834. pmid:33531649
  6. 6. Zhao FP, Wei CH, Zhang L, Liu JS, Wang GK, Tao ZE. A genome scan of recent positive selection signatures in three sheep populations. J Integr Agric. 2016;15(1):162–74.
  7. 7. Gershoni M, Shirak A, Raz R, Seroussi E. Comparing BeadChip and WGS Genotyping: Non-Technical Failed Calling Is Attributable to Additional Variation within the Probe Target Sequence. Genes (Basel). 2022;13(3):485. pmid:35328039
  8. 8. Javadmanesh A, Ghovvati Rodsari S, Soltani M, Nassiry M. Genetic diversity of urial population in Northeast of Iran. IJAB. 2022;18(2):185–93.
  9. 9. Wanjala G, Astuti PK, Bagi Z, Kichamu N, Strausz P, Kusza S. Assessing the Genomics Structure of Dorper and White Dorper Variants, and Dorper Populations in South Africa and Hungary. Biology (Basel). 2023;12(3):386. pmid:36979078
  10. 10. Gebreselase HB, Nigussie H, Wang C, Luo C. Genetic Diversity, Population Structure and Selection Signature in Begait Goats Revealed by Whole-Genome Sequencing. Animals (Basel). 2024;14(2):307. pmid:38254476
  11. 11. Machová K, Marina H, Arranz JJ, Pelayo R, Rychtářová J, Milerski M, et al. Genetic diversity of two native sheep breeds by genome-wide analysis of single nucleotide polymorphisms. Animal. 2023;17(1):100690. pmid:36566708
  12. 12. Gurgul A, Jasielczuk I, Miksza-Cybulska A, Kawęcka A, Szmatoła T, Krupiński J. Evaluation of genetic differentiation and genome-wide selection signatures in Polish local sheep breeds. Livest Sci. 2021;251:104635.
  13. 13. Meyermans R, Gorssen W, Aerts N, Hooyberghs K, Chakkingal Bhaskaran B, Chapard L, et al. Genomic characterisation and diversity assessment of eight endangered Belgian sheep breeds. Animal. 2024;18(10):101315. pmid:39276394
  14. 14. Sabeti PC, Varilly P, Fry B, Lohmueller J, Hostetter E, Cotsapas C, et al. Genome-wide detection and characterization of positive selection in human populations. Nature. 2007;449(7164):913–8. pmid:17943131
  15. 15. Sabeti PC, Reich DE, Higgins JM, Levine HZP, Richter DJ, Schaffner SF, et al. Detecting recent positive selection in the human genome from haplotype structure. Nature. 2002;419(6909):832–7. pmid:12397357
  16. 16. Wright S. The genetical structure of populations. Ann Eugen. 1951;15(4):323–54. pmid:24540312
  17. 17. Akey JM, Zhang G, Zhang K, Jin L, Shriver MD. Interrogating a high-density SNP map for signatures of natural selection. Genome Res. 2002;12(12):1805–14. pmid:12466284
  18. 18. Kijas JW, Lenstra JA, Hayes B, Boitard S, Porto Neto LR, San Cristobal M, et al. Genome-wide analysis of the world’s sheep breeds reveals high levels of historic mixture and strong recent selection. PLoS Biol. 2012;10(2):e1001258. pmid:22346734
  19. 19. Kizilaslan M, Arzik Y, Behrem S, White SN, Cinar MU. Comparative genomic characterization of indigenous fat‐tailed Akkaraman sheep with local and transboundary sheep breeds. Food and Energy Security. 2023;13(1).
  20. 20. Pritchard JK, Pickrell JK, Coop G. The genetics of human adaptation: hard sweeps, soft sweeps, and polygenic adaptation. Curr Biol. 2010;20(4):R208-15. pmid:20178769
  21. 21. Hu Z-L, Park CA, Reecy JM. Bringing the Animal QTLdb and CorrDB into the future: meeting new challenges and providing updated services. Nucleic Acids Res. 2022;50(D1):D956–61. pmid:34850103
  22. 22. Taheri S, Zerehdaran S, Javadmanesh A. Investigation of critical genes and quantitative trait loci related to economic traits in broiler chicken genome using protein-protein interaction network. Poult Sci J. 2025;13(1):29–38.
  23. 23. Gholizadeh M, Rahimi-Mianji G, Nejati-Javaremi A. Genomewide association study of body weight traits in Baluchi sheep. J Genet. 2015;94(1):143–6. pmid:25846889
  24. 24. Karimi MO, Shariati MM, Zerehdaran S, Moradi MH, Javadmanesh A. Study of genetic diversity of sheep breeds in Afghanistan. Biosci Biotechnol Res Asia. 2016;13(1).
  25. 25. Taheri S, Saedi N, Zerehdaran S, Javadmanesh A. Identification of selection signatures in Capra hircus and Capra aegagrus in Iran. Anim Sci J. 2023;94(1):e13864. pmid:37560768
  26. 26. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
  27. 27. Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19(9):1655–64. pmid:19648217
  28. 28. Kolde R, Kolde MR. Package ‘pheatmap’. R package. 2015;1(7):790.
  29. 29. Letunic I, Bork P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49(W1):W293–6. pmid:33885785
  30. 30. Bilginer U, Demir E, Argun Karsli B, Doğru H, Kaya S, Meydan H, et al. Unveiling genomic diversity and population structure in three Anatolian goats at genome-wide level. Small Ruminant Research. 2025;251:107570.
  31. 31. Barbato M, Orozco-terWengel P, Tapio M, Bruford MW. SNeP: a tool to estimate trends in recent effective population size trajectories using genome-wide SNP data. Front Genet. 2015;6:109. pmid:25852748
  32. 32. Sved JA, McRae AF, Visscher PM. Divergence between human populations estimated from linkage disequilibrium. Am J Hum Genet. 2008;83(6):737–43. pmid:19012875
  33. 33. Taheri S, Zerehdaran S, Javadmanesh A. Genetic diversity in some domestic and wild sheep and goats in Iran. Small Ruminant Research. 2022;209:106641.
  34. 34. Weir BS, Cockerham CC. Estimating f-statistics for the analysis of population structure. Evolution. 1984;38(6):1358–70. pmid:28563791
  35. 35. Ebrahimi F, Gholizadeh M, Sahebalam H. Genome-wide study for signatures of selection identifies genomic regions and candidate genes associated with milk traits in sheep. Mamm Genome. 2025;36(1):140–50. pmid:39904907
  36. 36. Bonhomme M, Chevalet C, Servin B, Boitard S, Abdallah J, Blott S, et al. Detecting selection in population trees: the Lewontin and Krakauer test extended. Genetics. 2010;186(1):241–62. pmid:20855576
  37. 37. Gautier M, Vitalis R. rehh: an R package to detect footprints of selection in genome-wide SNP data from haplotype structure. Bioinformatics. 2012;28(8):1176–7. pmid:22402612
  38. 38. Vitti JJ, Grossman SR, Sabeti PC. Detecting natural selection in genomic data. Annu Rev Genet. 2013;47:97–120. pmid:24274750
  39. 39. Lei Z, Sun W, Guo T, Li J, Zhu S, Lu Z, et al. Genome-Wide Selective Signatures Reveal Candidate Genes Associated with Hair Follicle Development and Wool Shedding in Sheep. Genes (Basel). 2021;12(12):1924. pmid:34946875
  40. 40. Zhang W, Jin M, Li T, Lu Z, Wang H, Yuan Z, et al. Whole-Genome Resequencing Reveals Selection Signal Related to Sheep Wool Fineness. Animals (Basel). 2023;13(18):2944. pmid:37760343
  41. 41. Taheri S, Javadmanesh A, Zerehdaran S. Identification of selective sweep and associated QTL traits in Iranian Ovis aries and Ovis orientalis populations. Front Genet. 2024;15:1414717. pmid:39748948
  42. 42. Huang DW, Sherman BT, Tan Q, Collins JR, Alvord WG, Roayaei J, et al. The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists. Genome Biol. 2007;8(9):R183. pmid:17784955
  43. 43. Odjakova T, Todorov P, Kalaydzhiev G, Salkova D, Dundarova H, Radoslavov G, et al. A study on the genetic diversity and subpopulation structure of three Bulgarian mountainous sheep breeds, based on genotyping of microsatellite markers. Small Ruminant Research. 2023;226:107034.
  44. 44. Barani S, Nejati-Javaremi A, Moradi MH, Moradi-Sharbabak M, Gholizadeh M, Esfandyari H. Genome-wide study of linkage disequilibrium, population structure, and inbreeding in Iranian indigenous sheep breeds. PLoS One. 2023;18(6):e0286463. pmid:37267244
  45. 45. Wang J. Estimating current effective sizes of large populations from a single sample of genomic marker data: A comparison of estimators by simulations. Popul Ecol. 2025;67(2):96–108.
  46. 46. Waples RS, Do C. Linkage disequilibrium estimates of contemporary N e using highly variable genetic markers: a largely untapped resource for applied conservation and evolution. Evol Appl. 2010;3(3):244–62. pmid:25567922
  47. 47. Cheng J, Zhao H, Chen N, Cao X, Hanif Q, Pi L, et al. Population structure, genetic diversity, and selective signature of Chaka sheep revealed by whole genome sequencing. BMC Genomics. 2020;21(1):520. pmid:32727368
  48. 48. Demissie BE, Tarekegn GM, Dadi H, Edea Z, Woldesemayat AA, Kim KS. Genome-wide analysis of population structure and genetic diversity in two Ethiopian native sheep populations. Reprod Breed. 2025;5(3):110–8.
  49. 49. Brito LF, Kijas JW, Ventura RV, Sargolzaei M, Porto-Neto LR, Cánovas A, et al. Genetic diversity and signatures of selection in various goat breeds revealed by genome-wide SNP markers. BMC Genomics. 2017;18(1):229. pmid:28288562
  50. 50. Yang C, Wang J, Bi L, Fang D, Xiang X, Khamili A, et al. Genetic Structure and Selection Signals for Extreme Environment Adaptation in Lop Sheep of Xinjiang. Biology (Basel). 2025;14(4):337. pmid:40282202
  51. 51. Yousefi Z, Moradi MH, Beige-Nasiri MT, Shirali M, Abdollahi-Arpanahi R. Genomic insights into runs of homozygosity, effective population size and selection signatures in Iranian meat and dairy sheep breeds. PLoS One. 2025;20(6):e0323328. pmid:40498690
  52. 52. Benedetti del Rio E, Mancin E, Orsi M, Mantovani R, Sturaro E. Genomic characterisation of local sheep breeds of the Eastern Alps. Ital J Anim Sci. 2025;24(1):1528–41.
  53. 53. Argun Karsli B, Demir E, Bilginer U, Balcioğlu MS, Karsli T. Re-visiting genetic background of some native Turkish sheep populations: bottleneck and migration. Trop Anim Health Prod. 2025;57(6):273. pmid:40524105
  54. 54. Demissie BE, Tarekegn GM, Dadi H, Edea Z, Woldesemayat AA, Kim KS, et al. Genomic signatures of positive selection and local adaptation in Ethiopian sheep populations. Trop Anim Sci J. 2025;48(3):179–88.
  55. 55. Abied A, Bagadi A, Bordbar F, Pu Y, Augustino SMA, Xue X, et al. Genomic Diversity, Population Structure, and Signature of Selection in Five Chinese Native Sheep Breeds Adapted to Extreme Environments. Genes (Basel). 2020;11(5):494. pmid:32365888
  56. 56. Zhang C-L, Liu C, Zhang J, Zheng L, Chang Q, Cui Z, et al. Analysis on the desert adaptability of indigenous sheep in the southern edge of Taklimakan Desert. Sci Rep. 2022;12(1):12264. pmid:35851076
  57. 57. Fariello M-I, Servin B, Tosser-Klopp G, Rupp R, Moreno C, International Sheep Genomics Consortium, et al. Selection signatures in worldwide sheep populations. PLoS One. 2014;9(8):e103813. pmid:25126940
  58. 58. Haberland M, Arnold MA, McAnally J, Phan D, Kim Y, Olson EN. Regulation of HDAC9 gene expression by MEF2 establishes a negative-feedback loop in the transcriptional circuitry of muscle differentiation. Mol Cell Biol. 2007;27(2):518–25. pmid:17101791
  59. 59. Mei C, Wang H, Liao Q, Khan R, Raza SHA, Zhao C, et al. Genome-wide analysis reveals the effects of artificial selection on production and meat quality traits in Qinchuan cattle. Genomics. 2019;111(6):1201–8. pmid:30300672
  60. 60. Hagen IJ, Zadissa A, McEwan JC, Veenvliet BA, Hickey SM, Cullen NG, et al. Molecular and bioinformatic strategies for gene discovery for meat traits: a reverse genetics approach. Aust J Exp Agric. 2005;45(8):801–7.
  61. 61. Snyman M, Süllwald S, Olivier W, Visser C. Identification of genes targeted in South African Merino and Afrino sheep populations under long-term selection for reproduction and body weight. Res Sq. 2020.
  62. 62. Marina H, Pelayo R, Suárez-Vega A, Gutiérrez-Gil B, Esteban-Blanco C, Arranz JJ. Genome-wide association studies (GWAS) and post-GWAS analyses for technological traits in Assaf and Churra dairy breeds. J Dairy Sci. 2021;104(11):11850–66. pmid:34454756
  63. 63. Shimizu A, Asakawa S, Sasaki T, Yamazaki S, Yamagata H, Kudoh J, et al. A novel giant gene CSMD3 encoding a protein with CUB and sushi multiple domains: a candidate gene for benign adult familial myoclonic epilepsy on human chromosome 8q23.3-q24.1. Biochem Biophys Res Commun. 2003;309(1):143–54. pmid:12943675
  64. 64. Ghoreishifar SM, Eriksson S, Johansson AM, Khansefid M, Moghaddaszadeh-Ahrabi S, Parna N, et al. Signatures of selection reveal candidate genes involved in economic traits and cold acclimation in five Swedish cattle breeds. Genet Sel Evol. 2020;52(1):52. pmid:32887549
  65. 65. An X, Ma H, Han P, Zhu C, Cao B, Bai Y. Genome-wide differences in DNA methylation changes in caprine ovaries between oestrous and dioestrous phases. J Anim Sci Biotechnol. 2018;9:85. pmid:30524725
  66. 66. Lv X, Chen W, Sun W, Hussain Z, Wang S, Wang J. Analysis of lncRNAs Expression Profiles in Hair Follicle of Hu Sheep Lambskin. Animals (Basel). 2020;10(6):1035. pmid:32549352
  67. 67. Wang FH, Zhang L, Gong G, Yan XC, Zhang LT, Zhang FT, et al. Genome-wide association study of fleece traits in Inner Mongolia Cashmere goats. Anim Genet. 2021;52(3):375–9. pmid:33778967
  68. 68. Hayashi S, Takeichi M. Emerging roles of protocadherins: from self-avoidance to enhancement of motility. J Cell Sci. 2015;128(8):1455–64. pmid:25749861
  69. 69. Mastrangelo S, Bahbahani H, Moioli B, Ahbara A, Al Abri M, Almathen F, et al. Novel and known signals of selection for fat deposition in domestic sheep breeds from Africa and Eurasia. PLoS One. 2019;14(6):e0209632. pmid:31199810
  70. 70. Serranito B, Cavalazzi M, Vidal P, Taurisson-Mouret D, Ciani E, Bal M, et al. Local adaptations of Mediterranean sheep and goats through an integrative approach. Sci Rep. 2021;11(1):21363. pmid:34725398
  71. 71. Bakhtiarizadeh MR, Alamouti AA. RNA-Seq based genetic variant discovery provides new insights into controlling fat deposition in the tail of sheep. Sci Rep. 2020;10(1):13525. pmid:32782325
  72. 72. Zhao L, Li F, Yuan L, Zhang X, Zhang D, Li X, et al. Expression of ovine CTNNA3 and CAP2 genes and their association with growth traits. Gene. 2022;807:145949. pmid:34481004
  73. 73. Janssens B, Goossens S, Staes K, Gilbert B, van Hengel J, Colpaert C, et al. alphaT-catenin: a novel tissue-specific beta-catenin-binding protein mediating strong cell-cell adhesion. J Cell Sci. 2001;114(Pt 17):3177–88. pmid:11590244
  74. 74. Li X, Wu Q, Zhang X, Li C, Zhang D, Li G, et al. Whole-Genome Resequencing to Study Brucellosis Susceptibility in Sheep. Front Genet. 2021;12:653927. pmid:34306007
  75. 75. Chen Z-H, Xu Y-X, Xie X-L, Wang D-F, Aguilar-Gómez D, Liu G-J, et al. Whole-genome sequence analysis unveils different origins of European and Asiatic mouflon and domestication-related genes in sheep. Commun Biol. 2021;4(1):1307. pmid:34795381
  76. 76. Hernández-Montiel W, Martínez-Núñez MA, Ramón-Ugalde JP, Román-Ponce SI, Calderón-Chagoya R, Zamora-Bustillos R. Genome-Wide Association Study Reveals Candidate Genes for Litter Size Traits in Pelibuey Sheep. Animals (Basel). 2020;10(3):434. pmid:32143402
  77. 77. Sundaresan V, Mambetisaeva E, Andrews W, Annan A, Knöll B, Tear G, et al. Dynamic expression patterns of Robo (Robo1 and Robo2) in the developing murine central nervous system. J Comp Neurol. 2004;468(4):467–81. pmid:14689480
  78. 78. Dickinson RE, Duncan WC. The SLIT-ROBO pathway: a regulator of cell function with implications for the reproductive system. Reproduction. 2010;139(4):697–704. pmid:20100881
  79. 79. Dickinson RE, Hryhorskyj L, Tremewan H, Hogg K, Thomson AA, McNeilly AS, et al. Involvement of the SLIT/ROBO pathway in follicle development in the fetal ovary. Reproduction. 2010;139(2):395–407. pmid:19900988
  80. 80. Mohammadi H, Farahani AHK, Moradi MH, Mastrangelo S, Di Gerlando R, Sardina MT, et al. Weighted Single-Step Genome-Wide Association Study Uncovers Known and Novel Candidate Genomic Regions for Milk Production Traits and Somatic Cell Score in Valle del Belice Dairy Sheep. Animals (Basel). 2022;12(9):1155. pmid:35565582
  81. 81. Gao G, Gao N, Li S, Kuang W, Zhu L, Jiang W, et al. Genome-Wide Association Study of Meat Quality Traits in a Three-Way Crossbred Commercial Pig Population. Front Genet. 2021;12:614087. pmid:33815461
  82. 82. Pasandideh M, Rahimi-Mianji G, Gholizadeh M. A genome scan for quantitative trait loci affecting average daily gain and Kleiber ratio in Baluchi Sheep. J Genet. 2018;97(2):493–503. pmid:29932070
  83. 83. Mohammadi H, Rafat SA, Moradi Shahrbabak H, Shodja J, Moradi MH. Genome-wide association study and gene ontology for growth and wool characteristics in Zandi sheep. J Livestock Sci Technologies. 2020;8(2):45–55.
  84. 84. He S, Di J, Han B, Chen L, Liu M, Li W. Genome-Wide Scan for Runs of Homozygosity Identifies Candidate Genes Related to Economically Important Traits in Chinese Merino. Animals (Basel). 2020;10(3):524. pmid:32245132
  85. 85. Signer-Hasler H, Burren A, Ammann P, Drögemüller C, Flury C. Runs of homozygosity and signatures of selection: a comparison among eight local Swiss sheep breeds. Anim Genet. 2019;50(5):512–25. pmid:31365135
  86. 86. Hassan SU, Chua EG, Paz EA, Kaur P, Tay CY, Greeff JC, et al. Investigating the development of diarrhoea through gene expression analysis in sheep genetically resistant to gastrointestinal helminth infection. Sci Rep. 2022;12(1):2207. pmid:35140270
  87. 87. Wang G, He Y, Luo Y. Expression of OPA1 and Mic60 genes and their association with mitochondrial cristae morphology in Tibetan sheep. Cell Tissue Res. 2019;376(2):273–9. pmid:30612186
  88. 88. Bigham AW, Lee FS. Human high-altitude adaptation: forward genetics meets the HIF pathway. Genes Dev. 2014;28(20):2189–204. pmid:25319824
  89. 89. Zhao P, Zhao F, Hu J, Wang J, Liu X, Zhao Z, et al. Physiology and Transcriptomics Analysis Reveal the Contribution of Lungs on High-Altitude Hypoxia Adaptation in Tibetan Sheep. Front Physiol. 2022;13:885444. pmid:35634140
  90. 90. Yuan Z, Zhang J, Li W, Wang W, Li F, Yue X. Association of Polymorphisms in Candidate Genes with the Litter Size in Two Sheep Breeds. Animals (Basel). 2019;9(11):958. pmid:31726757
  91. 91. Süllwald S. A genome-wide association study of body weight and reproduction traits in two South African sheep breeds. M.Sc. Thesis, The University of Pretoria. 2020. Available from: https://www.proquest.com/openview/b78600f07e1c483f99bf0f110ba21f2d/1?pq-origsite=gscholar&cbl=2026366&diss=y