Novel and known signals of selection for fat deposition in domestic sheep breeds from Africa and Eurasia

Genomic regions subjected to selection frequently show signatures such as within-population reduced nucleotide diversity and outlier values of differentiation among differentially selected populations. In this study, we analyzed 50K SNP genotype data of 373 animals belonging to 23 sheep breeds of different geographic origins using the Rsb (extended haplotype homozygosity) and FST statistical approaches, to identify loci associated with the fat-tail phenotype. We also checked if these putative selection signatures overlapped with regions of high-homozygosity (ROH). The analyses identified novel signals and confirmed the presence of selection signature in genomic regions that harbor candidate genes known to affect fat deposition. Several genomic regions that frequently appeared in ROH were also identified within each breed, but only two ROH islands overlapped with the putative selection signatures. The results reported herein provide the most complete genome-wide study of selection signatures for fat-tail in African and Eurasian sheep breeds; they also contribute insights into the genetic basis for the fat tail phenotype in sheep, and confirm the great complexity of the mechanisms that underlie quantitative traits, such as the fat-tail.

Introduction Natural selection plays an important role in determining the individuals that are best adapted to novel and existing environmental conditions. Besides natural selection, artificial selection has been widely applied to livestock species to achieve more desirable/profitable phenotypes [1]. For instance, sheep (Ovis aries) have been selected since domestication, approximately 9,000 years ago [2]. This process of selection resulted in divergent sheep breeds, reared in different geographic regions due to their different adaptability. Among these, fat-tail are an important class of sheep breeds and represent about 25% of the world's sheep population [3] mainly distributed in the Middle East, North and East Africa and Central Asia. According to Xu et al. [4] fat tails represent the energy reserve necessary to survive critical conditions such as drought seasons and food shortage. This statement was emphasized by Mwacharo et al. [5] who confirmed that the fat-tails are the predominant sheep across the deserts of northern Africa, and in the highlands, semi-arid and arid environments of eastern and southern Africa while the thin-tails occur in Sudan and in the sub-humid and humid regions of West Africa.
The unique genetic patterns inscribed in the genome of individuals by natural and/or artificial selection are defined as signatures of selection, which are usually regions of the genome that harbor functionally important sequence variants [6]. Although human consumption of animal fat has dramatically reduced in preference of leaner meat, the investigation of the potential candidate genes involved in the fat-tail might contribute to exploring the genetics of fat deposition, energy storage and adaptation to climate changes [7][8][9]. With the aim to identify candidate genes with a potential role in these traits, several authors performed studies targeting the fat-tail phenotype contrasted with the thin-tail one. All authors used, for their comparisons, sheep of the same geographic regions to prevent referring to the fat-tail differentiation signals arising from different origins or isolation by distance. These studies included indigenous Chinese [4,8,10,11], Mediterranean-North African [9,12], Iranian [3] and Ethiopian breeds [13]. Recently, some studies have proposed the combined use of diverse datasets to gain greater power. These are based on the concept of meta-analysis, a tool for aggregating information from multiple independent studies to identify associations with genomic regions not revealed in the individual studies [14]. Therefore, in this study, the genomes of fat-tail sheep from different regions of Africa and Eurasia were contrasted with the genomes of thintail sheep of the same geographical area in a meta-analysis to identify new genomic regions underlying variation of this phenotype. In order to improve the specificity of signal detection, we combined two complimentary approaches (Rsb and F ST ); moreover, as distinguishing false positive genes from candidate genes is not straightforward, selection signatures of fat-tail were considered only when shared by two or more fat-tail breeds of different geographic origin, and substantiated by verifying whether the candidate genes in proximity of the anonymous markers of differentiation have a known, or assumed, role in fat deposition and adipogenesis in mammals.

Samples, genotyping and quality control
A total of 373 animals belonging to 23 sheep breeds from different geographic regions were selected (Table 1). In particular, genotyped animals from Ethiopia (11 breeds) and Arabian peninsula (4 breeds) fat-tail breeds, together with two thin-tail breeds from Sudan were available from a recent study [13]; genotypic data of Barbaresca and Libyan Barbary were provided by a previous study [9], whereas genotypic data for Laticauda and the two Italian thin-tail breeds (Sardinian and Comisana) were available from a genome-wide analysis of genetic diversity for these breeds [15]. Moreover, a total of 39 blood samples of the Sidaoun breed were collected from different flocks in Algeria. These samples were genotyped using the Illumina OvineSNP50 BeadChip array.
Genotype data of all the considered animals, were combined in a single working dataset for the analyses. Chromosomal coordinates for each SNP were obtained from the latest release of the ovine genome sequence assembly Oar_v4.0. The combined dataset was filtered to remove animals with more than 10% missing genotypes, SNPs with a call rate lower than 95% and with a minor allele frequency (MAF) lower than 1%, and to exclude non-autosomal and unassigned markers.

Genetic relationships amongst breeds
Pair-wise genetic relationships were estimated using-mds-plot and-cluster options in PLINK 1.7 [16] and graphically represented by multidimensional scaling (MDS) analysis.

Signatures of selection analysis
To analyze genome-wide selection signatures, the MDS results were used to categorize the 23 breeds into contrasting genetic groups for comparative analysis. Table 1 summarizes the description of the breeds used in the pair-wise comparisons. The contrasting groups were as follow: 1. Ethiopian fat-tail breeds (11 breeds) vs. two thin-tail breeds from Sudan (Hammari and Kabashi); Inter-population analyses of the six fat-vs. thin-tail groups (Table 1) were performed using the Extended Haplotype Homozygosity (EHH)-derived statistic Rsb [17], as in Bahbahani et al. [18][19]. In particular, site-specific EHH (EHHs) for each SNP in each population was calculated by taking the average EHH for the two alleles of the SNP weighted by their squared allele frequencies. Integrated EHHs (iES) value for each SNP in each population was obtained upon integrating the area under curve of EHHs against distance describing the decay of EHHs for each SNP in each population. Rsb values are the natural log ratio of iES values between two populations. To identify statistically significant SNPs under selection in each of the six pairwise comparisons (positive Rsb value), one-sided P-values (fat-vs. thin-tail group) were derived as −log10(1-F(Rsb)), where F(Rsb) represents the Gaussian cumulative distribution function. Inter-population genome-wide F ST and χ 2 analysis were also performed to corroborate the results obtained with the Rsb analysis. The following constraints were introduced to define the fat-tail selection signatures: 1) −log10 (P-value) � 3.3, equivalent to a P-value of 0.0005, was used as a threshold to define significant Rsb; 2) candidate regions were retained if at least two SNPs, separated by �200 Kb, passed this threshold; 3) the candidate region was present in two or more pair-wise comparisons.

Runs of homozygosity
Runs of homozygosity (ROH) were estimated using PLINK 1.7 [16]. The minimum length that constituted the ROH was set to one Mb. The following criteria were also used: (i) one missing SNP was allowed in the ROH, and up to one possible heterozygous genotype; (ii) the minimum number of SNPs that constituted the ROH was set to 30; (iii) the minimum SNP density per ROH was set to one SNP every 100 kb; (iv) maximum gap between consecutive homozygous SNPs of 1000 Kb [20]. This analysis was carried out within the same group or breed of the pair-wise comparisons above reported for the selection signature detection, with a total of six groups/breeds for fat tail and three groups/breeds for thin tail, respectively. The percentage of SNP residing within an ROH for a given breed was estimated by counting the number of times that each SNP appeared in a ROH and by dividing that number by the number of animals in each group/breed, allowing us to obtain a locus homozygosity range (from 0 to 1). To identify the genomic regions of "high homozygosity", also called ROH islands, the top 0.9999 SNPs of the percentile distribution of the locus homozygosity range within each group/ breed were selected.

Gene annotation
Candidate regions identified by different approaches were used to annotate genes, that were either entirely or partially included within each selected region, using the NCBI Genome Data Viewer (https://www.ncbi.nlm.nih.gov/genome/gdv/browser/?context=gene&acc= 101104604). Finally the biological functions of each annotated gene within the selection signatures was investigated via a comprehensive search of literature.

Results
After quality control, 43,224 SNPs and 349 animals (6 to 39 per breed) were retained for the analyses (Table 1). To examine and visualize the genetic relationships among the 23 sheep breeds, we used a MDS plot of the pairwise identity-by-state distance. The results showed that most sheep breeds formed non-overlapping clusters and were clearly separate populations ( Fig  1). The first dimension (C1) separated the Italian breeds from the Arabian Peninsula and African ones, likely reflecting different breeding histories. The second dimension (C2) distinguished Barbaresca from the other breeds. Therefore, with the exception of Barbaresca, the MDS grossly separated the breeds according to their genetic origin and/or to geographical proximity between their breeding areas. The Ethiopian breeds were separated into two groups: one including the three fat-rumped breeds and one short fat-tail (Molale), and one group including the long fat-tail breeds and the other short fat-tail breed (Gafera). Noteworthy, the MDS plot did not separate the breeds on the basis of the different tail phenotypes. Different geographic origin and genetic drift, most likely, with adaptation to different eco-climates, as well as ethnic, cultural and religious practices, that can impede gene flow, may have shaped this genetic sub-structuring [13].
In this study, selection signatures for fat-tail were identified using the Rsb approach, after corroboration with F ST approach. The markers with a significant inter-population Rsb are shown in S1 Table (from sheet "a" to sheet "f"), corresponding to the six comparisons (from 1 to 6) described in the material and methods. Likewise, in S2 Table (from sheet "a" to sheet "f"), genome-wide F ST and χ 2 values, with the corresponding Bonferroni corrected χ 2 P-values, are shown. A summary of the number of significant SNPs obtained with the different statistical methods in the six pair-wise comparisons is reported in Table 2.
Significant signals of differentiation between the fat-tail breed/group and the thin-tail of the corresponding region, that were shared by two or more fat-tail breeds/groups, are reported in S3 Table. In particular, the positions of the significant SNPs, with their corresponding probability values, are reported for each fat-tail breed/group. F ST values and Bonferroni corrected χ 2 P-values are reported only when achieved by significant SNPs located in the candidate region, or at distance � 0.2 Mb up-and down-stream of the candidate region boundaries. A total of 14 candidate genomic regions on ten different chromosomes have been identified in two or more pair-wise comparisons ( Table 3). The majority of shared fat-tail signals were observed for the two groups of Ethiopian breeds on chromosomes (OAR) 5, 6, 10, 18 and 19 and by the two breeds of Barbary sheep origin (Laticauda and Libyan Barbary) on OAR 3, 10, 12, 13. A total of 84 genes were found spanning the detected selection signature regions. Some of these identified candidate genes, that may be associated with fat deposition or related phenotypes, include: ANAPC1 and MSRB3 (OAR3), CXXC5 and PSD2 (OAR5), SLIT2 and EPHA5 (OAR6), CHP1 and OIP5 (OAR7), PCDH9 (OAR10), CDS2 and BMP2 (OAR13) and OAS2 (OAR17) ( Table 1).
Remarkably, all the 14 candidate regions registered the presence of at least one marker attaining highly significant Rsb values, i.e. > 4, equivalent to a P-value of 0.0001. Moreover, in all the regions, except the one on OAR12 that is shared between Laticauda and Libyan Barbary, at least one significant χ2 value was also registered in either one or both of the fat-tail breed/ groups.
Several genomic regions that frequently appeared in a ROH were identified within each breed/group (S1 and S2 Figs, for fat-and thin-tail sheep breeds, respectively). Table 4 provides the chromosome position, and the start and end of the ROH islands. The top 0.9999 SNPs of the percentile distribution of locus homozygosity values led to the use of different thresholds for each breed/group (from 0.11 to 0.83), and a total of 15 genomic regions of high-homozygosity across breeds/groups were identified. Although the distribution of the ROH was relatively balanced and the signals were moderate in height, we found a few outstanding peaks with a high occurrence of ROH, especially in the Barbaresca breed (S1 Fig).

Discussion
In studies aiming to detect genomic signals for specific traits, for instance signals directly associated with fat deposition and adipogenesis, the major drawback is to detect strong differences (i.e. between fat-tail and thin-tail breeds) that are due either to different origins or to reproductive isolation, and not obviously involved in the trait (fat deposition). In this work, the fat-tail breeds from Ethiopia, Algeria, Arabian peninsula and southern Italy were pair-wise compared with their thin-tail counterparts from the closest geographical region. Therefore, the results reported herein provide the most complete genome-wide study of selection signatures for fattail in African and Eurasian sheep breeds. The signals that are detected between any two or more simultaneous pair-wise comparisons might consequently be considered more reliable, also because they are shown by geographically distant sheep breeds. While the fat-tail sheep of Algeria, Arabian peninsula and southern Italy were all long or semi-long fat-tail breeds, the Ethiopian fat-tail breeds included long-tail, short-tail and fatrumped sheep breeds (Table 1). Therefore, for the Ethiopian sheep, two different pair-wise comparisons were performed, the first including all the 11 fat-tail breeds, and the second including only the 6 long fat-tail breeds. The assumption here was that the first comparison might elucidate the genes involved with adipogenesis, irrespective of the shape of the tail, while the second comparison would reveal findings that are more comparable to those observed in the breeds from the other regions, and are therefore possibly more likely to be linked to the fat-tail phenotype.
We used two different, but complementary, statistical approaches to identify putative selection signatures across the phenotypically different breeds. The F ST index of differentiation is among the most widely used statistic to detect signals of selection in differentially-selected populations, where usually a locus is putatively considered under differential selection if its pair-wise F ST has a rank percentile value of 0.01 or less. Because the level of genetic differentiation between each pair of breeds/groups of the six comparisons highly varied, the decision "a priori" of a rank percentile to accept significant F ST may disfavor the pairs presenting the highest genetic diversity. Therefore, following Moioli et al. [12] who showed that F ST and χ2 are highly correlated, in order to confirm with a statistical test which markers were significant, we calculated inter-population locus-specific χ2 values and considered significant the markers reaching a Bonferroni-adjusted χ2 P-values � 0.05. Doing so, the different numbers of significant markers in each pair-wise comparison allows to appreciate their global genetic difference. However, since F ST and χ 2 values are based on allele frequencies and might represent an isolated event occurring by chance, and not necessarily associated with fat-tail signals, in this case, the extended haplotype homozygosity derived statistic, Rsb, was preferentially used. The Rsb statistics has high power towards strong selection sweeps that have undergone fixation [17]. Indeed, the Rsb metrics is able to show the presence of selective sweeps resulting in complete fixation of some alleles in one population relative to the other. This statistic in fact Table 4. Run of homozygosity (ROH) islands identified within each breed/group. The chromosome (OAR), the number of single nucleotide polymorphisms (SNPs) within each ROH island and the positions of the genomic regions (in base pairs, bp) are reported.

Breeds/groups OAR N of SNPs Start (bp) End (bp)
Ethiopian fat-tail ( considers the whole haplotype region around one marker, or group of markers; the larger is the region in which the homozygous haplotype was maintained in the first breed in contrast with the second breed, the more reliable is the probability of carrying a fat-tail signal. The number of significant SNPs for inter-population Rsb is smaller than the number of significant SNPs for χ 2 ( Table 2) confirming that the first method is more stringent. In most cases, the number of SNPs that are significant with one method showed some correlation with the number of SNPs that were obtained with the second method: the higher was the number of the significant Rsb values, the higher was also the number of significant F ST values. The highest number of significant signals obtained with both methods was observed in the Barbaresca breed-1.5% for inter-population Rsb and 5.6% for F ST / χ 2 , while the lowest number was in the Arabian peninsula breeds: 0.15 and 0.25% with the two methods, respectively. In accordance with our results, Yuan et al. [11], in a selection signature analysis for tail type in sheep, identified seven and twenty-six regions using the extended haplotype homozygosity and F ST approaches, respectively, and only six small regions using both approaches. Bahbahani et al. [21] showed that the two common approaches (inter-population Rsb and F ST ), used to identify signatures of positive selection in East African Shorthorn Zebu, did not produce overlapping signals. Although the authors interpreted the observed absence of overlaps between Rsb and F ST analyses as a possible consequence of the selection time-scale, with Rsb being considered more suitable for detecting signatures of recent selection, it may also be hypothesized that false positives may occur when using both Rsb and F ST . As the aim of this study was to identify loci most likely associated with fat-tail, we established that the candidate region(s) should be present in two or more pair-wise comparisons (Table 3). Therefore only the signatures satisfying these criteria and encompassing annotated genes will be subsequently discussed.
Another method for detecting signatures of positive selection based on intra-population analysis is the identification of high-homozygosity regions [6]. Since ROHs are normally abundant in regions under positive selection, their accumulation at specific loci, or islands, has been used to identify genomic regions that reflect directional selection in cattle [22], sheep [20,23], horse [24] and goat [25]. We therefore checked if such regions of high-homozygosity overlapped with putative selection signatures in the sheep breeds considered in this study.
The region in OAR3:104.2-105.9 Mb was identified in two comparisons: fat-tail sheep of the Arabian peninsula vs. Sudanese thin-tail sheep, and Libyan Barbary vs. Algerian Sidaoun. Out of the ten genes found in this region, only ANAPC1 has been associated with obesity-related traits by Comuzzie et al. [26] in a Genome-Wide Association Study (GWAS) on Hispanic children. The signature on OAR3:154.0-155.6 Mb, detected in the Laticauda and the Libyan Barbary breeds, had already been reported for the Barbaresca, Laticauda and Chios breeds [9]. Yuan et al. [11], in a GWAS on seven indigenous Chinese sheep, by contrasting fat-tail versus thin-tail phenotypes, detected a signature in this region encompassing the MSRB3, that has been identified as a candidate gene associated with adaptation [27]. In a study on world sheep breeds, MSRB3 was highlighted to have experienced high selection pressure [28]. Moreover, MSRB3 is located within a genomic region identified as ovine quantitative trait locus (QTL) (QTL#127000) for tail fat deposition [11]. A large region on OAR5:47.0-49.0 Mb turned out here to encode putative fat deposition genes in the two groups of Ethiopian sheep: the one composed only by the long fat-tail, and the one including all the eleven Ethiopian fat-tail breeds. Although Fariello et al. [29] reported that this region encoded a signature differentiating prolific and non-prolific Asian sheep, the involvement of a signature in this region in the fat-tail phenotype had been previously reported [9,12]. This involvement is corroborated by genetic studies of body mass index in humans, describing a role played in obesity by the CXXC5 gene in Americans [30] and the PSD2 gene in the Japanese population [31]. Moreover, this selection signature overlapped with the ROH island identified in the Ethiopian fat-tail breeds, which include 17 and 29 homozygous markers respectively for the two groups of breeds (the eleven Ethiopian fattail, and the six Ethiopian long fat-tail). The signal on OAR6:38.1-39.6 Mb was detected in Barbaresca and Laticauda and was previously reported as selection signature for fat-tail in these two breeds, also when compared with 13 Italian thin-tail breeds [9]. It includes the SLIT2 gene, a potential candidate for internal organ weights in Simmental beef cattle [32] and therefore possibly connected with fat deposition. This signal is worth investigating further, because it encompasses a large region (1.5 Mb) where the Barbaresca showed 27 SNP markers with allele frequency patterns that are highly differentiated from the Italian thin-tail breeds, 15 of which exceeded the significant threshold of Rsb P-value < 0.0001. Moreover, this genomic region partially overlapped with the ROH island on OAR6 detected in Barbaresca and shared by more than 80% of the individuals of this breed. The Laticauda showed less significant signals in the same region, with the exception of a highly significant one (-log10(P-value) = 7.68) at position 38,345,613 bp. Moreover, several markers attained high F ST values and significant χ2 values in the pair-wise comparisons of both breeds. However, because the two breeds have Barbary origin, it is also possible that this signature encodes loci inherited from North-African breeds. The selection signature on OAR6:55.6-55.7 Mb was shown in this study to be present in the two groups of Ethiopian breeds and it has never been reported before as a fat-tail signal. On the same chromosome, at position 76.5-77.5 Mb, we identified another signal in the Ethiopian group (eleven breeds) and the Laticauda, which has been previously described in the Barbaresca but not in the Laticauda [9]. Another candidate region identified on OAR6:80.6-81.0 Mb, shown by the Ethiopian group (eleven breeds) and the Libyan Barbary and including the EPHA5 gene was detected. This gene is involved in feed conversion ratio in Nellore cattle [33]. The signal on OAR7:33.5-33.9 Mb was detected in the Barbaresca and Laticauda, as well as in a previous study [9]. While these authors could not assign an obvious role of any of the genes located in this region, Lirangi et al. [34] suggested that the CHP1 gene is involved in cellular fat storage. Inoue et al. [35] revealed that OIP5 promotes proliferation of pre-and mature-adipocytes and contributes adipose hyperplasia; moreover, an increase of OIP5 may associate with development of obesity. A common candidate region located on OAR10:40.2-45.0 Mb observed in the two groups of Ethiopian sheep, Libyan Barbary and Laticauda, includes the PCDH9 gene. The same signature was detected by Kim et al. [36] who contrasted the Egyptian Barki sheep with British breeds, and referred to it as signal of adaptation to arid environments. However, the Barki is a fat-tail sheep while the British are all thin-tail breeds. We would then propose the idea that this is a signal of fat-deposition, this being corroborated by the presence of PCDH9, reported by Wang et al. [37] as a candidate gene for obesity in humans. The signal on OAR12:43.0-43.3 Mb, shared by the Laticauda and the Libyan Barbary has not been reported previously as a fat-tail signal and the two genes (RERE and SLC45A1) included in the region do not appear to have any evident connection with fat deposition. On OAR13:45.5-48.4 Mb, a selection signature shared by the Laticauda and the Libyan Barbary was identified. The region was also reported as a fat-tail signature in several studies [9][10][11]. The strong linkage disequilibrium between the SNPs in this OAR13 region with a missense mutation in exon 1 of the BMP2 gene (OAR13:48,552,093-48,897,111 bp) was demonstrated by Moioli et al. [12] in the Laticauda fat-tail and Altamurana thin-tail sheep. Yuan et al. [11] emphasized that BMP2 may play important roles in fat tail formation. However, here BMP2 does not appear to be the only candidate gene. This signature spans a size > 3Mb, and Laticauda showed a very high Rsb value (-log10-Pvalue = 7.96) at position 46,582,744 bp. A transcriptome profile analysis of adipose tissues from fat-and short-tailed Chinese sheep identified CDS2 among the differentially expressed genes [38]. This gene which spans 46,560,029 to 46,605,239 bp of OAR13, encompasses the significant marker mentioned above. Animal QTLdb identified a QTL (QTL#127011) for tail fat deposition [11] that overlapped this selection signature. Another candidate region on OAR17:61.0-61.6 Mb, shared by the Ethiopian group (eleven breeds) and the Libyan Barbary has not been detected previously as a fat-tail signal in sheep. However, Fox et al. [39] associated the OAS2 gene found in this region to body fat distribution. Finally, the candidate regions on OAR18:1.8-2.2 Mb, and OAR19:51.6-51.8 Mb, shared by the two groups of Ethiopian breeds have not been reported previously as fat-tail signals and the five genes included in the two regions (ATP10A on OAR18, and CDC25A, MAP4, DHX30, SMARCC1 on OAR19) do not appear to have any obvious connection with fat deposition. It must be emphasized that the six Ethiopian long fat-tail breeds used in the comparison 2 were a subset of the eleven breeds analyzed in the comparison 1. Therefore, signals identified in comparison 2 may reflect those found in comparison 1. Anyhow, other studies also reported putatively selected genes mapped in the same two regions, with no obvious roles in sheep tail type or fat deposition [3,11]. Moreover, the complexity of the fat-tail phenotype [10] may partly justify the high number of signals detected in the pair-wise comparisons.
One limitation of this study may be the different sample sizes used in the contrasting groups, and in particular in the comparison 1 (140 vs 15 animals). However, several studies have been carried out to identify selection signatures contrasting groups based on distinct phenotypes with considerably different numbers of individuals [1,25,40,41]. In our study, several genomic regions potentially under selection in at least two comparisons corroborate with previous studies, highlighting the importance of these regions and also supporting these results, as well as the design of the comparisons used to identify the genomic regions. As reported above, only two ROH islands identified in fat-tail breeds overlapped with the selection signatures. One reason for the little overlap is that ROH might detect selection related to any trait, while contrasting thin and fat tail breeds is more likely to detect signal related to this trait. However, Purfield et al. [23] reported a significant moderate correlation between the occurrence of SNPs in a ROH and two different statistical approaches (F ST and HapFLK) for identifying putative selection signatures, and showed two ROH islands that overlapped with selection signatures. Moreover, the ROH island on OAR5 identified in the Ethiopian breeds was also identified in the Lori Bakhtiari fat-tail breed [3]. The authors reported that this increase in homozygosity would be consistent with selection for mutations affecting fat-tail size several thousand years following domestication. Therefore, although the existence of ROH islands has been attributed to several factors (recombination events, demography) [20,42], our results corroborate the hypothesis that regions of high-homozygosity may indeed harbor targets of positive selection, as also observed in previous studies [22,23,25,43].
In this study, we report so far the most complete genome-wide study of selection signatures for fat-tail in sheep. We identified novel signals and confirmed the presence of selection signatures in the genomic regions that harbor candidate genes that are known to affect fat deposition. These findings also confirm the great complexity of the mechanisms underlying quantitative traits, such as the fat-tail, and further confirm the hypothesis that many different genes are involved in the phenotype. However, it is important to highlight that among the candidate genomic regions, false positives may still be a possibility. Therefore, further studies using different populations and the new ovine high-density SNP chip will be required to confirm and refine our results and investigate the role of specific genes. Notwithstanding, the selection signatures reported here provide comprehensive insights into the genetic basis underlining the fat tail phenotype in sheep.
Supporting information S1 Table. Significant markers for Rsb. a) Ethiopian fat-tail breeds (eleven breeds) contrasted with two thin-tail breeds from Sudan (Hammari and Kabashi); b) Ethiopian long fat-tail breeds (six breeds) contrasted with two thin-tail breeds from Sudan; c) Arabian Peninsula fattail breeds (Naimi, Najdi, Omani and Huri) contrasted with two thin-tail breeds from Sudan; d) Barbaresca fat-tail breed contrasted with two Italian thin-tail breeds (Sardinian and Comisana); e) Laticauda fat-tail breed contrasted with two Italian thin-tail breeds; f) Libyan Barbary fat-tail breed contrasted with the Sidaoun thin-tail breed. (XLS) S2 Table. Significant markers for F ST / χ2. a) Ethiopian fat-tail breeds (eleven breeds) contrasted with two thin-tail breeds from Sudan (Hammari and Kabashi); b) Ethiopian long fattail breeds (six breeds) contrasted with two thin-tail breeds from Sudan; c) Arabian Peninsula fat-tail breeds (Naimi, Najdi, Omani and Huri) contrasted with two thin-tail breeds from Sudan; d) Barbaresca fat-tail breed contrasted with two Italian thin-tail breeds (Sardinian and Comisana); e) Laticauda fat-tail breed contrasted with two Italian thin-tail breeds; f) Libyan Barbary fat-tail breed contrasted with the Sidaoun thin-tail breed. (XLS) S3 Table. Significant signals of differentiation between the fat-tail breed/group and the thin-tail of the corresponding region, that were shared by two or more fat-tail breedsgroups.