The persistent horns are an important trait of speciation for the family Bovidae with complex morphogenesis taking place briefly after birth. The polledness is highly favourable in modern cattle breeding systems but serious animal welfare issues urge for a solution in the production of hornless cattle other than dehorning. Although the dominant inhibition of horn morphogenesis was discovered more than 70 years ago, and the causative mutation was mapped almost 20 years ago, its molecular nature remained unknown. Here, we report allelic heterogeneity of the POLLED locus. First, we mapped the POLLED locus to a ∼381-kb interval in a multi-breed case-control design. Targeted re-sequencing of an enlarged candidate interval (547 kb) in 16 sires with known POLLED genotype did not detect a common allele associated with polled status. In eight sires of Alpine and Scottish origin (four polled versus four horned), we identified a single candidate mutation, a complex 202 bp insertion-deletion event that showed perfect association to the polled phenotype in various European cattle breeds, except Holstein-Friesian. The analysis of the same candidate interval in eight Holsteins identified five candidate variants which segregate as a 260 kb haplotype also perfectly associated with the POLLED gene without recombination or interference with the 202 bp insertion-deletion. We further identified bulls which are progeny tested as homozygous polled but bearing both, 202 bp insertion-deletion and Friesian haplotype. The distribution of genotypes of the two putative POLLED alleles in large semi-random sample (1,261 animals) supports the hypothesis of two independent mutations.
Citation: Medugorac I, Seichter D, Graf A, Russ I, Blum H, Göpel KH, et al. (2012) Bovine Polledness – An Autosomal Dominant Trait with Allelic Heterogeneity. PLoS ONE 7(6): e39477. doi:10.1371/journal.pone.0039477
Editor: Shuhong Zhao, Huazhong Agricultural University, China
Received: February 21, 2012; Accepted: May 25, 2012; Published: June 21, 2012
Copyright: © 2012 Medugorac et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was funded by the German Research Foundation (DFG; Germany, ME 3404/2-1) and a grant from the German agriculture network PHÄNOMICS (funded by Federal Ministry of Education and Research funding reference number 0315536F) as well as by substantial budget funds from the Chair of Animal Genetics and Husbandry of the Ludwig-Maximilians-University Munich (TierALL, LMU Munich, Germany), the Laboratory for Functional Genome Analysis (LAFUGA, Gene Center, LMU Munich, Germany) and the Tierzuchtforschung e.V. München (TZF Grub, Germany). The authors IM, SR, MF, SK, AG, HB, DS and IR are employed by TierALL, LAFUGA and TZF which supported this study by substantial budget funds but only the authors and not the funders had a role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have read the journal’s policy and declare the following conflicts: Karl Heinrich Göpel, one of the co-authors, is employed by Göpel Genetik GmbH, a commercial company that kindly provided a significant amount of samples and phenotypic information. However no funding was provided by Göpel Genetik GmbH or other cattle breeding companies, or cattle breeders. In addition, neither cattle breeding companies nor cattle breeders were involved in the study design, analysis and interpretation of data, writing of the paper, or decision to submit for publication. This does not alter the authors’ adherence to all the PLoS ONE policies on sharing data and materials.
Since the beginning of animal husbandry, humans have tended to accumulate particular variations in domesticated animals. Thereby, the variants of practical interest for agriculture were selected as well as phenotypes that visibly distinguished particular animals. This practice was more frequent in pets (e.g. cats and dogs) but led to the fixation of some obvious breed characteristics in cattle, too . The absence of horns (polled phenotype) as well as horn shape diversity represents such evident traits in cattle.
The persistent horn of the Bovidae consists of a pneumatised osseous core, which is fused with the frontal bone and covered by a cornified epithelium that grows outward from the skin at the base of the horn, thereby forming the cavernous visible horn. The development of horns is dependent on a number of different tissues and their interaction , . Before the domestication of cattle, horns were important for the survival of the wild species. Even after domestication, horns were a desired trait (e.g. fixation and use as draught animals) in most cattle breeding areas until recently. Exceptions to that rule are found in some regions (e.g. Scotland and the Nordic countries) where polledness was a desired trait much earlier probably due to dense housing of animals during the long winter. Nowadays, commercial dairy or beef herds are mainly confined to barns or fenced-in enclosures such as pastures or corrals. Under these conditions horns are not only of little value but can lead to considerable economic loss due to a higher risk of injuries and the possible consequences (infection, carcass deterioration etc.). Therefore in modern cattle husbandry removing horns at an early age has become an accepted management practice. However, all used methods are debatable not least because of animal welfare implications . Hence, breeding polled cattle may constitute a non-invasive option to replace the common practice by means of genetic selection.
In cattle, as in other Bovidae, horn development and morphology are characterised by a substantial degree of polymorphism. Even if only European cattle breeds (Bos taurus) are considered, there are completely hornless cattle breeds like Angus, and breeds with very short horns like Buša as well as breeds with very long and lyra-shaped horns like the Pannonian Podolian cattle . The polled phenotype has been considered to be caused by a dominant inhibitor  that is characterised by the complete absence of corneous appendices . In addition to diverse forms of cattle horns, abnormal types of horns called scurs frequently occur, i.e. incompletely developed horns that are not fused to the frontal bone . Scurs can be considered as the intermediate phenotype between polled and horned because the horn core is not an outgrowth of the skull but originates from a separate ossification centre in the tissues located above the periosteum with subsequent fusion to the skull , .
While the inheritance pattern and mapping position of the POLLED locus on Bos Taurus Autosome 1 (BTA1) , ,  is beyond controversy, the scurs locus displays heterogeneity ’ ’ . To avoid complications due to possible interference between POLLED locus and type 1  or type 2  scurs syndrome we exclusively focussed on polled and horned animals of 31 cattle breeds of European origin. This material exploited the full capacity of the population structure of European cattle and in combination with high-density SNP genotyping and high-throughput sequencing, was used to identify the genetic variation that causes the polled phenotype in Bos taurus.
The POLLED Locus Maps to Chromosome 1 in Divergent European Cattle Breeds
We collected DNA samples from twelve polled cattle breeds (Table S1). The applied mapping approach and major part (62%) of the samples has been presented in our previous study  and was now completed by additional 61 polled animals as well as by three additional polled breeds: Norwegian Red, Fjall cattle and Braunvieh. Case (homozygous polled, PP) and control (homozygous horned, pp) group each consisted of equal numbers of animals (162) from 18 breeds in total. As some breeds are consolidated either for the polled or horned phenotype uneven phenotype distribution was inevitable within some breeds. An improved case-control design accounting for this uneven distribution did not refine the mapping (Fig. 1). We mapped the POLLED locus in the same 381 kb interval (1.668 Mb - 2.049 Mb; UMD3.1 genome build) we already found in our previous study . The genome-wide significance (P≤0.0002,) was determined by 50,000 random permutations of markers along chromosomes. The applied mapping procedure  performs searches for shared segments of homozygosity without the need of an identity by state and thus revealed two haplotypes. The most common haplotype (AGACAAGGA) was found in all but one case animals. Two copies of an alternative haplotype, with a variant allele at the sixth SNP (AGACAGGGA) were found in a homozygous polled Braunvieh bull (Fig. S1) that was one of the animals from the current addition to the mapping design. Genotyping of three related bulls revealed the same alternative haplotype associated with the polled phenotype in Braunvieh (Fig. S1). As the origin of polledness in Braunvieh is unknown and recombination within the homozygosity block of the available Braunvieh pedigrees could not be detected, the Braunvieh data did not refine the mapping.
(a) Genome-wide association mapping of the POLLED mutation to the proximal end of bovine chromosome 1 (BTA1) with 162 affected animals and 162 controls (asshom method, Charlier et al. 2008). Nominal asshom statistic is presented chromosome by chromosome for all 29 bovine autosomes. Evidence for association (y axis) is measured as nominal asshom statistic with genome-wide significance (P = 0.0002, grey line) at 3.315, being determined by 50,000 nominal asshom statistics along chromosomes with randomly permutated markers. (b) Association mapping details for the region from 0 to 4 Mb on BTA1. Nine markers homozygous in all 162 cases covered a region from 1,760,113 to 1,983,902. Two informative SNP markers flanking this core (i.e. variable sites in the case group) are ARS-BFGL-NGS-39992 (1,668,494 bp) and ARS-BFGL-NGS-29653 (2,049,400 bp). (c) Candidate region of 547 kb (1.543–2.090 Mb (UMD3.1 genome build)) chosen for high-throughput sequencing and most likely encompassing the POLLED mutation. This interval is larger than homozygosity bracket detected by multi-breed design and nested five genes (IL10RB, IFNAR2, OLIG1, C1H21orf62 and GCFC1), one pseudo gene (OLIG2-like) and some not further annotated ESTs (e.g. BC122836, EV693397 and DT837875).
In order to confirm and possibly improve the mapping results we embedded 89 heterozygous polled animals (POLLED carriers; Pp) and 62 horned relatives into the analysis. These 151 animals were genotyped with the same Illumina BovineSNP50 assay  and have not been used in the case-control design. These 89 Pp bulls comprise ten breeds and are ancestors or descendents of PP bulls included in the case group, except for carriers from the Witrug breed (WTG) (Table S1). The haplotypes were manually traced through the pedigree charts. The result clearly confirmed the mapping result (Fig. S1, S2, S3 and S4), but the absence of any observable recombination within the candidate region hindered further improvement of fine-mapping by linkage analysis.
Sequencing Eight Polled and Eight Horned Sires did not Detected a Common Allele Associated with Polled Status
We selected seven PP, one Pp and eight pp bulls and performed high-throughput sequencing of a 547 kb interval (1.543–2.090 Mb (UMD3.1 genome build)) that nested the most likely location for the POLLED mutation . The polled group was represented by four Holstein-Friesian (HF), two Galloway (GLW), one German Angus (DAN) and one Fleckvieh (FV) bull. One HF bull was declared as Pp while remaining seven bulls of polled group were progeny tested as PP. The horned group was represented by four HF, two FV, one Gelbvieh (FGV) and one Murnau-Werdenfelser (MWF) bull.
Paired-end sequencing of the libraries enriched for a 547 kb region on BTA1 resulted in average sequence depth of non-repetitive sequences of ∼40× per individual, with 98.9% of the non repeat-masked target region being covered. From these data, we identified 451 putative DNA sequence variants (DSVs), or an average nucleotide diversity of ∼0.15%. Further analysis of the sequencing data (on a locally installed instance of galaxy ) revealed almost complete absence of sequence variability in a region of 125 kb (1.697 to 1.822 Mb) between all 16 sequenced animals. Under the assumption that the POLLED locus is biallelic with dominant inheritance ,  and because the reference sequence (RefSeq) is based on a horned Hereford cow , the causative variant has to be homozygous in the seven PP sires, heterozygous in one Pp sire and not present in RefSeq or in the eight pp sires. Applying this filter to the 451 DSVs did not detect a common allele associated with polled status. Also a visual inspection in a genome browser did not yield DSV concordant with the POLLED genotypes. These results suggested at least two plausible explanations: (i) the targeted 547 kb interval that was covered to 98.9% for the non repeat region harbors the functional mutation(s) in the remaining 1.1% (ii) some of DSVs are causal but there is allelic heterogeneity at the POLLED locus and different alleles have been selected in different geographic regions or breeds.
Dividing Sequenced Animals into Two Groups Suggests Allelic Heterogeneity of the POLLED Phenotype
Eight of 16 sequenced sires are Holsteins and in the case of the allelic heterogeneity these could most probably harbour the same POLLED mutation. The average sequence depth of non-repetitive sequences in eight HF bulls was ∼29× per individual, covering 98.5% of the non repeat-masked target region. From these data, we identified 312 putative DSVs, or an average nucleotide diversity of ∼0.11% (Fig. 2). Applying the same consecutive filtering as above reduced the number of potential causal variants to seven. On the proximal end of the region homozygous in polled HF (1.648–2.027 Mb; Fig. 2) we detected a complex InDel event, replacing 7 bp (cgcatca; RefSeq: 1,649,163–1,649,169) by 12 bp (ttctcagaatag) and thus resulting in a 5 bp longer sequence in polled HF animals (allele P5ID). On the distal end of the region of homozygosity in polled HF we detected a duplication of an 80,128 bp (1,909,352–1,989,480 bp) sequence (Fig. 2). This large sequence is seamlessly duplicated in the same direction and only differs from the RefSeq by one T→A transversion at the third position after the beginning of the duplicated sequence (PT1909354A) and by a two-base pair (TG) deletion at the 45th position in the duplicated sequence. This 2 bp deletion corresponds to position 1,909,396 in the original sequence and will be denoted as P1909396D2. Both variants close to the junction between the original and the duplicated sequence, PT1909354A and P1909396D2, allow for a PCR design for the detection of this 80 kb InDel (allele P80kbID). In addition to P5ID and P80kbID, both flanking the homozygosity region in polled HF, we detected five point mutations at the positions 1,654,405 (G→A), 1,655,463 (C→T), 1,671,849 (T→G), 1,680,646 (T→C) and 1,768,587 (C→A), respectively. In these five candidate SNPs (PG1654405A, PC1655463T, PT1671849G, PT1680646C, PC1768587A) mutant alleles determine the sequence in the polled HF animals. These seven candidate mutations do not include any known coding sequence, or splice site, or intronic region, or any known regulatory elements (Fig. 2). Furthermore, none of these variants were previously reported in dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/).
(a) Positions of DNA sequence variants (DSV) detected in the eight HF animals are presented by bars for SNPs and inverted triangles for InDels, with red symbols for heterozygous and blue symbols for homozygous differences from the reference sequence. Homozygous polled bulls are PP-HF1, PP-HF2 and PP-HF3. Pp-HF4 is declared as heterozygous polled and pp-HF5, pp-HF6, pp-HF7 and pp-HF8 are horned. Annotated genes and ESTs lying in the re-sequenced region are shown in the green shaded region. The five candidate mutations for polledness are superimposed: three SNPs were outlined by red crosses, 5 bp InDel and the duplicated region P80kbID were highlighted by inverted triangle. The red triangle area above P80kbID marks the duplicated sequence. (b) Within-species conservation. The nucleotide diversity in the re-sequenced region is shown by a density plot of bovine variants from dbSNP per kb. (c) Copy number variations. The ratio of mapped sequence reads between polled and horned animals is plotted with blue dots. The red line represents the result of segmentation analysis, showing the average ratio in the determined bins. (d) Across-species conservation. For each candidate mutation the surrounding base conservation for species without horn, bovid species and all aligned species was determined as PhastCons score calculated from multiz alignments. As an example, here we display the base conservation for the candidate mutation PC1768587A. (e) The underlying multi-species alignment for the across-species conservation calculation for the candidate region is shown in plot d. The candidate mutation PC1768587A is highlighted within a red frame and stars in red and black indicate identity among bovid and all animals, respectively.
The remaining eight non HF sires (four PP and four pp) belong to spatial and genetic quite differentiated breeds: Angus, Galloway, Fleckvieh, Gelbvieh and Murnau-Werdenfelser. Nevertheless, both PP and pp sires were homozygous for the common haplotype that is always homozygous in PP animals . Considering these eight sires separately paired-end sequencing identified 248 putative DSVs, or an average nucleotide diversity of ∼0.08%. Applying the same filter to the 248 DSVs yielded only one complex insertion-deletion (InDel) event as the putative causative variant (Fig. 3), which was confirmed by visual inspection in a genome browser to be the only DSV concordant with the pp and PP genotypes. The sequence of 212 bp (1,705,834–1,706,045 bp) is duplicated and replaces a sequence of 10 bp (1,706,051–1,706,060 bp). This InDel (P202ID) is between the genes IFNAR2 and OLIG1 (Fig. 3). The InDel P202ID again does not disrupt any known coding sequence or a splice site, or an intronic region, or any known regulatory regions. All candidate DSVs detected in HF subset were homozygous for RefSeq allele in all eight non HF sires, both in PP as well as in pp.
(a) SNP genotypes. The genomic position of DNA sequence variants (DSV) called by high-throughput sequencing of the target region are displayed as bars (SNPs) or triangles (InDels), in blue (homozygote) and red (heterozygote). Homozygous polled bulls are PP-DAN1, PP-GLW1, PP-GLW2 and PP-DFV1 and horned bulls are pp-FGV, pp-DFV2, pp-DFV3 and pp-MWF1. The candidate duplication P202ID is highlighted by an inverted triangle. The annotated genes and ESTs in this region are displayed in the green area. (b) Within-species conservation. The bovine variant density from dbSNP is shown in the re-sequenced region per kb. (c) Multi-species alignment. The genomic multi-species alignment around the candidate duplication P202ID was built with multiz. Sequence identities among bovid animals are displayed with red stars whereas black stars denote the identity among all animals. The duplicated region is highlighted by the red shaded area and replaces the blue shaded nucleotides in polled animals.
Target Genotyping of Candidate DSVs in Polled Animals, Random Samples and Diversity Panel Supports Allelic Heterogeneity of the POLLED Phenotype
In order to further investigate the presumed allelic heterogeneity of POLLED, we chose sperm samples of polled bulls of all of the most important cattle breeds with European origin. This comprised 89 Pp bulls of ten breeds (Table S1). Moreover, we complemented the sample of the homozygous case group by additional PP bulls of Holstein-Friesian (HF, including Red-Holsteins (RH)), German Fleckvieh (FV) and Jersey (JY).
The P202ID allele was in complete LD to the POLLED allele in all breeds but HF, JY and WTG which turned out to carry the P202ID allele only sporadically. Polled HF, JY and WTG animals declared as Pp but missing the P202ID allele were carriers of the haplotype block of seven candidates detected by sequencing of HF animals (P5ID, PG1654405A, PC1655463T, PT1671849G, PT1680646C, PC1768587A and P80kbID). Furthermore, HF and JY animals declared as PP and missing P202ID were carriers of two copies of the haplotype detected by sequencing of polled HF animals.
To further proof the hypothesis of allelic heterogeneity of POLLED, we randomly sampled 400 animals in South-Germany complemented by random samples from target breeds. This semi-random sample comprised sets of 238 random HF/RH, 293 HF-FV crosses, 52 random JY, 50 random PNZ, and 211 animals from our cattle diversity panel (Table S1). These 1,262 DNA samples as well as the case and the carrier group were chosen for genotyping for the candidate variants: P202ID, P5ID, P80kbID PG1654405A, PC1655463T, PT1671849G, PT1680646C, and PC1768587A. Two SNPs PT1671849G and PT1680646C were excluded due to sporadic and solitary occurrence of the candidate mutation on RefSeq background in horned animals. So far, we do not observe recombination within PF haplotype block (from P5ID to P80kbID) and consequently do not exclude any candidate mutations by recombination. As soon as one candidate was excluded by confirmed genotyping we cancelled further genotyping of the respective marker. The breeding records, including polled certificates of the genotyped animals and their ancestors as well as their breed affiliation, were used to confirm the congruence between the obtained genotyping results at six remaining candidate mutations and polled status. The results are presented in Fig. 4 and summarised below. The single candidate variant P202ID detected by the sequencing of eight non HF sires and the block of five candidates detected by sequencing of eight HF animals (P5ID, PG1654405A, PC1655463T, PC1768587A and P80kbID) segregate independently without evidence for any interference or recombination. As all animals carrying one or two copies of the P202ID allele are polled (Pp or PP, respectively) and belong to breeds originating from geographical areas with Celtic culture  we hereafter figuratively and provisory called this allele PC, indicating polledness of Celtic origin (see Fig. 4). Appropriately, the haplotype block (260 kb long) detected by sequencing of HF animals is hereafter called PF, indicating polledness of Friesian origin. Both PC and PF, independently and in combination, were in perfect association with POLLED locus. There were two HF and one JY bull declared as PP which we found to be heterogeneous polled at candidate level, i.e. PC/PF. These three bulls were progeny-tested and mated to a large number of horned cows, permitting an official declaration as PP by the respective breeder associations. The pedigrees chart and genotype analyses of the PC/PF bulls and their sampled relatives (Fig. S3 and S4) clearly support allelic heterogeneity at the POLLED locus. Our diversity panel completed the semi-random sample (Table S1) and both together cover the European cattle population from the domestication centre, over the most likely dispersal routes for this species to the western- and northernmost parts of Europe (Fig. 4). This spatial well distributed panel comprises breeds with very small horns as well as extremely large horns  and with high genetic diversity between as well as within breeds . In these horned breeds, only wild-type alleles (prs) were detected for all six candidate mutations, hindering further sieving of causative variants from passenger variants in this design.
The largest proportion (shown in the row marked %) of genotyped animals (78.03%) were homozygous for the wild-type allele at all six candidate variants of the POLLED locus, i.e. they inherited two copies of the RefSeq haplotype prs presented by vertical gray bar. Plausibility analyses suggested all these 1261 (shown in the row marked with No) prs/prs animals as being horned, therefore pp at the POLLED locus. Five candidate mutations detected by sequencing of polled and horned Holstein-Friesian animals form a haplotype block (P5IDPG1654405APC1655463TPC1768587AP80kbID) consisting of three SNPs flanked by two InDels and called PF signifying polledness of Friesian origin. The five candidate mutations of Friesian polledness were superimposed on RefSeq background (vertical gray bar): three SNPs and a 5 bp InDel were outlined by red horizontal bars and the duplicated region P80kbID by the red area. The PF block is in perfect association with POLLED genotype and segregate only in cattle breeds originating from north-western coast of continental Europe (HF, RH, JY and WTG). The candidate mutation P202ID is represented by black horizontal bar on RefSeq background. All animals carrying one (86) or two copies (192) of the P202ID are polled (Pp or PP, respectively) and belong to breeds originating from Scandinavia, Great Britain, France and South-Germany, hereafter figuratively called polledness of Celtic origin, PC. Three bulls with PP genotype determined by extensive progeny testing were found to be heterogeneous polled at candidate gene level, i.e. PC/PF. The genotyping of their sampled relatives as well as entire experimental design provide no evidence for recombination within PF haplotype block. The geographic origin of the sampled breeds is outlined by dots on Europe map. Breeds lacking polled samples are represented by gray dots. The breeds with P202ID as only causal variant or variant in perfect association with POLLED locus are marked by black dots. The breeds with PF as predominant variant in perfect association with POLLED locus are marked by red dots. The approximate distribution area of Celtic and Friesian polledness are highlighted as gray and red shading respectively.
Sequence Conservation Among Horned Ruminants and Other Mammals Allows Scoring of Candidate Causal Mutations
The DSVs PG1654405A and P5ID are part of repeat-masked regions (MLT1B and LTR13BT), hence no conservation score (PhastCons scores) could be computed. Local blat searches against the selected species revealed only short multiple alignments to various genomic locations; in consequence no local alignments could be built. This however makes it unlikely that these two DSVs are linked to horn development as the flanking sequence is not conserved among horned ruminants. Moreover, these DSVs are found in a region with a high density of SNPs (Fig. 2B), indicating little selective pressure and making it less likely to encode for a functional element important for horn development.
The sequence flanking PC1655463T has a low conservation score, even among ruminants, and the local alignment (Fig. S5) shows that the corresponding sequence in the goat genome harbours a T, the same base as the variant allele found in polled Holstein cows. Thus, PC1655463T is less likely being causative for the polled phenotype.
The RefSeq nucleotide C at SNP PC1768587A is conserved only among bovid ruminants, where the variant allele is at the central position of a 24 nucleotide block that is identical in all four available Bovidae genomes. Also, the surrounding sequence is highly conserved among Bovidae but far less among other mammals (Fig. 2D and E). The SNP density of the whole region (Fig. 3B and 3C) including PC shows a contiguous area with high sequence conservation within cattle, making it a likely candidate for the presence of an un-annotated functional element.
Both PT1909354A, and P1909396D2, nested within the 80 kb duplication, affect elements that are conserved only among the bovid ruminants (Fig. S6). However, the 80 kb duplication can be causative itself, and sequence conservation of single elements within these 80 kb does not necessarily reflect the consequence this duplication may induce.
Sequence conservation across and within species thus allows ranking of the candidate mutations within PF haplotype, resulting in the P80kbID duplication (including the two variations PT1909354A and P1909396D2) and the SNP PC1768587A as two most plausible candidates. The candidate PC1655463T is considered as least likely as it coincides with sequence variation within different members of the Bovidae family suggesting a lack of functional constraint at this position.
The polled phenotype in cattle has been a target of selection and research since long. Large economic value of the phenotype , the simple dominant inheritance known since 1936  and the mapping position known since 1993  provided hope for an early and easy detection of the causal variant. This, however, was not the case due to special features of the phenotype, underlying causal mutations and livestock genetics: (i) the identification of functional candidate genes by the usual comparative trajectory with humans or mouse is hardly possible for this phenotype specific to Bovidae, (ii) the lack of functional and even positional candidate genes in large fractions of candidate region hindered identification of mutations conferring the polled phenotype and (iii) possible allelic heterogeneity of the phenotype investigated in a highly mobile species (world-wide and across-breeds use of some founders by artificial insemination) can reduce the ability to exploit haplotype diversity between breeds.
In this study, we used approaches beyond LD-based fine mapping to reduce the list of candidate causative genes without relying on any a priori assumptions of gene function. According to the contemporary trajectory of fine-mapping by a relatively dense SNP marker panel, high-throughput re-sequencing and association proven in a large panel of individuals sampled across the European continent we detected a single complex insertion-deletion event (P202ID) perfectly associated with the POLLED gene in most European cattle breeds. The absence of any other congruent candidate variant in the entire chromosome segment detected by fine-mapping, as well as perfect association with the phenotype, high sequence conservation of the candidate fragment among horned ruminants and absence of the appropriate conservation in un-horned mammals (Fig. 3C), clearly suggest P202ID as most probably being the causal mutation for polledness in most Bos taurus breeds. The fact that InDel P202ID resides in a region without known function is neither a clear argument against nor in favour of the candidate but is rather a proof of current rudimentary knowledge about gene function or presumed effects of candidate causal variants. The plausible causal mutation must fall within the limits of the dominant mode of inheritance. One possibility is haploinsufficiency which unlike in type 2 scurs  should not result in an enhancement of the effect in homozygotes. The next possibility is a gain-of-function mutation which would be characterized by a complete inhibition of the separate ossification centre in the tissues above the periosteum . Such new regulatory elements with abnormal function may fall within the class of non-coding RNA and micro RNA (miRNA) or promoter sequences. However, their de novo prediction from un-annotated sequence of a non-model organism is highly error-prone and was not used for speculation about the mechanisms causing polledness.
The biology of horn development has not been studied in detail, partly also due to the obvious lack of model organisms. The development of a new ossification centre should probably engage the well characterized network of bone development with the RUNX2 master transcription factor and its known regulators including twist proteins  and several miRNAs . Indeed, mapping of the type 2 scurs locus has identified a TWIST1 mutation as likely cause  and expression analysis of skin biopsies of the horn bud area in newborn polled and normal calves has shown evidence of endothelial-mesenchymal transition , a process involving bone morphogenetic protein signalling and twist protein activity. Genetic polledness is also known in goat and sheep, but the underlying mutations are mapped to chromosomes without synteny to the bovine polled locus investigated, here. Moreover, the polled/horned phenotype shows sexual dimorphism in sheep  and in goat is even linked with impaired sexual development , so that it is assumed that polledness in these species is caused by different mechanisms.
Including the cattle breeds originating from the North See area –HF, JY and WTG– led (Fig. 4) to the detection of a single haplotype block composed of three SNPs flanked by two InDels, all possible candidate causal variants for an independent mutation causing polledness. This Friesian haplotype, PF, segregates independently from the previously mentioned Celtic mutation, PC. Both are complementary and there is no evidence for any interference or recombination. Two Friesian and one Jersey bull were heterogenic at the POLLED locus (PF/PC) and declared as homozygous polled by progeny testing, i.e. all progeny from mating with horned dams are polled. All three bulls were intensively used for artificial insemination without any particular features and bred hundreds of exclusively polled offspring. Recombinant offspring of these bulls can supply important information for a formal test of allelic heterogeneity of the POLLED phenotype.
Not only economical reasons but also increasing serious animal welfare issues urge for a solution in the production of hornless cattle other than by dehorning. Even though the positive effects of genetic dehorning predominate, possible negative aspects should not be dismissed. In spite of a strong selection and serious attempts to improve breeding value of the polled sires these still lag behind in performance. For example, the best progeny tested polled (Pp) Fleckvieh bull currently ranks at position 1,724 of selection index, more than 1.3 standard deviations worse than the top 100 bulls. We find a similar situation in the Holstein breed. The inferior dairy breeding values of polled bulls might be caused simply by long-standing neglect of polled animals within sophisticated breeding programs or by multiple pleiotropic effects. It is known that some domestication traits result from mutations that cause large phenotypic effects but include deleterious pleiotropic effects . Short-termed agricultural interests can obviously overcome the negative pleiotropic effects of some large phenotypic effects like in myostatin null mutations in cattle . The recently mapped type 2 scurs syndrome locus with its underlying most probably causal mutation in the TWIST1 gene  represents another breeder-selected trait that is negatively correlated with fitness due to embryonic lethality for the homozygous state. To avoid possible negative long-term impacts, possible deleterious pleiotropic effects should be closely investigated before the massive amplification of the POLLED gene in large cattle populations.
In conclusion, we describe the mapping as well as the perfect association of one InDel and one short haplotype causing polledness in cattle and suggest the conservation score as prioritising criterion of the most probable causal mutations. The distribution of the two putative POLLED alleles across the European continent supports the hypothesis of two independent mutations. Traits with allelic heterogeneity are no exception but tedious to decipher. We presented research strategies which could be more widely applicable for deciphering the molecular mechanisms of phenotypes with allelic heterogeneity.
Materials and Methods
Collection of blood samples was conducted exclusively by local veterinarians during regular health inspection and quality control of breeding records on the farms, so that randomness of the sample was assured and no ethical approval was required for this study. The regular health inspection includes annual IBR diagnostics (BGBl. I Nr. 74 S. 3520 ff) and ruminant metabolisms control at Lehr- und Versuchsgut, Oberschleißheim, Germany (Dec. 55.2-1-54-2531.3-80-10). The regular quality control of breeding records includes paternity testing organised by the respective breeding associations. Paternity testing involves blood, semen and hair root samples. Blood sampling by veterinarians with state examination avoids unnecessary pain, suffering and damage and is in accordance with the German Animal Welfare Act. Semen samples were collected by approved commercial artificial insemination stations as part of their regular breeding and reproduction measures in cattle industry. Hair roots samples were collected by breeders themselves (hair plucking from pinna or tail-tassel) as part of regular sampling for parentage control. As acknowledged below and specified in Table S2 we used these regularly sampled blood, semen and hair roots samples for DNA preparation necessary for this study.
All animals sampled and genotyped in this study are presented in Table S1 according to their breed affiliations and purpose in the experimental design. These 1,675 animals originate from 31 European cattle breeds belonging to the Bos taurus subspecies. The DNA samples of all 162 cases, 162 controls and 89 carriers were genotyped genome-wide with the Illumina BovineSNP50 BeadChip . All 162 cases, 89 carriers and 1,262 semi random animals were genotyped for the revealed candidate mutations.
Phenotypes and Declaration of the Underlying POLLED Genotype
Polledness or complete absence of horns is a visible phenotype that can be identified at relatively young animals (four to six months). Because the growth of scurs occurs later in life than horns, phenotyping of some scurred animals will not be possible until nine to eighteen months of age . Polled animals, especially breeding bulls and important dams, are declared as polled (name suffix P) in their pedigree certificate and/or other records. One P designates polled animals with the PP or Pp genotype at underlying POLLED locus. Breeding animals with 12 to 15 (depending on breeding organisation) consecutively polled offspring originating from horned mates are declared as homozygous polled and get a name suffix PP. One horned offspring with confirmed paternity is sufficient to declare a polled animal as Pp.
DNA samples of 162 case and 162 controls were genotyped with the Illumina BovineSNP50 BeadChip . Marker order was based on release UMD3.1 of the Bos taurus genome (http://www.cbcb.umd.edu/research/bos_taurus_assembly.shtml). The SNP haplotypes were inferred and missing genotypes imputed using hidden Markov models (software package Beagle ). Three cohorts, namely trios (two parents, one offspring), pairs (one parent, one offspring) and unrelated animals were formed, including those animals that turned out not to be relevant for this study (2,721 animals in total). Genome-wide homozygosity mapping in polled animals and controls was performed using the asshom procedure . Case and control genotypes served as input data after completion of haplotype-inference and imputation by the Beagle package. To determine the statistical significance of each summary score, we permutated (50,000 permutations) the complete markers along the chromosomes  and estimated the summary score as harmonic mean across all cases. Thus, the corresponding P values were corrected for multiple testing and accounted for the level of inbreeding within the cases . The statistical significance based on 50,000 permutations is comparable to the number of markers used.
Genotyping of the Candidate Causal Mutations
In general, 30–60 ng of genomic DNA were used for genotyping of the candidate mutations. InDel variants P202ID and P5ID were PCR amplified (94°C 30 sec, 58°C 60 sec, 72°C 60 sec for 31 or 35 cycles, respectively) using primer binding sites flanking the InDel events (5′-TCAAGAAGGCGGCACTATCT-3′ and 5′-TGATAAACTGACCCTCTGCCTATA-3′ for P202ID and 5′-FAM-CCTTGTCACGTTAGATGTATGTCC-3′ and 5′-TCAATCTCTAATAAGGAACAGAAGAAA-3′ for P5ID). PCR products were size-separated and visualized by 2% ethidium-bromide stained agarose gel electrophoresis (P202ID) or analysed on an ABI Prism® 3130XL DNA sequencer (P5ID).
Genotyping of the P80kbID was performed by use of two primers flanking the variable site P1909396D2 (5′-GAAGTCGGTGGTCTGAAAGG-3′ and 5′-TGTTCTGTGTGGGTTTGAGG-3′). PCR amplification (32 cycles 94°C 30 sec, 59°C 60 sec, 72°C 60 sec) resulted in a RefSeq related product (prs) which was obtained in all animals, whether horned or polled. Additionally, a second PF related product, containing the two-base pair (TG) deletion was observed in all animals bearing one or two copies of the PF haplotype. Discrimination of the two products differing 2 bp in size was performed on an ABI Prism® 3130XL DNA sequencer. Animals bearing PF/PF were distinguished from PF/prs by quantitative evaluation of the obtained signals. PF/PF animals yielded signals of similar peak heights for the PF and prs products, while a heterozygous constellation (PF/prs) resulted in signal intensities of approximately double height for the prs product when compared to the PF specific product.
SNP variants PG1654405A and PC1655463T were genotyped by competitive allele-specific PCR using commercially available kits (KASPar®, KBioscience). PCR was performed as recommended by the manufacturer, while subsequent discrimination of the obtained fluorescent-labelled (FAM and CAL Fluor Orange 560) allele-specific products was performed on an ABI Prism® 3130XL DNA sequencer.
SNP variant PC1768587A was analysed by PCR-RFLP. Genomic DNA was amplified (32 cycles 94°C 30 s, 58°C 60 s, 72°C 60 s; 5′-CTGGAACCACGGATTACACAG-3′ and 5′-ACAGTTATGGTCAGGAGGCAAA-3′). Subsequently, three µl of the PCR product were treated with three units of TspRI (65°C for a minimum of 3.5 h). Obtained fragments were size-separated and visualized by 2% ethidium-bromide stained agarose gel electrophoresis.
One µg of Genomic DNA was randomly sheared by sonication (Bioruptor, Diagenode, Liege, Belgium) for 25 cycles (30 sec on/off, “low” intensity). Sheared DNA with a median size of 200–300 bp was rendered blunt-ended and 5′-phosphorylated (NEBnext end repair module, New England Biolabs Inc, Ipswich, USA). After addition of a single non-templated 3′-A (NEBnext A-tailing module) fragments were ligated to Illumina compatible adapters that carried a sample-specific 3 nt barcode and a 3′-T overhang. The ligated library was size-selected on a 2% agarose gel and amplified by PCR with Illumina PE1 and PE2 primers. Equimolar amounts of the eight samples were pooled for array-capture (Agilent 244 k capture Array, Agilent, Santa Clara, USA; custom designed by e-array, repeat-masked, 3 bp tiling). Briefly, the libraries were hybridised for 65 h at 65°C, washed and eluted with nuclease-free water for 10 min at 95°C. The eluted DNA was concentrated in a vacuum centrifuge, amplified by PCR (10 cycles 98°C 15 s, 65°C 30 s, 72°C 30 s) and purified with Ampure XP beads.
Mapping and Variant Calling
Sequence reads from the Illumina Genome Analyzer IIx were aligned to the bovine reference genome (UMD3.1) using BWA . The mapped reads were filtered for PCR duplicates with SAMtools  and only uniquely mapping reads were retained. A pileup of the mapped reads was created for each animal using SAMtools and variants were detected with VarScan (v.2.2.7)  at a minimal coverage of 20 and a minimal variant frequency of 0.01.
The applied filters for the detection of potentially causative variants were: Homozygosity in the PP sires with a variant frequency range of 95 to 100%. For the heterozygote carrier Pp they must fall within a range of 40 to 60%, while the resulting candidate variants must be undetected in the pp sires. The putatively causative variants must be different from the UMD3.1 genome sequence, which was derived from a horned Hereford dam and therefore cannot carry the polled allele.
Copy Number Variation
For the identification of large insertion-deletion events between polled and horned animals we calculated the coverage ratio between the two groups in dynamical bin sizes. Each bin size was calculated by iterating over the mapped reads in the reference group until a determined number of reads (400) was reached and the start position of the last read was taken as bin size . This step was repeated until the whole target region was processed. Then, for the target group the number of reads falling in each bin were counted and used for the calculation of the log2 ratios between reference and target group. The log2 ratios were plotted using R  and the average ratios were segmented using the circular binary segmentation implementation in the DNAcopy package (v1.14.0) from Bioconductor .
Multi-species Alignment and Conservation Scores
For the re-sequenced target region (chr1:1,543,412-2,089,648) we created multiple alignments for bovid ruminants (cattle, sheep and water buffalo), for unhorned mammals (horse, dog, human, mouse, pig, elephant, alpaca and dolphin) and for all selected species. The repeat-masked bovine genome sequence of the target region was obtained from UCSC genome browser  and used as reference for each of the pair-wise lastz genome alignments . The resulting alignments were chained and netted to obtain best matching hits  and multi-species alignments were generated with multiz  projected to the bovine reference. Additionally, short read sequences from the goat genome project (SRA Accession number: SRX016522) were added to the multi-species alignments only in the neighborhood of candidate mutations. To do so we aligned the short read to the bovine reference using blat  and manually selected the best alignments to the bovine reference.
The reference genomes of sheep (Ovis aries 1.0), human (hg19), mouse (mm9), dog (canFam2), horse (EquCab2) and pig (Sscrofa9.1) were obtained from the FTP-site at UCSC. The draft genome sequences from alpaca, elephant and dolphin were obtained from the Broad Institute  and the water buffalo genome sequence was taken from the Indian Buffalo Genome Project (http://188.8.131.52).
From the multiple species alignments we calculated the base conservation among horned animals, unhorned mammals and for all species using the phast package. For that purpose, we estimated a phylogenetic tree with PhyloFit from which the conservation score for each base was calculated with PhastCons.
The pedigree chart of most polled Braunvieh. All declared polled Braunvieh bulls are descendents of the well-known American Brown-Swiss bull BS1 which is founder of polledness in Brown-Swiss/Braunvieh cattle population. The case individuals (PP) are represented by solid circles (females) and squares (males); declared carriers by half-filled symbols; not sampled individuals are marked with a diagonal line. The haplotype associated with polledness (red letters) of the four genome-wide genotyped bulls can be traced back to the same carrier bull BS1. This pedigree includes also four polled Braunvieh animals (*) genotyped only for candidate mutations. All sampled PP animals were genotyped as PC/PC and all Pp as PC/prs.
The pedigree chart of all sampled Fleckvieh bulls composing case and carrier group in Table S1. The case individuals (PP) are represented by solid circles (females) and squares (males); declared carriers by half-filled symbols; not sampled individuals are marked with a diagonal line. To reduce complexity of the pedigree not all relationships were presented. At two positions (A and B) there are relationships to important sires indicated. The inbreeding of the Fleckvieh bull (PP-DFV1) chosen for re-sequencing is superimposed by red lines. All sampled Fleckvieh PP animals bear two copies of the common haplotype (AGACAAGGA) and were genotyped as PC/PC. All Pp bear one copy of the common haplotype and were genotyped as PC/prs. No recombination was detected in the common haplotype.
The pedigree chart of all sampled Holstein bulls composing case and carrier group in Table S1. The case individuals (PP) are represented by solid circles (females) and squares (males); declared carriers by half-filled symbols; not sampled individuals are marked with a diagonal line. To reduce complexity of the pedigree not all relationships were presented. At positions A, B and C there are relationships to important sires indicated. The founder of the Celtic polledness is indicated by D. The carriers and cases of Friesian and Celtic polledness are filled with gray (genotyped as PF/prs and PF/PF) and black color (genotyped as PC/prs and PC/PC), respectively. Two heterogeneous polled bulls were genotyped as PC/PF. At the SNP-Chip level all sampled Holstein PP and Pp animals bear two or one copy of the common haplotype (AGACAAGGA) and there was no recombination detected.
The pedigree chart of all sampled case and carrier Jersey bulls. The case individuals (PP) are represented by solid circles (females) and squares (males); declared carriers (Pp) by half-filled symbols; not sampled individuals are marked with a diagonal line. The founder of the Friesian polledness in Jersey breed is indicated by F. The carriers and cases of polledness with Friesian and Celtic origin are filled with gray (genotyped as PF/prs and PF/PF) and black color (genotyped as PC/prs and PC/PC), respectively. Two heterogeneous polled bulls were genotyped as PC/PF. At the SNP-Chip level all sampled Jersey PP and Pp animals bear two or one copy of the common haplotype (AGACAAGGA) and there was no recombination detected.
Across-species conservation for variant PC1655463T. (a) The sequence conservation around the position of the candidate mutation PC1655463T among animals without horn, horn-bearing and among all animals are represented as PhastCons scores, calculated from the underlying multi-species alignment in plot b. (b) Sequence identity among horn-bearing animals is denoted with red stars and black stars are used for identity in all aligned species. The position of the candidate DNA sequence variant is outlined by a dashed black line and is highlighted in the multi-species alignment by a red frame.
Across-species conservation for variants PT1909354A and P1909396D2. (a) The PhastCons conservation scores of the sequence around the two DNA sequence variants PT1909354A and P1909396D2 are shown. Both variants are outlined with black dotted lines. (b) Sequence identity is marked in the multi-species alignment by a red and black star among horn-bearing and all aligned species, respectively.
Breed origin of case-control design, carriers and semi-random samples. Breed names, abbreviations, geographic origin and numbers of genotyped samples are listed for each group within breed. The four groups are homozygous polled cases (PP), heterozygous polled carriers (Pp), horned controls (pp) and the semi-random sample (R) adjusted by the diversity panel.
Specification of the samples origin. Name of the samples supplier, affiliation, sampled breed(s), sampled tissue and primary reason for tissue sampling. This table is part of Acknowledgments and Ethics statement.
We are grateful for the support of the respective numerous cattle breeders and cattle breeding associations and insemination centres that generously provided the most samples free of charge. In particular, we thank K.H. Göpel (Göpel Genetik GmbH, Germany), Dr. G. Röhrmoser (Arbeitsgemeinschaft Süddt. Rinderzucht-u. Besamungsorganisationen e.V., München, Germany), Dr. A. Scholz (Lehr- und Versuchsgut, Oberschleißheim, Germany), Dr. J. Eder (Hochschule Weihenstephan-Triesdorf, Triesdorf, Germany), Dr. A. Medugorac (Institute for Animal Breeding, TU Munich), Dr. J. Ramljak (Department of Animal Science, University of Zagreb, Croatia), Dr. A. Ivankovic (Department of Animal Science, University of Zagreb, Croatia), Dr. K. Kume (NC of FAnGR, Tirana, Albania), Dr. H.P. Grünenfelder (SAVE Foundation, St. Gallen, Switzerland), Dr. V. Cadavez (Departament of Animal Science, CIMO, Braganca, Portugal), Dr. E.D. Hegemann (Tierarzt Praxis Hegemann, Soest, Germany), Dr. J. Potthast (Rinder-Union West eG, Münster, Germany), Dr. B. Weber (Masterrind GmbH, Verden, Germany), Mrs. B. Brentrup (WWS Germany GmbH, Altenberge, Germany), Mr. M. Schricker (Fleischrinderverband Bayern, Ansbach, Germany), Mr. H. Zuchtriegel (Zweckverband für künstliche Besamung der Haustiere, Greifenberg, Germany), Mr. E. Rosenberger (The Bavarian Gene Reserves, LfL, Grub, Germany), Mr. L. Eule (AgrarCenter Erzgebirge GmbH, Germany), Mr. N. Hartmann (Klöck Hartmann GbR, Bidingen, Germany), Mr. M. Wild (Family Wild, Prittriching, Germany), Mr. S. Rist (AHG Kempten, Germany), Mr. M. Kelz (Rinderbesamungsgenossenschaft Memmingen, Germany) and Mr. P. Lobet (Belgian Blue Group, Ciney). We thank Dr. C.E. Veit (Chair of Animal Genetics and Husbandry, LMU, Munich, Germany) for comments on English grammar. Detailed specification of the samples origin is given in the Supplementary Table S2.
Conceived and designed the experiments: IM SR MF HB. Performed the experiments: IM SK DS. Analyzed the data: IM SR SK AG DS IR. Contributed reagents/materials/analysis tools: KHG HB MF. Wrote the paper: IM SR SK DS IR.
- 1. Felius M (1995) Cattle breeds : an encyclopedia. Doetinchem: Misset. 799p. p.
- 2. Capitan A, Grohs C, Weiss B, Rossignol MN, Reverse P, et al. (2011) A newly described bovine type 2 scurs syndrome segregates with a frame-shift mutation in TWIST1. PLoS One 6: e22242.
- 3. Dove WF (1935) The physiology of horn growth: a study of the morphogenesis, the interaction of tissues and the evolutionary processes of a Mendelian recessive character by means of transplantation of tissues. J Exp Zool 69: 347–405.
- 4. Graf B, Senn M (1999) Behavioural and physiological responses of calves to dehorning by heat cauterization with or without local anaesthesia. Applied Animal Behaviour Science 62: 153–171.
- 5. Ramljak J, Ivankovic A, Veit-Kensch CE, Forster M, Medugorac I (2011) Analysis of genetic and cultural conservation value of three indigenous Croatian cattle breeds in a local and global context. J Anim Breed Genet 128: 73–84.
- 6. White W, Ibsen H (1936) Horn inheritance in Galloway-Holstein cattle crosses. Journal of Genetics 32: 33–49.
- 7. Long CR, Gregory KE (1978) Inheritance of the horned, scurred, and polled condition in cattle. Journal of Heredity 69: 395–400.
- 8. Drogemuller C, Wohlke A, Momke S, Distl O (2005) Fine mapping of the polled locus to a 1-Mb region on bovine chromosome 1q12. Mamm Genome 16: 613–620.
- 9. Georges M, Drinkwater R, King T, Mishra A, Moore SS, et al. (1993) Microsatellite mapping of a gene affecting horn development in Bos taurus. Nat Genet 4: 206–210.
- 10. Seichter D, Russ I, Rothammer S, Eder J, Förster M, et al. (2012) SNP-based Association mapping of the polled gene in divergent cattle breeds. Anim Genet. (in press).
- 11. Asai M, Berryere TG, Schmutz SM (2004) The scurs locus in cattle maps to bovine chromosome 19. Anim Genet 35: 34–39.
- 12. Capitan A, Grohs C, Gautier M, Eggen A (2009) The scurs inheritance: new insights from the French Charolais breed. BMC Genet 10: 33.
- 13. Charlier C, Coppieters W, Rollin F, Desmecht D, Agerholm JS, et al. (2008) Highly effective SNP-based association mapping and management of recessive defects in livestock. Nat Genet 40: 449–454.
- 14. Matukumalli LK, Lawley CT, Schnabel RD, Taylor JF, Allan MF, et al. (2009) Development and characterization of a high density SNP genotyping assay for cattle. PLoS One 4: e5350.
- 15. Goecks J, Nekrutenko A, Taylor J (2010) Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 11: R86.
- 16. Gibbs RA, Taylor JF, Van Tassell CP, Barendse W, Eversole KA, et al. (2009) Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science 324: 528–532.
- 17. McEvoy B, Richards M, Forster P, Bradley DG (2004) The Longue Duree of genetic ancestry: multiple genetic marker systems and Celtic origins on the Atlantic facade of Europe. Am J Hum Genet 75: 693–702.
- 18. Medugorac I, Veit-Kensch CE, Ramljak J, Brka M, Marković B, et al. (2011) Conservation priorities of genetic diversity in domesticated metapopulations: a study in taurine cattle breeds. Ecology and Evolution 1: 408–420.
- 19. Prayaga KC (2007) Genetic options to replace dehorning in beef cattle–a review*. Australian Journal of Agricultural Research 58: 1–8.
- 20. Bialek P, Kern B, Yang X, Schrock M, Sosic D, et al. (2004) A twist code determines the onset of osteoblast differentiation. Dev Cell 6: 423–435.
- 21. Zhang Y, Xie RL, Croce CM, Stein JL, Lian JB, et al. (2011) A program of microRNAs controls osteogenic lineage progression by targeting transcription factor Runx2. Proc Natl Acad Sci U S A 108: 9863–9868.
- 22. Mariasegaram M, Reverter A, Barris W, Lehnert SA, Dalrymple B, et al. (2010) Transcription profiling provides insights into gene pathways involved in horn and scurs development in cattle. BMC Genomics 11: 370.
- 23. Johnston SE, McEwan JC, Pickering NK, Kijas JW, Beraldi D, et al. (2011) Genome-wide association mapping identifies the genetic basis of discrete and quantitative variation in sexual weaponry in a wild sheep population. Mol Ecol 20: 2555–2566.
- 24. Pailhoux E, Vigier B, Chaffaux S, Servel N, Taourit S, et al. (2001) A 11.7-kb deletion triggers intersexuality and polledness in goats. Nat Genet 29: 453–458.
- 25. Stern DL, Orgogozo V (2009) Is genetic evolution predictable? Science 323: 746–751.
- 26. Grobet L, Martin LJ, Poncelet D, Pirottin D, Brouwers B, et al. (1997) A deletion in the bovine myostatin gene causes the double-muscled phenotype in cattle. Nat Genet 17: 71–74.
- 27. Browning SR, Browning BL (2007) Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet 81: 1084–1097.
- 28. Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754–1760.
- 29. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, et al. (2009) The Sequence Alignment/Map format and SAMtools. Bioinformatics 25: 2078–2079.
- 30. Koboldt DC, Chen K, Wylie T, Larson DE, McLellan MD, et al. (2009) VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 25: 2283–2285.
- 31. Wood HM, Belvedere O, Conway C, Daly C, Chalkley R, et al. (2010) Using next-generation sequencing for high resolution multiplex analysis of copy number variation from nanogram quantities of DNA from formalin-fixed paraffin-embedded specimens. Nucleic Acids Res 38: e151.
- 32. Ihaka R, Gentleman R (1996) R: A Language for Data Analysis and Graphics. J Comput Graph Stat 5: 299–314.
- 33. Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, et al. (2004) Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 5: R80.
- 34. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, et al. (2002) The human genome browser at UCSC. Genome Res 12: 996–1006.
- 35. Harris RS (2007) Improved pairwise alignment of genomic DNA: The Pennsylvania State University.
- 36. Kent WJ, Baertsch R, Hinrichs A, Miller W, Haussler D (2003) Evolution’s cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proc Natl Acad Sci U S A 100: 11484–11489.
- 37. Blanchette M, Kent WJ, Riemer C, Elnitski L, Smit AF, et al. (2004) Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res 14: 708–715.
- 38. Kent WJ (2002) BLAT–the BLAST-like alignment tool. Genome Res 12: 656–664.
- 39. Lindblad-Toh K, Garber M, Zuk O, Lin MF, Parker BJ, et al. (2011) A high-resolution map of human evolutionary constraint using 29 mammals. Nature 478: 476–482.