Body size is an important characteristic for horses of various breeds and essential for the classification of ponies concerning the limit value of 148 cm (58.27 inches) height at the withers. Genome-wide association analyses revealed the highest associated quantitative trait locus for height at the withers on horse chromosome (ECA) 3 upstream of the candidate gene LCORL. Using 214 Hanoverian horses genotyped on the Illumina equine SNP50 BeadChip and 42 different horse breeds across all size ranges, we confirmed the highly associated single nucleotide polymorphism BIEC2-808543 (−log10P = 8.3) and the adjacent gene LCORL as the most promising candidate for body size. We investigated the relative expression levels of LCORL and its two neighbouring genes NCAPG and DCAF16 using quantitative real-time PCR (RT-qPCR). We could demonstrate a significant association of the relative LCORL expression levels with the size of the horses and the BIEC2-808543 genotypes within and across horse breeds. In heterozygous C/T-horses expression levels of LCORL were significantly decreased by 40% and in homozygous C/C-horses by 56% relative to the smaller T/T-horses. Bioinformatic analyses indicated that this SNP T>C mutation is disrupting a putative binding site of the transcription factor TFIID which is important for the transcription process of genes involved in skeletal bone development. Thus, our findings suggest that expression levels of LCORL play a key role for body size within and across horse breeds and regulation of the expression of LCORL is associated with genetic variants of BIEC2-808543. This is the first functional study for a body size regulating polymorphism in horses and a further step to unravel the mechanisms for understanding the genetic regulation of body size in horses.
Citation: Metzger J, Schrimpf R, Philipp U, Distl O (2013) Expression Levels of LCORL Are Associated with Body Size in Horses. PLoS ONE 8(2): e56497. doi:10.1371/journal.pone.0056497
Editor: Elissa Z. Cameron, University of Tasmania, Australia
Received: August 26, 2012; Accepted: January 10, 2013; Published: February 13, 2013
Copyright: © 2013 Metzger et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was supported by the Mehl-Mülhens Stiftung, Köln (DI-MM/1-1). The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Body size is an important model trait for studying genetic influences on quantitative traits and has been intensely investigated in human and also in domestic animals –. In human, adult height is described to be a complex trait influenced by many genes and environmental factors . Several genetic variants affecting the adult height have been identified using association analyses , .
In horses, body size is an important criterion for the evaluation of different breeds concerning appearance and function and is crucial for the classification of horses. According to the Fédération Equestre Internationale (FEI) veterinary regulations ponies taking part in any FEI competition have to be measured at the highest point of the withers. The limit height is in accordance with the definition of a pony 148 cm (centimetres) (58.27 inches) without shoes or with competition shoes 149 cm (58.66 inches). If this height is exceeded the animal is then classified as a horse . In some breeds, the limit values are even lower. The American Miniature Horse Association requires a limit height at the last hairs of the mane of 86.4 cm (34 inches). For breeders, body size of horses is an essential parameter to improve marketability, function and performance. The important effect of size for competitive jumping ability in ponies was suggested . Generally, larger animals within each height class possess competitive advantage and performances are evaluated correspondingly , . Due to the selection for specific functions, the domestic horse has been modified within breeds into diverse skeletal morphologic types. The heritability of height at the withers was estimated to be medium to high in pony breeds. Particularly, in Haflinger and Shetland ponies high heritabilities at 0.79–0.89 were found – while Icelandic and Hanoverian warmblood showed medium values at 0.5–0.6 , . The first attempt to identify patterns of skeletal size and shape variation among domestic horses has been made by principal component analyses . Overall body size was used as a principal component including thirty measurements all over horse's body like head length, height at withers, height at croup, chest width and neck length. It grouped small ponies together with low scores and large draft breeds with high median scores. Light horses showed mid-values . Several genome-wide association studies (GWAS) have been performed for height at withers in horses , , , . The involvement of LCORL (ligand-dependent nuclear receptor compressor-like protein) in height at withers has been primarily shown in a GWAS in Hanoverian stallions . A highly significant QTL on horse chromosome (ECA) 3 in the region of LCORL was detected for conformation traits like head, neck, frame and development . In Franches-Montagnes horses, quantitative trait loci (QTL) on ECA3 and 9 were significantly associated with withers height. Both associated SNPs were located in large intergenic regions . In thoroughbred horses, the same locus on ECA3 was found as highly associated with body size. A scan of 48 horses from 16 different breeds revealed that four loci on ECA3, 6, 9 and 11 explain 83% of the variance for size. The highest associated SNP was located near the candidate gene LCORL and this finding was in line with the other analyses for withers height in horses , , . In human, the candidate gene LCORL has been discussed to be involved in trunk and hip axis length . GWAS in cattle for growth traits such as birth weight, body length, carcass weight and longissimus muscle area, revealed an association in the region of NCAPG (non-SMC condensing I complex subunit) and LCORL. Coding regions of the candidate genes were sequenced and revealed further associated markers . In Angus cattle, polymorphisms in the candidate genes adiponectin (ADIPOQ) and somatostatin (SST) were tested for growth and carcass traits. The strongest statistical effect was detected for the SNP ADIPOQ:g.1596G>A, possibly influencing the anchoring of the transcription pre-initiation transcription factor IID (TFIID) complex and therefore affecting the stability of the initiation complex . Analyses of TFIID in mice resulted in dwarf phenotypes with an about 50% reduced body weight. According to its multiple molecular functions, TFIID is considered as a central component of the transcription apparatus . Furthermore, analyses showed a correlation between body weight, withers height and further size measurements .
In thoroughbred horses, a mass to height at the withers ratio was used to test the influence of the MSTN (myostatin) gene g.66493737C>T genotypes on body mass. Sprinters are proposed to be generally shorter animals with greater muscle mass , .
The objective of our study was to investigate the role of the candidate gene LCORL and its flanking polymorphisms with the development of body size in horses. First, we validated the association of the genomic region around LCORL in Hanoverian warmblood horses as well as substantiated the association of this locus with body size in a large number of horse breeds with extreme size and then we sequenced genomic and copy DNA of LCORL for polymorphism detection. In order to show a functional relationship of LCORL with body size and the associated SNP upstream of LCORL, we employed expression analyses for LCORL and its two adjacent genes for horses with differing height at withers within and across breeds.
GWAS confirmed the association of BIEC2-808543 (Broad institute nomenclature; EquCab2_105547002T>C) on ECA3 adjacent to the candidate gene LCORL for the Hanoverian warmblood population (Figure 1A). The observed −log10P-values were plotted against the expected −log10P-values. The quantile-quantile-plot (Q-Q) indicated that the population stratification was eliminated through the identity-by-state (IBS) kinship matrix as far as possible (Figure 1B). MLM analysis revealed the highly associated SNP BIEC2-808543 at 105.55 Mb (−log10P = 8.3, after accounting for multiple testing using a Bonferroni-correction −log10P = 3.6). The next highest associated SNP (BIEC2-808466) was located in the same region at 105.16 Mb (−log10P = 7.6, after accounting for multiple testing using a Bonferroni-correction −log10P = 2.95). The highly associated SNP BIEC2-808543 showed a minor allele frequency (MAF) of 0.45 and was distributed almost evenly among the Hanoverian warmblood horses (Table 1). In total, 35 horses were homozygous T/T and 56 horses homozygous C/C, 123 horses showed the heterozygous genotype (C/T). The distribution of the breeding values showed that BIEC2-808543 had a highly significant additive effect on the breeding value for body size, while the dominant effect approached zero.
(A) Manhattan-plot of the −log10P-values from genome-wide association analysis (MLM) of body size in Hanoverian warmblood horses. The highest peak is located at 105 Mb on ECA3. (B) Q-Q plot of observed versus expected −log10P-values from a genome- wide association study (GWAS) in Hanoverian warmblood horses. The expected distribution (solid line) and the observed −log10P-values plotted against the expected −log10P-values (black dots) are shown. The peak value (BIEC2-808543) is located on horse chromosome 3 at 105.55 Mb.
After validating the highly associated SNP BIEC2-808543 in the Hanoverian warmblood population, we performed a large scan for the genotypic distribution in 1851 horses of extreme size. Genotyping of the BIEC2-808543 showed that the genotype T/T was nearly perfectly associated with all pony breeds up to the limit value of 148 cm for the height at the withers. They showed a significantly higher allele frequency for T in contrast to the larger horses (Figure 2). The small horses varying between 130 cm (Dülmener) and 160 cm in the height at withers, showed homozygous T/T and heterozygous genotypes while larger and heavier horses predominantly showed the genotype C/C (Table S1). Heterozygous horses in pony breeds showed relatively high withers height values and those stallions served breeders to pass on larger height at the wither to the offspring. The results show that this SNP proved to be a highly predictive marker of genetic potential for body size.
The results show that the allele frequency corresponds to the average height.
Candidate region analysis
Bioinformatic sequence analysis of the genomic region of the highly associated SNP (BIEC2-808543) using Patch 1.0 and SIGNAL SCAN showed that the polymorphism was located in a putative DNA consensus sequence element, the transcription factor binding site of TFIID, which influences the transcription by RNA polymerase II , . This putative TATA box element (3′-ATAAA-5′) is modified due to the mutated allele C and prediction software suggests that the putative binding site disappears because of this BIEC2-808543 mutation.
TFIID has been supposed to play a role in influencing genes responsible for skeletal development. According to its function and its close proximity to the associated SNP, we sequenced the candidate gene LCORL, located on ECA3, for polymorphisms influencing body size. Comparison of the annotations of LCORL using NCBI (National Center for Biotechnology Information) and Ensembl resulted in two different gene models showing two transcripts with six exons with differences in the sequence of exon 6 (Ensembl) and one transcript with seven exons (NCBI). Sequence analyses of the cDNA revealed two transcripts with seven exons each (Figure 3) and showed an exon 1 sequence different to that predicted in the reference sequence. The experimental protein sequence 1 (BankIt1561108 Seq1 JX515275) showed a similarity of 97% to the human isoform 1. For the experimental protein sequence 2 (BankIt1561108 Seq2 JX515276), a similarity of 95% could be shown.
Sequence analyses of LCORL revealed two transcripts with seven exons each. The scale is in base pairs (bp). Black boxes indicate consecutively numbered exons with given sizes in base pairs (bp). Dotted and solid lines below the gene models represent the coverage of the complementary DNA (cDNA) primers. Polymorphisms are shown above and the highly associated single nucleotide polymorphism (SNP) BIEC2-808543 is printed in bold type.
Comparing the experimental cDNA sequences of two horses with the BIEC2-808543 genotype T/T and C/C revealed a twelve base pair deletion (BankIt1561108 Seq1 JX515275c.7-19Del) (Table S2) in the T/T horse. Further genotyping of 71 horses of different heights and genotypes showed only eight horses with this 12-bp-deletion and did not confirm an association with body size. All experimental sequences showed various differences to the reference sequence of exon 1. These new sequences were useful to complete the annotation of the protein structure of LCORL. Sequence analyses of the genomic DNA (gDNA) revealed two SNPs (NC_009146.3g.118421+29708C>T; NC_009146.3g.118421+29710T>C) and one insertion (NC_009146.3g.118421+29840InsT) in intron 6 in comparison with the reference sequence (Table S2) but these polymorphisms were not associated with body size.
The quantitative real-time PCR (RT-qPCR) analysis for LCORL revealed a significant decrease of relative LCORL expression in medium sized horses (Hanoverian 166–168 cm and Dülmener >133 cm) with the genotype C/T and an even more decreased relative expression in larger sized horses with the SNP genotype C/C (Hanoverian >168 cm and Rhenish German Draught). Across all horses analysed, a significant decrease of 40% could be shown in heterozygous (C/T) (P = 0.016) and 56% in homozygous C/C horses (P<0.001) in comparison to T/T horses (Figure 4). In the Hanoverian warmblood population, the relative expression of the heterozygous C/T-horses was decreased by 44% and in homozygous C/C-horses by 54% (P = 0.024) when T/T-horses were used as reference (Figure 5). General linear model (GLM) analysis revealed a significant effect for the T>C genotype but neither a significant influence of the sex, breed or breed by genotype on the expression results. Furthermore, the age and time of sampling had also no effect on the different expression levels. Relative expression analyses of the adjacent genes NCAPG and DCAF16 (damage-specific DNA-binding protein and cullin-4 associated factor 16) revealed no significant effects associated with body size or the T>C genotypes using GLM analyses (Figures S1, S2). Semi-quantitative expression analyses of testis, hair, brain, kidney, muscle and liver tissues revealed that LCORL, NCAPG and DCAF16 are almost equally expressed in these tissues and hair roots (Figure S3). EST profiles confirmed that these genes are almost ubiquitously expressed (http://www.ncbi.nlm.nih.gov/UniGene).
In comparison with the T/T genotypes, the expression of horses with the C/T genotype (P = 0.016) is decreased by 44% and for the genotype C/C (P<0.001) by 54%. The expression differences were accounted for using the ΔΔCT method.
In this study, we could show that the T>C mutation of the SNP BIEC2-808543 is located within a potential transcription factor binding site and this mutation is significantly associated with relative expression levels of LCORL and simultaneously with body size within and across horse breeds. GWAS within Hanoverian warmblood horses and across a large number of horses from 42 different breeds confirmed this SNP to be a highly predictive marker for body size.
The body size associated SNP is located within a binding site for the general transcription factor TFIID working as a TATA box-binding factor. While the homozygous genotype T/T of BIEC2-808543 possibly enables the anchoring of TFIID, the SNP allele C could presumably modify the binding site function and therefore influence the TFIID recognition of core promoter elements, the first step of the initiation of mRNA transcription . This initiation is a key stage in the regulation of gene expression and plays an important role in the regulation of the AP-1, activator protein-1 transcription factor complex, in bone cell development , . Skeletal bones are developed by the three cell types, chrondrocytes, osteoblasts and osteoclasts. The differentiation and function of theses cells is regulated by several factors influencing specific gene expression. Analyses in human bone have shown that members of the AP-1 family strongly affect these cells , . In vivo studies in mice with TFIID inactivated component resulted in a reduced size of different organs including an attenuated growth, resulting in dwarfism with an about 50% reduced body weight, and showed that a lacking function of the TFIID mechanism is able to have a strong influence on body size .
Modifications in transcriptional regulation are proposed to influence several biological processes determining body size. Possible targets discussed in adult height analysis were genes controlling intracellular signalling, cell division, DNA replication and skeletal development , , , . Therefore, the C allele of BIEC2-808543 is presumably the reason for the reduced expression of LCORL in larger sized horses. LCORL, also known as the Mblk1-related protein, shows characteristic motifs of transcription factors and analyses with mouse tissues indicate that it is able to activate transcription. Transcription factors are key proteins in various biologic processes. To achieve their different roles they have to change or specialize their functions or target genes . We assume that this specific function of LCORL could possibly be involved in such a complex trait.
In human genome-wide scans for adult stature evidences LCORL to be associated with trunk length and hip axis length . According to the conserved synteny of 53% of the equine chromosomes to a single human chromosome  horses' size could possibly be influenced by similar genomic regions. In cattle, a GWAS revealed two SNPs in LCORL highly associated with feed intake and body weight gain phenotypes. It was supposed that SNPs affecting the transcription or translation of LCORL may result in an increased or decreased regulation of genes involved in growth .
We assume that similar effects might also explain the variation in body sizes in horses. GWAS for body size in horses in previous studies revealed several QTL for different breeds , , , . Depending on breed and population different candidate genes were discussed. Nevertheless, all studies had the QTL on ECA3 near LCORL in common , , , . The present analysis of the Hanoverian warmblood horses and extreme size horse breeds confirmed the associated SNP adjacent to LCORL and emphasized the assumption that LCORL is strongly involved in the development of body size within and across breeds. This finding was the basis for the gene expression analysis.
The expression analysis was performed using hair root samples due to the ubiquitous expression profile and the non invasive sampling of these tissues. Body size is assumed to be a result of different gene interactions in human that are not yet investigated , . Nevertheless, the correlation between growth of body and hair could be shown in various studies. The Rothmund-Thomson syndrome in human for example is characterized by severe dwarfism combined with an abnormal hair growth . Studies in mice revealed growth retardation in hair length and a retarded rate of body growth caused by the supply of high concentrations of the epidermal growth factor (EGF) .
Our results of the expression analyses showed that the relative expression levels of LCORL decreased considerably in larger sized and heavy horses with the C/C genotype. Horses heterozygous for the SNP BIEC2-808543 showed relative expression levels in-between the two homozygous genotypes. They represent the medium sized horses and producers of larger sized ponies. The mechanism how BIEC2-808543 effects LCORL is not yet known but the mutated transcription factor binding site might cause significant changes of the relative expression levels.
The distribution of body sizes and expression levels indicated that LCORL might act as a main regulator for body size. Nevertheless, as studies in human height suggest, body size is a complex trait and further regulatory elements must be involved to create such huge differences in skeletal morphologic types . The candidate genes NCAPG and DCAF16 could be eliminated as candidates, whereas HMGA2 (high mobility group AT-hook 2, ECA6), ZFAT (zinc finger and AT hook domain containing, ECA9), LASP1 (LIM and SH3 protein 1, ECA11) could also possibly involved.
The results demonstrate that we have identified a functional polymorphism differentiating ponies from medium and large and heavy horse breeds and accounting for body size variation within horse breeds. This T>C polymorphism is located within the binding site of the transcription factor TFIID and was significantly associated with body size and relative expression levels of LCORL. This is the first functional study for a body size regulating polymorphism in horses and further steps are necessary to unravel the mechanisms for understanding the complex genetic regulation of body size in horses.
Materials and Methods
All animal work has been conducted according to the national and international guidelines for animal welfare. The EDTA-blood and hair root sampling was approved by the Lower Saxony state veterinary office, Niedersächsisches Landesamt für Verbraucherschutz und Lebensmittelsicherheit, Oldenburg, Germany (registration numbers 02A-138 and 07A-482).
Blood samples were collected from 214 Hanoverian warmblood horses including 150 stallions of the National State Stud of Lower Saxony and 64 broodmares. For these horses, breeding values, conformation and pedigree data were made available by the Hanoverian Studbook Society (HSS) through the national unified animal ownership database (Vereinigte Informationssysteme Tierhaltung w.V., VIT, Verden/Aller, Germany). We used the latest breeding values (BVs) for height at withers (WH) (edited in November 2011) provided by HSS. BVs for WH were estimated based on results of studbook inspection (SBI) since 1979 including 85,598 Hanoverian warmblood horses. BVs are estimated yearly through the VIT for WH employing a BLUP (best linear unbiased prediction) animal model .with yijk = height at withers, μ = model constant, TESTi = fixed effect of the individual test representing the interaction between the place, year and season of performance evaluation, aj = random additive genetic effect of the individual horse and eijk = random residual.
All BVs were standardized to a mean value of 100 points and a standard deviation of 20 points using the horses of the birth cohorts from 2000 and 2002 as reference.
For the horses studied here, the mean BV for WH was 105±25 (range 24–164). The mean reliabilities of the BVs were at 0.9. The distribution of the BVs for WH was analysed using the UNIVARIATE procedure of SAS (Statistical Analysis System, version 9.3, SAS Institute, Cary, NC, USA, 2011). The additive genetic effect was estimated as half of the difference of the least square means among the two homozygous genotypes. The dominance effect was calculated as the deviation of the least square mean of the heterozygotes from the average of the two homozygous genotypes.
For each horse the proportions of genes of Hanoverian (HAN), Thoroughbred (TB), Trakehner (TRAK) and Holsteiner (HOL) were calculated using all available pedigree information. Details are described elsewhere . Mean (median) proportions of genes in the stallions were 0.54 (0.63) for HAN, 0.28 (0.19) for TB, 0.05 (0.03) for TRAK, and 0.06 (0) for HOL.
Across-breed analysis was performed in 1851 horses of fourty-two different breeds including the pony breeds American Miniature Horse, Dartmoor, Exmoor, German Classic Pony, German Riding Pony, Haflinger, Icelandic, Lewitzer Pony, Miniature Shetland pony, Norwegian Fjord Horse, Shetland pony, Terceira Pony, Welsh Section A and the horse breeds with average withers height values below 148 cm (58.27 inches) (Dülmener, Arabian, Przewalski, Sorraia) or a range of variation around 148 cm (58.27 inches) (Anglo-Arabian, American Paint Horse, Appaloosa, Black Forest Horse, Lusitano, Peruvian Paso, Quarter Horse, Tinker, Thoroughbred), lighter coldblood horses (Black Forest Horse), heavy draft horse breeds (Altmark Coldblood, Mecklenburg Coldblood, Noriker horse, Rhenish German Draught, Saxon Thuringian Coldblood, Schleswig Draught, Shire Horse, South German Coldblood) and warmblood horse breeds (Hanoverian, Oldenburg, Westphalian, Rhinelander horse, Trakehner, Zweibrücker, Holsteiner, Selle Francais). Size ranges were detected for every breed and results were averaged.
Genomic DNA was isolated using 500 µl EDTA blood by standard ethanol fraction. Precipitation was achieved by 6 M NaCl, 70% ethanol, and 100% ethanol (Carl Roth) in consecutive steps according to standard protocols. Genotyping was performed with the Illumina equine SNP50 BeadChip (Illumina, San Diego, CA, USA) including 54,602 SNPs using standard procedures as recommended by the manufacturer. Data were analyzed and file clusters were generated using the genotyping module version 3.2.32 of the BeadStudio program (Illumina).
Across breed analysis required an additional genotyping of BIEC2-808543 by the use of restriction fragment length polymorphism (RFLP). We used the restriction enzyme BsrI according to NEBcutter V2.0 recommendations (http://tools.neb.com/NEBcutter2/) and primers were designed using Primer3 (http://frodo.wi.mit.edu/primer3/) (Table S3). The reaction was assembled in 30 µl total volume containing 2 µl DNA, 17.6 µl H2O, 6.4 µl enhancer solution P (Peqlab Biotechnologie), 3.2 µl incubation mix with MgCl2, 0.5 µl dNTP mix, 0.3 µl 1000 U Taq Polymerase (Taq Core Kit 10 (1000 U), MP Biomedicals, LLC, Germany), 1 µl forward and 1 µl reverse primers. The reaction was performed on a PTC 200™ thermocycler (MJ Research, Inc., Waltham, USA) setting 30 seconds denaturation at 94°C, 34 cycles of 94°C for 30 seconds, 60°C annealing temperature, 72°C for 40 seconds and finally 4°C for 10 minutes. The incubation of PCR (polymerase chain reaction) amplificates (10 µl) was performed with 16.75 µl H2O and 3 µl NEB buffer 3 (New England Biolabs, Ipswich, USA) at 65°C for 12 hours. Products were separated by gel electrophoresis using 3% agarose gels (peqGold MoSieve Agarose MS 500, Peqlab Biotechnologie). Genotypes were determined by visual examination under UV illumination (BioDocAnalyze, Biometra, Göttingen, Germany).
The polymorphism BankIt1561108 Seq1 JX515275c.7-19del was genotyped in 71 horses including one Arabian, one Dartmoor, twenty-four German Classic Ponies, eight Haflinger, ten Hanoverian, one Icelandic, three Miniature Shetland ponies, one Noriker horse, one Norwegian Fjord Horse, one Oldenburg horse, eight Rhenish German Draught, seven Shetland ponies and five Welsh Section A. Fluorescence labelled primer 5′-AGGGCTCCGGCACTGAGCAG-3′ and unmodified primer 5′-CAGAGGGAAGGTAGTGACACG-3′ were used for PCR-amplification with an annealing temperature of 60°C according to standard protocols. PCR products were size-fractioned by gel electrophoresis on 6% polyacrylamide denaturing gels (RotiphoreseGel 40, Carl Roth) using an automated capillary sequencer (LI-COR 4200/S-2, LI-COR 4300, LI-COR Biotechnology, Bad Homburg, Germany).
Candidate region analysis
Bioinformatic analysis of the region of BIEC2-808543 was performed using Patch 1.0 (http://www.gene-regulation.com/cgi-bin/pub/programs/pmatch/bin/p-match.cgi) on the public database TRANSFAC (version 7.0, Public 2005) which contains data on transcription factors, their experimentally-proven binding sites, and regulated genes. Further verification was done by SIGNAL SCAN (http://www.gene-regulation.com/cgi-bin/pub/programs/sigscan/sigscan.cgi). The model of the candidate gene LCORL was build using Spidey (http://www.ncbi.nlm.nih.gov/spidey/index.html), a tool to align expressed sequences to their parent genomic sequences . For BLAST searches, the resources of the National Center for Biotechnology Information (NCBI) were used (http://www.ncbi.nlm.nih.gov/blast/Blast.cgi) . The open reading frame (ORF) Finder (http://www.ncbi.nlm.nih.gov/projects/gorf/) and Windows 32 EditSeq 4.03, graphical analysis tools which find all open reading frames of a selectable minimum size in a user's sequence, were also used for analysis.
Equine complementary DNA (cDNA) of the candidate gene LCORL (NCBI Gene ID: 100068577; Ensembl ENSECAG00000000648) was sequenced in three German warmblood horses and one Arabian thoroughbred using hair root, kidney and testicular tissues. RNeasy Lipid Tissue Mini Kit (Qiagen, Hilden, Germany) was used for purification of about 50 µg total RNA from stabilized tissues according to manufacturer's protocol. It was transcribed into cDNA by Maxima First Strand cDNA Synthesis Kit for RT-qPCR (Fermentas Life Sciences, St. Leon-Rot, Germany). Quality control was performed with primers (5′CAAAAACAACAGACAGCCTTATGC-3′ and 5′-GCTCTGCCAGTACCCCAAGA-3′) of the RPL4, ribosomal protein L4, gene (ECA1) spanning two exons and a short intron in between. The product size differed clearly in cDNA (80 bp) and gDNA (268 bp) products. Primers were designed using Primer3 (http://frodo.wi.mit.edu/primer3/) (Table S4) and PCR was performed in 20 µl total volume. We used 2 µl cDNA, 12.1 µl H2O, 4.2 µl enhancer solution P (Peqlab Biotechnologie), 2.1 µl incubation mix with MgCl2, 0.3 µl dNTP mix, 0.3 µl 1000 U Taq Polymerase (Taq Core Kit 10 (1000 U), MP Biomedicals, LLC, Germany) and 0.5 µl forward and 0.5 µl reverse primers. The reaction was performed on a PTC 200™ thermocycler (MJ Research): 4 minutes denaturation at 94°C, followed by 34 cycles of 94°C and primer adapted annealing temperature for 30 seconds, 72°C for 40 seconds and finally 4°C for 10 minutes. Furthermore, except for the first exon, the exons and exon/intron boundaries of LCORL were sequenced on gDNA in ten horses to identify polymorphisms. PCR conditions were according to the cDNA analysis. Three horses showed the BIEC2-808543 genotype C/C, one horse was homozygous for T and seven horses had a heterozygous genotype (C/T). In exon 1 no appropriate PCR product could be amplified while nine of ten sequences of exon two didn't have sufficient quality for the evaluation. Sequencing was performed by the automated sequencer Genetic Analyzer 3500 (Applied Biosystems by Life Technologies).
For the analysis of the LCORL, NCAPG and DCAF16 expression, RNA was isolated from hair root samples of eleven Arabian and three Welsh Section A with the genotype T/T, eight Rhenish German Draught with the genotype C/C, twelve Dülmener showing the genotypes T/T and C/T and 13 Hanoverian of all three genotypes by RNeasy Lipid Tissue Mini Kit (Qiagen) according to manufacturer's protocol (Table S5). Furthermore we isolated testis, brain, kidney, muscle and liver tissues as controls for semi-quantitative expression analysis. It was transcribed into cDNA by Maxima First Strand cDNA Synthesis Kit for RT-qPCR (Fermentas Life Sciences). Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was used as a housekeeping gene. The reactions were run on an optical 96-well reaction plate (Applied Biosystems) in a final volume of 11.5 µl containing 1.5 µl cDNA, 1.0 µl reverse and 1.0 µl forward primes of the candidate gene and the GAPDH (Table S6), 0.25 µl VIC-labeled TaqMan probe for the candidate gene and FAM-labeled TaqMan probe for GAPDH (Applied Biosystems), 1.68 µl nuclease free water and 6 µl Maxima Probe qPCR master mix 2× supplemented by 0.07 µl ROX Solution (Fermentas Life Sciences). The quantitative real-time (qRT)-PCR was performed using an ABI7300 sequence detection system (Applied Biosystems) under the following conditions: 10 min at 95°C followed by 40 cycles at 95°C for 15 sec and 60°C for 1 min. All samples were analysed in duplicates. The identified quantities of gene expression were normalised by the GAPDH expression level (ΔCT). For calibration the average ΔCT of all horses with the genotype T/T was used and the relative expression level was calculated by the ΔΔCT method . The average ΔCT of medium sized horses (C/T) and of larger horses (C/C) was subtracted from the standard ΔCT (T/T) and the potency was calculated from these values. The deviations from the standard ( = 1) were given as a percentage. The same procedure was applied for the Hanoverians.
We performed a GLM analysis using SAS/Genetics, version 9.3 (Statistical Analysis System, 2012) for testing the effects of genotype, sex, breed, breed by genotype as well as age and time of sampling. Semi-quantitative expression analysis of different tissues was performed by PCR, using the primer pairs of qPCR analysis. The reaction was assembled in 20 µ, including 2 µl cDNA, 12.1 µl H2O, 4.2 µl enhancer solution P (Peqlab Biotechnologie), 2.1 µl incubation mix with MgCl2, 0.3 µl dNTP mix, 0.3 µl 1000 U Taq Polymerase (Taq Core Kit 10 (1000 U), MP Biomedicals) and 0.5 µl forward and 0.5 µl reverse primers. The reaction was performed on a PTC 200™ thermocycler (MJ Research): 4 minutes denaturation at 94°C, followed by 30 cycles of 94°C and 60°C for 30 seconds, 72°C for 40 seconds and finally 4°C for 10 minutes.
Analysis was performed in 44,496 SNPs with a minor allele frequency (MAF) >0.05 and a call rate of >90%. The data quality control was done using PLINK, version 1.07 (http://pngu.mgh.harvard.edu/purcell/plink/)  and SAS/Genetics, version 9.3 (2012). For the GWAS, a mixed linear model (MLM) was employed in order to control data for stratification. A marker based identity-by-state (IBS) kinship matrix among all horses (K-matrix) was employed for parameterization of a random polygenic effect. Fixed effects were the sex of the animal and gene proportions of Hanoverian, Thoroughbred, Holsteiner and Trakehner using TASSEL (Trait Analysis by Association, Evolution and Linkage) version 3.0, a software package for association mapping of complex traits in diverse samples . Quantile-quantile (Q-Q) plots for observed versus expected −log10P-values were constructed to control for population stratification. Significance thresholds were determined using a Bonferroni correction and the MULTITEST procedure of SAS.
Relative expression level of NCAPG in relation to the BIEC2-808543 genotype across five different breeds (A) and within-breed in 13 Hanoverian horses (B). No significant differences between the expression levels of horses of different sizes and genotypes could be seen.
Relative expression level of DCAF16 in relation to the BIEC2-808543 genotype across five different breeds (A) and within-breed in 13 Hanoverian horses (B). No significant differences between the expression levels of horses of different sizes and genotypes could be seen.
Semi-quantitative PCR analysis of investigated genes in different tissues. The expression of testis, hair roots, brain, kidney, muscle and liver is shown for LCORL (A), NCAPG (B) and DCAF16 (C).
Number of animals genotyped for the SNP BIEC-808543 on horse chromosome 3, average height at withers and genotypic distribution per breed.
Polymorphisms and their position, type, base change and source identified in the sequence analysis of LCORL. No associations for different body sizes could be detected.
Single nucleotide polymorphism on equine chromosome (ECA) 3, its primer sequence and product size used for genotyping by restriction fragment length polymorphism (RFLP).
Primer sequences and their position, product size and annealing temperature (AT) for sequencing the equine genomic and cDNA of LCORL .
Samples used for expression analysis of LCORL, NCAPG and DCAF16. The breed, height at the withers, genotype, sex, age at the time of sampling and the time of sampling are shown.
Primer sequences, their product sizes, annealing temperatures (AT) and TaqMan probes used for real-time quantitative PCR (RT-qPCR) for LCORL , NCAPG and DCAF16 using GAPDH as reference gene.
We thank the Hanoverian Studbook Society for providing data and samples of Hanoverian warmblood horses. We are grateful to all horse breeders donating us samples from their horses and providing the data.
Conceived and designed the experiments: OD JM UP. Performed the experiments: JM UP OD RS. Analyzed the data: JM OD UP. Contributed reagents/materials/analysis tools: OD JM. Wrote the paper: JM OD.
- 1. Signer-Hasler H, Flury C, Haase B, Burger D, Simianer H, et al. (2012) A genome-wide association study reveals loci influencing height and other conformation traits in horses. PLoS One 7: e37282. doi: 10.1371/journal.pone.0037282
- 2. Makvandi-Nejad S, Hoffman GE, Allen JJ, Chu E, Gu E, et al. (2012) Four Loci explain 83% of size variation in the horse. PLoS One 7: e39929. doi: 10.1371/journal.pone.0039929
- 3. Gudbjartsson DF, Walters GB, Thorleifsson G, Stefansson H, Halldorsson BV, et al. (2008) Many sequence variants affecting diversity of adult human height. Nat Genet 40: 609–615. doi: 10.1038/ng.122
- 4. Lindholm-Perry AK, Sexten AK, Kuehn LA, Smith TP, King DA, et al. (2011) Association, effects and validation of polymorphisms within the NCAPG - LCORL locus located on BTA6 with feed intake, gain, meat and carcass traits in beef cattle. BMC Genet 12: 103. doi: 10.1186/1471-2156-12-103
- 5. Shriner D, Adeyemo A, Gerry NP, Herbert A, Chen G, et al. (2009) Transferability and fine-mapping of genome-wide associated loci for adult height across human populations. PLoS One 4: e8398. doi: 10.1371/journal.pone.0008398
- 6. Soranzo N, Rivadeneira F, Chinappen-Horsley U, Malkina I, Richards JB, et al. (2009) Meta-analysis of genome-wide scans for human adult stature identifies novel loci and associations with measures of skeletal frame size. PLoS Genet 5: e1000445. doi: 10.1371/journal.pgen.1000445
- 7. van de Pol C, van Oldruitenborgh-Oosterbaan MMS (2007) Measuring the height of ponies at the withers: Influence of time of day, water and feed withdrawal, weight-carrying, exercise and sedation. Vet J 174: 69–76. doi: 10.1016/j.tvjl.2006.10.023
- 8. Ricard A (2004) Heritability of jumping ability and height of pony breeds in France. Livest Prod Sci 89: 243–251. doi: 10.1016/j.livprodsci.2004.01.006
- 9. Van Bergen H, Van Arendonk J (1993) Genetic parameters for linear type traits in Shetland Ponies. Livest Prod Sci 36: 273–284. doi: 10.1016/0301-6226(93)90058-p
- 10. Miglior F, Pagnacco G, Samore AB (1998) A total merit index for the Italian Haflinger horse using breeding values predicted by a multi-trait animal model. Proc 6th WCGALP 24: 416–419.
- 11. Stock KF, Distl O (2006) Genetic correlations between conformation traits and radiographic findings in the limbs of German Warmblood riding horses. Genet Sel Evol 38: 657–671. doi: 10.1051/gse:2006027
- 12. Arnason T (1984) Genetic studies on conformation and performance of Icelandic toelter horses.1. Estimation of non-genetic effects and genetic parameters. Acta Agriculturae Scandinavica 34: 409–427. doi: 10.1080/00015128409435410
- 13. Curtis GC, Grove-White D, Ellis RN, Argo CM (2010) Height measurement in horses and ponies: optimising standard protocols. Vet Rec 167: 127–133. doi: 10.1136/vr.c3722
- 14. Sadek MH, Al-Aboud AZ, Ashmawy AA (2006) Factor analysis of body measurements in Arabian horses. J Anim Breed Genet 123: 369–377. doi: 10.1111/j.1439-0388.2006.00618.x
- 15. Brooks SA, Makvandi-Nejad S, Chu E, Allen JJ, Streeter C, et al. (2010) Morphological variation in the horse: defining complex traits of body size and shape. Anim Genet 41 Suppl 2: 159–165. doi: 10.1111/j.1365-2052.2010.02127.x
- 16. Distl O, Schröder W, Dierks C, Klostermann A (2011) Genome-wide association studies for performance and conformation traits in Hanoverian warmblood horses. 9th Dorothy Russel Havemeyer Foundation, International Equine Genome Mapping Workshop. Oak Ridge Conference Center, Chaska, Minnesota: Distl, O. pp. 12.
- 17. Schröder W (2010) Athletic performance and conformation in Hanoverian warmblood horses - population genetic and genome-wide association analyses [cumulative thesis]. Hannover: University of Veterinary Medicine.
- 18. Morsci NS, Schnabel RD, Taylor JF (2006) Association analysis of adiponectin and somatostatin polymorphisms on BTA1 with growth and carcass traits in Angus cattle. Anim Genet 37: 554–562. doi: 10.1111/j.1365-2052.2006.01528.x
- 19. Tatarakis A, Margaritis T, Martinez-Jimenez CP, Kouskouti A, Mohan WS 2nd, et al. (2008) Dominant and redundant functions of TFIID involved in the regulation of hepatic genes. Mol Cell 31: 531–543. doi: 10.1016/j.molcel.2008.07.013
- 20. Saastamoinen A (1990) Heritabilities for Body Size and Growth Rate and Phenotypic Correlations among Measurements in Young Horses. Acta Agricult Scand 40: 377–389. doi: 10.1080/00015129009438573
- 21. Hill EW, McGivney BA, Gu J, Whiston R, Machugh DE (2010) A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses. BMC Genomics 11: 552. doi: 10.1186/1471-2164-11-552
- 22. Hill EW, Gu J, Eivers SS, Fonseca RG, McGivney BA, et al. (2010) A sequence polymorphism in MSTN predicts sprinting ability and racing stamina in thoroughbred horses. PLoS One 5: e8645. doi: 10.1371/journal.pone.0008645
- 23. Sawadogo M, Roeder RG (1985) Interaction of a gene-specific transcription factor with the adenovirus major late promoter upstream of the TATA box region. Cell 43: 165–175. doi: 10.1016/0092-8674(85)90021-2
- 24. Orphanides G, Lagrange T, Reinberg D (1996) The general transcription factors of RNA polymerase II. Genes Dev 10: 2657–2683. doi: 10.1101/gad.10.21.2657
- 25. Yang MQ, Laflamme K, Gotea V, Joiner CH, Seidel NE, et al. (2011) Genome-wide detection of a TFIID localization element from an initial human disease mutation. Nucleic Acids Res 39: 2175–2187. doi: 10.1093/nar/gkq1035
- 26. Nakatani Y, Horikoshi M, Brenner M, Yamamoto T, Besnard F, et al. (1990) A downstream initiation element required for efficient TATA box binding and in vitro function of TFIID. Nature 348: 86–88. doi: 10.1038/348086a0
- 27. Rodan GA, Harada S (1997) The missing bone. Cell 89: 677–680. doi: 10.1016/s0092-8674(00)80249-4
- 28. St-Arnaud R, Quelo I (1998) Transcriptional coactivators potentiating AP-1 function in bone. Front Biosci 3: d838–848.
- 29. Weedon MN, Lango H, Lindgren CM, Wallace C, Evans DM, et al. (2008) Genome-wide association analysis identifies 20 loci that influence adult height. Nat Genet 40: 575–583. doi: 10.1038/ng.121
- 30. Lettre G, Jackson AU, Gieger C, Schumacher FR, Berndt SI, et al. (2008) Identification of ten loci associated with height highlights new biological pathways in human growth. Nat Genet 40: 584–591. doi: 10.1038/ng.125
- 31. Kunieda T, Park JM, Takeuchi H, Kubo T (2003) Identification and characterization of Mlr1,2: two mouse homologues of Mblk-1, a transcription factor from the honeybee brain (1). FEBS Lett 535: 61–65. doi: 10.1016/s0014-5793(02)03858-9
- 32. Wade CM, Giulotto E, Sigurdsson S, Zoli M, Gnerre S, et al. (2009) Genome sequence, comparative analysis, and population genetics of the domestic horse. Science 326: 865–867. doi: 10.1126/science.1178158
- 33. Hall JG, Pagon RA, Wilson KM (1980) Rothmund-Thomson syndrome with severe dwarfism. Am J Dis Child 134: 165–169. doi: 10.1001/archpedi.1980.02130140039013
- 34. Moore GP, Panaretto BA, Robertson D (1981) Effects of epidermal growth factor on hair growth in the mouse. J Endocrinol 88: 293–299. doi: 10.1677/joe.0.0880293
- 35. Christmann L (1996) Zuchtwertschätzung für Merkmale der Stutbuchaufnahme und der Stutenleistungsprüfung im Zuchtgebiet Hannover. Dissertation, Georg-August Universität Göttingen.
- 36. Hamann H, Distl O (2008) Genetic variability in Hanoverian warmblood horses using pedigree analysis. J Anim Sci 86: 1503–1513. doi: 10.2527/jas.2007-0382
- 37. Wheelan SJ, Church DM, Ostell JM (2001) Spidey: a tool for mRNA-to-genomic alignments. Genome Res 11: 1952–1957.
- 38. Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, et al. (2008) NCBI BLAST: a better web interface. Nucleic Acids Res 36: W5–9. doi: 10.1093/nar/gkn201
- 39. Livak KJ, Schmittgen TD (2001) Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 25: 402–408. doi: 10.1006/meth.2001.1262
- 40. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, et al. (2007) PLINK: a toolset for whole-genome association and population-based linkage analysis. Am J Hum Genet 81: 559–575. doi: 10.1086/519795
- 41. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, et al. (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23: 2633–2635. doi: 10.1093/bioinformatics/btm308