The eye color of birds, generally referring to the color of the iris, results from both pigmentation and structural coloration. Avian iris colors exhibit striking interspecific and intraspecific variations that correspond to unique evolutionary and ecological histories. Here, we identified the genetic basis of pearl (white) iris color in domestic pigeons (Columba livia) to explore the largely unknown genetic mechanism underlying the evolution of avian iris coloration. Using a genome-wide association study (GWAS) approach in 92 pigeons, we mapped the pearl iris trait to a 9 kb region containing the facilitative glucose transporter gene SLC2A11B. A nonsense mutation (W49X) leading to a premature stop codon in SLC2A11B was identified as the causal variant. Transcriptome analysis suggested that SLC2A11B loss of function may downregulate the xanthophore-differentiation gene CSF1R and the key pteridine biosynthesis gene GCH1, thus resulting in the pearl iris phenotype. Coalescence and phylogenetic analyses indicated that the mutation originated approximately 5,400 years ago, coinciding with the onset of pigeon domestication, while positive selection was likely associated with artificial breeding. Within Aves, potentially impaired SLC2A11B was found in six species from six distinct lineages, four of which associated with their signature brown or blue eyes and lack of pteridine. Analysis of vertebrate SLC2A11B orthologs revealed relaxed selection in the avian clade, consistent with the scenario that during and after avian divergence from the reptilian ancestor, the SLC2A11B-involved development of dermal chromatophores likely degenerated in the presence of feather coverage. Our findings provide new insight into the mechanism of avian iris color variations and the evolution of pigmentation in vertebrates.
Birds exhibit striking eye color variations, providing a unique angle for understanding avian evolution. Here we identified the genetic basis of the pearl (white) iris color in domestic pigeons (Columba livia) to a nonsense mutation W49X in SLC2A11B via whole genome sequencing and genome-wide association study (GWAS) approaches. SLC2A11B is a gene with known roles in fish pigment cells differentiation and transcriptome analysis indicated that SLC2A11B loss of function may downregulate the xanthophore-differentiation gene CSF1R and the key pteridine biosynthesis gene GCH1, resulting in the pigeon’s pearl iris phenotype. The SLC2A11B variant was estimated to have originated at approximately 5,400 years ago coinciding with the onset of pigeon domestication and was then under positive selection likely associated with artificial breeding. Potentially impaired SLC2A11B was also found in six species from six distinct avian lineages. Analysis of vertebrate SLC2A11B orthologs revealed relaxed selection in the avian clade, consistent with the scenario that the SLC2A11B-involved development of dermal pigment cells likely degenerated in the presence of feather coverage. Our study sheds new light on the largely unknown genetic mechanism underlying the evolution of avian iris color variations.
Citation: Si S, Xu X, Zhuang Y, Gao X, Zhang H, Zou Z, et al. (2021) The genetics and evolution of eye color in domestic pigeons (Columba livia). PLoS Genet 17(8): e1009770. https://doi.org/10.1371/journal.pgen.1009770
Editor: Kelly A. Dyer, University of Georgia, UNITED STATES
Received: February 16, 2021; Accepted: August 10, 2021; Published: August 30, 2021
Copyright: © 2021 Si et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The sequencing data for domestic pigeon accessions from this study have been deposited in the NCBI Sequence Read Archive (BioProject ID: PRJNA682513).
Funding: The project was supported by the National Natural Science Foundation of China (http://www.nsfc.gov.cn) (31970537 to X. X., and 32070598 to S. -J. L.), the National Key Research and Development Program of China (2017YFF0210303 to S. -J. L.), and the Peking-Tsinghua Center for Life Sciences (http://www.cls.edu.cn) (to S. -J. L.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Integumentary pigmentation plays essential roles in camouflage, sexual selection, communication, and thermoregulation in vertebrates [1–3]. The dynamic color change and diverse pigmentation of poikilothermic vertebrates are mostly attributed to neural crest-derived dermal chromatophores, which are generally divided into three main categories: xanthophores/erythrophores (yellow to red), iridophores/leucophores (iridescent color or white), and melanophores (black) [4–6].
The eye color of a bird, usually referring to the color of the iris, is derived from both pigmentation and structural coloration, that is, the presence of materials that diffract light. In homeothermic birds, with outer feather coverage masking skin pigmentation, dermal chromatophores may have undergone relaxed selection and become subject to evolutionary demise . However, the avian iris maintains the potential for complete development of all types of pigment cells that are comparable to the chromatophores in poikilothermic vertebrates, probably due to its external, exposed location where chromatophores are under constant selective pressure, thus remaining as a “pigment cell refugium” during avian evolution .
The pigment cells are located in the eye stroma and the anterior layer of the iris consists of loose vascular connective tissue. Eye color in birds varies by the presence or absence of pigment cells in the iris and the content of pigments in those cells and exhibits striking interspecific variations, ranging from black and dark brown to brilliant colors that cover nearly the full spectrum of the rainbow [7–9]. Intraspecific eye color variation is also common and often associated with age and sex in wild birds [8,10–12]. Although the evolutionary drive shaping avian eye color remains largely unknown, recent studies have shed light on the possible coevolution of eye color and behavior or activity rhythm in birds [13–16]. Iris color variation may reflect unique evolutionary histories and ecological adaptions and thus provides a unique angle for understanding avian radiation, as well as pigmentation evolution across vertebrates.
The domestic pigeon (Columba livia) was derived from its synonymic wild ancestor, the rock pigeon, and its initial domestication is believed to have occurred approximately 5,000 years ago in the Mediterranean region [17–19]. After the onset of domestication, pigeons have been subject to intense selective breeding to have produced a wide array of phenotypic diversity within the species [17,18,20]. Recent studies have revealed the molecular basis of some intraspecific variations in domestic pigeons, such as plumage pigmentation, feather ornamentation, epidermal appendages, and navigation behavior, and highlighted the genes that might play important roles in avian evolution with respect to these features [21–29].
The domestic pigeon exhibits three major types of iris color: yellow to orange “gravel” (wild type, Fig 1A), white “pearl” (Fig 1B), and black “bull” eyes [7–9]. The gravel and pearl irises in pigeons contain bright pigment cells with birefringent crystals in the anterior stromal tissue, while the bull eye results from a complete absence of stromal pigment cells . Crystalline guanine was identified as the major pigment in both gravel and pearl irises, and at least two yellow fluorescing pteridine materials were identified leading to the yellow color tone in the gravel iris . Therefore, the stromal pigment cell in the gravel iris is also referred to as a “reflecting xanthophore” . By contrast, the white color of a pearl eye is the result of the presence of guanine only without the manufacture of yellow pigments (pteridines) in the pigment cells. All red colors in gravel and pearl eyes are from the blood vessels in the outer iris (Fig 1). The irises of all newly hatched pigeons are bull-eyed dark, and those with gravel or pearl eyes gradually develop a brighter color after two to three months. Empirical breeding records indicate that the pearl eye color is an autosomal recessive trait relative to the gravel eye, denoted as the Tr locus, and that the bull eye trait is associated with white feathers . Similar stromal pigment cells and/or developmental color changes were found in the irises of other avian species [8,12,32, 33], suggesting that the domestic pigeon is an ideal model to study the evolution of avian pigmentation variations.
(A) A pigeon with gravel eyes (wild type) exhibits a bright orange iris color; (B) A pigeon with pearl eyes exhibits a grayish-white iris color. The red background in both gravel and pearl irises is from capillaries. Photo credit: J. -O. Gao.
Here, we investigated the genetic basis of pearl irises in domestic pigeons. We identified the genetic mutation in SLC2A11B responsible for the iris color change from gravel to pearl and found that the pearl iris trait likely originated approximately 5,400 years ago as a domesticated trait under artificial selection. Evolutionary analyses revealed a conserved role for SLC2A11B in avian eye color and associated its mutation with a lack of yellow iris pigmentation in various lineages of birds. Analysis of vertebrate SLC2A11B orthologs further revealed a relaxation of selection in the avian clade after its divergence from the reptilian ancestor, shedding new light on the origin and evolution of avian coloration.
The Tr locus is mapped to a 9 kb genomic region
A case-control genome-wide association study (GWAS) based on whole-genome resequencing was performed in domestic pigeons (racing homers) to identify the genomic region responsible for pearl eyes. Whole-genome sequencing data were generated from 49 gravel-eyed and 43 pearl-eyed individuals at approximately 10× genome coverage each (S1 Table). Of the 28,628,324 SNPs initially identified, 2,490,056 passed genotype filtering and quality control and were used in subsequent analysis (S1 Fig). Significant signals of association (P < 2.01×10−8, after Bonferroni correction) were observed in 160 consecutive SNPs from the pigeon genome (Cliv_2.1, GenBank accession: GCA_000337935.2)  scaffold AKCR02000030.1 (hereafter referred to as scaffold 30), which spanned a single 341 kb region associated with the pearl-eyed phenotype (minimum P = 1.07×10−16) (Figs 2A and S2). No SNPs from other scaffolds showed a significant signal of association with the iris phenotypes (Figs 2A and S2). To fine-map the pearl eye Tr locus, haplotypes of the mapped region on scaffold 30 were built for each individual based on whole-genome sequencing data. While wild-type individuals exhibited multiple haplotypes in this region, a single 9 kb haplotype (scaffold 30: 1894006–1903342) was shared by all 43 pearl-eyed pigeons, highlighting this region as a strong candidate for the Tr locus (Fig 2B).
(A) The Manhattan plot of the GWAS analysis for the Tr locus. The gray line indicates the Bonferroni-corrected critical value (P = 2.01×10−8). The scaffold with the strongest association signal is marked. (B) The genotypes of 92 pigeons across the mapped region associated with the pearl eye. Each row represents one individual, and each column represents an SNP/indel. Blue indicates homozygosity for the major allele in pearl-eyed pigeons, yellow indicates homozygosity for the minor allele in pearl-eyed pigeons, green indicates heterozygosity, and white indicates missing data. A shared haplotype (scaffold 30: 1,894,006–1,903,342) was identified. Genes located in and around the shared region are labeled.
A premature stop codon in SLC2A11B causes the pearl eye color in pigeon
The mapped 9 kb interval of the Tr locus contained only one protein-coding gene according to the gene annotation of the reference genome (Cliv_2.1): A306_00005299 (Fig 2B). A306_00005299 was annotated as a 12-transmembrane helix transporter gene, solute carrier family 2 member 11-like (S3A and S3B Fig), and was further identified as SLC2A11B (solute carrier family 2, facilitative glucose transporter, member 11b) based on gene homology and synteny in vertebrates (avian, reptile, and fish). The full-length pigeon SLC2A11B transcript was validated by RT-PCR from the iris tissue RNA of a gravel-eyed pigeon. SLC2A11B is known to play an essential role in the differentiation of leucophores and xanthophores in medaka fish  and hence is a strong candidate for pearl eye color in domestic pigeons. To identify the causative mutation, we screened variations in the pigeon SLC2A11B coding region and identified a G-to-A transition in exon 3 that introduced a premature stop codon (W49X) to truncate the 448 downstream amino acids (Fig 3A). SLC2A11B loss of function in medaka fish prevented the development of xanthophores and eliminated the yellow pigment in leucophores at embryonic/larval stages . SLC2A11B-knockout zebrafish also exhibited reduced yellow pigment due to defects in xanthophore differentiation (https://zmp.buschlab.org/gene/ENSDARG00000093395). The pigmentation change of the iris stromal pigment cells in pearl-eyed pigeons resembles the leucophore color switch from yellow to white in medaka fish larva with SLC2A11B loss-of-function , thus supporting W49X of SLC2A11B as the causal mutation responsible for the pearl eye color. This nonsense mutation was further validated in 146 gravel-eyed and 146 pearl-eyed individuals by PCR and Sanger sequencing. All gravel-eyed pigeons carried at least one wild-type allele, while 141 out of the 146 pearl-eyed individuals were homozygous for the mutant allele, consistent with its recessive mode of inheritance. The only exception included five pearl-eyed pigeons that were heterozygous for the SLC2A11B W49X variant (Fig 3B) and had no other mutations found in the exons of SLC2A11B. Such phenotype-genotype incompatibility in these five outliers could have been due to mis-phenotyping of young pigeons or observer error.
Genes with known functions in xanthophore/leucophore pigmentation are downregulated in SLC2A11B W49X pigeon iris
The expression pattern of SLC2A11B was profiled in multiple tissues from nine wild-type domestic pigeons. SLC2A11B expression was equal and extremely low in both skin and feather buds (P = 0.736) (Fig 4A), consistent with the absence of the non-melanocyte pigment cells in avian skin. On the contrary, SLC2A11B was highly expressed in iris, exhibiting an 8.88-fold increase (P < 0.001) relative to that of the skin tissue (Fig 4A). Using the expression in skin as a baseline, higher level of expression of SLC2A11B was also observed in several other tissues, including the muscle (10.85-fold change, P < 0.001), retina (8.35-fold change, P < 0.001), brain (4.27-fold change, P < 0.001), heart (2.44-fold change, P < 0.001), and liver (1.69-fold change, P = 0.022) (Fig 4A). These patterns jointly suggest the role of SLC2A11B in the pigmentation process of pigeons may be restricted to iris only among the epithelial tissues, while exerting pleiotropic effects on some internal organs.
(A) Normalized expression of SLC2A11B in various tissues from wild-type pigeons as determined by qPCR analysis. Boxes span the first to third quartiles, bars extend to the minimum and maximum observed values, black lines indicate the medians, and circles represent the data points. (B) Normalized comparison of SLC2A11B expression between the pearl and gravel irises by transcriptome sequencing. Differentially expressed genes involved in xanthophore differentiation or pteridine synthesis are highlighted. (C) qPCR validation of differentially expressed genes (CSF1R, GCH1, and SLC2A11B) between pearl and gravel irises. *P < 0.05, **P < 0.01, ***P < 0.001.
Transcriptome analysis of iris tissues from four gravel-eyed and five pearl-eyed pigeons (S2 Table) identified 337 differentially expressed genes (DEGs) (FDR < 0.05; Figs 4B and S4 and S5, S3 Table). SLC2A11B was not significantly differentially expressed, but a downregulation trend of SLC2A11B, likely caused by nonsense-mediated RNA decay, was evident in pearl-eyed individuals (Fig 4B), which was further confirmed by qPCR (Fig 4C). Among these DEGs, we examined the genes involved in pteridine biosynthesis or xanthophore/leucophore differentiation. We identified that GTP cyclohydrolase 1 (GCH1), the gene encoding a rate-limiting enzyme in the pteridine pathway, was significantly downregulated (3.6-fold) in the iris tissues of pearl-eyed pigeons (Fig 4B), consistent with the absence of yellow pigment. We also found that colony-stimulating factor 1 receptor (CSF1R), a gene encoding a receptor tyrosine kinase required for the differentiation of the pteridine-containing xanthophore [36–38], was significantly downregulated (1.8-fold) in pearl irises (Fig 4B). The reduced expression of GCH1 and CSF1R was further confirmed by qPCR (Fig 4C). Overall, these differential transcriptome profiles between gravel and pearl eyes in the pigeon suggested that the loss-of-function variant of SLC2A11B may affect the differentiation of xanthophore-like stromal pigment cells and pteridine biosynthesis machinery, resulting in the reduction of yellow pigment and hence the pearl iris color in pigeon.
In addition, located in the same linkage group with SLC2A11B we found six DEGs (IGLL1, GGT1, TBX3, KNTC1, PRC1, and A306_00008692; S3 Table), among which GGT1 and TBX3 are involved in the regulation of melanocyte in mammals [39,40]. Although we cannot completely rule out the possibility that potential regulatory variants within the GWAS candidate interval may cause pearl eyes, this alternative hypothesis is highly unlikely, as none of the six DEGs is known to have any function associated with xanthophore or leucophore, and hence could not explain the pigmentation change in pearl-eyed pigeons.
Pearl eye is a domesticated trait under artificial selection in pigeons
We surveyed the SLC2A11B mutation in 45 pigeons that represented 35 breeds worldwide  to investigate the evolution and origin of the pearl-eyed trait in pigeons (S4 Table). The variant was found in 29 of the 45 pigeon genomes, or 20 of the 35 breeds (S4 Table). Such widespread presence of SLC2A11B W49X allele is consistent with the phenotypic prevalence in pigeons, in that at least 17 of the 35 breeds are independently documented to carry the pearl-eyed trait (S4 Table).
To further investigate the origin of the SLC2A11B W49X mutation, a neighbor-joining phylogenetic tree was reconstructed based upon the 9 kb nonrecombining Tr region from 139 domestic pigeons (35 fancy pigeons, 2 feral pigeons, and 102 racing homers) and one C. rupestris that is considered sister taxa to all domestic pigeons (Fig 5A). All haplotypes containing the W49X mutation (trmut) formed a monophyletic group nested within the wild-type (Tr+) clade (Figs 5A and S6), indicating a derived state of the SLC2A11B variants originating from the wild type via one single mutation event. The derived status of trmut in relation to Tr+ was also evident in the median-joining haplotype network (S6 Fig). In addition, the fact that all mutant haplotypes cluster together in the gene tree but spread across different breed sources and geographic regions (Fig 5A) supports that the mutation is old and likely occurred prior to the establishment of modern pigeon breeds. We further estimated the time to the most recent common ancestor (TMRCA) for all SLC2A11B haplotypes bearing the W49X variant at approximately 5,400 years ago (95% CI: 4,700–5800 years ago) (Fig 5B), a time period coinciding with pigeon domestication that was estimated at more than 5,000 years ago.
(A) Neighbor-joining (NJ) trees of the 9 kb nonrecombining Tr region based on 81 haplotypes from 140 unrelated individuals comprising 139 domestic pigeons (35 fancy pigeons, two feral pigeons, and 102 racing pigeons) and one Columba rupestris (outgroup). All nodes had more than 60% support from 1,000 bootstrap replicates. Orange branches indicate trmut haplotypes, blue branches indicate Tr+ haplotypes, and the black branch indicates the outgroup. Different circle colors represent the seven traditional breeds of domestic pigeons and the outgroup. The red star indicates the time of origin of the Tr mutation. (B) Violin plots of the posterior distribution of the estimated TMRCA for the Tr locus in domestic pigeons. The generation time was set at one year. Two independent simulations were performed using the estimated recombination rates in 10 kb and 50 kb windows. In every simulation, 10 replicate MCMCs were plotted with transparency.
Signals of selection for the pearl-eyed trait in pigeons were tested based on 139 genome datasets through evaluation of integrated haplotype score (iHS), nucleotide diversity (π), Tajima’s D, and extended haplotype homozygosity (EHH). We first applied iHS analysis to scaffold 30, where the Tr locus is located, and identified continuous signals of positive selection at the Tr locus and its adjacent region (scaffold 30: 1.4–2.0 Mb) (Figs 6A and S7A and S7B). Decreased nucleotide diversity and negative Tajima’s D were evident across the SLC2A11B haplotypes with the W49X mutant (Fig 6C and 6D), consistent with a selective sweep scenario. The mutant haplotypes exhibited longer homozygosity than the wild-type haplotypes in the EHH analysis (Figs 6B and S7C and S7D), in support of the occurrence of selection for the variant. These evidences jointly illustrated that the prevalence of the pearl iris trait in pigeons was likely a result of artificial selection during pigeon domestication.
(A) The distribution of integrated haplotype score (iHS) values on scaffold AKCR02000030.1. The gray lines indicate the significance of the absolute iHS scores of 3 or greater. (B) Extended haplotype homozygosity (EHH) decay across the Tr locus region. Nucleotide diversity (π, Pi) (C) and Tajima’s D (D) were calculated in 10 kb windows with a 1 kb step size for trmut and Tr+ haplotypes.
SLC2A11B is associated with avian iris color variation
Given the important role of SLC2A11B in determining the iris color variations in domestic pigeons, it is plausible that defects in SLC2A11B in other bird species may also be responsible for the lack of pteridine and hence contributing to the avian iris color diversity. We screened sequence variations across the SLC2A11B coding region in 34 avian species, whose iris pigment information [9,41] and genomes data are both available (S5 Table), and identified 335 non-synonymous variations. Four amino acid changes, including one nonsense (Q39X), one reading frameshift (T484Tfs3), and two missense (L129F and L229M) mutations (Figs 3A and S8 and S5 Tables), are predicted to impair SLC2A11B function in six bird species.
A nonsense mutation Q39X is identified in the house sparrow (Passer domesticus), whose eyes are sepia brown without pteridines in the iris tissue. Q39X is predicted to cause SLC2A11B loss-of-function by truncating the downstream 458 amino acid residues, and is consistent with the lack of yellow pigmentation in the iris of house sparrows.
T484Tfs3 is found in the American anhinga (Anhinga anhinga), double-crested cormorant (Phalacrocorax auritus), and greater rhea (Rhea americana). The reading frameshift mutation truncates the SLC2A11B C-terminus by 12 amino acids, most of which are evolutionarily conserved sites. The deleterious impact of T484Tfs3 on SLC2A11B is consistent with the pteridine-free iris of the double-crested cormorant (blue eye)  and greater rhea (dark brown eye) , though not in the American anhinga whose pale yellow iris contains pteridines .
The missense mutations L129F and L229M are detected in the grey parrot (Psittacus erithacus, white eye) and the Muscovy duck (Cairina moschata, brown eye) respectively. Both variants occur at the evolutionarily conserved sites (S8 Fig) and are predicted to affect SLC2A11B function (with a deleterious SIFT score less than 0.05). L229M is likely associated with the brown iris color or absence of pteridine in the Muscovy duck , whereas a direct link is not apparent for L129F, as the grey parrot shows colorless pteridines in its iris . Nevertheless, given the multifactorial nature of pigmentation process, the genotype-phenotype association across evolutionarily divergent lineages of birds supports that SLC2A11B is involved in avian iris color diversity.
SLC2A11B was under relaxed selection during or after avian divergence from its reptilian ancestor
With the development of full feather coverage in birds and/or their feathered non-avian dinosaur ancestors, the functional constraints on skin coloration may have been gradually lifted, followed by the degeneration of dermal chromatophores, leaving only those epidermal melanocytes responsible for feather pigmentation. It is likely that genes specialized in the development of xanthophores or iridophores, in this case SLC2A11B, could have experienced relaxation in natural selection during or after avian divergence from the ancestral reptile lineage.
To test this hypothesis, we analyzed SLC2A11B orthologs from 41 species, including five major vertebrate clades: Aves (birds), Crocodilia (crocodiles), Testudines (turtles), Squamata (scaled reptiles), and Teleostei (ray-finned fishes). RELAX analysis  was conducted, in which a statistic k is calculated denoting the strength of selection on a “Test” set of branches normalized to that on a “Reference” branch set. As the test involves modeling and inference of multiple mutational and selection parameters, information provided by sequence states of a single gene (636 codon sites for SLC2A11B and 572 codon sites for TYR) on a few branches may bear limitation. To avoid spurious significance and to strengthen data inference, we set three different schemes of test/reference branch designation, involving only basal (“basal”)/all internal (“internal”)/all (“clade”) branches of each group (Fig 7). For each scheme, relaxation of selection is tested for the branch(es) of each species group against the corresponding branches of the other four groups. We expected that birds show relaxation (k < 1) in the test branches for SLC2A11B, while no such significance was expected for TYR. Consistent results across three different schemes, or, the same conclusions based on information from different branches, would be most convincing.
Three different schemes of test/reference branch designations were illustrated by color-labeled branches (red/dark blue for “basal”, red and orange/dark and intermediate blue for “internal”, all warm colors/all cool colors for “clade”) in the phylogeny and were tested separately (see Materials and Methods). The results with birds (warm-colored branches) as test branches are shown in the table below (see S6 Table for more results). The k values significantly smaller than 1, which indicate relaxation of selection, are highlighted in bold green font. The P-values shown are after Bonferroni correction for multiple testing.
Relaxed selection in SLC2A11B was consistently detected along the avian clade (k < 1) under all three schemes of test/reference branch designations, but not in turtles or teleosts under any setting (Fig 7 and S6 Table). Only a weak relaxation signal was partially detected in crocodilians under the “basal” scheme and in squamates under the “clade” scheme, yet neither was consistent across all schemes (S6 Table). In contrast, no relaxed selection was consistently supported for any clade by more than one schemes for TYR, a key melanogenesis gene that is expected to be evolutionarily conserved and under similar selective pressure across all vertebrates (Fig 7 and S6 Table). Specifically, test for avian TYR is only significant under the “clade” scheme but not significant under either “basal” or “internal” scheme. This indicates that the relaxation of selection in TYR, if any, did not arise until the relatively recent diversification of birds and hence is not likely related to the initial avian divergence from reptile ancestors. Overall, relaxation of selection in SLC2A11B in the avian clade is supported, consistent with the scenario that SLC2A11B-involved development of dermal chromatophores in modern birds likely went through a process of degeneration following the emergence of feather coverage.
Through whole-genome sequencing and GWAS, we identified a nonsense mutation W49X in SLC2A11B as most likely responsible for the pigmentation change in the iris of pearl-eyed domestic pigeons. Most of the pearl-eyed individuals tested (141 out of 146) were homozygous for this mutation, consistent with its recessive mode of inheritance. One possible explanation for the exceptions of five pearl-eyed heterozygous pigeons might be additional variation(s) in SLC2A11B that give rise to the same phenotype. Since no other mutation was identified from SLC2A11B coding regions, the additional mutation(s) involved in pearl iris color might be located in a different gene or in the regulatory elements of SLC2A11B. Alternatively, the phenotype-genotype discordance in these five pearl-eyed individuals carrying one wild-type SLC2A11B allele might be due to mis-phenotyping. The iris of newly hatched domestic pigeons is black and gradually switches to brighter colors after two to three months. Phenotypes of all the pigeons sampled in this study were recorded at approximately five months old, when the iris color was usually stable. However, it is possible that the five heterozygous individuals might be younger than the others or that there could be individual variance in the developmental stage of pigeons at which iris pigmentation change may still be in progress. Unfortunately, we were unable to trace back to these five discordant pigeons to confirm the potential phenotyping error, as they were lost during the homing championship in the same year.
The W49X mutation leads to the truncation of approximately 90% of the amino acids of SLC2A11B and is predicted to cause a total loss of function in the protein. Besides iris, high expression of SLC2A11B was evident in brain, muscle, and retinal tissues, suggesting potentially multiple roles of this protein in addition to its involvement in pigmentation pathways. Therefore, it would be reasonable to expect consequences of the mutation other than iris pigmentation change. However, it seems that the influence of the SLC2A11B mutation in pigeons was restricted to iris pigmentation, as no other abnormality was apparent in pearl-eyed individuals. This is an observation consistent with the report that the loss of function of SLC2A11B in fishes affected only pigmentation cells . It is likely that the roles of SLC2A11B in non-pigmented cells of pearl-eyed pigeons (as well as other SLC2A11B mutants) could be compensated by the expression of other genes, most likely the other family members of solute carrier family 2.
Although the stromal pigment cells in the pigeon with gravel eyes are generally considered xanthophore-like, they show a reflecting effect that is normally absent in typical xanthophores. The stromal pigment cells in pigeons resemble the leucophores in medaka fish, in which the orange pigment is present at the larval stage but diminishes in the SLC2A11B-defect mutant . Therefore, it might be more appropriate to refer to pigeon iris stromal pigment cells as leucophore-like. Significantly reduced expression of CSF1R, a gene involved in xanthophore differentiation [36–38], was detected in the pearl iris (SLC2A11B mutant), which is consistent with the scenario that CSF1R is downstream of SLC2A11B in the regulatory network of xanthophore/leucophore differentiation. In addition, GCH1, the gene encoding the first and rate-limiting enzyme in the pteridine biosynthesis pathway [43,44], was also downregulated in the pearl iris. Since GCH1 directly participates in pteridine synthesis, it is plausible that GCH1 acts at the far end of the regulatory pathway of xanthophore/leucophore differentiation and triggers pteridine production upon receiving the upstream signal.
While the precise timing of pigeon domestication is unclear, it is commonly accepted that the pigeon was first domesticated at least 5,000 years ago in the Fertile Crescent [17–19] and probably served as a food resource for humans as early as 10,000 years ago [17,19,45]. The estimated time of the origin of the SLC2A11B W49X mutation at approximately 5,400 years ago is in line with the beginning of pigeon domestication, supporting the idea that pearl iris in domestic pigeons was a derived trait closely associated with the domestication process. Strong signals of positive selection were also detected, providing further evidence that the pearl iris trait has undergone artificial selection.
Even after thousands of years of domestication, little morphological difference exists today between a wild-type domestic pigeon and its conspecific rock pigeon; thus, it is reasonable to postulate that the pearl iris could serve as a marker for distinguishing a domesticated individual from its wild counterpart during the early stage of domestication. In modern breeds, the reason for the selection of the pearl iris is more complicated. As the eye is one of the classic judging criteria for pigeons, the pearl iris has been preferred in many show breeds, such as Show Homer, Runt, and Trumpeter. It is interesting that a very high frequency of pearl irises is observed in performance breeds as well, such as Tumbler, Roller, and Highflier. In addition to a random founder effect, breeders tend to hold an unfounded belief that pearl eyes in a pigeon are somewhat associated with intelligence and hence better performance.
We examined evolutionarily diverging bird species, whose iris pigmentation phenotype and genome data are both available, to explore the role of SLC2A11B in eye color determination. We found four mutations that might affect the SLC2A11B function in six bird species, within which Q39X in the house sparrow and L229M in the Muscovy duck well correspond to the lack of pteridines in their irises. The frameshift mutation T484Tfs3 is predicted to impair the function of SLC2A11B and is associated with the pteridine-absent iris color of the double-crested cormorant and greater rhea, though not with the American anhinga that carries the same mutation but has pale yellow eyes. This contradiction indicates that there could be other factors involved in the SLC2A11B pathway leading to pteridine production. Another inconsistency is L129F, which is also predicted to cause SLC2A11B deficiency, is present in the white-eyed grey parrot with pteridines in its iris tissue. It is interesting though the pteridines in the iris of grey parrot are colorless, other than ordinary yellow pigments. Whether such types of pteridines in the grey parrot are the results of SLC2A11B defect by L129F would be worth further investigation. Overall, despite the incongruence, association is evident between loss-of-function SLC2A11B variants and the absence of pteridines from iris in various avian taxa, thus supporting that the SLC2A11B pathway may serve as a common mechanism underlying the eye color diversity in Aves.
One landmark evolutionary transition setting the stage for the origin of birds was the development of feathery coverage that concealed their skin pigmentation in their feathered, non-avian dinosaur ancestors. The presence of fossilized non-avian dinosaurs with filaments or feathers suggested a complex integumentary covering formed prior to the avian divergence [46–49]. Therefore, the dermal chromatophores underneath feather might already become functionally redundant and likely subject to evolutionary demise before the onset of birds. In modern birds, the epidermal melanocyte is the only pigment cell distributed throughout the body and plays an important role in integument pigmentation. One exception lies in the iris, which seems to maintain the full developmental potential for all types of chromatophores and therefore has been proposed to be a pigment cell “refugium” .
In this context, our findings provide new insights into the mechanism underlying the evolutionary changes in pigment cells in Aves. First, SLC2A11B, a gene first found to be involved in the differentiation of xanthophores and leucophores in fish, is absent in mammals but intact in birds. Among the pigeon epithelial tissues, the expression of SLC2A11B is specific to the iris. Evolutionary analysis of SLC2A11B orthologs in vertebrates supported that, relative to other reptiles and fishes, SLC2A11B was under relaxed selection in the avian clade. We proposed that during and after avian divergence from the reptilian ancestor, with the newly evolved feather coverage lifting the selection pressure on the dermal coloration beneath, the expression of SLC2A11B and other specialized pigmentation genes may have gradually switched down or off in the dermis, thus resulting in the degeneration of the dermal chromatophores in the avian ancestor. In the iris, however, the expression of SLC2A11B and other pigmentation genes remained functional, and the pigment cells were sustained.
The roles of iris pigmentation genes could have gone through a dynamic and complicated process during avian evolution. At the early stage of avian radiation, it is likely that the iris color of the ancient birds exhibited limited diversity, as the evolutionary constraints on the genes involved in the development of the non-melanophore chromatophores were just being released. Subsequently, mutations accumulated in the dermal pigmentation genes, and iris color diversity in birds likely emerged, some of which might be of adaptive significance and subject to selection along different avian lineages. Further studies on SLC2A11B, as well as other pigmentation genes in birds, promise to illuminate the adaptative, functional, and evolutionary processes involved in avian coloration.
Materials and methods
All handling of animals and experimental protocols were approved by the Institutional Animal Care and Use Committee of Peking University (IACUC# LSC-LuoSJ-2) and the methods were performed in accordance with the relevant guidelines.
Feather samples of domestic pigeons were collected at the Xiangguan Pigeon Racing Club, Panjin City, Liaoning Province, China, in 2017. This pigeon racing club housed over 7,000 newly hatched pigeons recruited from hundreds of breeders in the spring and raised in a uniform manner for approximately half a year until the championship in the fall. All pigeons selected in the sample set had the same wild-type feather color (blue) to exclude the potential influence of other pigmentation genes on the iris color. No more than two pigeons from the same breeder were used to ensure unrelatedness among individuals. Pigeons were sampled at approximately five to six months old, when the iris color became stable. The iris color phenotype of each pigeon was examined visually during sampling and photographed. Four to six feathers with follicles were plucked from each individual. A total of 292 pigeons (146 with gravel eyes and 146 pearl eyes) from 237 breeders were gathered for the study. In addition, tissues of the brain, iris, retina, muscle, heart, liver, feather bud, and skin were collected from 14 domestic pigeons (nine with gravel eyes and five with pearl eyes). Tissue samples were immediately submerged in RNAlater reagent (Qiagen, Germany), stored in RNase-free 5 ml Eppendorf tubes at 4°C overnight, and then transferred to -80°C for long-term storage.
Genomic DNA from feather samples was extracted using a DNeasy Blood and Tissue Kit (QIAGEN, Valencia, California, USA) following the manufacturer’s instructions. DNA quantity and quality were examined using agarose gel electrophoresis, a NanoDrop spectrophotometer (Thermo Fisher Scientific, USA), and a Qubit fluorometer.
RNA was extracted from the brain, iris, retina, muscle, heart, liver, skin, and feather buds of nine gravel-eyed pigeons and the irises of five pearl-eyed pigeons. The tissues were carefully removed from RNAlater and homogenized in TRIzol Reagent (Invitrogen, USA). RNA was isolated following the manufacturer’s instructions and stored at -80°C for further use. The absence of RNA degradation and possible contamination was confirmed on 1% agarose gels.
Whole-genome resequencing, read mapping, and SNP calling
A total of 92 pigeons (49 with gravel eyes and 43 with pearl eyes) from 88 breeders were selected for whole-genome sequencing (S1 Table). Whole-genome resequencing was conducted at Mega Genomics Corporation, Beijing. For each genomic DNA extract, multiplex library preparation with a unique 6-bp sequence index tag was performed following the standard Illumina library construction protocol (Illumina, San Diego, California, USA). The libraries with an average insert size of 250–300 bp were sequenced using an Illumina NovaSeq sequencer, which generated 150 bp paired-end reads, reached an average sequencing depth of 16-fold coverage, and produced an average of 16 Gb of raw sequencing data per individual (S1 Table).
The adaptor sequences at both ends of the reads and bases with Phred quality <30 were trimmed with Cutadapt v1.16 . The processed reads were subsequently aligned to the domestic pigeon reference genome (Cliv_2.1 pigeon genome assembly) with Burrows-Wheeler Aligner v0.7.17 with the default options and parameters .
Sequence Alignment/Map (SAM) format files were imported to SAMtools v1.7 for binary format conversion (SAM to BAM) and sorted by coordinates using the default options and parameters [52,53]. We then masked and removed optical or PCR duplicate reads, QC failure reads, unmapped reads, supplementary alignment reads, and nonprimary aligned reads using SAMtools v1.7 such that only the unique mapped reads were retained (S1 Table) [52,53].
SNP and small indel calling was performed in GATK v126.96.36.199 according to the GATK best practices manual with the default parameters . Variant calling was performed with hard filters in GATK v188.8.131.52 and BCFtools v1.3.1 based on these filterExpression parameters in the VariantFiltration algorithm: FisherStrand (FS) >0.3, StrandOddsRatio (SOR) >2.0, RMSMappingQuality (MQ) <50.0, ReadPosRankSumTest (ReadPosRankSum) < -0.05 [52,54].
Linkage disequilibrium (LD) decay was measured by correlation coefficients (r2) in PopLDdecay v3.40 (http://github.com/BGI-shenzhen/PopLDdecay, accessed 21 Dec. 2018) with the following parameters: -MAF 0.1, -MaxDist 600, -Miss 0.6. The LD decay was plotted as pairwise LD versus pairwise distance between SNPs with a maximum distance of 50 kb using PopLDdecay .
Gene mapping by genome-wide association study (GWAS)
SNPs and indels were filtered with an overall quality score (QUAL) greater than 20, a minor allele frequency (maf) greater than 0.05, maximum missing genotype rates per variant (geno) greater than 0.1, and maximum missing genotype rates per sample (mind) greater than 0.1. The resulting 2,490,056 SNPs from 92 pigeons were used for a genome-wide association analysis (GWAS) with PLINK v1.9 , including 49 wild-type individuals set as the control group and 43 pearl-eyed individuals as the case group. The chi-square test was applied for differences between the case and control allele frequency distributions, and the level of significance cutoff was set at 2.01×10−8 after Bonferroni correction. The Manhattan plot and QQ plot were plotted using the qqman package in R .
Identification of causative mutation
The genotypes of all SNPs with an MAF greater than 0.1 and a P-value less than 2.01×10−8 in the genomic regions with significant GWAS signals were examined. A region with continuous homozygous genotypes shared by pearl-iris pigeons was considered a candidate region, and the genes within the region were considered candidate genes. SNPs and indels from the candidate region were screened for putative mutations associated with the gravel/pearl iris phenotype. After excluding the SNPs and indels in the noncoding region, SNPs and indels leading to amino acid changes were examined for evolutionary constraints at each affected residue site. The nonsynonymous substitutions at conserved sites among reptile and avian species, or indels causing reading frame shifting or affecting conserved amino acid residues, were considered putative mutations.
Causal mutation validation
The putative causal SLC2A11B mutation was validated in an extended collection of unrelated domestic pigeons with confirmed iris color phenotypes. The sample set consisted of 146 gravel-iris and 146 pearl-iris pigeons from China, including the 92 abovementioned individuals used in GWAS analysis. Full coding exons of SLC2A11B were further sequenced in five pearl-iris individuals. The primer sets used to amplify SLC2A11B exons (S7 Table) were designed on the basis of the domestic pigeon genome assembly (Cliv_2.1). PCR, subsequent Sanger sequencing, and sequence analysis were performed following previously described procedures .
Transmembrane model prediction
A 2D transmembrane model of SLC2A11B was constructed according to a schematic representation of the GLUT family of proteins from a previous study . The transmembrane regions and orientation of SLC2A11B were predicted by TMpred (https://embnet.vital-it.ch/software/TMPRED_form.html) and TMHMM Server v.2.0 (http://www.cbs.dtu.dk/services/TMHMM/) [60–62].
Transcriptome sequencing of RNA extracts from the iris tissues of four gravel- and five pearl-eyed pigeons was conducted at Novogene Corporation, Beijing, China. RNA quality and purity were evaluated using a NanoPhotometer spectrophotometer (IMPLEN, CA, USA). RNA concentration was measured using a Qubit RNA Assay Kit in a Qubit 2.0 Fluorometer (Life Technologies, CA, USA). The integrity of the RNA was assessed with an RNA Nano 6000 Assay Kit on the Agilent Bioanalyzer 2100 system (Agilent Technologies, CA, USA).
A total of 1.5 μg RNA per sample was used for RNA library preparation. Sequencing libraries were generated using the NEBNext Ultra RNA Library Prep Kit for Illumina (NEB, USA) following the manufacturer’s recommendations, and index codes were added to attribute sequences to each sample. Library quality was assessed on the Agilent Bioanalyzer 2100 system. The libraries were sequenced on an Illumina HiSeqXten platform and 150 bp paired-end reads were generated, producing approximately 10 Gb clean data per sample.
RNA-seq bioinformatics analysis
RNA-seq data were processed in one batch including case (five pearl irises) and control (four gravel irises) individuals. The reads were mapped to the pigeon reference genome assembly (Cliv_2.1) using HISAT v2.1.0 and counted against the predicted gene models using HTSeq-count [63,64]. The total number of aligned reads was normalized by gene length and sequencing depth for an accurate estimation of the expression level. These normalized read counts (TPM and FPKM) were used to represent the expression level of each gene, and differentially expressed genes were determined by DESeq2 . The genes were sorted according to their log2-transformed fold-change values in DESeq2, and a hierarchical clustering algorithm in the pheatmap R package was applied to generate the expression profiles of differentially expressed genes (DEGs) . The absolute log2(fold change) of 1 and padj of 0.05 were set as the threshold for significant DEGs. Analyses of Gene Ontology and KEGG pathway enrichment for 337 DEGs were performed in DAVID v6.8 (S8 Table) [67,68]. The linkage group information for each DEG was extracted from a previous study .
Quantitative real-time PCR (qPCR) validation
Quantitative PCR was performed to determine the SLC2A11B expression profiles in the brain, iris, retina, muscle, heart, liver, feather bud, and skin tissues from nine wild-type (gravel eye) pigeons and to validate the differential expression of SLC2A11B, CSF1R, and GCH1 between five pearl and four gravel iris tissues. RNA was reverse-transcribed to cDNA using a High-Capacity cDNA Reverse Transcription Kit (Thermo Fisher Scientific, USA) according to the manufacturer’s protocol. cDNA was amplified using intron-spanning primers (S7 Table) for each target by quantitative real-time PCR and PowerUp SYBR Green Master Mix (Applied Biosystems, USA) on a QuantStudio 3 Real-Time PCR instrument (Applied Biosystems, USA). Three replicates from each sample were performed to determine the mean value. Beta-actin (Actb) was chosen as the internal reference for gene normalization. Experimental data were manually analyzed in a normalized expression comparative Ct (2-ΔΔCT) model . The Wilcoxon rank sum test was used to compare the results and differences in expression levels were considered statistically significant if P < 0.05.
We downloaded published whole-genome sequencing data for 35 fancy pigeons, two feral pigeons, 10 racing pigeons, and one hill pigeon (Columba rupestris) (S4 Table) from NCBI and combined them with our genome data for 92 pigeons to fine map the peal iris causal genes and mutations. VCF files containing the genotypes of scaffold AKCR02000030.1 for all 140 individuals were phased using SHAPEIT v2, with the following parameters:—burn 10—prune 10—main 20—states 200—window 0.1—rho 0.001—effective-size 20000—thread 70 . The phased haplotypes of the Tr locus were divided into two clusters: the mutant haplogroup trmut containing the nonsense mutation (W49X) and the wild-type haplogroup Tr+. The phased file was converted to fasta format and then used for summary statistics and phylogenetic analysis.
Genetic diversity and selection analysis
The integrated haplotype score (iHS) analysis was applied to the haplotypes spanning the entire AKCR02000030.1 scaffold, to evaluate the extent of excess homozygosity around the ancestral or derived allele . Nucleotide diversity around the Tr locus (π, the average pairwise differences) and Tajima’s D (a measure of the skew in the site frequency spectrum) were calculated in 10 kb windows with a 1 kb step size , with different combinations of sequence type, population, and Tr+ or trmut haplotypes. The extended haplotype homozygosity (EHH) score was implemented to validate whether a partial selective sweep had occurred at the Tr+ and trmut haplotypes. EHH measures the relationship between the frequency of an allele of interest and the amount of LD in the surrounding region and provides the probability that two randomly chosen chromosomes out of a population are homozygous between the core haplotype and the increasingly distant SNP . Once a focal marker was given, the trmut mutation in this case, the LD decay from the core haplotype was measured for increasingly distant SNPs. The genetic diversity and Tajima’s D calculations were performed in TASSEL v5.0 , and the significance of differences was tested using the Wilcoxon rank sum test with continuity correction in R. EHH and iHS tests were implemented with the rehh package in R .
Phylogenetic trees were built based on trmut and Tr+ haplotypes across the SLC2A11B genic region. Individuals with GQ values less than 5 in the Tr mutation site were removed, and a sample set of 140 pigeons consisting of 102 racing pigeons, 35 fancy pigeons, two feral pigeons, and one hill pigeon (Columba rupestris) was retained. A 9-kb nonrecombinant region was selected after visual examination, and 24 trmut and 57 Tr+ haplotypes were identified from the dataset. A minimum evolution (ME) phylogenetic tree was constructed from the 81 haplotypes using the Kimura 2-parameter model and neighbor-joining (NJ) approach as implemented in PAUP v4.0a . The reliability of the nodes in the NJ tree was assessed by 1,000 bootstrap iterations. C. rupestris (NCBI: SAMN01057534) was selected as the outgroup. Phylogenetic trees were illustrated with the FIGTREE v1.3.1 package and modified manually.
A median-joining haplotype network was built based on SLC2A11B haplotypes from 102 racing pigeons, 35 fancy pigeons, and two feral pigeons. The haplotypes were reformatted using DnaSP v6.12.03  and constructed into a network using the network approach in PopART software .
TMRCA estimation of the trmut haplotypes
To trace the origin of the pearl iris mutation in the domestic pigeon, the most recent common ancestor (TMRCA) for all trmut alleles was estimated using startmrca . which leverages both the recombination rates and the accumulation of new mutations of the targeted allele’s ancestral haplotype. Relative to other approximate Bayesian computation methods, this approach is based on a hidden Markov model and the assumption that the focal allele is subject to positive selection. Individuals homozygous for the Tr+ allele were set as the reference panel. The selected and reference panels were set at 100 and 40 for each run. To obtain selection-onset time, independent Monte Carlo Markov (MCMC) chains were run 10 times, each with 200,000 iterations (first 6,000 iterations discarded as burn-in). The result with the highest posterior probability was considered the TMRCA estimate. To obtain confidence intervals, we took the 2.5th and 97.5th quantiles of each resulting distribution and calculated the recombination rates of the scaffold AKCR02000030.1 using FastEPRR with nonoverlapping 10 kb and 50 kb window sizes . We used a mutation rate of 1.42×10−9 per site per generation  and a generation time of one year.
Identification of SLC2A11B variation in Aves
We downloaded genome data for 34 avian species whose iris pigments were documented from NCBI (S5 Table) and extracted SLC2A11B coding sequences with blastn v2.7.1 . SLC2A11B orthologs from two Crocodilia (Crocodylus porosus, Alligator mississippiensis), six Testudines (Pelodiscus sinensis, Chelonoidis abingdonii, Chrysemys picta bellii, Terrapene Carolina triunguis, Pelusios castaneus, Gopherus agassizii), and nine Squamata (Anolis carolinensis, Varanus komodoensis, Pogona vitticeps, Salvator merianae, Naja naja, Pseudonaja textilis, Pantherophis guttatus, Laticauda laticaudata, Notechis scutatus) were downloaded from the Ensembl database (http://asia.ensembl.org/index.html, Release 101). These 51 coding sequences were aligned using MUSCLE (codon) in MEGA v10.1.8  for variant detection. The potential impacts of missense variations at evolutionarily conserved amino acid sites on protein function were predicted with SIFT .
Detection of relaxed selection in the avian SLC2A11B
To test for relaxed or intensified selection of SLC2A11B, we selected 41 orthologs of the zebrafish SLC2A11B and TYR genes in the Ensembl database (http://asia.ensembl.org/index.html, Release 101) and aligned the CDSs according to the corresponding amino acid sequence with the L-INS-i algorithm in MAFFT v7.471 . The CDSs of two cormorant species were truncated so that only the 1,434 bp before the nonsense substitution were retained. To obtain the tree topology of 41 vertebrate species, the initial tree was generated by http://timetree.org/and modified according to the species tree topology reported in various phylogenetic studies [85–88]. TYR, the key melanogenesis gene that is expected to be evolutionarily conserved and under consistent selective pressure across vertebrates, was used as a control in the analysis.
The RELAX test implemented in the HYPHY package  takes some branches in the tree as test branches and some others as reference branches (there can be unclassified branches) and infers a parameter k, which is the selection strength of positive or negative selection (i.e., deviation of omega from 1) on the test branches divided by that on the reference branches. Hence, if k is significantly smaller than 1, the selection on test branches is relaxed relative to that on reference branches; if k is significantly larger than 1, intensified selection is suggested. The level of significance of k is tested by a likelihood ratio test (LRT). Three different schemes of test/reference branch designation (basal, internal and clade) were applied: (1) the basal scheme assigns only the basal branches of each monophyletic group as test/reference branches; (2) the internal scheme assigns all internal branches of a monophyletic group as test/reference branches; and (3) the clade scheme assigns all (internal and external) branches within a clade as test/reference branches.
The 41 taxa involved in this analysis belong to five major monophyletic groups: Aves (birds), Crocodilia (crocodiles), Testudines (turtles), Squamata (scaled reptiles), and Teleostei (ray-finned fishes). For each scheme, we conducted the RELAX test separately for each group, with branches of each group used as test branches against corresponding branches from the other four groups as reference branches. We performed identical tests for the CDS of SLC2A11B and the negative control, TYR.
S1 Fig. The linkage disequilibrium (LD) decay in the pigeon genome.
The LD decay curve is based on the mean correlation coefficient (r2) between common SNPs (minor allele frequency ≥ 0.1). The threshold for “useful LD” is set with r2 < 0.2 at distances beyond 0.9 Kb. Under this scenario, the pigeon genome shows a rapid LD decay suggesting that the genome-wide SNPs generated from the sample set are nearly or completely independent from each other, and hence are sufficient for association mapping in pigeons.
S2 Fig. Quantile-Quantile (QQ) plot for GWAS.
The observed versus expected quantiles of the genome-wide association P-value shown in Fig 2A.
S3 Fig. Prediction of transmembrane regions and orientation of SLC2A11B protein based on the whole SLC2A11B sequence of 496 amino acids.
(A) TMHMM posterior probabilities of inside/outside/TM helix. The N-best prediction is displayed at the top where transmembrane regions are shown in red boxes. (B) Result output from TMpred server. The predicted transmembrane helices with scores above 500 are considered significant. The solid and dashed line indicates inside-to-outside and outside-to-inside transmembrane helices, respectively.
S4 Fig. Gene expression profile between pearl and gravel irises by RNA-seq.
The heatmap, hierarchically clustered into two groups of genes, recapitulates a total of 337 differentially expressed genes (DEGs), among which 295 and 42 genes were specifically upregulated in gravel (N = 4) and pearl (N = 5) irises, respectively. Columns are individual samples and rows indicate individual genes. The level of expression is color-coded from DESeq normalized counts, with red representing the higher level of expression, blue the lower level.
S5 Fig. Functional analysis of differentially expressed genes (DEGs) based on RNA-seq data.
(A) GO enrichment of 295 DEGs upregulated in gravel iris. (B) GO enrichment of 42 DEGs upregulated in pearl iris. The bubble diagrams show the degree of enrichment of Gene Ontology (GO) terms in three categories. The orange, blue, and black represent molecular function (MF), cellular component (CC), and biology process (BP) categories, respectively. Each bubble indicates a GO term, and the size of bubbles is proportional to the number of genes annotated to the GO term. P-value is represented by the color map.
S6 Fig. Haplotype network of 55 Tr+ and 24 trmut haplotypes.
The haplotype network was generated from 9 Kb nonrecombining Tr region from 139 domestic pigeons (35 fancy pigeons, 2 feral pigeons, and 102 racing pigeons). The mutations are shown by hatch marks. The orange and blue circles represent trmut and Tr+ haplotypes, respectively.
S7 Fig. Selection analysis of haplotypes containing W49X mutation (trmut) and wild-type haplotypes (Tr+) surrounding the Tr locus.
The integrated haplotype score (iHS) was calculated for scaffold AKCR02000030.1 in fancy pigeons (A) and racing pigeons (B). The gray lines represent the significance level of absolute iHS scores of 3 or greater. The extended haplotype homozygosity (EHH) decay across the Tr locus region is showed for fancy pigeons (C) and racing pigeons (D).
S8 Fig. The nonsense, missense and frame-shifting mutations of SLC2A11B in avian species.
A total of 52 coding sequences of 35 Aves, 2 Crocodilia, 6 Testudines, and 9 Squamata were aligned. The partial alignment of amino acid sequences of avian SLC2A11B with conserved species-specific mutations is shown. The conserved species-specific nonsense, missense and frame-shifting mutations are labeled on the top of the table, and the deleterious mutations are marked in red.
S1 Table. Sample information and resequencing data statistics.
S2 Table. Sample information and RNA-seq data statistics.
S3 Table. Differentially expressed genes from RNA-seq.
S4 Table. Sample information and resequencing data statistics from NCBI SRA library involved in this study.
S5 Table. Thirty-four avian genomes used for alignment of SLC2A11B coding sequence.
S6 Table. RELAX test results for SLC2A11B and TYR.
S7 Table. Primer sequences used in this study.
We thank J. -O. Gao and his team for coordinating the sampling logistics, H. Meng, X. Sun, H. Yu, Y. -C. Liu, Y. -T. Xing for collecting samples, C. Xie for helpful discussion, K. -W. Jiang for providing avian samples for validation, and all the pigeon owners for donating samples.
- 1. Hoekstra HE. Genetics, development and evolution of adaptive pigmentation in vertebrates. Heredity. 2006; 97(3):222–234. pmid:16823403
- 2. Hofreiter M, Schoneberg T. The genetic and evolutionary basis of colour variation in vertebrates. Cell Mol Life Sci. 2010; 67(15):2591–2603. pmid:20229234
- 3. Hubbard JK, Uy JA, Hauber ME, Hoekstra HE, Safran RJ. Vertebrate pigmentation: from underlying genes to adaptive function. Trends Genet. 2010; 26(5):231–239. pmid:20381892
- 4. Grether GF, Kolluru GR, Nersissian K. Individual colour patches as multicomponent signals. Biol Rev. 2004; 79(3):583–610. pmid:15366764
- 5. Kelsh RN. Genetics and evolution of pigment patterns in fish. Pigm Cell Res. 2004; 17(4):326–336. pmid:15250934
- 6. Olsson M, Stuart-Fox D, Ballen C. Genetics and evolution of colour patterns in reptiles. Semin Cell Dev Biol. 2013; 24(6–7):529–541. pmid:23578866
- 7. Oliphant LW, Hudon J, Bagnara JT. Pigment cell refugia in homeotherms—the unique evolutionary position of the iris. Pigm Cell Res. 1992; 5(6):367–371. pmid:1492070
- 8. Bond CJ. On certain factors concerned in the production of eye colour in birds. J Genet. 1919; 9(1):69–81. https://doi.org/10.1007/Bf02983518
- 9. Oliphant LW. Pteridines and purines as major pigments of the avian iris. Pigm Cell Res. 1987; 1(2):129–131. pmid:3507666
- 10. Snyder NFR, Snyder HA. Function of eye coloration in North American accipiters. Condor. 1974; 76(2):219–222. https://doi.org/10.2307/1366740
- 11. Scholten C. Iris colour of Humboldt penguins Spheniscus humboldti. Mar Ornithol. 1999; 27:187–194.
- 12. Bortolotti GR, Smits JE, Bird DM. Iris colour of American kestrels varies with age, sex, and exposure to PCBs. Physiol Biochem Zool. 2003; 76(1):99–104. pmid:12695990
- 13. Craig AJFK Hulley PE. Iris colour in passerine birds: why be bright-eyed? S Afr J Sci. 2004; 100(11–12):584–588.
- 14. Davidson GL, Clayton NS, Thornton A. Salient eyes deter conspecific nest intruders in wild jackdaws (Corvus monedula). Biol Lett. 2014; 10(2):20131077. pmid:24501271
- 15. Davidson GL, Thornton A, Clayton NS. Evolution of iris colour in relation to cavity nesting and parental care in passerine birds. Biol Lett. 2017; 13(1):20160783. pmid:28077686
- 16. Passarotto A, Parejo D, Cruz-Miralles A, Aviles JM. The evolution of iris colour in relation to nocturnality in owls. J Avian Biol. 2018; 49(12). https://doi.org/10.1111/jav.01908
- 17. Darwin . The variation of animals and plants under domestication, vol. 1. London: John Murray; 1868.
- 18. Price TD. Domesticated birds as a model for the genetics of speciation by sexual selection. Genetica. 2002; 116(2–3):311–327. https://doi.org/10.1023/A:1021248913179 pmid:12555787
- 19. Driscoll CA, Macdonald DW, O’Brien SJ. From wild animals to domestic pets, an evolutionary view of domestication. Proc Natl Acad Sci U S A. 2009; 106:9971–9978. pmid:19528637
- 20. Domyan ET, Shapiro MD. Pigeonetics takes flight: evolution, development, and genetics of intraspecific variation. Dev Biol. 2017; 427(2):241–250. pmid:27847323
- 21. Shapiro MD, Kronenberg Z, Li C, Domyan ET, Pan H, Campbell M, et al. Genomic diversity and evolution of the head crest in the rock pigeon. Science. 2013; 339(6123):1063–1067. pmid:23371554
- 22. Domyan ET, Guernsey MW, Kronenberg Z, Krishnan S, Boissy RE, Vickrey AI, et al. Epistatic and combinatorial effects of pigmentary gene mutations in the domestic pigeon. Curr Biol. 2014; 24(4):459–464. pmid:24508169
- 23. Domyan ET, Kronenberg Z, Infante CR, Vickrey AI, Stringham SA, Bruders R, et al. Molecular shifts in limb identity underlie development of feathered feet in two domestic avian species. Elife. 2016; 5:e12115. pmid:26977633
- 24. Vickrey AI, Domyan ET, Horvath MP, Shapiro MD. Convergent evolution of head crests in two domesticated columbids is associated with different missense mutations in EphB2. Mol Biol Evol. 2015; 32(10):2657–2664. pmid:26104009
- 25. Vickrey AI, Bruders R, Kronenberg Z, Mackey E, Bohlender RJ, Maclary ET, et al. Introgression of regulatory alleles and a missense coding mutation drive plumage pattern diversity in the rock pigeon. Elife. 2018; 7:e34803. pmid:30014848
- 26. Gazda MA, Andrade P, Afonso S, Dilyte J, Archer JP, Lopes RJ, et al. Signatures of selection on standing genetic variation underlie athletic and navigational performance in racing pigeons. Mol Biol Evol. 2018; 35(5):1176–1189. pmid:29547891
- 27. Boer EF, Van Hollebeke HF, Park S, Infante CR, Menke DB, Shapiro MD. Pigeon foot feathering reveals conserved limb identity networks. Dev Biol. 2019; 454(2):128–144. pmid:31247188
- 28. Shao Y, Tian HY, Zhang JJ, Kharrati-Koopaee H, Guo X, Zhuang XL, et al. Genomic and phenotypic analyses reveal mechanisms underlying homing ability in pigeon. Mol Biol Evol. 2020; 37(1):134–148. pmid:31501895
- 29. Bruders R, Van Hollebeke H, Osborne EJ, Kronenberg Z, Maclary E, Yandell M, et al. A copy number variant is associated with a spectrum of pigmentation patterns in the rock pigeon (Columba livia). PLoS Genet. 2020; 16(5):e1008274. pmid:32433666
- 30. Oliphant LW. Observations on the pigmentation of the pigeon iris. Pigm Cell Res. 1987; 1(3):202–208. pmid:3508278
- 31. Hollander W, Owen R. Iris pigmentation in domestic pigeons. Genetica. 1939; 21(5–6):408–419. https://doi.org/10.1007/BF01508127
- 32. Oliphant LW. Crystalline pteridines in the stromal pigment cells of the iris of the great horned owl. Cell Tissue Res. 1981; 217(2):387–395. pmid:7237534
- 33. Sweijd N, Craig AJFK. Histological basis of age-related changes in iris color in the African Pied Starling (Spreo bicolor). Auk. 1991; 108(1):53–59. https://doi.org/10.1093/auk/108.1.53
- 34. Holt C, Campbell M, Keays DA, Edelman N, Kapusta A, Maclary E, et al. Improved genome assembly and annotation for the rock pigeon (Columba livia). G3: Genes Genom Genet. 2018; 8(5):1391–1398. pmid:29519939
- 35. Kimura T, Nagao Y, Hashimoto H, Yamamoto-Shiraishi Y, Yamamoto S, Yabe T, et al. Leucophores are similar to xanthophores in their specification and differentiation processes in medaka. Proc Natl Acad Sci U S A. 2014; 111(20):7343–7348. pmid:24803434
- 36. Kelsh RN, Harris ML, Colanesi S, Erickson CA. Stripes and belly-spots—a review of pigment cell morphogenesis in vertebrates. Semin Cell Dev Biol. 2009; 20(1):90–104. pmid:18977309
- 37. Patterson LB, Bain EJ, Parichy DM. Pigment cell interactions and differential xanthophore recruitment underlying zebrafish stripe reiteration and Danio pattern evolution. Nat Commun. 2014; 5(1):1–9. pmid:25374113
- 38. Singh AP, Nusslein-Volhard C. Zebrafish stripes as a model for vertebrate colour pattern formation. Curr Biol. 2015; 25(2):R81–R92. pmid:25602311
- 39. Ding D, Jiang H, Chen GD, Longo-Guess C, Muthaiah VP, Tian C, et al. N-acetyl-cysteine prevents age-related hearing loss and the progressive loss of inner hair cells in gamma-glutamyl transferase 1 deficient mice. Aging (Albany N Y). 2016; 8(4):730–750. pmid:26977590
- 40. Imsland F, McGowan K, Rubin CJ, Henegar C, Sundstrom E, Berglund J, et al. Regulatory mutations in TBX3 disrupt asymmetric hair pigmentation that underlies Dun camouflage color in horses. Nat Genet. 2016; 48(2):152–158. pmid:26691985
- 41. Oehme H. Vergleichende Untersuchungen über die Färbung der Vogeliris. Biol Zbl. 1969; 88:3–35.
- 42. Wertheim JO, Murrell B, Smith MD, Pond SLK, Scheffler K. RELAX: detecting relaxed selection in a phylogenetic framework. Mol Biol Evol. 2015; 32(3):820–832. pmid:25540451
- 43. Ziegler I. The pteridine pathway in zebrafish: regulation and specification during the determination of neural crest cell-fate. Pigm Cell Res. 2003; 16(3):172–182. pmid:12753383
- 44. Braasch I, Schartl M, Volff JN. Evolution of pigment synthesis pathways by gene and genome duplication in fish. BMC Evol Biol. 2007; 7(1):1–18. pmid:17498288
- 45. Shapiro MD, Domyan ET. Domestic pigeons. Curr Biol. 2013; 23(8):R302–303. pmid:23618660
- 46. Rauhut OW, Foth C, Tischlinger H, Norell MA. Exceptionally preserved juvenile megalosauroid theropod dinosaur with filamentous integument from the Late Jurassic of Germany. Proc Natl Acad Sci U S A. 2012; 109(29):11746–11751. pmid:22753486
- 47. Zelenitsky DK, Therrien F, Erickson GM, DeBuhr CL, Kobayashi Y, Eberth DA, et al. Feathered non-avian dinosaurs from North America provide insight into wing origins. Science. 2012; 338(6106):510–514. pmid:23112330
- 48. Sullivan C, Wang Y, Hone DWE, Wang YQ, Xu X, Zhang FC. The vertebrates of the Jurassic Daohugou Biota of northeastern China. J Vertebr Paleontol. 2014; 34(2):243–280. https://doi.org/10.1080/02724634.2013.787316
- 49. Norell MA, Xu X. Feathered dinosaurs. Annu Rev Earth Planet Sci. 2005; 33:277–299. https://doi.org/10.1146/annurev.earth.33.092203.122511
- 50. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011; 17(1):10–12. pmid:28715235
- 51. Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010; 26(5):589–595. pmid:20080505
- 52. Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011; 27(21):2987–2993. pmid:21903627
- 53. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009; 25(16):2078–2079. pmid:19505943
- 54. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20(9):1297–1303. pmid:20644199
- 55. Zhang C, Dong SS, Xu JY, He WM, Yang TL. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019; 35(10):1786–1788. pmid:30321304
- 56. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007; 81(3):559–575. pmid:17701901
- 57. Turner SD. qqman: an R package for visualizing GWAS results using QQ and manhattan plots. J Open Source Softw. 2018; 3(25):731. https://doi.org/10.1101/005165
- 58. Xu X, Dong GX, Hu XS, Miao L, Zhang XL, Zhang DL, et al. The genetic basis of white tigers. Curr Biol. 2013; 23(11):1031–1035. pmid:23707431
- 59. Bryant NJ, Govers R, James DE. Regulated transport of the glucose transporter GLUT4. Nat Rev Mol Cell Biol. 2002; 3(4):267–277. pmid:11994746
- 60. Hofmann K. TMbase—A database of membrane spanning proteins segments. Biol Chem Hoppe-Seyler. 1993; 374:166.
- 61. Krogh A, Larsson B, von Heijne G, Sonnhammer ELL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001; 305(3):567–580. pmid:11152613
- 62. Sonnhammer EL, von Heijne G, Krogh A. A hidden Markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol. 1998; 6:175–182. pmid:9783223
- 63. Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015; 31(2):166–169. pmid:25260700
- 64. Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015; 12(4):357–360. pmid:25751142
- 65. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15(12):1–21. pmid:25516281
- 66. Kolde R. Pheatmap: pretty heatmaps. R package version 1.0. 8. 2015.
- 67. Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009; 4(1):44–57. pmid:19131956
- 68. Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009; 37(1):1–13. pmid:19033363
- 69. Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(T)(-Delta Delta C) method. Methods. 2001; 25(4):402–408. pmid:11846609
- 70. O’Connell J, Gurdasani D, Delaneau O, Pirastu N, Ulivi S, Cocca M, et al. A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet. 2014; 10(4):e1004234. pmid:24743097
- 71. Voight BF, Kudaravalli S, Wen X, Pritchard JK. A map of recent positive selection in the human genome. PLoS Biol. 2006; 4(3):e72. pmid:16494531
- 72. Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989; 123(3):585–595. pmid:2513255
- 73. Sabeti PC, Reich DE, Higgins JM, Levine HZP, Richter DJ, Schaffner SF, et al. Detecting recent positive selection in the human genome from haplotype structure. Nature. 2002; 419(6909):832–837. pmid:12397357
- 74. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007; 23(19):2633–2635. pmid:17586829
- 75. Gautier M, Vitalis R. rehh: an R package to detect footprints of selection in genome-wide SNP data from haplotype structure. Bioinformatics. 2012; 28(8):1176–1177. pmid:22402612
- 76. Wilgenbusch JC, Swofford D. Inferring evolutionary trees with PAUP*. Curr Protoc Bioinformatics. 2003; (1):6.4. 1–6.4. 28. pmid:18428704
- 77. Rozas J, Ferrer-Mata A, Sanchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017; 34(12):3299–3302. pmid:29029172
- 78. Leigh JW, Bryant D. POPART: full-feature software for haplotype network construction. Methods Ecol Evol. 2015; 6(9):1110–1116. https://doi.org/10.1111/2041-210x.12410
- 79. Smith J, Coop G, Stephens M, Novembre J. Estimating time to the common ancestor for a beneficial allele. Mol Biol Evol. 2018; 35(4):1003–1017. pmid:29361025
- 80. Gao F, Ming C, Hu WJ, Li HP. New software for the fast estimation of population recombination rates (FastEPRR) in the genomic era. G3: Genes Genom Genet. 2016; 6(6):1563–1571. pmid:27172192
- 81. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009; 10(1):1–9. pmid:20003500
- 82. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol. 2018; 35(6):1547–1549. pmid:29722887
- 83. Sim NL, Kumar P, Hu J, Henikoff S, Schneider G, Ng PC. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 2012; 40(W1):W452–W457. pmid:22689647
- 84. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013; 30(4):772–780. pmid:23329690
- 85. Near TJ, Eytan RI, Dornburg A, Kuhn KL, Moore JA, Davis MP, et al. Resolution of ray-finned fish phylogeny and timing of diversification. Proc Natl Acad Sci U S A. 2012; 109(34):13698–13703. pmid:22869754
- 86. Prum RO, Berv JS, Dornburg A, Field DJ, Townsend JP, Lemmon EM, et al. A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing. Nature. 2015; 526(7574):569–573. pmid:26444237
- 87. Reeder TW, Townsend TM, Mulcahy DG, Noonan BP, Wood PL, Sites JW, et al. Integrated analyses resolve conflicts over squamate reptile phylogeny and reveal unexpected placements for fossil taxa. PLoS One. 2015; 10(3):e0118199. pmid:25803280
- 88. Hedges SB. Amniote phylogeny and the position of turtles. BMC Biology. 2012; 10(1):1–2. pmid:22839753