DNA methylation plays an important role for mammalian development. However, it is unclear whether the DNA methylation pattern is evolutionarily conserved. The Y chromosome serves as a powerful tool for the study of human evolution because it is transferred between males. In this study, based on deep-rooted pedigrees and the latest Y chromosome phylogenetic tree, we performed epigenetic pattern analysis of the Y chromosome from 72 donors. By comparing their respective DNA methylation level, we found that the DNA methylation pattern on the Y chromosome was stable among family members and haplogroups. Interestingly, two haplogroup-specific methylation sites were found, which were both genotype-dependent. Moreover, the African and Asian samples also had similar DNA methylation pattern with a remote divergence time. Our findings indicated that the DNA methylation pattern on the Y chromosome was conservative during human male history.
Citation: Zhang M, Wang C-C, Yang C, Meng H, Agbagwa IO, Wang L-X, et al. (2016) Epigenetic Pattern on the Human Y Chromosome Is Evolutionarily Conserved. PLoS ONE 11(1): e0146402. doi:10.1371/journal.pone.0146402
Editor: Yan Zhang, Harbin Medical University, CHINA
Received: July 2, 2015; Accepted: November 2, 2015; Published: January 13, 2016
Copyright: © 2016 Zhang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files. The GEO accession number for the DNA methylation data reported in this paper is GSE73412.
Funding: This work was supported by Strategic Priority Research Program of the Chinese Academy of Sciences (grant no. XDA01040407), National Natural Science Foundation of China (grant no. 91019024, grant no. 31100558, and grant no. 91131002), National Basic Research Program of China (973 Program, grant no. 2012CB518302, and grant no. 2013CB911001), National Excellent Youth Science Foundation of China (grant no. 31222030), Shanghai Rising-Star Program (grant no. 12QA1400300), China Ministry of Education Scientific Research Major Project (grant no. 311016 and 113022A), MOE University Doctoral Research Supervisor’s Funds (grant no. 20120071110021), and Shanghai Professional Development Funding (grant no. 2010001).
Competing interests: The authors have declared that no competing interests exist.
DNA methylation, which refers to as the covalent addition of a methyl group to the fifth carbon of cytosine (resulting in the production of 5-methylcytosine at CpG sites), is called the “fifth base” of the DNA code . As an important type of epigenetic modification, DNA methylation plays essential roles in many biological processes, including gene regulation, mammalian development, X chromosome inactivation, and genomic imprinting [2–7]. Moreover, abnormal methylation modifications represent an important link to disease susceptibility, such as in Rett syndrome, monogenic disease, and cancer [8–11]. Previous studies showed that double knockout of the DNA methyltransferases DNMT1 and DNMT3a/3b in mice could result in defects in embryogenesis [12, 13]. Recently, a lot of research focused on the study of DNA methylation during mammalian development, reprogramming, and inheritance [14–16]. Several studies showed that the genome-wide DNA methylation underwent methylation reprogramming during early embryonic development [17–20].
However, whether DNA methylation can be stably passed from generation to generation like the genetic code is an open question remaining to be elucidated. Unfortunately, there are many obstacles to address this question in humans. First, DNA methylation pattern varies from tissue to tissue, which makes it difficult to select a standard for comparison . Second, although quite a few population-specific DNA methylation patterns have been found, it is difficult to make a connection between these variations and certain populations or haplogroups . Furthermore, epigenetic variations after recombination between sister chromatids also make epigenetic analysis difficult and complicated.
Recently, researchers found that more than 2000 differential methylation regions (DMRs) existed between ancient and modern human population . Moreover, Radford et al. found that the germline DNA methylation of adult mice could be altered by the in utero nutritional environment of embryos, while the next generation would not be affected . However, it is difficult to study epigenetic changes in more than 4 generations in one family pedigree.
Interestingly, the Y chromosome, unlike other chromosomes, is transferred between males and is only inherited from father to son. A lot of research work revealed important and unappreciated role of the Y chromosome of human [25–28]. Most regions on the Y chromosome remains comparable stability and purity after inheritance through multiple generations. Therefore, studying the methylation pattern on the Y chromosome could help overcome the above obstacles.
Single-nucleotide polymorphisms (SNPs) on the Y chromosome have been successfully utilized to reconstruct the Y chromosome phylogeny and to identify the divergence time of different lineages [29–34]. Through the detection of phylogenetically relevant SNPs within non-recombinant regions of the Y chromosome (NRY), we reconstructed the Y chromosome phylogeny of 72 donors [35–38]. The divergence time of these samples ranges from approximately 50 years to several thousand years ago (kya) . Further, we compared their DNA methylation pattern on the Y chromosome.
Based on the above powerful tools and analysis, we found that DNA methylation pattern on the Y chromosome was conserved within family trees and haplogroups despite of a few variations within individual sample. We also revealed that there were some genotype-related polymorphic DNA methylation variations on the Y chromosome. The conserved methylation pattern within each haplogroup and the genotype-dependent polymorphic variations shared by all individuals in specific haplogroups indicated that the methylation pattern on the Y chromosome was stable during human history.
Materials and Methods
Sample collection and DNA extraction
To study the methylation pattern of different haplogroups, we collected whole blood samples from 72 male donors. Our study was approved by the Ethics Committee of Beijing Institute of Genomics, Chinese Academy of Sciences, School of Life Sciences, Fudan University, and the University of Port Harcourt. All individuals were adequately informed and signed an informed consent form before participating in the study. These samples were all collected based on deep-rooted pedigrees and the Y chromosome phylogenetic tree described in our previously published work. DNA was extracted from the whole blood using QIAamp DNA Mini Kit (QIAGEN, Hilden, Germany).
In addition to the previously listed primers of the latest Y chromosome phylogenetic tree, we also designed several primers for newly typed SNPs [40, 41]. Overall, we tested more than 100 Y chromosome SNPs to establish the Y chromosome haplotype of 72 donors.
DNA samples were bisulfate-converted using a ZymoEZ DNA Methylation Kit (Zymo Research). After being amplified and enzymatically fragmented, the resulted fragments were purified by isopropanol precipitation and hybridized to the Infinium Human Methylation 450 BeadChip array (Illumina Inc., San Diego, CA, USA). After hybridization for 18 hours, extension, staining, and washing were performed successively. Finally, the BeadChip was imaged using the iScan system (Illumina, Inc.). In total, over 485,000 methylation sites were analyzed per sample, and the entire assay was performed according to the manufacturer’s instructions.
The raw files were processed by quality control and normalization using the software Genome Studio (Illumina). Then the methylation values of the tested sites were obtained as β-values, which were calculated as the methylation signal intensity divided by the sum of both methylation and unmethylation signals with background subtraction. β-values range from 0 to 1, representing completely unmethylated and fully methylated sites, respectively. Furthermore, sites with a detection p-value > 0.05 and sites containing missing values across all 72 samples were excluded.
Statistical analysis of methylation level of functional regions
The Illumina Methylation Analyzer (IMA) package in the R language was used to calculate the methylation level of each of the following functional regions: TSS1500 (-1500 bp from the nearest TSS), TSS200 (-200 bp from the nearest TSS), 5’UTR, EXON1 (1st exon of genes), 3’UTR, Gene Body, CpG islands, NSHORE (-2 kb region flanking the CpG island), SSHORE (+2 kb region flanking the CpG island), NSHELF (-4 to -2 kb region flanking the CpG island), and SSHELF (+2 to +4 kb region flanking the CpG island). The methylation level of each region was represented by the mean methylation level of all tested sites within each region.
Software for data visualization
Hierarchical clustering was performed in R using the “heatmap” package. Moreover, the methylation pattern of the Y-chromosome was loaded into IGV for visualization.
Analysis of haplogroups, sample selection, and preparation
The inheritance of the Y chromosome is more stable than other chromosomes (Fig 1A). Deep-rooting pedigrees have been utilized to estimate the mutation rate of the Y chromosome specific microsatellites or base substitutions (Fig 1B) [42, 43]. Moreover, with single nucleotide polymorphism (SNP) analysis of the human Y-chromosome, researchers successfully found 13 haplotypes in a Japanese population . We established several haplogroups with 72 samples using single nucleotide primer extension and SNP analysis based on the same method (Fig 1C and 1D).
A) The abridged general view of stable inheritance of the Y chromosome among male generations. Recombination between homologous chromosomes are existed in other chromosomes. B) Illustration diagram of the SNP inheritance on the Y chromosome. Each color star represents a new SNP, and it can be inherited to the next generation. C) The abbreviated form of the Y chromosome phylogenetic tree. Haplogroup E and O were selected for this study. D) The tree of haplogroup O and its subgroups in Han Chinese population. The methylation patterns of bold labeled subclades are studied.
Further, the DNA methylation level of all samples was detected using the Illumina 450K methylation microarray (Illumina, Inc., San Diego, CA, USA). We set up to determine whether the human epigenome was conserved by comparing the DNA methylation level of each haplogroup or subgroup. In this study, we focused on analyzing the methylation level of the Y chromosome. We tested the DNA methylation level of two technical replicates of two samples in advance. For all sites on the Y chromosome, the correlation of methylation profiles between two technical replicates was high (r > 0.99) (S1 Fig), indicating that this technology was robust for detecting the methylation pattern of the Y chromosome.
DNA methylation pattern on the Y chromosome was similar in three families of haplogroup O2*
First, we selected 11 samples from Rushan, Shandong, China. They came from three families (families A, B, and C). They were all the descendants of Emperor CAO Cao (155AD–220AD), one of the most famous Emperors in China, according to the deep-rooting pedigrees [33, 34]. As shown in Fig 2A, two trios were included. They all belonged to haplogroup O2*, based on the Y chromosome haplogroup analysis (Fig 2A, S1 Table) .
A) Pedigree showing the relationship of family members. B) Heat map showing the methylation level of the Y chromosome in each sample. Each row line represents a single tested site, with each vertical line showing the β-value obtained in each individual sample. DNA methylation level is colored from orange to red to indicate low to high. C) The DNA methylation level is shown around the annotated genes and CpG Islands of the Y chromosome. D) Bar graphs showing the distribution of hyper-, intermediate-, and hypo-methylation sites on the Y chromosome in each sample respectively. Red indicates the hyper-methylation sites (β-value > 0.8), blue indicates intermediate-methylation sites (β-value: 0.2 to 0.8), and green indicates hypo-methylation sites (β-value < 0.2).
404 sites on the Y chromosome were obtained for further analysis after filtering 12 CpG sites containing missing values or with detection p-values greater than 0.05. The data showed that there was no remarkable variation of the methylation pattern on the Y chromosome among family members or among different families (Fig 2B and 2C). Moreover, the distribution of hyper-methylation sites (β-value > 0.8), intermediate-methylation sites (β-value: 0.2–0.8), and hypo-methylation sites (β-value < 0.2) on the Y chromosome were similar for all individuals (Fig 2D). Specifically, the hyper-methylation sites and hypo-methylation sites were highly consistent within family members (S2 Fig).
To further investigate whether methylation pattern on the Y chromosome is correlated with relations, the mean β-value of each site was calculated. We did not find any family A-specific methylation sites (Wilcoxon Rank-sum Test, FDR-adjusted p < 0.05, and |beta_difference| ≥ 0.2) by comparing the mean β-value of family A with the mean β-value of the other two families (family B and family C). There was also no family-specific methylation pattern between family B and family C.
DNA methylation pattern on the Y chromosome was conservative among different geographic subgroups of haplogroup O2*
Given the above results, we speculated whether the methylation level of the Y chromosome was also conserved among subgroups of haplogroup O2*. To test this hypothesis, whole blood samples were collected from 17 healthy male individuals of haplogroup O2* from different regions (Donggang, Yancheng, Shucheng, Yaopu, Wangchuan, and Xuwen) (Fig 3A, S1 Table) . The result showed that there hardly no variation among geographic groups regarding the overall methylation pattern (Fig 3B and 3C). The distribution of hyper-methylation, intermediate-methylation, and hypo-methylation sites were similar among all individuals (Fig 3D). Further, no any geography-specific methylation sites were found in haplogroup O2* (Wilcoxon Rank-sum Test, FDR-adjusted p < 0.05, and |beta_difference| ≥ 0.2).
A) Geographic distribution of 28 haplogroup O2* individuals. B) Heat map showing the methylation level of the Y chromosome in each sample. Each row line represents a single site, with each vertical line showing the β-value obtained in each individual sample. DNA methylation level is colored from orange to red to indicate low to high. C) The DNA methylation level is shown around the annotated genes and CpG Islands of the Y chromosome. D) Bar graphs showing the distribution of hyper-, intermediate-, and hypo-methylation sites on the Y chromosome in each haplogroup O2* sample.
The common hyper-methylation sites (mean β-value > 0.8) and hypo-methylation sites (mean β-value < 0.2) among all above samples were selected for further investigation. The values of standard deviation (SD) of the common hyper-methylation sites were less than 0.1 (S3A Fig), indicating that DNA methylation pattern was relatively conserved. However, there was a few variation of DNA methylation in the hypo-methylation sites (the values of SD in several sites were larger than 0.1) (S3A Fig). Further, we assessed the methylation level of these common hyper- and hypo-methylation sites of each sample. We found that the methylation level of common hyper-methylation sites were similar among all O2* samples, showing no correlation with the family relation or geographic factors (S3B Fig). However, we found quite a few remarkably different methylation sites in Sample 243 within the common hypo-methylation sites (S3C Fig). Because we analyzed a total of 28 samples from haplogroup O2*, the variation of Sample 243 was not haplogroup-specific. One possibility was that the differential methylation pattern of Sample 243 was individualized unique.
All these results suggested that the DNA methylation pattern on the Y chromosome was conserved within haplogroup O2* samples.
Specific methylation sites were found in haplogroup O3a2b samples
We collected whole blood samples from 27 healthy male donors to analyze their DNA methylation patterns on the Y chromosome according to the most recent Y chromosome phylogenetic tree. These samples came from different haplogroups (O1a1-P203, O3a1c*-002611, O3a1c1*-F449, O3a1c1a-F238, O3a1c2a*-F11/F197/F526, O3a1c2a3-F196, O3a2b-M7, O3a2c*-P164, O3a2c1*-M134, and O3a2c1a-M117) (Fig 4A, S1 Table) [41, 45]. Sample RS2-1, RS4, TA4, and XC1 were randomly selected to be representative of haplogroup O2*. The divergence time of these haplogroups was approximately 24.7 thousand years ago (kya) .
A) The information of Y haplotype and coalescence time for samples. The left bar graph indicates the coalescence time, from 27 thousand years ago (kya) to now. Numbers in red indicate the coalescence time (in years) and 95% confidence interval of the node. B) Heat map showing the Pearson correlation coefficients among different haplogroups. Correlation coefficients are colored yellow to red to indicate low to high, respectively. C) Box plots showing the haplogroup O3a2b-specific methylation sites. **P < 0.01. D) The methylation level of cg07765982 and cg13365400 within all samples. Each data point represents the β-value of each sample.
To determine whether the DNA methylation pattern on the Y chromosome was also conservative among different haplogroups, Pearson correlation coefficients between each haplogroup were calculated. The result showed that the correlation of most haplogroups was high (Fig 4B). However, relatively low correlation between haplogroup O3a2b (JF1, JF2, XA3, XA7) and other haplogroups was found. Moreover, the sample LACB0902 in haplogroup O3a2c1a also showed weak correlation with other samples (Fig 4B).
We found two specific methylation sites in haplogroup O3a2b samples by comparing mean β-values between different haplogroups (Target ID: cg07765982 and cg13365400) (Fig 4C). To confirm the existence of these two haplogroup-specific sites, we collected another 8 male samples of haplogroup O3a2b. These 8 samples were collected from 6 distant provinces of China to exclude the influence of geographic factor on the methylation pattern. Overall, all of 12 samples from haplogroup O3a2b shared one haplogroup-specific methylation site (cg07765982) (S4A Fig, S2 Table).
The methylation level of cg07765982 was low in all haplogroup O3a2b samples (Fig 4D). The analysis began with four haplogroup O3a2b samples (XA3, XA7, XB0, XB2) sharing the hypomethylation level at cg13365400 site (Fig 4D). These four samples were all collected from Shexian, Anhui, China (S4B Fig). Further, 2 other samples (XA1, XA2) that also came from Shexian, Anhui, China were included, which belonged to haplogroup O1a1 (O-P203). By comparing the methylation pattern of cg13365400 among the above 6 samples (one geographical position, two different haplogroups), we found that the difference was haplogroup-specific rather than geography-specific (S4C Fig). One explanation for the above data was that the methylation variation of cg07765982 existed in the common ancestor of haplogroup O3a2b and was inherited by all the descendants.
We further tested 4 additional haplogroup O3a2c1a samples to assess whether the methylation pattern of sample LACB0902 was haplogroup O3a2c1a-specfic. 8 variant sites were selected with a cut-off SD > 0.1 after calculating the standard deviation of the β-value of each site among all 7 haplogroup O3a2c1a samples. However, these sites were only differentially methylated in sample LACB0902 (S4D Fig), indicating that the difference of the methylation pattern in sample LACB0902 was individual rather than haplogroup-specific.
Haplogroup O3a2b-specific methylation sites were located in gene body region on the Y chromosome
According to the human reference sequence (hg19), the tested sites on the Y chromosome were distributed on 11 regions: TSS1500 (-1500 bp from the nearest TSS), TSS200 (-200 bp from the nearest TSS), 5’UTR, EXON1 (1st exon of genes), 3’UTR, Gene Body, CpG islands, NSHORE (-2 kb region flanking the CpG island), SSHORE (+2 kb region flanking the CpG island), NSHELF (-4 to -2 kb region flanking the CpG island), and SSHELF (+2 to +4 kb region flanking the CpG island) (S3 Table). The mean methylation level of all tested sites within each region was taken as this region’s methylation index.
We found that the variation in gene body region was greater than in other regions by calculating the standard deviation of each region among all samples (Fig 5A). Further, we assessed the overall methylation pattern of 53 tested genes. Result showed that the methylation pattern of two genes was haplogroup O3a2b-specific (LOC100101116, TTTY1) (Fig 5C). However, we did not find such a haplogroup-specific variation on the other 10 functional regions (Fig 5B and 5D, S5 Fig).
A) Box plots showing the standard deviation of methylation level within each region. The median line indicates the average methylation level, the edges represent the 25th/75th percentile, and the whiskers represent the 2.5th/97.5th percentile. B−D). Heat map showing the methylation levels of 38 detected TSS1500 regions (B), 53 gene body regions (C), and 55 CpG island regions (D).
One haplogroup E-specific methylation site was found with a remote divergence time
To further test our theory, we collected 5 samples from Nigeria, Africa. These male donors came from haplogroup E1b1a1 (E-M2), and they had diverged from haplogroup O over 54.1 thousand years ago (Fig 6A). After filtering sites with detection p-values > 0.05 and with missing values, the remaining sites on the Y chromosome were used for further investigation.
A) The Y chromosome phylogenetic tree showing the coalescence time between haplogroup E and haplogroup O. Numbers in red indicate the coalescence time (in years) and 95% confidence interval of the node. B) Box plots showing the methylation level of haplogroup E1b1a1-specific methylation site. **P < 0.01. C) The methylation level of cg05782707 among all samples. Each data point represents the β-value of each sample. D) The genomic location of cg05782707 site.
As shown in Fig 6B, we found one haplogroup E1b1a1-specific methylation site (cg05782707) (Fig 6B, S4 Table), which was relatively hypo-methylated in all 5 haplogroup E samples (Fig 6C). We found that this site located in the gene body region of TTTY22 after mapping it to the human reference sequence (hg19) (Fig 6D).
The haplogroup-specific methylation sites identified on the Y chromosome were genotype-dependent
Quite a few studies revealed that DNA mutations adjacent to the CpG site can affect its methylation level [46, 47]. To determine whether the haplogroup O3a2b-specific methylation site was affected by SNPs, we performed mutation detection analysis. Result suggested that the haplogroup O3a2b-specific methylation was accompanied with genetic mutation at the corresponding CpG site (S6 Fig). We further found that the genomic deletion around haplogroup E-specific methylation site contributed to its specific methylation level. Taken together, the above two haplogroup-specific methylation sites were both genotype-dependent.
DNA methylation on other chromosomes were not as conserved as on the Y chromosome
Further, we selected the other two chromosomes to analyze their variance among all samples. We found that the degree of variation on the other two chromosomes was greater than that on the Y chromosome by assessing the standard deviation of each detected site across all samples (S7A Fig). We performed principal component analysis and found that the methylation pattern on the other two chromosomes were variable within the same haplogroup using chromosome 12 and the X chromosome as examples (S7B Fig). Therefore, the DNA methylation pattern on other chromosomes was not conserved during human history. This complicated pattern maybe caused by frequent spontaneous recombination between sister chromatids [48–50]. Further, we found there were family-specific methylation variations through investigating the methylation pattern on all chromosomes between 3 families of haplogroup O2* (S8A Fig). Moreover, we found more haplogroup-specific methylation sites between haplogroup O2* and haplogroup O3 (S8B Fig).
It is well known that SNPs can be inherited and accumulated during human male evolution (Fig 7A). In this study, we also found that the methylation pattern on the Y chromosome could also be stably inherited during human male history (Fig 7B). Our result demonstrated an interesting fact that the DNA methylation pattern on the Y chromosome was relatively stable during evolution.
A) Illustration model of the SNP inheritance on the Y chromosome. Each color star represents a new SNP, and it can be inherited to the next generation. B) The inheritance schematic model of DNA methylation pattern on the Y chromosome. Red star indicates a haplogroup-specific methylation site. This model shows the stable methylation pattern on the Y chromosome within each haplogroup. White circles represent unmethylated sites, black circles represent the methylated sites. Numbers in red indicate the coalescence time (in years) and the 95% confidence interval of the node.
It is well acknowledged that Y chromosome is a powerful tool for human evolution study, since it is transferred between males and remains relative stability after multi-generation inheritance. Based on this powerful tool, we analyzed a pedigree from haplogroup O2*. By analyzing their DNA methylation level, we found that the DNA methylation pattern on the Y chromosome was conserved within haplogroup O2*, even though the divergence time between these samples was approximately 1800 years ago. Further, we investigated the DNA methylation pattern among different haplogroups. We detected two haplogroup O3a2b-specific methylation sites (cg07765982 and cg13365400). The variant methylation site cg13365400 existed in 4 samples (Shexian, Anhui, China), and this variation was haplogroup-specific rather than geography-specific. Interestingly, after performing DNA mutation analysis, we found that these sites were accompanied by SNPs. Moreover, we found a haplogroup E-specific methylation site (cg05782707) with a remote divergence time, and this site was also associated with genetic mutation.
Several studies showed that the genome-wide DNA methylation underwent methylation reprogramming during early embryonic development [17–20]. In particular, the methylation of CpG Islands on the Y chromosome also underwent the above process (S9 Fig). However, one key question in this field is whether epigenetic modification, such as DNA methylation, can be stably passed from generation to generation like the DNA sequence. In our study, based on deep-rooted pedigrees and the Y chromosome phylogenetic tree, we found that the methylation pattern on the Y chromosome could also be stably inherited during human male history.
It is well known that Y chromosome had lost more than 1,393 genes since its existence, giving a rate of genetic loss of 4.6 genes per million years . However, human Y chromosome had not lost any genes since humans had diverged from chimpanzees, dating back to 6–7 million years ago . It indicates that gens on the Y chromosome have important roles in human development. Our work indicated that the basic DNA methylation pattern on the Y chromosome could be stably passed from generation to generation like the DNA sequence for almost 50 thousand years, indicating that DNA methylation may also play an important role in the development and evolution of human males.
Interestingly, we also found that all haplogroup-specific DNA methylation variations were genetically related. One possible explanation is that although basic DNA methylation pattern is stable, environmentally caused DNA methylation variation could not be inherited between generations unless it is genetically SNPs related. The fact that these individual variations of the DNA methylation could not be passed between generations may partially be resulted from DNA methylation reprogramming, which erases and resets whole genome DNA methylation during development.
Our study was conducted with DNA from peripheral blood cells, which do not express Y chromosome genes in any significant manners. Accordingly, the relative stable DNA methylation pattern on the Y chromosome in other tissues or cells will need further study. Additionally, Daniel et al. found that the surviving genes with X homologues on the Y chromosome were dosage-sensitive, which function as regulators of transcription, translation and protein stability . Further, we detected the DNA methylation pattern of promoter regions of these genes to study the potential effects of DNA methylation on these genes. Results showed that the methylation level of these genes was conserved among different haplogroup samples. Meanwhile, most of these genes were hyper-methylated within their promoter regions, indicating their relatively inactive state in peripheral blood cells.
The DNA methylation on the Y chromosome is relatively stable. This indicates that the Y chromosome is not as fragile as previously suggested [51, 53–56]. The function and phenotypes of human males have been protected by a stable DNA methylation pattern for tens of thousands of years. Our work also provided a unique research platform to study epigenetic-related diseases in the future. With the aid of deep-rooted pedigrees and the Y chromosome phylogenetic tree, we can perform novel family pedigree associated epigenetic studies for certain diseases.
S1 Fig. Scatterplots showing Pearson correlation coefficients between two repeated analyses of two samples.
S2 Fig. The common hyper- and hypo-methylation sites among each family samples.
A) Venn diagram showing the overlap of hyper-methylation sites among samples of family A, family B, and family C, respectively. B) Venn diagram showing the overlap of hypo-methylation sites among samples of family A, family B, and family C, respectively.
S3 Fig. The DNA methylation level of the Y chromosome within haplogroup O2* samples.
A) Box plots showing the distribution of standard deviation among the methylation levels of common hyper- and hypo-methylation sites in haplogroup O2* samples. B) Box plots illustrating the methylation level of common hyper-methylation sites in each sample. C) Box plots illustrating the methylation level of common hypo-methylation sites in each sample. The median line indicates the average methylation level, the edges represent the 25th/75th percentile, and the whiskers represent the 2.5th/97.5th percentile.
S4 Fig. The DNA methylation pattern on the Y chromosome among different haplogroup samples.
A) The methylation level of cg07765982 and cg13365400 between different haplogroups. **P < 0.01. B) The methylation level of cg13365400 within all 12 haplogroup O3a2b samples. C) The methylation level of cg13365400 within 6 samples (one geographical position, two different haplogroups). D) DNA methylation level of 8 LACB0902 unique methylation sites in all haplogroup O3a2c1a samples. Each data point represents the β-value obtained in each sample.
S5 Fig. The methylation pattern of another 8 functional regions.
A−G). Heat map showing the average methylation levels of TSS200 region (A), 5’UTR region (B), EXON1 region (C), 3’UTR region (D), NSHELF region (E), NSHORE region (F), SHELF region (G) and SHORE region (H).
S6 Fig. The genotype analysis of the haplogroup O3a2b-specific methylation site.
Sanger sequencing showing a nucleotide mutation within the haplogroup O3a2b samples.
S7 Fig. Stable DNA methylation pattern on the Y chromosome.
A) Box plots showing the distribution of standard deviation of the methylation levels on each chromosome. The median line indicates the average methylation level, the edges represent the 25th/75th percentile, and the whiskers represent the 2.5th/97.5th percentile. B) Principal component analysis of the methylation pattern on chromosome 12, the X chromosome, and the Y chromosome in all samples. Each data point represents an individual sample.
S8 Fig. Whole genome DNA methylation analysis of three haplogroup O2* families and different haplogroups.
A) Heat map showing the family-specific DNA methylation sites on whole genome. B) Heat map showing the haplogroup O2* and haplogroup O3-specific DNA methylation sites on whole genome. Each vertical line represents a single site, with each row showing the β-value obtained in each individual tested.
S9 Fig. The DNA methylation reprogramming process during early human embryonic development.
Published methylation data showing a de-methylation and then re-methylation process during early human embryonic development. Each data point represents the mean β-value of each stage.
S1 Table. Sample information.
S2 Table. Haplogroup O3a2b-specific methylation site.
S3 Table. Eleven regional categories on the Y chromosome.
S4 Table. Haplogroup E1b1a1-specific methylation site.
We thank all the volunteers and some organizers of the Cao clans, including Xiaohu Cao and Xinwen Cao, for sample collection.
Conceived and designed the experiments: YLS. Performed the experiments: HL MJZ CYY HM. Analyzed the data: YLS MJZ CYY. Contributed reagents/materials/analysis tools: CCW MJZ CYY SY HM IOA. Wrote the paper: YLS MJZ CYY. Performed the bioinformatics analyses: MJZ. Provided the bioinformatics data analysis and interpretation: LJ. Carried out Y chromosome haplogroup genotyping: CCW LXW YZW. Provided critical technical assistance and expertise: XL JL GP SCR YHS. Revised the manuscript: CCW HL.
- 1. Piccolo FM, Fisher AG. Getting rid of DNA methylation. Trends in cell biology. 2014;24(2):136–43. doi: 10.1016/j.tcb.2013.09.001 pmid:24119665.
- 2. Laurent L, Wong E, Li G, Huynh T, Tsirigos A, Ong CT, et al. Dynamic changes in the human methylome during differentiation. Genome research. 2010;20(3):320–31. doi: 10.1101/gr.101907.109 pmid:20133333; PubMed Central PMCID: PMC2840979.
- 3. Jones PA, Takai D. The role of DNA methylation in mammalian epigenetics. Science. 2001;293(5532):1068–70. doi: 10.1126/science.1063852 pmid:11498573.
- 4. Irizarry RA, Ladd-Acosta C, Wen B, Wu ZJ, Montano C, Onyango P, et al. The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nature genetics. 2009;41(2):178–86. doi: 10.1038/Ng.298 pmid:WOS:000263091300017.
- 5. Reik W. Stability and flexibility of epigenetic gene regulation in mammalian development. Nature. 2007;447(7143):425–32. doi: 10.1038/nature05918 pmid:17522676.
- 6. Sado T, Fenner MH, Tan SS, Tam P, Shioda T, Li E. X inactivation in the mouse embryo deficient for Dnmt1: distinct effect of hypomethylation on imprinted and random X inactivation. Developmental biology. 2000;225(2):294–303. doi: 10.1006/dbio.2000.9823 pmid:10985851.
- 7. Sharp AJ, Stathaki E, Migliavacca E, Brahmachary M, Montgomery SB, Dupre Y, et al. DNA methylation profiles of human active and inactive X chromosomes. Genome research. 2011;21(10):1592–600. doi: 10.1101/gr.112680.110 pmid:21862626; PubMed Central PMCID: PMC3202277.
- 8. Horike S, Cai S, Miyano M, Cheng JF, Kohwi-Shigematsu T. Loss of silent-chromatin looping and impaired imprinting of DLX5 in Rett syndrome. Nature genetics. 2005;37(1):31–40. doi: 10.1038/ng1491 pmid:15608638.
- 9. Amir RE, Van den Veyver IB, Wan M, Tran CQ, Francke U, Zoghbi HY. Rett syndrome is caused by mutations in X-linked MECP2, encoding methyl-CpG-binding protein 2. Nature genetics. 1999;23(2):185–8. pmid:WOS:000082827500017. doi: 10.1016/s0387-7604(01)00376-x
- 10. Kriaucionis S, Bird A. DNA methylation and Rett syndrome. Hum Mol Genet. 2003;12:R221–R7. doi: 10.1093/Hmg/Ddg2863 pmid:WOS:000185896800013.
- 11. Baylin SB, Jones PA. A decade of exploring the cancer epigenome—biological and translational implications. Nature Reviews Cancer. 2011;11(10):726–34. doi: 10.1038/Nrc3130 pmid:WOS:000295155600012.
- 12. Li E, Bestor TH, Jaenisch R. Targeted Mutation of the DNA Methyltransferase Gene Results in Embryonic Lethality. Cell. 1992;69(6):915–26. doi: 10.1016/0092-8674(92)90611-F pmid:WOS:A1992HY79200004.
- 13. Okano M, Bell DW, Haber DA, Li E. DNA methyltransferases Dnmt3a and Dnmt3b are essential for de novo methylation and mammalian development. Cell. 1999;99(3):247–57. doi: 10.1016/S0092-8674(00)81656-6 pmid:WOS:000083440600003.
- 14. Smith ZD, Meissner A. DNA methylation: roles in mammalian development. Nat Rev Genet. 2013;14(3):204–20. doi: 10.1038/Nrg3354 pmid:WOS:000315613100012.
- 15. Walsh CP, Bestor TH. Cytosine methylation and mammalian development. Genes & development. 1999;13(1):26–34. doi: 10.1101/Gad.13.1.26 pmid:WOS:000078121400004.
- 16. Geiman TM, Muegge K. DNA Methylation in Early Development. Mol Reprod Dev. 2010;77(2):105–13. doi: 10.1002/Mrd.21118 pmid:WOS:000273518200004.
- 17. Jiang L, Zhang J, Wang JJ, Wang L, Zhang L, Li GQ, et al. Sperm, but Not Oocyte, DNA Methylome Is Inherited by Zebrafish Early Embryos. Cell. 2013;153(4):773–84. doi: 10.1016/j.cell.2013.04.041 pmid:WOS:000318844000008.
- 18. Wang L, Zhang J, Duan J, Gao X, Zhu W, Lu X, et al. Programming and inheritance of parental DNA methylomes in mammals. Cell. 2014;157(4):979–91. doi: 10.1016/j.cell.2014.04.017 pmid:24813617; PubMed Central PMCID: PMC4096154.
- 19. Guo H, Zhu P, Yan L, Li R, Hu B, Lian Y, et al. The DNA methylation landscape of human early embryos. Nature. 2014;511(7511):606–10. doi: 10.1038/nature13544 pmid:25079557.
- 20. Smith ZD, Chan MM, Humm KC, Karnik R, Mekhoubad S, Regev A, et al. DNA methylation dynamics of the human preimplantation embryo. Nature. 2014;511(7511):611–+. doi: 10.1038/nature13581 pmid:WOS:000339566300039.
- 21. Ziller MJ, Gu H, Muller F, Donaghey J, Tsai LT, Kohlbacher O, et al. Charting a dynamic DNA methylation landscape of the human genome. Nature. 2013;500(7463):477–81. doi: 10.1038/nature12433 pmid:23925113; PubMed Central PMCID: PMC3821869.
- 22. Fraser HB, Lam LL, Neumann SM, Kobor MS. Population-specificity of human DNA methylation. Genome biology. 2012;13(2):R8. doi: 10.1186/gb-2012-13-2-r8 pmid:22322129; PubMed Central PMCID: PMC3334571.
- 23. Gokhman D, Lavi E, Prufer K, Fraga MF, Riancho JA, Kelso J, et al. Reconstructing the DNA methylation maps of the Neandertal and the Denisovan. Science. 2014;344(6183):523–7. doi: 10.1126/science.1250368 pmid:24786081.
- 24. Radford EJ, Ito M, Shi H, Corish JA, Yamazawa K, Isganaitis E, et al. In utero undernourishment perturbs the adult sperm methylome and intergenerational metabolism. Science. 2014. doi: 10.1126/science.1255903 pmid:25011554.
- 25. Forsberg LA, Rasi C, Malmqvist N, Davies H, Pasupulati S, Pakalapati G, et al. Mosaic loss of chromosome Y in peripheral blood is associated with shorter survival and higher risk of cancer. Nature genetics. 2014;46(6):624–8. doi: 10.1038/Ng.2966 pmid:WOS:000336870700021.
- 26. Dumanski JP, Rasi C, Lonn M, Davies H, Ingelsson M, Giedraitis V, et al. Smoking is associated with mosaic loss of chromosome Y. Science. 2015;347(6217):81–3. doi: 10.1126/science.1262092 pmid:WOS:000347102300053.
- 27. Bellott DW, Hughes JF, Skaletsky H, Brown LG, Pyntikova T, Cho TJ, et al. Mammalian Y chromosomes retain widely expressed dosage-sensitive regulators. Nature. 2014;508(7497):494–9. doi: 10.1038/nature13206 pmid:24759411; PubMed Central PMCID: PMC4139287.
- 28. Vijayakumar S, Garcia D, Hensel CH, Banerjee M, Bracht T, Xiang R, et al. The human Y chromosome suppresses the tumorigenicity of PC-3, a human prostate cancer cell line, in athymic nude mice. Genes, chromosomes & cancer. 2005;44(4):365–72. doi: 10.1002/gcc.20250 pmid:16080199.
- 29. Francalacci P, Morelli L, Angius A, Berutti R, Reinier F, Atzeni R, et al. Low-pass DNA sequencing of 1200 Sardinians reconstructs European Y-chromosome phylogeny. Science. 2013;341(6145):565–9. doi: 10.1126/science.1237947 pmid:23908240.
- 30. Poznik GD, Henn BM, Yee MC, Sliwerska E, Euskirchen GM, Lin AA, et al. Sequencing Y chromosomes resolves discrepancy in time to common ancestor of males versus females. Science. 2013;341(6145):562–5. doi: 10.1126/science.1237619 pmid:23908239; PubMed Central PMCID: PMC4032117.
- 31. Ke YH, Su B, Song XF, Lu DR, Chen LF, Li HY, et al. African origin of modern humans in East Asia: A tale of 12,000 Y chromosomes. Science. 2001;292(5519):1151–3. doi: 10.1126/science.1060011 pmid:WOS:000168609200056.
- 32. Balaresque P, Poulet N, Cussat-Blanc S, Gerard P, Quintana-Murci L, Heyer E, et al. Y-chromosome descent clusters and male differential reproductive success: young lineage expansions dominate Asian pastoral nomadic populations. European journal of human genetics: EJHG. 2015. doi: 10.1038/ejhg.2014.285 pmid:25585703.
- 33. Wang CC, Yan S, Yao C, Huang XY, Ao X, Wang ZF, et al. Ancient DNA of Emperor CAO Cao's granduncle matches those of his present descendants: a commentary on present Y chromosomes reveal the ancestry of Emperor CAO Cao of 1800 years ago. Journal of human genetics. 2013;58(4):238–9. doi: 10.1038/Jhg.2013.5 pmid:WOS:000318012700013.
- 34. Wang CC, Yan S, Hou Z, Fu WQ, Xiong MM, Han S, et al. Present Y chromosomes reveal the ancestry of Emperor CAO Cao of 1800 years ago. Journal of human genetics. 2012;57(3):216–8. doi: 10.1038/Jhg.2011.147 pmid:WOS:000302126700012.
- 35. Jobling MA, Tyler-Smith C. The human Y chromosome: an evolutionary marker comes of age. Nat Rev Genet. 2003;4(8):598–612. doi: 10.1038/nrg1124 pmid:12897772.
- 36. Hammer MF. A recent common ancestry for human Y chromosomes. Nature. 1995;378(6555):376–8. doi: 10.1038/378376a0 pmid:7477371.
- 37. Thomson R, Pritchard JK, Shen P, Oefner PJ, Feldman MW. Recent common ancestry of human Y chromosomes: evidence from DNA sequence data. Proceedings of the National Academy of Sciences of the United States of America. 2000;97(13):7360–5. pmid:10861004; PubMed Central PMCID: PMC16550. doi: 10.1073/pnas.97.13.7360
- 38. Underhill PA, Shen P, Lin AA, Jin L, Passarino G, Yang WH, et al. Y chromosome sequence variation and the history of human populations. Nature genetics. 2000;26(3):358–61. doi: 10.1038/81685 pmid:11062480.
- 39. Yan S, Wang CC, Zheng HX, Wang W, Qin ZD, Wei LH, et al. Y chromosomes of 40% Chinese descend from three Neolithic super-grandfathers. PloS one. 2014;9(8):e105691. doi: 10.1371/journal.pone.0105691 pmid:25170956; PubMed Central PMCID: PMC4149484.
- 40. Karafet TM, Mendez FL, Meilerman MB, Underhill PA, Zegura SL, Hammer MF. New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree. Genome research. 2008;18(5):830–8. doi: 10.1101/gr.7172008 pmid:18385274; PubMed Central PMCID: PMC2336805.
- 41. Yan S, Wang CC, Li H, Li SL, Jin L, Genographic C. An updated tree of Y-chromosome Haplogroup O and revised phylogenetic positions of mutations P164 and PK4. European journal of human genetics: EJHG. 2011;19(9):1013–5. doi: 10.1038/ejhg.2011.64 pmid:21505448; PubMed Central PMCID: PMC3179364.
- 42. Heyer E, Puymirat J, Dieltjes P, Bakker E, de Knijff P. Estimating Y chromosome specific microsatellite mutation frequencies using deep rooting pedigrees. Hum Mol Genet. 1997;6(5):799–803. pmid:9158156. doi: 10.1093/hmg/6.5.799
- 43. Xue Y, Wang Q, Long Q, Ng BL, Swerdlow H, Burton J, et al. Human Y chromosome base-substitution mutation rate measured by direct sequencing in a deep-rooting pedigree. Curr Biol. 2009;19(17):1453–7. doi: 10.1016/j.cub.2009.07.032 pmid:19716302; PubMed Central PMCID: PMC2748900.
- 44. Inagaki S, Yamamoto Y, Doi Y, Takata T, Ishikawa T, Yoshitome K, et al. Typing of Y chromosome single nucleotide polymorphisms in a Japanese population by a multiplexed single nucleotide primer extension reaction. Legal medicine. 2002;4(3):202–6. pmid:12935671. doi: 10.1016/s1344-6223(02)00032-9
- 45. Shi H, Dong YL, Wen B, Xiao CJ, Underhill PA, Shen PD, et al. Y-chromosome evidence of southern origin of the East Asian-specific haplogroup O3-M122. American journal of human genetics. 2005;77(3):408–19. doi: 10.1086/444436 pmid:16080116; PubMed Central PMCID: PMC1226206.
- 46. Bell JT, Pai AA, Pickrell JK, Gaffney DJ, Pique-Regi R, Degner JF, et al. DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines. Genome biology. 2011;12(1). Artn R1010.1186/Gb-2011-12-1-R10. pmid:WOS:000289437000005. doi: 10.1186/gb-2011-12-1-r10
- 47. Gertz J, Varley KE, Reddy TE, Bowling KM, Pauli F, Parker SL, et al. Analysis of DNA Methylation in a Three-Generation Family Reveals Widespread Genetic Influence on Epigenetic Regulation. Plos Genet. 2011;7(8). ARTN e100222810.1371/journal.pgen.1002228. pmid:WOS:000294297000024. doi: 10.1371/journal.pgen.1002228
- 48. Van Heemst D, Heyting C. Sister chromatid cohesion and recombination in meiosis. Chromosoma. 2000;109(1–2):10–26. doi: 10.1007/s004120050408 pmid:WOS:000087061000003.
- 49. Sears DD, Hegemann JH, Shero JH, Hieter P. Cis-Acting Determinants Affecting Centromere Function, Sister-Chromatid Cohesion and Reciprocal Recombination during Meiosis in Saccharomyces-Cerevisiae. Genetics. 1995;139(3):1159–73. pmid:WOS:A1995QJ96800005.
- 50. McRae AF, Powell JE, Henders AK, Bowdler L, Hemani G, Shah S, et al. Contribution of genetic variation to transgenerational inheritance of DNA methylation. Genome biology. 2014;15(5). Artn R73 doi: 10.1186/Gb-2014-15-5-R73 pmid:WOS:000338981700007.
- 51. Graves JAM. The degenerate Y chromosome—can conversion save it? Reprod Fert Develop. 2004;16(5):527–34. doi: 10.1071/RD03096 pmid:WOS:000222811600002.
- 52. Hughes JF, Skaletsky H, Pyntikova T, Minx PJ, Graves T, Rozen S, et al. Conservation of Y-linked genes during human evolution revealed by comparative sequencing in chimpanzee. Nature. 2005;437(7055):101–4. doi: 10.1038/nature04101 pmid:WOS:000231560400048.
- 53. Stahl PR, Kilgue A, Tennstedt P, Minner S, Krohn A, Simon R, et al. Y chromosome losses are exceedingly rare in prostate cancer and unrelated to patient age. Prostate. 2012;72(8):898–903. doi: 10.1002/pros.21492 pmid:WOS:000303197500009.
- 54. Takahashi S, Alcaraz A, Brown JA, Borell TJ, Herath JF, Bergstralh EJ, et al. Aneusomies of chromosomes 8 and Y detected by fluorescence in situ hybridization are prognostic markers for pathological stage C (pt3N0M0) prostate carcinoma. Clinical cancer research: an official journal of the American Association for Cancer Research. 1996;2(1):137–45. pmid:9816100.
- 55. Tricoli JV. Y chromosome enumeration in touch preparations from 42 prostate tumors by interphase fluorescence in situ hybridization analysis. Cancer Genet Cytogen. 1999;111(1):1–6. doi: 10.1016/S0165-4608(98)00212-X pmid:WOS:000080043400001.
- 56. Jin Y, Mertens F, Mandahl N, Heim S, Olegard C, Wennerberg J, et al. Chromosome abnormalities in eighty-three head and neck squamous cell carcinomas: influence of culture conditions on karyotypic pattern. Cancer Res. 1993;53(9):2140–6. pmid:8481917.