Individuals with a higher education are more likely to migrate, increasing the chance of meeting a spouse with a different ancestral background. In this context, the presence of strong educational assortment can result in greater ancestry differences within more educated spouse pairs, while less educated individuals are more likely to mate with someone with whom they share more ancestry. We examined the association between educational attainment and Froh (= the proportion of the genome consisting of runs of homozygosity [ROHs]) in ~2,000 subjects of Dutch ancestry. The subjects’ own educational attainment showed a nominally significant negative association with Froh (p = .045), while the contribution of parental education to offspring Froh was highly significant (father: p < 10-5; mother: p = 9×10-5), with more educated parents having offspring with fewer ROHs. This association was significantly and fully mediated by the physical distance between parental birthplaces (paternal education: pmediation = 2.4 × 10-4; maternal education: pmediation = 2.3 × 10-4), which itself was also significantly associated with Froh (p = 9 × 10-5). Ancestry-informative principal components from the offspring showed a significantly decreasing association with geography as parental education increased, consistent with the significantly higher migration rates among more educated parents. Parental education also showed a high spouse correlation (Spearman’s ρ = .66, p = 3 × 10-262). We show that less educated parents are less likely to mate with the more mobile parents with a higher education, creating systematic differences in homozygosity due to ancestry differences not directly captured by ancestry-informative principal components (PCs). Understanding how behaviors influence the genomic structure of a population is highly valuable for studies on the genetic etiology of behavioral, cognitive, and social traits.
Citation: Abdellaoui A, Hottenga J-J, Willemsen G, Bartels M, van Beijsterveldt T, Ehli EA, et al. (2015) Educational Attainment Influences Levels of Homozygosity through Migration and Assortative Mating. PLoS ONE10(3): e0118935. https://doi.org/10.1371/journal.pone.0118935
Academic Editor: Brion Maher, Johns Hopkins Bloomberg School of Public Health, UNITED STATES
Received: July 22, 2014; Accepted: January 7, 2015; Published: March 3, 2015
Copyright: © 2015 Abdellaoui et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: The Affymetrix Human Genome-Wide SNP 6.0 Array genotypes from the Dutch subjects in this study are available on dbGaP with the title Integration of Genomics & Transcriptomics in Normal Twins & Major Depression (#phs 000486.v1.p1) and Genomics of Developmental Trajectories in Twins (http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000486.v1.p1), together with phenotypic data on educational attainment.
Funding: This work was supported by the Netherlands Organization for Scientific Research (NWO: MagW/ZonMW grants 904-61-090, 985-10-002,904-61-193,480-04-004, 400-05-717, Addiction-31160008 Middelgroot-911-09-032, Spinozapremie 56-464-14192, Geestkracht program grant 10-000-1002), Center for Medical Systems Biology (CMSB, NWO Genomics; http://www.cmsb.nl/), NBIC/BioAssist/RK(2008.024), Biobanking and Biomolecular Resources Research Infrastructure (BBMRI -NL, 184.021.007), the VU University’s Institute for Health and Care Research (EMGO+, http://www.emgo.nl) and Neuroscience Campus Amsterdam (NCA, http://www.neurosciencecampus-amsterdam.nl), the European Science Foundation (ESF, EU/QLRT-2001-01254), the European Community's Seventh Framework Program (FP7/2007-2013), ENGAGE (HEALTH-F4-2007-201413); the European Science Council (ERC Advanced, 230374), Rutgers University Cell and DNA Repository (NIMH U24 MH068457-06), the Avera Institute for Human Genetics, Sioux Falls, South Dakota (USA) and the National Institutes of Health (NIH, R01D0042157-01A). Part of the genotyping was funded by the Genetic Association Information Network (GAIN) of the Foundation for the US National Institutes of Health (NIMH, MH081802) and by the Grand Opportunity grants 1RC2MH089951-01 and 1RC2 MH089995-01 from the NIMH. AA was supported by CSMB (http://www.cmsb.nl/). Most statistical analyses were carried out on the Genetic Cluster Computer (http://www.geneticcluster.org), which is financially supported by the Netherlands Scientific Organization (NWO 480-05-003), the Dutch Brain Foundation, and the department of Psychology and Education of the VU University Amsterdam. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Non-random mating can create systematic differences in parental relatedness, which can have a direct and detectable impact on genome-wide homozygosity in subsequent generations. Non-random mating in human populations can be driven by heritable behavioral traits. It is important to understand how behavior has influenced our genetic variation in order to successfully conduct and interpret studies that aim to understand the reverse, namely how genetic variation influences behavior. In the Netherlands for example, the consequences of continuous religious assortment during the last ~400 years and the relatively recent secularization are detectable through homozygosity differences between religious and non-religious groups. Such systematic differences can cause spurious associations between homozygosity and traits related to religiosity .
Educational attainment (EA) is another complex trait that may induce systematic differences in parental relatedness. Education shows considerable levels of assortment [2,3,4,5,6]. In addition, individuals with a higher education are more likely to have moved away from their birthplace, making the physical distance between them and their family members two to three times greater than for individuals with a lower education . When ancestry shows high correlations with geography, like in the Netherlands [8,9], these behaviors may increase the chance for higher educated individuals to mate with someone with a different ancestral background, making their offspring more outbred, while less educated spouse pairs are more likely to share more ancestry.
EA and its etiology have been widely studied. EA is heritable in populations in which it has been studied, with estimates ranging from ~20% to ~80% and increasing over time [10,11]. EA is associated with many other traits, such as psychiatric disorders [12,13], personality , life expectancy , overall health , and is especially deeply related to IQ [17,18]. IQ is predictive for EA, and is a heritable complex trait [19,20] of which the underlying genetic etiology is largely unknown. This makes EA itself an appealing trait for genetic association studies since it is more feasible to measure on a large scale than IQ [21,22]. Higher cognitive function has recently been associated with increased homozygosity levels in a representative UK sample , which is in the opposite direction of what one would expect assuming that individuals with a higher education are more likely to mate with someone with different ancestry. Assortative mating on cognitive function was posed as a potential explanation for this finding, where assortment among individuals with higher cognitive ability may have induced increased homozygosity for loci that contribute to higher cognitive ability.
The current study examines how migration, ancestral background, and the proportion of the offspring genome consisting of runs of homozygosity (ROHs: multiple contiguous homozygous single nucleotide polymorphisms [SNPs]), vary systematically between different levels of own and parental EA. The proportion of the genome consisting of ROHs is quantified by Froh, which has demonstrated to be a powerful measure for shared ancestry of genetic haplotypes, and is generally used to study the deleterious effects of inbreeding in humans and other animals . If parents with a higher education have higher migration rates and tend to select mates with different ancestral backgrounds through assortative mating, we expect their offspring to show lower Froh levels as well as weaker associations between ancestry-informative PCs and geography.
Data from a population cohort of ~2,000 unrelated subjects of Dutch ancestry included the EA of the participants and their parents, genome-wide SNPs, ancestry informative principal components (PCs), current living address, birthplace, parental birthplace, and religious affiliation.
Materials and Methods
Genotyped subjects were registered at the Netherlands Twin Register (NTR , N = 6,685; 2,678 males and 4,007 females). The NTR subjects were randomly sampled from twin families across the Netherlands. Analyses were done on unrelated individuals only. Unrelated individuals were chosen using GCTA , by excluding one of each pair of individuals with an estimated genetic relationship of >0.025 (i.e., more related than third or fourth cousin). Only individuals with Dutch ancestry were included. Individuals with a non-Dutch ancestry were identified by projecting PCs from 1000 Genomes populations on the dataset, and with additional help of the birth country of the parents. This procedure is described in more detail elsewhere .
This study was approved by the Central Ethics Committee on Research Involving Human Subjects of the VU University Medical Centre, Amsterdam, an Institutional Review Board certified by the US Office of Human Research Protections (IRB number IRB-2991 under Federal-wide Assurance-3703; IRB/institute codes, NTR 03–180). All subjects provided written informed consent.
EA was measured longitudinally with the question “What is the highest educational level that you have finished?”, “What is the highest educational level that your father has finished?”, and “What is the highest educational level that your mother has finished?”. The answer categories varied per survey, but could all be recoded into the following four categories: 1) Primary Education; 2) Secondary Education (VMBO, LBO, MAVO, lower secondary); 3) Higher Secondary Education (MBO, HAVO/VWO, higher secondary); 4) Tertiary Education (HBO, university, PhD). EA was available for 2,089 unrelated genotyped Dutch subjects, paternal EA was measured for 2,067 unrelated genotyped Dutch subjects, and maternal EA was available for 2,075 unrelated Dutch subjects (largely in the same subjects: 2,026 individuals had their own, their paternal, and maternal EA available).
Information on birthplace was available from survey and from city council register data for 1,892 unrelated genotyped Dutch subjects, paternal birthplace for 1,465 subjects, and maternal birthplace for 1,618 unrelated Dutch subjects; 1,371 individuals had both their own and paternal birthplace available, 1,513 individuals had their own and maternal birthplace available, 1,312 had both parental birthplaces available, and 1,227 individuals had their own, their paternal, and maternal birthplace available. Distance between birthplaces was computed with a purpose written perl script using the algorithm available on http://www.geodatasource.com/developers/perl. Data on parental birthplace were extracted from city council registers and were available for parents that were alive after 1994. Parental birthplace distance and distance between own and parental birthplace were analyzed in the sections “Migration distance and EA” and “Migration distance and Froh”, i.e., the analyses from these two sections have mainly been run on more recent generations that are on average more highly educated. The effects of parental migration however can also be deducted from the analyses in the section “Association between geography and ancestry per parental educational level“, where parental EA in combination with own birthplace and ancestry-informative PCs were analyzed; the results from this section were similar after splitting up the sample into subjects with and subjects without data on parental birthplace.
The assessment of religion and city size are described in detail elsewhere . Sample sizes of individuals that had EA, religion, and city size available (i.e., were included in the statistical analyses) are given in the Results section and the Tables for each analysis.
Genotyping, QC, and ancestry-informative PCs
Genotyping was performed on the Affymetrix Human Genome-Wide SNP 6.0 Array according to the manufacturer’s protocol. Methods for blood and buccal cell collection, genomic DNA extraction, genotyping, and QC have been described previously [1,27,28]. Only autosomal SNPs were analyzed. After QC, 498,592 SNPs remained.
Ancestry-informative PCs were computed with EIGENSTRAT  on 5,166 unrelated subjects with Dutch ancestry, which also included subjects from the Netherlands Study of Depression and Anxiety (NESDA) . The ancestry-informative PCs and their computation are described in detail elsewhere .
ROHs and Froh
ROHs were called using Plink . A recent study comparing several software packages designed for this goal concluded that Plink predicts autozygous stretches optimally , using simulated data based on the Affymetrix 6.0 chip, making their density of SNPs in linkage disequilibrium (LD) close to ours We followed the recommendations from this study in calling ROHs: (1) SNPs were pruned for LD (window size = 50, number of SNPs to shift after each step = 5, based on a variance inflation factor [VIF] of 2), resulting in 131,325 SNPs; (2) an ROH was defined as ≥65 consecutive homozygous SNPs with no heterozygote calls allowed. Froh is an overall measure of the proportion of the autosome in ROHs, which is calculated as the total length of ROHs summed for each individual, and then divided by the total SNP-mappable autosomal distance (2.77 × 109 bases).
Migration distance and EA.
The relation between birthplace distances and EA was investigated firstly with a one-way ANOVA in IBM SPSS Statistics 20, with birthplace distance as the dependent variable and EA as the independent variable. Four tests were performed: 1) distance between own and paternal birthplace as the dependent variable and paternal EA as independent variable; 2) distance between own and maternal birthplace as the dependent variable and maternal EA as independent variable; 3) distance between parental birthplaces as the dependent variable and paternal EA as independent variable; 4) distance between parental birthplaces as the dependent variable and maternal EA as independent variable. Post-hoc tests were then conducted with t-tests comparing birthplace distances between each two consecutive educational levels, computed in IBM SPSS Statistics 20.
Educational assortment and parent-offspring correlations.
The chi-squared test and Spearman’s ρ were computed in IBM SPSS Statistics 20 to test for the assortment on EA.
EA and Froh.
The R2 change (= difference in explained variation of Froh) was computed between multiple regressions on Froh with and without EA as a predictor (i.e., own or parental EA). The regressions included as predictors: the three PCs reflecting ancestry (correlated significantly with geography: PC1 = North-South PC, PC2 = East-West PC, PC3 = middle-band PC) , city size (dichotomous, i.e., living in a city with population size >100k), and religion. To evaluate the presence of a birth cohort effect, the analyses were repeated including year of birth as an additional predictor. To correct for the non-normal distribution of Froh, Froh was permuted 100,000 times (i.e., Froh was randomly shuffled across subjects 100,000 times and the R2 change was re-computed in all these random “null” datasets, after which an empirical p-value was computed for the R2 change by dividing the rank of the p-value of the real dataset among the “null” datasets by 100,000; all reported empirical p-values were almost identical to the p-value of the real dataset). These analyses were done in a purpose written perl script, using the PDL::Stats::GLM perl module (see http://search.cpan.org/~maggiexyz/PDL-Stats-0.6.2/GLM/glm.pp).
Migration distance and Froh.
The R2 change (= difference in explained variation of Froh) was computed between multiple regressions on Froh with and without the distance between birthplaces as a predictor. The regressions included as predictors: the three PCs reflecting ancestry, city size, religion, and EA. Froh was permuted 100,000 times. These analyses were done in a purpose written perl script, using the PDL::Stats::GLM perl module. In addition, a Sobel test was conducted in LISREL, where the significance of the mediation effect of parental birthplace distance on the association between parental EA and Froh was tested. The null hypothesis was αβ = 0, where α represents the relationship between parental EA and parental birthplace distance, and β represents the relationship between birthplace distance and Froh (see Fig. 1); τ’ (which represents the relationship between parental EA and Froh) was freely estimated in this model. We also conducted a full mediation test, where we tested whether τ’ = 0 when including α and β in the model, i.e., whether the association between parental EA and Froh remained significant after including parental birthplace distance as a mediator. Both the Sobel test and the full mediation test included as covariates: the three PCs reflecting ancestry, city size, and religion.
Association between geography and ancestry per parental educational level.
The influence of parental educational level on correlations between ancestry-informative PCs of the offspring and geographic location was tested using full information maximum likelihood estimation in OpenMx , separately for maternal and paternal educational levels and the two ancestry-informative PCs. We approximated the effect of parental EA on the correlations between PC-values and geographic location with the following linear model: ρ = ρ0 + education*ρ1, where ρ is the correlation between PC and latitude/longitude. The null hypothesis ρ1 = 0 was tested by mean of the likelihood ratio test.
Migration distance and EA
The distance between parental birthplace and own birthplace was significantly associated with EA (distance between own and paternal birthplace: p = 8.9 × 10-30, N = 1,349; distance between own and maternal birthplace: p = 1.2 × 10-26, N = 1,483). Post-hoc tests showed that this association is mainly driven by a significantly increasing migration distance as the educational level exceeds the Secondary Education (see Table 1 and Fig. 2), with parents with a Tertiary Educational level having moved more than twice the distance than parents with Primary or Secondary Educational levels. The same effect was observed for the distance between paternal and maternal birthplace for both paternal (p = 2.8 × 10-13; N = 1,294) and maternal (p = 1.7 × 10-16; N = 1,291) educational levels (see Table 1), showing that more highly educated individuals are more likely to mate with a partner from a different geographic region.
The average distance the colors are based on are: father: 28.47 km (SD = 44.45); mother: 30.16 km (SD = 44.45). The difference between the moving distance of fathers with a Secondary Education and fathers with a Tertiary Education is best suited to visualize the effect because of the almost equal sample sizes with respect to individuals plotted (i.e., moved) and the significant increase of moving distance (see Table 1); also note that fathers with Secondary Education have >25% measurements in total, which is another indicator of the difference in migration levels.
Educational assortment and parent-offspring correlations
Parental educational levels showed a high spouse correlation (Spearman’s ρ = .66, p = 3 × 10-262, N = 2,058; see Table 2). The majority of the parents (58.5%) shared the same educational level. The only other spouse pair combinations showing higher observed frequencies than expected are fathers with a Higher Secondary Education and mothers with a Secondary Education, or fathers with a Tertiary Education and mothers with a Higher Secondary Education (Table 2), a gender-asymmetrical pattern known as hypergamy . The correlation between parental EA and offspring EA was considerably lower than the spouse correlation, but still highly significant (paternal EA and offspring EA: Spearman’s ρ = .40, p = 1.4 × 10-78, N = 2,035; maternal EA and offspring EA: Spearman’s ρ = .38, p = 3.6 × 10-71, N = 2,041).
EA and Froh
The subjects’ own EA showed a nominally significant negative association with Froh (p = .045, N = 2,007). The association between offspring Froh and parental EA was highly significant (father: p < 10-5, N = 1,989; mother: p = 9×10-5, N = 1,995), with more highly educated parents having offspring with lower Froh levels (see Tables 3 and 4). Multiple confounders were accounted for in all regressions: we included ancestry-informative PCs, city size (i.e., living in a city with population size >100k), and religion (see reference  for more details on the relationship between these variables and Froh). Religion significantly contributed to Froh variation after including parental EA to the regression, and this significance diminished only slightly after the inclusion of EA in the regression, indicating a (partly) independent effect of religion and EA on Froh (see Table 4).
To evaluate whether the age difference between the genotyped subjects and their parents contributed to the difference between the effects of the subjects’ own education and the parental education, the analyses were repeated only including individuals that were at an age where they were more likely to have completed their education. The analyses were run once only including subjects with age > 25, and once including only ages > 30. Both these analyses gave a non-significant result for the subjects’ own EA (age>25: p = .065, N = 1,641; age>30: p = .075, N = 1,401), while parental EA remained significant (father: age>25: p = 9.9 × 10-4, N = 1,610; age>30: p = 1.6 × 10-3, N = 1,371; mother: age>25: p = 3.8 × 10-3, N = 1,616; age>30: p = .046, N = 1,376). Accounting for year of birth in order to evaluate the presence of a cohort effect also still results in a non-significant association between own EA and Froh with own year of birth added as a predictor (p = .181, N = 1,984), and a significant association between parental EA and Froh with the parental year of birth added as an additional predictor (father: p = 3.5 × 10-3, N = 1,401; mother: p = 7.2 × 10-3, N = 1,534).
Migration distance and Froh
A larger distance between paternal and maternal birthplace was significantly associated with lower Froh (p = 9 × 10-5, N = 1,263). A larger distance between own birthplace and parental birthplace also resulted in significantly lower Froh levels (paternal and own birthplace distance: p = 5.3 × 10-3, N = 1,317; maternal and own birthplace distance: p = 9.5 × 10-3, N = 1,445). After including the distance between the paternal and maternal birthplaces as a predictor, parental EA was no longer significantly associated with Froh (paternal EA: p = .077, N = 1,246; maternal EA: p = .134, N = 1,242), while the birthplace distance still contributed significantly to Froh variation (in regression including paternal EA: p = 4.6 × 10-5, in regression including maternal EA: p = 3.1 × 10-5).
To test the significance of the mediation effect, we performed a Sobel test (Fig. 1), including the same covariates as the regression analyses. For both parents, parental birthplace distance significantly mediated the association between parental EA and Froh (paternal EA: p = 2.4 × 10-4; maternal EA: p = 2.3 × 10-4). In addition, we conducted a full mediation test, i.e., we tested whether there is still an association left between parental EA and Froh after mediation by parental birthplace distance. After including parental birthplace distance as a mediator in the model, parental EA was no longer associated with Froh (paternal EA: p = .06; maternal EA: p = .12), indicating a full mediation effect of parental birthplace distance. These results show that the association between Froh and parental EA is explained by parents with a higher education tending to have more different ancestries than less educated parents because of higher migration levels.
Association between geography and ancestry per parental educational level
PCs from genome-wide single-nucleotide polymorphisms (SNPs) capture ancestral background, and show high correlations with geography within the Netherlands and other countries [8,35,36,37]. In the current dataset, the first PC correlates. 74 with the North-South gradient based on birthplace, and the second PC correlates. 46 with the East-West gradient (N = 1,892). The correlations between PCs and geographic location significantly differed between educational groups (PC1 for paternal EA: p = 2.2 × 10-12; PC1 for maternal EA: p = 8.2 × 10-12; PC2 for paternal EA: p = 2.5 × 10-4; PC2 for maternal EA: p = 3.5 × 10-4). Fig. 3 shows a decreasing association between the PCs and geography as the parental education increases. We approximated this decrease with a linear trend (ρ = ρ0 + EA*ρ1, where ρ is the correlation between PC and latitude/longitude, and EA is coded by 0, 1, 2, and 3), which gave us significant negative parameter estimates for ρ1: PC1 for paternal EA: ρ1 = -.06, p = 1.7 × 10-11; PC1 for maternal EA: ρ1 = -.07, p = 8.9 × 10-12; PC2 for paternal EA: ρ1 = -.06, p = 1.7 × 10-4; PC2 for maternal EA: ρ1 = -.05, p = 6.5 × 10-3. This would be expected if parents of subjects with higher EA tended to either live in a different geographic area than their ancestors, or that their partners live in a different geographic area than their ancestors. The effect is still visible after splitting up the sample into a religious and non-religious group (Fig. 3). The non-religious group shows an overall weaker association between geography and the PCs, consistent with previously observed lower Froh levels in the non-religious group in the Netherlands , and suggesting migration may have also played a role in the homozygosity differences between religious and secular groups.
A—Left: geographic distribution of PC1 (N = ~5,000 unrelated Dutch subjects), where the mean PC1 value per postal code (current living address) was computed, divided into 10 percentiles, and plotted. Right: two plots showing the explained variance (R2) of the offspring’s PC1 by the North-South gradient based on the offspring’s birthplace, per parental educational group. B—Left: geographic distribution of PC2. Right: two plots showing R2 between offspring PC2 and the East-West gradient based on offspring’s birth place.
The proportion of the autosomal genome in ROHs (Froh) shows a nominally significant negative association with EA. In the absence of data on parental EA, geographic mobility, and ancestry, this observation could have been interpreted as the result of deleterious effects of inbreeding on cognitive ability, which would fit the existing hypotheses [38,39,40,41]. The effect was considerably more significant however when associating Froh with paternal or maternal EA. We investigated whether this could be explained by a combination of migration and educational assortment. Ancestry correlates highly with geography in the Netherlands due to relatively low levels of within-country migrations in recent history . Individuals with higher EA were significantly more likely to have migrated away from their birthplace and to mate with a partner from a different geographic region. In this context, educational assortment increases the chance for more highly educated individuals to mate with genetically more dissimilar partners, lowering the number of homozygous alleles transmitted to their offspring, while less educated individuals would have been more likely to mate closer to their ancestry. The association between Froh and parental EA disappears after correcting for the distance between the paternal and maternal birthplaces, which itself was also significantly associated with Froh. Additional mediation tests showed that the association between Froh and parental EA was indeed fully mediated by the parental birthplace distance. This is in line with the declining correlation between ancestry-informative PCs and the geographic location of the birthplace in subjects with more highly educated parents (Fig. 3). The same trend is visible after splitting up the sample in a religious and non-religious group, with the non-religious group showing consistently lower correlations between PCs and geography, suggesting migration may have also played a role in the Froh differences between the religious and secular groups previously observed in this sample .
It has been suggested that outbreeding may explain the large increase in IQ from one generation to the next since the start of the 20th century in many parts of the world (also known as the “Flynn effect”) . If outbreeding is indeed associated with increased cognitive abilities, which would likely require much larger sample sizes to detect , the increased heterozygosity in subsequent generations caused by the non-random migration and mating patterns described in this study may have led (or could lead) to a feedback loop, which may contribute to an increase in IQ in each following generation.
A study in a UK sample found a nominally significant association between cognitive ability, which is predictive for EA, and Froh in the opposite direction, with increased Froh levels in individuals with higher cognitive ability . Considering the high correlation between IQ and EA [17,18], the significant association between genetic variation and geography in the UK [43,44], and higher migration rates for more highly educated individuals within the UK [45,46], we would have expected an association between cognitive ability and Froh in the same direction as EA shows in the Dutch population. The authors hypothesized that the ROHs causing this association harbor causal variants that have become homozygous through assortative mating on cognitive function. This difference in results is reminiscent of the difference in direction between populations for the association between Froh and major depressive disorder (MDD) for which the UK and the Netherlands also showed an opposite direction of effect [1,47]. The MDD-Froh association disappeared in the Dutch sample after correcting for systematic differences in parental relatedness between religious and non-religious groups. These phenomena illustrate the importance of the impact of complex social, demographic, and historical processes on the genomic structure of populations. The fact that the offspring education was much less significantly associated with offspring Froh than the parents’ education and that the association disappeared after correcting for the distance between parental birthplaces strongly suggests that the effects we observed in the Dutch population do not reflect systematic differences in the frequency of causal genetic variants. Further analyses in a more deeply phenotyped and representative UK sample (preferably with own and parental EA & birthplace measured) are necessary in order to investigate the discrepancy in direction of effects between the UK and Dutch population, and the role of causal variants therein.
Non-random mating in human populations can be driven by heritable social traits like religion and EA through migration and assortment. The impact of these mating behaviors on the genomic structure of a population is not always directly captured by traditional measures for population stratification, such as ancestry-informative PCs. These findings are relevant for genetic association studies, since these behaviors can be associated with additional traits of interest, like psychiatric disorders with religiosity [1,48], or IQ with EA . Deleterious effects of inbreeding studied by associating Froh and the trait of interest usually require much larger sample sizes for detection than that of the current dataset (~12,000–65,000) . We suspect that ancestral behavior may have influenced genetic variation more systematically than genetic variation influenced the current measurable behavior in our dataset. This additional confounding and non-causal “noise” may have contributed to the difficulty of finding consistent genetic association signals for many behavioral traits, especially if the nature, effect size, and/or direction of such confounding effects would differ per population. We recommend that cohorts contributing to meta-analyses of genetic association studies on behavioral, cognitive, and social traits search for patterns of variation caused by the social/historical context of their population, so these can be accounted for accordingly in their analyses and interpretations.
We would like to thank all the twins and family members for their participation. We thank Professor CV Dolan for providing scripts and feedback for the OpenMx and LISREL analyses.
Conceived and designed the experiments: AA GED PFS BWJHP EJG DIB. Performed the experiments: EAE AB. Analyzed the data: AA. Contributed reagents/materials/analysis tools: GW MB TVB EAE. Wrote the paper: AA JJH BWJHP EJG DIB.
- 1. Abdellaoui A, Hottenga J-J, Xiao X, Scheet P, Ehli EA, Brooks A, et al. Association between Autozygosity and Major Depression: Stratification due to Religious Assortment. Behav Genet. 2013; 43: 455–467. pmid:23978897
- 2. Blossfeld H-P. Educational assortative marriage in comparative perspective. Annu Rev Sociol. 2009; 35: 513–530.
- 3. Mare RD. Five decades of educational assortative mating. Am Sociol Rev. 1991: 15–32.
- 4. Schwartz CR. Trends and Variation in Assortative Mating: Causes and Consequences. Annu Rev Sociol. 2013; 39: 451–470.
- 5. Vandenberg SG. Assortative mating, or who marries whom? Behav Genet. 1972; 2: 127–157. pmid:4664207
- 6. Domingue BW, Fletcher J, Conley D, Boardman JD. Genetic and educational assortative mating among US adults. Proc Natl Acad Sci USA. 2014: 201321426.
- 7. Mulder C, Kalmijn M. Even bij oma langs: NKPS laat zien hoe ver familieden van elkaar wonen. Demos. 2004; 20: 78–80.
- 8. Abdellaoui A, Hottenga J-J, de Knijff P, Nivard MG, Xiao X, Scheet P, et al. Population structure, migration, and diversifying selection in the Netherlands. Eur J Hum Genet. 2013; 21: 1277–1285. pmid:23531865
- 9. Boomsma DI, Wijmenga C, Slagboom EP, Swertz MA, Karssen LC, Abdellaoui A, et al. The Genome of the Netherlands: design, and project goals. Eur J Hum Genet. 2014; 22: 221–227. pmid:23714750
- 10. Branigan AR, McCallum KJ, Freese J. Variation in the heritability of educational attainment: An international meta-analysis. Northwestern University Institute for Policy Research Working Paper. 2013; 13.
- 11. Heath AC, Berg K, Eaves LJ, Solaas MH, Corey LA, Sundet J, et al. Education policy and the heritability of educational attainment. Nature. 1985; 314: 734–736. pmid:4039415
- 12. Breslau J, Lane M, Sampson N, Kessler RC. Mental disorders and subsequent educational attainment in a US national sample. J Psychiatr Res. 2008; 42: 708–716. pmid:18331741
- 13. Kessler RC, Foster CL, Saunders WB, Stang PE. Social consequences of psychiatric disorders, I: Educational attainment. Am J Psychiatry. 1995; 152: 1026–1032. pmid:7793438
- 14. De Raad B, Schouwenburg HC. Personality in learning and education: A review. Eur J Personality. 1996; 10: 303–336.
- 15. Meara ER, Richards S, Cutler DM. The gap gets bigger: changes in mortality and life expectancy, by education, 1981–2000. Health Aff (Millwood). 2008; 27: 350–360. pmid:18332489
- 16. Mackenbach JP, Stirbu I, Roskam A-JR, Schaap MM, Menvielle G, Leinsalu M, et al. Socioeconomic inequalities in health in 22 European countries. N Engl J Med. 2008; 358: 2468–2481. pmid:18525043
- 17. Deary IJ, Strand S, Smith P, Fernandes C. Intelligence and educational achievement. Intelligence. 2007; 35: 13–21.
- 18. Kaufman AS, Kaufman JC, Liu X, Johnson CK. How do educational attainment and gender relate to fluid intelligence, crystallized intelligence, and academic skills at ages 22–90 years? Arch Clin Neuropsychol. 2009; 24: 153–163. pmid:19185449
- 19. Bouchard TJ, Lykken DT, McGue M, Segal NL, Tellegen A. Sources of human psychological differences: The Minnesota study of twins reared apart. Science. 1990; 250: 223–228. pmid:2218526
- 20. Haworth C, Wright M, Luciano M, Martin N, De Geus E, Van Beijsterveldt C, et al. The heritability of general cognitive ability increases linearly from childhood to young adulthood. Mol Psychiatry. 2009; 15: 1112–1120. pmid:19488046
- 21. Martin NW, Medland SE, Verweij KJ, Lee SH, Nyholt DR, Madden PA, et al. Educational attainment: a genome wide association study in 9538 Australians. PLoS One. 2011; 6: e20128. pmid:21694764
- 22. Rietveld CA, Medland SE, Derringer J, Yang J, Esko T, Martin NW, et al. GWAS of 126,559 Individuals Identifies Genetic Variants Associated with Educational Attainment. Science. 2013; 340: 1467–1471. pmid:23722424
- 23. Power RA, Nagoshi C, DeFries JC, Donnelly P, Barroso I, Blackwell JM, et al. Genome-wide estimates of inbreeding in unrelated individuals and their association with cognitive ability. Eur J Hum Genet. 2014; 22: 386–390. pmid:23860046
- 24. Keller MC, Visscher PM, Goddard ME. Quantification of inbreeding due to distant ancestors and its detection using dense single nucleotide polymorphism data. Genetics. 2011; 189: 237–249. pmid:21705750
- 25. Boomsma DI, de Geus EJ, Vink JM, Stubbe JH, Distel MA, Hottenga J-J, et al. Netherlands Twin Register: from twins to twin families. Twin Res Hum Genet. 2006; 9: 849–857. pmid:17254420
- 26. Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011; 88: 76. pmid:21167468
- 27. Boomsma DI, Willemsen G, Sullivan PF, Heutink P, Meijer P, Sondervan D, et al. Genome-wide association of major depression: description of samples for the GAIN Major Depressive Disorder Study: NTR and NESDA biobank projects. Eur J Hum Genet. 2008; 16: 335–342. pmid:18197199
- 28. Willemsen G, de Geus EJC, Bartels M, van Beijsterveldt CT, Brooks AI, Estourgie-van Burk GF, et al. The Netherlands Twin Register biobank: a resource for genetic epidemiological studies. Twin Res Hum Genet. 2010; 13: 231–245. pmid:20477721
- 29. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006; 38: 904–909. pmid:16862161
- 30. Penninx BW, Beekman AT, Smit JH, Zitman FG, Nolen WA, Spinhoven P, et al. The Netherlands Study of Depression and Anxiety (NESDA): rationale, objectives and methods. Int J Methods Psychiatr Res. 2008; 17: 121–140. pmid:18763692
- 31. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007; 81: 559–575. pmid:17701901
- 32. Howrigan D, Simonson M, Keller M. Detecting autozygosity through runs of homozygosity: A comparison of three autozygosity detection algorithms. BMC Genomics. 2011; 12: 460. pmid:21943305
- 33. Boker S, Neale M, Maes H, Wilde M, Spiegel M, Brick T, et al. OpenMx: an open source extended structural equation modeling framework. Psychometrika. 2011; 76: 306–317. pmid:23258944
- 34. Esteve A, García‐Román J, Permanyer I. The Gender‐Gap Reversal in Education and Its Effect on Union Formation: The End of Hypergamy? Popul Dev Rev. 2012; 38: 535–546.
- 35. Chen J, Zheng H, Bei J-X, Sun L, Jia W-h, Li T, et al. Genetic structure of the Han Chinese population revealed by genome-wide SNP variation. Am J Hum Genet. 2009; 85: 775–785. pmid:19944401
- 36. McEvoy BP, Montgomery GW, McRae AF, Ripatti S, Perola M, Spector TD, et al. Geographical structure and differential natural selection among North European populations. Genome Res. 2009; 19: 804–814. pmid:19265028
- 37. Novembre J, Johnson T, Bryc K, Kutalik Z, Boyko AR, Auton A, et al. Genes mirror geography within Europe. Nature. 2008; 456: 98–101. pmid:18758442
- 38. Jensen AR. Effects of inbreeding on mental-ability factors. Pers Individ Dif. 1983; 4: 71–87.
- 39. Morton N. Effect of inbreeding on IQ and mental retardation. Proc Natl Acad Sci USA. 1978; 75: 3906–3908. pmid:279005
- 40. Najmabadi H, Hu H, Garshasbi M, Zemojtel T, Abedini SS, Chen W, et al. Deep sequencing reveals 50 novel genes for recessive cognitive disorders. Nature. 2011; 478: 57–63. pmid:21937992
- 41. Woodley MA. Inbreeding depression and IQ in a study of 72 countries. Intelligence. 2009; 37: 268–276.
- 42. Mingroni MA. The secular rise in IQ: Giving heterosis a closer look. Intelligence. 2004; 32: 65–83. pmid:15042465
- 43. Tyler-Smith C, Xue Y. A British approach to sampling. Eur J Hum Genet. 2011; 20: 129–130. pmid:21829226
- 44. Winney B, Boumertit A, Day T, Davison D, Echeta C, Evseeva I, et al. People of the British Isles: preliminary analysis of genotypes and surnames in a UK-control population. Eur J Hum Genet. 2011; 20: 203–210. pmid:21829225
- 45. Faggian A, McCann P, Sheppard S. An analysis of ethnic differences in UK graduate migration behaviour. Ann Reg Sci. 2006; 40: 461–471.
- 46. Faggian A, McCann P, Sheppard S. Human capital, higher education and graduate migration: an analysis of Scottish and Welsh students. Urban Studies. 2007; 44: 2511–2528.
- 47. Power RA, Keller MC, Ripke S, Abdellaoui A, Wray NR, Sullivan PF, et al. A Recessive Genetic Model and Runs of Homozygosity in Major Depressive Disorder. Am J Med Genet B Neuropsychiatr Genet. 2014; 165: 157–166. pmid:24482242
- 48. Koenig HG. Research on religion, spirituality, and mental health: A review. Can J Psychiatry. 2009; 54: 283–291. pmid:19497160