Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A guide to using a multiple-matrix animal model to disentangle genetic and nongenetic causes of phenotypic variance

  • Caroline E. Thomson ,

    Roles Conceptualization, Formal analysis, Methodology, Project administration, Visualization, Writing – original draft, Writing – review & editing

    cethomson2@gmail.com

    Affiliation Laboratoire Evolution & Diversité Biologique (EDB UMR 5174), Université Fédérale Toulouse, Midi-Pyrénées, CNRS, ENSFEA, IRD, UPS, France

  • Isabel S. Winney,

    Roles Conceptualization, Writing – original draft, Writing – review & editing

    Affiliation Laboratoire Evolution & Diversité Biologique (EDB UMR 5174), Université Fédérale Toulouse, Midi-Pyrénées, CNRS, ENSFEA, IRD, UPS, France

  • Océane C. Salles,

    Roles Conceptualization, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Laboratoire Evolution & Diversité Biologique (EDB UMR 5174), Université Fédérale Toulouse, Midi-Pyrénées, CNRS, ENSFEA, IRD, UPS, France

  • Benoit Pujol

    Roles Conceptualization, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Laboratoire Evolution & Diversité Biologique (EDB UMR 5174), Université Fédérale Toulouse, Midi-Pyrénées, CNRS, ENSFEA, IRD, UPS, France, Laboratoire d’Excellence “CORAIL”, Perpignan, France

Abstract

Non-genetic influences on phenotypic traits can affect our interpretation of genetic variance and the evolutionary potential of populations to respond to selection, with consequences for our ability to predict the outcomes of selection. Long-term population surveys and experiments have shown that quantitative genetic estimates are influenced by nongenetic effects, including shared environmental effects, epigenetic effects, and social interactions. Recent developments to the “animal model” of quantitative genetics can now allow us to calculate precise individual-based measures of non-genetic phenotypic variance. These models can be applied to a much broader range of contexts and data types than used previously, with the potential to greatly expand our understanding of nongenetic effects on evolutionary potential. Here, we provide the first practical guide for researchers interested in distinguishing between genetic and nongenetic causes of phenotypic variation in the animal model. The methods use matrices describing individual similarity in nongenetic effects, analogous to the additive genetic relatedness matrix. In a simulation of various phenotypic traits, accounting for environmental, epigenetic, or cultural resemblance between individuals reduced estimates of additive genetic variance, changing the interpretation of evolutionary potential. These variances were estimable for both direct and parental nongenetic variances. Our tutorial outlines an easy way to account for these effects in both wild and experimental populations. These models have the potential to add to our understanding of the effects of genetic and nongenetic effects on evolutionary potential. This should be of interest both to those studying heritability, and those who wish to understand nongenetic variance.

Introduction

Genetic and nongenetic sources of phenotypic variation and similarity between individuals are often studied in isolation, with different research aims. Laboratory experiments have demonstrated genetic and nongenetic sources of similarity [1], but the use of fixed genetic and/or environmental backgrounds [2,3] can limit their relevance in nature. Studies in wild populations are more biologically realistic, but cannot control for all sources of variation. For example, such studies may investigate quantitative genetic [4,5], social [6,7], or environmental sources of variation [8]. Yet, these mechanisms are likely to act simultaneously on phenotypes, which makes them challenging to disentangle. Whilst there are attempts to consider genetic and nongenetic aspects of phenotypic variation simultaneously [9,10], these methods are challenging to apply with current empirical data. Thus there is a need to distinguish multiple sources of variation within a single model framework that is simple to use with realistic data. This will be valuable for addressing questions about how phenotypes are shaped, both in wild and experimental settings, and the evolutionary consequences of these causes of variance.

Quantitative genetic components of phenotypic variance

Quantitative genetics assesses the extent to which related individuals show similar phenotypic traits, making the assumption that this is because of shared genetic effects [4]. This assumes that continuous phenotypic traits are determined by a large number of genes with small additive effects [4,5], and environmental influences. The ‘animal model’ is a powerful tool in quantitative genetics, used to disentangle the origins of phenotypic variation in natural systems [11]. Adopted from animal breeding studies, this mixed-effects model associates variation in phenotypic traits with relatedness information to give an estimate of the (additive) genetic variation underlying the trait [12]. From this, researchers primarily calculate narrow-sense heritability, the portion of population phenotypic variance attributable to additive genetic variation, which provides predictions about a population’s expected response to selection [13,14]. Therefore, the animal model provides a link between phenotypic traits and their evolutionary potential.

The animal model is mostly used to estimate additive genetic variance (and heritability), but phenotypic variance can sometimes be affected by other factors that reduce the precision of these estimates. Genetic variance can include dominance and epistatic variance [4], in addition to the additive genetic variance, meaning that additive genetic variance may be overestimated. Dominance variance can be partitioned by comparing full- and half-siblings, or by including a dominance matrix in the animal model. Nevertheless, this is rarely done in wild studies [15] as good estimates require a large dataset containing specific types of relatives, and inaccurate estimates can often be confounded with maternal effects [16].

Nongenetic factors can also cause misestimation of additive genetic variance in some systems. The main assumptions underlying the animal model are that relatives share genes, and therefore the genetic effects that shape their genotype, but that any effects shaping unexplained variance are independent between individuals [5]. Shared genes are not the only sources of similarity between individuals, however, which violates these assumptions [17]. These sources of variation may need to be accounted for to accurately estimate heritability and evolutionary potential, and can be of interest in their own right. In this paper we focus principally on three nongenetic sources of variation: environmental, epigenetic, and social sources of variation, although others are known to exist [18]. Whilst environmental sources of variation, including the environment caused by social partners, have previously been included in the animal model in various ways, variation attributable to epigenetics or social networks have not. We begin, therefore, by briefly reviewing how environmental sources of variance can be included in quantitative genetic models.

Including the environment in the animal model

Shared environments can increase the phenotypic resemblance between relatives and alter heritability estimates. Shared rearing environments in particular can increase sibling similarity. To address this problem, the common environmental factor causing individuals to resemble one another can be included as a random effect in the animal model. For example, models may include the year of measurement [19] or year of birth [20] to account for broad-scale environmental variance. Shared rearing effects can also be accounted for in this way [21,22], particularly when combined with cross-fostering experiments [17,23].

In many cases, some portion of the environment that an individual experiences can be attributed to other individuals. For example, shared rearing environments can include environmental parameters generated by one or both parents [24,25]. Parental effects can be accounted for in simple cases by including the identity of one or both parents (usually the mother) as an additional random effect [2628]. This accounts for trait variance that can be attributed to the parent(s) [29,30], rather than attempting to identify and account for all parental phenotypic traits that cause the focal trait variance (although see [27]).

More generally, the variance in a focal individual’s phenotype can be affected by indirect genetic effects [3133]. These arise when a focal individual’s environment is altered by a conspecific (whether parental or not), as determined by the conspecific’s own genes. When IGEs occur, the residual variance in a trait is partially heritable and can affect evolutionary trajectories [34]. Animal models that incorporate IGEs have been developed [3539], and implemented within the fields of animal and plant breeding, allowing better estimation of the total heritabilities of traits. Additionally, experimental populations have been used to estimate IGEs on aggression [40,41]. In wild populations, there have been some estimates of IGEs, particularly where only one or two individuals have indirect genetic effects on traits. Phenotypic variance due to parental effects can be estimated as parental genetic variance, wherein the covariance between the effects of mothers (or fathers) is specified by the additive genetic relatedness matrix ([42]; [5]; [11]; for examples see: [43]; [28]). Additionally, wild studies have estimated IGEs of males on female reproductive timing [44,45], and those of helpers at the nest in long-tailed tits [46].

Accounting for spatial similarity in environments

Environments are often heterogeneous over both space and time [47,48], and may show spatial (and temporal) autocorrelation. Factorial random effects such as birth year may not capture the variance due to continuous and spatially autocorrelated environmental effects. This can lead to the misestimation of variance components, particularly if relatives share environments due to limited dispersal. Within quantitative genetics, spatial autocorrelation of phenotypes can be accounted for in the animal model. Autoregressive (AR) models account for the similarity between individuals that are close in space, using a two dimensional row-by-column model, first incorporated into animal models for agricultural trials [49,50]. This method has been widely used in agriculture, including in forest genetic trials [51,52], although the AR term has not always accounted for all environmental variance [53]. In one recent paper, [54] included both IGEs of nearest neighbours and spatial effects in models of leaf disease and growth in Eucalyptus to show that these processes may act simultaneously and interact.

The effect of spatial autocorrelation on heritabilities in the wild has received relatively little attention, although some studies examining its effects are emerging. [55] used parent-offspring correlations to show that including spatial autocorrelation reduced estimates of laying date heritability in great tits (Parus major). Subsequently, three studies have included spatial autocorrelation in animal models for wild populations [5658]. Whilst [56] found relatively large variance components associated with spatial autocorrelation, particularly for home range size traits, such large variance components were not found by [57] for morphological or phenological traits, or by [58] for breeding time.

An alternative approach to measuring environmental similarity

Three of the papers above [5658] demonstrate an additional way to account for variance caused by the environment, which we build on in this paper. This technique was first applied by [56], who included a matrix of home-range overlap (the “S” matrix) in animal models. Spatial observations were used to estimate an individual’s home range, and the overlap of individual’s home ranges were estimated and used to generate the S matrix. The S matrix was then incorporated into the animal model to estimate trait variance due to shared space. This model had the expectation that if the environment affected trait values, individuals that shared space would be similar in their trait values. The S matrix would therefore be used to account for the variance associated with environmental effects. The environmental variance estimated by including the S matrix was generally comparable to the magnitude of spatial autocorrelation in both red deer [56] and Soay sheep [57]. [58] used a similar method to estimate shared environmental effects on breeding date in song sparrows, using overlap in buffer zones around nest boxes. As with spatial autocorrelation, there was no significant effect of nest area overlap on breeding date heritability estimates.

Broader applications of multiple matrices

The S matrix will allow the animal model to estimate the variance due to environmental effects when it appropriately captures environmental heterogeneity. However, individuals that do not share space may still experience similar environments, whereas those that do share space may differ in the environments that they experience. We propose that similar matrices could be built using multiple measures of the environment itself, to describe the similarity in environment that individuals experience, and be incorporated in the animal model to account for environmental variance. The formation and use of such a matrix is demonstrated in the tutorial section below and in the supplement (S1 File).

We show two additional variance parameters that can be estimated using these matrices, beyond environmental variance. Firstly, phenotypic variation can also be caused by epigenetic variation [59,60]. This can be induced by environmental conditions [61], but has also been found in the absence of genotypic and environmental variation [62,63]. Estimates of additive genetic variation are made from statistical patterns that can include additive epigenetic variation, particularly if epigenetic marks are inherited [64,65]. To separate additive genetic and epigenetic variation, inclusion of matrices describing epigenetic “relatedness”, or similarity, between individuals should allow estimation of epigenetic variance.

Secondly, the social environment can affect phenotypic traits. An individual can learn behaviours from others with whom it interacts [66,67]. When individuals are embedded within a social network, information can be transmitted more strongly amongst frequently interacting individuals [68,69]. If relatives are social partners [70,71], this can potentially inflate estimates of genetic variance in an animal model. Conversely, to understand social effects on trait values, researchers should take relatedness and additive genetic variation into account. We propose using a matrix to account for links between individuals in a social network, and thereby estimate the variance associated with social structures in an animal model. Inclusion of social information into an animal model has previously been attempted for bottlenose dolphins [72], although this study had methodological issues, and inclusion of both genetic and social information within a single model was not attempted.

The purpose of this paper is to outline the tools available to disentangle causes of genetic and nongenetic resemblance between individuals, within the animal model framework. We take advantage of the versatility of this model to demonstrate the use of the multiple matrix method in evolutionary biology, as proposed by [18], and first implemented for nongenetic data in the wild by [56]. Matrices that account for nongenetic causes of similarity between individuals are included in a similar way to the matrix of genetic relatedness, and can be derived from multiple sources of similarity, including those described above. Using simulated data we provide a complete tutorial, with code provided in the supplementary materials (S1 File), to demonstrate how to carry out such analyses with R packages [73]. We show how phenotypic variance can be partitioned into genetic and nongenetic components, including parental effect variance, in three different phenotypic traits. We also give a qualitative overview of the data sizes needed to run these models. This extension of the animal model has the potential to bridge multiple areas of research on genetic and nongenetic sources of phenotypic variation.

Tutorial

The animal model is a mixed-effects model (i.e. a linear regression containing fixed and random terms), that partitions the total phenotypic trait variance in a population (Vp) between various sources, including additive genetic variation (Va). The animal model is a powerful tool for disentangling multiple sources of variation, both genetic and nongenetic. For readers who are new to the animal model, we recommend [5], [11], and [12] as an introduction. For those seeking more information, [16] gives a more extensive and in-depth overview of the models covered in this paper (Chapters 19, 20, 22).

Our aim here is to demonstrate the creation and use of nongenetic matrices in animal models. The animal model then becomes a mixed-effects model with multiple vectors of random effects. To illustrate this, we simulated three phenotypic traits, and ran animal models on them, as detailed below. Animal models were run using the R package ASReml-R [74]. Full details of the simulations are given in the Supplementary Information (S2 File), with a brief overview in the section below. The subsequent sections contain the theory behind the variance components included, followed by the results of models that used matrices of environmental similarity. We show how matrices derived from environmental, epigenetic and social network information can be included to estimate variance. A step-by-step guide with code (for ASReml-R) is given in the Supplementary Information (S1 File), with additional information on how to carry out these analyses in MCMCglmm [75], a Bayesian R package. Fitting some or all of these models may be possible in other software and R packages, including standalone ASReml [76], Wombat [77] and INLA [78]. Finally, to provide an idea of the data requirements of these models, we show the effects of data removal on the model in the Supplementary Information (S3 File).

Data simulation

To demonstrate how to use multiple-matrix animal models, we provide a simulated dataset of a population of merpeople (mermaids and mermen). This imaginary population contains ten semelparous generations, of between 150 and 250 individuals per generation (total population size 1806). Half of each generation were selected and assigned to breeding pairs at random (the other half were non-breeding), and offspring were then randomly assigned to these pairs. To simulate extra-pair mating, 10% of offspring had one parent reselected from the breeding population.

To simulate environments and their effects, each individual was assigned a location on a 50x50 grid, with the first generation positioned on the grid at random. Subsequent generations dispersed from their mothers, following a lognormal distribution function (μ = 1, σ2 = 1, see S2 File). Five continuous environmental variables (temperature, salinity, pH, coral coverage, depth) were simulated, with spatial autocorrelation across the 50x50 grid (the covariance between locations was determined by the matrix e−ϕD, where D is a matrix of distances and ϕ = 0.15). The five environmental variables were independently simulated and had negligible covariances. Individuals were then assigned a set of environmental variables according to their location on the grid.

We also simulated a set of epialleles, to show how an epigenetic similarity matrix can be used in an animal model. We simulated 230 CpG islands [79], wherein the number of methylated CpG sites within each island was allowed to vary between 0 and 20. For 100 of these, the methylation was affected by the environmental variables an individual experienced itself. For 30, methylation was affected by either the maternal environment, or was reset to the value determined by the individual’s own environment (with 50% probability of reset), such that epigenetic marks could be transmitted across generations. The final 100 islands were uninherited and unaffected by the environment, so each individual had a randomly generated value between 0 and 20 for each island.

To demonstrate how social effect variance can be accounted for, we simulated a social network of within- and between-generation interactions, based on the spatial locations of individuals. Within generations, the probability of connection in the social network declined with distance. Between adjacent generations, there was a weaker probability of connections, although the probability remained higher for mother-offspring pairs. Non-adjacent generations had no direct links in the social network.

Trait values

We simulated three continuous phenotypic traits, composed of different sources of variance (Fig 1). Each individual variance component was simulated with a mean of zero and variance of 1. The total variance of each trait therefore depended on the number of components used to generate it. All trait values included additive genetic and residual effects. Each environmental variable (kl, where l indexes environments) had a linear effect on the trait (βl), such that the total effect of the environment on the trait for individual i is , with variance . Direct effects of epigenetic marks or the social network were not simulated, but were expected to account for environmental variance. No covariances were simulated between variance components.

thumbnail
Fig 1. The potential inputs that could affect trait values in phenotypic traits in merpeople.

Boxed parameters are the true parameters included in at least one of the traits. Unboxed parameters are included in models as potential sources of variance, but did not affect the simulated traits. a represents additive genetic effects, and Ma the maternal genetic effect. n is the total direct environmental effect, and Mn the maternal environmental effect. epi and Mepi are direct and maternal epigenetic effects, and c and Mc are direct and maternal social network/cultural effects. Paternal effects were not included in the simulations or models, but are shown here as a potential effect on trait values. F0 and F1 indicate generations.

https://doi.org/10.1371/journal.pone.0197720.g001

The first trait was tail-fin colour. We use this trait to illustrate how the additive genetic and environmental variance can be confounded in an animal model, and how environmental variance can be accounted for. Individual trait values for tail-fin colour (yi,1) were: (1) where ai is the additive genetic effect, ni is the effect of the five environments, and ri is a residual effect. Trail-fin colour has a raw trait variance = 3.17, and an expected heritability of ~0.33.

Trait two was body size (y2). This trait is used to illustrate maternal variance, particularly how it can be estimated with an environmental matrix. No paternal effects were simulated or modelled. Trait values were:. (2) where maternal environmental variance (where j indicates the mother of individual i). The effect a mother had on her offspring’s trait was determined by the environment she experienced, akin to direct environmental effects. Total raw trait variance was = 3.61, with an expected heritability of ~0.25.

The third trait, swimming speed (y3), was used to illustrate the partitioning of different sources of maternal variance. This trait had all the components above, and also contained maternal genetic variance (Maj,i), wherein the effect of a mother was also affected by her own genes: (3)

Total raw variance was = 4.28, with an expected heritability of ~0.2.

Tail-fin colour—estimating additive genetic and environmental variance

Additive genetic variance.

In a basic animal model, an individual (i) has a trait value (yi) that is is composed of at least the three following elements: the population mean (μ), the breeding value of the individual (ai, or the effect of the genotype relative to the population mean) and a residual (ri), such that the total trait is yi = μ + ai + ri. In a more general form, this can be rewritten as: (4) where y is a vector of trait values in all individuals, X is the design matrix relating fixed effects to individuals, and β is a vector of fixed effects. In the case of Eq 4, which contains the intercept and no fixed effects, X is a vector of ones, and β = μ. Z is the incidence matrix relating random effects to each individual, and u is a vector of random effects.

When only one random effect (the additive genetic effect) is included, because there is only one phenotypic observation per individual Z becomes an identity matrix (I) and u is a vector of additive genetic effects. Both u and and r are assumed to be normally distributed with means of zero, and variances of Va and Vr, respectively. Residual errors are assumed to be independent between individuals, so that the variance-covariance matrix for r is R = I * Vr and therefore r ~ N(0,R). Taking G to be the variance-covariance matrix for u, this can be derived from the expected covariance between relatives in additive genetic effects G = A * Va, and thus u ~ N(0,G). A is the additive genetic relatedness matrix (Fig 2A), which is derived from the pedigree or genomic relatedness data [12,80]. This matrix contains the elements Ai,j = 2θi,j, where θi,j is the coefficient of coancestry (the probability that an allele in individual i is identical by descent to one of the alleles at the same locus in individual j). Thus for any pair of individuals, the additive genetic covariance between them is Gi,j = 2θi,j + Va.

thumbnail
Fig 2. Reduced versions of the matrix information included in the animal models shown in this paper.

(a) The genetic relatedness matrix, derived from a pedigree, which contains the probabilities that individuals i and j share alleles that are identical by descent. (b) The environmental similarity matrix, which contains elements Sn(i,j) (written in the figure as S, as the same equation is used for elements of Sepi) measuring the Euclidean distance between the environment individuals experience, p and q being the scaled environmental vectors for individuals i and j across l environments. (c) Epiallelic similarity, which contains elements Sepi(i,j), measuring the Euclidean distance between individuals’ epigenetic profiles, where p and q are the scaled epigenetic vectors for individuals i and j, across l epigenetic islands. (d) The geodesic distance matrix (social network connectedness) that shows the proximity of individuals in the social network, based on minimum path lengths (gi,j).

https://doi.org/10.1371/journal.pone.0197720.g002

Using the additive genetic relatedness matrix, the animal model relates trait covariance between individuals to their relatedness, and thus we are able to separate the additive genetic variance (Va, variance in ai) contributing to a trait from the residual variance (Vr, variance in ri). The phenotypic variance (Vp) is then the sum of the additive genetic and residual variances: (5)

Environmental variance.

Nongenetic factors can also cause individuals to resemble one another. To account for this, we can include information on nongenetic similarity between individuals in the model.

Shared environments can induce similar trait values between individuals, and thus create patterns of phenotypic variation that we may want to estimate. If related individuals share the same environment, neglecting shared environmental effects can inflate the estimated additive genetic variance (Va). Environmental causes of resemblance can be included as fixed or random effects in the model [17], such as a discrete environment classification (nest box, year of birth etc.). In this paper, however, we are more interested in how multiple continuous environmental parameters could be used to account for phenotypic variation. We do this by including a matrix that describes environmental similarity between individuals. To separate the shared environment variance from additive genetic variance in the model, the data requires that there are related individuals found across different environments [81]. However, if relatives all live in different environments, environmental effects will be difficult to distinguish from residual variation. Thus, these models also require that several group members are found in the same environment [17].

We took a measure of the similarity of the environment shared by each pair of individuals (across all generations), which generates the variance-covariance matrix associated with an environmental random effect that can be included in the animal model. We call this the Sn matrix, sensu [56], with subscript n indicating environmental effects. Similarity was calculated from the five continuous environmental measures that were simulated. These environmental measures were first centered and scaled, so that all had a mean of zero and variance of one. For each pair of individuals, the Euclidean distance between environmental measures was then calculated. This is equal to the straight-line distance between the two vectors of environmental measures in multivariate space, and so describes the similarity in environmental measures between individuals (, where p and q are the vectors of centered and scaled environmental measures for individuals i and j). This was scaled so that the Sn matrix contains values between 0 (dissimilar environments) and 1 (identical environments), with 1’s on the diagonal because individuals have an identical environment to themselves (Fig 2B). Thus, each element of the matrix is: (6)

Direct environmental variance (Vn) can then be estimated within the animal model in a similar way to additive genetic variance. Building on Eq 4 the model becomes: (7) where n is the vector of direct environmental effects, and Zn is the incidence matric that links each data point to an environmental value. As eah individual in the simulation has only one observation in the matrix, Zn becomes an identity matrix in the simulated example. Environmental effects (n) are assumed to be normally distributed with mean of zero, and a variance Vn, such that n ~ N(0,SnVn). Thus, the Sn matrix is included as the variance-covariance matrix for the environmental random effect. This gives: (8)

Variance estimates for tail-fin colour.

Applying a basic animal model of the form in Eq 6 for tail-fin colour gave a significant estimate of additive genetic variance that was larger than the simulated variance of 1 (Va = 1.714, SE = 0.19, p<0.001). Similarly, the residual variance was larger than the simulated value of 1 (Vr = 1.714, SE = 0.11). The model therefore gave an overestimate of trait heritability h2 = 0538 (SE = 0.042; Fig 3A). This overestimation was due to the fact that there were also direct environmental effects on the trait value that were not accounted for.

thumbnail
Fig 3. Variance components of tail-fin colour (trait 1).

This trait was simulated to contain additive genetic, environmental, and residual variance, each with mean = 0 and variance = 1. For each of the models, the pair of plots show the absolute values (left plot) and the proportion (right plot) of total trait variance explained. a) Model containing only additive genetic (a) and residual variance (r). b) Model including additive genetic (a), environmental (n), and residual variance (r) (the true model). c) Model including additive genetic (a), epigenetic (epi), and residual variance (r). d) Model containing additive genetic (a), social (c), and residual variance (r). The vertical dashed line shows simulated value for variance components (1).

https://doi.org/10.1371/journal.pone.0197720.g003

The environmental similarity matrix was subsequently included in the animal model for tail-fin colour, and produced accurate variance estimates (Fig 3B). As this trait was composed of additive genetic and environmental effect, this represents the complete (and correct) model of phenotypic variance. All three components were simulated to have variances around 1, and heritability around 0.33. The values estimated in this model were close to these values, confirming the efficacy of this method; Va = 1.1 (SE = 0.12, p<0.001), Vn = 0.986 (SE = 0.22, p<0.001), Vr = 0.894 (SE = 0.07), with h2 = 0.369 (SE = 0.042).

Body size—Estimating maternal environmental variance

Including parental effects.

Parents can form a portion of the environmental causes of trait values (i.e. an external input to the focal trait). Thus, we can think of parental effects as the combined influence of one or more (unmeasured) parental traits that affect the offspring’s phenotype (e.g. feeding rate), influenced by the parents own genotypes and/or environments. Parental effects can inflate the similarity between relatives, as siblings generally experience the same parental effects.

We can account for parental effects in the animal model. In the simplest case, this is done by including parental identity as an additional random effect. This requires that the parents have more than one offspring in the data, and that half-siblings are also present. Maternal half-siblings share both additive genetic and maternal effects, whereas paternal half-siblings share additive genetic and paternal effects. Information also comes from individuals that are related but with different mothers. We show maternal effects here, but the equivalent can be done for paternal effects. With the inclusion of maternal variance (VM), the total variance estimated in the model becomes: (9)

This model assumes that there is no covariance between mothers in the effects that they have on their offspring. However, mothers have not been simulated to be independent in their effects on offspring body size. This arises due to shared environments—mothers with similar environmental values are expected to express similar maternal effects on their offspring. For example, mothers that share environments may access similar types and quantities of food, and therefore show similar provisioning. We can account for this non-independence by estimating maternal environmental variance (VMn), using the same matrix of environmental similarity (Sn) as was used for the direct environmental effect. The total phenotypic variance is then estimated as: (10)

Whilst the Sn matrix included here is the same as for direct environmental effects, the incidence matrix (ZMn) for maternal environmental variance is not an identity matrix. Here it relates each set of phenotypic observations to a particular mother in the dataset.

Variance estimates for body size.

Body size was simulated to contain the same variance components as tail-fin colour, but with the addition of maternal environmental variance (VMn; Eq 2). In an animal model estimating only additive genetic and residual variances, both variances were overestimated, Va = 1.73 (SE = 0.19, p<0.001), Vr = 1.833 (SE = 0.12; Fig 4A). Accordingly, the heritability estimate was also upwardly biased, h2 = 0.486 (SE = 0.04).

thumbnail
Fig 4. Variance components of body size (trait 2).

This trait was simulated to contain additive genetic, environmental, maternal environemental, and residual variance, each with mean = 0 and variance = 1. For each of the models, the pair of plots show the absolute values (left plot) and the proportion (right plot) of total trait variance explained. a) Model containing only additive genetic (a) and residual variance (r). b) Model including additive genetic (a), environmental (n), and residual variance (r). c) Model including additive genetic (a), environmental (n), and residual variance (r), and maternal variance estimated by including maternal identity as a random effect (M). d) Model including additive genetic (a), environmental (n), maternal environmental (Mn), and residual variance (r) (the true model). e) Model including additive genetic (a), epigenetic (epi), maternal epigenetic (Mepi), and residual variance (r). f) Model containing additive genetic (a), social (c), maternal social (Mc), and residual variance (r).

https://doi.org/10.1371/journal.pone.0197720.g004

Including the environmental matrix in the animal model, we estimated a significant amount of (direct) environmental variance in the trait (Fig 4B), Vn = 1.108 (SE = 0.29, p<0.001). This somewhat reduced the estimate of additive genetic variance Va = 1.503 (SE = 0.15, p<0.001), residual variance Vr = 1.259 (SE = 0.09) and heritability h2 = 0.388 (SE = 0.041). Nevertheless, some variance estimates remained upwardly biased because maternal effects were not accounted for.

Including a simple maternal effect in the animal model (as in Eq 9), we found significant maternal effect variance VM = 0.653 (SE = 0.09, p<0.001), although it was underestimated relative to the true simulated variance. This reduced the estimate of the additive genetic (Va = 1.314, SE = 0.19, p<0.001), and residual variance (Vr = 0.855, SE = 0.1), although these remain overestimated, as was heritability (h2 = 0.337, SE = 0.046; Fig 4C). Environmental variance was unchanged compared to the previous model (Vn = 1.079 SE = 0.27, p<0.001).

The misestimation of the variance components in the model above was due to the non-independence of mothers in the effects they had on offspring body size. Therefore, we ran a model including maternal environmental variance (as in Eq 10). We found a significant maternal environmental effect, VMn = 0.97 (SE = 0.23, p<0.001), similar to the simulated variance of one. Inclusion of the maternal environmental effect led to variance estimates close to 1 for all components (Fig 4D): Va = 1.059 (SE = 0.14, p<0.001), Vn = 1.052 (SE = 0.24, p<0.001), and Vr = 0.892 (SE = 0.08). The heritability of the trait was h2 = 0267(SE = 0.038), similar to the expected heritability of 0.25. Thus, the model that matched the components of the simulated trait estimated the variance components.

Swimming speed—including maternal genetic variance

The final simulated trait, swim speed, was composed of the same components as body size, with the addition of maternal genetic effects (Eq 3). We use this trait to demonstrate the difficulty of disentangling maternal genetic and environmental effects.

As with maternal environmental variance, maternal genetic variance causes mothers to covary in their maternal effects. To estimate maternal (or paternal) genetic effects using relatedness derived from a pedigree, the data needs to contain at least three generations of individuals [12]. To account for maternal genetic effects in the animal model an additional random effect is included, with a variance-covariance matrix determined by the additive genetic relatedness matrix, as is done for additive genetic variance [5,11,42]. The incidence matrix for a maternal genetic random effect relates phenotypic observations to the mother. Thus, this estimates whether offspring of related mothers share similar trait values (whilst accounting for additive genetic effects), and thereby assigns a portion of the variance to maternal genetic effects. Thus with maternal genetic (VMa) and environmental (VMn) variance, the total variance is: (11)

It should be noted that the maternal genetic effect was simulated to have no covariance with any direct effect, and so this is also assumed for the model. However, any genetic covariance between direct and maternal effects will give a different error structure in the model [16].

Variance estimates for swimming speed.

Using a model with only additive genetic and residual variance, both components were overestimated Va = 2.067 (SE = 0.22, p<0.001), Vr = 2.082 (SE = 0.13, Fig 5A) and as a consequence a large heritability was found h2 = 0.498 (SE = 0.038).

thumbnail
Fig 5. Variance components of swimming speed (trait 3).

This trait was simulated to contain additive genetic, maternal genetic, environmental, maternal environmental, and residual variance, each with mean = 0 and variance = 1. For each of the models, the pair of plots show the absolute values (left plot) and the proportion (right plot) of total trait variance explained. a) Model containing only additive genetic (a) and residual variance (r). b) Model including additive genetic (a), environmental (n), and residual variance (r), and maternal variance estimated by including maternal identity as a random effect (M). c) Model including additive genetic (a), maternal genetic (Ma), environmental (n), maternal environmental (Mn), and residual variance (r)(the true model).

https://doi.org/10.1371/journal.pone.0197720.g005

A model that included direct environmental variance, and maternal variance (using maternal identity as a random effect) improved estimation (Fig 5B), but Va remained overestimated (Va = 1.459, SE = 0.22, p<0.001). A large maternal effect was found (VM = 1.302, SE = 0.15, p<0.001), although it underestimated the total maternal variance, which was expected to be ~2. Direct environmental variance was estimated correctly (Vn = 1.094, SE = 0.27, p<0.001). Overestimation of components in this model arose because maternal variance was simulated to be both environmentally and genetically derived, and so the complete model must estimate both maternal genetic variance, and maternal environmental variance.

The full animal model included additive genetic, maternal genetic, direct environmental, and maternal environmental variance (Eq 11). This model revealed a significant maternal genetic variance VMa = 0.792 (SE = 0.14, p<0.001), although this was slightly underestimated, and maternal environmental variance VMn = 1.345 (SE = 0.41, p<0.001), which was slightly overestimated. Other components were well estimated, Va = 1.119 (SE = 0.19, p<0.001), Vn = 1.118 (SE = 0.26, p<0.001), Vr = 0.922 (SE = 0.11), with heritability h2 = 0.211 (SE = 0.038).

This model follows the simulation of this trait. Whilst the direct genetic and environmental effects were well estimated, the maternal variance components were not as well estimated (Fig 5B). The maternal environmental variance was overestimated, whereas the maternal genetic variance was underestimated. This was due to the difficulty in distinguishing between the two maternal effects. These are challenging to disentangle, because related mothers are also likely to have similar environments. Whilst this is difficult to address with wild populations, increased data sizes (see Data Requirements) may aid in estimation. Experiments may also be of use, such as experimentally increasing dispersal between environments.

Epiallele similarity matrix

Using the first two traits (tail-fin colour and body size), we now show how variance in a trait could be partitioned using matrices derived from other possible sources of variance. We do not consider swimming speed (trait 3) from this point onward because of the difficulties estimating maternal environmental effects demonstrated above. Epigenetic information was converted into an epigenetic similarity matrix (Sepi, Fig 2C). As with the environmental information, the value for each island was first scaled so that it had a mean of zero and variance of one. From this scaled epigenetic information, the Euclidean distances between all pairs of individuals were calculated [82], and then scaled to be between 0 and 1. Thus, the elements of the matrix are: (12) where pl and ql are the vectors of scaled epigenetic measures for individuals i and j, akin to the elements of the Sn matrix (Eq 7). We expect that this matrix would capture similar information to the environmental similarity matrix above—individuals that experience similar environments will be more similar in their epigenome than those in dissimilar environments. To distinguish between genetic and epigenetic variances, there need to be epigenetic differences between relatives. If all epigenetic changes are obligatory [83], epigenetic variance is simply nested within the genetic variance.

Using the epigenetic similarity matrix, we can partition phenotypic variance into genetic and epigenetic variance parameters (Vepi): (13)

For traits including maternal epigenetic variance (Vmepi): (14)

Thus, these models estimate whether individuals that have similar epigenetic profiles have similar trait values. For tail-fin colour, this model gave variance estimates of Va = 1.097 (SE = 0.12, p<0.001), Vepi = 0.998 (SE = 0.18. p<0.001), and Vr = 0.684 (SE = 0.09), with a heritability of h2 = 0.395. Thus, the epigenetic similarity matrix accurately recapitulates the environmental effects on the trait (Fig 3C).

For body size, this model gave variance estimates of Va = 1.091 (SE = 0.16, p<0.001), Vepi = 1.085 (SE = 0.2, p<0.001), VMepi = 0.699 (SE = 0.12, p<0.001), and Vr = 0.614 (SE = 0.1; Fig 4E). Heritability was h2 = 0.313 (SE = 0.042). Thus, there was some underestimation of the maternal variance using the epigenetic similarity matrix, which is likely due to the fact that the trait was not simulated directly on the epigenetic effects.

Currently, it is challenging to obtain this sort of information for wild populations. However, we hope that the feasibility of obtaining this data will improve over time, in the same way that genotyping has. Therefore, the technique we present here will contribute to a growing area of research. Using matrices of epigenetic similarities could also be useful in laboratory or experimental situations. In cases where there is no genetic variation between individuals (e.g. using clones), data requirements will be lower, as Va = 0 by definition.

It should also be noted that in this simulation, using these alternative matrices technically produces type 1 errors (false positives), as tail-fin colour and body size were not simulated to be directly affected by either epigenetic or social effects. However, these models work as a proof of concept.

Social network matrix

To include social network information in a model, we used non-weighted connections between individuals to generate the Sc matrix of geodesic distances between individuals (i.e. the shortest path gi,j between individuals i and j; Fig 2D). Matrix diagonals must be one, so we adjusted the geodesic distances to have a path length of one to themselves, and added one to all other distances. We then took the reciprocal of each element, so that closely connected pairs had values close to one. Thus the elements of the Sc matrix are: (15)

A matrix that describes individuals’ proximity or connectedness in a social network can therefore be passed to the model to account for this: (16) where Vc is variance in the direct effect of the social network similarity on trait values. This model therefore estimates how much of the trait can be attributed to proximity of individuals in their social network.

Parental social network variance can also be estimated using the Sc matrix: (17) where VMc is the variance associated with the maternal effect due to the mother’s social interactions.

For tail-fin colour, we found a small but significant amount of variance due to social connections (Vc = 0.491, SE = 0.08, p<0.001). This was an underestimate of the variance due to nongenetic causes, and as a result the additive genetic variance Va = 1.501 (SE = 0.18, p<0.001), residual variance Vr = 1.226 (SE = 0.11), and heritability h2 = 0.467 (SE = 0.044) were overestimated (Fig 3D).

For body size, we found Vc = 0.265 (SE = 0.09, p<0.001) and VMc = 0.622 (SE = 0.13, p<0.001), showing that there was significant variance estimated by the social similarity matrix. Again, these values were underestimated relative to the true variance, and so the additive genetic Va = 1.213 (SE = 0.2, p<0.001) and residual variance Vr = 1.492 (SE = 0.13) were larger than one (Fig 4F).

To partition variance attributable to the social network, there must be sufficient contrast between the genetic and social similarity matrices. For example, in a social network in which connections only occur between parents and offspring, it will be difficult to disentangle social and additive genetic variance. If there are both full- and half-siblings within the data, and differences in the transmission of the trait from mothers and fathers, the social effect can be estimated as part of the maternal or paternal effects. More generally, social information can be incorporated if related individuals are present in different social groups, and differ in their connections in the network.

The matrix of social connectedness between individuals estimated some, but not all, of the environmental variance in the data. This was due to two main reasons. Firstly, the trait was simulated over the continuous environmental measures, not the social network, which was simulated using the spatial information for individuals. It is therefore unsurprising that it fails to capture all of the variance. Secondly, connectivity decreased with generational distance but environmental (and epigenetic) similarity did not. A trait that was truly inherited through a social network would be likely to show a more precise estimate of social variance.

Discussion

In this paper, we outlined how to simultaneously decompose genetic and nongenetic (environmental, epigenetic, and social) sources of similarity between individuals in an animal model. We also provided a supplementary tutorial (S1 File), using existing software, to encourage the use of these methods. This should facilitate wider adoption of these methods by researchers investigating phenotypic variation in wild or laboratory populations.

With biologically plausible simulated data, we estimated significant genetic, environmental, epigenetic, and social variation in all three traits. As has been shown previously, Va estimates of these traits changed significantly when variance due to nongenetic effects was estimated, leading to more accurate estimates of additive genetic variance and heritability [17,27]. For studies of wild populations, the actual effects of including nongenetic similarity matrices on estimates of variance will depend on the trait in question—as seen in the various effects of including spatial similarity in [56], [57], and [58].

The relative effect of including of nongenetic similarity on Va, Vr, and h2, will depend on whether the causes of similarity increase the resemblance between relatives or non-relatives. Nongenetic resemblance amongst relatives artificially inflates a trait’s estimated additive genetic variance when not accounted for. Conversely, increased resemblance between unrelated individuals will affect estimates of the residual variance, but heritability should be unbiased. In the merpeople simulation, relatives were similar in their continuous environmental measures, so that genetic variance was inflated by environmental effects when they were not accounted for.

For body size and swim speed, maternal effects were also included in the models, showing that it is important to account for maternal traits when they affect the trait of interest. For traits where maternal effects don’t exist, or have no effect on the estimation of additive genetic variance, the inclusion of maternal effect variance is unlikely to change estimates of additive genetic variance. Maternal environmental and genetic variance proved challenging to partition completely, but neglecting maternal environmental variance might cause over-estimation of maternal genetic, as well as additive genetic, variance.

Challenges with nongenetic matrices

Inclusion of such matrices with real-world data should be hypothesis driven, such that researchers need to have reason to expect that nongenetic similarity may account for some portion of phenotypic variance. These matrices are also non-sparse, and the models run more slowly and require more computer memory than an animal model containing only the additive genetic relatedness matrix. In addition, caution must be taken with considering whether other variance components may be confounded with the nongenetic variance. Dominance variance, for example, can contribute to phenotypic variance [4]. However, dominant genetic effects would mainly be confounded with environmental variance when there are large numbers of full-siblings in the data that also share environments.

Multiple nongenetic parameters themselves can have genetic influences (e.g. habitat selection, dispersal, social behaviours). For example, if there is a genetic basis to social behaviour [84], the social network and genetic variance could interact in forming the trait of interest. Including a behavioural trait or network metric (e.g. dominance) as a second response variable in a bivariate model could help account for this interaction. In addition, there may be interactions amongst sources of (nongenetic) variance. For example, [64] showed that a genetically determined maternal effect (maternal care) created a maternal environment that led to the methylation of rat pup DNA. Therefore, the maternal genetic and direct epigenetic effects are covarying and confounded, and the estimation of the epigenetic effect may depend on whether a maternal genetic effect is included, and vice versa [85,86].

The additive genetic variance and heritability of traits, can change over ontogeny [87] or senescence [88], but the additive genetic relatedness between individuals does not generally change over time. This is not necessarily the case for nongenetic factors. These may change over a lifetime in both the variance explained (as with additive genetic variance), and changes in the similarity matrix itself. The environment [89,90], epigenetic marks [91] and social networks [92] can all change over time, potentially altering the similarity between individuals. Furthermore nongenetic similarity during development may explain more variance than nongenetic similarity measured at the same time as the phenotype (or vice versa). For example, [93] showed the importance of early- rather than later-life environment on traits, and [57] used the home range overlap of mothers to investigate offspring early-life traits. The relative importance of similarity at different times could be investigated using multiple nongenetic similarity measures to test age-specific effects. Finally, whilst we have demostrated the use of these models using a semelparous population, researchers with iteroparous populations must consider how to deal with such life histories. Iteroparous populations may provide repeated measures of both the trait of interest and the environment (or other nongenetic factors). In such cases, researchers may be able to include the repeated measures of the environment separately within the matrix, matching these to the equivalent phenotypic observations. This would lead to some pseudoreplication is the environment is very similar between observations of an individual, so it might be important to include the average environment across the lifetime of each individual, and to include the individual identity as another random factor.

Carrying out these analyses may also be made more challenging by the presence of missing data for the nongenetic components. [56] included individuals with missing data in the model, allowing a similarity of 0 to all other individuals in the spatial overlap matrix (indicating no overlap). Our data has missing data (environmental, epigenetic, and social) for the parents of the first generation of individuals, and so these parents are not included in the nongenetic similarity matrices (including the parents, but with similarities of 0 to all individuals does not qualitatively change the variance components in the model). In cases where some (but not all) environmental (or epigenetic) data is missing for an individual, it may be possible to impute the values based on the mean environmental value, either at a dataset-wide or smaller spatial scale. Finally, caution must be taken about the potential effect of errors in nongenetic data. The effect of errors will be dependent on their frequency and whether there are random or systematic errors. Random errors should have relatively little effect, but systematic errors and those that correlate with an underlying trait may cause biased estimates [94].

Environmental similarity

Environmentally induced resemblance between relatives will depend upon the heterogeneity of the environment, and the rate and distance of dispersal within and between environment types. For example, in cases where dispersal away from the mother is low relative to the rate of environmental change, full- and maternal half-siblings are likely to share both genes and their environment, increasing their resemblance. In the extreme case, this may lead to the two variance components being completely confounded. Cross-fostering [22,81,95], transplantation [96,97], or laboratory experiments that fix certain sources of variance can help separate these effects from genetic variance.

Environmental measures can have non-linear and interacting effects on phenotypes. Fitting separate environmental measures as fixed effects makes assumptions about the linearity or interactions of such effects. These assumptions are not made with the environmental similarity matrix, though it does assume that its component environmental measures have equal weight. Thus the choice of how to include environmental measures in an animal model may depend on the questions researchers want to address. For example, tail-fin colour in merpeople could be strongly affected by salinity and less strongly affected by other environmental measures, which could potentially water down the strength of the covariance between the environment and the phenotype. If we were interested specifically in how salinity affects tail-fin colour we could choose to fit salinity as a fixed effect, rather than incorporating it into the matrix with the other nine environmental measures. For factorial environmental measures, these could be fitted as an additional random effect. Conversely, if we wished to estimate environmental variance dependent on all the measures, then we would retain it within the matrix.

Epigenetic similarity

The simulation that we show here is an oversimplified view of how epigenetics could be measured, but is used here to illustrate how such effects could be accounted for. We use a simulation of methylation at CpG islands [79,98], but there are many more forms and states of epigenetic variation that could be measured, such as methylation states at individual loci, histone modification (acetylation, methylation, phosphorylation, and ubiquitination), and RNAis [99,100]. Additionally, only one type of epigenetic measure is included—it assumes all cells are the same in epigenetic marks, or that we have only measured epigenetic marks in cells relevant to the trait. Alternatively, data used from Epigenome-Wide Association Studies (EWAS) could be used, wherein the matrix could contain information on the similarity between individuals in their percentage methylation at multiple loci across multiple cell types. Furthermore, the magnitude of the epigenetic effect in our simulation could depend upon the rate of resetting or reconstruction between generations. Thus, establishing the rate of reset or reconstruction is important in understanding how resetting affects quantitative genetic estimates and evolutionary processes [100102].

Additionally, the genomic architecture of epigenetic marks may be complex. For example, epigenetic marks could be in linkage disequilibrium with genetic variance [103], particularly if the genotype of an individual determines the probability of certain epigenetic states [83]. This makes drawing apart genetic and epigenetic variance challenging, and makes studies that examine epigenetic variation in a fixed genetic background valuable [13]. Such studies could make use of the models outlined here, excluding relatedness data, to estimate variance due to different nongenetic parameters (e.g. direct vs. maternal epigenetic effects).

Evolutionary implications

Nongenetic effects may directly affect evolutionary potential [18], and may be involved in the missing response to selection of wild populations [104]. The response of heritable nongenetic variation to environmental change can increase the rate of both phenotypic and genetic responses [105], and may therefore affect population persistence. However, the concept of nongenetic inheritance remains controversial and poorly understood. Theoretical progress has been made on this topic [[106]; [9]; [10]; Walsh.2018 Chapter 15], though empirical evidence is lacking in wild systems. The evolutionary importance of nongenetic inheritance relies on several prerequisites that need to be empirically evaluated. Firstly, nongenetic changes must be transmissible. Secondly, these changes must be (partially) stable over multiple transmission steps. Finally, nongenetic inheritance must affect evolutionary potential.

Multiple matrix models can provide novel insights for these problems: first, they allow us to demonstrate consistent nongenetic effects on the phenotype, and quantify the proportion of variance explained by these nongenetic effects. Second, where there is separate evidence for the stable transmission of the nongenetic effect (e.g. epigenetic inheritance), the model will show the relative effect of these nongenetic mechanisms compared to genetic mechanisms. For the traits simulated in this paper, a portion of the epigenome is maternally inherited, which in turn implies that a portion of Vepi is inherited. This animal model containing epigenetics does not distinguish between the portion of variance that is due to inherited (directly or reconstructed) or non-inherited epialleles, but it does imply that nongenetic inheritance may be having a significant effect on trait variation. If heritable epialleles are known a priori, these could be included as a separate matrix, allowing distinction between heritable and non-heritable epigenetic variation. In addition, comparison of trait covariances between different types of relatives may reveal the rate of epigenetic inheritance and reset [101].

Furthermore, in a population where individuals show close affinity to parental environments, such as inheriting a territory [107], niche transmission may form a part of the estimate of environmental variance [108]. Niche transmission cannot be implied directly from the animal model containing the environmental similarity matrix, but complementary external evidence may be used to support such a hypotheses in some populations. Similarly, estimates of variance due to social network connectivity may be used in conjunction with other information to make conclusions about social inheritance or cultural transmission of traits. For example, if teaching is known to happen between mothers and offspring, or within peer groups, then some portion of Vc is likely to represent the inheritance of the trait, rather than just that attributable to similarity.

Conclusion

A major advantage of the multiple matrix approach is its flexibility in where and how it can be applied. Nongenetic similarity matrices are not restricted to those demonstrated in this tutorial, but could also include a range of other measures of similarity, for example gene expression [109], histone states [103] or gut flora communities [110]. Consequently, this approach has the potential to bring together disconnected fields, such as ecology and molecular biology, which would connect information on broad-scale variation in natural systems to mechanistic information gained from model systems [111].

The multiple matrix extension to the animal model has potential applications to both wild and laboratory data. Wider adoption of multiple-matrix models has substantial promise for improving our understanding of ecological and evolutionary processes. The complex interaction of genetic and nongenetic sources of variance with dispersion, the rate of loss or reset of nongenetic information, and population structure may help to explain some of the differences between populations in estimates of genetic variance [112]. Therefore, the magnitude of nongenetic effects on evolutionary potential has not been quantified, but has the potential to be substantial.

Supporting information

S1 File. Tutorial.

Detailed tutorial for carrying out the analyses shown in this paper.

https://doi.org/10.1371/journal.pone.0197720.s001

(PDF)

S2 File. Data simulation.

Thorough detailing on the simulation used to create the data in this paper.

https://doi.org/10.1371/journal.pone.0197720.s002

(HTML)

S3 File. Data requirements.

Supplementary analyses using reduced data sizes to show the effect on estimation of variance components.

https://doi.org/10.1371/journal.pone.0197720.s003

(DOCX)

Acknowledgments

We thank Laura Gervais, Delphine Gourcilleau, Mathieu Latutrie, Pascal Marrot, Mathilde Mousset, and Charlotte Regan for useful feedback. We are also grateful to Matthew Silk for aid with social network simulation and analysis. We thank Bruce Walsh, Matthew Wolak, and two anonymous reviewers for their helpful comments. This work is part of a project that received funding from the European Research Council (ERC) under the European Union’s horizon 2020 research and innovation program (grant agreement ERC-CoG-2015-681484-ANGI to BP). This work was also supported by the French “Agence Nationale de la Recherche” (ANR-13-JSV7-002 “CAPA” to BP). CET, ISW, OCS and BP were also supported by the ANR funded French Laboratory of Excellence projects “LABEX TULIP” and “LABEX CEBA” (ANR-10-LABX-41, ANR-10-LABX-25-01) and ANR funded Toulouse Initiative of Excellence “IDEX UNITI” (ANR11-IDEX-0002-02).

References

  1. 1. Cubas P, Vincent C, Coen E. An epigenetic mutation responsible for natural variation in floral symmetry. Nature. 1999;401: 157–161. pmid:10490023
  2. 2. Latzel V, Zhang Y, Karlsson Moritz K, Fischer M, Bossdorf O. Epigenetic variation in plant responses to defence hormones. Annals of Botany. 2012;110: 1423–1428. pmid:22543179
  3. 3. Cortijo S, Wardenaar R, Colomé-Tatché M, Gilly A, Etcheverry M, Labadie K, et al. Mapping the epigenetic basis of complex traits. Science. 2014;343: 1145–1148. pmid:24505129
  4. 4. Falconer DS, Mackay TFC. Introduction to Quantitative genetics. Fourth. Essex, UK: Longman Group; 1996.
  5. 5. Lynch M, Walsh JB. Genetics and Analysis of Quantitative Traits. Sinauer , editor. Sunderland, MA; 1998.
  6. 6. Otto SP, Berkeley UC, Ca B. Genetic and cultural inheritance. Morrison Institute for Population and Resource Studies Paper Number 0064. 1994;
  7. 7. Grant BR, Grant PR. Cultural Inheritance of Song and Its Role in the Evolution of Darwin’s Finches. Evolution. 1996;50: 2471–2487. pmid:28565664
  8. 8. Legendre P. Spatial autocorralation: trouble or new paradigm? Ecology. 1993;74: 1659–1673.
  9. 9. Day T, Bonduriansky R. A unified approach to the evolutionary consequences of genetic and nongenetic inheritance. American Naturalist. 2011;178: E18–36. pmid:21750377
  10. 10. Townley S, Ezard THG. A G matrix analogue to capture the cumulative effects of nongenetic inheritance. Journal of Evolutionary Biology. 2013;26: 1234–1243. pmid:23656464
  11. 11. Kruuk LEB. Estimating genetic parameters in natural populations using the “animal model”. Philosophical Transactions of the Royal Society of London Series B-Biological Sciences. 2004;359: 873–890.
  12. 12. Wilson AJ, Réale D, Clements MN, Morrissey MM, Postma E, Walling CA, et al. An ecologist’s guide to the animal model. Journal of Animal Ecology. 2010;79: 13–26. pmid:20409158
  13. 13. Lush JL. Animal breeding plans. Des Moine, Iowa, USA: Iowa State University Press; 1937. p. 350.
  14. 14. Lande R. Quantitative genetic analysis of multivariate evolution, applied to the brain:body size allometry. Evolution. 1979;33: 402–416. pmid:28568194
  15. 15. Wolak ME, Keller LF. Dominance genetic variance and inbreeding in natural populations. Quantitative genetics in the wild. 2014. pp. 104–127.
  16. 16. Walsh JB, Lynch M. Evolution and Selection of Quantitative Traits. Press OU, editor. Oxford, UK; 2018.
  17. 17. Kruuk LEB, Hadfield JD. How to separate genetic and environmental causes of similarity between relatives. Journal of Evolutionary Biology. 2007;20: 1890–1903. pmid:17714306
  18. 18. Danchin E, Charmantier A, Champagne FA, Mesoudi A, Pujol B, Blanchet S. Beyond DNA: integrating inclusive inheritance into an extended theory of evolution. Nature Reviews Genetics. Nature Publishing Group; 2011;12: 475–486. pmid:21681209
  19. 19. Reale D, McAdam AG, Boutin S, Berteaux D. Genetic and plastic responses of a northern mammal to climate change. Proceedings of the Royal Society B: Biological Sciences. 2003;270: 591–596. pmid:12769458
  20. 20. Charmantier A, Perrins C, McCleery RH, Sheldon BC. Age-dependent genetic variance in a life-history trait in the mute swan. Proceedings of the Royal Society B: Biological Sciences. 2006;273: 225–232. pmid:16555791
  21. 21. Kruuk LEB, Merila J, Sheldon BC. Phenotypic selection on a heritable size trait revisited. American Naturalist. 2001;158: 557–571. pmid:18707351
  22. 22. Merilä J, Sheldon B. Avian quantitative genetics. New York, NY, USA: Springer US; 2001. pp. 177–255. https://doi.org/10.1007/978-1-4615-1211-0
  23. 23. Thomson CE, Bayer F, Cassinello M, Crouch N, Heap E, Mittell E, et al. Selection on parental performance opposes selection for larger body size in a wild population of blue tits. Evolution. 2017; 1–17. pmid:28106259
  24. 24. Mousseau TA, Fox CW. The adaptive significance of maternal effects. Trends in Ecology & Evolution. 1998;13: 403–407.
  25. 25. Green BS. Maternal effects in fish populations. Advances in marine biology. Cambridge, MA, USA: Academic Press; 2008. pp. 1–105. https://doi.org/10.1016/S0065-2881(08)00001-1 pmid:18929063
  26. 26. McAdam AG, Boutin S, Réale D, Berteaux D. Maternal effects and the potential for evolution in a natural population of animals. Evolution. 2002;56: 846–851. pmid:12038543
  27. 27. Noble DWA, McFarlane SE, Keogh JS, Whiting MJ. Maternal and additive genetic effects contribute to variation in offspring traits in a lizard. Behavioral Ecology. 2014;25: 633–640.
  28. 28. McFarlane SE, Gorrell JC, Coltman DW, Humphries MM, Boutin S, McAdam AG. The nature of nurture in a wild mammal’s fitness. Proceedings of the Royal Society B: Biological Sciences. 2015;282: 20142422–20142422. pmid:25833849
  29. 29. Willham RL. The covariance between relatives for characters composed of components contributed by related individuals. Biometrics. 1963;19: 18–27.
  30. 30. Willham RL. The role of maternal effects in animal breeding: III. Biometrical aspects of maternal effects in animals. Journal of Animal Science. 1972;35: 1288–1293. pmid:4567216
  31. 31. Griffing B. Selection in reference to biological groups. VI. Use of extreme forms of nonrandom groups to increase selection efficiency. Genetics. 1976;82: 723–731. pmid:1269914
  32. 32. Moore AJ, Brodie ED III, Wolf JB. Interacting phenotypes and the evolutionary process: I. Direct and indirect genetic effects of social interactions. Evolution. 1997;51: 1352–1362. pmid:28568644
  33. 33. Wolf JB, Brodie ED III, Cheverud JM, Moore AJ, Wade MJ. Evolutionary consequences of indirect genetic effects. Trends in Ecology & Evolution. 1998;13: 64–69.
  34. 34. Bijma P. The quantitative genetics of indirect genetic effects: a selective review of modelling issues. Heredity. Nature Publishing Group; 2014;112: 61–69. pmid:23512010
  35. 35. Muir W, Schinckel A. Incorporation of competitive effects in forest tree or animal breeding programs. 7th world congress on genetics applied to livestock production. Montpellier, France; 2002. p. 1407. https://doi.org/10.1534/genetics.104.035956
  36. 36. Arango J, Misztal I, Tsuruta S, Culbertson M, Herring W. Estimation of variance components including competitive effects of Large White growing gilts. Journal of Animal Science. 2005;83: 1241–1246. pmid:15890801
  37. 37. Muir WM. Incorporation of competitive effects in forest tree or animal breeding programs. Genetics. 2005;170: 1247–1259. pmid:15911590
  38. 38. Van Vleck LD, Cassady JP. Unexpected estimates of variance components with a true model containing genetic competition effects. Journal of Animal Science. 2005;83: 68–74. 83/1/68 [pii] pmid:15583044
  39. 39. Bijma P, Muir WM, Van Arendonk JA. Multilevel selection 1: Quantitative genetics of inheritance and response to selection. Genetics. 2007;175: 277–288. pmid:17110494
  40. 40. Wilson AJ, Gelin U, Perron M-C, Reale D. Indirect genetic effects and the evolution of aggression in a vertebrate system. Proceedings of the Royal Society B: Biological Sciences. 2009;276: 533–541. pmid:18842544
  41. 41. Santostefano F, Wilson AJ, Araya-Ajoy YG, Dingemanse NJ. Interacting with the enemy: Indirect effects of personality on conspecific aggression in crickets. Behavioral Ecology. 2016;27: 1235–1246.
  42. 42. Mrode RA, Thompson R. Linear models for the prediction of animal breeding values. Wallingford: CAB International; 1996.
  43. 43. Wilson AJ, Coltman DW, Pemberton JM, Overall ADJ, Byrne KA, Kruuk LEB. Maternal genetic effects set the potential for evolution in a free-living vertebrate population. Journal of Evolutionary Biology. 2005;18: 405–414. pmid:15715846
  44. 44. Brommer JE, Rattiste K. “Hidden” reproductive conflict between mates in a wild bird population. Evolution. 2008;62: 2326–2333. pmid:18616571
  45. 45. Liedvogel M, Cornwallis CK, Sheldon BC. Integrating candidate gene and quantitative genetic approaches to understand variation in timing of breeding in wild tit populations. Journal of Evolutionary Biology. 2012;25: 813–823. pmid:22409177
  46. 46. Adams MJ, Robinson MR, Mannarelli M-E, Hatchwell BJ. Social genetic and social environment effects on parental and helper care in a cooperatively breeding bird. Proceedings of the Royal Society B: Biological Sciences. 2015;282: 20150689. pmid:26063846
  47. 47. Koenig WD. Spatial autocorrelation of ecological phenomena. Trends in Ecology and Evolution. 1999;14: 22–26. pmid:10234243
  48. 48. Dormann CF, McPherson JM, Araújo MB, Bivand R, Bolliger J, Carl G, et al. Methods to account for spatial autocorrelation in the analysis of species distributional data: A review. Ecography. 2007;30: 609–628.
  49. 49. Gleeson AC, Cullis BR. Residual Maximum Likelihood (REML) Estimation of a neighbour model for field experiments. Biometrics. 1987;43: 277–287.
  50. 50. Cullis BR, Gleeson AC. Spatial analysis in field experiments—an extension to two dimensions. Biometrics. 1991;47: 1449–1460.
  51. 51. Magnussen S. Bias in genetic variance estimates due to spatial autocorrelation. Theoretical and Applied Genetics. 1993;86: 349–355. pmid:24193482
  52. 52. Dutkowski GW, Silva JC e, Gilmour AR, Lopez GA. Spatial analysis methods for forest genetic trials. Canadian Journal of Forestry Research. 2002;32: 2201–2214.
  53. 53. Dutkowski GW, Costa e Silva J, Gilmour AR, Wellendorf H, Aguiar A. Spatial analysis enhances modelling of a wide variety of traits in forest genetic trials. Canadian Journal of Forest Research. 2006;36: 1851–1870.
  54. 54. Costa e Silva J, Potts BM, Gilmour AR, Kerr RJ. Genetic-based interactions among tree neighbors: identification of the most influential neighbors, and estimation of correlations among direct and indirect genetic effects for leaf disease and growth in Eucalyptus globulus. Heredity. Nature Publishing Group; 2017;119: 1–11.
  55. 55. Van Der Jeugd HP, McCleery R. Effects of spatial autocorrelation, natal philopatry and phenotypic plasticity on the heritability of laying date. Journal of Evolutionary Biology. 2002;15: 380–387.
  56. 56. Stopher KV, Walling CA, Morris A, Guinness FE, Clutton-Brock TH, Pemberton JM, et al. Shared spatial effects on quantitative genetic parameters: Accounting for spatial autocorrelation and home range overlap reduces estimates of heritability in wild red deer. Evolution. 2012;66: 2411–2426. pmid:22834741
  57. 57. Regan CE, Pilkington JG, Berenos C, Pemberton JM, Smiseth PT, Wilson AJ. Accounting for female space sharing in St. Kilda Soay sheep (Ovis aries) results in little change in heritability estimates. Journal of Evolutionary Biology. 2017;30: 96–111. pmid:27747954
  58. 58. Germain RR, Wolak ME, Arcese P, Losdat S, Reid JM. Direct and indirect genetic and fine-scale location effects on breeding date in song sparrows. Journal of Animal Ecology. 2016;85: 1613–1624. pmid:27448623
  59. 59. Jablonka E, Raz G. Transgenerational epigenetic inheritance: prevalence, mechanisms, and implications for the study of heredity and evolution. The Quarterly Review of Biology. 2009;84: 131–176. pmid:19606595
  60. 60. Lira-Medeiros CF, Parisod C, Fernandes RA, Mata CS, Cardoso MA, Ferreira PCG. Epigenetic variation in mangrove plants occurring in contrasting natural environment. PLoS ONE. 2010;5: 1–8. pmid:20436669
  61. 61. Foust CM, Preite V, Schrey AW, Alvarez M, Robertson MH, Verhoeven KJF, et al. Genetic and epigenetic differences associated with environmental gradients in replicate populations of two salt marsh perennials. Molecular Ecology. 2016;25: 1639–1652. pmid:26880043
  62. 62. Wong AH, Gottesman II, Petronis A. Phenotypic differences in genetically identical organisms: The epigenetic perspective. Human Molecular Genetics. 2005;14: 11–18. pmid:15809262
  63. 63. Peaston AE, Whitelaw E. Epigenetics and phenotypic variation in mammals. Mammalian Genome. 2006;17: 365–374. pmid:16688527
  64. 64. Champagne FA, Meaney MJ. Stress during gestation alters postpartum maternal care and the development of the offspring in a rodent model. Biological Psychiatry. 2006;59: 1227–1235. pmid:16457784
  65. 65. Paszkowski J, Grossniklaus U. Selected aspects of transgenerational epigenetic inheritance and resetting in plants. Current Opinion in Plant Biology. Elsevier Ltd; 2011;14: 195–203. pmid:21333585
  66. 66. Waal E van de, Borgeaud C, Whiten A, Kawamura S, Hinde RA, Fisher J, et al. Potent social learning and conformity shape a wild primate’s foraging decisions. Science. 2013;340: 483–5. pmid:23620053
  67. 67. Aplin LM, Farine DR, Morand-Ferron J, Cockburn A, Thornton A, Sheldon BC. Experimentally induced innovations lead to persistent culture via conformity in wild birds. Nature. Nature Publishing Group; 2015;518: 538–541. pmid:25470065
  68. 68. Franz M, Nunn CL. Network-based diffusion analysis: a new method for detecting social learning. Proceedings of the Royal Society B. 2009;276: 1829–1836. pmid:19324789
  69. 69. Allen J, Weinrich M, Hoppitt W, Rendell L. Lobtail feeding in humpback whales. Science. 2013;340: 485–488. pmid:23620054
  70. 70. Hirsch BT, Stanton MA, Maldonado JE. Kinship shapes affiliative social networks but not aggression in ring-tailed coatis. PLoS ONE. 2012;7: e37301. pmid:22624010
  71. 71. Holekamp KE, Smith JE, Strelioff CC, Van Horn RC, Watts HE. Society, demography and genetic structure in the spotted hyena. Molecular Ecology. 2012;21: 613–632. pmid:21880088
  72. 72. Frère CH, Krützen M, Mann J, Connor RC, Bejder L, Sherwin WB. Social and genetic interactions drive fitness variation in a free-living dolphin population. Proceedings of the National Academy of Sciences of the United States of America. 2010;107: 19949–54. pmid:21041638
  73. 73. R Core Team. A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2016; Available: http://www.r-project.org.
  74. 74. Butler D. asreml: asreml() fits the linear mixed model. R package version 3.0 [Internet]. 2009. Available: www.vsni.co.uk
  75. 75. Hadfield JD. MCMC methods for multi-response generalized linear mixed models: the MCMCglmm R package. Journal of Statistical Software. 2010;33: 1–22.
  76. 76. Gilmour AR, Gogel BJ, Cullis BR, Thompson R. ASReml User Guide 4.1. 2009. p. 398.
  77. 77. Meyer K. WOMBAT—A tool for mixed model analyses in quantitative genetics by restricted maximum likelihood (REML). Journal of Zhejiang University Science B. 2007;8: 815–821. pmid:17973343
  78. 78. Holand AM, Martino S. AnimalINLA: Bayesian Animal Models. R package version 1.4. [Internet]. 2016. Available: http://www.r-inla.org/related-projects/animalinla{\%}0A
  79. 79. Gardiner-Garden M, Frommer M. CpG Islands in vertebrate genomes. Journal of Molecular Biology. 1987;196: 261–282. pmid:3656447
  80. 80. Bérénos C, Ellis PA, Pilkington JG, Pemberton JM. Estimating quantitative genetic parameters in wild populations: A comparison of pedigree and genomic approaches. Molecular Ecology. 2014;23: 3434–3451. pmid:24917482
  81. 81. Danchin E, Pujol B, Wagner RH. The double pedigree: A method for studying culturally and genetically inherited behavior in tandem. PLoS ONE. 2013;8: 1–7. pmid:23700404
  82. 82. Johannes F, Porcher E, Teixeira FK, Saliba-Colombani V, Simon M, Agier N, et al. Assessing the impact of transgenerational epigenetic variation on complex traits. PLoS Genetics. 2009;5: e1000530. pmid:19557164
  83. 83. Richards EJ. Revisiting soft inheritance. Nature Reviews Genetics. 2006;7: 395–402. pmid:16534512
  84. 84. Fowler JH, Dawes CT, Christakis N a. Model of genetic variation in human social networks. Proceedings of the National Academy of Sciences. 2009;106: 1720–1724. pmid:19171900
  85. 85. Morrissey MB. Selection and evolution of causally covarying traits. Evolution. 2014;68: 1748–1761. pmid:24611949
  86. 86. Hadfield JD, Thomson CE. Interpreting selection when individuals interact. Methods in Ecology and Evolution. 2017;8: 688–699.
  87. 87. Wilson AJ, Kruuk LEB, Coltman DW. Ontogenetic patterns in heritable variation for body size: using random regression models in a wild ungulate population. The American Naturalist. 2005;166: E177–E192. pmid:16475080
  88. 88. Pujol B, Marrot P, Pannell JR. A quantitative genetic signature of senescence in a short-lived perennial plant. Current Biology. Elsevier Ltd; 2014;24: 744–747. pmid:24631239
  89. 89. Charmantier A, McCleery RH, Cole LR, Perrins C, Kruuk LEB, Sheldon BC. Adaptive phenotypic plasticity in response to climate change in a wild bird population. Science. 2008;320: 800–804. pmid:18467590
  90. 90. Charmantier A, Gienapp P. Climate change and timing of avian breeding and migration: Evolutionary versus plastic changes. Evolutionary Applications. 2014;7: 15–28. pmid:24454545
  91. 91. Polanowski AM, Robbins J, Chandler D, Jarman SN. Epigenetic estimation of age in humpback whales. Molecular Ecology Resources. 2014;14: 976–987. pmid:24606053
  92. 92. Wey TW, Blumstein DT. Social cohesion in yellow-bellied marmots is established through age and kin structuring. Animal Behaviour. Elsevier Ltd; 2010;79: 1343–1352.
  93. 93. Pigeon G, Festa-Bianchet M, Pelletier F. Long-term fitness consequences of early environment in a long-lived ungulate. Proceedings of the Royal Society B. 2017; pmid:28424347
  94. 94. Firth JA, Hadfield JD, Santure AW, Slate J, Sheldon BC. The influence of nonrandom extra-pair paternity on heritability estimates derived from wild pedigrees. Evolution. 2015;69: 1336–1344. pmid:25800997
  95. 95. Winney I, Nakagawa S, Hsu Y-HYH, Burke T, Schroeder J. Troubleshooting the potential pitfalls of cross-fostering. Methods in Ecology and Evolution. 2015;6: 584–592.
  96. 96. Kawecki TJ, Ebert D. Conceptual issues in local adaptation. Ecology Letters. 2004;7: 1225–1241.
  97. 97. Fang W, Taub DR, Fox GA, Landis RM, Natali S, Gurevitch J. Sources of Variation in Growth, Form, and Survival in Dwarf and Normal—Stature Pitch Pines (Pinus rigida, Pinaceae) in long-term transplant experiments. American Journal of Botany. 2006;93: 1125–1133. pmid:21642178
  98. 98. Jones PA, Takai D. The role of DNA methylation in mammalian epigenetics. Science. 2001;293: 1068–1070. pmid:11498573
  99. 99. Bossdorf O, Richards CL, Pigliucci M. Epigenetics for ecologists. Ecology Letters. 2008;11: 106–115. pmid:18021243
  100. 100. Kovalchuk I. Resisting transgenerational epigenetic reprogramming–examples and mechanisms of non-genetic inheritance. Non-Genetic Inheritance. 2013;1: 51–61.
  101. 101. Tal O, Kisdi E, Jablonka E. Epigenetic contribution to covariance between relatives. Genetics. 2010;184: 1037–1050. pmid:20100941
  102. 102. Kronholm I, Collins S. Epigenetic mutations can both help and hinder adaptive evolution. Molecular Ecology. 2016;25: 1856–1868. pmid:26139359
  103. 103. Taudt A, Colomé-Tatché M, Johannes F. Genetic sources of population epigenomic variation. Nature Reviews Genetics. Nature Publishing Group; 2016;17: 319–332. pmid:27156976
  104. 104. Pujol B, Blanchet S, Charmantier A, Danchin E, Facon B, Marrot P, et al. The Missing Response to Selection in the Wild. Trends in Ecology & Evolution. Elsevier Ltd; 2018;xx: 1–10.
  105. 105. Bonduriansky R, Crean AJ, Day T. The implications of nongenetic inheritance for evolution in changing environments. Evolutionary Applications. 2012;5: 192–201. pmid:25568041
  106. 106. Bonduriansky R, Day T. Nongenetic inheritance and its evolutionary implications. Annual Review of Ecology Evolution and Systematics. 2009;40: 103–125.
  107. 107. Berteaux D, Boutin S. Breeding dispersal in female North American red squirrels. Ecology. 2000;81: 1311–1326.
  108. 108. Odling-Smee J, Laland KN. Ecological inheritance and cultural inheritance: What are they and how do they differ? Biological Theory. 2011;6: 220–230.
  109. 109. Kim K, Zhang S, Jiang K, Cai L, Lee I-B, Feldman LJ, et al. Measuring similarities between gene expression profiles through new data transformations. BMC Bioinformatics. 2007;8: 29. pmid:17257435
  110. 110. Vries EJ de, Jacobs G, Breeuwer JA. Growth and transmission of gut bacteria in the western flower thrips, Frankliniella occidentalis. Journal of Invertebrate Pathology. 2001;77: 129–137. pmid:11273693
  111. 111. Richards CL, Alonso C, Becker C, Bossdorf O, Bucher E, Colome-Tatche M, et al. Ecological plant epigenetics: Evidence from model and non-model species, and the way forward. bioRxiv. 2017;5905:
  112. 112. Teplitsky C, Tarka M, Møller AP, Nakagawa S, Balbontin J, Burke TA, et al. Assessing multivariate constraints to evolution across ten long-term avian studies. PLoS ONE. 2014;9. pmid:24608111