Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genetics of Callous-Unemotional Behavior in Children

  • Essi Viding ,

    Contributed equally to this work with: Essi Viding, Thomas S. Price

    Affiliations King’s College London, Medical Research Council Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, De Crespigny Park, London, United Kingdom, Division of Psychology and Language Sciences, University College London, Gower St, London, United Kingdom

  • Thomas S. Price ,

    Contributed equally to this work with: Essi Viding, Thomas S. Price

    Affiliations King’s College London, Medical Research Council Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, De Crespigny Park, London, United Kingdom, Institute for Translational Medicine and Therapeutics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America

  • Sara R. Jaffee,

    Affiliations King’s College London, Medical Research Council Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, De Crespigny Park, London, United Kingdom, Department of Psychology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America

  • Maciej Trzaskowski,

    Affiliation King’s College London, Medical Research Council Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, De Crespigny Park, London, United Kingdom

  • Oliver S. P. Davis,

    Affiliation King’s College London, Medical Research Council Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, De Crespigny Park, London, United Kingdom

  • Emma L. Meaburn,

    Affiliation Department of Psychological Sciences, Birkbeck, University of London, Malet Street, London, United Kingdom

  • Claire M. A. Haworth,

    Affiliation King’s College London, Medical Research Council Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, De Crespigny Park, London, United Kingdom

  • Robert Plomin

    Affiliation King’s College London, Medical Research Council Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, De Crespigny Park, London, United Kingdom


16 Jul 2013: Viding E, Price TS, Jaffee SR, Trzaskowski M, Davis OSP, et al. (2013) Correction: Genetics of Callous-Unemotional Behavior in Children. PLOS ONE 8(7): 10.1371/annotation/0b16418f-ceb5-41b2-be2a-a20f0c56f9a6. View correction


Callous-unemotional behavior (CU) is currently under consideration as a subtyping index for conduct disorder diagnosis. Twin studies routinely estimate the heritability of CU as greater than 50%. It is now possible to estimate genetic influence using DNA alone from samples of unrelated individuals, not relying on the assumptions of the twin method. Here we use this new DNA method (implemented in a software package called Genome-wide Complex Trait Analysis, GCTA) for the first time to estimate genetic influence on CU. We also report the first genome-wide association (GWA) study of CU as a quantitative trait. We compare these DNA results to those from twin analyses using the same measure and the same community sample of 2,930 children rated by their teachers at ages 7, 9 and 12. GCTA estimates of heritability were near zero, even though twin analysis of CU in this sample confirmed the high heritability of CU reported in the literature, and even though GCTA estimates of heritability were substantial for cognitive and anthropological traits in this sample. No significant associations were found in GWA analysis, which, like GCTA, only detects additive effects of common DNA variants. The phrase ‘missing heritability’ was coined to refer to the gap between variance associated with DNA variants identified in GWA studies versus twin study heritability. However, GCTA heritability, not twin study heritability, is the ceiling for GWA studies because both GCTA and GWA are limited to the overall additive effects of common DNA variants, whereas twin studies are not. This GCTA ceiling is very low for CU in our study, despite its high twin study heritability estimate. The gap between GCTA and twin study heritabilities will make it challenging to identify genes responsible for the heritability of CU.


Callous-unemotional behavior (CU) – defined by low levels of empathy, absence of guilt and emotional unresponsiveness – is currently under consideration as a subtyping index for conduct disorder in DSM-V [1] and may have independent diagnostic value, even in the absence of a conduct disorder diagnosis [2],[3],[4],[5]. CU often occurs in the presence of conduct problems (see e.g. [6],[7]) and predicts vulnerability to psychopathy in adulthood [8]. Several longitudinal studies of large community samples now suggest that CU can also occur in the absence of clinical levels of conduct problems (e.g. [2],[3],[5],[7]). In the cases where current levels of conduct problems do not reach clinical levels in children with CU, sub-clinical levels of or later developing conduct problems are typically observed [2],[3],[5]. In addition, and perhaps more interestingly, individuals with CU and non-clinical levels of conduct problems commonly show elevated levels of other types of impairment, including poor peer relationships, low pro-sociality, and increased hyperactivity [2],[3],[5],[7]. CU therefore has the potential to serve as a useful clinical indicator for psychiatric vulnerability and psychosocial maladjustment, in addition to its utility in subtyping children with conduct disorder.

Individual differences in CU are estimated to be moderately to strongly heritable using the twin design that compares resemblance in monozygotic (MZ) twins and dizygotic (DZ) twins in community samples of children and adolescents (heritability estimates from .45–.67; see [9] for a recent review). Having elevated levels of CU is strongly heritable in childhood regardless of whether CU traits are accompanied by conduct problems or not [10]. Twin studies suggest that there is considerable overlap in the genes that influence CU and conduct/externalizing problems, but that there are also unique genetic influences on CU [11],[12],[13]; consistent with the finding that high levels of CU have been observed in the absence of clinical levels of conduct problems [3]. CU is moderately to strongly stable during childhood [14] and twin studies suggest that stability in CU/psychopathic behavior is driven by genetic influences [15],[16]. This finding of genetic stability led us to conduct a new twin analysis focused on a composite measure of CU across ages 7, 9 and 12 in an attempt to create a genetically enriched measure of CU.

The high heritability of CU has led to the first attempts to identify some of the genes responsible for its heritability. Only a handful of published candidate gene association studies to date have focused on CU in children or adolescents [17],[18],[19],[20],[21],[22],[23],[24],[25]. However, the sample sizes have been smaller than 200, fewer than half a dozen candidate genes have been investigated, and the results of these studies have been mixed and contradictory. Even adequately powered candidate gene studies have a poor record for replication [26],[27]. The poor track record for candidate gene studies has been one reason why the field has moved towards systematic genome-wide association (GWA) studies [28]. GWA studies were made possible by the development of commercially available DNA arrays that can genotype hundreds of thousands of single-nucleotide polymorphisms (SNPs) inexpensively [29]. The SNPs on DNA arrays are distributed across the 23 pairs of chromosomes in order to tag all common DNA sequence variation in the genome. Each SNP is correlated with the target trait as in a candidate gene study but GWA systematically scans the entire genome for associations and, crucially, corrects significance levels for multiple testing so that the accepted level of significance for GWA studies is p<.00000005.

Genome-wide association (GWA) studies for psychiatric phenotypes have shown that genome-wide “hits” are often in genes that were not previously hypothesized to influence the phenotype or not in traditional genes at all [28],[29],[30]. To our knowledge, only one GWA study incorporating CU has been published to date [31], which was from our group and focused on children with a combination of high CU and conduct problems, using allele frequencies estimated from DNA pooled across children in the high group and a control group rather than genotyping each child individually. Although our DNA pooling GWA study of the extremes had power to detect genes of large effect size, none of our associations reached genome-wide significance. The present report is the first standard genomewide association (GWA) study of individual differences in CU assessed as a quantitative trait – our previous GWA study investigated the extremes of co-occurring CU and conduct problems using DNA pooling.

After more than a thousand published GWA studies across the life sciences [32], we now know that the largest effect sizes for GWA associations are likely to be very small, accounting for less than 1% of the variance of quantitative traits [33]. Although it is nonetheless useful to exclude the possibility of large effect sizes – which were found for example in the first GWA studies on macular degeneration [34] – it now seems unlikely that our GWA sample of about 3000 children would have the power to identify genome-wide significant associations of the expected small effect size. Much has been written about ‘missing heritability’ [35], the gap between GWA-identified associations and heritability as estimated in twin studies, with rare variants and non-additive effects as the most likely culprits [36]. GWA studies have been limited to the common SNPs used on commercially available DNA arrays and to additive effects of SNPs considered individually rather than multiply as they interact in their effect on the phenotype.

Another reason for the missing heritability gap could be that twin studies have overestimated heritability. A new method, implemented in a software package called Genome-wide Complex Trait Analysis (GCTA), uses DNA alone to estimate genetic influence from samples of unrelated individuals, not relying on the assumptions of the twin method [37],[38]. GCTA does not identify specific genes associated with traits. Instead, it uses chance similarity across hundreds of thousands of SNPs to predict phenotypic similarity pair by pair in a large sample of unrelated individuals. The essence of GCTA is to estimate genetic influence on a trait by predicting phenotypic similarity for each pair of individuals in the sample from their total SNP similarity. In contrast to the twin method, which estimates heritability by comparing phenotypic similarity of identical and fraternal twin pairs, whose genetic similarity is roughly 1.00 and .50, respectively, GCTA relies on comparisons of pairs of individuals whose genetic similarity varies from .00 to .02. GCTA extracts this tiny genetic signal from the noise of hundreds of thousands of SNPs using the massive information available from a matrix of thousands of individuals, each compared pair by pair with every other individual in the sample; for example, the 3,000-plus individuals in the present sample provided nearly 5 million pairwise comparisons.

GCTA genetic similarity is not limited to the genotyped SNPs themselves, but also includes unknown causal variants to the extent that they are correlated with the SNPs. Mendel’s second law of inheritance is that genes (as they are now called) are inherited independently (a phenomenon now called linkage equilibrium), but Mendel did not know that genes can be on the same chromosome, in which case they are not inherited independently (linkage disequilibrium). This violation of Mendel’s second law is complicated by the fact that during meiosis, on average each pair of chromosomes – one from the mother and one from the father – crosses over (recombines) once; in the population, genes on the same chromosome are separated by this process of recombination to the extent that they are not close together on the chromosome. GCTA provides a lower-limit estimate of heritability because it misses genetic influence due to causal variants that are not highly correlated with the common SNPs on genotyping arrays.

A difference between GCTA estimates and twin-study estimates of heritability is that GCTA only estimates additive genetic effects, whereas the twin method captures nonadditive as well as additive genetic effects. Additive genetic effects are caused by the independent effects of alleles, which add up in their effect on a trait; nonadditive genetic effects are those that interact. Because GCTA adds up the effect of each SNP, it does not include gene-gene interaction effects; the twin method captures nonadditive as well as additive genetic effects because the DNA sequence of identical twins is virtually identical and thus they share all genetic effects, including nonadditive ones (see [39] for details). GCTA has been used to estimate heritability as captured by genotyping arrays for height [37], weight [40], psychiatric and other medical disorders [41],[42],[43], and personality [44]. We have used GCTA to estimate heritability for cognitive abilities using DNA alone and to compare these results to twin study heritability estimates from the same sample using the same measures at the same ages [45].

However, GCTA offers far more than a check on twin study heritability estimates – it provides important clues about missing heritability. Because GCTA estimates of genetic influence are limited in the same way as GWA studies to the additive effects of common DNA variants, GCTA will underestimate twin study heritability to the extent that nonadditive effects or rare variants are influential. Moreover, for this same reason, GCTA estimates of heritability rather than twin study estimates of heritability create a ceiling for GWA attempts to identify associations. Here we report the first GCTA estimate of genetic influence and compare it to a twin study heritability estimate using the same measure in the same sample in order to increase the precision of the comparison between them.

In summary, the overall aim of this research was to compare twin study heritability to GCTA heritability and to the results of GWA for CU assessed as a quantitative trait. The comparison of these three components of genetic influence has important implications for finding missing heritability.

Materials and Methods

Ethics Statement

Ethical authorization, including authorization to work with children, was given by The Joint South London and Maudsley and the Institute of Psychiatry Research Ethics Committee (05/Q0706/228). Parents were given a letter describing the general purpose of the study and written parental consent was required. It was made clear that participation was voluntary and participants could withdraw from the study whenever they wished.


The sample was drawn from the Twins Early Development Study (TEDS), a multivariate longitudinal study which recruited over 11,000 twin pairs born in England and Wales in 1994, 1995 and 1996 [46], whose families are representative of the UK population [47]. Twins with severe medical problems or severe birth complications or whose zygosity could not be determined were excluded from the sample. To decrease heterogeneity of ancestry, the sample was restricted to families who identified themselves as white and whose first language was English.

In order to make our twin sample as comparable as possible to our GCTA and GWA samples, we selected those twin pairs for whom one member of the twin pair was chosen for the GCTA and GWA analyses. For GCTA and the discovery sample of the GWA analysis, we included unrelated individuals by selecting only one member of each twin pair for whom GWA genotyping and CU data were available. For the GCTA analysis, we verified that the unrelated individuals were less genetically related than fourth-degree relatives (genetic relatedness >.025), the standard GCTA exclusion criterion.

Based on these selection criteria, our twin analyses included 1099 MZ pairs and 1787 DZ pairs. Our GCTA and GWA discovery sample included 2,930 children; the slightly smaller number of twin pairs was caused by twin pairs for whom the co-twin did not have CU data.

Genotyping Protocol

DNA was extracted from buccal cheek swabs and sent to the Wellcome Trust Sanger Institute, Hinxton, UK for genotyping as part of the Wellcome Trust Case Control Consortium 2 ( A total of 3,747 DNA samples from unrelated children in TEDS were sent for genome-wide DNA array genotyping used in our GCTA and GWA analyses. In total, 3,665 samples were successfully hybridized to Affymetrix GeneChip 6.0 SNP genotyping arrays ( using experimental protocols recommended by the manufacturer (Affymetrix Inc., Santa Clara, CA). The raw image data from the arrays were normalized and pre-processed according to the manufacturer’s guidelines (

Genotypes for the Affymetrix arrays were called using CHIAMO ( Where there was a sufficient quantity of DNA, samples were also re-genotyped on a panel of 30 SNPs (including 26 autosomal SNPs present on the Affymetrix array, and 4 SNPs on the X chromosome to verify gender) using the Sequenom iPlex Gold assay (Sequenom Inc., San Diego, CA).

Quality Control: Samples

We identified and removed samples whose genome-wide patterns of diversity differed from those of the collection at large, interpreting these differences as possibly due to biases or artifacts. Outlying individuals were identified on the basis of call rate, heterozygosity, relatedness and ancestry using a Bayesian clustering approach [48].

To obtain a set of putatively unrelated individuals we used a hidden Markov model (HMM) to infer identify by descent along the genome between pairs of individuals. Among pairs of closely related individuals, we excluded the member of the pair with the lowest call rate, iteratively repeating this procedure to obtain a set of individuals with pairwise identity by descent less than 5% [48]. Of the individuals genotyped, samples were excluded because of low call rate or heterozygosity outliers (377), unusual hybridization intensity (9), atypical population ancestry (59), sample duplication or relatedness to other sample members (83), and gender mismatches (13). In addition, 54 samples were excluded because fewer than 90% of genotypes were called identically on the genomewide array and Sequenom panel. The remaining samples were consistent with previous genotyping. In total, 513 samples were excluded by these quality control criteria. The remaining sample of 3,152 individuals included 1,446 males and 1,706 females. Of this sample, 2,930 children had valid data for CU at age 7, 9, or 12, and 2,140 had data at two or more ages.

Quality Control: SNPs

An index of information (Fisher) for the allele frequency at each of 932,533 called SNPs was calculated using SNPTEST version 2.1.1 [49]. Autosomal SNPs were excluded if this information index was below 0.975, if the minor allele frequency was less than 1%, if greater than 2% of genotype data were missing, or if the Hardy Weinberg p-value was lower than 10−20. Association between the SNP and the plate on which samples were genotyped was calculated and SNPs with a plate effect p-value less than 10−6 were also excluded. In addition, SNPs were manually filtered for call quality by visual inspection of the hybridization intensity plots using EVOKER software ( The above filters removed 22.7% of the SNPs, leaving 699,388 autosomal SNPs for further analysis.

SNP Imputation

In order to increase the number of SNPs used in our GCTA and GWA analyses, imputation was carried out using the IMPUTE version 2 software [50] on the genotype data after application of quality control procedures, using a two-stage approach with both a haploid reference panel and a diploid reference panel. For the haploid reference panel we used HapMap phase II and III SNP data on the 120 unrelated CEU trios. 5,175 WTCCC2 controls were genotyped on both Affymetrix 6.0 and Illumina Human1.2M-Duo arrays (Illumina Inc., La Jolla, CA), and these were used for the diploid reference panel. Imputed SNPs were retained for analysis if they were genotyped using the Affymetrix 6.0 array, if they were genotyped using the Illumina Human1.2M-Duo array and obtained an information score ≥0.90, or if they were imputed and obtained an information score ≥0.98. Using these criteria, 1,024,929 imputed SNPs were retained for the GCTA and GWA analyses, in addition to the 699,388 measured SNPs described above.

CU Trait Measures

CU at 7, 9, and 12 years of age was assessed by each child’s school teacher using a paper (or at 12, online) questionnaire. Teacher ratings were obtained towards the end of the academic year when the class teacher had known the child for most of the academic year. In the U.K. there are no systematic differences with regard to placing mono- versus dizygotic twins to same or different classes. The percentage of twins rated by the same teacher is 65% at age 7, 58% at age 9, and 33% at age 12.

Teachers are familiar with a broad range of children and have expertise regarding normative child development. Teacher ratings have been found to show higher internal consistency and stability than parent ratings [51], and twin analyses indicate that teacher ratings are free of rater bias typically found in parent ratings [52]. In line with this, teacher ratings for CU show better internal consistency (e.g. α = .74 at age 7), indicating reliable detection of the latent construct of interest; parent ratings of CU show much poorer levels of internal consistency (e.g. α = .45 at age 7). Finally, the means and variances for the CU scale are typically lower for parents than for teachers, indicating that parents are poorer at discriminating children high in CU [53]. These problems with the parent rating scales led us to focus on the teacher ratings. The CU score was calculated as the total for seven items used in previous heritability analyses of CU (e.g. [10],[13]). These were original Antisocial Process Screening Device [54] CU items (e.g. ‘Does not show feelings or emotions’) or Strengths and Difficulties Questionnaire [55] items selected to reflect CU (e.g. ‘Considerate of other people’s feelings’ (reverse scored)). The sampling frame for CU at 7 and 12 included all children in TEDS. The sampling frame for CU at 9 included only children born between January 1994 and August 1995. For the purposes of twin and GCTA analysis, we calculated a composite variable from the mean of available teacher reports of CU at 7, 9, and 12 years. This composite required that at least one measurement be non-missing.

Statistical Analysis: Twin

MZ and DZ twin intraclass correlations were calculated and standard twin model-fitting was used to estimate additive genetic (A), common or shared environment (C), and residual or non-shared environment (E) [39]. Although twin model-fitting is usually referred to as ACE models, in fact the twin design – unlike GCTA, discussed in the next section – can include non-additive as well as additive genetic effects. In quantitative genetics, estimates of heritability that include non-additive as well as additive genetic effects is called broad heritability, in contrast to narrow heritability, which is limited to additive genetic effects. In twin analysis, the additive genetic model assumes that DZ twins are half as similar as MZ twins because the genetic relatedness of DZ twins is 50% for additive genetic effects, whereas the relatedness of MZ twins is 100%. This twofold greater genetic resemblance of MZ as compared to DZ twins is the reason why heritability is often estimated by doubling the difference between MZ and DZ correlations. (For example, MZ and DZ correlations of 0.80 and 0.40, respectively, imply 80% heritability).

In contrast, the hallmark of non-additive genetic effects is that the DZ correlation is less than half the MZ correlation because epistatic (inter-locus) gene-gene interactions scarcely contribute to DZ similarity but are shared entirely by MZ twins. If non-additive genetic effects are important, the twin method will detect these effects, although its ability to estimate these effects is limited. For example, if MZ and DZ twin correlations are 0.80 vs. 0.20, respectively, simply doubling the difference between MZ and DZ correlations – an additive genetic model which would be inappropriate given the non-additive pattern of twin correlations – would yield a heritability estimate of 120%. However, heritability cannot exceed the MZ twin correlation, so that the heritability estimate in this example would be constrained to be 80%. Model-fitting would show that the ACE model does not fit the data in this example. An allowance is made for non-additivity in a twin model called ADE, in which ‘D’ refers to dominance (intra-locus allele-allele interaction). In the ADE model, dominance discounts DZ resemblance from 50% for the A parameter to 25% for the D parameter. However, this adjustment does not cover the extreme epistatic case in which the DZ correlation could be zero despite a high MZ correlation. However, even in this extreme case – for example an MZ correlation of 0.80 and a DZ correlation of 0.00 – twin model-fitting would detect genetic influence and would cap the heritability estimate at 80%, as suggested by the MZ correlation of 0.80.

In summary, the twin design can detect the presence of non-additive genetic effects, although it is limited in its ability to distinguish additive and non-additive genetic effects. Greater detail about distinguishing additive and non-additive genetic variance in twin designs is available (e.g. [39]). As is usual in twin analyses, residualized scores were used that were independent of age and sex because age and sex are perfectly correlated across pairs, which would be misinterpreted as C in twin analyses. The OpenMx package for R was used for twin maximum-likelihood model-fitting using full-information matrices [56].

Statistical analysis: GCTA

We used the software package Genome-wide Complex Trait Analysis (GCTA; [38]) to estimate genetic influence from pair-by-pair similarity across all of the SNPs on the DNA array. We applied GCTA analysis to a composite variable from the mean of available teacher reports of CU at 7, 9, and 12 years, the same variable used in the twin analysis. This univariate phenotype was submitted to GCTA [38] in order to estimate by restricted maximum likelihood the proportion of variance explained by the genome-wide panel of SNPs. Both genotyped and imputed SNPs were included in the analysis. Individuals were deleted from the analysis if their estimated relatedness with another member of the dataset exceeded 0.025. Sex, birth year/school year cohort, and 8 principal components of the genotype data were included as covariates in the GCTA analysis.

Statistical Analysis: GWA

Genome-wide association (GWA) analysis was conducted using a linear regression approach implemented in SNPTEST v2.0 [57] under an additive model. This approach uses a frequentist method to account for uncertainty of genotype information [49]. Because even small differences in allelic frequency within subgroups in the population can generate false-positive results, eight principal components representing population ancestry were used to control for population stratification. Sex and DNA sample plate number were also included as covariates. Results were visualized using Manhattan plots, quantile-quantile (Q-Q) plots, and genotype-phenotype plots, generated in R [58]; a regional association plot created using LocusZoom [59].

Following SNP quality control and SNP imputations, described earlier, we performed several preliminary analyses prior to GWA analysis. First, principal component analysis (PCA) was used to attenuate GWA biases due to population structure. PCA was conducted on a subset of 105,556 autosomal SNPs post QC, selected after pruning to remove SNPs in high linkage disequilibrium (r2>0.2) and to exclude high linkage disequilibrium genomic regions so as to ensure that only genome-wide effects were detected [60]. Application of the Tracy-Widom test indicated that eight principal components were significant using a threshold of p<0.05. We caution that the inclusion of principal components as covariates may not be sufficient to remove biases in estimation due to population structure [61]. Our second preliminary analysis involved normalizing CU trait scores by transforming the ranked data to the quantiles of a standard Normal distribution using the van der Waerden transformation [62], and taking the residuals after regressing the resulting score on age at measurement.

For the GWA analysis, each autosomal SNP was tested for association with CU at 7, 9, and 12 using a multivariate method that was similar in its essentials to the test of a longitudinal composite but required slightly less restrictive statistical assumptions. Using a linear regression framework, we calculated score statistics to test a hypothesis that the SNP had an equal effect on CU at each age. The test evaluated a single parameter and hence had 1 degree of freedom. Sex, birth year/school year cohort, and the first eight principal components of the genotype data were included as covariates in the regression model. The statistical framework was able to account for missingness in the outcome variables as a function of these covariates, assuming that data for these covariates were missing at random [63]. The test was implemented as a custom library for R (

Probability values were adjusted by genomic control [64] separately for genotyped SNPs, SNPs that were genotyped in the WTCCC2 controls and imputed in the TEDS sample, and SNPs that were imputed in both WTCCC2 controls and the TEDS sample.


Twin analysis

Tables 13 present twin correlations and the results of model-fitting analyses for the composite CU trait. The difference in the MZ and DZ twin correlations suggests substantial genetic influence. The DZ correlation is about half the MZ correlation, suggesting no influence of shared environment or non-additive genetic variance, unless these two factors mask each other’s effect. Model-fitting results confirm the results gleaned from the MZ and DZ twin correlations. A model that includes only A and E, excluding C, fits the data best. Model-fitting parameter estimates for the full ACE model indicate substantial heritability (0.64 ± 0.03) and negligible shared environmental influence (0.00 ± 0.02).

Table 1. Twin correlations and model-fitting results for a callous unemotional (CU) trait longitudinal composite.

Table 2. Model-fitting estimates for a callous unemotional (CU) trait longitudinal composite.

Table 3. Fit statistics for structural equation modeling.


The GCTA estimate of genetic variance was 0.07, which was not significant given our sample size. The standard error of 0.12 suggests that the proportion of variance explained by the common SNPs tagged by our genome-wide genotypes is highly likely to be less than 20%, which suggests a wide gap with our twin study heritability estimate of 64%. Unlike twin analysis, GCTA does not discriminate C and E because each individual is from a different family. In GCTA, E is a residual term that refers to all variance (including error of measurement) that cannot be attributed to additive genetic effects of the common SNPs included on the DNA array.

GWA Analysis

We tested 699,388 genotyped autosomal SNPs and 1,024,929 imputed autosomal SNPs that passed quality control thresholds. The analysis included 2,930 children with admissible data for both genotype and CU. The quantile-quantile (QQ) plot presented in Figure 1 illustrates the distribution of probability values from the genomewide test of association with CU. The line indicates the null hypothesis for the relationship between the observed distribution of probability values and results expected by chance alone. The shaded area indicates the 95% confidence interval around the null values. The resemblance of the distribution of observed test statistics to the null distribution indicates that the results are consistent with chance. We observed very little inflation of the test statistics (λ<1.01).

Figure 1. Quantile-quantile plot illustrating the distribution of probability values from the genomewide test of association with CU.

X axis: expected quantile of minus log probability values under the null hypothesis. Y axis: observed quantile of minus log probability values for association after adjustment by genomic control. The straight line at x = y represents the null distribution and the gray area surrounding the line indicates a 95% confidence band around the null.

No single SNP achieved genomewide significance using a conventional significance threshold of p<5×10−8 [65]. The results for three SNPs achieving suggestive significance at the less stringent threshold of p<5×10−6 are summarized in Table 4.

Table 4. SNPs associated with CU exceeding a threshold for suggestive significance of p<5×10−6.

The sample size of 2,930 would, in a univariate analysis, be sufficient to detect a quantitative trait locus (QTL) explaining 1.0% of the variance with 49% power, or a QTL explaining 1.3% of the variance with 77% power [66]. Individual SNP variants with lower frequencies, being yet rarer in the population, are also unlikely to explain such large proportions of variance. The balance of probabilities therefore suggest that there are no autosomal SNPs with large effects on CU (>1% of variance).


Our twin study heritability estimate of 64% for CU is consistent with previously published results [9]. In contrast, this first GCTA estimate of CU heritability based on DNA alone was only 7% – much lower than expected given that it was calculated using an identical measure in the same sample.

Although our sample size of nearly 3000 children entails a large standard error (0.12) for our GCTA heritability estimate (0.07), the 95% confidence interval suggests that the true estimate is GCTA heritability is less than 20%. In other words, even if the true GCTA heritability estimate for CU were at the top of this confidence interval, it would imply a wide gap between the GCTA and twin heritability estimates. This gap between GCTA heritability and twin study heritability could be called ‘missing GCTA heritability’. The missing GCTA heritability gap for CU, if true, would greatly increase the difficulty of identifying GWA associations because GWA, like GCTA, is limited to the additive effects tagged by the common SNPs on our DNA array.

Why is missing GCTA heritability so much greater for CU than for some other traits in the same study? Although it is possible that the twin study overestimated heritability for CU, converging evidence from several sources suggests that twin estimates are valid [39]. Another possibility is that our low GCTA heritability estimate for CU is in error. This seems unlikely for two reasons. First, in the same study using the same methods, we have found GCTA heritability estimates that were more than half the twin study heritability estimate for cognitive and anthropometric traits in the same sample [45]. For example, heritability estimates for height and intelligence were 0.80 and 0.46, respectively, in twin analyses and 0.42 and 0.35 in GCTA. Second, although our GCTA heritability estimate for CU is the first such estimate, our low estimate is similar to the low GCTA heritability estimate reported in the only published GCTA study of personality [44].

What we do know for certain is that GCTA is limited to additive effects of common DNA variants that are tagged by commercially available DNA arrays, whereas the twin design is not. Because identical twins are identical in terms of all inherited DNA sequence variants, they share all genetic effects – small or large, nonadditive or additive, rare or common. For this reason, as compared to twin studies, GCTA underestimates heritability to the extent that heritability is caused by nonadditive genetic effects and the effects of rare variants [67].

The first hypothesis – that nonadditive genetic effects led to the low GCTA estimate of heritability for CU – is not supported by our twin results. As mentioned in the Methods section, the hallmark of nonadditive gene-gene (epistatic) interactions is that the DZ twin correlation is less than half the MZ twin correlation. However, in our twin analysis of CU, the DZ correlation (0.31) is almost exactly half the MZ correlation (0.63), providing no support for the hypothesis of nonadditive genetic influence.

Our twin model-fitting also found that the additive AE model best fit the data. Indirect support for suggesting that nonadditive genetic variance may not be a major factor causing missing GCTA heritability for CU comes from the general conclusion from quantitative genetic research that most genetic variance is additive [39],[68]. There are also evolutionary reasons to expect that most genetic variance is additive [69]. We hope that nonadditive genetic variance is not a major factor because if heritability is substantially due to nonadditive genetic effects, it will be extremely difficult to identify these effects because power is greatly diminished with each gene added to the interaction.

The second hypothesis –less common DNA variants contribute to the GCTA heritability gap for CU – seems certain to be part of the general explanation for missing heritability [70]. Common SNPs on currently available commercial DNA arrays have frequencies greater than one percent in the population. Many more SNPs are rarer, with frequencies that go down to ‘private mutations’ unique to an individual. More than 10 million SNPs have been validated in populations around the world; only about 2 million have frequencies greater than one percent in the population studied. However, we can offer no speculation why rare variants would be so especially important for CU. In order to explain a large proportion of phenotypic variation, rare variants would need to have large effect sizes or be highly numerous. Large effect sizes could occur for a trait under negative selection pressure, in which a de novo mutation has a large effect on the individual who harbors the mutation, but whose fertility is lowered so that the mutation does not spread in the population. Yet schizophrenia, a disorder that is known to be associated with low fecundity and therefore presumably under negative selection pressure, is nevertheless largely influenced by common polygenic variation [41],[71].

Because CU seems unlikely to be under strong negative selection, rare variants of large effect seem an unlikely hypothesis as to why the GCTA heritability gap is so much greater for CU than for other traits.

Whatever its cause, the low GCTA heritability estimate, if true, implies that identifying DNA variants responsible for the heritability of CU will be even more difficult than it is for most complex traits. Because GCTA heritability, not twin study heritability, creates a ceiling for GWA analysis, the low GCTA estimate for CU doomed our GWA from the outset, even beyond the relatively small sample size that was limited to detecting genome-wide significant additive effects tagged by common variants that yield associations accounting for more than 1% of the total variance. This first genome-wide association study of childhood CU in a community sample of 2,930 individuals found no associations that met stringent genomewide correction for multiple testing (p<5×10−8). Three SNPs achieved suggestive significance at a less stringent threshold (p<5×10−6), none of which are close to the coding regions of well-documented candidate genes implicated in previous genetic association studies of CU, or any of the suggestive hits identified by the previous pooling study of extreme CU that co-occurred with conduct problems [31].

These results suggest that, for CU in particular, not only will much larger samples be needed to detect associations that account for very small effect sizes, but that methods to identify gene-gene interactions and whole-genome sequencing to detect rare variants may be needed in order to detect DNA variants that are responsible for the heritability of CU.


We gratefully acknowledge the ongoing contribution of the parents and children in the Twins Early Development Study (TEDS).

Author Contributions

Conceived and designed the experiments: EV TSP OSPD ELM CMAH RP. Performed the experiments: EV TSP SRJ MT OSPD ELM CMAH RP. Analyzed the data: EV TSP SRJ MT OSPD ELM. Wrote the paper: EV TSP RP.


  1. 1. Frick PJ, Moffitt TE (2010) A proposal to the DSM–V Childhood Disorders and the ADHD and Disruptive Behavior Disorders Work Groups to include a specifier to the diagnosis of conduct disorder based on the presence of callous-unemotional traits. Arlington, VA: American Psychiatric Association.
  2. 2. Barker ED, Oliver BR, Viding E, Salekin RT, Maughan B (2011) The impact of prenatal maternal risk, fearless temperament and early parenting on adolescent callous-unemotional traits: A 14-year longitudinal investigation. J Child Psychol Psychiatry 52: 878–888
  3. 3. Frick PJ, Cornell AH, Bodin SD, Dane HA, Barry CT, et al. (2003) Callous-unemotional traits and developmental pathways to severe aggressive and antisocial behavior. Dev Psychol 39: 246–260
  4. 4. Kumsta R, Sonuga-Barke E, Rutter M (2012) Adolescent callous-unemotional traits and conduct disorder in adoptees exposed to severe early deprivation. Br J Psychiatry 200: 197–201
  5. 5. Rowe R, Maughan B, Moran P, Ford T, Briskman J, et al. (2010) The role of callous and unemotional traits in the diagnosis of conduct disorder. J Child Psychol Psychiatry 51: 688–695
  6. 6. Hart SD, Hare RD (1997) Psychopathy: Assessment and association with criminal conduct. In: Stoff DM, Breiling J, editors. Handbook of Antisocial Behaviour. New York: John Wiley & Sons Inc.
  7. 7. Fontaine NMG, McCrory EJP, Boivin M, Moffitt TE, Viding E (2011) Predictors and outcomes of joint trajectories of callous-unemotional traits and conduct problems in childhood. J Abnorm Psychol 120: 730–742
  8. 8. Lynam DR, Caspi A, Moffitt TE, Loeber R, Stouthamer-Loeber M (2007) Longitudinal evidence that psychopathy scores in early adolescence predict adult psychopathy. J Abnorm Psychol 116: 155–165
  9. 9. Viding E, McCrory EJ (2012) Genetic and neurocognitive contributions to the development of psychopathy. Dev Psychopathol 24: 969–983
  10. 10. Larsson H, Viding E, Plomin R (2008) Callous unemotional traits and antisocial behavior: Genetic, environmental, and early parenting characteristics. Crim Justice Behav 35: 197–211
  11. 11. Bezdjian S, Tuvblad C, Raine A, Baker LA (2011) The genetic and environmental covariation among psychopathic personality traits, and reactive and proactive aggression in childhood. Child Dev 82: 1267–1281
  12. 12. Forsman M, Larsson H, Andershed H, Lichtenstein P (2007) The association between persistent disruptive childhood behaviour and the psychopathic personality constellation in adolescence: A twin study. Br J Dev Psychol 25: 383–398
  13. 13. Viding E, Frick PJ, Plomin R (2007) Aetiology of the relationship between callous-unemotional traits and conduct problems in childhood. Br J Psychiatry 190: s33–s38
  14. 14. Frick PJ, Viding E (2009) Antisocial behavior from a developmental psychopathology perspective. Dev Psychopathol 21: 1111–1131
  15. 15. Fontaine NMG, Rijsdijk FV, McCrory EJP, Viding E (2010) Etiology of different developmental trajectories of callous-unemotional traits. J Am Acad Child Adolesc Psychiatry 49: 656–664
  16. 16. Forsman M, Lichtenstein P, Andershed H, Larsson H (2008) Genetic effects explain the stability of psychopathic personality from mid- to late adolescence. J Abnorm Psychol 117: 606–617
  17. 17. Fowler T, Langley K, Rice F, van den Bree MBM, Ross K, et al. (2009) Psychopathy trait scores in adolescents with childhood ADHD: the contribution of genotypes affecting MAOA, 5HTT and COMT activity. Psychiatr Genet 19: 312–319
  18. 18. Meyer-Lindenberg A, Buckholtz JW, Kolachana B, Hariri AR, Pezawas L, et al. (2006) Neural mechanisms of genetic risk for impulsivity and violence in humans. Proc Nat Acad Sci USA 103: 6269–6274
  19. 19. Munafo MR, Brown SM, Hariri AR (2008) Serotonin transporter (5-HTTLPR) genotype and amygdala activation: A meta-analysis. Biol Psychiatry 63: 852–857
  20. 20. Jones AP, Laurens KR, Herba CM, Barker GJ, Viding E (2009) Amygdala hypoactivity to fearful faces in boys with conduct problems and callous-unemotional traits. Am J Psychiatry 166: 95–102
  21. 21. Marsh AA, Finger EC, Mitchell DGV, Reid ME, Sims C, et al. (2008) Reduced amygdala response to fearful expressions in children and adolescents with callous-unemotional traits and disruptive behavior disorders. Am J Psychiatry 165: 712–720
  22. 22. Sebastian CL, McCrory EJP, Cecil CAM, Lockwood PL, De Brito SA, et al. (2012) Neural responses to affective and cognitive Theory of Mind in children with conduct problems and varying levels of callous-unemotional traits. Arch Gen Psychiatry 69: 814–822
  23. 23. Sadeh N, Javdani S, Jackson JJ, Reynolds EK, Potenza MN, et al. (2010) Serotonin transporter gene associations with psychopathic traits in youth vary as a function of socioeconomic resources. J Abnorm Psychol 119: 604–609
  24. 24. Beitchman JH, Zai CC, Muir K, Berall L, Nowrouzi B, et al. (2012) Childhood aggression, callous-unemotional traits and oxytocin genes. Eur Child Adolesc Psychiatry 21: 125–132
  25. 25. Malik AI, Zai CC, Abu Z, Nowrouzi B, Beitchman JH (2012) The role of oxytocin and oxytocin receptor gene variants in childhood-onset aggression. Genes Brain Behav 11: 545–551
  26. 26. Chabris CF, Hebert BM, Benjamin DJ, Beauchamp J, Cesarini D, et al. (2012) Most reported genetic associations with general intelligence are probably false positives. Psychol Sci 23: 1314–1323
  27. 27. Siontis KCM, Patsopoulos NA, Ioannidis JPA (2010) Replication of past candidate loci for common diseases and phenotypes in 100 genome-wide association studies. Eur J Hum Genet 18: 832–837
  28. 28. Visscher PM, Brown MA, McCarthy MI, Yang J (2012) Five years of GWAS discovery. Am J Hum Genet 90: 7–24
  29. 29. Hirschhorn JN, Daly MJ (2005) Genome-wide association studies for common diseases and complex traits. Nat Rev Genet 6: 95–108
  30. 30. McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, et al. (2008) Genome-wide association studies for complex traits: Consensus, uncertainty and challenges. Nat Rev Genet 9: 356–369
  31. 31. Viding E, Hanscombe KB, Curtis CJC, Davis OSP, Meaburn EL, et al. (2010) In search of genes associated with risk for psychopathic tendencies in children: A two-stage genome-wide association study of pooled DNA. J Child Psychol Psychiatry 51: 780–788
  32. 32. Hindorff LA, MacArthur J (European Bioinformatics Institute), Morales J (European Bioinformatics Institute), Junkins HA, Hall PN, et al. A catalog of published genome-wide association studies. Available:
  33. 33. Park JH, Gail MH, Weinberg CR, Carroll RJ, Chung CC, et al. (2011) Distribution of allele frequencies and effect sizes and their interrelationships for common genetic susceptibility variants. Proc Nat Acad Sci USA 108: 18026–18031
  34. 34. Klein RJ, Zeiss C, Chew EY, Tsai JY, Sackler RS, et al. (2005) Complement factor H polymorphism in age-related macular degeneration. Science 308: 385–389
  35. 35. Maher B (2008) Personal genomes: The case of the missing heritability. Nature 456: 18–21
  36. 36. Plomin R (2013) Child development and molecular genetics: 14 years later. Child Dev 84: 104–120
  37. 37. Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, et al. (2010) Common SNPs explain a large proportion of the heritability for human height. Nat Genet 42: 565–569
  38. 38. Yang JA, Lee SH, Goddard ME, Visscher PM (2011) GCTA: A tool for genome-wide complex trait analysis. Am J Hum Genet 88: 76–82
  39. 39. Plomin R, DeFries JC, Knopik VS, Neiderhiser JM (2013) Behavioral genetics. New York: Worth Publishers.
  40. 40. Yang J, Manolio TA, Pasquale LR, Boerwinkle E, Caporaso N, et al. (2011) Genome partitioning of genetic variation for complex traits using common SNPs. Nat Genet 43: 519–525
  41. 41. Lee SH, DeCandia TR, Ripke S, Yang J, Sullivan PF, et al. (2012) Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs. Nat Genet 44: 247–250
  42. 42. Lee SH, Wray NR, Goddard ME, Visscher PM (2011) Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet 88: 294–305
  43. 43. Lubke GH, Hottenga JJ, Walters R, Laurin C, de Geus EJ, et al. (2012) Estimating the genetic variance of major depressive disorder due to all single nucleotide polymorphisms. Biol Psychiatry 72: 707–709
  44. 44. Vinkhuyzen AAE, Pedersen NL, Yang J, Lee SH, Magnusson PKE, et al. (2012) Common SNPs explain some of the variation in the personality dimensions of neuroticism and extraversion. Transl Psychiatry 2: e102
  45. 45. Plomin R, Haworth CMA, Meaburn EL, Price T, Wellcome Trust Case Control Consortium 2, et al (2013) Common DNA markers can account for more than half of the genetic influence on cognitive abilities. Psychol Sci 24: 562–568
  46. 46. Oliver BR, Plomin R (2007) Twins Early Development Study (TEDS): A multivariate, longitudinal genetic investigation of language, cognition and behaviour problems from childhood through adolescence. Twin Res Hum Genet 10: 96–105
  47. 47. Kovas Y, Haworth CMA, Dale PS, Plomin R (2007) The genetic and environmental origins of learning abilities and disabilities in the early school years. Monogr Soc Res Child Dev 72: 1–144
  48. 48. Barrett JC, Lee JC, Lees CW, Prescott NJ, Anderson CA, et al. (2009) Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region. Nat Genet 41: 1330–1334
  49. 49. Marchini J, Howie B, Myers S, McVean G, Donnelly P (2007) A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet 39: 906–913
  50. 50. Howie BN, Donnelly P, Marchini J (2009) A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 5: e1000529
  51. 51. Gomez R, Harvey J, Quick C, Scharer I, Harris G (1999) DSM-IV AD/HD: Confirmatory factor models, prevalence, and gender and age differences based on parent and teacher ratings of Australian primary school children. J Child Psychol Psychiatry 40: 265–274
  52. 52. Nadder TS, Silberg JL, Rutter M, Maes HH, Eaves LJ (2001) Comparison of multiple measures of ADHD symptomatology: A multivariate genetic analysis. J Child Psychol Psychiatry 42: 475–486
  53. 53. Viding E, Blair RJR, Moffitt TE, Plomin R (2005) Evidence for substantial genetic risk for psychopathy in 7-year-olds. J Child Psychol Psychiatry 46: 592–597
  54. 54. Frick PJ, Hare RD (2001) The Antisocial Process Screening Device (APSD). Toronto, Canada: Multi-Health Systems.
  55. 55. Goodman R (1997) The Strengths and Difficulties Questionnaire: A research note. J Child Psychol Psychiatry 38: 581–586
  56. 56. Boker S, Neale M, Maes H, Wilde M, Spiegel M, et al. (2011) OpenMx: An open source extended structural equation modeling framework. Psychometrika 76: 306–317
  57. 57. Wellcome Trust Case Control Consortium (2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447: 661–678
  58. 58. R Development Core Team (2011) R: A language and environment for statistical computing (Version 2.13.1). Vienna, Austria: R Foundation for Statistical Computing.
  59. 59. Pruim RJ, Welch RP, Sanna S, Teslovich TM, Chines PS, et al. (2010) LocusZoom: Regional visualization of genome-wide association scan results. Bioinformatics 26: 2336–2337
  60. 60. Fellay J, Shianna KV, Ge D, Colombo S, Ledergerber B, et al. (2007) A whole-genome association study of major determinants for host control of HIV-1. Science 317: 944–947
  61. 61. Browning SR, Browning BL (2011) Population structure can inflate SNP-based heritability estimates. Am J Hum Genet 89: 191–193
  62. 62. van der Waerden BL (1952) Order tests for the two-sample problem and their power. Indagationes Mathematicae 14: 453–458.
  63. 63. Little RJA, Rubin DB (2002) Statistical analysis with missing data. Hoboken, NJ: John Wiley & Sons.
  64. 64. Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55: 997–1004
  65. 65. Dudbridge F, Gusnanto A (2008) Estimation of significance thresholds for genomewide association scans. Genet Epidemiol 32: 227–234
  66. 66. Purcell S, Cherny SS, Sham PC (2003) Genetic Power Calculator: Design of linkage and association genetic mapping studies of complex traits. Bioinformatics 19: 149–150
  67. 67. Zaitlen N, Kraft P (2012) Heritability in the genome-wide association era. Hum Genet 131: 1655–1664
  68. 68. Flint J, DeFries JC, Henderson ND (2004) Little epistasis for anxiety-related measures in the DeFries strains of laboratory mice. Mamm Genome 15: 77–82
  69. 69. Hill WG, Goddard ME, Visscher PM (2008) Data and theory point to mainly additive genetic variance for complex traits. PLoS Genet 4: e1000008
  70. 70. Gibson G (2012) Rare and common variants: Twenty arguments. Nat Rev Genet 13: 135–145
  71. 71. Purcell SM, Wray NR, Stone JL, Visscher PM, O'Donovan MC, et al. (2009) Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460: 748–752