In recent years, DISC1 has emerged as one of the most credible and best supported candidate genes for schizophrenia and related neuropsychiatric disorders. Furthermore, increasing evidence – both genetic and functional – indicates that many of its protein interaction partners are also involved in the development of these diseases. In this study, we applied a pooled sample 454 sequencing strategy, to explore the contribution of genetic variation in DISC1 and 10 of its interaction partners (ATF5, Grb2, FEZ1, LIS-1, PDE4B, NDE1, NDEL1, TRAF3IP1, YWHAE, and ZNF365) to schizophrenia susceptibility in an isolated northern Swedish population. Mutation burden analysis of the identified variants in a population of 486 SZ patients and 514 control individuals, revealed that non-synonymous rare variants with a MAF<0.01 were significantly more present in patients compared to controls (8.64% versus 4.7%, P = 0.018), providing further evidence for the involvement of DISC1 and some of its interaction partners in psychiatric disorders. This increased burden of rare missense variants was even more striking in a subgroup of early onset patients (12.9% versus 4.7%, P = 0.0004), highlighting the importance of studying subgroups of patients and identifying endophenotypes. Upon investigation of the potential functional effects associated with the identified missense variants, we found that ~90% of these variants reside in intrinsically disordered protein regions. The observed increase in mutation burden in patients provides further support for the role of the DISC1 pathway in schizophrenia. Furthermore, this study presents the first evidence supporting the involvement of mutations within intrinsically disordered protein regions in the pathogenesis of psychiatric disorders. As many important biological functions depend directly on the disordered state, alteration of this disorder in key pathways may represent an intriguing new disease mechanism for schizophrenia and related neuropsychiatric diseases. Further research into this unexplored domain will be required to elucidate the role of the identified variants in schizophrenia etiology.
Citation: Moens LN, De Rijk P, Reumers J, Van Den Bossche MJA, Glassee W, De Zutter S, et al. (2011) Sequencing of DISC1 Pathway Genes Reveals Increased Burden of Rare Missense Variants in Schizophrenia Patients from a Northern Swedish Population. PLoS ONE 6(8): e23450. doi:10.1371/journal.pone.0023450
Editor: Peter Heutink, VU University Medical Center and Center for Neurogenomics and Cognitive Research, Netherlands
Received: March 17, 2011; Accepted: July 18, 2011; Published: August 11, 2011
Copyright: © 2011 Moens et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was supported by the Special Research Fund of the University of Antwerp, the Fund for Scientific Research Flanders (FWO-F), the Institute for Science and Technology–Flanders (IWT-F), the Swedish Research Council (grants 2006-4472 and 2009-5269), the Medical Faculty, Umeå University and the County Councils of Västerbotten and Norrbotten, Sweden. LNM was supported by a postdoctoral fellowship from IWT-F, and MVDB holds a PhD fellowship of the IWT-F. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Schizophrenia (SZ) is a severe psychiatric disorder affecting ~1% of the population worldwide. The disease is characterized by positive symptoms, including hallucinations, delusions and disturbances in thoughts, as well as negative symptoms such as lack of motivation and attention, asocial behavior and cognitive dysfunction. With a heritability of ~60–80% , SZ has a clear genetic component, but despite major efforts during the last decades to identify genetic risk factors, only a handful of candidate genes could be replicated independently – and even a smaller number demonstrated clear biological support.
One recent and promising exception is Disrupted in Schizophrenia 1 (DISC1), which was originally identified via a balanced t(1;11) chromosomal translocation segregating with a wide spectrum of psychiatric disorders in a large Scottish pedigree . Since its discovery, several independent linkage and association studies in diverse populations and various phenotype models – including SZ, bipolar disorder, major depression, as well as various neurophysiological, cognitive and structural traits – have confirmed the original findings, further supporting a central role for DISC1 genetic variation in conferring susceptibility to psychiatric illness – .
Nevertheless, except for the translocation no specific causal variant has yet been identified, , and most compelling evidence so far has come from functional genomic and cell biological analyses, suggesting an essential role of DISC1 in neuronal development, including adult neurogenesis and signaling –.
Research into DISC1's multiple interaction partners has proved very valuable in elucidating its biological functions, further implicating it in essential processes of brain development and adult neuronal function , , –. With over 15 in vivo confirmed protein interactors , –, and many more potential interactions (>60, identified by yeast two-hybrid screens) , –, DISC1 is considered a central ‘hub’ protein, connecting numerous functional systems, including neuronal migration, neurite outgrowth, cytoskeletal modulation and cAMP signaling, within the brain , . The observation that several of these interaction partners have been identified as independent (genetic) susceptibility factors for neuropsychiatric diseases , and the potential crosstalk with known schizophrenia risk factors such as dysbindin and neuregulin, which share multiple putative binding partners with DISC1 , indicate that not just DISC1, but a multidimensional ‘DISC1 pathway’ is involved in the etiology of psychiatric diseases.
At present, however, little is known on the role of genetic variation in the individual DISC1 interactors, or combinations of the different interaction partners, in the development of these disorders. Therefore, detailed studies of the various components of the DISC1 pathway will be essential to fully understand the role of this complex molecular network in conferring susceptibility to psychiatric diseases. Studying a set of candidate genes in convergent molecular risk networks – like the DISC1 pathway – is an attractive strategy, as it allows to investigate the cumulative effects of (moderately deleterious) mutations that, because of their molecular connection, could cause genetic impairment of pathway activity and thereby lead to phenotypic effects. Hereby, it is of particular importance to search for rare mutations (with a low population frequency (<1%), and a relatively high penetrance), as it has been suggested that these variants, or combinations thereof, could explain a substantial fraction of common disorders like SZ –.
Of course, identifying rare variants requires genotyping large populations of individuals, which is costly and time-consuming, especially when one seeks to study large candidate gene sets typically involved in complex diseases. An interesting approach to minimize cost and time, is through sample pooling that in combination with massively parallel sequencing allows the identification and simultaneous quantification of (rare) variants in multiple individuals –.
In this study, we have used multiplex PCR combined with next generation sequencing of pooled DNA, to explore the mutation burden in DISC1 and 10 of its interaction partners (ATF5, GRB2, FEZ1, LIS-1 (encoded by PAFAH1B1), PDE4B, NDE1, NDEL1, TRAF3IP1, YWHAE, and ZNF365) in schizophrenia patients versus control individuals. The candidate genes were first selected based on their convincingly reported interaction with DISC1. Next, both some more established/studied interactors – having (suggestive) evidence for involvement in psychiatric illness – and some new, potentially interesting candidates were included (Table S1) , .
454 sequencing validation and variant discovery
454 sequence analysis was performed on 4 DNA pools and on 1 individual patient DNA sample. The average number of mapped reads per amplicon was comparable between patient and control pools (1356 versus 1368 reads/amplicon, respectively (~34 reads/individual)), justifying a comparison between the patient and control sequencing data.
The number of reads was homogeneously distributed over the different amplicon pools, except for multiplex reaction 12, where a lower number of reads was obtained (Figure S1 and Text S1, Results & Discussion).
To evaluate the performance of our pooling approach we compared the minor allele frequencies of the variants in the pooled samples (as determined by GS-FLX sequencing) with their actual frequencies for each of the validated variants. The observed and predicted frequencies correlated very well (R2 = 0.98) across a wide range of frequencies, demonstrating the accuracy of our DNA sample pooling (Figure S2). In addition, the incidence of false negatives was estimated by Sanger sequencing of a representative subset of amplicons (Table S3). Results indicated that the occurrence of type II errors was negligible as long as sequence coverage was sufficient (≥500 reads/amplicon). More details and additional performance measures of our experimental platform are provided in Supporting Text S1 (Results section).
Manual curation of the variants using NovoSNP 4.0, resulted in a total of 110 potential variants with a frequency ≥0.8% (in the discovery sample), of which 61 were located in the coding and untranslated regions of the target genes. These 61 variants were further validated by SNP genotyping in the complete association sample, (comprising 486 unrelated SZ patients and 514 unrelated control individuals) (57 variants), or Sanger sequencing in the original subject population (80 patient and 80 control samples) (4 variants), resulting in a final set of 50 confirmed coding and UTR variants (Table 1).
Variant frequency analyses
Variant frequencies of the 50 confirmed variants are listed in Table 1.
The genotype distribution of variant PAFAH1B1 r.1250C>T (rs6628) was not in HWE, and was omitted from further statistical analyses.
Main effect analysis.
At the allelic level, only 1 variant, NDE1 p.Y279Y ( = rs17283846), showed a significant difference between patients and control individuals, exhibiting a higher frequency in the latter (Table 1; allelic OR = 0.60 (95% CI: 0.39–0.92), p = 0.016). At the genotypic level, 6 statistically significant effects were found, the strongest of which was observed for TRAF3IP1 p.T23T ( = rs13398676), showing a higher proportion of MAF homozygotes in patients versus controls (Table 1; OR = 2.07 (95% CI: 1.2–3.5), p = 0.008). However, none of these effects passed Bonferroni correction.
Interestingly, it was observed that large part of the identified coding variants represented rare mutations, with over 35% of the variants (18/50) having a MAF below 1%, and 50% (25/50) having a MAF smaller than 2%. Some of these variants were uniquely present in patients (DISC1 p.E834E, = rs41271517) or in controls (NDEL1 p.P324S; ZNF365 p.L318L).
Mutation burden analysis.
The overall variation burden (i.e. comprising all 49 variants in HWE) did not differ between the 486 patients and 514 control individuals genotyped, neither in the complete set of candidate genes (p = 0.75), nor on an individual gene basis (smallest p = 0.42). However, when the variants were stratified based on MAF and variant type, several interesting effects were observed (Table 2 and 3). It was found that the rare mutations (MAF <0.01) were more common in the patient population compared to the controls (1.24-fold increase, P = 0.246). Though not statistically significant for the assembled rare mutations (including missense, silent and UTR variants), this increased burden became significant when only the missense mutations were considered (1.85-fold increase at MAF<0.01, P = 0.018). This effect gradually diminished when variants with a higher MAF were included (1.28-fold increase at MAF<0.02, P = 0.112; 1.10-fold increase at MAF<0.05, P = 0.479). The observed result, however, did not remain statistically significant after Bonferroni correction (16 tests, corrected P = 0.29).
Yet, upon further stratification of the data according to disease onset age, we found that the observed increase in rare missense mutation burden may not be a general effect in our SZ sample, but seems to be related to the age at disease onset, being most pronounced in patients with a young onset age.
More specifically, the patient population was stratified into three groups: early-onset (≤20) (N = 163), medium-onset (21 - <35) (N = 266) and late-onset (35 - <60) (N = 57) patients, and the burden of rare missense mutations (MAF <1%) was investigated in these groups. It was found that the early-onset patients had a particularly high burden of these rare missense variants: in this subset of patients, we found 12.9 mutant alleles per 100 individuals, versus 8.6 in the complete SZ sample and 4.7 in controls (SZ/co ratio = 2.75; P = 0.0004). This burden decreased to 7.9 mutant alleles per 100 individuals in the medium onset group (SZ/co ratio = 1.69; P = 0.076), and was zero in the late-onset group (0 mutant alleles/100 individuals; P = 0.15).
The observed increased burden of rare missense mutations in early onset patients also remains significant after Bonferroni correction (19 tests, corrected P = 0.0076).
In silico functional analyses
For all variants, the degree of nucleotide conservation was assessed using the GERP (Genome Evolutionary Rate Profiling) score. Missense and silent mutations were further examined for potential splicing defects. Several potential effects on splicing were predicted, with varying consistencies across the different matrices (Table 4 and 5). These include the disruption of several potential ESE sites, and the creation of two potential new splicing motifs in DISC1 (one donor and one acceptor site), which were predicted by both of the splice-site prediction algorithms used. Neither of these two alternative splice variants corresponded to the splice isoforms described by Nakata and co-workers. . The strongest evidence – i.e., corresponding to the highest number of predictions, across multiple matrices – was found for TRAF3IP K295N ( = rs12464423) (a common variant; OR = 1.05, 95% CI: 0.87–1.28), and FEZ1 E358Q, a rare variant which is slightly more present in control individuals compared to patients (OR = 0.64, 95% CI:0.30–1.40).
The variants causing amino acid substitutions were further evaluated by analyzing evolutionary amino acid conservation using SIFT, PolyPhen and Panther. The resulting conservation scores indicated that the majority of identified amino acid substitutions in our data set had either little or no functional effect (scored as neutral by at least two these algorithms), or required more homologous sequence data in order to make reliable predictions. Two variants however, were concordantly predicted to be potentially damaging: ATF5 p.R167C (SIFT, damaging; PolyPhen, probably damaging; Panther, possibly damaging) and DISC1 p.S704C ( = rs821616) (all three algorithms: possibly damaging). In addition, one variant (DISC1 p.L607F) ( = rs6675281) was predicted to be possibly damaging by SIFT and Panther, but scored as benign by PolyPhen (Table 4).
The missense mutations were also examined for putative disruptions of specific structural and functional properties using various predictors in the SNPeffect toolsuite (Reumers et al., 2008). Single amino acid replacements can affect the protein's structure and dynamics, they can disrupt functional sites or affect the cellular processing of the protein. None of the missense variants detected in this study caused significant changes in any of the properties examined. The lack of three dimensional structures of the proteins in the DISC1 pathway, or even structural models of close homologs, hindered a detailed analysis of the influence of the mutations on protein stability.
However, this scarcity of structural models is probably due to a high occurrence of intrinsically disordered regions in the proteins under study. Indeed, DisProt analysis showed that all proteins except LIS1 (encoded by PAFAH1B1), GRB2 and YWHAE have ≥40% disordered residues (Figure 1) (Peng et al., 2006). The missense variants were overrepresented in proteins with a high content of unstructured regions, with 19 of the 22 identified missense mutations (86%) residing in an unstructured region of the protein, and only 3 mutations (DISC1 L607F ( = rs6675281), DISC1 S704C ( = rs821616) and ZNF365 A26T ( = rs7076156)) lying outside such a region (Figure S4). Notably, none of the 3 more ‘structured’ proteins contained missense mutations.
(A) Disorder content of the individual proteins of the DISC1 pathway (i.e. percentage of disordered residues/ total number of amino acids). (B) Disorder content of the DISC1 pathway proteins compared to human proteome (i.e. 20320 SwissProt proteins), a set of brain proteins (7160 sequences from GeneAtlas) and a set of schizophrenia candidate proteins (670 sequences from the Schizophrenia Gene Resource database). Overall, the DISC1 pathway proteins exhibit a higher abundance of intrinsically disordered residues, compared to the human proteome.
Finally, all UTR variants were analyzed for interference with predicted transcription factor binding sites, miRNA target sites. We did not find evidence for mutations affecting transcription factor binding sites, but we did identify three potential miRNA target site mutations, in the 3′ UTR of ATF5 and DISC1 (Table 6).
To explore the role of genetic variation in DISC1 and 10 of its interaction partners in the etiology of schizophrenia, we sequenced the coding exons and splice junctions of the genes using massively parallel 454 sequencing in pooled samples. A selection of 80 early onset SZ patients and 80 control individuals was used as a discovery sample, resulting in the identification of 50 validated variants (Table 1). These 50 variants were subsequently genotyped in the complete association samples comprising 486 SZ patients and 514 control individuals recruited from an isolated northern Swedish population.
Six variants were found with a statistically significant frequency difference between patients and controls. These include two synonymous mutations, NDE1 Y279Y ( = rs17283846) (p = 0.025) and TRAF3IP1 T23T ( = rs13398676) (p = 0.006). Although the functional consequences of silent mutations may not be obvious, recent studies have shown that these variants might modify protein abundance, structure and/or activity via alterations in mRNA stability , splicing , or translation kinetics. The other variants showing a significant effect in two or more of the inheritance models all involve UTR or splice site mutations: ATF5 r.871G>A, DISC1 g.169488T>C ( = rs2273890) and NDE1 r.1041A>C ( = rs2075511). Interestingly, ATF5 r.871G>A is located in a potential miRNA target site, and may therefore interfere with ATF5 expression and function. However, as none of these effects survived multiple testing correction, further independent replication of these findings will be required.
The observed scarcity of statistically significant main effects in our data set does not necessarily rule out the involvement of the DISC1 pathway in the susceptibility for SZ in our population, but may (at least partly) be attributed to the relatively high occurrence of rare variants, having frequencies too low to be able to run adequate statistical comparisons. Indeed, over 35% of all identified coding variants (18/50) have a MAF below 1%, and 50% (25/50) were present at a MAF smaller than 2%. Though the frequencies of these rare variants were not significantly different between patients and controls, they often have odds ratios higher than 1.5 (resp. lower than 0.67), and several are unique in one or the other group (Tables 4, 5, 6). As for DISC1, none of rare variants identified here overlapped with the 5 ultra-rare cohort-specific variants previously described by Song et al. . Of the two rare variants we identified in this gene, one (W160L) was completely novel, and the other (E751Q) was also reported as rare by Song and colleagues (MAF<1%). Interestingly, this variant also had an OR of ~2 in their population, analogous with our results (Table 4).
To understand the potential role of the identified rare variants in SZ etiology, we evaluated the mutation burden (defined as the average number of mutations per person) in patients versus controls. Under a model in which rare mutations increase risk, we would expect to observe a greater burden in patients compared to control individuals. There was no difference in overall mutation burden between the genotyped patients and control individuals. Yet, when mutation burden was defined not simply as the total number of variants – including neutral polymorphisms – but evaluated as subgroups of variants (based on MAF and variant type), we found that schizophrenia patients were 1.85 times as likely as controls to harbor rare variants (MAF<0.01) causing amino acid substitutions (including frameshifts) (empirical P = 0.018, Table 2), indicating a role for these variants in SZ etiology in at least the northern Swedish population. Though this effect was no longer significant after stringent Bonferroni correction, it pointed us towards an even stronger effect in a subgroup of patients. Indeed, the observed increased burden of rare missense mutations seems to be related to the age at disease onset, being most pronounced in patients with a young onset age, having a 2.75-fold higher burden of rare missens variants compared to controls (empirical P = 0.0004, Bonferroni corrected P = 0.0076) (Table 3). This observation is in line with previous clinical, cognitive, genetic and imaging studies, implicating that early onset SZ is associated with greater genetic loading –, overabundance of rare CNVs impacting on known genes  and increased neurodevelopmental deviance , amongst others. These data emphasize the importance of studying subgroups of patients and identifying endophenotypes.
In addition, our findings support the hypothesis that multiple, individually rare mutations contribute to SZ risk  and, given the distribution of the variants across different genes, also explain the allelic and locus heterogeneity typically observed in SZ. Replication of our findings in larger sample sets will however be required to further substantiate the observed effects.
Detailed analysis of the variants contributing to the increased burden, showed that 8 out of 9 identified rare non-synonymous mutations in this study had an increased abundance in patients versus controls. These 8 mutations are located in DISC1 (2 mutations), PDE4B (1 mutation), ATF5 (1 mutation), TRAF3IP1 (3 mutations) and ZNF365 (1 mutation) (Table S4). Though not statistically significant on a single gene level, each of these genes causes an individual increase in mutation burden with a factor ~2 in patients versus controls (average fold increase 2.24±0.51). Taking into account the number of coding bases in these genes, DISC1 and ATF5 were found to have highest mutation burden per base (Table S4), and may thus be considered the strongest candidates for further detailed mutation analysis in a larger sample. Indeed, only a subset of our patient population (80 individuals out of 486) was sequenced in this study, enabling the detection of merely a fraction of all rare variants present in this population (see Supporting Text S1, Discussion section). Follow-up sequencing of the candidate genes in the complete patient sample may therefore uncover other rare (non-synonymous) mutations, possibly further contributing to the observed differences in mutation burden.
In order to estimate the potential risk associated with the identified missense variants, a range of protein structural and functional properties was investigated. Rather unexpectedly, we found that none of the 22 identified missense variants caused any significant effect on the various properties examined. This absence led us to the observation that 8 of 11 proteins under study showed a remarkably high occurrence of intrinsically disordered regions (IDRs). Indeed, all proteins except LIS1, GRB2 and YWHAE were found to have ≥40% of disordered residues by DisProt analysis (Figure 1, panel A). Furthermore, we observed that ~90% of the identified missense variants were located in these IDRs (Figure S4), while neither PAFAH1B1, GRB2, nor YWHAE contained a single missense variant.
IDRs are segments of proteins that do not definitively fold and remain flexible and unordered. These proteins take up different structures upon binding to different targets, and thereby exhibit functional flexibility –. Disordered regions of proteins have been shown to have important physiological roles, including molecular recognition, cell regulation and signal transduction . It is therefore not surprising that protein disorder turns out to be very common in human diseases – being significantly enriched in a wide variety of disease-associated proteins, including neurodegenerative disease, cancer, cardiovascular disease and diabetes –. Furthermore, it has been shown that IDRs are particularly prevalent in hub proteins and interaction networks, where their conformational flexibility is required to accommodate binding between the different interaction partners –. Interestingly, our analyses revealed that this is also the case here, with the DISC1 pathway proteins clearly exhibiting a higher abundance of intrinsically disordered residues, compared to the human proteome , as well as a set of brain and schizophrenia-related protein sequences (p = 0.018; 0.013 and 0.0098, respectively) (Figure 1, panel B). This is an exciting new insight, which – to our knowledge – has never been reported in the literature, and may provide a new boost to the complex research field of psychiatric genetics. While alterations of disordered regions may not directly cause changes in protein structure, they are very well capable of interfering with the function of proteins, e.g. by affecting the affinity for interaction with other proteins, or altering the coupled binding-folding mechanism of (one of) the binding partners. Importantly, it has been shown that intrinsically disorder is very sensitive to changes in amino acid sequence; as recently described –, maintaining disordered regions through evolution (or sequence changes) appears very difficult, whereas helices and strands are maintained more easily. Neutral mutations with respect to disorder are therefore very unlikely –. Certainly in a complex network, such as the DISC1 pathway, it is very well conceivable that mutations and/or changes in one of the proteins or its environment could reduce its ability to recognize appropriate binding partners and lead to partial or complete collapse of the protein network.
In this study, ~90% of all identified missense variants (including the rare mutations underlying the increased burden in patients versus controls) are located in an IDR. Interestingly, some of the (rare) variants indentified in this study fall into known binding regions on one or more of the interactors (Figure S3). E.g., ATF5 R167C is located in the DISC1 binding region of this protein; DISC E751Q resides in the binding sites for ATF5, LIS1 and PDE4B; and TRAF3IP1 E260K is located in the DISC1 binding region of this protein. Although these observations are certainly very intriguing, they should be regarded with some caution, as the reported binding regions between the different interactors are often quite large, hence no clear conclusions can be drawn from them. Moreover, as not all of the binding regions for the different interactions have been described in literature, it is impossible to give a complete picture of this. The question whether one (or more) of these mutations might influence protein (or even pathway) function, by interfering with any of the key features associated with IDRs, will be one of the major challenges for future work. A first clue about potential effects of some of the variants may be provided by their amino acid conservation (Tables 4, 5, 6). Based on evolutionary conservation scores generated by 3 different algorithms, we found that three variants were predicted to be possibly damaging: ATF5 R167C, DISC1 L607F ( = rs6675281) and S704C ( = rs821616). Interestingly, two of these variants (DISC1 S704C and L607F) were recently reported to have an actual functional effect –. The fact that the predicted outcome for DISC1 L607F and S704C corresponds to already known biological consequences greatly underlines the value of our in silico predictions, also for other, unknown variants. This is especially interesting as to ATF5 R167C, which was also predicted to be damaging, but not previously reported. This variant corresponds to a novel, rare mutation, having an odds ratio of 2.6 (95% CI: 0.50–13.46). Further studies of this variant are warranted to clarify its relation to disease.
To our knowledge, this study is the first describing a comprehensive resequencing analysis of the DISC1 pathway in schizophrenia. Our results provide support for a model of SZ pathogenesis that includes the effects of multiple rare variants, residing in different vulnerable genes, which may in turn be functionally linked into pathways and networks. This model is consistent with the theory presented by Eyre-Walker , stating that rare alleles should explain most of the variance in complex traits if there is natural selection for the trait. Based on these findings, and as also suggested by McClellan and co-workers , we argue that rare risk alleles may be revealed by research strategies including extensive resequencing of genes previously shown to be informative (e.g. based on a chromosomal translocation, such as DISC1) and, importantly, these genes' functional network.
Assigning potential functional significance to identified variants is a major challenge in genomics research. In this work, a wide array of functional properties was examined to predict possible deleterious effects of the variants. Using these tools, we were able to predict several potential effects on splicing and miRNA target motifs. Yet, alterations of protein structure or function were hard to track down using standard in silico prediction programs, as a majority of the proteins encoded by our candidate genes contain large regions of intrinsically disordered residues. Though amino acid conservation analysis may provide a first hint of potential functional effects, it does not tell the whole story, as disorder-based signaling is a complex process, depending on multiple factors including alterations in protein context, alternative splicing and post-translational modifications , . However, in our opinion, this high prevalence of IDRs in the DISC1 pathway is a very fascinating finding in se, hopefully encouraging further research into this complex area, and providing new clues to our understanding of the complex etiology of SZ and other (psychiatric) disorders. Indeed, as an increasing amount of evidence is beginning to emerge that many important biological functions depend directly on the disordered state, alteration of this disorder may play a crucial role in the pathogenicity of many complex diseases (including SZ), thereby adding another level of complexity to the study of their molecular mechanism, and providing exciting new perspectives for future research.
Materials and Methods
In a first phase of the study we used DNA of 80 SZ patients (41 females, 39 males) and 80 control individuals (40 females, 40 males) for 454 sequencing based variant discovery. These individuals were selected on the basis of their early age at disease-onset (18.55±3.36), from a larger association sample consisting of 486 unrelated SZ patients (180 females, 306 males) and 514 unrelated control individuals (275 females, 239 males). All originated from a geographically isolated population living in the county of Västerbotten in Northern Sweden. They were all Caucasians and none were of Finnish, Norwegian or Lappish descent. All patients fulfilled the DSM-IV criteria for SZ . The mean age at disease-onset in the complete SZ sample was 24.8 (±7.3) years and the mean age at inclusion 53.1 (±15.1) years (see Supporting Text S1 (Materials and Methods section) for additional information regarding the ascertainment and assessment procedure of the patients).
The control population had a mean age of 58.0 (±13.0) years at inclusion. They originated from the same geographical area as the patients and were randomly selected from the Betula study, described in detail elsewhere (http://www.betula.su.se/en/) –. None of the controls were reported to have a diagnosis of schizophrenia based on studies of psychiatric records and/or an interview.
All participants gave written informed consent, and the study was approved by the regional Medical Ethical Committees of the universities of Umeå and Antwerp.
The patient-control sample was controlled for population stratification by the genotyping of 37 microsatellite (STR) markers via the use of standard genotyping and scoring methods. Statistical tests for population stratification were performed using the program STRUCTURE (http://pritch.bsd.uchicago.edu/structure.html). No population substructure was observed in the association sample (data not shown).
DNA samples and pooling
Genomic DNA was extracted from peripheral blood using standard methods.
4 DNA pools were prepared (2 ‘patient pools’ and 2 ‘control pools’), each comprising 40 DNA samples. Hereto, an equal amount of each sample (100 ng per individual) was combined, and the resulting DNA pool was adjusted to a final concentration of 10 ng/µL.
To control the efficiency of DNA pooling, the relative abundance of 3 SNP alleles was measured by pyrosequencing and compared to the allele frequencies of the individual samples constituting the pools (Supporting Text S1 (Materials and Methods section) and Table S2).
Multiplex PCR reactions and 454 sequencing
Multiplex PCR assays were designed to amplify all coding exons and splice junctions of the 11 selected genes (totaling ~16 kb target sequence). The target sequence was covered by 155 amplicons with an average length of ~221 bp, resulting in ~34 kb of sequence. The 155 amplicons were amplified in 12 multiplex PCR reactions. Simplex PCR reactions of the amplicons showed that all except two of the primer pairs (both in ATF5) amplified the correct fragment (conversion rate = 98.7%). The two failed ATF5 amplicons were omitted from further experiments. Multiplex PCR reactions were performed for each DNA pool and 1 individual patient DNA sample, also contained in one of the patient pools (Supporting Text S1 (Materials and Methods section).
Each multiplex PCR reaction was purified on a QIAquick PCR Purification column (Qiagen GmbH, Hilden, Germany), and the concentration of the eluates measured using a Nanodrop spectrophotometer (NanoDrop Technologies, Wilmington, DE). Finally, for each DNA pool (and the individual sample), the 12 purified multiplex PCR products were mixed, taking into account the concentration of each multiplex reaction and its number of constituent amplicons, to obtain an equal representation of every amplicon in the final PCR mixtures.
The final mix of 155 amplicons of each sample was sequenced using the standard amplicon sequencing protocol on a 454 GS-FLX genome sequencer (Roche Applied Science) according to the manufacturer's instructions. For each of the 4 pools, 1 lane of a 2-lane Bead Loading gasket on a 70×75 mm PicoTiterPlate was loaded, and sequenced from both directions. The individual DNA sample was sequenced using 1 lane of a 16-lane Bead Loading gasket.
Variant detection and validation
The generated standard 454 flow files were analyzed using NovoSNP 4.0 (beta), an in-house developed software program for the identification of variants in resequencing experiments. In short, NovoSNP 4.0 uses the quality and height of the flow at the variation position and the neighboring flows for SNP identification. Further, it takes into account the number and ratio of reads showing the variant – thereby allowing for the analysis of pooled sequencing data – and favors variants seen in both directions. Finally, the program creates a database of all identified variants, for which the flows can be visually inspected (De Rijk P., personal communication).
All variants with a frequency ≥0.8% were examined, allowing for a secure cutoff level to detect singleton variants, which theoretically have a frequency of 1.25% in a pool of 40 samples.
Finally, all potential variants (except 4) were validated using iPLEX SNP genotyping in the complete association sample (486 patients and 514 control individuals). For technical reasons, 4 variants were genotyped by traditional Sanger-based sequencing in the original subject population (80 patient and 80 control samples) (Supporting Text S1 (Materials and Methods)).
gPlink version 2.050  (http://pngu.mgh.harvard.edu/purcell/plink/) was used to calculate genotype deviation from Hardy-Weinberg equilibrium (HWE), by an exact test , and to compare individual allele and genotype frequencies between patients and controls, by a standard χ2 test for independence.
Differences in mutation burden (defined as the average number of variant alleles per individual) between patients and controls were assessed by two-sided t-tests, using SPSS version 16.0.2 (Brussels, Belgium). The data were thereby stratified according to type (missense, silent and UTR variants) and MAF (<0.01, <0.02 and <0.05, respectively). Empirical p-values were generated using the max(T) permutation approach, based on 100000 replicates. The level of significance for all statistical tests was 0.05. When correcting for multiple testing, Bonferroni corrective measures were taken to control false positive rates. All association analyses were performed on the complete sample (i.e. including the discovery samples). Contrary to the use of patient samples only for variant discovery, inclusion of an equally large control sample in the discovery phase, does not lead to an inflation of type I errors  (see Supporting Text S1, Discussion section).
In silico functional analyses
To investigate potential deleterious effects associated with the identified variants, each variant was subjected to a battery of in silico analyses, including assessment of nucleotide and amino acid conservation, effects on potential splice sites and cis-acting elements, potential disruption of miRNA target sites and predicted transcription factor binding sites, and alterations of functional and structural properties of the proteins. Finally, sequences where also examined for intrinsically disordered regions using DisProt (http://www.ist.temple.edu/disprot/Predictors.html). A detailed description of the applied methods is provided in Supporting Text S1.
Boxplot showing the distribution of the number of reads per amplicon in each multiplex PCR reaction, for control and patient pools. The observed read count is uniformly distributed across the different amplicon pools (with an average of 1362 reads/amplicon), except for multiplex reaction 12. °: outliers (values between 1.5 and 3x the interquartile range from either end of the box) *: extreme outliers (values more than 3x the interquartile range from either end of the box).
Allele frequencies estimated from pooled DNA samples (as determined by GS-FLX sequencing) versus the actual frequencies (as determined by genotyping the individual samples) in the different pools. °: outlier (corresponding to rs13398676, in SZ pool 1).
Overview of the known interaction domains between the different proteins investigated, along with the positions of the variants identified in this study. Protein lengths are given between brackets. Binding sites between two proteins are indicated along the line connecting them, with the binding site(s) on a certain protein closest to that protein (orange: binding sites on DISC1, blue: binding site on other proteins). The identified missense mutations are shown in a white area within the proteins' oval. Rare missense mutations (MAF ≤1%) are underlined, and mutations located in one of the binding sites are shown in italic. Note that the positions of many of the binding sites were not described/found in literature (indicated with ‘?’).
Schematic representation of the overall domain architecture of each of the proteins investigated, highlighting the regions of predicted disorder (orange), along with regions having known homologous domains (blue). On each protein, the identified missense mutations are indicated. Mutations lying outside a disordered region are marked with an extra line.
DISC1 interaction partners included in the study, and evidence for their involvement in psychiatric disease.
Allele frequencies of pooled DNA and individual samples, as determined by pyrosequencing.
Properties of amplicon subset selected for false negative rate estimation using Sanger sequencing.
Mutation burden of identified rare nonsynonymous mutations (MAF<1%), stratified by gene.
Supplementary Materials and Methods, Results and Discussion.
We are grateful to the patients and family members for their kind cooperation in this study and to the personnel of the Flanders Institute for Biotechnology (VIB) Genetic Service Facility (www.vibgeneticservicefacility.be). Research nurse Eva Lundberg is thankfully acknowledged for her help and expertise. Dr. Del-Favero takes responsibility for the integrity of the data and the accuracy of the data analysis and declares that all authors had full access to all the data in the study.
Conceived and designed the experiments: LNM DG JDF. Performed the experiments: LNM SDZ ASL. Analyzed the data: LNM JR IMC KVS JDF. Contributed reagents/materials/analysis tools: PDR MVDB WG AN LGN KFN RA. Wrote the paper: LNM PDR JR KVS JDF. Designed the software used in analysis: PDR.
- 1. Shih RA, Belmonte PL, Zandi PP (2004) A review of the evidence from family, twin and adoption studies for a genetic contribution to adult psychiatric disorders. Int Rev Psychiatry 16: 260–283.
- 2. Williams HJ, Owen MJ, O'Donovan MC (2009) New findings from genetic association studies of schizophrenia. J Hum Genet 54: 9–14.
- 3. Sanders AR, Duan J, Levinson DF, Shi J, He D, et al. (2008) No significant association of 14 candidate genes with schizophrenia in a large European ancestry sample: implications for psychiatric genetics. Am J Psychiatry 165: 497–506.
- 4. Betcheva ET, Mushiroda T, Takahashi A, Kubo M, Karachanak SK, et al. (2009) Case-control association study of 59 candidate genes reveals the DRD2 SNP rs6277 (C957T) as the only susceptibility factor for schizophrenia in the Bulgarian population. J Hum Genet 54: 98–107.
- 5. St Clair D, Blackwood D, Muir W, Carothers A, Walker M, et al. (1990) Association within a family of a balanced autosomal translocation with major mental illness. Lancet 336: 13–16.
- 6. Callicott JH, Straub RE, Pezawas L, Egan MF, Mattay VS, et al. (2005) Variation in DISC1 affects hippocampal structure and function and increases risk for schizophrenia. Proc Natl Acad Sci U S A 102: 8627–8632.
- 7. Cannon TD, Hennah W, van Erp TG, Thompson PM, Lonnqvist J, et al. (2005) Association of DISC1/TRAX haplotypes with schizophrenia, reduced prefrontal gray matter, and impaired short- and long-term memory. Arch Gen Psychiatry 62: 1205–1213.
- 8. Curtis D, Kalsi G, Brynjolfsson J, McInnis M, O'Neill J, et al. (2003) Genome scan of pedigrees multiply affected with bipolar disorder provides further support for the presence of a susceptibility locus on chromosome 12q23-q24, and suggests the presence of additional loci on 1p and 1q. Psychiatr Genet 13: 77–84.
- 9. Ekelund J, Hennah W, Hiekkalinna T, Parker A, Meyer J, et al. (2004) Replication of 1q42 linkage in Finnish schizophrenia pedigrees. Mol Psychiatry 9: 1037–1041.
- 10. Ekelund J, Hovatta I, Parker A, Paunio T, Varilo T, et al. (2001) Chromosome 1 loci in Finnish schizophrenia families. Hum Mol Genet 10: 1611–1617.
- 11. Hamshere ML, Bennett P, Williams N, Segurado R, Cardno A, et al. (2005) Genomewide linkage scan in schizoaffective disorder: significant evidence for linkage at 1q42 close to DISC1, and suggestive evidence at 22q11 and 19p13. Arch Gen Psychiatry 62: 1081–1088.
- 12. Hennah W, Thomson P, McQuillin A, Bass N, Loukola A, et al. (2009) DISC1 association, heterogeneity and interplay in schizophrenia and bipolar disorder. Mol Psychiatry 14: 865–873.
- 13. Hwu HG, Liu CM, Fann CS, Ou-Yang WC, Lee SF (2003) Linkage of schizophrenia with chromosome 1q loci in Taiwanese families. Mol Psychiatry 8: 445–452.
- 14. Macgregor S, Visscher PM, Knott SA, Thomson P, Porteous DJ, et al. (2004) A genome scan and follow-up study identify a bipolar disorder susceptibility locus on chromosome 1q42. Mol Psychiatry 9: 1083–1090.
- 15. Thomson PA, Harris SE, Starr JM, Whalley LJ, Porteous DJ, et al. (2005) Association between genotype at an exonic SNP in DISC1 and normal cognitive aging. Neurosci Lett 389: 41–45.
- 16. Chen QY, Chen Q, Feng GY, Lindpaintner K, Wang LJ, et al. (2007) Case-control association study of Disrupted-in-Schizophrenia-1 (DISC1) gene and schizophrenia in the Chinese population. J Psychiatr Res 41: 428–434.
- 17. Maeda K, Nwulia E, Chang J, Balkissoon R, Ishizuka K, et al. (2006) Differential expression of disrupted-in-schizophrenia (DISC1) in bipolar disorder. Biol Psychiatry 60: 929–935.
- 18. Palo OM, Antila M, Silander K, Hennah W, Kilpinen H, et al. (2007) Association of distinct allelic haplotypes of DISC1 with psychotic and bipolar spectrum disorders and with underlying cognitive impairments. Hum Mol Genet 16: 2517–2528.
- 19. Qu M, Tang F, Yue W, Ruan Y, Lu T, et al. (2007) Positive association of the Disrupted-in-Schizophrenia-1 gene (DISC1) with schizophrenia in the Chinese Han population. Am J Med Genet B Neuropsychiatr Genet 144B: 266–270.
- 20. Hennah W, Varilo T, Kestila M, Paunio T, Arajarvi R, et al. (2003) Haplotype transmission analysis provides evidence of association for DISC1 to schizophrenia and suggests sex-dependent effects. Hum Mol Genet 12: 3151–3159.
- 21. Perlis RH, Purcell S, Fagerness J, Kirby A, Petryshen TL, et al. (2008) Family-based association study of lithium-related and other candidate genes in bipolar disorder. Arch Gen Psychiatry 65: 53–61.
- 22. Saetre P, Agartz I, De Franciscis A, Lundmark P, Djurovic S, et al. (2008) Association between a disrupted-in-schizophrenia 1 (DISC1) single nucleotide polymorphism and schizophrenia in a combined Scandinavian case-control sample. Schizophr Res 106: 237–241.
- 23. Kilpinen H, Ylisaukko-Oja T, Hennah W, Palo OM, Varilo T, et al. (2008) Association of DISC1 with autism and Asperger syndrome. Mol Psychiatry 13: 187–196.
- 24. Hashimoto R, Numakawa T, Ohnishi T, Kumamaru E, Yagasaki Y, et al. (2006) Impact of the DISC1 Ser704Cys polymorphism on risk for major depression, brain morphology and ERK signaling. Hum Mol Genet 15: 3024–3033.
- 25. Burdick KE, Hodgkinson CA, Szeszko PR, Lencz T, Ekholm JM, et al. (2005) DISC1 and neurocognitive function in schizophrenia. Neuroreport 16: 1399–1402.
- 26. Hennah W, Tuulio-Henriksson A, Paunio T, Ekelund J, Varilo T, et al. (2005) A haplotype within the DISC1 gene is associated with visual memory functions in families with a high density of schizophrenia. Mol Psychiatry 10: 1097–1103.
- 27. Li W, Zhou Y, Jentsch JD, Brown RA, Tian X, et al. (2007) Specific developmental disruption of disrupted-in-schizophrenia-1 function results in schizophrenia-related phenotypes in mice. Proc Natl Acad Sci U S A 104: 18280–18285.
- 28. Austin CP, Ky B, Ma L, Morris JA, Shughrue PJ (2004) Expression of Disrupted-In-Schizophrenia-1, a schizophrenia-associated gene, is prominent in the mouse hippocampus throughout brain development. Neuroscience 124: 3–10.
- 29. Duan X, Chang JH, Ge S, Faulkner RL, Kim JY, et al. (2007) Disrupted-In-Schizophrenia 1 regulates integration of newly generated neurons in the adult brain. Cell 130: 1146–1158.
- 30. Harrison PJ (2004) The hippocampus in schizophrenia: a review of the neuropathological evidence and its pathophysiological implications. Psychopharmacology (Berl) 174: 151–162.
- 31. James R, Adams RR, Christie S, Buchanan SR, Porteous DJ, et al. (2004) Disrupted in Schizophrenia 1 (DISC1) is a multicompartmentalized protein that predominantly localizes to mitochondria. Mol Cell Neurosci 26: 112–122.
- 32. Lipska BK, Peters T, Hyde TM, Halim N, Horowitz C, et al. (2006) Expression of DISC1 binding partners is reduced in schizophrenia and associated with DISC1 SNPs. Hum Mol Genet 15: 1245–1258.
- 33. Ozeki Y, Tomoda T, Kleiderlein J, Kamiya A, Bord L, et al. (2003) Disrupted-in-Schizophrenia-1 (DISC-1): mutant truncation prevents binding to NudE-like (NUDEL) and inhibits neurite outgrowth. Proc Natl Acad Sci U S A 100: 289–294.
- 34. Schurov IL, Handford EJ, Brandon NJ, Whiting PJ (2004) Expression of disrupted in schizophrenia 1 (DISC1) protein in the adult and developing mouse brain indicates its role in neurodevelopment. Mol Psychiatry 9: 1100–1110.
- 35. Porteous DJ, Millar JK (2006) Disrupted in schizophrenia 1: building brains and memories. Trends Mol Med 12: 255–261.
- 36. Chubb JE, Bradshaw NJ, Soares DC, Porteous DJ, Millar JK (2008) The DISC locus in psychiatric illness. Mol Psychiatry 13: 36–64.
- 37. Kamiya A, Kubo K, Tomoda T, Takaki M, Youn R, et al. (2005) A schizophrenia-associated mutation of DISC1 perturbs cerebral cortex development. Nat Cell Biol 7: 1167–1178.
- 38. Miyoshi K, Asanuma M, Miyazaki I, Diaz-Corrales FJ, Katayama T, et al. (2004) DISC1 localizes to the centrosome by binding to kendrin. Biochem Biophys Res Commun 317: 1195–1199.
- 39. Miyoshi K, Honda A, Baba K, Taniguchi M, Oono K, et al. (2003) Disrupted-In-Schizophrenia 1, a candidate gene for schizophrenia, participates in neurite outgrowth. Mol Psychiatry 8: 685–694.
- 40. Brandon NJ, Handford EJ, Schurov I, Rain JC, Pelling M, et al. (2004) Disrupted in Schizophrenia 1 and Nudel form a neurodevelopmentally regulated protein complex: implications for schizophrenia and other major neurological disorders. Mol Cell Neurosci 25: 42–55.
- 41. Kamiya A, Tan PL, Kubo K, Engelhard C, Ishizuka K, et al. (2008) Recruitment of PCM1 to the centrosome by the cooperative action of DISC1 and BBS4: a candidate for psychiatric illnesses. Arch Gen Psychiatry 65: 996–1006.
- 42. Millar JK, Pickard BS, Mackie S, James R, Christie S, et al. (2005) DISC1 and PDE4B are interacting genetic factors in schizophrenia that regulate cAMP signaling. Science 310: 1187–1191.
- 43. Morris JA, Kandpal G, Ma L, Austin CP (2003) DISC1 (Disrupted-In-Schizophrenia 1) is a centrosome-associated protein that interacts with MAP1A, MIPT3, ATF4/5 and NUDEL: regulation and loss of interaction with mutation. Hum Mol Genet 12: 1591–1608.
- 44. Ogawa F, Kasai M, Akiyama T (2005) A functional link between Disrupted-In-Schizophrenia 1 and the eukaryotic translation initiation factor 3. Biochem Biophys Res Commun 338: 771–776.
- 45. Shinoda T, Taya S, Tsuboi D, Hikita T, Matsuzawa R, et al. (2007) DISC1 regulates neurotrophin-induced axon elongation via interaction with Grb2. J Neurosci 27: 4–14.
- 46. Taya S, Shinoda T, Tsuboi D, Asaki J, Nagai K, et al. (2007) DISC1 regulates the transport of the NUDEL/LIS1/14-3-3epsilon complex through kinesin-1. J Neurosci 27: 15–26.
- 47. Camargo LM, Collura V, Rain JC, Mizuguchi K, Hermjakob H, et al. (2007) Disrupted in Schizophrenia 1 Interactome: evidence for the close connectivity of risk genes and a potential synaptic basis for schizophrenia. Mol Psychiatry 12: 74–86.
- 48. Millar JK, Christie S, Porteous DJ (2003) Yeast two-hybrid screens implicate DISC1 in brain development and function. Biochem Biophys Res Commun 311: 1019–1025.
- 49. Ross CA, Margolis RL, Reading SA, Pletnikov M, Coyle JT (2006) Neurobiology of schizophrenia. Neuron 52: 139–153.
- 50. McClellan JM, Susser E, King MC (2007) Schizophrenia: a common disease caused by multiple rare alleles. Br J Psychiatry 190: 194–199.
- 51. Bodmer W, Bonilla C (2008) Common and rare variants in multifactorial susceptibility to common diseases. Nat Genet 40: 695–701.
- 52. Ingman M, Gyllensten U (2009) SNP frequency estimation using massively parallel sequencing of pooled DNA. Eur J Hum Genet 17: 383–386.
- 53. Druley TE, Vallania FL, Wegner DJ, Varley KE, Knowles OL, et al. (2009) Quantification of rare allelic variants from pooled genomic DNA. Nat Methods 6: 263–265.
- 54. Hennah W, Thomson P, Peltonen L, Porteous D (2006) Genes and schizophrenia: beyond schizophrenia: the role of DISC1 in major mental illness. Schizophr Bull 32: 409–416.
- 55. Nakata K, Lipska BK, Hyde TM, Ye T, Newburn EN, et al. (2009) DISC1 splice variants are upregulated in schizophrenia and associated with risk polymorphisms. Proc Natl Acad Sci U S A 106: 15873–15878.
- 56. Nackley AG, Shabalina SA, Tchivileva IE, Satterfield K, Korchynskyi O, et al. (2006) Human catechol-O-methyltransferase haplotypes modulate protein expression by altering mRNA secondary structure. Science 314: 1930–1933.
- 57. D'Souza I, Poorkaj P, Hong M, Nochlin D, Lee VM, et al. (1999) Missense and silent tau gene mutations cause frontotemporal dementia with parkinsonism-chromosome 17 type, by affecting multiple alternative RNA splicing regulatory elements. Proc Natl Acad Sci U S A 96: 5598–5603.
- 58. Kimchi-Sarfaty C, Oh JM, Kim IW, Sauna ZE, Calcagno AM, et al. (2007) A “silent” polymorphism in the MDR1 gene changes substrate specificity. Science 315: 525–528.
- 59. Song W, Li W, Feng J, Heston LL, Scaringe WA, et al. (2008) Identification of high risk DISC1 structural variants with a 2% attributable risk for schizophrenia. Biochem Biophys Res Commun 367: 700–706.
- 60. Childs B, Scriver CR (1986) Age at onset and causes of disease. Perspect Biol Med 29: 437–460.
- 61. Addington AM, Gornick M, Duckworth J, Sporn A, Gogtay N, et al. (2005) GAD1 (2q31.1), which encodes glutamic acid decarboxylase (GAD67), is associated with childhood-onset schizophrenia and cortical gray matter volume loss. Mol Psychiatry 10: 581–588.
- 62. Addington AM, Gornick M, Sporn AL, Gogtay N, Greenstein D, et al. (2004) Polymorphisms in the 13q33.2 gene G72/G30 are associated with childhood-onset schizophrenia and psychosis not otherwise specified. Biol Psychiatry 55: 976–980.
- 63. Addington AM, Gornick MC, Shaw P, Seal J, Gogtay N, et al. (2007) Neuregulin 1 (8p12) and childhood-onset schizophrenia: susceptibility haplotypes for diagnosis and brain developmental trajectories. Mol Psychiatry 12: 195–205.
- 64. Gornick MC, Addington AM, Sporn A, Gogtay N, Greenstein D, et al. (2005) Dysbindin (DTNBP1, 6p22.3) is associated with childhood-onset psychosis and endophenotypes measured by the Premorbid Adjustment Scale (PAS). J Autism Dev Disord 35: 831–838.
- 65. Vyas NS, Patel NH, Puri BK (2011) Neurobiology and phenotypic expression in early onset schizophrenia. Early Interv Psychiatry 5: 3–14.
- 66. Walsh T, McClellan JM, McCarthy SE, Addington AM, Pierce SB, et al. (2008) Rare structural variants disrupt multiple genes in neurodevelopmental pathways in schizophrenia. Science 320: 539–543.
- 67. Dunker AK, Brown CJ, Lawson JD, Iakoucheva LM, Obradovic Z (2002) Intrinsic disorder and protein function. Biochemistry 41: 6573–6582.
- 68. Dunker AK, Obradovic Z (2001) The protein trinity--linking function and disorder. Nat Biotechnol 19: 805–806.
- 69. Dyson HJ, Wright PE (2005) Intrinsically unstructured proteins and their functions. Nat Rev Mol Cell Biol 6: 197–208.
- 70. Cheng Y, LeGall T, Oldfield CJ, Dunker AK, Uversky VN (2006) Abundance of intrinsic disorder in protein associated with cardiovascular disease. Biochemistry 45: 10448–10460.
- 71. Iakoucheva LM, Brown CJ, Lawson JD, Obradovic Z, Dunker AK (2002) Intrinsic disorder in cell-signaling and cancer-associated proteins. J Mol Biol 323: 573–584.
- 72. Midic U, Oldfield CJ, Dunker AK, Obradovic Z, Uversky VN (2009) Protein disorder in the human diseasome: unfoldomics of human genetic diseases. BMC Genomics 10: Suppl 1S12.
- 73. Raychaudhuri S, Dey S, Bhattacharyya NP, Mukhopadhyay D (2009) The role of intrinsically unstructured proteins in neurodegenerative diseases. PLoS One 4: e5566.
- 74. Uversky VN, Oldfield CJ, Dunker AK (2008) Intrinsically disordered proteins in human diseases: introducing the D2 concept. Annu Rev Biophys 37: 215–246.
- 75. Uversky VN, Oldfield CJ, Midic U, Xie H, Xue B, et al. (2009) Unfoldomics of human diseases: linking protein intrinsic disorder with diseases. BMC Genomics 10: Suppl 1S7.
- 76. Dunker AK, Cortese MS, Romero P, Iakoucheva LM, Uversky VN (2005) Flexible nets. The roles of intrinsic disorder in protein interaction networks. FEBS J 272: 5129–5148.
- 77. Dunker AK, Oldfield CJ, Meng J, Romero P, Yang JY, et al. (2008) The unfoldomics decade: an update on intrinsically disordered proteins. BMC Genomics 9: Suppl 2S1.
- 78. Kim PM, Sboner A, Xia Y, Gerstein M (2008) The role of disorder in interaction networks: a structural analysis. Mol Syst Biol 4: 179.
- 79. Romero PR, Zaidi S, Fang YY, Uversky VN, Radivojac P, et al. (2006) Alternative splicing in concert with protein intrinsic disorder enables increased functional diversity in multicellular organisms. Proc Natl Acad Sci U S A 103: 8390–8395.
- 80. Mohan A, Uversky VN, Radivojac P (2009) Influence of sequence changes and environment on intrinsically disordered proteins. PLoS Comput Biol 5: e1000497.
- 81. Schaefer C, Schlessinger A, Rost B (2010) Protein secondary structure appears to be robust under in silico evolution while protein disorder appears not to be. Bioinformatics 26: 625–631.
- 82. Di Giorgio A, Blasi G, Sambataro F, Rampino A, Papazacharias A, et al. (2008) Association of the SerCys DISC1 polymorphism with human hippocampal formation gray matter and function during memory encoding. Eur J Neurosci 28: 2129–2136.
- 83. Leliveld SR, Hendriks P, Michel M, Sajnani G, Bader V, et al. (2009) Oligomer assembly of the C-terminal DISC1 domain (640-854) is controlled by self-association motifs and disease-associated polymorphism S704C. Biochemistry 48: 7746–7755.
- 84. Eastwood SL, Hodgkinson CA, Harrison PJ (2009) DISC-1 Leu607Phe alleles differentially affect centrosomal PCM1 localization and neurotransmitter release. Mol Psychiatry 14: 556–557.
- 85. Eyre-Walker A (2010) Evolution in health and medicine Sackler colloquium: Genetic architecture of a complex trait and its implications for fitness and genome-wide association studies. Proc Natl Acad Sci U S A 107: Suppl 11752–1756.
- 86. Mittag T, Kay LE, Forman-Kay JD (2010) Protein dynamics and conformational disorder in molecular recognition. J Mol Recognit 23: 105–116.
- 87. American Psychiatry Association (1994) Diagnostic and Statistical Manual of Mental Disorders. Washington, DC: American Psychiatric Press.
- 88. Nilsson L-G, Bäckman L, Erngrund K, Nyberg L, Adolfsson R, et al. (1997) The Betula prospective cohort study: memory, health and aging. Aging Neuropsychol Cognition 4: 1–32.
- 89. Nilsson L-G, Adolfsson R, Bäckman L, de Frias CM, Molander B, et al. (2004) Betula: A prospective cohort study on memory, health and agin. Aging Neuropsychol Cognition 11: 134–148.
- 90. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81: 559–575.
- 91. Wigginton JE, Cutler DJ, Abecasis GR (2005) A note on exact tests of Hardy-Weinberg equilibrium. Am J Hum Genet 76: 887–893.
- 92. Li B, Leal SM (2009) Discovery of rare variants via sequencing: implications for the design of complex trait association studies. PLoS Genet 5: e1000481.