Structural genetic changes, especially copy number variants (CNVs), represent a major source of genetic variation contributing to human disease. Tetralogy of Fallot (TOF) is the most common form of cyanotic congenital heart disease, but to date little is known about the role of CNVs in the etiology of TOF. Using high-resolution genome-wide microarrays and stringent calling methods, we investigated rare CNVs in a prospectively recruited cohort of 433 unrelated adults with TOF and/or pulmonary atresia at a single centre. We excluded those with recognized syndromes, including 22q11.2 deletion syndrome. We identified candidate genes for TOF based on converging evidence between rare CNVs that overlapped the same gene in unrelated individuals and from pathway analyses comparing rare CNVs in TOF cases to those in epidemiologic controls. Even after excluding the 53 (10.7%) subjects with 22q11.2 deletions, we found that adults with TOF had a greater burden of large rare genic CNVs compared to controls (8.82% vs. 4.33%, p = 0.0117). Six loci showed evidence for recurrence in TOF or related congenital heart disease, including typical 1q21.1 duplications in four (1.18%) of 340 Caucasian probands. The rare CNVs implicated novel candidate genes of interest for TOF, including PLXNA2, a gene involved in semaphorin signaling. Independent pathway analyses highlighted developmental processes as potential contributors to the pathogenesis of TOF. These results indicate that individually rare CNVs are collectively significant contributors to the genetic burden of TOF. Further, the data provide new evidence for dosage sensitive genes in PLXNA2-semaphorin signaling and related developmental processes in human cardiovascular development, consistent with previous animal models.
Congenital heart disease affects nearly 1% of all live births. Tetralogy of Fallot (TOF) is the most common form of cyanotic congenital heart disease. This condition is associated with hemizygous deletions of chromosome 22q11.2 and chromosomal trisomies, but little else is known about the genetic heterogeneity of this complex disease. We used high-resolution microarrays and stringent methods to study structural (copy number) variants in a systematically phenotyped cohort of unrelated adults with TOF. We found that individually rare genic copy number variants (CNVs) were collectively significant contributors to the genetic burden in TOF. Among CNVs that implicated candidate genes of interest were loss CNVs overlapping the PLXNA2 gene that codes for plexin A2. This is the first study to show a role for this semaphorin receptor in human congenital heart disease, consistent with a Plxna2 mouse knockout phenotype. Pathway analyses comparing rare exonic loss CNVs in the TOF sample to controls implicated other novel gene sets suggest new pathogenetic mechanisms.
Citation: Silversides CK, Lionel AC, Costain G, Merico D, Migita O, Liu B, et al. (2012) Rare Copy Number Variations in Adults with Tetralogy of Fallot Implicate Novel Risk Gene Pathways. PLoS Genetics 8(8): e1002843. https://doi.org/10.1371/journal.pgen.1002843
Editor: Dianna M. Milewicz, University of Texas Medical School, United States of America
Received: February 21, 2012; Accepted: May 29, 2012; Published: August 9, 2012
Copyright: © Silversides et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by Canadian Institutes of Health Research grants (MOP-89066 and MOP-111238), a Canada Research Chair in Schizophrenia Genetics and Genomic Disorders (ASB), and a McLaughlin Centre Accelerator Grant. SWS is supported by grants from the University of Toronto McLaughlin Centre, NeuroDevNet, Genome Canada and the Ontario Genomics Institute, the Canadian Institutes for Health Research (CIHR), the Canadian Institute for Advanced Research, the Canada Foundation for Innovation, the government of Ontario, Autism Speaks, and The Hospital for Sick Children Foundation. SWS holds the GlaxoSmithKline-CIHR Chair in Genome Sciences at the University of Toronto and The Hospital for Sick Children. ACL holds a NeuroDevNet doctoral fellowship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Tetralogy of Fallot (TOF) is the most common form of cyanotic congenital cardiac disease in humans. With surgical advances and increased longevity, attention has shifted from immediate outcomes to understanding causation. However, for most patients with TOF, the genetic basis for the disease remains unknown. Recently, there has been a focus on unbalanced structural genomic changes, or copy number variants (CNVs), and disease . Copy number variation contributes to the genetic heterogeneity of many complex human diseases , . Investigation of CNVs that overlap genes has led to the discovery of novel etiologies and disease pathways, especially for developmental disorders , –. Our current understanding of the role of CNVs in the etiology of TOF, however, is limited. Early reports of CNVs in subjects with various types of congenital cardiac conditions, using low resolution methods, suggested that CNVs may be important – but there is just one report of genome-wide CNVs in 111 TOF patients using a high resolution microarray . We used a high resolution genome-wide microarray and proven methods to: a) investigate the burden of rare CNVs in TOF compared to controls, b) identify putative candidate genes associated with rare and recurrent CNVs and c) assess, using a pathway analysis, whether the exonic CNVs found in TOF could identify functional gene sets relevant to cardiac development.
Of the 495 unrelated adults with TOF recruited, 53 (10.7%; including 49 of European ancestry) had 22q11.2 deletions associated with 22q11.2 deletion syndrome , four had chromosomal anomalies detectable on karyotype (two with XXY, one with XXX and one with a 16 Mb 18q22 deletion) and five had previously diagnosed genetic conditions for which clinical genetic testing is in progress (three with Holt-Oram syndrome, one with CHARGE syndrome and one with VACTERL association). The remaining 433 adults [239 (55.2%) male] formed the CNV discovery sample [mean age 32.56 (SD 12.29 years)]; 45 (10.39%) had pulmonary atresia and 57 (13.16%) were in the syndromic group.
Using a strict CNV analysis strategy , , , we detected 63 CNVs on average per genome in TOF cases with a median size of 18,020 (range 397–5,997,249) bp, similar to results for the controls (Tables 1, 2 and 3 in Supporting Information S1). To minimize false positives, we focused on rare CNVs using a conservative definition (<0.1% in population-based controls), and employed identical methods for both cases and the independent Ontario Population Genomics Platform (OPGP) controls used for case-control analyses (see Methods). The main quantitative analyses involved only those subjects of European ancestry. We compared rare CNVs in the 340 TOF cases and 416 OPGP controls of European ancestry. To assess the experimental reproducibility of rare CNVs after in silico detection, we tested 68 CNVs across different size ranges using quantitative PCR (qPCR). We observed a high true positive validation rate of 65/68 (95.6%), consistent with our previous studies , , .
Rare CNV burden in TOF
We first compared the CNV burden of large (>500 kb) rare CNVs in the TOF cases and the OPGP controls. Consistent with our hypothesis, a significantly greater proportion of cases harbored large rare CNVs compared to controls (OR 1.89, 95% CI 1.06–3.35, p = 0.0278) (Table 1). This was most notable for differences in large gains that overlapped exons (OR 2.54, 95% CI 1.17–5.50, p = 0.0148). However, if the 49 individuals with TOF and 22q11.2 deletions of European ancestry had been included, the odds ratio for large rare exonic losses would also have been significantly higher compared to controls (OR 10.87, 95% CI 4.80–24.08, p<0.0001). In contrast, the overall quantitative burden of rare CNVs of any size was similar between the TOF group and OPGP controls; most TOF and OPGP control subjects had one or more rare CNVs (Table 1). When CNV burden for individuals was defined as having two or more rare CNVs, there was no significant difference between cases and controls (data not shown).
Rare CNV burden in TOF subgroups
For those subjects with TOF, large rare CNVs were enriched in the syndromic subgroup however this difference reached statistical significance only for subjects with large rare exonic losses (OR 9.53, 95% CI 2.89–31.41, p = 0.0004) (Table 1). These results would have been even more significant if individuals with 22q11.2 deletions had been included (data not shown). When individuals with one or more rare exonic loss CNVs of any size were considered, results were still significant but with a smaller OR (OR 2.69, 95% CI 1.35–4.60, p = 0.0013). A further TOF subgroup analysis comparing those with and without pulmonary atresia showed no significant enrichment of individuals with rare exonic loss CNVs in those with pulmonary atresia [48% (16 of 33) vs. 37% (115 of 307), p = 0.2162)].
Large rare CNVs in TOF
Table 2 shows the 47 large (>500 kb) rare CNVs found in 43 of the 433 adults with TOF in the discovery sample. Most (39/47) were very rare, i.e., not found in any of 2,773 controls (2,357 population controls or 416 OPGP controls) and all but three overlapped genes. Several of these loci showed evidence for recurrence in TOF. The most compelling were 1q21.1 duplications (Figure 2 in Supporting Information S1) (OMIM #612475) identified in four (1.18%) of 340 subjects of European ancestry. None met our syndromic criteria, however detailed examination of the phenotype revealed macrocephaly in two and tall stature in another of these subjects.
There were two other subjects in the non-syndromic subgroup with genomic disorders at loci previously associated with congenital cardiac disease: one proband with a previously undetected 22q11.21 duplication (OMIM #608363)  and another with a typical 16p11.2 duplication (OMIM #611913) .
Amongst other large rare CNVs of note, one proband with syndromic features had a novel tandem duplication-deletion in the 18q22.3-q23 region that was transmitted to her daughter. Both had TOF, learning difficulties, short stature, obesity, and thyroid disease. This complex CNV overlapped the region involved in the 18q22 deletion syndrome, e.g., the 16 Mb deletion of a subject excluded from our TOF cohort (Figure 3 in Supporting Information S1). There are three candidate genes in the distal end of the 3.5 Mb 18q23 deletion that may have an impact on cardiac development and/or are implicated by a relevant family of genes ,  (Table 2, Figure 3 in Supporting Information S1): NFATC1, PARD6G and SALL3 . Another proband with syndromic features had a 1q41 deletion that may overlap the region of a translocation reported in a patient with TOF  and possibly the 1q41 deletion region (OMIM #612530) associated with holoprosencephaly 10.
Smaller very rare CNVs identifying genes of interest for TOF
The first section of Table 3 shows the smaller (<500 kb) very rare CNVs in the TOF sample that implicate specific candidate genes of interest at loci associated with TOF, including: GJA5 in the 1q21.1 duplication region (Figure 2 in Supporting Information S1) , CDH19 at 18q22.1 (Figure 3 in Supporting Information S1), NBEA at 13q13.3  and ANGPT2 at 8p23.1 . Other candidate genes highlighted through overlap with results from other studies include: CECR5 in the cat eye syndrome region , RAF1 involved in Noonan syndrome  and PPM1K .
Table 3 also shows novel very rare (not found in 2,773 controls) CNVs overlapping genes with evidence for cardiovascular involvement. Two unrelated probands had 1q32.2 loss CNVs overlapping the PLXNA2 gene (Figure 1), which were confirmed by qPCR and sequencing across the junction breakpoints. Plexins play an important role in cardiac development, including cardiac neural crest cell migration and outflow tract morphogenesis . We therefore resequenced PLXNA2 exons and splice sites in a subset (n = 192) of the TOF cases of European ancestry. This yielded nine missense variants but no additional nonsense or frame-shift mutations that would lead to haploinsufficiency of the gene (Table 6 in Supporting Information S1). No point mutations were detected in the two individuals with PLXNA2 deletions and in silico inspection of the intronic CNV revealed no conclusive evidence of regulatory region disruption. Two other loss CNVs involved adjacent semaphorin genes at 7q21.11 with previous evidence for structural cardiac phenotypes (Table 3). One overlapped three exons of the SEMA3D gene coding for semaphorin 3D and the other overlapped the first intron of the SEMA3E gene, previously associated with CHARGE syndrome .
Solid and open bars represent gains and losses, respectively; genomic parameters from NCBI Build 36.
We also identified a group of four subjects with novel small rare CNVs containing genes associated with ciliary dysmotility: DNAH11 (n = 2), BBS9 (n = 1) and SNX8 (n = 1). Primary ciliary dyskinesis has several genetic causes, including mutations in DNAH11, a gene coding for a dynein heavy chain component of the axoneme, the inner cytoskeletal core of cilia (OMIM #6033). Similarly, BBS9 is one of 14 genes known to be responsible for Bardet-Biedl syndrome, a multisystem disorder . Loss of ciliary function results in a multisystem disease and loss of function during embryogenesis can lead to congenital cardiac lesions, typically abnormalities of cardiac situs (heterotaxy), and less commonly TOF .
FGF10 was another plausible candidate gene for human congenital cardiac disease – implicated by a very rare exonic loss CNV. FGF10 codes for fibroblast growth factor 10, a protein with dosage sensitive expression in several aspects of early murine cardiovascular development , . Notably, the loss CNV involved the entire gene, thus would encompass an evolutionarily conserved cis-regulatory module in intron 1 of the FGF10 gene recently reported to be functional during human cardiac development . The proband with this CNV did not meet criteria for lacrimoauriculodentodigital (LADD) syndrome (OMIM #149730) or autosomal dominant aplasia of lacrimal and salivary glands (ALSG; OMIM #180920), conditions associated with point mutations in FGF10 coding regions that may have different expression from that of an intronic point mutation  or a structural variant alone.
In pathway analyses testing of case and control subjects with rare CNVs that overlapped 6 or fewer genes, only exonic losses led to significant results for the gene-set test (permutation FDR< = 27.5%, nominal p-value< = 0.05) (Table 7 in Supporting Information S1), in line with previous findings for autism . Nineteen gene-sets passed the significance thresholds (Table 8 in Supporting Information S1) and were selected for visualization (Figure 2). The gene-sets identified belonged to five overlapping functional clusters, representing both those expected and more novel (Figure 2): vasculature development (p = 0.0351), chromosome organization (p = 0.0224), cell motility (p = 0.0224), chemotaxis (p = 0.0440) and neuron projection and development (p = 0.0440). We also selected the three top-scoring previously reported TOF disease genes (GATA4, NKX2-5 and TBX5) and identified as potential disease candidates their high-confidence functional neighbors affected by a rare exonic CNV in two or more cases and none in controls (Table 10 and Table 11 in Supporting Information S1). PLXNA2-semaphorins was the only gene-set found exclusively in our systematic CNV review.
Diagram of results of pathway analyses comparing rare CNVs in cases and controls. Five overlapping functional clusters involved 19 gene-sets; functional neighbors of three known candidate genes identified another cluster (circle size indicates relative number of cases involved).
Integrating the results of pathway analysis and systematic CNV review, we identified potentially important convergences (Figure 2). GJA5 was found in all disease gene neighborhoods. ANGPT2 and FGF10 were found in vasculature development, cell motility, chemotaxis and in association with at least one of the disease genes. PLXNA2 was found in cell motility, chemotaxis and neuron projection and development. In contrast, HDAC9 was a novel gene identified by gene-set association (vasculature development and other clusters) and disease gene neighbor analysis (NKX2-5), but not in our systematic CNV review. Figure 4 in Supporting Information S1 presents results of a manual review of further lines of evidence to reconstruct putative regulatory relations between the candidate genes in a potential disease pathway.
Copy number changes appear to be important genetic variants contributing to the etiology of TOF, with rare exonic losses occurring more frequently in patients with TOF than in controls. Many CNVs associated with TOF appear to disrupt gene pathways that control cell migration and vasculature development, both potentially important in cardiac development . Notably, several plausible candidate genes for TOF were implicated in humans for the first time, including the PLXNA2 gene and related pathways. Our findings suggest that individually rare structural genomic changes are important contributors to the collective genetic burden of TOF.
Based on recommendations from the International Standard Cytogenomic Array (ISCA) Consortium, it is now suggested that chromosomal microarrays be used as the first-tier diagnostic test for patients with multiple congenital anomalies and/or unexplained developmental delay . This recommendation is based on the higher diagnostic yield of genetic testing, specifically as it relates to the high sensitivity of detecting submicroscopic deletions and duplications. Many patients seen in adult congenital cardiac clinics, including those with TOF, will meet these criteria . Notably, our data suggest that clinical screening for syndromic features will likely be insufficient to identify patients with large, pathogenic gains. In contrast, large rare losses may more often be associated with complex phenotypes . In the next decade, many more CNVs associated with congenital heart disease will likely be discovered. The results of the current study will contribute to the strategies used to assess the pathogenicity of a CNV for TOF.
After 22q11.2 deletions, the most common large rare CNV in our TOF cohort was the recurrent 1q21.1 duplication. The 1.18% prevalence of the 1q21.1 duplication is consistent with results using a targeted assay in two previous studies of TOF ,  and with early reports of CNVs at the 1q21.1 locus where variable expression included TOF and neuropsychiatric conditions . Congenital cardiac defects that have been reported to be associated with 1q21.1 duplications include TOF , , ventricular septal defect , univentricular heart  and unspecified complex congenital cardiac disease . Results of the current study, including pathway results and a very rare CNV overlapping GJA5 in this 1q21.1 CNV region (Figure 2 in Supporting Information S1, Table 3), add to previous studies that implicated GJA5 as a promising candidate gene for TOF , . GJA5 codes for connexin40, a gap junction protein in a protein family known to be important in cardiac development and shown to be associated with TOF in mice . Point mutations in GJA5 have also been reported in patients with arrhythmias , .
This is the first study to report 1q32.2 deletions at the PLXNA2 locus in patients with congenital heart disease. The PLXNA2 gene codes for a transmembrane protein, plexin A2 . Plexin A2 is a receptor for semaphorin C3, which acts as a guidance molecule and is necessary for neural crest influx and endothelial cell function during outflow track septation –. In PLXNA2 knockout mice, congenital cardiac defects, including TOF, have been described . Interactions with other plexins and transcription factors that control neural crest cell migration may also be important in the development of congenital cardiac lesions –.
There are multiple semaphorin-plexin pathways. We identified two subjects with loss CNVs involving semaphorin genes: a 7q21.11 deletion overlapping three exons of semaphorin 3D (SEMA3D) and a 7q21.11 deletion intronic to semaphorin 3E (SEMA3E) (Table 3). Semaphorin 3D has been shown to be expressed in the cardiac cushions of chick heart and ventricular trabeculae . Semaphorin 3E is involved in modulating the NOTCH signaling pathway via a VEGF feedback mechanism . Although most commonly due to mutations involving the chromodomain helicase DNA-binding protein-7, CHARGE syndrome can be caused by mutations in the SEMA3E gene .
These CNV-related results direct attention to novel genes potentially involved in cardiac development in humans, and extend data from previous animal and human studies. For example, the migration of cardiac neural crest cells into the outflow tract is a process orchestrated, in part, by PLXNA2 signaling . PLXNA2-semaphorin signaling is also implicated in guidance of both blood vessels and nerves . Placed in this context, gene-set clusters labeled “Neuron projection and development” are compelling candidates for importance in cardiac development. Our findings implicating ciliary genes are also consistent with involvement of processes such as migration of cardiac neural crest into the outflow tract and parallel guidance of blood vessels and nerves in development. A further novel finding, from the pathway analysis, indicated HDAC9, a gene previously linked to muscle development and cardiac hypertrophy in mouse and human –.
There is a known association between ciliopathies and congenital heart disease. In the current study, we found four cases with rare CNVs overlapping genes responsible for ciliary motility disorders: primary ciliary dyskinesis, Bardet-Biedl syndrome and 7p22 deletion syndrome. Two subjects had CNVs overlapping the DNAH11 gene, one of the genes responsible for primary ciliary dyskinesis. The cardiac lesion in this condition is believed to be due to an abnormality of nodal ciliary motility during development. In addition to abnormalities of cardiac situs (left and right sided heterotaxy), other congenital lesions including TOF have been reported in humans ,  and mice . Another individual with TOF had a very rare loss CNV that overlaps exons 10–21 of BBS9. Although the exact function of the BBS9 gene has not yet been determined, mutations in this gene are known to cause Bardet-Biedl syndrome, another classic ciliopathy that can affect multiple systems and has a highly variable phenotype . Deletion of exons 5–20 in the BBS9 gene was recently reported to have a severe phenotype but this did not include a congenital cardiac defect . The SNX8 gene, coding for sorting nexin 8, lies in the region of overlap of two previously reported 7p22.2 deletions with associated cardiac malformations, including one with TOF . Although the role of SNX8 in development is unknown, sorting nexins have recently been implicated in ciliogenesis . Because of the variable expressivity of the phenotype in ciliopathies, it is possible that patients with congenital cardiac lesions, including TOF, who have these conditions may be undiagnosed .
Most of these CNV findings remain in the research realm with functional studies essential to determine the true role of such variants and candidate genes in cardiac maldevelopment. Even the possible role for 1q21.1 duplications in the genetic burden of TOF, detectable by clinical genome-wide and targeted microarrays, requires more data to delineate the associated breadth and penetrance of cardiac and extracardiac expression . There remains a large cohort of adults with TOF of, as yet, unknown etiology. The candidate genes and pathways identified in our study should help to inform subsequent genetic, including sequencing-based, studies of TOF. Most other variants identified will be rarer and of as yet uncertain clinical significance. Nevertheless, these results contribute to our understanding of pathogenesis in TOF - a crucial step towards future clinical applicability of genetic investigations.
Advantages and limitations
This is the first study of genome-wide CNVs in TOF to use a well-characterized cohort of adult patients, stringent molecular methods, and multiple converging analyses. We have identified novel candidate genes for TOF, in addition to providing replication of previous findings, including some from a smaller genome-wide study of CNVs in TOF . There are, however, limitations to this novel study. The conservative laboratory and CNV analytic methods used, including the restricted focus on rare CNVs at the <0.1% level, may have resulted in missing some rare variants of interest. However, the fact that we used the same approach and adjudication control set to determine rarity meant that our a priori decision to minimize false positives, at the expense of such false negatives, would be expected to affect both cases and controls equally. Although our results overlapped certain previously described CNVs, further replication studies will be important to help define the significance and relative prevalence of the novel rare CNVs identified in this study. Large, multicentre studies may be useful, provided that comparable phenotyping and stringent quality control methods, as highlighted in this study, are maintained . Meta-analyses could clarify if the lack of evidence for two or more rare CNVs per subject, as previously found for 22q11.2 deletions , is due to insufficient power. Family studies are also needed to delineate inherited or de novo status and segregation patterns of CNVs. These data will be essential to determine the true penetrance and variable expression of individual CNVs. Examining CNVs in patients with other forms of conotruncal defects or other forms of congenital heart disease may also be informative, and could reveal a genetically-related spectrum of clinically-distinct cardiac maldevelopment as is increasingly appreciated for, e.g., neuropsychiatric disorders . Other study designs, e.g., using whole genome sequencing, will be needed to fully delineate the genetic architecture of TOF, including detection of relevant sequence-based mutations, such as those in non-coding regulatory regions that may be important for cardiac development . Pathway analyses were restricted to subjects with rare CNVs overlapping 6 or fewer genes, and insufficient numbers precluded separate analyses involving only large rare CNVs. However, pathway results were similar when subjects with multigenic CNVs overlapping >6 genes were included (data not shown). Lastly, proving causality of specific genetic variants is beyond the scope of this study and more evidence, including replication of association in independent cohorts, will be needed to corroborate our putative candidate genes for tetralogy of Fallot. Fortunately, the functional significance of several key candidate genes implicated by our CNV results has already been validated in model organisms such as mice and zebrafish.
In addition to well known 22q11.2 deletions, other structural genomic changes appear to be important contributors to the genetic heterogeneity of TOF. In particular, these include 1q21.1 duplications and other rare copy number changes that disrupt genes involved in cell migration and vasculature development pathways including PLXNA2-semaphorin signaling and perhaps ciliary motility. Further studies will help to improve our understanding of the complex etiology and pathogenesis of TOF and of congenital heart disease in general.
The study was approved by institutional research ethics boards at the University Health Network and the Centre for Addiction and Mental Health.
We prospectively recruited 495 unrelated adults (≥18 years) with TOF including a subset with TOF-pulmonary atresia or pulmonary atresia and ventricular septal defect (collectively termed “TOF” in this study), without autosomal trisomies, from a single clinic (Toronto Congenital Cardiac Centre for Adults). Patients with pulmonary atresia in the setting of more complex cardiac lesions, such as single ventricle lesions or transposition complexes, were not included. We excluded 62 subjects with documented syndromes, including 53 with 22q11.2 deletion syndrome associated with 1.5 to 3.0 Mb 22q11.2 deletions and genome-wide CNV data reported elsewhere . The remaining 433 subjects formed a CNV discovery sample for this study.
TOF diagnosis was confirmed using echocardiogram and/or cardiac catheterization together with other imaging and surgical data reviewed using lifetime medical records . All subjects underwent direct clinical screening for potential syndromic features ; available medical records were also reviewed. Subjects were stratified into syndromic and non-syndromic subgroups using criteria previously validated for identifying 22q11.2 deletion syndrome in adults . Individuals with at least two of three features (history of learning difficulties, global dysmorphic facial features, hypernasal voice) were placed in the syndromic subgroup . All phenotyping was done blind to genotype. Further details regarding cardiac and extracardiac phenotypes are provided elsewhere . We were underpowered to perform subgroup analyses involving individuals with specific congenital cardiac outcomes (e.g., heart failure).
Control sample for formal analyses
To optimize our analyses, we used an independent Canadian control sample from the Ontario Population Genomics Platform (OPGP) genetic epidemiological project that comprised adults of European ancestry [(208 (50.0%) male; mean age 44.96 (SD 12.05) years]. To maximize quality control and minimize artefactual/laboratory-related findings , all OPGP control samples were handled and experiments performed by the same laboratory using identical array methods and protocols, including CNV analyses and rarity assignation using separate large control cohorts, as for the TOF cases (see below).
High quality genomic DNA was genotyped using the high resolution Affymetrix Genome-Wide Human SNP Array 6.0. CNV analysis and adjudication for all TOF case and OPGP control samples were performed at The Centre for Applied Genomics (Toronto, Canada). Arrays meeting Affymetrix-recommended quality control guidelines of contrast QC>0.4 were used for further analysis as outlined below and in Figure 1 in Supporting Information S1.
To accurately estimate ancestry, in addition to self-reported ethnicities genotypes of the TOF cases from 1,120 genome-wide unlinked SNPs were clustered by the program STRUCTURE  together with those from 270 HapMap samples, which were used as references of known ancestry during clustering. Ancestries were assigned with a threshold of coefficient of ancestry >0.9. Of the 433 TOF subjects, there were 340 of European, 61 of Admixed, 27 of East Asian and 5 of African ancestry.
CNV determination, adjudication, and prioritization
Genome-wide CNVs were determined using a multiple-algorithm approach to maximize sensitivity and specificity of CNV calling, as described previously . Briefly, for each subject we defined “stringent” CNV calls as those detected by at least two of three different CNV calling algorithms: Birdsuite , iPattern  and Affymetrix Genotyping Console, and spanning 10 kb in length and five or more consecutive array probes. In this dataset, the mean number of calls per sample was 51, 50 and 32 for Birdsuite, iPattern and Genotyping Console, respectively. Overlapping calls at the sample level from Birdsuite and iPattern were merged with the outside probe boundaries. Singleton calls from iPattern or Birdsuite were also included in the stringent CNV set if they overlapped with a Genotyping Console call from the same sample. On average, 59% of CNVs in a sample were stringent. All subsequent analyses focused on the stringent CNVs, which in our experience have very high positive validation rates by independent methods such as quantitative PCR , , . Merging CNV calls on a sample level across different algorithms has the additional advantage of correcting for the tendency of individual algorithms to segment single CNV events into multiple calls.
Each stringently defined CNV identified in the TOF case and OPGP control samples was then adjudicated for rarity by comparison to those CNVs identified in two large population-based control cohorts comprising 2,357 individuals of European ancestry from Ontario and Germany, which had already been assessed using an identical microarray platform and CNV analysis strategy (i.e., as above) . We adopted a conservative definition of rare CNVs, retaining only those CNVs present in <0.1% of these 2,357 population controls. Further details of the comprehensive adjudication methods, including assessment of segmental duplications and Database of Genomic Variants (http://projects.tcag.ca/variation/) CNVs, may be found elsewhere , .
CNVs>6.5 Mb in size, likely to be detectable by karyotype and/or potentially indicating artefactual results, were excluded. To ensure consistency of data, for major analyses we used only autosomal CNVs>10 kb in size in individuals of European ancestry (Figure 1 in Supporting Information S1).
Large CNVs were defined as those >500 kb in size. We prioritized smaller CNVs (<500 kb) meeting the following criteria for more detailed examination: a) very rare (i.e., not present in any control sample using a 50% reciprocal overlap criterion)  and b) recurrent in unrelated TOF subjects, including those reported in the literature, and/or c) overlapping ‘interesting’ gene(s) possibly involved in TOF. When available, immediate relatives were studied using the same methods as for the proband to determine if a CNV was de novo or inherited.
Experimental validation of CNVs
Confirmatory studies of possible TOF-associated CNVs used Stratagene SYBR Green based quantitative-PCR (qPCR). Each qPCR assay was performed in triplicate, for both the target region and for a control region at the FOXP2 locus on chromosome 7. Where available, molecular cytogenetic or microarray results from clinical laboratories also confirmed CNVs.
Sequencing and mutation screening
For candidate gene discovery in TOF we prioritized further sequencing characterization to a single gene selected based on our CNV results and previous animal model studies to be the most likely to be involved in cardiac development. We performed mutation screening of PLXNA2 coding sequence (spanning 5,682 nucleotides) using standard PCR-based Sanger sequencing. The PLXNA2 gene contains 31 coding exons (67 to 1,268 bp) that were fully sequenced with 32 amplicons. The program Primer 3 (http://frodo.wi.mit.edu/primer3/) was used to design primers. The amplified products were sequenced with the Big Dye Terminator kit using the ABI 3730XL capillary sequencer (Applied Biosystems) and analyzed for sequence variants using Sequencher (Gene Codes, Ann Arbor, MI, USA). Putative sequence variants of interest were confirmed by sequencing in the reverse direction. SIFT  and POLYPHEN  were used for in-silico prediction of the effect of missense variants on protein function.
Statistical analysis was performed using SAS software (version 9.3, SAS Institute Inc., Cary, NC, USA). The main analyses compared rare CNVs in the 340 TOF cases of European ancestry with those in the 416 OPGP controls and within-group comparisons of syndromic versus non-syndromic TOF subjects. Chi-square or Fisher's exact tests were used to compare categorical variables and Student's t tests for continuous variables, as appropriate. All tests were two-side, with statistical significance defined as p<0.05.
For pathway analyses, we first assessed if pre-defined gene-sets (corresponding to biological functions and pathways) displayed a higher rare CNV load in TOF cases than in OPGP controls. Gene-sets were derived from Gene Ontology annotations (downloaded from NCBI in April 2011 and up-propagated according to ontological relations), pathway databases (KEGG, Reactome, BioCarta, NCI; March 2011) and protein domains (PFAM; March 2011). Only gene-sets with a number of member genes between 25 and 750 were tested: 2,456 total, with 1,939 from GO, 414 from pathways and 103 from PFAM domains. Gene-sets with fewer than 25 genes decrease the statistical power of the analysis, whereas those with more than 750 genes tend to have a very broad biological scope (e.g. GO “regulation of biopolymer catabolism”) and hinder the visualization of results. Subjects with rare CNVs overlapping more than 6 genes were not considered for the gene-set analysis, as these may have a more promiscuous set of gene functions perturbed by the rare variant. For exonic losses this led to the exclusion of 14 TOF cases and 11 OPGP controls.
For each gene-set, we built a contingency matrix with subjects of European ancestry as sampling units. Subjects were categorized as (a) TOF cases or OPGP controls and (b) having at least one gene-set gene harboring a rare CNV or not. On the basis of this contingency table, a one-tailed Fisher's Exact Test was used to test higher prevalence of rare CNVs in TOF probands versus OPGP controls. This test can be regarded as an extension of a single-gene or single-variant association test; however, testing association for groups of genes, unlike single genes or single variants, provides sufficient power to detect significant association even when considering only rare variants . To map CNVs to genes we used a stringent method, and restricted to CNVs overlapping exons. We tested all types of variants as well as losses-only and gains-only; only losses produced significant results (see analysis method below), in line with our previous findings for autism .
The Fisher's Exact Test nominal p-value was corrected for multiple tests using a case/control class permutation procedure to estimate an empirical false discovery rate. We favored a permutation strategy over classical Benjamini-Hochberg false discovery rate owing to the highly complex dependency structure among gene-sets and overly conservative nature of this test . Case and control labels were permuted 2,000 times, and for each permutation gene-sets were tested following exactly the same procedure. Real nominal p-values were ranked from lowest (most significant) to highest (least significant) and, for each real p-value, the empirical false discovery rate was computed as the average number of gene-sets with equal or smaller p-value over permutations. Therefore, the empirical false discovery rate can be interpreted as an estimate of the fraction of gene-sets that would be significant under the null hypothesis of no association at the chosen nominal significance level. We selected 27.5% as the empirical false discovery rate significance threshold for final results; we additionally required the nominal p-value to be <0.05.
Previously known TOF disease genes (Table 9 in Supporting Information S1) were scored for association following a similar strategy, but using functional neighbors instead of functional gene-sets. For each known disease gene, we scored TOF case and OPGP control subjects of European ancestry. The score was defined as the highest functional weight between (a) the known disease gene being tested and (b) the CNV-harboring genes in the subject being scored. The functional weight was obtained from STRING, a publicly available resource that predicts the probability of two genes participating in the same pathways based on physical interaction, pathway membership, co-expression and PubMed co-citation. For each TOF disease gene, we tested if functional neighborhood scores were higher in TOF cases compared to OPGP controls by logistic regression analysis. All exonic CNVs (gains and losses) were used; unlike the gene-set association test, restricting to losses did not improve significance. We finally selected the three top-scoring known disease genes (GATA4, NKX2-5, TBX5).
For visualization, we integrated results from gene-set association, disease gene neighborhood analysis and systematic CNV review as a gene-set overlap network using the Cytoscape plugin Enrichment Map , . Gene-sets significant after the gene-set association test were restricted to genes with higher prevalence in TOF cases than in OPGP controls , whereas functional neighborhood gene-sets included the known TOF disease gene as well as its neighbors that had high interaction confidence according to STRING (score>700, equivalent to interaction probability >70%) and harbored CNVs in 2 or more TOF case subjects but no OPGP control (Table 10 in Supporting Information S1). The combined jaccard-overlap index was used to generate the gene-set network, setting a threshold of 0.2. Clusters of overlapping gene-sets were manually identified and colored.
Table 1: List of rare CNVs in 340 TOF and/or pulmonary atresia cases of European ancestry. Table 2: List of rare CNVs in 416 OPGP control individuals. Table 3: Summary of Affymetrix 6.0 microarray CNV data TOF sample (N = 340). Table 4: Rare large CNVs (>500 kb) in 43 of 433 unrelated adults with tetralogy of Fallot. Table 5: Very rare CNVs overlapping 26 candidate genes for tetralogy of Fallot. Table 6: PLXNA2 sequence variants detected in 192 unrelated TOF cases of European ancestry. Table 7: Gene-set association results for all gene-sets tested, rare CNVs restricted to exonic losses. Table 8: Additional gene-set information for the 19 gene-sets selected for final results. Table 9: Known TOF disease genes used for the disease gene neighborhood analysis. Table 10: Test results on disease gene neighborhoods for all disease genes, using the STRING network. Table 11: Neighbor gene details for the three top disease genes. Figure 1: Overview of study design and CNV analysis workflow. Figure 2: Rare CNVs at chromosome region 1q21.1 in TOF cases. Figure 3: Rare CNVs at chromosome region 18q22.3-q23 in TOF cases. Figure 4: Integrated TOF pathway and candidate gene connectivity. Supplementary References.
The authors thank the patients and their families for their participation, research assistants and staff at the Toronto Congenital Cardiac Centre for Adults (TCCCA), staff at The Centre for Applied Genomics (TCAG), and fellows and students who assisted in the collection and analysis of data. The authors thank Gladys Wong and Monica Torsan for their help preparing the manuscript.
Conceived and designed the experiments: ASB CKS CRM SWS. Performed the experiments: OM ACL JR BT. Analyzed the data: ASB GC ACL DM BL TY OM CKS. Wrote the paper: ASB CKS ACL GC CRM DM OM.
- 1. Lee C, Scherer SW (2010) The clinical context of copy number variation in the human genome. Expert Rev Mol Med 12: e8.
- 2. Bassett AS, Scherer SW, Brzustowicz LM (2010) Copy number variations in schizophrenia: critical review and new perspectives on concepts of genetics and disease. Am J Psychiatry 167: 899–914.
- 3. Lupski JR (2007) Genomic rearrangements and sporadic disease. Nat Genet 39: S43–47.
- 4. Stankiewicz P, Lupski JR (2002) Genome architecture, rearrangements and genomic disorders. Trends Genet 18: 74–82.
- 5. Pinto D, Pagnamenta AT, Klei L, Anney R, Merico D, et al. (2010) Functional impact of global rare copy number variation in autism spectrum disorders. Nature 466: 368–372.
- 6. Marshall CR, Noor A, Vincent JB, Lionel AC, Feuk L, et al. (2008) Structural variation of chromosomes in autism spectrum disorder. Am J Hum Genet 82: 477–488.
- 7. Thienpont B, Breckpot J, Holvoet M, Vermeesch JR, Devriendt K (2007) A microduplication of CBP in a patient with mental retardation and a congenital heart defect. Am J Med Genet A 143A: 2160–2164.
- 8. Erdogan F, Belloso JM, Gabau E, Ajbro KD, Guitart M, et al. (2008) Fine mapping of a de novo interstitial 10q22-q23 duplication in a patient with congenital heart disease and microcephaly. Eur J Med Genet 51: 81–86.
- 9. Richards AA, Santos LJ, Nichols HA, Crider BP, Elder FF, et al. (2008) Cryptic chromosomal abnormalities identified in children with congenital heart disease. Pediatr Res 64: 358–363.
- 10. Krepischi-Santos AC, Vianna-Morgante AM, Jehee FS, Passos-Bueno MR, Knijnenburg J, et al. (2006) Whole-genome array-CGH screening in undiagnosed syndromic patients: old syndromes revisited and new alterations. Cytogenet Genome Res 115: 254–261.
- 11. Prescott K, Ivins S, Hubank M, Lindsay E, Baldini A, et al. (2005) Microarray analysis of the Df1 mouse model of the 22q11 deletion syndrome. Hum Genet 116: 486–496.
- 12. Greenway SC, Pereira AC, Lin JC, DePalma SR, Israel SJ, et al. (2009) De novo copy number variants identify new genes and loci in isolated sporadic tetralogy of Fallot. Nat Genet 41: 931–935.
- 13. Lionel AC, Crosbie J, Barbosa N, Goodale T, Thiruvahindrapuram B, et al. (2011) Rare copy number variation discovery and cross-disorder comparisons identify risk genes for ADHD. Sci Transl Med 3: 95ra75.
- 14. Ensenauer RE, Adeyinka A, Flynn HC, Michels VV, Lindor NM, et al. (2003) Microduplication 22q11.2, an emerging syndrome: clinical, cytogenetic, and molecular analysis of thirteen patients. Am J Hum Genet 73: 1027–1040.
- 15. Hernando C, Plaja A, Rigola M, Perez M, Vendrell T, et al. (2002) Comparative genomic hybridisation shows a partial de novo deletion 16p11.2 in a neonate with multiple congenital malformations. J Med Genet 39: e24.
- 16. Kohlhase J, Wischermann A, Reichenbach H, Froster U, Engel W (1998) Mutations in the SALL1 putative transcription factor gene cause Townes-Brocks syndrome. Nat Genet 18: 81–83.
- 17. Serville F, Lacombe D, Saura R, Billeaud C, Sergent MP (1993) Townes-Brocks syndrome in an infant with translocation t (5;16). Genet Couns 4: 109–112.
- 18. Deimling SJ, Drysdale TA (2011) Fgf is required to regulate anterior-posterior patterning in the Xenopus lateral plate mesoderm. Mech Dev 128: 327–341.
- 19. Smith SA, Martin KE, Dodd KL, Young ID (1994) Severe microphthalmia, diaphragmatic hernia and Fallot's tetralogy associated with a chromosome 1;15 translocation. Clin Dysmorphol 3: 287–291.
- 20. Soemedi R, Topf A, Wilson IJ, Darlay R, Rahman T, et al. (2012) Phenotype-specific effect of chromosome 1q21.1 rearrangements and GJA5 duplications in 2436 congenital heart disease patients and 6760 controls. Hum Mol Genet 21: 1513–1520.
- 21. Costain G, Silversides CK, Marshall CR, Shago M, Costain N, et al. (2011) 13q13.1-q13.2 deletion in tetralogy of Fallot: clinical report and a literature review. Int J Cardiol 146: 134–139.
- 22. Lamont RE, Vu W, Carter AD, Serluca FC, MacRae CA, et al. (2010) Hedgehog signaling via angiopoietin1 is required for developmental vascular stability. Mech Dev 127: 159–168.
- 23. McDermid HE, Morrow BE (2002) Genomic disorders on 22q11. Am J Hum Genet 70: 1077–1088.
- 24. Perala N (2012) More than nervous: The emerging roles of plexins. Differentiation 83: 77–91.
- 25. Lalani SR, Safiullah AM, Molinari LM, Fernbach SD, Martin DM, et al. (2004) SEMA3E mutation in a patient with CHARGE syndrome. J Med Genet 41: e94.
- 26. Nishimura DY, Swiderski RE, Searby CC, Berg EM, Ferguson AL, et al. (2005) Comparative genomics and gene expression analysis identifies BBS9, a new Bardet-Biedl syndrome gene. Am J Hum Genet 77: 1021–1033.
- 27. Nonaka S, Tanaka Y, Okada Y, Takeda S, Harada A, et al. (1998) Randomization of left-right asymmetry due to loss of nodal cilia generating leftward flow of extraembryonic fluid in mice lacking KIF3B motor protein. Cell 95: 829–837.
- 28. Urness LD, Bleyl SB, Wright TJ, Moon AM, Mansour SL (2011) Redundant and dosage sensitive requirements for Fgf3 and Fgf10 in cardiovascular development. Dev Biol 356: 383–397.
- 29. Kelly RG, Brown NA, Buckingham ME (2001) The arterial pole of the mouse heart forms from Fgf10-expressing cells in pharyngeal mesoderm. Dev Cell 1: 435–440.
- 30. Golzio C, Havis E, Daubas P, Nuel G, Babarit C, et al. (2012) ISL1 directly regulates FGF10 transcription during human cardiac outflow formation. PLoS One 7: e30677.
- 31. Larrivee B, Freitas C, Suchting S, Brunet I, Eichmann A (2009) Guidance of vascular development: lessons from the nervous system. Circ Res 104: 428–441.
- 32. Miller DT, Adam MP, Aradhya S, Biesecker LG, Brothman AR, et al. (2010) Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. Am J Hum Genet 86: 749–764.
- 33. Piran S, Bassett AS, Grewal J, Swaby J-A, Oeschlin E, et al. (2011) Patterns of cardiac and extra-cardiac anomalies in adults with tetralogy of Fallot. Am Heart J 161: 131–137.
- 34. Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, et al. (2006) Global variation in copy number in the human genome. Nature 444: 444–454.
- 35. Mefford HC, Sharp AJ, Baker C, Itsara A, Jiang Z, et al. (2008) Recurrent rearrangements of chromosome 1q21.1 and variable pediatric phenotypes. N Engl J Med 359: 1685–1699.
- 36. Brunetti-Pierri N, Berg JS, Scaglia F, Belmont J, Bacino CA, et al. (2008) Recurrent reciprocal 1q21.1 deletions and duplications associated with microcephaly or macrocephaly and developmental and behavioral abnormalities. Nat Genet 40: 1466–1471.
- 37. Gu H, Smith FC, Taffet SM, Delmar M (2003) High incidence of cardiac malformations in connexin40-deficient mice. Circ Res 93: 201–206.
- 38. Gollob MH, Jones DL, Krahn AD, Danis L, Gong XQ, et al. (2006) Somatic mutations in the connexin 40 gene (GJA5) in atrial fibrillation. N Engl J Med 354: 2677–2688.
- 39. Groenewegen WA, Firouzi M, Bezzina CR, Vliex S, van Langen IM, et al. (2003) A cardiac sodium channel mutation cosegregates with a rare connexin40 genotype in familial atrial standstill. Circ Res 92: 14–22.
- 40. Raper JA (2000) Semaphorins and their receptors in vertebrates and invertebrates. Curr Opin Neurobiol 10: 88–94.
- 41. Tamagnone L, Artigiani S, Chen H, He Z, Ming GI, et al. (1999) Plexins are a large family of receptors for transmembrane, secreted, and GPI-anchored semaphorins in vertebrates. Cell 99: 71–80.
- 42. Feiner L, Webber AL, Brown CB, Lu MM, Jia L, et al. (2001) Targeted disruption of semaphorin 3C leads to persistent truncus arteriosus and aortic arch interruption. Development 128: 3061–3070.
- 43. Brown CB, Feiner L, Lu MM, Li J, Ma X, et al. (2001) PlexinA2 and semaphorin signaling during cardiac neural crest development. Development 128: 3071–3080.
- 44. Toyofuku T, Yoshida J, Sugimoto T, Yamamoto M, Makino N, et al. (2008) Repulsive and attractive semaphorins cooperate to direct the navigation of cardiac neural crest cells. Dev Biol 321: 251–262.
- 45. Lepore JJ, Mericko PA, Cheng L, Lu MM, Morrisey EE, et al. (2006) GATA-6 regulates semaphorin 3C and is required in cardiac neural crest for cardiovascular morphogenesis. J Clin Invest 116: 929–939.
- 46. Kodo K, Nishizawa T, Furutani M, Arai S, Yamamura E, et al. (2009) GATA6 mutations cause human cardiac outflow tract defects by disrupting semaphorin-plexin signaling. Proc Natl Acad Sci U S A 106: 13933–13938.
- 47. Theveniau-Ruissy M, Dadonneau M, Mesbah K, Ghez O, Mattei MG, et al. (2008) The del22q11.2 candidate gene Tbx1 controls regional outflow tract identify and coronary artery patterning. Circ Res 103: 142–148.
- 48. Jin Z, Chau MD, Bao ZZ (2006) Sema3D, Sema3F, and Sema5A are expressed in overlapping and distinct patterns in chick embryonic heart. Dev Dyn 235: 163–169.
- 49. Kim J, Oh W-J, Gaiano N, Yoshida Y, Gu C (2011) Semaphorin 3E–Plexin-D1 signaling regulates VEGF function in developmental angiogenesis via a feedback mechanism. Genes Dev 25: 1399–1411.
- 50. Lalani SR, Safiullah AM, Fernbach SD, Harutyunyan KG, Thaller C, et al. (2006) Spectrum of CHD7 mutations in 110 individuals with CHARGE syndrome and genotype-phenotype correlation. Am J Hum Genet 78: 303–314.
- 51. Kodo K, Yamagishi H (2011) A decade of advances in the molecular embryology and genetics underlying congenital heart defects. Circ J 75: 2296–2304.
- 52. Chang S, McKinsey TA, Zhang CL, Richardson JA, Hill JA, et al. (2004) Histone deacetylases 5 and 9 govern responsiveness of the heart to a subset of stress signals and play redundant roles in heart development. Mol Cell Biol 24: 8467–8476.
- 53. Karamboulas C, Swedani A, Ward C, Al-Madhoun AS, Wilton S, et al. (2006) HDAC activity regulates entry of mesoderm cells into the cardiac muscle lineage. J Cell Sci 119: 4305–4314.
- 54. Haberland M, Arnold MA, McAnally J, Phan D, Kim Y, et al. (2007) Regulation of HDAC9 gene expression by MEF2 establishes a negative-feedback loop in the transcriptional circuitry of muscle differentiation. Mol Cell Biol 27: 518–525.
- 55. Pomerleau D, Gilbert G, Thibert D (1972) Kartagener's syndrome associated with tetralogy of Fallot. Union Med Can 101: 79–84.
- 56. Kennedy MP, Omran H, Leigh MW, Dell S, Morgan L, et al. (2007) Congenital heart disease and other heterotaxic defects in a large cohort of patients with primary ciliary dyskinesia. Circulation 115: 2814–2821.
- 57. Icardo JM, Sanchez de Vega MJ (1991) Spectrum of heart malformations in mice with situs solitus, situs inversus, and associated visceral heterotaxy. Circulation 84: 2547–2558.
- 58. Deveault C, Billingsley G, Duncan JL, Bin J, Theal R, et al. (2011) BBS genotype-phenotype assessment of a multiethnic patient cohort calls for a revision of the disease definition. Hum Mutat 32: 610–619.
- 59. Richards EG, Zaveri HP, Wolf VL, Kang SH, Scott DA (2011) Delineation of a less than 200 kb minimal deleted region for cardiac malformations on chromosome 7p22. Am J Med Genet A 155A: 1729–1734.
- 60. Chen Y, Liu YJ, Pei YF, Yang TL, Deng FY, et al. (2011) Copy number variations at the Prader-Willi syndrome region on chromosome 15 and associations with obesity in Whites. Obesity (Silver Spring) 19: 1229–1234.
- 61. Bassett AS, Marshall CR, Lionel AC, Chow EW, Scherer SW (2008) Copy number variations and risk for schizophrenia in 22q11.2 deletion syndrome. Hum Mol Genet 17: 4045–4053.
- 62. Bassett AS, Chow EW, Husted J, Hodgkinson KA, Oechslin E, et al. (2009) Premature death in adults with 22q11.2 deletion syndrome. J Med Genet 46: 324–330.
- 63. Fung WL, Chow EW, Webb GD, Gatzoulis MA, Bassett AS (2008) Extracardiac features predicting 22q11.2 deletion syndrome in adult congenital heart disease. Int J Cardiol 131: 51–58.
- 64. Scherer SW, Lee C, Birney E, Altshuler DM, Eichler EE, et al. (2007) Challenges and standards in integrating surveys of structural variation. Nat Genet 39: S7–15.
- 65. Pritchard JK (2001) Are rare variants responsible for susceptibility to complex diseases? Am J Hum Genet 69: 124–137.
- 66. Korn JM, Kuruvilla FG, McCarroll SA, Wysoker A, Nemesh J, et al. (2008) Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs. Nat Genet 40: 1253–1260.
- 67. Pinto D, Darvishi K, Shi X, Rajan D, Rigler D, et al. (2011) Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants. Nat Biotechno 29: 512–520.
- 68. Kumar P, Henikoff S, Ng PC (2009) Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc 4: 1073–1081.
- 69. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, et al. (2010) A method and server for predicting damaging missense mutations. Nat Methods 7: 248–249.
- 70. Bansal V, Libiger O, Torkamani A, Schork NJ (2010) Statistical analysis strategies for association studies involving rare variants. Nat Rev Genet 11: 773–785.
- 71. Merico D, Isserlin R, Stueker O, Emili A, Bader GD (2010) Enrichment map: A network-based method for gene-set enrichment visualization and interpretation. PLoS One 5: e13984.