Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Rare genomic copy number variants implicate new candidate genes for bicuspid aortic valve

Abstract

Bicuspid aortic valve (BAV), the most common congenital heart defect, is a major cause of aortic valve disease requiring valve interventions and thoracic aortic aneurysms predisposing to acute aortic dissections. The spectrum of BAV ranges from early onset valve and aortic complications (EBAV) to sporadic late onset disease. Rare genomic copy number variants (CNVs) have previously been implicated in the development of BAV and thoracic aortic aneurysms. We determined the frequency and gene content of rare CNVs in EBAV probands (n = 272) using genome-wide SNP microarray analysis and three complementary CNV detection algorithms (cnvPartition, PennCNV, and QuantiSNP). Unselected control genotypes from the Database of Genotypes and Phenotypes were analyzed using identical methods. We filtered the data to select large genic CNVs that were detected by multiple algorithms. Findings were replicated in a BAV cohort with late onset sporadic disease (n = 5040). We identified 3 large and rare (< 1,1000 in controls) CNVs in EBAV probands. The burden of CNVs intersecting with genes known to cause BAV when mutated was increased in case-control analysis. CNVs intersecting with GATA4 and DSCAM were enriched in cases, recurrent in other datasets, and segregated with disease in families. In total, we identified potentially pathogenic CNVs in 9% of EBAV cases, implicating alterations of candidate genes at these loci in the pathogenesis of BAV.

Introduction

Copy number variants (CNVs) have been implicated as causes or modifiers of many human diseases [1]. Specifically, large genomic CNVs are significantly enriched in cohorts with developmental delay or congenital abnormalities, and the severity of phenotypes has been correlated with the burden of rare CNVs [2]. These observations show that large, rare, de novo CNVs are likely to be pathogenic and can exert clinically relevant effects on disease pathogenesis [3, 4].

The worldwide prevalence of congenital heart disease (CHD) is 8.2 per 1000 live births [5]. CNVs have been implicated in both syndromic and non-syndromic forms of CHD [610]. The pathogenicity and penetrance of CNVs was initially established for clinical syndromes such as velocardiofacial syndrome, Turner syndrome, or Williams–Beuren syndrome, which involve chromosomal or megabase scale duplications or deletions, but has since been expanded to include additional CHD subtypes [10]. CNVs contribute to 10% of all CHD cases and up to 25% of cases with extracardiac anomalies or other syndromic features [11]. The role of pathogenic CNVs affecting genes that are known to cause CHD when mutated, such as ELN and TBX1, has been established [12]. Furthermore, population-level analysis has consistently demonstrated an increased burden of CHD in carriers of CNVs at specific genomic hotspots compared to controls, displaying the pathogenic potential of rare or de novo CNVs [1214].

Bicuspid aortic valve (BAV) is the most common congenital heart malformation with a population prevalence of 0.5–2% [15]. BAV predisposes to aortic valve stenosis and thoracic aortic aneurysms and is associated with other left ventricular outflow tract lesions such as mitral valve disease and coarctation [16]. The high heritability of BAV was demonstrated in first- and second-degree relatives, who are more than ten times more likely to be diagnosed with BAV compared to matched controls [17]. BAV can occur as an isolated congenital lesion or as part of a clinical syndrome. For example, the prevalence of BAV is increased in Velocardiofacial, Loeys-Dietz, Kabuki, and Turner syndromes. Pathogenic variants of several genes are implicated in familial non-syndromic BAV, which is typically inherited as an autosomal dominant trait with reduced penetrance and variable expressivity. There is strong cumulative evidence that pathogenic variants in GATA4, GATA6, MIB1, NOTCH1, ROBO4, SMAD4, and SMAD6 each contribute to a small percentage of non-syndromic BAV cases [18, 19]. Phenotypic expression of BAV disease ranges from incidental discovery in late adulthood to neonatal or childhood onset with severe manifestations requiring valve or aortic interventions. In comparison to patients with later disease onset, younger BAV cohorts tend to present with syndromic features or complex congenital malformations that are more likely to have a genetic cause, thereby increasing the power of association studies to discover clinically relevant CNVs [20]. Recently, we identified recurrent rare CNVs that were enriched for cardiac developmental genes in a young cohort with early-onset thoracic aortic aneurysms or acute aortic dissections [21].

We hypothesize that large rare genomic CNVs contribute to early onset complications of BAV. Consistent with previous observations, we predict that the burden and penetrance of rare CNVs will be increased in individuals with early onset disease when compared to elderly sporadic BAV cases and population controls. Identification of novel pathogenic CNVs can provide new insights into the genetic complexity of BAV and may be useful for personalized risk stratification or clinical guidance based on the specific recurrent CNV (Fig 1) [22]. Therefore, we set out to describe the burden and penetrance of rare CNVs in a young cohort with early onset complications of BAV disease (EBAV).

thumbnail
Fig 1. Specific rare genomic copy number variants may influence BAV disease severity.

Aortic events, thoracic aortic aneurysm, thoracic aortic dissection, or aortic valve stenosis or regurgitation requiring aortic valve repair or replacement.

https://doi.org/10.1371/journal.pone.0304514.g001

Materials and methods

The study protocol was approved by the Committee for the Protection of Human Subjects at the University of Texas Health Science Center at Houston (HSC-MS-11-0185). Study recruitment began on July 1, 2017, and concluded on March 30, 2022. After written informed consent, we enrolled probands with early onset BAV disease (EBAV), which we defined as individuals with BAV who were under the age of 30 at the time of first clinical event. Clinical events were defined as aortic replacement, aortic valve surgery, aortic dissection, moderate or severe aortic stenosis or aortic regurgitation, large aneurysm (Z > 4.5), or intervention for BAV-related conditions. Those with hypoplastic left heart, known genetic mutations, genetic syndromes, or complex congenital heart disease were excluded. Samples were collected and genotyped as previously reported [23]. For comparison, we analyzed a cohort of older individuals of European ancestry with sporadic BAV disease selected from the International BAV Consortium (BAVGWAS) [24].

Phenotypes were derived from record review with confirmation of image data whenever possible [25, 26]. The computational pipeline for CNV analysis of Illumina single nucleotide polymorphism (SNP) array data included three independent CNV detection algorithms (Fig 2, S1 Appendix).

thumbnail
Fig 2. Overview of pipeline for CNV identification and validation.

SNP, single nucleotide polymorphism; QC, Quality control; CNV, copy number variant. Illumina B-allele frequency and signal intensity data was trimmed and exported using GenomeStudio. Three different algorithms (PennCNV [27], cnvPartition, and QuantiSNP [28]) were used to generate initial CNV calls and sample-level statistics. Sample-level quality control analysis was performed using PennCNV. PLINK [29] was used to define CNV regions for subsequent burden, enrichment, and replication tests. Raw CNV calls were individually screened for CNVs intersecting with genes implicated in BAV and enriched in case-control tests. CNVs were validated by examining the raw data in GenomeStudio.

https://doi.org/10.1371/journal.pone.0304514.g002

GenomeStudio was used to exclude samples with indeterminate sex or more than 5% missing genotypes, and single nucleotide polymorphisms (SNPs) with GenTrain = 0. Principal component analysis was used to remove outliers that did not cluster with European ancestry. Prior to CNV analysis, each dataset was trimmed by selecting a common set of 650,000 SNPs that were genotyped on each of the microarrays used in this study.

Three independent algorithms (PennCNV, cnvPartition, and QuantiSNP) were used to generate CNV calls and sample-level quality statistics from SNP intensity data. PennCNV and QuantiSNP were run on Unix clusters and cnvPartition data were exported from GenomeStudio. The analysis was run using default configurations.

PennCNV was used to generate QC data and remove CNV calls that intersect with polymorphic genomic regions. Samples that met any of the following criteria were excluded, standard deviation of the LogR ratio (obtained from PennCNV) > 0.35 or number of CNVs > 2 standard deviations above the mean for each data set. CNV calls less than 20 Kilobases in length and/or spanned by fewer than 6 probes were excluded. The overlap function for rare CNVs in PLINK was used to construct CNV regions (CNVRs) after adjacent regions were merged using PennCNV.

LogR ratio (LRR) and B allele frequency (BAF) data at CNVRs and calls of interest were visualized in GenomeStudio for validation. For segregation analysis, GenomeStudio was used to determine the presence of CNVs in relatives.

A total of 22,014 unselected control Illumina Genotypes obtained from the Database of Genotypes and Phenotypes were analyzed using identical methods (S1 Table). The Wisconsin Longitudinal Study (WLS) includes data on a cohort of 10,300 individuals who graduated from Wisconsin high schools in 1957. The Health and Retirement Study (HRS) includes data on 37,000 individuals aged 50 above from 23,000 households across the United States. Principal component analysis was used to select European ancestry genotypes from these datasets for analysis. Datasets were paired for case-control analysis based on the concordance of log-transformed sample-level quality control statistics (number of CNV calls and standard deviation of logR ratios). Chi-squared or Fisher exact tests were used to compare CNV frequencies in cases and controls.

Rare CNV functions in PLINK (v1.7) were used to perform permutation-based burden tests or gene set-based enrichment tests as previously described [29, 30]. Case control burden tests were restricted to CNVs that were longer than 110 Kb and less than 0.1% in frequency. CNV overlap functions in PLINK were used to identify rare CNVs that intersect between datasets or involve specific BAV or CHD genes (S2 Table). The list of candidate genes included 190 CHD genes that have strong cumulative evidence to cause BAV or related congenital malformations from human or animal model data [3133]. Genome Reference Consortium Human Build 37 was used for CNV annotation [34].

Results

The EBAV cohort included a total of 544 samples, 272 EBAV probands, 21 relatives with BAV, and 251 apparently unaffected family members (26 trios and 15 multiplex families). The BAVGWAS sample contained 5,040 genotypes with associated demographic and clinical data. After exclusions due to data quality control or missing phenotypic data, 499 EBAV genotypes and 4,216 BAVGWAS were included in the final analysis. In phenotypic comparisons, EBAV probands were significantly younger at diagnosis, had more frequent co-existing congenital heart and vascular lesions, and underwent more frequent valve or aortic interventions (Table 1).

In comparisons between EBAV and WLS data, the rate of large CNVs was increased in EBAV cases compared to WLS controls, driven primarily by enrichment of large rare genomic deletions (P<0.001, Table 2, S3 Table).

In BAVGWAS cases, there were no significant differences in CNV rates or the proportions of individuals with large rare CNVs in comparison to HRS controls (Table 3).

We discovered 32 large (>250 kb) and rare (<1:1000) CNVs in EBAV probands, and 26 of these CNVs included protein-coding genes (S4 Table). The overall burden of large rare genic CNVs was not different between EBAV cases and WLS controls. However, the burden of large rare genic CNVs intersecting with genes known to cause BAV when mutated or implicated in syndromic BAV was significantly increased in EBAV cases (Table 4). EBAV probands with large rare CNVs were more likely to have concomitant congenital heart lesions and have other family members who were diagnosed with BAV (S5 Table).

In region-based association tests, 55 genes spanned by 26 CNVs were enriched in the EBAV cohort compared to WLS controls (S6 and S7 Tables). The largest CNVs involved DSCAM in 21q22 and GATA4 in 8p23. Large duplications involving the Velocardiofacial (VCFS) region in 22q11.2 and a recurrent CHD-associated CNV region in 1q21.1 were also enriched in EBAV probands. In BAVGWAS cases, common CNVs involving NANOG in 12p13.31 and rare CNVs involving NIBPL in 5p13.2, which are essential for early heart development, are enriched in comparison to HRS controls (S8 and S9 Tables). NIBPL mutations cause Cornelia-de Lange syndrome with a spectrum of congenital heart malformations including BAV.

We also scrutinized candidate genomic regions that are implicated in CHD by analyzing data from individual CNV algorithms to detect copy number alterations that may have been filtered out due to strict quality control criteria prior to enrichment studies. We identified additional rare EBAV CNVs that intersect with CHD candidate genes CELSR1, GJA5, RAF1, LTBP1, KIF1A, MYH11, TTN, and the VCFS region in 22q11.2. We detected additional GATA4 and DSCAM CNVs in multiplex families. These CNVs were enriched in EBAV cases compared to WLS controls (Table 5, S10 Table). In total, 9% of EBAV probands had CNVs that are likely to contribute to the development of BAV (S11 Table).

thumbnail
Table 5. CNVs affecting congenital heart disease genes in EBAV.

https://doi.org/10.1371/journal.pone.0304514.t005

We similarly scrutinized BAVGWAS data for additional CNVs that intersect with CHD genes. We found that large duplications involving SOX7 and GATA4 in 8p23 and the VCFS region in 22q11.2 were also significantly enriched in BAVGWAS cases compared to HRS controls (Table 6, S12 Table).

thumbnail
Table 6. CNVs affecting congenital heart disease genes in BAVGWAS.

https://doi.org/10.1371/journal.pone.0304514.t006

Next, we attempted to replicate our observations by identifying CNVs in the BAVGWAS dataset that overlapped with EBAV CNVs. Large rare CNVs intersecting with GATA4, DSCAM, CELSR1, GJA5, MYH11, KIF1A, and TBX1 overlapped between the EBAV and BAVGWAS datasets (S13 Table). However, only CNVs intersecting with GATA4 and DSCAM were significantly enriched in both datasets (Fig 3).

thumbnail
Fig 3. UCSC genome browser plots of GATA4 and DSCAM variants.

Each bar represents a copy number variant (CNV). Blue, EBAV CNVs; Red: CNVs BAVGWAS CNVs. (a) Ideogram of Chromosome 8 with CNV region highlighted red; (b) Plot of GATA4 CNVs; (c) Ideogram of Chromosome 21 with CNV region highlighted red; (d) Plot of DSCAM CNVs. Figures were constructed using the UCSC Genome Browser (http://genome.ucsc.edu) [35].

https://doi.org/10.1371/journal.pone.0304514.g003

We also identified 21 very large genomic CNVs > 5 Mb in length in the BAVGWAS dataset. Analysis of GenomeStudio data showed that most of these were mosaic loss of heterozygosity regions or duplications. Nine were large germline chromosome-scale aberrations, including two cases of trisomy 21 (S14 Table). We did not identify any large X chromosome copy variants that may be consistent with Turner syndrome. There were no megabase-scale copy number variants in the EBAV dataset.

Pedigree analysis showed that CNVs involving CELSR1, LTBP1, KIF1A, GATA4, and DSCAM segregate with BAV in EBAV families (S1 Fig). Most of these CNVs occurred de novo in probands and were not found in unaffected family members. CNV carriers tended to present due to moderate or severe aortic regurgitation requiring valvular surgery. One proband had aortic coarctation. The age at presentation or sex of individuals with rare CNVs was not significantly different from the rest of the EBAV cohort.

Discussion

We identified large, rare, and likely pathogenic CNVs in almost 10% of EBAV probands that are enriched in genes that cause BAV when mutated. The percentage of EBAV cases with likely pathogenic CNVs is similar to our previous observations in a cohort with early onset TAD [36]. Enrichment of CNVs involving GATA4 and DSCAM in EBAV cases replicated in two additional BAV datasets and thousands of unselected control genotypes. This analysis provides compelling evidence that rare CNVs collectively contribute to more BAV cases than any single mutated gene.

GATA-binding protein 4 (GATA4) is a transcription factor that is required for cardiac and neuronal differentiation during embryogenesis [37]. Mutations of GATA4 and its homologs GATA5 and GATA6 cause congenital heart lesions [38]. Mutations in the GATA4 gene have been linked to a range of congenital heart diseases in humans, such as cardiac septal defects, tetralogy of Fallot, and patent ductus arteriosus [39]. Patients with BAV who have rare functional variants in the GATA family exhibit varying degrees of aortopathy expression, including aortic aneurysm, dissection, and/or aortic stenosis. Alonso‐Montes et al. described 4 predicted deleterious GATA4 mutations in 122 non-syndromic BAV probands who did not have affected relatives [40]. Rare GATA4 deletions and putative loss of function mutations are also implicated in CHD with distinctive features, underlining the importance of GATA4 dosage to cardiac development [41, 42]. Glessner et al. discovered large de novo duplications involving GATA4 in CHD trios with conotruncal defects or left ventricular outflow tract obstructive lesions [43]. Some duplications were inherited from apparently unaffected parents. Zogopoulos and Yu described similar genomic duplications in unaffected individuals and in unselected control genotypes [44, 45].

These observations are consistent with low-penetrance CHD in GATA4 duplication carriers. Similar to other complex and multifactorial disorders, CHD pathogenesis is likely caused by the cumulative impact of multiple CNVs or mutations, each exerting small to moderate effects to collectively disrupt cardiac development. For example, the frequency of congenital heart lesions is increased in individuals who have both 22q11.2 deletions and a common 12p13.31 duplication involving the SLC2A3 gene. The SLC2A3 CNV likely functions as a modifier of the cardiac phenotype associated with 22q11 deletion syndrome, exemplifying a “two-hit” model [46].

More than half of patients with Down syndrome have congenital heart malformations due to the interaction of multiple dosage-sensitive CHD genes on chromosome 21 [4749]. Down syndrome cell adhesion molecule (DSCAM), previously shown to play a critical role in neurogenesis, has also been implicated in the pathophysiology of CHD [50]. Analysis of rare segmental trisomies of chromosome 21 suggested that duplication of DSCAM and the contiguous COL6A1 and COL6A2 genes may cause septal abnormalities and other Down Syndrome-related CHD lesions, including BAV. Overexpression of DSCAM and COL6A2 causes cardiac malformations in mice [51]. Our findings suggest that rare CNVs involving DSCAM may contribute to some non-syndromic BAV cases.

GATA4 and DSCAM CNVs segregated with disease in multiple families but are not fully penetrant and were detected in some unaffected relatives. Intriguingly, large 22q11.2, GATA4 and DSCAM CNVs were more highly enriched in EBAV than in BAVGWAS cases, suggesting that these CNVs may drive early onset BAV disease. These results are consistent with our observation that pathogenic CNVs involving candidate BAV genes are also enriched in EBAV compared to BAVGWAS cases. Our data suggests that pathogenic CNVs at these loci may predict accelerated disease onset or more severe complications.

We also identified recurrent rare CNVs involving specific dosage-sensitive cardiac developmental genes that are implicated in non-syndromic CHD. Recurrent 1q21.1 distal deletions encompassing GJA5, the gene encoding Connexin-40, are associated with CHD lesions including BAV. Enrichment of small genomic duplications spanning the GJA5 gene in cohorts with tetralogy of Fallot and cardiac abnormalities in mice with a targeted GJA5 deletion imply that dosage variations of GJA5 contribute to CHD [52]. CELSR1, a cadherin superfamily member, is mutated in families with BAV and hypoplastic left heart syndrome [53]. LTBP1 encodes an extracellular matrix protein that regulates TGF-β and fibrillin and has been implicated in congenital heart lesions [54]. KIF1A, encoding a kinesin microtubule transporter, was implicated in a dominant multisystem syndromic disorder with valvular and cardiac defects [55]. Mutation of MYH11 causes familial thoracic aortic aneurysms and dissections with an increased prevalence of BAV [56]. TTN mutations cause dilated cardiomyopathy and are associated with other left-sided congenital lesions [57]. Mutations or copy number changes involving these genes all cause a wide spectrum of penetrance and phenotypic severity, consistent with sensitivity to genetic or clinical modifiers.

Our combinatorial analysis method eliminated many CNVs that were detected by single algorithms or did not meet quality control benchmarks. Therefore, our analysis likely underestimated the contribution of rare pathogenic CNVs to BAV. We also recognize that cardiac development involves the complex interaction of many genes. We selectively validated individual CNVs at loci of interest but may have underrepresented CNVs that had no a priori relationship with CHD. The apparent penetrance of some CNVs may be less than expected due to missing phenotypic information. The available clinical data was not sufficiently detailed to permit genotype-phenotype correlations with specific CHD clinical features.

In conclusion, we identified large rare CNVs in a significant proportion of BAV cases, including a subset of CNVs that may predict early onset complications of BAV disease. These observations add to the evidence that rare CNVs may eventually have clinical utility for risk stratification and personalized disease management (Fig 1).

Supporting information

S1 Appendix. Computational pipeline for CNV analysis.

https://doi.org/10.1371/journal.pone.0304514.s001

(DOCX)

S1 Table. Summary of control cohorts.

WLS, Wisconsin Longitudinal Study on Aging; HRS, Health and Retirement Study; Accession, accession number in the Database of Genotypes Phenotypes. WLS includes data on a cohort of 10,300 individuals who graduated from Wisconsin high schools in 1957. HRS includes data on 37,000 individuals aged 50 above from 23,000 households across the United States.

https://doi.org/10.1371/journal.pone.0304514.s002

(DOCX)

S2 Table. List of genes implicated in BAV or CHD.

Used for intersection studies.

https://doi.org/10.1371/journal.pone.0304514.s003

(DOCX)

S3 Table. Summary of CNV data.

EBAV, early onset bicuspid aortic valve cohort; BAVGWAS, Genome-wide Association Study from the International BAV Consortium; WLS, Wisconsin Longitudinal Study on Aging; HRS, Health Retirement Study; PennCNV, number of CNV calls detected by PennCNV algorithm after quality control; cnvPartition, number of CNV calls detected by cnvPartition algorithm after quality control; QuantiSNP, number of CNVs detected by QuantiSNP algorithm after quality control; Merged, number of CNV regions after merging adjacent calls; >5 MB, number of CNV regions that are larger than 5 megabases; Rare, number of CNVs that occur in less than 1 in 1000 samples of the combined datasets; Rare Deletions, number of large (> 250 Kb) rare deletions.

https://doi.org/10.1371/journal.pone.0304514.s004

(DOCX)

S4 Table. Large rare copy number variants in the EBAV probands.

Gene(s), genes intersected by CNV; Chr, chromosome; Start BP, start base pair of CNV; Stop BP, stop base pair of CNV; DUP, duplication; DEL, deletion. All CNVs were validated by direct inspection in GenomeStudio.

https://doi.org/10.1371/journal.pone.0304514.s005

(DOCX)

S5 Table. Phenotype comparison of EBAV samples with without large, rare CNVs.

CNV, samples with large rare CNVs; n, number of samples; TAA, thoracic aortic aneurysm; AR, aortic regurgitation; AS, aortic stenosis; Other lesions, other congenital heart malformations; >1 Affected, number of families with more than one affected individual. Percentages are in parentheses. *Significantly increased.

https://doi.org/10.1371/journal.pone.0304514.s006

(DOCX)

S6 Table. Top hits in region-based association tests of EBAV CNVs compared to WLS controls.

Chr, Chromosome; EMP1, permutation-based empiric P-value; EMP2, after genome-wide correction. *Top candidate genes.

https://doi.org/10.1371/journal.pone.0304514.s007

(DOCX)

S7 Table. Rare CNVs enriched in EBAV cohort.

Gene(s), genes intersected by CNV; Chr, chromosome; Start, start base pair of CNV; Stop, stop base pair of CNV; DUP, duplication; DEL, deletion. *Call in apparently unaffected family member. **Call in affected family member belonging to multiplex family.

https://doi.org/10.1371/journal.pone.0304514.s008

(DOCX)

S8 Table. Top hits in region-based association tests of BAVGWAS CNVs compared to HRS controls.

Chr, Chromosome; EMP1, permutation-based empiric P-value; EMP2, after genome-wide correction. *Top candidate genes.

https://doi.org/10.1371/journal.pone.0304514.s009

(DOCX)

S9 Table. Rare CNVs enriched in BAVGWAS cohort.

Gene(s), genes intersected by CNV; Chr, chromosome; Start, start base pair of CNV; Stop, stop base pair of CNV; DUP, duplication; DEL, deletion.

https://doi.org/10.1371/journal.pone.0304514.s010

(DOCX)

S10 Table. EBAV CNVs intersecting with congenital heart disease genes.

Chr, chromosome; Start, start base pair of CNV; Stop, stop base pair of CNV; DUP, duplication; DEL, deletion. *Call in unaffected family member, **Call in affected family member from a multiplex family.

https://doi.org/10.1371/journal.pone.0304514.s011

(DOCX)

S11 Table. Phenotype information of EBAV probands with candidate rare CNVs.

Gene, principal gene/region intersected by CNV; TAA, thoracic aortic aneurysm, TAD, thoracic aortic dissection; AS, aortic stenosis; AR, aortic regurgitation; AVR, aortic valve replacement; Aortic Repair, open or endovascular aortic procedure.

https://doi.org/10.1371/journal.pone.0304514.s012

(DOCX)

S12 Table. BAVGWAS CNVs intersecting with congenital heart disease genes.

Chr, chromosome; Start, start base pair of CNV; Stop, stop base pair of CNV; DUP, duplication; DEL, deletion.

https://doi.org/10.1371/journal.pone.0304514.s013

(DOCX)

S13 Table. CNV overlaps between EBAV and BAVGWAS cohorts.

Chr, chromosome; Start, start base pair of CNV; Stop, stop base pair of CNV; DUP, duplication; DEL, deletion. Some GATA4 CNVs were not identified in the overlap analysis because they exceeded the size threshold (5 Mb).

https://doi.org/10.1371/journal.pone.0304514.s014

(DOCX)

S14 Table. Large genomic events in the BAVGWAS dataset.

Chr, chromosome; Start, start base pair of CNV; Stop, stop base pair of CNV; DUP, duplication; DEL, deletion; LOH, loss of heterozygosity.

https://doi.org/10.1371/journal.pone.0304514.s015

(DOCX)

S1 Fig. Segregation of candidate BAV CNVs.

(A) GATA4 CNV; (B) DSCAM CNV; (C) CELSR1 CNV; (D, E) KIF1A CNVs; (F) LTBP1 CNVs. Dot, apparently unaffected CNV carrier; Shaded, affected CNV carrier; U, no genotype was available.

https://doi.org/10.1371/journal.pone.0304514.s016

(TIFF)

Acknowledgments

We thank Joana Castillo and Jacqueline Jennings for sample preparation, William J. Allen for computational support, and Gladys Zapata, Nitesh Mehta, and the Laboratory for Translational Genomics at Baylor College of Medicine for microarray genotyping. Figs 1 and 2 were created using BioRender.com. S1 Fig was created using CeGaT Pedigree Chart Designer. The Texas Advanced Computing Center (TACC) at The University of Texas at Austin (http://www.tacc.utexas.edu) provided high-performance computing resources for data analysis.

The EBAV Investigators are: Siddharth K. Prakash, Dianna M. Milewicz, Shaine A. Morris, Rita Milewski, Giuseppe Limongelli, Allesandro Della Corte, Laura Perrone, Yuli Y. Kim, Hector I. Michelena, Maria G. Andreassi, Arturo Evangelista, Denver Sallee, Angela Yetman, Kim McBride, Eduardo Bossone, Rodolfo Citro, Dawn S. Hui, Malenka M. Bissell, Andrea Ballotti, Ilenia Foffa, Margot De Marco, Anthony Caffarelli, Rita Weise, Julie DeBacker, Laura Muino Mosquera, Robbin Cohen, Laura Dos Subira, Justin T. Tretter, Anna Sabe Rotes, Martina Caiazza, Lamia Ait Ali, Francesca Pluchinotta, Simon C. Body.

Lead author: Siddharth K. Prakash

Siddharth.K.Prakash@uth.tmc.edu

The BAVCon Investigators are: Simon C. Body, Alessandro Della Corte, Rodolfo Citro, Yohan Bossé, Alessandro Frigiola, Andrea Ballotta, Arturo Evangelista, Evaldas Girdsaukas, Betti Giusti, Bo Yang, Carlo de Vincentiis, Dan Gilon, Thoralf M. Sundt, David Messika Zeitoun, Dianna M. Milewicz, Siddharth K. Prakash, Eduardo Bossone, Eric Eisselbacher, Vicenza Stefano Nistri, Francesca R. Pluchinotta, Giuseppe Limongelli, Gordon S. Huggins, Joshua C. Denny, Patrick M. McCarthy, S. Chris Malaisrie, Aakash Bavishi, Hector I. Michelena, J. Daniel Muehlschelgel, Kim Eagle, Lasse Folkersen, Malenka M. Bissell, Patrick Mathieu, Per Eriksson, Peter Lichtner, Ronen Durst, Sébastien Thériault.

Lead author: Simon C. Body

scbody@bu.edu

References

  1. 1. Zhang F, Gu W, Hurles ME, Lupski JR. Copy number variation in human health, disease, and evolution. Annual review of genomics and human genetics. 2009;10:451. pmid:19715442
  2. 2. Cooper GM, Coe BP, Girirajan S, Rosenfeld JA, Vu TH, Baker C, et al. A copy number variation morbidity map of developmental delay. Nature genetics. 2011 Sep;43(9):838–46. pmid:21841781
  3. 3. Girirajan S, Rosenfeld JA, Coe BP, Parikh S, Friedman N, Goldstein A, et al. Phenotypic heterogeneity of genomic disorders and rare copy-number variants. New England Journal of Medicine. 2012 Oct 4;367(14):1321–31. pmid:22970919
  4. 4. Kaufman L, Ayub M, Vincent JB. The genetic basis of non-syndromic intellectual disability: a review. Journal of neurodevelopmental disorders. 2010 Dec;2(4):182–209. pmid:21124998
  5. 5. Liu Y., Chen S., Zühlke L., Black G., Choy M. K., Li N., et al. (2019). Global birth prevalence of congenital heart defects 1970–2017: Updated systematic review and meta-analysis of 260 studies. International Journal of Epidemiology, 48(42), 455–463. pmid:30783674
  6. 6. Hitz MP, Lemieux-Perreault LP, Marshall C, Feroz-Zada Y, Davies R, Yang SW, et al. Rare copy number variants contribute to congenital left-sided heart disease.
  7. 7. Warburton D, Ronemus M, Kline J, Jobanputra V, Williams I, Anyane-Yeboa K, et al. The contribution of de novo and rare inherited copy number changes to congenital heart disease in an unselected sample of children with conotruncal defects or hypoplastic left heart disease. Human genetics. 2014 Jan;133:11–27. pmid:23979609
  8. 8. Silversides CK, Lionel AC, Costain G, Merico D, Migita O, Liu B, et al. Rare copy number variations in adults with tetralogy of Fallot implicate novel risk gene pathways.
  9. 9. Ware SM, Jefferies JL. New genetic insights into congenital heart disease. Journal of clinical & experimental cardiology. 2012 Jun 6. pmid:22822471
  10. 10. Sørensen KM, El‐Segaier M, Fernlund E, Errami A, Bouvagnet P, Nehme N, et al. Screening of congenital heart disease patients using multiplex ligation‐dependent probe amplification: Early diagnosis of syndromic patients. American journal of medical genetics Part A. 2012 Apr;158(4):720–5. pmid:22383218
  11. 11. Lander J, Ware SM. Copy number variation in congenital heart defects. Current Genetic Medicine Reports. 2014 Sep;2:168–78.
  12. 12. Tomita-Mitchell A, Mahnke DK, Struble CA, Tuffnell ME, Stamm KD, Hidestrand M, et al. Human gene copy number spectra analysis in congenital heart malformations. Physiological genomics. 2012 May 1;44(9):518–41. pmid:22318994
  13. 13. Kim DS, Kim JH, Burt AA, Crosslin DR, Burnham N, Kim CE, et al. Burden of potentially pathologic copy number variants is higher in children with isolated congenital heart disease and significantly impairs covariate-adjusted transplant-free survival. The Journal of thoracic and cardiovascular surgery. 2016 Apr 1;151(4):1147–51. pmid:26704054
  14. 14. Soemedi R, Wilson IJ, Bentham J, Darlay R, Töpf A, Zelenika D, et al. Contribution of global rare copy-number variants to the risk of sporadic congenital heart disease. The American Journal of Human Genetics. 2012 Sep 7;91(3):489–501. pmid:22939634
  15. 15. Chandra S, Lang RM, Nicolarsen J, Gayat E, Spencer KT, Mor-Avi V, et al. Bicuspid aortic valve: inter-racial difference in frequency and aortic dimensions. JACC: Cardiovascular Imaging. 2012 Oct;5(10):981–9. pmid:23058064
  16. 16. Michelena HI, Prakash SK, Della Corte A, Bissell MM, Anavekar N, Mathieu P, et al. Bicuspid aortic valve: identifying knowledge gaps and rising to the challenge from the International Bicuspid Aortic Valve Consortium (BAVCon). Circulation. 2014 Jun 24;129(25):2691–704. pmid:24958752
  17. 17. Glotzbach JP, Hanson HA, Tonna JE, Horns JJ, McCarty Allen C, Presson AP, et al. Familial Associations of Prevalence and Cause-Specific Mortality for Thoracic Aortic Disease and Bicuspid Aortic Valve in a Large-Population Database. Circulation. 2023 Jun 15.
  18. 18. Musfee FI, Guo D, Pinard AC, Hostetler EM, Blue EE, Nickerson DA, University of Washington Center for Mendelian Genomics (UW‐CMG), Bamshad MJ, Milewicz DM, Prakash SK. Rare deleterious variants of NOTCH1, GATA4, SMAD6, and ROBO4 are enriched in BAV with early onset complications but not in BAV with heritable thoracic aortic disease. Molecular Genetics & Genomic Medicine. 2020 Oct;8(10):e1406.
  19. 19. Tessler I, Albuisson J, Piñeiro-Sabarís R, Verstraeten A, Kaya HE, Siguero-Álvarez M, et al. Novel association of the NOTCH pathway regulator MIB1 gene with the development of bicuspid aortic valve. JAMA cardiology. 2023 Aug 1;8(8):721–31. pmid:37405741
  20. 20. Prakash SK, Yetman A, Bissell MM, Kim YY, Michelena H, Hui DS, et al. Recurrent genomic copy number variants implicate new candidate genes for early onset bicuspid aortic valve disease. Journal of the American College of Cardiology. 2019 Mar 12;73(9S1):620-.
  21. 21. Prakash S, Kuang SQ, GenTAC Registry Investigators, Regalado E, Guo D, Milewicz D. Recurrent rare genomic copy number variants and bicuspid aortic valve are enriched in early onset thoracic aortic aneurysms and dissections. PloS one. 2016 Apr 19;11(4):e0153543.
  22. 22. Balistreri CR, Cavarretta E, Sciarretta S, Frati G. Light on the molecular and cellular mechanisms of bicuspid aortic valve to unveil phenotypic heterogeneity. Journal of Molecular and Cellular Cardiology. 2019;133: 113–114. pmid:31199951
  23. 23. Prakash S.K., LeMaire S.A., Guo D.C., Russell L., Regalado E.S., Golabbakhsh H., et al., 2010. Rare copy number variants disrupt genes regulating vascular smooth muscle cell adhesion and contractility in sporadic thoracic aortic aneurysms and dissections. The American Journal of Human Genetics, 87(6), pp.743–756. pmid:21092924
  24. 24. Prakash SK, Bossé Y, Muehlschlegel JD, Michelena HI, Limongelli G, Della Corte A, et al. A roadmap to investigate the genetic basis of bicuspid aortic valve and its complications: insights from the International BAVCon (Bicuspid Aortic Valve Consortium). Journal of the American College of Cardiology. 2014 Aug 26;64(8):832–9. pmid:25145529
  25. 25. Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)–A metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009 Apr;42(2):377–81. pmid:18929686
  26. 26. Harris PA, Taylor R, Minor BL, Elliott V, Fernandez M, O’Neal L, et al. Building an international community of software partners. J Biomed Inform. 2019 May 9 pmid:31078660
  27. 27. Wang K, Li M, Hadley D, Liu R, Glessner J, Grant SF, et al. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome research. 2007 Nov 1;17(11):1665–74. pmid:17921354
  28. 28. Colella S, Yau C, Taylor JM, Mirza G, Butler H, Clouston P, et al. QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data. Nucleic acids research. 2007 Mar 1;35(6):2013–25. pmid:17341461
  29. 29. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. The American journal of human genetics. 2007 Sep 1;81(3):559–75. pmid:17701901
  30. 30. Raychaudhuri S, Korn JM, McCarroll SA, International Schizophrenia Consortium, Altshuler D, Sklar P, et al. Accurately assessing the risk of schizophrenia conferred by rare copy-number variation affecting genes with brain function. PLoS genetics. 2010 Sep 9;6(9):e1001097. pmid:20838587
  31. 31. Zhu W., and Lo C.W. (2023). Insights into the genetic architecture of congenital heart disease from animal modeling. Zool. Res. 44, 577–590. pmid:37147909
  32. 32. Xie Y, Jiang W, Dong W, Li H, Jin SC, Brueckner M, et al. Network assisted analysis of de novo variants using protein-protein interaction information identified 46 candidate genes for congenital heart disease. PLoS genetics. 2022 Jun 7;18(6):e1010252. pmid:35671298
  33. 33. Sevim Bayrak C, Zhang P, Tristani-Firouzi M, Gelb BD, Itan Y. De novo variants in exomes of congenital heart disease patients identify risk genes and pathways. Genome medicine. 2020 Dec;12:1–8. pmid:31941532
  34. 34. Church DM, Schneider VA, Graves T, Auger K, Cunningham F, Bouk N, et al. Modernizing reference genome assemblies. PLoS biology. 2011 Jul 5;9(7):e1001091. pmid:21750661
  35. 35. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al. The human genome browser at UCSC. Genome research. 2002 Jun 1;12(6):996–1006. pmid:12045153
  36. 36. Prakash S, Kuang SQ, GenTAC Registry Investigators, Regalado E, Guo D, Milewicz D. Recurrent rare genomic copy number variants and bicuspid aortic valve are enriched in early onset thoracic aortic aneurysms and dissections. PloS one. 2016 Apr 19;11(4):e0153543. pmid:27092555
  37. 37. Durocher D., Charron F., Warren R., Schwartz R. J., Nemer M. The cardiac transcription factors Nkx2-5 and GATA-4 are mutual cofactors. EMBO J. 16: 5687–5696, 1997. pmid:9312027
  38. 38. Tremblay M, Sanchez-Ferras O, Bouchard M. GATA transcription factors in development and disease. Development. 2018 Oct 15;145(20):dev164384. pmid:30348673
  39. 39. McCulley DJ, Black BL. Transcription factor pathways and congenital heart disease. Current topics in developmental biology. 2012 Jan 1;100:253–77. pmid:22449847
  40. 40. Alonso‐Montes C, Martín M, Martínez‐Arias L, Coto E, Naves‐Díaz M, Morís C, et al. Variants in cardiac GATA genes associated with bicuspid aortic valve. European journal of clinical investigation. 2018 Dec;48(12):e13027. pmid:30229885
  41. 41. Pehlivan T, Pober BR, Brueckner M, Garrett S, Slaugh R, Van Rheeden R, et al. GATA4 haploinsufficiency in patients with interstitial deletion of chromosome region 8p23. 1 and congenital heart disease. American journal of medical genetics. 1999 Mar 19;83(3):201–6. pmid:10096597
  42. 42. Li RG, Xu YJ, Wang J, Liu XY, Yuan F, Huang RT, et al. GATA4 loss-of-function mutation and the congenitally bicuspid aortic valve. The American journal of cardiology. 2018 Feb. pmid:29325903
  43. 43. Glessner JT, Bick AG, Ito K, Homsy JG, Rodriguez-Murillo L, Fromer M, et al. Increased frequency of de novo copy number variants in congenital heart disease by integrative analysis of single nucleotide polymorphism array and exome sequence data. Circulation research. 2014 Oct 24;115(10):884–96. pmid:25205790
  44. 44. Zogopoulos G, Ha KC, Naqib F, Moore S, Kim H, Montpetit A, et al. Germ-line DNA copy number variation frequencies in a large North American population. Human genetics. 2007 Nov;122:345–53. pmid:17638019
  45. 45. Yu S, Zhou XG, Fiedler SD, Brawner SJ, Joyce JM, Liu HY. Cardiac defects are infrequent findings in individuals with 8p23. 1 genomic duplications containing GATA4. Circulation: Cardiovascular Genetics. 2011 Dec;4(6):620–5. pmid:21933911
  46. 46. Mlynarski EE, Sheridan MB, Xie M, Guo T, Racedo SE, McDonald-McGinn DM, et al. Copy-number variation of the glucose transporter gene SLC2A3 and congenital heart defects in the 22q11. 2 deletion syndrome. The American Journal of Human Genetics. 2015 May 7;96(5):753–64. pmid:25892112
  47. 47. Freeman SB, Taft LF, Dooley KJ, Allran K, Sherman SL, Hassold TJ, et al. Population‐based study of congenital heart defects in Down syndrome. American journal of medical genetics. 1998 Nov 16;80(3):213–7. pmid:9843040
  48. 48. Paladini D, Tartaglione A, Agangi A, Teodoro A, Forleo F, Borghese A, et al. The association between congenital heart disease and Down syndrome in prenatal life. Ultrasound in Obstetrics and Gynecology. 2000 Feb;15(2):104–8. pmid:10775990
  49. 49. Laursen HB. Congenital heart disease in Down’s syndrome. Heart. 1976 Jan 1;38(1):32–8. pmid:1252293
  50. 50. Kosaki R, Kosaki K, Matsushima K, Mitsui N, Matsumoto N, Ohashi H. Refining chromosomal region critical for Down syndrome‐related heart defects with a case of cryptic 21q22. 2 duplication. Congenital anomalies. 2005 Jun;45(2):62–4. pmid:15904434
  51. 51. Grossman TR, Gamliel A, Wessells RJ, Taghli-Lamallem O, Jepsen K, Ocorr K, et al. Over-expression of DSCAM and COL6A2 cooperatively generates congenital heart defects. PLoS genetics. 2011 Nov 3;7(11):e1002344. pmid:22072978
  52. 52. Gu H, Smith FC, Taffet SM, Delmar M. High incidence of cardiac malformations in connexin40-deficient mice. Circulation research. 2003 Aug 8;93(3):201–6. pmid:12842919
  53. 53. Theis JL, Niaz T, Sundsbak RS, Fogarty ZC, Bamlet WR, Hagler DJ, et al. CELSR1 Risk Alleles in Familial Bicuspid Aortic Valve and Hypoplastic Left Heart Syndrome. Circ: Genomic and Precision Medicine. 2022;15. pmid:35133174
  54. 54. Pottie L, Adamo CS, Beyens A, Lütke S, Tapaneeyaphan P, De Clercq A, et al. Bi-allelic premature truncating variants in LTBP1 cause cutis laxa syndrome. The American Journal of Human Genetics. 2021;108: 1095–1114. pmid:33991472
  55. 55. Akasaka T, Ocorr K, Lin L, Vogler G, Bodmer R, Grossfeld P. Overexpression of Kif1A in the Developing Drosophila Heart Causes Valvar and Contractility Defects: Implications for Human Congenital Heart Disease. JCDD. 2020;7: 22. pmid:32498427
  56. 56. Pannu H, Tran-Fadulu V, Papke CL, Scherer S, Liu Y, Presley C, et al. MYH11 mutations result in a distinct vascular pathology driven by insulin-like growth factor 1 and angiotensin II. Human Molecular Genetics. 2007;16: 2453–2462. pmid:17666408
  57. 57. Herman DS, Lam L, Taylor MRG, Wang L, Teekakirikul P, Christodoulou D, et.al. Truncations of titin causing dilated cardiomyopathy. New England Journal of Medicine. 2012;366: 619–628. pmid:22335739