Genome-Wide Association Studies of the PR Interval in African Americans

The PR interval on the electrocardiogram reflects atrial and atrioventricular nodal conduction time. The PR interval is heritable, provides important information about arrhythmia risk, and has been suggested to differ among human races. Genome-wide association (GWA) studies have identified common genetic determinants of the PR interval in individuals of European and Asian ancestry, but there is a general paucity of GWA studies in individuals of African ancestry. We performed GWA studies in African American individuals from four cohorts (n = 6,247) to identify genetic variants associated with PR interval duration. Genotyping was performed using the Affymetrix 6.0 microarray. Imputation was performed for 2.8 million single nucleotide polymorphisms (SNPs) using combined YRI and CEU HapMap phase II panels. We observed a strong signal (rs3922844) within the gene encoding the cardiac sodium channel (SCN5A) with genome-wide significant association (p<2.5×10−8) in two of the four cohorts and in the meta-analysis. The signal explained 2% of PR interval variability in African Americans (beta  = 5.1 msec per minor allele, 95% CI  = 4.1–6.1, p = 3×10−23). This SNP was also associated with PR interval (beta = 2.4 msec per minor allele, 95% CI = 1.8–3.0, p = 3×10−16) in individuals of European ancestry (n = 14,042), but with a smaller effect size (p for heterogeneity <0.001) and variability explained (0.5%). Further meta-analysis of the four cohorts identified genome-wide significant associations with SNPs in SCN10A (rs6798015), MEIS1 (rs10865355), and TBX5 (rs7312625) that were highly correlated with SNPs identified in European and Asian GWA studies. African ancestry was associated with increased PR duration (13.3 msec, p = 0.009) in one but not the other three cohorts. Our findings demonstrate the relevance of common variants to African Americans at four loci previously associated with PR interval in European and Asian samples and identify an association signal at one of these loci that is more strongly associated with PR interval in African Americans than in Europeans.


Introduction
The electrocardiogram is an important clinical tool that provides a graphical representation of the electrical activity of the heart as captured by skin surface electrodes. The electrocardiographic PR interval represents conduction through the atria and AV node, and impaired conduction through these tissues is a central pathogenic mechanism in many cardiac arrhythmias. Prolongation of the PR interval is a risk factor for long-term atrial fibrillation, heart block, and all-cause mortality [1]. A substantial proportion of the variability of the PR interval is explained by genetic factors with heritability estimates ranging between 30 and 50% in populations of European and Asian ancestry [2][3][4][5][6][7]. Identification of the specific alleles underlying the heritability of the PR interval might provide novel insights into molecular electrophysiology, lead to novel drug targets for arrhythmia treatment, and facilitate genetic prediction of arrhythmia risk and response to antiarrhythmics. Genome-wide association (GWA) studies have identified nine loci associated with PR interval duration in populations of European [6,8] or South Asian ancestry [9]. Four of these associations have been replicated [6,9]. However, there is a general paucity of GWA studies performed in populations of African ancestry [10] and none have been performed for PR interval duration which has been suggested to be longer in individuals of African than European ancestry [11]. Samples of African ancestry typically have shorter-range linkage disequilibrium (LD) than samples of European or Asian ancestry, potentially reducing genomic coverage in GWA studies using fixed arrays. However, samples of African ancestry offer greater genetic diversity; European and Asian populations have experienced a loss of allelic diversity since the out-of-Africa founding events [12,13]. Thus, unique alleles influencing complex traits may be present in samples of African ancestry and shorter-range LD can facilitate narrowing of association signals. We studied the association of genetic ancestry with PR interval and performed a GWA study in 6,247 African Americans from four cohorts as part of the Candidate-gene Association Resource (CARe) consortium including the Atherosclerosis Risk in Communities (ARIC) study, the Cleveland Family Study (CFS), the Jackson Heart Study (JHS) and the Multi-Ethnic Study of Atherosclerosis (MESA). Replication of genome-wide significant associations was attempted by direct genotyping in 2,022 African Americans from the Health, Aging, and Body Composition Study (Health ABC) and the Healthy Aging in Neighborhoods of Diversity across the Life Span Study (HANDLS).

Phenotype modeling
Baseline characteristics of the study populations are shown in Table 1. After adjustment for covariates, residual standard deviations for PR interval ranged from 22.1 to 26.9 msec in the four cohorts. The proportion of variation in phenotype explained by covariates (r 2 ) was modest, ranging from 0.04 to 0.15, as shown in Table 1.

Genetic ancestry and PR interval
The distribution of European ancestry relative to African ancestry was similar across cohorts; ARIC (median 0. 15 African ancestry was associated with longer PR interval than European ancestry in ARIC (13.3 msec longer for complete African compared to complete European ancestry, p = 0.009) but not in CFS (213.2 msec, p = 0.44), JHS (1.0 msec, p = 0.89) or MESA (7.2 msec, p = 0.11).

Meta-analysis of genome-wide association studies
Genomic inflation was minimal in ARIC (l GC = 1.02) and MESA (l GC = 1.01) and modest in the family-based CFS (l GC = 1.10) and JHS (l GC = 1.08) studies, likely due to relatedness of some participants. Quantile-quantile plots of p-values were consistent with null distributions at all but the lowest p-values ( Figure S1). Genomic control was applied to test statistics in all cohorts. Genomic inflation in the meta-analysis was modest (l GC = 1.04); genomic control was applied again (Figure 1). The distribution of association results across the genome is shown in Figure 2. Twenty-four single nucleotide polymorphisms (SNPs) at four genomic loci reached genome-wide significance as shown in Table S1. We clustered these SNPs by pairwise correlation in the HapMap phase II YRI panel and observed five weakly correlated linkage disequilibrium bins (all pairwise r 2 were ,0.3). Results for the SNP with the lowest p-value in each cluster are shown in Table 2 and the correlation of these SNPs to SNPs reported in previous GWA studies of European and Asian ancestry are shown in Table 3.
The top association result was for rs3922844 at SCN5A (5.1 msec per minor allele, 95% CI = 4.1-6.1, p = 3610 223 ), explaining 2% of variation in PR interval. The SNP (rs3922844) reached genome-wide significance (p,2.5610 28 ) in the larger cohorts, ARIC and JHS, considered individually. The SNP had a large effect estimate (5.1-6.1 msec or 0.19-0.24 standard deviations per C-allele) consistently across ARIC, JHS and MESA as shown in Table 2. The effect estimate was smaller in CFS, but CFS had a smaller sample of African Americans and hence wider confidence intervals than was observed in other larger samples. We observed no evidence of heterogeneity across the four African American cohorts (p for heterogeneity = 0.41). The SNP was well imputed in all cohorts as shown in Table S2. It was not correlated in HapMap CEU or YRI panels with the SNPs at this locus that have been identified in previous GWA studies of European or Asian ancestry (r 2 ,0.10, Table 3).
We observed an additional association signal in SCN5A (rs6599222, p = 2610 28 ), for which the strongest SNP was in LD with the SNP identified in a previous European GWA study (rs11708996, r 2 = 0.61) and with an effect size (+3. 4

Author Summary
We performed genome-wide association studies in African American participants from four population-based cohorts to identify genetic variation that correlates with variation in PR interval duration, an electrocardiographic measure of conduction through the atria and atrioventricular node. We observed a strong signal within the gene encoding the cardiac sodium channel, SCN5A, with genome-wide significant association (p,2.5610 28 ) in two cohorts and in a meta-analysis of four cohorts with African Americans. We replicated this association in two additional cohorts of African Americans and in Europeans (p = 3610 216 ). The signal explains 2% of PR duration variability in African Americans and 0.5% in Europeans. In further meta-analysis, we observed genome-wide significant associations for single nucleotide polymorphisms in SCN10A, MEIS1, TBX5, corresponding to signals observed in people of European and Asian descent. We found an association of genetic ancestry and PR interval in one but not the other three cohorts. Our findings provide the first demonstration of the relevance of these loci to individuals of African ancestry and identify an association signal from SCN5A that is more strongly associated with PR interval in African Americans.
study [8]. However, when analyzed conditional on rs3922844, the SNP rs6599222 was only borderline significantly associated with PR interval as shown in Table S3. The SNPs in SCN5A previously reported to be associated with PR interval in a cohort of Micronesians [5] were not associated with PR interval in African Americans (p = 0.57). The association results, correlations and locations of all studied SNPs in SCN5A are plotted against recombination rates in Figure 3. We further observed significant associations for a cluster of ten highly correlated SNPs in SCN10A that included the missense SNP (rs6795970) reported previously in GWA studies of individuals of European and Asian ancestry [6,8,9]. The missense SNP did not have the lowest p-value in the cluster but had the highest effect estimate, the lowest minor allele frequency (Table 2) and a somewhat lower imputation quality than other SNPs. Significant associations also were seen with SNPs inTBX5 and MEIS1. These correlated with the strongest reported SNPs from the GWA study performed in individuals of European ancestry at the loci shown in Table 3. The SNPs with the lowest p-values in each cluster were all well imputed as shown in Table S2. We observed no evidence for heterogeneity of effect across cohorts for any SNP reported in Table 2 (p.0.05).
As shown in Table 4, additional adjustment for local ancestry estimates did not impact any of the genome-wide significant associations.

Replication in African Americans
The strongest SNPs at loci reaching genome-wide significance were genotyped in Health ABC and HANDLS. As shown in Table 4, we observed evidence of replication for the top SNPs in SCN5A, SCN10A and MEIS1 and borderline significant replication for TBX5.

Association study in individuals of European ancestry
We examined the result for rs3922844 in a large, European GWA study (CHARGE-PR) [8], where it was imputed in four cohorts (n = 14,042). We observed a significant association with the PR interval (beta 2.40 msec per major allele, 95% CI = 1.83-2.97, p = 3610 216 ). As in HapMap reference panels, allele frequencies were reversed in Europeans compared to African Americans, with a minor allele frequency of 0.30 for the T-allele. The effect size was significantly different in CHARGE-PR as compared to CARe (p for heterogeneity ,0.001, I 2 = 0.99).

Discussion
We performed a meta-analysis of GWA studies of the PR interval duration in individuals of African ancestry from four cohorts. We used a stringent significance threshold of 2.5610 28 , as previous genetic analyses have suggested twice as many independent SNPs in samples of African ancestry compared to European ancestry [14].
In two cohorts and in the meta-analysis we observed a genomewide significant association with the SNP rs3922844 in SCN5A that explains approximately 2% of variability in PR interval duration. The SNP was not correlated with rs11708996, another SNP in SCN5A associated with PR interval duration in prior GWA studies of subjects of European ancestry [8], and had a larger effect estimate than rs11708996. We, therefore, examined the result for rs3922844 in a previously reported GWA study of PR interval in subjects of European ancestry [8] and observed genome-wide significant association but with a smaller effect estimate as compared to our study (5.1 msec, 95% CI = 4.1-6.1 versus 2.4 msec, 95% CI = 1.8-3.0, p for heterogeneity ,0.001).Thus, we have evaluated genetic variants associated with PR interval duration in subjects of African American ancestry and identified an association signal from SCN5A with a larger effect size in individuals of African American compared to European ancestry.
We consider direct prolongation of the PR interval by rs3922844 unlikely given the differing effect sizes in individuals of African American and European ancestry. No SNP catalogue, including HapMap, used as reference panel for imputation in our study, includes all common sequence variants. We consider it more likely that rs3922844 is correlated with one or several causal variants and that the varying effect sizes result from differences in LD patterns across populations. For example, two variants with opposing effects could exist at the locus, with overlapping signals in Europeans but with isolation of individual signals in African Americans due to breakdown of linkage disequilibrium. The SNP rs3922844 was not directly genotyped in our samples but was imputed with high imputation quality as measured by the average ratio of observed variance to that expected by the Hardy-Weinberg equilibrium. We note that no SNPs included on our genotyping platform had an r 2 .0.2 with rs3922844 in the African (YRI) HapMap panel. A small cluster of genotyped SNPs were well correlated with rs3922844 in the European (CEU) HapMap panel (r 2 = 0.6-0.8) as shown in Figure 3 but none of these reached a p-value less than 10 24 . These observations suggest that the association was detected due to correlation of rs3922844 with multiple, weakly correlated, genotyped SNPs. Alternatively, a strong gene-environment interaction for rs3922844 could result in large differences in effect estimates across populations subject to differential environmental exposures or other genetic effects. Large-scale resequencing or imputation from large-scale resequencing efforts such as the 1000 Genomes Project, and functional experiments will ultimately be necessary to identify the causal variants.
Although allelic heterogeneity between families is well established for monogenic diseases, little is known about common variants across different populations. Recently identified findings include different association signals at the locus containing KCNQ1 with type 2 diabetes in European and Asian cohorts [10], at MYH9 with focal segmental glomerulosclerosis [15] and a MYBPC3 variant that causes cardiomyopathy and is only polymorphic in Asian populations [16]. Allelic heterogeneity is conceptually important and highlights the value of genetic association studies spanning diverse racial and ethnic populations. Studies across multiple populations may identify novel disease loci and genetic factors underlying differences in risk by race. Our results do not provide support for allelic heterogeneity across populations for PR interval, although limited power may have failed to detect such effects. Genetic admixture studies have estimated that, on average, modern African American genomes have approximately 80-90% African and 10-20% European ancestry [17], which is well in line with our global ancestry analyses. It is important to note that there is great genetic diversity across the African continent. Our study included African Americans, who predominantly have an ancestral origin in Central and West Africa. It is possible that GWA studies in other samples of African ancestry might identify additional association signals.
The clinical impact of our findings merit further exploration. For example, a missense variant in SCN5A, S1103Y (rs7626962), which is polymorphic in African Americans but very rare in Europeans, has been associated with a markedly increased risk of ventricular arrhythmias [18]. This variant was not correlated with rs3922844 in the YRI panel of HapMap phase III (r2 = 0.04). Furthermore, African Americans have been shown to have a lower incidence of atrial fibrillation than Europeans. Longer PR interval duration (difference in means 11.8 msec) has also been reported in African Americans compared to Europeans in the ARIC study [11]. We were able to confirm this observation in ARIC using genetic ancestry, with a similar magnitude (13.3 msec longer with complete African compared to complete European ancestry, p = 0.009). However, we did not observe differences by genetic ancestry in CFS, JHS or MESA. This discrepancy could be due to differences in trans-African ancestry or other factors. Our results further indicate that common variants are unlikely to explain a large part of any ancestry-related variation in atrial and atrioventricular nodal conduction. The strongest variant observed here explained 2% of PR interval variability. It seems more likely that any ancestry-related differences in atrial conduction are due to rare variants, but this awaits further testing. Additional studies in populations with atrial fibrillation or sudden cardiac death will be necessary to determine the impact of the SNPs identified in our study on clinical outcomes.
In our meta-analysis, we also observed genome-wide significant association signals of SNPs in SCN10A, TBX5 and MEIS1, which are correlated with SNPs reported in previous GWA studies in individuals of European ancestry. The lowest p-value in the SCN10A cluster of SNPs was not observed in the previously described missense variant (rs6795970) [6,9] but in an intronic SNP (rs6798015). The effect estimate was highest for the missense SNP but it also had the lowest minor allele frequency in the cluster, a lower allele frequency than in European (CEU) and  Asian (JPT and CHB) HapMap panels. No evidence that the missense variant explains the association with PR interval has been published yet. Although it is possible that another SNP mediates the association, we find it more likely that the weaker p-value for the intronic SNP in our meta-analysis reflects differences in imputation quality or chance difference in the strength of the observed association. To ultimately identify the causal variants in each population, large-scale resequencing and functional experiments will be necessary. Four genome-wide association studies for PR interval duration have been published previously. The first study identified an association with SNPs in SCN5A in a Micronesian founder population, but no SNP reached genome-wide significance [5]. A large study in individuals of European ancestry identified nine loci of genome-wide significance [8], including ARHGAP24, CAV1/ CAV2, MEIS1, NKX2-5, SCN5A, SCN10A, SOX5, TBX5/TBX3 and WNT11. A third study in Icelanders identified and replicated significant associations with four of these loci [6]; ARHGAP24, CAV1, SCN10A and TBX5/TBX3. Finally, a study in individuals of Asian ancestry identified and replicated association with SNPs in SCN10A [9]. Here, we have reported genome-wide significant associations in African Americans with SNPs in strong LD with the previously reported SNPs at four of these loci; MEIS1, SCN5A, SCN10A and TBX5/TBX3. These findings demonstrate the relevance of these common alleles to individuals of African ancestry.
Our study has a number of limitations. All GWA studies suffer risk of false positive associations due to multiple testing, which is higher in samples of African ancestry where two million independent SNPs have been described [14]. We therefore applied a stringent significance threshold (2.5610 28 ) to account for the approximately two million common variant tests estimated to exist across the genome. Large sample sizes are required to achieve such stringent significance levels. Our significant findings are thus unlikely to be false positives, especially as association signals at the same loci have been observed in European studies. However, even in the meta-analysis our study was underpowered to detect both common variants of small effect and rare variants. Measurement error in PR duration and heterogeneity between cohorts may further contribute to loss of power. Another potential problem encountered in genetic association studies is population stratification or confounding by ancestry. We sought to address this issue by filtering out individuals with cryptic relatedness or unlikely to be of African ancestry, adjusting for principal components of genome-wide data that describe within-population genetic variation, using family-based methods where appropriate and applying genomic control.
In conclusion, our results suggest that genetic loci underlying complex traits may be in large part the same across ancestral populations. However, association signals may show different effects across ethnic and racial populations and multiple association signals may exist at these loci. We anticipate that large-scale resequencing and functional experiments will ultimately identify the causal variants specific to each population.

Ethics statement
The study was approved by the Institutional Review Board at all participating institutions. Only individuals who provided informed consent to genetic testing were included.

Study samples
Individuals of self-reported African American ancestry from four cohort studies were genotyped as part of the Candidate-gene Association Resource (CARe) [19] (Lettre G et al, submitted). Detailed descriptions of these cohorts have been published previously [20][21][22][23]. Additional details are shown in Table S4. Details on replication samples (Health ABC, HANDLS) are shown in Table S5.
ARIC. The Atherosclerosis Risk in Communities (ARIC) study is a prospective population-based study of atherosclerosis and cardiovascular disease that included 15,792 men and women between 45 and 64 years from four US communities, 4,314 of whom were self-reported black Americans from two of the four communities (Jackson, MS and Forsyth County, NC) [20]. Electrocardiographic recordings used in the present study were performed at baseline examinations between 1987-1989. After exclusions, data on 2,391 black individuals with genotypes, information on all covariates and informed consent remained for analyses.
CFS. The Cleveland Family Study (CFS) is a family-based, longitudinal study designed to study the risk factors for sleep apnea [21]. The 632 African Americans with available DNA were genotyped as part of CARe. Electrocardiographic recordings used for the present study were performed at the final exam cycle conducted in a Clinical Research Unit between 2001-2006. After exclusions, 267 individuals remained for analyses.
JHS. The Jackson Heart Study (JHS) is a prospective, community-based study of the causes of the high prevalence of cardiovascular disease in African Americans that includes 5,301 self-reported African Americans recruited between 2000-2004 [22]. The cohort comprises residents of a tri-county area near Jackson, MS (Hinds County, Rankin County and Madison County). The study includes a subsample of unrelated participants (35-84 years) and a nested family-based subcohort ($21 years). Electrocardiographic recordings used in the present study and blood collection for DNA extraction were performed at baseline examinations between 2000-2004. Genotype data were available in 3,030 individuals, including 885 who were also included in the ARIC study. After exclusions, 1962 individuals with ECG data and not included in ARIC were included in the present analysis.
MESA. The Multi-Ethnic Study of Atherosclerosis (MESA) is a population-based study of the characteristics of subclinical cardiovascular disease that included 6,814 individuals (28% African Americans) free from known cardiovascular disease between 45-84 years recruited from 6 field centers in the US [23]. Electrocardiographic recordings used in the present study and blood sampling for DNA extraction were performed at baseline visits between 2000-2002. African Americans constituted 28% of the total sample, of whom 1627 with genotype and phenotype data remained after exclusions.

ECG recordings
12-lead electrocardiograms with standard lead placements were recorded during ten seconds in all cohorts using Marquette MAC PC, MAC6 or MAC1200 machines (GE Healthcare, Milwaukee, WI, USA). PR interval duration was measured electronically using either the Marquette 12SL algorithm or the MC MEANS algorithm (Table S4).

Genotyping and quality control
All genotyping was performed at the Broad Institute of Harvard and MIT using the Affymetrix Genome-Wide Human SNP Array 6.0, which interrogates 906,600 SNPs, according to the manufacturers recommendations. Sample processing and quality control has been described previously [19] (Lettre G et al, submitted). Briefly, 1 ug of genomic DNA was equally interleaved on 96-well master plates. Quantity of double stranded DNA was ascertained using PicoGreen (Molecular Probes, Oregon, USA). Restriction enzyme digestion and PCR amplification using universal primers was performed to generate fragments of between 200-1,100 base pairs in length which were further fragmented to 25-50 bp and labeled with biotinylated nucleotides. Labeled fragments were hybridized to the microarray, washed and detected. Genotypes were called using Birdseed v1.33 and quality control steps were performed in PLINK, EIGENSTRAT and PREST-Plus [24][25][26][27][28]. To confirm sample identity, genotype concordance between 24 SNPs genotyped in all samples both on the microarrays and using Sequenom iPlex were monitored. DNA samples with genotyping success rate ,95% were excluded. Autosomal heterozygosity rates were estimated to identify and exclude samples with poor DNA quality or contamination. Duplicated or contaminated samples were identified from identity by descent (IBD) estimates and excluded. Samples that were present in both JHS and ARIC were excluded based on IBD concordance. We also excluded samples unlikely to be African American (ARIC = 8), based on principal component analyses incorporating carefully curated reference populations of 1,178 European Americans and 756 Nigerians in EIGENSTRAT as previously described (Lettre G et al, submitted).
We excluded SNPs with genotyping success rate ,90%, minor allele frequency ,1%, SNPs that map to multiple locations, SNPs where missingness could be predicted from surrounding haplotypes and SNPs associated with chemistry plates (i. e. differential genotyping by plate). SNPs with unusually high numbers of Mendelian errors were excluded from family-based samples. No exclusions were performed based on Hardy-Weinberg equilibrium given admixture of the sample. In total, about 800,000 SNPs passed quality control in all cohorts.

Imputation and quality control
Imputation of additional ungenotyped SNPs was performed using hidden Markov models as implemented in MACH v1.16 [29] for a total of 2,801,419 million SNPs. Phased reference haplotypes from combined HapMap phase II CEU and YRI panels were used with equal proportions. Imputation was performed in two steps: in the first step, related individuals (as identified by proportion of genomic variation shared IBS/IBD (pi hat).0.05) were filtered and error and recombination rates were estimated per sample in MACH using 30 iterations. In the second step, estimated rates were used to estimate allele dosage for all samples in one iteration. The average ratio of observed variance over that expected under Hardy-Weinberg equilibrium per SNP was also estimated as quality metric (R 2 ). All samples were genotyped for 50,000 SNPs using the ITMAT-Broad-CARe (IBC) array [30] and the concordance rate (1-1/26 [imputed dosagechip dosage]) of imputed genotypes with IBC genotypes was high (,95%) in all samples. We excluded poorly imputed SNPs (ratio of observed over expected variance ,0.3), SNPs with minor allele frequency,1% and non-autosomal SNPs. Sample outliers from missingness clustering analyses in PLINK were also removed.

Phenotype modeling
We excluded from all analyses individuals with missing covariates, ,18 years, pacemaker implant, Wolf-Parkinson-White pattern, extreme PR values (,80 ms and .320 ms), prevalent atrial fibrillation, heart failure, myocardial infarction, or 2 nd or 3 rd degree AV block. PR interval duration was regressed linearly on age, sex, RR interval, BMI, height, systolic blood pressure and study site where relevant using SAS 9.2 (SAS Institute, Cary, North Carolina). Residuals were used as phenotype for association analyses.

Genome-wide association analyses and meta-analysis
For all cohorts except CFS, GWA analysis was performed in PLINK, using a linear regression model with an additive genetic model. The family-based CFS study was analyzed using linear mixed-effects models as implemented in the GWAF package for R [31]. Although JHS contains a family-based subcohort, little effect on inflation of using family-based methods were observed in preliminary analyses for a set of traits. Pedigrees for CFS were confirmed using identity by state (IBS) or IBD estimates from PREST-Plus. In all association analyses, we adjusted for population stratification by including the first ten principal components in addition to genome-wide SNP data. Cohortspecific GWA results were combined using fixed effects metaanalysis with inverse variance weights. Genomic inflation factors were evaluated in each cohort prior to meta-analysis and in the combined results [32]. We prespecified a genome-wide significance threshold of 2.5610 28 as suggested for populations of African ancestry by Pe'er et al [14], assuming 2 million independent common variant tests in genomes of individuals of African American ancestry with p,0.05 genome-wide. Heterogeneity across samples was assessed by Cochran's x 2 test of heterogeneity with 4 degrees of freedom. To identify independent signals at loci we clustered all genome-wide significant hits by pairwise correlations $0.5 in the YRI HapMap panel. All analyses were performed using PLINK or R version 2.10.1 (R Foundation for Statistical Computing, Vienna, Austria). We examined the results for any SNPs of genome-wide significance that were uncorrelated with signals previously reported in other ancestral populations in a large published GWA study of PR interval in Europeans ancestry [8].

Genetic ancestry analyses
Individual estimates of global ancestry (proportion of European ancestors relative to African ancestors) and local ancestry were calculated using ANCESTRYMAP [33] and HAPMIX [34], as previously described (Lettre G et al, submitted). To study the association of genetic ancestry and PR interval, global ancestry estimates were regressed linearly on PR residuals. To assess the impact of local ancestry on SNP associations, all associations with genome-wide significance were adjusted for local ancestry. Figure S1 Quantile-quantile plots for genome-wide association studies of PR interval duration in African Americans. Found at: doi:10.1371/journal.pgen.1001304.s001 (0.34 MB TIF) Table S1 All SNPs reaching genome-wide significance (p,2.5610 28 ) in a meta-analysis of four genome-wide association studies of PR interval duration in African Americans. Positions refer to NCBI build 36. Genomic control has been applied to standard errors and p-values. SNP clusters as defined by r2 greater than or equal to 0.5 are indicated. Neff refers to the effective sample size, defined as N6R2, and reflects the loss of power with poor imputation compared to the total sample sizes shown in Table 1. Table S2 Imputation quality and effective sample size for the SNPs with the lowest p-values in each linkage equilibrium bin of genome-wide significant SNPs. R2 refers to the average ratio of the observed variance over that expected under Hardy-Weinberg equilibrium and indicates quality of imputation as poorly imputed SNPs show less variability than expected based on allele frequency.

Supporting Information
Neff refers to the effective sample size, defined as N6R2, and reflects the loss of power with poor imputation compared to the total sample sizes shown in Table 1