Gene-Centric Meta-Analysis of Lipid Traits in African, East Asian and Hispanic Populations

Meta-analyses of European populations has successfully identified genetic variants in over 100 loci associated with lipid levels, but our knowledge in other ethnicities remains limited. To address this, we performed dense genotyping of ∼2,000 candidate genes in 7,657 African Americans, 1,315 Hispanics and 841 East Asians, using the IBC array, a custom ∼50,000 SNP genotyping array. Meta-analyses confirmed 16 lipid loci previously established in European populations at genome-wide significance level, and found multiple independent association signals within these lipid loci. Initial discovery and in silico follow-up in 7,000 additional African American samples, confirmed two novel loci: rs5030359 within ICAM1 is associated with total cholesterol (TC) and low-density lipoprotein cholesterol (LDL-C) (p = 8.8×10−7 and p = 1.5×10−6 respectively) and a nonsense mutation rs3211938 within CD36 is associated with high-density lipoprotein cholesterol (HDL-C) levels (p = 13.5×10−12). The rs3211938-G allele, which is nearly absent in European and Asian populations, has been previously found to be associated with CD36 deficiency and shows a signature of selection in Africans and African Americans. Finally, we have evaluated the effect of SNPs established in European populations on lipid levels in multi-ethnic populations and show that most known lipid association signals span across ethnicities. However, differences between populations, especially differences in allele frequency, can be leveraged to identify novel signals, as shown by the discovery of ICAM1 and CD36 in the current report.


Introduction
Plasma levels of circulating total cholesterol (TC), low-density lipoprotein (LDL-C), high-density lipoprotein (HDL-C) and triglycerides (TG) are associated with coronary artery disease (CAD) and are targets for therapeutic intervention [1]. Multiple environmental and genetic factors influence these plasma lipid levels, with heritability estimated to range from 0.28 to 0.78 in twin and family studies [2]. To date, .100 lipid-associated loci have been described, using studies mainly based on individuals of European ancestry [3]. Together, known variants affecting plasma lipid levels explain 10-12% of the total variance and 25-30% of the genetic variance [3] indicating that other loci and independent signals in established loci are likely to additionally contribute to the trait.
Lipid levels have been demonstrated to vary between ethnic groups [4]. Africans and East Asians have higher levels of HDL-C and lower levels of TG compared to Europeans [5] though the underlying mechanisms of these ethnic differences remain unknown. Genetic contributors to lipid concentrations are less well understood in non-European populations partly due to less well-powered genetic studies being attempted to date and most genotyping platforms are designed to have optimal coverage in European studies. An important first step towards understanding genetic risk across populations is to establish whether plasma lipid associated loci, identified in Europeans, span across multiple ethnicities or are population-specific. In a recent analysis, most of these known lipid loci had the same direction of association in different ethnic groups as in Europeans, despite presumed differences in linkage disequilibrium (LD) between marker and causal variants in each population [6]. Using regional LD in different ethnicities can help to refine association signals and to distinguish causal variants from correlated markers [7]. Furthermore, independent association signals in established lipid loci in one ethnicity may be useful to highlight causal signal(s) in other ethnicities.
The ITMAT-Broad-CARe (IBC) array (also referred to as the CardioChip or HumanCVD Beadchip [Illumina]) was specifically designed to densely tag ,2000 genes with known or potential roles in lipid and cardiovascular traits using ,50,000 single nucleotide polymorphisms (SNPs) [8]. Sequencing data from European, African American and Yoruba individuals was included for SNP selection in IBC array development. The IBC array drew upon knowledge of lipid metabolism and cardiovascular physiology, as well as early GWAS and sequencing studies to target efforts towards regions with higher a priori evidence of association, reducing cost per sample, and improving efficiency of replication studies. The IBC array has been successfully used for multiple cardiovascular-related phenotypes [9,10,11,12]. Results are reported elsewhere for the association of lipid phenotypes in European-derived cohorts with variants on the IBC array [13].
In this study we set out to discover novel lipid loci, fine map signals to identify causal genes at implicated loci, and gain a greater understanding of the genetic architecture of lipid traits across ethnicities. Here, we have used the IBC array to examine association results for TC, LDL-C, HDL-C and TG across seven non-European study populations, including African Americans (n = 7,657), Hispanics (n = 1,315) and East Asians (n = 841). Using conditional analyses, we sought to identify independent signals from within associated loci. Finally, we assessed the direction of effect in non-Europeans of new and established loci found in European-derived populations, and tested a composite risk score of known loci across ethnicities.

Ethics statement
All participants in each of the cohorts gave informed written consent. The Institutional Review Boards (IRBs) of each CARe cohort (i.e., the IRBs for each cohort's field centers, coordinating center, and laboratory center) have reviewed and approved the cohort's interaction with CARe. The study described in this manuscript was approved by the Committee on the Use of Humans as Experimental Subjects (COUHES) of the Massachusetts Institute of Technology.

Participating studies
Data from African-American, Hispanic and East Asian participants from seven cohorts were included for this study (Figure 1). Participants were $21 years of age. All seven studies contributed individual-level genotypes and phenotypes. Features of the included cohorts are presented in Table S1 and summary statistics are listed in Table S2. Six replication studies were used comprising African American individuals.

Phenotype definitions
Lipid phenotypes were taken from baseline or first measurements for all fasting individuals. All measurements were converted to mmol/L, with TC and HDL-C measurements converted from mg/dL by dividing by 38.67, and TG measurements converted from mg/dL by dividing by 88.57. TG values were log(10)transformed as TG values were not normally distributed. LDL-C was calculated according to Friedewald's formula L,C -H -kT where C is total cholesterol, H is HDL-C, L is LDL-C, T is TG and k is 0.45 for mmol/L (or 0.20 if measured in mg/dl) [14]. If TG values were .4.51 mmol/L (.400 mg/dL), then LDL-C was treated as a missing value.

Genotyping and quality control
Genotyping in each participating cohort was performed using the IBC array [8]. SNPs were clustered into genotypes using the Illumina Genomestudio software and were subjected to quality control filters at the sample and SNP level, separately within each cohort. Samples were excluded for individual call rates ,90%, gender mismatch, and duplicate discordance. SNPs were removed for call rates ,95% or Hardy-Weinberg equilibrium (HWE) p,10 27 . Due to low frequency SNPs included in the design, and the aim to capture low frequency variants of large effect across the combined dataset, we filtered only on minor allele frequency (MAF),0.005.

Statistical analyses
Evaluation of population stratification. Self-reported ethnicity was verified by multidimensional scaling analysis of identityby-state distances as implemented in PLINK [15], including HapMap panels as reference standards. After pruning of SNPs in linkage disequilibrium (r 2 .0.3), Eigenstrat was used to compute principal components within each ethnic group separately for use as covariates in the regression analyses [16].
Association testing. Association analysis was performed in each study using an additive genetic model with one degree of freedom. Gender stratified analyses were performed using three multivariate models: Model 1, including 10 principal components (PCs); Model 2, including 10 PCs, age, and lipid medication; and Model 3, including 10 PCs, age, lipid medication, type 2 diabetes (T2D), smoking and BMI. The genomic control inflation factor, lambda, was calculated for each cohort and used for within-study correction before meta-analysis. Genomic control inflation factors (l) ranged from 1.00 to 1.054.
Meta-analyses within each ethnic group were performed by two independent analysts using a fixed-effect inverse-variance approach in two different software packages: MANTEL (www. broadinstitute.org/,debakker/mantel.html) and METAL [17]. Results were highly concordant, reflecting a robust data analyses pipeline. Additionally, the directions of effect of lead SNPs from previously identified loci from the European IBC array metaanalysis [13] were evaluated for consistency in African Americans, Hispanics and Asians. To gauge an appropriate significance threshold, data from the Candidate gene Association Resource (CARe) IBC array studies [18] which is available on dbGAP (www.ncbi.nlm.nih.gov/gap) were employed and it was determined that after accounting for LD, the effective number of independent tests was ,26,500 for African Americans, ,23,500 for Hispanics, and ,15,500 for East Asians. This produces experimental or 'array-wide' statistical thresholds of p = 1.9610 26 , p = 2.1610 26 and p = 3.2610 26 , respectively, to maintain a false positive rate of 5% for each of the three ethnic groups. While we have adopted these 'array-wide' statistical thresholds for this study, we also highlight loci associated at a more conventional genomewide significance threshold of p,5.0610 28 .
Additionally, the I 2 statistic was calculated to quantify the proportion of total variation due to heterogeneity, as described previously [19].
Conditional Analyses. Loci harboring evidence for association of P,1610 25 in African Americans were examined for the presence of multiple, independent signals via conditional analyses in PLINK [15]. A term was added to the regression model including the lead SNP as a covariate, and SNPs within a +/ 2500 kb region were evaluated for significance. A locus-specific Bonferroni correction, as employed in previous IBC studies [20], was applied to determine significance of independent signals within candidate genes genotyped at each locus. On average, the windows contained 195.2 (6107.0) variants with a range between 12 for ACADL and 359 for PCSK9. Because of limited power due to low sample size, we did not perform conditional analyses in Hispanics and East Asians.
Genetic Risk Score Analyses and direction of effect. Within each ethnic group, we generated a genetic risk score using 28 SNPs for TC, 20 SNPs for LDL-C, 24 SNPs for HDL-C, and 21 SNPs for TG that had been found to be arraywide significant (p = 2.6610 26 ) in the European-ancestry IBC meta-analysis [21] (Table S3), weighted by the beta as described previously [22,23]. To account for missing data we adjusted the values for the number of genotyped risk alleles per individual. We evaluated for each ethnic group the contribution of the weighted genetic risk score to TC, HDL-C, LDL-C and TG in linear regression models adjusting for 10 PCs. Additionally, we compared the relative betas across quartiles of risk by linear regression. These loci were additionally investigated to study direction of effect across ethnicities.

Replication
In order to confirm putative novel loci, we replicated previously undetected lipid signals (p,1.0610 25 ) in 7,000 African American individuals from six replication cohorts and in 61,636 samples from the European-ancestry IBC meta-analysis [21]. Recent power analyses suggest that large-scale multi-ethnic association studies may have greater statistical power to detect causal alleles because of random genetic drift elevating global risk variants to higher allele frequency in some populations [24]. All but one replication studie provided summary results of SNPs that were genotyped on platforms other than the IBC array, or imputed using 1000 Genomes data. Features of the replication datasets included in this meta-analysis are described in Table S1.

Meta-analyses of African, Hispanic and East Asian populations
Meta-analyses of IBC array association results for plasma TC, LDL-C, HDL-C and TG levels in five African American studies (n = 7,657), two Hispanic studies (n = 1,315) and three East Asian studies (n = 841) were performed independently. Results of different association models did not differ substantially. Therefore, results of model 1, an additive model with 10 PCs as covariates, are presented in the main text ( Table 1) and results of other models are presented in the supplements (Table S4). After fixed-effect inverse-variance meta-analysis, we found that 23, five and two loci in African Americans, Hispanics and East Asian samples respectively, were significantly associated with a lipid trait at their respective array-wide significance thresholds, with twelve, three and one loci respectively surpassing the traditional genome-wide significance threshold (see Table 1; Figure 1). Two of these loci, intercellular adhesion molecule 1 (ICAM1) and CD36 molecule thrombospondin receptor (CD36), have not previously been reported to be associated with a lipid trait in a large-scale genomic study ( Figure 2).
Hispanics. Genome-wide significant association with HDL-C was observed for two SNPs in previously identified loci within CETP (rs3764261, p = 3.4610 211 ) and LIPC (rs8034802, p = 1.8610 28 ). For TG, we identified one genome-wide signal within the previously reported APOA5 locus (rs10750097, p = 2.1610 212 ). Genome-wide significant association for TC and LDL-C was not observed in our Hispanic populations.
East Asians. In East Asians, the rs662799 variant within ZNF259/APOA5 was significantly associated with TG (p = 1.6610 213 ). The opposite allele of the same SNP was studywide significantly associated with HDL-C. Genome-wide or studywide significant genetic association was not observed for LDL-C or TC in our East Asian populations.

Independent signals within single genetic loci in African Americans
The current investigation using the IBC array included rare SNPs at candidate loci collected in sequencing data from Europeans and Africans and dense genotyping, which can potentially be used to identify independent signals for lipids within genes at known or novel loci. We repeated association studies conditioning on the lead SNP in 23 loci with P,1.0610 25 . After Bonferroni correction for the number of SNPs at each candidate gene locus, we found independent lipids signals at the LDLR, APOE, PCSK9 and APOB loci for TC, at the APOE, PCSK9, LDLR, and APOB loci for LDL-C, at the APOC1/APOE, and LPL loci for TG and at the CETP, LPL, CD36 and the TRADD/LCAT for HDL-C (Table 2).
Three loci harbored two independent signals at genome-wide significance. The alleles rs6511720-G (risk allele frequency [RAF] = 0.86) and rs17242787-T (RAF = 0.98) within the LDLR gene showed association with TC with a p-value of 1.04610 213 and 4.7610 29 respectively in the original analyses. After conditioning on rs6511720-G, the p value for rs17242787-T remained significant (p = 2.4610 210 ). Also for LDL-C, we found two independent genome-wide significant signals within the APOE locus: rs389261-A (RAF = 0.25) and rs283813-T (RAF = 0.67). Furthermore, the SNPs rs17231520-A (RAF = 0.07) and rs4783961-A (RAF = 0.44) within the CETP gene were both strongly associated with HDL-C and after conditioning on the lead signal, the secondary signal remained significant with p = 2.8610 220 . Interestingly, the newly identified CD36 locus also harbored two independent signals, with the second signal showing association with locus-wide significance. The r 2 between the two SNPs in HapMap-YRI was 0.118.

Replication
In order to confirm putative novel signals, we carried out in silico follow-up of ten SNPs within novel loci and previously unreported SNPs within known lipid-associated loci (P,1.0610 25 ) in six African American studies, comprising together 7,000 samples. Only HeartSCORE was genotyped using the IBC array and provided association results for all SNPs. All other replication studies contributed association results for up to seven genotyped and imputed SNPs. Imputed SNPs were only included in the study when passing the 95% confidence threshold. Combined metaanalysis of the discovery and replication studies led to genomewide significant signals at the CD36 locus (p = 13.5610 212 ; Table 3) for association with HDL-C. A signal within ACADL was not significant after meta-analysis of the discovery and replication studies. However, the direction of effect was consistent with our discovery dataset in three of six studies, so it is possible that the signal has a weak effect and the locus is undetectable due to limited statistical power. Also, previously unidentified signals in known lipid loci showed genome-wide significant association in the combined discovery and replication meta-analysis: rs11806638 within PCSK9 was found to be associated with TC; rs389261 within APOE was associated with LDL-C levels; rs17231520 within the CETP locus and rs35673026 within the LCAT locus were found to be associated with HDL-C; and rs12721054 within APOE was associated with TG levels ( Table 3).

Comparison of lipid loci in African Americans to Europeans
Utilizing the results of each of the meta-analyses from the three available ethnicities, we sought to refine localization of known lipid signals or reveal novel independent signals within known loci based upon differential LD (see Table 1). The dense genotyping within each locus on the IBC array enabled detailed comparisons of loci that harbored array-wide significant SNPs in Africans Americans, Hispanics and East Asians as well as in the IBC metaanalysis of up to 61,636 individuals of Europeans ancestry [21] (see Table 1 and Table S3).
The strongest signal for HDL-C in African Americans is rs17231520 within CETP (p = 2.0610 246 ; Table 1). This SNP is associated with HDL-C in the same direction in Europeans with p = 3.3610 24 . However, in Europeans there is less power to detect this signal at array-wide significance, as the MAF in Europeans is only 0.2% (versus 7% in African Americans) and was screened out in many European studies for the IBC meta-analysis. Furthermore, rarer variants are often not correctly clustered optimally during QC, making them less likely to pass the standard quality control (including genotyping threshold or HWE check). This is also observed for the most strongly associated SNPs within CD36 (rs3211938) and LCAT (rs35673026) for HDL-C in African-Americans, as they show the same direction of effect in Europeans, but do not reach significance, given low MAF and absence in the majority of European studies for IBC meta-analysis. For two loci, LIPC and LPL, the strongest associated SNP in African Americans for HDL-C was the same or among the most highly associated SNPs in Europeans. Also, for the LDL-C-associated loci CELSR2, APOB, APOH and LDLR, the strongest signals in African Americans did overlap or represented similar signals that were highly associated with LDL-C in Europeans. The newly identified SNP for LDL-C, rs5030359 within ICAM1, has an observed MAF of 0.8% in African Americans and 0.2% in Europeans. In Europeans, this SNP is not associated with LDL-C (p = 0.3231), but the SNP is only present in very few European studies that are included in the IBC meta-analysis. The most associated signals within PCSK9 and APOE in African Americans are different, independent signals compared to the most associated SNPs within these loci in Europeans. Again, both signals are common in African Americans and have very low frequencies in Europeans: MAF for SNPs in PCSK9 and APOE are 17% and 25% in African Americans and 0.5% and 0.1% in Europeans respectively.
Among the array-wide statistically significant loci that were associated with TG in African Americans, three SNPs within GCKR, LPL and APOA5 were the same as or amongst the most highly associated SNPs in Europeans. SNP rs12721054 in APOE appeared to be a novel independent signal for TG in African Americans. This SNP showed an opposite effect in Europeanderived cohorts, although it was observed rarely in the metaanalysis of European populations (MAF = 0.2%) [13].
For TC, we observed the same pattern as for other lipid traits. The strongest associated SNPs within loci associated with TC overlapped with the same signals in Europeans (SNPs within CELSR2, APOB, LDLR and APOE), or were independent signals in African Americans that could not be replicated in Europeans because of low frequency (PCSK9, ACADL and ICAM1).

Direction of effect concordance with lead SNPs identified in European populations
Direction of effect across different ethnicities was studied for 28 previously established TC risk loci, 20 LDL-C loci, 24 HDL-C loci, and 21 TG associated loci. Not all SNPs passed the initial quality control, so number of investigated SNPs differed by trait and ethnicity (Table S3) (Table S3).

Genetic risk score analysis
To study whether we could find elevated lipid levels in multiethnic samples with cumulative numbers of risk alleles that were previously found to be associated in Europeans, we evaluated the contribution of the weighted genetic risk score for lipids in linear regression models adjusting for 10 PCs and compared the relative beta's ratios across quartiles of risk. We demonstrated a significant per quartile risk effect in African-Americans (ranging from p,10 210 for TG to p,10 233 for HDL-C), Hispanics (ranging Table 1. Cont. from p,10 2l for LDL-C to p,10 223 for TC) and East Asians (ranging from p,0.02 for HDL-C to p,10 26 for TG) (see Table 4). Quartiles based on weighted risk alleles and lipid level distribution for each ethnicity is shown in Figure S1.

Discussion
The current study reports a meta-analysis of lipid association studies in African Americans, Hispanics and East Asians using the IBC array, and has identified two novel loci associated with TC and LDL-C levels (rs5030359 in ICAM1) and HDL-C levels (rs3211938 in CD36) in African Americans. Additionally, we have uncovered multiple independent association signals within estab-lished lipid loci, demonstrating the value of dense SNP genotyping to uncover genetic variation associated with lipid levels. Furthermore, we have evaluated the impact of established SNPs, previously associated with lipids in Europeans populations, on lipid levels in three additional populations, showing that many known association signals for lipids span across ethnicities.

CD36
This study shows association between the nonsense coding variant rs3211938-G in CD36 and HDL-C levels at conventional genome-wide significance for African Americans (p,5610 29 ). This SNP has previously been reported to be associated with  Table 3. Replication results of nine signals in 7,000 African Americans.  increased HDL-C levels (p = 0.00018), decreased TG levels (p = 0.0059) and protection against metabolic syndrome (p = 0.0012) in a candidate gene study including 2,020 African Americans that did not overlap with samples in our meta-analyses [25]. Also, a variant within CD36 was associated with LDL levels in two small studies [26,27]. The CD36 finding is present in an accompanying paper [52] from the wider NHLBI CARe lipid studies which essentially uses the same discovery cohorts for African Americans that we present here although our analysis differs in that (a) it screened out related individuals (b) it takes additional covariates into account through the use of the three multivariate models and (c) our analysis filtered more stringently on I 2 and (d) we replicated these findings in additional studies. CD36, which is present on gustatory, olfactory and intestinal epithelial cells, is involved in the orosensory perception of fatty acids [28,29]. Also, lipid ingestion affects lingual CD36 expression in mice [30]. Therefore, CD36 may influence fat intake, and hence, serum lipid levels. SNPs within CD36, other than the one we found in this study, were linked to obesity in a case-control study [31]. However, this finding could not be replicated in a larger cohort [32]. In mouse models, CD36 deficiency impairs intestinal lipid secretion and results in hypertriglyceridemia [33]  and others show that CD36 deficiency rescues lipotoxic cardiomyopathy [34]. CD36 is an integral membrane protein found on the surface of many cell types and binds many ligands including oxidized lipid proteins [35,36], long-chain fatty acids [37] and erythrocytes that are parasitized with the malaria parasite Plasmodium falciparum [38]. The rs3211938-G variant is nearly absent in Europeans and Asians and shows a signature of selection in African Americans and some African populations [39,40]. Additionally, rs3211938-G has been shown in previous studies to be associated with CD36 deficiency and with susceptibility to malaria, although this has not been confirmed in other studies [41,42].

ICAM1
The rs5030359 variant in ICAM1, is observed in this study to be associated with TC and LDL-C at conventional genome-wide significance. ICAM1 encodes a cell surface glycoprotein that is typically expressed on endothelial cells and cells of the immune system [43]. However, rs5030359 maps to a gene-dense region (Figure 2b), so it cannot be excluded that there is another gene underlying the signal. The rs5030359 variant is ,800 kb downstream of a previously identified lipids signal within the LDLR region, but conditional analyses showed that the two loci are independent. Using fine-mapping in non-African populations to point to the most likely gene underlying the signal, is not possible as the SNP is very rare in Europeans, with a MAF of 0.002, and absent in our Hispanic and East Asian populations. Previously, common variants within ICAM1 were found to be associated with soluble ICAM1 (sICAM1) concentrations in Europeans [44,45]. sICAM1 has been associated with several common diseases such as diabetes, heart disease, stroke, and malaria [46,47]. sICAM1 levels were associated with progression of carotid intima media thickness in young adults [48,49] and in asymptomatic dyslipidaemia subjects [50]. Additionally, sICAM1 levels were found to be higher in Europeans than in Africans [49].

Differences in signals within lipid loci in multiple ethnicities
We were able to use the dense SNP genotyping in loci on the IBC array to analyze and compare lipid-associated loci, particularly between African Americans and Europeans. Our analyses showed multiple examples of signals that were associated with lipid levels in one ethnicity but not another (Table 1).
First, some of the strongest associated SNPs in one ethnicity may be rare or absent in other ethnicities. This is a well-established phenomenon, e.g., truncation mutations in PCSK9 that are of low frequency in African Americans and absent in individuals of European origin, that result in a robust reduction in LDL-C levels and coronary heart disease risk [51,52]. In this study we find that the majority of the observed discrepancies across ethnicities in association of SNPs with lipid traits can be attributed to differences in allele frequency. For example, rs3211938 in CD36 is much more highly associated with HDL-C in African Americans (p = 1.8610 211 ) than in Europeans (p = 0.08) with a large discrepancy in RAFs (7% vs. 0.2%).
In other loci, the strongest associated polymorphisms varied across populations, for example in the BUD13/ZNF259/APOA5 region (Table S3, Figure S2). In theory these regions could be excellent candidates for fine-mapping, but our efforts and association results could not narrow down the loci. When conducting meta-analyses across multiple ethnicities we observed that the stronger p-value association typically tracked with the higher heterogeneity I 2 values ( Figure S3). This high I 2 suggests high heterogeneity, but it could also be the effect of low sample sizes of the combined cohorts (especially for Hispanics and East Asians).
One limitation of this study is the sample size available particularly the Hispanic and the East Asian available samples and this obviously limited our ability to find new signals in these populations and to replicate many previously established lipid signals. Also, not all previously described signals for lipids were present on the IBC array, as the array was designed to densely cover genes regions, rather than the whole genome. However, using this approach we did find signals for lipids that remained uncovered using the genome-wide association approach, as both rs5030359 within ICAM1 and rs3211938 within CD36 were not present on conventional genome-wide arrays.
In conclusion, we performed dense genotyping of ,2,000 candidate genes in 7,657 African Americans, 1,315 Hispanics and 841 East Asians using IBC 50K SNP genotyping array and we found and confirmed two novel signals for lipids by replication in 7,000 African Americans. Additionally we evaluated the effect of SNPs established in European populations on lipid levels in multiethnic populations and show that most known lipid association signals span across ethnicities. However, differences between populations, especially differences in allele frequency, can be leveraged to identify novel signals. Figure S1 Quartiles based on the number of weighted risk alleles and lipid level distribution. (TIF) Figure S2 Association results of the BUD13/ZNF259/APOA5 regions with TG in multiple ethnicities. Beta's of SNPs are shown in the BUD13/ZN259/APOA5 region from association results in each ethnicity separately. 2logP-value and I 2 are from multiethnic meta-analyses. Squares mark the strongest association signals per ethnicity. The three independent signals in Europeans are depicted in green, the top signal in African Americans is shown in blue and Hispanics and East Asian meta-analyses results are in red and yellow respectively. (PDF) Figure S3 Correlation between 2logP-value and I 2 in the BUD13/ZN259/APOA5 region for TG association results. (TIF)