Figures
Abstract
Background
Taller adult height is associated with lower risks of ischemic heart disease in mendelian randomization (MR) studies, but little is known about the causal relevance of height for different subtypes of ischemic stroke. The present study examined the causal relevance of height for different subtypes of ischemic stroke.
Methods and findings
Height-associated genetic variants (up to 2,337) from previous genome-wide association studies (GWASs) were used to construct genetic instruments in different ancestral populations. Two-sample MR approaches were used to examine the associations of genetically determined height with ischemic stroke and its subtypes (cardioembolic stroke, large-artery stroke, and small-vessel stroke) in multiple ancestries (the MEGASTROKE consortium, which included genome-wide studies of stroke and stroke subtypes: 60,341 ischemic stroke cases) supported by additional cases in individuals of white British ancestry (UK Biobank [UKB]: 4,055 cases) and Chinese ancestry (China Kadoorie Biobank [CKB]: 10,297 cases). The associations of genetically determined height with established cardiovascular and other risk factors were examined in 336,750 participants from UKB and 58,277 participants from CKB. In MEGASTROKE, genetically determined height was associated with a 4% lower risk (odds ratio [OR] 0.96; 95% confidence interval [CI] 0.94, 0.99; p = 0.007) of ischemic stroke per 1 standard deviation (SD) taller height, but this masked a much stronger positive association of height with cardioembolic stroke (13% higher risk, OR 1.13 [95% CI 1.07, 1.19], p < 0.001) and stronger inverse associations with large-artery stroke (11% lower risk, OR 0.89 [0.84, 0.95], p < 0.001) and small-vessel stroke (13% lower risk, OR 0.87 [0.83, 0.92], p < 0.001). The findings in both UKB and CKB were directionally concordant with those observed in MEGASTROKE, but did not reach statistical significance: For presumed cardioembolic stroke, the ORs were 1.08 (95% CI 0.86, 1.35; p = 0.53) in UKB and 1.20 (0.77, 1.85; p = 0.43) in CKB; for other subtypes of ischemic stroke in UKB, the OR was 0.97 (95% CI 0.90, 1.05; p = 0.49); and for other nonlacunar stroke and lacunar stroke in CKB, the ORs were 0.89 (0.80, 1.00; p = 0.06) and 0.99 (0.88, 1.12; p = 0.85), respectively. In addition, genetically determined height was also positively associated with atrial fibrillation (available only in UKB), and with lean body mass and lung function, and inversely associated with low-density lipoprotein (LDL) cholesterol in both British and Chinese ancestries. Limitations of this study include potential bias from assortative mating or pleiotropic effects of genetic variants and incomplete generalizability of genetic instruments to different populations.
Conclusions
The findings provide support for a causal association of taller adult height with higher risk of cardioembolic stroke and lower risk of other ischemic stroke subtypes in diverse ancestries. Further research is needed to understand the shared biological and physical pathways underlying the associations between height and stroke risks, which could identify potential targets for treatments to prevent stroke.
Author summary
Why was this study done?
- Taller people have lower risks of ischemic stroke and heart disease, but higher risks of atrial fibrillation. However, little is known about the effects of height on the risks of different subtypes of ischemic stroke (cardioembolic stroke, large-artery stroke, and small-vessel stroke).
- Understanding the shared biological and physical pathways underlying the associations between height and stroke risks could identify potential targets for treatments to prevent stroke.
- Mean height and the rates of different stroke subtypes vary considerably across different income and ancestry populations, and, therefore, investigation across diverse ancestries is important.
What did the researchers do and find?
- We used a mendelian randomization (MR) approach to study the association between genetic variants for height and risk of ischemic stroke subtypes in populations with different ancestries.
- Genetic variants associated with taller height were associated with higher risks of cardioembolic stroke and lower risks of large-artery and small-vessel stroke.
- The findings were consistent across populations of different genetic ancestries and use of different analytical methods.
What do these findings mean?
- The findings support a causal association of taller adult height with higher risks of atrial fibrillation and cardioembolic stroke and lower risks of other ischemic stroke subtypes.
- Further research is needed to clarify the biological and physical pathways underlying the associations of height with ischemic stroke subtypes, which could identify novel targets for treatments to prevent stroke.
Citation: Linden AB, Clarke R, Hammami I, Hopewell JC, Guo Y, Whiteley WN, et al. (2022) Genetic associations of adult height with risk of cardioembolic and other subtypes of ischemic stroke: A mendelian randomization study in multiple ancestries. PLoS Med 19(4): e1003967. https://doi.org/10.1371/journal.pmed.1003967
Academic Editor: Joshua Z. Willey, Columbia University, UNITED STATES
Received: January 5, 2021; Accepted: March 16, 2022; Published: April 22, 2022
Copyright: © 2022 Linden et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: MEGASTROKE data are publicly available for download at https://www.megastroke.org/download.html, on condition that researchers adhere to the rules for use of MEGASTROKE data for publications, available at https://www.megastroke.org/index.html. UK Biobank data are available on application to bona fide researchers for health-related research in the public interest. Details regarding registration and access to the UK Biobank data, including access conditions and fees, are available at https://www.ukbiobank.ac.uk/enable-your-research/register. China Kadoorie Biobank data sharing is subject to the Data Access Policy for the Nuffield Department of Population Health, University of Oxford, available at https://www.ndph.ox.ac.uk/files/about/ndph-data-access-policy-1.pdf. Data from baseline, first and second resurveys, and disease follow-up are available under the China Kadoorie Biobank Open Access Data Policy to bona fide researchers. Sharing of genotyping data is constrained by the Administrative Regulations on Human Genetic Resources of the People’s Republic of China. Access to these and certain other data is available through collaboration with China Kadoorie Biobank researchers. Full details of the China Kadoorie Biobank Data Sharing Policy are available at www.ckbiobank.org. The GIANT Consortium genome-wide association study summary statistics for height are publicly available for download at https://portals.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium_data_files. Biobank Japan genome-wide association study summary statistics for height are publicly available for download at http://jenger.riken.jp/en/result.
Funding: The China Kadoorie Biobank study was supported by the Kadoorie Charitable Foundation (https://www.kadooriecharitablefoundation.com; ZC), the UK Wellcome Trust (https://wellcome.org; ZC grant numbers: 202922/Z/16/Z, 104085/Z/14/Z, 088158/Z/09/Z), the National Natural Science Foundation of China (https://www.nsfc.gov.cn/english/site_1/index.html; ZC grant numbers: 81390540, 81390541, 81390544), and the National Key Research and Development Program of China (http://en.most.gov.cn/programmes1/200610/t20061009_36224.htm; ZC grant numbers: 2016YFC0900500, 2016YFC0900501, 2016YFC0900504, 2016YFC1303904). The Clinical Trial Service Unit, University of Oxford, also acknowledges support from the UK Medical Research Council (https://mrc.ukri.org; ZC grant number: MC_UU_00017/1; SP grant number: MC_UU_00017/5), the British Heart Foundation (https://www.bhf.org.uk; RC grant number: CH/1996001/9454; JCH grant number: FS/14/55/30806), the British Heart Foundation Oxford Centre for Research Excellence (https://www.cardioscience.ox.ac.uk/bhf-centre-of-research-excellence; RC, SP and JCH grant number: RE/18/3/34214), and Cancer Research UK (https://www.cancerresearchuk.org; ZC grant number: C500/A16896). ABL was supported by the Clarendon Fund (https://www.ox.ac.uk/clarendon) and by a Nuffield Department of Population Health Early Career Research Fellowship (https://www.ndph.ox.ac.uk). The MEGASTROKE project received funding from sources specified at https://www.megastroke.org/acknowledgements.html. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: WNW was supported by the Chief Scientist Office during the conduct of the study, and by the Alzheimer’s Society, the British Heart Foundation, and the UK Stroke Association outside the submitted work.
Abbreviations: BMI, body mass index; CI, confidence interval; CKB, China Kadoorie Biobank; FEV1, forced expiratory volume in 1 second; FVC, forced vital capacity; GIANT, Genetic Investigation of Anthropometric Traits; GWAS, genome-wide association study; HDL, high-density lipoprotein; HR, hazard ratio; LD, linkage disequilibrium; LDL, low-density lipoprotein; MR, mendelian randomization; MR–PRESSO, Mendelian Randomization Pleiotropy RESidual Sum and Outlier; OR, odds ratio; SD, standard deviation; SNP, single nucleotide polymorphism; STROBE-MR, Strengthening the Reporting of Observational Studies in Epidemiology using Mendelian Randomization; TOAST, Trial of ORG 10 172 in Acute Stroke Treatment; UKB, UK Biobank
Introduction
Taller people have lower risks of atherosclerotic diseases, ischemic stroke, and heart disease, but higher risks of atrial fibrillation and venous thromboembolism [1–3]. The associations of height with ischemic stroke subtypes have not been reported, but it would be of interest to know whether these vary between atherosclerotic and cardioembolic stroke subtypes. In observational studies, any such associations could reflect confounding by socioeconomic status or other known or unknown correlates of height that are risk factors for cardiovascular diseases. Alternatively, the associations could be causal and could possibly be mediated through physical effects of height on body structure (including lean body mass or lung function) [4–7].
Increasingly, mendelian randomization (MR) analyses have been used to assess the causal relevance of risk factors for diseases by using genetic variants associated with risk factors of interest as instrumental variables [8]. The allocation of genetic variants to gametes (and hence offspring) is determined randomly at meiosis. Therefore, the random distribution of variants for a trait, such as height, between individuals can be used to minimize the effects of confounding by risk factors and provide support for the causal relevance of the trait for disease outcomes. Previous MR studies have reported that genetically determined differences in adult height were inversely associated with ischemic heart disease [4] and hypertension [2], but positively associated with atrial fibrillation [2,3], venous thromboembolism [2], and vasculitis [2]. However, the associations of genetically determined height with ischemic stroke and ischemic stroke subtypes have not been reliably established as previous studies have focused analyses on total stroke rather than on individual stroke pathological types and their main subtypes [2,9].
The present study examined the observational and genetic associations (using MR approaches) of height with (i) ischemic stroke and subtypes of ischemic stroke in the MEGASTROKE consortium (an international collaboration on the genetics of stroke) and in 2 large prospective studies conducted in the United Kingdom and China [10,11]; and (ii) established cardiovascular risk factors and anthropometric traits in the 2 large prospective studies.
Methods
This study is reported using the Strengthening the Reporting of Observational Studies in Epidemiology using Mendelian Randomization (STROBE-MR) [12] guideline (S1 Checklist). The study did not have a prospective protocol or published analysis plan. Analyses were planned prior to study initiation, but some were subsequently revised to reflect availability of new data or in response to reviewer comments (S1 Methods).
MEGASTROKE
MEGASTROKE consortium data included 29 genome-wide studies of stroke and stroke subtypes [13]. Ischemic stroke cases were defined using standard diagnostic criteria based on clinical and imaging findings and were further classified into subtypes using the Trial of ORG 10 172 in Acute Stroke Treatment (TOAST) criteria [13,14]. Analyses were conducted using meta-analyzed, heterogeneity-filtered summary results from multiple ancestries (60,341 ischemic stroke cases—including 9,006 cardioembolic stroke, 6,688 large-artery atherosclerotic stroke, and 11,710 small-vessel stroke subtypes—and up to 454,450 controls) and separately for the subset of Europeans (34,217 ischemic stroke cases) [13]. Summary results for separate non-European ancestries were not made available by the consortium.
UK Biobank
The UK Biobank (UKB) is a prospective study of 502,506 men and women, aged 40 to 69 years living in the UK, who were enrolled between 2006 and 2010 [10,15]. All participants provided written informed consent to participate in a study defined by a protocol approved by the North West Multi-centre Research Ethics Committee on May 10, 2016 (reference: 16/NW/0274). Details of the study methods and baseline characteristics have been previously reported (S2 Methods) [10,15]. Participants were followed up for a mean of 8 years through linkage to death registries and hospital admission records. Criteria for diagnosis of ischemic stroke cases (ICD-10: I63) were prespecified and included both cases recorded prior to enrollment and incident cases recorded during follow-up (S2 Methods). Ischemic stroke cases with a history of atrial fibrillation, based on either a self-reported diagnosis at baseline or an admission to hospital (ICD-10: I48) prior to onset of the stroke, were classified as having presumed cardioembolic stroke (S2 Methods). The remaining noncardioembolic ischemic stroke cases were classified as other subtypes of ischemic stroke. Genotyping using Affymetrix arrays with imputation into multiple reference panels was available for 483,420 participants passing quality control (S2 Methods). After exclusions for non-white British ancestry (n = 78,674) and relatedness (n = 67,201; kinship coefficient ≥0.125), a total of 336,750 UKB participants were included in the present genetic analyses (S1 Fig).
China Kadoorie Biobank
The China Kadoorie Biobank (CKB) is a prospective study of 513,214 men and women, aged 30 to 79 years, who were enrolled from 10 (5 urban and 5 rural) geographically defined regions of China between 2004 and 2008 [11]. All participants provided written informed consent to participate in a study defined by a protocol that was approved by the Oxford Tropical Research Ethics Committee on February 3, 2005 (reference: 025–04) and by the Ethics Review Committee of the Chinese Center for Disease Control and Prevention on July 8, 2004 (approval notice: 005/2004). Details of the study methods and baseline characteristics have been previously reported (S3 Methods) [11]. Compared to participants in UKB, those in CKB were on average 5 years younger (mean age 51.6 [standard deviation (SD) 10.6] versus 56.4 [8.1] years) and were less highly educated (S1 Table). Participants were followed up for a mean of 10 years through linkages to death and stroke registries and health insurance claims records. Adjudication of stroke was undertaken by review of clinical findings from medical records and brain imaging reports (available for >92% of stroke cases with retrieved records) by specialist clinicians using a defined protocol (S3 Methods). Presumed cardioembolic strokes were identified from confirmed ischemic stroke cases based on the Causative Classification System criteria [16]. Other confirmed ischemic stroke cases were further classified by brain infarct size into lacunar and other nonlacunar stroke subtypes. Data on atrial fibrillation were not systematically recorded at baseline or during follow-up in CKB, but electrocardiographic evidence of atrial fibrillation and other major and minor sources of cardioembolism were recorded by adjudicating physicians. Genotyping using Affymetrix arrays with imputation into the 1000 Genomes reference panel (S3 Methods) was available for 100,706 participants passing quality control, comprising a sample of 76,020 participants selected to be representative of the CKB population [17] and an additional 24,686 selected for nested case–control studies of incident cardiovascular or respiratory disease (S3 Methods). After relatedness exclusions (n = 28,233; kinship coefficient >0.05), the present genetic analyses involved 58,277 CKB participants (53,346 from the population-based subset and 4,931 additional ischemic stroke cases included only in analyses of ischemic stroke outcomes; S2 Fig).
Height
Participants in CKB were on average shorter (10 cm in men, 8 cm in women; S1 Table) than those in UKB and the SDs of directly measured height in UKB and CKB, respectively, were 6.8 cm and 6.5 cm in men, and 6.3 cm and 6.0 cm in women. Separately in UKB and CKB, following the methodology used in the Genetic Investigation of Anthropometric Traits (GIANT) consortium, a measured height phenotype was constructed: Within strata by sex (and by region in CKB), directly measured height (S2 and S3 Methods) was adjusted for age and age2, and the residuals were transformed using an inverse normal transformation, yielding a measured height phenotype in study and sex-specific SD units. This transformed height phenotype (referred to as “height” or “measured height”) was used for all analyses (unless “directly measured” is explicitly stated).
Blood pressure, blood lipids, and other anthropometric traits
Systolic and diastolic blood pressure were measured using standard instruments and protocols. Blood lipids (low-density lipoprotein [LDL] cholesterol, high-density lipoprotein [HDL] cholesterol, triglycerides, and apolipoprotein B) [17] were available in a subset of CKB participants (S2 Fig). Fat body mass was estimated as weight multiplied by percentage body fat measured by bio-impedance (S2 and S3 Methods). Lean body mass was estimated as weight minus fat body mass. Lung function measures (forced vital capacity [FVC] and forced expiratory volume in 1 second [FEV1]) were restricted to those with reliable values (S2 and S3 Methods, S2 Fig). Compared with UKB participants, those in CKB had lower mean levels of systolic blood pressure (7.3 mm Hg), diastolic blood pressure (4.7 mm Hg), LDL cholesterol (1.2 mmol/L), HDL cholesterol (0.3 mmol/L), apolipoprotein B (0.2 g/L), body mass index (BMI; 4 kg/m2 in men and 3 kg/m2 in women), and lean body mass (14 kg in men and 7 kg in women), but higher mean levels of triglycerides (0.3 mmol/L; S1 Table).
Instruments for genetically determined height
Genetic instruments for a 2-sample MR approach were constructed separately for MEGASTROKE, UKB, and CKB, due to differences in ancestry and overlap in participants in genome-wide association studies (GWASs) of height. For MEGASTROKE, height-associated single nucleotide polymorphisms (SNPs) from the GIANT GWAS report in 2018 [18] (which also included data from the whole of UKB) were used for both multiple and European ancestry analyses (S2 Table). For UKB, the genetic instrument was constructed from height-associated SNPs obtained from an earlier (2014) GIANT study that was independent of UKB [19]. For CKB, both the European ancestry–based GIANT GWAS (2018) [18] and a smaller GWAS from Biobank Japan [20], involving participants of East Asian ancestry, were used to optimize the genetic instrument for height by benefitting from a larger discovery population and a more proximal genetic ancestry [21,22].
The SNPs selected from these GWAS studies (together with their published single-variant effect sizes on height) were those associated with height at genome-wide significance and also available in MEGASTROKE, UKB, or CKB (S4 Methods). The SNPs from each GWAS were linkage disequilibrium (LD) pruned (r2 < 0.05) using LD estimates from UKB for GIANT and from CKB for Biobank Japan (i.e., where r2 between SNPs was ≥0.05, the SNP with the lowest p-value for association with height in the GWAS was retained). Palindromic SNPs were validated by comparing allele frequencies for individual participant data (UKB and CKB). For MEGASTROKE, palindromic SNPs were replaced with high LD proxies (r2 > 0.9).
After LD pruning, 641 height-associated SNPs from GIANT were available for analysis in UKB (S4 Methods). Likewise, 2,337 height-associated SNPs from GIANT (European ancestry) and 517 SNPs from Biobank Japan (East Asian ancestry) were available for analysis in CKB. In MEGASTROKE, after LD pruning (at p < 0.05) and replacing palindromic SNPs with proxies, the number of height-associated SNPs from GIANT remaining for analysis available in each of the multiple ancestry summary data sets was 2,265 for ischemic stroke, 2,270 for cardioembolic and large-artery stroke, and 2,084 for small-vessel stroke. The SNPs used in MEGASTROKE, UKB, and CKB are listed in S1–S3 Data Tables.
For UKB and CKB, genetic risk scores for each individual were constructed as the sum of the number of each height-associated effect alleles weighted by their published single-variant effect sizes on height (S4 Methods, S2 and S3 Data Tables). For CKB, the genetic risk score was the simple average of weighted genetic risk scores constructed from 2,337 GIANT (2018) [18] and 517 Biobank Japan [20] height-associated SNPs (other percentages of the 2 genetic risk scores, including either score alone, were assessed in sensitivity analyses but had less explanatory power; S3 Table). The effects of SNPs on height in UKB and CKB estimated separately for each SNP using linear regression adjusted for age, age2, sex, region (in CKB only), genomic principal components (40 in UKB and 14 in CKB), and genotyping array type were also compared with the published effect sizes on height.
The genetic risk score in UKB explained 17.0% of the variance of height (S4 Methods, S3 Table) and the effect sizes of the SNPs in UKB were highly correlated with the effect sizes in the source GWAS [19] (r = 0.96; Fig 1). In CKB, the genetic risk scores from GIANT, Biobank Japan, and the average genetic risk score, respectively, explained 11.4%, 11.0%, and 15.2% of the variance of height (S3 Table). SNP effect sizes in CKB were less strongly correlated with effect sizes in GIANT (r = 0.65) [18], but were more strongly correlated with effect sizes in Biobank Japan (r = 0.90, respectively; Fig 1). One unit of the respective genetic risk score was associated with 0.91 SD of measured height in UKB and 1.05 SD in CKB.
For UKB (336,750 participants), the effects on height were estimated for 641 SNPs from GIANT (2014) [19]. For CKB (53,346 participants), the effects on height were estimated for 2,189/2,337 SNPs from GIANT (2018) [18] and 499/517 SNPs from Biobank Japan [20] with minor allele frequency ≥0.005 in CKB. The effect sizes on height were adjusted for age, age2, sex, region (in CKB only), genomic principal components, and genotyping array type. SNPs with minor allele frequencies of <0.005 were not shown. In UKB, the genetic risk score explained 17.0% of the variance of height and, in CKB, the genetic risk scores from GIANT (2018) [18], Biobank Japan [20], and the average genetic risk score, respectively, explained 11.4%, 11.0%, and 15.2% of the variance of height. CKB, China Kadoorie Biobank; GIANT, Genetic Investigation of Anthropometric Traits; SD, standard deviation; SNP, single nucleotide polymorphism; UKB, UK Biobank.
Genetic analyses
Since only GWAS summary results on stroke were available from MEGASTROKE [13] (and not individual participant data), causal effects were estimated by inverse-variance–weighted random-effects SNP-level meta-analysis [23] (S5 Methods, S3 and S4 Figs, S4 Data Table). For UKB and CKB, individual participant data were used to construct genetic risks scores for each individual, and the ratio method for single instruments was applied to estimate the genetically instrumented causal effects on outcomes per 1 SD of measured height. When using the ratio method, the second order variance term that is formally used in an instrumental variable estimate was ignored because the contribution from this term would be negligible given the strength (large F-statistics) of the instruments [23,24]. Specifically, logistic regression was used to assess associations of each genetic risk score with the stroke outcomes (after adjustment for age, age2, sex, region in CKB, genomic principal components, and genotyping array type). Subsequently, the coefficients from these regressions were divided by the regression coefficient of measured height on the genetic risk score (0.91 SD of measured height in UKB and 1.05 SD in CKB) to estimate the causal effects [23]. The genetic instruments used in the different populations were all strongly associated with height (F-statistic of 69,096 for UKB and 9,589 for CKB and an average F-statistic of 109 per genetic variant in MEGASTROKE). All effects presented as associations of genetically determined height are the instrumented effects per 1 SD higher measured level of height (S3 Fig).
To investigate the potential for factors to contribute to pleiotropy, cross-sectional associations of genetically determined height with established cardiovascular risk factors, and anthropometric traits were assessed in UKB and CKB using linear or logistic regression as appropriate, with adjustment for age, sex, region in CKB, genomic principal components, and genotyping array type. For these cross-sectional associations, anthropometric traits and lung function were standardized (by dividing by their SD within each sex) in the UKB and CKB populations. The ratio method was then applied to regression results and, as for the disease outcomes, the genetically instrumented effects presented. As t-statistics closely approximate z-statistics in large samples, they are referred to as z-statistics in this report. These were used to assess the strength and direction of the associations of height with cardiovascular and anthropometric factors to permit comparisons of z-statistics up to about ±500, which is beyond the convenient ranges for p-values (z-statistics of ±1.96 and of ±37 correspond to 2p = 0.05 and 2p ≈ 1 × 10−300, respectively).
Sensitivity analyses
As MR inference relies on various assumptions (including instrumental variable assumptions) [24], additional sensitivity analyses in MEGASTROKE included weighted median analyses, MR–Egger analyses to assess any possible pleiotropic effects of height on other factors, and Mendelian Randomization Pleiotropy RESidual Sum and Outlier (MR–PRESSO) analyses to correct for pleiotropy, if any, by removal of outliers (S6 Methods) [25]. As there is some overlap of the populations in MEGASTROKE with those in GIANT (2018) [18] (S6 Methods) but not with UKB, the sensitivity analyses were repeated using effect sizes on height estimated in UKB. A further sensitivity analyses excluded SNPs that were associated at p < 0.001 in the large pan-ancestry UKB GWAS analyses [26] with age at completion of education, diabetes, atrial fibrillation, hypertension, systolic blood pressure, diastolic blood pressure, LDL cholesterol, HDL cholesterol, triglycerides, or apolipoprotein B (S6 Methods). An additional sensitivity analysis in MEGASTROKE used more stringent pruning criteria (r2 < 0.001) for SNP inclusion to provide greater comparability with recent literature. In CKB, the analyses of genetically determined height with ischemic stroke subtypes were repeated using separate genetic instruments constructed from GIANT (2018) [18] SNPs and from Biobank Japan [20] SNPs.
Observational analyses
Observational analyses were restricted to participants with no prior history of ischemic heart disease or stroke in UKB (S1 Fig) and CKB (S2 Fig, S7 Methods). Hazard ratios (HRs) for the associations of measured height (grouped and as a linear term) with incident ischemic stroke and ischemic stroke subtypes postrecruitment were estimated by Cox regressions stratified by age at risk (in 5-year groups), sex, and region (10 regions in CKB), with adjustment for possible baseline confounders (S7 Methods). Cross-sectional associations of measured height with cardiovascular and anthropometric factors at baseline were assessed using linear or logistic regression as appropriate and adjusted for age (in 5-year groups), sex, year of birth, and region in CKB. All statistical analyses were conducted in SAS (version 9.4) and R (version 3.3.3) and are available upon request.
Results
Genetically determined height was inversely associated with ischemic stroke in MEGASTROKE in both multiple ancestries (odds ratio [OR]: 0.96; 95% confidence interval [CI]: 0.94, 0.99; p = 0.007) per 1 SD taller height, n = 60,341 cases) and the European ancestry subset (0.96 [0.93, 0.99]; p = 0.02; n = 34,217; Fig 2). The genetic associations with ischemic stroke in UKB (OR: 0.98 [95% CI 0.91, 1.06]; p = 0.66; n = 4,055) and CKB (0.94 [0.88, 1.00]; p = 0.05; n = 10,297) were also consistent with the results in MEGASTROKE (Fig 2). However, the results for overall ischemic stroke masked directionally opposing associations with different subtypes of ischemic stroke.
The numbers of events reported for MEGASTROKE were the maximum number of cases available in the genetic summary data. In MEGASTROKE and CKB, “All ischemic stroke” includes additional unsubtyped ischemic strokes. For UKB and CKB, respectively, the SDs of directly measured height were 6.8 cm versus 6.5 cm for men and 6.3 cm versus 6.0 cm for women. Genetic associations in UKB and CKB were adjusted for age, age2, sex, region (in CKB only), genomic principal components, and genotyping array type, and observational associations were stratified by age at risk (in 5-year groups), sex, and region (in CKB only) and adjusted for additional potential confounders (S6 Methods). CI, confidence interval; CKB, China Kadoorie Biobank; SD, standard deviation; UKB, UK Biobank.
In MEGASTROKE, genetically determined height was positively associated with cardioembolic stroke (OR per 1 SD taller height: 1.13 [95% CI 1.07, 1.19]; p < 0.001; n = 9,006), but was inversely associated with large-artery stroke (0.89 [0.84, 0.95]; p < 0.001; n = 6,688) and small-vessel stroke (0.87 [0.83, 0.92]; p < 0.001; n = 11,710) in multiple ancestries and were similar in the European ancestry subset (Fig 2). The findings in both UKB and CKB were directionally concordant with the associations observed in MEGASTROKE, but did not reach statistical significance: For presumed cardioembolic stroke, the ORs were 1.08 (95% CI 0.86, 1.35; p = 0.53; n = 454 cases) in UKB and 1.20 (0.77, 1.85; p = 0.43; n = 133 cases) in CKB; for other subtypes of ischemic stroke, the corresponding ORs were 0.97 (95% CI 0.90, 1.05; p = 0.49; n = 3,601) in UKB, while in CKB, they were 0.89 (0.80, 1.00; p = 0.06; n = 2,205) for other nonlacunar stroke and 0.99 (0.88, 1.12; p = 0.85; n = 2,138) for lacunar stroke (Fig 2, S4 Table).
Sensitivity analyses in MEGASTROKE also demonstrated reliable concordant estimates irrespective of the methodology used for estimation, which included weighted median method, MR–Egger, and MR–PRESSO (S5 Table). Importantly, there was no evidence of directional pleiotropy for ischemic stroke or its subtypes (p > 0.08 for nonzero MR–Egger intercepts). The MR–PRESSO analyses identified only a few outlying SNPs (n ≤ 4), and their exclusion had no impact on the causal estimates. MR results remained similar when a restricted genetic instrument was used that consisted of the 1,515 (67%) of SNPs not associated at p < 0.001 with potentially pleiotropic risk factors for stroke (S6 Table). There was no evidence of bias due to sample overlap as the causal estimates based on UKB effect sizes on height were largely unchanged. In addition, the application of a stricter level of LD pruning (r2 < 0.001) had little impact on the causal estimates (S6 Table). In CKB, sensitivity analyses of the component genetic instruments for height yielded similar results to the combined instrument in the main analyses (S7 Table).
Taller measured height was inversely and log-linearly associated with risk of ischemic stroke in both UKB (HR per 1 SD taller measured height: 0.98 [95% CI 0.95, 1.02]; p = 0.33; n = 3,698) and CKB (0.96 [0.95, 0.97]; p < 0.001; n = 37,947), although the association was not statistically significant in UKB (Fig 3). The associations of measured height with ischemic stroke subtypes in UKB and CKB were statistically significant (except for presumed cardioembolic stroke in CKB) and similar to the genetic associations in MEGASTROKE in terms of direction: For presumed cardioembolic stroke, the HRs were 1.17 (95% CI 1.07, 1.28; p < 0.001; n = 495 cases) in UKB and 1.09 (0.99, 1.28; p = 0.09; n = 410 cases) in CKB; for other subtypes of ischemic stroke in UKB, the HR was 0.96 (95% CI 0.92, 0.99; p = 0.02; n = 3,203); and for other nonlacunar stroke and lacunar stroke in CKB, they were 0.93 (0.91, 0.96; p < 0.001; n = 7,503) and 0.96 (0.94, 0.99; p = 0.002; n = 6,840), respectively (Fig 2, S8 Table).
In UKB, the category “Other ischemic stroke subtypes” includes all ischemic strokes not classified as “Presumed cardioembolic stroke,” whereas in CKB, the category includes all subtyped ischemic strokes not classified as “Presumed cardioembolic stroke.” For UKB (482,928 participants) and CKB (490,067 participants), respectively, the SDs of directly measured height were 6.8 cm versus 6.5 cm for men and 6.3 cm versus 6.0 cm for women. HRs were stratified by age at risk (in 5-year groups), sex, and region (in CKB only) and adjusted for additional potential confounders (S6 Methods). Tenths of measured height were used to examine the shape of the associations of height with ischemic stroke subtypes, except for presumed cardioembolic stroke where thirds were used due to the lower number of cases. When tenths of height were plotted, consecutive pairs of the middle 6 tenths were combined (to give 7 groups). HRs were presented as floating absolute risks relative to the middle height category (whereby standard errors were assigned approximately independently to each category to avoid restricting comparisons to any arbitrary reference groups). CI, confidence interval; CKB, China Kadoorie Biobank; HR, hazard ratio; SD, standard deviation; UKB, UK Biobank.
The associations of genetically determined and measured height with established cardiovascular risk factors, anthropometric traits, and education are shown in Tables 1 and 2 and S9 Table. Almost all of the associations between genetically determined height and risk factors were directionally concordant and broadly consistent between UKB and CKB, the exceptions being the following: diabetes, where the CIs were wide and overlapped; smoking, which was not associated in either population; and tertiary education, which was positively associated with genetically determined height in UKB but not associated in CKB (Table 1, S9 Table; the generally lower z-statistics in the genetic comparisons in CKB reflect the smaller number of participants studied). Both genetically determined and measured height were strongly associated with lean body mass (in UKB, 0.5 to 0.6 SD higher lean body mass per 1 SD taller genetically determined height, z = 98 [p < 0.001] in men, z = 87 [p < 0.001] in women) and with lung function (0.3 to 0.4 SD higher FEV1 or FVC, z = 50 to 65 [p < 0.001]).
Genetically determined taller height was also associated with lower levels of LDL cholesterol, HDL cholesterol, and blood pressure in UKB and nonstatistically significant lower levels in CKB; however, the estimated effect sizes on blood pressure were greater in UKB than in CKB and the CIs of the estimates did not overlap (−1.13 mm Hg [95% CI −1.27, −0.98; p < 0.001] versus −0.14 mm Hg [95% CI −0.57, 0.29; p = 0.55]). In UKB, the findings for measured and genetically determined height with systolic blood pressure were highly consistent (Tables 1 and 2), but in CKB, the measured height was positively, rather than inversely, associated with systolic blood pressure, suggesting that this association might reflect confounding in CKB. Both genetically determined and measured height were strongly positively associated with atrial fibrillation at baseline (available only in UKB) with ORs per 1 SD taller height of 1.33 (95% CI 1.25, 1.42; p < 0.001) and 1.31 (1.28, 1.34; p < 0.001), respectively (Tables 1 and 2).
Discussion
In this large MR study of height and ischemic stroke, there were modest inverse associations of both genetically determined and measured height with overall ischemic stroke in populations from multiple ancestries. However, these masked much stronger directionally opposing associations of height with cardioembolic versus other ischemic stroke subtypes. In MEGASTROKE (multiple ancestries), a 1 SD genetically determined taller height was associated with 13% higher risk (OR 1.13 [95% CI 1.07, 1.19]; p < 0.001) of cardioembolic stroke, but with 11% lower (OR 0.89 [0.84, 0.95]; p < 0.001) and 13% lower (OR 0.87 [0.83, 0.92]; p < 0.001) risks of large-artery stroke and small-vessel stroke, respectively. In UKB and CKB, the different associations of measured height with ischemic stroke subtypes were concordant with those in MEGASTROKE. However, the genetic associations in UKB and CKB, although consistent, had less power to reliably demonstrate differences between the different ischemic stroke subtypes. Nevertheless, the similar findings from observational and MR approaches across 3 different populations provide support for height being causally related to ischemic stroke subtypes.
To the best of our knowledge, this is the first large genetic study to examine the associations of height with ischemic stroke subtypes and furthermore included multiple ancestries. A previous study reported an OR of 0.88 (95% CI 0.82, 0.95) per 1 SD taller genetically determined height with ischemic heart disease [4], which is similar to association with large-artery stroke in the present study and could be a reflection of a shared underlying process affecting height and atherosclerosis. The present study used MR approaches that minimize biases from residual confounding and reverse causality that can bias observational studies. Furthermore, in a range of MR sensitivity analyses, the findings remained consistent irrespective of the methodology used for estimation and found no evidence to support any major influence of horizontal pleiotropy. For example, the associations of genetically determined height with the stroke subtypes remained similar when SNPs most strongly associated (at p < 0.001) with length of education, LDL cholesterol, blood pressure and other cardiovascular risk factors were excluded from the genetic instrument.
The modest impact of excluding SNPs most strongly associated with cardiovascular risk factors suggests that any mediating effect of such traits is likely to be low. However, LDL cholesterol has previously been shown to be causally associated with increased risk of ischemic stroke in populations of both European and Chinese ancestries [21], with the strongest association observed with large-artery stroke and little association seen with cardioembolic stroke [27]. Thus, the inverse association of genetically determined height with LDL cholesterol levels in both UKB and CKB could explain some of the inverse associations of height with large-artery stroke and, to a lesser extent, with small-vessel stroke, although the mechanism by which height might cause this is unclear. Genetically determined taller height was also associated with lower mean levels of blood pressure in both studies (about 1 mm Hg lower in UKB, but only 0.1 mm Hg in CKB; Table 1); based on the UKB effect, this would be expected to translate to about 3% proportional lower risk of ischemic stroke and 2% to 5% proportional lower risk of each ischemic stroke subtype [28]. By contrast with the consistency of the genetic associations, the observational associations were not as consistent between UKB and CKB, possibly reflecting differences in residual confounding in the observational analyses (e.g., by socioeconomic factors, as blood pressure and height are positively correlated with income in China [29]) or reverse causality (e.g., due to LDL-lowering medication), illustrating the advantage of MR analyses.
The associations of height with ischemic stroke subtypes may reflect a direct causal effect of body dimensions on stroke subtypes or the effects of some other correlated anthropometric trait (such as lean body mass) on the diseases. Previous MR studies have suggested that greater lung function may act as a possible mediator of the protective effect of height on ischemic heart disease [5]. In both UKB and CKB, taller height was associated with higher lung function and so lung function could account for some of the protective effects of height [5].
This study provides novel support for the causal relevance of height for cardioembolic stroke, the most disabling consequence of atrial fibrillation. Previous studies have supported the causal relevance of height and lean body mass for atrial fibrillation [6,7] and suggested that greater lean body mass is the chief anthropometric risk factor (stronger than height) for atrial fibrillation [7]. Larger left atrial diameter, present in taller people, has also been associated with higher risks of atrial fibrillation and embolism from cardiac sources [30], but whether these associations are mediated by lean body mass or some other physical aspect of body dimensions has not been previously studied. Higher levels of lean body mass have also been positively associated with other physical measures, such as carotid intima-media thickness, left ventricular mass, and cardiac wall thickness, but not with atherosclerosis [31].
The opposing associations of height with cardioembolic and other ischemic stroke subtypes highlight the importance of considering ischemic stroke subtypes as distinct diseases. Studies examining the associations of risk factors with overall ischemic stroke may incorrectly estimate medically relevant associations of some risk factors with individual ischemic stroke subtypes. Many studies (e.g., UKB, with follow-up based on electronic health records) and cardiovascular trials do not currently have detailed and reliable ischemic stroke subtyping, limiting their use for causal inference. Subtyping is also important in clinical practice for prevention of stroke recurrence, where the impact of treatments, such as statins or anticoagulants, may vary in patients at particular risk for different ischemic stroke subtypes [27].
Men and women in CKB were 10 and 8 cm shorter (about 1.5 SD), respectively, than their counterparts in UKB (S1 Table). If the MR associations in Fig 2 are assumed to be causal, this would translate to adults in China having a higher risk of some ischemic stroke subtypes (particularly for large-artery stroke and small-vessel stroke subtypes) and a lower risk of cardioembolic stroke compared with Europeans. In CKB, genetically determined height was associated with a modestly, albeit not statistically, significant lower OR for all ischemic stroke subtypes.
The present study also had several limitations. Genotypes associated with height, education, blood pressure, and several chronic diseases have been shown to be correlated within spouse pairs (i.e., indicative of assortative mating), which can lead to indirect effects of genotypes in offspring, in violation of MR assumptions [32]. Family-based studies have reported that such indirect genetic effects of nontransmitted alleles could explain about 12% of the genetic effect on height [33]. As desirable traits such as higher income, taller height, and healthy traits tend to cluster in mates, assortative mating could explain some of the protective associations of taller height, but is unlikely to explain the adverse associations of height with atrial fibrillation and cardioembolic stroke.
A further limitation is that studies differed in the methodology used to classify ischemic stroke subtypes, and reliable subtyping was not available in all of the populations studied. As cardioembolic stroke has been reported to account for 22% of ischemic stroke cases in a global meta-analysis [34] and over half of cases in a Canadian registry study [35], the relatively low number of presumed cardioembolic stroke cases observed in both UKB and CKB may be an underestimate of the true incidence of cardioembolic strokes.
While height has been estimated to have a SNP-based heritability of about 50% in both Europeans [19] and East Asians [20], it is likely that genetic instruments derived in European populations may not perform as well in other ancestry populations, due to differences in allele frequencies and LD structure, but can still provide valid causal inferences [21,22]. The genetic risk scores for height used in UKB (based on an independent largely European ancestry-based GWAS) explained 19.7% of the variance in height in UKB, but the genetic risk score used in CKB (based on a large GWAS of height in a European population [18] and a smaller GWAS of height in a Japanese population) [20] explained only 15.2% of the variance in height in CKB. The present multiple ancestry analysis in MEGASTROKE may therefore have underestimated the causal effects of height if the (European ancestry derived) genetic risk score used was associated with smaller differences in height in the non-European ancestry populations.
The findings in the present study highlight important differences in the causal pathways between stroke subtypes and the need to distinguish such subtypes not only in clinical practice, but also in cardiovascular trials, electronic health records, and population studies. Although height is not a modifiable risk factor, recognition that taller individuals have increased risk of cardioembolic stroke may guide clinicians to screen for atrial fibrillation or other risk factors for cardioembolic stroke when managing an individual’s overall risk [3]. Further research is needed to understand the shared biological and physical pathways underlying the associations of height with stroke subtypes. The strong association of genetically determined height with physical measurements such as lean body mass and lung function and with atrial fibrillation suggest that these may be mediators of some of the associations with height. Further study, such as multivariable MR with robust instruments (probably sex specific, because of the substantial differences in anthropometric measures by sex), could yield further insight into the direct and indirect effects of height through other factors on the risks of ischemic stroke subtypes.
In conclusion, the present genetic studies provide novel and reliable findings that support a causal association of taller adult height with higher risks of atrial fibrillation and cardioembolic stroke and lower risks of other ischemic stroke subtypes. These findings raise the possibility of investigating whether including height as a risk factor in risk prediction tools would improve screening and primary prevention of cardioembolic stroke and of whether understanding the shared biological and physical pathways involved in height may offer novel targets for treatment to prevent cardioembolic stroke.
Supporting information
S1 Text. Members of the CKB Collaborative Group.
CKB, China Kadoorie Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s001
(DOCX)
S2 Text. Members of the MEGASTROKE consortium.
https://doi.org/10.1371/journal.pmed.1003967.s002
(DOCX)
S1 Checklist. STROBE-MR checklist of recommended items to address in reports of MR studies.
This checklist is copyrighted by the Equator Network under the Creative Commons Attribution 3.0 Unported (CC BY 3.0) license. MR, mendelian randomization; NA, not applicable; STROBE-MR, Strengthening the Reporting of Observational Studies in Epidemiology using Mendelian Randomization.
https://doi.org/10.1371/journal.pmed.1003967.s003
(DOCX)
S1 Methods. Revisions after study initiation.
https://doi.org/10.1371/journal.pmed.1003967.s004
(DOCX)
S2 Methods. Additional methods for UKB.
UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s005
(DOCX)
S3 Methods. Additional methods for CKB.
CKB, China Kadoorie Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s006
(DOCX)
S4 Methods. Further details of instruments for genetically determined height.
https://doi.org/10.1371/journal.pmed.1003967.s007
(DOCX)
S6 Methods. Sensitivity analyses in MEGASTROKE.
https://doi.org/10.1371/journal.pmed.1003967.s009
(DOCX)
S7 Methods. Further details of observational analyses in UKB and CKB.
CKB, China Kadoorie Biobank; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s010
(DOCX)
S1 Table. Baseline characteristics of included UKB and CKB participants without prior major cardiovascular disease.
Data are n (%) or mean (SD) unless otherwise stated. In UKB, blood lipids measurements were available in 85% to 93% of participants, lung function in 71%, and anthropometric traits ≥98% (S1 Fig). In CKB, blood lipids measurements were available in 4% of participants and lung function in 87% (S2 Fig). BMI, body mass index; CKB, China Kadoorie Biobank; FEV1, forced expiratory volume in 1 second; FVC, forced vital capacity; HDL, high-density lipoprotein; IQR, interquartile range; LDL, low-density lipoprotein; NA, not available; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s011
(DOCX)
S2 Table. Ancestry composition of stroke cases in MEGASTROKE.
Ancestry was predominantly self-reported in the 29 genome-wide studies comprising the MEGASTROKE consortium data. *Other ancestry included Latin American and mixed Asian ancestry.
https://doi.org/10.1371/journal.pmed.1003967.s012
(DOCX)
S3 Table. Percentage of height variance explained (R2) by genetic instruments for height in UKB and CKB.
*Each individual genetic instrument for height, based on GIANT or Biobank Japan SNPs, was linkage disequilibrium pruned (r2 < 0.05). †The number of linkage disequilibrium pruned SNPs available in UKB or CKB. ‡Beta estimate of measured height regressed on the genetic risk score for height, adjusted for age, age2, sex, region (in CKB only), genomic principal components, and genotyping array type. Biobank Japan, Biobank Japan genome-wide association study (2019) [20]; CKB, China Kadoorie Biobank; GIANT (2014), Genetic Investigation of Anthropometric Traits (2014) [19]; GIANT (2018), Genetic Investigation of Anthropometric Traits (2018) [18]; R2, the proportion of the residual variance of height explained by the genetic risk score for height (the coefficient of determination); SNP, single nucleotide polymorphism; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s013
(DOCX)
S4 Table. Associations of the genetic risk score for height with ischemic stroke and its subtypes in UKB and CKB.
CKB, China Kadoorie Biobank; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s014
(DOCX)
S5 Table. Weighted median, MR–Egger, and MR–PRESSO sensitivity analyses of the associations of genetically determined height with ischemic stroke and its subtypes in MEGASTROKE.
*OR per 1 SD genetically determined taller height. The numbers of events reported for MEGASTROKE were the maximum number of cases available in the genetic summary data. GIANT (2018), Genetic Investigation of Anthropometric Traits (2018) [18]; IVW, inverse variance weighted; MR, mendelian randomization; MR–PRESSO, Mendelian Randomization Pleiotropy RESidual Sum and Outlier; OR, odds ratio; SD, standard deviation.
https://doi.org/10.1371/journal.pmed.1003967.s015
(DOCX)
S6 Table. Additional sensitivity analyses of the associations of genetically determined height with ischemic stroke and its subtypes in MEGASTROKE using (A) a restricted genetic instrument excluding SNPs associated with age at completion of full-time education, diabetes, atrial fibrillation, hypertension, systolic blood pressure, diastolic blood pressure, LDL cholesterol, HDL cholesterol, triglycerides, or apolipoprotein B and (B) a genetic instrument for height with a stricter level of LD pruning (r2 < 0.001).
*The number of SNPs available for each stroke subtype varied from 2,084 to 2,277 in the main genetic instrument for height, from 1,377 to 1,514 in the restricted genetic instrument in (A) and from 1,114 to 1,180 in the genetic instrument in (B). See S4 Methods (further details of instruments for genetically determined height). †In the pan-ancestry genetic analysis of the UKB (Pan-UKBB) based on 294,072 to 421,391 participants [26]. Percentages of SNPs associated (at p < 0.001) with each risk factor were age at completion of full-time education (2.8%), diabetes (2.2%), atrial fibrillation (1.4%), hypertension (7.1%), systolic blood pressure (10.3%), diastolic blood pressure (8.9%), LDL cholesterol (5.7%), HDL cholesterol (10.7%), triglycerides (10.5%), and apolipoprotein B (6.5%). HDL, high-density lipoprotein; LD, linkage disequilibrium; LDL, low-density lipoprotein; OR, odds ratio; SNP, single nucleotide polymorphism; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s016
(DOCX)
S7 Table. Associations of genetically determined height with ischemic stroke and its subtypes in CKB shown for different genetic instruments.
Each individual genetic instrument for height, based on GIANT or Biobank Japan SNPs, was linkage disequilibrium pruned (r2 < 0.05). The category “All ischemic stroke” includes additional unsubtyped ischemic strokes. Genetic associations in CKB were adjusted for age, age2, sex, region, genomic principal components, and genotyping array type. Biobank Japan, Biobank Japan genome-wide association study (2019) [20]; CKB, China Kadoorie Biobank; GIANT (2018), Genetic Investigation of Anthropometric Traits (2018) [18]; OR, odds ratio; R2, the proportion of the residual variance of height explained by the genetic risk score for height (the coefficient of determination); SNP, single nucleotide polymorphism.
https://doi.org/10.1371/journal.pmed.1003967.s017
(DOCX)
S8 Table. Associations of measured height with ischemic stroke and its subtypes in UKB and CKB.
For UKB and CKB, respectively, the SDs of directly measured height were 6.8 cm versus 6.5 cm for men and 6.3 cm versus 6.0 cm for women. *Associations were stratified by age at risk (in 5-year groups), sex, and region (in CKB only) and adjusted for year of birth. †Additional potential confounders included year of birth, smoking status, number of cigarettes smoked, systolic blood pressure, diastolic blood pressure, diagnosed hypertension, diagnosed diabetes, self-rated walking pace (UKB only), and level of education (S6 Methods). CKB, China Kadoorie Biobank; HR, hazard ratio; SD, standard deviation; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s018
(DOCX)
S9 Table. Associations of genetically determined height with other cardiovascular risk factors—Smoking status and education.
*Effects are the ORs per 1 SD genetically determined taller height, adjusted for age, age2, sex, region (in CKB only), genomic principal components, and genotyping array type. For UKB and CKB, respectively, the SDs of directly measured height were 6.8 cm versus 6.5 cm for men and 6.3 cm versus 6.0 cm for women. †Each pair of signs indicates the direction of the estimated effect for UKB (first sign) and CKB (second sign). CKB, China Kadoorie Biobank; OR, odds ratio; SD, standard deviation; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s019
(DOCX)
S1 Fig. Data flow diagram for UKB.
UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s020
(TIF)
S2 Fig. Data flow diagram for CKB.
CKB, China Kadoorie Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s021
(TIF)
S3 Fig. MR framework for MEGASTROKE, UKB, and CKB.
Biobank Japan, Biobank Japan genome-wide association study (2019) [20]; CKB, China Kadoorie Biobank; GC, genetic consortia (which differs between studies); GIANT (2014), Genetic Investigation of Anthropometric Traits (2014) [19]; GIANT (2018), Genetic Investigation of Anthropometric Traits (2018) [18]; GWAS, genome-wide association study; HR, hazard ratio; MR, mendelian randomization; OR, odds ratio; SNP, single nucleotide polymorphism; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s022
(TIF)
S4 Fig. Effects of height-associated SNPs on ischemic stroke and its subtypes in MEGASTROKE (multiple ancestry) in the relation to their effects on height in GIANT (2018).
For MEGASTROKE (multiple ancestry), 2,265 height-associated SNPs were available for ischemic stroke cases, 2,270 for cardioembolic and large-artery stroke cases, and 2,084 for small-vessel stroke cases. GIANT (2018), Genetic Investigation of Anthropometric Traits (2018) [18]; SD, standard deviation; SNP, single nucleotide polymorphism;.
https://doi.org/10.1371/journal.pmed.1003967.s023
(TIF)
S1 Data Table. SNPs used to construct the genetic instrument for height in MEGASTROKE.
*GIANT (2018), Genetic Investigation of Anthropometric Traits (2018) [18]. SNP, single nucleotide polymorphism.
https://doi.org/10.1371/journal.pmed.1003967.s024
(XLSX)
S2 Data Table. SNPs used to construct the genetic instrument for height in UKB.
*GIANT (2014), Genetic Investigation of Anthropometric Traits (2014) [19]. SNP, single nucleotide polymorphism; UKB, UK Biobank.
https://doi.org/10.1371/journal.pmed.1003967.s025
(XLSX)
S3 Data Table. SNPs used to construct the genetic instrument for height in CKB.
*GIANT (2018), Genetic Investigation of Anthropometric Traits (2018) [18]. †Biobank Japan genome-wide association study (2019) [20]. CKB, China Kadoorie Biobank; SNP, single nucleotide polymorphism.
https://doi.org/10.1371/journal.pmed.1003967.s026
(XLSX)
S4 Data Table. Associations of SNPs used to construct the genetic instrument for height in MEGASTROKE with ischemic stroke and its subtypes.
SNP, single nucleotide polymorphism.
https://doi.org/10.1371/journal.pmed.1003967.s027
(XLSX)
Acknowledgments
The China Kadoorie Biobank study is jointly coordinated by the University of Oxford and the Chinese Academy of Medical Sciences. Members of the MEGASTROKE consortium are listed in S2 Text. The research included in the present report used data obtained from the UKB resource under application 10 061. Genotyping data were exported from China to the Oxford CKB International Coordinating Centre under Data Export Approvals 2014–13 and 2015–39 from the Office of Chinese Human Genetic Resource Administration.
References
- 1. The Emerging Risk Factors Collaboration. Adult height and the risk of cause-specific death and vascular morbidity in 1 million people: individual participant meta-analysis. J Epidemiol. 2012;41:1419–33. pmid:22825588
- 2. Lai FY, Nath M, Hamby SE, Thompson JR, Nelson CP, Samani NJ. Adult height and risk of 50 diseases: a combined epidemiological and genetic analysis. BMC Med. 2018;16:187. pmid:30355295
- 3. Levin MG, Judy R, Gill D, Vujkovic M, Verma SS, Bradford Y, et al. Genetics of height and risk of atrial fibrillation: A Mendelian randomization study. PLoS Med. 2020;17:e1003288. pmid:33031386
- 4. Nelson CP, Hamby SE, Saleheen D, Hopewell JC, Zeng L, Assimes TL, et al. Genetically determined height and coronary artery disease. N Engl J Med. 2015;372:1608–18. pmid:25853659
- 5. Marouli E, Del Greco MF, Astley CM, Yang J, Ahmad S, Berndt SI, et al. Mendelian randomisation analyses find pulmonary factors mediate the effect of height on coronary artery disease. Commun Biol. 2019;2:119. pmid:30937401
- 6. Tikkanen E, Gustafsson S, Knowles JW, Perez M, Burgess S, Ingelsson E. Body composition and atrial fibrillation: a Mendelian randomization study. Eur Heart J. 2019;40:1277–82. pmid:30721963
- 7. Fenger-Grøn M, Overvad K, Tjønneland A, Frost L. Lean Body Mass Is the Predominant Anthropometric Risk Factor for Atrial Fibrillation. J Am Coll Cardiol. 2017;69:2488–97. pmid:28521886
- 8. Lawlor DA, Tilling K, Davey SG. Triangulation in aetiological epidemiology. Int J Epidemiol. 2016;45:1866–86. pmid:28108528
- 9. Nüesch E, Dale C, Palmer TM, White J, Keating BJ, van Iperen EP, et al. Adult height, coronary heart disease and stroke: a multi-locus Mendelian randomization meta-analysis. Int J Epidemiol. 2016;45:1927–37. pmid:25979724
- 10. Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–9. pmid:30305743
- 11. Chen Z, Chen J, Collins R, Guo Y, Peto R, Wu F, et al. China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. Int J Epidemiol. 2011;40:1652–66. pmid:22158673
- 12. Skrivankova VW, Richmond RC, Woolf BAR, Yarmolinsky J, Davies NM, Swanson SA, et al. Strengthening the Reporting of Observational Studies in Epidemiology Using Mendelian Randomization: The STROBE-MR Statement. JAMA. 2021;326:1614–21. pmid:34698778
- 13. Malik R, Chauhan G, Traylor M, Sargurupremraj M, Okada Y, Mishra A, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet. 2018;50:524–37. pmid:29531354
- 14. Adams HP, Bendixen BH, Kappelle LJ, Biller J, Love BB, Gordon DL, et al. Classification of subtype of acute ischemic stroke. Definitions for use in a multicenter clinical trial. TOAST. Trial of Org 10172 in Acute Stroke Treatment. Stroke. 1993;24:35–41. pmid:7678184
- 15. Allen NE, Arnold M, Parish S, Hill M, Sheard S, Callen H, et al. Approaches to minimising the epidemiological impact of sources of systematic and random variation that may affect biochemistry assay data in UK Biobank [version 2; peer review: 2 approved]. Wellcome Open Research. 2021. pmid:33364437
- 16. Ay H, Benner T, Murat AE, Furie KL, Singhal AB, Jensen MB, et al. A Computerized Algorithm for Etiologic Classification of Ischemic Stroke. Stroke. 2007;38:2979–84. pmid:17901381
- 17.
Millwood IY, Walters RG. Collection, Processing, and Management of Biological Samples in Biobank Studies. In: Chen Z, editor. Population Biobank Studies: A Practical Guide. Singapore: Springer; 2020. pp. 77–97.
- 18. Yengo L, Sidorenko J, Kemper KE, Zheng Z, Wood AR, Weedon MN, et al. Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum Mol Genet. 2018;27:3641–9. pmid:30124842
- 19. Wood AR, Esko T, Yang J, Vedantam S, Pers TH, Gustafsson S, et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet. 2014;46:1173–86. pmid:25282103
- 20. Akiyama M, Ishigaki K, Sakaue S, Momozawa Y, Horikoshi M, Hirata M, et al. Characterizing rare and low-frequency height-associated variants in the Japanese population. Nat Commun. 2019;10:4393. pmid:31562340
- 21. Sun L, Clarke R, Bennett D, Guo Y, Walters RG, Hill M, et al. Causal associations of blood lipids with risk of ischemic stroke and intracerebral hemorrhage in Chinese adults. Nat Med. 2019;25:569–74. pmid:30858617
- 22. Martin AR, Kanai M, Kamatani Y, Okada Y, Neale BM, Daly MJ. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat Genet. 2019;51:584–91. pmid:30926966
- 23. Burgess S, Small DS, Thompson SG. A review of instrumental variable estimators for Mendelian randomization. Stat Methods Med Res. 2017;26:2333–55. pmid:26282889
- 24.
Burgess S, Thompson SG. Mendelian Randomization: Methods for Using Genetic Variants in Causal Estimation. 1st ed. Boca Raton: CRC Press; 2015.
- 25. Verbanck M, Chen C-Y, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet. 2018;50:693–8. pmid:29686387
- 26.
Pan-ancestry UK Biobank (Pan-UKBB). [cited 16 Dec 2021]. https://pan.ukbb.broadinstitute.org/.
- 27. Hindy G, Engström G, Larsson SC, Traylor M, Markus HS, Melander O, et al. Role of Blood Lipids in the Development of Ischemic Stroke and its Subtypes: A Mendelian Randomization Study. Stroke. 2018;49:820–7. pmid:29535274
- 28. Georgakis MK, Gill D, Webb AJS, Evangelou E, Elliott P, Sudlow CLM, et al. Genetically determined blood pressure, antihypertensive drug classes, and risk of stroke subtypes. Neurology. 2020. pmid:32611631
- 29. Yang F, Qian D, Liu X. for the Healthy Aging and Development Study Group in Nanjing Medical University, for the Data Mining Group of Biomedical Big Data in Nanjing Medical University. Socioeconomic disparities in prevalence, awareness, treatment, and control of hypertension over the life course in China. Int J Equity Health. 2017;16:100. pmid:28610576
- 30. Tsang TSM, Barnes ME, Bailey KR, Leibson CL, Montgomery SC, Takemoto Y, et al. Left atrial volume: important risk marker of incident atrial fibrillation in 1655 older men and women. Mayo Clin Proc. 2001;76:467–75. pmid:11357793
- 31. Arnold M, Linden A, Clarke R, Guo Y, Du H, Bian Z, et al. Carotid Intima-Media Thickness but Not Carotid Artery Plaque in Healthy Individuals Is Linked to Lean Body Mass. J Am Heart Assoc. 2019;8:e011919. pmid:31364443
- 32. Howe LJ, Lawson DJ, Davies NM, Pourcain BS, Lewis SJ, Davey Smith G, et al. Genetic evidence for assortative mating on alcohol consumption in the UK Biobank. Nat Commun. 2019;10:5039. pmid:31745073
- 33. Kong A, Thorleifsson G, Frigge ML, Vilhjalmsson BJ, Young AI, Thorgeirsson TE, et al. The nature of nurture: Effects of parental genotypes. Science. 2018;359:424–8. pmid:29371463
- 34. Ornello R, Degan D, Tiseo C, Di Carmine C, Perciballi L, Pistoia F, et al. Distribution and Temporal Trends From 1993 to 2015 of Ischemic Stroke Subtypes: A Systematic Review and Meta-Analysis. Stroke. 2018;49:814–9. pmid:29535272
- 35. Bogiatzi C, Hackam DG, McLeod IA, Spence DJ. Secular Trends in Ischemic Stroke Subtypes and Stroke Risk Factors. Stroke. 2014;45:3208–13. pmid:25213343