Skip to main content
Advertisement
  • Loading metrics

Disentangling the consequences of type 2 diabetes on targeted metabolite profiles using causal inference and interaction QTL analyses

  • Ozvan Bocher ,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing

    ozvan.bocher@helmholtz-munich.de

    Affiliation Institute of Translational Genomics, Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany

  • Archit Singh,

    Roles Data curation, Writing – review & editing

    Affiliations Institute of Translational Genomics, Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany, Munich School for Data Science (MUDS), Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany, Technical University of Munich (TUM), TUM School of Medicine and Health, Graduate School of Experimental Medicine, Munich, Germany

  • Yue Huang,

    Roles Data curation, Writing – review & editing

    Affiliations Institute of Translational Genomics, Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany, Technical University of Munich (TUM), TUM School of Medicine and Health, Graduate School of Experimental Medicine, Munich, Germany

  • Urmo Võsa,

    Roles Formal analysis, Resources, Writing – review & editing

    Affiliation Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia

  • Ene Reimann,

    Roles Data curation, Writing – review & editing

    Affiliations Institute of Translational Genomics, Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany, Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia

  • Ana Arruda,

    Roles Conceptualization, Writing – review & editing

    Affiliations Institute of Translational Genomics, Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany, Munich School for Data Science (MUDS), Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany, Technical University of Munich (TUM), TUM School of Medicine and Health, Graduate School of Experimental Medicine, Munich, Germany

  • Andrei Barysenska,

    Roles Data curation, Writing – review & editing

    Affiliation Institute of Translational Genomics, Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany

  • Anastassia Kolde,

    Roles Data curation, Resources, Writing – review & editing

    Affiliations Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia, Institute of Mathematics and Statistics, University of Tartu, Tartu, Estonia

  • Nigel W. Rayner,

    Roles Resources, Writing – review & editing

    Affiliation Institute of Translational Genomics, Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany

  • Estonian Biobank research team ,

    Membership of Estonian Biobank research team is provided in the Acknowledgements

  • Tõnu Esko,

    Roles Resources, Writing – review & editing

    Affiliation Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia

  • Reedik Mägi,

    Roles Funding acquisition, Resources, Writing – review & editing

    Affiliation Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia

  • Eleftheria Zeggini

    Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing

    Affiliations Institute of Translational Genomics, Helmholtz Zentrum München- German Research Center for Environmental Health, Neuherberg, Germany, TUM school of medicine and health, Technical University Munich and Klinikum Rechts der Isar, Munich, Germany

Abstract

Circulating metabolite levels have been associated with type 2 diabetes (T2D), but the extent to which T2D affects metabolite levels and their genetic regulation remains to be elucidated. In this study, we investigate the interplay between genetics, metabolomics, and T2D risk in the UK Biobank dataset using the Nightingale panel composed of 249 metabolites, 92% of which correspond to lipids (HDL, IDL, LDL, VLDL) and lipoproteins. By integrating these data with large-scale T2D GWAS from the DIAMANTE meta-analysis through Mendelian randomization analyses, we find 79 metabolites with a causal association to T2D, all spanning lipid-related classes except for Glucose and Tyrosine. Twice as many metabolites are causally affected by T2D liability, spanning almost all tested classes, including branched-chain amino acids. Secondly, using an interaction quantitative trait locus (QTL) analysis, we describe four metabolites consistently replicated in an independent dataset from the Estonian Biobank, for which genetic loci in two different genomic regions show attenuated regulation in T2D cases compared to controls. The significant variants from the interaction QTL analysis are significant QTLs for the corresponding metabolites in the general population but are not associated with T2D risk, pointing towards consequences of T2D on the genetic regulation of metabolite levels. Finally, through differential level analyses, we find 165 metabolites associated with microvascular, macrovascular, or both types of T2D complications, with only a few discriminating between complication classes. Of the 165 metabolites, 40 are not causally linked to T2D in either direction, suggesting biological mechanisms specific to the occurrence of complications. Overall, this work provides a map of the consequences of T2D on Nightingale targeted metabolite levels and on their genetic regulation, enabling a better understanding of the T2D trajectory leading to complications.

Author summary

Type 2 diabetes (T2D) is a complex disease that affects millions of people worldwide, with its progression to complications posing a significant health burden. While the genetics of the disease has been increasingly described through large efforts, the biological mechanisms behind these associations remain to be fully elucidated. Additionally, the consequences of the disease on molecular profiles, and how these changes could lead to further complications, remain poorly understood. Here, we have investigated the relationships between T2D, its complications and 249 targeted metabolites mostly focusing on lipids. We show a causal impact of 79 metabolites on T2D risk, while twice as many metabolites were causally affected by T2D liability. Further, we highlight four metabolites whose genetic regulation appears to be altered in T2D cases, which, along with additional lines of evidence, seems to be a consequence rather than a cause of T2D occurrence. Finally, we found that 165 metabolites were associated with the occurrence of T2D complications, with most also showing causal links to T2D, although 40 of them did not demonstrate such associations. Our study enhances our understanding of the metabolic pathways involved in the trajectory of T2D to complications and represents a first step towards the prevention of the development of complications in the future, a key public health priority.

Introduction

Type 2 diabetes (T2D) is a common, complex disease, with a prevalence that is expected to increase dramatically, and for which the genetics has been largely described through genome-wide association study (GWAS) meta-analysis efforts [13]. The next step towards translating these associations into the clinic is to understand the biological mechanisms behind them. To this end, multi-omics data such as metabolite levels offer great promise, as they enable the study of molecular phenotypes closely implicated in the disease. A large number of metabolites have been associated with T2D, as highlighted in a recent study from Julkunen et al. [4], which described 230 metabolites as nominally associated with incident and prevalent T2D. Consistent associations between T2D risk and metabolites across studies mostly cover increases in various amino acid levels, especially branched-chain amino acids (BCAAs) [57], and dyslipidemia [4,8]. However, the predictive value of metabolite profiles in T2D risk is still debated. Improved prediction of various complex traits using metabolite profiles over genetic scores has been shown[9], but this improvement is rather limited when compared to classical risk factors including age, sex and family history of disease [10,11]. The limited prediction of metabolite profiles may be related to the uncertain causal role in modulating T2D risk, a question investigated using Mendelian randomization (MR) [12]. One particular example is BCAAs, for which some studies have described a causal effect of valine, leucine and/or isoleucine on T2D [1315], while more recent evidence has shown no causal effect of BCAAs on T2D risk in the UK Biobank (UKBB) cohort [16]. Conversely, increasing evidence of a causal effect of T2D liability on metabolite profiles has been shown, with increased alanine levels caused by increased T2D liability being consistently reported [13,17,18]. Conflicting conclusions exist for other metabolites, and little is known about the role of genetic regulation in shaping these relationships. The genetic regulation of metabolite profiles is increasingly described, for example through the latest meta-analysis reporting over 400 independent loci associated with 233 metabolite levels [19]. These metabolite QTLs, also called mQTLs, are usually reported in the general population, and studies are starting to emerge on how they are modified by factors such as diet [20]. However, no study to date has investigated the question of whether the genetic regulation of metabolite levels is affected by the occurrence of T2D.

A large part of the healthcare burden associated with T2D is due to subsequent complications of the disease. T2D complications span both microvascular complications, which refer to complications involving small vessels and have an estimated prevalence of 53%, and macrovascular complications which refer to complications involving large vessels such as arteries and veins with an estimated prevalence of 27% [21]. Increasing evidence is emerging that metabolite profiles are also associated with the risk of developing complications, such as the fatty acid biosynthesis pathway with retinal and renal complications [22], two of the main microvascular complications of T2D, n-3 fatty acids with T2D macrovascular complications [23], or amino acids with both types of T2D complications [24]. However, replication of these associations is still needed, as well as investigating whether the metabolites associated with T2D complications are distinct from the ones causally affected by T2D liability. Exploring the links between metabolite levels and T2D can help us gain insights into how these relationships may influence trajectories towards T2D complications, for which underlying biological mechanisms are still to be unraveled.

Here, we aim to address these questions and elucidate the metabolic consequences of T2D by investigating the links between metabolite profiles, genetics, and the risk of T2D and subsequent complications. For this, we considered profiles of 249 metabolites from the UKBB cohort [25] assessed through the Nightingale panel, mostly composed of lipids and lipoproteins classes (around 92% of metabolites), and covering additional classes such as amino acids (10 metabolites), ketone bodies (4 metabolites), glycolysis-related metabolites (4 metabolites), and markers of inflammation (1 metabolite). These levels were characterized for almost 275,000 participants through two releases, which were here considered for discovery (n = 117,967 individuals) and replication (n = 156,385 individuals), or meta-analyzed when statistically appropriate. We first performed a bi-directional two sample MR analysis, using non-overlapping datasets for the SNP-exposure and SNP-outcome associations to limit potential bias [12]. Previous MR studies performed in the UKBB cohort have used only the first release of data for the 249 metabolites and were either limited to a few metabolites, or only investigated a single causal direction [16,18,26]. Secondly, we performed an interaction mQTL analysis to assess whether there is a different genetic regulation of metabolite levels between T2D patients and controls. Finally, we investigated how metabolite profiles are associated with T2D complications, and whether these associated metabolites overlap with the metabolites causally affected by T2D liability.

Methods

Ethics statement

Individual level data analysis in the Estonian Biobank was carried out under ethical approval [nr 1.1-12/3337] from the Estonian Committee on Bioethics and Human Research (Estonian Ministry of Social Affairs), using data according to release application [nr 6-7/GI/15486 T17] from the Estonian Biobank.

Data and quality control

Genetic data from genotyping arrays and imputation are available for over 500,000 participants in the UKBB cohort. Quality control (QC) was performed at the variant level and at the sample (S1 Appendix). We selected only variants with a minor allele frequency greater than 1% and with an imputation INFO score greater than 0.8. To minimize potential bias in MR due to differences between the exposure and outcome datasets which could be related to having different ancestry background in the two datasets, we chose to focus on unrelated individuals of European ancestry. In total, 408,194 individuals and 9,572,578 variants remained after QC.

A total of 249 metabolite levels obtained from the Nightingale platform using nuclear magnetic resonance (NMR) spectroscopy is available in the UKBB cohort as part of two different releases covering a total of 274,352 individuals: 117,967 in the first release, and 156,385 in the second release. Absolute concentrations cover 168 metabolites, with an additional set of 81 metabolite ratios and percentages, which are further derived from the absolute concentrations (https://biobank.ndph.ox.ac.uk/showcase/refer.cgi?id=1682). To perform metabolite QC, we used the ukbnmr R package (version 1.5) specifically developed to remove technical variations from NMR metabolite measurements in the UKBB [27]. For each release, we considered only metabolite data at the first timepoint, T0, corresponding to a total of 227,607 individuals and 249 metabolites after applying the ukbnmr QC (S1 Appendix). All analyses have been performed on inverse normal transformed values to obtain normally distributed metabolite levels [28], which correspond to ‘metabolite levels’ for the rest of the manuscript.

Definition of phenotypes

T2D status.

T2D status was defined based on the UKBB field 130708, which corresponds to the first date of T2D report (self-reported or ICD10 code). Status was defined at T0 to analyze metabolite profiles in the light of T2D status at the time of profiling (Fig A in S2 Appendix), prevalent cases corresponding to individuals having a T2D diagnosis date before the date of blood sample collection on which metabolites were assayed, and incident cases to individuals having a T2D diagnosis date after the date of blood sample collection. Considering that T2D is often diagnosed with a few years delay [29], we considered HbA1c levels in addition of the field 130708 to recover individuals likely having undiagnosed T2D at T0 among T2D incident cases. We removed from the analysis individuals with any mention of T1D (ICD10 code E10*, field 130706) or gestational diabetes (ICD10 code O24*, field 132202), and individuals diagnosed with T2D before the age of 36 years, in accordance with previous guidelines [30]. Individuals with a mention of T1D or gestational diabetes were also removed from the controls. The final number of T2D cases and controls included in the analyses, passing the genetic data QC and having measured metabolite levels are 3,088 and 88,244 for the first release and 4,302 and 118,555 for the second release, respectively.

T2D complications.

We categorized prevalent cases of T2D at T0 into four mutually exclusive complication groups based on the occurrence of generalized vascular complication events: “microvascular”, “macrovascular”, “micro and macrovascular” complications, and “no complications”. “Microvascular” and “macrovascular” complications were defined based on ICD10 codes (Table A in S3 Appendix). Individuals with an ICD10 code for both types of complications were attributed to the “micro and macrovascular” group and removed from the two other complication groups. We selected individuals with a diagnosis date of complications before the date of metabolite sample collection but after the T2D diagnosis date. If individuals had multiple ICD10 codes for the “microvascular” or the “macrovascular” complication group, the date of the first event was considered. For the “micro and macrovascular” group, the latest date between the “microvascular” and the “macrovascular” onset was considered. The total number of individuals in each complication group assayed in each of the release datasets is presented in Table 1.

thumbnail
Table 1. Number of T2D patients per complication group for each of the released dataset.

https://doi.org/10.1371/journal.pgen.1011346.t001

Statistical analyses

General considerations.

All the analyses have been performed using R version 4.2.0 and adjusted for age, sex, and fasting time (only available in the UKBB), as well as for additional covariates in the analyses involving genetic data (Table B in S3 Appendix). To correct our analyses for multiple testing, we considered FDR-adjusted p-values (Benjamini-Hochberg method, referred to as q-values) to assess significance at 5%. All analyses were performed on the two release datasets separately and meta-analyzed, except for the MR analyses where the same T2D summary statistics were used for both MR analyses on the two release sets (details below).

Mendelian randomization.

Two sample bi-directional MR was performed to assess the causal effects of metabolite levels on T2D risk (forward MR) and the causal effects of T2D liability on metabolite levels (reverse MR). MR was run separately on the two released datasets as it was not possible to perform meta-analysis due to the same outcome data being used in the two MR analyses, namely T2D DIAMANTE meta-analysis. For the first release dataset, we used mQTL summary statistics from Borges et al.[31], which correspond to the QTL map of the 249 metabolites from the first data release that has been generated in the UK Biobank. For the second release, we performed a mQTL analysis using the REGENIE software with the--qt option considering the INT metabolite levels as the phenotype (version 2.2.4) [32]. For the associations between genetic variants and T2D, we used the DIAMANTE 2018 meta-analysis [2], restricted to European ancestry samples, and without the UKBB cohort to avoid sample overlap between the exposure and the outcome data.

Instrumental variables (IVs) were selected from independent variants defined through LD-based clumping using plink [33] with the following parameters: R2<0.001 in windows of 10Mb, a p-value threshold of 2.54x10-10 for the metabolite levels and of 5x10-8 for T2D. The threshold of 2.54x10-10 is a genome-wide threshold corrected by the number of effective tests [34], estimated at 197, which enables to correct the analyses for multiple testing while considering the correlation between the metabolites. For each of these independent variants, the strength of association with the exposure was determined using F-statistics. For the first released dataset and T2D summary statistics, the F-statistic was estimated using , where beta is the effect estimate and se its corresponding standard error. For the second release dataset, individual-level data from the UKBB and the R package ivreg (version 0.6–1) were used to calculate the F-statistic. Only IVs with an F-statistic larger than 10 were retained, which corresponds to a total of 5 to 75 IVs when using the mQTL statistics from Borges et al.[31] based on the first metabolite release, and 4 to 90 IVs when using the results from our mQTL analysis performed on the second metabolite release.

Data were harmonized using the function harmonise_data() with default parameters from the R package TwoSampleMR (version 0.5–6)[35], which was further used to run the MR analyses. RadialMR [36] (version 1.0) was used to assess heterogeneity and remove outliers for metabolites with a significant SNP heterogeneity. MR significance in the first release dataset was assessed based on the inverse variance weighted (IVW) method using q-values at a 5% threshold. Six additional MR methods (MR Egger, weighted median, simple mode, weighted mode, IVW fixed effect, IVW random effect and Steiger filtered IVW) were used as sensitivity analyses to check for concordant direction of causal effect estimate. These sensitivity analyses enable to evaluate causation under different MR assumptions, such as MR-Egger being robust to pleiotropy, or Steiger-filtered IVW removing IVs more strongly associated with the outcome than with the exposure to limit reverse causation effects. We only considered metabolites having a non-significant heterogeneity and pleiotropy estimates, measured by the Q-statistic and the MR-Egger intercept, respectively (5% significance threshold). Significant metabolites were considered as replicated if they had an IVW q-value lower than 5% in the second release dataset with a concordant direction of effect of the IVW method with the first release dataset.

Statin medication, prescribed to lower LDL-related levels, is known to have an impact on T2D [37,38] and could therefore bias the MR results. To limit the impact of statin medication in our MR analyses, we therefore performed one-sample MR in the forward direction restricted to UKBB participants not taking any statin medication. For this, we removed individuals taking medication with an ATC code starting with C10A or C10B corresponding to ‘lipid lowering agents’. We performed this sensitivity analysis in the second release set for the significant metabolites in the forward direction using the ivreg package (version 0.6–1) (96,380 individuals with 1,044 T2D cases and 95,336 controls). We used the same IVs as in the main MR analyses in the second release set and recomputed the F-statistics on the set of individuals not under statin medication to select only IVs with F>10. As for the other sensitivity analyses, we then compared the directions of effect.

Interaction QTL.

We performed an interaction QTL analysis to investigate the genetic regulation of metabolite levels in T2D cases and controls using the REGENIE software with the--interaction option (version 2.2.4) [32] and the following interaction test:

Where SNP*T2D represents the interaction term between prevalent T2D disease status and genotypes. Only the p-value of the interaction test (“ADD-INT_SNPxVAR” column from the REGENIE output) was assessed for statistical significance. Interaction analyses were run on each of the two release datasets separately and were then meta-analyzed using the software METAL [39] to maximize statistical power. We considered suggestive variants if they had a meta-analysis p-value lower than the genome-wide significance threshold of 5x10-8, a nominally significant p-value in both release datasets, and a concordant direction of effect between both datasets. To investigate whether the results from the interaction QTL analyses could be due to the effect of confounding variables, we performed sensitivity analyses where we sequentially adjusted for BMI, lipid-lowering medication, and metformin (S1 Appendix). Lipid medication was considered as ATC code starting with C10A or C10B, and a binary variable was considered indicating if participants were taking any of this medication.

Replication of interaction QTL effects in Estonian Biobank. We replicated the interaction QTL analysis in the Estonian Biobank (EstBB), in which the same metabolite panel has been assayed, for the significant variants identified in the UKBB. The EstBB data freeze including altogether 211,728 biobank participants was applied.

The T2D cases were defined using the same approach as in the UKBB, where T2D diagnosis was based on the ICD10 codes E11* from electronic health registries (EHRs), and further on HbA1c levels for incident T2D cases. Individuals who have consumed insulin at least one year after the T2D diagnosis were excluded (EHRs do not have information about prescription of Insulins and analogies (the Anatomical Therapeutic Chemical code A10A*)). The control group was defined as follows: (1) they do not have ICD10 codes E10*, E11* or O24* marked in their EHRs, (2) they have not been prescribed any of the drugs with the following ATC codes: A10A*, A10BA02, A10BF01, A10BB07, A10BB03, V04CA01, A10BB01, A10BB09, A10BB12, A10BG01, A10BX02, A10BG03, A10BX03, A10BG02.

Similar to UKBB, metabolite data is available from Nightingale NMR spectrometry platform, and a QC was performed on both these data and the genotyping data (S1 Appendix). After QC, 6,237 T2D cases and 92,381 controls were enrolled into the replication analysis. Replication interaction QTL analysis was conducted by fitting a linear model with the interaction term between SNP and T2D status, while using SNP dosage, T2D status, sex, age at agreement, spectrometer serial number and ten first genetic principal components as covariates on each pair of variant and metabolite. Analysis and data processing was implemented into custom scripts and by using R v4.3.1. We considered replicated signals those with a q-value lower than 5% in the EstBB cohort and with a concordant direction of effect.

Analysis of T2D complications.

We analyzed the differences between metabolite levels and the four complication groups defined previously (microvascular, macrovascular, micro and macrovascular, no complications) using a multinomial approach. We used the R package mlogit (version 1.1–1), with T2D individuals without complications representing the reference level. To increase statistical power, we meta-analyzed the results across the two release datasets for each complication group. We declared metabolites as having a significant effect between different complication groups if they had a q-value lower than 5% in the meta-analysis, and a nominal significant p-value in each of the release datasets with a concordant direction of effect. Finally, we performed forward one-sample MR analyses within the UKBB to assess the causal effect of metabolite levels on the risk of developing T2D complications using the R package ivreg (version 0.6–1). We used the same metabolite IVs as for the T2D bi-directional two-sample MR, and only kept the ones having an F-statistic greater than 10 in the subset of T2D individuals.

Results

Causal associations

The upset plot in Fig 1 represents the causal relationships between T2D and metabolites from the bi-directional MR that were significant in the first release dataset and replicated in the second release dataset, and which passed all our sensitivity criteria. This includes no significant pleiotropy and no significant heterogeneity (assessed at 5%), and concordant directions of effect across six sensitivity methods (MR Egger, weighted median, simple mode, weighted mode, IVW fixed effect, IVW random effect and Steiger filtered IVW). Fig 1 also includes the significant associations between metabolite levels and prevalent/incident T2D in the UKBB described by Julkunen et al.[4] using the first metabolite release dataset. All metabolites that showed a significant causal association in either MR direction in our analysis were also associated with prevalent or incident T2D reported by Julkunen et al.[4], supporting the observed MR effects. Interestingly, while among the metabolites significant in the reverse direction, i.e., causally affected by T2D liability, 172 were found to be associated with both incident and prevalent T2D, 8 were found to be associated only with incident T2D and not significant in the forward direction. This finding points to T2D predisposition affecting these 8 metabolites levels, which are all related to very low-density lipoproteins (VLDL). MR estimates from the IVW method in both directions are reported in Table C in S3 Appendix and shown in Figs 2 and 3 as circular plots for 168 absolute metabolite levels and 81 derived ratios.

thumbnail
Fig 1. Upset plot of the forward (ForMR) and reverse (RevMR) MR analyses, along with the association results from Julkunen et al. [4] between metabolite levels and prevalent (Prevalent_Julikunen) and incident (Incident_Julkunen) T2D.

https://doi.org/10.1371/journal.pgen.1011346.g001

thumbnail
Fig 2. Circular plot of the forward (ForMR) and reverse (RevMR) MR analyses with inverse variance weighted (IVW) estimates from the 1st metabolite set (absolute metabolite values).

Metabolites having increased levels associated with T2D occurrence are presented in red and the ones with decreased levels in blue. Non-significant metabolites, or metabolites not replicated in the 2nd metabolite set are colored in grey. Metabolites are grouped according to metabolite classes. The y-axis corresponds to the q-value (qval).

https://doi.org/10.1371/journal.pgen.1011346.g002

thumbnail
Fig 3. Circular plot of the forward (ForMR) and reverse (RevMR) MR analyses with inverse variance weighted (IVW) estimates from the 1st metabolite set (metabolite ratios).

Metabolites having increased levels associated with T2D occurrence are presented in red and the ones with decreased levels in blue. Non-significant metabolites, or metabolites not replicated in the 2nd metabolite set are colored in grey. Metabolites are grouped according to metabolite classes. The y-axis corresponds to the q-value (qval).

https://doi.org/10.1371/journal.pgen.1011346.g003

We identified 78 metabolites significant in the forward direction, i.e., that showed a causal association with T2D risk, mostly spanning lipid classes, especially low-density lipoproteins (LDL) with decreased levels increasing the risk of T2D. While counterintuitive, these results are concordant with previous work using data from the UKBB [4,18], and a recent study describing genetic variants associated with both higher T2D risk and lower LDL levels [40]. Increasing evidence from observational studies also suggests that individuals with low LDL-C have a higher risk of developing T2D [41,42], these observations being retained when restricting the analyses to individuals not being under statin medication. Statin medication, used to lower LDL-C levels, is known to be associated with an increased risk of developing T2D [37,38]. To avoid potential bias in the MR analyses, we performed an additional sensitivity analysis in the forward direction in the second released metabolite restricted to individuals not taking statin medication through one-sample MR within the UKBB. We observed the same negative direction of effect for all LDL-related metabolites, including LDL-C, which retained FDR significance (Table D in S3 Appendix). The magnitude of the effect was lower, which could be due to the lower statistical power of one-sample MR compared to two-sample MR using large T2D summary statistics. These results suggest that the causal effect of LDL-related metabolites is independent of statin medication, in line with previous studies showing that, while statins have a causal effect on LDL-C levels, there is no evidence for a direct causal impact on T2D risk [43,44]. Glucose was the strongest signal in the first dataset (OR = 4.57 [3.15–6.63], p-value = 8.73x10-16), which was replicated in the second released dataset. Additionally, some ratios in high density lipoproteins (HDL) and VLDL classes were found to be causally associated with T2D risk. We found no evidence of a causal role of absolute triglycerides (TG) levels on T2D, although it has been described as a predictor of T2D risk [17,45]. We replicated findings from Mosley et al. [16], which did not find evidence of a causal role of BCAAs on the risk of T2D. Almost all of the significant metabolites in the forward direction were also found to be associated in the reverse direction, highlighting the complex interplay between metabolite profiles and T2D liability.

We found 183 metabolites to be significantly associated in the reverse direction, of which 114 were not found in the forward direction. This includes a negative effect of T2D liability on large and very large HDL, intermediate-density lipoproteins (IDL) and LDL, while the opposite trend is observed for large and extremely large VLDL. Consistent trends were observed for lipoprotein ratios with T2D liability being causally associated with decreases in cholesterol and cholesteryl esters fractions, and with increases in TG ratios. Fatty acids percentages were also affected by T2D liability, with decreases in PUFA and omega-6 percentages, but increases in MUFA percentage. Finally, we observed causal effects of T2D liability on all BCAAs, as well as on tyrosine and alanine. We further compared our results with a reverse MR study that was carried out by Smith et al. on the first release dataset of metabolites from the UKBB [18] using T2D summary statistics that included UKBB samples, and observed a high correlation of the effect estimates and p-values with our study (Fig B in S2 Appendix). Here, we report replication of 93% of the findings from Smith et al. [18] in the second release dataset of metabolites from the UKBB without a potential bias due to sample overlap (Table C in S3 Appendix).

Interaction QTL

We investigated whether there was a different effect of the genetic variants on metabolite profiles, which could explain the causal effects of T2D liability on metabolite profiles, through an interaction mQTL analysis. We identified 222 variants having a significant interaction between the mQTL effect and T2D status, corresponding to 14 metabolites, including glycine and various lipids. Of these, 40 pairs of variants were replicated in the EstBB (Table E in S3 Appendix), covering four metabolites: percentage of free cholesterol in small HDL (S_HDL_FC_pct), percentage of phospholipids in large LDL (L_LDL_PL_pct), percentage of triglycerides in large VLDL (L_VLDL_TG_pct) and percentage of free cholesterol in large VLDL (L_VLDL_FC_pct). These significant interactions mapped to two intergenic regions, one shared by all metabolites except S_HDL_FC_pct, and the second showing significant genetic interactions with T2D status only for S_HDL_FC_pct (Fig C in S2 Appendix). The genetic associations of these variants with metabolite levels are stronger in the control group than in the T2D patients’ group, with some of the variants in the interaction regions being a significant mQTL only in the control group. For instance, rs6073958 is associated with lower levels of S_HDL_FC_pct in the controls group (beta = -0.146, p<1e-300), but not in the patients’ group (beta = -0.002, p = 0.93) (Fig 4).

thumbnail
Fig 4. Boxplot showing metabolite levels according to the genotype in T2D patients and controls (the most significant variant for each of the four metabolites replicated in the EstBB are represented).

The metabolite levels represented are inverse-normal transformed and adjusted for the covariates used in the interaction QTL analyses. For each tested variant (ID is in the form chr_pos_NonEffectAllele_EffectAllele), its frequency in the control and in the T2D patients’ group is provided, along with the beta and p-value of association with metabolite levels. The overall p-value of the interaction test is also given.

https://doi.org/10.1371/journal.pgen.1011346.g004

L_VLDL_TG_pct was significant in both MR directions, while S_HDL_FC_pct and L_VLDL_FC_pct were only significant in the forward and reverse MR analysis respectively (Fig 5). All of the interaction variants replicated in the EstBB are mQTLs for the corresponding metabolites in the overall UKBB cohort but are not associated with T2D risk in the latest T2D GWAS meta-analysis [3]. These findings suggest that the different genetic regulation of metabolite levels observed between T2D patients and controls is a consequence rather than a cause of the disease, in line with the results of the reverse MR for S_HDL_FC_pct and L_VLDL_TG_pct. This also goes in line with the observed interaction effects with the different genetic regulation of metabolites decreasing the differences in metabolite levels between cases and controls. The results remained consistent after adjustment for BMI, lipid-lowering medication or metformin medication (Table F in S3 Appendix).

thumbnail
Fig 5. Bi-directional MR results for the four metabolites significant in the interaction QTL analysis.

The top row corresponds to the forward MR (effects of metabolites on T2D), and the bottom row to the reverse MR (effects of T2D liability on metabolites). Effect sizes and standard error using the inverse variance weighted method (IVW) and the sensitivity methods are represented in red for the first set of metabolites, and in black for the second set of metabolites.

https://doi.org/10.1371/journal.pgen.1011346.g005

To gain more insights into the link between the four replicated metabolites and T2D, we looked at the most significant interaction QTLs. The most significant interaction QTL for L_VLDL_TG_pct, rs3816117, is annotated by the variant effect predictor (VEP [46]) as a modifier variant for cholesteryl ester transfer protein, (CETP, Fig D in S2 Appendix), a protein targeted by drugs to increase HDL-C and decrease LDL-C levels. The significant interaction variant for S_HDL_FC_pct, rs6073958, is a QTL variant of the PLTP gene, which is involved in the transfer of phospholipids from triglycerides to HDL and might be involved in cholesterol metabolism.

Metabolite profiles and T2D complications

We sought to better understand whether alterations in metabolite levels causally affected by T2D liability were associated with the risk of developing T2D complications (comparing T2D patients without complications, with microvascular complications, with macrovascular complications, and with both types of complications). Out of the 249 tested metabolites, 165 (66%) were associated with at least one of the complication groups, of which 156 were associated with the macrovascular group (Fig 6A). Only 3 and 6 signals were exclusive to the ‘microvascular’ and ‘both complications’ group, respectively. This pattern likely reflects the power of the analysis as the macrovascular group is three to five times larger than the two other complications groups (Table 1). Most of the significant metabolites were shared across multiple complication groups, showing that there is a metabolic signature associated with T2D complications.

thumbnail
Fig 6. Association analyses between metabolite levels and T2D complications (microvascular ‘micro’, macrovascular ‘macro’, or both complication groups ‘micro_macro’).

A: Venn diagram of the significant metabolites (meta-analysis q-values lower than 5%) across complications. B: log10(q-values) of associations between metabolite levels (separated into absolute levels and derived ratios) and each complication group.–log10(q-values) are represented for positive associations, and log10(q-values) are represented for negative associations.

https://doi.org/10.1371/journal.pgen.1011346.g006

Four metabolites were more strongly associated with microvascular complications than with macrovascular complications (Fig 6B). This is for example the case for creatinine, which was one of the strongest signals associated with microvascular complications (OR = 1.85 [1.58–2.18], p = 3.59x10-14). Creatinine is used in estimated glomerular filtration rate (eGFR) calculation, a measure used to assess kidney function, and known to be affected in nephropathy which is one of the main T2D microvascular complications [47]. Leucine is another example, for which significant association was found for the microvascular group (OR = 0.76 [0.65–0.89], p = 7.33x10-4), concordant with previous studies describing negative associations between leucine and kidney disease in T2D patients [48]. Increased levels of leucine were found to be putatively caused by an increased T2D liability in our reverse MR analysis, in line with the existing literature [6], showing opposite associations of leucine with T2D and its complications. Overall, we found the profile of associations with metabolites to be similar with the MR estimates observed with T2D liability, with 78% of the metabolites associated with at least one complication and significant in the reverse MR having concordant direction of association between the two analyses, including decreases in most cholesterol metabolites, apolipoprotein B and glycine. Out of the 165 significantly associated metabolites, 54 and 118 were significant in the forward and reverse MR analyses, respectively. Of these, glycine and glycoprotein A were significant only in the reverse direction and associated with microvascular complications, suggesting that increased risk of these complications in T2D patients could be partly explained by an impact of the T2D liability on metabolite levels. A total of 40 metabolites were associated with complications but not significant in the MR analyses, including creatinine, glutamine, and apolipoprotein A1, which were all previously associated with microvascular complications in the literature [47,49,50]. This finding points towards molecular mechanisms involving these metabolites being more specific to disease complications rather than T2D itself.

The metabolite associations with T2D complications stayed stable upon adjustment, with 164, 158 and 156 signals remaining significant upon lipid medication, BMI and metformin adjustment, respectively (Table G in S3 Appendix). Metabolites affected by metformin and BMI adjustment span VLDL and HDL classes, while for instance valine, total VLDL size, albumin and total BCAAs became significant after adjustment. When applying one-sample MR between metabolite levels and the risk of T2D complications, we found no evidence of causal associations, potentially due to the low statistical power associated of the within-T2D UKBB MR analysis. Altogether, these results show that some metabolites causally affected by T2D liability are also associated with T2D complications, while others are specific to the development of complications, even though causality is still to be demonstrated.

Discussion

In this study, we have investigated the links between plasma metabolite profiles, genetics, and the risk of T2D and subsequent complications, to evaluate the metabolic consequences of T2D. We have identified more metabolites as causally affected by T2D liability rather than having a causal effect on disease risk. Further, we have shown that T2D occurrence seems to have an impact on the genetic regulation of metabolite levels. Finally, we describe that metabolite profiles are also associated with T2D complications, though no significant causal relationship could be demonstrated in the present study.

Among the 183 metabolites causally affected by T2D liability, we found positive estimates with BCAAs, as well as with alanine and tyrosine, in agreement with previous studies [13, 18, 26], but we did not find causal effects of BCAAs on T2D risk. Tyrosine was significant in both MR directions, but with opposite direction of effect, with decreased tyrosine levels being causally linked to increased T2D risk and T2D liability having a causal effect on increased tyrosine levels. These findings are in concordance with previous MR studies on the same cohort [26] but not with observational studies, which describe positive associations between tyrosine levels and T2D risk [6]. Furthermore, we found negative associations between tyrosine levels and the three complications groups tested, in line with a previous study [24], suggesting complex relationships between this metabolite and the risk of T2D complications. Further work is required to better disentangle the effect of tyrosine on T2D etiology. Glycine was significant only in the reverse MR analysis and associated with microvascular complications only. This amino acid has been already described as exhibiting lower levels in T2D patients, which could play a role in aggravating glucose dysregulation [51]. Even though not replicated in the EstBB, glycine presented the strongest signal in the interaction QTL analysis in the UKBB, with a weaker genetic regulation in T2D patients compared to controls. A genetic dysregulation of glycine levels might, therefore, be involved in T2D complications etiology, but replication is needed in an independent cohort.

Our findings also provide insights into the role of lipoprotein classes in T2D etiology. For example, TG levels have been described to be positively associated with T2D[40] and are known to positively correlate with glucose levels, a relationship that may be exacerbated in individuals with high polygenic risk scores for T2D[52]. TG levels, which we found to be causally affected by T2D liability and for some of the related percentages significant in our interaction QTL analysis, may therefore contribute to increased glucose levels. For all other types of lipoproteins that were significant in the reverse MR analyses, T2D liability had a causal effect on decreasing their levels. This includes absolute LDL levels and to a lesser extent their related percentages, in line with previous studies [4, 18, 41, 42, 53]. However, these relationships are still debated, and caution is to be taken when interpreting results in studies not restricted to medication-free individuals. For example, Smith et al.[18] showed in a similar reverse MR setting, using age as a proxy for medication use, that lipoprotein associations could be distorted upon medication use. Lipid-related medication adjusted T2D summary statistics are not available, and even if they were, care should be taken in interpreting results from such MR analyses using adjusted summary statistics due to potential collider bias [54]. It is to note that in our one-sample forward MR sensitivity analysis restricted to UKBB participants not under statin medication, LDL-related metabolites retained significance with the same direction of effect as in the two-sample MR analyses. Overall, we observed a causal effect of T2D liability on all lipid classes, including decreases in cholesterol levels, both in HDL and LDL, and increases in triglyceride levels, characteristic of the dyslipidemia observed in T2D[55].

We identified 165 associations between metabolite levels and the risk of developing T2D complications, most of which were shared across complication groups with only few, such as creatinine and glycine, being more strongly associated with microvascular than macrovascular outcomes despite smaller sample size of the microvascular group. A total of 118 metabolites were also found to be significant in the MR analyses, with similar profiles of association. A counter example of this is leucine, which showed associations in opposite directions between the reverse MR analysis and the differential level analysis with T2D complications. The discrepancies observed in our study between the two MR directions, as well as with the risk of complications, highlight the need for further work to better understand disease trajectories of T2D and its complications. A total of 40 metabolites were associated with at least one complication group while not being significant in any of the MR analyses with T2D, suggesting mechanisms specific to T2D complications. However, causal inference to disentangle causation from association in the risk of T2D complications is warranted but challenging given the limited statistical power of MR analysis restricted to T2D patients.

In addition to investigating relationships between metabolite levels and T2D, we described for the first time, to our knowledge, differences in their genetic regulation between T2D patients and controls for 14 metabolites, of which four were replicated in an independent cohort. All of the significant variants identified were found to be significant mQTLs in the overall UKBB but were not associated with T2D risk [3]. These results suggest that deregulated levels of these metabolites are a consequence rather than a cause of the disease, in line with the observations from the bi-directional MR, where evidence is found in the reverse direction for two of these four metabolites. By performing an agnostic genome-wide scan to look for interaction signals, we have observed that all the interaction variants, which are mQTLs at a population-level, led to a decreased magnitude of metabolite genetic regulation in the T2D patients’ group compared to the control group. Additionally, we observed that these interactions resulted in attenuated differences in metabolite levels between cases and controls. The decreased magnitude of genetic regulation in cases might be explained by feedback mechanisms trying to restore homeostasis equilibrium following the occurrence of the disease. Further research is needed to validate this hypothesis and to further interpret these findings, which is challenged by the fact that significant metabolites correspond to percentages and not absolute values. On the other hand, our findings could be of clinical relevance, as for example the interaction variants identified for the percentages of L_LDL_PL, L_VLDL_TG and L_VLDL_FC in our study are regulators of CETP, a target of lipid-lowering drugs, which have been shown to correlate with diabetes incidence [56]. On the other hand, variants in the interaction region for S_HDL_FC_pct are QTL variants of the PLTP gene, which is involved in the transfer of phospholipids from triglycerides to HDL and might be involved in cholesterol metabolism. Investigating the differential genetic regulation of metabolite profiles in T2D patients might therefore help in better understanding how lipid profiles are affected upon T2D, a question still debated across studies [55].

The present work presents some limitations. We have performed bi-directional MR using a similar framework to previous studies in the UKBB based on the first release dataset of metabolite data [16, 18], with the benefit of providing replication in the second release dataset. The use of the same outcome data for the two release datasets prevented the meta-analysis of the results. In this study, we were restricted to the European ancestry DIAMANTE study from 2018 because it was the latest one with summary statistics available without the UKBB samples, which enables us to avoid sample overlap between the exposure and outcome and to limit potential related bias [12]. Our results need to be extended to non-European populations, which will require global efforts to characterize the genetic regulation of molecular traits in these populations, along with methodological developments to deal with multi-ancestry data, especially in MR studies. Additionally, our findings warrant replication in cohorts external to the UKBB, especially for the MR analyses and complication associations. Finally, our study provides useful insights into the metabolic consequences of T2D but is limited by the assayed metabolite panel, which is mostly composed of lipid-related plasma metabolites. Additional metabolite classes, as well as metabolite levels from different tissue types would help in better unravelling biological mechanisms. Furthermore, metabolites were considered individually as exposure, which does not capture overall metabolic pathways. As expected, we observed similar causal effects of highly correlated metabolites belonging to the same class. It would be of interest to integrate metabolites together, for example through multivariate MR analyses [57] within each metabolite class, to assess their relative contribution in shaping T2D risk. In this study, we have here addressed the potential confounding effect of lipid-lowering medications in our MR analyses using restricted one-sample MR. It would however be interesting to use multivariate models to include variables such as BMI to better understand the biological pathways mediating causal relationships between T2D and metabolite profiles.

While MR studies enable the assessment of causality between an exposure and an outcome, they are prone to false positives and based on the liability of the disease but not its occurrence. By using interaction models, we went one step further to describe the consequences of T2D occurrence on metabolite levels and their genetic regulation. We highlight that changes in metabolite profiles can be useful to better understand T2D disease progression, as exemplified by the metabolites associated with the risk of developing T2D complications. Altogether, our results enable a deeper understanding of the metabolic consequences of T2D and provide future directions for the study of the genetic regulation of molecular levels in T2D and its complications to better capture disease trajectory.

Supporting information

S1 Appendix. File containing additional details on the quality control and sensitivity analyses performed.

https://doi.org/10.1371/journal.pgen.1011346.s001

(DOCX)

S2 Appendix. Containing Supplementary Figures A to D.

Fig A: Workflow to define prevalent T2D cases at T0, when metabolite data have been measured. Fig B: Comparison of betas and p-values with Smith et al. 2023 for the reverse MR on the first set of metabolites [18]. Fig C: Manhattan plot of the interaction QTL analysis for the four metabolites significant and replicated in the EstBB. Fig D: Genomic regions around the most significant variants from the interaction QTL analyses, obtained from https://genome.ucsc.edu/.

https://doi.org/10.1371/journal.pgen.1011346.s002

(DOCX)

S3 Appendix. Containing the Supplementary Tables A to G.

Table A: list of ICD10 codes used to define the T2D complication groups. Table B: summary of the analyses performed with the number of individuals kept, the covariates used for the adjustment and the significance threshold. Table C: Results of the bi-directionnal MR analyses (IVW method) in the 1st and 2nd set of metabolites. Table D: Results of the one-sample MR Forward analysis (ivreg estimate) in the 2nd release set of significant metabolites from the main MR analyses. Table E: Variants significant at the genome-wide significance threshold (5e-8) in the interaction analysis and replicated in the Estonian Biobank (q-value <0.05). Table F: Sensitivity analyses for the replicated variants from the interaction QTL analysis. Table G: Results of associations between metabolite levels and the three complication groups (microvascular, macrovascular, and both micro and macro vascular).

https://doi.org/10.1371/journal.pgen.1011346.s003

(XLSX)

Acknowledgments

This research has been conducted using the UK Biobank Resource under Application Number 10205.

The Estonian Biobank research team refers to Andres Metspalu, Lili Milani, Tõnu Esko, Reedik Mägi, Mari Nelis and Georgi Hudjashov.

Data analysis in the EstBB was carried out in part in the High-Performance Computing Center of University of Tartu. The activities of the EstBB are regulated by the Human Genes Research Act, which was adopted in 2000 specifically for the operations of the EstBB.

References

  1. 1. Mahajan A, Spracklen CN, Zhang W, Ng MCY, Petty LE, Kitajima H, et al. Multi-ancestry genetic study of type 2 diabetes highlights the power of diverse populations for discovery and translation. Nat Genet. 2022 May;54(5):560–72. pmid:35551307
  2. 2. Mahajan A, Taliun D, Thurner M, Robertson NR, Torres JM, Rayner NW, et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat Genet. 2018 Nov;50(11):1505–13. pmid:30297969
  3. 3. Suzuki K, Hatzikotoulas K, Southam L, Taylor HJ, Yin X, Lorenz KM, et al. Genetic drivers of heterogeneity in type 2 diabetes pathophysiology. Nature. 2024 Feb 19. pmid:38374256
  4. 4. Julkunen H, Cichonska A, Tiainen M, Koskela H, Nybo K, Makela V, et al. Atlas of plasma NMR biomarkers for health and disease in 118,461 individuals from the UK Biobank. Nat Commun. 2023 Feb 3;14(1):604. pmid:36737450
  5. 5. Morze J, Wittenbecher C, Schwingshackl L, Danielewicz A, Rynkiewicz A, Hu FB, et al. Metabolomics and Type 2 Diabetes Risk: An Updated Systematic Review and Meta-analysis of Prospective Cohort Studies. Diabetes Care. 2022 Apr 1;45(4):1013–24. pmid:35349649
  6. 6. Wang TJ, Larson MG, Vasan RS, Cheng S, Rhee EP, McCabe E, et al. Metabolite profiles and the risk of developing diabetes. Nat Med. 2011 Apr;17(4):448–53. pmid:21423183
  7. 7. Sun Y, Gao HY, Fan ZY, He Y, Yan YX. Metabolomics Signatures in Type 2 Diabetes: A Systematic Review and Integrative Analysis. J Clin Endocrinol Metab. 2020 Apr 1;105(4). pmid:31782507
  8. 8. Jin Q, Ma RCW. Metabolomics in Diabetes and Diabetic Complications: Insights from Epidemiological Studies. Cells. 2021 Oct 21;10(11). pmid:34831057
  9. 9. Nightingale Health Biobank Collaborative G, Jeffrey CB, Tõnu E, Krista F, Luke J-D, Pekka J, et al. Metabolomic and genomic prediction of common diseases in 477,706 participants in three national biobanks. medRxiv. 2023:2023.06.09.23291213.
  10. 10. Bragg F, Trichia E, Aguilar-Ramirez D, Besevic J, Lewington S, Emberson J. Predictive value of circulating NMR metabolic biomarkers for type 2 diabetes risk in the UK Biobank study. BMC Med. 2022 May 3;20(1):159.
  11. 11. Buergel T, Steinfeldt J, Ruyoga G, Pietzner M, Bizzarri D, Vojinovic D, et al. Metabolomic profiles predict individual multidisease outcomes. Nat Med. 2022 Nov;28(11):2309–20. pmid:36138150
  12. 12. Sanderson E, Glymour MM, Holmes MV, Kang H, Morrison J, Munafo MR, et al. Mendelian randomization. Nat Rev Methods Primers. 2022 Feb 10;2.
  13. 13. Porcu E, Gilardi F, Darrous L, Yengo L, Bararpour N, Gasser M, et al. Triangulating evidence from longitudinal and Mendelian randomization studies of metabolomic biomarkers for type 2 diabetes. Sci Rep. 2021 Mar 18;11(1):6197. pmid:33737653
  14. 14. Lotta LA, Scott RA, Sharp SJ, Burgess S, Luan J, Tillin T, et al. Genetic Predisposition to an Impaired Metabolism of the Branched-Chain Amino Acids and Risk of Type 2 Diabetes: A Mendelian Randomisation Analysis. PLoS Med. 2016 Nov;13(11):e1002179. pmid:27898682
  15. 15. Yin X, Li J, Bose D, Okamoto J, Kwon A, Jackson AU, et al. Metabolome-wide Mendelian randomization characterizes heterogeneous and shared causal effects of metabolites on human health. medRxiv. 2023 Jun 29. pmid:37425837
  16. 16. Mosley JD, Shi M, Agamasu D, Vaitinadin NS, Murthy VL, Shah RV, et al. Branched-chain amino acids and type 2 diabetes: a bidirectional Mendelian randomization analysis. Obesity (Silver Spring). 2024 Feb;32(2):423–35. pmid:38269471
  17. 17. Liu J, van Klinken JB, Semiz S, van Dijk KW, Verhoeven A, Hankemeier T, et al. A Mendelian Randomization Study of Metabolite Profiles, Fasting Glucose, and Type 2 Diabetes. Diabetes. 2017 Nov;66(11):2915–26.
  18. 18. Smith ML, Bull CJ, Holmes MV, Davey Smith G, Sanderson E, Anderson EL, et al. Distinct metabolic features of genetic liability to type 2 diabetes and coronary artery disease: a reverse Mendelian randomization study. EBioMedicine. 2023 Apr;90:104503. pmid:36870196
  19. 19. Karjalainen MK, Karthikeyan S, Oliver-Williams C, Sliz E, Allara E, Fung WT, et al. Genome-wide characterization of circulating metabolic biomarkers. Nature. 2024 Mar 6. pmid:38448586
  20. 20. Bar N, Korem T, Weissbrod O, Zeevi D, Rothschild D, Leviatan S, et al. A reference map of potential determinants for the human serum metabolome. Nature. 2020 Dec;588(7836):135–40. pmid:33177712
  21. 21. Litwak L, Goh SY, Hussein Z, Malek R, Prusty V, Khamseh ME. Prevalence of diabetes complications in people with type 2 diabetes mellitus and its association with baseline characteristics in the multinational A1chieve study. Diabetol Metab Syndr. 2013 Oct 24;5(1):57. pmid:24228724
  22. 22. Tomofuji Y, Suzuki K, Kishikawa T, Shojima N, Hosoe J, Inagaki K, et al. Identification of serum metabolome signatures associated with retinal and renal complications of type 2 diabetes. Commun Med (Lond). 2023 Jan 9;3(1):5. pmid:36624208
  23. 23. Harris K, Oshima M, Sattar N, Wurtz P, Jun M, Welsh P, et al. Plasma fatty acids and the risk of vascular disease and mortality outcomes in individuals with type 2 diabetes: results from the ADVANCE study. Diabetologia. 2020 Aug;63(8):1637–47. pmid:32385604
  24. 24. Welsh P, Rankin N, Li Q, Mark PB, Wurtz P, Ala-Korpela M, et al. Circulating amino acids and the risk of macrovascular, microvascular and mortality outcomes in individuals with type 2 diabetes: results from the ADVANCE trial. Diabetologia. 2018 Jul;61(7):1581–91. pmid:29728717
  25. 25. Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015 Mar;12(3):e1001779. pmid:25826379
  26. 26. Jacky Man Yuen M, Baoting H, Tommy Hon Ting W, Ying L, Shan L, Kenneth L, et al. Evaluating the role of amino acids in type 2 diabetes risk: a Mendelian randomization study. medRxiv. 2023:2023.08.27.23294702.
  27. 27. Ritchie SC, Surendran P, Karthikeyan S, Lambert SA, Bolton T, Pennells L, et al. Quality control and removal of technical variation of NMR metabolic biomarker data in ~120,000 UK Biobank participants. Sci Data. 2023 Jan 31;10(1):64. pmid:36720882
  28. 28. Auer PL, Reiner AP, Leal SM. The effect of phenotypic outliers and non-normality on rare-variant association testing. Eur J Hum Genet. 2016 Aug;24(8):1188–94. pmid:26733287
  29. 29. Gopalan A, Mishra P, Alexeeff SE, Blatchins MA, Kim E, Man AH, et al. Prevalence and predictors of delayed clinical diagnosis of Type 2 diabetes: a longitudinal cohort study. Diabet Med. 2018 Dec;35(12):1655–62. pmid:30175870
  30. 30. Eastwood SV, Mathur R, Atkinson M, Brophy S, Sudlow C, Flaig R, et al. Algorithms for the Capture and Adjudication of Prevalent and Incident Diabetes in UK Biobank. PLoS One. 2016;11(9):e0162388. pmid:27631769
  31. 31. Borges MC, Haycock PC, Zheng J, Hemani G, Holmes MV, Davey Smith G, et al. Role of circulating polyunsaturated fatty acids on cardiovascular diseases risk: analysis using Mendelian randomization and fatty acid genetic association data from over 114,000 UK Biobank participants. BMC Med. 2022 Jun 13;20(1):210. pmid:35692035
  32. 32. Mbatchou J, Barnard L, Backman J, Marcketta A, Kosmicki JA, Ziyatdinov A, et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat Genet. 2021 Jul;53(7):1097–103. pmid:34017140
  33. 33. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007 Sep;81(3):559–75. pmid:17701901
  34. 34. Li J, Ji L. Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix. Heredity (Edinb). 2005 Sep;95(3):221–7. pmid:16077740
  35. 35. Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018 May 30;7. pmid:29846171
  36. 36. Bowden J, Spiller W, Del Greco MF, Sheehan N, Thompson J, Minelli C, et al. Improving the visualization, interpretation and analysis of two-sample summary data Mendelian randomization via the Radial plot and Radial regression. Int J Epidemiol. 2018 Aug 1;47(4):1264–78. pmid:29961852
  37. 37. Laakso M, Fernandes Silva L. Statins and risk of type 2 diabetes: mechanism and clinical implications. Front Endocrinol (Lausanne). 2023;14:1239335. pmid:37795366
  38. 38. Cholesterol Treatment Trialists’ Collaboration. Electronic address cnoau, Cholesterol Treatment Trialists C. Effects of statin therapy on diagnoses of new-onset diabetes and worsening glycaemia in large-scale randomised blinded statin trials: an individual participant data meta-analysis. Lancet Diabetes Endocrinol. 2024 May;12(5):306–19.
  39. 39. Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010 Sep 1;26(17):2190–1. pmid:20616382
  40. 40. Klimentidis YC, Arora A, Newell M, Zhou J, Ordovas JM, Renquist BJ, et al. Phenotypic and Genetic Characterization of Lower LDL Cholesterol and Increased Type 2 Diabetes Risk in the UK Biobank. Diabetes. 2020 Oct;69(10):2194–205. pmid:32493714
  41. 41. Andersson C, Lyass A, Larson MG, Robins SJ, Vasan RS. Low-density-lipoprotein cholesterol concentrations and risk of incident diabetes: epidemiological and genetic insights from the Framingham Heart Study. Diabetologia. 2015 Dec;58(12):2774–80. pmid:26409460
  42. 42. Feng Q, Wei WQ, Chung CP, Levinson RT, Sundermann AC, Mosley JD, et al. Relationship between very low low-density lipoprotein cholesterol concentrations not due to statin therapy and risk of type 2 diabetes: A US-based cross-sectional observational study using electronic health records. PLoS Med. 2018 Aug;15(8):e1002642. pmid:30153257
  43. 43. Khankari NK, Keaton JM, Walker VM, Lee KM, Shuey MM, Clarke SL, et al. Using Mendelian randomisation to identify opportunities for type 2 diabetes prevention by repurposing medications used for lipid management. EBioMedicine. 2022 Jun;80:104038. pmid:35500537
  44. 44. Smit RAJ, Trompet S, Leong A, Goodarzi MO, Postmus I, Warren H, et al. Statin-induced LDL cholesterol response and type 2 diabetes: a bidirectional two-sample Mendelian randomization study. Pharmacogenomics J. 2020 Jun;20(3):462–70. pmid:31801993
  45. 45. Zhao J, Zhang Y, Wei F, Song J, Cao Z, Chen C, et al. Triglyceride is an independent predictor of type 2 diabetes among middle-aged and older adults: a prospective study with 8-year follow-ups in two cohorts. J Transl Med. 2019 Dec 3;17(1):403. pmid:31801571
  46. 46. McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GR, Thormann A, et al. The Ensembl Variant Effect Predictor. Genome Biol. 2016 Jun 6;17(1):122. pmid:27268795
  47. 47. Levey AS, Coresh J. Chronic kidney disease. Lancet. 2012 Jan 14;379(9811):165–80. pmid:21840587
  48. 48. Tofte N, Vogelzangs N, Mook-Kanamori D, Brahimaj A, Nano J, Ahmadizar F, et al. Plasma Metabolomics Identifies Markers of Impaired Renal Function: A Meta-analysis of 3089 Persons with Type 2 Diabetes. J Clin Endocrinol Metab. 2020 Jul 1;105(7). pmid:32271379
  49. 49. Rhee SY, Jung ES, Park HM, Jeong SJ, Kim K, Chon S, et al. Plasma glutamine and glutamic acid are potential biomarkers for predicting diabetic retinopathy. Metabolomics. 2018;14(7):89. pmid:29950956
  50. 50. Moosaie F, Davatgari RM, Firouzabadi FD, Esteghamati S, Deravi N, Meysamie A, et al. Lipoprotein(a) and Apolipoproteins as Predictors for Diabetic Retinopathy and Its Severity in Adults With Type 2 Diabetes: A Case-Cohort Study. Can J Diabetes. 2020 Jul;44(5):414–21.
  51. 51. Yan-Do R, MacDonald PE. Impaired "Glycine"-mia in Type 2 Diabetes and Potential Mechanisms Contributing to Glucose Homeostasis. Endocrinology. 2017 May 1;158(5):1064–73. pmid:28323968
  52. 52. Lim JE, Kang JO, Ha TW, Jung HU, Kim DJ, Baek EJ, et al. Gene-environment interaction in type 2 diabetes in Korean cohorts: Interaction of a type 2 diabetes polygenic risk score with triglyceride and cholesterol on fasting glucose levels. Genet Epidemiol. 2022 Jul;46(5–6):285–302. pmid:35481584
  53. 53. Besseling J, Kastelein JJ, Defesche JC, Hutten BA, Hovingh GK. Association between familial hypercholesterolemia and prevalence of type 2 diabetes mellitus. JAMA. 2015 Mar 10;313(10):1029–36. pmid:25756439
  54. 54. Hartwig FP, Tilling K, Davey Smith G, Lawlor DA, Borges MC. Bias in two-sample Mendelian randomization when using heritable covariable-adjusted summary associations. Int J Epidemiol. 2021 Nov 10;50(5):1639–50. pmid:33619569
  55. 55. Higuchi S, Izquierdo MC, Haeusler RA. Unexplained reciprocal regulation of diabetes and lipoproteins. Curr Opin Lipidol. 2018 Jun;29(3):186–93. pmid:29708925
  56. 56. Masson W, Lobo M, Siniawski D, Huerin M, Molinero G, Valero R, et al. Therapy with cholesteryl ester transfer protein (CETP) inhibitors and diabetes risk. Diabetes Metab. 2018 Dec;44(6):508–13. pmid:29523487
  57. 57. Sanderson E, Spiller W, Bowden J. Testing and correcting for weak and pleiotropic instruments in two-sample multivariable Mendelian randomization. Stat Med. 2021 Nov 10;40(25):5434–52. pmid:34338327