Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Relationship between insulin resistance surrogate markers with diabetes and dyslipidemia: A Bayesian network analysis of Korean adults

  • Jaeyeop Choi,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – original draft

    Affiliation Department of Applied Statistics, Gachon University, Seongnam-si, Gyeonggi-do, Korea

  • Jonghyun Kim,

    Roles Data curation, Methodology, Software, Validation

    Affiliation Department of Applied Statistics, Gachon University, Seongnam-si, Gyeonggi-do, Korea

  • Hyun Sook Oh

    Roles Conceptualization, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing

    hoh@gachon.ac.kr (HSO)

    Affiliation Department of Applied Statistics, Gachon University, Seongnam-si, Gyeonggi-do, Korea

Abstract

Insulin resistance (IR) can be optimally assessed using the euglycemic clamp, but practical clinical limitations necessitate surrogate markers. This study leveraged the Bayesian network analysis to evaluate three established IR markers: the Homeostatic Model Assessment of IR (HOMA-IR) using insulin level and fasting blood glucose (FBG), TG-Glucose (TyG) index using triglycerides (TG) and FBG, and TG-to-HDL ratio (TG/HDL ratio) using TG and high-density lipoprotein (HDL), based on the Korean National Health and Nutrition Examination Survey data (2019–2021). Our analysis revealed a sequential association pattern (TG/HDL ratio → TyG index → HOMA-IR), positioning the TyG index as a central connecting marker. The HOMA-IR exhibited strong predictive power for diabetes, while the TG/HDL ratio was most effective for assessing dyslipidemia. However, both had limited crossover utility. In contrast, the TyG index bridged this gap, demonstrating robust predictive capability for both conditions. The Markov blanket analysis illuminated the distinctive metabolic signatures of each marker: The TyG index displayed balanced glucose-lipid metabolic contributions, the HOMA-IR predominantly reflected glucose metabolism and obesity characteristics, and the TG/HDL ratio emphasized lipid metabolism. Notably, the TyG index’s predictive performance showed significant enhancement when integrated with obesity information, contrasting with the HOMA-IR’s minimal response owing to its inherent incorporation of obesity characteristics. These findings position the TyG index as a superior clinical marker, offering both comprehensive predictive capability and enhanced performance through synergistic integration with obesity measures. While each marker demonstrated reliability, the TyG index’s unique combination of versatility and scalability establishes it as an effective tool for comprehensive metabolic risk assessment.

Introduction

Insulin resistance (IR), a condition where tissues show reduced responsiveness to normal insulin levels, fundamentally disrupts glucose and fatty acid regulation [1,2]. This impairment of insulin’s role in mediating the transition between nutrient production and storage contributes significantly to chronic conditions such as diabetes and dyslipidemia.

Hyperglycemia damages blood vessels by triggering inflammation, promoting lipid synthesis, and inducing oxidative stress, leading to vascular remodeling and plaque formation [35]. These changes increase the risk of complications across multiple organs. Accurate IR assessment is crucial for prevention and management. While the euglycemic clamp is the gold standard for measuring IR, its invasiveness, time demands, and cost limit its clinical use [68]. The HOMA-IR offers an alternative approach by evaluating IR through fasting insulin and glucose levels; numerous studies have validated its effectiveness [911]. However, the HOMA-IR’s utility is constrained by the irregular inclusion of insulin measurements in routine blood work and lack of standardized measurement protocols [12].

To overcome these limitations, researchers have developed non-insulin-based indices, including the TyG index and the TG/HDL ratio. These simpler alternatives have been validated as reliable IR markers, with a few studies suggesting the TyG index may be superior to both the HOMA-IR and the TG/HDL ratio in predicting diabetes and cardiovascular outcomes [1318]. However, direct comparisons between these three indicators remain scarce.

Despite the TyG index’s promise, most research has relied on traditional statistical methods such as logistic regression and Cox proportional hazards models. These approaches, while useful for assessing predictive performance, cannot capture non-linear interactions among variables or fully reflect the complexity of biomedical data [19].

The Bayesian network analysis offers a more comprehensive approach by modeling probabilistic relationships without predefined dependent or explanatory variables [20]. This methodology reveals non-linear interdependencies, which traditional models often overlook, by exploring relationships in an unsupervised manner, while also enabling probability estimations through simulations under various conditional scenarios [2022]. Furthermore, its visual representation through Directed Acyclic Graphs (DAGs) enhances the interpretation of variable relationships, rendering it particularly valuable in medical and life science applications [2326].

This study aims to compare the HOMA-IR, TyG index, and TG/HDL ratio as IR surrogate markers for diabetes and dyslipidemia using the Bayesian network analysis. Beyond simple comparison, we explore the pathways through which various clinical and behavioral factors influence these markers. Additionally, we assess their scalability by examining their performance when incorporating key variables such as obesity and hypertension—significant contributors to both diabetes and dyslipidemia. This comprehensive approach provides deeper insights into the markers’ predictive capabilities and their broader clinical applicability.

Materials and methods

Study design and data extraction

This study analyzed data from the Korea National Health and Nutrition Survey (KNHANES) conducted between 2019 and 2021. The KNHANES, established in 1998 by the Korea Centers for Disease Control and Prevention (KCDC), is an ongoing cross-sectional survey designed to assess and enhance the health and nutritional status of the South Korean population. The survey implements a stratified multistage cluster sampling method to ensure representative sampling of non-institutionalized Koreans aged 1 year and older. Detailed methodology is available on the KNHANES Website. The KNHANES 2019–2021 received approval from the Institutional Review Board of the KCDC, in accordance with the principles outlined in the Declaration of Helsinki. The approval numbers are 2018-01-03-C-A, 2018-01-03-2C-A, and 2018-01-03-5C-A. The requirement for written informed consent was waived by the Institutional Review Board, as the analysis was based on anonymous and de-identified data provided by KNHANES. No additional personal or sensitive data was collected.

The 2019–2021 KNHANES dataset was selected as it represents the most recent period with comprehensive insulin measurements, incorporating health surveys, health examinations, and nutrition surveys. From an initial sample of 22,559 participants, we first excluded 5,390 participants because of incomplete health survey and examination data. We excluded 8,921 participants outside the age range of 40–70 years to focus on the population with highest risk and prevalence of diabetes and dyslipidemia, while avoiding younger individuals with lower incidence rates and elderly subjects with potentially confounding comorbidities [27,28]. Further refinement of the study population involved excluding 726 participants based on their medical history: 366 participants with cardiovascular disease history (including stroke, myocardial infarction, and angina), 277 participants with cancer history (including gastric, liver, colorectal, lung, and thyroid cancers), and 83 participants with kidney disease history. After applying these exclusion criteria, the final study population comprised 8,195 participants (Fig 1).

thumbnail
Fig 1. Flow chart for data selection.

Abbreviation: KNHANES, Korea National Health and Nutrition Survey.

https://doi.org/10.1371/journal.pone.0323329.g001

IR indices

All blood samples were collected after an overnight fast of at least 8 hours, in accordance with the KNHANES protocol. In the KNHANES dataset, insulin levels were measured using the Cobas 8000 e801 analyzer (Roche, Germany) with Elecsys Insulin reagents (Roche, Germany). FBG, TG, and HDL levels were measured using the Labospect 008AS system (Hitachi, Japan). The insulin-based index, the HOMA-IR, was calculated as: HOMA-IR = Insulin (μIU/mL) × FBG (mg/dL)/ 405. Non-insulin-based indices were defined as follows: TyG index = ln[TG (mg/dL) × FBG (mg/dL)/ 2], TG/HDL ratio = TG (mg/dL)/ HDL (mg/dL) [27,28]. Participants were categorized into four groups (Q1–Q4) based on the quartiles of each index.

Diabetes and dyslipidemia

Diabetes was defined according to the Korean Diabetes Association guidelines as having a diagnosis of diabetes by a physician, FBG ≥ 126 mg/dL, a hemoglobin A1c (HbA1c) ≥6.5%, or a prescription for anti-diabetic medication or insulin therapy [29]. Dyslipidemia was defined according to the Korea Society of Lipid and Atherosclerosis guidelines as meeting any of the following criteria: a diagnosis of dyslipidemia by a physician, a total cholesterol (TCHOL) ≥240 mg/dL, a low-density lipoprotein cholesterol (LDL) ≥160 mg/dL, HDL ≤ 40 mg/dL, TG ≥ 200 mg/dL, or the prescription of medication for dyslipidemia [30].

Other variables

Socio-demographic factors considered included age, sex, education, and household income. Age was categorized into three groups: 40–49, 50–59, and 60–69 years old. Education level was divided into three categories: Middle school or below, High school, and College or higher. Household income was classified into three groups based on income quartiles as reported in the KNHANES survey: Low/Medium low (lowest and second quartiles), Medium high (third quartile), and High (highest quartile) [31].

Health behaviors included sleep duration, drinking frequency, smoking status, leisure activity, work activity, and muscle activity. Sleep was categorized as < 6 hours or ≥ 6 hours per day, following established thresholds for sleep insufficiency associated with metabolic risk [32]. Drinking was classified as < 1 per month or ≥ 1 per month. Smoking status was classified as no (smoked <5 packs or does not smoke) or yes (smoked ≥5 packs or currently smokes) based on previous epidemiological studies on smoking intensity [33]. Leisure and work activity were categorized as yes or no based on high- or moderate-intensity physical activity at least once a week [34]. Muscle activity was classified as yes if engaging in muscle-strengthening activities at least once a week, and no otherwise.

Clinical variables included systolic blood pressure (SBP), diastolic blood pressure (DBP), FBG, HbA1c, HDL, TG, TCHOL, and LDL. SBP was categorized as < 120 mmHg (Normal), 120–139 mmHg (Caution), and ≥ 140 mmHg (Danger). DBP was classified as < 80 mmHg (Normal), 80–89 mmHg (Caution), and ≥ 90 mmHg (Danger). FBG was categorized as < 100 mg/dL (Normal), 100–125 mg/dL (Caution), and ≥ 126 mg/dL (Danger). HbA1c was divided into <5.7% (Normal), 5.7–6.3% (Caution), and ≥ 6.4% (Danger). HDL was classified as < 40 mg/dL (Danger) and ≥ 40 mg/dL (Normal). TG was categorized as < 150 mg/dL (Normal), 150–199 mg/dL (Caution), and ≥ 200 mg/dL (Danger). TCHOL was classified as < 200 mg/dL (Normal), 200–239 mg/dL (Caution), and ≥ 240 mg/dL (Danger). LDL was classified as < 129 mg/dL (Normal), 130–189 mg/dL (Caution), and ≥ 190 mg/dL (Danger). For unmeasured LDL, Friedewald’s formula was appplied: LDL = TCHOL - HDL - (TG/5) [35].

Hypertension was defined according to the Korean Society of Hypertension guidelines as having a diagnosis of hypertension by a physician, SBP ≥ 140 mmHg, DBP ≥ 90 mmHg, or being prescribed anti-hypertensive medication [36]. Obesity was defined using three different indicators: body mass index (BMI), waist circumference (WC), and waist-to-height ratio (WHtR). Participants with BMI ≥ 25 kg/m² were classified as “yes,” otherwise “no.” Males with WC ≥ 90 cm and females with WC ≥ 85 cm were classified as “yes,” otherwise “no.” Participants with WHtR ≥ 0.5 were classified as “yes,” otherwise “no” [37,38].

Statistical analysis

The characteristics of the participants were described based on diabetes and dyslipidemia status, respectively. For each condition, participants were classified into two groups (Yes/No), and comparisons were made to assess variable distributions according to diabetes and dyslipidemia status. Variables were categorized and expressed as weighted percentages, and the Rao-Scott chi-square tests, accounting for survey weights and sampling design, were performed to identify significant differences between groups.

For each IR index, participants were stratified into quartiles: Q1 (≤ first quartile), Q2 (between the first and second quartiles), Q3 (between the second and third quartiles), and Q4 (> third quartile). Distribution across quartile groups was analyzed by diabetes and dyslipidemia status, with tests for significant differences. Statistical significance was defined a P-value < 0.05 in all tests.

Additionally, weighted mutual information was calculated for diabetes and dyslipidemia to assess the mutual dependence of the other variables. The performance of the HOMA-IR, TyG index, and TG/HDL ratio as predictive markers was compared using the receiver operating characteristic (ROC) curves and the area under the curve (AUC), with higher AUC values indicating greater diagnostic performance. Subsequently, a Bayesian network analysis was conducted to explore the relationship structure between all variables.

Bayesian network

A Bayesian network is a graphical model for probabilistic inference that uses a DAG to represent the joint distribution of variables and their conditional independence relationships [39]. In a DAG, nodes represent random variables , and arcs represent probabilistic dependencies [19,40]. If there is an arc from node to node , then is the parent node of . The joint probability distribution of variables is:

where denotes the set of parent nodes. A Bayesian network is defined by the graph, and the probability distribution, [41].

This study uses DAG analysis to reveal complex relationships between IR markers and metabolic disorders. Unlike traditional statistical methods, DAGs model probabilistic relationships, visualizing direct and indirect connections. This approach illuminates the interplay between IR markers and their relationships with diabetes, dyslipidemia, and other clinical and behavioral factors.

Additionally, the Markov blanket of a node includes its parents, children, and the parents of its children. It is the minimal set of information required to explain while making all other variables conditionally independent [4244].

In this study, the Markov blanket analysis identifies variables with direct probabilistic relationships to each IR measure, filtering out indirect connections. This approach determines the minimal set of variables that provide complete information about each index’s relationships within the network, clarifying how these IR measures relate to clinical factors and to each other in the metabolic structure.

There are three main methods for network structure learning: Score-based learning, Constraint-based learning, and Hybrid learning [45]. Score-based learning finds the graph with the highest objective function, using scores such as the Bayesian score or Information-theoretic score. Constraint-based learning uses conditional independence tests to build the graph. Hybrid learning combines both approaches by restricting the search space through tests and selecting the highest-scoring graph [46].

In this study, network structure learning was conducted using the score-based Hill-climbing algorithm with the Akaike Information Criterion (AIC) as the score. To enhance model robustness, 1,000 bootstrap iterations were performed, and the network structure was refined by including only arcs that appeared in at least 75% of the iterations through model averaging [47-48]. Conditional probability estimation was performed using the “cpquery” function from the “bnlearn” package. All analyses were performed using R version 4.3.2 and the “bnlearn” package version 4.9.1.

Results

General characteristics of the study population

The study sample characteristics was summarized by diabetes and dyslipidemia status (Table 1). Diabetes prevalence was 15.20%, and dyslipidemia was 51.86%. While the overall sex ratio was balanced, males were more prevalent in both groups. Among diabetes patients, frequency increased with age, peaking in the 40s, followed by the 50s and 60s. Dyslipidemia was most common in the 40s, followed by the 60s and 50s.

thumbnail
Table 1. Summary of study participants by diabetes and dyslipidemia.

https://doi.org/10.1371/journal.pone.0323329.t001

The Rao-Scott tests showed that most variables, except sleep duration, drinking frequency, and work activity, were significantly associated with diabetes or dyslipidemia. Obesity (BMI) rates were 38.62% overall, 58.14% in diabetes, and 48.31% in dyslipidemia. Hypertension prevalence was 32.51% overall, 54.67% in diabetes, and 41.61% in dyslipidemia.

Distribution of IR indices across quartile groups

Table 2 shows participant distribution across IR index quartiles, stratified by diabetes and dyslipidemia status. Patients with diabetes showed high IR markers, with the majority falling in the highest quartile group (Q4 or Q3) across all indices. Specifically, for TyG index, 52.66% were in Q4 and 25.53% in Q3. HOMA-IR values were in Q4 at 58.09%, with 21.21% in Q3. The TG/HDL ratio showed 38.28% in Q4 and 28.90% in Q3.

Among dyslipidemia patients, a similar pattern emerged but with slightly different distribution. For TyG index, 42.81% were in Q4 and 25.71% in Q3. HOMA-IR measurements showed 35.48% in Q4 and 28.10% in Q3. The TG/HDL ratio showed 44.48% in Q4 and 24.84% in Q3.

Bayesian network analysis

For diabetes, the HOMA-IR demonstrated the highest predictive performance with both the greatest mutual information (5.11%) and AUC (0.76), followed by the TyG index (4.25%, 0.74) and TG/HDL ratio (1.92%, 0.64) (Figs 2A and 2B). Conversely, for dyslipidemia, the TG/HDL ratio showed the best performance, ranking highest in both mutual information (14.01%) and AUC (0.78), followed by the TyG index (12.05%, 0.77) and the HOMA-IR (5.05%, 0.69) (Figs 2C and 2D).

thumbnail
Fig 2. Mutual information and ROC curve of diabetes (A, B) and dyslipidemia (C, D).

Abbreviation: ROC, receiver operating characteristic; AUC, area under the curve. BMI, body mass index; WC, waist circumference; WHtR, waist-to-height ratio; SBP, systolic blood pressure; FBG, fasting blood glucose; HbA1c, hemoglobin A1c; TG, triglycerides; HDL, high-density lipoprotein; LDL, low-density lipoprotein; TCHOL, total cholesterol; HOMA-IR, homeostasis model assessment of insulin resistance; TyG index, triglyceride-glucose index; TG/HDL, triglyceride-to-high-density lipoprotein cholesterol ratio.

https://doi.org/10.1371/journal.pone.0323329.g002

The DAG structure of the BN model is illustrated in Fig 3. Arc thickness represents AIC values, with thicker arcs indicating stronger connections. The association between the TG/HDL ratio and TyG index was stronger than that between the TyG index and HOMA-IR. Marginal probabilities of each node are provided in the supplementary materials (S1 Fig).

thumbnail
Fig 3. A directed acyclic graph of Bayesian network obtained through hill-climbing learning algorithm with threshold 

= 0.75 by model averaging over 1,000 networks. Arc thickness represents the strength of association based on the Akaike Information Criterion (AIC). Abbreviation: BMI, body mass index; WC, waist circumference; WHtR, waist-to-height ratio; SBP, systolic blood pressure; FBG, fasting blood glucose; HbA1c, hemoglobin A1c; TG, triglycerides; HDL, high-density lipoprotein; LDL, low-density lipoprotein; TCHOL, total cholesterol; HOMA-IR, homeostasis model assessment of insulin resistance; TyG index, triglyceride-glucose index; TG/HDL ratio, triglyceride-to-high-density lipoprotein cholesterol ratio.

https://doi.org/10.1371/journal.pone.0323329.g003

All three IR indices shared FBG, a child node of diabetes, as a parent node. These indices formed a sequential connection structure flowing from TG/HDL ratio through TyG index to HOMA-IR (Fig 3) [46]. The Markov blanket analysis revealed that the TyG index was included in the Markov blankets of both HOMA-IR and TG/HDL ratio, while HOMA-IR and TG/HDL ratio did not appear in each other’s Markov blankets (Fig 4).

thumbnail
Fig 4. Markov blankets of HOMA-IR (A), TG/HDL ratio (B), and TyG index (C).

Abbreviation: BMI, body mass index; WC, waist circumference; WHtR, waist-to-height ratio; SBP, systolic blood pressure; FBG, fasting blood glucose; HbA1c, hemoglobin A1c; TG, triglycerides; HDL, high-density lipoprotein; LDL, low-density lipoprotein; TCHOL, total cholesterol; HOMA-IR, homeostasis model assessment of insulin resistance; TyG index, triglyceride-glucose index; TG/HDL, triglyceride-to-high-density lipoprotein cholesterol ratio.

https://doi.org/10.1371/journal.pone.0323329.g004

Specifically, the Markov blanket of the HOMA-IR included the TyG index and FBG, along with age and obesity indicators such as BMI, WC, and WHtR. The Markov blanket of the TyG index included the HOMA-IR, TG/HDL ratio, FBG, as well as Sex, Age, Frequency of drinking, History of smoking, TG, and HDL. The Markov blanket of the TG/HDL ratio consisted of the TyG index and FBG, as well as Sex, Education, Muscle activity, TG, and HDL.

So, Obesity indicators (BMI, WC, WHtR) were only part of the Markov blanket of the HOMA-IR and were not included in the Markov blankets of the other indices. Meanwhile, FBG acted as a d-separator for the three indices, reinforcing its role as a diagnostic marker for diabetes, with the strongest association observed with the TyG index (Fig. 3) [46].

For dyslipidemia, the lipid clinical variables TG, LDL, HDL, TCHOL, and Age were identified as parent nodes. Among these, HDL had the highest association with the TG/HDL ratio, while TG, which exhibited the highest mutual information with dyslipidemia, showed the strongest association with the TyG index (Figs 2C and 3).

Conditional probabilities

The conditional probabilities of diabetes for the HOMA-IR across quartiles Q1 to Q4 were 0.06, 0.09, 0.16, and 0.34, respectively. For the TyG index, the probabilities were 0.05, 0.10, 0.17, and 0.35. For the TG/HDL ratio, the probabilities were 0.08, 0.12, 0.19, and 0.25. These values are shown in Fig 5A.

thumbnail
Fig 5. Conditional probabilities of diabetes (A) and dyslipidemia (B) across the quartile groups (Q1-Q4) of HOMA-IR, TyG index, and TG/HDL ratio.

Abbreviation: HOMA-IR, homeostasis model assessment of insulin resistance; TyG index, triglyceride-glucose index; TG/HDL, triglyceride-to-high-density lipoprotein cholesterol ratio; Q1, 1st quartile group; Q2, 2nd quartile group; Q3, 3rd quartile group; Q4, 4th quartile group.

https://doi.org/10.1371/journal.pone.0323329.g005

Similarly, the conditional probabilities of dyslipidemia for the HOMA-IR across quartiles Q1 to Q4 were 0.43, 0.48, 0.56, and 0.64, respectively. For the TyG index, the probabilities were 0.36, 0.40, 0.49, and 0.87. For the TG/HDL ratio, the probabilities were 0.36, 0.38, 0.48, and 0.92. These values are shown in Fig 5B.

When additional information on obesity (based on BMI) was provided, the conditional probability of diabetes across quartiles of the TyG index was as illustrated in Fig 6A. For the TyG index in Q4, if the individual was non-obese (BMI = “no”), the conditional probability of diabetes was 0.29. However, when the additional information indicated obesity (BMI = “yes”), the probability increased to 0.39, demonstrating the change in diabetes probability due to the inclusion of obesity information.

thumbnail
Fig 6. Conditional probabilities of diabetes with obesity (based on BMI) given the TyG index (A) and HOMA-IR (B), and conditional probabilities of dyslipidemia with hypertension given the TG/HDL ratio (C) and TyG index (D).

Abbreviation: HOMA-IR, homeostasis model assessment of insulin resistance; TyG index, triglyceride-glucose index; TG/HDL, triglyceride-to-high-density lipoprotein cholesterol ratio; BMI, body mass index.

https://doi.org/10.1371/journal.pone.0323329.g006

However, for the HOMA-IR in Q4, the conditional probability of diabetes was similar between non-obese (BMI = “no”, 0.35) and obese (BMI = “yes”, 0.34) individuals (Fig 6B), suggesting that the diabetes risk associated with high HOMA-IR values was independent of obesity status.

When additional information on hypertension was provided, the conditional probability of dyslipidemia was as depicted in Figs 6C and 6D. For the TG/HDL ratio in Q4, the probability of dyslipidemia was 0.92 when there was no hypertension (Hypertension = “no”), and 0.91 when hypertension was present (Hypertension = “yes”), with only a minimal difference (Fig 6C). For the TyG index in Q4, the conditional probability of dyslipidemia was 0.85 without hypertension (Hypertension = “no”) and slightly increased to 0.86 with hypertension (Hypertension = “yes”) (Fig 6D).

Discussion

This study investigated the relationships between IR indicators (HOMA-IR, TyG index, TG/HDL ratio) and their associations with diabetes and dyslipidemia using the Bayesian network analysis. Our findings reveal that the TyG index serves as a comprehensive biomarker capable of predicting both diabetes and dyslipidemia simultaneously, while the HOMA-IR and the TG/HDL ratio are more effective at predicting individual conditions.

The analysis revealed a sequential connection among the three indices structured as TG/HDL ratio → TyG index → HOMA-IR, with the TyG index functioning as a connecting variable (d-separator) between the other two indices. This pattern is particularly informative because a Markov blanket identifies the complete set of variables that directly influence or are influenced by a target variable, effectively filtering out indirect relationships. The presence of TyG index in both HOMA-IR and TG/HDL ratio’s Markov blankets indicates that it provides essential information for understanding both measures. Conversely, the absence of HOMA-IR from TG/HDL ratio’s Markov blanket (and vice versa) suggests these two indices represent distinct aspects of IR that become conditionally independent when TyG index is known. This implies that while all three indices measure IR, they capture different physiological dimensions, with the TyG index serving as a central link.

For diabetes prediction, the HOMA-IR and TyG index demonstrated comparable excellence through multiple analytical approaches: both showed high mutual information and AUC values in the ROC curve (Figs 2A and 2B) and exhibited strong predictive power in the Bayesian network conditional probability analysis, consistently outperforming the TG/HDL ratio’s performance (Fig 5A). Similarly, in predicting dyslipidemia, the TG/HDL ratio and TyG index showed superior performance across all measures—mutual information, the AUC values, and the Bayesian network conditional probabilities—consistently outperforming the HOMA-IR (Figs 2C, 2D, and 5B).

The strength of connections with FBG, the central variable of diabetes, decreased from the TyG index to the HOMA-IR to the TG/HDL ratio (Fig 3). Similarly, TG, the central variable of dyslipidemia, showed the strongest connection with the TyG index, indicating its robust associations with both glucose and lipid metabolism markers.

The Markov blanket analysis revealed distinct characteristics for each index. The TyG index’s Markov blanket included FBG, TG, and HDL, reflecting balanced contributions from glucose and lipid metabolism, along with health behavior variables such as smoking history and drinking frequency (Fig 4C). The HOMA-IR’s Markov blanket primarily captured glucose metabolism and obesity characteristics, including FBG and obesity-related variables such as BMI, WC, and WHtR (Fig 4A). The TG/HDL ratio’s Markov blanket comprised muscle activity and lipid variables, emphasizing its lipid-centered nature (Fig 4B).

These patterns align with the biological basis of IR: insulin and glucose reflect IR in hepatic cells, while TG and HDL reflect IR in adipose cells [49,50]. The HOMA-IR specifically reflects hepatic IR through insulin and glucose measurements, while the TG/HDL ratio specializes in adipose cell IR through TG and HDL measurements. The TyG index’s incorporation of both FBG and TG enables comprehensive measurement of IR in both tissue types, explaining its central position in the DAG structure linking the TG/HDL ratio and HOMA-IR (Fig 3).

Previous research supports our findings. Studies have shown the TyG index’s superior performance over that of the HOMA-IR in predicting diabetes and atherosclerosis in type 2 diabetes patients, and better predictive capability than the TG/HDL ratio for diabetes risk in prediabetic individuals [15,17,51]. Che et al. [27] demonstrated the TyG index’s greater explanatory power, accounting for 45.8% of dyslipidemia variance and 27.0% of diabetes variance, compared to 40.0% and 11.8%, respectively for the TG/HDL ratio.

When incorporating obesity information, the TyG index showed significant probability value changes based on obesity presence (Fig 6A), while the HOMA-IR showed minimal changes (Fig 6B). This difference arises from the parent–child node relationship between the HOMA-IR and obesity variables, where obesity information is inherently incorporated into the HOMA-IR (Figs 3 and 4A). The TyG index’s Markov blanket excluded obesity variables, allowing direct influence of obesity information on diabetes prediction. This finding aligns with studies showing improved predictive effects of combined TyG-obesity indices (TyG-BMI, TyG-WC, TyG-WHtR) compared to the TyG index alone [5254]. For dyslipidemia, including hypertension information induced minimal probability value changes for both the TyG index and TG/HDL ratio (Figs 6C and 6D), as these indices were parent nodes in dyslipidemia-related clinical variables (Fig 3), already embedding relevant information.

Our findings have significant clinical implications. The TyG index, centrally positioned in our network, offers a cost-effective screening tool using only routine measurements. Its robust associations with both diabetes and dyslipidemia support its use for early risk assessment, potentially improving screening efficiency and resource allocation, especially in resource-limited settings. These insights could enhance early detection and targeted interventions for cardiometabolic diseases.

Limitations

Our research lacked direct IR validation against actual measurements and excluded sleep duration from the DAG structure despite its presence at lower thresholds (0.55–0.70), as it disappeared at stronger thresholds (0.75+) and showed no probabilistic influence on key variables. This exclusion likely had minimal impact, though a 2021 format change and our age-diverse sample may have contributed to this pattern. Additional limitations include potential unmeasured confounders such as dietary patterns, occupational factors, and psychiatric conditions that could influence IR pathways—a constraint inherent to observational studies, limited sample size due to recent insulin measurement data unavailability, possible result variation based on hyperparameter settings, and limited generalizability beyond the Korean population.

Finally, it is important to note that our Bayesian network analysis and the resulting DAG do not establish causal relationships between variables. The arrows in our DAG represent probabilistic associations and conditional dependencies learned from the data, rather than proven causal pathways [19]. While these associations provide valuable insights into the relationships between IR markers and metabolic disorders, causal inference would require different methodological approaches, such as interventional studies or causal inference frameworks with additional assumptions [55]. Therefore, our findings should be interpreted as revealing strong statistical associations rather than definitive causal mechanisms. Given that a network structure model was derived in this study, we propose exploring causal relationship inference based on this model as a potential direction for future research.

However, our use of the Bayesian networks offers advantages over traditional regression models by enabling unsupervised learning of probabilistic relationships and providing a visual representation through DAGs. This approach captures complex interrelationships that regression models may overlook.

Furthermore, existing methods for assessing IR and metabolic disorders are often invasive, costly, or complex, limiting their clinical utility. Our study addresses this gap by employing Bayesian networks to identify probabilistic relationships among IR markers and metabolic disorders, offering a novel and more accessible tool for risk assessment in routine clinical practice.

Conclusions

Diabetes and dyslipidemia are chronic diseases with high comorbidity, significantly increasing cardiovascular risks. While HOMA-IR effectively predicts diabetes and TG/HDL ratio predicts dyslipidemia, both show limited crossover predictive ability. The TyG index bridges this gap by demonstrating excellent predictive performance for both conditions. When combined with obesity information, the TyG index shows enhanced predictive capability compared to HOMA-IR, highlighting its scalability. Additionally, the TyG index’s calculation from routine laboratory tests without insulin measurements makes it more cost-effective and practical for clinical use.

Supporting information

S1 Fig. A directed acyclic graph of Bayesian network with marginal probabilities.

Abbreviation: BMI, body mass index; WC, waist circumference; WHtR, waist-to-height ratio; SBP, systolic blood pressure; FBG, fasting blood glucose; HbA1c, hemoglobin A1c; TG, triglycerides; HDL, high-density lipoprotein; LDL, low-density lipoprotein; TCHOL, total cholesterol; HOMA-IR, homeostasis model assessment of insulin resistance; TyG index, triglyceride-glucose index; TG/HDL ratio, triglyceride-to-high-density lipoprotein cholesterol ratio.

https://doi.org/10.1371/journal.pone.0323329.s001

(TIF)

References

  1. 1. Gerich JE. Contributions of insulin-resistance and insulin-secretory defects to the pathogenesis of type 2 diabetes mellitus. Mayo Clin Proc. 2003;78(4):447–56. pmid:12683697
  2. 2. Lee S-H, Park S-Y, Choi CS. Insulin resistance: from mechanisms to therapeutic strategies. Diabetes Metab J. 2022;46(1):15–37. pmid:34965646
  3. 3. Li Y, Liu Y, Liu S, Gao M, Wang W, Chen K, et al. Diabetic vascular diseases: molecular mechanisms and therapeutic strategies. Signal Transduct Target Ther. 2023;8(1):152. pmid:37037849
  4. 4. Ormazabal V, Nair S, Elfeky O, Aguayo C, Salomon C, Zuñiga FA. Association between insulin resistance and the development of cardiovascular disease. Cardiovasc Diabetol. 2018;17(1):122. pmid:30170598
  5. 5. Ding P-F, Zhang H-S, Wang J, Gao Y-Y, Mao J-N, Hang C-H, et al. Insulin resistance in ischemic stroke: Mechanisms and therapeutic approaches. Front Endocrinol (Lausanne). 2022;13:1092431. pmid:36589857
  6. 6. DeFronzo RA, Tobin JD, Andres R. Glucose clamp technique: a method for quantifying insulin secretion and resistance. Am J Physiol. 1979;237(3):E214-23. pmid:382871
  7. 7. Bergman RN, Phillips LS, Cobelli C. Physiologic evaluation of factors controlling glucose tolerance in man: measurement of insulin sensitivity and beta-cell glucose sensitivity from the response to intravenous glucose. J Clin Invest. 1981;68(6):1456–67. pmid:7033284
  8. 8. Bergman RN, Prager R, Volund A, Olefsky JM. Equivalence of the insulin sensitivity index in man derived by the minimal model method and the euglycemic glucose clamp. J Clin Invest. 1987;79(3):790–800. pmid:3546379
  9. 9. Matthews DR, Hosker JP, Rudenski AS, Naylor BA, Treacher DF, Turner RC. Homeostasis model assessment: insulin resistance and beta-cell function from fasting plasma glucose and insulin concentrations in man. Diabetologia. 1985;28(7):412–9. pmid:3899825
  10. 10. Wallace TM, Levy JC, Matthews DR. Use and abuse of HOMA modeling. Diabetes Care. 2004;27(6):1487–95. pmid:15161807
  11. 11. Ghasemi A, Tohidi M, Derakhshan A, Hasheminia M, Azizi F, Hadaegh F. Cut-off points of homeostasis model assessment of insulin resistance, beta-cell function, and fasting serum insulin to identify future type 2 diabetes: Tehran Lipid and Glucose Study. Acta Diabetol. 2015;52(5):905–15. pmid:25794879
  12. 12. Moon S, Park JS, Ahn Y. The Cut-off values of triglycerides and glucose index for metabolic syndrome in American and Korean Adolescents. J Korean Med Sci. 2017;32(3):427–33. pmid:28145645
  13. 13. McLaughlin T, Abbasi F, Cheal K, Chu J, Lamendola C, Reaven G. Use of metabolic markers to identify overweight individuals who are insulin resistant. Ann Intern Med. 2003;139(10):802–9. pmid:14623617
  14. 14. Guerrero-Romero F, Simental-Mendía LE, González-Ortiz M, Martínez-Abundis E, Ramos-Zavala MG, Hernández-González SO, et al. The product of triglycerides and glucose, a simple measure of insulin sensitivity. Comparison with the euglycemic-hyperinsulinemic clamp. J Clin Endocrinol Metab. 2010;95(7):3347–51. pmid:20484475
  15. 15. Park HM, Lee HS, Lee Y-J, Lee J-H. The triglyceride-glucose index is a more powerful surrogate marker for predicting the prevalence and incidence of type 2 diabetes mellitus than the homeostatic model assessment of insulin resistance. Diabetes Res Clin Pract. 2021;180:109042. pmid:34506839
  16. 16. Zhang Q, Xiao S, Jiao X, Shen Y. The triglyceride-glucose index is a predictor for cardiovascular and all-cause mortality in CVD patients with diabetes or pre-diabetes: evidence from NHANES 2001-2018. Cardiovasc Diabetol. 2023;22(1):279. pmid:37848879
  17. 17. Chen B, Zeng J, Fan M, You Q, Wang C, Wang K, et al. A longitudinal study on the impact of the TyG Index and TG/HDL-C ratio on the risk of type 2 diabetes in Chinese patients with prediabetes. Lipids Health Dis. 2024;23(1):262. pmid:39175004
  18. 18. Lim J, Kim J, Koo SH, Kwon GC. Comparison of triglyceride glucose index, and related parameters to predict insulin resistance in Korean adults: An analysis of the 2007-2010 Korean National Health and Nutrition Examination Survey. PLoS One. 2019;14(3):e0212963. pmid:30845237
  19. 19. Pan J, Ren Z, Li W, Wei Z, Rao H, Ren H, et al. Prevalence of hyperlipidemia in Shanxi Province, China and application of Bayesian networks to analyse its related factors. Sci Rep. 2018;8(1):3750. pmid:29491353
  20. 20. Arora P, Boyne D, Slater JJ, Gupta A, Brenner DR, Druzdzel MJ. Bayesian networks for risk prediction using real-world data: a tool for precision medicine. Value Health. 2019;22(4):439–45. pmid:30975395
  21. 21. Jensen FV. Bayesian networks and decision graphs. Statistics for engineering and information science. 2001.
  22. 22. Yirdaw BE, Debusho LK. Multilevel Bayesian network to model child morbidity using Gibbs sampling. Artif Intell Med. 2024;149:102784. pmid:38462284
  23. 23. VanderWeele TJ, Robins JM. Signed directed acyclic graphs for causal inference. J Royal Stat Soc Series B Stat Methodol. 2010;72(1):111–27.
  24. 24. Garrett KA, Alcalá-Briseño RI, Andersen KF, Buddenhagen CE, Choudhury RA, Fulton JC, et al. Network analysis: a systems framework to address grand challenges in plant pathology. Annu Rev Phytopathol. 2018;56:559–80. pmid:29979928
  25. 25. Ordovas JM, Rios-Insua D, Santos-Lozano A, Lucia A, Torres A, Kosgodagan A, et al. A Bayesian network model for predicting cardiovascular risk. Comput Methods Programs Biomed. 2023;231:107405. pmid:36796167
  26. 26. Howey R, Shin S-Y, Relton C, Davey Smith G, Cordell HJ. Bayesian network analysis incorporating genetic anchors complements conventional Mendelian randomization approaches for exploratory analysis of causal relationships in complex data. PLoS Genet. 2020;16(3):e1008198. pmid:32119656
  27. 27. Che B, Zhong C, Zhang R, Pu L, Zhao T, Zhang Y, et al. Triglyceride-glucose index and triglyceride to high-density lipoprotein cholesterol ratio as potential cardiovascular disease risk factors: an analysis of UK biobank data. Cardiovasc Diabetol. 2023;22(1):34. pmid:36797706
  28. 28. Shin K-A. Triglyceride and glucose (TyG) index is a clinical surrogate marker for the diagnosis of metabolic syndrome. BSL. 2017;23(4):348–54.
  29. 29. Choi JH, Lee KA, Moon JH, Chon S, Kim DJ, Kim HJ, et al. 2023 Clinical Practice Guidelines for diabetes mellitus of the Korean Diabetes Association. Diabetes Metab J. 2023;47(5):575–94. pmid:37793979
  30. 30. The Korean Society of Lipid and Atherosclerosis. Korean guidelines for the management of dyslipidemia. 5th ed. Seoul: The Korean Society of Lipid and Atherosclerosis; 2022.
  31. 31. Lim H, Nguyen T, Choue R, Wang Y. Sociodemographic disparities in the composition of metabolic syndrome components among adults in South Korea. Diabetes Care. 2012;35(10):2028–35. pmid:22837361
  32. 32. Kim J-Y, Yadav D, Ahn SV, Koh S-B, Park JT, Yoon J, et al. A prospective study of total sleep duration and incident metabolic syndrome: the ARIRANG study. Sleep Med. 2015;16(12):1511–5. pmid:26611949
  33. 33. Yu H, Lee CM. Comparison of tobacco use and cessation behavior between conventional cigarette and heated tobacco product users: based on the analyses of the Eighth Korea National Health and Nutrition Examination Survey 2019. Korean J Fam Med. 2022;43(5):296–304. pmid:36168901
  34. 34. Oh HS. Opposite effects of work-related physical activity and leisure-time physical activity on the risk of diabetes in Korean Adults. Int J Environ Res Public Health. 2020;17(16):5812. pmid:32796731
  35. 35. Rhee E-J, Kim HC, Kim JH, Lee EY, Kim BJ, Kim EM, et al. 2018 Guidelines for the management of dyslipidemia in Korea. J Lipid Atheroscler. 2019;8(2):78–131. pmid:32821702
  36. 36. Kim HC, Lee H, Lee H-H, Son D, Cho M, Shin S, et al. Korea Hypertension Fact Sheet 2023: analysis of nationwide population-based data with a particular focus on hypertension in special populations. Clin Hypertens. 2024;30(1):7. pmid:38424634
  37. 37. Kim K-K, Haam J-H, Kim BT, Kim EM, Park JH, Rhee SY, et al. Evaluation and treatment of obesity and its comorbidities: 2022 update of clinical practice Guidelines for Obesity by the Korean Society for the Study of Obesity. J Obes Metab Syndr. 2023;32(1):1–24. pmid:36945077
  38. 38. Browning LM, Hsieh SD, Ashwell M. A systematic review of waist-to-height ratio as a screening tool for the prediction of cardiovascular disease and diabetes: 0·5 could be a suitable global boundary value. Nutr Res Rev. 2010;23(2):247–69. pmid:20819243
  39. 39. Ling Z, Yu K, Liu L, Li J, Zhang Y, Wu X. PSL: an algorithm for Partial Bayesian Network Structure Learning. ACM Trans Knowl Discov Data. 2022;16(5):1–25.
  40. 40. Park E, Chang H-J, Nam HS. A Bayesian Network Model for predicting post-stroke outcomes with available risk factors. Front Neurol. 2018;9:699. pmid:30245663
  41. 41. Pearl J. Probabilistic reasoning in intelligent systems: Networks of plausible inference. Morgan Kaufmann Publishers; 1988.
  42. 42. Kitson NK, Constantinou AC, Guo Z, Liu Y, Chobtham K. A survey of Bayesian Network structure learning. Artif Intell Rev. 2023;56(8):8721–814.
  43. 43. Aliferis CF, Statnikov A, Tsamardinos I, Mani S, Koutsoukos XD. Local causal and Markov blanket induction for causal discovery and feature selection for classification part I: Algorithms and empirical evaluation. J Mach Learn Res. 2010;11: 171–234.
  44. 44. Wang L, Liu Y, Mammadov M, Sun M, Qi S. Discriminative structure learning of bayesian network classifiers from training dataset and testing instance. Entropy (Basel). 2019;21(5):489. pmid:33267204
  45. 45. Daly R, Shen Q, Aitken S. Learning Bayesian networks: approaches and issues. The Knowledge Engineering Review. 2011;26(2):99–157.
  46. 46. Scutari M, Denis J-B. Bayesian Networks. 2021.
  47. 47. Bowles NP, He Y, Huang Y-H, Stecker EC, Seixas A, Thosar SS. Cardiovascular disease risk: it is complicated, but race and ethnicity are key, a Bayesian network analysis. Front Public Health. 2024;12:1364730. pmid:38915752
  48. 48. Fuster-Parra P, Huguet-Torres A, Castro-Sánchez E, Bennasar-Veny M, Yañez AM. Identifying the interplay between protective measures and settings on the SARS-CoV-2 transmission using a Bayesian network. PLoS One. 2024;19(7):e0307041. pmid:38990971
  49. 49. Rosenberg DE, Jabbour SA, Goldstein BJ. Insulin resistance, diabetes and cardiovascular risk: approaches to treatment. Diabetes Obes Metab. 2005;7(6):642–53. pmid:16219008
  50. 50. Kerr AG, Andersson DP, Dahlman I, Rydén M, Arner P. Adipose insulin resistance associates with dyslipidemia independent of liver resistance and involves early hormone signaling. Arterioscler Thromb Vasc Biol. 2023;43(6):1054–65. pmid:37051928
  51. 51. Wang S, Shi J, Peng Y, Fang Q, Mu Q, Gu W, et al. Stronger association of triglyceride glucose index than the HOMA-IR with arterial stiffness in patients with type 2 diabetes: a real-world single-centre study. Cardiovasc Diabetol. 2021;20(1):82. pmid:33888131
  52. 52. Lin Z, Yuan S, Li B, Guan J, He J, Song C, et al. Insulin-based or non-insulin-based insulin resistance indicators and risk of long-term cardiovascular and all-cause mortality in the general population: A 25-year cohort study. Diabetes Metab. 2024;50(5):101566. pmid:39127168
  53. 53. He H-M, Xie Y-Y, Chen Q, Li Y-K, Li X-X, Mu Y-K, et al. The additive effect of the triglyceride-glucose index and estimated glucose disposal rate on long-term mortality among individuals with and without diabetes: a population-based study. Cardiovasc Diabetol. 2024;23(1):307. pmid:39175051
  54. 54. Sun Y, Ji H, Sun W, An X, Lian F. Triglyceride glucose (TyG) index: A promising biomarker for diagnosis and treatment of different diseases. Eur J Intern Med. 2025;131:3–14. pmid:39510865
  55. 55. Howey R, Clark AD, Naamane N, Reynard LN, Pratt AG, Cordell HJ. A Bayesian network approach incorporating imputation of missing data enables exploratory analysis of complex causal biological relationships. PLoS Genet. 2021;17(9):e1009811. pmid:34587167