Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Evaluation of first and second trimester maternal thyroid profile on the prediction of gestational diabetes mellitus and post load glycemia

  • Daniela Mennickent,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft

    Affiliations Facultad de Farmacia, Departamento de Bioquímica Clínica e Inmunología, Universidad de Concepción, Concepción, Chile, Facultad de Farmacia, Departamento de Análisis Instrumental, Universidad de Concepción, Concepción, Chile, Machine Learning Applied in Biomedicine (MLAB), Concepción, Chile

  • Bernel Ortega-Contreras,

    Roles Conceptualization, Data curation, Funding acquisition, Investigation, Writing – review & editing

    Affiliation Facultad de Farmacia, Departamento de Bioquímica Clínica e Inmunología, Universidad de Concepción, Concepción, Chile

  • Sebastián Gutiérrez-Vega,

    Roles Data curation, Investigation, Writing – review & editing

    Affiliation Facultad de Farmacia, Departamento de Bioquímica Clínica e Inmunología, Universidad de Concepción, Concepción, Chile

  • Erica Castro,

    Roles Investigation, Resources, Writing – review & editing

    Affiliation Facultad de Ciencias de la Salud, Departamento de Obstetricia y Puericultura, Universidad de Atacama, Copiapó, Chile

  • Andrés Rodríguez,

    Roles Funding acquisition, Resources, Visualization, Writing – review & editing

    Affiliations Machine Learning Applied in Biomedicine (MLAB), Concepción, Chile, Facultad de Ciencias, Departamento de Ciencias Básicas, Universidad del Bío-Bío, Chillán, Chile, Group of Research and Innovation in Vascular Health (GRIVAS-Health), Chillan, Chile

  • Juan Araya ,

    Roles Conceptualization, Formal analysis, Funding acquisition, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – review & editing

    eguzman@udec.cl (EG-G); jarayaq@udec.cl (JA)

    Affiliations Facultad de Farmacia, Departamento de Análisis Instrumental, Universidad de Concepción, Concepción, Chile, Machine Learning Applied in Biomedicine (MLAB), Concepción, Chile

  • Enrique Guzmán-Gutiérrez

    Roles Conceptualization, Data curation, Funding acquisition, Methodology, Project administration, Resources, Supervision, Visualization, Writing – original draft, Writing – review & editing

    eguzman@udec.cl (EG-G); jarayaq@udec.cl (JA)

    Affiliations Facultad de Farmacia, Departamento de Bioquímica Clínica e Inmunología, Universidad de Concepción, Concepción, Chile, Machine Learning Applied in Biomedicine (MLAB), Concepción, Chile, Group of Research and Innovation in Vascular Health (GRIVAS-Health), Chillan, Chile

Abstract

Maternal thyroid alterations have been widely associated with the risk of gestational diabetes mellitus (GDM). This study aims to 1) test the first and the second trimester full maternal thyroid profile on the prediction of GDM, both alone and combined with non-thyroid data; and 2) make that prediction independent of the diagnostic criteria, by evaluating the effectiveness of the different maternal variables on the prediction of oral glucose tolerance test (OGTT) post load glycemia. Pregnant women were recruited in Concepción, Chile. GDM diagnosis was performed at 24–28 weeks of pregnancy by an OGTT (n = 54 for normal glucose tolerance, n = 12 for GDM). 75 maternal thyroid and non-thyroid parameters were recorded in the first and the second trimester of pregnancy. Various combinations of variables were assessed for GDM and post load glycemia prediction through different classification and regression machine learning techniques. The best predictive models were simplified by variable selection. Every model was subjected to leave-one-out cross-validation. Our results indicate that thyroid markers are useful for the prediction of GDM and post load glycemia, especially at the second trimester of pregnancy. Thus, they could be used as an alternative screening tool for GDM, independently of the diagnostic criteria used. The final classification models predict GDM with cross-validation areas under the receiver operating characteristic curve of 0.867 (p<0.001) and 0.920 (p<0.001) in the first and the second trimester of pregnancy, respectively. The final regression models predict post load glycemia with cross-validation Spearman r correlation coefficients of 0.259 (p = 0.036) and 0.457 (p<0.001) in the first and the second trimester of pregnancy, respectively. This investigation constitutes the first attempt to test the performance of the whole maternal thyroid profile on GDM and OGTT post load glycemia prediction. Future external validation studies are needed to confirm these findings in larger cohorts and different populations.

1. Introduction

Gestational diabetes mellitus (GDM) is a hyperglycemia state that is diagnosed at the second or third trimester of pregnancy, with glycemia values that vary depending on the diagnostic criteria [1]. This disorder has been linked to several negative effects on maternal and fetal health, short and long-term [24]. The improvement of its diagnosis, whether with early detection tools or with alternative screening tools, could help to prevent those adverse pregnancy outcomes. Predictive models, with “prediction" understood as the forecast of future behaviors or unobserved outcomes [5], are an auspicious means to achieve that goal.

GDM prediction has been typically addressed with strategies based on classic clinical risk factors for this pregnancy complication, i.e. maternal age, body mass index (BMI), family history of diabetes, previous GDM, among others. Nevertheless, approaches that consider only in this type of parameters usually have a poor predictive performance for GDM [68]. Thus, other biomarkers should be investigated and tested in order to complement and boost them.

Maternal thyroid alterations in the first and the second trimester of pregnancy have been widely related to the risk of GDM [912]. Moreover, certain authors have assessed first and second trimester thyroid parameters on the prediction of this pregnancy disorder [1319]. However, the majority of them considered only a few thyroid markers, instead of the whole thyroid profile [1318]. Furthermore, various of them didn’t evaluate these markers in association with other variables [1315], which is usually related to an improved predictive accuracy [20]. In addition, even though some authors used thyroid data in multivariate analysis, they didn’t assess the particular effect of the thyroid profile on GDM prediction, whether by testing it solo or by adding/removing it from the analysis [1619].

Therefore, this investigation aims to: 1) evaluate the first and the second trimester maternal thyroid profile on GDM prediction, both alone and combined with non-thyroid parameters; and 2) make that prediction independent of the diagnostic criteria, by testing the performance of the different maternal variables on post load glycemia prediction.

2. Materials and methods

2.1. Ethical aspects

This biomedical study was approved by the Ethics Committee of the Servicio de Salud Concepción (17-12-88) and was performed in accordance to the World Medical Association Declaration of Helsinki.

2.2. Subjects recruitment

Pregnant women with 10–14 gestational weeks were recruited between 2017 and 2019 at three primary health centers (CESFAM Victor Manuel Fernández, CESFAM Santa Sabina and CESFAM Tucapel) in Concepción, Chile. Subjects with pregestational diabetes mellitus, pregestational thyroid disorders, or any pregnancy pathologies different than GDM were excluded. Women that gave their written informed consent were incorporated into the study, and followed through pregnancy until 24–28 gestational weeks.

2.3. Blood samples collection

Blood samples were collected on the first (at 10–14 gestational weeks) and the second (at 24–28 gestational weeks) trimester of pregnancy, either after 12 hours of fasting or after 2 hours of a 75 g glucose load. Samples were transported to laboratory at 4˚C. Sera (obtained from tubes with no anticoagulant) and plasma (obtained from tubes with sodium fluoride/citrate as anticoagulant) were aliquoted and stored at -80˚C. White blood cells, obtained from EDTA tubes, were stored at -20°C.

2.4. GDM diagnosis and study groups

Pregnant women were subjected to an oral glucose tolerance test (OGTT) at 24–28 gestational weeks. Subjects with post load glycemia > 140 mg/dL (75 g, 2 h) were diagnosed with GDM. In this study, out of 66 subjects, 12 had GDM. The remaining 54 exhibited no alteration in fasting or post load glycemia, i.e. had normal glucose tolerance (NGT).

2.5. Clinical data collection

First trimester maternal clinical variables were obtained from both health centers records and pregnant women statements. Institution-derived parameters involved: age, height, weight, BMI, systolic and diastolic pressure at the first trimester of pregnancy. Self-reported data comprised: supplement consumption, hyperemesis, vaginal bleeding and drug use at the first trimester of pregnancy; and preconception information, i.e. drug use before pregnancy, prior pregnancy complications, prior non-viable pregnancy, fertility problems, polycystic ovary syndrome (PCOS) history, first period age, last period date, personal and family morbid history.

2.6. Biochemical determinations

Fasting and post load plasma glucose were measured using the hexokinase method [21]. Thyroid stimulating hormone (TSH), total triiodothyronine (TT3), total thyroxine (TT4), free T4 (FT4), thyroglobulin (TG), TG antibody (aTG), thyroid peroxidase antibody (aTPO), TSH receptor antibody (TRAb) and fat mass and obesity-associated (FTO) rs9939609 genotype were determined as described by Araya et al [22]. Briefly, serum thyroid hormones and antibodies were measured by different immunoassays. For FTO genotyping, genomic DNA was isolated from white blood cells, and then analyzed by a previously optimized PCR-HRM method. Melt curve results were normalized to identify the subject’s genotype by allele comparison with control samples of known genotype.

2.7. Univariate data analysis

Between the first and second trimester of pregnancy, a total of 75 maternal clinical and biochemical variables were recorded (S1 Dataset). Qualitative data were compared by two-sided Fisher exact test. Quantitative data normality was assessed through Shapiro-Wilk test. Normally distributed variables were compared by unpaired Student t test. Not-normally distributed variables were compared by Mann-Whitney or Wilcoxon matched-pairs signed rank test in unpaired or paired analyses, respectively. Values of p<0.05 were considered statistically significant. Univariate analyses were performed with the software GraphPad Prism 9.2.0 (GraphPad Software Inc, USA).

2.8. Multivariate data analysis

For multivariate analyses, almost all the registered maternal variables were used as predictors, except for height, weight and OGTT glycemia. Height and weight weren’t considered as individual predictors. Instead, the BMI, which comprises both, was utilized. The OGTT glycemia was employed as a label (i.e. NGT or GDM, for exploration) or as the property to predict (i.e. NGT or GDM, for classification; or the OGTT post load glycemia value, for regression).

For classification and regression analyses, the predictors were divided into four categories: thyroid from the first (Thy1T) and the second (Thy2T) trimester of pregnancy, which comprised TSH, TT3, TT4, FT4, TG, aTG, aTPO and TRAb; and non-thyroid from the first (NoThy1T) and the second (NoThy2T) trimester of pregnancy, which included every other recorded predictor. All the possible combinations of these four categories were assessed for GDM and post load glycemia prediction.

Prior to multivariate analyses, qualitative variables were transformed into categorical and all data were normalized by auto-scaling.

2.8.1. Exploratory analysis.

Full auto-scaled data were explored by principal components analysis (PCA). To detect possible outliers, four parameters were evaluated at this stage: sample residual, F ratio, probability and Mahalanobis distance. Subjects were considered outliers if two or more parameters suggested such behavior. Data exploration was performed with the software Pirouette 4.5 (Infometrix Inc, USA).

2.8.2. Classification analysis.

Different subgroups of auto-scaled data were assessed for GDM prediction through five classification ML techniques: logistic regression (LR), linear support vector machine (L-SVM), partial least squares discriminant analysis (PLS-DA), classification and regression tree (CART) and extreme gradient boosting (XGB). Every model was subjected to leave-one-out cross-validation. LR, PLS-DA and XGBoost were executed in PLS_Toolbox R8.9.2 (Eigenvector Research Inc, USA). L-SVM and CART were implemented by coding in MATLAB R2021a (The MathWorks Inc, USA).

The classification predictive performance was assessed by the determination of the models sensitivity (Se), specificity (Sp) and non-error rate (NER) in both calibration and cross-validation. These parameters were calculated as follows: (1) (2) (3)

Where TP, FN, TN and FP are the number of true positives, false negatives, true negatives and false positives, respectively.

The area under the receiver operating characteristic curve (AUC) of the final classification models was determined through the software GraphPad Prism 9.2.0 (GraphPad Software Inc, USA).

2.8.3. Regression analysis.

Different subgroups of auto-scaled data were assessed for post load glycemia prediction through the regression ML technique partial least squares (PLS). Every model was subjected to leave-one-out cross-validation. PLS was executed in PLS_Toolbox R8.9.2 (Eigenvector Research Inc, USA).

The regression predictive performance was assessed by the determination of the models root mean square error (RMSE) and relative error (RE) in both calibration (RMSEC and REC) and cross validation (RMSECV and RECV). These parameters were calculated as follows: (4) (5) (6) (7)

Where ŷi is the predicted post load glycemia for subject i, yi is the actual post load glycemia for subject i, n is the number of subjects in calibration and cross-validation, and ȳ is the mean of the actual post load glycemia values in calibration.

The Spearman r correlation coefficient of the final regression models was determined by the software GraphPad Prism 9.2.0 (GraphPad Software Inc, USA).

2.8.4. Variable selection.

Variable importance was retrieved from PLS-DA and PLS regression vectors. For every model, the contribution of each variable was calculated as follows: (8)

Where bi is the regression vector value for variable i, and n is the number of variables in the classification or regression model.

For each model, variables were sorted according to their percentage contribution. Toward model simplification, the j variables with the highest percentage contribution were selected, with j ranging from 10 to 1.

3. Results and discussion

3.1. Description of the study groups

To characterize and compare the study groups, classic univariate techniques were used. Table 1 shows only the first and second trimester maternal variables that differed significantly between NGT and GDM pregnancies in the univariate approach. The full list, which contains the 75 maternal variables that were recorded and analyzed in this study, is presented in S1 Table.

thumbnail
Table 1. Maternal variables that differ significantly between normal glucose tolerance and gestational diabetes mellitus pregnancies in univariate analysis.

https://doi.org/10.1371/journal.pone.0280513.t001

In the first trimester of pregnancy, the BMI and the systolic pressure were higher in the GDM group, than in the NGT group. Moreover, prior history of GDM, family history of type 2 diabetes (DM2) and family history of hypertension were more frequent in GDM pregnancies. These observations are consistent with a recent Chilean epidemiologic study, in which subjects with a higher pre-pregnancy BMI, previous record of hypertension or GDM, or family history of DM2 or hypertension, presented a greater risk of GDM [23]. Likewise, personal history of anemia was more frequent in GDM women. The relationship between anemia and GDM appears to be dependent on the underlying cause of the former. Indeed, whereas iron deficiency decreases the risk of GDM [24], B12 deficiency increases it [25]. Interestingly, although iron deficiency was the most common cause of anemia among Chilean women between 1981 and 2010 [26], the 2010–2011 National Food Consumption Survey revealed that, while approximately 30% of women had a deficient consumption of iron, nearly 60% of them had insufficient intake of vitamin B12 [27]. The FTO genotype was significantly different between the two groups as well: while the TT genotype was more common in NGT women, the TA and the AA genotypes were more frequent in GDM women. Contradictory results have been reported regarding to the association between FTO rs9939609 polymorphism and GDM [2830]. However, in Chilean adults the risk allele A was linked to an increased BMI [31], which is a risk factor for GDM in this population [23].

In the second trimester of pregnancy, TSH, TT3 and aTG were higher in the GDM group, when compared to the NGT group. In addition, FT4 was lower in GDM pregnancies. Preceding studies describe similar results. For instance, Gutierrez-Vega et al reported that GDM subjects had greater levels of TSH and TT3 than NGT subjects at 24–28 weeks of pregnancy [32]. Likewise, Tang et al found increased levels of aTG and decreased levels of FT4 in GDM pregnant women at 17.0 ± 4.0 weeks of gestation [14]. Furthermore, second trimester fasting glycemia and OGTT post load glycemia were higher in the GDM group, which is in agreement with its diagnostic criteria. The prevalence of GDM in this study was 18.2%, moderately superior than the 13.0% described for Chile in 2015 [23] and to the 11.2% reported for South and Central America in 2015–2018 [1].

To better understand the differences observed between NGT and GDM groups in terms of thyroid markers, a complementary analysis was performed. For each group, TSH, TT3, FT4 and aTG levels were compared between the first and the second trimester of pregnancy. The results are displayed in S2 Table. TSH didn’t vary significantly in any of the groups, although it tended to increase from the first to the second trimester, especially in GDM subjects. The latter accounts for the statistical difference between NGT and GDM pregnancies regarding TSH2T (Table 1). TT3 raised and FT4 decreased from the first to the second trimester in both groups. The magnitude of those changes was greater in GDM women, which explains the significant difference amidst the two groups in terms of TT32T and FT42T (Table 1). aTG lowered from the first to the second trimester in the NGT subjects, whereas it didn’t vary in GDM subjects. This contrasting behavior gives a rational for the statistical difference between NGT and GDM pregnancies in relation to aTG2T (Table 1). The changes observed in thyroid hormones within the NGT group are consistent with the physiology of normal pregnancy, which is characterized by is a slight rise in TSH, a considerable increase in TT3 and a substantial decrease in FT4 between early- and mid-gestation [33]. The fact that these changes are exacerbated in GDM is not yet fully understood, however it has been proposed that it may be linked to maternal weight. High maternal weight, as high maternal BMI, is an important risk factor for GDM. Haddow et al proposed that higher maternal weight would induce higher peripheral deiodinase activity, which in turn would lead to reduced FT4 and increased TT3. Moreover, higher TT3 would induce higher plasma glucose levels, thereby contributing to GDM pathophysiology [34]. This hypothesis, although just beginning to be tested by other authors [3537], provides a feasible explanation for the differences observed between GDM and NGT pregnancies in this study, especially considering that the GDM group presented a higher BMI than the NGT group. In a similar manner, the changes observed in aTG within the NGT group are in line with previous reports, in which the levels of this thyroid antibody decrease from early- to mid-gestation in normal pregnancies [3840]. This decline is due to the physiological immunosuppression that occurs in normal pregnancy to tolerate the fetus and placenta [41]. That process is disrupted in GDM [42], which could explain why the levels of aTG do not decrease amidst trimesters in the GDM group as they do in the NGT group.

These results indicate that, when considered individually, the main differential variables between the two study groups are non-thyroid in the first trimester of pregnancy, and thyroid in the second trimester of pregnancy.

3.2. Exploration of maternal data by PCA

To assess the potential of a multivariate approach to distinguish GDM from NGT pregnancies, a PCA was performed. The unsupervised analysis comprised 72 maternal variables and all the 66 subjects. No outliers were found. The PCA results are displayed in Fig 1.

thumbnail
Fig 1. Exploration of maternal data by principal component analysis.

(A) The scores plot shows the NGT (blue) and GDM (red) subjects projected in the space constituted by PC1 and PC2. (B) The loadings plot shows the variables contribution to PC1 and PC2. The ones that contribute the most to each PC are highlighted (green) and labeled. NGT: Normal glucose tolerance. GDM: Gestational diabetes mellitus. PC: Principal component. 1T: First trimester. 2T: Second trimester. pHELLP: Prior hemolysis elevated liver enzymes and low platelets syndrome. pPEffusion: Prior pleural effusion. pSepticemia: Prior septicemia. fDM2: Family history of type 2 diabetes. fHT: Family history of hypertension. BMI: Body mass index. fHypothyroidism: Family history of hypothyroidism. NonViableP: Prior non-viable pregnancy. pDrugs: Drug use before pregnancy. Supplements: Supplement consumption. FirstPAge: First period age. TSH: Thyroid stimulating hormone. aTG: Thyroglobulin antibody. aTPO: Thyroid peroxidase antibody. TT4: Total thyroxine. FT4: Free thyroxine. TT3: Total triiodothyronine. FBG: Fasting blood glucose.

https://doi.org/10.1371/journal.pone.0280513.g001

Fig 1A shows that NGT and GDM women are partially separated by the conjunction of principal components 1 (PC1) and 2 (PC2). Fig 1B highlights the maternal variables that, when combined, contribute the most to the separation between the two study groups. They are mainly non-thyroid variables from the first trimester, and thyroid variables from the second trimester of pregnancy, which is consistent with the univariate analysis results. However, although more discreetly, some thyroid variables from the first trimester also appear to contribute to the separation between NGT and GDM subjects. A previous study reported similar results. Araya et al used PCA to explore clinical and biochemical maternal data from NGT and GDM pregnancies. After removing the OGTT post load glycemia from the analysis, the separation between two groups was mostly attributable to first trimester clinical variables and second trimester thyroid parameters, with a minor but noticeable contribution of first trimester thyroid data [22]. These observations are also in agreement with other authors, who described that different second trimester thyroid markers are more strongly associated with the risk of GDM, than their first trimester counterparts [14,16,4345].

These results suggest that a supervised multivariate model, i.e. a ML-based model, built with first and second trimester thyroid parameters could be successful in the prediction of GDM.

3.3. Prediction of GDM with ML techniques

To evaluate the ability of maternal thyroid markers to predict GDM, different classification models were developed, covering various subgroups of predictors and ML techniques. Table 2 displays the predictive performance of each model in calibration and cross-validation. It focuses on the NER, which is a global measure of classification quality that doesn’t depend on the relative sizes of the groups of interest [46]. In other words, the NER isn’t biased by the presence of unbalanced groups, which is the case of this study: the NGT and GDM groups comprise 54 and 12 subjects, respectively. The sensitivity and specificity values of each model are presented in S3 Table.

thumbnail
Table 2. Non-error rate of machine learning models that predict gestational diabetes mellitus based on different maternal variables.

https://doi.org/10.1371/journal.pone.0280513.t002

First trimester thyroid variables (Thy1T) solo displayed a poor performance for the prediction of GDM. Their models got an average cross-validation NER (CV-NER) of 51.3%. Furthermore, in most cases the addition of Thy1T to other variables didn’t improve the classification quality. However, the model built only on first trimester predictors that had the highest CV-NER comprises Thy1T (PLS-DA, Thy1T + NoThy1T, CV-NER of 81.0%). The model with second trimester predictors that showed the greatest CV-NER also encompasses Thy1T (PLS-DA, NoThy1T + Thy1T + Thy2T, CV-NER of 87.0%). The top 15 most important variables of those models are presented in Fig 2. Fig 2A concerns the one covering first trimester data only, and Fig 2B the one containing second trimester data. Notably, the 10 major predictors of both models include first trimester thyroid markers, which didn’t come up as relevant in univariate analysis. This supports the idea that emerged in exploratory analysis: although moderately, first trimester thyroid variables contribute to the differentiation between NGT and GDM subjects when they are in association with other data. This view is consistent with the study of Zhu et al, who used LR and clinical and biochemical maternal parameters to predict GDM. They observed that the addition of first trimester TT3 and TT3/FT4 improved the AUC of their classification model, from 0.710 to 0.726 and 0.724, respectively [16].

thumbnail
Fig 2. Variable importance of the machine learning models that predict gestational diabetes mellitus with best performance.

(A) Built on first trimester data only. (B) Including second trimester data. 1T: First trimester. 2T: Second trimester. PLS-DA: Partial least squares discriminant analysis. mAnemia: Personal anemia. FTO: Fat mass and obesity-associated genotype (rs9939609). pGDM: Prior gestational diabetes mellitus. fDM2: Family history of type 2 diabetes. TG: Thyroglobulin. FirstPAge: First period age. pIUGR: Prior intrauterine growth restriction. mInsRes: Personal insulin resistance. TRAb: TSH receptor antibody. fArthritis: Family history of rheumatoid arthritis. NonViableP: Prior non-viable pregnancy. fCVA: Family history of cerebrovascular accident. FertilityP: Fertility problems. fHT: Family history of hypertension. TSH: Thyroid stimulating hormone. FT4: Free thyroxine. BMI: Body mass index. pHELLP: Prior hemolysis elevated liver enzymes and low platelets syndrome. pSepticemia: Prior septicemia. pPEffusion: Prior pleural effusion. mNeoplasia: Personal neoplasia. SysP: Systolic pressure.

https://doi.org/10.1371/journal.pone.0280513.g002

The particular first trimester thyroid variables that are among the top 10 most relevant predictors of the presented models are TG1T, in the first and the second trimester approaches, and TRAb1T, in the first trimester approach. The levels of TG1T and TRAb1T tend to be higher in the GDM group than in the NGT group (S1 Table), yet they didn’t reach statistical significance in univariate analysis. Remarkably, in both models they are more important than age and BMI, two known risk factors for GDM [23]. Few articles have studied the association between these two thyroid markers and the risk of GDM. Bell et al assessed the role of TG1T on the risk of GDM in Finnish pregnant women, and found no association between them, neither solo nor combined with blood iodine levels [47]. Likewise, Wang et al evaluated the association between TRAb1T and the risk of GDM in Shanghainese pregnant women, and found that positive TRAb1T was related to a decreased risk of this pregnancy disorder [48]. Pregnant women from both Finland [47] and Shanghai [48] are known to be iodine deficient. Differently from those populations, Chilean pregnant women have an adequate iodine intake [49], which could partially explain these contrasting results. It is also important to mention that, in the present study, TG1T and TRAb1T only came out as relevant for the prediction of GDM in association with other maternal variables, which wasn’t comprehensively assessed in the mentioned studies. As far as we know, neither TG1T nor TRAb1T had been tested for the prediction of GDM before.

In contrast to Thy1T, second trimester thyroid variables (Thy2T) solo showed an acceptable performance for GDM prediction. Their models reached an average CV-NER of 79.0%. Moreover, the addition of Thy2T to other variables improved the classification performance in most cases. Interestingly, Thy2T presented a higher CV-NER than second trimester non-thyroid variables (NoThy2T, average CV-NER of 53.8%), and the only NoThy2T assessed in this study was fasting blood glucose (FBG2T). This means that Thy2T had a better predictive performance than FBG2T, a parameter that is often used as a screening tool for GDM [50]. Moreover, several second trimester thyroid parameters figure among the top 15 most important predictors of the second trimester model, whereas FBG2T doesn’t. Therefore, Thy2T could be employed as an alternative screening tool for GDM. To our knowledge, this is the first time that thyroid markers are proposed to be used with that purpose.

The particular second trimester thyroid variables that are part of the top 10 most relevant predictors of the second trimester model are TSH2T, FT42T and TG2T. TSH2T and FT42T arose as significant in univariate analysis, but TG2T didn’t. They are more important to the model than age and BMI. Furthermore, TSH2T is more relevant than prior history of GDM and family history of DM2, other known risk factors for this pregnancy complication [23]. Higher TSH2T [43,44] and lower FT42T [14,16,45] levels have been associated to an increased risk of GDM, whereas TG2T hadn’t been linked with the risk of this disorder until now. The levels of this hormone precursor tend to be higher in the GDM group, than in the control group (S1 Table). FT42T has been used for its prediction in univariate analysis, with poor classification performance [14]. However, as far as we can tell, none of these thyroid markers had been used for the prediction of GDM in multivariate analysis before.

These results prove that maternal thyroid variables are useful for the prediction of GDM, especially at the second trimester of pregnancy.

3.4. Prediction of post load glycemia with ML techniques

In order to make the prediction of GDM independent from the diagnostic criteria, the prediction of post load glycemia was conducted. Whereas the prediction of GDM is a binary classification task, the prediction of the continuous parameter post load glycemia is a regression task. Given that PLS-DA stood out in classification, its related ML technique PLS was used for regression. The prediction performance was evaluated by RMSE and RE (also known as normalized RMSE or coefficient of variation of the RMSE), which are usually employed to assess the error of regression models in ML-based studies [51]. Table 3 presents the RE of each PLS model in calibration and cross-validation. Their RMSEs are shown in S4 Table.

thumbnail
Table 3. Relative error of machine learning models that predict post load glycemia based on different maternal variables.

https://doi.org/10.1371/journal.pone.0280513.t003

Thy1T and Thy2T exhibited a similar behavior than the observed in the classification analysis. Thy1T alone presented the highest error for post load glycemia prediction (RECV of 24.7%) and their addition to other variables didn’t improve the regression performance. Thy2T alone displayed a better performance (RECV of 20.1%) and their incorporation to other predictors enhanced the regression quality. Moreover, Thy2T had a lower RECV than FBG2T (NoThy2T, RECV of 22.4%). This is in agreement with the study of Ouzilleau et al, who found a very weak correlation between FBG and OGTT post load glycemia (75 g, 2 h) at 24–28 weeks of pregnancy [52].

In opposition with the observed in the classification analysis, the regression models with the lowest RECV don’t contain Thy1T, neither when built on first trimester predictors only (PLS, NoThy1T, RECV of 21.4%) nor when include second trimester predictors (PLS, NoThy1T + NoThy2T + Thy2T, RECV of 18.7%). However, the latter does include Thy2T. The top 15 most relevant variables of these models are presented in Fig 3. Fig 3A concerns the one comprising first trimester data only, and Fig 3B the one involving second trimester data.

thumbnail
Fig 3. Variable importance of the machine learning models that predict post load glycemia with best performance.

(A) Built on first trimester data only. (B) Including second trimester data. 1T: First trimester. 2T: Second trimester. PLS: Partial least squares. FTO: Fat mass and obesity-associated genotype (rs9939609). pHELLP: Prior hemolysis elevated liver enzymes and low platelets syndrome. pSepticemia: Prior septicemia. pPEffusion: Prior pleural effusion. mAsthma: Personal asthma. fDM2: Family history of type 2 diabetes. pGDM: Prior gestational diabetes mellitus. fHT: Family history of hypertension. FBG: Fasting blood glucose. fHyperthyroidism: Family history of hyperthyroidism. DiaP: Diastolic pressure. mAnemia: Personal anemia. pIUGR: Prior intrauterine growth restriction. FirstPAge: First period age. fArthritis: Family history of rheumatoid arthritis. TSH: Thyroid stimulating hormone. FT4: Free thyroxine. TT4: Total thyroxine. TRAb: TSH receptor antibody.

https://doi.org/10.1371/journal.pone.0280513.g003

The particular thyroid variables that are among the top 10 most important predictors of the second trimester model are TSH2T, FT42T and TT42T. In line with the results described above, TSH2T and FT42T are more relevant to the model than FBG2T. Moreover, the three thyroid markers are more relevant than family history of DM2 and prior GDM, two recognized risk factors for GDM in Chile [23] and worldwide [1]. In contrast to TSH2T and FT42T, TT42T didn’t emerge as significant in univariate analysis and hadn’t been linked to the risk of GDM before. TT42T levels tend to be lower in the GDM group, than in the NGT group (S1 Table). To our knowledge, this is the first time that these thyroid parameters are used to predict post load glycemia in pregnancy. In agreement with was already suggested, they could be used as an alternative screening tool for GDM, independently of its diagnostic criteria.

These results demonstrate that second trimester maternal thyroid variables are useful for the prediction of OGTT post load glycemia.

3.5. Models simplification by variable selection

In order to ease the application of the best classification and regression models, variable selection was performed. Prior ML-based studies carried out similar approaches [6,19]. For each full model, different numbers of variables were selected, which resulted in various simplified models. Their performance is presented in S5 Table. The first and second trimester simplified models with the highest CV-NER (in the case of the prediction of GDM) and the lowest RECV (in the case of the prediction of post load glycemia) were chosen as the final ones. Table 4 displays the NER or RE of the full and the final simplified models, in calibration and cross-validation.

thumbnail
Table 4. Non-error rate and relative error of machine learning models that predict gestational diabetes mellitus and post load glycemia before and after variable selection.

https://doi.org/10.1371/journal.pone.0280513.t004

In classification with first trimester data only, the full model achieved a CV-NER of 81.0%. The selection of the top 4 most important variables increased the CV-NER to 87.0%. In the case of classification including second trimester data, the full model reached a CV-NER of 87.0%. The selection of the top 9 most relevant variables improved the predictive performance, but in a more moderate extent than in the first trimester case (CV-NER of 89.8%). The former simplified model doesn’t contain thyroid predictors, but the latter includes both first and second trimester thyroid markers. It is important to notice that the second trimester model has a better predictive performance than the first trimester model, both before and after variable selection. This could be related to the fact that GDM’s hyperglycemia state is already established at the second trimester of pregnancy. In contrast, at the first trimester of pregnancy the pathophysiological events that lead to the clinical manifestation of GDM are likely to be incipient. Thus, the phenotypic alterations induced by this pregnancy complication will probably be more evident in the second trimester than in the first trimester of pregnancy, which is reflected in the variables’ behavior and the models’ performance. This idea is consistent with was described by other authors. For instance, Catalano et al observed an impaired insulin response and a reduced hepatic insulin sensitivity in GDM women at late gestation, but not at early gestation [53]. Likewise, Wong et al found an increased placental volume in GDM subjects at the second trimester, but not at the first trimester of pregnancy [54]. Similarly, Sovio et al noticed an altered fetal biometry in GDM pregnancies at 28 weeks of gestation, but not at 20 weeks of gestation [55]. On the other hand, the main differential predictors between the first and second trimester models are thyroid. Therefore, the comportment of the models’ performance is in agreement with what was already discussed: thyroid markers are useful for the prediction of GDM, especially at the second trimester of pregnancy.

In regression, the full model with first trimester data alone got a RECV of 21.4%. The selection of the 7 most relevant variables decreased the RECV to 20.7%. In the case of regression comprising second trimester data, the full model yielded a RECV of 18.7%. The selection of the 9 most important variables improved the predictive performance, but in a more discreet degree than in the first trimester case (RECV of 18.4%). Even though none of the simplified models include first trimester thyroid predictors, the latter does contain second trimester thyroid markers. Similar to what was observed in classification, the second trimester model has a better predictive performance than the first trimester model, both before and after variable selection. Together, these observations reaffirm the idea that second trimester thyroid variables are useful for the prediction of OGTT post load glycemia.

Fig 4 presents the cross-validation predictive performance of the final simplified models in a graphical manner. Fig 4A and 4B display the receiver operating characteristic (ROC) curve for the prediction of GDM in the first and the second trimester of pregnancy, respectively. Fig 4C and 4D show the correlation between the actual and the predicted value of post load glycemia in the first and second trimester of pregnancy, respectively. Calibration plots are exhibited in S1 Fig.

thumbnail
Fig 4. Cross-validation performance of the final machine learning models that predict gestational diabetes mellitus and post load glycemia.

A-B) For the prediction of GDM in the first (A) or second (B) trimester of pregnancy by PLS-DA. C-D) For the prediction of post load glycemia in the first (C) or second (D) trimester of pregnancy by PLS. The four plots show the cross-validation performance of the final models with their full set of predictors (black) and after the remotion of biochemical markers (grey). 1T: First trimester. 2T: Second trimester. PLS-DA: Partial least squares discriminant analysis. PLS: Partial least squares. CV: Cross-validation.

https://doi.org/10.1371/journal.pone.0280513.g004

The prediction of GDM with the first trimester model reaches calibration and cross-validation AUCs of 0.914 (95% CI: 0.821–1.000, p<0.001****) and 0.867 (95% CI: 0.741–0.994, p<0.001****), respectively. This classification model covers the top 4 most important predictors presented in Fig 2A, i.e. personal history of anemia, FTO genotype, prior GDM and family history of DM2. The contribution of each predictor to the final model is shown on Panel A in S2 Fig. The relationship between them and GDM was already discussed. Remarkably, the deletion of FTO genotype from the model decreases calibration and cross-validation AUCs to 0.860 (95% CI: 0.722–0.998, p<0.001****) and 0.759 (95% CI: 0.556–0.962, p<0.01**), respectively.

GDM prediction with the second trimester model achieves AUCs of 0.940 (95% CI: 0.860–1.000, p<0.001****) and 0.920 (95% CI: 0.823–1.000, p<0.001****) in calibration and cross-validation, respectively. This classification model comprises the top 9 most relevant variables displayed in Fig 2B, i.e. TSH2T, prior GDM, family history of DM2, FT42T, personal history of anemia, TG2T, TG1T, BMI and past HELLP syndrome. The contribution of each predictor to the final model is presented on Panel B in S2 Fig. The association between the majority of them and GDM was formerly discussed, except for past hemolysis, elevated liver enzymes and low platelet (HELLP) syndrome. Past HELLP syndrome tends to be more frequent in GDM than in NGT pregnancies (S1 Table). The relationship between this syndrome and the development of GDM in a future pregnancy has not been studied. However, HELLP has been linked to long-term glucose intolerance [56]. Furthermore, severe hypertensive pregnancy disorders, including the HELLP syndrome, have been associated with an increased risk of DM2 [57]. Interestingly, the remotion of all biochemical variables from the model, i.e. TSH2T, FT42T, TG2T and TG1T, reduces the predictive accuracy in both calibration and cross-validation, with AUCs of 0.914 (95% CI: 0.836–0.992, p<0.001****) and 0.819 (95% CI: 0.682–0.957, p<0.001***), respectively. This result reinforces the idea that thyroid markers are useful to predict GDM at the second trimester of pregnancy.

It is important to note that the classification models presented in this study were calibrated (i.e. developed or trained), optimized and then validated using the Chilean diagnostic criteria. Therefore, they should only be used to predict GDM under that criteria. Very few articles have proposed strategies to predict GDM in Chilean pregnant women. They used clinical parameters [23], plasma biomarkers [58], plasma extracellular vesicles [59], oral extracellular vesicles [60] and the combination of glycemia with BMI or gingival crevicular fluid placental growth factor (GCF-PlGF) [61]; and got AUCs of 0.739, 0.870, 0.798, 0.814, 0.828 and 0.898, respectively. Those studies employed preconception [23] or first trimester data [5861], and the only ML technique used was LR. None of them validated their models, hence, the reported AUCs are calibration AUCs. The first and second trimester models presented here have similar or better calibration AUCs. Moreover, they are cross-validated. As far as we know, this is the first study that develops and validates ML models for the prediction of GDM in Chilean subjects.

Post load glycemia prediction with the first trimester model yields a Spearman r correlation coefficient of 0.496 (95% CI: 0.282–0.663, p<0.001****) and 0.259 (95% CI: 0.011–0.477, p = 0.036*) in calibration and cross-validation, respectively. This regression model encompasses the top 7 most important predictors exhibited in Fig 3A, i.e. FTO genotype, past HELLP syndrome, prior septicemia, prior pleural effusion, past asthma, family history of DM2 and prior GDM. The contribution of each predictor to the final model is presented on Panel A in S3 Fig. The link between most of them and GDM was discussed above, except for prior septicemia, prior pleural effusion and past asthma. Prior septicemia tends to be more frequent in GDM than in NGT pregnancies (S1 Table). Even though its association with GDM has not been investigated, septicemia has been related to a higher risk of DM2 [62,63]. Likewise, prior pleural effusion leans to be more frequent in GDM than in NGT subjects (S1 Table). There are several long-term follow-up studies of patients affected with pleural effusion [6469], but none of them assessed diabetes or related conditions as outcomes. Hence, more studies are needed to explain the link between pleural effusion and GDM. At last, past asthma tends to be less common in GDM than in NGT pregnant women (S1 Table). In contrast, a meta-analysis reported that maternal asthma was associated with a higher risk of GDM [70]; however, it didn’t evaluate the risk of GDM by ethnicity. A different study revealed that Hispanic subjects with and without asthma had an increased risk of GDM, when compared to White subjects. Interestingly, in the Hispanic cohort, the non-asthmatics had a higher odds ratio (OR) for GDM than the asthmatics [71]. It is interesting to note that, as in the classification first trimester final model, the deletion of FTO genotype from the regression model is associated with an impaired predictive performance, with calibration and cross-validation Spearman r correlation coefficients of 0.375 (95% CI: 0.139–0.571, p<0.01**) and -0.081 (95% CI: -0.324–0.171, p˃0.05 ns), respectively.

The prediction of post load glycemia with the second trimester model gets calibration and cross-validation Spearman r correlation coefficients of 0.520 (95% CI: 0.311–0.681, p<0.001****) and 0.457 (95% CI: 0.234–0.634, p<0.001***), respectively. This regression model involves the top 9 most relevant variables shown in Fig 3B, i.e. TSH2T, FT42T, FBG2T, TT42T, FTO genotype, past HELLP syndrome, prior septicemia, prior pleural effusion and past asthma. The contribution of each predictor to the final model is exhibited on Panel B in S3 Fig. Their relation with GDM was already discussed. Remarkably, as in the classification second trimester final model, the remotion of all biochemical variables from the regression model, i.e. TSH2T, FT42T, FBG2T, TT42T and FTO genotype, leads to a reduced predictive performance, with Spearman r correlation coefficients of 0.425 (95% CI: 0.197–0.609, p<0.001***) and -0.679 (95% CI: -0.794–0.518, p<0.001****) in calibration and cross validation, respectively. This result supports the idea that thyroid-related markers are useful to predict post load glycemia at the second trimester of pregnancy.

It is important to note that, contrary to what happens in the case of classification models, the regression models presented in this work are not restricted to a specific diagnostic criteria. This is because they allow to predict the post load glycemia very value, and then that value can be interpreted as NGT or GDM depending on the criteria applied. The error associated with the prediction of post load glycemia (given in terms of RECV) using the best regression models of this study is 20.7% in the first trimester of pregnancy and 18.4% in the second trimester of pregnancy, regardless of the diagnostic criteria employed. Worldwide, few studies have proposed strategies to predict glycemia in the context of pregnancy. Most of them are focused on blood glucose monitoring for the management of GDM [7274], and not on the prediction of post load glycemia as an early detection or alternative screening tool for this pregnancy disorder. Furthermore, although some of them evaluate the correlation between different variables and fasting glycemia [7577], postprandial glycemia [78], glucose challenge test (GCT) post load glycemia [79,80] and OGTT post load glycemia [8183], they don’t test their predictive performance through error metrics like RMSE and RE. To our knowledge, this is the first study that introduces and properly assesses ML models to predict OGTT post load glycemia so as to make the early detection and screening of GDM independent of the diagnostic criteria.

3.6. Strengths and limitations of this study

This study has some limitations. The sample size is small, thus, the generalization of the findings described here should be made with moderation. Models built on a restricted number of subjects are prone to overfitting. Even though internal validation was carried out to reduce the risk of it, the possibility that the presented models may be associated with some degree of overfitting cannot be ruled out. Future external validation studies are needed to evaluate that possibility. In addition, some of the clinical variables used were self-reported. Hence, the association between those particular parameters and GDM should be interpreted with caution.

Despite its limitations, this investigation has several strengths. Differently from previous studies, it focuses on the capability of the whole thyroid profile to predict GDM, both separately and in association with other maternal variables. Moreover, this work considered thyroid markers from the first and the second trimester of pregnancy, i.e. it tested their potential for both the early detection and the alternative screening of GDM. In addition, it assessed the power of thyroid and other maternal parameters on the prediction of OGTT post load glycemia, which hadn’t been done before. All the presented models were cross-validated, which reduces the chance of overestimating their predictive performance. To our knowledge, this is the first study that performs such a systematic analysis to evaluate the effectiveness of thyroid variables for the prediction of GDM and OGTT post load glycemia.

4. Conclusion

In this work, the maternal thyroid profile from the first and the second trimester of pregnancy was methodically assessed for the prediction of GDM and OGTT post load glycemia, both alone and combined with other variables. Our results indicate that thyroid parameters are useful for the prediction of GDM and OGTT post load glycemia, especially at the second trimester of pregnancy. Therefore, these markers could be employed as an alternative screening tool for GDM, independently of the diagnostic criteria used. Further studies are required to confirm these findings in larger cohorts and different populations.

Despite its limitations, this investigation represents the first attempt to evaluate the effectiveness of the whole thyroid profile on the prediction of GDM and OGTT post load glycemia. Our best models achieved a good predictive performance for both outcomes. Future external validation studies are needed so as to corroborate their potential and bring them closer to clinical practice.

Supporting information

S1 Fig. Calibration performance of the final machine learning models that predict gestational diabetes mellitus and post load glycemia.

A-B) For the prediction of GDM in the first (A) or second (B) trimester of pregnancy by PLS-DA. C-D) For the prediction of post load glycemia in the first (C) or second (D) trimester of pregnancy by PLS. The four plots show the calibration performance of the final models with their full set of predictors (black) and after the remotion of biochemical markers (grey). 1T: First trimester. 2T: Second trimester. PLS-DA: Partial least squares discriminant analysis. PLS: Partial least squares. CV: Cross-validation.

https://doi.org/10.1371/journal.pone.0280513.s001

(TIF)

S2 Fig. Variable importance of the machine learning models that predict gestational diabetes mellitus with best performance after variable selection.

(A) Built on first trimester data only. (B) Including second trimester data. 1T: First trimester. 2T: Second trimester. PLS-DA: Partial least squares discriminant analysis. pGDM: Prior gestational diabetes mellitus. fDM2: Family history of type 2 diabetes. mAnemia: Personal anemia. FTO: Fat mass and obesity-associated genotype (rs9939609). TSH: Thyroid stimulating hormone. FT4: Free thyroxine. TG: Thyroglobulin. BMI: Body mass index. pHELLP: Prior hemolysis elevated liver enzymes and low platelets syndrome.

https://doi.org/10.1371/journal.pone.0280513.s002

(TIF)

S3 Fig. Variable importance of the machine learning models that predict post load glycemia with best performance after variable selection.

(A) Built on first trimester data only. (B) Including second trimester data. 1T: First trimester. 2T: Second trimester. PLS: Partial least squares. FTO: Fat mass and obesity-associated genotype (rs9939609). mAsthma: Personal asthma. pGDM: Prior gestational diabetes mellitus. pHELLP: Prior hemolysis elevated liver enzymes and low platelets syndrome. pSepticemia: Prior septicemia. pPEffusion: Prior pleural effusion. fDM2: Family history of type 2 diabetes. TSH: Thyroid stimulating hormone. FBG: Fasting blood glucose. FT4: Free thyroxine. TT4: Total thyroxine.

https://doi.org/10.1371/journal.pone.0280513.s003

(TIF)

S1 Table. Maternal variables that were recorded in both normal glucose tolerance and gestational diabetes mellitus pregnancies.

NGT: Normal glucose tolerance. GDM: Gestational diabetes mellitus. BMI: Body mass index. HELLP: Hemolysis elevated liver enzymes and low platelets syndrome. IUGR: Intrauterine growth restriction. PCOS: Polycystic ovary syndrome. DM1: Type 1 diabetes. DM2: Type 2 diabetes. FTO: Fat mass and obesity-associated gene. TSH: Thyroid stimulating hormone. TT3: Total triiodothyronine. TT4: Total thyroxine. FT4: Free T4. TG: Thyroglobulin. aTG: TG antibody. aTPO thyroid peroxidase antibody. TRAb: TSH receptor antibody. OGTT: Oral glucose tolerance test. Qualitative variables are presented as percentage (proportion); quantitative variables with normal distribution, as mean ± standard deviation; and quantitative variables with non-normal distribution, as median (interquartile range). NS: Not significant. * p<0.05. ** p<0.01. **** p<0.0001.

https://doi.org/10.1371/journal.pone.0280513.s004

(DOCX)

S2 Table. Maternal thyroid variables that differed significantly between normal glucose tolerance and gestational diabetes mellitus pregnancies.

TSH: Thyroid stimulating hormone. TT3: Total triiodothyronine. FT4: Free thyroxine. aTG: Thyroglobulin antibody. Quantitative variables with non-normal distribution are presented as median (interquartile range). NS: Not significant. * p<0.05. ** p<0.01. *** p<0.001. **** p<0.0001.

https://doi.org/10.1371/journal.pone.0280513.s005

(DOCX)

S3 Table. Sensitivity and specificity of machine learning models that predict gestational diabetes mellitus based on different maternal variables.

a Model with the highest cross-validation non-error rate (CV-NER) for GDM prediction, using 1T data only. b Model with the highest CV-NER for GDM prediction, including 2T data. LR: Logistic regression. L-SVM: Linear support vector machine. PLS-DA: Partial least squares discriminant analysis. CART: Classification and regression tree. XGB: Extreme gradient boosting. 1T: First trimester. 2T: Second trimester. Thy: Thyroid predictors. NoThy: Non-thyroid predictors.

https://doi.org/10.1371/journal.pone.0280513.s006

(DOCX)

S4 Table. Root mean square error of machine learning models that predict post load glycemia based on different maternal variables.

a Model with the lowers RE for GDM prediction, using 1T data only. b Model with the lowest RE for GDM prediction, including 2T data. PLS: Partial least squares. RMSEC: Root mean square error of calibration. RMSECV: Root mean square error of cross-validation. 1T: First trimester. 2T: Second trimester. Thy: Thyroid predictors. NoThy: Non-thyroid predictors.

https://doi.org/10.1371/journal.pone.0280513.s007

(DOCX)

S5 Table. Sensitivity, specificity, non-error rate, root mean square error and relative error of machine learning models that predict gestational diabetes mellitus and post load glycemia before and after variable selection.

a For PLS-DA: NoThy1T + Thy1T. For PLS: NoThy1T. b Model with the lowest RE for post load glycemia prediction, using 1T data only. Predictors: FTO, pHELLP, pSepticemia, pPEffusion, mAsthma, fDM2, pGDM. c Model with the highest NER for GDM prediction, using 1T data only. Predictors: mAnemia, FTO, pGDM, fDM2. d For PLS-DA: NoThy1T + Thy1T + Thy2T. For PLS: NoThy1T + NoThy2T + Thy2T. e Model with the highest NER for GDM prediction, including 2T data. Predictors: TSH2T, pGDM, fDM2, FT42T, mAnemia, TG2T, TG1T, BMI, pHELLP. f Model with the lowest RE for post load glycemia prediction, including 2T data. Predictors: TSH2T, FT42T, FBG2T, TT42T, FTO, pHELLP, pSepticemia, pPEffusion, mAsthma. PLS-DA: Partial least squares discriminant analysis. PLS: Partial least squares. Se: Sensitivity. Sp: Specificity. NER: Non-error rate. RMSE: Root mean square error. RE: Relative error. NoThy: Non-thyroid predictors. Thy: Thyroid predictors. 1T: First trimester. 2T: Second trimester. FTO: Fat mass and obesity-associated genotype (rs9939609). pHELLP: Prior hemolysis elevated liver enzymes and low platelets syndrome. pSepticemia: Prior septicemia. pPEffusion: Prior pleural effusion. mAsthma: Personal asthma. fDM2: Family history of type 2 diabetes. pGDM: Prior gestational diabetes mellitus. mAnemia: Personal anemia. TSH: Thyroid stimulating hormone. FT4: Free thyroxine. TG: Thyroglobulin. BMI: Body mass index. FBG: Fasting blood glucose. TT4: Total thyroxine.

https://doi.org/10.1371/journal.pone.0280513.s008

(DOCX)

S1 Dataset. Full data analyzed in this study.

FTO: Fat mass and obesity-associated. BMI: Body mass index. TSH: Thyroid stimulating hormone. T3: Triiodothyronine. T4: Thyroxine. TG: Thyroglobulin. aTG: TG antibody. aTPO: Thyroid peroxidase antibody. TRAb: TSH receptor antibody. GDM: Gestational diabetes mellitus. HELLP: Hemolysis elevated liver enzymes and low platelets. IUGR: Intrauterine growth restriction. PCOS: Polycystic ovary syndrome. DM1: Type 1 diabetes. DM2: Type 2 diabetes. OGTT: Oral glucose tolerance test. 1T: First trimester. 2T: Second trimester.

https://doi.org/10.1371/journal.pone.0280513.s009

(XLSX)

Acknowledgments

The authors would like to thank all the pregnant women who voluntarily participated in this study; and to the health staff of the primary health centers (CESFAM) Victor Manuel Fernández, Santa Sabina and Tucapel, who collaborated in the collection and the initial storage of blood samples, and the retrieval of institution-derived maternal data.

References

  1. 1. McIntyre HD, Catalano P, Zhang C, Desoye G, Mathiesen ER, Damm P. Gestational diabetes mellitus. Nat Rev Dis Prim. 2019;5: 47. pmid:31296866
  2. 2. Metzger B, Lowe L, Dyer A, Trimble E, Chaovarindr U, Coustan D, et al. Hyperglycemia and Adverse Pregnancy Outcomes. N Engl J Med. 2008;358: 1991–2002. pmid:18463375
  3. 3. Lowe WL, Scholtens DM, Lowe LP, Kuang A, Nodzenski M, Talbot O, et al. Association of gestational diabetes with maternal disorders of glucose metabolism and childhood adiposity. JAMA—J Am Med Assoc. 2018;320: 1005–1016. pmid:30208453
  4. 4. Lowe WL, Scholtens DM, Kuang A, Linder B, Lawrence JM, Lebenthal Y, et al. Hyperglycemia and adverse Pregnancy Outcome follow-up study (HAPO FUS): Maternal gestational diabetes mellitus and childhood glucose metabolism. Diabetes Care. 2019;42: 372–380. pmid:30655380
  5. 5. Bzdok D, Altman N, Krzywinski M. Statistics versus machine learning. Nat Methods. 2018;15: 233–234. pmid:30100822
  6. 6. Artzi NS, Shilo S, Hadar E, Rossman H, Barbash-Hazan S, Ben-Haroush A, et al. Prediction of gestational diabetes based on nationwide electronic health records. Nat Med. 2020;26: 71–76. pmid:31932807
  7. 7. Di Farrar, Simmonds M, Bryant M, Lawlor DA, Dunne F, Tuffnell D, et al. Risk factor screening to identify women requiring oral glucose tolerance testing to diagnose gestational diabetes: A systematic review and meta-analysis and analysis of two pregnancy cohorts. PLoS One. 2017;12: 1–17. pmid:28384264
  8. 8. Mennickent D, Rodríguez A, Farías-Jofré M, Araya J, Guzmán-Gutiérrez E. Machine learning-based models for gestational diabetes mellitus prediction before 24–28 weeks of pregnancy: A review. Artif Intell Med. 2022;132: 102378. pmid:36207076
  9. 9. Yang Y, Li Q, Wang Q, Ma X. Thyroid antibodies and gestational diabetes mellitus: A meta-analysis. Fertil Steril. 2015;104: 665–671.e3. pmid:26151621
  10. 10. Jia M, Wu Y, Lin B, Shi Y, Zhang Q, Lin Y, et al. Meta-analysis of the association between maternal subclinical hypothyroidism and gestational diabetes mellitus. Int J Gynecol Obstet. 2019;144: 239–247. pmid:30578669
  11. 11. Kent NL, Young SL, Akison LK, Cuffe JSM. Is the link between elevated TSH and gestational diabetes mellitus dependant on diagnostic criteria and thyroid antibody status: a systematic review and meta-analysis. Endocrine. 2021;74: 38–49. pmid:33991314
  12. 12. Luo J, Wang X, Yuan L, Guo L. Association of thyroid disorders with gestational diabetes mellitus: a meta-analysis. Endocrine. 2021;73: 550–560. pmid:33983493
  13. 13. Sert UY, Buyuk GN, Engin Ustun Y, Ozgu Erdinc AS. Is there any relationship between thyroid function abnormalities, thyroid antibodies and development of gestational diabetes mellitus (GDM) in pregnant women? Medeni Med J. 2020;35: 195–201. pmid:33110671
  14. 14. Tang L, Li P, Zhou H, Li L. A longitudinal study of thyroid markers during pregnancy and the risk of gestational diabetes mellitus and post-partum glucose metabolism. Diabetes Metab Res Rev. 2021;37. pmid:33486811
  15. 15. Yanachkova V, Kamenov Z. The relationship between thyroid dysfunction during pregnancy and gestational diabetes mellitus. Endokrynol Pol. 2021;72: 226–231. pmid:33619713
  16. 16. Zhu B, Han Y, Deng F, Huang K, Yan S, Hao J, et al. The role of triiodothyronine (T3) and t3/free thyroxine (ft4) in glucose metabolism during pregnancy: The ma’anshan birth cohort study. Endocr Connect. 2021;10: 685–693. pmid:34077392
  17. 17. Qiu H, Yu HY, Wang LY, Yao Q, Wu SN, Yin C, et al. Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy. Sci Rep. 2017;7: 1–13. pmid:29180800
  18. 18. McLaren R, Haberman S, Moscu M, Atallah F, Friedmann H. A Novel and Precise Profiling Tool to Predict Gestational Diabetes. J Diabetes Sci Technol. 2021;15: 891–896. pmid:32787448
  19. 19. Wu YT, Zhang CJ, Mol BW, Kawai A, Li C, Chen L, et al. Early Prediction of Gestational Diabetes Mellitus in the Chinese Population via Advanced Machine Learning. J Clin Endocrinol Metab. 2021;106: E1191–E1205. pmid:33351102
  20. 20. Grobman WA, Stamilio DM. Methods of clinical prediction. Am J Obstet Gynecol. 2006;194: 888–894. pmid:16522430
  21. 21. Villena Gonzales W, Mobashsher A, Abbosh A. The Progress of Glucose Monitoring—A Review of Invasive to Minimally and Non-Invasive Techniques, Devices and Sensors. Sensors. 2019;19: 800. pmid:30781431
  22. 22. Araya J, Rodriguez A, Lagos-SanMartin K, Mennickent D, Gutiérrez-Vega S, Ortega-Contreras B, et al. Maternal thyroid profile in first and second trimester of pregnancy is correlated with gestational diabetes mellitus through machine learning. Placenta. 2021;103. pmid:33099203
  23. 23. Garmendia ML, Mondschein S, Montiel B, Kusanovic JP. Trends and predictors of gestational diabetes mellitus in Chile. Int J Gynecol Obstet. 2020;148: 210–218. pmid:31671205
  24. 24. Tiongco RE, Arceo E, Clemente B, Pineda-Cortel MR. Association of maternal iron deficiency anemia with the risk of gestational diabetes mellitus: a meta-analysis. Arch Gynecol Obstet. 2019;299: 89–95. pmid:30315412
  25. 25. Behere R V., Deshmukh AS, Otiv S, Gupte MD, Yajnik CS. Maternal Vitamin B12 Status During Pregnancy and Its Association With Outcomes of Pregnancy and Health of the Offspring: A Systematic Review and Implications for Policy in India. Front Endocrinol (Lausanne). 2021;12: 1–18. pmid:33912132
  26. 26. Ríos-Castillo I, Brito A, Olivares M, López-de Romaña D, Pizarro F. Low prevalence of iron deficiency anemia between 1981 and 2010 in chilean women of childbearing age. Salud Publica Mex. 2013;55: 478–483. pmid:24626618
  27. 27. Atalah E, Amigo H, Bustos P. Does Chile’s nutritional situation constitute a double burden? Am J Clin Nutr. 2014;100: 1623S–1627S. pmid:25411304
  28. 28. Lin Z, Wang Y, Zhang B, Jin Z. Association of type 2 diabetes susceptible genes GCKR, SLC30A8, and FTO polymorphisms with gestational diabetes mellitus risk: a meta-analysis. Endocrine. 2018;62: 34–45. pmid:30091126
  29. 29. He H, ting Cao W, hui Zeng Y, qing Huang Z, rong Du W, di Guan N, et al. Lack of associations between the FTO polymorphisms and gestational diabetes: A meta-analysis and trial sequential analysis. Gene. 2018;677: 169–175. pmid:30055308
  30. 30. Guo F, Long W, Zhou W, Zhang B, Liu J, Yu B. FTO, GCKR, CDKAL1 and CDKN2A/B gene polymorphisms and the risk of gestational diabetes mellitus: a meta-analysis. Arch Gynecol Obstet. 2018;298: 705–715. pmid:30074065
  31. 31. Petermann F, Villagrán M, Troncoso C, Mardones L, Leiva AM, Martínez MA, et al. Asociación entre el polimorfismo rs9939609 del gen FTO y marcadores de adiposidad en población adulta chilena. Rev Med Chil. 2018;146: 717–726. pmid:30148903
  32. 32. Gutierrez-Vega S, Armella A, Mennickent D, Loyola M, Covarrubias A, Ortega-Contreras B, et al. High levels of maternal total tri-iodothyronine, and low levels of fetal free L-thyroxine and total tri-iodothyronine, are associated with altered deiodinase expression and activity in placenta with gestational diabetes mellitus. PLoS One. 2020;15: 1–15. pmid:33232364
  33. 33. Moleti M, Trimarchi F, Vermiglio F. Thyroid physiology in pregnancy. Endocr Pract. 2014;20: 589–596. pmid:24449667
  34. 34. Haddow JE, Lambert-Messerlian G, Eklund E, Neveux LM, Palomaki GE. Peripheral deiodinase activity: A potential explanation for the association between maternal weight and gestational hyperglycemia. Obstet Med. 2018;11: 73–78. pmid:29997689
  35. 35. Haddow JE, Metzger BE, Lambert-Messerlian G, Eklund E, Coustan D, Catalano P, et al. Maternal BMI, Peripheral Deiodinase Activity, and Plasma Glucose: Relationships between White Women in the HAPO Study. J Clin Endocrinol Metab. 2019;104: 2593–2600. pmid:30753726
  36. 36. Wang Y, Lu S, Xu X, Zhang L, Yang J, Hu W. The interactive effects of pre-pregnancy body mass index, thyroid function, and blood lipid levels on the risk of gestational diabetes mellitus: a crossover analysis. BMC Pregnancy Childbirth. 2022;22: 1–10. pmid:35858832
  37. 37. Zhang S, Wu Y, Pan W, Li G, Zhang D, Li S, et al. Free-Triiodothyronine to Free-Thyroxine Ratio Mediated the Effect of Prepregnancy Body Mass Index or Maternal Weight Gain During Early Pregnancy on Gestational Diabetes Mellitus. Endocr Pract. 2022;28: 398–404. pmid:34995786
  38. 38. Han SM, Han JH, Park JA, Quinn FA, Park J, Oh E. Longitudinal evaluation of thyroid autoimmunity and function in pregnant Korean women. Clin Chem Lab Med. 2013;51: 2295–2301. pmid:23979124
  39. 39. Ekinci EI, Chiu WL, Lu ZX, Sikaris K, Churilov L, Bittar I, et al. A longitudinal study of thyroid autoantibodies in pregnancy: The importance of test timing. Clin Endocrinol (Oxf). 2015;82: 604–610. pmid:25079145
  40. 40. Ollero MD, Pineda J, Martínez de Esteban JP, Toni M, Espada M, Anda E. Optimization of the follow-up of pregnant women with autoimmune thyroid disease. Endocrinol Diabetes y Nutr (English ed). 2019;66: 305–311.
  41. 41. Balucan FS, Morshed SA, Davies TF. Thyroid autoantibodies in pregnancy: Their role, regulation and clinical relevance. J Thyroid Res. 2013;2013. pmid:23691429
  42. 42. Sharma S, Banerjee S, Krueger PM, Blois SM. Immunobiology of Gestational Diabetes Mellitus in Post-Medawar Era. Front Immunol. 2022;12: 1–10. pmid:35046934
  43. 43. Ying H, Tang YP, Bao YR, Su XJ, Cai XY, Li YH, et al. Maternal TSH level and TPOAb status in early pregnancy and their relationship to the risk of gestational diabetes mellitus. Endocrine. 2016;54: 742–750. pmid:27423217
  44. 44. Deng S-Q, Chen H-T, Wang D-Y, Liu B, Chen H-Q, Wang Z-L. Maternal Thyroid-Stimulating Hormone Level and Thyroid Peroxidase Antibody Status in the First and Second Trimester of Pregnancy and Their Relationship with the Risk of Gestational Diabetes Mellitus. Matern Med. 2019;1: 81–85.
  45. 45. Haddow JE, Craig WY, Neveux LM, Palomaki GE, Lambert-Messerlian G, Malone FD, et al. Free thyroxine during early pregnancy and risk for gestational diabetes. PLoS One. 2016;11: 1–11. pmid:26910563
  46. 46. Ballabio D, Grisoni F, Todeschini R. Multivariate comparison of classification performance measures. Chemom Intell Lab Syst. 2018;174: 33–44.
  47. 47. Bell GA, Männistö T, Liu A, Kannan K, Yeung EH, Kim UJ, et al. The joint role of thyroid function and iodine concentration on gestational diabetes risk in a population-based study. Acta Obstet Gynecol Scand. 2019;98: 500–506. pmid:30580457
  48. 48. Wang J, Gong XH, Peng T, Wu JN. Association of Thyroid Function During Pregnancy With the Risk of Pre-eclampsia and Gestational Diabetes Mellitus. Endocr Pract. 2021;27: 819–825. pmid:33831553
  49. 49. Castillo C, Lustig N, Margozzini P, Gomez A, Rojas MP, Muzzo S, et al. Thyroid-stimulating hormone reference ranges in the first trimester of pregnancy in an iodine-sufficient country. Endocrinol Metab. 2018;33: 466–472. pmid:30513561
  50. 50. Li M, Lan JR, Liang JL, Xiong XL. Diagnostic accuracy of fasting plasma glucose as a screening test for gestational diabetes mellitus: A systematic review and meta-analysis. Eur Rev Med Pharmacol Sci. 2020;24: 11172–11186. pmid:33215435
  51. 51. Botchkarev A. A new typology design of performance metrics to measure errors in machine learning regression algorithms. Interdiscip J Information, Knowledge, Manag. 2019;14: 45–76.
  52. 52. Ouzilleau C, Roy MA, Leblanc L, Carpentier A, Maheux P. An observational study comparing 2-hour 75-g oral glucose tolerance with fasting plasma glucose in pregnant women: Both poorly predictive of birth weight. Can Med Assoc J. 2003;168: 403–409. pmid:12591779
  53. 53. Catalano PM, Tyzbir ED, Wolfe RR, Calles J, Roman NM, Amini SB, et al. Carbohydrate metabolism during pregnancy in control subjects and women with gestational diabetes. Am J Physiol—Endocrinol Metab. 1993;264. pmid:8430789
  54. 54. Wong CH, Chen CP, Sun FJ, Chen CY. Comparison of placental three-dimensional power Doppler indices and volume in the first and the second trimesters of pregnancy complicated by gestational diabetes mellitus. J Matern Neonatal Med. 2019;32: 3784–3791. pmid:29716432
  55. 55. Sovio U, Murphy HR, Smith GCS. Accelerated fetal growth prior to diagnosis of gestational diabetes mellitus: A prospective cohort study of nulliparous women. Diabetes Care. 2016;39: 982–987. pmid:27208333
  56. 56. Weitgasser R, Spitzer D, Kartnig I, Zajc M, Staudach A, Sandhofer F. Association of HELLP syndrome with autoimmune antibodies and glucose intolerance. Diabetes Care. 2000;23: 786–790. pmid:10840997
  57. 57. Lykke JA, Langhoff-Roos J, Sibai BM, Funai EF, Triche EW, Paidas MJ. Hypertensive pregnancy disorders and subsequent cardiovascular morbidity and type 2 diabetes mellitus in the mother. Hypertension. 2009;53: 944–951. pmid:19433776
  58. 58. Correa PJ, Venegas P, Palmeiro Y, Albers D, Rice G, Roa J, et al. First trimester prediction of gestational diabetes mellitus using plasma biomarkers: a case-control study. J Perinat Med. 2019;47: 161–168. pmid:30205647
  59. 59. Arias M, Monteiro LJ, Acuña-Gallardo S, Varas-Godoy M, Rice GE, Monckeberg M, et al. Vesículas extracelulares como predictores tempranos de diabetes gestacional. Rev Med Chil. 2019;147: 1503–1509. pmid:32186613
  60. 60. Monteiro LJ, Varas-Godoy M, Monckeberg M, Realini O, Hernández M, Rice G, et al. Oral extracellular vesicles in early pregnancy can identify patients at risk of developing gestational diabetes mellitus. PLoS One. 2018;14: 1–13. pmid:31242249
  61. 61. Chaparro A, Zúñiga E, Varas-Godoy M, Albers D, Ramírez V, Hernández M, et al. Periodontitis and placental growth factor in oral fluids are early pregnancy predictors of gestational diabetes mellitus. J Periodontol. 2018;89: 1052–1060. pmid:29790168
  62. 62. Dombrovskiy VY, Martin AA, Sunderram J, Paz HL. Occurrence and outcomes of sepsis: Influence of race. Crit Care Med. 2007;35: 763–768. pmid:17255870
  63. 63. Hsu CW, Lin CS, Chen SJ, Lin SH, Lin CL, Kao CH. Risk of type 2 diabetes mellitus in patients with acute critical illness: a population-based cohort study. Intensive Care Med. 2016;42: 38–45. pmid:26370689
  64. 64. Ferrer JS, Muñoz XG, Orriols RM, Light RW, Morell FB. Evolution of Idiopathic Pleural Effusion: A Prospective, Long-term Follow-up Study. Chest. 1996;109: 1508–1513. pmid:8769502
  65. 65. Janssen JP, Ramlal S, Mravunac M. The long-term follow up of exudative pleural effusion after nondiagnostic thoracoscopy. J Bronchol. 2004;11: 169–174.
  66. 66. Gunluoglu G, Olcmen A, Gunluoglu MZ, Dincer I, Sayar A, Camsari G, et al. Long-term Outcome of Patients With Undiagnosed Pleural Effusion. Arch Bronconeumol. 2015;51: 632–636. pmid:26216715
  67. 67. Chan CHS, Arnold M, Chan CY, Mak TWL, Hoheisel GB. Clinical and pathological features of tuberculous pleural effusion and its long-term consequences. Respiration. 1991;58: 171–175. pmid:1745850
  68. 68. Kataoka H. Ultrasound Pleural Effusion Sign as a Useful Marker for Identifying Heart Failure Worsening in Established Heart Failure Patients During Follow-Up. Congest Hear Fail. 2012;18: 272–277. pmid:22994441
  69. 69. Hung TH, Tseng CW, Tsai CC, Tsai CC, Tseng KC, Hsieh YH. The long-term outcomes of cirrhotic patients with pleural effusion. Saudi J Gastroenterol. 2018;24: 46–51. pmid:29451184
  70. 70. Wang G, Murphy VE, Namazy J, Powell H, Schatz M, Chambers C, et al. The risk of maternal and placental complications in pregnant women with asthma: A systematic review and meta-analysis. J Matern Neonatal Med. 2014;27: 934–942. pmid:24111742
  71. 71. Flores KF, Robledo CA, Hwang BS, Leishear K, Laughon Grantz K, Mendola P. Does maternal asthma contribute to racial/ethnic disparities in obstetrical and neonatal complications? Ann Epidemiol. 2015;25: 392–397.e1. pmid:25724829
  72. 72. Pustozerov E, Popova P, Tkachuk A, Bolotko Y, Yuldashev Z, Grineva E. Development and evaluation of a mobile personalized blood glucose prediction system for patients with gestational diabetes mellitus. JMIR mHealth uHealth. 2018;6. pmid:29317385
  73. 73. Pustozerov E, Tkachuk A, Vasukova E, Dronova A, Shilova E, Anopova A, et al. The role of glycemic index and glycemic load in the development of real-time postprandial glycemic response prediction models for patients with gestational diabetes. Nutrients. 2020;12. pmid:31979294
  74. 74. Seo W, Park SW, Kim N, Jin SM, Park SM. A personalized blood glucose level prediction model with a fine-tuning strategy: A proof-of-concept study. Comput Methods Programs Biomed. 2021;211: 106424. pmid:34598081
  75. 75. Raúl Delgado M, Victoria Novik A, Felipe Cardemil M, Diego Santander A. Utilidad de la medición de fructosamina como indicador de control en pacientes con diabetes gestacional y pregestacional. Rev Med Chil. 2011;139: 1444–1450.
  76. 76. Kizilgul M, Kan S, Beysel S, Apaydin M, Ozcelik O, Caliskan M, et al. Is fibroblast growth factor 23 a new cardiovascular risk marker in gestational diabetes? Arch Endocrinol Metab. 2017;61: 562–566. pmid:28977159
  77. 77. Ren Z, Zhe D, Li Z, Sun X-P, Yang K, Lin L. Study on the correlation and predictive value of serum pregnancy-associated plasma protein A, triglyceride and serum 25-hydroxyvitamin D levels with gestational diabetes mellitus. World J Clin Cases. 2020;8: 864–873. pmid:32190623
  78. 78. Scioscia M, Kunjara S, Gumaa K, McLean P, Rodeck CH, Rademacher TW. Urinary excretion of inositol phosphoglycan P-type in gestational diabetes mellitus. Diabet Med. 2007;24: 1300–1304. pmid:17956457
  79. 79. Shushan A, Samueloff A. Correlation between fasting glucose in the first trimester and glucose challenge test in the second. Obstet Gynecol. 1998;91: 596–599. pmid:9540948
  80. 80. Kianpour M, Saadatmand F, Nematbakhsh M, Fahami F. Relationship between c-reactive protein and screening test results of gestational diabetes in pregnant women referred to health centers in Isfahan in 2013–2014. Iran J Nurs Midwifery Res. 2019;24: 360–364. pmid:31516522
  81. 81. Eid J, Kechichian T, Benavides E, Thibodeaux L, Salazar AE, Saade GR, et al. The Quantose Insulin Resistance Test for Maternal Insulin Resistance: A Pilot Study. Am J Perinatol. 2020;1. pmid:32894869
  82. 82. Erbağcı MO, Tuna G, Köse S, Dal-Bekar NE, Akış M, Kant M, et al. Association between early oxidative DNA damage and iron status in women with gestational diabetes mellitus. Reprod Toxicol. 2021;103: 171–180. pmid:34051274
  83. 83. Wu Y, Bible PW, Long S, kit Ming W, Ding W, Long Y, et al. Metagenomic analysis reveals gestational diabetes mellitus-related microbial regulators of glucose tolerance. Acta Diabetol. 2020;57: 569–581. pmid:31820107