Development of machine learning models for predicting early pregnancy outcomes based on β-hCG, progesterone, and estradiol

Luming Chen; Fangxiang Mu; Kexin Wang; Fang Wang

doi:10.1371/journal.pone.0348114

Abstract

Objective

This study aims to develop a machine learning model for predicting early pregnancy outcomes by combining baseline levels and dynamic changes of β-human chorionic gonadotropin (β-hCG), progesterone (P), and estradiol (E2).

Methods

This retrospective study screened out 421 patients treated at the Lanzhou University Second Hospital between March 2023 and August 2024. Feature selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO) and Random Forest Recursive Feature Elimination (RF-RFE). Subsequently, we constructed a traditional logistic regression model and five machine learning models: Random Forest (RF), eXtreme Gradient Boosting (XGBoost), k-Nearest Neighbors (KNN), Multilayer Perceptron (MLP) neural network, and Support Vector Machine (SVM). Internal validity was assessed through 5-fold cross-validation. Model performance was measured by the area under the Receiver Operating Characteristic curve (AUC), accuracy, precision, sensitivity, and specificity.

Results

Among the 421 enrolled patients, 263 had ongoing pregnancies while 158 experienced early pregnancy loss (EPL). LR, RF, XGBoost, KNN, MLP, and SVM achieved AUCs of 0.750, 0.784, 0.750, 0.706, 0.755, and 0.749, respectively, with all accuracy and precision metrics exceeding 0.60. Notably, the RF model yielded optimal performance for EPL prediction, attaining the highest AUC (0.784), accuracy (0.729), and precision (0.724).

Conclusion

Integrating dynamic changes in β-hCG, P, and E2 enables effective prediction of early pregnancy outcomes. The RF model exhibited optimal performance, highlighting its potential for clinical implementation as a risk stratification tool based on serial hormone monitoring.

Citation: Chen L, Mu F, Wang K, Wang F (2026) Development of machine learning models for predicting early pregnancy outcomes based on β-hCG, progesterone, and estradiol. PLoS One 21(4): e0348114. https://doi.org/10.1371/journal.pone.0348114

Editor: Rajakumar Anbazhagan, National Institute of Child Health and Human Development (NICHD), NIH, UNITED STATES OF AMERICA

Received: November 23, 2025; Accepted: April 10, 2026; Published: April 27, 2026

Copyright: © 2026 Chen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting information files.

Funding: Medical Innovation and Development Project of Lanzhou University (Grant No. lzuyxcx-2022-137).

Competing interests: The authors have declared that no competing interests exist.

1. Introduction

Early pregnancy loss (EPL) refers to a non-viable intrauterine pregnancy occurring within the first 12 weeks of gestation, characterized by either an empty gestational sac or a gestational sac containing an embryo or fetus without cardiac activity [1]. Prospective studies indicate that EPL affects 15%–20% of clinically recognized pregnancies [2]. Beyond potential physiological symptoms such as bleeding and abdominal pain, EPL can exert profound psychological consequences on patients and their families, including grief, guilt, depression, and anxiety [3,4]. Therefore, early and accurate prediction of pregnancy outcomes is crucial for enabling timely clinical intervention and providing effective psychological support.

Currently, various biomarkers are utilized to assess early pregnancy development and predict pregnancy outcomes. β-human chorionic gonadotropin (β-hCG) is a key hormone secreted by trophoblast cells following implantation. Evidence indicates that unhealthy pregnancies exhibit slower rises or even declines in maternal serum β-hCG levels, often lacking the characteristic doubling increase [5]. Progesterone (P) is crucial for endometrial receptivity, implantation, and maintaining pregnancy [6]. Ucyigit et al. confirmed that women with lower P levels in early pregnancy suffer an increased risk of pregnancy loss [7]. Estradiol (E2) levels reflect follicular quality and corpus luteum function, contributing to luteal maintenance. Serum E2 levels are typically significantly lower in women experiencing pregnancy loss compared to those with normal pregnancies [8]. Despite their use, the clinical application of these biomarkers faces challenges. Bobdiwala et al. found that single-threshold levels for β-hCG, β-hCG ratios, and progesterone were clinically unreliable for determining early pregnancy viability [9], findings corroborated by Pillai et al. [10]. This highlights the limited predictive utility of traditional approaches relying on single-timepoint measurements or individual biomarkers.

Therefore, the development of a reliable, early-warning method for EPL is imperative to enable timely clinical intervention. This study aims to establish machine learning models for predicting early pregnancy outcomes by integrating both static and dynamic features of β-hCG, P, and E2.

2. Materials and methods

2.1. Participants

This retrospective study consecutively enrolled 1,865 patients with previous pregnancy outcomes treated at the Department of Reproductive Medicine, Lanzhou University Second Hospital, between 11 March 2023 and 26 August 2024. Inclusion criteria were as follows: (1) age between 18 and 45 years; (2) natural intrauterine conception; (3) available data on serum levels of β-hCG, P, E2, and pregnancy outcomes; (4) at least one set of alternate-day measurements of β-hCG, P, and E2 obtained within the first 12 gestational weeks; and (5) a history of pregnancy loss. The exclusion criteria were: (1) adverse pregnancy events occurred after 12 weeks of gestation; (2) chromosomal abnormalities in either parent or the embryo; (3) congenital uterine malformations (including septate, unicornuate, bicornuate, or didelphys uterus); (4) multiple gestations; (5) infertility; and (6) missing or insufficiently frequent data on β-hCG, P, or E2 levels. Ultimately, 421 patients were included in the study and categorized into the EPL group (n = 158) and the ongoing pregnancy group (n = 263), according to pregnancy outcomes at 12 weeks (Fig 1). Approval for the study was granted by the Ethics Committee at the Lanzhou University Second Hospital (Reference No. 2019A-231), and all patients provided their written informed consent.

Download:

Fig 1. Flowchart.

LR: Logistic regression; RF: Random forest; XGBoost: eXtreme Gradient Boosting; KNN: k-nearest neighbors; SVM: Support vector machine; MLP: Multilayer perceptron.

https://doi.org/10.1371/journal.pone.0348114.g001

2.2. Data collection

Maternal characteristics, including age, weight, height, age at menarche, regularity of menstruation, and history of pregnancy loss, were derived from medical records. Body mass index (BMI) was defined as weight divided by the square of height (kg/m²). Pregnancy outcomes were determined through follow-up or by reviewing medical records from the hospital.

2.3. Main outcome

We defined EPL as pregnancy loss before 12 weeks of gestation, which included biochemical pregnancies. Conversely, an ongoing pregnancy is referred to as a viable pregnancy confirmed at the 12-week gestation. In this study, the pregnancy outcomes of all the research subjects were comprehensively determined by the specialists in the Department of Reproductive Medicine, Lanzhou University Second Hospital, based on the results of ultrasound examinations, a series of serum hormone test data, and clinical symptoms. Ultrasound examinations served as the core supporting evidence for diagnosing pregnancy outcomes, providing a crucial guarantee for the accuracy of the research results.

2.4. Serum β-hCG, P, and E2 measurements

Hormonal assessments included initial and alternate-day β-hCG levels, P levels, and E2 levels. All measurements were performed at the Reproductive Center of Lanzhou University Second Hospital, by trained personnel using validated automated immunoassay systems. First and second log(β-hCG) refer to the log-transformed values of two β-hCG measurements taken at a 48-hour interval; β-hCG ratio refers to the ratio of the second β-hCG value to the first; interday estradiol/progesterone level difference is calculated as the difference between the second and first measurements.

2.5. Statistical analysis

2.5.1. Data analysis.

Statistical analysis was performed using R version 4.3.1. Continuous variables were compared using the Mann-Whitney U test or t-test, while categorical variables were compared using the Chi-square or Fisher’s exact tests. Continuous variables with a normal distribution were presented as means with standard deviations (SD), while those with a skewed distribution were presented as medians with interquartile ranges (IQR). Categorical variables were shown as frequencies (percentages). Associations between variables were assessed by Pearson correlation. A two-tailed P < 0.05 is deemed statistically significant.

All statistical analyses were conducted using R software (version 4.3.1; R Core Team, 2023) within the RStudio integrated development environment (version 2023.03.0; RStudio Team, 2023). Specifically, the glmnet package (v4.1-7) was used for LASSO regression, while the caret package (v6.0-94) supported Random Forest-Recursive Feature Elimination (RF-RFE) and model training. The pROC package (v1.18.2) was utilized for receiver operating characteristic (ROC) curve analysis and area under the curve (AUC) calculation. Model construction was facilitated through randomForest (v4.7-1.1), xgboost (v1.7.6.1), and e1071 (v1.7-13) for Random Forest, Extreme Gradient Boosting, and Support Vector Machine algorithms, respectively. Additionally, the class package (v7.3-22) provided the k-Nearest Neighbors algorithm, neuralnet (v1.44.2) was used for Multilayer Perceptron training, and vip (v0.4.1) was employed to visualize variable importance.

2.5.2. Feature selection.

In this research, feature selection was executed through a combination of RF-RFE and the Least Absolute Shrinkage and Selection Operator (LASSO). LASSO was implemented utilizing the glmnet package in R. To ensure robust model performance, the optimal regularization parameter, denoted as λ, was determined through a rigorous process of 10-fold cross-validation. In this approach, the entire dataset was randomly divided into ten distinct subsets. Each subset was utilized once as a validation set, while the remaining subsets collectively served as the training set. This design allowed for a comprehensive evaluation of various λ values during the training phase, with model performance subsequently assessed on the corresponding validation set. Ultimately, the λ that demonstrated optimal performance across all folds was selected as the final regularization parameter for the LASSO model. Simultaneously, RF-RFE was applied using the caret package in R. This technique involved a systematic approach to eliminating features, governed by specific parameters controlling the recursive elimination process. Accuracy served as the primary performance metric for this method. The iterations of feature elimination continued until either a predefined number of features was attained or optimal performance was reached. This dual approach of utilizing both LASSO and RF-RFE provided a comprehensive framework for effective feature selection, ensuring that the most pertinent features were identified for enhancing model performance.

2.5.3. Model development and evaluation.

Six machine learning algorithms were developed to predict EPL, including logistic regression (LR), random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbors (KNN), multilayer perceptron (MLP), and support vector machine (SVM). Patients were randomly assigned to the training set, and 5-fold cross-validation was performed for internal validation. Model parameters were optimized through a grid search approach, with the best configuration determined after multiple iterations. Diagnostic accuracy and discriminative power were evaluated by means of ROC analysis. Evaluation metrics for each model included accuracy, precision, sensitivity, and specificity. AUC was utilized as the primary metric due to its threshold-independent nature. Conversely, threshold-dependent metrics, such as accuracy and sensitivity, were reported based on the default settings.

2.5.4. Sensitivity analysis.

To evaluate the robustness of the predictive model across different populations, patients with biochemical pregnancies were excluded, and the optimal model was refitted to assess whether its predictive performance remained consistent.

3. Results

3.1. Baseline characteristics

All 421 women were enrolled in this study and underwent alternate-day measurements of serum β-hCG, P, and E2 during the first trimester (Table 1). Among them, 263 had ongoing pregnancies at 12 weeks, while 158 experienced EPL. Significant differences were observed in terms of age, first and second log(β-hCG) values, β-hCG ratio, first and second P levels, second E2 level, interday E2 level difference, and number of pregnancy losses (all P < 0.05). No significant differences were found for other characteristics. S1 Table in S1 File presents the baseline characteristics after excluding biochemical pregnancies. Among the 158 EPLs, 43 were classified as biochemical pregnancies and 115 as clinical pregnancy losses. Significant differences were observed in the first and second log(β-hCG) values between biochemical pregnancy and clinical pregnancy loss (S2 Table in S1 File).

Download:

Table 1. Baseline characteristics.

https://doi.org/10.1371/journal.pone.0348114.t001

3.2. Correlation analysis

Pearson correlation analysis was employed to examine the associations between variables. Strong correlations were observed between the repeated measurements for β-hCG (r = 0.98), P (r = 0.79), and E2 (r = 0.89) (Fig 2).

Download:

Fig 2. Pearson correlation coefficient.

https://doi.org/10.1371/journal.pone.0348114.g002

3.3. Feature selection

Fig 3A illustrates the process of variable simplification and coefficient adjustment in the LASSO regression model. It highlights the importance ranking of the variables included in the model, with each distinct colored line corresponding to a particular variable. As the penalty parameter is increased, the coefficients associated with less significant variables tend to diminish quickly, nearing zero. In contrast, the coefficients of more crucial variables exhibit minimal changes, enabling these key variables to persist throughout the modeling process until its conclusion. Meanwhile, Fig 3B presents the relationship between the mean squared error and the logarithm of the penalty parameter (log(λ)). This trend serves as a foundational element for determining the optimal model selection, guiding researchers in effectively balancing model complexity with predictive accuracy. The LASSO algorithm supported feature selection by regularizing the model, thereby limiting its complexity during the fitting process. Cross-validation was performed within a specified λ range, yielding two key parameters: lambda_min and lambda_lse. Lambda_min corresponded to the λ value that minimized the cross-validation error, while lambda_lse represented the largest λ within one standard error above the minimum cross-validation error. Lambda_lse was chosen to prevent overfitting and enhance model robustness. Similarly, the RF-RFE method performed feature selection by utilizing an RF-based strategy, iteratively training the model to assess the importance of each feature and recursively eliminating irrelevant features. The model achieved its highest accuracy when including 11 variables (Fig 3C). We ultimately selected seven common features from both the LASSO and RF-RFE algorithms to construct the predictive model, including the second β-hCG value, β-hCG ratio, age, interday E2 level difference, first P value, number of pregnancy losses, and BMI (Fig 3D).

Download:

Fig 3. Feature screening.

(A) Regression coefficient variation curve based on LASSO. (B) The optimal λ process was obtained through iterative analysis using the 10-fold cross-validation method based on LASSO. (C) Feature variable screening based on RF-RFE. (D) LASSO combination RF-RFE. LASSO: Least Absolute Contraction and Selection Operator; RF: Random Forest; RFE: Recursive Feature Elimination.

https://doi.org/10.1371/journal.pone.0348114.g003

3.4. Model construction and performance comparison

Based on the selected features, we developed an LR model and five machine learning models. The ROC curves for all six models are presented in Fig 4. Among them, the RF model demonstrated the best discriminatory performance, with an AUC of 0.784, followed by the MLP (AUC = 0.755), LR (AUC = 0.750), XGBoost (AUC = 0.750), SVM (AUC = 0.749), and KNN (AUC = 0.706) (Table 2). The RF model ranked first in terms of accuracy and precision, while ranking third in sensitivity and second in specificity. Overall, the RF model exhibited consistently strong performance across all metrics.

Download:

Table 2. Performance parameters of the six prediction models.

https://doi.org/10.1371/journal.pone.0348114.t002

Download:

Fig 4. The mean ROC curves of the six models.

AUC: area under the curve; LR: Logistic regression; RF: Random forest; XGBoost: eXtreme Gradient Boosting; KNN: k-nearest neighbors; SVM: Support vector machine; MLP: Multilayer perceptron.

https://doi.org/10.1371/journal.pone.0348114.g004

3.5. Sensitivity analysis

To assess the robustness of the findings, we excluded patients with biochemical pregnancies (n = 43) and refitted the RF model using the remaining cohort (n = 378). The model’s predictive performance slightly declined, with an AUC of 0.770 (S3 Table in S1 File), which may be attributed to the reduced sample size. Nevertheless, the model remained effective after excluding biochemical pregnancies, indicating overall robustness despite the minor decrease in performance.

4. Discussion

This study employed machine learning algorithms to integrate serum biomarkers (β-HCG, P, and E2) and their dynamic characteristics for constructing predictive models of early pregnancy outcomes. All models exhibited favorable predictive performance, with the RF model showing the optimal predictive capability. The present study is expected to provide more reliable and objective auxiliary tools for the clinical prediction of early pregnancy outcomes. The sample size can be further expanded, and more dimensions of features can be included to optimize the model’s generalization ability in the future.

Compared with the EPL group, the ongoing pregnancy group exhibited significantly higher levels of log(β-hCG) values, β-hCG ratio, P levels, second E2 level, and interday E2 level difference. Notably, even after excluding biochemical pregnancies, the ongoing pregnancy group still demonstrated elevated P levels, second E2, and interday E2 level difference relative to the EPL group. These results underscore the crucial role of hormonal levels and their dynamic fluctuations in maintaining early pregnancy. It is well-established that low levels or unfavorable trends in β-hCG, P, and E2 are widely recognized as potential indicators of adverse pregnancy outcomes. For instance, Li et al. reported that low levels and increased rates of E2 and β-hCG were associated with adverse pregnancy outcomes [11]. Deng et al. evaluated the predictive value of serum E2, P, and β-hCG before 9 weeks of gestation for predicting pregnancy loss within 12 weeks [8]. Their study identified that specific combinations of low hormone levels at different gestation ages (e.g., low E2 + P or E2 alone at 7–9 weeks, or low β-hCG or P + E2 at 5–6 weeks) could more effectively predict EPL. Additionally, Su et al. investigated the utility of weekly measurements of hCG, P, and E2 in assessing the viability of pregnancy in patients with unexplained recurrent pregnancy loss [12]. Their results similarly supported that repeated hCG and E2 measurements contribute to pregnancy risk assessment in this population. In conclusion, the values of hormone levels and their dynamic changes have significant predictive value for the outcome of early pregnancy.

Combined LASSO regression with RF-RFE, we identified seven core predictive variables: second β-hCG, β-hCG ratio, age, interday E2 level difference, first P value, number of pregnancy losses, and BMI. In line with prior investigation, dynamic changes in β-hCG demonstrated greater predictive value for pregnancy outcomes compared to a single measurement [13]. The predictive significance of age, number of previous pregnancy losses, and BMI—established risk factors for pregnancy loss—was corroborated by our findings [14]. Furthermore, P and E2 are essential hormones for early gestational maintenance [15], and their fluctuations may reflect dynamic alterations related to embryonic development or placental function. Pearson correlation analyses, performed across the entire cohort, revealed high correlations between repeated measurements of the same hormone, confirming both assay stability and data integrity. Crucially, while single measurements were highly correlated, they were not perfectly concordant. The differences between these serial measurements constitute the core of dynamic indicators. Machine learning models, particularly RF, which excels at capturing complex nonlinear relationships, effectively leveraged the information embedded within these dynamic changes to enhance predictive performance.

We assessed the efficacy of six machine learning models for predicting the risk of EPL. All models demonstrated good predictive capability, meeting the threshold criterion for high accuracy proposed by Luo et al. [16]. Among the models, the RF algorithm exhibited the optimal predictive performance, achieving the highest values for AUC, accuracy, and precision. Even after excluding biochemical pregnancies, it showed only a modest decline while maintaining high discriminatory power. This result aligns with multiple studies confirming the advantage of machine learning models in predicting pregnancy outcomes [17–19]. Specifically regarding the application of RF, research by Yehuala et al. demonstrated its superior predictive capability in estimating risk factors for pregnancy losses [20]. Our study further corroborates the effectiveness of RF in this context. It is worth noting, however, that some studies have reported other machine learning models achieving superior performance [21], indicating that the selection of the model still requires further exploration in combination with specific clinical circumstances.

The machine learning model developed in this study is intended to function as a decision-support tool in early pregnancy and reproductive medicine clinics. To facilitate practical implementation, the model requires only seven standardized inputs readily available during routine care: maternal age, BMI, number of previous pregnancy losses, and the static and dynamic values of serum β-hCG, P, and E2 (based on a 48-hour interval). At the point of care, these parameters can be entered into a simplified digital template to generate an immediate risk probability. We propose a risk-stratified management protocol based on these predictions: Low-risk patients continue with standard prenatal care; Moderate-risk patients receive intensified surveillance, including more frequent hormonal and ultrasound monitoring and personalized medical support; and High-risk patients are prioritized for specialist-led interventions, symptom-specific management for threatened miscarriage, and proactive psychological counseling. By utilizing only routine clinical data without requiring additional specialized testing, this model offers a scalable and cost-effective solution for individualized early pregnancy management.

Several limitations should be acknowledged. First, as a retrospective single-center investigation, the generalizability of the findings is limited, and the risk of inherent selection bias cannot be excluded. Prospective, multicentre studies with larger sample sizes are needed for external validation. Furthermore, the model was evaluated solely by internal cross-validation, and its performance in the external population remains to be confirmed. Optimizing thresholds based on specific clinical needs (such as high-sensitivity screening) is a crucial step for clinical translation, future applications can tailor the decision threshold based on specific clinical scenarios and further evaluate model performance under pre-set sensitivity requirements. Finally, not all potentially relevant predictors—such as maternal medical history or ultrasonographic characteristics—were incorporated, and these factors may further influence pregnancy outcomes. In future prospective, multi-center studies, we intend to systematically collect these variables to optimize the model’s comprehensiveness and generalizability.

In summary, we developed machine learning models that incorporate both static and dynamic profiles of serum biomarkers (β-hCG, P, and E2) for the prediction of early pregnancy outcomes. Among the evaluated algorithms, the RF achieved the highest predictive performance, providing promising evidence for the clinical utility of machine learning in EPL risk stratification. Despite the inherent constraints of a retrospective design, our findings underscore the value of integrating dynamic biomarkers with an appropriate machine learning algorithm. Future multicentre prospective studies and iterative model refinement will be essential to translate these results into routine clinical practice and ultimately enhance first-trimester pregnancy management.

Supporting information

S1 File. S1 Table. Baseline characteristics of patients excluded from biochemical pregnancy.

S2 Table. Baseline characteristics of patients with biochemical pregnancy and clinical pregnancy loss. S3 Table. Performance parameters of the RF prediction model excluding patients with biochemical pregnancy.

https://doi.org/10.1371/journal.pone.0348114.s001

(DOCX)

References

1. ACOG practice bulletin no. 200: early pregnancy loss. Obstet Gynecol. 2018;132(5):e197–207. pmid:30157093
- View Article
- PubMed/NCBI
- Google Scholar
2. Mehra VM, Farooqi S, Sriram P, Tunde-Byass M. Diagnosis and management of early pregnancy loss. CMAJ. 2024;196(34):E1162–8. pmid:39406415
- View Article
- PubMed/NCBI
- Google Scholar
3. Chu J, Hardy P, Beeson L, Coomarasamy A. What is the best method for managing early miscarriage? BMJ. 2020;368:l6438. pmid:31959625
- View Article
- PubMed/NCBI
- Google Scholar
4. Li H, Qin S, Xiao F, Li Y, Gao Y, Zhang J, et al. Predicting first-trimester outcome of embryos with cardiac activity in women with recurrent spontaneous abortion. J Int Med Res. 2020;48(6). pmid:32527173
- View Article
- PubMed/NCBI
- Google Scholar
5. Hamza A, Meyberg-Solomayer G, Juhasz-Böss I, Joukhadar R, Takacs Z, Solomayer E-F, et al. Diagnostic methods of ectopic pregnancy and early pregnancy loss: a review of the literature. Geburtshilfe Frauenheilkd. 2016;76(4):377–82. pmid:27134292
- View Article
- PubMed/NCBI
- Google Scholar
6. Vitzthum VJ, Spielvogel H, Thornburg J, West B. A prospective study of early pregnancy loss in humans. Fertil Steril. 2006;86(2):373–9. pmid:16806213
- View Article
- PubMed/NCBI
- Google Scholar
7. Ucyigit A, Fuller JL, Poon LC, Johns J, Ross JA. The significance of low first trimester serum progesterone in ongoing early pregnancies presenting as pregnancies of unknown location. Eur J Obstet Gynecol Reprod Biol. 2021;258:294–8. pmid:33498002
- View Article
- PubMed/NCBI
- Google Scholar
8. Deng W, Sun R, Du J, Wu X, Ma L, Wang M, et al. Prediction of miscarriage in first trimester by serum estradiol, progesterone and β-human chorionic gonadotropin within 9 weeks of gestation. BMC Pregnancy Childbirth. 2022;22(1):112. pmid:35144584
- View Article
- PubMed/NCBI
- Google Scholar
9. Bobdiwala S, Kyriacou C, Christodoulou E, Farren J, Mitchell-Jones N, Al-Memar M, et al. Evaluating cut-off levels for progesterone, β human chorionic gonadotropin and β human chorionic gonadotropin ratio to exclude pregnancy viability in women with a pregnancy of unknown location: a prospective multicenter cohort study. Acta Obstet Gynecol Scand. 2022;101(1):46–55. pmid:34817062
- View Article
- PubMed/NCBI
- Google Scholar
10. Pillai RN, Konje JC, Tincello DG, Potdar N. Role of serum biomarkers in the prediction of outcome in women with threatened miscarriage: a systematic review and diagnostic accuracy meta-analysis. Hum Reprod Update. 2016;22(2):228–39. pmid:26663220
- View Article
- PubMed/NCBI
- Google Scholar
11. Li Y, Zhang J, Zhang K, Wang E, Shu J. Significance of dynamically monitoring serum estrogen and β-human chorionic gonadotropin in early pregnancy assessment. J Clin Lab Anal. 2021;35(1):e23559. pmid:32892443
- View Article
- PubMed/NCBI
- Google Scholar
12. Su R, Wang Y, Lu Y, Lin B, An J. Weekly changes in serum β-human chorionic gonadotropin, estradiol, and progesterone levels for pregnancy assessment in women with unexplained recurrent miscarriage. J Int Med Res. 2025;53(4). pmid:40302658
- View Article
- PubMed/NCBI
- Google Scholar
13. van Mello NM, Mol F, Opmeer BC, Ankum WM, Barnhart K, Coomarasamy A, et al. Diagnostic value of serum hCG on the outcome of pregnancy of unknown location: a systematic review and meta-analysis. Hum Reprod Update. 2012;18(6):603–17. pmid:22956411
- View Article
- PubMed/NCBI
- Google Scholar
14. Quenby S, Gallos ID, Dhillon-Smith RK, Podesek M, Stephenson MD, Fisher J, et al. Miscarriage matters: the epidemiological, physical, psychological, and economic costs of early pregnancy loss. Lancet. 2021;397(10285):1658–67. pmid:33915094
- View Article
- PubMed/NCBI
- Google Scholar
15. Jin C, Peng Y, Luo X, Zhu Q, Lin L, Yang J, et al. Dynamic change of estrogen and progesterone metabolites in human urine during pregnancy. Sci Rep. 2025;15(1):8409. pmid:40069295
- View Article
- PubMed/NCBI
- Google Scholar
16. Luo X, Lin F, Zhu S, Yu M, Zhang Z, Meng L, et al. Mine landslide susceptibility assessment using IVM, ANN and SVM models considering the contribution of affecting factors. PLoS One. 2019;14(4):e0215134. pmid:30973936
- View Article
- PubMed/NCBI
- Google Scholar
17. Qi S, Zheng S, Lu M, Chen A, Chen Y, Fu X. Building a machine learning-based risk prediction model for second-trimester miscarriage. BMC Pregnancy Childbirth. 2024;24(1):738. pmid:39522013
- View Article
- PubMed/NCBI
- Google Scholar
18. Zhang R, Guo Y, Zhai X, Wang J, Hao X, Yang L, et al. Machine learning algorithm based on combined clinical indicators for the prediction of infertility and pregnancy loss. Front Endocrinol (Lausanne). 2025;16:1544724. pmid:40756514
- View Article
- PubMed/NCBI
- Google Scholar
19. Ozer G, Akca A, Yuksel B, Duzguner I, Pehlivanli AC, Kahraman S. Prediction of risk factors for first trimester pregnancy loss in frozen-thawed good-quality embryo transfer cycles using machine learning algorithms. J Assist Reprod Genet. 2023;40(2):279–88. pmid:36399255
- View Article
- PubMed/NCBI
- Google Scholar
20. Yehuala TZ, Mengesha SB, Baykemagn ND. Predicting pregnancy loss and its determinants among reproductive-aged women using supervised machine learning algorithms in Sub-Saharan Africa. Front Glob Womens Health. 2025;6:1456238. pmid:39995747
- View Article
- PubMed/NCBI
- Google Scholar
21. Kubahoniyesu T, Kabano IH. Predicting adverse pregnancy outcome in Rwanda using machine learning techniques. PLoS One. 2024;19(12):e0312447. pmid:39637200
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. ACOG practice bulletin no. 200: early pregnancy loss. Obstet Gynecol. 2018;132(5):e197–207. pmid:30157093
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Mehra VM, Farooqi S, Sriram P, Tunde-Byass M. Diagnosis and management of early pregnancy loss. CMAJ. 2024;196(34):E1162–8. pmid:39406415
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Chu J, Hardy P, Beeson L, Coomarasamy A. What is the best method for managing early miscarriage? BMJ. 2020;368:l6438. pmid:31959625
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Li H, Qin S, Xiao F, Li Y, Gao Y, Zhang J, et al. Predicting first-trimester outcome of embryos with cardiac activity in women with recurrent spontaneous abortion. J Int Med Res. 2020;48(6). pmid:32527173
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Hamza A, Meyberg-Solomayer G, Juhasz-Böss I, Joukhadar R, Takacs Z, Solomayer E-F, et al. Diagnostic methods of ectopic pregnancy and early pregnancy loss: a review of the literature. Geburtshilfe Frauenheilkd. 2016;76(4):377–82. pmid:27134292
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Vitzthum VJ, Spielvogel H, Thornburg J, West B. A prospective study of early pregnancy loss in humans. Fertil Steril. 2006;86(2):373–9. pmid:16806213
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Ucyigit A, Fuller JL, Poon LC, Johns J, Ross JA. The significance of low first trimester serum progesterone in ongoing early pregnancies presenting as pregnancies of unknown location. Eur J Obstet Gynecol Reprod Biol. 2021;258:294–8. pmid:33498002
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Deng W, Sun R, Du J, Wu X, Ma L, Wang M, et al. Prediction of miscarriage in first trimester by serum estradiol, progesterone and β-human chorionic gonadotropin within 9 weeks of gestation. BMC Pregnancy Childbirth. 2022;22(1):112. pmid:35144584
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Bobdiwala S, Kyriacou C, Christodoulou E, Farren J, Mitchell-Jones N, Al-Memar M, et al. Evaluating cut-off levels for progesterone, β human chorionic gonadotropin and β human chorionic gonadotropin ratio to exclude pregnancy viability in women with a pregnancy of unknown location: a prospective multicenter cohort study. Acta Obstet Gynecol Scand. 2022;101(1):46–55. pmid:34817062
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Pillai RN, Konje JC, Tincello DG, Potdar N. Role of serum biomarkers in the prediction of outcome in women with threatened miscarriage: a systematic review and diagnostic accuracy meta-analysis. Hum Reprod Update. 2016;22(2):228–39. pmid:26663220
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Li Y, Zhang J, Zhang K, Wang E, Shu J. Significance of dynamically monitoring serum estrogen and β-human chorionic gonadotropin in early pregnancy assessment. J Clin Lab Anal. 2021;35(1):e23559. pmid:32892443
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Su R, Wang Y, Lu Y, Lin B, An J. Weekly changes in serum β-human chorionic gonadotropin, estradiol, and progesterone levels for pregnancy assessment in women with unexplained recurrent miscarriage. J Int Med Res. 2025;53(4). pmid:40302658
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. van Mello NM, Mol F, Opmeer BC, Ankum WM, Barnhart K, Coomarasamy A, et al. Diagnostic value of serum hCG on the outcome of pregnancy of unknown location: a systematic review and meta-analysis. Hum Reprod Update. 2012;18(6):603–17. pmid:22956411
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Quenby S, Gallos ID, Dhillon-Smith RK, Podesek M, Stephenson MD, Fisher J, et al. Miscarriage matters: the epidemiological, physical, psychological, and economic costs of early pregnancy loss. Lancet. 2021;397(10285):1658–67. pmid:33915094
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Jin C, Peng Y, Luo X, Zhu Q, Lin L, Yang J, et al. Dynamic change of estrogen and progesterone metabolites in human urine during pregnancy. Sci Rep. 2025;15(1):8409. pmid:40069295
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref16] 16. Luo X, Lin F, Zhu S, Yu M, Zhang Z, Meng L, et al. Mine landslide susceptibility assessment using IVM, ANN and SVM models considering the contribution of affecting factors. PLoS One. 2019;14(4):e0215134. pmid:30973936
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref17] 17. Qi S, Zheng S, Lu M, Chen A, Chen Y, Fu X. Building a machine learning-based risk prediction model for second-trimester miscarriage. BMC Pregnancy Childbirth. 2024;24(1):738. pmid:39522013
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref18] 18. Zhang R, Guo Y, Zhai X, Wang J, Hao X, Yang L, et al. Machine learning algorithm based on combined clinical indicators for the prediction of infertility and pregnancy loss. Front Endocrinol (Lausanne). 2025;16:1544724. pmid:40756514
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref19] 19. Ozer G, Akca A, Yuksel B, Duzguner I, Pehlivanli AC, Kahraman S. Prediction of risk factors for first trimester pregnancy loss in frozen-thawed good-quality embryo transfer cycles using machine learning algorithms. J Assist Reprod Genet. 2023;40(2):279–88. pmid:36399255
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref20] 20. Yehuala TZ, Mengesha SB, Baykemagn ND. Predicting pregnancy loss and its determinants among reproductive-aged women using supervised machine learning algorithms in Sub-Saharan Africa. Front Glob Womens Health. 2025;6:1456238. pmid:39995747
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref21] 21. Kubahoniyesu T, Kabano IH. Predicting adverse pregnancy outcome in Rwanda using machine learning techniques. PLoS One. 2024;19(12):e0312447. pmid:39637200
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

Figures

Abstract

Objective

Methods

Results

Conclusion

1. Introduction

2. Materials and methods

2.1. Participants

2.2. Data collection

2.3. Main outcome

2.4. Serum β-hCG, P, and E2 measurements

2.5. Statistical analysis

2.5.1. Data analysis.

2.5.2. Feature selection.

2.5.3. Model development and evaluation.

2.5.4. Sensitivity analysis.

3. Results

3.1. Baseline characteristics

3.2. Correlation analysis

3.3. Feature selection

3.4. Model construction and performance comparison

3.5. Sensitivity analysis

4. Discussion

Supporting information

S1 File. S1 Table. Baseline characteristics of patients excluded from biochemical pregnancy.

References