Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Table 1.

Descriptive statistics of the study population, grouped by age.

Only age, sex, height and weight were ultimately used in the machine learning models as predictive variables. All variables were, however, used for imputing missing data and constructing synthetic datasets.

More »

Table 1 Expand

Fig 1.

Histogram comparison for each variable comparing the aggregate demographic characteristics of the real training dataset (n = 2408) against synthetic dataset A (n = 2408).

More »

Fig 1 Expand

Fig 2.

Histogram comparison for each variable comparing the aggregate demographic characteristics of the real training dataset (n = 2408) against synthetic dataset B (n = 4816).

More »

Fig 2 Expand

Table 2.

Statistical analysis comparing synthetic data tables to the real training dataset (n = 2408).

Presented are propensity score mean-squared-error and standardised ration of propensity score mean-squared error.

More »

Table 2 Expand

Table 3.

Results of the machine learning models, trained on real or synthetic datasets.

Each was tested on the same test dataset (real data). None of the p-values were <0.05.

More »

Table 3 Expand