Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Table 1.

Parameters for true data generating processes (DGP) and outcome generating models (OGM).

In all scenarios, the true vector of coefficients is equal to and the error distribution is set to εN(0, 0.32). 0p denotes the p-dimensional vector of zeros.

More »

Table 1 Expand

Table 2.

Deviations from true DGP and OGM for parametric and Plasmode simulation.

More »

Table 2 Expand

Fig 1.

Relative error in MSE estimation for individual coefficients for different types of Plasmode simulation compared to parametric simulation under assumption of true DGP and OGM.

More »

Fig 1 Expand

Fig 2.

Absolute value of the relative error in MSE estimation averaged over individual coefficients, for different types of Plasmode simulation compared to parametric simulation under the assumption of the true DGP and OGM, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

More »

Fig 2 Expand

Fig 3.

Absolute value of relative error in MSE estimation for individual coefficients when the assumed feature distribution in parametric simulation deviates from the true distribution, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

More »

Fig 3 Expand

Fig 4.

Absolute value of relative error in the MSE estimation averaged over individual coefficients for different types of Plasmode simulation compared to parametric simulation, under the assumption of the true data generating process and outcome generating model, for p = 50, n = 100, β = 151, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

More »

Fig 4 Expand

Fig 5.

Relative error in MSE estimation for individual coefficients when the assumed mean of the marginal distribution of the second feature in parametric simulation deviates from the true mean, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

N(0,1), N(μ,1) denotes that the first feature is generated from a standard normal (truth), and the second feature is generated from a normal distribution with mean μ instead (deviation).

More »

Fig 5 Expand

Fig 6.

Relative error in MSE estimation for individual coefficients when the assumed variance of the marginal distribution of the second feature in parametric simulation deviates from the true variance, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

N(0,1), N(0,σ2) denotes that the first feature is generated from a standard normal (truth), and the second feature is generated from a normal distribution with variance σ2 instead (deviation).

More »

Fig 6 Expand

Fig 7.

Relative error in MSE estimation for individual coefficients when the assumed correlation of the features in parametric simulation deviates from true correlation, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.5 ∀ij.

More »

Fig 7 Expand

Fig 8.

Relative error in MSE estimation for individual coefficients when the assumed correlation of the features in parametric simulation deviates from true correlation, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

More »

Fig 8 Expand

Fig 9.

Relative error in MSE estimation for individual coefficients when the assumed correlation of the features in parametric simulation deviates from true correlation, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2|ij| for ith and jth feature within each of the 5 blocks.

More »

Fig 9 Expand

Fig 10.

Relative error in MSE estimation for individual coefficients when the assumed marginal distribution of the second feature in parametric simulation is misspecified as Gaussian mixture with increasing proportion of data drawn from Gaussian with different expectations (bimodal distribution).

The mean and the variance of the marginal normal distribution of the first feature are set to match those of the second. The mixing proportion is given on the x-axis.

More »

Fig 10 Expand

Fig 11.

Relative error in MSE estimation for individual coefficients when the assumed marginal distribution of the second feature in parametric simulation is misspecified as Gaussian mixture with increasing proportion of data drawn from Gaussian with different variance (contaminated distribution), for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

The mean and the variance of the marginal normal distribution of the first feature are set to match those of the second. The mixing proportion is given on the x-axis.

More »

Fig 11 Expand

Fig 12.

Relative error in MSE estimation for individual coefficients when the assumed marginal distribution of the second feature in parametric simulation is misspecified as log-normal, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

The mean and the variance of the marginal normal distribution of the first feature are set to match those of the second.

More »

Fig 12 Expand

Fig 13.

Relative error in MSE estimation for individual coefficients when the assumed marginal distribution of the second feature in parametric simulation is misspecified as Bernoulli with different success probabilities, for p = 2, n = 100, β = (1, 1, 1)T, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

More »

Fig 13 Expand

Fig 14.

Absolute value of relative error in MSE estimation averaged over individual coefficients when the assumed coefficients in parametric and Plasmode simulation are misspecified, for p = 50, n = 100, β = 151, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij, βI = (0, 0.02, …, 1)T, βII = 0.0551, βIII = 1051, βIV = 051.

Large outliers for n out of n Bootstrap are not displayed.

More »

Fig 14 Expand

Fig 15.

Absolute value of relative error in MSE estimation averaged over individual coefficients when the assumed error variance in parametric and Plasmode simulation are misspecified for p = 50, n = 100, β = 151, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

Large outliers for n out of n Bootstrap are not displayed.

More »

Fig 15 Expand

Fig 16.

Absolute value of relative error in MSE estimation averaged over individual coefficients when the assumed error distributions in parametric and Plasmode simulation are misspecified, for p = 50, n = 100, β = 151, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

Large outliers for n out of n Bootstrap are not displayed.

More »

Fig 16 Expand

Fig 17.

Absolute value of relative error in MSE estimation for individual coefficients when the assumed feature correlation matrix in parametric simulation is misspecified.

True correlation matrix is estimated from the benchmark dataset quake (p = 3, n = 100, β = 14, σ = 0.3).

More »

Fig 17 Expand

Fig 18.

Absolute value of relative error in MSE estimation for individual coefficients when the assumed feature correlation matrix in parametric simulation is misspecified.

True correlation matrix is estimated from benchmark dataset wine_quality (p = 11, n = 100, β = 112, σ = 0.3).

More »

Fig 18 Expand

Fig 19.

Absolute value of relative error in MSE estimation averaged over individual coefficients when the assumed feature correlation matrix in parametric simulation is misspecified.

True correlation matrix is estimated from benchmark dataset Yolanda (p = 100, n = 200, β = 1101, σ = 0.3).

More »

Fig 19 Expand

Fig 20.

Comparison of different resampling types for different numbers of observations resampled from a dataset with 100 observations.

Absolute value of relative error in MSE estimation averaged over individual coefficients when the true model is assumed in parametric and Plasmode simulation, for p = 10, n = 100, β = 111, σ = 0.3, Cor(Xi, Xj) = 0.2 ∀ij.

More »

Fig 20 Expand

Table 3.

Smallest deviations in parametric simulations for which Plasmode simulation is superior to parametric simulation.

p denotes the number of features, n the number of observations. True ρ gives the true correlation structure, scenario type the type of deviation and true value the true parameter value that the deviation refers to.

More »

Table 3 Expand