Machine learning algorithm validation with a limited sample size

Fig 7

Classification with discriminable data using K-Fold CV, Nested CV and Train/Test Split validation methods.

A: Comparison of different validation methods. Dash-dot lines show 95% confidence intervals for 50 runs. B: Size of 95% confidence intervals. Inset plot shows more refined view of confidence intervals for K-fold CV in a sample size range of 20 to 200 (in the inset plot sample sizes were N = 20, 22, … 198, 200, in the main plot N = 20, 40, … 980, 1000).

