SillyPutty: Improved clustering by optimizing the silhouette width

doi:10.1371/journal.pone.0300358

Fig 1.

SillyPutty algorithm.

More »

Expand

Fig 2.

UMAP plots of four examples of simulated data sets using different parameters.

(A) 3 clusters, 600 samples, 5000 features, low noise. (B) 6 clusters, 1000 samples, 5000 features, low noise. (C) 6 clusters, 600 samples, 10000 features, medium noise. (D) 12 clusters, 1000 samples, 10000 features, high noise.

More »

Expand

Table 1.

List of cancer genomics datasets generated by the Umpire package.

More »

Expand

Table 2.

Average running time of algorithms across all simulations.

More »

Expand

Fig 3.

Evaluation of algorithms.

Values over 19 replicate simulated sets were averaged. Distributions of averaged values over 27 sets of simulation parameters are displayed in ‘bean plots’ for each of the 13 methods (plus true clusters) (A) Adjusted Rand Index. (B) Mean silhouette width. (C) Normalized within-group sum of squares. (D) Entropy.

More »