Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Fig 1.

Technical route of DKCDC algorithm.

Bound(P) and Inter(P) denote the set of boundary point and the set of internal point respectively. Clust(P) denotes the set of initial clustering label obtained by CDC. Brid(P) denotes the points on the bridge. denotes the points that are obviously deviated from the cluster. denotes the points on the edge of the cluster. denotes unobvious deviation points at the edge of clusters. denotes interior points at the edge of the clusters.

More »

Fig 1 Expand

Fig 2.

The geometric meaning of bridge points, deviation points and less deviated edge points.

More »

Fig 2 Expand

Fig 3.

The four core steps of the DKCDC algorithm and the silhouette coefficients at different steps.

More »

Fig 3 Expand

Fig 4.

A fusion strategy that combines voting and distance methods.

(a) The r-domain of the boundary point pi, which located on the bridge, does not include any internal points. (b) Boundary point pi that deviates from cluster has its r-domain containing one internal point. (c) In small-scale cluster, the r-domain of the boundary point pi contains most of the boundary points. (d) The r-domain of the boundary point pi, which deviates from the cluster, contains more boundary points, and . (e) The number of boundary points within the r-domain of the boundary point pi, which located on the cluster’s edge, is greater than the number of internal points, and . (f) The number of boundary points within the r-domain of the boundary point pi, which deviates from the cluster, is greater than the number of internal points and there are cross-cluster internal points. (g) The r-domain of the boundary point pi, which deviates from the cluster, contains more internal points, but . (h) The number of boundary points within the r-domain of the boundary point pi, which located on the cluster’s edge, is smaller than the number of internal points, but . (u) The number of boundary points within the r-domain of the boundary point pi, which deviates from the cluster, is smaller than the number of internal points and there are cross-cluster internal points. (v) The result of clustering under fusion strategy.

More »

Fig 4 Expand

Fig 5.

The silhouette coefficient obtained by the DKCDC algorithm under different parameters r.

More »

Fig 5 Expand

Fig 6.

The noise distribution under different r values.

More »

Fig 6 Expand

Fig 7.

The overall flow chart of DKCDC algorithm.

More »

Fig 7 Expand

Table 1.

Comparison of clustering results on artificial datasets (Use the Silhouette Coefficient as measurement index).

More »

Table 1 Expand

Fig 8.

The results of DKCDC, CDC, K-Means, DBSCAN, OPTICS, HDBSCAN algorithms on Rotundity dataset: (a)-(f).

More »

Fig 8 Expand

Fig 9.

The results of DKCDC, CDC, K-Means, DBSCAN, OPTICS, HDBSCAN algorithms on Islands dataset: (a)–(f).

More »

Fig 9 Expand

Fig 10.

The results of DKCDC, CDC, K-Means, DBSCAN, OPTICS, HDBSCAN algorithms on Alphabet dataset: (a)–(f).

More »

Fig 10 Expand

Table 2.

UCI datasets.

More »

Table 2 Expand

Table 3.

Comparison of clustering results on UCI datasets.

More »

Table 3 Expand

Fig 11.

Cluster evaluation index comparison chart of different algorithms on UCI datasets: (a)–(c).

More »

Fig 11 Expand