Fig 1.
Spark ecosystem [22].
Fig 2.
Table 1.
A sample medical dataset.
Table 2.
Medical dataset in 4-anonymous model.
Table 3.
Medical dataset in 3-diversity model.
Fig 3.
Generalization tree for age and job attributes.
Fig 4.
Schema of hierarchical data clustering in Spark framework: (a) reading dataset from HDFS into worker nodes. (b) Assigning a unique key to all recrods. (c) The first round of data clustering and forming sub-clusters. (d) The second round of data clustering and forming smaller sub-clusters.
Fig 5.
Main steps of the proposed three-phase computing model.
Fig 6.
Information loss in anonymous poker hand with ḱ = 3 and different values of k and λ.
Table 4.
Classifier evaluation criteria on the poker hand.
Table 5.
Accuracy criterion on the anonymous poker hand for λ = 4, ḱ = 3 and different values of k.
Table 6.
F1-measure criterion on the anonymous poker hand for λ = 4, ḱ = 3 and different values of k.
Fig 7.
Runtime (sec) of the city block clustering on poker Hand dataset for ḱ = 3.