Ten quick tips for protecting health data using de-identification and perturbation of structured datasets
Fig 2
Checking bivariate correlation before and after perturbation.
An exhaustive bi-variate correlation matrix shows that the bivariate correlation relationships remain generally similar despite perturbation. Red shading indicates positive correlation, blue shading indicates negative correlation. Values within each cell show the correlation coefficient. A: Original dataset, B: Dataset after perturbation of multiple fields.