Clustering Consumers Based on Trust, Confidence and Giving Behaviour: Data-Driven Model Building for Charitable Involvement in the Australian Not-For-Profit Sector

doi:10.1371/journal.pone.0122133

Fig 1.

Process of the MST-kNN method.

Starting with the complete dataset, a distance matrix is computed which forms the basis for a complete graph. A Minimum Spanning Tree is computed within the complete graph. Then, all edges that are not k-Nearest Neighbors are removed resulting in clusters.

More »

Expand

Fig 2.

Results of the clustering method with k = 3.

Seven clusters were found; Cluster 0 to Cluster 6. Clusters are of varying sizes with the largest cluster (Cluster 5) containing 556 respondents and the smallest cluster (Cluster 4) containing 45. Cluster 0 is shown in light yellow, Cluster 1 in green, Cluster 2 in light green, Cluster 3 in blue, Cluster 4 in light orange, Cluster 5 in orange and Cluster 6 in red.

More »

Expand

Table 1.

Bottom features for each cluster (presented in ascending order of score).

More »

Expand

Table 2.

Top features for each cluster (presented in descending order of score).

More »

Expand

Fig 3.

CM1 Scores of Cluster 0.

The selected top and bottom features are shown in red and green respectively. As can be seen, these coloured features form a “shoulder” on either side of the ‘curve’ as they are characteristically higher or lower than the rest of the bars in this bar chart. The selected bottom and top “shoulders” are also presented in Tables 1 and 2.

More »

Expand

Fig 4.

CM1 Scores of Cluster 1.

The selected top and bottom features are shown in red and green respectively. As can be seen, these coloured features form a “shoulder” on either side of the ‘curve’ as they are characteristically higher or lower than the rest of the bars in this bar chart. The selected bottom and top “shoulders” are also presented in Tables 1 and 2.

More »

Expand

Fig 5.

CM1 Scores of Cluster 2.

The selected top and bottom features are shown in red and green respectively. As can be seen, these coloured features form a “shoulder” on either side of the ‘curve’ as they are characteristically higher or lower than the rest of the bars in this bar chart. The selected bottom and top “shoulders” are also presented in Tables 1 and 2.

More »

Expand

Fig 6.

CM1 Scores of Cluster 3.

The selected top and bottom features are shown in red and green respectively. As can be seen, these coloured features form a “shoulder” on either side of the ‘curve’ as they are characteristically higher or lower than the rest of the bars in this bar chart. The selected bottom and top “shoulders” are also presented in Tables 1 and 2.

More »

Expand

Fig 7.

CM1 Scores of Cluster 4.

The selected top and bottom features are shown in red and green respectively. As can be seen, these coloured features form a “shoulder” on either side of the ‘curve’ as they are characteristically higher or lower than the rest of the bars in this bar chart. The selected bottom and top “shoulders” are also presented in Tables 1 and 2.

More »

Expand

Fig 8.

CM1 Scores of Cluster 5.

The selected top and bottom features are shown in red and green respectively. As can be seen, these coloured features form a “shoulder” on either side of the ‘curve’ as they are characteristically higher or lower than the rest of the bars in this bar chart. The selected bottom and top “shoulders” are also presented in Tables 1 and 2.

More »

Expand

Fig 9.

CM1 Scores of Cluster 6.

The selected top and bottom features are shown in red and green respectively. As can be seen, these coloured features form a “shoulder” on either side of the ‘curve’ as they are characteristically higher or lower than the rest of the bars in this bar chart. The selected bottom and top “shoulders” are also presented in Tables 1 and 2.

More »

Expand

Table 3.

Best ‘simple’ logistic models for assessing cluster partitioning.

For each model, the Fitness was guided by the Area Under the Curve value and is shown as well as the best model found by Eureqa.

More »

Expand

Table 4.

Best ‘simple’ logistic models for Involvement Class.

For each cluster, the Fitness was guided by the Area Under the Curve value is shown as well as the best model found by Eureqa.

More »

Expand