S3CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization

doi:10.1371/journal.pone.0217316

Table 1.

Comparison of our proposed S³CMTF and the existing CMTF methods.

S³CMTF outperforms all other methods in terms of time, accuracy, scalability, memory usage, and parallelizability.

More »

Expand

Fig 1.

Comparison of our proposed S³CMTF and the existing methods.

(a) For a fixed number of nonzeros, S³CMTF takes constant time as dimensionality grows, while existing methods become slower. Our sequential method S³CMTF-opt1 is 930× and 54× faster than CMTF-OPT and CMTF-Tucker ALS, respectively. (b) S³CMTF-opt20 shows the best convergence rate and accuracy on real world Yelp dataset. CMTF-Tucker-ALS shows O.O.M. in both experiments. (O.O.M.: out of memory error).

More »

Expand

Table 2.

Table of symbols.

More »

Expand

Fig 2.

The scheme for S³CMTF.

More »

Expand

Fig 3.

Example hypergraphs induced by S³CMTF objective function (Eq (7)).

A matrix Y is coupled to the second mode of with a coupled factor matrix V. Each node represents a factor row or the core tensor. Each hyperedge includes corresponding factors to an SGD update. (a) Induced hypergraph with the core tensor. Every hyperedge corresponding to tensor entries includes . (b) Induced hypergraph without core tensor. The graph has sparse structure as every node is shared by only few hyperedges.

More »

Expand

Table 3.

Comparison of time complexity (per iteration) and memory usage of our proposed S³CMTF and other CMTF algorithms.

S³CMTF-opt shows the lowest time complexity and S³CMTF-base shows the lowest memory usage. For simplicity, we assume that all modes are of size I, of rank J, and an I × K matrix is coupled to one mode. P is the number of parallel cores. (* indicates the lowest time or memory).

More »

Expand

Table 4.

Summary of the data used for experiments.

‘K’ means thousand, and ‘M’ million. Tensors and matrices of density 1 are fully observed.

More »

Expand

Fig 4.

Test RMSE of S³CMTF and other CMTF methods over iterations.

S³CMTF-opt20 shows the best convergence rate and accuracy.

More »

Expand

Fig 5.

Comparison with SALS-single for movieLens dataset.

We compare two non-coupled version of S³CMTF, S³CMTF-CP-opt and S³CMTF-TUCKER-opt with the parallel CP decomposition method, SALS-single. For (a), we set 1 mark per 20 iterations for clarity. (a) S³CMTF converges faster to a lower error than SALS does. (b) S³CMTF-CP-opt is 2.3× faster than SALS-single.

More »

Expand

Fig 6.

Comparison of scalability.

(a) S³CMTF shows linear scalability as the number of entries increases. (b) S³CMTF-base and S³CMTF-opt show linear speed up as the number of cores grows. O.O.M.: out of memory error.

More »

Expand

Fig 7.

(a) Gap statistics on U⁽²⁾ of S³CMTF and the Tucker decomposition for Yelp dataset. S³CMTF outperforms the naive Tucker decomposition for its clustering ability. (b) Visualization of the personal recommendation scenario.

More »

Expand

Table 5.

Clustering results on business factor U⁽²⁾ found by S³CMTF.

We found dominant spatial and categorical characteristics from each cluster. Businesses in a same cluster tend to be in adjacent cities and are included in similar categories.

More »

Expand