Volodymyr Kuleshov ú Arun Tejasvi Chaganty ú Percy Liang. May 11, 2015

Size: px

Start display at page:

Download "Volodymyr Kuleshov ú Arun Tejasvi Chaganty ú Percy Liang. May 11, 2015"

Rafe Mathews
6 years ago
Views:

1 Tensor Factorization via Matrix Factorization Volodymyr Kuleshov ú Arun Tejasvi Chaganty ú Percy Liang Stanford University May 11, 2015 Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

2 Introduction: tensor factorization An application: community detection a b c d Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

3 Introduction: tensor factorization An application: community detection a b c d Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

4 Introduction: tensor factorization An application: community detection a b c d Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

5 Introduction: tensor factorization An application: community detection?? a b?? c d Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

6 Introduction: tensor factorization An application: community detection Anandkumar, Ge, Hsu, and S. Kakade 2013 a b c d = Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

7 Introduction: tensor factorization Applications of tensor factorization I Community detection I Anandkumar, Ge, Hsu, and S. Kakade 2013 I Parsing I Cohen, Satta, and Collins 2013 I Knowledge base completion I Chang et al I Singh, Rocktäschel, and Riedel 2015 I Topic modelling I Anandkumar, Foster, et al I Crowdsourcing I Zhang et al I Mixture models I Anandkumar, Ge, Hsu, S. M. Kakade, et al I Bottlenecked models I Chaganty and Liang 2014 I... Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

8 Introduction: tensor factorization What is tensor (CP) factorization? I Tensor analogue of matrix eigen-decomposition. kÿ M = fi i u i u i. i=1 = k Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

9 Introduction: tensor factorization What is tensor (CP) factorization? I Tensor analogue of matrix eigen-decomposition. kÿ T = fi i u i u i u i. i=1 = k Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

10 Introduction: tensor factorization What is tensor (CP) factorization? I Tensor analogue of matrix eigen-decomposition. kÿ T = fi i u i u i u i + R. i=1 I Goal: Given T with noise, R, recover factors u i. = k Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

11 Introduction: tensor factorization What is tensor (CP) factorization? I Tensor analogue of matrix eigen-decomposition. kÿ T = fi i u i u i u i + R. i=1 I Goal: Given T with noise, R, recover factors u i. Orthogonal = Non-orthogonal = Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

12 Introduction: tensor factorization Existing tensor factorization algorithms I Tensor power method (Anandkumar, Ge, Hsu, S. M. Kakade, et al. 2013) I Analog of matrix power method. I Sensitive to noise. I Restricted to orthogonal tensors. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

13 Introduction: tensor factorization Existing tensor factorization algorithms I Tensor power method (Anandkumar, Ge, Hsu, S. M. Kakade, et al. 2013) I Analog of matrix power method. I Sensitive to noise. I Restricted to orthogonal tensors. I Alternating least squares (Comon, Luciani, and Almeida 2009; Anandkumar, Ge, and Janzamin 2014) I Sensitive to initialization. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

14 Introduction: tensor factorization Existing tensor factorization algorithms I Tensor power method (Anandkumar, Ge, Hsu, S. M. Kakade, et al. 2013) I Analog of matrix power method. I Sensitive to noise. I Restricted to orthogonal tensors. I Alternating least squares (Comon, Luciani, and Almeida 2009; Anandkumar, Ge, and Janzamin 2014) I Sensitive to initialization. Our approach: reduce to existing fast and robust matrix algorithms. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

15 Orthogonal Tensor factorization Outline Introduction: tensor factorization Orthogonal Tensor factorization Projections Non-orthogonal tensor factorization Related work Empirical results Conclusions Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

16 Orthogonal Tensor factorization Tensor factorization via single matrix factorization T = fi 1 u fi 2 u fi 3 u R Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

17 Orthogonal Tensor factorization Tensor factorization via single matrix factorization T = u u u 1 3 Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

18 Orthogonal Tensor factorization Tensor factorization via single matrix factorization T = u u u 1 3 T (I, I, w) = (w u 1 )u (w u 2 )u (w u 3 )u 3 2 Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

19 Orthogonal Tensor factorization Tensor factorization via single matrix factorization T = u u u 1 3 T (I, I, w) = (w u 1 ) u (w u 2 ) u (w u 3 ) I Proposal: Eigen-decomposition on the projected matrix. I Return: recovered eigenvectors, u i. u Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

20 Orthogonal Tensor factorization Sensitivity of single matrix projection I Problem: Eigendecomposition is very sensitive to the eigengap. error in factors Ã 1 min(di erence in eigenvalues). Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

21 Orthogonal Tensor factorization Sensitivity of single matrix projection I Problem: Eigendecomposition is very sensitive to the eigengap. error in factors Ã 1 min(di erence in eigenvalues). I Intuition: If two eigenvalues are equal, corresponding eigenvectors are arbitrary. = + + Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

22 Orthogonal Tensor factorization Sensitivity of single matrix projection I Problem: Eigendecomposition is very sensitive to the eigengap. error in factors Ã 1 min(di erence in eigenvalues). I Intuition: If two eigenvalues are equal, corresponding eigenvectors are arbitrary. = + + Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

23 Orthogonal Tensor factorization Sensitivity analysis (contd.) Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

24 Orthogonal Tensor factorization Sensitivity analysis (contd.) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

25 Orthogonal Tensor factorization Sensitivity analysis (contd.) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

26 Orthogonal Tensor factorization Sensitivity analysis (contd.) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

27 Orthogonal Tensor factorization Sensitivity analysis (contd.) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

28 Orthogonal Tensor factorization Sensitivity analysis (contd.) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

29 Orthogonal Tensor factorization Sensitivity analysis (contd.) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

30 Orthogonal Tensor factorization Sensitivity analysis (contd.) (Cardoso 1994) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. I Simultaneous matrix factorization: error in factors Ã 1 min avg. di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

31 Orthogonal Tensor factorization Sensitivity analysis (contd.) (Cardoso 1994) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Every coordinate pair needs one good projection (with a large eigengap). I Simultaneous matrix factorization: error in factors Ã 1 min avg. di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

32 Orthogonal Tensor factorization Sensitivity analysis (contd.) (Cardoso 1994) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Every coordinate pair needs one good projection (with a large eigengap). I Simultaneous matrix factorization: error in factors Ã 1 min avg. di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

33 Orthogonal Tensor factorization Sensitivity analysis (contd.) (Cardoso 1994) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Every coordinate pair needs one good projection (with a large eigengap). I Simultaneous matrix factorization: error in factors Ã 1 min avg. di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

34 Orthogonal Tensor factorization Sensitivity analysis (contd.) (Cardoso 1994) I Single matrix factorization: error in factors Ã 1 min di. in eigenvalues. Every coordinate pair needs one good projection (with a large eigengap). I Simultaneous matrix factorization: error in factors Ã 1 min avg. di. in eigenvalues. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

35 Orthogonal Tensor factorization Reduction to simultaneous diagonalization T (I, I, w 1 ) = (w 1 u 1 ) u 1 u 1 + (w 1 u 2 ) u 2 u 2 + (w 1 u 3 ) u 3 u 3 M Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

36 Orthogonal Tensor factorization Reduction to simultaneous diagonalization T (I, I, w 1 ) = (w 1 u 1 ) u 1 u 1 + (w 1 u 2 ) u 2 u 2 + (w 1 u 3 ) u 3 u 3 M T (I, I, w ) M = (w u 1 ) u 1 u 1 + (w u 2 ) u 2 u 2 + (w u 3 ) u 3 u Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

37 Orthogonal Tensor factorization Reduction to simultaneous diagonalization T (I, I, w 1 ) = (w 1 u 1 ) u 1 u 1 + (w 1 u 2 ) u 2 u 2 + (w 1 u 3 ) u 3 u 3 M T (I, I, w ) M = (w u 1 ) u 1 u 1 + (w u 2 ) u 2 u 2 + (w u 3 ) u 3 u I Projections share factors: can be simultaneously diagonalized. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

38 Orthogonal Tensor factorization Simultaneous diagonalization algorithm I Algorithm: Simultaneously diagonalize projected matrices. U =arg min U:UU =I Lÿ =1 o (U M U) o (A) =ÿ i =j A 2 ij. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

39 Orthogonal Tensor factorization Simultaneous diagonalization algorithm I Algorithm: Simultaneously diagonalize projected matrices. U =arg min U:UU =I Lÿ =1 o (U M U) o (A) =ÿ i =j A 2 ij. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

40 Orthogonal Tensor factorization Simultaneous diagonalization algorithm I Algorithm: Simultaneously diagonalize projected matrices. U =arg min U:UU =I Lÿ =1 o (U M U) o (A) =ÿ i =j A 2 ij. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

41 Orthogonal Tensor factorization Simultaneous diagonalization algorithm I Algorithm: Simultaneously diagonalize projected matrices. U =arg min U:UU =I Lÿ =1 o (U M U) o (A) =ÿ i =j A 2 ij. I Optimize using Jacobi angles (Cardoso and Souloumiac 1996). Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

42 Orthogonal Tensor factorization Simultaneous diagonalization algorithm I Algorithm: Simultaneously diagonalize projected matrices. U =arg min U:UU =I Lÿ =1 o (U M U) o (A) =ÿ i =j A 2 ij. I Optimize using Jacobi angles (Cardoso and Souloumiac 1996). I Used widly in the ICA community. I Empirically appears to be generically global convergence. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

43 Orthogonal Tensor factorization Projections Outline Introduction: tensor factorization Orthogonal Tensor factorization Projections Non-orthogonal tensor factorization Related work Empirical results Conclusions Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

44 Orthogonal Tensor factorization Projections Oracle and random projections I Hypothetically: oracle projections along the factors is good. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

45 Orthogonal Tensor factorization Projections Oracle and random projections I Hypothetically: oracle projections along the factors is good. I Practically: use projections along random directions. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

46 Orthogonal Tensor factorization Projections Results: Orthogonal tensor decomposition T = kÿ i=1 fi i u 3 i + R. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

47 Orthogonal Tensor factorization Projections Results: Orthogonal tensor decomposition T = Theorem (Random projections) kÿ i=1 fi i u 3 i + R. Pick L = (k log k) projections randomly from the unit sphere. Then, with high probability, Q R apple Û ÎfiÎ1 fi max d error in factors Æ O c + d a L b fi 2 min Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

48 Orthogonal Tensor factorization Projections Results: Orthogonal tensor decomposition T = Theorem (Random projections) kÿ i=1 fi i u 3 i + R. Pick L = (k log k) projections randomly from the unit sphere. Then, with high probability, Q R apple Û ÎfiÎ1 fi max d error in factors Æ O c a fimin 2 + d L b oracle error Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

49 Orthogonal Tensor factorization Projections Results: Orthogonal tensor decomposition T = Theorem (Random projections) kÿ i=1 fi i u 3 i + R. Pick L = (k log k) projections randomly from the unit sphere. Then, with high probability, Q R apple Û ÎfiÎ1 fi max d error in factors Æ O c a fimin 2 + d L b oracle error conc. term Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

50 Orthogonal Tensor factorization Projections Empirical: Random vs. oracle projections rthogonaO case Error umEer of projections Figure: Comparing random vs. oracle projections (d = k = 10, =0.05) Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

51 Non-orthogonal tensor factorization Outline Introduction: tensor factorization Orthogonal Tensor factorization Projections Non-orthogonal tensor factorization Related work Empirical results Conclusions Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

52 Non-orthogonal tensor factorization Non-orthogonal simultaneous diagonalization T (I, I, w 1 ) = (w 1 u 1 ) u 1 u 1 + (w 1 u 2 ) u 2 u 2 + (w 1 u 3 ) u 3 u 3 M I No unique non-orthogonal factorization for a single matrix. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

53 Non-orthogonal tensor factorization Non-orthogonal simultaneous diagonalization T (I, I, w 1 ) = (w 1 u 1 ) u 1 u 1 + (w 1 u 2 ) u 2 u 2 + (w 1 u 3 ) u 3 u 3 M T (I, I, w ) M = (w u 1 ) u 1 u 1 + (w u 2 ) u 2 u 2 + (w u 3 ) u 3 u I No unique non-orthogonal factorization for a single matrix. I Ø 2 matrices have a unique non-orthogonal factorization. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

54 Non-orthogonal tensor factorization Non-orthogonal simultaneous diagonalization I Algorithm: Simultaneously diagonalize projected matrices. U =argmin U Lÿ =1 o (U 1 M U ) o (A) =ÿ i =j A 2 ij. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

55 Non-orthogonal tensor factorization Non-orthogonal simultaneous diagonalization I Algorithm: Simultaneously diagonalize projected matrices. U =argmin U Lÿ =1 o (U 1 M U ) o (A) =ÿ i =j A 2 ij. I U are not constrained to be orthogonal. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

56 Non-orthogonal tensor factorization Non-orthogonal simultaneous diagonalization I Algorithm: Simultaneously diagonalize projected matrices. U =argmin U Lÿ =1 o (U 1 M U ) o (A) =ÿ i =j A 2 ij. I U are not constrained to be orthogonal. I Optimize using the QR1JD algorithm (Afsari 2006). I Only guaranteed to have local convergence. I More stable than ALS in practice. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

57 Non-orthogonal tensor factorization Non-orthogonal simultaneous diagonalization I Algorithm: Simultaneously diagonalize projected matrices. U =argmin U Lÿ =1 o (U 1 M U ) o (A) =ÿ i =j A 2 ij. I U are not constrained to be orthogonal. I Optimize using the QR1JD algorithm (Afsari 2006). I Only guaranteed to have local convergence. I More stable than ALS in practice. I Sensitivity analysis due to Afsari 2008 Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

58 Non-orthogonal tensor factorization Results: Non-orthogonal tensor decomposition Theorem (Random projections) Pick L = (k log k) projections randomly from the unit sphere. Then, with high probability, Q R error in factors Æ O c a ÎU Î µ 2 apple Q Û R ÎfiÎ1 fi max d a1+ b fimin 2 L d b where U =[u 1... u k ],µ=maxu i u j and ÎU Î µ 2 measures non-orthogonality. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

59 Non-orthogonal tensor factorization Results: Non-orthogonal tensor decomposition Theorem (Random projections) Pick L = (k log k) projections randomly from the unit sphere. Then, with high probability, Q R error in factors Æ O c a ÎU Î µ 2 apple Q Û R ÎfiÎ1 fi max d a1+ b fimin 2 L d b ortho. cost where U =[u 1... u k ],µ=maxu i u j and ÎU Î µ 2 measures non-orthogonality. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

60 Non-orthogonal tensor factorization Results: Non-orthogonal tensor decomposition Theorem (Random projections) Pick L = (k log k) projections randomly from the unit sphere. Then, with high probability, Q R ÎU Î 2 2 error in factors Æ O c 1 µ a 2 non ortho. cost apple Q Û R ÎfiÎ1 fi max d a1+ b fimin 2 L d b ortho. cost where U =[u 1... u k ],µ=maxu i u j and ÎU Î µ 2 measures non-orthogonality. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

61 Related work Outline Introduction: tensor factorization Orthogonal Tensor factorization Projections Non-orthogonal tensor factorization Related work Empirical results Conclusions Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

62 Related work Notes and Related work I Orthogonal tensor methods can factorize non-orthogonal tensors using a whitening transformation (Anandkumar, Ge, Hsu, S. M. Kakade, et al. 2013). I Is a major source of errors itself (Souloumiac 2009). Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

63 Related work Notes and Related work I Orthogonal tensor methods can factorize non-orthogonal tensors using a whitening transformation (Anandkumar, Ge, Hsu, S. M. Kakade, et al. 2013). I Is a major source of errors itself (Souloumiac 2009). I Simultaneous diagonalization for tensors proposed by Lathauwer I Relies on computing the SVD of a d 4 k 2 matrix. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

64 Related work Notes and Related work I Orthogonal tensor methods can factorize non-orthogonal tensors using a whitening transformation (Anandkumar, Ge, Hsu, S. M. Kakade, et al. 2013). I Is a major source of errors itself (Souloumiac 2009). I Simultaneous diagonalization for tensors proposed by Lathauwer I Relies on computing the SVD of a d 4 k 2 matrix. I Simultaneous diagonalizations for multiple projections mentioned in Anandkumar, Ge, Hsu, S. M. Kakade, et al I No analysis presented. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

65 Empirical results Outline Introduction: tensor factorization Orthogonal Tensor factorization Projections Non-orthogonal tensor factorization Related work Empirical results Conclusions Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

66 Empirical results Community detection Anandkumar, Ge, Hsu, and S. Kakade 2013 a b c d = Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

67 Empirical results Community detection Error Recall Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

68 Empirical results Community detection 0.15 TPM 0.1 Error Recall Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

69 Empirical results Community detection 0.15 TPM OJD 0.1 Error Recall Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

70 Empirical results Crowdsourcing Zhang et al a b c Y N N Y N Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

71 Empirical results Crowdsourcing Zhang et al Accuracy web rte birds dogs TPM ALS Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

72 Empirical results Crowdsourcing Zhang et al Accuracy web rte birds dogs TPM ALS OJD Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

73 Empirical results Crowdsourcing Zhang et al Accuracy web rte birds dogs TPM ALS OJD NOJD Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

74 Empirical results Crowdsourcing Zhang et al Accuracy web rte birds dogs TPM ALS OJD NOJD MV+EM Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

75 Conclusions Conclusions $ I Reduce tensor problems to matrix ones with random projections. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

76 Conclusions Conclusions $ I Reduce tensor problems to matrix ones with random projections. I Empirically, competitive with state of the art with support for non-orthogonal, asymmetric tensors of arbitrary order. Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

77 Conclusions Conclusions $ I Reduce tensor problems to matrix ones with random projections. I Empirically, competitive with state of the art with support for non-orthogonal, asymmetric tensors of arbitrary order. I Open question: is the Jacobi angles algorithm for orthogonal simultaneous diagaonalization generically globally convergent? Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

78 Conclusions Conclusions $ I Reduce tensor problems to matrix ones with random projections. I Empirically, competitive with state of the art with support for non-orthogonal, asymmetric tensors of arbitrary order. I Open question: is the Jacobi angles algorithm for orthogonal simultaneous diagaonalization generically globally convergent? I Github: I Codalab: Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

79 Conclusions Conclusions $ I Reduce tensor problems to matrix ones with random projections. I Empirically, competitive with state of the art with support for non-orthogonal, asymmetric tensors of arbitrary order. I Open question: is the Jacobi angles algorithm for orthogonal simultaneous diagaonalization generically globally convergent? I Github: I Codalab: I Thanks! Questions? Kuleshov, Chaganty, Liang (Stanford University) Tensor Factorization May 11, / 28

Tensor Factorization via Matrix Factorization

Tensor Factorization via Matrix Factorization Volodymyr Kuleshov Arun Tejasvi Chaganty Percy Liang Department of Computer Science Stanford University Stanford, CA 94305 Abstract Tensor factorization arises in many machine learning applications, such