Reconstruction of full rank algebraic branching programs

Similar documents
Deterministic identity testing for sum of read-once oblivious arithmetic branching programs

Polynomial Identity Testing

Ben Lee Volk. Joint with. Michael A. Forbes Amir Shpilka

November 17, Recent Results on. Amir Shpilka Technion. PIT Survey Oberwolfach

15-251: Great Theoretical Ideas In Computer Science Recitation 9 : Randomized Algorithms and Communication Complexity Solutions

5 Linear Transformations

CPSC 536N: Randomized Algorithms Term 2. Lecture 9

Topics in linear algebra

Polynomial Identity Testing. Amir Shpilka Technion

Explicit Noether Normalization for Simultaneous Conjugation via Polynomial Identity Testing

Introduction to Mobile Robotics Compact Course on Linear Algebra. Wolfram Burgard, Cyrill Stachniss, Kai Arras, Maren Bennewitz

Mobile Robotics 1. A Compact Course on Linear Algebra. Giorgio Grisetti

Linear Projections of the Vandermonde Polynomial

GRE Subject test preparation Spring 2016 Topic: Abstract Algebra, Linear Algebra, Number Theory.

Rank Bound for Depth-3 Identities

Dependence of logarithms on commutative algebraic groups

MATH 2331 Linear Algebra. Section 2.1 Matrix Operations. Definition: A : m n, B : n p. Example: Compute AB, if possible.

An algebraic perspective on integer sparse recovery

CSL361 Problem set 4: Basic linear algebra

Elementary maths for GMT

MATH 304 Linear Algebra Lecture 10: Linear independence. Wronskian.

Finite fields, randomness and complexity. Swastik Kopparty Rutgers University

Tangent spaces, normals and extrema

Determinant Versus Permanent

Formalizing Randomized Matching Algorithms

Linear Algebra. Min Yan

Introduction to Group Theory

Linear Algebra 1 Exam 2 Solutions 7/14/3

Introduction to Mobile Robotics Compact Course on Linear Algebra. Wolfram Burgard, Bastian Steder

MATH 323 Linear Algebra Lecture 12: Basis of a vector space (continued). Rank and nullity of a matrix.

Representations and Linear Actions

MATH 223A NOTES 2011 LIE ALGEBRAS 35

22M: 121 Final Exam. Answer any three in this section. Each question is worth 10 points.

Problem Set 2. Assigned: Mon. November. 23, 2015

Generic Picard-Vessiot extensions for connected by finite groups Kolchin Seminar in Differential Algebra October 22nd, 2005

Determinant versus permanent

Introduction to Mobile Robotics Compact Course on Linear Algebra. Wolfram Burgard, Cyrill Stachniss, Maren Bennewitz, Diego Tipaldi, Luciano Spinello

Solutions to the August 2008 Qualifying Examination

More on Noncommutative Polynomial Identity Testing

REPRESENTATION THEORY OF S n

Homework 11/Solutions. (Section 6.8 Exercise 3). Which pairs of the following vector spaces are isomorphic?

Solution. That ϕ W is a linear map W W follows from the definition of subspace. The map ϕ is ϕ(v + W ) = ϕ(v) + W, which is well-defined since

1. What is the determinant of the following matrix? a 1 a 2 4a 3 2a 2 b 1 b 2 4b 3 2b c 1. = 4, then det

Linear Algebra Massoud Malek

Lecture 10: A (Brief) Introduction to Group Theory (See Chapter 3.13 in Boas, 3rd Edition)

Announcements Wednesday, November 01

18. Cyclotomic polynomials II

Assignment 3. A tutorial on the applications of discrete groups.

1. Let r, s, t, v be the homogeneous relations defined on the set M = {2, 3, 4, 5, 6} by

0.1 Rational Canonical Forms

A Polynomial Time Deterministic Algorithm for Identity Testing Read-Once Polynomials

Background Mathematics (2/2) 1. David Barber

Chapter 2 Notes, Linear Algebra 5e Lay

MATH 205C: STATIONARY PHASE LEMMA

Chapter 2: Linear Independence and Bases

Algebraic structures I

ALGEBRA 8: Linear algebra: characteristic polynomial

Improved Polynomial Identity Testing for Read-Once Formulas

NOTES ON LINEAR ALGEBRA. 1. Determinants

EIGENVALUES AND EIGENVECTORS 3

Equivalence of Polynomial Identity Testing and Deterministic Multivariate Polynomial Factorization

LINEAR ALGEBRA REVIEW

Compute the Fourier transform on the first register to get x {0,1} n x 0.

Learning Restricted Models of Arithmetic Circuits

The Power of Depth 2 Circuits over Algebras

MATH 315 Linear Algebra Homework #1 Assigned: August 20, 2018

88 CHAPTER 3. SYMMETRIES

Lecture 2: January 18

ELEMENTARY SUBALGEBRAS OF RESTRICTED LIE ALGEBRAS

Econ 204 Supplement to Section 3.6 Diagonalization and Quadratic Forms. 1 Diagonalization and Change of Basis

Topic 1: Matrix diagonalization

MATH 31 - ADDITIONAL PRACTICE PROBLEMS FOR FINAL

Lecture notes on Quantum Computing. Chapter 1 Mathematical Background

Review of Linear Algebra Definitions, Change of Basis, Trace, Spectral Theorem

UMA Putnam Talk LINEAR ALGEBRA TRICKS FOR THE PUTNAM

MATH 2210Q MIDTERM EXAM I PRACTICE PROBLEMS

LINEAR ALGEBRA BOOT CAMP WEEK 1: THE BASICS

Reconstruction of ΣΠΣ(2) Circuits over Reals

18.06 Problem Set 8 - Solutions Due Wednesday, 14 November 2007 at 4 pm in

Minimum Polynomials of Linear Transformations

Higher-order Fourier analysis of F n p and the complexity of systems of linear forms

On the complexity of the permanent in various computational models

GALOIS GROUPS OF CUBICS AND QUARTICS (NOT IN CHARACTERISTIC 2)

2018 Fall 2210Q Section 013 Midterm Exam II Solution

Linear Algebra Practice Final

UNDERSTANDING THE DIAGONALIZATION PROBLEM. Roy Skjelnes. 1.- Linear Maps 1.1. Linear maps. A map T : R n R m is a linear map if

DETERMINISTIC POLYNOMIAL IDENTITY TESTS FOR MULTILINEAR BOUNDED-READ FORMULAE

Chapter 3. Vector spaces

MATH 101A: ALGEBRA I, PART D: GALOIS THEORY 11

SYMBOL EXPLANATION EXAMPLE

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

Introduction. 1 Partially supported by a grant from The John Templeton Foundation

Algebra Review 2. 1 Fields. A field is an extension of the concept of a group.

APPENDIX A. Background Mathematics. A.1 Linear Algebra. Vector algebra. Let x denote the n-dimensional column vector with components x 1 x 2.

EXERCISES AND SOLUTIONS IN LINEAR ALGEBRA

Math 121 Homework 5: Notes on Selected Problems

(a) II and III (b) I (c) I and III (d) I and II and III (e) None are true.

Non-commutative arithmetic circuits with division

Deterministic Identity Testing for Sum of Read-Once Oblivious Arithmetic Branching Programs arxiv: v2 [cs.cc] 16 May 2015

1. Select the unique answer (choice) for each problem. Write only the answer.

Transcription:

Reconstruction of full rank algebraic branching programs Vineet Nair Joint work with: Neeraj Kayal, Chandan Saha, Sebastien Tavenas 1

Arithmetic circuits 2

Reconstruction problem f(x) Q[X] is an m-variate degree d polynomial computable by a size s circuit in circuit class C. Input: α ϵ F m Blackbox access f(α) 3

Reconstruction problem Input: α ϵ F m f(α) Output: A small arithmetic circuit computing f. The algorithm should run in time poly(m,s,d,b) where (b is the bit length of the coefficients of f). 4

Polynomial identity testing (PIT): Input: α ϵ F m f(α) Is f(x) = 0? Randomized algorithm for PIT follows easily from Schwartz-Zippel lemma Unlike PIT no efficient randomized algorithm is known for reconstruction. 5

Previous works Over finite fields [Shp07],[KS09] gave quasi-poly time deterministic reconstruction algorithm for depth three circuits with constant number of product gates. 6

Previous works Over characteristic zero fields [Sinha16] gave a poly time randomized algorithm for depth three circuits with two product gates. [GKL12] gave poly time randomized algorithm for multilinear depth four circuits with two top-level product gates. 7

Previous works [SV09], [MV16] gave deterministic poly time reconstruction for read-once formulas [KS03], [FS13] gave deterministic quasi-poly time reconstruction for ROABPs, set-multilinear ABPs and non-commutative ABPs 8

Average-case reconstruction Progress in reconstruction is slow. Can we do reconstruction for most circuits in a circuit class C? C Efficiently reconstructed 9

Average-case reconstruction Problem definition: The input f is an m variate degree d polynomial picked according to a distribution D on circuit class C Output an efficient reconstruction algorithm for f. [GKL11], [GKQ13] gave randomized poly time algorithm for average-case reconstruction of multilinear formulas and formulas. 10

Algebraic branching programs (ABP) Definition: Consider the product of d matrices as X 1 X 2 X d, where X 1 is a row vector of length w, X d is a column vector of length w and X 2,, X d-1 are wxw matrices. Each entry of X i, i [d] is an affine form in X variables. X = m, example a 0 + a 1 x 1 + + a m. Polynomial computed by the ABP is the entry in the 1x1 matrix computed as above. Length and width of the ABP is w and d respectively. 11

Distribution on ABPs Random ABP: Fix w,d and m. Pick the constants of the linear forms independently and uniformly at random from a large set S Q. Average-case reconstruction: Design a reconstruction algorithm for random(m,w,d,s) ABP. 12

Average-case reconstruction for ABPs Input: Blackbox access to f(x) computable by random(m,w,d,s) ABP. α ϵ F n f(α) Output: A small ABP computing f with high probability. The algorithm should run in time poly(m,w,d,ρ) - (ρ bit length of an element in S). 13

Pseudo-random family A distribution D on m variate degree d polynomial family with seed length s=(md) O(1) generates a pseudo-random family if Every algorithm that distinguishes a polynomial coming from D and uniformly random m-variate polynomial with a non-negligible bias runs in time exponential in s. 14

Candidate family [Aar08] conjectures the family Det n (AX) where every entry of A ϵ F t x m is chosen uniformly at random from a finite field and m << t=n 2 is pseudo-random Example x 1 +x 2 6x 1 +x 2 x 1 +3x 2 5x 1 +4x 2 8x 1 +x 2 10x 1 +x 2 8x 1 +3x 2 3x 1 +2x 2 m = 2, n = 4 8x 1 +2x 2 5x 1 +4x 2 7x 1 +9x 2 11x 1 +x 2 4x 1 +3x 2 9x 1 +3x 2 5x 1 +6x 2 9x 1 +7x 2 15

Iterated matrix multiplication Definition: Consider the product of d matrices as X 1 X 2 X d, where X 1 is a row vector of length w, X d is a column vector of length w and X 2,, X d-1 are wxw matrices. Each entry of X i, i [d] is a distinct variable. The variables are disjoint across matrices. IMM w,d is the entry in 1x1 matrix computed as above. 16

Consequence Det n and IMM w,d are affine projections of each other [Mahajan, Vinay 97]. Hence, it makes sense to ask whether IMM w,d (AX) where A ϵ F t x m is chosen uniformly at random from a finite S Q and m << t = w 2 (d-2) + 2w is pseudorandom. 17

18 Our Contribution

Main result 19

Remarks Does not resolve Aaronson s conjecture For IMM w,d the conjecture holds when m << w 2 d Our result holds when m w 2 d Our result works even if the matrices are not of uniform width. 20

Full rank ABPs If m w 2 d then the affine forms in the ABP are Q-linearly independent with high probability. Full rank ABPs: the set of linear forms in X 1, X 2,, X d are Q-linearly independent. Example: x 1 + x 2 x 2 + x 3 x 3 + x 4 x 4 + x 5 x 5 + x 6 x 6 + x 7 x 13 + x 14 x 7 + x 8 x 8 + x 9 x 9 + x 10 x 14 + x 15 x 10 + x 11 x 11 + x 12 x 12 + x 13 x 15 + x 16 21

Full rank ABPs If m w 2 d then the affine forms in the ABP are Q-linearly independent with high probability. Full rank ABPs: the set of linear forms in X 1, X 2,, X d are Q-linearly independent. Main result: We design an efficient randomized algorithm to reconstruct full rank ABPs. 22

Equivalent polynomials An n-variate polynomial f is equivalent to an n- variate polynomial g if there exists an invertible A ϵ F n xn such that f(x) = g(ax) Equivalence test: g(x) f(x) Is there an invertible A in F nxn such that f(x) = g(ax) 23

Equivalent polynomials Equivalence test: IMM(X) f(x) Is there an invertible A in F nxn such that f(x) = IMM(AX) Remark: Computing a full rank ABP for f is the same as designing an efficient randomized equivalence test for IMM 24

Group of symmetries of IMM Group of symmetries: For an n variate polynomial g(x) it is the set of all invertible A F nxn such that g(ax) = g(x). Characterization by symmetries: g(x) is characterized by its group of symmetries then The group of symmetries of f(x) and g(x) are equal if and only if f(x) is a constant multiple of g(x) Main theorem 2: IMM w,d is characterized by its group of symmetries. 25

26 Proof Ideas

Template of the reconstruction algorithm Assume the input polynomial f is computable by a full rank ABP Compute a full rank ABP 1. Find the layer spaces 2. Glue them together Do a polynomial identity test to check if the polynomial computed by the ABP is f Output `f is not computable by a full rank ABP no yes Output the full rank ABP computing f 27

Pre-processing Let an m variate polynomial f be computed by a width w and length d full rank ABP. The number of edges is n = w 2 (d-2) +2w Two steps of pre-processing: m n Variable reduction: At the end of this step we get an n variate f computable by a full rank ABP Translation equivalence test: The entries in the matrices of the full rank ABP computing f are linear forms (constant term is 0). 2

Multiple full rank ABPs for f Suppose f is computable by a full rank ABP X 1 X 2 X d Then this full rank ABP for f is not unique The following transformations still compute f Transposition Left-right multiplication 29 Corner translations

Transposition Recall X 1 and X d are row and column vectors Since the eventual product is a 1x1 matrix the transpose of the product still computes f Hence f is also computed by T X d T X 2 T X 1 30

Left-right multiplication Let A be a wxw invertible matrix with entries from Q Replace X 2 with X 2 = X 2 A and X 3 = A -1 X 3 f is computed by the product X 1 X 2 X 3 X d 31

Corner translations Let B be an anti-symmetric wxw matrix, then X 1 B T X 1 = 0 x 1 + x 2 x 2 + x 3 x 3 + x 4 0 4 5 x 1 + x 2-4 0 8-5 -8 0 x 2 + x 3 = 0 x 3 + x 4 32

Corner translations Let B 1, B 2,, B w be anti-symmetric wxw matrices. Let Y be the matrix such that the i-th column of Y is B i T X 1 i-th column of matrix Y B i T X 1 33

Corner translations Replace X 2 with X 2 = X 2 + Y Observe that X 1 X 2 = X 1 X 2 as X 1 Y = 0 wxw f is computed by the product X 1 X 2 X 3 X d = X 1 (X 2 + Y) X 3 X d Similarly we can define corner translations for X d-1 34

Uniqueness of the layer spaces Suppose f is computable by a full rank ABP X 1 X 2 X d Let X i denote the Q-linear space spanned by the linear forms in X i X 1,2 and X d-1,d denote the the Q-linear space spanned by the linear forms in X 1,X 2 and X d-1, X d respectively 35

Uniqueness of the layer spaces If X 1 X 2 X d computes f then either X i = X i for i ϵ [d]\{2,d-1} X 1,2 = X 1,2 and X d-1,d = X d-1,d or X i = X d-i for i ϵ [d]\{2,d-1} X 1,2 = X d-1,d and X d-1,d = X 1,2 36

Uniqueness of the layer spaces X 1 X 3 X 4 X d-2 X d X 1,2 X d-1,d 37

Uniqueness of the layer spaces X d X d-2 X d-3 X 3 X 1 X d-1,d X 1,2 38

Group of symmetries of IMM The set of all invertible A ϵ F n xn IMM w,d (AX) = IMM w,d. such that We show that the group of symmetries are generated by the following subgroups T denotes the group corresponding to transpositions M denotes the group corresponding to left-right multilpications C denotes the group corresponding to corner translations 39

Group of symmetries of IMM Main Theorem: G IMM = C H, where H = M T C is a normal subgroup in G IMM and M is a normal subgroup in H We also show that IMM is characterized by its group of symmetries That is any polynomial with the same symmetry group is a constant multiple of IMM 40

Computing the full rank ABP: Step 1 Computing the layer spaces: Study the Lie algebra of the group of symmetries of IMM w,d [Kay12] Lie algebra of the group of symmetries of f is the set of matrices A = (a i,j ) i,j [n] We just use the vector space property of the algebra 41

Invariant spaces Invariant space: Let M: Q n Q n be a linear operator. U Q n is an invariant space if M(U) U The definition can be extended to a set of linear operators {M 1, M 2,, M n } The layer spaces of an f computed by a full rank ABP are intimately connected to the invariant spaces of Lie algebra of f 42

Computing the layer spaces Compute a basis of the Lie algebra of f Easy: Involves solving a set of linear dependencies Compute the irreducible invariant spaces of the Lie algebra of f Since f and IMM are equivalent their Lie algebras are conjugates of each other Compute the layer spaces from the irreducible invariant spaces We show that the layer spaces are in fact the irreducible invariant spaces in some sense 43

Computing the full rank ABP: Step 2 Ordering the layer spaces: We use evaluation dimension to order the layer spaces. Definition: Evaluation Dimension for a polynomial H(X) is defined with respect to a set of variables S X Evaldim S [H(X) ] is equal to dim (span{h(x) x j =ɑ j for x j S,where ɑ j F}) 44

Ordering the layer spaces We make the variables in distinct layers are disjoint by mapping the basis vectors of the layer spaces to distinct variables. Then we find the ordering inductively. 45

Ordering the layer spaces Base Case Evaluation dimension = w Evaluation dimension = w 2 46

Ordering the layer spaces Inductive Step Evaluation dimension = w Evaluation dimension = w 2 47

48 Thank You