A Switching Lemma Tutorial. Benjamin Rossman University of Toronto and NII, Tokyo

Size: px

Start display at page:

Download "A Switching Lemma Tutorial. Benjamin Rossman University of Toronto and NII, Tokyo"

Anna Norris
5 years ago
Views:

1 A Switching Lemma Tutorial Benjamin Rossman University of Toronto and NII, Tokyo

2 homage to Paul Beame s excellent A Switching Lemma Primer A Switching Lemma Tutorial* Benjamin Rossman University of Toronto and NII, Tokyo

3 Overview 1. Basic Concepts models of computa?on (AC 0, DNFs/CNFs, decision trees) random restric?on R p 2. The Classic Switching Lemma (Hastad 86) 3. PARITY Lower Bound 4. Affine RestricGons Tsei?n expander switching lemma (joint with Pitassi, Servedio and Tan)

4 Models of Computa?on AC 0 circuits / formulas DNFs / CNFs (i.e. depth-2 formulas) Decision trees Canonical decision tree of a DNF / CNF

5 AC 0 Circuits size = # of gates depth = # layers of gates x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

6 AC 0 Circuits size = # of gates depth = # layers of gates The complexity class AC 0 refers to (sequences of) Boolean func?ons {0,1} n {0,1} computable by poly(n)-size, constant-depth circuits. x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

7 Depth-2 (DNFs and CNFs) DNF = disjunc?ve normal form (OR-AND formula) CNF = conjunc?ve normal form (AND-OR formula) width = max # of variables in a term/clause k

8 Depth-2 (DNFs and CNFs) k-dnf = width-k DNF k-cnf = width-k CNF k

9 Decision Trees x 3 depth 3 x 1 x 7 x 4 x 7 x 1 x

10 Decision Trees The decision-tree depth of a Boolean func?on f : {0,1} n {0,1} denoted DT depth (f), is the minimum depth of a decision tree that computes f DT depth (PARITY n ) = DT depth (AND n ) = n DT depth (f) = 0 f is constant

11 Decision Tree to DNF ( x 3 x 1 x 4 ) ( x 3 x 1 x 7 ) (x 3 x 7 x 1 ) (x 3 x 7 x 8 ) x 1 x 3 x 7 x 4 x 7 x 1 x

12 Decision Tree to DNF ( x 3 x 1 x 4 ) x 3 x 1 x 7 x 4 x 7 x 1 x

13 Decision Tree to DNF ( x 3 x 1 x 4 ) ( x 3 x 1 x 7 ) x 3 x 1 x 7 x 4 x 7 x 1 x

14 Decision Tree to DNF ( x 3 x 1 x 4 ) ( x 3 x 1 x 7 ) (x 3 x 7 x 1 ) x 3 x 1 x 7 x 4 x 7 x 1 x

15 Decision Tree to DNF ( x 3 x 1 x 4 ) ( x 3 x 1 x 7 ) (x 3 x 7 x 1 ) (x 3 x 7 x 8 ) x 1 x 3 x 7 x 4 x 7 x 1 x

16 (x 1 x 2 x 3 ) ( x 2 x 4 ) (x 1 x 3 x 7 ) Canonical DT of a DNF

17 (x 1 x 2 x 3 ) ( x 2 x 4 ) (x 1 x 3 x 7 ) Canonical DT of a DNF

18 Canonical DT of a DNF (x 1 x 2 x 3 ) x 1 x 2 x 3 1

19 Canonical DT of a DNF (x 1 x 2 x 3 ) ( x 2 x 4 ) x 1 x 2 x 4 1 x 4 1 x 2 x 2 x 4 1 x 3 x 2 1

20 Canonical DT of a DNF (x 1 x 2 x 3 ) ( x 2 x 4 ) x 1 x 2 x 4 1 x 4 1 x 2 x 2 x 4 1 x 3 x 2 1

21 Canonical DT of a DNF (x 1 x 2 x 3 ) ( x 2 x 4 ) x 1 x 2 x 4 1 x 4 x 2 1 x 3 1

22 Canonical DT of a DNF (x 1 x 2 x 3 ) ( x 2 x 4 ) x 1 (x 1 x 3 x 7 ) x 2 x x 3 x 7 x 4 x 2 1 x 7 1 x 3 1

23 Canonical DT of a DNF (x 1 x 2 x 3 ) ( x 2 x 4 ) x 1 (x 1 x 3 x 7 ) x 2 x x 3 x 7 0 x 4 0 x 2 1 x 7 1 x 3 0 1

24 Restric?ons a.k.a. func?ons [n] {0,1, } a.k.a. subcubes of {0,1} n

25 Restric?ons Consider a Boolean func?on f : {0,1} n {0,1} A restric3on (w.r.t. the variables of f) is a func?on R : [n] {0,1, }

26 Restric?ons Consider a Boolean func?on f : {0,1} n {0,1} A restric3on (w.r.t. the variables of f) is a func?on R : [n] {0,1, } [n] = {1,,n}, which we iden?fy with the variables x 1,,x n of f(x)

27 Restric?ons Consider a Boolean func?on f : {0,1} n {0,1} A restric3on (w.r.t. the variables of f) is a func?on R : [n] {0,1, } R(i) = 0 or 1 means that the variable x i is fixed to 0 or 1

28 Restric?ons Consider a Boolean func?on f : {0,1} n {0,1} A restric3on (w.r.t. the variables of f) is a func?on R : [n] {0,1, } R(i) = means that the variable x i is free (unrestricted)

29 Restric?ons f : {0,1} n {0,1} R : [n] {0,1, } Applying R to f, we get a Boolean func?on f R : {0,1} Stars(R) {0,1} R f R ( ) f ( )

30 Restric?ons Restric?ons are applied syntac3cally to DNFs / CNFs / decision trees / AC 0 circuits, etc.

31 Restric?ng an AC 0 Circuit Consider R = { x 1 1 } x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

32 Restric?ng an AC 0 Circuit Consider R = { x 1 1 } i.e., R(1) = 1 and R(i) = for all i 1 x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

33 Restric?ng an AC 0 Circuit Consider R = { x 1 1 } x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

34 Restric?ng an AC 0 Circuit Consider R = { x 1 1 } x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

35 Restric?ng an AC 0 Circuit Consider R = { x 1 1 } 1 0 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

36 Restric?ng an AC 0 Circuit Gate Elimina9on Method Consider R = { x 1 1 } N to 1+1/exp(d) lower bound against depth-d circuits via determinis3c restric3ons [Chaudhuri & Radhakrishnan 96] 1 0 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

37 Restric?ng a DNF R = { x 1 1, x 4 0 } x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 7 x 8 x 9 x 4 x 7 x 9

38 Restric?ng a DNF R = { x 1 1, x 4 0 } 1 x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 7 x 8 x 9 x 4 x 7 x 9

39 Restric?ng a DNF R = { x 1 1, x 4 0 } 1 x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 7 x 8 x 9 x 4 x 7 x 9

40 Restric?ng a DNF R = { x 1 1, x 4 0 } 1 x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 7 x 8 x 9 x 4 x 7 x 9

41 Restric?ng a DNF R = { x 1 1, x 4 0 } 1 x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 7 x 8 x 9 x 4 x 7 x 9 R x 2 x 3 x 2 x 5 x 7 x 8 x 9 x 7 x 9

42 Restric?ng a Decision Tree Again co R = { x 1 1 } x 3 x 1 x 7 x 4 x 7 x 1 x

43 Restric?ng a Decision Tree Again co R = { x 1 1 } x 3 x 1 x 7 x 4 x 7 x 1 x

44 Restric?ng a Decision Tree Again co R = { x 1 1 } x 3 R 0 x 3 1 x 1 x 7 x 7 x 7 x 4 x 7 x 1 x x 8 1 0

45 The Random Restric?on R p Throughout this talk, random objects in boldface

46 The Random Restric?on R p For 0 p 1, let R p be the random restric3on with R p (i) = with prob. p 0 with prob. (1-p)/2 1 with prob. (1-p)/2 independently for each variable index i [n]

47 Effect of R p R p simplifies Boolean func?ons represented by small decision trees / AC 0 circuits / demorgan formulas, [Subbotovskaya 61, ] Certain Boolean func?ons, like PARITY, maintain their complexity under R p => lower bounds!

48 Effect of R p on Boolean Func?ons R p kills AND n with very high probability (for p 1/2): Pr[ AND n R p 0 ] 1 (1/2) O(n)

49 Effect of R p on Boolean Func?ons R p kills AND n with very high probability (for p 1/2): Pr[ AND n R p 0 ] 1 (1/2) O(n) On the other hand, for any R with k stars, PARITY n R PARITY k or 1 PARITY k In par?cular, DT depth (PARITY n R p ) pn with high prob

50 Effect of R p on Decision Trees Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l

51 Effect of R p on Decision Trees Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l 2 l when p 1/4k

52 Effect of R p on DNFs Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Hastad s Switching Lemma (1986) If F is a k-dnf (i.e. OR of depth-k decision trees), then Pr[ DT depth (F R p ) l ] (5pk) l

53 Effect of R p on DNFs Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t 2 l when R p ) p l ] 1/10k (2pk) l Hastad s Switching Lemma (1986) If F is a k-dnf (i.e. OR of depth-k decision trees), then Pr[ DT depth (F R p ) l ] (5pk) l

54 Effect of R p on DNFs Corollary (usual statement of the S.L.) If F is a k-dnf, then Pr[F R p is not equivalent to an l-cnf ] (5pk) l Hastad s Switching Lemma (1986) If F is a k-dnf (i.e. OR of depth-k decision trees), then Pr[ DT depth (F R p ) l ] (5pk) l

55 Effect of R p on DNFs CNFs Corollary (usual statement of the S.L.) If F is a k-cnf, then Pr[F R p is not equivalent to an l-dnf ] (5pk) l Hastad s Switching Lemma (1986) If F is a k-cnf (i.e. AND of depth-k decision trees), then Pr[ DT depth (F R p ) l ] (5pk) l

56 Effect of R p on DNFs k R p w.h.p. for small p l

57 Depth Reduc?on unbnd'd fan-in k k k k k k

58 Depth Reduc?on Apply the Switching Lemma to each gate (take a union bound over failure probability) k k k k k k

59 Depth Reduc?on Apply the Switching Lemma to each gate (take a union bound over failure probability) l l l l l l

60 Depth Reduc?on two layers of -gates l l l l l l

61 Depth Reduc?on l l l l l l

62 Depth Reduc?on Applying the S.L. to the boaom-2 layers k k k k k k l l l l l l

63 Depth Reduc?on Applying the S.L. to the boaom-2 layers k k k k k k typically, k = l n 1/d l l l l l l

64 Depth Reduc?on unbnd'd fan-in depth-k dec. trees

65 Hastad s Switching Lemma (decision tree version) If T 1,, T m are depth-k decision trees, then Pr[ DT depth ((T 1 T m ) R p ) l ] (5pk) l unbnd'd fan-in depth-k dec. trees

66 Hastad s Switching Lemma (decision tree version) If T 1,, T m are depth-k decision trees, then Pr[ DT depth ((T 1 T m ) R p ) l ] (5pk) l depth-l dec. trees

67 Depth Reduc?on applying the S.L. to the boaom layer of gates

68 PARITY Lower Bound

69 Hastad 86 Depth d+1 circuits for PARITY require size exp(ω(n 1/d ))

70 Hastad 86 Depth d+1 circuits for PARITY require size exp(ω(n 1/d )) (Trivial) Upper Bound PARITY has depth d+1 circuits of size exp(o(n 1/d )) d 2 n 1/d n 1 1/d n 1 1/d n 1 1/d

71 Hastad 86 Depth d+1 circuits for PARITY require size exp(ω(n 1/d )) (Trivial) Upper Bound PARITY has depth d+1 circuits of size exp(o(n 1/d )) d+1 n

72 PARITY Lower Bound x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

73 PARITY Lower Bound depth-1 decision trees on n variables

74 PARITY Lower Bound R p -random restric?on ρ 1 : [n] {0,1, } depth-1 decision trees on n variables

75 PARITY Lower Bound Apply the Switching Lemma to each bo om-level gate Take a union bound over failure probabili?es depth-1 decision trees on n variables

76 PARITY Lower Bound Apply the Switching Lemma to each bo om-level gate Take a union bound over failure probabili?es low-depth decision trees on pn variables

77 PARITY Lower Bound 1 O(ε) frac?on have depth 0 O(ε) frac?on have depth 1 O(ε 2 ) frac?on have depth 2 O(ε l ) have depth l low-depth decision trees on pn variables

78 PARITY Lower Bound low-depth decision trees on pn variables

79 PARITY Lower Bound R p -random restric?on ρ 2 : Stars(ρ 1 ) {0,1, } low-depth decision trees on pn variables

80 PARITY Lower Bound low-depth decision trees on p 2 n variables

81 PARITY Lower Bound R p -random restric?on ρ 3 : Stars(ρ 2 ) {0,1, } low-depth decision trees on p 2 n variables

82 PARITY Lower Bound low-depth decision trees on p 3 n variables

83 PARITY Lower Bound R p -random restric?on ρ 4 : Stars(ρ 3 ) {0,1, } low-depth decision trees on p 3 n variables

84 PARITY Lower Bound depth-0 decision tree (i.e. constant funcgon) on p 4 n variables

85 PARITY Lower Bound depth-0 decision tree (i.e. constant funcgon) on p 4 n variables w.h.p., output is fixed to 0 or 1 a~er restric?ons ρ 1 ρ 2 ρ 3 ρ 4 (equivalent to R p 4) If p 4 n 1, this shows the original circuit does not compute PARITY n

86 PARITY Lower Bound Op?mal choice of parameters gives a?ght exp(ω(n 1/d )) lower bound for depth d+1 circuits x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

87 Formulas vs. Circuits

88 AC 0 Circuits x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

89 AC 0 Formulas x 3 x 7 x 4 x 1 x 3 x 2 x 11 x 1 x 2 x 5 x 4 x 2 x 1

90 Fact: Every depth d+1 circuit of size S is equivalent to a depth d+1 formula of size at most S d

91 PARITY Lower Bound Hastad 86 Depth d+1 circuits for PARITY require size exp(ω(n 1/d )) R. 15 Depth d+1 formulas for PARITY have size exp(ω(dn 1/d ))

92 PARITY Lower Bound Hastad 86 Depth d+1 circuits for PARITY require size exp(ω(n 1/d )) Poly-size circuits for PARITY have depth log n log log n + O(1) R. 15 Depth d+1 formulas for PARITY have size exp(ω(dn 1/d )) Poly-size formulas for PARITY have depth Ω(log n)

93 Two Views of R p : [n] {0,1, } (1) Independently, for each coordinate i [n], set R p (i) = with prob. p 0 with prob. (1-p)/2 1 with prob. (1-p)/2 (2) σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

94 1 τ 0 σ σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

95 1 τ?me p runs from 1 to 0 0 σ σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

96 1 start?me p = 1 τ 0 σ R p σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

97 1 start?me p = 1 τ 0 σ R p σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

98 1 p = 0.99 τ 0 σ R p 0 σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

99 1 p = 0.5 τ 0 σ R p σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

100 1 p = 0.1 τ 0 σ R p σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

101 1 end?me p = 0 τ 0 σ R p σ {0,1} n uniform random assignment iτ [0,1] n uniform random?me-stamp R p (i) = if τ i p σ i if τ i > p

102 1 0 x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

103 1 p 0 R p x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

104 1 p 0 p 2 R p 2 R p x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

105 1 p 0 p 2 p 3 R p 3 R p 2 R p x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

106 1 p 0 p 2 p 3 p 4 R p 4 R p 3 R p 2 R p x 1 x 1 x 2 x 2 x 3 x 3 x 4 x 4 x 5 x 5

107 1 0

108 1 0 Each sub-formula F will have its own stopping?me q(f) q(f) is a random variable in [0,1], which is determined by σ and τ

109 1 q(g) 0 G

110 1 q(h) 0 H

111 1 q(i) 0 I

112 1 q(j) 0 J

113 1 0 q(f) F

114 1 p 0

115 1 p 0 R p

116 1 p 0 decision trees

117 1 p 0 k

118 1 p 0 F k

119 1 p 0 q(f) F q(f) = p / 14k k

120 Hastad s Switching Lemma If F is k-dnf or k-cnf, then Pr[ DT depth (f R 1/14k ) l ] e l Stopping Time Version For every Boolean formula F, Pr[ DT depth (F R q(f) ) l ] e l

121 Technical Main Lemma (tail bound for q(f)) If F has depth d+1, then for all 0 λ 1, C Pr[ q(f) λ ] size(f) d exp(ω(dλ 1/d )) The constant C < 8 is defined by 1+ 1X i=0 1 1 exp(e i 1 (i + 1)e 2 ) + X i=0 1X j=0 1 exp((j + 1)e i 1 (i + j + 2)e 2 )

122 Technical Main Lemma (tail bound for q(f)) If F has depth d+1, then for all 0 λ 1, C Pr[ q(f) λ ] size(f) d exp(ω(dλ 1/d )) Corollary Depth d+1 formulas for PARITY have size exp(ω(dn 1/d ) O(d))

123 Proof of Hastad s Switching Lemma

124 Fix a k-dnf F and l 1 Switching Lemma: Pr[ depth(candt(f R p )) l ] = O(pk) l

125 Fix a k-dnf F and l 1 BAD := { restric?ons R depth(candt(f R)) l } Switching Lemma: Pr[ depth(candt(f R p )) l ] = O(pk) l

126 Fix a k-dnf F and l 1 BAD := { restric?ons R depth(candt(f R)) l } Goal. Pr[ R p BAD ] = O(pk) l Switching Lemma: Pr[ depth(candt(f R p )) l ] = O(pk) l

127 Fix a k-dnf F and l 1 BAD := { restric?ons R depth(candt(f R)) l } Goal. Pr[ R p BAD ] = O(pk) l Key idea. We associate each R BAD with a restric?on R* such that

128 Fix a k-dnf F and l 1 BAD := { restric?ons R depth(candt(f R)) l } Goal. Pr[ R p BAD ] = O(pk) l Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l R* R* (l = 3)

129 Fix a k-dnf F and l 1 BAD := { restric?ons R depth(candt(f R)) l } Goal. Pr[ R p BAD ] = O(pk) l Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l in par?cular, Pr[ R p = R* ] / Pr[ R p = R ] = ((1-p)/2p) l

130 Fix a k-dnf F and l 1 BAD := { restric?ons R depth(candt(f R)) l } Goal. Pr[ R p BAD ] = O(pk) l Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l in par?cular, Pr[ R p = R* ] / Pr[ R p = R ] = ((1-p)/2p) l for any restric?on ρ with s stars, Pr[ R p = ρ ] = p s (1 p) n s

131 Fix a k-dnf F and l 1 BAD := { restric?ons R depth(candt(f R)) l } Goal. Pr[ R p BAD ] = O(pk) l Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1

132 Fix a k-dnf F and l 1 BAD := { restric?ons R depth(candt(f R)) l } Goal. Pr[ R p BAD ] = O(pk) l Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 any restric?on ρ equals R* for at most (4k) l dis?nct R BAD

133 Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1

134 Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 Pr[ R p BAD ] = R BAD Pr[ R p = R ]

135 Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 Pr[ R p BAD ] = R BAD Pr[ R p = R ] = R BAD (2p/(1-p)) l Pr[ R p = R* ] Pr[ R p = R ] = (2p/(1-p)) l Pr[ R p = R* ]

136 Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 Pr[ R p BAD ] = R BAD Pr[ R p = R ] = R BAD (2p/(1-p)) l Pr[ R p = R* ] (4p) l R BAD Pr[ R p = R* ] w.l.o.g. p 1/2

137 Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 Pr[ R p BAD ] = R BAD Pr[ R p = R ] = R BAD (2p/(1-p)) l Pr[ R p = R* ] (4p) l R BAD Pr[ R p = R* ] (4p) l (4k) l Pr[ R p {R* R BAD} ]

138 Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 Pr[ R p BAD ] = R BAD Pr[ R p = R ] = R BAD (2p/(1-p)) l Pr[ R p = R* ] (4p) l R BAD Pr[ R p = R* ] (4p) l (4k) l Pr[ R p {R* R BAD} ] Pr[ ] 1

139 Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 Pr[ R p BAD ] = R BAD Pr[ R p = R ] = R BAD (2p/(1-p)) l Pr[ R p = R* ] (4p) l R BAD Pr[ R p = R* ] (4p) l (4k) l Pr[ R p {R* R BAD} ] (16pk) l Q.E.D.

140 Switching Lemma: Pr[ depth(candt(f R p )) l ] = O(pk) l Pr[ R p BAD ] = R BAD Pr[ R p = R ] = R BAD (2p/(1-p)) l Pr[ R p = R* ] (4p) l R BAD Pr[ R p = R* ] (4p) l (4k) l Pr[ R p {R* R BAD} ] (16pk) l Q.E.D.

141 Switching Lemma: Pr[ depth(candt(f R p )) l ] = O(pk) l Pr[ R p BAD ] = R BAD Pr[ R p = R ] = R BAD (2p/(1-p)) l Pr[ R p = R* ] more (4p) careful l R BAD analysis Pr[ R p = gives R* ] (5pk) l (w.l.o.g. p (4p) l (4k) l Pr[ R p {R* R BAD} ] (16pk) l Q.E.D.

142 Key idea. We associate each R BAD with a restric?on R* such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1

143 Key idea. We associate each R BAD with a pair (R*,Code(R)) such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 3 4

144 Key idea. We associate each R BAD with a pair (R*,Code(R)) such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 3 Code(R) ({0,1} 2 [k]) l 4

145 Key idea. We associate each R BAD with a pair (R*,Code(R)) such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 3 Code(R) ({0,1} 2 [k]) l 4 in par?cular, Code(R) has (4k) l possible values

146 Key idea. We associate each R BAD with a pair (R*,Code(R)) such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 3 Code(R) ({0,1} 2 [k]) l 4 the map R (R*,Code(R)) is 1-to-1

147 Key idea. We associate each R BAD with a pair (R*,Code(R)) such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 3 Code(R) ({0,1} 2 [k]) l 4 the map R (R*,Code(R)) is 1-to-1 3 & 4 imply 2

148 Key idea. We associate each R BAD with a pair (R*,Code(R)) such that 1 Stars(R*) = Stars(R) l 2 the map R R* is (4k) l -to-1 3 Code(R) ({0,1} 2 [k]) l 4 the map R (R*,Code(R)) is 1-to-1 intui?vely, Code(R) is a recipe for inver?ng R (R*,Code(R)) given knowledge of R*

149 R (R*,Code(R))

150 k = 3, l = 4 R (R*,Code(R)) F = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7

151 k = 3, l = 4 R (R*,Code(R)) F = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 }

152 k = 3, l = 4 R (R*,Code(R)) 1 1 F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 }

153 k = 3, l = 4 R (R*,Code(R)) 1 1 F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 }

154 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } x 2 CanDT(F R) = x 7 x 3 1 x 5 x

155 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R BAD, since CanDT(F R) l 4 x 2 x 7 x 3 1 x 5 x

156 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } x 2 x 7 x 3 1 x 5 x

157 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

158 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

159 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

160 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

161 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

162 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } Code(R) says: find the first sa?sfied term of F R* the long path begins with x 2 1, x 3 1 (i.e. var 2 of term 1 and var 3 of term 1)

163 k = 3, l = 4 R (R*,Code(R)) 1 F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } Fix variables according to the beginning of the long path R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 }

164 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

165 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

166 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

167 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

168 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } Code(R) next says: find the next sa?sfied term of F R*(overwri?ng x 2 1, x 3 1) the long path con?nues x 5 0 (i.e. var 3 of term 0)

169 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } 0 R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } Fix variables according to the long path R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 }

170 k = 3, l = 4 R (R*,Code(R)) F R = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } x 2 x 7 x 3 1 x 5 x

171 k = 3, l = 4 R (R*,Code(R)) F = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } Code(R) ({0,1} 2 [k]) l given knowledge of R* (and F), follow these instruc?ons to recover R (and along the way R~) R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 }

172 k = 3, l = 4 R (R*,Code(R)) F = x 1 x 2 x 3 x 1 x 3 x 5 x 2 x 4 x 5 x 3 x 4 x 6 x 1 x 4 x 7 R = { x 1 1, x 4 0 } R~ = { x 1 1, x 4 0, x 2 1, x 3 1, x 5 0, x 7 1 } R* = { x 1 1, x 4 0, x 2 1, x 3 0, x 5 1, x 7 0 } Code(R) ({0,1} 2 [k]) l given knowledge of R* (and F), follow these instruc?ons to recover R (and along the way R~) ü R* has l fewer stars than R ü R (R*,Code(R)) is 1-to-1

173 A Different Approach

174 A Different Approach Hastad s Switching Lemma If F is a k-dnf, then Pr[ depth(candt(f R p )) l ] = O(pk) l Weak (but flexible) Switching Lemma If F is a k-dnf, then Pr[ depth(candt(f) R p ) l ] = O(pk2 k ) l

175 Shows that PARITY has depth-d circuit size log n d 2 exp(ω(( ) )) Weak (but flexible) Switching Lemma If F is a k-dnf, then Pr[ depth(candt(f) R p ) l ] = O(pk2 k ) l

176 Proof generalizes to affine restric3ons! Weak (but flexible) Switching Lemma If F is a k-dnf, then Pr[ depth(candt(f) R p ) l ] = O(pk2 k ) l

177 A Different Approach 1. Decision Tree Switching Lemma 2. k-clipped Decision Trees 3. Arbitrary Distribu?on of Stars 4. Switching Lemma for Affine Restric3ons 5. Tsei?n Expander Switching Lemma

178 A Different Approach 1. Decision Tree Switching Lemma 2. k-clipped Decision Trees 3. Arbitrary Distribu?on of Stars 4. Switching Lemma for Affine Restric3ons 5. Tsei?n Expander Switching Lemma

179 Binomial Distribu?on For a finite set N, let S p N Denote S is a p-biased random subset of N, i.e., Pr[ S = S ] = p S (1 p) N S For a natural number n and 0 p 1, the binomial random variable Bin(n,p) is equivalent to S where S p [n]

180 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l

181 Consider a random branch of T: x 3 x 1 x 2 1/4 x 4 x 7 x 7 1/8 x 2 1/16 1/16 1/8 1/8 1/8 1/8

182 Consider a random branch of T: x 3 x 1 x 2 1/4 x 4 x 7 x 7 1/8 x 2 1/16 1/16 1/8 1/8 1/8 1/8

183 Let β(t) [n] be the set of variables queried on a random branch of T x 3 x 1 x 2 1/4 x 4 x 7 x 7 1/8 x 2 1/16 1/16 1/8 1/8 1/8 1/8

184 #β(t) = the number of variables queried on a random branch of T x 3 x 1 x 2 1/4 x 4 x 7 x 7 1/8 x 2 1/16 1/16 1/8 1/8 1/8 1/8

185 #β(t) = the number of variables queried on a random branch of T x 3 Obs: If depth(t) k, then Pr[ #β(t) k ] 2 k x 1 x 2 1/4 x 4 x 7 x 7 1/8 x 2 1/16 1/16 1/8 1/8 1/8 1/8

186 Claim. The following random variables have the same distribu?on: 1. β(t R p ) the set of variables queried on a random branch of T R p 2. S p β(t) a p-biased subset of the vars. on a random branch of T

187 Claim. The following random variables have the same distribu?on: 1. β(t R p ) #β(t R p ) the set of variables queried on a random branch of T R p 2. S p β(t) Bin(#β(T), p) a p-biased subset of the vars. on a random branch of T

188 1. β(t R p ) 2. S p β(t) T

189 1. β(t R p ) 2. S p β(t) Stars(R p )

190 1. β(t R p ) 2. S p β(t) R p

191 1. β(t R p ) 2. S p β(t) T R p

192 1. β(t R p ) 2. S p β(t) T R p

193 1. β(t R p ) 2. S p β(t) T R p

194 1. β(t R p ) 2. S p β(t) T R p

195 1. β(t R p ) 2. S p β(t) β(t R p )

196 1. β(t R p ) 2. S p β(t) β(t R p )

197 1. β(t R p ) 2. S p β(t) β(t R p )

198 1. β(t R p ) 2. S p β(t) β(t R p )

199 1. β(t R p ) 2. S p β(t) β(t R p )

200 1. β(t R p ) 2. S p β(t) T

201 1. β(t R p ) 2. S p β(t) β(t)

202 1. β(t R p ) 2. S p β(t) β(t)

203 1. β(t R p ) 2. S p β(t) S p β(t)

204 1. β(t R p ) 2. S p β(t) 1. #β(t R p ) 2. Bin(β(T), p)

205 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l

206 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] recall: depth(t ) l Pr[ #β(t ) l ] 2 l

207 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p p recall: depth(t ) l Pr[ #β(t ) l ] 2 l

208 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p p

209 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Ex R p [ Pr β(t R p) [ #β(t R p ) l ] ] p p p Markov s inequality

210 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p p 2 l Ex R p [ Pr β(t R p) [ #β(t R p ) l ] ] p p

211 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] p

212 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] = 2 l Pr[ Bin(#β(T), p) l ] p #β(t R p ) and Bin(#β(T), p) have the same distribu?on

213 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] = 2 l Pr[ Bin(#β(T), p) l ] 2 l Pr[ Bin(k, p) l ] p #β(t) depth(t) k

214 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] = 2 l Pr[ Bin(#β(T), p) l ] 2 l Pr[ Bin(k, p) l ] 2 l (pk) l p union bound

215 Decision Tree Switching Lemma If T is a depth-k decision tree, then Pr[ depth(t R p ) l ] (2pk) l Pr[ depth(t R p ) l ] Pr R p [ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] = 2 l Pr[ Bin(#β(T), p) l ] 2 l Pr[ Bin(k, p) l ] 2 l (pk) l p Q.E.D.

216 A Different Approach 1. Decision Tree Switching Lemma 2. k-clipped Decision Trees 3. Arbitrary Distribu?on of Stars 4. Switching Lemma for Affine Restric3ons 5. Tsei?n Expander Switching Lemma

217 Recall: Canonical DT of a k-dnf (x 1 x 2 x 3 ) ( x 2 x 4 ) (x 1 x 3 x 7 ) x 4 1 x 2 1 x 1 x 3 x 7 x 4 x 2 1 x 7 1 x 3 1

218 Recall: Canonical DT of a k-dnf (x 1 x 2 x 3 ) ( x 2 x 4 ) (x 1 x 3 x 7 ) x 4 1 x 2 1 x 1 x 3 Obs: Every node has distance k from a leaf! x 7 x 4 x 2 1 x 7 1 x 3 1

219 Defini?on: A decision tree is k-clipped if every node has distance k from a leaf

220 Defini?on: A decision tree is k-clipped if every node has distance k from a leaf Lemma: If F is a k-dnf (or k-cnf), then CanDT(F) is k-clipped

221 Defini?on: A decision tree is k-clipped if every node has distance k from a leaf Lemma: If F is a k-dnf (or k-cnf), then CanDT(F) is k-clipped Lemma: If T is k-clipped, then Ex #β(t) l ( ) O(k2 k ) l

222 Defini?on: A decision tree is k-clipped if every node has distance k from a leaf #β(t) stochas?cally dominated by Lemma: If F is a k-dnf (or k-cnf), then the number of unbiased coin flips un?l CanDT(F) seeing is k consecu?ve k-clipped heads Lemma: If T is k-clipped, then Ex #β(t) l ( ) O(k2 k ) l

223 k-clipped Decision Tree Switching Lemma If T is a k-clipped decision tree, then Pr[ depth(t R p ) l ] = O(pk2 k ) l

224 k-clipped Decision Tree Switching Lemma If T is a k-clipped decision tree, then Pr[ depth(t R p ) l ] = O(pk2 k ) l Pr[ depth(t R p ) l ] Pr R p[ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] p = 2 l Pr [ Bin(#β(T), p) l ] same as before

225 k-clipped Decision Tree Switching Lemma If T is a k-clipped decision tree, then Pr[ depth(t R p ) l ] = O(pk2 k ) l Pr[ depth(t R p ) l ] Pr R p[ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] = 2 l Pr [ Bin(#β(T), p) l ] 2 l Ex[ p l ] p #β(t) l ( ) union bound

226 k-clipped Decision Tree Switching Lemma If T is a k-clipped decision tree, then Pr[ depth(t R p ) l ] = O(pk2 k ) l Lemma: If T is k-clipped, then Pr[ depth(t R p ) l ] Ex (#β(t)) Pr R p[ Pr β(t R p) [ #β(t O(k2 k R p ) ) l l l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] = 2 l Pr [ Bin(#β(T), p) l ] 2 l Ex[ p l ] 2 l p l O(k2 k ) l p #β(t) l ( )

227 k-clipped Decision Tree Switching Lemma If T is a k-clipped decision tree, then Pr[ depth(t R p ) l ] = O(pk2 k ) l Pr[ depth(t R p ) l ] Pr R p[ Pr β(t R p) [ #β(t R p ) l ] 2 l ] p 2 l Pr[ #β(t R p ) l ] = 2 l Pr [ Bin(#β(T), p) l ] 2 l Ex[ p l ] 2 l p l O(k2 k ) l p #β(t) l ( ) Q.E.D.

228 A Different Approach 1. Decision Tree Switching Lemma 2. k-clipped Decision Trees 3. Arbitrary DistribuGon of Stars 4. Switching Lemma for Affine Restric3ons 5. Tsei?n Expander Switching Lemma

229 Let S [n] be an arbitrary distribu?on of stars

230 Let S [n] be an arbitrary distribu?on of stars Let u unif {0,1} n

231 Let S [n] be an arbitrary distribu?on of stars Let u unif {0,1} n Define random restric?on ρ S,u : [n] {0,1, } by ρ S,u (i) = S,u (i) = if i S, u i otherwise.

232 Let S [n] be an arbitrary distribu?on of stars Let u unif {0,1} n Define random restric?on ρ S,u : [n] {0,1, } by ρ S,u (i) = if i S, u i otherwise. If S p [n], then ρ S,u has the S,u has the same distribu?on as R p

233 Let S [n] be an arbitrary distribu?on of stars Defini?on: For p [0,1], we say that S is p-bounded if Pr[ J S ] p J for every J [n]

234 The following are equivalent: β(t) S 1. β(t ρ S,u ) 2. β(t) S

235 The following are equivalent: β(t) S 1. β(t ρ S,u ) 2. β(t) S in fact, this equivalence valid for fixed S Supp(S)

236 The following are equivalent: 1. β(t ρ S,u ) 2. β(t) S β(t) S If S is p-bounded, then Pr[ β(t) S l ] #β(t) l ( ) Ex[ p l ]

237 The following are equivalent: 1. β(t ρ S,u ) 2. β(t) S β(t) S Pr[ J S ] p l for every J [n] of size J = l If S is p-bounded, then Pr[ β(t) S l ] #β(t) l ( ) Ex[ p l ]

238 Theorem (Switching Lemma for ρ S,u ) If S is p-bounded and T is a k-clipped DT, then Pr[ depth(t ρ S,u ) l ] = O(pk2k k ) l

239 Theorem (Switching Lemma for ρ S,u ) If S is p-bounded and T is a k-clipped DT, then Pr[ depth(t ρ S,u ) l ] = O(pk2k k ) l Pr[ depth(t ρ S,u ) l ] Pr S,u [ Pr V(T ρ ) [ #β(t ρ S,u ) l ] 2 2 l l ] S,u 2 l Pr[ #β(t ρ S,u ) l ] = 2 l Pr [ β(t) S l ] #β(t) l ( ) 2 l Ex[ p l ] 2 l p l O(k2 k ) l Q.E.D.

240 A Different Approach 1. Decision Tree Switching Lemma 2. k-clipped Decision Trees 3. Arbitrary Distribu?on of Stars 4. Switching Lemma for Affine Restric9ons 5. Tsei?n Expander Switching Lemma

241 Restric?ons [n] {0,1, } are in 1-1 correspondence with subcubes of {0,1} n What about the complexity of F A (e.g. for F a k-dnf) for a random affine subspace of A {0,1} n?

242 Restric?ons [n] {0,1, } are in 1-1 correspondence with subcubes of {0,1} n What about the complexity of F A (e.g. for F a k-dnf) for a random affine subspace of A {0,1} n? A = V + u where V is a linear subspace of {0,1} n

243 Restric?ons [n] {0,1, } are in 1-1 correspondence with subcubes of {0,1} n What about the complexity of F A (e.g. for F a k-dnf) for a random affine subspace of A {0,1} n? Affine Switching Lemma If F is a k-dnf and A is a p-bounded random affine subset of {0,1} n, then Pr[ DT depth (F A) l ] = O(pk2 k ) l

244 Applying an Affine Restric?on Consider an affine set A {0,1} n For f : {0,1} n {0,1}, we define: f A : A {0,1}, DT depth (f A) = min. depth of a decision tree T such that f(x) = T(x) for all x A We also have a syntac3c opera?on T T A on decision trees

245 Applying an Affine Restric?on Example: A = { x {0,1} n x 1 x 3 } x 3 A 0 x 3 1 x 1 x 7 x 7 x 7 x 4 x 7 x 1 x x 8 1 0

246 Let S [n] be an arbitrary dist. of stars Let u unif {0,1} n Consider random restric?on ρ S,u (i) = S,u (i) = if i S, u i otherwise.

247 Let S [n] be an arbitrary dist. of stars Let V be an arbitrary dist. on linear subspaces of {0,1} n Let u unif {0,1} n Consider random restric?on ρ S,u (i) = S,u (i) = if i S, u i otherwise.

248 Let S [n] be an arbitrary dist. of stars Let V be an arbitrary dist. on linear subspaces of {0,1} n Let u unif {0,1} n Consider random restric?on ρ S,u (i) = Consider random affine space V + u if i S, u i otherwise.

249 Let S [n] be an arbitrary dist. of stars Let V be an arbitrary dist. on linear subspaces of {0,1} n Let u unif {0,1} n Consider random restric?on ρ S,u (i) = Consider random affine space V + u if i S, u i otherwise. V is the arrangement of stars

250 Let S [n] be an arbitrary dist. of stars Let V be an arbitrary dist. on linear subspaces of {0,1} n Let u unif {0,1} n Consider random restric?on ρ S,u (i) = Consider random affine space V + u if i S, u i otherwise. u gives the assignment to non-stars

251 Let S [n] be an arbitrary dist. of stars Let V be an arbitrary dist. on linear subspaces of {0,1} n Let u unif {0,1} n Consider random restric?on ρ S,u (i) = Consider random affine space V + u if i S, u i otherwise. {b 1,, b n } = standard basis for {0,1} n if V = Span{ b i i S }, then V + u is the subcube defined by ρ S,u

252 Defini?on A random set S [n] is p-bounded if, for all J [n], Pr[ J S ] p J Defini?on A random linear subspace V {0,1} n is p-bounded if, for all J [n], Pr[ V sha ers J ] p J

253 Defini?on A random set S [n] is p-bounded if, for all J [n], Pr[ J S ] p J Defini?on A random linear subspace V {0,1} n is p-bounded if, for all J [n], Pr[ V sha ers J ] p J I J v V such that v i = 1 for all i I v j = 0 for all j J I

254 Defini?on V sha ers J [n] A random set S [n] is p-bounded if, for all J [n], J is independent Pr[ J S ] p in J the matroid associated with V Defini?on A random linear subspace V {0,1} n is p-bounded if, for all J [n], Pr[ V sha ers J ] p J I J v V such that v i = 1 for all i I v j = 0 for all j J I

255 Defini?on A random set S [n] is p-bounded if, for all J [n], Pr[ J S ] p J Defini?on A random linear subspace V {0,1} n is p-bounded if, for all J [n], Pr[ V sha ers J ] p J equivalent if V = Span{ b i i S }

256 Suppose A = V + u where V is a p-bounded random linear subspace of {0,1} n u unif {0,1} n Theorem For every k-clipped decision tree T, Pr[ depth(t A) l ] = O(pk2 k ) l essen?ally the same proof as the Switching Lemma for ρ S,u

257 Suppose A = V + u where V is a p-bounded random linear subspace of {0,1} n u unif {0,1} n Theorem For every k-clipped decision tree T, Pr[ depth(t A) l ] = O(pk2 k ) l Corollary (Affine Switching Lemma) For every k-dnf F, Pr[ DT depth (F A) l ] = O(pk2 k ) l

258 Suppose A = V + u where V is a p-bounded random linear subspace of {0,1} n u unif {0,1} n Theorem For every k-clipped decision tree T, Pr[ depth(t A) l ] = O(pk2 k ) l Corollary (Affine Switching Lemma) For every k-dnf F, Pr[ DT depth (F A) l ] = O(pk2 k ) l Conjecture: O(pk) l

259 For every fixed V (and random u {0,1} n ), the following are equivalent: 1. β(t (V + u)) #β(t (V + u)) 2. greedy basis of β(t) in the V-matroid rank V (β(t))

260 Suppose A = V + u where V is a p-bounded random linear subspace of {0,1} n u unif {0,1} n Final generaliza?on: Replace the ambient space {0,1} n with an arbitrary affine space B

261 Suppose A = V + u where V is a p-bounded random linear subspace of B u unif B Final generaliza?on: Replace the ambient space {0,1} n with an arbitrary affine space B

262 Suppose A = V + u where V is a p-bounded random linear subspace of B u unif B A is a random affine subspace of B

263 Suppose A = V + u where V is a p-bounded random linear subspace of B u unif B Theorem For every B-independent k-clipped decision tree T, Pr[ depth(t A) l ] = O(pk2 k ) l

264 Suppose A = V + u where V is a p-bounded random linear subspace of B u unif B Theorem For every B-independent k-clipped decision tree T, Pr[ depth(t A) l ] = O(pk2 k ) l Every branch β [n] of T is sha ered by B (i.e. β is independent in the matroid associated with Lin(B) = { x y x,y B})

265 A Different Approach 1. Decision Tree Switching Lemma 2. k-clipped Decision Trees 3. Arbitrary Distribu?on of Stars 4. Switching Lemma for Affine Restric3ons 5. TseiGn Expander Switching Lemma

266 Tsei?n Expander Switching Lemma Theorem (Pitassi-R.-Servedio-Tan 16) Depth-d AC 0 Frege proofs of Tsei?n(3-regular log n d expander) require size exp(ω(( ) 2 )) Theorem (Beame-Pitassi-Impagliazzo 93, Ben-Sasson 02) Depth-d AC 0 Frege proofs of Tsei?n(3-regular expander) require size exp(ω(n 1/2 d ))

267 Tsei?n Expander Switching Lemma Theorem (Pitassi-R.-Servedio-Tan 16) Depth-d AC 0 Frege proofs of Tsei?n(3-regular log n d expander) require size exp(ω(( ) 2 )) Theorem (Beame-Pitassi-Impagliazzo 93, Ben-Sasson 02) Depth-d Improving AC 0 Frege the proofs AFFINE of SWITCHING Tsei?n(3-regular LEMMA would yield exp(ω(n 1/d expander) require size exp(ω(n 1/2 d )) ))

268 Suppose A = V + u where V is a p-bounded random linear subspace of {0,1} n u unif {0,1} n Affine Switching Lemma for k-dnfs For every k-dnf F, Pr[ DT depth (F A) l ] = O(pk2 k ) l Conjecture: O(pk) l

269 Stay Tuned Toni Pitassi: Expander S.L. & AC 0 -Frege lower bound Johan Hastad, Avishay Tal: Correla?on bounds from improved switching lemmas Johan Hastad, Rocco Servedio: Random projec?ons, depth hierarchy theorem Srikanth Srinivasan: Adap?ve random restric?ons (against AC 0 with few threshold gates)

270 Thanks!

CSC 2429 Approaches to the P vs. NP Question and Related Complexity Questions Lecture 2: Switching Lemma, AC 0 Circuit Lower Bounds

CSC 2429 Approaches to the P vs. NP Question and Related Complexity Questions Lecture 2: Switching Lemma, AC 0 Circuit Lower Bounds Lecturer: Toniann Pitassi Scribe: Robert Robere Winter 2014 1 Switching