Additive Combinatorics and Computational Complexity

Similar documents
Pseudorandomness in Computer Science and in Additive Combinatorics. Luca Trevisan University of California, Berkeley

Dense Subsets of Pseudorandom Sets

A New Proof of a Theorem of Green, Tao, and Ziegler

Regularity, Boosting, and Efficiently Simulating Every High-Entropy Distribution

The Green-Tao theorem and a relative Szemerédi theorem

Szemerédi s regularity lemma revisited. Lewis Memorial Lecture March 14, Terence Tao (UCLA)

The Green-Tao theorem and a relative Szemerédi theorem

Lecture 3: Randomness in Computation

Last time, we described a pseudorandom generator that stretched its truly random input by one. If f is ( 1 2

The dichotomy between structure and randomness. International Congress of Mathematicians, Aug Terence Tao (UCLA)

The Green-Tao Theorem on arithmetic progressions within the primes. Thomas Bloom

The sum of d small-bias generators fools polynomials of degree d

Roth s Theorem on 3-term Arithmetic Progressions

Applications of model theory in extremal graph combinatorics

Selected Results in Additive Combinatorics: An Exposition

Szemerédi s Lemma for the Analyst

Arithmetic progressions in primes

Erdős and arithmetic progressions

Metric Pseudoentropy: Characterizations and Applications

1 Cryptographic hash functions

Higher-order Fourier analysis of F n p and the complexity of systems of linear forms

1.1 Szemerédi s Regularity Lemma

1 Cryptographic hash functions

On Pseudorandomness w.r.t Deterministic Observers

On Uniform Amplification of Hardness in NP

Roth s Theorem on Arithmetic Progressions

COS598D Lecture 3 Pseudorandom generators from one-way functions

Polynomial Identity Testing and Circuit Lower Bounds

On explicit Ramsey graphs and estimates of the number of sums and products

Quadratic Goldreich-Levin Theorems

CS294: Pseudorandomness and Combinatorial Constructions September 13, Notes for Lecture 5

Pseudorandom Generators

Pseudorandomness and combinatorial constructions

The power and weakness of randomness (when you are short on time) Avi Wigderson Institute for Advanced Study

Extractors using hardness amplification

Notes for Lecture 7. 1 Increasing the Stretch of Pseudorandom Generators

PSEUDORANDOMNESS AND AVERAGE-CASE COMPLEXITY VIA UNIFORM REDUCTIONS

Ramsey theory and the geometry of Banach spaces

Lecture 21: P vs BPP 2

CS294: PCP and Hardness of Approximation January 25, Notes for Lecture 3

Notes for Lecture 18

Minentropy and its Variations for Cryptography

Lecture 12: Constant Degree Lossless Expanders

Norms, XOR lemmas, and lower bounds for GF(2) polynomials and multiparty protocols

arxiv: v1 [math.nt] 14 Sep 2017

Geometric Complexity and Applications CIS 6930 August 26 - September 4, Lecture 1 through 4

Pseudorandomness and Combinatorial Constructions

Limits at Infinity. Horizontal Asymptotes. Definition (Limits at Infinity) Horizontal Asymptotes

The Complexity of the Matroid-Greedoid Partition Problem

On a certain generalization of the Balog-Szemeredi-Gowers theorem

Additive Combinatorics and Szemerédi s Regularity Lemma

THE COMPLEXITY OF CONSTRUCTING PSEUDORANDOM GENERATORS FROM HARD FUNCTIONS

COMPRESSION OF SAMPLABLE SOURCES

1 Introduction The relation between randomized computations with one-sided error and randomized computations with two-sided error is one of the most i

Cycle lengths in sparse graphs

Lecture 5: Derandomization (Part II)

The regularity method and blow-up lemmas for sparse graphs

Average Case Complexity: Levin s Theory

Where do pseudo-random generators come from?

Efficiency Improvements in Constructing Pseudorandom Generators from One-way Functions

Dept of Math., SCU+USTC

On two-point configurations in subsets of pseudo-random sets

Extractors and Pseudorandom Generators

Average-Case Complexity

MTH310 EXAM 2 REVIEW

+ 1 3 x2 2x x3 + 3x 2 + 0x x x2 2x + 3 4

HARDNESS AMPLIFICATION VIA SPACE-EFFICIENT DIRECT PRODUCTS

Szemerédi s Theorem via Ergodic Theory

Worst-Case to Average-Case Reductions Revisited

Towards NEXP versus BPP?

Pseudorandom Generators and Typically-Correct Derandomization

Expander Construction in VNC 1

The Rainbow Turán Problem for Even Cycles

Algorithmic regularity for polynomials and applications

A Noisy-Influence Regularity Lemma for Boolean Functions Chris Jones

On some inequalities between prime numbers

REGULARITY LEMMAS FOR GRAPHS

1 Distributional problems

On an Extremal Hypergraph Problem of Brown, Erdős and Sós

CS151 Complexity Theory. Lecture 9 May 1, 2017

6.080 / Great Ideas in Theoretical Computer Science Spring 2008

On the Power of the Randomized Iterate

From Affine to Two-Source Extractors via Approximate Duality

Computational Entropy and Information Leakage

Model-theoretic distality and incidence combinatorics

Hardness amplification proofs require majority

1 Randomized Computation

Approximation Resistance from Pairwise Independent Subgroups

Additive Combinatorics

A talk given at the Institute of Math., Chinese Academy of Sciences ( ).

Uniform Derandomization

Near-Optimal Algorithms for Maximum Constraint Satisfaction Problems

Simple extractors via crypto pseudo-random generators

Extracting Kolmogorov Complexity with Applications to Dimension Zero-One Laws

Lecture 2: Program Obfuscation - II April 1, 2009

Computer Science Dept.

Least singular value of random matrices. Lewis Memorial Lecture / DIMACS minicourse March 18, Terence Tao (UCLA)

Removal Lemmas with Polynomial Bounds

Higher-order Fourier Analysis and Applications

Transcription:

Additive Combinatorics and Computational Complexity Luca Trevisan U.C. Berkeley Ongoing joint work with Omer Reingold, Madhur Tulsiani, Salil Vadhan

Combinatorics: Studies: Graphs, hypergraphs, set systems Considers: cuts, subgraphs, intersections Typical question: max number of edges in triangle free graph Additive combinatorics Studies: subsets of abelian groups (e.g. primes) Considers: properties that can be described with linear equations Typical question: largest subset of integers in {1,...,N} with no length-3 arithmetic progression

Additive number theory Questions about primes which can be formulated as systems of linear equations Does y=x+2 have infinitely many solutions in primes? Does x+y=2n have solutions in primes for all n>2? Are there infinitely many non-trivial solutions in primes to x-y=y-z y-z=z-w

Additive number theory Questions about primes which can be formulated as systems of linear equations Some questions better approached in broader generality, separating combinatorics and number theory 1. Under what conditions does a set of integers possess a desired property? 2. Do the primes satisfy such conditions? [number theory needed only here]

Additive combinatorics has seen a confluence of Graph and hypergraph theory Analysis Ergodic theory Techniques have been applied in theoretical computer science to Property testing Pseudorandomness and derandomization PCP Lower bounds...

Additive combinatorics Graph and hypergraph theory Regularity Lemmas -> property testing [Alon-...] Analysis Regularity Lemmas in groups -> PCP [Khot-O Donnell] Gowers-Balog-Szemeredi Thm -> low-degree testing [Samorodnitsky] Gowers Uniformity -> PCP [Samorodnitsky-T], -> pseudorandomness [Bogdanov-Viola] -> direct product thms [Viola-Wigderson],... Ergodic theory???

Arithmetic Combinatorics Subsets of rings, properties defined via addition and multiplication Sum-product Theorems Applications to Computer Science: Expanders, extractors [Barak-Impagliazzo-Wigderson,...]

Additive (and arithmetic) Combinatorics What do the very different techniques have in common? Randomness versus structure [Terence Tao, ICM 2006 Lecture] Why so useful in (theoretical) computer science??? In different language, notions of pseudorandomness, indistinguishability, property testing, compact representations (sketches), approximation, worst-case analysis,...

Additive (and arithmetic) Combinatorics http://www.cs.princeton.edu/theory/index.php/ Main/AdditiveCombinatoricsMinicourse [Barak, Charikar, T., Wigderson] Google: additive combinatorics princeton

In this talk The structure of the proof of the Green-Tao Theorem A look at a key combinatorial step Complexity-theoretic interpretation New complexity-theoretic proofs new results in complexity theory

Green-Tao (2004) The primes contain arbitrarily long arithmetic progressions 199, 409, 619, 829, 1039, 1249, 1459, 1669, 1879, 2089

Green-Tao (2004) The primes contain arbitrarily long arithmetic progressions Previously, two lines of attack: The primes are dense Erdos (conj): all sets of density 1/ln N have long APs The primes are pseudorandom Hardy-Littlewood (conj): primes satisfy linear systems as often as random sets of density 1/ln N

Density Erdos (conj): if A is set of integers such that Σ1/a diverges, then A has long APs [~equivalent: if A has density 1/ln N, then long APs] Erdos-Turan (conj, 1940s): if A has constant density, then A has long APs [Szemeredi, 1970s] density 1/log* log* N [Furstenberg, 1970s] density o(1) [Gowers, 1998-01] density 1/loglog N

Pseudorandomness Hardy-Littlewood: take a system of linear equations e.g. y-x = z-y z-y = w-z Then approximately as many solutions in primes as in a random set of density 1/ln N Twin primes conjecture: y-x=2

Pseudorandomness Hopeless for now Strong results for almost primes: integers with few, large, factors E.g.: y-x =2 has infinitely many solutions, x prime, y at most two factors

Green-Tao Idea Pseudorandomness works only for almost primes Density works only for o(1) density Combine the two

Green-Tao Proof 1. Thm: primes have const density in almost primes 2. Thm: almost primes are pseudorandom 3. Thm: [Main] if R is pseudorandom and D has constant density in R, then there is M of constant density indistinguishable from D 4. Thm: [Szemeredi] M has long APs 5. so does D

Thm There is a pseudorandom set of integers R s.t. a constant fraction of R are primes {1,...,N } Primes

Thm There is a pseudorandom set of integers R s.t. a constant fraction of R are primes {1,...,N } R Primes

If R is pseudorandom and D is subset, D > δ R, then there is a model M indistinguishable from D, M > δn {1,...,N } R Primes M

By Szemeredi's Thm, M contains k-ap Because of indistinguishability, so does D {1,...,N } 23 11 17 Primes 23 35 47 M

Key step Dense Model Theorem: If R pseudorandom in X and D subset of R has size > δ R Then there is M of size > δ X that is indistinguishable from D

Key step Dense Model Theorem: If R is pseudorandom distribution in X and for all x, D(x) < R(x)/ δ Then there is M that is indistinguishable from D and for all x, M(x) < 1/ δ X

X R

R D X

R D X M

The Green-Tao(-Ziegler) proof X R D

The Green-Tao(-Ziegler) proof X D

The Green-Tao(-Ziegler) proof X D

The Green-Tao(-Ziegler) proof X D

Find partition of X: i) blocks are large and easily recognizable; ii) D is indistinguishable from i. - Sample z ~ D ii. - Let B be block of z iii. - Sample x ~ B Above distribution is dense: Pr [ x is sampled ] = Pr [ D in B x ] / B x Pr [ D in B x ] < Pr [ R in B x ] / δ Pr[ R in B x ] < B x / X + ε Pr [ x is sampled ] < 1/δ X + ε /δ B x

Find partition of X: (i) blocks are large and easily recognizable; (ii) D is indistinguishable from: - Sample z ~ D - Let B be block of z - Sample x ~ B To find partition - Start from trivial partition - While distribution distinguishable from D - refine partition using level sets of distinguisher Potential function argument: - only need poly(1/ε,1/δ) refinement steps - upper bound on potential function uses pseudorandomness of R

Green-Tao(-Ziegler) Proof Applies to computational setting R poly(ε,δ)-pseudorandom in X against circuits of size O(S*exp(1/δ,1/ε)) D is δ-dense in R Then there is M that is ε-indistinguishable from D by circuits of size S Exponential loss: partition has exp many pieces

New Proof [Reingold, T., Tulsiani, Vadhan, FOCS 08] R poly(ε,δ)-pseudorandom in X against circuits of size O(S*poly(1/δ,1/ε)) D is δ-dense in R Then there is M that is ε-indistinguishable from D by circuits of size S Proof using linear programming duality Same idea independently [Gowers, 2008]

New Proof [Reingold, T., Tulsiani, Vadhan, FOCS 08] As noted by Russell Impagliazzo, can also prove: Suppose D is such that for every f of size S*poly(1/ δ,1/ε) Pr[f(D)=1] < Pr[f(U)] /δ + poly(ε) Then there is M that is δ dense and ε-indistinguishable from D by circuits of size S

Consequence of New Proof [Reingold, T., Tulsiani, Vadhan, FOCS 08] The following 3 properties of D are equivalent: 1. Domination by pseudorandom distribution there is R indistinguishable from U such that for all x, D(x) < R(x)/δδ 2. Pseudodensity for all efficient f Pr[f(D)] < Pr[f(U)]/δ + ε 3. Pseudoentropy (defined by Hastad-Impagliazzo-Luby-Levin) there is M indistinguishable from D such that for all x, M(x) < U(x)/ δ = 1/(δ X )

Consequence of New Proof Suppose G: {0,1} 512 -> {0,1} 100,000 is a cryptographic pseudorandom generator D is a distribution of seeds of entropy 510 (e.g. two digits are fixed) Then there is a distribution M of entropy 99,998 which is indistinguishable from G(D)

New Proof [Reingold, T., Tulsiani, Vadhan, FOCS 08; Gowers 2008] Suppose Dense Model Theorem is false Then there is D dense in pseudorandom R such that for every dense model M there is a distinguisher Then there is a distinguisher that works against all models It can be used to distinguish R from Uniform Contradiction

Comparison with Previous Proof [Reingold, T., Tulsiani, Vadhan, FOCS 08; Gowers 2008] Leads to computational application with polynomial parameters Green-Tao-Ziegler has exponential loss The model is shown to exist non-constructively Green-Tao-Ziegler exhibits model Best of two worlds?

Comparison with Previous Proof Impagliazzo Hard Core Lemma Dense Model Theorem Min-max Iterative partitioning boosting

The ImpagliazzoHard-Core Lemma Suppose g: {0,1} n -> {0,1} is weakly hard-on-average: for all f of size S*poly(1/ε,1/δ) Pr[ f(x)=g(x)] < 1- δ Then there is dense distribution on which g is strongly hard-on-average: For all x, H(x) < 1/ δ 2 n is such that Pr D [f(x) = g(x) ] < ½ + ε for all f of size S

Comparison with Previous Proof Impagliazzo Hard Core Lemma Dense Model Theorem Min-max Iterative partitioning boosting

Comparison with Previous Proof Impagliazzo Hard Core Lemma Dense Model Theorem Min-max Iterative partitioning boosting

Comparison with Previous Proof Impagliazzo Hard Core Lemma Dense Model Theorem Min-max Iterative partitioning boosting

Comparison with Previous Proof Non-constructive Dense Model Theorem Impagliazzo Hard Core Lemma Min-max Constructive Dense Model Theorem Iterative partitioning boosting

Ongoing work by T., Tulsiani, Vadhan

Comparison with Previous Proof Non-constructive Dense Model Theorem Impagliazzo Hard Core Lemma Min-max Constructive Dense Model Theorem Iterative partitioning boosting

Comparison with Previous Proof Non-constructive Dense Model Theorem Impagliazzo Hard Core Lemma Min-max Constructive Dense Model Theorem Iterative partitioning boosting Generalized Szemeredi Regularity Lemma

Comparison with Previous Proof Non-constructive Dense Model Theorem Impagliazzo Hard Core Lemma Min-max Constructive Dense Model Theorem Iterative partitioning boosting Generalized Szemeredi Regularity Lemma

Comparison with Previous Proof Constructive Dense Model Theorem poly complexity Non-constructive Dense Model Theorem Poly complexity Impagliazzo Hard Core Lemma Min-max Constructive Dense Model Theorem Exp complexity Iterative partitioning boosting Generalized Szemeredi Regularity Lemma Poly complexity Generalized Szemeredi Regularity Lemma Exponential complexity

Comparison with Previous Proof Constructive Dense Model Theorem poly complexity Non-constructive Dense Model Theorem Poly complexity Impagliazzo Hard Core Lemma Min-max Constructive Dense Model Theorem Exp complexity Iterative partitioning boosting Generalized Szemeredi Regularity Lemma Poly complexity Generalized Szemeredi Regularity Lemma Exponential complexity

G Regularity Lemma

Regularity Lemma G G G Given: dense graph G, approximation ε Find partition of vertices into Oε(1) blocks G is indistinguishable by cuts from graph G that has weighted complete bipartite subgraph between blocks

Abstract Version Finite space X [edges of complete graph] g: X: -> [0,1] [graph] ε F={ f }, f: X -> [0,1] [char functions of cuts] Can find approximating function A(x) = h(f 1 (x),,f k (x)), k = poly(1/ε), f i () in F Such that for all f in F E f(x)g(x) E f(x)a(x) = < ε

Abstract Version Finite space X [edges of complete graph] g: X: -> [0,1] [graph] ε F={ f }, f: X -> [0,1] [char functions of cuts] Can find approximating function A(x) = h(f 1 (x),,f k (x)), k = poly(1/ε), f i () in F in standard proof, h has exp(1/ε) complexity Such that for all f in F E f(x)g(x) E f(x)a(x) = <f,g-a> < ε

Abstract Version Finite space X [edges of complete graph] g: X: -> [0,1] [graph] ε F={ f }, f: X -> [0,1] [char functions of cuts] Can find approximating function A(x) = h(f 1 (x),,f k (x)), k = poly(1/ε), f i () in F in new boosting proof, h has poly(1/ε) complexity Such that for all f in F E f(x)g(x) E f(x)a(x) = <f,g-a> < ε

Rephrasing Finite space X = {0,1} n Given distribution D such that D(x) < 1/δ2 n (min-entropy > n log 1/δ) There is samplable distribution M: Also M(x) < 1/δ2 n M is (S,ε) indistinguishable from D Samplable (and computable) by circuits of size S*poly(1/ε, 1/δ)

Derive the Hard Core Lemma Given: g weakly hard on average Find efficient approximation A of g Easy to prove: g is very hard on average on distribution in which point x has probability proportional to A(x) g(x) Also easy to prove direct product lemma, Yao s XOR lemma, and other complexity result from polynomial complexity regularity lemma

Constructive Dense Model Theorem poly complexity Non-constructive Dense Model Theorem Poly complexity Impagliazzo Hard Core Lemma Min-max Constructive Dense Model Theorem Exp complexity Iterative partitioning boosting Generalized Szemeredi Regularity Lemma Poly complexity Generalized Szemeredi Regularity Lemma Exponential complexity

Returning to big picture Additive combinatorics Graph theory regularity lemma Analysis Gowers Uniformity Graph Property Testing Pseudorandomness Direct Product Theorems PCP Ergodic Theory??? Finitary Ergodic Theory??

Returning to the big picture Additive combinatorics Graph theory regularity lemma Analysis Gowers Uniformity Ergodic Theory??? Finitary Ergodic Theory?? Graph Property Testing Pseudorandomness Direct Product Theorems PCP Average-case complexity pseudorandomness