Lecture 5: Hyperbolic polynomials

Similar documents
Lecture 3: Boolean variables and the strong Rayleigh property

Lecture 4: Applications: random trees, determinantal measures and sampling

Limits at Infinity. Horizontal Asymptotes. Definition (Limits at Infinity) Horizontal Asymptotes

The Geometry of Polynomials and Applications

CONCENTRATION OF THE MIXED DISCRIMINANT OF WELL-CONDITIONED MATRICES. Alexander Barvinok

Largest dual ellipsoids inscribed in dual cones

Systems of distinct representatives/1

NOTES ON HYPERBOLICITY CONES

Hyperbolicity and stable polynomials in combinatorics and probability

Supplement: Universal Self-Concordant Barrier Functions

Stability, hyperbolicity, and zero localization of functions

ON THE ARITHMETIC-GEOMETRIC MEAN INEQUALITY AND ITS RELATIONSHIP TO LINEAR PROGRAMMING, BAHMAN KALANTARI

HYPERBOLICITY CONES AND IMAGINARY PROJECTIONS

HANDOUT ABOUT THE TABLE OF SIGNS METHOD. The method To find the sign of a rational (including polynomial) function

1 Strict local optimality in unconstrained optimization

arxiv: v1 [math.co] 9 Sep 2007

Math 115 Spring 11 Written Homework 10 Solutions

Integer-Valued Polynomials

UNIFORM BOUNDS FOR BESSEL FUNCTIONS

Analogs of Hodge Riemann relations

1 Directional Derivatives and Differentiability

Real Stable Polynomials: Description and Application

Chapter 1. Preliminaries. The purpose of this chapter is to provide some basic background information. Linear Space. Hilbert Space.

x 2 + 6x 18 x + 2 Name: Class: Date: 1. Find the coordinates of the local extreme of the function y = x 2 4 x.

We simply compute: for v = x i e i, bilinearity of B implies that Q B (v) = B(v, v) is given by xi x j B(e i, e j ) =

arxiv: v1 [cs.it] 3 Nov 2009

Lecture 7: Positive Semidefinite Matrices

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

MAT 257, Handout 13: December 5-7, 2011.

= lim. (1 + h) 1 = lim. = lim. = lim = 1 2. lim

Lecture 5: Lowerbound on the Permanent and Application to TSP

Scattered Data Interpolation with Polynomial Precision and Conditionally Positive Definite Functions

Lecture 1 INF-MAT3350/ : Some Tridiagonal Matrix Problems

Mathematics 530. Practice Problems. n + 1 }

Trinity Christian School Curriculum Guide

Lecture 1: Entropy, convexity, and matrix scaling CSE 599S: Entropy optimality, Winter 2016 Instructor: James R. Lee Last updated: January 24, 2016

A Criterion for the Stochasticity of Matrices with Specified Order Relations

arxiv: v1 [math.co] 13 Oct 2014

HW1 solutions. 1. α Ef(x) β, where Ef(x) is the expected value of f(x), i.e., Ef(x) = n. i=1 p if(a i ). (The function f : R R is given.

Counting and Gröbner Bases

Lecture 5 : Projections

THE INVERSE FUNCTION THEOREM

= W z1 + W z2 and W z1 z 2

In English, this means that if we travel on a straight line between any two points in C, then we never leave C.

CPSC 536N: Randomized Algorithms Term 2. Lecture 9

Welsh s problem on the number of bases of matroids

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

Elements of Positive Definite Kernel and Reproducing Kernel Hilbert Space

Smale s Mean Value Conjecture. and Related Problems. The University of Hong Kong

ARCS IN FINITE PROJECTIVE SPACES. Basic objects and definitions

Schur s Triangularization Theorem. Math 422

Chebyshev coordinates and Salem numbers

b jσ(j), Keywords: Decomposable numerical range, principal character AMS Subject Classification: 15A60

LMI MODELLING 4. CONVEX LMI MODELLING. Didier HENRION. LAAS-CNRS Toulouse, FR Czech Tech Univ Prague, CZ. Universidad de Valladolid, SP March 2009

Algebra II Vocabulary Alphabetical Listing. Absolute Maximum: The highest point over the entire domain of a function or relation.

1 Lyapunov theory of stability

MIT Algebraic techniques and semidefinite optimization February 14, Lecture 3

Unbounded Convex Semialgebraic Sets as Spectrahedral Shadows

On the linear differential equations whose general solutions are elementary entire functions

The proximal mapping

TORIC REDUCTION AND TROPICAL GEOMETRY A.

Linear Algebra. Workbook

ALGEBRAIC GEOMETRY HOMEWORK 3

Solutions to Exercises, Section 2.5

arxiv:math/ v2 [math.oc] 25 May 2004

LECTURE 5: THE METHOD OF STATIONARY PHASE

A NICE PROOF OF FARKAS LEMMA

Competitive and Cooperative Differential Equations

Lecture 2 Linear Codes

Preliminaries Lectures. Dr. Abdulla Eid. Department of Mathematics MATHS 101: Calculus I

Course Description - Master in of Mathematics Comprehensive exam& Thesis Tracks

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

On a Class of Multidimensional Optimal Transportation Problems

A PREDICTOR-CORRECTOR PATH-FOLLOWING ALGORITHM FOR SYMMETRIC OPTIMIZATION BASED ON DARVAY'S TECHNIQUE

Zero sum games Proving the vn theorem. Zero sum games. Roberto Lucchetti. Politecnico di Milano

Chapter 3 Second Order Linear Equations

The reverse Hlawka inequality in a Minkowski space

Lecture 6. Numerical methods. Approximation of functions

REAL-ROOTED POLYNOMIALS IN COMBINATORICS. Mirkó Visontai A DISSERTATION. Mathematics. Presented to the Faculties of the University of Pennsylvania

Convex Optimization & Parsimony of L p-balls representation

Lecture 5. Theorems of Alternatives and Self-Dual Embedding

Matrix rigidity and elimination theory

Symmetric Matrices and Eigendecomposition

On self-concordant barriers for generalized power cones

DISCRIMINANTS, SYMMETRIZED GRAPH MONOMIALS, AND SUMS OF SQUARES

The extreme rays of the 5 5 copositive cone

The final is cumulative, but with more emphasis on chapters 3 and 4. There will be two parts.

Lecture 2: Stable Polynomials

Some stochastic inequalities for weighted sums

Math 4242 Fall 2016 (Darij Grinberg): homework set 8 due: Wed, 14 Dec b a. Here is the algorithm for diagonalizing a matrix we did in class:

Energy method for wave equations

An injection from standard fillings to parking functions

Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra

Notes on Chemical Reaction Networks


Selected Examples of CONIC DUALITY AT WORK Robust Linear Optimization Synthesis of Linear Controllers Matrix Cube Theorem A.

Hyperbolicity Cones of Elementary Symmetric Polynomials. Masterarbeit

Classification of root systems

Real Symmetric Matrices and Semidefinite Programming

Lecture 1. 1 Conic programming. MA 796S: Convex Optimization and Interior Point Methods October 8, Consider the conic program. min.

Transcription:

Lecture 5: Hyperbolic polynomials Robin Pemantle University of Pennsylvania pemantle@math.upenn.edu Minerva Lectures at Columbia University 16 November, 2016

Ubiquity of zeros of polynomials The one contribution of mine that I hope will be remembered has consisted in pointing out that all sorts of problems of combinatorics can be viewed as problems of the location of the zeros of certain polynomials... Gian-Carlo Rota (1985) PDE s (hyperbolicity) Number theory (zeta function) Statistical physics and probability (Lee-Yang theory)

Guler1997 Gurvits2006 Convex Programming & Matrix theorems Self concordant barriers and conjectures LEC 5 LEC 5 Baryshnikov Pemantle2011 Borcea Branden Liggett2009 Asymptotics of Multivariate Stability & strong Generating Functions Rayleigh property LEC 6 LEC 3 Garding1951 Hyperbolic polynomials and PDE s LEC 5

Basics of the theory PDE s Other uses: convex programming and matrix theorems Lecture 6: Applications to multivariate generating functions

Hyperbolicity: equivalences and examples

Definitions Definition (stability) A polynomial q is said to be stable if q(z) 0 whenever each coordinate z j is in the strict upper half plane. Definition (hyperbolicity) A homogeneous polynomial p of degree m is said to be hyperbolic in direction x R d if p(x + iy) 0 for all y R d. Proposition (equivalences) Hyperbolicity in direction x is equivalent to the univariate polynomial p(y + tx) having only real roots for all y R d. A real homogeneous polynomial is stable if and only if it is hyperbolic in every direction in the positive orthant.

Example: Lorentzian quadratic Let p be the Lorentzian quadratic t 2 x 2 2 x2 d, where we have renamed x 1 as t because of its interpretation as the time axis in spacetime; then p is hyperbolic in every timelike direction, that is, for each direction x with p(x) > 0. The time axis is left-right

Example: coordinate planes The coordinate function x j is hyperolic in direction y if and only if y j 0 (this is true for any linear polynomial). It is obvious from the definition that the product of polynomials hyperbolic in direction y is again hyperbolic in direction y. It follows that d j=1 x j is hyperbolic in every direction not contained in a coordinate plane, that is, in every open orthant.

Cones of hyperbolicity The real variety V := {p = 0} R d plays a special role in hyperbolicity theory. Proposition (cones of hyperbolicity) Let ξ be a direction of hperbolicity for p and let K(p, ξ) denote the connectied component of the set R d \ V that contains ξ. p is hyperbolic in direction x for every x K(p, ξ). The set K(p, ξ) is an open convex cone; we call this a cone of hyperbolicity for p.

Examples, continued Every component of R 3 \ V is a cone of hyperbolicity for p := xyz The forward and backward light cones are cones of hypebolicity for p := x 2 y 2 z 2

Example: Fortress polynomial w 4 u 2 w 2 v 2 w 2 + 9 25 u2 v 2 is the projective localization of the denominator (cleaned up a bit) of the so-called Fortress generating polynomial. It follows from [BP11, Proposition 2.12] that this polynomial is hyperbolic in certain directions, e.g., forward and backward cones (pictured pointing NE and SW).

Hyperbolicity and PDE s These results are not needed for applications to other areas but they serve to explain where the consructions originate.

Stable evolution of PDE s Let p be a polynomial in d variables and denote by D p the operator p( / x) obtained by replacing each x i by / x i. Let r be a vector in R d, let H r be the hyperplane orthogonal to r, and consider the equation D p (f) = 0 (1) in the halfspace {r x 0} with boundary conditions specified on H r (typically, f and its first d 1 normal derivatives). We say that (1) evolves stably in direction r if convergence of the boundary conditions to 0 implies convergence 1 of the solution to 0. 1 Uniform convergence of the function and its derivatives on compact sets.

Gårding s Theorem Theorem ([Går51, Theorem III]) The equation D p f = 0 evolves stably in direction r if and only if p is hyperbolic in direction r. Let us see why this should be true.

We begin with the observation that if ξ C d is any vector with p(ξ) = 0 then f ξ (x) := exp(iξ x) is a solution to D p f = 0. (Our solutions are allowed to be complex but live on R d.) r ξ. v is complex Exponential growth in direction v when ξ v is non-real; ξ. v is real Bounded growth when ξ v is real.

Stability implies hyperbolicity: Assume WLOG that r = (0,..., 0, 1). Suppose p is not hyperbolic. Unraveling the definition, p has at least one root ξ = (a 1,..., a d 1, a d ± bi), with {a i } and b real and nonzero. In other words, v ξ v is real when restricted to r and non-real on r. Therefore, f ξ grows exponentially in direction r but is bounded on r. The same is true for f λξ. Sending λ, we may take initial conditions going to zero such that f λξ (r) = 1 for all λ.

Hyperbolicity implies stable evolution Sketch: Suppose q is hyperbolic and WLOG r = e d. For every real r = (r 1,..., r d 1 ) in frequency space there are d real values of r d such that q(r) := q(r, r d ) = 0. For each such r, the function f r is a solution to D q (f ) = 0, traveling unitarily. d bounded solutions These d solutions are a unitary basis for the space V r that they span, of solutions to (1) that restrict on H ξ to e ir x, and the same is true at any later time. Difficult argument: d independent solutions in the boundary plane, for each frequency, is enough to give boundary values up to d 1 derivatives.

Light cone Hyperbolicity can also be used to establish finite propagation speed. Contrast to parabolic equations such as the heat equation, where f (x, t) depends on f (y, 0) for all y. Example (2-D wave equation) y Let f solve f tt f yy = 0 with boundary conditions f (0, y) = g(y) and f t (0, y) = h(y). Then an explicit formula for f in the right half plane is given by y+t y t f(t,y) t f (t, y) = 1 2 [ y+t ] g(y + t) + g(y t) + h(u) du. y t

Dual cone The more general result is that the solution from δ-function initial conditions propagates on the dual cone (Paley-Wiener Theorem) and that the solution in general is given by the Riesz kernel. Let K R d be a cone and let K denote the dual cone, that is the set of all y such that x y 0 for all x K. K K*

Theorem (Riesz kernel supported on the dual cone) (i) For each cone K of hyperbolicity of p there is a soluton E x to D p E x = δ x in R d supported on the dual cone K. (ii) This solution is called the Riesz kernel and is defined by E x (r) := (2π) d R d q(x + iy) 1 exp[r (x + iy)] dy. (iii) The boundary value problem D p f = 0 on the halfspace x r > 0 with boundary values g and normal derivatives vanishing to order deg(p) 1 is given by E x (r)g(x)dx. We will discuss the proof next lecture when extracting coefficients of rational multivariate generating functions.

Convex programming and self-concordant barriers

Interior point method Convex programming is the algorithmic location of the minimum of a convex function f on a convex set K R d. Without loss of generality, f (x) = c T x is linear (intersect K with the region above the graph of f ) and the minimum on K occurs on K. The interior point method is to let φ be a barrier function going to infinity at K and to define a family of interior minima x (t) := argmin x c T x + t 1 φ(x) converging to the true minimum x as t. This is useful because from x (t), it is often possible to approximate x (t + t) very quickly.

Inequalities that make this work well Suppose φ C 3 is defined on the interior of a convex cone K, is homogeneous of degree m, and goes to infinity at K. If F := log φ satisfies D 3 F (x)[h, h, h] 2 ( D 2 F (x)[x, x] ) 3/2 then φ serves as a barrier function. If, in addition, DF (x)[h] 2 md 2 F (x)[h, h] then the interior method will converge well.

Hyperbolicity and self-concordant barriers Theorem (Nesterov and Todd 1997; Guler 1997) If p is a homogeneous polynomial vanishing on the boundary of a convex cone K, then φ(x) = p(x) α satisfies the above inequalities, bounding second directional derivative from below by constant multiples of the 2/3-power of the third derivative and the square first derivative, all in the same direction.

Hyperbolicity is universal for homogeneous cones This establishes a long-step interior point algorithm on the cone of hyperbolicity of any hyperbolic polynomial. It turns out this is more general than one might think. Say a cone is homogeneous if the linear maps preserving it as a set act transitively on its interior. Proposition All homogenous convex cones are cones of hyperbolicity for some polynomial.

Matrix theorems and conjectures

van der Waerden conjecture Theorem (van der Waerden conjecture) The minimum permanent of an n n doubly stochastic matrix is n!/n n, uniquely obtained when all entries are 1/n. This was proved by Falikman (value of the minimum) and Egorychev (uniqueness) in 1981. Gurvits (2008) found a much simpler proof using stability. Let C n be the set of homogeneous polynomials of degree n in n variables with nonnegative coefficients.

Permanent as a stable polynomial in C n If A is an n n matrix then its permanent may be represented as a mixed partial derivative of a homogeneous stable polynoimal. Define n n p A (x) := a ij x j. i=1 j=1 Proposition (Gurvits 2008) The polynomial p is stable and per (A) = n x 1 x n p A (0,..., 0).

Gurvits argument Define Cap (p) := inf p(x 1,..., x n ) n i=1 x. i The following two results, also from Gurvits (2008), immediately imply the van der Waerden conjecture. Proposition If A is doubly stochastic then Cap (p A ) = 1. Theorem (Gurvits inequality) If p C n is stable then n x 1 x n p(0,..., 0) Cap (p) n! n n.

More mileage Note: both propositions are immediate and the theorem is a short induction. Further mileage may be obtained from this. The theorem also implies the Schrijver-Valiant conjecture (1980, proved by Schrijver in 1998), concerning the minimum permanent of n n nonnegative integer matrices whose row and column sums are k.

Monotone Column Permanent (MCP) Conjecture Say that the n n matrix A is a monotone column matrix if its entries are real and weakly decreasing down each column, that is, a i,j a i+1,j for 1 i n 1 and 1 j n. Let J n denote the n n matrix of all ones. It was conjectured (Haglund, Ono and Wagner 1999) that whenever A is a monotone column matrix, the univariate polynomial Per (zj n + A) has only real roots. A proof was given there for the case where A is a zero-one matrix.

MCP theorem Theorem (Brändén, Haglund, Ono and Wagner 2009) Let Z n be the diagonal matrix whose entries are the n indeterminates z 1,..., z n. Let A by an n n monotone column matrix. Then Per (J n Z n + A) is a stable polynomial in the variables z 1,..., z n. Specializing to z j z for all j preserves stability, hence the original conjecture follows. The proof is via a lemma making use of the multivariate Pólya-Schur theorem to reduce to the special case, already proved, where all entries are 0 or 1.

End of Lecture 5

References I P. Brändén, J. Haglund, M. Visontai, and D. Wagner. Proof of the monotone column permanent conjecture. In FPSAC 2009, volume AK, pages 443 454, Nancy, 2009. Assoc. Discrete Math. Theor. Comput. Sci. Y. Baryshnikov and R. Pemantle. Asymptotics of multivariate sequences, part III: quadratic points. Adv. Math., 228:3127 3206, 2011. G. Egorychev. The solution of van der Waerden s problem for permanents. Adv. Math., 42:299 305, 1981. D. Falikman. Proof of the van der Waerden s conjecture on the permanent of a doubly stochastic matrix. Mat. Zametki, 29:931 938, 1981. (in Russian).

References II L. Gårding. Linear hyperbolic partial differential equations with constant coefficients. Acta Math., 85:1 62, 1951. O. Güler. Hyperbolic polynomials and interior point methods for convex programming. Math. Oper. Res., 22:350 377, 1997. L. Gurvits. The van der Waerden conjecture for mixed discriminants. Adv. Math., 200:435 454, 2006. J. Haglund, K. Ono, and D. G. Wagner. Theorems and conjectures involving rook polynomials with only real zeros. In Topics in Number Theory, volume 467 of Math. Appl., pages 207 221. Kluwer Academic Publishers, Dordrecht, 1999. R. Pemantle. Hyperbolicity and stable polynomials in combinatorics and probability. In Current Developments in Mathematics, pages 57 124. International Press, Somerville, MA, 2012.