ALGEBRA: From Linear to Non-Linear. Bernd Sturmfels University of California at Berkeley

Similar documents
SPECTRAHEDRA. Bernd Sturmfels UC Berkeley

SPECTRAHEDRA. Bernd Sturmfels UC Berkeley

The Geometry of Semidefinite Programming. Bernd Sturmfels UC Berkeley

CONVEX ALGEBRAIC GEOMETRY. Bernd Sturmfels UC Berkeley

Semidefinite Programming

d A 0 + m t k A k 0 whenever λ min (B k (x)) t k λ max (B k (x)) for k = 1, 2,..., m x n B n (k).

The Algebraic Degree of Semidefinite Programming

Unbounded Convex Semialgebraic Sets as Spectrahedral Shadows

Persistence and Stationary Distributions of Biochemical Reaction Networks

Toric Dynamical Systems

Journal of Symbolic Computation

ALGEBRAIC DEGREE OF POLYNOMIAL OPTIMIZATION. 1. Introduction. f 0 (x)

Tropical Islands. Jan Verschelde

Tropical Algebra. Notes by Bernd Sturmfels for the lecture on May 22, 2018, in the IMPRS Ringvorlesung Introduction to Nonlinear Algebra

Algebraic methods for biochemical reaction network theory. Anne Joyce Shiu

x y = x + y. For example, the tropical sum 4 9 = 4, and the tropical product 4 9 = 13.

Binomial Ideals from Graphs

INITIAL COMPLEX ASSOCIATED TO A JET SCHEME OF A DETERMINANTAL VARIETY. the affine space of dimension k over F. By a variety in A k F

d + x u1 y v p 1 (x, y) = det

Convex Optimization. (EE227A: UC Berkeley) Lecture 28. Suvrit Sra. (Algebra + Optimization) 02 May, 2013

LMI MODELLING 4. CONVEX LMI MODELLING. Didier HENRION. LAAS-CNRS Toulouse, FR Czech Tech Univ Prague, CZ. Universidad de Valladolid, SP March 2009

Gram Spectrahedra. Lynn Chua UC Berkeley. joint work with Daniel Plaumann, Rainer Sinn, and Cynthia Vinzant

Semidefinite Representation of the k-ellipse

6-1 The Positivstellensatz P. Parrilo and S. Lall, ECC

Dissimilarity maps on trees and the representation theory of GL n (C)

QUARTIC SPECTRAHEDRA. Bernd Sturmfels UC Berkeley and MPI Bonn. Joint work with John Christian Ottem, Kristian Ranestad and Cynthia Vinzant

Preliminaries and Complexity Theory

RESEARCH ARTICLE. An extension of the polytope of doubly stochastic matrices

A new parametrization for binary hidden Markov modes

Commuting birth-and-death processes

SPECTRA - a Maple library for solving linear matrix inequalities in exact arithmetic

Describing convex semialgebraic sets by linear matrix inequalities. Markus Schweighofer. Universität Konstanz

Polynomials, Ideals, and Gröbner Bases

5. Grassmannians and the Space of Trees In this lecture we shall be interested in a very particular ideal. The ambient polynomial ring C[p] has ( n

Combinatorial Aspects of Tropical Geometry and its interactions with phylogenetics

Journal of Algebra 226, (2000) doi: /jabr , available online at on. Artin Level Modules.

Positive semidefinite rank

CSC Linear Programming and Combinatorial Optimization Lecture 10: Semidefinite Programming

Mic ael Flohr Representation theory of semi-simple Lie algebras: Example su(3) 6. and 20. June 2003

arxiv: v2 [math.ds] 12 Jul 2011

CHAPTER 7: Systems and Inequalities

Tensors: a geometric view Open lecture November 24, 2014 Simons Institute, Berkeley. Giorgio Ottaviani, Università di Firenze

Convex Optimization of Graph Laplacian Eigenvalues

Polynomial Dynamical Systems as Reaction Networks and Toric Differential Inclusions

Geometry of Phylogenetic Inference

AN INTRODUCTION TO AFFINE TORIC VARIETIES: EMBEDDINGS AND IDEALS

Voronoi Cells of Varieties

ON THE RANK OF A TROPICAL MATRIX

An explicit construction of distinguished representations of polynomials nonnegative over finite sets

Open Problems in Algebraic Statistics

Mathematical and computational methods for understanding the dynamics of biochemical networks

COURSE ON LMI PART I.2 GEOMETRY OF LMI SETS. Didier HENRION henrion

Combinatorics and geometry of E 7

Sparsity of Matrix Canonical Forms. Xingzhi Zhan East China Normal University

MIT Algebraic techniques and semidefinite optimization May 9, Lecture 21. Lecturer: Pablo A. Parrilo Scribe:???

A degree bound for codimension two lattice ideals

UCSD Mathematics Department Tel: (858) Gilman Drive, La Jolla, CA Fax: (858)

Exact algorithms: from Semidefinite to Hyperbolic programming

Rational Sums of Squares and Applications

Spectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics

Recognizing Graph Theoretic Properties with Polynomial Ideals

Bases for Cluster Algebras from Surfaces

The convex algebraic geometry of linear inverse problems

ACO Comprehensive Exam October 18 and 19, Analysis of Algorithms

arxiv:math/ v1 [math.ac] 11 Nov 2005

SEMIDEFINITE PROGRAM BASICS. Contents

Polytopes and Algebraic Geometry. Jesús A. De Loera University of California, Davis

Shortest paths with negative lengths

Sixty-Four Curves of Degree Six

MATH 8. Unit 1: Rational and Irrational Numbers (Term 1) Unit 2: Using Algebraic Properties to Simplify Expressions - Probability

Toric Ideals, an Introduction

College Algebra To learn more about all our offerings Visit Knewton.com

In English, this means that if we travel on a straight line between any two points in C, then we never leave C.

Lecture 1. Toric Varieties: Basics

College Algebra with Corequisite Support: Targeted Review

A tropical approach to secant dimensions

AN INTRODUCTION TO TROPICAL GEOMETRY

Semidefinite and Second Order Cone Programming Seminar Fall 2001 Lecture 5

Geometry of Gaussoids

Lecture Note 5: Semidefinite Programming for Stability Analysis

chapter 12 MORE MATRIX ALGEBRA 12.1 Systems of Linear Equations GOALS

Quantum Computing Lecture 2. Review of Linear Algebra

Real Algebraic Geometry in Convex Optimization. Cynthia Vinzant

Semi-definite representibility. For fun and profit

HYPERBOLIC POLYNOMIALS, INTERLACERS AND SUMS OF SQUARES

3. Vector spaces 3.1 Linear dependence and independence 3.2 Basis and dimension. 5. Extreme points and basic feasible solutions

TORIC REDUCTION AND TROPICAL GEOMETRY A.

College Algebra with Corequisite Support: A Blended Approach

A SURVEY OF CLUSTER ALGEBRAS

An Algorithm for Solving the Convex Feasibility Problem With Linear Matrix Inequality Constraints and an Implementation for Second-Order Cones

HYPERBOLICITY CONES AND IMAGINARY PROJECTIONS

Combinatorics for algebraic geometers

Ellipsoidal Mixed-Integer Representability

Definition 2.3. We define addition and multiplication of matrices as follows.

THE PYTHAGOREAN THEOREM

MCS 563 Spring 2014 Analytic Symbolic Computation Monday 14 April. Binomial Ideals

College Algebra with Corequisite Support: A Compressed Approach

Test Codes : MIA (Objective Type) and MIB (Short Answer Type) 2007

Convex algebraic geometry, optimization and applications

MATH 7 HONORS. Unit 1: Rational and Irrational Numbers (Term 1) Unit 2: Using Algebraic Properties to Simplify Expressions - Probability

Transcription:

ALGEBRA: From Linear to Non-Linear Bernd Sturmfels University of California at Berkeley John von Neumann Lecture, SIAM Annual Meeting, Pittsburgh, July 13, 2010

Undergraduate Linear Algebra All undergraduate students learn about Gaussian elimination, a general method for solving linear systems of algebraic equations: Input: x + 2y + 3z = 5 7x + 11y + 13z = 17 19x + 23y + 29z = 31

Undergraduate Linear Algebra All undergraduate students learn about Gaussian elimination, a general method for solving linear systems of algebraic equations: Input: Output: x + 2y + 3z = 5 7x + 11y + 13z = 17 19x + 23y + 29z = 31 x = 35/18 y = 2/9 z = 13/6 Solving very large linear systems is central to applied mathematics.

Undergraduate Non-Linear Algebra Lucky undergraduate students also learn about Gröbner bases, a general method for non-linear systems of algebraic equations: Input: x 2 + y 2 + z 2 = 2 x 3 + y 3 + z 3 = 3 x 4 + y 4 + z 4 = 4

Undergraduate Non-Linear Algebra Lucky undergraduate students also learn about Gröbner bases, a general method for non-linear systems of algebraic equations: Input: x 2 + y 2 + z 2 = 2 x 3 + y 3 + z 3 = 3 x 4 + y 4 + z 4 = 4 Output: 3z 12 12z 10 12z 9 +12z 8 +72z 7 66z 6 12z 4 +12z 3 1 = 0 4y 2 + (36z 11 +54z 10 69z 9 252z 8 216z 7 +573z 6 +72z 5 12z 4 99z 3 +10z+3) y + 36z 11 +48z 10 72z 9 234z 8 192z 7 +564z 6 48z 5 +96z 4 96z 3 +10z 2 +8 = 0 4x + 4y + 36z 11 +54z 10 69z 9 252z 8 216z 7 +573z 6 +72z 5 12z 4 99z 3 +10z+3 = 0 Non-linear equations can be intimidating, but they are important.

This Lecture What I shall speak about: Chemical Reaction Networks Convex Algebraic Geometry Tropical Mathematics What others will speak about: Numerical Algebraic Geometry Algebraic Methods in Discrete Optimization Multilinear Algebra and Tensors Algebraic Statistics

Topic 1: CHEMICAL REACTION NETWORKS Chemical species A, B, H, O,... can form complexes that react: A + B 2H + O We write complexes in multiplicative notation: AB H 2 O A chemical reaction network is a collection of such reactions. Associated with each reaction is a rate constant κ R >0. The concentrations c (t) of the species are functions of time t. Relevant authors include F. Horn, R. Jackson, M. Feinberg, E. Sontag, D. Anderson, G. Gnacadja, J. Gunawardena, C. Conradi, J. Stelling, G. Craciun, W. Helton, P. De Leenheer, A. Shiu,... My point: Reaction network theory is important for systems biology, and non-linear algebra can make useful contributions. R. Laubenbacher and BSt: Computer algebra in systems biology, Amer. Math. Monthly 116 (2009) 882-891.

Networks and their Siphons Suppose there are s species that form n complexes. These correspond to monomials c y 1,c y 2,...,c yn in the unknown concentrations c 1,...,c s. Our chemical reaction network is a directed graph G whose nodes are labeled by these monomials. A siphon is a subset S of species such that all reactions in a component of G are shut off when c i = 0 for all i S. Example. If the network is G = { AB H 2 O, AC H 2 O } then S = {A} and S = {B,C} are the minimal siphons of G.

Networks and their Siphons Suppose there are s species that form n complexes. These correspond to monomials c y 1,c y 2,...,c yn in the unknown concentrations c 1,...,c s. Our chemical reaction network is a directed graph G whose nodes are labeled by these monomials. A siphon is a subset S of species such that all reactions in a component of G are shut off when c i = 0 for all i S. Example. If the network is G = { AB H 2 O, AC H 2 O } then S = {A} and S = {B,C} are the minimal siphons of G. Question: Which faces of the positive orthant R s 0 can contain steady states of the reaction network? Proposition. If the face {c R s 0 : c i = 0 for i S} contains a steady state, for some dynamics on G, then S is a siphon.

Primary Decomposition Fix the ring R = Q[c 1,...,c s ]/ c 1 c 2 c s. With a chemical reaction network G we associate three ideals M G = c y 1,c y 2,...,c yn J G = c y i c y j : c y i c y j is a reaction of G I G = c y i (c y i c y j ) : c y i c y j is a reaction of G

Primary Decomposition Fix the ring R = Q[c 1,...,c s ]/ c 1 c 2 c s. With a chemical reaction network G we associate three ideals M G = c y 1,c y 2,...,c yn J G = c y i c y j : c y i c y j is a reaction of G I G = c y i (c y i c y j ) : c y i c y j is a reaction of G Theorem. The minimal siphons of G are the inclusion-minimal sets {i [s] : c i P} where P runs over the minimal primes of I G. If each connected component of G is strongly connected, then we can replace I G by the simpler ideal J G. If G is strongly connected, then we can replace I G by M G. [Anne Shiu and BSt: Siphons in chemical reaction networks, Bulletin of Mathematical Biology (2010)] Monomial primary decomposition is fast: (Example: 50 species and over 1,000,000 minimal siphons. Macaulay2 computes them in 43 seconds. [B. Roune] )

Mass-Action Kinetics The node i of the network G is labeled by the monomial c y i = c y i1 1 cy i2 2 c y is s. Y = (y ij ) is an n s-matrix of non-negative integers. The monomial labels are the entries in the row vector Ψ(c) = ( c y 1, c y 2,..., c yn). Mass-action kinetics specified by the network G and the reaction rates κ ij defines the differential equations dc dt = Ψ(c) A κ Y

Mass-Action Kinetics The node i of the network G is labeled by the monomial c y i = c y i1 1 cy i2 2 c y is s. Y = (y ij ) is an n s-matrix of non-negative integers. The monomial labels are the entries in the row vector Ψ(c) = ( c y 1, c y 2,..., c yn). Mass-action kinetics specified by the network G and the reaction rates κ ij defines the differential equations dc dt = Ψ(c) A κ Y, where the n n-matrix A κ is the Laplacian of G. Example. The network G = c y 1 c y 2 c y 3 has the Laplacian κ 12 κ 12 0 A κ = κ 21 κ 21 κ 23 κ 23 0 κ 32 κ 32

Steady State Analysis The steady states are the solutions c R s 0 Ψ(c) A κ Y = 0 of the s equations These equations are linear in the parameters κ ij but they are non-linear in the variables c 1,...,c s. Mass action on a network G is a toric dynamical system if there exists a solution c R s >0 of the n equations Ψ(c ) A κ = 0.

Steady State Analysis The steady states are the solutions c R s 0 Ψ(c) A κ Y = 0 of the s equations These equations are linear in the parameters κ ij but they are non-linear in the variables c 1,...,c s. Mass action on a network G is a toric dynamical system if there exists a solution c R s >0 of the n equations Ψ(c ) A κ = 0. Theorem (Craciun-Dickenstein-Shiu-St, 2009) For fixed κ, the set of such solutions c is either empty or is a toric variety, i.e. an irreducible variety cut out by binomial equations c u 1 1 cu 2 2 cus s = c v 1 1 cv 2 2 cvs s. For a fixed network G, the set κ for which it is non-empty is also a toric variety, called the moduli space of toric dynamical systems.

Toric Varieties connect the linear world and the non-linear world. If we set γ i = log(c i ) then the binomial equation c u 1 1 cu 2 2 cus s = c v 1 1 cv 2 2 cvs s translates into the linear equation (u 1 v 1 )γ 1 + (u 2 v 2 )γ 2 + + (u s v s )γ s = 0.

Toric Varieties connect the linear world and the non-linear world. If we set γ i = log(c i ) then the binomial equation c u 1 1 cu 2 2 cus s = c v 1 1 cv 2 2 cvs s translates into the linear equation (u 1 v 1 )γ 1 + (u 2 v 2 )γ 2 + + (u s v s )γ s = 0. But binomials have a rich structure that is essential in applications: Test Sets in Integer Programming, Markov Bases in Statistics. In context of bio-chemical reaction networks, the previous theorem elucidates the geometry behind Feinberg s Deficiency Theory, led to recent progress on the Global Attractor Conjecture.

Topic 2: CONVEX ALGEBRAIC GEOMETRY A spectrahedron is the intersection of the cone of positive semidefinite matrices with an affine-linear space. Its algebraic representation is a linear combination of symmetric matrices A 0 + x 1 A 1 + x 2 A 2 + + x m A m 0 ( )

Topic 2: CONVEX ALGEBRAIC GEOMETRY A spectrahedron is the intersection of the cone of positive semidefinite matrices with an affine-linear space. Its algebraic representation is a linear combination of symmetric matrices A 0 + x 1 A 1 + x 2 A 2 + + x m A m 0 ( ) Semidefinite programming is the computational problem of maximizing a linear function over a spectrahedron: Maximize c 1 x 1 + c 2 x 2 + + c m x m subject to ( ) Example: The smallest eigenvalue of a symmetric matrix A is the solution of the SDP: Maximize x subject to A x Id 0.

Multifocal Ellipses The 3-ellipse with foci (0,0),(1,0),(0,1) has the representation 2 3 d + 3x 1 y 1 y 0 y 0 0 0 y 1 d + x 1 0 y 0 y 0 0 y 0 d + x + 1 y 1 0 0 y 0 0 y y 1 d x + 1 0 0 0 y y 0 0 0 d + x 1 y 1 y 0 6 0 y 0 0 y 1 d x 1 0 y 7 4 0 0 y 0 y 0 d x + 1 y 1 5 0 0 0 y 0 y y 1 d 3x + 1 This is a convex curve of degree eight. Its interior consists of all points (x,y) where this symmetric 8 8-matrix is positive definite:

Ellipses are Spectrahedra Theorem: The polynomial equation defining the m-ellipse has degree 2 m if m is odd and degree 2 m ( m m/2) if m is even. It is the determinant of a symmetric matrix of linear polynomials. This representation extends to m-ellipsoids in arbitrary dimensions. [J. Nie, P. Parrilo, BSt.: Semidefinite representation of the k-ellipse, in Algorithms in Algebraic Geometry, I.M.A. Volumes in Mathematics and its Applications, 146, Springer, New York, 2008, pp. 117-132] CONVEX ALGEBRAIC GEOMETRY is the marriage of real algebraic geometry with optimization theory. It concerns convex figures such as ellipses, ellipsoids, and much more... including polyhedra

Convex Polyhedra Linear programming is semidefinite programming for diagonal matrices. If A 0,A 1,...,A m are diagonal n n-matrices then A 0 + x 1 A 1 + x 2 A 2 + + x m A m 0 translates into a system of n linear inequalities in the m unknowns. A spectrahedron defined in this manner is a convex polyhedron:

Spectrahedra are Beautiful......and useful too: in rank minimization, compressed sensing etc...

Non-Linear Convex Hull Computation Input : { (t,t 2,t 3 ) R 3 : 1 t 1 } 1 0.8 0.6 0.4 0.2 y 3 0 0.2 0.4 0.6 0.8 1 1 y 2 0.5 0 The convex hull of this curve is a spectrahedron. ( ) ( ) 1 x x y Output : ± 0 x y y z 1 0.5 0 y 1 0.5 1

Global Optimization of Polynomials Let f (x 1,...,x m ) be a polynomial of even degree 2d. We wish to compute the global minimum x of f (x) on R m. This optimization problem is hard. It is equivalent to Maximize λ such that f (x) λ is non-negative on R m. The following relaxtion gives a lower bound: Maximize λ such that f (x) λ is a sum of squares of polynomials. This is much easier. It is a semidefinite program. In practice, the optimal value of the SDP often agrees with the global minimum, and an optimal point x can be recovered. [P.Parrilo, BSt: Minimizing polynomial functions, DIMACS Ser. 60, Amer.Math.Soc., Providence, 2003, pp. 83 99]

SOS Programming: A Univariate Example Let m = 1, d = 2 and f (x) = 3x 4 + 4x 3 12x 2. Then f (x) λ = ( x 2 x 1 ) 3 2 µ 6 x 2 2 2µ 0 x µ 6 0 λ 1 Our problem is to find (λ,µ) such that the 3 3-matrix is positive semidefinite and λ is maximal.

SOS Programming: A Univariate Example Let m = 1, d = 2 and f (x) = 3x 4 + 4x 3 12x 2. Then f (x) λ = ( x 2 x 1 ) 3 2 µ 6 x 2 2 2µ 0 x µ 6 0 λ 1 Our problem is to find (λ,µ) such that the 3 3-matrix is positive semidefinite and λ is maximal. The optimal solution of this SDP is (λ,µ ) = ( 32, 2). Cholesky factorization reveals the SOS representation f (x) λ = ( ( 3 x 4 ) (x + 2) ) 2 8( ) 2. + x + 2 3 3 We see that the global minimum is x = 2. This approach works for many polynomial optimization problems.

Characterization of Spectrahedra A convex hypersurface of degree d in R n is rigid convex if every line passing through its interior meets the corresponding complex hypersurface in d real points. Theorem (Helton Vinnikov (2006)) Every spectrahedron is rigid convex. The converse is true for n = 2. Open problem: Is every compact convex basic semialgebraic set S the projection of a spectrahedron in higher dimensions?

Topic 3: TROPICAL MATHEMATICS Let s simplify arithmetic: x y = minimum of x and y x y = x + y Example: 3 (4 5) = 3 4 3 5 = 7 8 = 7

Topic 3: TROPICAL MATHEMATICS Let s simplify arithmetic: x y = minimum of x and y x y = x + y Example: 3 (4 5) = 3 4 3 5 = 7 8 = 7 Neutral Elements: x = x 0 x = x

Topic 3: TROPICAL MATHEMATICS Let s simplify arithmetic: x y = minimum of x and y x y = x + y Example: 3 (4 5) = 3 4 3 5 = 7 8 = 7 Neutral Elements: x = x 0 x = x Matrix Multiplication: [ 3 3 0 7 ] [ 4 1 5 2 ] = [ 7 4 4 1 ]

Matrices and Metrics 1 2 D = 0 d 12 d 13 d 14 d 21 0 d 23 d 24 d 31 d 32 0 d 34 d 41 d 42 d 43 0 3 4

Matrices and Metrics 1 2 D = 0 d 12 d 13 d 14 d 21 0 d 23 d 24 d 31 d 32 0 d 34 d 41 d 42 d 43 0 The (i,j)-entry of the matrix D k = D D D is the length of a shortest path from i to j using k steps. To find shortest pairwise distances in a directed graph with n nodes, compute the tropical matrix power D n. 3 4

Matrices and Metrics 1 2 D = 0 d 12 d 13 d 14 d 21 0 d 23 d 24 d 31 d 32 0 d 34 d 41 d 42 d 43 0 The (i,j)-entry of the matrix D k = D D D is the length of a shortest path from i to j using k steps. To find shortest pairwise distances in a directed graph with n nodes, compute the tropical matrix power D n. A symmetric matrix D represents a metric if and only if D 2 = D (triangle inequalities) 3 4

Matrices and Metrics 1 2 D = 0 d 12 d 13 d 14 d 21 0 d 23 d 24 d 31 d 32 0 d 34 d 41 d 42 d 43 0 The (i,j)-entry of the matrix D k = D D D is the length of a shortest path from i to j using k steps. To find shortest pairwise distances in a directed graph with n nodes, compute the tropical matrix power D n. A symmetric matrix D represents a metric if and only if D 2 = D (triangle inequalities) D is a tree metric if it comes from a tree with edge lengths. 1 2 5 1 7 3 4 3 6 2 4 d 12 = 5 + 1 + 7 = 13, etc.

Phylogenetics Q: Is every metric a tree metric? A: No, but biologists care about those that are.

Phylogenetics Q: Is every metric a tree metric? A: No, but biologists care about those that are. Theorem [4 Point Condition]: A metric D is a tree metric if and only if the max-plus polynomial d ij d kl d ik d jl d il d jk attains its maximum twice, for any four leaves i, j, k and l. Theorem 2.34 in [L.Pachter and BSt: Algebraic Statistics for Computational Biology, Cambridge Univ. Press, 2007] 1 2 5 1 7 3 6 2 4 d 12 = 13, d 13 = 11, d 14 = 8, d 23 = 14, d 24 = 9, d 34 = 9. d 12 d 34 d 13 d 24 d 14 d 23 = 22 20 22 = 22. Theorem: The space of trees is the tropical Grassmannian G(2, n).

Closing Thoughts Linear Algebra has been fantastically useful. Non-Linear Algebra is the natural next step. Quiz: How to draw the convex hull of the trigonometric curve (cos(θ),cos(2θ),sin(3θ) )?

Closing Thoughts Linear Algebra has been fantastically useful. Non-Linear Algebra is the natural next step. Quiz: How to draw the convex hull of the trigonometric curve (cos(θ),cos(2θ),sin(3θ) )? Quiz: How many real solutions do these equations have: x 2 + y 2 + z 2 = 2 x 3 + y 3 + z 3 = 3 x 4 + y 4 + z 4 = 4 To find out, please talk to one of the members of the SIAM Activity Group in Algebraic Geometry SI(AG) 2. THANK YOU for your attention!!