The maximum likelihood degree of rank 2 matrices via Euler characteristics

Similar documents
Maximum likelihood for dual varieties

The Maximum Likelihood Degree of Mixtures of Independence Models

THE MAXIMUM LIKELIHOOD DEGREE OF MIXTURES OF INDEPENDENCE MODELS

The Algebraic Degree of Semidefinite Programming

A Gauss-Bonnet theorem for constructible sheaves on reductive groups

Open Problems in Algebraic Statistics

Cohomology jump loci of quasi-projective varieties

Likelihood Geometry. June Huh and Bernd Sturmfels

ALGEBRAIC DEGREE OF POLYNOMIAL OPTIMIZATION. 1. Introduction. f 0 (x)

Resolution of Singularities in Algebraic Varieties

Sixty-Four Curves of Degree Six

EUCLIDEAN DISTANCE DEGREE OF THE MULTIVIEW VARIETY

10. Smooth Varieties. 82 Andreas Gathmann

T -equivariant tensor rank varieties and their K-theory classes

FOUNDATIONS OF ALGEBRAIC GEOMETRY CLASS 37

K3 Surfaces and Lattice Theory

The Geometry of Matrix Rigidity

Introduction to Algebraic Statistics

Numerical local irreducible decomposition

Tensors. Notes by Mateusz Michalek and Bernd Sturmfels for the lecture on June 5, 2018, in the IMPRS Ringvorlesung Introduction to Nonlinear Algebra

NUMERICAL COMPUTATION OF GALOIS GROUPS

The Waring rank of the Vandermonde determinant

Cohomology jump loci of local systems

Semidefinite Programming

Introduction to Arithmetic Geometry Fall 2013 Lecture #17 11/05/2013

arxiv: v1 [math.ag] 24 Apr 2015

Regeneration Homotopies for Solving Systems of Polynomials

arxiv: v2 [math.ag] 3 Dec 2016

David Eklund. May 12, 2017

Voronoi Cells of Varieties

Torus actions on positively curved manifolds

A NEW FAMILY OF SYMPLECTIC FOURFOLDS

The Stable Types and Minimal Whitney Stratifications of Discriminants and Euler Obstruction.

Tensor decomposition and tensor rank

Numerical Local Irreducible Decomposition

On the Waring problem for polynomial rings

Homotopy Techniques for Tensor Decomposition and Perfect Identifiability

CW-complexes. Stephen A. Mitchell. November 1997

Exercises on characteristic classes

arxiv: v4 [math.rt] 9 Jun 2017

The calibrated trifocal variety

Exercise Sheet 7 - Solutions

Torus Knots and q, t-catalan Numbers

Commuting birth-and-death processes

Basic facts and definitions

Euler characteristic of the truncated order complex of generalized noncrossing partitions

Hodge theory for combinatorial geometries

Special Session on Secant Varieties. and Related Topics. Secant Varieties of Grassmann and Segre Varieties. Giorgio Ottaviani

Polynomials, Ideals, and Gröbner Bases

Some constructions of projective varieties

Algebraic Geometry. Question: What regular polygons can be inscribed in an ellipse?

Numerical computation of the genus of an irreducible curve within an algebraic set

Algebraic Geometry (Math 6130)

Topological Data Analysis - Spring 2018

INTRODUCTION TO ALGEBRAIC GEOMETRY, CLASS 14

Multiplicity One Theorems

Vectors. January 13, 2013

The Euclidean Distance Degree of an Algebraic Variety

Arrangements, matroids and codes

Sheaf cohomology and non-normal varieties

ARITHMETICALLY COHEN-MACAULAY BUNDLES ON THREE DIMENSIONAL HYPERSURFACES

A new parametrization for binary hidden Markov modes

π X : X Y X and π Y : X Y Y

mathematics Smoothness in Binomial Edge Ideals Article Hamid Damadi and Farhad Rahmati *

Examples of numerics in commutative algebra and algebraic geo

Math 418 Algebraic Geometry Notes

LYUBEZNIK NUMBERS AND DEPTH. Matteo Varbaro

MULTIVARIABLE CALCULUS, LINEAR ALGEBRA, AND DIFFERENTIAL EQUATIONS

Curtis Heberle MTH 189 Final Paper 12/14/2010. Algebraic Groups

Ma/CS 6b Class 20: Spectral Graph Theory

Elliptic curves over function fields 1

Secant varieties. Marin Petkovic. November 23, 2015

arxiv:math/ v2 [math.ag] 25 Mar 2004

List of Symbols, Notations and Data

SHELLABILITY AND HIGHER COHEN-MACAULAY CONNECTIVITY OF GENERALIZED CLUSTER COMPLEXES

Multivariate Gaussians, semidefinite matrix completion, and convex algebraic geometry

SPACES OF RATIONAL CURVES ON COMPLETE INTERSECTIONS

The Central Curve in Linear Programming

Boolean Product Polynomials and the Resonance Arrangement

DIFFERENTIAL TOPOLOGY AND THE POINCARÉ-HOPF THEOREM

Tensors: a geometric view Open lecture November 24, 2014 Simons Institute, Berkeley. Giorgio Ottaviani, Università di Firenze

Projective Varieties. Chapter Projective Space and Algebraic Sets

Is Every Secant Variety of a Segre Product Arithmetically Cohen Macaulay? Oeding (Auburn) acm Secants March 6, / 23

arxiv: v1 [math.ag] 13 Mar 2019

The weight filtration for real algebraic varieties

The geography of irregular surfaces

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1)

DM559 Linear and Integer Programming. Lecture 2 Systems of Linear Equations. Marco Chiarandini

Kazhdan-Lusztig polynomials of matroids

The Geometry of Semidefinite Programming. Bernd Sturmfels UC Berkeley

QUALIFYING EXAMINATION Harvard University Department of Mathematics Tuesday 10 February 2004 (Day 1)

ECE521 week 3: 23/26 January 2017

Defining Valuation Rings

Geometric interpretations for the algebra of conformal blocks

On the Virtual Fundamental Class

Euler Characteristic of Two-Dimensional Manifolds

Using algebraic geometry for phylogenetic reconstruction

Multiplicity of singularities is not a bi-lipschitz invariant

Advanced Counting Techniques. Chapter 8

THE EULER CHARACTERISTIC OF A LIE GROUP

Transcription:

The maximum likelihood degree of rank 2 matrices via Euler characteristics Jose Israel Rodriguez University of Notre Dame Joint work with Botong Wang Algebraic Statistics 2015 University of Genoa June 9, 2015

A mixture of independence models 1 Consider a pair of four sided dice: one red die and one blue die R 1,B 1. 2 Consider a second pair of four sided dice: one red die and one blue die R 2,B 2. 3 Consider a biased coin C =[c 1,c 2 ] The following map induces a set of probability distributions denoted M 44 15 R 16 and is called the model. 1 ( 3 3 ) ( 3 3 ) M 44 15 R 16 c 1 R 1 B T 1 + c 2 R 2 B T 2 =[p ij ] M 44 is the set of 4 4 nonnegative rank at most 2 matrices. M 44 is a mixture of two independence models.

A mixture of independence models 1 Consider a pair of four sided dice: one red die and one blue die R 1,B 1. 2 Consider a second pair of four sided dice: one red die and one blue die R 2,B 2. 3 Consider a biased coin C =[c 1,c 2 ] The following map induces a set of probability distributions denoted M 44 15 R 16 and is called the model. 1 ( 3 3 ) ( 3 3 ) M 44 15 R 16 c 1 R 1 B T 1 + c 2 R 2 B T 2 =[p ij ] M 44 is the set of 4 4 nonnegative rank at most 2 matrices. M 44 is a mixture of two independence models.

Collecting data and the likelihood function Roll the dice Rolling the dice we may observe the following data: 160 8 16 24 u =[u ij ]= 32 200 16 8 8 24 176 32 16 40 8 232 To each p in the set of probability distributions M 44 we assign the likelihood of p with respect to u by the likelihood function: ( uij 1 l u (p)= u 11,...,u 44) ij p u ij ij. The probability distribution maximizing l u (p) on the set of distributions M 44 is called the maximum likelihood estimate (mle). The mle is the best point of M 44 to describe the observed data. The statistics problem is to determine mle s.

Collecting data and the likelihood function Roll the dice Rolling the dice we may observe the following data: 160 8 16 24 u =[u ij ]= 32 200 16 8 8 24 176 32 16 40 8 232 To each p in the set of probability distributions M 44 we assign the likelihood of p with respect to u by the likelihood function: ( uij 1 l u (p)= u 11,...,u 44) ij p u ij ij. The probability distribution maximizing l u (p) on the set of distributions M 44 is called the maximum likelihood estimate (mle). The mle is the best point of M 44 to describe the observed data. The statistics problem is to determine mle s.

Collecting data and the likelihood function Roll the dice Rolling the dice we may observe the following data: 160 8 16 24 u =[u ij ]= 32 200 16 8 8 24 176 32 16 40 8 232 To each p in the set of probability distributions M 44 we assign the likelihood of p with respect to u by the likelihood function: ( uij 1 l u (p)= u 11,...,u 44) ij p u ij ij. The probability distribution maximizing l u (p) on the set of distributions M 44 is called the maximum likelihood estimate (mle). The mle is the best point of M 44 to describe the observed data. The statistics problem is to determine mle s.

Applied Algebraic Geometry The mle can be determined by solving the likelihood equations. Instead of M 44, we consider its Zariski closure X 44. The Zariski closure is described by zero sets of homogeneous polynomials. The defining polynomials of X 44 are the 3 3 minors of p 11 p 12 p 13 p 14 p 21 p 22 p 23 p 24 p 31 p 32 p 33 p 34 p 41 p 42 p 43 p 44 and the linear constraint p 11 + p 12 + +p 44 p s = 0. The equations define a projective variety of P 16 : rank at 2 matrices We consider the homogenized likelihood function l u (p)= ij (p ij /p s ) u ij on X 44.

Applied Algebraic Geometry The mle can be determined by solving the likelihood equations. Instead of M 44, we consider its Zariski closure X 44. The Zariski closure is described by zero sets of homogeneous polynomials. The defining polynomials of X 44 are the 3 3 minors of p 11 p 12 p 13 p 14 p 21 p 22 p 23 p 24 p 31 p 32 p 33 p 34 p 41 p 42 p 43 p 44 and the linear constraint p 11 + p 12 + +p 44 p s = 0. The equations define a projective variety of P 16 : rank at 2 matrices We consider the homogenized likelihood function l u (p)= ij (p ij /p s ) u ij on X 44.

Applied Algebraic Geometry The mle can be determined by solving the likelihood equations. Instead of M 44, we consider its Zariski closure X 44. The Zariski closure is described by zero sets of homogeneous polynomials. The defining polynomials of X 44 are the 3 3 minors of p 11 p 12 p 13 p 14 p 21 p 22 p 23 p 24 p 31 p 32 p 33 p 34 p 41 p 42 p 43 p 44 and the linear constraint p 11 + p 12 + +p 44 p s = 0. The equations define a projective variety of P 16 : rank at 2 matrices We consider the homogenized likelihood function l u (p)= ij (p ij /p s ) u ij on X 44.

Geometric definition of critical points Critical points can be determined by solving a system of polynomial equations. For the models in this talk, the mle is a critical point of the homogenized likelihood function. The solutions to the likelihood equations are critical points. One way to formulate the likelihood equations is to use Lagrange multipliers. We omit a formal description of the likelihood equations, but instead give a geometric description of critical points.

Geometric definition of critical points (cont.) Critical points can be determined by solving a system of polynomial equations. Let X o denote the open variety X \{coordinate hyperplanes}. X o is the set of points in X which have nonzero coordinates. The gradient of the likelihood function up to scaling equals l u (p)=[ u11 p 11 u 12 p 12... u 44 p 44 u s p s ], u s := u ij. ij The gradient is defined on X o. We say p X o is a complex critical point, whenever l u (p) is orthogonal to the tangent space of X at p and p X o reg. The mle is a critical point (in the cases we consider).

Two experiments and ML degree Two experiments Consider vectorized datasets u for likelihood function l u (p) on X 44. u ={160,8,16,24,32,200,16,8,8,24,176,32,16,40,8,232} 191 complex including 25 real u ={292,45,62,41,142,51,44,42,213,75,67,63,119,85,58,70} 191 complex including 3 real The # of complex solutions was always 191 (this is the ML degree). For general choices of u we get the same number of complex critical points. This number is called the ML degree of a variety.

Two experiments and ML degree Two experiments Consider vectorized datasets u for likelihood function l u (p) on X 44. u ={160,8,16,24,32,200,16,8,8,24,176,32,16,40,8,232} 191 complex including 25 real u ={292,45,62,41,142,51,44,42,213,75,67,63,119,85,58,70} 191 complex including 3 real The # of complex solutions was always 191 (this is the ML degree). For general choices of u we get the same number of complex critical points. This number is called the ML degree of a variety.

Previous Computational Results Consider the mixture model M mn for m-sided red dice and n-sided blue dice. Denote its Zariski closure by X mn. Theorem The ML-degrees of X mn include the following: (m, n) 3 4 5 6 7 8 9 10 11 12 3 10 26 58 122 250 506 1018 2042 4090 8186 4 26 191 843 3119 6776????? Reference: Maximum likelihood for matrices with rank constraints J. Hauenstein, [], and B. Sturmfels using Bertini. Any conjectures? (Hint add 6.) Maximum likelihood geometry in the presence of sampling and model zeros gave supporting evidence for up to n= 15. E. Gross and [] using Macaulay2.

Previous Computational Results Consider the mixture model M mn for m-sided red dice and n-sided blue dice. Denote its Zariski closure by X mn. Theorem The ML-degrees of X mn include the following: (m, n) 3 4 5 6 7 8 9 10 11 12 3 10 26 58 122 250 506 1018 2042 4090 8186 4 26 191 843 3119 6776????? Reference: Maximum likelihood for matrices with rank constraints J. Hauenstein, [], and B. Sturmfels using Bertini. Any conjectures? (Hint add 6.) Maximum likelihood geometry in the presence of sampling and model zeros gave supporting evidence for up to n= 15. E. Gross and [] using Macaulay2.

Euler characteristics and ML degrees Huh proves that the ML degrees are an Euler characteristic in the smooth case. Let X be a smooth variety of P n+1 defined by homogeneous polynomials and the linear constraint p 0 + p 1 + +p n p s = 0. Let X o denote the open variety X \{coordinate hyperplanes}. Theorem [Huh] The ML degree of the smooth variety X equals the signed Euler characteristic of X o, i.e. χ(x o )=( 1) dimx MLdegree(X). The independence model (one sided coin) is smooth but the mixture model is not.

Euler characteristics and ML degrees Huh proves that the ML degrees are an Euler characteristic in the smooth case. Let X be a smooth variety of P n+1 defined by homogeneous polynomials and the linear constraint p 0 + p 1 + +p n p s = 0. Let X o denote the open variety X \{coordinate hyperplanes}. Theorem [Huh] The ML degree of the smooth variety X equals the signed Euler characteristic of X o, i.e. χ(x o )=( 1) dimx MLdegree(X). The independence model (one sided coin) is smooth but the mixture model is not.

Independence model ML degree Use Huh s result to give a topological proof. Let Z denote the Zariski closure of the independence model, a variety of P 16. The following map gives an algebraic geometry parameterization of Z. ([r 1,r 2,r 3,r 4 ],[b 1,b 2,b 3,b 4 ]) P 3 P 3 Z [ ] r i b j, r i b j ij where i,j {1,2,3,4}. Let O denote P 3 \V(x 0 x 1 x 2 x 3 (x 0 + x 1 + x 2 + x 3 )). Then we have a parameterization of X o given by O O X o because ij r i b j =( i r i ) ( j b j ). Using inclusion-exclusion and the additive properties of Euler characteristics we see that χ(o)= 1. By the product property χ(o O)=1. This parameterization is a homeomorphism thus χ(o O)=χ(X o ).

Independence model ML degree Use Huh s result to give a topological proof. Let Z denote the Zariski closure of the independence model, a variety of P 16. The following map gives an algebraic geometry parameterization of Z. ([r 1,r 2,r 3,r 4 ],[b 1,b 2,b 3,b 4 ]) P 3 P 3 Z [ ] r i b j, r i b j ij where i,j {1,2,3,4}. Let O denote P 3 \V(x 0 x 1 x 2 x 3 (x 0 + x 1 + x 2 + x 3 )). Then we have a parameterization of X o given by O O X o because ij r i b j =( i r i ) ( j b j ). Using inclusion-exclusion and the additive properties of Euler characteristics we see that χ(o)= 1. By the product property χ(o O)=1. This parameterization is a homeomorphism thus χ(o O)=χ(X o ).

Independence model ML degree Use Huh s result to give a topological proof. Let Z denote the Zariski closure of the independence model, a variety of P 16. The following map gives an algebraic geometry parameterization of Z. ([r 1,r 2,r 3,r 4 ],[b 1,b 2,b 3,b 4 ]) P 3 P 3 Z [ ] r i b j, r i b j ij where i,j {1,2,3,4}. Let O denote P 3 \V(x 0 x 1 x 2 x 3 (x 0 + x 1 + x 2 + x 3 )). Then we have a parameterization of X o given by O O X o because ij r i b j =( i r i ) ( j b j ). Using inclusion-exclusion and the additive properties of Euler characteristics we see that χ(o)= 1. By the product property χ(o O)=1. This parameterization is a homeomorphism thus χ(o O)=χ(X o ).

ML degrees of singular models The ML degree is a stratified topological invariant. Let (S 1,S 2,...,S k ) denote a Whitney stratification of X o. When X o is smooth the Whitney stratification is (X o ). When k = 2, S1 = X o reg and S 2 = X o sing. Theorem Given reduced irreducible X o with Whitney stratification (S 1,...,S k ), we have χ ( X o reg) = e11 MLdegree ( S 1 ) +e21 MLdegree ( S 2 ) + +ek1 MLdegree ( S k ). The e ij are topological invariants called Euler obstructions, which can be considered as the topological multiplicity of the singularities. This theorem is a corollary of Botong Wang and Nero Budur s result that relates ML degrees to Gaussian degrees. The Euler obstruction e 11 always equals ( 1) dimx o.

ML degrees of singular models The ML degree is a stratified topological invariant. Let (S 1,S 2,...,S k ) denote a Whitney stratification of X o. When X o is smooth the Whitney stratification is (X o ). When k = 2, S1 = X o reg and S 2 = X o sing. Theorem Given reduced irreducible X o with Whitney stratification (S 1,...,S k ), we have χ ( X o reg) = e11 MLdegree ( S 1 ) +e21 MLdegree ( S 2 ) + +ek1 MLdegree ( S k ). The e ij are topological invariants called Euler obstructions, which can be considered as the topological multiplicity of the singularities. This theorem is a corollary of Botong Wang and Nero Budur s result that relates ML degrees to Gaussian degrees. The Euler obstruction e 11 always equals ( 1) dimx o.

ML degrees of singular models The ML degree is a stratified topological invariant. Let (S 1,S 2,...,S k ) denote a Whitney stratification of X o. When X o is smooth the Whitney stratification is (X o ). When k = 2, S1 = X o reg and S 2 = X o sing. Theorem Given reduced irreducible X o with Whitney stratification (S 1,...,S k ), we have χ ( X o reg) = e11 MLdegree ( S 1 ) +e21 MLdegree ( S 2 ) + +ek1 MLdegree ( S k ). The e ij are topological invariants called Euler obstructions, which can be considered as the topological multiplicity of the singularities. This theorem is a corollary of Botong Wang and Nero Budur s result that relates ML degrees to Gaussian degrees. The Euler obstruction e 11 always equals ( 1) dimx o.

Ternary Cubic Example for Singular Case We determine the ML degree of a singular X using the previous theorem. Let X be defined by p 2 (p 1 p 2 ) 2 (p 0 p 2 ) 3 = p 0 + p 1 + p 2 p s = 0. The Whitney stratification of X o consists of S 1 the regular points (so S 1 = X) and S 2 the singular point which is [1:1:1:3], χ(s 1 )=e 11 MLdegree(X)+ e 21 MLdegree ( S 2 ). S 2 is a point so S 2 = S 2 and MLdegree ( S2 ) = 1. The Euler obstruction e 21 is the signed multiplicity of the singular point, i.e. e 21 = 2. In general, the sign depends on the dimension of S2 and the multiplicity is actually the Euler characteristic of a link [Kashiwara]. The Euler obstruction e 11 always equals ( 1) dimx.

Ternary Cubic Example for Singular Case We determine the ML degree of a singular X using the previous theorem. Let X be defined by p 2 (p 1 p 2 ) 2 (p 0 p 2 ) 3 = p 0 + p 1 + p 2 p s = 0. The Whitney stratification of X o consists of S 1 the regular points (so S 1 = X) and S 2 the singular point which is [1:1:1:3], χ(s 1 )=e 11 MLdegree(X)+ e 21 MLdegree ( S 2 ). S 2 is a point so S 2 = S 2 and MLdegree ( S2 ) = 1. The Euler obstruction e 21 is the signed multiplicity of the singular point, i.e. e 21 = 2. In general, the sign depends on the dimension of S2 and the multiplicity is actually the Euler characteristic of a link [Kashiwara]. The Euler obstruction e 11 always equals ( 1) dimx.

Ternary Cubic Example for Singular Case We determine the ML degree of a singular X using the previous theorem. Let X be defined by p 2 (p 1 p 2 ) 2 (p 0 p 2 ) 3 = p 0 + p 1 + p 2 p s = 0. The Whitney stratification of X o consists of S 1 the regular points (so S 1 = X) and S 2 the singular point which is [1:1:1:3], χ(s 1 )=e 11 MLdegree(X)+ e 21 MLdegree ( S 2 ). S 2 is a point so S 2 = S 2 and MLdegree ( S2 ) = 1. The Euler obstruction e 21 is the signed multiplicity of the singular point, i.e. e 21 = 2. In general, the sign depends on the dimension of S2 and the multiplicity is actually the Euler characteristic of a link [Kashiwara]. The Euler obstruction e 11 always equals ( 1) dimx.

Returning to the mixture model We apply the Whitney stratification-ml degree theorem to X o mn. The Whitney stratification of X o = X o mn is given by (S 1,S 2 ) where S 1 are the regular points X o mn \Z o mn and S 2 are the singular points Z o mn. Denote the singular points of X o mn by Z o mn. Z o mn should be thought of as the set of rank 1 matrices (Z mn is the Zariski closure of the independence model) By the theorem we have χ(x o mn \Z o mn )=e 11MLdegree(X mn )+e 21 MLdegree(Z mn ). It is already well known e 11 = 1 and MLdegree(Z mn )=1. The first lemma we would prove determines e 21 : e 21 =( 1) m+n 1 (min{m,n} 1). If we knew χ(x o mn\z o mn), then we would know MLdegree(X mn ).

Returning to the mixture model We apply the Whitney stratification-ml degree theorem to X o mn. The Whitney stratification of X o = X o mn is given by (S 1,S 2 ) where S 1 are the regular points X o mn \Z o mn and S 2 are the singular points Z o mn. Denote the singular points of X o mn by Z o mn. Z o mn should be thought of as the set of rank 1 matrices (Z mn is the Zariski closure of the independence model) By the theorem we have χ(x o mn \Z o mn )=e 11MLdegree(X mn )+e 21 MLdegree(Z mn ). It is already well known e 11 = 1 and MLdegree(Z mn )=1. The first lemma we would prove determines e 21 : e 21 =( 1) m+n 1 (min{m,n} 1). If we knew χ(x o mn\z o mn), then we would know MLdegree(X mn ).

Returning to the mixture model We apply the Whitney stratification-ml degree theorem to X o mn. The Whitney stratification of X o = X o mn is given by (S 1,S 2 ) where S 1 are the regular points X o mn \Z o mn and S 2 are the singular points Z o mn. Denote the singular points of X o mn by Z o mn. Z o mn should be thought of as the set of rank 1 matrices (Z mn is the Zariski closure of the independence model) By the theorem we have χ(x o mn \Z o mn )=e 11MLdegree(X mn )+e 21 MLdegree(Z mn ). It is already well known e 11 = 1 and MLdegree(Z mn )=1. The first lemma we would prove determines e 21 : e 21 =( 1) m+n 1 (min{m,n} 1). If we knew χ(x o mn\z o mn), then we would know MLdegree(X mn ).

Determining the Euler characteristic χ(x o mn \Z o mn ) This is our main theorem. If we knew χ(xmn\z o mn), o then we would know MLdegree(X mn ). Let Λ m be a sequence of m 1 integers (λ 1,λ 2,...,λ m 1 ). Theorem [ - and B. Wang] Fix m greater than or equal to 2. Then, there exists Λ m such that χ(x o mn \Z o mn )=( 1)n 1 1 i m 1 λ i i+ 1 1 i m 1 λ i i+ 1 in 1. Now we prove the conjecture of Hauenstein, [], Sturmfels.

Using the main theorem Fix m=3. ( χ(x3n\z o 3n)=( 1) o n 1 λ1 2 + λ ) ( 2 λ1 3 2 1n 1 + λ ) 2 3 2n 1. χ(x o mn\z o mn)= MLdegree(X 3n )+( 1) 3+n 1 (min{3,n} 1). MLdegree(X 32 )=1 yields the relation λ 1 λ 2 = 0. MLdegree(X 33 )=10 yields the relation λ 2 = 12. MLdegree(X 3n )= ( 2 n+1 6 ) +( 1) n ((min{3,n} 3)) Main idea: For fixed m, if we knew MLdegree(X m2 ),MLdegree(X m3 ),...,MLdegree(X mm ) then we can solve for Λ m =(λ 1,...,λ m 1 ) thereby giving a closed form expression for MLdegree(X mn ) for all n.

Using the main theorem Fix m=3. ( χ(x3n\z o 3n)=( 1) o n 1 λ1 2 + λ ) ( 2 λ1 3 2 1n 1 + λ ) 2 3 2n 1. χ(x o mn\z o mn)= MLdegree(X 3n )+( 1) 3+n 1 (min{3,n} 1). MLdegree(X 32 )=1 yields the relation λ 1 λ 2 = 0. MLdegree(X 33 )=10 yields the relation λ 2 = 12. MLdegree(X 3n )= ( 2 n+1 6 ) +( 1) n ((min{3,n} 3)) Main idea: For fixed m, if we knew MLdegree(X m2 ),MLdegree(X m3 ),...,MLdegree(X mm ) then we can solve for Λ m =(λ 1,...,λ m 1 ) thereby giving a closed form expression for MLdegree(X mn ) for all n.

Do Better Main idea (from previous slide): For fixed m, if we knew MLdegree(X m2 ),MLdegree(X m3 ),...,MLdegree(X mm ) then we can solve for Λ m =(λ 1,...,λ m 1 ) thereby giving a closed form expression for MLdegree(X mn ) for all n. We can recursively determine Λ m thereby giving a closed form formula for MLdegree(X mn ) for fixed m but any n. Note MLdegree(Xmn )=MLdegree(X nm ). Prove λm 1 of Λ m is (m 1)m!. Closed form expressions for fixed m and n m: MLdegX 4n = 25 1 n 1 40 2 n 1 + 23 3 n 1 MLdegX 5n = 90 1 n 1 + 260 2 n 1 270 3 n 1 + 96 4 n 1 MLdegX 6n = 301 1 n 1 1400 2 n 1 +2520 3 n 1 2016 4 n 1 +600 5 n 1

Do Better Main idea (from previous slide): For fixed m, if we knew MLdegree(X m2 ),MLdegree(X m3 ),...,MLdegree(X mm ) then we can solve for Λ m =(λ 1,...,λ m 1 ) thereby giving a closed form expression for MLdegree(X mn ) for all n. We can recursively determine Λ m thereby giving a closed form formula for MLdegree(X mn ) for fixed m but any n. Note MLdegree(Xmn )=MLdegree(X nm ). Prove λm 1 of Λ m is (m 1)m!. Closed form expressions for fixed m and n m: MLdegX 4n = 25 1 n 1 40 2 n 1 + 23 3 n 1 MLdegX 5n = 90 1 n 1 + 260 2 n 1 270 3 n 1 + 96 4 n 1 MLdegX 6n = 301 1 n 1 1400 2 n 1 +2520 3 n 1 2016 4 n 1 +600 5 n 1

Do Better Main idea (from previous slide): For fixed m, if we knew MLdegree(X m2 ),MLdegree(X m3 ),...,MLdegree(X mm ) then we can solve for Λ m =(λ 1,...,λ m 1 ) thereby giving a closed form expression for MLdegree(X mn ) for all n. We can recursively determine Λ m thereby giving a closed form formula for MLdegree(X mn ) for fixed m but any n. Note MLdegree(Xmn )=MLdegree(X nm ). Prove λm 1 of Λ m is (m 1)m!. Closed form expressions for fixed m and n m: MLdegX 4n = 25 1 n 1 40 2 n 1 + 23 3 n 1 MLdegX 5n = 90 1 n 1 + 260 2 n 1 270 3 n 1 + 96 4 n 1 MLdegX 6n = 301 1 n 1 1400 2 n 1 +2520 3 n 1 2016 4 n 1 +600 5 n 1

Using Numerical Algebraic Geometry Witness sets allow us to use parallelizable algorithms. Treat the u ij as parameter values that we can adjust, If we have a set of critical points for generic data, then we can solve any specific instance of data quickly using a parameter homotopy. Critical points of l u for u general are taken to critical points of lu for u specific by a parameter homotopy > 160 8 16 24 32 200 16 8 8 24 176 32 16 40 8 232 191 points > 191points denotes a random complex number.

Thank You Contact information Jose Israel Rodriguez jo.ro@nd.edu http://www.nd.edu/~jrodri18/ SIAM: AG15 in Daejeon, Korea, Aug 3-7. Co-organizing a mini-sympoium with Xiaoxian Tang: Maximum Likelihood Degrees and Critical Points http://www.nd.edu/~jrodri18/quicklinks/ag15rt/

Outline Statistics Mixture model Applied algebraic geometry Critical points Topology ML degree Euler obstructions