Note on the convex hull of the Stiefel manifold

Similar documents
POLARS AND DUAL CONES

1 Introduction CONVEXIFYING THE SET OF MATRICES OF BOUNDED RANK. APPLICATIONS TO THE QUASICONVEXIFICATION AND CONVEXIFICATION OF THE RANK FUNCTION

Week 3: Faces of convex sets

Appendix B Convex analysis

Affine iterations on nonnegative vectors

Optimality Conditions for Nonsmooth Convex Optimization

Generalized Power Method for Sparse Principal Component Analysis

Subdifferential representation of convex functions: refinements and applications

Sufficiency of Cut-Generating Functions

An extrinsic look at the Riemannian Hessian

Existence and uniqueness of solutions for a continuous-time opinion dynamics model with state-dependent connectivity

Split Rank of Triangle and Quadrilateral Inequalities

THE MIXING SET WITH FLOWS

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES

Elements of Convex Optimization Theory

Rank-Constrainted Optimization: A Riemannian Manifold Approach

Topological properties

Introduction to Real Analysis Alternative Chapter 1

Generalized Shifted Inverse Iterations on Grassmann Manifolds 1

ADJOINTS, ABSOLUTE VALUES AND POLAR DECOMPOSITIONS

Introduction to Functional Analysis

Extending a two-variable mean to a multi-variable mean

Convex Analysis and Economic Theory Winter 2018

SWITCHED SYSTEMS THAT ARE PERIODICALLY STABLE MAY BE UNSTABLE

Chapter 1. Preliminaries. The purpose of this chapter is to provide some basic background information. Linear Space. Hilbert Space.

Functional Analysis Review

Generalized power method for sparse principal component analysis

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets

Linear Algebra. Session 12

Geometry and topology of continuous best and near best approximations

Course 212: Academic Year Section 1: Metric Spaces

l(y j ) = 0 for all y j (1)

Lecture 2: A crash course in Real Analysis

Lecture 1: Convex Sets January 23

Linear Algebra Massoud Malek

Elementary linear algebra

Generalized Power Method for Sparse Principal Component Analysis

Existence and Uniqueness

A Review of Linear Programming

When are Sums Closed?

Extreme points of compact convex sets

Information-Theoretic Limits of Matrix Completion

Problem Set 2: Solutions Math 201A: Fall 2016

j=1 [We will show that the triangle inequality holds for each p-norm in Chapter 3 Section 6.] The 1-norm is A F = tr(a H A).

Unbounded Convex Semialgebraic Sets as Spectrahedral Shadows

Bodies of constant width in arbitrary dimension

1 Principal component analysis and dimensional reduction

Singular Value Decomposition

DISCRETIZED CONFIGURATIONS AND PARTIAL PARTITIONS

Conditions for Robust Principal Component Analysis

Strong duality in Lasserre s hierarchy for polynomial optimization

H 2 -optimal model reduction of MIMO systems

On Semicontinuity of Convex-valued Multifunctions and Cesari s Property (Q)

CHAPTER 7. Connectedness

Near convexity, metric convexity, and convexity

4 Countability axioms

Introduction to Convex and Quasiconvex Analysis

ECE 8201: Low-dimensional Signal Models for High-dimensional Data Analysis

Twice Differentiable Characterizations of Convexity Notions for Functions on Full Dimensional Convex Sets

Only Intervals Preserve the Invertibility of Arithmetic Operations

Notes on nets and convergence in topology

1 Directional Derivatives and Differentiability

Spectral Theory, with an Introduction to Operator Means. William L. Green

Lecture Notes 9: Constrained Optimization

NOWHERE LOCALLY UNIFORMLY CONTINUOUS FUNCTIONS

We will begin our study of topology from a set-theoretic point of view. As the subject

Lecture 6 - Convex Sets

Division of the Humanities and Social Sciences. Sums of sets, etc.

Rank-constrained Optimization: A Riemannian Manifold Approach

Subgradient. Acknowledgement: this slides is based on Prof. Lieven Vandenberghes lecture notes. definition. subgradient calculus

Functional Analysis I

Assignment 1: From the Definition of Convexity to Helley Theorem

Robust Solutions to Multi-Objective Linear Programs with Uncertain Data

Principal Component Analysis

On the topology of pointwise convergence on the boundaries of L 1 -preduals. Warren B. Moors

NORMS ON SPACE OF MATRICES

Jónsson posets and unary Jónsson algebras

Convex hull of two quadratic or a conic quadratic and a quadratic inequality

Linear maps preserving AN -operators

Mathematics for Economists

CHAPTER 9. Embedding theorems

On Firmness of the State Space and Positive Elements of a Banach Algebra 1

Universal low-rank matrix recovery from Pauli measurements

Convex Analysis and Economic Theory Fall 2018 Winter 2019

Notes for Functional Analysis

Introduction to Convex Analysis Microeconomics II - Tutoring Class

Chapter 16. Manifolds and Geodesics Manifold Theory. Reading: Osserman [7] Pg , 55, 63-65, Do Carmo [2] Pg ,

Robust Principal Component Pursuit via Alternating Minimization Scheme on Matrix Manifolds

arxiv: v1 [math.fa] 2 Jan 2017

ONP-MF: An Orthogonal Nonnegative Matrix Factorization Algorithm with Application to Clustering

REAL RENORMINGS ON COMPLEX BANACH SPACES

arxiv: v1 [math.na] 9 Feb 2013

Reconstruction from Anisotropic Random Measurements

Computational math: Assignment 1

OPERATOR-LIPSCHITZ ESTIMATES FOR THE SINGULAR VALUE FUNCTIONAL CALCULUS

Math 341: Convex Geometry. Xi Chen

Throughout these notes we assume V, W are finite dimensional inner product spaces over C.

arxiv:math/ v1 [math.sg] 23 Nov 2005

Dimensionality Reduction: PCA. Nicholas Ruozzi University of Texas at Dallas

UNIT 6: The singular value decomposition.

Transcription:

Note on the convex hull of the Stiefel manifold Kyle A. Gallivan P.-A. Absil July 9, 00 Abstract In this note, we characterize the convex hull of the Stiefel manifold and we find its strong convexity parameter. We also introduce the notion of roundness of a set and show that the Stiefel manifold is round. Key words. convex hull, Stiefel manifold, strong convexity Definitions and notation Let p and n be positive integers with p n. The (compact) Stiefel manifold is the set St(p,n) = {X R n p : X T X = I p }, where I p denotes the identity matrix of size p. We view St(p,n) as a subset of R n p endowed with the Frobenius norm. The fact that St(p, n) has a natural manifold structure is not relevant here. The convex hull of a subset V of a vector space E, denoted conv(v), is the smallest convex set containing V. The closed convex hull of a subset of R n is the closure of its convex hull. The support function of a nonempty subset S of R n is the function S S : R n R {+ } defined by R n x S S (x) := sup{ s,x : s S}. The spectral norm X of a matrix X R n p is its induced -norm, which is equal to the largest singular value of X and to the square root of the largest eigenvalue of X T X. We let B sp (p,n) := {X R n p : X } = {X R n p : X T X I p } denote the unit spectral ball. If E is a normed vector space, the convex set S E is strongly convex if there exists a constant σ > 0 such that, for all x,y S and all α [0,], the following inclusion holds: αx+( α)y + σ α( α) x y B S, This paper presents research results of the Belgian Network DYSCO (Dynamical Systems, Control, and Optimization), funded by the Interuniversity Attraction Poles Programme, initiated by the Belgian State, Science Policy Office. The scientific responsibility rests with its authors. Department of Mathematics, Florida State University, Tallahassee, Florida, 3306-450, USA http://www.math.fsu.edu/~gallivan/ Department of Mathematical Engineering, Université catholique de Louvain, B-348 Louvain-la-Neuve, Belgium http://www.inma.ucl.ac.be/~absil/

where B denotes the unit sphere in E; see [JNRS0]. Since S is convex by assumption, x,y S can be equivalently replaced by x,y S where S denotes the boundary of the set. Strong convexity thus guarantees that a straight line cannot be embedded in any portion of the boundary of the convex set. Convex hull of the Stiefel manifold The next result shows that the convex hull of the Stiefel manifold is the (closed) unit ball in the spectral norm. The principle of the proof can be found in [JNRS0, 3.4]. Theorem. (Journée et al.) The convex hull of the Stiefel manifold is the (closed) unit spectral ball: conv(st(p,n)) = B sp (p,n) Proof. This proof is a detailed version of the argument given in [JNRS0, 3.4]. In view of proposition.. and theorem.. in [HUL0], two subsets of R n have the same closed convex hull if and only if they have the same support function. The support function of the unit spectral ball B sp (p,n) is the trace norm: p S Bsp(p,n)(C) = σ i (C). Equivalently stated, the dual norm of the spectral norm is the trace norm; see, e.g., [RFP07, prop..]. The support function of St(p,n) is also the trace norm: i= S St(p,n) (C) = p σ i (C); i= see [JNRS0, prop. 7]. Hence the B sp (p,n) and St(p,n) have the same closed convex hull. Since the spectral norm is a norm, B sp (p,n) is convex, and thus equal to its closed convex hull. Since St(p,n) is compact, its convex hull is compact, hence its closed convex hull is its convex hull. Thus the convex hull of St(p,n) is B sp (p,n). 3 Convex hull of the Stiefel manifold without support functions In this section, we give an alternate proof of Theorem.. It only applies to p n/. On the other hand, it does not need results pertaining to support functions, and it also yields the property where conv(st(p,n)) = conv (St(p,n)), conv (S) := {αs +( α)s : s S,s S,α [0,]} is the union of all line segments with extremities in S. We will use the following results to show that conv(st(p,n)) = B sp (p,n).

Lemma 3. St(p,n) conv (St(p,n)) conv(st(p,n)) () St(p,n) B sp (p,n) () B sp (p,n) is convex (3) Proof. Theinclusions()and()followimmediately fromthedefinitions. Theconvexity ofb sp (p,n) follows from the fact that B sp (p,n) is a ball for a norm (specifically, the spectral norm). We can now characterize conv(st(p, n)). Theorem 3. If p n/ then conv(st(p,n)) = conv (St(p,n)) = B sp (p,n). Proof. Let Q St(p,n) and Q St(p,n). For α [0,], Q α = αq +( α)q conv (St(p,n)). By Lemma 3., St(p,n) B sp (p,n) and B sp (p,n) is convex. Therefore, conv (St(p,n)) B sp (p,n). Now let Z B sp (p,n) and Z = UΣV T be a thin SVD with σ i [0,], i =,...,p, U R n p, and V R p p. Let W R n p be such that [ U W ] is orthonormal. Such a W exists since p n/. Let Q = (UΣ + W I p Σ )V T and Q = (UΣ W I p Σ )V T. Then Q T Q = V(Σ +(I p Σ ))V T = VV T = I p. Likewise for Q. It is easily verified that Z = (Q +Q )/ and so B sp (p,n) conv (St(p,n)). Therefore, B sp (p,n) = conv (St(p,n)). We have St(p,n) B sp (p,n) = conv (St(p,n)) conv(st(p,n)). By Lemma 3., B sp (p,n) is convex and therefore so is conv (St(p,n)). conv (St(p,n)) is then a convex subset of conv(st(p, n)) that contains St(p, n). Since conv(st(p, n)) is the smallest convex set that contains St(p,n) it follows that conv(st(p,n)) = conv (St(p,n)) = B sp (p,n). So conv(st(p,n)) is characterized by the singular values of matrices in R n p and only requires all convex combinations of pairs of matrices from St(p, n). 4 conv(st(p,n)) is not strongly convex for p > When p =, St(,n) is the unit sphere in R n which is clearly strongly convex. This is not the case for p >, as pointed out in [JNRS0, 3.4]. A detailed proof of this result follows. Theorem 4. Let n. If p then conv(st(p,n)) is not strongly convex, in the sense given in Section. Proof. Define X R n p and Y R n p as [ ] X = UI n V T Ik 0, Y = U V T 0 0 n k 3

with k. Note that X conv(st(p,n)) and Y conv(st(p,n)). Let 0 α. We have [ ] Ik 0 Z α = αx +( α)y = U V T 0 αi n k Z α conv(st(p,n)). So the line is completely contained in conv(st(p,n)) hence there is no room to fit a ball around the points on the line. It follows that conv(st(p, n)) is not strongly convex. 5 St(p, n) is round We introduce the notion of roundness, which is weaker than strong convexity. The roundness of a set measures how far away the boundary of the set is from containing aligned points. Definition 5. Let E be a normed vector space and let S be a subset of E, not necessarily convex. The value ρ > 0 is a roundnesslower bound of S if, for all x,y S and all α [0,], the following holds: αx+( α)y + ρ α( α) x y B S =, where B is the unit ball in E. The supremum of the roundness lower bounds is termed the roundness of S and denoted by ρ S. A round set is a set whose roundness is nonzero. It is clear, in view of the previous section, that conv(st(p,n)) is not round. For the general case p n, we can obtain a bound on ρ St(p,n) using the following result that follows directly from the definition of roundness. Lemma 5. Let E be a normed vector space. If V S E, then We then have the following result. ρ S ρ V. Theorem 5.3 The roundness of the Stiefel manifold, ρ St(p,n), satisfies p ρ St(p,n) Proof. The Stiefel manifold St(p,n) is a subset of B p, the sphere of radius p in R n p. So ρ St(p,n) cannot be smaller than ρ B p. It is shown in [JNRS0] that the convexity parameter of the sphere of radius r is r, from which one can deduce the lower bound. For the upper bound, consider X St(p,n) and Y St(p,n) such that only their first column differ. This yields a problem on the unit sphere in R n and therefore ρ St(p,n). If we require p n/ as in Theorem 3., then ρ St(p,n) can be determined. 4

Theorem 5.4 If p n/ then the roundness of St(p,n) is ρ St(p,n) = p. Proof. It is easily shown that St(p,n) = St(p,n). It follows that the set of pairwise convex combinations in Definition 5. is conv (St(p,n)) = B sp (p,n) given in Section. By Lemma 3. we know conv (St(p,n)) = B sp (p,n) = conv(st(p,n)) and therefore our weaker definition checks the intersection of balls around lines in conv(st(p, n)) with St(p, n) rather than conv(st(p, n)). Let Z = UΣV T, Q = (UΣ+W I p Σ )V T, and Q = (UΣ W I p Σ )V T be as in the proof of Theorem 3.. Recall that Q and Q belong to St(p,n) and that Z = αq +( α)q with α = /. Observe that UV T belongs to St(p,n). Hence, from the definition of ρ St(p,n), we must have This yields Z UV T ρ St(p,n) 4 Q Q. ρ St(p,n) p i= ( σ i) p i= ( σ i ), where the σ i s are arbitrary in [0,]. The right-hand side, for σ =... = σ p =: s, is equal to p( s) p( s ) = p(+s). As s goes to, this goes to p. Hence ρ St(p,n) p. Since we also know that ρ St(p,n) p, the result follows. References [HUL0] Jean-Baptiste Hiriart-Urruty and Claude Lemaréchal. Fundamentals of convex analysis. Grundlehren Text Editions. Springer-Verlag, Berlin, 00. Abridged version of ıt Convex analysis and minimization algorithms. I [Springer, Berlin, 993; MR640 (95m:9000)] and ıt II [ibid.; MR9540 (95m:9000)]. [JNRS0] Michel Journée, Yurii Nesterov, Peter Richtárik, and Rodolphe Sepulchre. Generalized power method for sparse principal component analysis. J. Mach. Learn. Res., :57 553, 00. [RFP07] Benjamin Recht, Maryam Fazel, and Pablo A. Parrilo. Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization, 007. arxiv:0706.438. 5