Lecture 1: Background on Convex Analysis

Similar documents
Shiqian Ma, MAT-258A: Numerical Optimization 1. Chapter 4. Subgradient

Subgradients. subgradients and quasigradients. subgradient calculus. optimality conditions via subgradients. directional derivatives

Subgradients. subgradients. strong and weak subgradient calculus. optimality conditions via subgradients. directional derivatives

Subgradient. Acknowledgement: this slides is based on Prof. Lieven Vandenberghes lecture notes. definition. subgradient calculus

LECTURE 12 LECTURE OUTLINE. Subgradients Fenchel inequality Sensitivity in constrained optimization Subdifferential calculus Optimality conditions

Optimization and Optimal Control in Banach Spaces

Convex Functions. Pontus Giselsson

Convex Analysis Background

Math 273a: Optimization Subgradients of convex functions

Chapter 2 Convex Analysis

Optimality Conditions for Nonsmooth Convex Optimization

EE 546, Univ of Washington, Spring Proximal mapping. introduction. review of conjugate functions. proximal mapping. Proximal mapping 6 1

Convex Optimization. (EE227A: UC Berkeley) Lecture 4. Suvrit Sra. (Conjugates, subdifferentials) 31 Jan, 2013

8. Conjugate functions

A function(al) f is convex if dom f is a convex set, and. f(θx + (1 θ)y) < θf(x) + (1 θ)f(y) f(x) = x 3

The proximal mapping

Mathematics 530. Practice Problems. n + 1 }

Convexity in R n. The following lemma will be needed in a while. Lemma 1 Let x E, u R n. If τ I(x, u), τ 0, define. f(x + τu) f(x). τ.

Constrained Optimization and Lagrangian Duality

Primal/Dual Decomposition Methods

Convex Analysis Notes. Lecturer: Adrian Lewis, Cornell ORIE Scribe: Kevin Kircher, Cornell MAE

g 2 (x) (1/3)M 1 = (1/3)(2/3)M.

A SET OF LECTURE NOTES ON CONVEX OPTIMIZATION WITH SOME APPLICATIONS TO PROBABILITY THEORY INCOMPLETE DRAFT. MAY 06

EE/ACM Applications of Convex Optimization in Signal Processing and Communications Lecture 17

6. Proximal gradient method

1 Convexity, concavity and quasi-concavity. (SB )

Extreme Abridgment of Boyd and Vandenberghe s Convex Optimization

Linear and non-linear programming

Lecture: Duality of LP, SOCP and SDP

Constraint qualifications for convex inequality systems with applications in constrained optimization

Math 273a: Optimization Subgradients of convex functions

Lecture 3: Lagrangian duality and algorithms for the Lagrangian dual problem

9. Geometric problems

BASICS OF CONVEX ANALYSIS

CONVEX OPTIMIZATION, DUALITY, AND THEIR APPLICATION TO SUPPORT VECTOR MACHINES. Contents 1. Introduction 1 2. Convex Sets

Lecture 2: Convex functions

3. Convex functions. basic properties and examples. operations that preserve convexity. the conjugate function. quasiconvex functions

3. Convex functions. basic properties and examples. operations that preserve convexity. the conjugate function. quasiconvex functions

Introductory Lectures on Stochastic Optimization

Problem Set 6: Solutions Math 201A: Fall a n x n,

Handout 2: Elements of Convex Analysis

In Progress: Summary of Notation and Basic Results Convex Analysis C&O 663, Fall 2009

CHAPTER 2: CONVEX SETS AND CONCAVE FUNCTIONS. W. Erwin Diewert January 31, 2008.

Lecture 6 : Projected Gradient Descent

Convex Analysis and Optimization Chapter 4 Solutions

LECTURE 25: REVIEW/EPILOGUE LECTURE OUTLINE

Introductory Lectures on Stochastic Optimization

8. Geometric problems

Convex Optimization M2

LECTURE SLIDES ON BASED ON CLASS LECTURES AT THE CAMBRIDGE, MASS FALL 2007 BY DIMITRI P. BERTSEKAS.

Chapter 2: Preliminaries and elements of convex analysis

Geometric problems. Chapter Projection on a set. The distance of a point x 0 R n to a closed set C R n, in the norm, is defined as

Lecture 8. Strong Duality Results. September 22, 2008

LECTURE SLIDES ON CONVEX OPTIMIZATION AND DUALITY THEORY TATA INSTITUTE FOR FUNDAMENTAL RESEARCH MUMBAI, INDIA JANUARY 2009 PART I

Almost Convex Functions: Conjugacy and Duality

A notion of Total Dual Integrality for Convex, Semidefinite and Extended Formulations

Convex Optimization and an Introduction to Congestion Control. Lecture Notes. Fabian Wirth

Convex analysis and profit/cost/support functions

Convex Optimization Conjugate, Subdifferential, Proximation

Lecture 12 Unconstrained Optimization (contd.) Constrained Optimization. October 15, 2008

Semicontinuous functions and convexity

Lectures on Geometry

Conditional Gradient (Frank-Wolfe) Method

Convex Optimization Theory

THE UNIQUE MINIMAL DUAL REPRESENTATION OF A CONVEX FUNCTION

5. Duality. Lagrangian

6. Proximal gradient method

GEOMETRIC APPROACH TO CONVEX SUBDIFFERENTIAL CALCULUS October 10, Dedicated to Franco Giannessi and Diethard Pallaschke with great respect

Lecture 1: Convex Sets January 23

Coordinate Update Algorithm Short Course Subgradients and Subgradient Methods

Lecture 2: Convex Sets and Functions

Dedicated to Michel Théra in honor of his 70th birthday

Convex Optimization Boyd & Vandenberghe. 5. Duality

Convex Optimization Theory. Chapter 5 Exercises and Solutions: Extended Version

8. Geometric problems

SYMBOLIC CONVEX ANALYSIS

arxiv:math/ v1 [math.fa] 4 Feb 1993

Strongly convex functions, Moreau envelopes and the generic nature of convex functions with strong minimizers

HW1 solutions. 1. α Ef(x) β, where Ef(x) is the expected value of f(x), i.e., Ef(x) = n. i=1 p if(a i ). (The function f : R R is given.

Subgradient Method. Ryan Tibshirani Convex Optimization

Continuity of convex functions in normed spaces

The sum of two maximal monotone operator is of type FPV

Convex Optimization and Modeling

Frank-Wolfe Method. Ryan Tibshirani Convex Optimization

Convex Functions. Daniel P. Palomar. Hong Kong University of Science and Technology (HKUST)

LECTURE 4 LECTURE OUTLINE

MATH 829: Introduction to Data Mining and Analysis Computing the lasso solution

5. Subgradient method

Continuous Sets and Non-Attaining Functionals in Reflexive Banach Spaces

Subdifferentials of convex functions

Necessary and Sufficient Conditions for the Existence of a Global Maximum for Convex Functions in Reflexive Banach Spaces

Dual Proximal Gradient Method

CSCI : Optimization and Control of Networks. Review on Convex Optimization

Inequality Constraints

Analysis and Linear Algebra. Lectures 1-3 on the mathematical tools that will be used in C103

Math 273a: Optimization Subgradient Methods

Introduction to Convex and Quasiconvex Analysis

A Brief Review on Convex Optimization

Lecture 7: September 17

Lecture 6: September 17

Transcription:

Lecture 1: Background on Convex Analysis John Duchi PCMI 2016

Outline I Convex sets 1.1 Definitions and examples 2.2 Basic properties 3.3 Projections onto convex sets 4.4 Separating and supporting hyperplanes II Convex functions 1.1 Definitions 2.2 Subgradients and directional derivatives 3.3 Optimality properties 4.4 Calculus rules

Why convex analysis and optimization? Consider problem minimize x f(x) subject to x X. When is this (efficiently) solvable? When things are convex If we can formulate a numerical problem as minimization of a convex function f over a convex set X, then (roughly) it is solvable

Convex sets Definition A set C R n is convex if for any x, y C tx + (1 t)y C for all t C

Examples Hyperplane: Let a R n, b R, C := {x R n : a, x = b}. Polyhedron: Let a 1, a 2,..., a m R n, b R m, C := {x : Ax b} = {x R n : a i, x b i }

Examples Norm balls: let be any norm, C := {x R n : x 1}

Basic properties Intersections: If C α, α A, are all convex sets, then C := is convex. α A C α Minkowski addition: If C, D are convex sets, then C + D := {x + y : x C, y D} is convex.

Basic properties Convex hulls: If x 1,..., x m are points, { m m } Conv{x 1,..., x m } := t i x i : t i 0, t i = 1. i=1 i=1

Projections Let C be a closed convex set. Define π C (x) := argmin{ x y 2 2 }. y C Existence: Assume x = 0 and that 0 C. Show that if x n is such that x n 2 inf y C y 2 then x n is Cauchy.

Projections Characterization of projection π C (x) := argmin y C { } x y 2 2 We have π C (x) is the projection of x onto C if and only if π C (x) x, y π C (x) 0 for all y C.

Separation Properties Separation of a point and convex set: Let C be closed convex and x C. Then there exists a separating hyperplane: there is v 0 and a constant b such that v, x > b and v, y b for all y C.

Separation Properties Separation of a point and convex set: Let C be closed convex and x C. Then there exists a separating hyperplane: there is v 0 and a constant b such that v, x > b and v, y b for all y C. Proof: Show the stronger result v, x v, y + x π C (x) 2 2.

Separation Properties Separation of two convex sets: Let C be convex and compact and D be closed convex. Then there is a non-zero separating hyperplane v inf x C v, x > sup v, y. y D Proof: D C is closed convex and 0 D C

Supporting hyperplanes Supporting hyperplane: A hyperplane {x : v, x = b} supports the set C if C is contained in the halfspace {x : v, x b} and for some y bd C we have v, y = b

Supporting hyperplanes Let C be a convex set with x bd C. Then there exists a non-zero supporting hyperplane H passing through x. That is, a v 0 such that Proof: C {y : v, y b} and v, x = b.

Convex functions A function f is convex if its domain dom f is a convex set and for all x, y dom f we have f(tx + (1 t)y) tf(x) + (1 t)f(y) for all t [0, 1]. (Define f(z) = + for z dom f)

Convex functions and epigraphs Duality between convex sets and functions. A function f is convex if and only if its epigraph is convex epi f := {(x, t) : f(x) t, t R}

Minima of convex functions Why convex? Let x be a local minimizer of f on the convex set C. Then global minmization: Proof: Note that for y C, f(x) f(y) for all y C. f(x + t(y x)) = f((1 t)x + ty) (1 t)f(x) + tf(y).

Subgradients A vector g is a subgradient of f at x if f(y) f(x) + g, y x for all y.

Subdifferential The subdifferential (subgradient set) of f at x is f(x) := {g : f(y) f(x) + g, y x for all y}.

Subdifferential examples Let f(x) = x = max{x, x}. Then 1 if x > 0 f(x) = 1 if x < 0 [ 1, 1] if x = 0.

Existence of subgradients Theorem: Let x int dom f. Then f(x) Proof: Supporting hyperplane to epi f at the point (x, f(x))

Optimality properties Subgradient optimality A point x minimizes f if and only if 0 f(x). Immediate: f(y) f(x) + g, y x

Advanced optimality properties Subgradient optimality Consider problem minimize x f(x) subject to x C. Then x solves problem if and only if there exists g f(x) such that g, y x 0 for all y C.

Subgradient calculus Addition Let f 1,..., f m be convex and f = m i=1 f i. Then f(x) = m { m } f i (x) = g i : g i f i (x) i=1 i=1 (Extends to infinite sums, integrals, etc.)

Subgradient calculus Composition Let A R n m and f : R n R be convex, with h(x) = f(ax). Then h(x) = A T f(ax) = {A T g : g f(ax)}.

Subgradient calculus: maxima Let f i, i = 1,..., m, be convex f(x) = max f i (x) i Then with I(x) = {i : f i (x) = f(x) = max j f j (x)}, f(x) = Conv{g i : g i f i (x), i I(x)}.

Subgradient calculus: suprema Let {f α : α A} be a collection of convex functions, f(x) = sup f α (x). α A Then if the supremum is attained and A(x) = {α : f α (x) = f(x)}, f(x) Conv {g α : g α f α (x), α A(x)}

Examples of subgradients Norms: Recall l -norm,, Then x = Conv x = max x j j { } {e i : e i, x = x } { e i : e i, x = x }

Examples of subgradients General norms: Recall dual norm of norm y = sup x, y and x = sup x, y. x: x 1 y: y 1 Then x = {y : y 1, y, x = x }.