Definition of convex function in a vector space

Similar documents
Part I Formulation of convex optimization problems

Lecture 2: Convex Sets and Functions

A function(al) f is convex if dom f is a convex set, and. f(θx + (1 θ)y) < θf(x) + (1 θ)f(y) f(x) = x 3

Convex Functions. Daniel P. Palomar. Hong Kong University of Science and Technology (HKUST)

Introduction and Math Preliminaries

Lecture 2: Convex functions

MVE165/MMG631 Linear and integer optimization with applications Lecture 13 Overview of nonlinear programming. Ann-Brith Strömberg

EE Applications of Convex Optimization in Signal Processing and Communications Dr. Andre Tkacenko, JPL Third Term

3. Convex functions. basic properties and examples. operations that preserve convexity. the conjugate function. quasiconvex functions

Nonlinear Optimization

MATHEMATICAL ECONOMICS: OPTIMIZATION. Contents

Convex Functions. Wing-Kin (Ken) Ma The Chinese University of Hong Kong (CUHK)

Numerisches Rechnen. (für Informatiker) M. Grepl P. Esser & G. Welper & L. Zhang. Institut für Geometrie und Praktische Mathematik RWTH Aachen

Convex Functions and Optimization

N. L. P. NONLINEAR PROGRAMMING (NLP) deals with optimization models with at least one nonlinear function. NLP. Optimization. Models of following form:

Lecture Note 5: Semidefinite Programming for Stability Analysis

Convex Optimization. (EE227A: UC Berkeley) Lecture 4. Suvrit Sra. (Conjugates, subdifferentials) 31 Jan, 2013

Lecture 4: Convex Functions, Part I February 1

Chapter 2: Preliminaries and elements of convex analysis

4. Convex Sets and (Quasi-)Concave Functions

Introduction to Machine Learning Lecture 7. Mehryar Mohri Courant Institute and Google Research

3. Convex functions. basic properties and examples. operations that preserve convexity. the conjugate function. quasiconvex functions

Common-Knowledge / Cheat Sheet

1 Overview. 2 A Characterization of Convex Functions. 2.1 First-order Taylor approximation. AM 221: Advanced Optimization Spring 2016

Convex optimization COMS 4771

Nonlinear equations. Norms for R n. Convergence orders for iterative methods

j=1 [We will show that the triangle inequality holds for each p-norm in Chapter 3 Section 6.] The 1-norm is A F = tr(a H A).

Lecture 3: Functions of Symmetric Matrices

IE 521 Convex Optimization

Semidefinite Programming Basics and Applications

Microeconomics I. September, c Leopold Sögner

8. Geometric problems

Lagrange duality. The Lagrangian. We consider an optimization program of the form

Lagrangian Duality Theory

Agenda. 1 Cone programming. 2 Convex cones. 3 Generalized inequalities. 4 Linear programming (LP) 5 Second-order cone programming (SOCP)

Convex functions. Definition. f : R n! R is convex if dom f is a convex set and. f ( x +(1 )y) < f (x)+(1 )f (y) f ( x +(1 )y) apple f (x)+(1 )f (y)

Nonlinear Programming Models

Chapter 1. Optimality Conditions: Unconstrained Optimization. 1.1 Differentiable Problems

Convex Optimization. Dani Yogatama. School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA. February 12, 2014

Convex Optimization and Modeling

8. Geometric problems

Introduction to Convex Analysis Microeconomics II - Tutoring Class

Module 04 Optimization Problems KKT Conditions & Solvers

Shiqian Ma, MAT-258A: Numerical Optimization 1. Chapter 4. Subgradient

Convex envelopes, cardinality constrained optimization and LASSO. An application in supervised learning: support vector machines (SVMs)

Examination paper for TMA4145 Linear Methods

Lecture 3. Optimization Problems and Iterative Algorithms

Linear Algebra Lecture Notes-I

Bounded uniformly continuous functions

Introduction to Functional Analysis

Fuzzy Sets. Mirko Navara navara/fl/fset printe.pdf February 28, 2019

EE/ACM Applications of Convex Optimization in Signal Processing and Communications Lecture 17

Extreme Abridgment of Boyd and Vandenberghe s Convex Optimization

2 Two-Point Boundary Value Problems

2) Let X be a compact space. Prove that the space C(X) of continuous real-valued functions is a complete metric space.

Subgradients. subgradients and quasigradients. subgradient calculus. optimality conditions via subgradients. directional derivatives

Paul Schrimpf. October 18, UBC Economics 526. Unconstrained optimization. Paul Schrimpf. Notation and definitions. First order conditions

L. Vandenberghe EE236C (Spring 2016) 18. Symmetric cones. definition. spectral decomposition. quadratic representation. log-det barrier 18-1

An Alternative Proof of Primitivity of Indecomposable Nonnegative Matrices with a Positive Trace

8 Numerical methods for unconstrained problems

Math 273a: Optimization Subgradients of convex functions

Tutorials in Optimization. Richard Socher

Analysis and Linear Algebra. Lectures 1-3 on the mathematical tools that will be used in C103

Local strong convexity and local Lipschitz continuity of the gradient of convex functions

Kernel Method: Data Analysis with Positive Definite Kernels

5. Duality. Lagrangian

Lecture 2: Linear Algebra Review

10-725/36-725: Convex Optimization Prerequisite Topics

Convex Optimization in Classification Problems

Iterative Reweighted Minimization Methods for l p Regularized Unconstrained Nonlinear Programming

STAT 200C: High-dimensional Statistics

Examples include: (a) the Lorenz system for climate and weather modeling (b) the Hodgkin-Huxley system for neuron modeling

Lecture 15 Newton Method and Self-Concordance. October 23, 2008

8. Conjugate functions

Convex analysis and profit/cost/support functions

Course Summary Math 211

Structural and Multidisciplinary Optimization. P. Duysinx and P. Tossings

Math 328 Course Notes

January 29, Introduction to optimization and complexity. Outline. Introduction. Problem formulation. Convexity reminder. Optimality Conditions

A Brief Review on Convex Optimization

Sparse Gaussian conditional random fields

WEAK CONVERGENCE OF RESOLVENTS OF MAXIMAL MONOTONE OPERATORS AND MOSCO CONVERGENCE

Self-Concordant Barrier Functions for Convex Optimization

Linear and non-linear programming

Sobolevology. 1. Definitions and Notation. When α = 1 this seminorm is the same as the Lipschitz constant of the function f. 2.

Interior Point Methods: Second-Order Cone Programming and Semidefinite Programming

i=1 β i,i.e. = β 1 x β x β 1 1 xβ d

Handout 2: Elements of Convex Analysis

Generalized Kernel Classification and Regression

Techinical Proofs for Nonlinear Learning using Local Coordinate Coding

ON GAP FUNCTIONS OF VARIATIONAL INEQUALITY IN A BANACH SPACE. Sangho Kum and Gue Myung Lee. 1. Introduction

Assignment 1: From the Definition of Convexity to Helley Theorem

Summer School: Semidefinite Optimization

THE INVERSE FUNCTION THEOREM FOR LIPSCHITZ MAPS

Support Vector Machines: Maximum Margin Classifiers

Statistical Methods for SVM

From the Newton equation to the wave equation in some simple cases

Convex Optimization M2

Chapter 14 Unconstrained and Constrained Optimization Problems

Linear ODEs. Existence of solutions to linear IVPs. Resolvent matrix. Autonomous linear systems

Transcription:

Convex functions Nonlinear optimization Instituto Superior Técnico and Carnegie Mellon University PhD course João Xavier TAs: Brian Swenson, Shanghang Zhang, Lucas Balthazar

Convex functions

Nonconvex functions

Definition of convex function in a vector space Definition. A function f : V R {+ } in a vector space V is convex if f + and f ((1 α)x + αy) (1 α)f(x) + αf(y) for all x, y S and α [0, 1]. (0(+ ) := 0) Definition. The domain of f is dom f = {x V : f(x) < + }. If f is convex, its domain is a convex set

Convexity is a 1D property Proposition. f is convex if and only φ : R R {+ }, φ(t) = f (p + td), is convex for any p and d in V. f p d

How do we recognize convex functions? List of simple ones + Apply convexity-preserving operations

Simple convex functions affine norms indicators

Affine function in R f(x) =sx + r r s x

Affine function in R 2 f(x) =s T x + r r x 2 x 1

Affine function Definition. An affine function f : V R is a map of the form f(v) = l(v) + r for some linear function l : V R and some r R. Examples: f : R n R, f(x) = s T x + r (s R n, r R) f : R n m R, f(x) = tr ( S T X ) + r (S R n m, r R) f : S n R, f(x) = tr (SX) + r (S S n, r R) Theorem. An affine function is convex.

Example: network flow x 1 x 4 c 1 c 4 x 3 1 s t 1 c 5 x 2 x 5 c 2 c 3 Formulation that minimizes cost: minimize x 1,x 2,x 3,x 4,x 5 c 1 x 1 + c 2 x 2 + c 3 x 3 + c 4 x 4 + c 5 x 5 subject to 1 = x 1 + x 2 x 1 + x 3 = x 4 x 2 = x 3 + x 5 x 4 + x 5 = 1 x 1, x 2, x 3, x 4, x 5 0

Norm in R f(x) = x x

Norm in R 2 f(x) =kxk 2 x 2 x 1

Theorem. A norm is a convex function. Examples: f : R n R, f(x) = x 2 f : R n R, f(x) = x f : S n R, f(x) = X F

Indicators Definition. The indicator of a set S V is the function { 0, if x S i S : V R {+ }, i S (x) = +, otherwise. +1 +1 S Theorem. The indicator of a convex set is a convex function.

Indicators allow to pass constraints to the objective: is equivalent to minimize x subject to f(x) x A x B is equivalent to minimize x subject to f(x) + i B (x) x A is equivalent to minimize x f(x) + i A (x) + i B (x) minimize x f(x) + i A B (x)

Convexity through differentiability Theorem (1st order criterion). Let S be an open convex subset of R n and f : S R be a differentiable function. Then, f is convex f(x) f(y) + f(y) T (x y) for all x, y S. f(x) f(y)+rf(y) T (x y) y x Mostly useful in the direction: offers affine lower bounds to convex functions produces interesting inequalities

Convexity through differentiability Theorem (2nd order criterion). Let S be an open convex subset of R n and f : S R be a twice-differentiable function. Then, f is convex 2 f(y) 0 for all y S. f(x) y x Mostly useful in the direction: proves f is convex Commonly, the 2nd order Taylor expansion f(y) + f(y) T (x y) + 1 2 (y x)t 2 f(y)(y x) is not an upper bound on f(x)

Examples f : R ++ R, f(x) = 1 x f : R ++ R, f(x) = log(x) f : R + R, f(x) = x log(x) with 0 log(0) := 0 f : R n R, f(x) = x T Ax + b T x + c, with A symmetric, is convex iff A 0 f : R n R, f(x 1,..., x n ) = log (e x1 + + e xn ) f : R n R ++ R, f(x, y) = xt x y

Theorem. For any A S n ++ and B S n there exists S R n n such that A = SS T and B = SΛS T where Λ R n n is diagonal, with the eigenvalues of A 1/2 BA 1/2. Further examples of convex functions: f : S n ++ R, f(x) = tr ( X 1) f : S n ++ R, f(x) = log det(x)

Operations that preserve convexity conic combination composition with affine map pointwise supremum

Conic combination preserves convexity Theorem. Let f i : V R {+ } be convex functions and α i 0 for i = 1,..., n. If n i=1 dom f i, then is convex. f = α 1 f 1 + + α n f n Example: basis pursuit with denoising minimize x Ax b 2 2 + ρ x 1 }{{} f(x) for given A R m n, b R m, and ρ > 0

Composition with affine map preserves convexity Theorem. Let A : V W be an affine map, f : W R {+ } be convex, and A(V ) dom f. Then is convex. f A : V R {+ } Example: logistic regression (x k, y k ) R n {0, 1}, k = 1,..., K, is the labeled training set label is generated randomly from feature vector P(Y = y X = x, s, r) = ey(st x+r) 1 + e st x+r what should be the classifier parameters (s, r) R n R? K maximize s,r k=1 log P (Y = y k X = x k, s, r)

Pointwise supremum preserves convexity Theorem. Let f i : V R {+ }, i I, be a family a convex functions and suppose that f = sup i I f i +. Then, f : V R {+ } is convex. Example: fire-station placement p 1,..., p K are the locations of villages what should be the position x of the fire-station? minimize x max { x p 1,, x p K } }{{} f(x)

Example: maximum eigenvalue function λ max : S n R X λ max (X) does not have a closed-form expression variational characterization from linear algebra λ max (X) = sup{q T Xq : q = 1} offers the representation λ max = sup {f q (X) : q Q} with f q : S n R, f q (X) = q T Xq and Q = {q R n : q = 1} f q is a convex function (linear function of X) we conclude λ max is a convex function