Strongly convex functions, Moreau envelopes and the generic nature of convex functions with strong minimizers

Similar documents
Optimization and Optimal Control in Banach Spaces

Convex Optimization Notes

Epiconvergence and ε-subgradients of Convex Functions

BASICS OF CONVEX ANALYSIS

Chapter 2 Convex Analysis

Subdifferential representation of convex functions: refinements and applications

Local strong convexity and local Lipschitz continuity of the gradient of convex functions

Continuity of convex functions in normed spaces

Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping.

Chapter 2 Metric Spaces

AW -Convergence and Well-Posedness of Non Convex Functions

(convex combination!). Use convexity of f and multiply by the common denominator to get. Interchanging the role of x and y, we obtain that f is ( 2M ε

Iterative Convex Optimization Algorithms; Part One: Using the Baillon Haddad Theorem

Optimality Conditions for Nonsmooth Convex Optimization

g 2 (x) (1/3)M 1 = (1/3)(2/3)M.

Convex Functions. Pontus Giselsson

On a result of Pazy concerning the asymptotic behaviour of nonexpansive mappings

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Convex Analysis and Economic Theory AY Elementary properties of convex functions

The resolvent average of monotone operators: dominant and recessive properties

ON GAP FUNCTIONS OF VARIATIONAL INEQUALITY IN A BANACH SPACE. Sangho Kum and Gue Myung Lee. 1. Introduction

On Semicontinuity of Convex-valued Multifunctions and Cesari s Property (Q)

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

Subgradient Projectors: Extensions, Theory, and Characterizations

Metric Spaces and Topology

THE UNIQUE MINIMAL DUAL REPRESENTATION OF A CONVEX FUNCTION

3.1 Convexity notions for functions and basic properties

Monotone operators and bigger conjugate functions

Dedicated to Michel Théra in honor of his 70th birthday

A SET OF LECTURE NOTES ON CONVEX OPTIMIZATION WITH SOME APPLICATIONS TO PROBABILITY THEORY INCOMPLETE DRAFT. MAY 06

Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University

Keywords. 1. Introduction.

Chapter 3: Baire category and open mapping theorems

Convex Analysis and Optimization Chapter 2 Solutions

A Dykstra-like algorithm for two monotone operators

Convex Optimization Conjugate, Subdifferential, Proximation

Convex Functions and Optimization

From now on, we will represent a metric space with (X, d). Here are some examples: i=1 (x i y i ) p ) 1 p, p 1.

SOME REMARKS ON SUBDIFFERENTIABILITY OF CONVEX FUNCTIONS

Existence and Approximation of Fixed Points of. Bregman Nonexpansive Operators. Banach Spaces

Best approximations in normed vector spaces

Course 212: Academic Year Section 1: Metric Spaces

Set, functions and Euclidean space. Seungjin Han

Identifying Active Constraints via Partial Smoothness and Prox-Regularity

Real Analysis Notes. Thomas Goller

Semicontinuous functions and convexity

WEAK CONVERGENCE OF RESOLVENTS OF MAXIMAL MONOTONE OPERATORS AND MOSCO CONVERGENCE

Economics 204 Fall 2011 Problem Set 2 Suggested Solutions

On the convexity of piecewise-defined functions

Convex Analysis Background

GEOMETRIC APPROACH TO CONVEX SUBDIFFERENTIAL CALCULUS October 10, Dedicated to Franco Giannessi and Diethard Pallaschke with great respect

Introduction and Preliminaries

Convex Analysis and Economic Theory Winter 2018

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

Second order forward-backward dynamical systems for monotone inclusion problems

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization

ZERO DUALITY GAP FOR CONVEX PROGRAMS: A GENERAL RESULT

Your first day at work MATH 806 (Fall 2015)

Chapter 1: Banach Spaces

MATH 4200 HW: PROBLEM SET FOUR: METRIC SPACES

MA651 Topology. Lecture 10. Metric Spaces.

Topology. Xiaolong Han. Department of Mathematics, California State University, Northridge, CA 91330, USA address:

A Geometric Framework for Nonconvex Optimization Duality using Augmented Lagrangian Functions

Introduction to Convex Analysis Microeconomics II - Tutoring Class

Remark on a Couple Coincidence Point in Cone Normed Spaces

DEFINABLE VERSIONS OF THEOREMS BY KIRSZBRAUN AND HELLY

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

MAXIMALITY OF SUMS OF TWO MAXIMAL MONOTONE OPERATORS

arxiv: v1 [math.oc] 15 Apr 2016

Convex and Nonconvex Optimization Techniques for the Constrained Fermat-Torricelli Problem

REVIEW OF ESSENTIAL MATH 346 TOPICS

The Brezis-Browder Theorem in a general Banach space

Brøndsted-Rockafellar property of subdifferentials of prox-bounded functions. Marc Lassonde Université des Antilles et de la Guyane

The proximal mapping

Chapter 2. Metric Spaces. 2.1 Metric Spaces

The Subdifferential of Convex Deviation Measures and Risk Functions

A Unified Analysis of Nonconvex Optimization Duality and Penalty Methods with General Augmenting Functions

Contents Ordered Fields... 2 Ordered sets and fields... 2 Construction of the Reals 1: Dedekind Cuts... 2 Metric Spaces... 3

Part III. 10 Topological Space Basics. Topological Spaces

Lecture 1: Background on Convex Analysis

Introduction to Real Analysis Alternative Chapter 1

Stability of efficient solutions for semi-infinite vector optimization problems

Mathematics for Economists

Mathematics II, course

ANALYSIS WORKSHEET II: METRIC SPACES

A Dual Condition for the Convex Subdifferential Sum Formula with Applications

Semicontinuities of Multifunctions and Functions

Division of the Humanities and Social Sciences. Supergradients. KC Border Fall 2001 v ::15.45

Existence and Uniqueness

Math 209B Homework 2

Chapter 1. Optimality Conditions: Unconstrained Optimization. 1.1 Differentiable Problems

P-adic Functions - Part 1

Self-equilibrated Functions in Dual Vector Spaces: a Boundedness Criterion

Examples of Convex Functions and Classifications of Normed Spaces

Self-dual Smooth Approximations of Convex Functions via the Proximal Average

CONTINUOUS CONVEX SETS AND ZERO DUALITY GAP FOR CONVEX PROGRAMS

Some Background Material

Chapter 1. Introduction

On the order of the operators in the Douglas Rachford algorithm

Transcription:

University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part B Faculty of Engineering and Information Sciences 206 Strongly convex functions, Moreau envelopes and the generic nature of convex functions with strong minimizers Chayne Planiden University of British Columbia, Okanagan, chayne@uow.edu.au Xianfu Wang University of British Columbia, Okanagan, shawn.wang@ubc.ca Publication Details Planiden, C. & Wang, X. (206). Strongly convex functions, Moreau envelopes and the generic nature of convex functions with strong minimizers. SIAM Journal on Optimization, 26 (2), 34-364. Research Online is the open access institutional repository for the University of Wollongong. For further information contact the UOW Library: research-pubs@uow.edu.au

Strongly convex functions, Moreau envelopes and the generic nature of convex functions with strong minimizers Abstract In this work, using Moreau envelopes, we define a complete metric for the set of proper lower semicontinuous convex functions. Under this metric, the convergence of each sequence of convex functions is epiconvergence. We show that the set of strongly convex functions is dense but it is only of the first category. On the other hand, it is shown that the set of convex functions with strong minima is of the second category. Disciplines Engineering Science and Technology Studies Publication Details Planiden, C. & Wang, X. (206). Strongly convex functions, Moreau envelopes and the generic nature of convex functions with strong minimizers. SIAM Journal on Optimization, 26 (2), 34-364. This journal article is available at Research Online: http://ro.uow.edu.au/eispapers/343

Strongly convex functions, Moreau envelopes and the generic nature of convex functions with strong minimizers C. Planiden X. Wang April 3, 206 Abstract In this work, using Moreau envelopes, we define a complete metric for the set of proper lower semicontinuous convex functions in a finite-dimensional space. Under this metric, the convergence of each sequence of convex functions is epi-convergence. We show that the set of strongly convex functions is dense but it is only of the first category. On the other hand, it is shown that the set of convex functions with strong minima is of the second category. AMS Subject Classification: Primary 54E52, 52A4, 90C25; Secondary 49K40. Keywords: Attouch-Wets metric, Baire category, complete metric space, convex function, epiconvergence, epi-topology, generic set, meager set, Moreau envelope, proximal mapping, strong minimizer, strongly convex. Introduction Minimizing convex functions is fundamental in optimization, both in theory and in algorithm design. For most applications, the assertions that can be made about a class of convex functions are of greater value than those concerning a particular problem. This theoretical analysis is valuable for the insights gained on the behaviour of the entire class of functions. Our main result in this paper states that in R n, the set of all proper lower semicontinuous (lsc) convex functions that have strong minimizers is of second category. Studying strong minima is important, because numerical methods usually produce asymptotically minimizing sequences, and we can assert convergence of asymptotically minimizing sequences when the function has a strong minimizer. As shown in Mathematics, University of British Columbia Okanagan, Kelowna, B.C. VV V7, Canada. Research by this author was supported by UBC UGF and by NSERC of Canada. chayne.planiden@alumni.ubc.ca. Mathematics, University of British Columbia Okanagan, Kelowna, B.C. VV V7, Canada. Research by this author was partially supported by an NSERC Discovery Grant. shawn.wang@ubc.ca.

[25], the proximal mapping of a strongly convex function is a contraction, and the proximal point method converges at a linear rate. The strongly convex function is also of great use in optimization problems, as it can significantly increase the rate of convergence of first-order methods such as projected subgradient descent [8], or more generally the forward-backward algorithm [5, Example 27.2]. Although every strongly convex function has a strong minimizer, we show that the set of strongly convex functions is only of the first category. As a proper lsc convex function allows the value infinity, we propose to relate the function to its Moreau envelope. The importance of the Moreau envelope in optimization is clear; it is a regularizing (smoothing) function [20, 2], and in the convex setting it has the same local minima and minimizers as its objective function [27, 30]. The key tool we use is Baire category. A property is said to be generic if it holds for a second category set. We will work in a metric space defined by Moreau envelopes. In this setting, there are many nice properties of the set of Moreau envelopes of proper, lsc, convex functions. This set is proved to be closed and convex. Moreover, as a mapping from the set of proper lsc convex functions to the set of Moreau envelopes of convex functions, the Moreau envelope mapping is bijective. We provide a detailed analysis of functions with strong minima, strongly convex functions, and their Moreau envelopes. The organization of the present work is the following. Section 2 contains notation and definitions, as well as some preliminary facts and lemmas about Baire category, epi-convergence of convex functions, strongly convex functions and strong minimizers that we need to prove the main results. We show that the Moreau envelope of a convex function inherits many nice properties of the convex function, such as coercivity and strong convexity. In Section 3, using Moreau envelopes of convex functions, we propose to use the Attouch-Wets metric on the set of proper lsc convex functions. It turns out that this metric space is complete, and it is isometric to the metric space of Moreau envelopes endowed with uniform convergence on bounded sets. The main results of this paper are presented in Section 4. We give some characterizations of strong minimizers of convex functions, that are essential for our Baire category approach. We establish Baire category classification of the sets of strongly convex functions, convex functions with strong minima, and convex coercive functions. Our main result says that most convex functions have strong minima, which in turn implies that the set of convex functions not having strong minimizers is small. Surprisingly, the set of strongly convex functions is only of the first category. In addition, we show that a convex function is strongly convex if and only if its proximal mapping is a down-scaled proximal mapping. Concluding remarks and areas of future research are mentioned in Section 5. A comparison to literature is in order. In [29], Baire category theory was used to show that most (i.e. a generic set) maximally monotone operators have a unique zero. In [22], a similar track was taken, but it uses the perspective of proximal mappings in particular, ultimately proving that most classes of convex functions have a unique minimizer. The technique of this paper differs in that it is based on functions. We use Moreau envelopes of convex functions, strong minimizers and strongly convex functions rather than subdifferentials. While Beer and Lucchetti obtained a similar result on generic well-posedness of convex optimization, their approach relies on epigraphs of convex functions [7, 8, 9]. Our Moreau envelope approach is more accessible and natural to practical optimizers, because taking the Moreau envelope is a popular regularization method used 2

in the optimization community. We also give a systematic study of strongly convex functions, a special subclass of uniformly convex functions, which is new to the best of our knowledge. In [6], the definition of generic Tikhonov well-posedness of convex problems is given. There, it is assumed that either the convex functions are finite-valued and the set of convex functions is equipped with uniform convergence on bounded sets, or the convex functions are all continuous on the whole space. See also [28] for generic nature of constrained optimization problems, and [7, 6, 9, 24] for well-posedness in optimization. For comprehensive generic results on fixed points of nonexpansive mappings and firmly nonexpansive mappings, we refer the reader to [23]. 2 Preliminaries 2. Notation All functions in this paper have their domains in R n, where R n is Euclidean space equipped with inner product x, y := n x i y i, and induced norm x := x, x. The extended real line R { } i= is denoted R. We denote the set of natural numbers by N. We use dom f for the domain of f, int dom f for the interior of the domain of f, bdry dom f for the boundary of the domain of f, and epi f for the epigraph of f. We use Γ 0 (X) to represent the set of proper lsc convex functions on the space X, with the terms proper, lsc, and convex as defined in [5, 27]. More precisely, f is proper if f(x) and dom f ; f is lsc at x if x k x implies lim inf f(x k) f(x), when this k is true at every x X we call f lsc on X; f is convex if ( x, y dom f)( 0 α ) f(αx + ( α)y) αf(x) + ( α)f(y). The symbol G δ is used to indicate a countable intersection of open sets. The identity mapping or matrix is Id : R n R n : x x. We use B r (x) for the open ball centered at x of radius r, and B r [x] for the closed ball. For a set C R n, its closure is denoted by C. The closed line segment between x, y R n is [x, y] := {λx + ( λ)y : 0 λ }. For a sequence of functions {f γ } Γ 0 (R n p e ), we use f γ f to indicate pointwise convergence, fγ f for epi-convergence, and f u γ f for uniform convergence to the function f. 2.2 Baire category Let (X, d) be a metric space, where X is a set and d is a metric on X. Definition 2.. A set S X is dense in X if every element of X is either in S, or a limit point of S. A set is nowhere dense in X if the interior of its closure in X is empty. Definition 2.2. A set S X is of first category (meager) if S is a union of countably many nowhere dense sets. A set S X is of second category (generic) if X \ S is of first category. The following Baire category theorem is essential for this paper. 3

Fact 2.3 (Baire). ([32, Theorem.47] or [5, Corollary.44]) Let (X, d) be a complete metric space. Then any countable intersection of dense open subsets of X is dense. Fact 2.4. Finite-dimensional space R n is separable. That is, R n has a countable subset that is dense in R n. 2.3 Convex analysis In this section we state several key facts about convex functions that we need in order to prove the main results in subsequent sections. 2.3. Subdifferentials of convex functions Let f Γ 0 (R n ). The set-valued mapping f : R n R n : x { x R n ( y R n ) y x, x + f(x) f(y) } is the subdifferential operator of f. Fact 2.5. [5, Theorem 20.40] If f Γ 0 (R n ), then f is maximally monotone. Fact 2.6. ([27, Theorem 2.4], [3, Theorem 2.5]) For any maximally monotone mapping T : R n R n, the set dom T is almost convex. That is, there exists a convex set C R n such that C dom T C. The same applies to the set ran T. Definition 2.7. The Fenchel conjugate of f : R n R is defined as ( v R n ) f (v) := sup x R n { v, x f(x)}. Fact 2.8. [26, Corollary 23.5.] If f Γ 0 (R n ), then f is the inverse of f in the sense of multivalued mappings, i.e. x f (x ) if and only if x f(x). 2.3.2 Convex functions and their Moreau envelopes Definition 2.9. The Moreau envelope of a proper, lsc function f : R n R is defined as { ( x R n ) e λ f(x) := inf f(y) + } y x 2. y 2λ The associated proximal mapping is the (possibly empty) set of points at which this infimum is achieved, and is denoted Prox λ f : ( x R n ) { Prox λ f(x) := argmin f(y) + } y x 2. y 2λ 4

In terms of the subdifferential of f, the proximal mapping Prox f = (Id + f). In this paper, without loss of generality we use λ =. The theory developed here is equally applicable with any other choice of λ > 0. Fact 2.0. ([5, Proposition 2.29] or [27, Theorem 2.26]) Let f Γ 0 (R n ). Then e f : R n R is continuously differentiable on R n, and its gradient is -Lipschitz continuous, i.e., nonexpansive. e f = Id Prox f One important concept for studying the convergence of extended-valued functions is that of epi-convergence, see, e.g., [, 7, 27]. We first remind the reader of the concepts of limit inferior and limit superior for a sequence of sets. Definition 2.. The distance from a point x to a set S R n is defined by the distance function d S : d S (x) := inf x y. y S Definition 2.2. The limit inferior of a sequence of sets {S γ } γ= is denoted lim inf γ defined lim inf γ S γ := {x R n : lim sup d Sγ (x) = 0}. γ Similarly, the limit superior is denoted lim sup S γ, and is defined γ lim sup S γ := {x R n : lim inf d S γ (x) = 0}. γ γ S γ, and is Definition 2.3. The lower epi-limit of a sequence {f γ } γ= R n is the function having as its epigraph the outer limit of the sequence of sets epi f γ : epi(eliminf γ f γ) := lim sup(epi f γ ). γ Similarly, the upper epi-limit of {f γ } γ= is the function having as its epigraph the inner limit of the sets epi f γ : epi(elimsup f γ ) := lim inf (epi f γ). γ γ When these two functions coincide, the epi-limit is said to exist and the functions are said to epiconverge to f. We denote this by using the following notation: f γ e f if and only if epi f γ epi f. Definition 2.4. Let {f γ } be a sequence of functions with the same domain D. The sequence p converges pointwise to a function f, denoted f γ f, if and only if ( x D) lim f γ (x) = f(x). γ 5

We refer the reader to [7, 9,, 27] for further details on epi-convergence, e.g., continuity, stability and applications in optimization. The analysis of the limit properties of sequences of convex functions via their Moreau envelopes is highlighted by the following fact. Fact 2.5. ([27, Theorem 7.37], []) Let {f γ } γ= Γ 0 (R n ), f Γ 0 (R n ). Then f γ e f if and only if e f γ p e f. Moreover, the pointwise convergence of e f γ to e f is uniform on all bounded subsets of R n, hence yields epi-convergence to e f as well. Remark 2.6. (). Fact 2.5 would not be valid as it stands in infinite-dimensional space. A uniform limit on bounded sets of Moreau envelopes will remain a Moreau envelope, but this can fail to be true when considering pointwise convergence of Moreau envelopes, see, e.g., [, Remark 2.7]. (2). When X is a Banach space, we refer to Beer [7, p. 263]: for {f γ } γ= Γ 0 (X), one has that f γ aw f 0 if and only if e f γ e f 0 uniformly on bounded subsets of X. When X is infinite dimensional, Attouch-Wets convergence is a stronger concept than epi-convergence, see, e.g., [7, p. 235] or [, Theorem 6.2.4]. Two more nice properties about Moreau envelopes are the following. Fact 2.7. [27, Example.46] For any proper, lsc function f : R n R, inf f = inf e f. Lemma 2.8. [26, Theorem 3.5] Let f Γ 0 (R n ). Then ( x R n ) e f(x) + e f (x) = 2 x 2. For more properties of Moreau envelopes of functions, we refer the reader to [, 5, 26, 27]. 2.4 Strong minimizers, coercive convex functions and strongly convex functions We now present some basic properties of strong minimizers, strongly convex functions, and coercive functions. Definition 2.9. A function f : R n R is said to attain a strong minimum at x R n if (i) f( x) f(x) for all x dom f, and (ii) f(x k ) f( x) implies x k x. 6

In existing literature, a function f having a strong minimizer is also known as (R n, f) being well-posed in the sense of Tikhonov, as defined by Dontchev and Zolezzi in [6, Chapter I] and used in [7, 3, 5, 7]. [6, Theorem 2] shows that f has a strong minimizer x if and only if there exists a forcing function c such that f(x) f( x) + c(dist(x, x)) where c : 0 D [0, ) [0, ), c(0) = 0, and c(a n ) 0 a n 0. When f is convex, the forcing function c can be chosen convex, [9, Proposition 0..9]. Chapter III of [6] also contains several results on generic well-posedness. For further information on strong minimizers, we refer readers to [2, 4, 9]. Definition 2.20. Following Rockafellar and Wets (see [27, p. 90]), we will call a function f Γ 0 (R n ) level-coercive if f(x) lim inf x x > 0, and coercive if f(x) lim inf x x =. Definition 2.2. A function f Γ 0 (R n ) is σ-strongly convex if there exists a constant σ > 0 such that f σ 2 2 is convex. Equivalently, f is σ-strongly convex if there exists σ > 0 such that for all λ (0, ) and for all x, y R n, f(λx + ( λ)y) λf(x) + ( λ)f(y) σ 2 λ( λ) x y 2. Fact 2.22. ([0, Exercise 2 p. 83], [27, Theorem.8]) Let f Γ 0 (R n ). Then (i) f is level-coercive if and only if 0 int dom f, and (ii) f is coercive if and only if dom f = R n. Lemma 2.23. The function f Γ 0 (R n ) is σ-strongly convex if and only if e f is convex. σ -strongly σ+ Proof. By [27, Proposition 2.60], f is σ-strongly convex if and only if f is -Lipschitz for σ some σ > 0. Now (e f) = f + 2 2, and (e f) = f + Id. Suppose that f is σ-strongly convex. Since f is -Lipschitz, we have that f + Id is ( + σ σ) - Lipschitz. Hence, (e f) is σ+-lipschitz. Then e σ f is σ -strongly convex, and we have proved σ+ one direction of the lemma. Working backwards with the same argument, the other direction is proved as well. 7

Lemma 2.24. Let f Γ 0 (R n ). Then (i) f is level-coercive if and only if e f is level-coercive, and (ii) f is coercive if and only if e f is coercive. Proof. Since (e f) = f + 2 2, we have dom(e f) = dom f. The result follows from Fact 2.22. Lemma 2.25. Let f Γ 0 (R n ) be strongly convex. Then f is coercive. Proof. Since f is strongly convex, f can be written as g + σ 2 2 for some g Γ 0 (R n ) and σ > 0. Since g is convex, g is bounded below by a hyperplane. That is, there exist x R n and r R such that g(x) x, x + r for all x R n. Hence, f(x) x, x + r + σ 2 x 2 for all x R n. This gives us that f(x) lim inf x x =. Lemma 2.26. Let f : Γ 0 (R n ) R be strongly convex. Then the (unique) minimizer of f is a strong minimizer. Proof. Let f(x k ) inf f(x). Since f is coercive by Lemma 2.25, {x k} k= is bounded. By the x Bolzano-Weierstrass Theorem, {x k } k= has a convergent subsequence x k j x. Since f is lsc, we have that lim inf f(x k) f( x). Hence, k inf x f(x) f( x) inf f(x). x Therefore, f( x) = inf f(x). Since strong convexity implies strict convexity, argmin f(x) = { x} x is unique. As every subsequence of {x k } k= converges to the same limit x, we conclude that x k x. Remark 2.27. See [32, Proposition 3.5.8] for a stronger and more general result regarding Lemmas 2.25 and 2.26. In view of [32, Corollary 3.5.(iii)], when f is strongly convex with strong minimizer x, taking ( x, 0) gr f one has the existence of c > 0 such that f(x) f( x) + c 2 x x 2. 8

From this, we have 2 x k x c [f(x k) f( x)] for all k N. This ensures a bound on the rate of convergence of the minimizing sequence to the minimizer in terms of function values. Note that a convex function can be coercive, but fail to be strongly convex. Consider the following example. Example 2.28. Let f : R R, f(x) = x 4. The function f is coercive and attains a strong minimum at x = 0, but is not strongly convex. Proof. It is clear that f is coercive. By definition, f is strongly convex if and only if there exists σ > 0 such that g(x) := x 4 σ 2 x2 is convex. Since g is a differentiable, univariate function, we know it is convex if and only if its second derivative is nonnegative for all x R. Since g (x) = 2x 2 σ is clearly not nonnegative for any fixed σ > 0 and all x R, we have that g is not convex. Therefore, f is not strongly convex. Clearly, zero is the minimum and minimizer of f. Let {x k } k= R be such that f(x k) f(0) = 0. Then lim x 4 k n Therefore, f attains a strong minimum. The following additional counterexample is thanks to one of the referees. = 0 implies lim n x k = 0. Example 2.29. Let f : R R, f(x) := x p, where p >. Then f is coercive for all such p, but f is strongly convex if and only if p = 2. Proof. It is elementary to show that f is coercive. Let σ > 0, and define g(x) := f(x) σ 2 x2. Then g (x) = p(p )x p 2 σ. In order to conclude that f is strongly convex, we must have g (x) 0 for all x R. (2.) ( However, if p > 2 then inequality (2.) fails for x small, specifically for x < σ p(p ) ) p 2, and if < p < 2 then inequality (2.) fails for x large. Only for p = 2 can we choose σ such that inequality (2.) is true for all x. 3 A complete metric space using Moreau envelopes The principal tool we use is the Baire category theorem. To this end, we need a Baire space. In this section, we establish a complete metric space whose distance function makes use of the Moreau envelope. This metric has been used by Attouch and Wets in [3, page 38]. The distances used in the next section refer to the metric established here. In Beer [7, p. 24], it is left as an exercise to show that the space defined in this section is a complete metric space. For the purpose of self-containment, we include a full proof here. We begin with some properties on the Moreau envelope. Set e (Γ 0 (R n )) := {e f : f Γ 0 (R n )}. 9

Theorem 3.. The set e (Γ 0 (R n )) is a convex set in Γ 0 (R n ). Proof. Let f, f 2 Γ 0 (R n ), λ [0, ]. Then e f, e f 2 e (Γ 0 (R n )). We need to show that λe f + ( λ)e f 2 e (Γ 0 (R n )). By [6, Theorem 6.2] with µ = and n = 2, we have that λe f + ( λ)e f 2 is the Moreau envelope of the proximal average function P (f, λ). By [6, Corollary 5.2], we have that P (f, λ) Γ 0 (R n ). Hence, e P (f, λ) e (Γ 0 (R n )), and we conclude that e (Γ 0 (R n )) is a convex set. On e (Γ 0 (R n )), define a metric by d( f, g) := i= 2 i f g i + f g i, (3.) where f g i := sup f(x) g(x) and f, g e (Γ 0 (R n )). x i Note that a sequence of functions in (e (Γ 0 (R n )), d) converges if and only if the sequence converges uniformly on bounded sets, if and only if the sequence converges pointwise on R n. The latter fails in infinite-dimensional space. Theorem 3.2. The metric space (e (Γ 0 (R n )), d) is complete. Proof. Consider a Cauchy sequence {e f k } k N in (e (Γ 0 (R n )), d) where f k Γ 0 (R n ) for k N. Since f k Γ 0 (R n ), by Fact 2.0, e f k is continuous and differentiable on R n p. Then e f k g where g : R n R is continuous and convex. Our objective is to prove that g is in fact the Moreau envelope of a proper, lsc, convex function. Since (e f k ) k N and g are convex and full-domain, by e [27, Theorem 7.7] we have that e f k g, and e f u k g on bounded sets of R n. By [27, Theorem.34], we have that (e f k ) e g, that is, fk + 2 2 e g and g is proper, lsc, and convex. Hence, fk e g 2 2 by [27, Exercise 7.8(a)]. As {fk } k N is a sequence of convex functions, by [27, Theorem 7.7] we have that g 2 2 is convex. Since g 2 2 is proper, lsc, and convex, there exists h Γ 0 (R n ) such that g 2 2 = h. Applying [27, Theorem.34] again, e p we obtain f k h. Finally, using Fact 2.5 or [27, Theorem 7.37] we see that e f k e h, and e f u k e h on bounded subsets of R n as well. Therefore, (e (Γ 0 (R n )), d) is complete. In view of Fact 2.5, we give the definition of the Attouch-Wets metric on Γ 0 (R n ) as follows. Definition 3.3 (Attouch-Wets metric). For f, g Γ 0 (R n ), define the distance function d : d(f, g) := i= 2 i e f e g i + e f e g i. A referee points out that this can also be obtained by proving that the σ-strong convexity property is preserved in the limit, using the characterization of the strong convexity in terms of the subdifferentials, and the corresponding graphical convergence of the subdifferentials. 0

In Γ 0 (R n ), there are other metrics that induce the same topology as the Attouch-Wets metric; see Remark 3.6 below. In order to prove completeness of the space, we state the following lemma, whose simple proof is omitted. Lemma 3.4. Define a : [0, ) R, a(t) := a) a is an increasing function, and t. Then +t b) t, t 2 0 implies that a(t + t 2 ) a(t ) + a(t 2 ). Proposition 3.5. The space (Γ 0 (R n ), d) where d is the metric defined in Definition 3.3, is a complete metric space. Proof. Items M-M4 below show that (Γ 0 (R n ), d) is a metric space, and item C that follows shows that it is complete. M: Since 2 =, and 0 e f e g i < for all i, i + e f e g i we have that Then i= Hence, d is real-valued, finite, and non-negative. M2: We have 2 i 2 i e f e g i + e f e g i for all i. 0 d(f, g) for all f, g Γ 0 (R n ). d(f, g) = 0 i= 2 i e f e g i + e f e g i = 0, e f e g i = 0 for all i, e f(x) e g(x) = 0 for all x, e f = e g, Hence, d(f, g) = 0 if and only if f = g. M3: The fact that d(f, g) = d(g, f) is trivial. M4: By the triangle inequality, f = g [27, Corollary 3.36]. e f e g i e f e h i + e h e g i for all f, g, h Γ 0 (R n ). By applying Lemma 3.4(a), we have e f e g i + e f e g i e f e h i + e h e g i + e f e h i + e h e g i.

Then we apply Lemma 3.4(b) with t = e f e h i and t 2 = e h e g i, and we have e f e g i + e f e g i e f e h i + e f e h i + e h e g i + e h e g i. Multiplying both sides by and taking the summation over i, we obtain the distance functions, 2 i which yields d(f, g) d(f, h) + d(h, g) for all f, g, h Γ 0 (R n ). C: Let {f k } k= be a Cauchy sequence in (Γ 0(R n ), d). Then for each ε > 0 there exists N ε N such that d(f j, f k ) < ε for all j, k N ε. Fix ε > 0. Then there exists N N such that i= 2 i e f j e f k i + e f j e f k i < ε for all j, k N. Then for any i N fixed, we have e f j e f k i + e f j e f k i < 2 i ε, so that e f j e f k i < 2i ε =: ˆε > 0, 2 i ε for all j, k N. Notice that ˆε 0 as ε 0. This gives us that {e f k } k= is a Cauchy sequence p on B i (0) for each i N, so that e f k g for some finite-valued convex function g. By the same arguments as in the proof of Theorem 3.2, we know that g = e h with h Γ 0 (R n ). Hence f k h in terms of metric d. Therefore, (Γ 0 (R n ), d) is closed, and is a complete metric space. Remark 3.6. (). This result can also be reached via [2, Theorem 2.], using the ρ-hausdorff distance for epigraphs; and this result is also mentioned as an exercise in [7, p. 24]. We refer the reader to [7, Chapter 7] for more details on the Attouch-Wets topology for convex functions. (2). When X is infinite dimensional, on the set of proper lower semicontinuous convex function Γ 0 (X), a variety different topologies show up, such as Kuratowski-Painlevé convergence, Attouch-Wets convergence, Mosco convergence, Choquet-Wijsman convergence, etc. See Beer [7], Lucchetti [9], Borwein-Vanderwerff [], Attouch []. However, when X is finite dimensional, all these convergences coincide [, Theorem 6.2.3]. On the set of Fenchel conjugates (Γ 0 (R n )) := {f : f Γ 0 (R n )} define a metric by ˆd(f, g ) := d(f, g ) for f, g (Γ 0 (R n )). Observe that Γ 0 (R n ) = (Γ 0 (R n )). Corollary 3.7. Consider two metric spaces (Γ 0 (R n ), d) and ((Γ 0 (R n )), ˆd). Define T : (Γ 0 (R n ), d) ((Γ 0 (R n )), ˆd) : f f. Then T is a bijective isometry. Consequently, (Γ 0 (R n ), d) and ((Γ 0 (R n )), ˆd) are isometric. Proof. Clearly T is onto. Also, T is injective because of the Fenchel-Moreau Theorem [5, Theorem 3.32] or [26, Corollary 2.2.]. To see this, let T f = T g. Then f = g, so f = (f ) = (g ) = g. It remains to show that T is an isometry: ( f, g Γ 0 (R n )) d(f, g) = d(f, g ) = ˆd(T f, T g). 2

To see this, using Lemma 2.8 we have d(f, g ) = = = i= i= i= 2 i 2 i 2 i = d(f, g). sup e f (x) e g (x) x i + sup e f (x) e g (x) x i sup x i + sup x i 2 x 2 e f(x) 2 x 2 + e g(x) 2 x 2 e f(x) 2 x 2 + e g(x) sup e g(x) e f(x) x i + sup e g(x) e f(x) x i By Theorem 3.2, (e (Γ 0 (R n )), d) is a complete metric space. Corollary 3.8. Consider two metric spaces (Γ 0 (R n ), d) and (e (Γ 0 (R n )), d). Define T : Γ 0 (R n ) e (Γ 0 (R n )) : f e f. Then T is a bijective isometry, so (Γ 0 (R n ), d) and (e (Γ 0 (R n )), d) are isometric. 4 Baire category results This section is devoted to the main work of this paper. Ultimately, we show that the set of strongly convex functions is a meager (Baire category one) set, while the set of convex functions that attain a strong minimum is a generic (Baire category two) set. 4. Characterizations of the strong minimizer The first proposition describes the relationship between a function and its Moreau envelope, pertaining to the strong minimum. Several more results regarding strong minima follow. Proposition 4.. Let f : R n R. Then f attains a strong minimum at x if and only if e f attains a strong minimum at x. Proof. ( ) Assume that f attains a strong minimum at x. Then min x f(x) = min e f(x) = f( x) = e f( x). x 3

Let {x k } be such that e f(x k ) e f( x). We need to show that x k x. Since for some v k, and f(v k ) f( x), we have e f(x k ) = f(v k ) + 2 v k x k 2 0 2 x k v k 2 + f(v k ) f( x) = e f(x k ) e f( x) 0. (4.) Since both 2 x k v k 2 0 and f(v k ) f( x) 0, equation (4.) tells us that x k v k 0 and f(v k ) f( x). Since x is the strong minimizer of f, we have v k x. Therefore, x k x, and e f attains a strong minimum at x. ( ) Assume that e f attains a strong minimum at x, e f( x) = min e f. Then e f(x k ) e f( x) implies that x k x. Let f(x k ) f( x). We have Since f(x k ) f( x), we obtain f( x) e f( x) e f(x k ) f(x k ). e f(x k ) f( x) = e f( x). Therefore, x k x, and f attains a strong minimum at x. Proposition 4.2. Let f : R n R have a strong minimizer x. Then for all m N, inf f(x) > f( x). x x m Proof. This is clear by the definition of strong minimizer. Indeed, suppose that there exists m N such that f(x) = f( x). Then there exists a sequence {x k } k= with x k x and m inf x x m lim f(x k) = f( x). Since x is the strong minimizer of f, we have x k x, a contradiction. k Corollary 4.3. Let f : R n R have a strong minimizer x. Then for all m N, inf e f(x) > e f( x). x x m Proof. This follows directly from Propositions 4. and 4.2. The next result describes a distinguished property of convex functions defined in R n. Theorem 4.4. Let f Γ 0 (R n ). Then f has a strong minimizer if and only if f has a unique minimizer. 4

Proof. ( ) By definition, if f has a strong minimizer, then that minimizer is unique. ( ) Suppose f has a unique minimizer x. Because f Γ 0 (R n ), by [26, Theorem 8.7], all sublevelsets {x : f(x) α}, for any α f( x), have the same recession cone. Since the recession cone of {x : f(x) f( x)} = { x} is 0, each sublevel set of f is bounded. Since f has a unique minimizer x and all sublevel sets of f are bounded, we have that x is in fact a strong minimizer. Indeed, this follows by applying [, Fact 4.4.8], [, Theorem 4.4.0], and [, Theorem 5.23(e)(c)] in R n because f (0) = argmin f = { x}. Remark 4.5. See also [, Exercise 5.2. p. 234]. Remark 4.6. A more self-contained proof of Theorem 4.4, more in the style of that of Lemma 4.20, is provided below. Proof. Let f(x k ) f( x). We need to show that x k x. Suppose the contrary, x k x. Taking a subsequence if necessary, we can assume that there exists m N such that x k x m for all k N. Select y k such that y k [x k, x] { x : m x x }. m Then for all k, we have that y k m, and that there exists 0 λ k such that By the convexity of f, we have that for all k, y k = λ k x k + ( λ k ) x. f(y k ) λ k f(x k ) + ( λ k )f( x). As {y k } is bounded, λ k is bounded. Taking convergent subsequences if necessary, we may assume that y k ȳ, and λ k λ. Then f(ȳ) lim inf k f(y k) λf( x) + ( λ)f( x) = f( x). As f(ȳ) f( x), we have f(ȳ) = f( x). Since ȳ x, we have ȳ x, which contradicts m the fact that x is the unique minimizer of f. 5

Example 4.7. Theorem 4.4 can fail when the function is nonconvex. Consider the continuous but nonconvex function f : R R, f(x) = x2. x 4 + The function has a unique minimizer x = 0, but the minimizer is not strong, as any sequence {x k } that tends to ± gives a sequence of function values that tends to f( x). Example 4.8. Theorem 4.4 can also fail in infinite-dimensional space. Consider the continuous, convex function f : l 2 R : x f(x) := k x2 k. This function has a unique minimizer x = 0, but x is not a strong minimizer because f(e k ) = k 0 and e k 0. Here, e k = (0,..., 0,, 0,...), where the is in the kth position. See also [7, p. 268, Exercise 2]. Using Proposition 4.2 and Corollary 4.3, we can now single out two sets in Γ 0 (R n ) which are very important for our later proofs. Definition 4.9. For any m N, define the sets U m and E m as follows: { U m := E m := { f Γ 0 (R n ) : there exists z R n such that f Γ 0 (R n ) : there exists z R n such that Proposition 4.0. Let f m N k= inf f(x) f(z) > 0 x z m } inf e f(x) e f(z) > 0 x z m U m. Then f attains a strong minimum on R n. Proof. The proof follows the method of [4, Theorem II.]. Since f each m N there exists z m R n such that f(z m ) < inf f(x). x z m m 6 m N, } U m, we have that for.

Suppose that z p z m m for some p > m. By the definition of z m, we have f(z p ) > f(z m ). (4.2) Since z m z p m > p, we have f(z m ) > f(z p ) by the definition of z p. This contradicts inequality (4.2). Thus, z p z m < for each p > m. m This gives us that {z m } m= is a Cauchy sequence that converges to some x R n. It remains to be shown that x is the strong minimizer of f. Since f is lsc, we have f( x) lim inf f(z m) m ( lim inf m inf f(x). x R n \{ x} inf f(x) x z m m Let {y k } k= Rn be such that f(y k ) f( x), and suppose that y k x. Dropping to a subsequence if necessary, there exists ε > 0 such that y k x ε for all k. Thus, there exists p N such that y k z p for all k N. Hence, p f( x) f(z p ) < ) inf f(x) f(y k ) x z p p for all k N, a contradiction to the fact that f(y k ) f( x). Therefore, x is the strong minimizer of f. Theorem 4.. Let f E m. Then e f attains a strong minimum on R n, so f attains a strong minimum on R n. m N Proof. Applying Proposition 4.0, for each f m N E m, e f has a strong minimizer on R n. By Proposition 4., each corresponding f has the same corresponding strong minimizer. 4.2 The set of strongly convex functions is dense, but first category Next, we turn our attention to the set of strongly convex functions. The objectives here are to show that the set is contained in both U m and E m, dense in (Γ 0 (R n ), d), and meager in (Γ 0 (R n ), d). Theorem 4.2. Let f : R n R be strongly convex. Then f U m and f E m for all m N. Proof. Since f is strongly convex, f has a unique minimizer z. By Lemma 2.26, z is a strong minimizer, so that for any sequence {x k } such that f(x k ) f( x), we must have x k x. We want to show that inf x z m f(x) f(z) > 0. (4.3) 7

For any m N, equation (4.3) is true by Proposition 4.2. Therefore, f U m for all m N. By Lemma 2.23, e f is strongly convex. Therefore, by the same reasoning as above, f E m for all m N. We will need the following characterizations of strongly convex functions in later proofs. Note that (i) (iii) has been done by Rockafellar [25]. Lemma 4.3. Let f Γ 0 (R n ). The following are equivalent: (i) f is strongly convex. (ii) Prox f = k Prox g for some 0 k < and g Γ 0 (R n ). (iii) Prox f = kn for some 0 k < and N : R n R n nonexpansive. Proof. (i) (ii): Assume that f is strongly convex. Then f = g+σq where g Γ 0 (R n ), q = 2 2, and σ > 0. We have ( Prox f = (( + σ) Id + g) = ( + σ) ( Id + g ) ) (4.4) + σ ( = Id + g ) ( ) Id. (4.5) + σ + σ Define g(x) = ( + σ)g(x/( + σ)). Then g Γ 0 (R n ), g = g ( Id +σ), so ( ( )) ( ( Id Prox g = Id + g = ( + σ) Id + g ) + σ + σ ( = ( + σ) + g + σ ) ( Id ) + σ ( Id + σ )) (4.6) (4.7) = ( + σ) Prox f. (4.8) Therefore, Prox f = Prox g. +σ (ii) (i): Assume Prox f = k Prox g for some 0 k < and g Γ 0 (R n ). If k = 0, then f = ι {0}, and f is obviously strongly convex. Let us assume 0 < k <. The assumption (Id + f) = k(id + g) gives Id + f = (Id + g) (Id /k) = Id /k + g (Id /k), so f = (/k ) Id + g(id /k). Since /k > and g (Id /k) is monotone, we have that f is strongly monotone, which implies that f is strongly convex. (ii) (iii): This is clear because Prox g is nonexpansive, see, e.g., [5, Proposition 2.27]. (iii) (ii): Assume Prox f = kn where 0 k < and N is nonexpansive. If k = 0, then Prox f = 0 = 0 0, so (ii) holds because Prox ι {0} = 0. If 0 < k <, then N = /k Prox f. As Prox f = (Id + f) = (q + f) = e (f ), 8

we have N = (e (f )/k). This means that N is nonexpansive and the gradient of a differentiable convex function. By the Baillon-Haddad theorem [4] or [5, Corollary 8.6], N = Prox g for some g Γ 0 (R n ). Therefore, Prox f = k Prox g, i.e., (ii) holds true. Theorem 4.4. The set of strongly convex functions is dense in (Γ 0 (R n ), d). Equivalently, the set of strongly convex functions is dense in (e (Γ 0 (R n )), d). Proof. Let 0 < ε < and f Γ 0 (R n ). It will suffice to find h Γ 0 (R n ) such that h is strongly convex and d(h, f) < ε. For 0 < σ <, define g Γ 0 (R n ) by way of the proximal mapping: Prox g := ( σ) Prox f = ( σ) Prox f +σ Prox ι {0}. Such a g Γ 0 (R n ) does exists because g is the proximal average of f and ι {0} by [6], and g is strongly convex because of Lemma 4.3. Define h Γ 0 (R n ) by Indeed, some calculations give h = ( σ)f Then e h = e g e g(0) + e f(0), so that h := g e g(0) + e f(0). ( ) + σ σ σ q + σe f(0). (4.9) e h(0) = e f(0), (4.0) and Prox h = Prox g. Fix N large enough that < ε. Then 2 i 2 Choose σ such that This gives us that i=n i=n 2 i e f e g i + e f e g i i=n 2 i < ε 2. (4.) 0 < σ < ε 2 ε N(N + Prox f(0) )). (4.2) σn(n + Prox f(0) ) + σn(n + Prox f(0) ) < ε 2. (4.3) By equation (4.0) and the Mean Value Theorem, for some c [x, 0] we have e h(x) e f(x) = e h(x) e f(x) (e h(0) e f(0)) = e h(c) e f(c), x 0 = (Id Prox h)(c) (Id Prox f)(c), x 0 = Prox h(c) + Prox f(c), x 0 = ( σ) Prox f(c) + Prox f(c), x = σ Prox f(c), x. 9

Using the triangle inequality, the Cauchy-Schwarz inequality, and the fact that Prox f is nonexpansive, we obtain e h(x) e f(x) σ Prox f(c) x = σ Prox f(c) Prox f(0) + Prox f(0) x σ( Prox f(c) Prox f(0) + Prox f(0) ) x σ( c + Prox f(0) ) x σ( x + Prox f(0) ) x σn(n + Prox f(0) ), when x N. Therefore, e h e f N σn(n + Prox f(0) ). Applying equation (4.3), this implies that e f e g N + e f e g N σn(n + Prox f(0) ) + σn(n + Prox f(0) ) < ε 2. (4.4) Now considering the first N terms of our d function, we have N i= N e f e g i 2 i + e f e g i i= 2 i e f e g N + e f e g N = e f e g N + e f e g N N i= 2 i < e f e g N + e f e g N. (4.5) When inequality (4.2) holds, combining inequalities (4.), (4.4), and (4.5) yields d(h, f) < ε. Hence, for any arbitrary f Γ 0 (R n ) and 0 < ε <, there exists a strongly convex function h Γ 0 (R n ) such that d(h, f) < ε. That is, the set of strongly convex functions is dense in (Γ 0 (R n ), d). Because (Γ 0 (R n ), d) and (e (Γ 0 (R n )), d) are isometric by Corollary 3.8, it suffices to apply Lemma 2.23. The proof is complete. Remark 4.5. A shorter proof can be provided by approximating f Γ 0 (R n ) with f + ε 2. One can use either [7, Theorem 7.4.5], or the fact that f + ε 2 converges uniformly to f on bounded subsets of dom f. When ε 0, clearly f + ε 2 converges epigraphically to f, which gives the Attouch-Wets convergence because we are in a finite dimension setting. To quantify the epigraphical convergence in terms of the distance d based on Moreau envelopes (Definition 3.3), one has ( )( ) f e (f + ε 2 ) = (2ε + )e 2ε + 2ε + which indeed converges to e f uniformly on bounded sets of R n. + ε 2ε + 2, Theorem 4.6. The set of strongly convex functions is meager in (e (Γ 0 (R n )), d) where d is given by (3.). Equivalently, in (Γ 0 (R n ), d) the set of strongly convex function is meager. 20

Proof. Denote the set of strongly convex functions in e (Γ 0 (R n )) by S. Define { F m := g e (Γ 0 (R n )) : g } 2m 2 is convex on R n. We show that a) S = m N F m, b) for each m N, the set F m is closed in e (Γ 0 (R n )), and c) for each m N, the set F m has empty interior. Then S will have been shown to be a countable union of closed, nowhere dense sets, hence first category. a) ( ) Let f S. Then there exists σ > 0 such that f σ 2 2 is convex. Note that this means f σ 2 2 is convex for all σ (0, σ). Since σ > 0, there exists m N such that 0 < < σ. Hence, f m 2m 2 is convex, and f F m. Therefore, S F m. ( ) Let f F m for some m N. Then f 2m 2 is convex. Thus, with σ =, m we have that there exists σ > 0 such that f σ 2 2 is convex, which is the definition of strong convexity of f. Therefore, F m S, and since this is true for every m N, we have F m S. m N b) Let g F m. Then g 2m 2 is not convex. Equivalently, there exist λ (0, ) and x, y R n such that m N g(λx + ( λ)y) λg(x) ( λ)g(y) λ( λ) x y 2 > 2m. (4.6) Let N > max{ x, y }. Choose ε > 0 such that when d(f, g) < ε for f e (Γ 0 (R n )), we have f g N < ε for some ε > 0. In particular, f(λx + ( λ)y) λf(x) ( λ)f(y) λ( λ) g(λx + ( λ)y) λg(x) ( λ)g(y) = λ( λ) (f g)(λx + ( λ)y) λ(f g)(x) ( λ)(f g)(y) + λ( λ) g(λx + ( λ)y) λg(x) ( λ)g(y) 4 ε > λ( λ) λ( λ). Hence, when ε is sufficiently small, which can be achieved by making ε sufficiently small, we have f(λx + ( λ)y) λf(x) ( λ)f(y) x y 2 > λ( λ) 2m. This gives us, by equation (4.6), that f 2m 2 is not convex. Thus, f F m, so e (Γ 0 (R n )) \ F m is open, and therefore F m is closed. 2

c) That int F m = is equivalent to saying that e (Γ 0 (R n )) \ F m is dense. Thus, it suffices to show that for every ε > 0 and every g e (Γ 0 (R n )), the open ball B ε (g) contains an element of e (Γ 0 (R n )) \ F m. If g e (Γ 0 (R n )) \ F m, then there is nothing to prove. Assume that g F m. Then g is 2m -strongly convex, and has a strong minimizer x by Lemma 2.26. As g e (Γ 0 (R n )), g = e f for some f Γ 0 (R n ). We consider two cases. Case : Suppose that for every k > 0, there exists x k x such that f(x k ) < f( x) + k. Define h k := max { f, f( x) + k}. Then min h k = f( x) + k, f h k < f + k, so that e f e h k e f + k. We have g k := e h k e (Γ 0 (R n )), and g k g i < k for all i N. Choosing k sufficiently large guarantees that d(g k, g) < ε. We see that g k does not have a strong minimizer by noting that for every k, f( x) < f( x) + k, f(x k ) < f( x) + k, and h k( x) = h k (x k ) = f( x) + k. Thus, h k does not have a strong minimizer, which implies that g k = e h k does not either, by Proposition 4.. Therefore, g k F m. Case 2: If Case is not true, then there exists k such that f(x) f( x)+ for every x x. Then k we claim that f(x) = for all x x. Suppose for the purpose of contradiction that there exists x x such that f(x) <. As f Γ 0 (R n ), the function ϕ : [0, ] R defined by ϕ(t) := f(tx + ( t) x) is continuous by [32, Proposition 2..6]. This contradicts the assumption, therefore, f(x) = ι { x} (x) + f( x). Consequently, g(x) = e f(x) = f( x) + 2 x x 2. Now for every j N, define f j : R n R, { f( x), x x j f j (x) :=,, otherwise. We have f j Γ 0 (R n ), and f( x), x x j g j (x) := e f j (x) = ( ), 2 f( x) + x x 2 j, x x >. j Then {g j (x)} j N converges pointwise to e f = g, by [27, Theorem 7.37]. Thus, for sufficiently large j, d(gj, g) < ε. Since g j is constant on B ( x), g j is not strongly j convex, so g j F m. 22

Properties a), b) and c) all together show that the set of strongly convex function is meager in (e (Γ 0 (R n ), d). Note that (e (Γ 0 (R n ), d) and (Γ 0 (R n ), d) are isometric by Corollary 3.8. The proof is complete by using Lemma 2.23. Remark 4.7. (). Theorem 4.6 shows that the set S of strongly convex functions is first category, but what about the larger set U of uniformly convex functions to which S belongs? (See [5, 3, 32] for information on uniform convexity.) It is clear that U is bigger than S, for example [5, Definition 0.5] shows that f S f U, and [5, Exercise 0.7] states that f : R R, f(x) = x 4 is uniformly convex but not strongly convex. So is U generic or meager? This is a question for which the authors have no answer as of yet; it remains an open question. We refer the reader in particular to [32, Proposition 3.5.8, Theorem 3.5.0] for properties and characterizations of uniformly convex functions. (2). Strongly convex functions are important in optimization [5]. In [25], Rockafellar showed that the proximal point method associated to every strongly convex function converges at least at a linear rate to the optimal solution. Corollary 4.8. In (Γ 0 (R n ), d)) the set is first category. D := {f : f is differentiable and f is c-lipschitz for some c > 0} Proof. We know that (Γ 0 (R n ), d) ((Γ 0 (R n )), ˆd) is an isometry, and that f is strongly convex for some c > 0 if and only if f is differentiable on R n with f being -Lipschitz [32, Corollary c 3.5.(i) (vi)]. Since the set of strongly convex functions is first category in ((Γ 0 (R n )), ˆd), the set of differentiable convex functions with f being c-lipschitz for some c > 0 is a first category set in (Γ 0 (R n ), d). 4.3 The set of convex functions with strong minimizers is second category We present properties of the sets U m and E m, and show that the set of convex functions that attain a strong minimum is a generic set in (Γ 0 (R n ), d). Lemma 4.9. The sets U m and E m are dense in (Γ 0 (R n ), d). Proof. This is immediate by combining Theorems 4.2 and 4.4. To continue, we need the following result, which holds in Γ 0 (X) where X is any Banach space. Lemma 4.20. Let f Γ 0 (R n ), m N, and fix z dom f. Then inf f(x) f(z) > 0 if and only if x z m inf f(x) f(z) > 0. m x z m 23

Proof. ( ) Suppose that for z fixed, inf f(x) f(z) > 0. Since x z m we have ( ) Let inf f(x) f(z) > 0. m x z m inf m x z m inf f(x) f(z) inf f(x) f(z), m x z m x z m f(x) f(z) > 0, and suppose that inf f(x) f(z) 0. x z m Then for each k with k N, there exists y k with y k z m such that f(y k) f(z) + k. Take z k [y k, z] { x R n : m x z m}. Then z k = λ k y k + ( λ k )z for some λ k [0, ]. By the convexity of f, we have Now f(z k ) = f(λ k y k + ( λ k )z) λ k f(y k ) + ( λ k )f(z) λ k f(z) + ( λ k )f(z) + λ k k = f(z) + λ k k f(z) + k. inf f(x) f(z k ) f(z) +, so when k we obtain m x z k m This contradicts the fact that inf f(x) f(z) 0. m x z m inf m x z m f(x) f(z) > 0. Therefore, Lemma 4.2. The set E m is an open set in (Γ 0 (R n ), d). Proof. Fix m N, and let f E m. Then there exists z R n such that 0. Hence, by Lemma 4.20, inf e f(x) e f(z) > 0. m x z m inf x z m f(x) f(z) > 0. inf e f(x) e f(z) > x z m Choose j large enough that B m [z] B j (0). Let g Γ 0 (R n ) be such that d(f, g) < ε, where 0 < ε < inf e f(x) e f(z) m x z m 2 (2 j + inf e f(x) e f(z) m x z m ) < 2 j. (4.7) 24

The reason for this bound on ε will become apparent at the end of the proof. Then i= 2 i e f e g i + e f e g i < ε. In particular for our choice of j, we have that 2 j ε < by inequality (4.7), and that Define α := In other words, 2 j e f e g j + e f e g j < ε, e f e g j < 2 j ε( + e f e g j ), sup e f(x) e g(x) ( 2 j ε) < 2 j ε, x j sup e f(x) e g(x) < 2j ε x j 2 j ε. 2j ε. Then sup e 2 j ε f(x) e g(x) < α. Hence, x j e f(x) e g(x) < α for all x with x j. e f(x) α < e g(x) < e f(x) + α for all x with x j. Since B m [z] B j (0), we can take the infimum over m x z m to obtain inf e f(x) α m x z m inf e g(x) m x z m inf e f(x) + α. (4.8) m x z m Using inequality (4.8) together with the fact that e g(z) e f(z) < α yields ( ) inf e g(x) e g(z) m x z m inf e f(x) α m x z m = inf e f(x) e f(z) 2α. m x z m (e f(z) + α) Hence, if we have α < inf e f(x) e f(z) m x z m, (4.9) 2 inf e g(x) e g(z) > 0. (4.20) m x z m 25

Recalling that α = 2j ε, we solve equation (4.9) for ε to obtain 2 j ε ε < inf e f(x) e f(z) m x z m 2 (2 j + inf e f(x) e f(z) m x z m ). Thus, inequality (4.20) is true whenever d(f, g) < ε for any ε that respects inequality (4.7). Applying Lemma 4.20 to inequality (4.20), we conclude that inf e g(x) e g(z) > 0. x z m Hence, if g Γ 0 (R n ) is such that d(f, g) < ε, then g E m. Therefore, E m is open. We are now ready to present the main results of the paper. Theorem 4.22. In X := (Γ 0 (R n ), d), the set S := {f Γ 0 (R n ) : f attains a strong minimum} is generic. Proof. By Lemmas 4.9 and 4.2, we have that E m is open and dense in X. Hence, G := E m m N is a countable intersection of open, dense sets in X, and as such G is generic in X. Let f G. By Corollary 4., f attains a strong minimum on R n. Thus, every element of G attains a strong minimum on R n. Since G is generic in X and G S, we conclude that S is generic in X. Remark 4.23. This result is stated as Exercise 7.5.0 in [7, p. 269]. However, the approach taken there uses the Attouch-Wets topology defined by uniform convergence on bounded subsets of the distance function, associated with epigraphs of convex functions. Theorem 4.24. In X := (Γ 0 (R n ), d), the set S := {f Γ 0 (R n ) : f is coercive} is generic. Proof. Define the set Γ (R n ) := Γ 0 (R n ) + x, in the sense that for any function f Γ 0 (R n ), the function f + x, Γ (R n ). Since any such f + x, is proper, lsc, and convex, we have Γ (R n ) Γ 0 (R n ). Now, since for any f Γ 0 (R n ) we have that f x Γ 0 (R n ), this gives us that f Γ 0 (R n ) + x = Γ (R n ). Therefore, Γ (R n ) = Γ 0 (R n ). By Theorem 4.22, there exists a generic set G Γ 0 (R n ) such that for every f G, f attains a strong minimum at some point x, and hence 0 f(x). Then, given any x fixed, there exists a generic set G x that contains a dense G δ set, such that 0 (f + x )(x). Thus, for each f G x there exists x R n such that x f(x). By Fact 2.4, it is possible to construct the set D := { x i } i= such that D = R n. Then each set G x i, i N, contains a dense G δ set. Therefore, the set G := contains a dense G x i i= G δ set. Let f G. Then for each i N, x i f(x) for some x R n. That is, x i ran f. So D := { x i } ran f, and D ran f. Since D = R n, we have R n = ran f. By i= Facts 2.5 and 2.6, ran f is almost convex; there exists a convex set C such that C ran f C. 26

Then C = R n. As C is convex, by [26, Theorem 6.3] we have the relative interior ri C = ri C, so ri C = R n. Thus, R n = ri C C, which gives us that C = R n. Therefore, ran f = R n. By Fact 2.8, ran f dom(f ). Hence, dom f = R n f(x). By Fact 2.22, we have that =. x lim x Therefore, f is coercive for all f G. Since G is generic in X and G S, we conclude that S is generic in X. Theorem 4.25. In (Γ 0 (R n ), d), the set S := {f Γ 0 (R n ) : dom f = R n } is generic. Proof. Note that (Γ 0 (R n )) = Γ 0 (R n ). In ((Γ 0 (R n )), d), by Theorem 4.24, the set {f (Γ 0 (R n )) : f is coercive} is generic. Since f is coervcive if and only if f has dom f = R n by Fact 2.22, the proof is done. Combining Theorems 4.22, 4.24 and 4.25, we obtain Corollary 4.26. In (Γ 0 (R n ), d), the set is generic. S := {f Γ 0 (R n ) : dom f = R n, dom f = R n, f has a strong minimizer} 5 Conclusion Endowed with the Attouch-Wets metric, based on the Moreau envelope, the set of proper lower semicontinuous convex functions on finite-dimensional space forms a complete metric space. In this complete metric space, the topology is epi-convergence topology. We have proved several Baire category results. In particular, we have shown that in (Γ 0 (R n ), d) the set of strongly convex functions is category one, the set of functions that attain a strong minimum is category two, and the set of coercive functions is category two. Several other results about strongly convex functions and functions with strong minima are included. In future work that has already commenced, we will continue to develop the theory of Moreau envelopes, providing characterizations and illustrative examples of how to calculate them, and extend results in this paper to convex functions defined on Hilbert spaces and to prox-bounded functions on R n. A natural question to ask here, as a referee did, are these genericity results sharp? According to [32, Proposition 3.5.8, Theorem 3.5.0], a uniformly convex function is coercive, has a strong minimizer, and its Fenchel conjugate f is differentiable with f being uniformly continuous. Can more conclusions be drawn, for instance about the set of uniformly convex functions of which the set of strongly convex functions is a subset? The authors do not have the answer, and will consider the question in future research. Acknowledgements The authors wish to thank the two anonymous referees for their valuable suggestions for improvement of this paper, in particular, Remarks 2.6, 3.6, 4.5. The results of their comments were a better overall presentation and a broadened connection to existing literature on the topic. 27