arxiv: v1 [math.oc] 13 Sep 2018

Size: px
Start display at page:

Download "arxiv: v1 [math.oc] 13 Sep 2018"

Transcription

1 Hamilonian Descen Mehods Chris J. Maddison 1,2,*, Daniel Paulin 1,*, Yee Whye Teh 1,2, Brendan O Donoghue 2, and Arnaud Douce 1 arxiv: v1 [mah.oc] 13 Sep Deparmen of Saisics, Universiy of Oxford 2 DeepMind, London, UK * Boh auhors conribued equally o his work. 14 Sepember 2018 Absrac We propose a family of opimizaion mehods ha achieve linear convergence using firsorder gradien informaion and consan sep sizes on a class of convex funcions much larger han he smooh and srongly convex ones. This larger class includes funcions whose second derivaives may be singular or unbounded a heir minima. Our mehods are discreizaions of conformal Hamilonian dynamics, which generalize he classical momenum mehod o model he moion of a paricle wih non-sandard kineic energy exposed o a dissipaive force and he gradien field of he funcion of ineres. They are firs-order in he sense ha hey require only gradien compuaion. Ye, crucially he kineic gradien map can be designed o incorporae informaion abou he convex conjugae in a fashion ha allows for linear convergence on convex funcions ha may be non-smooh or non-srongly convex. We sudy in deail one implici and wo explici mehods. For one explici mehod, we provide condiions under which i converges o saionary poins of non-convex funcions. For all, we provide condiions on he convex funcion and kineic energy pair ha guaranee linear convergence, and show ha hese condiions can be saisfied by funcions wih power growh. In sum, hese mehods expand he class of convex funcions on which linear convergence is possible wih firs-order compuaion. 1 Inroducion We consider he problem of unconsrained minimizaion of a differeniable funcion f : R d R, min fx, 1 x R d by ieraive mehods ha require only he parial derivaives fx = fx/ x n R d of f, known also as firs-order mehods [38, 45, 41]. These mehods produce a sequence of ieraes x i R d, and our emphasis is on hose ha achieve linear convergence, i.e., as a funcion of he ieraion i hey saisfy fx i fx = Oλ i for some rae λ > 1 and x R d a global minimizer. We briefly consider non-convex differeniable f, bu he bulk of our analysis focuses on he case of convex differeniable f. Our resuls will also occasionally require wice differeniabiliy of f. 1

2 Ieraes Objecive x 2 log fxi Gradien descen Classical momenum Hamilonian descen x 1 ieraion i Figure 1: Opimizing fx = [x 1 + x 2 ] 4 + [x 1 /2 x 2 /2] 4 wih hree mehods: gradien descen wih fixed sep size equal o 1/L 0 where L 0 = λ max 2 fx 0 is he maximum eigenvalue of he Hessian 2 f a x 0 ; classical momenum, which is a paricular case of our firs explici mehod wih kp = [p p 2 2 ]/2 and fixed sep size equal o 1/L 0 ; and Hamilonian descen, which is our firs explici mehod wih kp = 3/4[p 1 4/3 + p 2 4/3 ] and a fixed sep size. The convergence raes of firs-order mehods on convex funcions can be broadly separaed by he properies of srong convexiy and Lipschiz smoohness. Taken ogeher hese properies for convex f are equivalen o he condiions ha he following lef hand bound srong convexiy and righ hand bound smoohness hold for some µ, L 0, and all x, y R d, µ 2 x y 2 2 fx fy fy, x y L 2 x y 2 2, 2 where x, y = d n=1 xn y n is he sandard inner produc and x 2 = x, x is he Euclidean norm. For wice differeniable f, hese properies are equivalen o he condiions ha eigenvalues of he marix of second-order parial derivaives 2 fx = 2 fx/ x n x m R d d are everywhere lower bounded by µ and upper bounded by L, respecively. Thus, funcions whose second derivaives are coninuously unbounded or approaching 0, canno be boh srongly convex and smooh. Boh bounds play an imporan role in he performance of firs-order mehods. On he one hand, for smooh and srongly convex f, he ieraes of many firs-order mehods converge linearly. On he oher hand, for any firs-order mehod, here exis smooh convex funcions and non-smooh srongly convex funcions on which is convergence is sub-linear, i.e., fx i fx Oi 2 for any firs-order mehod on smooh convex funcions. See [38, 45, 41] for hese classical resuls and [25] for oher more exoic scenarios. Moreover, for a given mehod i can someimes be very easy o find examples on which is convergence is slow; see Figure 1, in which gradien descen wih a fixed sep size converges slowly on fx = [x 1 + x 2 ] 4 + [x 1 /2 x 2 /2] 4, which is no srongly convex as is Hessian is singular a 0, 0. The cenral assumpion in he wors case analyses of firs-order mehods is ha informaion abou f is resriced o black box evaluaions of f and f locally a poins x R d, see [38, 41]. In his paper we assume addiional access o firs-order informaion of a second differeniable funcion k : R d R and show how k can be designed o incorporae informaion abou f o yield pracical mehods ha converge linearly on convex funcions. These mehods are derived by discreizing 2

3 a sub-linear convergence in coninuous ime linear convergence in coninuous ime b fx = x b /b kp = p a /a linear convergence of 1s explici mehod linear convergence of 2nd explici mehod quadraic suiable for srongly convex and smooh Figure 2: Convergence Regions for Power Funcions. Shown are regions of disinc convergence ypes for Hamilonian descen sysems wih fx = x b /b, kp = p a /a for x, p R and a, b 1,. We show in Secion 2 convergence is linear in coninuous ime iff 1/a + 1/b 1. In Secion 4 we show ha he assumpions of he explici discreizaions can be saisfied if 1/a + 1/b = 1, leaving his as he only suiable pairing for linear convergence. Ligh doed line is he line occupied by classical momenum wih kp = p 2 /2. he conformal Hamilonian sysem [33]. These sysems are parameerized by f, k : R d R and γ 0, wih soluions x, p R 2d, x = kp p = fx γp. 3 From a physical perspecive, hese sysems model he dynamics of a single paricle locaed a x wih momenum p and kineic energy kp being exposed o a force field f and a dissipaive force. For his reason we refer o k as, he kineic energy, and k, he kineic map. When he kineic map k is he ideniy, kp = p, hese dynamics are he coninuous ime analog of Polyak s heavy ball mehod [44]. Le f c x = fx + x fx denoe he cenered version of f, which akes is minimum a 0, wih minimum value 0. Our key observaion in his regard is ha when f is convex, and k is chosen as kp = fc p + fc p/2 where fc p = sup{ x, p f c x : x R d } is he convex conjugae of f c, hese dynamics have linear convergence wih rae independen of f. In oher words, his choice of k acs as a precondiioner, a generalizaion of using kp = p, A 1 p /2 for fx = x, Ax /2. Thus k can exploi global informaion provided by he conjugae fc o condiion convergence for generic convex funcions. To preview he flavor of our resuls in deail, consider he special case of opimizing he power funcion fx = x b /b for x R and b 1, iniialized a x 0 > 0 using sysem 3 or discreizaions of i wih kp = p a /a for p R and a 1,. FOr his choice of f, i can be shown ha fc p = fc p = kp when a = b/b 1. In line wih his, in Secion 2 we show ha 3 exhibis linear convergence in coninuous ime if and only if 1/a + 1/b 1. In Secion 3 we propose wo explici discreizaions wih fixed sep sizes; in Secion 4 we show ha he firs explici discreizaion converges if 1/a + 1/b = 1 and b 2, and he second converges if 1/a + 1/b = 1 and 1 < b 2. This means ha he only suiable pairing corresponds in his case o he choice kp fc p+fc p. Figure 2 summarizes his discussion. Reurning o Figure 1, we can compare 3

4 he use of he kineic energy of Polyak s heavy ball wih a kineic energy ha relaes appropriaely o he convex conjugae of fx = [x 1 + x 2 ] 4 + [x 1 /2 x 2 /2] 4. Mos convex funcions are no simple power funcions, and compuing f c p + f c p exacly is rarely feasible. To make our observaions useful for numerical opimizaion, we show ha linear convergence is sill achievable in coninuous ime even if kp α max{f c p, f c p} for some 0 < α 1 wihin a region defined by x 0. We sudy hree discreizaions of 3, one implici mehod and wo explici ones which are suiable for funcions ha grow asympoically fas or slow, respecively. We prove linear convergence raes for hese under appropriae addiional assumpions. We inroduce a family of kineic energies ha generalize he power funcions o capure disinc power growh near zero and asympoically far from zero. We show ha he addiional assumpions of discreizaion can be saisfied for his family of k. We derive condiions on f ha guaranee he linear convergence of our mehods when paired wih a specific choice of k from his family. These condiions generalize he quadraic growh implied by smoohness and srong convexiy, exending i o general power growh ha may be disinc near he minimum and asympoically far from he minimum, which we refer o as ail and body behavior, respecively. Sep sizes can be fixed independenly of he iniial posiion and ofen dimension, and do no require adapaion, which ofen leads o convergence problems, see [57]. Indeed, we analyze a kineic map k ha resembles he ierae updaes of some popular adapive gradien mehods [13, 59, 18, 27], and show ha i condiions he opimizaion of srongly convex funcions wih very fas growing ails non-smooh. Thus, our mehods provide a framework opimizing poenially non-smooh or non-srongly convex funcions wih linear raes using firs-order compuaion. The organizaion of he paper is as follows. In he res of his secion, we cover noaion, review a few resuls from convex analysis, and give an overview of he relaed lieraure. In Secion 2, we show he linear convergence of 3 under condiions on he relaion beween he kineic energy k and f. We show a parial converse ha in some seings our condiions are necessary. In Secion 3, we presen he hree discreizaions of he coninuous dynamics and sudy he assumpions under which linear raes can be guaraneed for convex funcions. For one of he discreizaions, we also provide condiions under which i converges o saionary poins of non-convex funcions. In Secion 4, we sudy a family of kineic energies suiable for funcions wih power growh. We describe he class of funcions for which he assumpions of he discreizaions can be saisfied when using hese kineic energies. 1.1 Noaion and Convex Analysis Review We le x, y = d n=1 xn y n denoe he sandard inner produc for x, y R d and x 2 = x, x he Euclidean norm. For a differeniable funcion f : R d R, he gradien fx = fx/ x n R d is he vecor of parial derivaives a x. For wice-differeniable f, he Hessian 2 hx = 2 fx/ x n x m R d d is he marix of second-order parial derivaives a x. The noaion x denoes he soluion x : [0, R d o a differenial equaion wih derivaive in denoed x. x i denoes he ieraes x i : {0, 1,...} R d of a discree sysem. Consider a convex funcion h : C R ha is defined on a convex domain C R d and differeniable on he inerior inc. The convex conjugae h : R d R is defined as h p = sup{ x, p hx : x C} 4 and i is iself convex. I is easy o show from he definiion ha if g : C R is anoher convex funcion such ha gx hx for all x C, hen h p g p for all p R d. Because we make 4

5 such exensive use of i, we remind readers of he Fenchel-Young inequaliy: for x C and p R d, x, p hx + h p, 5 which is easily derived from he definiion of h, or see Secion 12 of [47]. Theorem 26.4 of [47], For x inc by x, hx = hx + h hx. 6 Le y R d, c R \ {0}. If gx = hx + y c, hen g p = h p p, y + c Theorem 12.3 [47]. If hx = x b /b for x R and b 1,, hen h p = p a /a where a = b/b 1 page 106 of [47]. If gx = chx, hen g p = ch p/c Table 3.2 [6]. For hese and more on h, we refer readers o [47, 8, 6]. 1.2 Relaed Lieraure Sandard references on convex opimizaion and he convergence analysis of firs-order mehods include [38, 45, 3, 8, 41, 9]. The heavy ball mehod was inroduced by Polyak in his seminal paper [44]. In his paper, local convergence wih linear rae was shown i.e., when he iniial posiion is sufficienly close o he local minimum. For quadraic funcions, i can be shown ha he convergence rae for opimally chosen sep sizes is proporional o he square roo of he condiional number of he Hessian, similarly o conjugae gradien descen see e.g., [46]. As far as we know, global convergence of he heavy ball mehod for non-quadraic funcions was only recenly esablished in [19] and [30], see [22] for an exension o sochasic average gradiens. The heavy ball mehod forms he basis of he some of he mos successful opimizaion mehods for deep learning, see e.g., [54, 27], and he recen review [7]. Hereafer, classical momenum refers o any firs-order discreizaion of he coninuous analog of Polyak s heavy ball wih possibly subopimal sep sizes. Neserov obained upper and lower bounds of maching order for firs-order mehods for smooh convex funcions and smooh srongly convex funcions, see [41]. In Necoara e al. [36], he assumpion of srong convexiy was relaxed, and under a weaker quadraic growh condiion, linear raes were obained by several well known opimizaion mehods. Several oher auhors obained linear raes for various classes of non-srongly convex or non-uniformly smooh funcions, see e.g., [37, 26, 11, 58, 14, 48]. In recen years, here has been ineres in he opimizaion communiy in looking a he coninuous ime ODE limi of opimizaion mehods, when he sep size ends o zero. Su e al. [52, 53] have found he coninuous ime limi of Neserov s acceleraed gradien descen. This resul improves he inuiion abou Neserov s mehod, as he proofs of convergence raes in coninuous ime are raher elegan and clear, while he previous proofs in discree ime are no as ransparen. Follow-ups have sudied he coninuous ime counerpars o acceleraed mirror descen [28] as well as higher order discreizaions of such sysems [55, 56]. Sudying coninuous ime sysems for opimizaion can separae he concerns of designing an opimizer from he difficulies of discreizaion. This perspecive has resuled in numerous oher recen works ha propose new opimizaion mehods, and sudy exising ones via heir coninuous ime limi, see e.g., [4, 1, 15, 24, 10, 16, 17]. Conformal Hamilonian sysems 3 are sudied in geomery [33, 5], because heir soluions preserve symplecic area up o a consan; when γ = 0 symplecic area is exacly preserved, when γ > 0 symplecic area dissipaes uniformly a an exponenial rae [33]. In classical mechanics, 5

6 Hamilonian dynamics sysem 3 wih γ = 0 are used o describe he moion of a paricle exposed o he force field f. Here, he mos common form for k is kp = p, p /2m, where m is he mass, or in relaivisic mechanics, kp = c p, p + m 2 c 2 where c is he speed of ligh, see [21]. In he Markov Chain Mone Carlo lieraure, where discreized Hamilonian dynamics again γ = 0 are used o propose moves in a Meropolis Hasings algorihm [34, 23, 12, 35], k is viewed as a degree of freedom ha can be used o improve he mixing properies of he Markov chain [20, 31]. Sochasic differenial equaions similar o 3 wih γ > 0 have been sudied from he perspecive of designing k [32, 51]. 2 Coninuous Dynamics In his secion, we moivae he discree opimizaion algorihms by inroducing heir coninuous ime counerpars. These sysems are differenial equaions described by a Hamilonian vecor field plus a dissipaion field. Thus, we briefly review Hamilonian dynamics, he coninuous dynamics of Hamilonian descen, and derive convergence raes for convex f in coninuous ime. 2.1 Hamilonian Sysems In he Hamilonian formulaion of mechanics, he evoluion of a paricle exposed o a force field f is described by is locaion x : [0, R d and momenum p : [0, R d as funcions of ime. The sysem is characerized by he oal energy, or Hamilonian, Hx, p = kp + fx fx, 7 where x is one of he global minimizers of f and k : R d R is called he kineic energy. Throughou, we consider kineic energies k ha are a sricly convex funcions wih minimum a k0 = 0. The Hamilonian H defines he rajecory of a paricle x and is momenum p via he ordinary differenial equaion, x = p Hx, p = kp p 8 = x Hx, p = fx. For any soluion of his sysem, he value of he oal energy over ime H = Hx, p is conserved as H = kp, p + fx, x = 0. Thus, he soluions of he Hamilonian field oscillae, exchanging energy from x o p and back again. 2.2 Coninuously Descending he Hamilonian The soluions of a Hamilonian sysem remain in he level se {x, p : H = H 0 }. To drive such a sysem owards saionary poins, he oal energy mus reduce over ime. Consider as a moivaing example he coninuous sysem x = fx γx, which describes Polyak s heavy ball algorihm in coninuous ime [44]. Leing x = p, he heavy ball sysem can be rewrien as x = p p = fx γp. 9 Noe ha his sysem can be viewed as a combinaion of a Hamilonian field wih kp = p, p /2 and a dissipaion field, i.e., x, p = F x, p + Gx, p where F x, p = p, fx and 6

7 Hamilonian Field Dissipaion Field Conformal Hamilonian Field momenum p + = posiion x Figure 3: A visualizaion of a conformal Hamilonian sysem. Gx, p = 0, γp, see Figure 3 for a visualizaion. This is naurally exended o define he more general conformal Hamilonian sysem [33], x = kp p = fx γp. 3 revisied wih γ 0,. When k is convex wih a minimum k0 = 0, hese sysems descend he level ses of he Hamilonian. We can see his by showing ha he oal energy H is reduced along he rajecory x, p, H = kp, p + fx, x = γ kp, p γkp 0, 10 where we have used he convexiy of k, and he fac ha i is minimised a k0 = 0. The following proposiion shows some exisence and uniqueness resuls for he dynamics 3. We say ha H is radially unbounded if Hx, p when x, p 2, e.g., his would be implied if f and k were sricly convex wih unique minima. Proposiion 2.1 Exisence and uniqueness. If f and k are coninuous, k is convex wih a minimum k0 = 0, and H is radially unbounded, hen for every x, p R d, here exiss a soluion x, p of 3 defined for every 0 wih x 0, p 0 = x, p. If in addiion, f and k are coninuously differeniable, hen his soluion is unique. Proof. Firs, only assuming coninuiy, i follows from Peano s exisence heorem [42] ha here exiss a local soluion on an inerval [ a, a] for some a > 0. Le [0, A denoe he righ maximal inerval where a soluion of 3 saisfying ha x 0 = x and p 0 = p exis. From 10, i follows ha H 0, and hence H H 0 for every [0, A. Now by he radial unboundedness of H, and he fac ha H H 0, i follows ha he compac se {x, p : Hx, p H 0 } is never lef by he dynamics, and hence by Theorem 3 of [43] page 91, we mus have A =. The uniqueness under coninuous differeniabiliy follows from he Fundamenal Exisence Uniqueness Theorem on page 74 of [43]. As shown in he nex proposiion, 10 implies ha conformal Hamilonian sysems approach saionary poins of f. 7

8 Proposiion 2.2 Convergence o a saionary poin. Le x, p be a soluion o he sysem 3 wih iniial condiions x 0, p 0 = x, p R 2d, f coninuously differeniable, and k coninuously differeniable, sricly convex wih minimum a 0 and k0 = 0. If f is bounded below and H is radially unbounded, hen fx 2 0. Proof. Since f is bounded below, H 0. Since H is radially unbounded, he se B := {x, p R 2d : Hx, p Hx 0, p 0 + 1} is a compac se ha conains x 0, p 0 in is inerior. Moreover, by 10, we also have x, p B for all > 0. Consider he se M = {x, p : H = 0} B. Since k is sricly convex, his se is equivalen o {x, p : p 2 = 0} B. The larges invarian se of he dynamics 3 inside M is I = {x, p R 2d : p 2 = 0, fx 2 = 0} B. By LaSalle s principle [29], all rajecories sared from B mus approach I. Since f is a coninuous bounded funcion on he compac se B, here is a poin x B such ha fx fx for every x B i.e. he minimum is aained in B by he exreme value heorem see [49]. Moreover, due o he definiion of B, x is in is inerior, hence fx 2 = 0 and herefore x, 0 I. Thus he se I is non-empy noe ha I migh conain oher local minima as well. Remark 1. This consrucion can be generalized by modifying he γp componen of 3 o a more general dissipaion field γdp. If he dissipaion field is everywhere aligned wih he kineic map, kp, Dp 0, hen hese sysems dissipae energy. We have no found alernaives o Dp = γp ha resul in linear convergence in general. 2.3 Coninuous Hamilonian Descen on Convex Funcions In his secion we sudy how k can be designed o condiion he sysem 3 for linear convergence in logfx fx. Alhough he soluions x, p of 3 approach saionary poins under weak condiions, o derive raes we consider he case when f is convex. To moivae our choice of k, consider he quadraic funcion fx = x, Ax /2 wih kp = p, A 1 p /2 for posiive definie symmeric A R d d. Now 3 becomes, x = A 1 p p = Ax γp. 11 By he change of variables v = A 1 p, his is equivalen o x = v v = x γv, 12 which is a universal equaion and hence he convergence rae of 11 is independen of A. Alhough his kineic energy implemens a consan precondiioner for any f, for his specific f k is is convex conjugae f. This suggess he core idea of his paper: aking k relaed in some sense o f for more general convex funcions may condiion he convergence of 3. Indeed, we show in his secion ha, if he kineic energy kp upper bounds a cenered version of f p, hen he convergence of 3 is linear. More precisely, define he following cenered funcion f c : R d R, f c x = fx + x fx. 13 The convex conjugae of f c is given by f c p = f p x, p +fx and is minimized a f c 0 = 0. Imporanly, as we will show in he final lemma of his secion, aking a kineic energy such ha 8

9 kp α maxfc p, fc p for some α 0, 1] suffices o achieve linear raes on any differeniable convex f in coninuous ime. The consan α is included o capure he fac ha k may under esimae fc by some consan facor, so long as i is posiive. If α does no depend in any fashion on f, hen he convergence rae of 3 is independen of f. In Secion 2.4 we also show a parial converse for some simple problems aking a k no saisfying hose assumpions resuls in sublinear convergence for almos every pah excep for one unique curve and is mirror. Remark 2. There is an ineresing connecion o dualiy heory for a specific choice of k. In a sligh abuse of represenaion, consider rewriing he original problem as min fx = min x R d x R d 1 fx + fx. 2 The Fenchel dual of his problem is equivalen o he following problem afer a small reparameerizaion of p see Chaper 31 of [47], 1 max p R d 2 f p f p. The Fenchel dualiy heorem guaranees ha for a given pair of primal-dual variables x, p R d, he dualiy gap beween he primal objecive fx and he dual objecive f p f p/2 is posiive. Thus, fx f p f p/2 = fx fx + f p + f p/2 + fx = fx fx + f c p + f c p/2 0. Thus, for he choice kp = f c p + f c p/2, which as we will show implies linear convergence of 3, he Hamilonian Hx, p is exacly he dualiy gap beween he primal and dual objecives. Linear raes in coninuous ime can be derived by a Lyapunov funcion V : R d d [0, ha summarizes he oal energy of he sysem, conracs exponenially or linearly in log-space, and is posiive unless x, p = x, 0. Ulimaely we are rying o prove a resul of he form V λv for some rae λ > 0. As he energy H is decreasing, i suggess using H as a Lyapunov funcion. Unforunaely, his will no suffice, as H plaeaus insananeously H = 0 a poins on he rajecory where p = 0 despie x possibly being far from x. However, when p = 0, he momenum field reduces o he erm fx and he derivaive of x x, p in is insananeously sricly negaive x x, fx < 0 for convex f unless we are a x, 0. This suggess he family of Lyapunov funcions ha we sudy in his paper, Vx, p = Hx, p + β x x, p, 14 where β 0, γ see he nex lemma for condiions ha guaranee ha i is non-negaive. As wih H, V is used o indicae Vx, p a ime along a soluion o 3. Before moving on o he final lemma of he secion, we prove wo echnical lemmas ha will give us useful conrol over V hroughou he paper. The firs lemma describes how β mus be consrained for V o be posiive and o rack H closely, so ha i is useful for he analysis of he convergence of H and ulimaely f. 9

10 Lemma 2.3 Bounding he raio of H and V. Le x R d, f : R d R convex wih unique minimum x, k : R d R sricly convex wih minimum k0 = 0, α 0, 1] and β 0, α]. If p R d is such ha kp αfc p, hen Hx, p x x, p kp/α + fx fx, α 15 Hx, p Vx, p. 16 α β α If p R d is such ha kp αf c p, hen Proof. Assuming ha kp αf c p, we have Hx, p x x, p kp/α + fx fx, α 17 Vx, p α+β α Hx, p. 18 kp/α + f c x x f c p + f c x x x x, p f c x x + f c x x = x x, p, hence we have follows by rearrangemen. The proof of 17 and 18 is similar. Lemma 2.3 consrains β in erms of α. For a resul like V λv, we will need o conrol β in erms of he magniude γ of he dissipaion field. The following lemma provides consrains on β and, under hose consrains, he opimal β. The proof can be found in Secion A of he Appendix. Lemma 2.4 Convergence raes in coninuous ime for fixed α. Given γ 0, 1, f : R d R differeniable and convex wih unique minimum x, k : R d R differeniable and sricly convex wih minimum k0 = 0. Le x, p R d be he value a ime of a soluion o he sysem 3 such ha here exiss α 0, 1] where kp αfc p. Define αγ αβ βγ β1 γ λα, β, γ = min,. 19 α β 1 β If β 0, minα, γ], hen Finally, V λα, β, γv. 1. The opimal β 0, minα, γ], β = arg max β λα, β, γ and λ = λα, β, γ are given by, β = 1 1+α α + γ2 1 γα 2 + γ2 4, 20 1 λ 1 α 1 γα + γ2 = 1 γα 2 + γ2 4 for 0 < α < 1, 21 γ1 γ 2 γ for α = 1, 10

11 2. If β 0, αγ/2], hen λα, β, γ = β1 γ, and 22 1 β γ β γ 2 1 γ/4 kp βγ x x, p β x x, fx β1 γkp + fx fx + β x x, p. 23 These wo lemmas are sufficien o prove he linear conracion of V and he conracion fx fx α α β H 0 exp λ under he assumpion of consan α and β. Sill, he consan α, which conrols our approximaion of fc may be quie pessimisic if i mus hold globally along x, p as he sysem converges o is minimum. Insead, in he final lemma ha collecs he convergence resul for his secion, we consider he case where α may increase as convergence proceeds. To suppor an improving α, our consan β will now have o vary wih ime and we will be forced o ake slighly subopimal β and λ given by 22 of Lemma 2.4. Sill, he improving α will be imporan in fuure secions for ensuring ha we are able o achieve posiion independen sep sizes. We are now ready o presen he cenral resul of his secion. Under Assumpions A we show linear convergence of 3. In general, he dependence of he rae of linear convergence on f is via he funcion α and he consan C α,γ in our analysis. Assumpions A. A.1 f : R d R differeniable and convex wih unique minimum x. A.2 k : R d R differeniable and sricly convex wih minimum k0 = 0. A.3 γ 0, 1. A.4 There exiss some differeniable non-increasing convex funcion α : [0, 0, 1] and consan C α,γ 0, γ ] such ha for every p R d, and ha for every y [0, kp αkp maxf c p, f c p 24 C α,γ α yy < αy. 25 In paricular, if kp α maxf c p, f c p for a consan α 0, 1], hen he consan funcion αy = α serves as a valid, bu pessimisic choice. Remark 3. Assumpion A.4 can be saisfied if a symmeric lower bound on f is known. example, srong convexiy implies For fx + x fx µ 2 x 2 2. This in urn implies fc p p 2 2 /2µ. Because kp = p 2 2 /2µ is symmeric, i saisfies A.4 which explains why condiions relaing o srong convexiy are necessary for linear convergence of Polyak s heavy ball. 11

12 Theorem 2.5 Convergence bound in coninuous ime wih general α. Given f, k, γ, α, C α,γ saisfying Assumpions A. Le x, p be a soluion o he sysem 3 wih iniial saes x 0, p 0 = x, 0 where x R d. Le α = α3h 0, λ = 1 γcα,γ 4, and W : [0, [0, be he soluion of W = λ α2w W, wih W 0 := H 0 = fx 0 fx. Then for every [0,, we have fx fx 2H 0 exp λ α2w 2H 0 exp λα Proof. By 24 in assumpion A.4, he condiions of Lemma 2.3 hold, and by 15 and 17 we have x x, p kp /αkp + fx fx H αkp. 27 Insead of defining he Lyapunov funcion V exacly as in 14 we ake a ime-dependen β. Specifically, for every 0 le V be he unique soluion v of he equaion v = H + C α,γα2v 2 x x, p 28 in he inerval v [H /2, 3H /2]. To see why his equaion has a unique soluion in v [H /2, 3H /2], noe ha from 27 i follows ha and hence for any such v, we have α 2v x x, p H for every v H 2, H 2 H + C α,γα2v x x, p H. 29 This means ha for v = H 2, he lef hand side of 28 is smaller han he righ hand side, while for v = 3H 2, i is he oher way around. Now using 25 in assumpion A.4 and 27, we have C α,γ α 2V x x, p C α,γ α 2V 2V α2v Thus, by differeniaion, we can see ha 30 implies ha v H Cα,γ v 2 α2v x x, p > 0, < 1, 30 which implies ha 28 has a unique soluion V in [ H 2, 3H 2 ]. Le α = α2v and β = Cα,γ 2 α 2V. By he implici funcion heorem, i follows ha V is differeniable in. Morover, since V = H + C α,γα2v 2 for every 0, by differeniaing boh sides, we obain ha x x, p 31 V = γ β kp, p β γ x x, p β x x, fx + β x x, p 12

13 0 5 Objecive log fx log fx i 1 Soluion & Ieraes x x i log fx x 0 fx = x 4 /4 kp = 3p 4/3 / log fx x 0 fx = x 4 /4 kp = p 2 / ime ime Figure 4: Imporance of Assumpions A. Soluions x and ieraes x i of our firs explici mehod on fx = x 4 /4 wih wo differen choices of k. Noice ha f c p = 3p 4/3 /4 and hus kp = p 2 /2 canno be made o saisfy assumpion A.4. The firs hree erms are equivalen o he emporal derivaive of V wih consan β = β. Since α αkp and β γ, he assumpions of Lemma 2.4 are saisfied locally for α, β and we ge V λα, β, γv + β x x, p = λα, β, γv + C α,γ α x x, p V. Using 22 of Lemma 2.4 for α, β, we have λα, β, γ = β1 γ 1 β V β 1 γv + C α,γ α x x, p V. β 1 γ and Using 30 we have V β1 γ 2 V. Noice ha V 0 = H 0 since we have assumed ha p 0 = 0, and he claim of he lemma follows by Grönwall s inequaliy. The final inequaliy 26 follows from he fac ha α2v α3h 0 = α. 2.4 Parial Lower Bounds In his secion we consider a parial converse of Proposiion 2.5, showing in a simple seing ha if he assumpion kp α maxf c p, f c p of A.4 is violaed, hen he ODE 3 conracs sublinearly. Figure 4 considers he example fx = x 4 /4. If kp = p a /a, hen assumpions A canno 13

14 Two ypical pahs Unique fas pahs 1 momenum p posiion x posiion x 5 log fx ime ime Figure 5: Soluions o he Hamilonian descen sysem wih fx = x 4 /4 and kp = x 2 /2. The righ plos show a numerical approximaion of x η, p η and x η, p η. The lef plos show a numerical approximaion of x θ, p θ and x θ, p θ for θ = η +δ R, which represen ypical pahs. be saisfied for small p unless b 4/3. Figure 4 shows ha an inappropriae choice of kp = p 2 /2 leads o sub-linear convergence boh in coninuous ime and for one of he discreizaions of Secion 3. In conras, he choice of kp = 3p 4/3 /4 resuls in linear convergence, as expeced. Le b, a > 1 and γ > 0. For d = 1 dimension, wih he choice fx := x b /b and kp := p a /a, 3 akes he following form, x = p a 1 signp, 32 p = x b 1 signx γp. Since fx akes is minimum a 0, x, p are expeced o converge o 0, 0 as. There is a rivial soluion: x = p = 0 for every R. The following Lemma shows an exisence and uniqueness resul for his equaion. The proof is included in Secion B of he Appendix. Lemma 2.6 Exisence and uniqueness of soluions of he ODE. Le a, b, γ 0,. For every 0 R and x, p R 2, here is a unique soluion x, p R of he ODE 32 wih x 0 = x, p 0 = p. Eiher x = p = 0 for every R, or x, p 0, 0 for every R. 14

15 Noe ha if x, p is a soluion, and R, hen x +, p + is also a soluion ime ranslaion, and x, p is also a soluion cenral symmery. Noe also ha f p = f p = p b /b for b := 1 1 b 1. Hence if a b, or equivalenly, if 1 b + 1 a 1, he condiions of Proposiion 2.5 are saisfied for some α > 0 in paricular, if a = b, hen α = 1 independenly of x 0, p 0. Hence in such cases, he speed of convergence is linear. For a > b Kp, lim p 0 f p = 0, so he condiions of Proposiion 2.5 are violaed. Now we are ready o sae he main resul in his secion, a heorem characerizing he convergence speeds of x, p o 0, 0 in his siuaion. The proof is included in Secion B of he Appendix. Proposiion 2.7 Lower bounds on he convergence rae in coninuous ime. Suppose ha 1 b + 1 a < 1. For any θ R, we denoe by x θ, p θ he unique soluion of 32 wih x 0 = θ, p 0 = 0. Then here exiss a consan η 0, depending on a and b such ha he pah x η mirrored version x η, p η saisfy ha x η = x η Oexp α for every α < γa 1 as. For any pah x, p ha is no a ime ranslaion of x η, p η 1 x 1 = O ba b a as, so he speed of convergence is sub-linear and no linearly fas. or x η, p η, p η, we have and is Figure 5 illusraes he wo pahs where he convergence is linearly fas for a = 2, b = 4. The main idea in he proof of Proposiion 2.7 is ha we esablish he exisence of a class of rapping ses, i.e. once he pah of he ODE eners one of hem, i never escapes. Convergence raes wihin such ses can be shown o be logarihmic, and i is esablished ha only wo pahs which are symmeric wih respec o he origin avoid each one of he rapping ses, and hey have linear convergence rae. 3 Opimizaion Algorihms In his secion we consider hree discreizaions of he coninuous sysem 3, one implici and wo explici. For hese discreizaions we mus assume more abou he relaionship beween f and k. The implici mehod defines he ieraes as soluion of a local subproblem. The firs and second explici mehods are fully explici, and we mus again make sronger assumpions on f and k. The proofs of all of he resuls in his secion are given in Secion C of he Appendix. 3.1 Implici Mehod Consider he following discree approximaion x i, p i o he coninuous sysem, making he fixed ɛ > 0 finie difference approximaion, xi+1 xi ɛ = x and pi+1 pi ɛ = p, which approximaes he field a he forward poins. x i+1 x i = kp i+1 ɛ 33 p i+1 p i = γp i+1 fx i+1. ɛ 15

16 Since k kp = p, his sysem of equaions corresponds o he saionary condiion of he following subproblem ieraion, which we inroduce as our implici mehod. Implici Mehod. Given f, k : R d R, ɛ, γ 0,, x 0, p 0 R d. Le δ = 1 + γɛ 1 and { x i+1 = arg min ɛk x xi x R d ɛ + ɛδfx δ p i, x } p i+1 = δp i ɛδ fx i The following lemma shows ha he formulaion 34 is well defined. Secion C of he Appendix. The proof is included in Lemma 3.1 Well-definedness of he implici scheme. Suppose ha f and k saisfy assumpions A.1 and A.2, and ɛ, γ 0,. Then 34 has a unique soluion for every x i, p i R d, and his soluion also saisfies 33. As his discreizaion involves solving a poenially cosly subproblem a each ieraion, i requires a relaively ligh assumpion on he compaibiliy of f and k. Assumpions B. B.1 There exiss C f,k 0, such ha for all x, p R d, fx, kp C f,k Hx, p. 35 Remark 4. Smoohness of f implies 1 2 fx 2 2 Lfx fx see of Theorem of [41]. Thus, if f is smooh and kp = 1 2 p 2 2, hen he assumpion B.1 can be saisfied by C f,k = max{1, L}, since fx, kp 1 2 fx kp 2 2 Lfx fx + kp. The following proposiion shows a convergence resul for he implici scheme. Proposiion 3.2 Convergence bound for he implici scheme. Given f, k, γ, α, C α,γ, and 1 γ C f,k saisfying assumpions A and B. Suppose ha ɛ < 2 maxc f,k,1. Le α = α3h 0, and le W 0 = fx 0 fx and for i 0, W i+1 = W i [1 + ɛc α,γ 1 γ 2C f,k ɛα2w i /4] 1. Then for any x 0, p 0 wih p 0 = 0, he ieraes of 33 saisfy for every i 0, fx i fx 2W i 2W 0 [1 + ɛc α,γ 1 γ 2C f,k ɛα /4] i. 1 γ Remark 5. Proposiion 3.2 means ha we can fix any sep size 0 < ɛ < 2 maxc f,k,1 independenly of he iniial poin, and have linear convergence wih conracion rae ha is proporional o α3h 0 iniially and possibly increasing as we ge closer o he opimum. In Secion 4 we inroduce kineic 16

17 energies kp ha behave like p a 2 near 0 and p A 2 in he ails. We will show ha for funcions fx ha behave like x x b 2 near heir minima and x x B 2 in he ails he condiions of assumpions B are saisfied as long as 1 a + 1 b = 1 and 1 A + 1 B 1. In paricular, if we choose kp = p relaivisic kineic energy, hen a = 2 and A = 1, and assumpions B can be shown o hold for every f ha has quadraic behavior near is minimum and no faser han exponenial growh in he ails. 3.2 Firs Explici Mehod, wih Analysis via he Hessian of f The following discree approximaion x i, p i o he coninuous sysem makes a similar finie difference approximaion, i+1 x i x ɛ = x and pi+1 pi ɛ = p for ɛ > 0. In conras o he implici mehod, i approximaes he field a he poin x i, p i+1, making i fully explici wihou any cosly subproblem, x i+1 x i = kp i+1 ɛ p i+1 p i = γp i+1 fx i. ɛ This mehod can be rewrien as our firs explici mehod. Firs Explici Mehod. Given f, k : R d R, ɛ, γ 0,, x 0, p 0 R d. Le δ = 1 + γɛ 1 and p i+1 = δp i ɛδ fx i x i+1 = x i + ɛ kp i This discreizaion explois he convexiy of k by approximaing he coninuous dynamics a he forward poin p i+1, bu is made explici by approximaing a he backward poin x i. Because his mehod approximaes he field a he backward poin x i i requires a kind of smoohness assumpion o preven f from changing oo rapidly beween ieraes. This assumpion is in he form of a condiion on he Hessian of f, and hus we require wice differeniabiliy of f for he firs explici mehod. Because he accumulaion of gradiens of f in he form of p i are modulaed by k, his condiion in fac expresses a requiremen on he ineracion beween k and 2 f, see assumpion C.3. Assumpions C. C.1 There exiss C k 0, such ha for every p R d, kp, p C k kp. 37 C.2 f : R d R convex wih a unique minimum a x and wice coninuously differeniable for every x R d \ {x }. C.3 There exiss D f,k 0, such ha for every p R d, x R d \ {x }, kp, 2 fx kp D f,k α3hx, phx, p

18 Remark 6. If f smooh and wice differeniable hen v, 2 fxv is everywhere bounded by L for v R d such ha v 2 = 1 see Theorem of [41]. Thus, using kp = 1 2 p 2 2, his allows us o saisfy assumpion C.3 wih D f,k = max{1, 2L}, since kp, 2 fx kp L kp 2 2 = 2Lkp fx fx + 2Lkp. Assumpion C.1 is clearly saisfied in his case by C k = 2. The following lemma shows a convergence resul for his discreizaion. Proposiion 3.3 Convergence bound for he firs explici scheme. Given f, k, γ, α, C α,γ, C f,k, C k, D f,k saisfying assumpions A, B, and C, and ha 0 < ɛ < min Le α = α3h 0, W 0 := fx 0 fx, and for i 0, le 1 γ 2 maxc f,k +6D f,k /C α,γ,1, C α,γ W i+1 = W i 1 + ɛc 1 α,γ [1 γ 2ɛC f,k + 6D f,k /C α,γ ] α2w i. 4 Then for any x 0, p 0 wih p 0 = 0, he ieraes 36 saisfy for every i 0, fx i fx 2W i 2W ɛc i α,γ [1 γ 2ɛC f,k + 6D f,k /C α,γ ] α. 4 10C f,k +5γC k. Remark 7. Similar o Remark 5, Proposiion 3.3 implies ha, under suiable assumpions and posiion independen sep sizes, he firs explici mehod can achieve linear convergence wih conracion rae ha is proporional o α3h 0 iniially and possibly increasing as we ge closer o he opimum. In paricular, again as remarked in Remark 5, for fx ha behave like x x b 2 near heir minima and x x B 2 in he ails he condiions of assumpions C can be saisfied for kineic energies ha grow like p a 2 in he body and p A 2 in he ails as long as 1 a + 1 b = 1, 1 A + 1 B 1. The disincion here is ha for he firs explici mehod we will require b, B Second Explici Mehod, wih Analysis via he Hessian of k Our second explici mehod invers relaionship beween f and k from he firs. Again, i makes a fixed ɛ sep approximaion xi+1 xi ɛ = x and pi+1 pi ɛ = p. In conras o he implici 33 and firs explici 36 mehods, i approximaes he field a he poin x i+1, p i. Second Explici Mehod. Given f, k : R d R, ɛ, γ 0,, x 0, p 0 R d. Le, x i+1 = x i + ɛ kp i p i+1 = 1 ɛγp i ɛ fx i This discreizaion explois he convexiy of f by approximaing he coninuous dynamics a he forward poin x i+1, bu is made explici by approximaing a he backward poin p i. As wih he oher explici mehod, i requires a smoohness assumpion o preven k from changing oo rapidly beween ieraes, which is expressed as a requiremen on he ineracion beween f and 2 k, see assumpion D.5. These assumpions can be saisfied for k ha have quadraic or higher power growh and are suiable for f ha may have unbounded second derivaives a heir minima for such f, Assumpions C can no hold. 18

19 Assumpions D. D.1 k : R d R sricly convex wih minimum k0 = 0 and wice coninuously differeniable for every p R d \ {0}. D.2 There exiss C k 0, such ha for every p R d, kp, p C k kp. 40 D.3 There exiss D k 0, such ha for every p R d \ {0}, p, 2 kpp D k kp. 41 D.4 There exiss E k, F k 0, such ha for every p, q R d, kp kq E k kq + F k kp kq, p q. 42 D.5 There exiss D f,k 0, such ha for every x R d, p R d \ {0}, fx, 2 kp fx D f,k α3hx, phx, p. 43 Remark 8. Smoohness of f implies 1 2 fx 2 2 Lfx fx see of Theorem of [41]. Thus, if f is smooh and kp = 1 2 p 2 2, hen he assumpion D.5 can be saisfied by D f,k = max{1, 2L}, since 2 kp = I and fx, 2 kp fx = fx 2 2 2Lfx fx 2Lfx fx + kp. The k-specific assumpions D.2 and D.3 can clearly be saisfied wih C k = D k = 2 in his case. We show ha D.4 can be saisfied in Secion 4. Proposiion 3.4 Convergence bound for he second explici scheme. Given f, k, γ, α, C α,γ, C f,k, C k, D k, D f,k, E k, F k saisfying assumpions A, B, and D, and ha 0 < ɛ < min 1 γ 2C f,k + 6D f,k /C α,γ, 1 γ 8D k 1 + E k, C α,γ 1, 65C f,k + 2γC k + 12γC α,γ 6γ 2. D k F k Le α = α3h 0, W 0 := fx 0 fx, and for i 0, le W i+1 = W i 1 ɛc α,γ [1 γ 2ɛC f,k + 6D f,k /C α,γ ] α2w i. 4 Then for any x 0, p 0 wih p 0 = 0, he ieraes 39 saisfy for every i 0, fx i fx 2W i 2W 0 1 ɛc i α,γ [1 γ 2ɛC f,k + 6D f,k /C α,γ ] α. 4 19

20 0 5 Objecive log fx log fx i 1 Soluion & Ieraes x x i log fx x 0 fx = x 4 /4 kp = p 8/7 7/ Figure 6: Imporance of discreizaion assumpions. Soluions x and ieraes x i of our firs explici mehod on fx = x 4 /4. Wih an inappropriae choice of kineic energy, kp = p 8/7 7/8, he coninuous soluion converges a a linear rae bu he ieraes do no. Remark 9. Similar o Remark 5, Proposiion 3.4 implies ha, under suiable assumpions and for a fixed sep size independen of he iniial poin, he second explici mehod can achieve linear convergence wih conracion rae ha is proporional o α3h 0 iniially and possibly increasing as we ge closer o he opimum. In paricular, again as remarked in Remark 5, for fx ha behave like x x b 2 near heir minima and x x B 2 in he ails he condiions of assumpions D can be saisfied for kineic energies ha grow like p a 2 in he body and p A 2 in he ails as long as 1 a + 1 b = 1, 1 A + 1 B 1. The disincion here is ha for he second explici mehod we will require b, B 2. To conclude he analysis of our mehods on convex funcions, consider he example fx = x 4 /4 from Figure 4. If we ake kp = p a /a, hen assumpion A.4 requires ha a 4/3. Assumpions B and C canno be saisfied as long as a < 4/3, which suggess ha kp = f p is he only suiable choice in his case. Indeed, in Figure 6, we see ha he choice of kp = p 8/7 7/8 resuls in a sysem whose coninuous dynamics converge a a linear rae and whose discree dynamics fail o converge. Noe ha as he coninuous sysems converge he oscillaion frequency increases dramaically, making i difficul for a fixed sep size scheme o approximae. 3.4 Firs Explici Mehod on Non-Convex f We close his secion wih a brief analysis of he convergence of he firs explici mehod on nonconvex f. A radiional requiremen of discreizaions is some degree of smoohness o preven he funcion changing oo rapidly beween poins of approximaion. The noion of Lipschiz smoohness is he sandard one, bu he use of he kineic map k o selec ieraes allows Hamilonian descen mehods o consider he broader definiion of uniform smoohness, as discussed in [60, 2, 61] bu specialized here for our purposes. Uniform smoohness is defined by a norm and a convex non-decreasing funcion σ : [0, [0, ] such ha σ0 = 0. A funcion f : R d R is σ-uniformly smooh, if for all x, y R d, fy fx + fx, y x + σ y x

21 Lipschiz smoohness corresponds o σ = 1 2 2, and generally speaking here exis non-rivial uniformly smooh funcions for σ = 1 b b for 1 < b 2, see, e.g., [40, 60, 2, 61]. Assumpions E. E.1 f : R d R differeniable. E.2 γ 0,. E.3 There exiss a norm on R d, b 1,, D k 0,, D f 0,, σ : [0, [0, ] non-decreasing convex such ha σ0 = 0 and σc c b σ for c, 0, ; for all p R d, σ kp D k kp; 45 and for all x, y R d, fy fy + fx, y x + D f σ y x. 46 Lemma 3.5 Convergence of he firs explici scheme wihou convexiy. Given, f, k, γ, b, D k, D f, σ saisfying assumpions E and A.2. If ɛ 0, b 1 γ/d f D k ], hen he ieraes 36 of he firs explici mehod saisfy and fx i 2 0. H i+1 H i ɛ b D f D k ɛγkp i+1 0, 47 Remark 10. L-Lipschiz coninuiy of he gradiens fx fy 2 L x y 2 for L > 0 wih Euclidean norm 2 implies boh fy fy+ fx, y x + L 2 y x 2 2 and 1 2 fx 2 2 Lfx fx. Thus, if f, k are L f, L k smooh, respecively, hen he condiion for convergence simplifies o ɛ γ/l f L k. 4 Kineic Maps for Funcions wih Power Behavior In his secion we design a family of kineic maps k suiable for a class of funcions f ha exhibi power growh, which we will describe precisely as a se of assumpions. This class includes srongly convex and smooh funcions. However, i is much broader, including funcions wih possibly nonquadraic power behavior and singular or unbounded Hessians. Firs, we show ha his family of kineic energies saisfies he k-specific assumpions of Secion 3. Then we use he generic analysis of Secion 3 o provide a specific se of assumpions on fs and heir mach o he choice of k. As a consequence, his analysis grealy exends he class of funcions for which linear convergence is possible wih fixed sep size firs order compuaion. Sill, his analysis is no mean o be an exhausive caalogue of possible kineic energies for Hamilonian descen. Insead, i serves as an example of how known properies of f can be used o design k. Noe ha, wih a few excepions, he proofs of all of our resuls in his secion are deferred o Secion D of he Appendix. 21

22 4 ' A a x wih a =8/7 ' A a x wih a =2 ' A a x wih a =8 ' A a x x x A =8/7 A =2 A = x Figure 7: Power kineic energies in one dimension. 4.1 Power Kineic Energies We assume a given norm x and is dual p = sup{ x, p : x 1} for x, p R d. Define he family of power kineic energies k, kp = ϕ A a p where ϕ A a = 1 A a + 1 A a 1 A for [0, and a, A [1,. 48 For a = A we recover he sandard power funcions, ϕ a a = a /a. For disinc a A, we have ϕ A a A 1 for large and ϕ A a a 1 for small. Thus, kp p A /A as p and kp p a /a as p 0. See Figure 7 for examples from his family in one dimension. Broadly speaking, his family of kineic energies mus be mached in a conjugae fashion o he body and ail behavior of f. Informally, for his choice of k we will require condiions on f ha correspond o requiring ha i grows like x x b in he body as x x 0 and x x B in he ails as x x for some b, B 1,. In paricular, our growh condiions in he case of f growing like x 2 2 = x, x everywhere will be necessary condiions of srong convexiy and smoohness. More generally, a, A, b, B will be well-mached if 1/a+1/b = 1/A+1/B = 1, bu oher scenarios are possible. Of hese, he conjugae relaionship beween a and b is he mos criical; i capures he asympoic mach beween f and k as x i, p i x, 0, and our analysis requires ha 1/a + 1/b = 1. The mach beween A and B is less criical. In he ideal case, B is known and A = B/B 1. In his case, he discreizaions will converge a a consan fas linear rae. If B is no known, i suffices for 1/A + 1/B 1. The consequence of underesimaing A < B/B 1 will be refleced in a linear, bu non-consan, rae of convergence via α of Assumpion A.4, which depends on he iniial x 0 and slowly improves owards a fas rae as he sysem converges and he regime swiches. We presen a complee analysis and se of condiions on f for wo of he mos useful scenarios. In Proposiion 4.4 we consider he case ha f grows like ϕ B b x x where b, B > 1 are exacly known. In his case convergence proceeds a a fas consan linear rae when mached wih kp = ϕ A a p where a = b/b 1 and A = B/B 1. In Proposiion 4.5 we consider he case ha f grows like ϕ B 2 x x where B 2 is unknown. Here, he convergence is linear wih a non-consan rae when mached wih he relaivisic kineic energy kp = ϕ 1 2 p. The case covered by relaivisic kineic k is paricularly valuable, as i covers a large class of globally non-smooh, bu srongly convex funcions. Table 1 summarizes his, and hroughou he remaining subsecions we flesh ou he deails of hese claims. 22

23 fx grows like ϕ B b x appropriae kp = ϕa a p mehod powers known? body power b ail power B body power a ail power A implici known b > 1 B > 1 a = b/b 1 A = B/B 1 unknown b = 2 B 2 a = 2 A = 1 1s explici known b 2 B 2 a = b/b 1 A = B/B 1 unknown b = 2 B 2 a = 2 A = 1 2nd explici known 1 < b 2 1 < B 2 a = b/b 1 A = B/B 1 Table 1: A summary of he condiions on f and power kineic k considered in his secion ha saisfy he assumpions of Secion 3. Here grows like is an imprecise erm meaning ha f s growh can be bounded in an appropriae way by ϕ B b x ϕb b is defined in 48. The full precise assumpions on f are laid ou in Proposiions 4.4 and 4.5. In paricular, b = B = 2 corresponds o assumpions similar in spiri o srong convexiy and smoohness. Oher combinaions of b, B and a, A are possible. For hese kineic energies o be suiable in our analysis, hey mus a minimum saisfy assumpions A.2, C.1, D.1, D.3, and D.4. Assumpions C.1 and D.3 are clearly saisfied by kp = p a /a for p R wih consans C k = a and D k = aa 1. In he remainder of his subsecion, we provide condiions on he norms and a, A under which assumpions like hese hold for ϕ A a wih muliple power behavior in any finie dimension. In general, he problemaic erms of kp and 2 kp ha arise in high dimensions involve he gradien and Hessian of he norm. The gradien of norm can be deal wih cleanly, bu our analysis requires addiional conrol on he Hessian of he norm. To conrol erms involving 2 p we define a generalizaion of he maximum eigenvalue induced by he norm. Le λ max : R d d R be he funcion defined by λ maxm = sup{ v, Mv : v R d, v = 1}. 49 For symmeric M R d d and Euclidean his is exacly he maximum eigenvalue of M. Now we are able o sae our lemma analyzing power kineic energies. Lemma 4.1 Verifying assumpions on k. Given a norm p on p R d, a, A [1,, and ϕ A a in 48. Define he consan, C a,a = 1 a 1 A 1 A 1 B b b A a + a 1 A a A a 1 kp = ϕ A a p saisfies he following. 1. Convexiy. If a > 1 or A > 1, hen k is sricly convex wih a unique minimum a 0 R d. 2. Conjugae. For all x R d, k x = ϕ A a x. 23

24 3. Gradien. If p is differeniable a p R d \ {0} and a > 1, hen k is differeniable for all p R d, and for all p R d, kp, p max{a, A}kp, 51 ϕ A a kp max{a, A} 1kp. 52 Addiionally, if a, A > 1, define B = A/A 1, b = a/a 1, and hen ϕ B b kp C a,a max{a, A} 1kp. 53 Addiionally, if a, A 2, hen for all p, q R d, kp kq, q + kp kq, p q Hessian. If p is wice coninuously differeniable a p R d \ {0}, hen k is wice coninuously differeniable for all p R d \ {0}, and for all p R d \ {0}, p, 2 kpp max{a, A}max{a, A} 1kp. 55 Addiionally, if a, A 2 and here exiss N [0, such ha p λ max 2 p N for p R d \ {0}, hen for all p R d \ {0} ϕ A/2 a/2 λ max 2 kp max{a, A} 1 + N max{a, A} 2kp. 56 Remark , 54, and 55 ogeher direcly confirm ha hese k saisfy C.1, D.3, and D.4 wih consans C k = max{a, A}, D k = max{a, A}max{a, A} 1, E k = max{a, A} 1, and F k = 1. The oher resuls 52, 53, and 56 will be used in subsequen lemmas along wih assumpions on f o saisfy he remaining assumpions of discreizaion. The assumpion ha p λ max 2 p N in Lemma 4.1 is saisfied by b-norms for b [2,, as he following lemma confirms. I implies ha if p = p b for b 2, we can ake N = b 1 in 56. Lemma 4.2 Bounds on λ max 2 p for b-norms. Given b [2,, le x b = for x R d. Then for x R d \ {0}, x b λ b max 2 x b b 1. d n=1 xn b 1/b The remaining assumpions B.1, C.3, and D.5 involve inner producs beween derivaives of f and k. To conrol hese erms we will use he Fenchel-Young inequaliy. To his end, he conjugaes of ϕ A a will be a crucial componen of our analysis. Lemma 4.3 Convex conjugaes of ϕ A a. Given a, A 1, and ϕ A a in 48. Define B = A/A 1, b = a/a 1. The following hold. 1. Near Conjugae. ϕ B b upper bounds he conjugae ϕa a for all [0,, ϕ A a ϕ B b

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Course Noes for EE7C Spring 018: Convex Opimizaion and Approximaion Insrucor: Moriz Hard Email: hard+ee7c@berkeley.edu Graduae Insrucor: Max Simchowiz Email: msimchow+ee7c@berkeley.edu Ocober 15, 018 3

More information

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs PROC. IEEE CONFERENCE ON DECISION AND CONTROL, 06 A Primal-Dual Type Algorihm wih he O(/) Convergence Rae for Large Scale Consrained Convex Programs Hao Yu and Michael J. Neely Absrac This paper considers

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

Optimality Conditions for Unconstrained Problems

Optimality Conditions for Unconstrained Problems 62 CHAPTER 6 Opimaliy Condiions for Unconsrained Problems 1 Unconsrained Opimizaion 11 Exisence Consider he problem of minimizing he funcion f : R n R where f is coninuous on all of R n : P min f(x) x

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

EXERCISES FOR SECTION 1.5

EXERCISES FOR SECTION 1.5 1.5 Exisence and Uniqueness of Soluions 43 20. 1 v c 21. 1 v c 1 2 4 6 8 10 1 2 2 4 6 8 10 Graph of approximae soluion obained using Euler s mehod wih = 0.1. Graph of approximae soluion obained using Euler

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details! MAT 257, Handou 6: Ocober 7-2, 20. I. Assignmen. Finish reading Chaper 2 of Spiva, rereading earlier secions as necessary. handou and fill in some missing deails! II. Higher derivaives. Also, read his

More information

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality Marix Versions of Some Refinemens of he Arihmeic-Geomeric Mean Inequaliy Bao Qi Feng and Andrew Tonge Absrac. We esablish marix versions of refinemens due o Alzer ], Carwrigh and Field 4], and Mercer 5]

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

Lecture 20: Riccati Equations and Least Squares Feedback Control

Lecture 20: Riccati Equations and Least Squares Feedback Control 34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he

More information

IMPLICIT AND INVERSE FUNCTION THEOREMS PAUL SCHRIMPF 1 OCTOBER 25, 2013

IMPLICIT AND INVERSE FUNCTION THEOREMS PAUL SCHRIMPF 1 OCTOBER 25, 2013 IMPLICI AND INVERSE FUNCION HEOREMS PAUL SCHRIMPF 1 OCOBER 25, 213 UNIVERSIY OF BRIISH COLUMBIA ECONOMICS 526 We have exensively sudied how o solve sysems of linear equaions. We know how o check wheher

More information

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

10. State Space Methods

10. State Space Methods . Sae Space Mehods. Inroducion Sae space modelling was briefly inroduced in chaper. Here more coverage is provided of sae space mehods before some of heir uses in conrol sysem design are covered in he

More information

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t... Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger

More information

Chapter 3 Boundary Value Problem

Chapter 3 Boundary Value Problem Chaper 3 Boundary Value Problem A boundary value problem (BVP) is a problem, ypically an ODE or a PDE, which has values assigned on he physical boundary of he domain in which he problem is specified. Le

More information

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation: M ah 5 7 Fall 9 L ecure O c. 4, 9 ) Hamilon- J acobi Equaion: Weak S oluion We coninue he sudy of he Hamilon-Jacobi equaion: We have shown ha u + H D u) = R n, ) ; u = g R n { = }. ). In general we canno

More information

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems Essenial Microeconomics -- 6.5: OPIMAL CONROL Consider he following class of opimizaion problems Max{ U( k, x) + U+ ( k+ ) k+ k F( k, x)}. { x, k+ } = In he language of conrol heory, he vecor k is he vecor

More information

Chapter 6. Systems of First Order Linear Differential Equations

Chapter 6. Systems of First Order Linear Differential Equations Chaper 6 Sysems of Firs Order Linear Differenial Equaions We will only discuss firs order sysems However higher order sysems may be made ino firs order sysems by a rick shown below We will have a sligh

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

arxiv: v1 [math.fa] 9 Dec 2018

arxiv: v1 [math.fa] 9 Dec 2018 AN INVERSE FUNCTION THEOREM CONVERSE arxiv:1812.03561v1 [mah.fa] 9 Dec 2018 JIMMIE LAWSON Absrac. We esablish he following converse of he well-known inverse funcion heorem. Le g : U V and f : V U be inverse

More information

Expert Advice for Amateurs

Expert Advice for Amateurs Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he

More information

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still. Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in

More information

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018 MATH 5720: Gradien Mehods Hung Phan, UMass Lowell Ocober 4, 208 Descen Direcion Mehods Consider he problem min { f(x) x R n}. The general descen direcions mehod is x k+ = x k + k d k where x k is he curren

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

2. Nonlinear Conservation Law Equations

2. Nonlinear Conservation Law Equations . Nonlinear Conservaion Law Equaions One of he clear lessons learned over recen years in sudying nonlinear parial differenial equaions is ha i is generally no wise o ry o aack a general class of nonlinear

More information

Ordinary Differential Equations

Ordinary Differential Equations Ordinary Differenial Equaions 5. Examples of linear differenial equaions and heir applicaions We consider some examples of sysems of linear differenial equaions wih consan coefficiens y = a y +... + a

More information

Two Coupled Oscillators / Normal Modes

Two Coupled Oscillators / Normal Modes Lecure 3 Phys 3750 Two Coupled Oscillaors / Normal Modes Overview and Moivaion: Today we ake a small, bu significan, sep owards wave moion. We will no ye observe waves, bu his sep is imporan in is own

More information

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes Represening Periodic Funcions by Fourier Series 3. Inroducion In his Secion we show how a periodic funcion can be expressed as a series of sines and cosines. We begin by obaining some sandard inegrals

More information

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t M ah 5 2 7 Fall 2 0 0 9 L ecure 1 0 O c. 7, 2 0 0 9 Hamilon- J acobi Equaion: Explici Formulas In his lecure we ry o apply he mehod of characerisics o he Hamilon-Jacobi equaion: u + H D u, x = 0 in R n

More information

Convergence of the Neumann series in higher norms

Convergence of the Neumann series in higher norms Convergence of he Neumann series in higher norms Charles L. Epsein Deparmen of Mahemaics, Universiy of Pennsylvania Version 1.0 Augus 1, 003 Absrac Naural condiions on an operaor A are given so ha he Neumann

More information

Lecture 9: September 25

Lecture 9: September 25 0-725: Opimizaion Fall 202 Lecure 9: Sepember 25 Lecurer: Geoff Gordon/Ryan Tibshirani Scribes: Xuezhi Wang, Subhodeep Moira, Abhimanu Kumar Noe: LaTeX emplae couresy of UC Berkeley EECS dep. Disclaimer:

More information

RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY

RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY ECO 504 Spring 2006 Chris Sims RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY 1. INTRODUCTION Lagrange muliplier mehods are sandard fare in elemenary calculus courses, and hey play a cenral role in economic

More information

Notes on online convex optimization

Notes on online convex optimization Noes on online convex opimizaion Karl Sraos Online convex opimizaion (OCO) is a principled framework for online learning: OnlineConvexOpimizaion Inpu: convex se S, number of seps T For =, 2,..., T : Selec

More information

15. Vector Valued Functions

15. Vector Valued Functions 1. Vecor Valued Funcions Up o his poin, we have presened vecors wih consan componens, for example, 1, and,,4. However, we can allow he componens of a vecor o be funcions of a common variable. For example,

More information

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Simulaion-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Week Descripion Reading Maerial 2 Compuer Simulaion of Dynamic Models Finie Difference, coninuous saes, discree ime Simple Mehods Euler Trapezoid

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models. Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear

More information

Boundedness and Exponential Asymptotic Stability in Dynamical Systems with Applications to Nonlinear Differential Equations with Unbounded Terms

Boundedness and Exponential Asymptotic Stability in Dynamical Systems with Applications to Nonlinear Differential Equations with Unbounded Terms Advances in Dynamical Sysems and Applicaions. ISSN 0973-531 Volume Number 1 007, pp. 107 11 Research India Publicaions hp://www.ripublicaion.com/adsa.hm Boundedness and Exponenial Asympoic Sabiliy in Dynamical

More information

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales Advances in Dynamical Sysems and Applicaions. ISSN 0973-5321 Volume 1 Number 1 (2006, pp. 103 112 c Research India Publicaions hp://www.ripublicaion.com/adsa.hm The Asympoic Behavior of Nonoscillaory Soluions

More information

Linear Response Theory: The connection between QFT and experiments

Linear Response Theory: The connection between QFT and experiments Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and

More information

Class Meeting # 10: Introduction to the Wave Equation

Class Meeting # 10: Introduction to the Wave Equation MATH 8.5 COURSE NOTES - CLASS MEETING # 0 8.5 Inroducion o PDEs, Fall 0 Professor: Jared Speck Class Meeing # 0: Inroducion o he Wave Equaion. Wha is he wave equaion? The sandard wave equaion for a funcion

More information

EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE

EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE Version April 30, 2004.Submied o CTU Repors. EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE Per Krysl Universiy of California, San Diego La Jolla, California 92093-0085,

More information

4. Advanced Stability Theory

4. Advanced Stability Theory Applied Nonlinear Conrol Nguyen an ien - 4 4 Advanced Sabiliy heory he objecive of his chaper is o presen sabiliy analysis for non-auonomous sysems 41 Conceps of Sabiliy for Non-Auonomous Sysems Equilibrium

More information

(1) (2) Differentiation of (1) and then substitution of (3) leads to. Therefore, we will simply consider the second-order linear system given by (4)

(1) (2) Differentiation of (1) and then substitution of (3) leads to. Therefore, we will simply consider the second-order linear system given by (4) Phase Plane Analysis of Linear Sysems Adaped from Applied Nonlinear Conrol by Sloine and Li The general form of a linear second-order sysem is a c b d From and b bc d a Differeniaion of and hen subsiuion

More information

14 Autoregressive Moving Average Models

14 Autoregressive Moving Average Models 14 Auoregressive Moving Average Models In his chaper an imporan parameric family of saionary ime series is inroduced, he family of he auoregressive moving average, or ARMA, processes. For a large class

More information

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD HAN XIAO 1. Penalized Leas Squares Lasso solves he following opimizaion problem, ˆβ lasso = arg max β R p+1 1 N y i β 0 N x ij β j β j (1.1) for some 0.

More information

An Introduction to Malliavin calculus and its applications

An Introduction to Malliavin calculus and its applications An Inroducion o Malliavin calculus and is applicaions Lecure 5: Smoohness of he densiy and Hörmander s heorem David Nualar Deparmen of Mahemaics Kansas Universiy Universiy of Wyoming Summer School 214

More information

18 Biological models with discrete time

18 Biological models with discrete time 8 Biological models wih discree ime The mos imporan applicaions, however, may be pedagogical. The elegan body of mahemaical heory peraining o linear sysems (Fourier analysis, orhogonal funcions, and so

More information

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature On Measuring Pro-Poor Growh 1. On Various Ways of Measuring Pro-Poor Growh: A Shor eview of he Lieraure During he pas en years or so here have been various suggesions concerning he way one should check

More information

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence Supplemen for Sochasic Convex Opimizaion: Faser Local Growh Implies Faser Global Convergence Yi Xu Qihang Lin ianbao Yang Proof of heorem heorem Suppose Assumpion holds and F (w) obeys he LGC (6) Given

More information

4 Sequences of measurable functions

4 Sequences of measurable functions 4 Sequences of measurable funcions 1. Le (Ω, A, µ) be a measure space (complee, afer a possible applicaion of he compleion heorem). In his chaper we invesigae relaions beween various (nonequivalen) convergences

More information

Unsteady Flow Problems

Unsteady Flow Problems School of Mechanical Aerospace and Civil Engineering Unseady Flow Problems T. J. Craf George Begg Building, C41 TPFE MSc CFD-1 Reading: J. Ferziger, M. Peric, Compuaional Mehods for Fluid Dynamics H.K.

More information

Second Order Linear Differential Equations

Second Order Linear Differential Equations Second Order Linear Differenial Equaions Second order linear equaions wih consan coefficiens; Fundamenal soluions; Wronskian; Exisence and Uniqueness of soluions; he characerisic equaion; soluions of homogeneous

More information

Homogenization of random Hamilton Jacobi Bellman Equations

Homogenization of random Hamilton Jacobi Bellman Equations Probabiliy, Geomery and Inegrable Sysems MSRI Publicaions Volume 55, 28 Homogenizaion of random Hamilon Jacobi Bellman Equaions S. R. SRINIVASA VARADHAN ABSTRACT. We consider nonlinear parabolic equaions

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN Inernaional Journal of Scienific & Engineering Research, Volume 4, Issue 10, Ocober-2013 900 FUZZY MEAN RESIDUAL LIFE ORDERING OF FUZZY RANDOM VARIABLES J. EARNEST LAZARUS PIRIYAKUMAR 1, A. YAMUNA 2 1.

More information

ODEs II, Lecture 1: Homogeneous Linear Systems - I. Mike Raugh 1. March 8, 2004

ODEs II, Lecture 1: Homogeneous Linear Systems - I. Mike Raugh 1. March 8, 2004 ODEs II, Lecure : Homogeneous Linear Sysems - I Mike Raugh March 8, 4 Inroducion. In he firs lecure we discussed a sysem of linear ODEs for modeling he excreion of lead from he human body, saw how o ransform

More information

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8) I. Definiions and Problems A. Perfec Mulicollineariy Econ7 Applied Economerics Topic 7: Mulicollineariy (Sudenmund, Chaper 8) Definiion: Perfec mulicollineariy exiss in a following K-variable regression

More information

Some Basic Information about M-S-D Systems

Some Basic Information about M-S-D Systems Some Basic Informaion abou M-S-D Sysems 1 Inroducion We wan o give some summary of he facs concerning unforced (homogeneous) and forced (non-homogeneous) models for linear oscillaors governed by second-order,

More information

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 2, May 2013, pp. 209 227 ISSN 0364-765X (prin) ISSN 1526-5471 (online) hp://dx.doi.org/10.1287/moor.1120.0562 2013 INFORMS On Boundedness of Q-Learning Ieraes

More information

SPECTRAL EVOLUTION OF A ONE PARAMETER EXTENSION OF A REAL SYMMETRIC TOEPLITZ MATRIX* William F. Trench. SIAM J. Matrix Anal. Appl. 11 (1990),

SPECTRAL EVOLUTION OF A ONE PARAMETER EXTENSION OF A REAL SYMMETRIC TOEPLITZ MATRIX* William F. Trench. SIAM J. Matrix Anal. Appl. 11 (1990), SPECTRAL EVOLUTION OF A ONE PARAMETER EXTENSION OF A REAL SYMMETRIC TOEPLITZ MATRIX* William F Trench SIAM J Marix Anal Appl 11 (1990), 601-611 Absrac Le T n = ( i j ) n i,j=1 (n 3) be a real symmeric

More information

A Shooting Method for A Node Generation Algorithm

A Shooting Method for A Node Generation Algorithm A Shooing Mehod for A Node Generaion Algorihm Hiroaki Nishikawa W.M.Keck Foundaion Laboraory for Compuaional Fluid Dynamics Deparmen of Aerospace Engineering, Universiy of Michigan, Ann Arbor, Michigan

More information

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux Gues Lecures for Dr. MacFarlane s EE3350 Par Deux Michael Plane Mon., 08-30-2010 Wrie name in corner. Poin ou his is a review, so I will go faser. Remind hem o go lisen o online lecure abou geing an A

More information

1 Solutions to selected problems

1 Solutions to selected problems 1 Soluions o seleced problems 1. Le A B R n. Show ha in A in B bu in general bd A bd B. Soluion. Le x in A. Then here is ɛ > 0 such ha B ɛ (x) A B. This shows x in B. If A = [0, 1] and B = [0, 2], hen

More information

An Introduction to Backward Stochastic Differential Equations (BSDEs) PIMS Summer School 2016 in Mathematical Finance.

An Introduction to Backward Stochastic Differential Equations (BSDEs) PIMS Summer School 2016 in Mathematical Finance. 1 An Inroducion o Backward Sochasic Differenial Equaions (BSDEs) PIMS Summer School 2016 in Mahemaical Finance June 25, 2016 Chrisoph Frei cfrei@ualbera.ca This inroducion is based on Touzi [14], Bouchard

More information

On-line Adaptive Optimal Timing Control of Switched Systems

On-line Adaptive Optimal Timing Control of Switched Systems On-line Adapive Opimal Timing Conrol of Swiched Sysems X.C. Ding, Y. Wardi and M. Egersed Absrac In his paper we consider he problem of opimizing over he swiching imes for a muli-modal dynamic sysem when

More information

Multi-scale 2D acoustic full waveform inversion with high frequency impulsive source

Multi-scale 2D acoustic full waveform inversion with high frequency impulsive source Muli-scale D acousic full waveform inversion wih high frequency impulsive source Vladimir N Zubov*, Universiy of Calgary, Calgary AB vzubov@ucalgaryca and Michael P Lamoureux, Universiy of Calgary, Calgary

More information

( ) = b n ( t) n " (2.111) or a system with many states to be considered, solving these equations isn t. = k U I ( t,t 0 )! ( t 0 ) (2.

( ) = b n ( t) n  (2.111) or a system with many states to be considered, solving these equations isn t. = k U I ( t,t 0 )! ( t 0 ) (2. Andrei Tokmakoff, MIT Deparmen of Chemisry, 3/14/007-6.4 PERTURBATION THEORY Given a Hamilonian H = H 0 + V where we know he eigenkes for H 0 : H 0 n = E n n, we can calculae he evoluion of he wavefuncion

More information

CHARACTERIZATION OF REARRANGEMENT INVARIANT SPACES WITH FIXED POINTS FOR THE HARDY LITTLEWOOD MAXIMAL OPERATOR

CHARACTERIZATION OF REARRANGEMENT INVARIANT SPACES WITH FIXED POINTS FOR THE HARDY LITTLEWOOD MAXIMAL OPERATOR Annales Academiæ Scieniarum Fennicæ Mahemaica Volumen 31, 2006, 39 46 CHARACTERIZATION OF REARRANGEMENT INVARIANT SPACES WITH FIXED POINTS FOR THE HARDY LITTLEWOOD MAXIMAL OPERATOR Joaquim Marín and Javier

More information

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game Sliding Mode Exremum Seeking Conrol for Linear Quadraic Dynamic Game Yaodong Pan and Ümi Özgüner ITS Research Group, AIST Tsukuba Eas Namiki --, Tsukuba-shi,Ibaraki-ken 5-856, Japan e-mail: pan.yaodong@ais.go.jp

More information

CHAPTER 2 Signals And Spectra

CHAPTER 2 Signals And Spectra CHAPER Signals And Specra Properies of Signals and Noise In communicaion sysems he received waveform is usually caegorized ino he desired par conaining he informaion, and he undesired par. he desired par

More information

dy dx = xey (a) y(0) = 2 (b) y(1) = 2.5 SOLUTION: See next page

dy dx = xey (a) y(0) = 2 (b) y(1) = 2.5 SOLUTION: See next page Assignmen 1 MATH 2270 SOLUTION Please wrie ou complee soluions for each of he following 6 problems (one more will sill be added). You may, of course, consul wih your classmaes, he exbook or oher resources,

More information

LECTURE 1: GENERALIZED RAY KNIGHT THEOREM FOR FINITE MARKOV CHAINS

LECTURE 1: GENERALIZED RAY KNIGHT THEOREM FOR FINITE MARKOV CHAINS LECTURE : GENERALIZED RAY KNIGHT THEOREM FOR FINITE MARKOV CHAINS We will work wih a coninuous ime reversible Markov chain X on a finie conneced sae space, wih generaor Lf(x = y q x,yf(y. (Recall ha q

More information

2.3 SCHRÖDINGER AND HEISENBERG REPRESENTATIONS

2.3 SCHRÖDINGER AND HEISENBERG REPRESENTATIONS Andrei Tokmakoff, MIT Deparmen of Chemisry, 2/22/2007 2-17 2.3 SCHRÖDINGER AND HEISENBERG REPRESENTATIONS The mahemaical formulaion of he dynamics of a quanum sysem is no unique. So far we have described

More information

t dt t SCLP Bellman (1953) CLP (Dantzig, Tyndall, Grinold, Perold, Anstreicher 60's-80's) Anderson (1978) SCLP

t dt t SCLP Bellman (1953) CLP (Dantzig, Tyndall, Grinold, Perold, Anstreicher 60's-80's) Anderson (1978) SCLP Coninuous Linear Programming. Separaed Coninuous Linear Programming Bellman (1953) max c () u() d H () u () + Gsusds (,) () a () u (), < < CLP (Danzig, yndall, Grinold, Perold, Ansreicher 6's-8's) Anderson

More information

Vanishing Viscosity Method. There are another instructive and perhaps more natural discontinuous solutions of the conservation law

Vanishing Viscosity Method. There are another instructive and perhaps more natural discontinuous solutions of the conservation law Vanishing Viscosiy Mehod. There are anoher insrucive and perhaps more naural disconinuous soluions of he conservaion law (1 u +(q(u x 0, he so called vanishing viscosiy mehod. This mehod consiss in viewing

More information

Solutions from Chapter 9.1 and 9.2

Solutions from Chapter 9.1 and 9.2 Soluions from Chaper 9 and 92 Secion 9 Problem # This basically boils down o an exercise in he chain rule from calculus We are looking for soluions of he form: u( x) = f( k x c) where k x R 3 and k is

More information

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC his documen was generaed a 1:4 PM, 9/1/13 Copyrigh 213 Richard. Woodward 4. End poins and ransversaliy condiions AGEC 637-213 F z d Recall from Lecure 3 ha a ypical opimal conrol problem is o maimize (,,

More information

L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS. NA568 Mobile Robotics: Methods & Algorithms

L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS. NA568 Mobile Robotics: Methods & Algorithms L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS NA568 Mobile Roboics: Mehods & Algorihms Today s Topic Quick review on (Linear) Kalman Filer Kalman Filering for Non-Linear Sysems Exended Kalman Filer (EKF)

More information

Oscillation of an Euler Cauchy Dynamic Equation S. Huff, G. Olumolode, N. Pennington, and A. Peterson

Oscillation of an Euler Cauchy Dynamic Equation S. Huff, G. Olumolode, N. Pennington, and A. Peterson PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DYNAMICAL SYSTEMS AND DIFFERENTIAL EQUATIONS May 4 7, 00, Wilmingon, NC, USA pp 0 Oscillaion of an Euler Cauchy Dynamic Equaion S Huff, G Olumolode,

More information

Math Week 14 April 16-20: sections first order systems of linear differential equations; 7.4 mass-spring systems.

Math Week 14 April 16-20: sections first order systems of linear differential equations; 7.4 mass-spring systems. Mah 2250-004 Week 4 April 6-20 secions 7.-7.3 firs order sysems of linear differenial equaions; 7.4 mass-spring sysems. Mon Apr 6 7.-7.2 Sysems of differenial equaions (7.), and he vecor Calculus we need

More information

5. Stochastic processes (1)

5. Stochastic processes (1) Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly

More information

Physics 127b: Statistical Mechanics. Fokker-Planck Equation. Time Evolution

Physics 127b: Statistical Mechanics. Fokker-Planck Equation. Time Evolution Physics 7b: Saisical Mechanics Fokker-Planck Equaion The Langevin equaion approach o he evoluion of he velociy disribuion for he Brownian paricle migh leave you uncomforable. A more formal reamen of his

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

U( θ, θ), U(θ 1/2, θ + 1/2) and Cauchy (θ) are not exponential families. (The proofs are not easy and require measure theory. See the references.

U( θ, θ), U(θ 1/2, θ + 1/2) and Cauchy (θ) are not exponential families. (The proofs are not easy and require measure theory. See the references. Lecure 5 Exponenial Families Exponenial families, also called Koopman-Darmois families, include a quie number of well known disribuions. Many nice properies enjoyed by exponenial families allow us o provide

More information

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities:

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities: Mah 4 Eam Review Problems Problem. Calculae he 3rd Taylor polynomial for arcsin a =. Soluion. Le f() = arcsin. For his problem, we use he formula f() + f () + f ()! + f () 3! for he 3rd Taylor polynomial

More information

Lecture 2 October ε-approximation of 2-player zero-sum games

Lecture 2 October ε-approximation of 2-player zero-sum games Opimizaion II Winer 009/10 Lecurer: Khaled Elbassioni Lecure Ocober 19 1 ε-approximaion of -player zero-sum games In his lecure we give a randomized ficiious play algorihm for obaining an approximae soluion

More information

Lecture 10: The Poincaré Inequality in Euclidean space

Lecture 10: The Poincaré Inequality in Euclidean space Deparmens of Mahemaics Monana Sae Universiy Fall 215 Prof. Kevin Wildrick n inroducion o non-smooh analysis and geomery Lecure 1: The Poincaré Inequaliy in Euclidean space 1. Wha is he Poincaré inequaliy?

More information

11!Hí MATHEMATICS : ERDŐS AND ULAM PROC. N. A. S. of decomposiion, properly speaking) conradics he possibiliy of defining a counably addiive real-valu

11!Hí MATHEMATICS : ERDŐS AND ULAM PROC. N. A. S. of decomposiion, properly speaking) conradics he possibiliy of defining a counably addiive real-valu ON EQUATIONS WITH SETS AS UNKNOWNS BY PAUL ERDŐS AND S. ULAM DEPARTMENT OF MATHEMATICS, UNIVERSITY OF COLORADO, BOULDER Communicaed May 27, 1968 We shall presen here a number of resuls in se heory concerning

More information

Primal-Dual Splitting: Recent Improvements and Variants

Primal-Dual Splitting: Recent Improvements and Variants Primal-Dual Spliing: Recen Improvemens and Varians 1 Thomas Pock and 2 Anonin Chambolle 1 Insiue for Compuer Graphics and Vision, TU Graz, Ausria 2 CMAP & CNRS École Polyechnique, France The proximal poin

More information

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes Some common engineering funcions 2.7 Inroducion This secion provides a caalogue of some common funcions ofen used in Science and Engineering. These include polynomials, raional funcions, he modulus funcion

More information

KINEMATICS IN ONE DIMENSION

KINEMATICS IN ONE DIMENSION KINEMATICS IN ONE DIMENSION PREVIEW Kinemaics is he sudy of how hings move how far (disance and displacemen), how fas (speed and velociy), and how fas ha how fas changes (acceleraion). We say ha an objec

More information

Integration Over Manifolds with Variable Coordinate Density

Integration Over Manifolds with Variable Coordinate Density Inegraion Over Manifolds wih Variable Coordinae Densiy Absrac Chrisopher A. Lafore clafore@gmail.com In his paper, he inegraion of a funcion over a curved manifold is examined in he case where he curvaure

More information

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),

More information

Ordinary dierential equations

Ordinary dierential equations Chaper 5 Ordinary dierenial equaions Conens 5.1 Iniial value problem........................... 31 5. Forward Euler's mehod......................... 3 5.3 Runge-Kua mehods.......................... 36

More information

A Decentralized Second-Order Method with Exact Linear Convergence Rate for Consensus Optimization

A Decentralized Second-Order Method with Exact Linear Convergence Rate for Consensus Optimization 1 A Decenralized Second-Order Mehod wih Exac Linear Convergence Rae for Consensus Opimizaion Aryan Mokhari, Wei Shi, Qing Ling, and Alejandro Ribeiro Absrac This paper considers decenralized consensus

More information

WEEK-3 Recitation PHYS 131. of the projectile s velocity remains constant throughout the motion, since the acceleration a x

WEEK-3 Recitation PHYS 131. of the projectile s velocity remains constant throughout the motion, since the acceleration a x WEEK-3 Reciaion PHYS 131 Ch. 3: FOC 1, 3, 4, 6, 14. Problems 9, 37, 41 & 71 and Ch. 4: FOC 1, 3, 5, 8. Problems 3, 5 & 16. Feb 8, 018 Ch. 3: FOC 1, 3, 4, 6, 14. 1. (a) The horizonal componen of he projecile

More information

Heat kernel and Harnack inequality on Riemannian manifolds

Heat kernel and Harnack inequality on Riemannian manifolds Hea kernel and Harnack inequaliy on Riemannian manifolds Alexander Grigor yan UHK 11/02/2014 onens 1 Laplace operaor and hea kernel 1 2 Uniform Faber-Krahn inequaliy 3 3 Gaussian upper bounds 4 4 ean-value

More information

POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION

POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION Novi Sad J. Mah. Vol. 32, No. 2, 2002, 95-108 95 POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION Hajnalka Péics 1, János Karsai 2 Absrac. We consider he scalar nonauonomous neural delay differenial

More information