Information Theoretic views of Path Integral Control.

Size: px
Start display at page:

Download "Information Theoretic views of Path Integral Control."

Transcription

1 Information Theoretic views of Path Integral Control. Evangelos A. Theodoro and Emannel Todorov Abstract We derive the connections of Path IntegralPI and Klback-LieblerKL control as presented in machine learning and robotics commnities 4 with earlier work in controls theory on the fndamental dalities between relative entropy and free energy and the logarithmic transformations of diffsions processes 5 0. Or analysis provides an information theoretic view of PI stochastic optimal control based on the dality between Free Energy and Relative Entropy as expressed by the Legendre-Fenchel transformation in Statistical Mechanics. Finally we discss the cases of partial observability and min-max control as stdied in controls and conclde. Introdction Recent developments in nonlinear stochastic optimal control and machine learning sggest new ways to solve optimization problems for dynamical systems in continos state action spaces. The mathematical analysis is based on the qantm-mechanical concept of PI. The PI formalism provides an nified view of Newtonian and Qantm mechanics. In stochastic optimal control theory, path integrals can be sed to represent soltions of partial differential eqations. Here we provide the information theoretic view of path integral control and show its connection to mathematical developments in control theory. The Feynman-Kac lemma will play an essential role in or analysis since it creates bridges between cost fnctions sbject to stochastic differential eqations and partial differential eqations. This paper is organized as follows, in section 2 we show the dalities between free energy and relative entropy. In section 3 we derive PI control based on these dalities and discss the cases of min-max control and partial observability. In section 4 we derive PI based on the Bellman principle for continos and discrete time. In the last section 5 we compare the different approaches to PI control and conclde. 2 Free Energy and Relative Entropy Dalities In this section we derive the fndamental dality relationships between free energy and relative entropy 9. This relationship is important for the derivation of stochastic optimal control. Let Z, Z denote a measrable space and PZ the corresponding probability measre defined on the measrable space. For or analysis we consider the following definitions. Definition : Let P PZ and the fnction J x : Z R be a measrable fnction. Then the term: E J x = log exp ρj xdp is called free energy of J x with respect to P. Definition 2: Let P PZ and Q PZ, the relative entropy of P with respect to Q is defined as: { log H Q P = dp if Q << P and log dp L + otherwise

2 We will also consider the objective fnction: ξx = ρ J E x = ρ 0 log E τ i exp ρj x with J x = φx tn + t N qxds the state dependent cost. The objective fnction above takes the form ξx = E τ 0 i J + ρ 2 V ar J as ρ 0. This form allows s to get the basic intition for constrcting sch objective fnctions. Essentially for small ρ the coss a fnction of the mean the variance. When ρ > 0 the cost fnction is risk sensitive while for ρ < 0 is risk seeking. To derive the basic relationship between free energy and relative entropy we express the expectation E τ 0 i taken nder the measre P as a fnction of the expectation E taken nder the probability measre. More precisely will have: E 0 exp ρj x = exp ρj xdp = exp ρj x dp By taking the logarithm of both sides of the eqations above and making se of the Jensen s ineqality we will have: log E τ 0 i exp ρj x = log exp ρj x dp log exp ρj x dp. We mltiply the ineqality above with ρ for case of ρ < 0 or ρ = and ths we have: ξx = E J x E J x + H Q P where E J x = J x. The ineqality above gives s the dality relationship between relative entropy and free energy. Essentially one cold define the following two minimization problems: The infimm in 2 is attained at Q given by: E J x = inf E J x + Q H Q P = exp J xdp 3 exp J xdp When ρ > 0 the ineqality in becomes from to and the inf in 2 and becomes sp. Therefore we will have that: E J x = sp E J x Q H Q P 4 In the next section we show how ineqality 2 is transformed to a stochastic optimal control problem for the case of markov diffsion processes. 3 Information theoretic view of stochastic optimal control. We consider the ncontrolled and controlled stochastic dynamics of the form: dx = fxdt + Bxdw 0 t and dx = fxdt + Bx dt + dw t with x t R n denoting the state of the system, Bx, t : R n R R n n is the control and diffsions matrix, fx, t : R n R R n the passive dynamics, t R n the control vector and dw R p brownian noise. Notice that the difference between the two diffsions above is on the controls. These controls together with the passive dynamics define a new drift term. For or analysis here we assme B exists. Expectations evalated on trajectories generated by the controlled dynamics and ncontrolled dynamics are represented as E 0 and E respectively. The corresponding probability measres of the aforementioned expectations are P and Q. We contine or analysis with the main resln dp and the definition of the Radon-Nikodým derivative: dp = exp ζ and = exp ζ where according to Girsanov s theorem 2 adapted to the diffsion processes considered here, the 2 2

3 term ζ is expressed as: ζ = 2 t N T dt + t N T dw t. Sbstittion of the Radon-Nikodým derivative into 2 gives the following reslt: ξx = 0 log E exp J x E J x + ζ = E J x + tn T dt 2 5 The right term of the ineqality above corresponds to the cost fnction of a stochastic optimal control problem thas bonded from below by the free energy. Besides providing a lower bond on the objective fnction of the stochastic optimal control problem ineqality 5 expresses also how this lower bond shold be compted. This comptation involves forward sampling of the ncontrolled dynamics, evalation of the expectation of the exponentiated state depended part φx tn and qx t and the logarithmic transformation of this expectation. Srprisingly, ineqality 5 was derived withot relying on any principle of optimality. It only takes the application of Girsanov theorem between controlled and ncontrolled stochastic dynamics and the se of dal relationship between free energy and relative entropy to find the lower bond in 5. Essentially ineqality 5 defines a minimization problem in which the right part of the ineqality is minimized with respect ζ and therefore with respect to control. At the minimm, when = then the right part of the ineqality in 5 reaches its optimal ξx. An important qestion to is the link between 5 and the dynamic programming principle. To find this link the next step is to show that ξx satisfies the HJB eqations and therefore is the corresponding vale fnction. More precisely, we introdce a new variable Φx, t defined as Φx, t = E 0 exp ρj x. The Feynman-Kac lemma 3 tells s that this fnction satisfies the backward Chapman Kolmogorov PDE. Therefore the following eqation holds: t Φ = ρq 0 Φ + f T x Φ + 2 tr xx ΦBB T 6 For ρ = < 0 and since ξx = ρ log Φx, t = log Φx, t we will have that tφ = Φ t ξ, x Φ = Φ x ξ and xx Φ = Φ xx ξ + 2 Φ x ξ x ξ T it can be shown that ξx satisfies the nonlinear PDE: t ξ = q 0 + x ξ T f 2 xξ T BB T x ξ + 2 tr xx ξbb T 7 The nonlinear PDEs above corresponds to the HJB eqation 4 for the case of the minimizing optimal control problem with control weight R = I and therefore, ξx is the corresponding minimizing vale fnction. Note than order to derive the PDEs above we did not se any principle of optimality. Similar reslts can be derived for ρ = > 0 as shown in 6. The analysis so far is smmarized by the following corollary in which we se the fnction signx = x < 0 and signx = x > 0. More precisely we will have: Corollary Consider the expectation operators E 0, E evalated on state trajectories sampled according to ncontrolled and controlled dynamics respectively. The fnction ξx, t specified as: ξx, t = signρ log E 0 exp signρj x is the vale fnction of the stochastic optimal control problems: ξx, = min E t N qx 2 T dt, ρ > 0 and ξx, = max E t N qx + 2 T dt, ρ < Information theoretic view of stochastic optimal control: The case of partial observability. In the partial observable case 9, besides the stochastic dynamics there are also the observation diffsions: 3

4 dy = hxdt + Cxdv 0 t and dy = hxdt + Cx bdt + Cxdv t 8 The term y denotes the observations, b plays the role of control that act on observations, and dv 0 and dv 0 is the observation noise nder the ncontrolled and controlled measrement dynamics. In this case the Radon-Nikodým derivative is expressed as dp = exp ζ with ζ = 2 t N T + b T b dt + t N T dw t + b T dv t. Under the observation dynamics above the free energy is the lower bond of the following expression: ξx = 0 log E exp J x E J x + tn T + b T b dt 9 2 The expectation E 0 is with respect to the process and observations noise dw 0 t, dv 0 t while the expectation E is with respect to dw t, dv t. Again the free energy is the lower bond on a cost thas typically fond in stochastic optimal control. The partial observable case incldes controls in the observations. 3.2 Information theoretic view of stochastic optimal control: The case of min-max optimal control. We consider stochastic dynamics : dx = fxdt + Bx dt + dw 0 t and cost fnction Sx = E 0 exp Lx, dt = E exp 0 t N Lx, dt. Next we define free energy as follows E Lx, = log exp ρlx, dp and se the Legendre transformation that leads to maximization problem: ELx, = sp E Lx,. H Q P By taking into accont then stochastic dynamics and the Radon-Nikodým derivative we will have: inf inf log log exp ρlx, dp = inf sp Q exp ρlx, dp = inf sp π E Lx, E Lx, 2 H Q P tn π T πdt 0 where E is the expectation on trajectories generated based on the diffsion dx = fxdt + Bx dt + πdt + dw t. The term π can be thoght as a destabilizing controller. Eqation 0 is a short bt elegant way to show the eqivalence of min-max control and differential game theory with risk sensitivity. Essentially all that takes is an appropriate definition of free energy and the se of the Legendre transformation. Notice that no dynamic programming argments were sed in this analysis. 4 Derivation based on Bellman Principle: The continos case. We consider stochastic optimal control in the classical sense, as a constrained optimization problem, with the cost fnction nder minimization given by the mathematical expression: V x = min E Jx, = min E tn to Lx,, tdt sbject to the nonlinear stochastic dynamics: dx = Fx, dt + Bxdw with x R n denoting the state of the system, R p the control vector and dw R p brownian noise. 4

5 The fnction Fx, is a nonlinear fnction of the state x and affine in controls and therefore is defined as Fx, = fx + Gx. The matrix Gx R n p is the control matrix, Bx R n p is the diffsion matrix and fx R n are the passive dynamics. The cost fnction Jx, is a fnction of states and controls. Under the optimal controls the cost fnction is eqal to the vale fnction V x. The term Lx,,t is the rnning cost and is expressed as: Lx,, t = q 0 x, t + q x, t + 2 T R. Essentially, the rnning cost has three terms, the first q 0 x t, t is a state-dependent cost, the second term depends on states and controls and the third is the control cost with the term R > 0 the corresponding weight. The stochastic HJB eqation 5, 4 associated with this stochastic optimal control problem is expressed as follows: t V = min L + x V T F + 2 tr xx V BB T. The corresponding optimal control is given by the eqation: x t = R q x, t + Gx T x V x, t. These optimal controls will psh the system dynamics in the direction opposite that of the gradient of the vale fnction x V x, t. The vale fnction satisfies nonlinear, second-order PDE: t V = q + x V T f 2 xv T GR G T x V + 2 tr xx V BB T with qx, t and fx, t defined as qx, t = q 0 x, t 2 q x, t T R q x, t and fx, t = fx, t Gx, tr q x, t and the bondary condition V x tn = φx tn. Given the exponential transformation V x, t = λ log Ψx, t and the assmption λgxr Gx T = BxBx T = Σx t = Σ the reslting PDE is formlated as follows: t Ψ = λ qψ + f T x Ψ + 2 tr xxψσ 2 with bondary condition: Ψxt N = exp λ φxt N. By applying the Feynman-Kac lemma to the Chapman-Kolmogorov PDE 2 yields its soltion in form of an expectation over system trajectories. This soltion is mathematically expressed as: tn Ψ x ti = E 0 exp λ qxdt Ψx tn The expectation E 0 is taken on sample paths generated with the forward sampling of the ncontrolled diffsion eqation dx = fx t δt + Bxdw. The optimal controls are specified as: P I x = R q x, t λgx T xψx,t. Since, the initial vale the fnction V x, t is Ψx,t the minimm of the expectation of the objective fnction Jx,, it can be trivially shown that: 3 tn V x, = λ log E 0 exp λ qxdt Ψx tn E Jx, 4 Note that the ineqality above in similar to 5 when the following eqations hold: q x = 0, R = I, λ =, G = B, B = B. The first three eqalities garantee that Jx, = tn J x ρ T dt are identical, and the last two eqalities make sre that the expectations are evalated nder the same diffsions and therefore E 0 E 0 and E E. Under the conditions above the Kolmogorov PDEs 6 and 2 and the HJB eqations and 7 are identical. 4. Derivation based on Bellman Principle: The discrete case. In the KL control framework 3,4,5 the analysis starts with the application of the Bellman principle of optimality on Markov Decision Processes MDP and nder the rnning cost specified as a sm of a state depended term and the Kllback Leibler Divergence between the transition densities of the controlled and ncontrolled dynamics. In particlar, the rnning coss specified as L x, = qx + H Q P = qx + E log px x, px x. The transition probabilities nder the controlled and ncontrolled dynamics are represented as px x, and px x. Application of the Bellman principle of optimality reslts in the minimization of the qantity: 5

6 V t x = min qx + E log px x, + V t+ x U px x Depending on the stochastic optimal control problem wx is eqal to V x, αv x, V t+ x. For or presentation here we choose wx = V t+ x that corresponds to finite horizon case. The dependent terms in the fnctional above are minimized and ths we will have that: E log px x, px + V t+ x = E px x, log x px x exp V t+ x For the prposes the normalization term GΦx is introdced with Φx = exp wx being the desirability fnction defined as GΦx = px xφx = E Φx 0, we will have that: E log x x px x + V t+x = log GΦx + H px x, px xφx GΦx Sbstittion of the expression above into the Bellman minimization eqation reslts in: min U qx log GΦx + H x x px xφx GΦx. The minimm of the Bellman eqation is attained by: p x x, = px xφx GΦx. The last eqation provides the transition probability nder the optimal control law and in that sense it the optimal transition probability. Clearly the optimal distribtion above is identical to eqations 3. Sbstittion of the optimal distribtion above will resln the Bellman eqation: Φx = exp qxgφx. The link with the continos case is established by writing the Bellman eqation for an MDP with continos state space Φ δt x = exp qxδtgφ δt x. Rearrangement of the terms reslts in: exp qxδt Φ δt x = E 0 Φ δt x Φ δt x. Under the limit δ 0 the eqation reslts the backward Chaplman Kolmogorov PDE in 2 for ρ =. 5 Discssion We show the connection of path integral control framework as presented in the machine learning and robotic commnities 3, 6 8 with work in the control theoretic commnity on risk sensitivity 5, 6, 8, 9. Essentially there are two methodological approaches to derive the path integral framework. In the first, stochastic optimal control is specified as minimization of the objective E Jx, sbject to the controlled dynamics. The HJB PDE is derived based on the Bellman principle of optimality. The exponential transformation of the vale fnction V x and the connection between control cost and variance resln the transformation of the HJB in to the backward Chapman Kolmogorov. The Feynman-Kac lemma is applied and the soltion of the Chapman Kolmogorov PDE together with the lower bond on the objective fnction are provided. The second methodological approach starts with the dality between free energy and relative entropy and the reslting optimization problem as expressed in 2. For diffsion processes affine in control and noise and nder the se of Girsanov s theorem, the aforementioned optimization reslts in formlating the bond ξx of the objective fnction E Jx, which is typically fond in stochastic optimal control. The link to Bellman optimality is established by showing that, this bond ξx satisfies the HJB eqation and therefore is a vale fnction. Inside the class of the stochastic dynamics of markov diffsion processes affine in control and noise, Dynamic Programming is more general since incorporates general cost fnctions and stochastic dynamics. This generalization however, is redced by the assmption regarding control cost and the variance of the noise λgxr Gx T = BxBx T. In the second approach the lower bond ξx of the accmlated trajectory cost E Jx, is derived withot relying on the Bellman Principle. In fact, this lower bond defines a new form of optimality which, as is shown in 5,6 as well as in this work, for the case of diffsion processes is eqivalent to the Bellman principle of optimality. In the KL stochastic optimal control framework the derivation relies on the Bellman Principle of Optimality in discrete time. The reslting distribtion 6

7 p k x x, is optimal since is the distribtion that reslts when actions are optimal. In that sense the KL framework, in its initial formlation 3 does not explicitly provide an optimal control law bnstead it provides the optimal distribtion or optimal transition probability nder the se of optimal control law. For the case of control affine diffsions, KL control framework incorporates control-only and state-only depended terms in contrast to PI derived based on the Bellman principle in which cross terms between controls and states may be considered. The compositionally of optimal controls incldes control laws and ths it can incorporate any analytically derived optimal control as well as PI control. The information theoretic view and in particlar the se of the Legendre transformation and the dality between free energy and relative is an elegant way to derive many important reslts in control theory and machine learning. Frthermore, it allows for generalizations in the sense thaneqality 5 holds for more general models of stochasticity sch as the case of jmp diffsions. References H. J. Kappen, Path integrals and symmetry breaking for optimal control theory, Jornal of Statistical Mechanics: Theory and Experiment, vol., p. P0, H. J. Kappen, An introdction to stochastic control theory, path integrals and reinforcement learning, in Cooperative Behavior in Neral Systems J. Marro, P. L. Garrido, and J. J. Torres, eds., vol. 887 of American Institte of Physics Conference Series, pp. 49 8, Feb E. Todorov, Efficient comptation of optimal actions, Proc Natl Acad Sci U S A, vol. 06, no. 28, pp , E. Todorov, Linearly-solvable markov decision problems, in Advances in Neral Information Processing Systems 9 NIPS 2007 B. Scholkopf, J. Platt, and T. Hoffman, eds., Vancover, BC, Cambridge, MA: MIT Press, W. H. Fleming and H. M. Soner, Controlled Markov processes and viscosity soltions. Applications of mathematics, New York: Springer, 2nd ed., W. H. Fleming and H. M. Soner, Controlled Markov processes and viscosity soltions. Applications of mathematics, New York: Springer, nd ed., W. Fleming, Exit probabilities and optimal stochastic control, Applied Math. Optim, vol. 9, pp , W. H. Fleming and W. M. McEneaney, Risk-sensitive control on an infinite time horizon, SIAM J. Control Optim., vol. 33, pp , November P. Dai Pra, L. Meneghini, and W. Rnggaldier, Connections between stochastic control and dynamic games, Mathematics of Control, Signals, and Systems MCSS, vol. 9, no. 4, pp , S. K. Mitter and N. J. Newton, A variational approach to nonlinear estimation, SIAM J. Control Optim., vol. 42, pp , May H. Tochette, The large deviation approach to statistical mechanics, Physics Reports, vol. 478, pp. 69, I. Karatzas and S. E. Shreve, Brownian Motion and Stochastic Calcls Gradate Texts in Mathematics. Springer, 2nd ed., Agst A. Friedman, Stochastic Differential Eqations And Applications. Academic Press, R. F. Stengel, Optimal control and estimation. Dover books on advanced mathematics, New York: Dover Pblications, E. Todorov, Compositionality of optimal control laws, In Advances in Neral Information Processing Systems, vol. 22, pp , E. Theodoro, J. Bchli, and S. Schaal, A generalized path integral approach to reinforcement learning, Jornal of Machine Learning Research, no., pp , E. Theodoro, Iterative Path Integral Stochastic Optimal Control: Theory and Applications to Motor Control. PhD thesis, niversity of sothern California, May J. Bchli, E. Theodoro, F. Stlp, and S. Schaal, Variable impedance control - a reinforcement learning approach, in Robotics: Science and Systems Conference RSS,

Real Time Stochastic Control and Decision Making: From theory to algorithms and applications

Real Time Stochastic Control and Decision Making: From theory to algorithms and applications Real Time Stochastic Control and Decision Making: From theory to algorithms and applications Evangelos A. Theodorou Autonomous Control and Decision Systems Lab Challenges in control Uncertainty Stochastic

More information

Optimization via the Hamilton-Jacobi-Bellman Method: Theory and Applications

Optimization via the Hamilton-Jacobi-Bellman Method: Theory and Applications Optimization via the Hamilton-Jacobi-Bellman Method: Theory and Applications Navin Khaneja lectre notes taken by Christiane Koch Jne 24, 29 1 Variation yields a classical Hamiltonian system Sppose that

More information

The Linear Quadratic Regulator

The Linear Quadratic Regulator 10 The Linear Qadratic Reglator 10.1 Problem formlation This chapter concerns optimal control of dynamical systems. Most of this development concerns linear models with a particlarly simple notion of optimality.

More information

Lecture 8: September 26

Lecture 8: September 26 10-704: Information Processing and Learning Fall 2016 Lectrer: Aarti Singh Lectre 8: September 26 Note: These notes are based on scribed notes from Spring15 offering of this corse. LaTeX template cortesy

More information

Linearly Solvable Markov Games

Linearly Solvable Markov Games Linearly Solvable Markov Games Krishnamrthy Dvijotham and mo Todorov Abstract Recent work has led to an interesting new theory of linearly solvable control, where the Bellman eqation characterizing the

More information

Remarks on strongly convex stochastic processes

Remarks on strongly convex stochastic processes Aeqat. Math. 86 (01), 91 98 c The Athor(s) 01. This article is pblished with open access at Springerlink.com 0001-9054/1/010091-8 pblished online November 7, 01 DOI 10.1007/s00010-01-016-9 Aeqationes Mathematicae

More information

Essentials of optimal control theory in ECON 4140

Essentials of optimal control theory in ECON 4140 Essentials of optimal control theory in ECON 4140 Things yo need to know (and a detail yo need not care abot). A few words abot dynamic optimization in general. Dynamic optimization can be thoght of as

More information

Stability of Model Predictive Control using Markov Chain Monte Carlo Optimisation

Stability of Model Predictive Control using Markov Chain Monte Carlo Optimisation Stability of Model Predictive Control sing Markov Chain Monte Carlo Optimisation Elilini Siva, Pal Golart, Jan Maciejowski and Nikolas Kantas Abstract We apply stochastic Lyapnov theory to perform stability

More information

B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Quadratic Optimization Problems in Continuous and Binary Variables

B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Quadratic Optimization Problems in Continuous and Binary Variables B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Qadratic Optimization Problems in Continos and Binary Variables Naohiko Arima, Snyong Kim and Masakaz Kojima October 2012,

More information

CHARACTERIZATIONS OF EXPONENTIAL DISTRIBUTION VIA CONDITIONAL EXPECTATIONS OF RECORD VALUES. George P. Yanev

CHARACTERIZATIONS OF EXPONENTIAL DISTRIBUTION VIA CONDITIONAL EXPECTATIONS OF RECORD VALUES. George P. Yanev Pliska Std. Math. Blgar. 2 (211), 233 242 STUDIA MATHEMATICA BULGARICA CHARACTERIZATIONS OF EXPONENTIAL DISTRIBUTION VIA CONDITIONAL EXPECTATIONS OF RECORD VALUES George P. Yanev We prove that the exponential

More information

Admissibility under the LINEX loss function in non-regular case. Hidekazu Tanaka. Received November 5, 2009; revised September 2, 2010

Admissibility under the LINEX loss function in non-regular case. Hidekazu Tanaka. Received November 5, 2009; revised September 2, 2010 Scientiae Mathematicae Japonicae Online, e-2012, 427 434 427 Admissibility nder the LINEX loss fnction in non-reglar case Hidekaz Tanaka Received November 5, 2009; revised September 2, 2010 Abstract. In

More information

Optimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance

Optimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance Optimal Control of a Heterogeneos Two Server System with Consideration for Power and Performance by Jiazheng Li A thesis presented to the University of Waterloo in flfilment of the thesis reqirement for

More information

On Multiobjective Duality For Variational Problems

On Multiobjective Duality For Variational Problems The Open Operational Research Jornal, 202, 6, -8 On Mltiobjective Dality For Variational Problems. Hsain *,, Bilal Ahmad 2 and Z. Jabeen 3 Open Access Department of Mathematics, Jaypee University of Engineering

More information

A Characterization of the Domain of Beta-Divergence and Its Connection to Bregman Variational Model

A Characterization of the Domain of Beta-Divergence and Its Connection to Bregman Variational Model entropy Article A Characterization of the Domain of Beta-Divergence and Its Connection to Bregman Variational Model Hyenkyn Woo School of Liberal Arts, Korea University of Technology and Edcation, Cheonan

More information

ON THE SHAPES OF BILATERAL GAMMA DENSITIES

ON THE SHAPES OF BILATERAL GAMMA DENSITIES ON THE SHAPES OF BILATERAL GAMMA DENSITIES UWE KÜCHLER, STEFAN TAPPE Abstract. We investigate the for parameter family of bilateral Gamma distribtions. The goal of this paper is to provide a thorogh treatment

More information

Sufficient Optimality Condition for a Risk-Sensitive Control Problem for Backward Stochastic Differential Equations and an Application

Sufficient Optimality Condition for a Risk-Sensitive Control Problem for Backward Stochastic Differential Equations and an Application Jornal of Nmerical Mathematics and Stochastics, 9(1) : 48-6, 17 http://www.jnmas.org/jnmas9-4.pdf JNM@S Eclidean Press, LLC Online: ISSN 151-3 Sfficient Optimality Condition for a Risk-Sensitive Control

More information

A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time

A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time Tomas Björk Department of Finance, Stockholm School of Economics tomas.bjork@hhs.se Agatha Mrgoci Department of Economics Aarhs

More information

THE HOHENBERG-KOHN THEOREM FOR MARKOV SEMIGROUPS

THE HOHENBERG-KOHN THEOREM FOR MARKOV SEMIGROUPS THE HOHENBERG-KOHN THEOREM FOR MARKOV SEMIGROUPS OMAR HIJAB Abstract. At the basis of mch of comptational chemistry is density fnctional theory, as initiated by the Hohenberg-Kohn theorem. The theorem

More information

SUBORDINATION RESULTS FOR A CERTAIN SUBCLASS OF NON-BAZILEVIC ANALYTIC FUNCTIONS DEFINED BY LINEAR OPERATOR

SUBORDINATION RESULTS FOR A CERTAIN SUBCLASS OF NON-BAZILEVIC ANALYTIC FUNCTIONS DEFINED BY LINEAR OPERATOR italian jornal of pre and applied mathematics n. 34 215 375 388) 375 SUBORDINATION RESULTS FOR A CERTAIN SUBCLASS OF NON-BAZILEVIC ANALYTIC FUNCTIONS DEFINED BY LINEAR OPERATOR Adnan G. Alamosh Maslina

More information

Part II. Martingale measres and their constrctions 1. The \First" and the \Second" fndamental theorems show clearly how \mar tingale measres" are impo

Part II. Martingale measres and their constrctions 1. The \First and the \Second fndamental theorems show clearly how \mar tingale measres are impo Albert N. Shiryaev (Stelov Mathematical Institte and Moscow State University) ESSENTIALS of the ARBITRAGE THEORY Part I. Basic notions and theorems of the \Arbitrage Theory" Part II. Martingale measres

More information

Mean Value Formulae for Laplace and Heat Equation

Mean Value Formulae for Laplace and Heat Equation Mean Vale Formlae for Laplace and Heat Eqation Abhinav Parihar December 7, 03 Abstract Here I discss a method to constrct the mean vale theorem for the heat eqation. To constrct sch a formla ab initio,

More information

L 1 -smoothing for the Ornstein-Uhlenbeck semigroup

L 1 -smoothing for the Ornstein-Uhlenbeck semigroup L -smoothing for the Ornstein-Uhlenbeck semigrop K. Ball, F. Barthe, W. Bednorz, K. Oleszkiewicz and P. Wolff September, 00 Abstract Given a probability density, we estimate the rate of decay of the measre

More information

RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS

RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS Athors: N.UNNIKRISHNAN NAIR Department of Statistics, Cochin University of Science Technology, Cochin, Kerala, INDIA

More information

Study of the diffusion operator by the SPH method

Study of the diffusion operator by the SPH method IOSR Jornal of Mechanical and Civil Engineering (IOSR-JMCE) e-issn: 2278-684,p-ISSN: 2320-334X, Volme, Isse 5 Ver. I (Sep- Oct. 204), PP 96-0 Stdy of the diffsion operator by the SPH method Abdelabbar.Nait

More information

Decision Oriented Bayesian Design of Experiments

Decision Oriented Bayesian Design of Experiments Decision Oriented Bayesian Design of Experiments Farminder S. Anand*, Jay H. Lee**, Matthew J. Realff*** *School of Chemical & Biomoleclar Engineering Georgia Institte of echnology, Atlanta, GA 3332 USA

More information

Modelling by Differential Equations from Properties of Phenomenon to its Investigation

Modelling by Differential Equations from Properties of Phenomenon to its Investigation Modelling by Differential Eqations from Properties of Phenomenon to its Investigation V. Kleiza and O. Prvinis Kanas University of Technology, Lithania Abstract The Panevezys camps of Kanas University

More information

Discontinuous Fluctuation Distribution for Time-Dependent Problems

Discontinuous Fluctuation Distribution for Time-Dependent Problems Discontinos Flctation Distribtion for Time-Dependent Problems Matthew Hbbard School of Compting, University of Leeds, Leeds, LS2 9JT, UK meh@comp.leeds.ac.k Introdction For some years now, the flctation

More information

Optimal search: a practical interpretation of information-driven sensor management

Optimal search: a practical interpretation of information-driven sensor management Optimal search: a practical interpretation of information-driven sensor management Fotios Katsilieris, Yvo Boers and Hans Driessen Thales Nederland B.V. Hengelo, the Netherlands Email: {Fotios.Katsilieris,

More information

Formulas for stopped diffusion processes with stopping times based on drawdowns and drawups

Formulas for stopped diffusion processes with stopping times based on drawdowns and drawups Stochastic Processes and their Applications 119 (009) 563 578 www.elsevier.com/locate/spa Formlas for stopped diffsion processes with stopping times based on drawdowns and drawps Libor Pospisil, Jan Vecer,

More information

BLOOM S TAXONOMY. Following Bloom s Taxonomy to Assess Students

BLOOM S TAXONOMY. Following Bloom s Taxonomy to Assess Students BLOOM S TAXONOMY Topic Following Bloom s Taonomy to Assess Stdents Smmary A handot for stdents to eplain Bloom s taonomy that is sed for item writing and test constrction to test stdents to see if they

More information

Discussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli

Discussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli 1 Introdction Discssion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen Department of Economics, University of Copenhagen and CREATES,

More information

The Dual of the Maximum Likelihood Method

The Dual of the Maximum Likelihood Method Department of Agricltral and Resorce Economics University of California, Davis The Dal of the Maximm Likelihood Method by Qirino Paris Working Paper No. 12-002 2012 Copyright @ 2012 by Qirino Paris All

More information

The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost under Poisson Arrival Demands

The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost under Poisson Arrival Demands Scientiae Mathematicae Japonicae Online, e-211, 161 167 161 The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost nder Poisson Arrival Demands Hitoshi

More information

Controlled Diffusions and Hamilton-Jacobi Bellman Equations

Controlled Diffusions and Hamilton-Jacobi Bellman Equations Controlled Diffusions and Hamilton-Jacobi Bellman Equations Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 2014 Emo Todorov (UW) AMATH/CSE 579, Winter

More information

Characterizations of probability distributions via bivariate regression of record values

Characterizations of probability distributions via bivariate regression of record values Metrika (2008) 68:51 64 DOI 10.1007/s00184-007-0142-7 Characterizations of probability distribtions via bivariate regression of record vales George P. Yanev M. Ahsanllah M. I. Beg Received: 4 October 2006

More information

Pulses on a Struck String

Pulses on a Struck String 8.03 at ESG Spplemental Notes Plses on a Strck String These notes investigate specific eamples of transverse motion on a stretched string in cases where the string is at some time ndisplaced, bt with a

More information

Chem 4501 Introduction to Thermodynamics, 3 Credits Kinetics, and Statistical Mechanics. Fall Semester Homework Problem Set Number 10 Solutions

Chem 4501 Introduction to Thermodynamics, 3 Credits Kinetics, and Statistical Mechanics. Fall Semester Homework Problem Set Number 10 Solutions Chem 4501 Introdction to Thermodynamics, 3 Credits Kinetics, and Statistical Mechanics Fall Semester 2017 Homework Problem Set Nmber 10 Soltions 1. McQarrie and Simon, 10-4. Paraphrase: Apply Eler s theorem

More information

), σ is a parameter, is the Euclidean norm in R d.

), σ is a parameter, is the Euclidean norm in R d. WEIGHTED GRAPH LAPLACIAN AND IMAGE INPAINTING ZUOQIANG SHI, STANLEY OSHER, AND WEI ZHU Abstract. Inspired by the graph Laplacian and the point integral method, we introdce a novel weighted graph Laplacian

More information

Mixed Type Second-order Duality for a Nondifferentiable Continuous Programming Problem

Mixed Type Second-order Duality for a Nondifferentiable Continuous Programming Problem heoretical Mathematics & Applications, vol.3, no.1, 13, 13-144 SSN: 179-9687 (print), 179-979 (online) Scienpress Ltd, 13 Mied ype Second-order Dality for a Nondifferentiable Continos Programming Problem.

More information

Optimal Mean-Variance Portfolio Selection

Optimal Mean-Variance Portfolio Selection Research Report No. 14, 2013, Probab. Statist. Grop Manchester 25 pp Optimal Mean-Variance Portfolio Selection J. L. Pedersen & G. Peskir Assming that the wealth process X is generated self-financially

More information

CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK. Wassim Jouini and Christophe Moy

CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK. Wassim Jouini and Christophe Moy CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK Wassim Joini and Christophe Moy SUPELEC, IETR, SCEE, Avene de la Bolaie, CS 47601, 5576 Cesson Sévigné, France. INSERM U96 - IFR140-

More information

Elements of Coordinate System Transformations

Elements of Coordinate System Transformations B Elements of Coordinate System Transformations Coordinate system transformation is a powerfl tool for solving many geometrical and kinematic problems that pertain to the design of gear ctting tools and

More information

Math 263 Assignment #3 Solutions. 1. A function z = f(x, y) is called harmonic if it satisfies Laplace s equation:

Math 263 Assignment #3 Solutions. 1. A function z = f(x, y) is called harmonic if it satisfies Laplace s equation: Math 263 Assignment #3 Soltions 1. A fnction z f(x, ) is called harmonic if it satisfies Laplace s eqation: 2 + 2 z 2 0 Determine whether or not the following are harmonic. (a) z x 2 + 2. We se the one-variable

More information

Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining

Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining P. Hgenin*, B. Srinivasan*, F. Altpeter**, R. Longchamp* * Laboratoire d Atomatiqe,

More information

Math 116 First Midterm October 14, 2009

Math 116 First Midterm October 14, 2009 Math 116 First Midterm October 14, 9 Name: EXAM SOLUTIONS Instrctor: Section: 1. Do not open this exam ntil yo are told to do so.. This exam has 1 pages inclding this cover. There are 9 problems. Note

More information

Two-media boundary layer on a flat plate

Two-media boundary layer on a flat plate Two-media bondary layer on a flat plate Nikolay Ilyich Klyev, Asgat Gatyatovich Gimadiev, Yriy Alekseevich Krykov Samara State University, Samara,, Rssia Samara State Aerospace University named after academician

More information

On averaged expected cost control as reliability for 1D ergodic diffusions

On averaged expected cost control as reliability for 1D ergodic diffusions On averaged expected cost control as reliability for 1D ergodic diffsions S.V. Anlova 5 6, H. Mai 7 8, A.Y. Veretennikov 9 10 Mon Nov 27 10:24:42 2017 Abstract For a Markov model described by a one-dimensional

More information

Conditions for Approaching the Origin without Intersecting the x-axis in the Liénard Plane

Conditions for Approaching the Origin without Intersecting the x-axis in the Liénard Plane Filomat 3:2 (27), 376 377 https://doi.org/.2298/fil7276a Pblished by Faclty of Sciences and Mathematics, University of Niš, Serbia Available at: http://www.pmf.ni.ac.rs/filomat Conditions for Approaching

More information

Homotopy Perturbation Method for Solving Linear Boundary Value Problems

Homotopy Perturbation Method for Solving Linear Boundary Value Problems International Jornal of Crrent Engineering and Technolog E-ISSN 2277 4106, P-ISSN 2347 5161 2016 INPRESSCO, All Rights Reserved Available at http://inpressco.com/categor/ijcet Research Article Homotop

More information

ON IMPROVED SOBOLEV EMBEDDING THEOREMS. M. Ledoux University of Toulouse, France

ON IMPROVED SOBOLEV EMBEDDING THEOREMS. M. Ledoux University of Toulouse, France ON IMPROVED SOBOLEV EMBEDDING THEOREMS M. Ledox University of Tolose, France Abstract. We present a direct proof of some recent improved Sobolev ineqalities pt forward by A. Cohen, R. DeVore, P. Petrshev

More information

QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING

QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING Jornal of the Korean Statistical Society 2007, 36: 4, pp 543 556 QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING Hosila P. Singh 1, Ritesh Tailor 2, Sarjinder Singh 3 and Jong-Min Kim 4 Abstract In sccessive

More information

A Model-Free Adaptive Control of Pulsed GTAW

A Model-Free Adaptive Control of Pulsed GTAW A Model-Free Adaptive Control of Plsed GTAW F.L. Lv 1, S.B. Chen 1, and S.W. Dai 1 Institte of Welding Technology, Shanghai Jiao Tong University, Shanghai 00030, P.R. China Department of Atomatic Control,

More information

Approach to a Proof of the Riemann Hypothesis by the Second Mean-Value Theorem of Calculus

Approach to a Proof of the Riemann Hypothesis by the Second Mean-Value Theorem of Calculus Advances in Pre Mathematics, 6, 6, 97- http://www.scirp.org/jornal/apm ISSN Online: 6-384 ISSN Print: 6-368 Approach to a Proof of the Riemann Hypothesis by the Second Mean-Vale Theorem of Calcls Alfred

More information

FREQUENCY DOMAIN FLUTTER SOLUTION TECHNIQUE USING COMPLEX MU-ANALYSIS

FREQUENCY DOMAIN FLUTTER SOLUTION TECHNIQUE USING COMPLEX MU-ANALYSIS 7 TH INTERNATIONAL CONGRESS O THE AERONAUTICAL SCIENCES REQUENCY DOMAIN LUTTER SOLUTION TECHNIQUE USING COMPLEX MU-ANALYSIS Yingsong G, Zhichn Yang Northwestern Polytechnical University, Xi an, P. R. China,

More information

Solution of Stochastic Optimal Control Problems and Financial Applications

Solution of Stochastic Optimal Control Problems and Financial Applications Journal of Mathematical Extension Vol. 11, No. 4, (2017), 27-44 ISSN: 1735-8299 URL: http://www.ijmex.com Solution of Stochastic Optimal Control Problems and Financial Applications 2 Mat B. Kafash 1 Faculty

More information

A generalized Alon-Boppana bound and weak Ramanujan graphs

A generalized Alon-Boppana bound and weak Ramanujan graphs A generalized Alon-Boppana bond and weak Ramanjan graphs Fan Chng Department of Mathematics University of California, San Diego La Jolla, CA, U.S.A. fan@csd.ed Sbmitted: Feb 0, 206; Accepted: Jne 22, 206;

More information

Model reduction of nonlinear systems using incremental system properties

Model reduction of nonlinear systems using incremental system properties Model redction of nonlinear systems sing incremental system properties Bart Besselink ACCESS Linnaes Centre & Department of Atomatic Control KTH Royal Institte of Technology, Stockholm, Sweden L2S, Spelec,

More information

Tendon-Driven Control of Biomechanical and Robotic Systems: A Path Integral Reinforcement Learning Approach.

Tendon-Driven Control of Biomechanical and Robotic Systems: A Path Integral Reinforcement Learning Approach. Tendon-Driven Control of Biomechanical and Robotic Systems: A Path Integral Reinforcement Learning Approach. Eric Rombokas, Evangelos Theodoro, Mark Malhotra, Emo Todorov and Yoky Matsoka Abstract We apply

More information

REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL

REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL Lewis c11.tex V1-10/19/2011 4:10pm Page 461 11 REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL In this book we have presented a variety of methods for the analysis and design of optimal control systems.

More information

MAXIMUM AND ANTI-MAXIMUM PRINCIPLES FOR THE P-LAPLACIAN WITH A NONLINEAR BOUNDARY CONDITION. 1. Introduction. ν = λ u p 2 u.

MAXIMUM AND ANTI-MAXIMUM PRINCIPLES FOR THE P-LAPLACIAN WITH A NONLINEAR BOUNDARY CONDITION. 1. Introduction. ν = λ u p 2 u. 2005-Ojda International Conference on Nonlinear Analysis. Electronic Jornal of Differential Eqations, Conference 14, 2006, pp. 95 107. ISSN: 1072-6691. URL: http://ejde.math.txstate.ed or http://ejde.math.nt.ed

More information

Outline. Model Predictive Control: Current Status and Future Challenges. Separation of the control problem. Separation of the control problem

Outline. Model Predictive Control: Current Status and Future Challenges. Separation of the control problem. Separation of the control problem Otline Model Predictive Control: Crrent Stats and Ftre Challenges James B. Rawlings Department of Chemical and Biological Engineering University of Wisconsin Madison UCLA Control Symposim May, 6 Overview

More information

Second-Order Wave Equation

Second-Order Wave Equation Second-Order Wave Eqation A. Salih Department of Aerospace Engineering Indian Institte of Space Science and Technology, Thirvananthapram 3 December 016 1 Introdction The classical wave eqation is a second-order

More information

DYNAMICAL LOWER BOUNDS FOR 1D DIRAC OPERATORS. 1. Introduction We consider discrete, resp. continuous, Dirac operators

DYNAMICAL LOWER BOUNDS FOR 1D DIRAC OPERATORS. 1. Introduction We consider discrete, resp. continuous, Dirac operators DYNAMICAL LOWER BOUNDS FOR D DIRAC OPERATORS ROBERTO A. PRADO AND CÉSAR R. DE OLIVEIRA Abstract. Qantm dynamical lower bonds for continos and discrete one-dimensional Dirac operators are established in

More information

KdV PRESERVES WHITE NOISE

KdV PRESERVES WHITE NOISE KdV PESEVES WHIE NOISE JEEMY QUASEL AND BENEDEK VALKÓ Abstract. It is shown that white noise is an invariant measre for the Korteweg-deVries eqation on. his is a conseqence of recent reslts of Kappeler

More information

Approximate Solution of Convection- Diffusion Equation by the Homotopy Perturbation Method

Approximate Solution of Convection- Diffusion Equation by the Homotopy Perturbation Method Gen. Math. Notes, Vol. 1, No., December 1, pp. 18-114 ISSN 19-7184; Copyright ICSRS Pblication, 1 www.i-csrs.org Available free online at http://www.geman.in Approximate Soltion of Convection- Diffsion

More information

Bridging the Gap Between Multigrid, Hierarchical, and Receding-Horizon Control

Bridging the Gap Between Multigrid, Hierarchical, and Receding-Horizon Control Bridging the Gap Between Mltigrid, Hierarchical, and Receding-Horizon Control Victor M. Zavala Mathematics and Compter Science Division Argonne National Laboratory Argonne, IL 60439 USA (e-mail: vzavala@mcs.anl.gov).

More information

Formal Methods for Deriving Element Equations

Formal Methods for Deriving Element Equations Formal Methods for Deriving Element Eqations And the importance of Shape Fnctions Formal Methods In previos lectres we obtained a bar element s stiffness eqations sing the Direct Method to obtain eact

More information

Effects Of Symmetry On The Structural Controllability Of Neural Networks: A Perspective

Effects Of Symmetry On The Structural Controllability Of Neural Networks: A Perspective 16 American Control Conference (ACC) Boston Marriott Copley Place Jly 6-8, 16. Boston, MA, USA Effects Of Symmetry On The Strctral Controllability Of Neral Networks: A Perspective Andrew J. Whalen 1, Sean

More information

On a class of topological groups. Key Words: topological groups, linearly topologized groups. Contents. 1 Introduction 25

On a class of topological groups. Key Words: topological groups, linearly topologized groups. Contents. 1 Introduction 25 Bol. Soc. Paran. Mat. (3s.) v. 24 1-2 (2006): 25 34. c SPM ISNN-00378712 On a class of topological grops Dinamérico P. Pombo Jr. abstract: A topological grop is an SNS-grop if its identity element possesses

More information

Subcritical bifurcation to innitely many rotating waves. Arnd Scheel. Freie Universitat Berlin. Arnimallee Berlin, Germany

Subcritical bifurcation to innitely many rotating waves. Arnd Scheel. Freie Universitat Berlin. Arnimallee Berlin, Germany Sbcritical bifrcation to innitely many rotating waves Arnd Scheel Institt fr Mathematik I Freie Universitat Berlin Arnimallee 2-6 14195 Berlin, Germany 1 Abstract We consider the eqation 00 + 1 r 0 k2

More information

Lecture: Corporate Income Tax - Unlevered firms

Lecture: Corporate Income Tax - Unlevered firms Lectre: Corporate Income Tax - Unlevered firms Ltz Krschwitz & Andreas Löffler Disconted Cash Flow, Section 2.1, Otline 2.1 Unlevered firms Similar companies Notation 2.1.1 Valation eqation 2.1.2 Weak

More information

Classify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled.

Classify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled. Jnction elements in network models. Classify by nmber of ports and examine the possible strctres that reslt. Using only one-port elements, no more than two elements can be assembled. Combining two two-ports

More information

STURM-LIOUVILLE PROBLEMS

STURM-LIOUVILLE PROBLEMS STURM-LIOUVILLE PROBLEMS ANTON ZETTL Mathematics Department, Northern Illinois University, DeKalb, Illinois 60115. Dedicated to the memory of John Barrett. ABSTRACT. Reglar and singlar Strm-Lioville problems

More information

Information Source Detection in the SIR Model: A Sample Path Based Approach

Information Source Detection in the SIR Model: A Sample Path Based Approach Information Sorce Detection in the SIR Model: A Sample Path Based Approach Kai Zh and Lei Ying School of Electrical, Compter and Energy Engineering Arizona State University Tempe, AZ, United States, 85287

More information

Optimal Mean-Variance Portfolio Selection

Optimal Mean-Variance Portfolio Selection Math. Financ. Econ. Vol. 11, No. 2, 2017, 137 160 Research Report No. 14, 2013, Probab. Statist. Grop Manchester 26 pp Optimal Mean-Variance Portfolio Selection J. L. Pedersen & G. Peskir Assming that

More information

Nonlinear parametric optimization using cylindrical algebraic decomposition

Nonlinear parametric optimization using cylindrical algebraic decomposition Proceedings of the 44th IEEE Conference on Decision and Control, and the Eropean Control Conference 2005 Seville, Spain, December 12-15, 2005 TC08.5 Nonlinear parametric optimization sing cylindrical algebraic

More information

1 Undiscounted Problem (Deterministic)

1 Undiscounted Problem (Deterministic) Lectre 9: Linear Qadratic Control Problems 1 Undisconted Problem (Deterministic) Choose ( t ) 0 to Minimize (x trx t + tq t ) t=0 sbject to x t+1 = Ax t + B t, x 0 given. x t is an n-vector state, t a

More information

Section 7.4: Integration of Rational Functions by Partial Fractions

Section 7.4: Integration of Rational Functions by Partial Fractions Section 7.4: Integration of Rational Fnctions by Partial Fractions This is abot as complicated as it gets. The Method of Partial Fractions Ecept for a few very special cases, crrently we have no way to

More information

Feature Selection for Neuro-Dynamic Programming

Feature Selection for Neuro-Dynamic Programming Featre Selection for Nero-Dynamic Programming Day Hang Wei Chen Prashant Mehta Sean Meyn Amit Srana October 4, 211 Abstract Nero-Dynamic Programming encompasses techniqes from both reinforcement learning

More information

Network Coding for Multiple Unicasts: An Approach based on Linear Optimization

Network Coding for Multiple Unicasts: An Approach based on Linear Optimization Network Coding for Mltiple Unicasts: An Approach based on Linear Optimization Danail Traskov, Niranjan Ratnakar, Desmond S. Ln, Ralf Koetter, and Mriel Médard Abstract In this paper we consider the application

More information

An efficient approach to stochastic optimal control. Bert Kappen SNN Radboud University Nijmegen the Netherlands

An efficient approach to stochastic optimal control. Bert Kappen SNN Radboud University Nijmegen the Netherlands An efficient approach to stochastic optimal control Bert Kappen SNN Radboud University Nijmegen the Netherlands Bert Kappen Examples of control tasks Motor control Bert Kappen Pascal workshop, 27-29 May

More information

A family of wave equations with some remarkable properties arxiv: v4 [math-ph] 14 Feb 2018

A family of wave equations with some remarkable properties arxiv: v4 [math-ph] 14 Feb 2018 A family of wave eqations with some remarkable properties arxiv:1602.06525v4 [math-ph] 14 Feb 2018 Priscila Leal da Silva 1, Igor Leite Freire 2 and Júlio Cesar Santos Sampaio 2 1 Departamento de Matemática,

More information

Path Integral Stochastic Optimal Control for Reinforcement Learning

Path Integral Stochastic Optimal Control for Reinforcement Learning Preprint August 3, 204 The st Multidisciplinary Conference on Reinforcement Learning and Decision Making RLDM203 Path Integral Stochastic Optimal Control for Reinforcement Learning Farbod Farshidian Institute

More information

Polymer confined between two surfaces

Polymer confined between two surfaces Appendix 4.A 15 Polymer confined between two srfaces In this appendix we present in detail the calclations of the partition fnction of a polymer confined between srfaces with hard wall bondary conditions.

More information

Asymptotics of dissipative nonlinear evolution equations with ellipticity: different end states

Asymptotics of dissipative nonlinear evolution equations with ellipticity: different end states J. Math. Anal. Appl. 33 5) 5 35 www.elsevier.com/locate/jmaa Asymptotics of dissipative nonlinear evoltion eqations with ellipticity: different end states enjn Dan, Changjiang Zh Laboratory of Nonlinear

More information

Discussion Papers Department of Economics University of Copenhagen

Discussion Papers Department of Economics University of Copenhagen Discssion Papers Department of Economics University of Copenhagen No. 10-06 Discssion of The Forward Search: Theory and Data Analysis, by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen,

More information

Lecture: Corporate Income Tax

Lecture: Corporate Income Tax Lectre: Corporate Income Tax Ltz Krschwitz & Andreas Löffler Disconted Cash Flow, Section 2.1, Otline 2.1 Unlevered firms Similar companies Notation 2.1.1 Valation eqation 2.1.2 Weak atoregressive cash

More information

Quickest detection of drift change for Brownian motion in generalized Bayesian and minimax settings

Quickest detection of drift change for Brownian motion in generalized Bayesian and minimax settings Statistics & Decisions 24, 445 47 (26) / DOI 1.1524/stnd.26.24.4.445 c Oldenorg Wissenschaftsverlag, München 26 Qickest detection of drift change for Brownian motion in generalized Bayesian and minima

More information

J.A. BURNS AND B.B. KING redced order controllers sensors/actators. The kernels of these integral representations are called fnctional gains. In [4],

J.A. BURNS AND B.B. KING redced order controllers sensors/actators. The kernels of these integral representations are called fnctional gains. In [4], Jornal of Mathematical Systems, Estimation, Control Vol. 8, No. 2, 1998, pp. 1{12 c 1998 Birkhaser-Boston A Note on the Mathematical Modelling of Damped Second Order Systems John A. Brns y Belinda B. King

More information

Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis Functional Response and Feedback Controls

Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis Functional Response and Feedback Controls Hindawi Pblishing Corporation Discrete Dynamics in Natre and Society Volme 2008 Article ID 149267 8 pages doi:101155/2008/149267 Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis

More information

The Heat Equation and the Li-Yau Harnack Inequality

The Heat Equation and the Li-Yau Harnack Inequality The Heat Eqation and the Li-Ya Harnack Ineqality Blake Hartley VIGRE Research Paper Abstract In this paper, we develop the necessary mathematics for nderstanding the Li-Ya Harnack ineqality. We begin with

More information

Optimal Control, Statistics and Path Planning

Optimal Control, Statistics and Path Planning PERGAMON Mathematical and Compter Modelling 33 (21) 237 253 www.elsevier.nl/locate/mcm Optimal Control, Statistics and Path Planning C. F. Martin and Shan Sn Department of Mathematics and Statistics Texas

More information

ρ u = u. (1) w z will become certain time, and at a certain point in space, the value of

ρ u = u. (1) w z will become certain time, and at a certain point in space, the value of THE CONDITIONS NECESSARY FOR DISCONTINUOUS MOTION IN GASES G I Taylor Proceedings of the Royal Society A vol LXXXIV (90) pp 37-377 The possibility of the propagation of a srface of discontinity in a gas

More information

Gradient Projection Anti-windup Scheme on Constrained Planar LTI Systems. Justin Teo and Jonathan P. How

Gradient Projection Anti-windup Scheme on Constrained Planar LTI Systems. Justin Teo and Jonathan P. How 1 Gradient Projection Anti-windp Scheme on Constrained Planar LTI Systems Jstin Teo and Jonathan P. How Technical Report ACL1 1 Aerospace Controls Laboratory Department of Aeronatics and Astronatics Massachsetts

More information

STEP Support Programme. STEP III Hyperbolic Functions: Solutions

STEP Support Programme. STEP III Hyperbolic Functions: Solutions STEP Spport Programme STEP III Hyperbolic Fnctions: Soltions Start by sing the sbstittion t cosh x. This gives: sinh x cosh a cosh x cosh a sinh x t sinh x dt t dt t + ln t ln t + ln cosh a ln ln cosh

More information

A generalized Alon-Boppana bound and weak Ramanujan graphs

A generalized Alon-Boppana bound and weak Ramanujan graphs A generalized Alon-Boppana bond and weak Ramanjan graphs Fan Chng Abstract A basic eigenvale bond de to Alon and Boppana holds only for reglar graphs. In this paper we give a generalized Alon-Boppana bond

More information

Simplified Identification Scheme for Structures on a Flexible Base

Simplified Identification Scheme for Structures on a Flexible Base Simplified Identification Scheme for Strctres on a Flexible Base L.M. Star California State University, Long Beach G. Mylonais University of Patras, Greece J.P. Stewart University of California, Los Angeles

More information

Using Lagrangian relaxation in optimisation of unit commitment and planning- Part2

Using Lagrangian relaxation in optimisation of unit commitment and planning- Part2 Faklteta za Elektrotehniko Eva horin, Heike Brand, Christoph Weber Using Lagrangian relaxation in optimisation of nit commitment and planning- art2 OSCOGEN Discssion aper No. Contract No. ENK5-C-2-94 roject

More information

OPTIMUM EXPRESSION FOR COMPUTATION OF THE GRAVITY FIELD OF A POLYHEDRAL BODY WITH LINEARLY INCREASING DENSITY 1

OPTIMUM EXPRESSION FOR COMPUTATION OF THE GRAVITY FIELD OF A POLYHEDRAL BODY WITH LINEARLY INCREASING DENSITY 1 OPTIMUM EXPRESSION FOR COMPUTATION OF THE GRAVITY FIEL OF A POLYHERAL BOY WITH LINEARLY INCREASING ENSITY 1 V. POHÁNKA2 Abstract The formla for the comptation of the gravity field of a polyhedral body

More information

Affine Invariant Total Variation Models

Affine Invariant Total Variation Models Affine Invariant Total Variation Models Helen Balinsky, Alexander Balinsky Media Technologies aboratory HP aboratories Bristol HP-7-94 Jne 6, 7* Total Variation, affine restoration, Sobolev ineqality,

More information