Finite Horizon Risk Sensitive MDP and Linear Programming

Size: px
Start display at page:

Download "Finite Horizon Risk Sensitive MDP and Linear Programming"

Transcription

1 Finite Horizon Risk Sensitive MDP nd Liner Progrmming Atul Kumr, Veerrun Kvith nd N. Hemchndr IEOR, Indin Institute of Technology Bomby, Indi Abstrct In the context of stndrd Mrkov decision processes (MDPs), the connection between Dynmic Progrm (DP) nd Liner Progrm (LP) is well understood nd is well estblished under sufficiently generl conditions. LP bsed pproch fcilittes solving the constrined MDPs. Multiplictive or Risk sensitive MDPs, introduced to control the fluctutions/vritions round the expected vlue, re reltively less studied objects. DP equtions re considerbly well understood even in the context of Risk MDPs, however the LP connection is not known. We consider finite horizon risk MDP problem nd estblish the connections between the DP nd LP pproches. We ugment the stte spce with suitble component, to obtin the optiml policies for constrined risk MDPs. We pply this results to server selection problem in Ber/M/K/K queues, with constrint on the utiliztion of the fst server. We discuss some interesting structurl properties of the risk optiml policies. I. INTRODUCTION Mrkov Decision Process (MDP) is mthemticl frmework used to solve the problem of sequentil decision mking in stochstic situtions ([6], [3], [], [4] etc). The im of MDP is to find n optiml policy for the decision mkers. A policy is sequence of decisions one for ech time slot, possibly depending upon the (current stte or history of ll sttes) stte of the system. MDP considers running cost t every time step, depending upon the stte nd the ction tken t tht time step, nd obtins n optiml policy tht optimizes the expected vlue of the sum (integrl in cse of continuous time problems) of the running costs over ll the time slots under considertion. In cse of finite horizon problems, it lso considers terminl cost. There cn be three vrieties of MDP problems bsed on the time horizon for which the problem spns. It is finite time horizon problem if the sum cost is considered for finite durtion. In cse of infinite time horizon problems, vrints like discounted cost, verge cost nd totl cost re considered. The focus of this pper is on finite horizon problems. In mny scenrios, the gents re interested not just in the verge cost. But some gents would like to reduce the risk on most of the smple pths. Worst cse nlysis dels with n extreme cse in this direction. While risk sensitive frmework offers vrying rnges of importnce to smple pths nd verge vlue s controlled by prmeter. Depending upon the prmeter, clled risk prmeter, it provides importnce to higher moments of the sum cost. In ll, while verge cost/liner MDPs re concerned bout first moment of the sum cost, risk sensitive MDPs incorporte higher moments of the cost, to control the vribility/fluctutions bout the expected vlue. The liner MPDs re lso viewed s risk neutrl MDPs. The liner MDP is well studied topic nd mny solutions pproches re known. Dynmic progrmming (DP), Liner progrmming (LP), Vlue itertion re some of them ([6], [3], [], [4] etc). DP obtins the vlue function, the optiml cost to go till termintion from ny time nd ny stte, using bckwrd induction. Alterntively vlue functions cn be obtined using solution of n pproprite Liner Progrm (LP). The dul LP directly provides the optiml policy (e.g., [6], []). Reltively risk sensitive MDPs re studied to limited extent. Nevertheless, dynmic progrmming pproch cn be pplied even in the context of finite horizon risk MDP ([5]). To the best of our knowledge, the connection between risk sensitive MDPs nd n pproprite liner progrm is not yet estblished nd this is the min focus of the pper. This connection does not solve the dimensionlity problem. Nevertheless, the dvent of fst LP solvers mkes this very ttrctive lterntive. Further nd more importntly one cn incorporte constrints in the MDP frmework. Our LP connection thus provides computtionl methods to solve such constrined risk sensitive MDPs. Avilbility of resources cn be cptured s suitble constrints nd hence solutions to constrined MDPs re importnt. The work on risk sensitive control is vst nd vried nd we give smple of some of the strnds. The pioneering work is by Howrd nd Mthieson [7]. The bckwrd recursion dynmic progrmming equtions in finite horizon setting re of multiplictive type nd lgorithms to compute optiml polices in this model re known. In generl, the optiml policies in infinite horizon setting tend to be non sttionry, [8], etc. Some ppers identify suitble sufficient conditions tht ensure tht the optiml polices re sttionry nd lso develop lgorithms to compute the sme ([]) or pproximte optiml policies. Other ppers lso explore the reltions between robust MDPs nd risk sensitive MDPs [9]. We develop LP bsed lgorithm tht computes optiml risk sensitive polices for constrined risk MDPs. Nottion: The bold letters represent the vectors, e.g.,

2 y = {y(t, x, )} t,x, represents fesible vector of dul LP (9), given below. While x t n represents the vector x t n = [x n,, x t ]. The rndom vribles re represented by cpitl letters, while their reliztion by the corresponding smll letters. When required to specify the time index, subscript of the time index is used. When not required it is voided. For exmple, x represents reliztion of rndom vrible X t for ny t. If it required to represent reliztion of the pir of rndom vribles X t, X n, then we use x t, x n. The reliztions for rndom vribles of subsequent time slots, like X t, X t+, re represented by (x, x ). II. RISK SENSITIVE MDP FRAMEWORK Risk sensitive MDP, s in the cse of liner MDP, consists of set X of ll possible sttes, set A of ll possible ctions nd n immedite rewrd function r t : X A R for ech time slot t. The terminl cost r T depends only upon the stte x X. The stte, ction spces X, A do not depend the time slot t, tht is we consider the sme set for ll the time indices. It is further chrcterized by trnsition function p : X A X, which defines the ction dependent stte trnsitions. Here p(x x, ) gives the probbility of the stte trnsition from x to x, when ction is chosen. We consider finite horizon problem nd let {X t } t T, {A t } t T respectively represent the trjectories of the stte nd the ction processes. The terminl cost, cost in finl time slot T, depends only upon the (finl) stte. A policy t = (π t, π t+ π T ) is sequence of stte dependent nd possibly rndomized ctions, given for time slots between t nd T. Given policy t, the stte, ction pir evolve rndomly over the time slots t < n < T, with trnsitions given by the following lws: qn t (x, x, ) =P (X n = x, A n = X n = x, A n = ) = π n(x, )p(x x, ) where p(x x, ) = P (X n = x X n = x, A n = ) nd π n(x, ) = P (A n = X n = x ). () The bove evolution further depends upon the initil condition, i.e., the initiliztion of the stting point X t. Let E x,t represent the expecttion opertor with initil condition X t = x nd when the policy t is used. Let E α,t represent the sme expecttion opertor when the initil condition is distributed ccording to α, written s X t α. Here α(x) = P (X t = x). We re interested in optimizing the following risk sensitive objective: J t(α, t ) = γ log ( J t(α, t ) ) where J t(α, t ) = E α,t [ e γ T n=t rn(xn,an)+r T (X T ) ]. () The bove eqution represents the cost to go from time slot t to T under the policy t, with X t α. The vlue function, function of (x, t), is defined s the optiml vlue of the bove risk sensitive objective given the initil condition X t = x: V t (x) := min t D t Jt (x, t ) for ny x X, (3) where D t represents the spce of policies t. Dynmic Progrmming We re interested in the optiml policy = (we discrd in superscript when it strts from ) tht optimizes the risk cost J (x, ), or equivlently policy tht chieves the vlue function, i.e., such tht V (x) = J (x, ) for ll x X. Dynmic progrmming (DP) is well known technique, tht provides solution to such control problems, nd DP equtions re given by bckwrd induction s below for ny x X (see [5]): V T (x) = r T (x), nd for ny t T, { [ ] } V t(x) = min r t(x, ) + A γ log p(x x, )e γv t+(x ). We consider the following trnsltion of the vlue function, to simplify the bove set of equtions: u t (x) = e γvt(x) for ll t T, nd x X. Note by monotonicity nd continuity u t for ny t is minimum vlue of J t given in ( ): u t (x) = min t J t (x, t ). The DP equtions cn now be rewritten s: u T (x) = e γr T (x) for ny x X nd (5) { u t(x) = min e } γr t(x,) p(x x, )u t+(x ) for ny t T, nd x X. (6) For ese of nottions, we bsorb γ into the cost functions {r t }. One needs to solve the bove set of equtions to obtin the vlue function: u = {u t (x); t < T, x X }, nd the optimizers in the minimiztion step will provide us the optiml policy ([6], [5] etc). III. LINEAR PROGRAMMING FORMULATION The dynmic progrmming bsed pproch suffers from the curse of dimension. As we increse the number of sttes nd/or time epochs, the complexity of the problem increses significntly. This results in limited pplicbility of dynmic progrmming. In the context of liner MDPs, it is well known fct tht DP problem cn be reformulted s Liner Progrm (LP), under considerble generlity (see for e.g., [6] in the context of infinite horizon problems). However this conversion my

3 C t,πt, = π t(, )e r t(,)... π t(, )e r t(,) , π t(n, )e r t(n,) P = p(, ) p(, )... p(n, ) p(, ) p(, )... p(n, ) p( N, ) p( N, )... p(n N, ) (4) not solve the problem of dimension. But recent improvements in LP solvers mkes it n ttrctive lterntive. Further nd more importntly the LP bsed pproch extends esily nd provides solutions for constrined MDPs. In the coming sections, s in the cse of liner MDP (see for e.g., [6]), we will obtin two relevnt LPs ( priml nd dul LP). The solution of the priml LP will be the trnslted vlue function vector, u, which is the function vlue on the left hnd side (LHS) of the DP eqution (6), t the optimizer(s). On the other hnd the solution of the dul LP will directly provide the optiml policy D of the control problem (3). We begin with introducing some more nottions. Let N be the number of elements of the stte spce nd without loss of generlity let X = {,, N}. Let u t = [u t (), u t (N)] represent n N dimensionl vector indexed by time t, indictive of the possible vlue function for different sttes t time t. And let the combined vector tht includes the vlue function for ll combintions of time slots nd sttes be rewritten s below: u = {u t (x); t < T, x X } = [u, u, u,, u T ]. Define the opertor tht opertes on the combined vector u by: Lu = [L u, L u,, L T u] where L t u := inf π t C t,πt,p u t+ (7) with the mtrices C t,πt,, P re defined using (4), plced t the top of the pge nd u T = {u T (x); x X } is given by eqution (5). The bove opertor is constructed using the right hnd side (RHS) of the DP eqution (6). We now hve the following theorems (whose proofs re in Appendix): Theorem : Any vector u with Lu u (the component wise inequlity), stisfies: u u. Theorem : Any vector u with Lu u, stisfies: u u. Any vector u tht stisfies the constrint, Lu u, i.e., when u t C t,πt,p u t+ for ll t nd π t, (8) by Theorem, is lower thn the vlue function of risk MDP, u. It is trivil to check tht u lso stisfies (8). Thus it is the gretest lower bound mong ll vectors tht stisfy (8). Hence we hve the following LP for priml. Following similr procedure s in ([6]), we chose nonnegtive vector α(x), x X tht stisfies x X α(x) =. Using this one cn obtin n equivlent LP, whose solution equls the vlue function vector u. Priml Liner Progrm mx {{u t(x)} x X,t T } x X α(x)u (x) subject to: u T (x) b x, for ll x,, u t(x) e r t(x,) p(x x, )u t+(x ) for ll, x nd t T with b x, := e r T (x,) p(x x, )e r T (x ). The vector α cn be interpreted s the distribution of initil stte, X. The dul of the bove LP, is given by: Dul Liner Progrm min y={y(t,x,);t T,x X, A} b x, y(t, x, ) x X subject to: y(, x, ) = α(x ) for ll x X, (9) y(t, x, ) = e rt (x,) p(x x, )y(t, x, ) x for ll t T nd x X. ()

4 Below we give series of results connecting the dul LP (9) nd the trnslted risk MDP (6). Some of the proofs nd results of this section hve similr structure s tht given in [6]. However there re significnt chnges due to risk sensitive nture of the cost. A. Fesible region F nd the set of risk policies D: We sy tht vector y is fesible if it stisfies the dul constrints (9), () nd let F represent this fesible region. We first show one to one correspondence between the two spces, F nd D. Theorem 3: (i) For ny policy D of risk MDP, there exists vector y which stisfies ll the constrints of dul LP (9), i.e., y F. The fesible vector is given by the eqution (see ()): y (, x, ) = α(x )π (x, ) for ll x X, A, y (t, x t, t) = α(x t )e n= rn(xn,n) t n= qn (x n, n x n, n ) t,s t In the bove we define for ll x t X, t A, nd t < T. () q π (x, x, ) := π (x, ). (ii) Given vector y F, define policy y using the following rule: π y,t(x, ) := y(t, x, ) for ll x X, nd A. () y(t, x, ) The vector y y defined by eqution () of point (i) is gin in fesible region nd equls y. Proof: The proof is provided in Appendix. B. Expecttion t optiml policy: To obtin the connection between risk MDP problem nd the dul LP, one needs to study the connection between the risk sensitive cost for given policy nd the dul objective function t the fesible point y, defined using. We lso require similr connecting between the fesible point y nd the corresponding policy y. Further, we would like to solve constrined risk sensitive MDP problems (in section IV). The constrints usully bound the expected vlue of some function of the stte, ction rndom trjectories. In ll, we require the expression for the expected vlue of given function, in terms of the dul vrible y F. As first step, we hve the following, with proof in Appendix: Lemm : Let X α. For ny fesible point y of dul LP, integrble function f nd t < T y(t, x, )f(x, ) = E [ α,y Ψ t f(x t, A t ) ] with x, Ψ t := e t n= rn(xn,an). (3) Further, for ny integrble function f of the lst two sttes X T, X T nd the finl ction A T, we hve: x,,x y(t, x, )p(x x, )f(x,, x ) = E α,y [ Ψ T f(x T, A T, X T ) ]. (4) We hve the sme result when we replce y, y with y, respectively, following exctly similr steps s in the previous theorem. Lemm : For ny policy D of risk MDP nd for ny integrble function f, we hve: x,,x y (T, x, )p(x x, )f(x,, x ) = E α, [ Ψ T f(x T, A T, X T ) ]. (5) If we use the bove theorem with the function, f(x,, x ) = e r T (x,) e r T (x ), we obtin the following for ny (y, y ): y(t, x, ) p(x x, )e r T (x,) e r T (x ) x x, = E α,y [ e T n= rn(xn,an) e r T (X T ) ]. Note tht the LHS is the dul objective (9) t point y nd RHS is the risk cost t policy y. This is the bsic element in proving the equivlence of optiml policies nd optiml dul solutions, given below. C. Optiml policies nd the dul solutions The following theorem shows the reltion between the two optimizers (proof in Appendix). Theorem 4: () If y is n optiml solution of the dul LP, then y defined by () is n optiml policy for risk MDP. (b) If is n optiml policy for risk MDP, then y is n optiml solution of the dul LP. IV. CONSTRAINED RISK MDP We now consider constrined MDP problem, with n dditionl constrint s given below: Subject to: t min J (α, ) (6) E α, [f t (X t, A t )] B, for some set of integrble functions {f t }, initil distribution α nd bound B. The eqution (3) of Lemm could hve been useful in obtining the expecttion defining the constrint, but for the extr fctor Ψ t, s seen from the RHS of the eqution (3). We propose to dd Ψ t s dditionl stte component to the originl Mrkov chin {X t } to tckle this problem. We consider two component Mrkov chin {(X t, Ψ t )} nd

5 the corresponding probbility trnsition mtrix depends explicitly upon time index s below: p t+ (x, ψ t+ x, ψ t, ) = {ψ t+ =ψ te r t (x,) }p(x x, ). With the introduction of the new stte component, for ny dul LP fesible point y we hve: y(t, x, ψ t, ) ψ t f(x, ) = E α,y [f(x t, A t)]. (7) x,ψ t, Thus one cn obtin optiml policy of constrined risk MDP (6) by considering n dditionl stte component nd by dding n extr constrint to the dul LP (9) s below: min [ ] e r T (x,) p(x x, )e r T (x ) y(t, x, ) x (8) subject to: y(t, x, ) = ψ t y(t, x, ψ t, ) for ll t y(, x, ψ, ) = α(x) {ψ =} for ll x, ψ y(t, x, ψ t, ) =,x,ψ t e rt (x,) p(x, ψ t x, ψ t, )y(t, x, ψ t, ) t x,ψ t, for ll t T nd x, ψ t nd y(t, x, ψ t, )ψ tf t(x, ) B. We would like to mention here tht ψ = is lwys initilized to one, Ψ cn tke t mximum X A vlues while Ψ t for ny t cn tke t mximum X t A t possible vlues. There will lso be considerble deletions if the concerned mpping ( t, x t ) e t n= rn(xn,n) is not one-one. One needs to consider this time dependent stte spce while solving the dul LP given bove nd we omit the discussion of these obvious detils. V. APPLICATIONS In [], we pplied LP bsed pproch to solve constrined risk sensitive cost tht rises nturlly in the context of Dely Sensitive Networks (DTNs). The probbility of messge delivery filure with exponentilly distributed contcts turns out to hve risk sensitive form. The direct solution to the power constrined problem works significntly superior in comprison with the solution obtined for model with soft constrints. In this pper we consider nother exmple, which investigtes the effect of risk sensitive cost on optiml policy. As the risk fctor increses, the optiml policies re no more monotone. A. Queueing with losses We consider queueing system with two possible service options. The fst service fcility offers service t rte µ nd is expensive, while the service rte of the slower one is µ with µ < µ. The system cn support t mximum N jobs in prllel nd ny job rrivl tht finds ll the N servers busy, leves the system without service. Aim is to utilize the fst service fcility in n optiml mnner which minimizes the totl number of jobs lost in given time horizon, while mintining the utiliztion of the fst service fcility within given limit. We consider queueing system with Bernoulli rrivls. In every time slot (of unit durtion), customer rrives with probbility δ nd there is no rrivl with probbility δ. The job demnds re exponentilly distributed with prmeter µ (prmeter µ ) when served by the fst (slow) server. Let X t represent the number of customers in the system nd let A t be the indictor of the service type used in time slot t. The flg A t = implies fster service fcility is used cross ll the servers, while A t = implies the use of slower service fcility. A customer leves the system fter service completion, in one time slot with probbility Θ At where Θ := e µ, µ := µ {=} + µ {=}. Thus the trnsition probbility mtrix of this controlled Mrkov chin is given by p(x x, ) Θ N + δnθ N ( Θ ) if x = x = N δθ x {x<n} if x = x + ( δ)( Θ ) x if x = ( ) = x δ x Θ x ( Θ ) x x + ( ) x +( δ) x Θ x ( Θ ) x x if < x x else. With G t representing the flg indicting the rrivl of customer in time slot t, the totl number of customers lost in totl of T time slots is given by: T t= {Xt=N}G t nd we re interested in minimizing the corresponding risk sensitive cost for given [ risk prmeter γ J(x, ) = E x, e γ ] T t= {X t =N}G t. Theorem 5: The required risk sensitive cost hs simpler form s below: [ J(x, ) = E x, e β ] T t= {X t =N} with β = ln (δe γ + ( δ)). (9)

6 Proof : Note tht the rrivls in the time slots with X t = N re lost, becuse ll the servers re busy. These rrivls does not chnge the number in the system in the next time slot X t+ nd hence re independent of the system evolution. By conditioning on the Mrkov chin trjectory {X t } T t= nd becuse of the independence just discussed bove, [ we hve: J(x, ) = E x, T ] g δ t= {X t =N} with g δ := E [ e γgt]. Probbility of fst service, t * γ =. γ = π * t (,), γ = π * t (3,), γ = π * (,), γ =. t π * (3,), γ =. t Let β := ln(g δ ), so tht e β = g δ. We would like to optimize the bove risk sensitive cost under the following constrint for given utiliztion bound B: [ T ] E x, {At=}X t B. t= Bsiclly when fst fcility is chosen s option in ny time slot, X t number of servers re using fst fcility nd hence the bove constrint. Numericl nlysis We obtin the optiml policy for the bove queueing bsed control problem, [ min Ex, e β T t= {X t =N}] such tht [ T ] E x, {At=}X t B, t= by solving the corresponding LP (8), where β depends upon the risk prmeter γ s given by Theorem 5. We did most of the coding in Mtlb except for the LP prt. We used AMPL to model the LP nd Gurobi solver to solve the LP. The solution y of the LP provides the optiml policy y s given by eqution (). In Figure, we consider system with 3 servers. We plot the optiml policy for two vlues of γ. The optiml policy with x = (i.e., with no customers in the system) hs no impct s the server(s) re not utilized. With both vlues of γ, the optiml policy with one customer in the system, i.e., with x =, is to switch off the fst serve fcility t ll time slots. But there is big difference for the remining two sttes x =, 3 nd these policies re plotted in the figure. We plot the probbility of fst service, s dictted by optiml policy, with sttes x = nd x = 3 cross the time slots. When γ =., the optiml policies re threshold type. With x = it switches off the fst fcility t 5th time slot, while with x = 3 it switches off t the nd time slot. The risk cost is close to the liner cost with smll vlues of risk fctor γ, the optiml policies re well understood to be threshold type for liner control nd this explins the figure for the cse with γ =.. With γ =, the risk optiml policies re no more threshold. In fct they re not even monotone s seen Time epochs Fig.. Optiml policy with T =, N = 3, µ =.3, µ =. nd B =.5. For x = optiml policy lwys uses slow server. from the Figure. Further, the probbility of fst service is higher t smller sttes (x = ) with smll γ, while the opposite is true when γ =. With lrger importnce to risk cost, the policy is more cutious t the points of high risk, i.e., when x = 3. We estimte the verge number of customers lost, t optiml policy, using Lemm s below: [ ] E x, [N lost ] = E x, {Xt=N}G t t = y (t, x, )ψ t {x=n} δ. t x,,ψ t The verge number lost equls.37 nd.45 respectively t γ =. nd. This is obvious becuse with high risk fctor, the importnce is drifted wy from the verge number lost. We considered mny more numericl exmples nd found similr chrcteriztion of the optiml policies. CONCLUSIONS We consider finite horizon risk MDP problem nd estblish the connections between the DP nd LP pproches. We show tht the solution of the unconstrined risk MDP problem (3) cn be obtined vi the solution of ny one of the two LPs, priml nd dul. The priml solution provides the vlue function while the dul solution directly provides the risk optiml policy. It is not strightforwrd to extend the solution to the constrined risk MDP problem. We ugment the stte spce with suitble component, tht t ny time slot cptures the effect of the risk cost until tht slot. We propose third LP using the ugmented stte spce trnsitions, which provides the solution to the constrined risk MDP problem. We pply the results so obtined, to study the server selection problem in the context of Bernoulli queues with losses. Our im is to minimize the number of customers lost, i.e., returned without service. We consider minimizing the risk version of the cost nd optimize it under

7 fst server utiliztion constrint. The optiml policy is threshold policy when the risk fctors re close to zero. It is well known tht the risk MDP is close to the liner MDP with smll risk fctors nd hence threshold policy is nticipted. However, with lrge risk fctors the risk optiml policy is no longer threshold type. The policies re not even monotone. Further we notice tht the probbility of choosing fst server is higher t lrger sttes. With higher preference to risk cost, the policy emphsizes utiliztion of the fst server t high risk sttes, the lrger sttes. Thus the proposed LPs re useful in obtining the solutions of the constrined/ unconstrined finite horizon risk MDPs. ACKNOWLEDGEMENTS The work towrds this problem originted with Prof. Eitn Altmn s remrk bout finding the connections between LPs nd risk sensitive MDP problems. APPENDIX: PROOFS The proofs of this section follow similr structure s given in [6]. However there re significnt chnges due to risk sensitive nture of the cost. Proof of Theorem : Consider ny vector u stisfying u Lu. Consider ny policy = [π, π,, π T ]. By definition of the opertor L { } u inf C,,π P u () π C,,π P u C,,π P C,,π P u. C,,π P C,,π P C T,T,π T P u T T T = J ( ) with J ( ) := J (, ) J (, ). J (N, ) This is true for ny policy. Thus. () u inf J ( ) = u. Following exctly similr logic one cn show for ll t < T tht u t u t. Proof of Theorem : Let us consider ny vector u which stisfies u Lu. By definition of L: { } u inf C,,π P u () π Consider ny ɛ, by definition of infimum there exists policy π such tht: u C,,π P u ɛ By boundedness of the mtrices (finite sttes nd ctions) involved nd further choosing the policies π, π etc., inductively we obtin the following for ny incresing sequence of {ɛ i } nd with ɛ = ɛ T : u C,,π P C,,π P u ɛ. C,,π P C,,π P C T,T,π T P u T T ɛ T Note in the bove tht, for exmple, ɛ is chosen such tht (for pproprite choice of ɛ ): ɛ C,,π P ɛ + ɛ. Thus s in the proof of the previous theorem, u C,,π P C,,π P C T,T,π T P u T T ɛ T = J ( ) ɛ with = [π, π,, π T ]. Thus u J ( ) ɛ inf J () ɛ = u ɛ. Thus for ny ɛ > one cn chose pproprite incresing sequence of {ɛ i } such tht u u ɛ. Since ɛ > is rbitrry, consider the limit ɛ nd hence u u. Following exctly similr logic one cn show for ll t < T tht u t u t. Proof of Theorem 3: It is esy to see tht the defined point y stisfies the first constrint (9), s by definition for ll x y (, x, ) = α(x )π (x, ) = α(x ). Define t := t n= q n (x n, n x n, n )e t n= rn(xn,n). Note tht t depends upon the vectors t, x t. Using the bove definition, we cn rewrite

8 y (t, x, ) = t x t α(x ) t. To simplify the nottions, we represent the ction stte pir by z t := (x t, t ), for every t. Considering the right hnd side (RHS) of the second constrint (): e r t ( ) p(x t )y (t, ) =(x t, t ) = e r t ( ) p(x t ) = = ( α(x ) t e r t ( ) α(x ) t z t =( t x t ) α(x ) t π t(z t)p(x t ) t e r t ( ) t qt (z t ) = α(x ) t = y (t, z t). t t Hence the point y stisfies the second constrint (). Prt (ii): Consider ny y F, the policy y defined s in () nd then the point y y defined using policy y s in (). Aim is to prove tht y(t, z t ) = y y (t, z t ) for ll t T, x t X, t A. Fix (t, z t ). As in previous proof, define k,t := t n=k q n (z n z n )e t n=k rn(zn) now including the strting time k. Since y F, it stisfies (9) nd by definition () of y we hve: y y (t, z t) = = = =,t α(x ),t,t α(x)er (z ) π y,(z ) α(x)er (z ) y(, z ) ; using () y(, x, ),t er (z ) y(, z ); using (9). Further expnding,t =,t er(z) q (z z ) nd simplifying s before, we reduce one pir of elements ) (z ) in the summtion: y y (t, z t) =,t er (z ) z e r (z ) q (z z )y(, z ) =,t er (z ) π y,(z ) e r (z ) p(x z )y(, z ) z =,t er (z ) π y,(z ) y(, x, ); using () =,t er (z ) y(, z ) y(, x, ) y(, x, ); using () =,t er (z ) y(, z ). Proceeding in similr wy, we reduce one more pir of elements z = (x, ) summtion, tht is: y y (t, z t) = 3,t er (z ) z e r (z ) q (z z )y(, z ) = 3,t er (z ) π y,(z ) e r (z ) p(x z )y(, z ) z = 3,t er (z ) π y,(z ) y(, x, ); using () = 3,t er (z ) y(, z ) y(, x, ) y(, x, ) = 3,t er (z ) y(, z ). Repeting exctly the sme steps, we eliminte ll the terms till nd including (z t ), to obtin the following (note tht t,t = q t (z t )): y y (t, z t ) = q t (z t )e rt (zt ) y(t, ) = π t (z t ) e rt (zt ) p(x t )y(t, ) = t = y(t, z t ). y(t, z t ) y(t, x t, y(t, x t, t); using () t) t This is true for ll t T. Proof of Lemm : By Theorem (3), y(t, z t ) = y X (t, z t ) for ny t < T. Further, using the definition of y X (t, z t ) from (), one cn rewrite left hnd side

9 (LHS) of (3) z t y(t, z t)f(z t) = z t y X (t, z t)f(z t) = z t f(z t) = z t [ = E y t α(x ) qn (z t n z n )e n= rn(zn) n= (f(z t)e t n= rn(zn)) α(x ) e t t qn (z n z n ) n= ] n= rn(xn,an) f(x t, A t) for ny t < T. [8] Corluppi, Stefno P., nd Steven I. Mrcus. Risk-sensitive nd minimx control of discrete-time, finite-stte Mrkov decision processes. Automtic 35. (999): [9] Osogmi, Tkyuki. Robustness nd risk-sensitivity in Mrkov decision processes. Advnces in Neurl Informtion Processing Systems.. [] Borkr, Vivek S., nd Sen P. Meyn. Risk-sensitive optiml control for Mrkov decision processes with monotone cost. Mthemtics of Opertions Reserch 7. (): 9-9. [] Atul Kumr, Veerrun Kvith nd N. Hemchndr, Power Constrined DTNs: Risk MDP-LP Approch. Interntionl Workshop on DD Communictions held in conjunction with WiOpt 5, Mumbi, Indi. One cn get the second result (4) in exctly similr lines. Proof of Theorem (4): We begin with proof of (). Let g(y) represent the dul objective (see (9)), for ny y F, i.e., g(y) := e rt (zt ) T x T [ ] p(x T z T )e r T (x T ) y(t, z T ). x T X Let y be n optiml solution of the dul LP, nd let y be the corresponding policy given by (). By optimlity nd becuse of eqution (5), for ny D: [ E α, e ] T n=t rn(xn,an)+r T (X T ) = g(y ) g(y ) = E α, y [ e T n=t rn(xn,an)+r T (X T ) ], estblishing the required optimlity. Prt (b) cn be proved in similr wy. REFERENCES [] Atul Kumr, Veerrun Kvith nd N. Hemchndr, Finite horizon risk sensitive MDP nd liner progrmming, 5, Technicl report vilble t [] Altmn, Eitn. Constrined Mrkov decision processes. Vol. 7. CRC Press, 999. [3] Bertseks, Dimitri P., nd Dimitri P. Bertseks. Dynmic progrmming nd optiml control. Vol.. No.. Belmont, MA: Athen Scientific, 995. [4] Feinberg, Eugene A., nd Adm Shwrtz, eds. Hndbook of Mrkov decision processes: methods nd pplictions. Boston, MA: Kluwer Acdemic Publishers,. [5] Stefno P Corluppi nd Steven I Mrcus. Risk-sensitive queueing. In Proceedings of the Annul Allerton Conference on Communiction Control nd Computing, volume 35, pges Citeseer, 997. [6] Mrtin L Putermn. Mrkov decision processes: discrete stochstic dynmic progrmming. John Wiley & Sons, 4. [7] Howrd, Ronld A., nd Jmes E. Mtheson. Risk-sensitive Mrkov decision processes. Mngement Science 8.7 (97):

Power Constrained DTNs: Risk MDP-LP Approach

Power Constrained DTNs: Risk MDP-LP Approach Power Constrined DTNs: Risk MDP-LP Approch Atul Kumr tulkr.in@gmil.com IEOR, IIT Bomby, Indi Veerrun Kvith vkvith@iitb.c.in, IEOR, IIT Bomby, Indi N Hemchndr nh@iitb.c.in, IEOR, IIT Bomby, Indi. Abstrct

More information

Administrivia CSE 190: Reinforcement Learning: An Introduction

Administrivia CSE 190: Reinforcement Learning: An Introduction Administrivi CSE 190: Reinforcement Lerning: An Introduction Any emil sent to me bout the course should hve CSE 190 in the subject line! Chpter 4: Dynmic Progrmming Acknowledgment: A good number of these

More information

Reinforcement learning II

Reinforcement learning II CS 1675 Introduction to Mchine Lerning Lecture 26 Reinforcement lerning II Milos Huskrecht milos@cs.pitt.edu 5329 Sennott Squre Reinforcement lerning Bsics: Input x Lerner Output Reinforcement r Critic

More information

We will see what is meant by standard form very shortly

We will see what is meant by standard form very shortly THEOREM: For fesible liner progrm in its stndrd form, the optimum vlue of the objective over its nonempty fesible region is () either unbounded or (b) is chievble t lest t one extreme point of the fesible

More information

Duality # Second iteration for HW problem. Recall our LP example problem we have been working on, in equality form, is given below.

Duality # Second iteration for HW problem. Recall our LP example problem we have been working on, in equality form, is given below. Dulity #. Second itertion for HW problem Recll our LP emple problem we hve been working on, in equlity form, is given below.,,,, 8 m F which, when written in slightly different form, is 8 F Recll tht we

More information

Online Supplements to Performance-Based Contracts for Outpatient Medical Services

Online Supplements to Performance-Based Contracts for Outpatient Medical Services Jing, Png nd Svin: Performnce-bsed Contrcts Article submitted to Mnufcturing & Service Opertions Mngement; mnuscript no. MSOM-11-270.R2 1 Online Supplements to Performnce-Bsed Contrcts for Outptient Medicl

More information

Reinforcement Learning

Reinforcement Learning Reinforcement Lerning Tom Mitchell, Mchine Lerning, chpter 13 Outline Introduction Comprison with inductive lerning Mrkov Decision Processes: the model Optiml policy: The tsk Q Lerning: Q function Algorithm

More information

THE EXISTENCE-UNIQUENESS THEOREM FOR FIRST-ORDER DIFFERENTIAL EQUATIONS.

THE EXISTENCE-UNIQUENESS THEOREM FOR FIRST-ORDER DIFFERENTIAL EQUATIONS. THE EXISTENCE-UNIQUENESS THEOREM FOR FIRST-ORDER DIFFERENTIAL EQUATIONS RADON ROSBOROUGH https://intuitiveexplntionscom/picrd-lindelof-theorem/ This document is proof of the existence-uniqueness theorem

More information

Advanced Calculus: MATH 410 Notes on Integrals and Integrability Professor David Levermore 17 October 2004

Advanced Calculus: MATH 410 Notes on Integrals and Integrability Professor David Levermore 17 October 2004 Advnced Clculus: MATH 410 Notes on Integrls nd Integrbility Professor Dvid Levermore 17 October 2004 1. Definite Integrls In this section we revisit the definite integrl tht you were introduced to when

More information

Continuous Random Variables

Continuous Random Variables STAT/MATH 395 A - PROBABILITY II UW Winter Qurter 217 Néhémy Lim Continuous Rndom Vribles Nottion. The indictor function of set S is rel-vlued function defined by : { 1 if x S 1 S (x) if x S Suppose tht

More information

{ } = E! & $ " k r t +k +1

{ } = E! & $  k r t +k +1 Chpter 4: Dynmic Progrmming Objectives of this chpter: Overview of collection of clssicl solution methods for MDPs known s dynmic progrmming (DP) Show how DP cn be used to compute vlue functions, nd hence,

More information

Using QM for Windows. Using QM for Windows. Using QM for Windows LEARNING OBJECTIVES. Solving Flair Furniture s LP Problem

Using QM for Windows. Using QM for Windows. Using QM for Windows LEARNING OBJECTIVES. Solving Flair Furniture s LP Problem LEARNING OBJECTIVES Vlu%on nd pricing (November 5, 2013) Lecture 11 Liner Progrmming (prt 2) 10/8/16, 2:46 AM Olivier J. de Jong, LL.M., MM., MBA, CFD, CFFA, AA www.olivierdejong.com Solving Flir Furniture

More information

Chapter 4: Dynamic Programming

Chapter 4: Dynamic Programming Chpter 4: Dynmic Progrmming Objectives of this chpter: Overview of collection of clssicl solution methods for MDPs known s dynmic progrmming (DP) Show how DP cn be used to compute vlue functions, nd hence,

More information

SUMMER KNOWHOW STUDY AND LEARNING CENTRE

SUMMER KNOWHOW STUDY AND LEARNING CENTRE SUMMER KNOWHOW STUDY AND LEARNING CENTRE Indices & Logrithms 2 Contents Indices.2 Frctionl Indices.4 Logrithms 6 Exponentil equtions. Simplifying Surds 13 Opertions on Surds..16 Scientific Nottion..18

More information

2D1431 Machine Learning Lab 3: Reinforcement Learning

2D1431 Machine Learning Lab 3: Reinforcement Learning 2D1431 Mchine Lerning Lb 3: Reinforcement Lerning Frnk Hoffmnn modified by Örjn Ekeberg December 7, 2004 1 Introduction In this lb you will lern bout dynmic progrmming nd reinforcement lerning. It is ssumed

More information

Math& 152 Section Integration by Parts

Math& 152 Section Integration by Parts Mth& 5 Section 7. - Integrtion by Prts Integrtion by prts is rule tht trnsforms the integrl of the product of two functions into other (idelly simpler) integrls. Recll from Clculus I tht given two differentible

More information

Lecture 19: Continuous Least Squares Approximation

Lecture 19: Continuous Least Squares Approximation Lecture 19: Continuous Lest Squres Approximtion 33 Continuous lest squres pproximtion We begn 31 with the problem of pproximting some f C[, b] with polynomil p P n t the discrete points x, x 1,, x m for

More information

Module 6 Value Iteration. CS 886 Sequential Decision Making and Reinforcement Learning University of Waterloo

Module 6 Value Iteration. CS 886 Sequential Decision Making and Reinforcement Learning University of Waterloo Module 6 Vlue Itertion CS 886 Sequentil Decision Mking nd Reinforcement Lerning University of Wterloo Mrkov Decision Process Definition Set of sttes: S Set of ctions (i.e., decisions): A Trnsition model:

More information

Monte Carlo method in solving numerical integration and differential equation

Monte Carlo method in solving numerical integration and differential equation Monte Crlo method in solving numericl integrtion nd differentil eqution Ye Jin Chemistry Deprtment Duke University yj66@duke.edu Abstrct: Monte Crlo method is commonly used in rel physics problem. The

More information

Bellman Optimality Equation for V*

Bellman Optimality Equation for V* Bellmn Optimlity Eqution for V* The vlue of stte under n optiml policy must equl the expected return for the best ction from tht stte: V (s) mx Q (s,) A(s) mx A(s) mx A(s) Er t 1 V (s t 1 ) s t s, t s

More information

Lecture 1. Functional series. Pointwise and uniform convergence.

Lecture 1. Functional series. Pointwise and uniform convergence. 1 Introduction. Lecture 1. Functionl series. Pointwise nd uniform convergence. In this course we study mongst other things Fourier series. The Fourier series for periodic function f(x) with period 2π is

More information

New Expansion and Infinite Series

New Expansion and Infinite Series Interntionl Mthemticl Forum, Vol. 9, 204, no. 22, 06-073 HIKARI Ltd, www.m-hikri.com http://dx.doi.org/0.2988/imf.204.4502 New Expnsion nd Infinite Series Diyun Zhng College of Computer Nnjing University

More information

Numerical integration

Numerical integration 2 Numericl integrtion This is pge i Printer: Opque this 2. Introduction Numericl integrtion is problem tht is prt of mny problems in the economics nd econometrics literture. The orgniztion of this chpter

More information

Acceptance Sampling by Attributes

Acceptance Sampling by Attributes Introduction Acceptnce Smpling by Attributes Acceptnce smpling is concerned with inspection nd decision mking regrding products. Three spects of smpling re importnt: o Involves rndom smpling of n entire

More information

NUMERICAL INTEGRATION. The inverse process to differentiation in calculus is integration. Mathematically, integration is represented by.

NUMERICAL INTEGRATION. The inverse process to differentiation in calculus is integration. Mathematically, integration is represented by. NUMERICAL INTEGRATION 1 Introduction The inverse process to differentition in clculus is integrtion. Mthemticlly, integrtion is represented by f(x) dx which stnds for the integrl of the function f(x) with

More information

Improper Integrals, and Differential Equations

Improper Integrals, and Differential Equations Improper Integrls, nd Differentil Equtions October 22, 204 5.3 Improper Integrls Previously, we discussed how integrls correspond to res. More specificlly, we sid tht for function f(x), the region creted

More information

Chapter 5 : Continuous Random Variables

Chapter 5 : Continuous Random Variables STAT/MATH 395 A - PROBABILITY II UW Winter Qurter 216 Néhémy Lim Chpter 5 : Continuous Rndom Vribles Nottions. N {, 1, 2,...}, set of nturl numbers (i.e. ll nonnegtive integers); N {1, 2,...}, set of ll

More information

Chapter 14. Matrix Representations of Linear Transformations

Chapter 14. Matrix Representations of Linear Transformations Chpter 4 Mtrix Representtions of Liner Trnsformtions When considering the Het Stte Evolution, we found tht we could describe this process using multipliction by mtrix. This ws nice becuse computers cn

More information

13: Diffusion in 2 Energy Groups

13: Diffusion in 2 Energy Groups 3: Diffusion in Energy Groups B. Rouben McMster University Course EP 4D3/6D3 Nucler Rector Anlysis (Rector Physics) 5 Sept.-Dec. 5 September Contents We study the diffusion eqution in two energy groups

More information

ODE: Existence and Uniqueness of a Solution

ODE: Existence and Uniqueness of a Solution Mth 22 Fll 213 Jerry Kzdn ODE: Existence nd Uniqueness of Solution The Fundmentl Theorem of Clculus tells us how to solve the ordinry differentil eqution (ODE) du = f(t) dt with initil condition u() =

More information

221B Lecture Notes WKB Method

221B Lecture Notes WKB Method Clssicl Limit B Lecture Notes WKB Method Hmilton Jcobi Eqution We strt from the Schrödinger eqution for single prticle in potentil i h t ψ x, t = [ ] h m + V x ψ x, t. We cn rewrite this eqution by using

More information

p-adic Egyptian Fractions

p-adic Egyptian Fractions p-adic Egyptin Frctions Contents 1 Introduction 1 2 Trditionl Egyptin Frctions nd Greedy Algorithm 2 3 Set-up 3 4 p-greedy Algorithm 5 5 p-egyptin Trditionl 10 6 Conclusion 1 Introduction An Egyptin frction

More information

Math 1B, lecture 4: Error bounds for numerical methods

Math 1B, lecture 4: Error bounds for numerical methods Mth B, lecture 4: Error bounds for numericl methods Nthn Pflueger 4 September 0 Introduction The five numericl methods descried in the previous lecture ll operte by the sme principle: they pproximte the

More information

7.2 The Definite Integral

7.2 The Definite Integral 7.2 The Definite Integrl the definite integrl In the previous section, it ws found tht if function f is continuous nd nonnegtive, then the re under the grph of f on [, b] is given by F (b) F (), where

More information

1 Online Learning and Regret Minimization

1 Online Learning and Regret Minimization 2.997 Decision-Mking in Lrge-Scle Systems My 10 MIT, Spring 2004 Hndout #29 Lecture Note 24 1 Online Lerning nd Regret Minimiztion In this lecture, we consider the problem of sequentil decision mking in

More information

The Regulated and Riemann Integrals

The Regulated and Riemann Integrals Chpter 1 The Regulted nd Riemnn Integrls 1.1 Introduction We will consider severl different pproches to defining the definite integrl f(x) dx of function f(x). These definitions will ll ssign the sme vlue

More information

1.2. Linear Variable Coefficient Equations. y + b "! = a y + b " Remark: The case b = 0 and a non-constant can be solved with the same idea as above.

1.2. Linear Variable Coefficient Equations. y + b ! = a y + b  Remark: The case b = 0 and a non-constant can be solved with the same idea as above. 1 12 Liner Vrible Coefficient Equtions Section Objective(s): Review: Constnt Coefficient Equtions Solving Vrible Coefficient Equtions The Integrting Fctor Method The Bernoulli Eqution 121 Review: Constnt

More information

Riemann Sums and Riemann Integrals

Riemann Sums and Riemann Integrals Riemnn Sums nd Riemnn Integrls Jmes K. Peterson Deprtment of Biologicl Sciences nd Deprtment of Mthemticl Sciences Clemson University August 26, 203 Outline Riemnn Sums Riemnn Integrls Properties Abstrct

More information

19 Optimal behavior: Game theory

19 Optimal behavior: Game theory Intro. to Artificil Intelligence: Dle Schuurmns, Relu Ptrscu 1 19 Optiml behvior: Gme theory Adversril stte dynmics hve to ccount for worst cse Compute policy π : S A tht mximizes minimum rewrd Let S (,

More information

APPROXIMATE INTEGRATION

APPROXIMATE INTEGRATION APPROXIMATE INTEGRATION. Introduction We hve seen tht there re functions whose nti-derivtives cnnot be expressed in closed form. For these resons ny definite integrl involving these integrnds cnnot be

More information

Riemann Sums and Riemann Integrals

Riemann Sums and Riemann Integrals Riemnn Sums nd Riemnn Integrls Jmes K. Peterson Deprtment of Biologicl Sciences nd Deprtment of Mthemticl Sciences Clemson University August 26, 2013 Outline 1 Riemnn Sums 2 Riemnn Integrls 3 Properties

More information

How do we solve these things, especially when they get complicated? How do we know when a system has a solution, and when is it unique?

How do we solve these things, especially when they get complicated? How do we know when a system has a solution, and when is it unique? XII. LINEAR ALGEBRA: SOLVING SYSTEMS OF EQUATIONS Tody we re going to tlk bout solving systems of liner equtions. These re problems tht give couple of equtions with couple of unknowns, like: 6 2 3 7 4

More information

Lecture notes. Fundamental inequalities: techniques and applications

Lecture notes. Fundamental inequalities: techniques and applications Lecture notes Fundmentl inequlities: techniques nd pplictions Mnh Hong Duong Mthemtics Institute, University of Wrwick Emil: m.h.duong@wrwick.c.uk Februry 8, 207 2 Abstrct Inequlities re ubiquitous in

More information

Taylor Polynomial Inequalities

Taylor Polynomial Inequalities Tylor Polynomil Inequlities Ben Glin September 17, 24 Abstrct There re instnces where we my wish to pproximte the vlue of complicted function round given point by constructing simpler function such s polynomil

More information

Lecture 14: Quadrature

Lecture 14: Quadrature Lecture 14: Qudrture This lecture is concerned with the evlution of integrls fx)dx 1) over finite intervl [, b] The integrnd fx) is ssumed to be rel-vlues nd smooth The pproximtion of n integrl by numericl

More information

Chapter 3 Polynomials

Chapter 3 Polynomials Dr M DRAIEF As described in the introduction of Chpter 1, pplictions of solving liner equtions rise in number of different settings In prticulr, we will in this chpter focus on the problem of modelling

More information

Chapter 4 Contravariance, Covariance, and Spacetime Diagrams

Chapter 4 Contravariance, Covariance, and Spacetime Diagrams Chpter 4 Contrvrince, Covrince, nd Spcetime Digrms 4. The Components of Vector in Skewed Coordintes We hve seen in Chpter 3; figure 3.9, tht in order to show inertil motion tht is consistent with the Lorentz

More information

Driving Cycle Construction of City Road for Hybrid Bus Based on Markov Process Deng Pan1, a, Fengchun Sun1,b*, Hongwen He1, c, Jiankun Peng1, d

Driving Cycle Construction of City Road for Hybrid Bus Based on Markov Process Deng Pan1, a, Fengchun Sun1,b*, Hongwen He1, c, Jiankun Peng1, d Interntionl Industril Informtics nd Computer Engineering Conference (IIICEC 15) Driving Cycle Construction of City Rod for Hybrid Bus Bsed on Mrkov Process Deng Pn1,, Fengchun Sun1,b*, Hongwen He1, c,

More information

Module 6: LINEAR TRANSFORMATIONS

Module 6: LINEAR TRANSFORMATIONS Module 6: LINEAR TRANSFORMATIONS. Trnsformtions nd mtrices Trnsformtions re generliztions of functions. A vector x in some set S n is mpped into m nother vector y T( x). A trnsformtion is liner if, for

More information

Variational Techniques for Sturm-Liouville Eigenvalue Problems

Variational Techniques for Sturm-Liouville Eigenvalue Problems Vritionl Techniques for Sturm-Liouville Eigenvlue Problems Vlerie Cormni Deprtment of Mthemtics nd Sttistics University of Nebrsk, Lincoln Lincoln, NE 68588 Emil: vcormni@mth.unl.edu Rolf Ryhm Deprtment

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Chemistry 36 Dr Jen M Stndrd Problem Set 3 Solutions 1 Verify for the prticle in one-dimensionl box by explicit integrtion tht the wvefunction ψ ( x) π x is normlized To verify tht ψ ( x) is normlized,

More information

DIRECT CURRENT CIRCUITS

DIRECT CURRENT CIRCUITS DRECT CURRENT CUTS ELECTRC POWER Consider the circuit shown in the Figure where bttery is connected to resistor R. A positive chrge dq will gin potentil energy s it moves from point to point b through

More information

Tests for the Ratio of Two Poisson Rates

Tests for the Ratio of Two Poisson Rates Chpter 437 Tests for the Rtio of Two Poisson Rtes Introduction The Poisson probbility lw gives the probbility distribution of the number of events occurring in specified intervl of time or spce. The Poisson

More information

Bernoulli Numbers Jeff Morton

Bernoulli Numbers Jeff Morton Bernoulli Numbers Jeff Morton. We re interested in the opertor e t k d k t k, which is to sy k tk. Applying this to some function f E to get e t f d k k tk d k f f + d k k tk dk f, we note tht since f

More information

( dg. ) 2 dt. + dt. dt j + dh. + dt. r(t) dt. Comparing this equation with the one listed above for the length of see that

( dg. ) 2 dt. + dt. dt j + dh. + dt. r(t) dt. Comparing this equation with the one listed above for the length of see that Arc Length of Curves in Three Dimensionl Spce If the vector function r(t) f(t) i + g(t) j + h(t) k trces out the curve C s t vries, we cn mesure distnces long C using formul nerly identicl to one tht we

More information

MAT612-REAL ANALYSIS RIEMANN STIELTJES INTEGRAL

MAT612-REAL ANALYSIS RIEMANN STIELTJES INTEGRAL MAT612-REAL ANALYSIS RIEMANN STIELTJES INTEGRAL DR. RITU AGARWAL MALVIYA NATIONAL INSTITUTE OF TECHNOLOGY, JAIPUR, INDIA-302017 Tble of Contents Contents Tble of Contents 1 1. Introduction 1 2. Prtition

More information

MAA 4212 Improper Integrals

MAA 4212 Improper Integrals Notes by Dvid Groisser, Copyright c 1995; revised 2002, 2009, 2014 MAA 4212 Improper Integrls The Riemnn integrl, while perfectly well-defined, is too restrictive for mny purposes; there re functions which

More information

CBE 291b - Computation And Optimization For Engineers

CBE 291b - Computation And Optimization For Engineers The University of Western Ontrio Fculty of Engineering Science Deprtment of Chemicl nd Biochemicl Engineering CBE 9b - Computtion And Optimiztion For Engineers Mtlb Project Introduction Prof. A. Jutn Jn

More information

Physics 116C Solution of inhomogeneous ordinary differential equations using Green s functions

Physics 116C Solution of inhomogeneous ordinary differential equations using Green s functions Physics 6C Solution of inhomogeneous ordinry differentil equtions using Green s functions Peter Young November 5, 29 Homogeneous Equtions We hve studied, especilly in long HW problem, second order liner

More information

Review of Calculus, cont d

Review of Calculus, cont d Jim Lmbers MAT 460 Fll Semester 2009-10 Lecture 3 Notes These notes correspond to Section 1.1 in the text. Review of Clculus, cont d Riemnn Sums nd the Definite Integrl There re mny cses in which some

More information

Exam 2, Mathematics 4701, Section ETY6 6:05 pm 7:40 pm, March 31, 2016, IH-1105 Instructor: Attila Máté 1

Exam 2, Mathematics 4701, Section ETY6 6:05 pm 7:40 pm, March 31, 2016, IH-1105 Instructor: Attila Máté 1 Exm, Mthemtics 471, Section ETY6 6:5 pm 7:4 pm, Mrch 1, 16, IH-115 Instructor: Attil Máté 1 17 copies 1. ) Stte the usul sufficient condition for the fixed-point itertion to converge when solving the eqution

More information

1 The Lagrange interpolation formula

1 The Lagrange interpolation formula Notes on Qudrture 1 The Lgrnge interpoltion formul We briefly recll the Lgrnge interpoltion formul. The strting point is collection of N + 1 rel points (x 0, y 0 ), (x 1, y 1 ),..., (x N, y N ), with x

More information

Jim Lambers MAT 169 Fall Semester Lecture 4 Notes

Jim Lambers MAT 169 Fall Semester Lecture 4 Notes Jim Lmbers MAT 169 Fll Semester 2009-10 Lecture 4 Notes These notes correspond to Section 8.2 in the text. Series Wht is Series? An infinte series, usully referred to simply s series, is n sum of ll of

More information

Generalized Fano and non-fano networks

Generalized Fano and non-fano networks Generlized Fno nd non-fno networks Nildri Ds nd Brijesh Kumr Ri Deprtment of Electronics nd Electricl Engineering Indin Institute of Technology Guwhti, Guwhti, Assm, Indi Emil: {d.nildri, bkri}@iitg.ernet.in

More information

The use of a so called graphing calculator or programmable calculator is not permitted. Simple scientific calculators are allowed.

The use of a so called graphing calculator or programmable calculator is not permitted. Simple scientific calculators are allowed. ERASMUS UNIVERSITY ROTTERDAM Informtion concerning the Entrnce exmintion Mthemtics level 1 for Interntionl Bchelor in Communiction nd Medi Generl informtion Avilble time: 2 hours 30 minutes. The exmintion

More information

1.9 C 2 inner variations

1.9 C 2 inner variations 46 CHAPTER 1. INDIRECT METHODS 1.9 C 2 inner vritions So fr, we hve restricted ttention to liner vritions. These re vritions of the form vx; ǫ = ux + ǫφx where φ is in some liner perturbtion clss P, for

More information

Math 8 Winter 2015 Applications of Integration

Math 8 Winter 2015 Applications of Integration Mth 8 Winter 205 Applictions of Integrtion Here re few importnt pplictions of integrtion. The pplictions you my see on n exm in this course include only the Net Chnge Theorem (which is relly just the Fundmentl

More information

Indefinite Integral. Chapter Integration - reverse of differentiation

Indefinite Integral. Chapter Integration - reverse of differentiation Chpter Indefinite Integrl Most of the mthemticl opertions hve inverse opertions. The inverse opertion of differentition is clled integrtion. For exmple, describing process t the given moment knowing the

More information

UNIFORM CONVERGENCE. Contents 1. Uniform Convergence 1 2. Properties of uniform convergence 3

UNIFORM CONVERGENCE. Contents 1. Uniform Convergence 1 2. Properties of uniform convergence 3 UNIFORM CONVERGENCE Contents 1. Uniform Convergence 1 2. Properties of uniform convergence 3 Suppose f n : Ω R or f n : Ω C is sequence of rel or complex functions, nd f n f s n in some sense. Furthermore,

More information

Self-similarity and symmetries of Pascal s triangles and simplices mod p

Self-similarity and symmetries of Pascal s triangles and simplices mod p Sn Jose Stte University SJSU ScholrWorks Fculty Publictions Mthemtics nd Sttistics Februry 2004 Self-similrity nd symmetries of Pscl s tringles nd simplices mod p Richrd P. Kubelk Sn Jose Stte University,

More information

Review of basic calculus

Review of basic calculus Review of bsic clculus This brief review reclls some of the most importnt concepts, definitions, nd theorems from bsic clculus. It is not intended to tech bsic clculus from scrtch. If ny of the items below

More information

Cf. Linn Sennott, Stochastic Dynamic Programming and the Control of Queueing Systems, Wiley Series in Probability & Statistics, 1999.

Cf. Linn Sennott, Stochastic Dynamic Programming and the Control of Queueing Systems, Wiley Series in Probability & Statistics, 1999. Cf. Linn Sennott, Stochstic Dynmic Progrmming nd the Control of Queueing Systems, Wiley Series in Probbility & Sttistics, 1999. D.L.Bricker, 2001 Dept of Industril Engineering The University of Iow MDP

More information

Numerical Analysis: Trapezoidal and Simpson s Rule

Numerical Analysis: Trapezoidal and Simpson s Rule nd Simpson s Mthemticl question we re interested in numericlly nswering How to we evlute I = f (x) dx? Clculus tells us tht if F(x) is the ntiderivtive of function f (x) on the intervl [, b], then I =

More information

Riemann is the Mann! (But Lebesgue may besgue to differ.)

Riemann is the Mann! (But Lebesgue may besgue to differ.) Riemnn is the Mnn! (But Lebesgue my besgue to differ.) Leo Livshits My 2, 2008 1 For finite intervls in R We hve seen in clss tht every continuous function f : [, b] R hs the property tht for every ɛ >

More information

P 3 (x) = f(0) + f (0)x + f (0) 2. x 2 + f (0) . In the problem set, you are asked to show, in general, the n th order term is a n = f (n) (0)

P 3 (x) = f(0) + f (0)x + f (0) 2. x 2 + f (0) . In the problem set, you are asked to show, in general, the n th order term is a n = f (n) (0) 1 Tylor polynomils In Section 3.5, we discussed how to pproximte function f(x) round point in terms of its first derivtive f (x) evluted t, tht is using the liner pproximtion f() + f ()(x ). We clled this

More information

Math 270A: Numerical Linear Algebra

Math 270A: Numerical Linear Algebra Mth 70A: Numericl Liner Algebr Instructor: Michel Holst Fll Qurter 014 Homework Assignment #3 Due Give to TA t lest few dys before finl if you wnt feedbck. Exercise 3.1. (The Bsic Liner Method for Liner

More information

(e) if x = y + z and a divides any two of the integers x, y, or z, then a divides the remaining integer

(e) if x = y + z and a divides any two of the integers x, y, or z, then a divides the remaining integer Divisibility In this note we introduce the notion of divisibility for two integers nd b then we discuss the division lgorithm. First we give forml definition nd note some properties of the division opertion.

More information

Math 426: Probability Final Exam Practice

Math 426: Probability Final Exam Practice Mth 46: Probbility Finl Exm Prctice. Computtionl problems 4. Let T k (n) denote the number of prtitions of the set {,..., n} into k nonempty subsets, where k n. Argue tht T k (n) kt k (n ) + T k (n ) by

More information

Chapter 0. What is the Lebesgue integral about?

Chapter 0. What is the Lebesgue integral about? Chpter 0. Wht is the Lebesgue integrl bout? The pln is to hve tutoril sheet ech week, most often on Fridy, (to be done during the clss) where you will try to get used to the ides introduced in the previous

More information

Numerical Integration

Numerical Integration Chpter 1 Numericl Integrtion Numericl differentition methods compute pproximtions to the derivtive of function from known vlues of the function. Numericl integrtion uses the sme informtion to compute numericl

More information

Vyacheslav Telnin. Search for New Numbers.

Vyacheslav Telnin. Search for New Numbers. Vycheslv Telnin Serch for New Numbers. 1 CHAPTER I 2 I.1 Introduction. In 1984, in the first issue for tht yer of the Science nd Life mgzine, I red the rticle "Non-Stndrd Anlysis" by V. Uspensky, in which

More information

Here we study square linear systems and properties of their coefficient matrices as they relate to the solution set of the linear system.

Here we study square linear systems and properties of their coefficient matrices as they relate to the solution set of the linear system. Section 24 Nonsingulr Liner Systems Here we study squre liner systems nd properties of their coefficient mtrices s they relte to the solution set of the liner system Let A be n n Then we know from previous

More information

Theoretical foundations of Gaussian quadrature

Theoretical foundations of Gaussian quadrature Theoreticl foundtions of Gussin qudrture 1 Inner product vector spce Definition 1. A vector spce (or liner spce) is set V = {u, v, w,...} in which the following two opertions re defined: (A) Addition of

More information

Properties of Integrals, Indefinite Integrals. Goals: Definition of the Definite Integral Integral Calculations using Antiderivatives

Properties of Integrals, Indefinite Integrals. Goals: Definition of the Definite Integral Integral Calculations using Antiderivatives Block #6: Properties of Integrls, Indefinite Integrls Gols: Definition of the Definite Integrl Integrl Clcultions using Antiderivtives Properties of Integrls The Indefinite Integrl 1 Riemnn Sums - 1 Riemnn

More information

The Riemann-Lebesgue Lemma

The Riemann-Lebesgue Lemma Physics 215 Winter 218 The Riemnn-Lebesgue Lemm The Riemnn Lebesgue Lemm is one of the most importnt results of Fourier nlysis nd symptotic nlysis. It hs mny physics pplictions, especilly in studies of

More information

We partition C into n small arcs by forming a partition of [a, b] by picking s i as follows: a = s 0 < s 1 < < s n = b.

We partition C into n small arcs by forming a partition of [a, b] by picking s i as follows: a = s 0 < s 1 < < s n = b. Mth 255 - Vector lculus II Notes 4.2 Pth nd Line Integrls We begin with discussion of pth integrls (the book clls them sclr line integrls). We will do this for function of two vribles, but these ides cn

More information

A Fast and Reliable Policy Improvement Algorithm

A Fast and Reliable Policy Improvement Algorithm A Fst nd Relible Policy Improvement Algorithm Ysin Abbsi-Ydkori Peter L. Brtlett Stephen J. Wright Queenslnd University of Technology UC Berkeley nd QUT University of Wisconsin-Mdison Abstrct We introduce

More information

221A Lecture Notes WKB Method

221A Lecture Notes WKB Method A Lecture Notes WKB Method Hmilton Jcobi Eqution We strt from the Schrödinger eqution for single prticle in potentil i h t ψ x, t = [ ] h m + V x ψ x, t. We cn rewrite this eqution by using ψ x, t = e

More information

Improved Results on Stability of Time-delay Systems using Wirtinger-based Inequality

Improved Results on Stability of Time-delay Systems using Wirtinger-based Inequality Preprints of the 9th World Congress he Interntionl Federtion of Automtic Control Improved Results on Stbility of ime-dely Systems using Wirtinger-bsed Inequlity e H. Lee Ju H. Prk H.Y. Jung O.M. Kwon S.M.

More information

Lecture 3 ( ) (translated and slightly adapted from lecture notes by Martin Klazar)

Lecture 3 ( ) (translated and slightly adapted from lecture notes by Martin Klazar) Lecture 3 (5.3.2018) (trnslted nd slightly dpted from lecture notes by Mrtin Klzr) Riemnn integrl Now we define precisely the concept of the re, in prticulr, the re of figure U(, b, f) under the grph of

More information

NOTES ON HILBERT SPACE

NOTES ON HILBERT SPACE NOTES ON HILBERT SPACE 1 DEFINITION: by Prof C-I Tn Deprtment of Physics Brown University A Hilbert spce is n inner product spce which, s metric spce, is complete We will not present n exhustive mthemticl

More information

Journal of Inequalities in Pure and Applied Mathematics

Journal of Inequalities in Pure and Applied Mathematics Journl of Inequlities in Pure nd Applied Mthemtics GENERALIZATIONS OF THE TRAPEZOID INEQUALITIES BASED ON A NEW MEAN VALUE THEOREM FOR THE REMAINDER IN TAYLOR S FORMULA volume 7, issue 3, rticle 90, 006.

More information

The Wave Equation I. MA 436 Kurt Bryan

The Wave Equation I. MA 436 Kurt Bryan 1 Introduction The Wve Eqution I MA 436 Kurt Bryn Consider string stretching long the x xis, of indeterminte (or even infinite!) length. We wnt to derive n eqution which models the motion of the string

More information

Euler, Ioachimescu and the trapezium rule. G.J.O. Jameson (Math. Gazette 96 (2012), )

Euler, Ioachimescu and the trapezium rule. G.J.O. Jameson (Math. Gazette 96 (2012), ) Euler, Iochimescu nd the trpezium rule G.J.O. Jmeson (Mth. Gzette 96 (0), 36 4) The following results were estblished in recent Gzette rticle [, Theorems, 3, 4]. Given > 0 nd 0 < s

More information

3.4 Numerical integration

3.4 Numerical integration 3.4. Numericl integrtion 63 3.4 Numericl integrtion In mny economic pplictions it is necessry to compute the definite integrl of relvlued function f with respect to "weight" function w over n intervl [,

More information

Minimal DFA. minimal DFA for L starting from any other

Minimal DFA. minimal DFA for L starting from any other Miniml DFA Among the mny DFAs ccepting the sme regulr lnguge L, there is exctly one (up to renming of sttes) which hs the smllest possile numer of sttes. Moreover, it is possile to otin tht miniml DFA

More information

INTRODUCTION TO INTEGRATION

INTRODUCTION TO INTEGRATION INTRODUCTION TO INTEGRATION 5.1 Ares nd Distnces Assume f(x) 0 on the intervl [, b]. Let A be the re under the grph of f(x). b We will obtin n pproximtion of A in the following three steps. STEP 1: Divide

More information

Reversals of Signal-Posterior Monotonicity for Any Bounded Prior

Reversals of Signal-Posterior Monotonicity for Any Bounded Prior Reversls of Signl-Posterior Monotonicity for Any Bounded Prior Christopher P. Chmbers Pul J. Hely Abstrct Pul Milgrom (The Bell Journl of Economics, 12(2): 380 391) showed tht if the strict monotone likelihood

More information

Integral points on the rational curve

Integral points on the rational curve Integrl points on the rtionl curve y x bx c x ;, b, c integers. Konstntine Zeltor Mthemtics University of Wisconsin - Mrinette 750 W. Byshore Street Mrinette, WI 5443-453 Also: Konstntine Zeltor P.O. Box

More information

Student Activity 3: Single Factor ANOVA

Student Activity 3: Single Factor ANOVA MATH 40 Student Activity 3: Single Fctor ANOVA Some Bsic Concepts In designed experiment, two or more tretments, or combintions of tretments, is pplied to experimentl units The number of tretments, whether

More information