Learning to Take Concurrent Actions

Size: px
Start display at page:

Download "Learning to Take Concurrent Actions"

Transcription

1 Learning o Take Concurren Acions Khashayar Rohanimanesh Deparmen of Compuer Science Universiy of Massachuses Amhers, MA 0003 khash@cs.umass.edu Sridhar Mahadevan Deparmen of Compuer Science Universiy of Massachuses Amhers, MA 0003 mahadeva@cs.umass.edu Absrac We invesigae a general semi-markov Decision Process (SMDP) framework for modeling concurren decision making, where agens learn opimal plans over concurren emporally exended acions. We inroduce hree ypes of parallel erminaion schemes all, any and coninue and heoreically and experimenally compare hem. Inroducion We invesigae a general framework for modeling concurren acions. The noion of concurren acion is formalized in a general way, o capure boh siuaions where a single agen can execue muliple parallel processes, as well as he muli-agen case where many agens ac in parallel. Concurrency clearly allows agens o achieve goals more quickly: in making breakfas, we inerleave making oas and coffee wih oher aciviies such as geing milk; in driving, we search for road signs while conrolling he wheel, acceleraor and brakes. Mos previous work on concurrency has focused on parallelizing primiive (uni sep) acions. Reier developed axioms for concurren planning using he siuaion calculus framework [4]. Knoblock [3] and Bouilier [] modify he STRIPS represenaion of acions o allow for concurren acions. These approaches assume deerminisic effecs. Prior work in decision-heoreic planning includes work on muli-dimensional vecor acion spaces [2], and models based on dynamic merging of muliple MDPs [6]. There is also a massive lieraure on concurren processes, dynamic logic, and emporal logic. Pars of hese lines of research deal wih he specificaion and synhesis of concurren acions, including probabilisic ones [8]. In conras, we focus on parallelizing emporally exended acions. The concurrency framework described below significanly exends our previous work [5]. We provide a deailed analysis of hree erminaion schemes for composing parallel acion srucures. The hree schemes any, all, and coninue are illusraed in Figure. We characerize he class of policies under each scheme. We also heoreically compare he opimaliy of he concurren policies under each scheme wih ha of he ypical

2 sequenial case. The heoreical resuls are complemened by an experimenal sudy, which illusrae he rade-offs beween opimaliy and convergence speed, and he advanages of concurrency over sequenialiy. 2 Concurren Acion Model Building on SMDPs, we inroduce he Concurren Acion Model (CAM) (S, A, T, R), where S is a se of saes, A is a se of primary acions, T is a ransiion probabiliy disribuion S (A) S N [0, ], where (A) is he power-se of he primary acions and N is he se of naural numbers, and R is he reward funcion mapping S R. Here, a concurren acion is simply represened as a se of primary acions (hereafer called a muli-acion), where each primary acion is eiher a single sep acion, or a emporally exended acion (e.g., modeled as a closed loop policy over single sep acions [7]). We denoe he se of muli-acions ha can be execued in a sae s by A(s). In pracice, his funcion can capure resource consrains ha limi how many acions an agen can execue in parallel. Thus, he ransiion probabiliy disribuion in pracice may be defined over a much smaller subse han he power-se of primary acions (e.g., in he grid world example in Figure 3, he power se is > 00, bu he se of concurren acions is only 0). S a erminaed a a 2 a 3 +k inerruped S erminaed a a 2 a 3 +k S a erminaed +k Nex muli-acion a = {a, a 2, a 3, a 4 } +k a a a 2 a 3 a 4 a 4 a 4 muli-acion d a = {a, a 2, a 3, a 4 } n d n+ d n muli-acion a = {a, a 2, a 3, a 4 } d n+ Curren muli-acion d n a = {a, a 2, a 3, a 4 } d n+ Coninue o run Figure : Lef: T any erminaion scheme. Middle: T all erminaion scheme. Righ: T coninue erminaion scheme. A principal goal of his paper is o undersand how o define decision epochs for concurren processes, since he primary acions in a muli-acion may no erminae a he same ime. The even of erminaion of a muli-acion can be defined in many ways. Three erminaion schemes are illusraed in Figure. In he T any erminaion scheme (Figure, lef), he nex decision epoch is when he firs primary acion wihin he muli-acion currenly being execued erminaes, where he res of he primary acions ha did no erminae naurally are inerruped (he noion of inerrupion is similar o [7]). In he T all erminaion scheme (Figure, middle), he nex decision epoch is he earlies ime a which all he primary acions wihin he muli-acion currenly being execued have erminaed. We can design oher erminaion schemes by combining T any and T all : for example, anoher erminaion scheme called coninue is one ha always erminaes based on he T any erminaion scheme, bu les hose primary acions ha did no erminae naurally coninue running, while iniiaing new primary acions if hey are going o be useful (Figure, righ). A deerminisic Markovian (memoryless) policy in CAMs is defined as he mapping π : S (A). Noe ha even hough he mapping is defined independen of he

3 erminaion scheme, he behavior of a muli-acion policy depends on he erminaion scheme ha is used in he model. To illusrae his, le < π, τ > (called a policy-erminaion consruc) denoe he process of execuing he muli-acion policy π using he erminaion scheme τ {T any, T all }. To simplify noaion, we only use his form whenever we wan o explicily poin ou wha erminaion scheme is being used for execuing he policy π. For a given Markovian policy, we can wrie he value of ha policy in an arbirary sae given he erminaion mechanism used in he model. Le Θ(π, s, τ) denoe he even of iniiaing he muli-acion π(s ) a ime and erminaing i according o he τ {T any, T all } erminaion scheme. Also le π τ denoe he opimal muli-acion policy wihin he space of policies over muli-acions ha erminae according o he τ {T any, T all } erminaion scheme. To simplify noaion, we may alernaively use τ o denoe opimaliy wih respec o he τ erminaion scheme. Then he opimal value funcion can be wrien as: V τ (s ) = E{r + + γr γ k r +k + γ k max (s a A(s +k ) Q τ +k, a) Θ(π τ, s, τ)} where Q τ (s +k, a) denoes he muli-acion value of execuing a in sae s +k (erminaed using τ) and following he opimal policy π τ hereafer. The policy associaed wih he coninue erminaion scheme is a hisory dependen policy, since for a given sae s, he coninue policy will selec a muli-acion such ha i includes he se of all he primary acions of he muli-acion execued in he previous decision epoch ha did no erminae naurally in he curren sae s (we refer o his se as he coninue-se represened by h ). The coninue policy is defined as he mapping π con : S H (A) in which H is a se of coninue-ses h. Noe ha he value funcion definiion for he coninue policy should be defined over boh sae s and he coninue-se h (represened by s, h ), i.e., V πcon ( s, h ). Le he funcion A(s, h ) reurn he se of muli-acions ha can be execued in sae s ha include he coninuing primary acions in h. Then he coninue policy is formally defined as: π con ( s, h ) = arg max a A(s,h ) Q πcon ( s, h, a). To illusrae his, assume ha he curren sae is s and he muli-acion a = {a, a 2, a 3, a 4 } is execued in sae s. Also, assume ha he primary acion a is he firs acion ha erminaes afer k seps in sae s +k. According o he definiion of he coninue erminaion scheme (ha erminaes based on T any ), he muli-acion a is erminaed a ime + k and we need o selec a new muliacion o execue in sae s +k (wih he coninue-se h +k = {a 2, a 3, a 4 }). The coninue policy will selec he bes muli-acion a +k ha includes he primary acions {a 2, a 3, a 4 }, since hey did no erminae in sae s +k (see Figure, righ). 3 Theoreical Resuls In his secion we presen some of our heoreical resuls comparing he opimaliy of various policies under differen erminaion schemes inroduced in he previous secion. In all of hese heorems we use he parial ordering relaion V π V π2 π π 2, in order o compare differen policies. For lack of space, we abbreviaed he proofs. Noe ha in heorems and 3 which compare he coninue policy wih π any and π all policies, he value funcion is wrien over he pair s, h o be consisen wih he definiion of he coninue policy. This does no influence he original definiion of he value funcion for he opimal policies in T any and T all

4 erminaion schemes, since hey are independen of he coninue-se h. Firs, we compare he opimal muli-acion policies based on he T any erminaion scheme and he coninue policy. Theorem : For every sae s S, and all coninue-se h H, V π con ( s, h ) V any ( s, h ). Proof: By wriing he value funcion definiion for each case we have: V π con ( s, h ) = max a A(s,h ) Qπ con ( s, h, a) max a A(s ) Qπ con ( s, h, a) max Q any ( s, h, a) = V any ( s, h ) a A(s ) The inequaliy holds since he maximizaion in π con is over a smaller se (i.e., A(s, h )) which is a subse of he larger se A(s ) ha is maximized over, in he π any case. Nex, we show ha he opimal plans wih muli-acions ha erminae according o he T any erminaion scheme are beer compared o he opimal plans wih muli-acions ha erminae according o he T all erminaion scheme: Theorem 2: For every sae s S, V all (s) V any (s). Proof: The proof is based on he following lemma which saes ha if we aler he execuion of he opimal muli-acion policy based on T all (i.e., π all ) in such a way ha a every decision epoch he nex muli-acion is sill seleced from π all, bu we erminae i based on T any hen he new policy-erminaion consruc represened by < all, any > is beer han he π all policy. Inuiively his makes sense, since if we inerrup π all (s) when he firs primary acion a i a = π all (s) erminaes in some fuure sae s, due o he opimaliy of π all, execuing π all (s ) is always beer han or equal o coninuing some oher policy such as he one in progress (i.e., π all (s)). Noe ha he proof is no as simple as in he firs heorem since he wo differen policies discussed in his heorem (i.e., π any and π all ) are no being execued using he same erminaion mehod. Lemma : For every sae s S, V all (s) V < all,any> (s). Proof: Le V all n,any(s) denoe he value of following he opimal π all policy in sae s, where for he firs n decision epochs we use he T any erminaion scheme and for he res we use he T all erminaion scheme. By inducion on n, we can show ha V all (s) V all n,any(s), s S and for all n. This suggess ha if we always erminae a muli-acion π all (s ) according o he T any erminaion scheme, we achieve a beer reurn; or mahemaically V all (s) lim n V all n,any(s) = V < all,any> (s). Using Lemma, and he opimaliy of π any in he space of policies wih erminaion scheme according o T any, i follows ha V all (s) V < all,any> (s) V any (s). Nex, we show ha if we execue he coninue policy in which a any decision epoch we always execue he bes se of primary acions along wih hose ones ha were execued in he previous decision epoch and have no erminaed ye, we achieve a beer reurn compared o he case in which we execue he bes se of primary acions, bu always wai unil all of he primary acions erminae before making a new decision: Theorem 3: For every sae s S, and all coninue-se h H, V all ( s, h ) V π con ( s, h ). Proof: In π all policies, muli-acions are execued unil all of he primary acions

5 of ha muli-acion erminae. The coninue policy, however, may also iniiae new useful primary acion in addiion o hose already running which may achieve a beer reurn. Le V all n,con( s, h ) denoe he value of he alered policy π all ha works as follows: for a given sae and coninue-se s, h, he policy π all ( s, h ) is execued while for he firs n decision epochs we use he coninue erminaion scheme (which means erminaing according o T any, and selecing he nex muli-acion according o he coninue policy) and for he res we use he T all erminaion scheme. By inducion on n, i can be shown ha V all ( s, h ) V all n,con( s, h ) for all n. This suggess ha as we increase n, he alered policy behaves more like he coninue policy and hus in he limi we have V all ( s, h ) lim n V all n,con( s, h ) = V πcon ( s, h ) which proves he heorem. Finally we show ha he opimal muli-acion policies based on T all erminaion scheme are as good as he case where he agen always execues a single primary acion a a ime, as i is he case in sandard SMDPs. Noe ha his heorem does no sae ha concurren plans are always beer han sequenial ones; i simply says ha if in a problem, he sequenial execuion of he primary acions is he bes policy, CAM is able o represen and find ha policy. Le π seq represen he opimal policy in he sequenial case, where only one primary acion can be execued a a ime: Theorem 4: For every sae s S, V seq (s) V all (s), in which V seq (s) is he value of he opimal policy when he primary acions are execued one a a ime sequenially. Proof: I suffices o show ha sequenial policies are wihin he space of concurren policies. This holds since a single primary acion can be considered as a muli-acion conaining only one primary acion whose erminaion is consisen wih eiher of he muli-acion erminaion schemes (i.e., in he sequenial case boh T any and T all erminaion schemes are same). Corollary summarizes our heoreical resuls. I shows how differen policies in a concurren acion model using differen erminaion schemes compare o each oher in erms of opimaliy. Corollary : In a concurren acion model and a se of erminaion schemes {T any, T all, coninue}, he following parial ordering holds among he opimal policy based on T any, he opimal policy based on T all, he coninue policy and he opimal sequenial policy: π seq π all π con π any. Proof: This follows immediaely from he above heorems. Figure 2 visually describes he summary of resuls ha we presened in Corollary. According o his figure, he opimal muli-acion policies based on T any and T all, and also coninue muli-acion policies dominae (wih respec o he parial ordering relaion defined over policies) he opimal policies over he sequenial case. Furhermore, policies based on coninue muli-acions dominae he opimal muliacion policies based on T all erminaion scheme, while hemselves being dominaed by he opimal muli-acion policies based on T any erminaion scheme.

6 Muli-acion policies using T any Coninue muli-acion policies Muli-acion policies using T all Policies over sequenial acions Figure 2: Comparison of policies over muli-acions and sequenial primary acions using differen erminaion schemes. 4 Experimenal Resuls In his secion we presen experimenal resuls using a grid world ask comparing various erminaion schemes (see Figure 3). Each hallway connecs wo rooms, and has a door wih wo locks. An agen has o rerieve wo keys and hold boh keys a he same ime in order o open boh locks. The process of picking up keys is modeled as a emporally exended acion ha akes differen amoun of imes for each key. Moreover, keys canno be held indefiniely, since he agen may drop a key occasionally. Therefore he agen needs o find an efficien soluion for picking up he keys in parallel wih navigaion o ac opimally. This is an episodic ask, in which a he beginning of each episode he agen is placed in a fixed posiion (upper lef corner) and he goal of he agen is o navigae o a fixed posiion goal (hallway H3). Agen H0-4 sochasic primiive acions (Up, Down, Lef and Righ) - Fail 0% of imes, when fails i will move randomly o one of he neighbors H H3 (Goal) - 8 muli-sep navigaion acions (o each room s 2 hallways) - One primiive no-op acion - 3 sochasic primiive acions for keys (ge-key, key-nop and puback-key) - 2 muli-sep key acions (pickup-key), one for each key - Drop each key 30% of imes when holding i H2 Figure 3: A navigaion problem ha requires concurren plans. There are wo locks on each door, which need o be opened simulaneously. Rerieving each key akes differen amouns of ime. The agen can execue wo ypes of acion concurrenly: () navigaion acions, and (2) key acions. Navigaion acions include a se of one-sep sochasic navigaion acions (Up, Lef, Down and Righ) ha move he agen in he corresponding direcion wih probabiliy 0.9 and fail wih probabiliy 0.. Upon failure he agen moves insead in one of he oher hree direcions, each wih probabiliy 30. There is also a se of emporally exended acions defined over he one sep navigaion acions ha ranspor he agen from wihin he room o one of he wo hallway cells leading ou of he room (Figure 4 (lef)). Key acions are defined o manipulae each

7 key (ge-key, puback-key, pickup-key, ec). Among hem pickup-key is a emporally exended acion (Figure 4 (righ)). Noe ha each key has is own se of acions. Door is closed & boh keys are ready Primiive acion "ge-key" Primiive acion "key-nop" Door is open Inside he room Primiive acion "puback-key" Muli-sep acion "pickup-key" S 0 S... S 0 Muli-sep hallway acion can be aken 0.7 Targe Hallway Muli-sep hallway acion can no be aken Door is closed & keys are no ready Ouside he room S0 S S 6 S7... Key Ready Key Dropped Key Key S0 S S 2... S 6 Key Ready Key Dropped S0 Figure 4: Lef: he policy associaed wih one of he hallway emporally exended acions. Righ: represenaion of he key pickup acions for each key process. In his example, navigaion acions can be execued concurrenly wih key acions. Acions ha manipulae differen keys can be also execued concurrenly. However, he agen is no allowed o execue more han one navigaion acion, or more han one key acion (from he same key acion se) concurrenly. In order o properly handle concurren execuion of acions, we have used a facored sae space defined by sae variables posiion (04 posiions), key-sae ( saes) and key2-sae (7 saes). In our previous work we showed ha concurren acions formed an SMDP over primiive acions [5], which urns ou o hold for all he erminaion schemes described above. Thus, we can use SMDP Q-learning o compare concurren policies over differen erminaion schemes wih he use of his mehod for purely sequenial policy learning [7]. Afer each decision epoch where he muli-acion a is aken in some sae s and erminaes in sae s, he following updae rule is used: Q(s, a) Q(s, a) + α [ r + γ k max a A(s ) Q(s, a ) Q(s, a) ], where k denoes he number of ime seps since iniiaion of he muli-acion a a sae s and is erminaion a sae s, and r denoes he cumulaive discouned reward over his period. The agen is punished by for each primiive acion. Figure 5 (lef) compares he number of primiive acions aken unil success, and Figure 5 (righ) shows he median number of decision epochs per rial, where for rial n, i is he median of all rials from o n. These daa are averaged over 0 episodes, each consising of 500, 000 rials. As shown in figure 5 (lef), concurren acions over any erminaion scheme yield a faser plan han sequenial execuion. Moreover, he policies learned based on T any (i.e. boh π any and π con ) are also faser han T all. Also, π any achieves higher opimaliy han π con, however he difference is small. We conjecure ha sequenial execuion and T all converge faser compared o T any, due o he frequency wih which muli-acions are erminaed. As shown in Figure 5 (righ), T all makes fewer decisions, compared o T any. This is inuiive since T all erminaes only when all of he primary acions in a muli-acion are compleed, and hence i involves less inerrupion compared o learning based on T any. Noe π con converges faser han π any and i is nearly as good as T any.. We can hink of

8 Median/Trials (seps o goal) Sequenial Acions Concurren Acions: opimal, T-all Concurren Acions: opimal, T-any Concurren Acions: coninue Median/Trials (# of decision epochs) Sequenial Acions Concurren Acions: opimal, T-all Concurren Acions: opimal, T-any Concurren Acions: coninue Trial Trial Figure 5: Lef: moving median of number of seps o he goal. Righ: moving median of number of muli-acion level decision epochs aken o he goal. π con as a blend of T all and T any. Even hough i uses he T any erminaion scheme, i coninues execuing primary acions ha did no erminae naurally when he firs primary acion erminaes, making i similar o T all. 5 Fuure Work Even hough specifying he A(s) se of applicable muli-acions migh significanly reduce he se of choices, we sill may need addiional mechanisms for efficienly searching he space of muli-acions ha can run in parallel. Also, we can addiionally exploi he hierarchical srucure of muli-acions o compile hem ino an effecive policy over primary acions. These are some of he pracical issues ha we will invesigae in fuure work. References [] Craig Bouilier and Ronen Brafman. Planning wih concurren ineracing acions. In Proceedings of he Foureenh Naional Conference on Arificial Inelligence (AAAI 97), 997. [2] P. Cichosz. Learning mulidimensional conrol acions from delayed reinforcemens. In Eighh Inernaional Symposium on Sysem-Modelling-Conrol (SMC-8), Zakopane, Poland, 995. [3] C. A. Knoblock. Generaing parallel execuion plans wih a parial-order planner. In Proceedings of he Second Inernaional Conference on Arificial Inelligence Planning Sysems, Chicago, IL, 994., 994. [4] Ray Reier. Naural acions, concurrency and coninuous ime in he siuaion calculus. Principles of Knowledge Represenaion and Reasoning: Proceedings of he Fifh Inernaional Conference (KR 96), Cambridge MA., November 5-8, 996, 996. [5] Khashayar Rohanimanesh and Sridhar Mahadevan. Decision-heoreic planning wih concurren emporally exended acions. In Proceedings of he 7h Conference on Uncerainy in Arificial Inelligence, 200. [6] S. Singh and David Cohn. How o dynamically merge markov decision processes. Proceedings of NIPS, 998. [7] R. Suon, D. Precup, and S. Singh. Beween MDPs and Semi-MDPs: A framework for emporal absracion in reinforcemen learning. Arificial Inelligence, pages 8 2, 999. [8] Glynn Winskel. Topics in concurrency: Par ii comp. sci. lecure noes. Compuer Science course a he Universiy of Cambridge, 2002.

RL Lecture 7: Eligibility Traces. R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 1

RL Lecture 7: Eligibility Traces. R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 1 RL Lecure 7: Eligibiliy Traces R. S. Suon and A. G. Baro: Reinforcemen Learning: An Inroducion 1 N-sep TD Predicion Idea: Look farher ino he fuure when you do TD backup (1, 2, 3,, n seps) R. S. Suon and

More information

Presentation Overview

Presentation Overview Acion Refinemen in Reinforcemen Learning by Probabiliy Smoohing By Thomas G. Dieerich & Didac Busques Speaer: Kai Xu Presenaion Overview Bacground The Probabiliy Smoohing Mehod Experimenal Sudy of Acion

More information

Overview. COMP14112: Artificial Intelligence Fundamentals. Lecture 0 Very Brief Overview. Structure of this course

Overview. COMP14112: Artificial Intelligence Fundamentals. Lecture 0 Very Brief Overview. Structure of this course OMP: Arificial Inelligence Fundamenals Lecure 0 Very Brief Overview Lecurer: Email: Xiao-Jun Zeng x.zeng@mancheser.ac.uk Overview This course will focus mainly on probabilisic mehods in AI We shall presen

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance

More information

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still. Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in

More information

Lecture Notes 2. The Hilbert Space Approach to Time Series

Lecture Notes 2. The Hilbert Space Approach to Time Series Time Series Seven N. Durlauf Universiy of Wisconsin. Basic ideas Lecure Noes. The Hilber Space Approach o Time Series The Hilber space framework provides a very powerful language for discussing he relaionship

More information

Final Spring 2007

Final Spring 2007 .615 Final Spring 7 Overview The purpose of he final exam is o calculae he MHD β limi in a high-bea oroidal okamak agains he dangerous n = 1 exernal ballooning-kink mode. Effecively, his corresponds o

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

Two Coupled Oscillators / Normal Modes

Two Coupled Oscillators / Normal Modes Lecure 3 Phys 3750 Two Coupled Oscillaors / Normal Modes Overview and Moivaion: Today we ake a small, bu significan, sep owards wave moion. We will no ye observe waves, bu his sep is imporan in is own

More information

On Multicomponent System Reliability with Microshocks - Microdamages Type of Components Interaction

On Multicomponent System Reliability with Microshocks - Microdamages Type of Components Interaction On Mulicomponen Sysem Reliabiliy wih Microshocks - Microdamages Type of Componens Ineracion Jerzy K. Filus, and Lidia Z. Filus Absrac Consider a wo componen parallel sysem. The defined new sochasic dependences

More information

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1 SZG Macro 2011 Lecure 3: Dynamic Programming SZG macro 2011 lecure 3 1 Background Our previous discussion of opimal consumpion over ime and of opimal capial accumulaion sugges sudying he general decision

More information

Online Appendix to Solution Methods for Models with Rare Disasters

Online Appendix to Solution Methods for Models with Rare Disasters Online Appendix o Soluion Mehods for Models wih Rare Disasers Jesús Fernández-Villaverde and Oren Levinal In his Online Appendix, we presen he Euler condiions of he model, we develop he pricing Calvo block,

More information

Notes for Lecture 17-18

Notes for Lecture 17-18 U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up

More information

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology

More information

Cash Flow Valuation Mode Lin Discrete Time

Cash Flow Valuation Mode Lin Discrete Time IOSR Journal of Mahemaics (IOSR-JM) e-issn: 2278-5728,p-ISSN: 2319-765X, 6, Issue 6 (May. - Jun. 2013), PP 35-41 Cash Flow Valuaion Mode Lin Discree Time Olayiwola. M. A. and Oni, N. O. Deparmen of Mahemaics

More information

Planning in POMDPs. Dominik Schoenberger Abstract

Planning in POMDPs. Dominik Schoenberger Abstract Planning in POMDPs Dominik Schoenberger d.schoenberger@sud.u-darmsad.de Absrac This documen briefly explains wha a Parially Observable Markov Decision Process is. Furhermore i inroduces he differen approaches

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

5. Stochastic processes (1)

5. Stochastic processes (1) Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly

More information

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),

More information

2. Nonlinear Conservation Law Equations

2. Nonlinear Conservation Law Equations . Nonlinear Conservaion Law Equaions One of he clear lessons learned over recen years in sudying nonlinear parial differenial equaions is ha i is generally no wise o ry o aack a general class of nonlinear

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

Unit Root Time Series. Univariate random walk

Unit Root Time Series. Univariate random walk Uni Roo ime Series Univariae random walk Consider he regression y y where ~ iid N 0, he leas squares esimae of is: ˆ yy y y yy Now wha if = If y y hen le y 0 =0 so ha y j j If ~ iid N 0, hen y ~ N 0, he

More information

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales Advances in Dynamical Sysems and Applicaions. ISSN 0973-5321 Volume 1 Number 1 (2006, pp. 103 112 c Research India Publicaions hp://www.ripublicaion.com/adsa.hm The Asympoic Behavior of Nonoscillaory Soluions

More information

Logic in computer science

Logic in computer science Logic in compuer science Logic plays an imporan role in compuer science Logic is ofen called he calculus of compuer science Logic plays a similar role in compuer science o ha played by calculus in he physical

More information

10. State Space Methods

10. State Space Methods . Sae Space Mehods. Inroducion Sae space modelling was briefly inroduced in chaper. Here more coverage is provided of sae space mehods before some of heir uses in conrol sysem design are covered in he

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

Kriging Models Predicting Atrazine Concentrations in Surface Water Draining Agricultural Watersheds

Kriging Models Predicting Atrazine Concentrations in Surface Water Draining Agricultural Watersheds 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Kriging Models Predicing Arazine Concenraions in Surface Waer Draining Agriculural Waersheds Paul L. Mosquin, Jeremy Aldworh, Wenlin Chen Supplemenal Maerial Number

More information

CSE/NB 528 Lecture 14: Reinforcement Learning (Chapter 9)

CSE/NB 528 Lecture 14: Reinforcement Learning (Chapter 9) CSE/NB 528 Lecure 14: Reinforcemen Learning Chaper 9 Image from hp://clasdean.la.asu.edu/news/images/ubep2001/neuron3.jpg Lecure figures are from Dayan & Abbo s book hp://people.brandeis.edu/~abbo/book/index.hml

More information

BU Macro BU Macro Fall 2008, Lecture 4

BU Macro BU Macro Fall 2008, Lecture 4 Dynamic Programming BU Macro 2008 Lecure 4 1 Ouline 1. Cerainy opimizaion problem used o illusrae: a. Resricions on exogenous variables b. Value funcion c. Policy funcion d. The Bellman equaion and an

More information

Expert Advice for Amateurs

Expert Advice for Amateurs Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

Applying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints

Applying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints IJCSI Inernaional Journal of Compuer Science Issues, Vol 9, Issue 1, No 1, January 2012 wwwijcsiorg 18 Applying Geneic Algorihms for Invenory Lo-Sizing Problem wih Supplier Selecion under Sorage Capaciy

More information

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

Random Walk with Anti-Correlated Steps

Random Walk with Anti-Correlated Steps Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

Sequential Importance Resampling (SIR) Particle Filter

Sequential Importance Resampling (SIR) Particle Filter Paricle Filers++ Pieer Abbeel UC Berkeley EECS Many slides adaped from Thrun, Burgard and Fox, Probabilisic Roboics 1. Algorihm paricle_filer( S -1, u, z ): 2. Sequenial Imporance Resampling (SIR) Paricle

More information

0.1 MAXIMUM LIKELIHOOD ESTIMATION EXPLAINED

0.1 MAXIMUM LIKELIHOOD ESTIMATION EXPLAINED 0.1 MAXIMUM LIKELIHOOD ESTIMATIO EXPLAIED Maximum likelihood esimaion is a bes-fi saisical mehod for he esimaion of he values of he parameers of a sysem, based on a se of observaions of a random variable

More information

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé Bias in Condiional and Uncondiional Fixed Effecs Logi Esimaion: a Correcion * Tom Coupé Economics Educaion and Research Consorium, Naional Universiy of Kyiv Mohyla Academy Address: Vul Voloska 10, 04070

More information

Online Convex Optimization Example And Follow-The-Leader

Online Convex Optimization Example And Follow-The-Leader CSE599s, Spring 2014, Online Learning Lecure 2-04/03/2014 Online Convex Opimizaion Example And Follow-The-Leader Lecurer: Brendan McMahan Scribe: Sephen Joe Jonany 1 Review of Online Convex Opimizaion

More information

Temporal Abstraction in Temporal-difference Networks

Temporal Abstraction in Temporal-difference Networks Temporal Absracion in Temporal-difference Neworks Richard S. Suon, Eddie J. Rafols, Anna Koop Deparmen of Compuing Science Universiy of Albera Edmonon, AB, Canada T6G 2E8 {suon,erafols,anna}@cs.ualbera.ca

More information

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes Half-Range Series 2.5 Inroducion In his Secion we address he following problem: Can we find a Fourier series expansion of a funcion defined over a finie inerval? Of course we recognise ha such a funcion

More information

Deep Learning: Theory, Techniques & Applications - Recurrent Neural Networks -

Deep Learning: Theory, Techniques & Applications - Recurrent Neural Networks - Deep Learning: Theory, Techniques & Applicaions - Recurren Neural Neworks - Prof. Maeo Maeucci maeo.maeucci@polimi.i Deparmen of Elecronics, Informaion and Bioengineering Arificial Inelligence and Roboics

More information

A NOTE ON THE STRUCTURE OF BILATTICES. A. Avron. School of Mathematical Sciences. Sackler Faculty of Exact Sciences. Tel Aviv University

A NOTE ON THE STRUCTURE OF BILATTICES. A. Avron. School of Mathematical Sciences. Sackler Faculty of Exact Sciences. Tel Aviv University A NOTE ON THE STRUCTURE OF BILATTICES A. Avron School of Mahemaical Sciences Sacler Faculy of Exac Sciences Tel Aviv Universiy Tel Aviv 69978, Israel The noion of a bilaice was rs inroduced by Ginsburg

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :45 PM 8/8/04 Copyrigh 04 Richard T. Woodward. An inroducion o dynamic opimizaion -- Opimal Conrol and Dynamic Programming AGEC 637-04 I. Overview of opimizaion Opimizaion is

More information

Reading from Young & Freedman: For this topic, read sections 25.4 & 25.5, the introduction to chapter 26 and sections 26.1 to 26.2 & 26.4.

Reading from Young & Freedman: For this topic, read sections 25.4 & 25.5, the introduction to chapter 26 and sections 26.1 to 26.2 & 26.4. PHY1 Elecriciy Topic 7 (Lecures 1 & 11) Elecric Circuis n his opic, we will cover: 1) Elecromoive Force (EMF) ) Series and parallel resisor combinaions 3) Kirchhoff s rules for circuis 4) Time dependence

More information

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities:

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities: Mah 4 Eam Review Problems Problem. Calculae he 3rd Taylor polynomial for arcsin a =. Soluion. Le f() = arcsin. For his problem, we use he formula f() + f () + f ()! + f () 3! for he 3rd Taylor polynomial

More information

Single-Pass-Based Heuristic Algorithms for Group Flexible Flow-shop Scheduling Problems

Single-Pass-Based Heuristic Algorithms for Group Flexible Flow-shop Scheduling Problems Single-Pass-Based Heurisic Algorihms for Group Flexible Flow-shop Scheduling Problems PEI-YING HUANG, TZUNG-PEI HONG 2 and CHENG-YAN KAO, 3 Deparmen of Compuer Science and Informaion Engineering Naional

More information

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality Marix Versions of Some Refinemens of he Arihmeic-Geomeric Mean Inequaliy Bao Qi Feng and Andrew Tonge Absrac. We esablish marix versions of refinemens due o Alzer ], Carwrigh and Field 4], and Mercer 5]

More information

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence Supplemen for Sochasic Convex Opimizaion: Faser Local Growh Implies Faser Global Convergence Yi Xu Qihang Lin ianbao Yang Proof of heorem heorem Suppose Assumpion holds and F (w) obeys he LGC (6) Given

More information

SOLUTIONS TO ECE 3084

SOLUTIONS TO ECE 3084 SOLUTIONS TO ECE 384 PROBLEM 2.. For each sysem below, specify wheher or no i is: (i) memoryless; (ii) causal; (iii) inverible; (iv) linear; (v) ime invarian; Explain your reasoning. If he propery is no

More information

Speaker Adaptation Techniques For Continuous Speech Using Medium and Small Adaptation Data Sets. Constantinos Boulis

Speaker Adaptation Techniques For Continuous Speech Using Medium and Small Adaptation Data Sets. Constantinos Boulis Speaker Adapaion Techniques For Coninuous Speech Using Medium and Small Adapaion Daa Ses Consaninos Boulis Ouline of he Presenaion Inroducion o he speaker adapaion problem Maximum Likelihood Sochasic Transformaions

More information

More Digital Logic. t p output. Low-to-high and high-to-low transitions could have different t p. V in (t)

More Digital Logic. t p output. Low-to-high and high-to-low transitions could have different t p. V in (t) EECS 4 Spring 23 Lecure 2 EECS 4 Spring 23 Lecure 2 More igial Logic Gae delay and signal propagaion Clocked circui elemens (flip-flop) Wriing a word o memory Simplifying digial circuis: Karnaugh maps

More information

Chapter 21. Reinforcement Learning. The Reinforcement Learning Agent

Chapter 21. Reinforcement Learning. The Reinforcement Learning Agent CSE 47 Chaper Reinforcemen Learning The Reinforcemen Learning Agen Agen Sae u Reward r Acion a Enironmen CSE AI Faculy Why reinforcemen learning Programming an agen o drie a car or fly a helicoper is ery

More information

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should Cambridge Universiy Press 978--36-60033-7 Cambridge Inernaional AS and A Level Mahemaics: Mechanics Coursebook Excerp More Informaion Chaper The moion of projeciles In his chaper he model of free moion

More information

Chapter 8 The Complete Response of RL and RC Circuits

Chapter 8 The Complete Response of RL and RC Circuits Chaper 8 The Complee Response of RL and RC Circuis Seoul Naional Universiy Deparmen of Elecrical and Compuer Engineering Wha is Firs Order Circuis? Circuis ha conain only one inducor or only one capacior

More information

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes Some common engineering funcions 2.7 Inroducion This secion provides a caalogue of some common funcions ofen used in Science and Engineering. These include polynomials, raional funcions, he modulus funcion

More information

Seminar 4: Hotelling 2

Seminar 4: Hotelling 2 Seminar 4: Hoelling 2 November 3, 211 1 Exercise Par 1 Iso-elasic demand A non renewable resource of a known sock S can be exraced a zero cos. Demand for he resource is of he form: D(p ) = p ε ε > A a

More information

Learning Objectives: Practice designing and simulating digital circuits including flip flops Experience state machine design procedure

Learning Objectives: Practice designing and simulating digital circuits including flip flops Experience state machine design procedure Lab 4: Synchronous Sae Machine Design Summary: Design and implemen synchronous sae machine circuis and es hem wih simulaions in Cadence Viruoso. Learning Objecives: Pracice designing and simulaing digial

More information

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t M ah 5 2 7 Fall 2 0 0 9 L ecure 1 0 O c. 7, 2 0 0 9 Hamilon- J acobi Equaion: Explici Formulas In his lecure we ry o apply he mehod of characerisics o he Hamilon-Jacobi equaion: u + H D u, x = 0 in R n

More information

Inventory Control of Perishable Items in a Two-Echelon Supply Chain

Inventory Control of Perishable Items in a Two-Echelon Supply Chain Journal of Indusrial Engineering, Universiy of ehran, Special Issue,, PP. 69-77 69 Invenory Conrol of Perishable Iems in a wo-echelon Supply Chain Fariborz Jolai *, Elmira Gheisariha and Farnaz Nojavan

More information

Off-policy TD(λ) with a true online equivalence

Off-policy TD(λ) with a true online equivalence Off-policy TD(λ) wih a rue online equivalence Hado van Hassel A Rupam Mahmood Richard S Suon Reinforcemen Learning and Arificial Inelligence Laboraory Universiy of Albera, Edmonon, AB T6G 2E8 Canada Absrac

More information

Navneet Saini, Mayank Goyal, Vishal Bansal (2013); Term Project AML310; Indian Institute of Technology Delhi

Navneet Saini, Mayank Goyal, Vishal Bansal (2013); Term Project AML310; Indian Institute of Technology Delhi Creep in Viscoelasic Subsances Numerical mehods o calculae he coefficiens of he Prony equaion using creep es daa and Herediary Inegrals Mehod Navnee Saini, Mayank Goyal, Vishal Bansal (23); Term Projec

More information

6.2 Transforms of Derivatives and Integrals.

6.2 Transforms of Derivatives and Integrals. SEC. 6.2 Transforms of Derivaives and Inegrals. ODEs 2 3 33 39 23. Change of scale. If l( f ()) F(s) and c is any 33 45 APPLICATION OF s-shifting posiive consan, show ha l( f (c)) F(s>c)>c (Hin: In Probs.

More information

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC his documen was generaed a 1:4 PM, 9/1/13 Copyrigh 213 Richard. Woodward 4. End poins and ransversaliy condiions AGEC 637-213 F z d Recall from Lecure 3 ha a ypical opimal conrol problem is o maimize (,,

More information

Some Ramsey results for the n-cube

Some Ramsey results for the n-cube Some Ramsey resuls for he n-cube Ron Graham Universiy of California, San Diego Jozsef Solymosi Universiy of Briish Columbia, Vancouver, Canada Absrac In his noe we esablish a Ramsey-ype resul for cerain

More information

FINM 6900 Finance Theory

FINM 6900 Finance Theory FINM 6900 Finance Theory Universiy of Queensland Lecure Noe 4 The Lucas Model 1. Inroducion In his lecure we consider a simple endowmen economy in which an unspecified number of raional invesors rade asses

More information

The expectation value of the field operator.

The expectation value of the field operator. The expecaion value of he field operaor. Dan Solomon Universiy of Illinois Chicago, IL dsolom@uic.edu June, 04 Absrac. Much of he mahemaical developmen of quanum field heory has been in suppor of deermining

More information

Energy Storage Benchmark Problems

Energy Storage Benchmark Problems Energy Sorage Benchmark Problems Daniel F. Salas 1,3, Warren B. Powell 2,3 1 Deparmen of Chemical & Biological Engineering 2 Deparmen of Operaions Research & Financial Engineering 3 Princeon Laboraory

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

Problem Set 5. Graduate Macro II, Spring 2017 The University of Notre Dame Professor Sims

Problem Set 5. Graduate Macro II, Spring 2017 The University of Notre Dame Professor Sims Problem Se 5 Graduae Macro II, Spring 2017 The Universiy of Nore Dame Professor Sims Insrucions: You may consul wih oher members of he class, bu please make sure o urn in your own work. Where applicable,

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :37 PM, 1/11/018 Copyrigh 018 Richard T. Woodward 1. An inroducion o dynamic opimiaion -- Opimal Conrol and Dynamic Programming AGEC 64-018 I. Overview of opimiaion Opimiaion

More information

1 Differential Equation Investigations using Customizable

1 Differential Equation Investigations using Customizable Differenial Equaion Invesigaions using Cusomizable Mahles Rober Decker The Universiy of Harford Absrac. The auhor has developed some plaform independen, freely available, ineracive programs (mahles) for

More information

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux Gues Lecures for Dr. MacFarlane s EE3350 Par Deux Michael Plane Mon., 08-30-2010 Wrie name in corner. Poin ou his is a review, so I will go faser. Remind hem o go lisen o online lecure abou geing an A

More information

Lecture 4 Notes (Little s Theorem)

Lecture 4 Notes (Little s Theorem) Lecure 4 Noes (Lile s Theorem) This lecure concerns one of he mos imporan (and simples) heorems in Queuing Theory, Lile s Theorem. More informaion can be found in he course book, Bersekas & Gallagher,

More information

15. Vector Valued Functions

15. Vector Valued Functions 1. Vecor Valued Funcions Up o his poin, we have presened vecors wih consan componens, for example, 1, and,,4. However, we can allow he componens of a vecor o be funcions of a common variable. For example,

More information

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t... Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger

More information

Some Basic Information about M-S-D Systems

Some Basic Information about M-S-D Systems Some Basic Informaion abou M-S-D Sysems 1 Inroducion We wan o give some summary of he facs concerning unforced (homogeneous) and forced (non-homogeneous) models for linear oscillaors governed by second-order,

More information

20. Applications of the Genetic-Drift Model

20. Applications of the Genetic-Drift Model 0. Applicaions of he Geneic-Drif Model 1) Deermining he probabiliy of forming any paricular combinaion of genoypes in he nex generaion: Example: If he parenal allele frequencies are p 0 = 0.35 and q 0

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN Inernaional Journal of Scienific & Engineering Research, Volume 4, Issue 10, Ocober-2013 900 FUZZY MEAN RESIDUAL LIFE ORDERING OF FUZZY RANDOM VARIABLES J. EARNEST LAZARUS PIRIYAKUMAR 1, A. YAMUNA 2 1.

More information

Lab 10: RC, RL, and RLC Circuits

Lab 10: RC, RL, and RLC Circuits Lab 10: RC, RL, and RLC Circuis In his experimen, we will invesigae he behavior of circuis conaining combinaions of resisors, capaciors, and inducors. We will sudy he way volages and currens change in

More information

) were both constant and we brought them from under the integral.

) were both constant and we brought them from under the integral. YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha

More information

CHAPTER 12 DIRECT CURRENT CIRCUITS

CHAPTER 12 DIRECT CURRENT CIRCUITS CHAPTER 12 DIRECT CURRENT CIUITS DIRECT CURRENT CIUITS 257 12.1 RESISTORS IN SERIES AND IN PARALLEL When wo resisors are conneced ogeher as shown in Figure 12.1 we said ha hey are conneced in series. As

More information

A Specification Test for Linear Dynamic Stochastic General Equilibrium Models

A Specification Test for Linear Dynamic Stochastic General Equilibrium Models Journal of Saisical and Economeric Mehods, vol.1, no.2, 2012, 65-70 ISSN: 2241-0384 (prin), 2241-0376 (online) Scienpress Ld, 2012 A Specificaion Tes for Linear Dynamic Sochasic General Equilibrium Models

More information

Analyze patterns and relationships. 3. Generate two numerical patterns using AC

Analyze patterns and relationships. 3. Generate two numerical patterns using AC envision ah 2.0 5h Grade ah Curriculum Quarer 1 Quarer 2 Quarer 3 Quarer 4 andards: =ajor =upporing =Addiional Firs 30 Day 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 andards: Operaions and Algebraic Thinking

More information

Retrieval Models. Boolean and Vector Space Retrieval Models. Common Preprocessing Steps. Boolean Model. Boolean Retrieval Model

Retrieval Models. Boolean and Vector Space Retrieval Models. Common Preprocessing Steps. Boolean Model. Boolean Retrieval Model 1 Boolean and Vecor Space Rerieval Models Many slides in his secion are adaped from Prof. Joydeep Ghosh (UT ECE) who in urn adaped hem from Prof. Dik Lee (Univ. of Science and Tech, Hong Kong) Rerieval

More information

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details! MAT 257, Handou 6: Ocober 7-2, 20. I. Assignmen. Finish reading Chaper 2 of Spiva, rereading earlier secions as necessary. handou and fill in some missing deails! II. Higher derivaives. Also, read his

More information

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 2, May 2013, pp. 209 227 ISSN 0364-765X (prin) ISSN 1526-5471 (online) hp://dx.doi.org/10.1287/moor.1120.0562 2013 INFORMS On Boundedness of Q-Learning Ieraes

More information

Mixing times and hitting times: lecture notes

Mixing times and hitting times: lecture notes Miing imes and hiing imes: lecure noes Yuval Peres Perla Sousi 1 Inroducion Miing imes and hiing imes are among he mos fundamenal noions associaed wih a finie Markov chain. A variey of ools have been developed

More information

Sections 2.2 & 2.3 Limit of a Function and Limit Laws

Sections 2.2 & 2.3 Limit of a Function and Limit Laws Mah 80 www.imeodare.com Secions. &. Limi of a Funcion and Limi Laws In secion. we saw how is arise when we wan o find he angen o a curve or he velociy of an objec. Now we urn our aenion o is in general

More information

Longest Common Prefixes

Longest Common Prefixes Longes Common Prefixes The sandard ordering for srings is he lexicographical order. I is induced by an order over he alphabe. We will use he same symbols (,

More information

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H. ACE 56 Fall 005 Lecure 5: he Simple Linear Regression Model: Sampling Properies of he Leas Squares Esimaors by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Inference in he Simple

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION DOI: 0.038/NCLIMATE893 Temporal resoluion and DICE * Supplemenal Informaion Alex L. Maren and Sephen C. Newbold Naional Cener for Environmenal Economics, US Environmenal Proecion

More information

Phys1112: DC and RC circuits

Phys1112: DC and RC circuits Name: Group Members: Dae: TA s Name: Phys1112: DC and RC circuis Objecives: 1. To undersand curren and volage characerisics of a DC RC discharging circui. 2. To undersand he effec of he RC ime consan.

More information

Rapid Termination Evaluation for Recursive Subdivision of Bezier Curves

Rapid Termination Evaluation for Recursive Subdivision of Bezier Curves Rapid Terminaion Evaluaion for Recursive Subdivision of Bezier Curves Thomas F. Hain School of Compuer and Informaion Sciences, Universiy of Souh Alabama, Mobile, AL, U.S.A. Absrac Bézier curve flaening

More information

Economics 8105 Macroeconomic Theory Recitation 6

Economics 8105 Macroeconomic Theory Recitation 6 Economics 8105 Macroeconomic Theory Reciaion 6 Conor Ryan Ocober 11h, 2016 Ouline: Opimal Taxaion wih Governmen Invesmen 1 Governmen Expendiure in Producion In hese noes we will examine a model in which

More information

An Analysis of Actor/Critic Algorithms using Eligibility Traces: Reinforcement Learning with Imperfect Value Functions

An Analysis of Actor/Critic Algorithms using Eligibility Traces: Reinforcement Learning with Imperfect Value Functions An Analysis of Acor/Criic Algorihms using Eligibiliy Traces: Reinforcemen Learning wih Imperfec Value Funcions Hajime Kimura 3 Tokyo Insiue of Technology gen@fe.dis.iech.ac.jp Shigenobu Kobayashi Tokyo

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

Bias-Variance Error Bounds for Temporal Difference Updates

Bias-Variance Error Bounds for Temporal Difference Updates Bias-Variance Bounds for Temporal Difference Updaes Michael Kearns AT&T Labs mkearns@research.a.com Sainder Singh AT&T Labs baveja@research.a.com Absrac We give he firs rigorous upper bounds on he error

More information