arxiv: v1 [stat.ml] 9 Aug 2016

Size: px
Start display at page:

Download "arxiv: v1 [stat.ml] 9 Aug 2016"

Transcription

1 On Lower Bounds for Regret in Reinforcement Lerning In Osbnd Stnford University, Google DeepMind Benjmin Vn Roy Stnford University rxiv: v1 [stt.ml 9 Aug Introduction August 10, 2016 his is brief technicl note to clrify the stte of lower bounds on regret for reinforcement lerning. In prticulr, this pper: Reproduces lower bound on regret for reinforcement lerning, similr to the result of heorem 5 in the journl UCRL2 pper Jksch et l Clrifies tht the proposed proof of heorem 6 in the REGAL pper Brtlett nd ewri 2009 does not hold using the stndrd techniques without further work. We suggest tht this result should insted be considered conjecture s it hs no rigorous proof. Suggests tht the conjectured lower bound given by Brtlett nd ewri 2009 is incorrect nd, in fct, it is possible to improve the scling of the upper bound to mtch the weker lower bounds presented in this pper. 2 Problem formultion We consider the problem of lerning to optimize n unknown MDP M = S, A, R, P. S = {1,.., S} is the stte spce, A = {1,.., A} is the ction spce. In ech timestep t = 1, 2,.. the gent observes stte s t S, selects n ction t A, receives rewrd r t R s t, t [0, 1 nd trnsitions to new stte s t+1 P s t, t. We define ll rndom vribles with respect to probbility spce Ω, F, P. A policy µ is mpping from stte s S to ction A. For MDP M nd ny policy µ we define the long run verge rewrd strting from stte s: [ 1 λ M µ s := lim E M,µ rs t, t s 1 = s, 1 where r s, := E[r r R s,. he subscripts M, µ indicte the MDP evolves under M with policy µ. A policy µ M is optiml for the MDP M if µ M rg mx µ λ M µ s for ll s S. For the unknown MDP M we will often bbrevite sub/superscripts to simply, for exmple λ for λ M µ M. 1

2 Let H t = s 1, 1, r 1,.., s t 1, t 1, r t 1 denote the history of observtions mde prior to time t. A reinforcement lerning lgorithm is deterministic sequence {π t t = 1, 2,..} of functions ech mpping H t to probbility distribution π t H t over policies, from which the gent smple policy µ t t timestep t. We define the regret of reinforcement lerning lgorithm π up to time Regret, π, M s := {λ s r t } s 1 = s. 2 he regret of lerning lgorithm shows how worse the policy performs tht optiml in terms of cumultive rewrds. Any lgorithm with o regret will eventully lern the optiml policy. Note tht the regret is rndom since it depends on the unknown MDP M, the rndom smpling of policies nd, through the history H t on the previous trnsitions nd rewrds. We will ssess nd compre lgorithm performnce in terms of the regret. 2.1 Finite horizon MDPs We now spend little time to relte the formultion bove to so-clled finite horizon MDPs Osbnd et l. 2013; Dnn nd Brunskill In this setting, n gent will interct repetedly with environment over H N timesteps which we cll n episode. A finite horizon MDP M = S, A, R, P, H, ρ is defined s bove, but every H N timesteps the stte will reset ccording to some initil distribution ρ. We cll H N the horizon of the MDP. In finite horizon MDP typicl policy my depend on both the stte s S nd the timestep h within the episode. o be explicit, we define policy µ is mpping from stte s S nd period h = 1,.., H to ction A. For ech MDP M = S, A, R M, P M, H, ρ nd policy µ we define the stte-ction vlue function for ech period h: H Q M µ,hs, := E M,µ r M s j, j s h = s, h =, 3 j=h nd Vµ,h M s := QM µ,h s, µs, h. Once gin, we sy policy µm is optiml for the MDP M if µ M rg mx µ Vµ,h M s for ll s S nd h = 1,..., H. At first glnce this might seem t odds with the formultion in Section 2. However, finite horizon MDPs cn be thought of s specil cse of Section 2 in the expnded stte spce S := S {1,.., H}. In this cse it is typicl to ssume tht the gent knows bout the evolution of time h deterministiclly priori. o highlight this time evolution within episodes, with some buse of nottion, we let s kh = s t for t = k 1H + h, so tht s kh is the stte in period h of episode k. We define H kh nlogously. 3 Multi-rmed bndit We cll the degenerte MDP with only one stte S = 1 multi-rmed bndit with independent rms Li nd Robbins In this setting the ctions t A re often clled rms nd the optiml verge rewrd is simply the verge rewrd of the highest rewrd, λ = mx r. 2

3 We now reproduce lower bound on regret for ny lerning lgorithm in multi-rmed bndit Bubeck nd Ces-Binchi heorem 1 Lower bound on regret in bndits. Let sup be the supremum over ll distributions of rewrds such tht for ech = 1,.., A the rewrds r1 t,.., ra t {0, 1} re i.i.d. nd let inf be the infimum over ll reinforcement lerning lgorithms. hen inf sup mx r E [ r t 1 A At high level heorem 1 sys tht no mtter wht lerning lgorithm you choose, there will lwys be some environment which gives your lgorithm Ω A regret. his is pretty powerful result, since it mens tht if we cn design n lgorithm with upper bounds on regret O A then this lgorithm is in some sense ner-optiml Bubeck nd Ces-Binchi he intuition for the proof is reltively simple nd presented in Bubeck nd Ces-Binchi After ny timesteps there must be some rm which is pulled less thn /A times. Stndrd concentrtion results stte tht the estimtes of rndom vrible cn only be ccurte up to O n 1 where n is the number of observtions. herefore, for the rm with n /A it is difficult to distinguish between Ber1/2 nd Ber1/2+ A/. his mens tht, if every rm is Ber1/2 but one Ber1/2 + A/, ny lgorithm would incur A/ = A regret. In the next section we will see how to mke this rgument more rigorous. 3.1 Proof of heorem 1 We consider the problem where ll rms re i.i.d. Bernoulli with prmeter δ, but one rm hs prmeter δ + ɛ for some δ, ɛ > 0. We define n uxilliry r t = r t for ll, but with the rewrds of the ction = replced by the drw r t Berδ. We consider n uxilliry sequence of ctions ã t π t H t for H t = ã 1, r 1,.., ã t 1, r t 1 s the history generted by n gent with no feedbck informing them bout. We introduce the nottion n := { t = t = 1,.., } nd ñ := {ã t = t = 1,.., } to denote the number of times rm hve been selected by time under t nd ã t respectively. he following lemm estblishes lower bound on the regret relized by ction ã t. Lemm 1 Regret of n uninformed gent. For ll δ, ɛ > 0 nd ll lerning lgorithms π, [ mx r E r ã t A 1 A ɛ. Proof. We hve, mx r E [ r ã t = E ñ ɛ = ɛ ñ = ɛ 1 1, 5 A 3

4 where the lst step follows from symmetry rgument, since is independent of ñ t for ll ctions. We now estblish tht, if ɛ is sufficiently smll, then over limited time horizon the distributions of r t t cnnot be significntly different from the outcomes r t t. We compre the conditionl distributions over the choice of ction P with the choice of ctions P which would hve risen under the uninformtive dt H t. o be more precise we define P zt H t := Prt = zt H t with P zt H t := P r t = zt H t. We write rt := r t t,.., r t for the sequence of rewrds from time t to nd similrly for r t. o quntify the difference between two distributions we will employ the following notion of KL divergence: d KL P z t H t, P zt H t = E P zt H t log Lemm 2 KL divergence of uninformed distribution. For ll δ, ɛ > 0 nd ll lerning lgorithms π, d KL P z 1 H t, P z1 H t δ log A z t δ δ log δ + ɛ P z t H t. 6 P zt H t δ. 1 δ ɛ Proof. We cn pply the chin rule of KL divergence Bubeck nd Ces-Binchi 2012 to obtin It follows tht d KL P z t H t, P zt H t = d KL P z t H t, P zt H t = Pã t δ log d KL P z t t H t, P zt H t t. δ δ log δ + ɛ δ. 1 δ ɛ We conclude the proof by noting tht the ctions ã t re selected indepedently of of together with symmetry rgument. We now use Pinsker s inequlity to show tht, if the distribution of ctions P is close to the choice of ctions under uninformtive dt P then the resulting regret is close to the regret of the uninformtive policy. Lemm 3 Regret bound in terms of KL divergence. For ll δ, ɛ > 0 nd ll lerning lgorithms π, [ mx r E r t ɛ 1 1A 12 d KL P z 1, P z1. Proof. Pinsker s inequlity gives us [ n E ñ [ Since E[ñ = /A, it follows tht E n proof of through simple substitution in Lemm d KL P z 1, P z d KL P z1, P z A. We complete the

5 o complete the proof of heorem 1 we cn use Lemm 20 from Jksch et l Proposition 1 Bound on the KL divergence. For ny 0 δ 1 2 nd ɛ 1 2δ we hve δ 1 δ ɛ 2 δ log δ log δ + ɛ 2 1 δ ɛ δ log2. We combine Proposition 1 with Lemm 3 to sy, [ mx r E ɛ r ã t 1 1 A ɛ 2 δa A δa. for ll ɛ 2δ A by setting ɛ 2 = δa 8 We cn choose δ = 0.25 to complete the proof of heorem 1. We note tht better constnts re vilble through more creful nlysis, but this is not our focus in this work. 4 Reinforcement lerning In this section we will work to extend the lower bound rguments from bndits to reinforcement lerning with S 2. As in common in the literture, we will begin with simple two stte MDP with known rewrds nd unknown trnsitions Jksch et l. 2010; Brtlett nd ewri 2009; Dnn nd Brunskill It is reltively strightforwrd to extend this flvour of result to MDPs with S > 2 simply by conctenting S/2 copies of these smller systems. Stte 0 gives rewrd of 0 nd stte 1 gives rewrd of 1. All ctions from the stte 0 follow the sme lw P 0, = 1 δ 0, δ 0. In stte 1 P 1, = δ 1, 1 δ 1 for ll ctions prt from P 1, = δ 1 ɛ, 1 δ 1 + ɛ. For this simple MDP we will distinguish policies in terms of their ction upon s = 1, since this is the only ction which cn influence the evolution of the MDP. Figure 1: A two stte MDP which is hrd to lern. Dotted lines distinguish the unique optiml policy. We define θ 1 := δ 0 δ 0 +δ 1 to be the verge expected rewrd under the policy. For convenience we write δ1 := δ 1 ɛ for the distinguished optiml ction nd correspondingly θ1 := δ 0 δ 0 +δ1 for the verge expected rewrd under the optiml policy. 5

6 4.1 Sketch t REGAL-style lower bounds In this section we present quick overview of the style of rgument tht ttempts to solidify the lower bound of heorem 6 in Brtlett nd ewri We ssume tht δ 0 δ 1 to bound the difference in optiml vlue, θ 1 θ 1 = = > δ 0 δ 0 + δ 1 ɛ δ 0 δ 0 + δ 1 δ 0 ɛ δ 0 + δ 1 δ 0 + δ 1 ɛ δ 0 ɛ δ 0 + δ 1 2 > δ 0ɛ 2δ 0 2 = ɛ. 7 4δ 0 Brodly speking, this indictes tht the gent should obtin expected regret Ωɛ/δ 0 every timestep it selects ction t whilst in stte s = 1. All other ctions in ny other stte produce zero regret. We now note tht the problem described by Figure 1 is quite similr to the bndit exmple from Section 3. he difference here is tht ctions of the suboptiml rm give expected regret O ɛ δ 0, rther thn ɛ. he rguments we present in this section cn be thought of s n ttempt to mke the sketch proof for heorem 6 of Brtlett nd ewri 2009 more explicit, if not entirely rigorous. Our rguments will follow the sme structure s Section 3: we consider n uxilliry MDP where the optiml ction hs been replced by nother ction with identicl trnsition dynmics. We will write ã t for the ctions which re tken by this uninformed policy nd H t for the uninformtive history tht it genertes. We begin with result of similr flvour to Lemm 3. Lemm 4 Regret of n uninformed gent. In the environment of Figure 1, for ll δ, ɛ > 0 nd ll lerning lgorithms π, [ mx r E r ɛ ã t θ 1 4δ A Proof. We note tht the uninformed gent cn only incur regret when it mkes sub-optiml decision, which is only possible in stte s = 1. he proportion of the time the gent spends in stte s = 1 is lower bounded by θ 1. he regret for ny sub-optiml decision while in stte s = 1 is t lest ɛ 4δ 0 by 7. We follow the rguments from Lemm 1 to obtin our desired result. We now note tht the problem of lerning 2-stte trnsition function is equivlent to estimting Bernoulli rewrd. herefore, we cn use Lemm 4 in plce of Lemm 1 nd repet similr rgument to the proof of heorem 1 for multi-rmed bndits. At high level we cn bound the regret of ny gent in terms of the devition in KL from the distribution of the uninformed gent. For ɛ smll, nd over short enough time window, the distribution of ctions chosen by the lerning lgorithm cnnot differ significntly from the ctions chosen from the uninformtive system. As such, using Pinsker s inequlity, the resulting regret from ny lerning lgorithm cnnot differ significntly from tht of the uninformed lgorithm. o mke this rgument explicit, we use Lemm 2 nd Lemm 3 together with Proposition 2 nd 6

7 optimize over the resulting bound over ɛ. ht is to sy, for ny lerning lgorithm π, [ θ1 E r t ɛ θ 1 1 1A 12 4δ d KL P z 1, P z1 0 θ 1 ɛ 4δ 0 1 ɛθ 1 4 δ A 1 1 A ɛ 2 θ 1 2δ 1 A 1 ɛ 2 θ 1 for ll ɛ 2 δ 1 A 1 4 δ 1 A 8θ 1 θ1 1 1 δ 0 A 1 δ 1 A setting ɛ = 4 8θ δ1 θ 1 A. 8 2 δ 2 0 Now, we re left with problem to complete the rgument for heorem 6 from REGAL. We introduce the nottion, M µ s, s for the expected number of timesteps to get from stte s to s in MDP M under policy µ. he one-wy dimeter of n MDP is defined D ow M := mx min M s µ µ s, s, where s is ny stte with optiml vlue bis. 9 he clim in heorem 6 of REGAL is tht, for ny lerning lgorithm π there exists nd MDP M such tht Regret, π, M c 0 D ow SA for some c0 > 0. From construction of the MDP in Figure 1 it is cler tht D ow = 1 δ 0, since the only stte with optiml vlue bis is s = 1 nd the expected time from s = 0 to s = 1 is 1 δ 0. We now exmine behviour of the remining free prmeters using the definition θ 1 = δ 0 /δ 0 + δ 1 : δ1 θ 1 δ 2 0 = D ow δ1 θ 1 δ1 /D ow = D ow δ 1 + 1/D ow Dow = = O D ow for ny choice of δ 1 > 0. δ 1 D ow his completes the demonstrtion tht the stndrd proof techniques for lower bounds do not ddress the problems in the proof REGAL heorem 6. In fct, we re only ble to estblish lower bound Ω D ow SA nd not ΩD ow SA s Brtlett nd ewri 2009 hd climed. Further, these bounds re ctully weker thn the estblished results in Jksch et l Ω DSA, where DM := mx s,s min µ M µ s, s D ow is the dimeter of the MDP. 4.2 Where do the lower bounds lie? he rguments in Section 4.1 show tht existing mchinery is not sufficient to estblish proof of heorem 6 in Brtlett nd ewri In light of this we suggest tht this published result 7

8 be considered conjecture, rther thn n estblished theorem. In this note we present nother lterntive conjecture, tht the results of heorem 6 in Brtlett nd ewri 2009 re not correct. he spirit of this conjecture is similr to Conjecture 1 of Osbnd nd Vn Roy 2016 given for finite horizon MDPs. Conjecture 1 ight lower bounds for regret. he lower bounds of Jksch et l Ω DSA re unimprovble in the sense tht there exists some lerning lgorithm π such tht, for ny MDP M nd ny δ > 0 Regret, π, M = Õ DSA, 10 with probbility t lest 1 δ Wht is wrong the REGAL lower bound? In order for Conjecture 1 to be true, the sketched proof in Brtlett nd ewri 2009 must be flse. Although the rguments of Section 4.1 show tht this proof is not yet rigorous, they do not pinpoint ny step of the ppeling sketched rgument which is incorrect. However, we will now present n intuitive rgument for wht my be going wrong in the sketched proof: For every timestep t in stte s = 1 the worst possible decision the gent could mke will contribute regret ODow in terms of the vlue. he proposed sketch proof rgues tht the gent effectively incurs this regret every timestep until it lerns the optiml rm. If we mesure regret in terms of ctul shortfll in the instntneous regret λ r t must be bounded O1 per timestep. he bd decisions in stte s = 1 re just worth OD ow vlue becuse it might led to OD ow of these O1 instntneous regret steps to occur in row. Alterntively, we might think of regret in terms of the future vlue OD ow which bd decision t s = 1 my be worth - this is the rgument tht REGAL uses Brtlett nd ewri However, if we do this then tht mens this bd decision must be followed by OD ow timesteps in which we count no dditionl regret. At the moment, the rgument for heorem 6 in Brtlett nd ewri 2009 is doing type of double-counting for regret. It ssigns the mximum OD ow M regret in terms of vlue t ech timestep. However, this nlysis ignores tht for every one of these bd ctions there will be OD ow M periods of time within s = 0 where, in terms of the vlue shortfll, these ctions will not incur further regret thn hs been counted lredy Comprison to existing tight PAC bounds Another piece of tngentilly supporting evidence for Conjecture 1 comes from the recent PACnlysis for finite horizon MDPs Dnn nd Brunskill he problem formultion given by this pper differs from Brtlett nd ewri 2009 in severl wys, but they produce n lgorithm LUCFH which mtches upper nd lower bounds for the horizon H in finite horizon MDPs. In finite horizon MDPs, the horizon H is n upper bound on D ow. A similr flvour of result is vilble in discounted MDPs Lttimore nd Hutter 2012 where the horizon H is replce with n equivlent timefrme H = Õ 1 1 γ. 8

9 he nlysis for LUCFH in finite horizon MDPs implies tht the number of episodes required for ɛ-optiml episodes is Θ H2, where we view ll vribles other thn H nd ɛ s fixed. According ɛ 2 to their definition, this would imply Θ H3 timesteps until ɛ-optiml episodes, which is roughly ɛ 2 equivlent to Θ H timesteps until ɛ-optiml timesteps. ɛ 2 At high level the lgorithm nd nlysis from Dnn nd Brunskill 2015 leverges the sort of phenomenon we describe in Section his essentil rgument is refined nd mde more rigorous through the Bellmn eqution for locl vrince, first used in Lttimore nd Hutter It is not generlly possible to go from PAC bounds to regret gurntees, however, the spirit of previous nlyses nd comprble results suggest tht the tight bounds Θ H ɛ 2 timesteps until ɛ-optiml timesteps re suggestive of tight regret scling Θ H. 5 Conclusion his technicl note ims to clrify the current stte of lower bounds for regret in reinforcement lerning. We reproduce cler step by step rgument for the lower bound on regret given in Brtlett nd ewri We show tht, using stndrd mchinery, this leds to provble lower bound Ω D ow SA nd currently there is no proof vilble for the bound ΩD ow SA s conjectured in tht erlier work. o stimulte thinking on this topic, we present Conjecture 1, tht the lower bound Ω D ow SA is in fct unimprovble. Definitively proving these results one wy or nother is n exciting re for future reserch. Acknowledgements We would like to thnk the uthors of Brtlett nd ewri 2009 for their help nd dilogue in the discussion of these delicte technicl issues. We would lso like to thnk Dniel Russo for the mny hours of discussion nd nlysis spent in the office on issues like these. References Peter L. Brtlett nd Ambuj ewri. REGAL: A regulriztion bsed lgorithm for reinforcement lerning in wekly communicting MDPs. In Proceedings of the 25th Conference on Uncertinty in Artificil Intelligence UAI2009, pges 35 42, June Sébstien Bubeck nd Nicolò Ces-Binchi. Regret nlysis of stochstic nd nonstochstic multi-rmed bndit problems. CoRR, bs/ , URL Christoph Dnn nd Emm Brunskill. Smple complexity of episodic fixed-horizon reinforcement lerning. In Advnces in Neurl Informtion Processing Systems, pge BA, homs Jksch, Ronld Ortner, nd Peter Auer. Ner-optiml regret bounds for reinforcement lerning. Journl of Mchine Lerning Reserch, 11: , ze Leung Li nd Herbert Robbins. Asymptoticlly efficient dptive lloction rules. Advnces in pplied mthemtics, 61:4 22, or Lttimore nd Mrcus Hutter. PAC bounds for discounted MDPs. In Algorithmic lerning theory, pges Springer,

10 In Osbnd nd Benjmin Vn Roy. Why is posterior smpling better thn optimism for reinforcement lerning. rxiv preprint rxiv: , In Osbnd, Dniel Russo, nd Benjmin Vn Roy. More efficient reinforcement lerning vi posterior smpling. In NIPS, pges Currn Assocites, Inc.,

Multi-Armed Bandits: Non-adaptive and Adaptive Sampling

Multi-Armed Bandits: Non-adaptive and Adaptive Sampling CSE 547/Stt 548: Mchine Lerning for Big Dt Lecture Multi-Armed Bndits: Non-dptive nd Adptive Smpling Instructor: Shm Kkde 1 The (stochstic) multi-rmed bndit problem The bsic prdigm is s follows: K Independent

More information

Advanced Calculus: MATH 410 Notes on Integrals and Integrability Professor David Levermore 17 October 2004

Advanced Calculus: MATH 410 Notes on Integrals and Integrability Professor David Levermore 17 October 2004 Advnced Clculus: MATH 410 Notes on Integrls nd Integrbility Professor Dvid Levermore 17 October 2004 1. Definite Integrls In this section we revisit the definite integrl tht you were introduced to when

More information

Reinforcement learning II

Reinforcement learning II CS 1675 Introduction to Mchine Lerning Lecture 26 Reinforcement lerning II Milos Huskrecht milos@cs.pitt.edu 5329 Sennott Squre Reinforcement lerning Bsics: Input x Lerner Output Reinforcement r Critic

More information

2D1431 Machine Learning Lab 3: Reinforcement Learning

2D1431 Machine Learning Lab 3: Reinforcement Learning 2D1431 Mchine Lerning Lb 3: Reinforcement Lerning Frnk Hoffmnn modified by Örjn Ekeberg December 7, 2004 1 Introduction In this lb you will lern bout dynmic progrmming nd reinforcement lerning. It is ssumed

More information

Reinforcement Learning

Reinforcement Learning Reinforcement Lerning Tom Mitchell, Mchine Lerning, chpter 13 Outline Introduction Comprison with inductive lerning Mrkov Decision Processes: the model Optiml policy: The tsk Q Lerning: Q function Algorithm

More information

Recitation 3: More Applications of the Derivative

Recitation 3: More Applications of the Derivative Mth 1c TA: Pdric Brtlett Recittion 3: More Applictions of the Derivtive Week 3 Cltech 2012 1 Rndom Question Question 1 A grph consists of the following: A set V of vertices. A set E of edges where ech

More information

The Regulated and Riemann Integrals

The Regulated and Riemann Integrals Chpter 1 The Regulted nd Riemnn Integrls 1.1 Introduction We will consider severl different pproches to defining the definite integrl f(x) dx of function f(x). These definitions will ll ssign the sme vlue

More information

Administrivia CSE 190: Reinforcement Learning: An Introduction

Administrivia CSE 190: Reinforcement Learning: An Introduction Administrivi CSE 190: Reinforcement Lerning: An Introduction Any emil sent to me bout the course should hve CSE 190 in the subject line! Chpter 4: Dynmic Progrmming Acknowledgment: A good number of these

More information

Lecture 1. Functional series. Pointwise and uniform convergence.

Lecture 1. Functional series. Pointwise and uniform convergence. 1 Introduction. Lecture 1. Functionl series. Pointwise nd uniform convergence. In this course we study mongst other things Fourier series. The Fourier series for periodic function f(x) with period 2π is

More information

1 Online Learning and Regret Minimization

1 Online Learning and Regret Minimization 2.997 Decision-Mking in Lrge-Scle Systems My 10 MIT, Spring 2004 Hndout #29 Lecture Note 24 1 Online Lerning nd Regret Minimiztion In this lecture, we consider the problem of sequentil decision mking in

More information

CMDA 4604: Intermediate Topics in Mathematical Modeling Lecture 19: Interpolation and Quadrature

CMDA 4604: Intermediate Topics in Mathematical Modeling Lecture 19: Interpolation and Quadrature CMDA 4604: Intermedite Topics in Mthemticl Modeling Lecture 19: Interpoltion nd Qudrture In this lecture we mke brief diversion into the res of interpoltion nd qudrture. Given function f C[, b], we sy

More information

Improper Integrals. Type I Improper Integrals How do we evaluate an integral such as

Improper Integrals. Type I Improper Integrals How do we evaluate an integral such as Improper Integrls Two different types of integrls cn qulify s improper. The first type of improper integrl (which we will refer to s Type I) involves evluting n integrl over n infinite region. In the grph

More information

Reversals of Signal-Posterior Monotonicity for Any Bounded Prior

Reversals of Signal-Posterior Monotonicity for Any Bounded Prior Reversls of Signl-Posterior Monotonicity for Any Bounded Prior Christopher P. Chmbers Pul J. Hely Abstrct Pul Milgrom (The Bell Journl of Economics, 12(2): 380 391) showed tht if the strict monotone likelihood

More information

and that at t = 0 the object is at position 5. Find the position of the object at t = 2.

and that at t = 0 the object is at position 5. Find the position of the object at t = 2. 7.2 The Fundmentl Theorem of Clculus 49 re mny, mny problems tht pper much different on the surfce but tht turn out to be the sme s these problems, in the sense tht when we try to pproimte solutions we

More information

Bellman Optimality Equation for V*

Bellman Optimality Equation for V* Bellmn Optimlity Eqution for V* The vlue of stte under n optiml policy must equl the expected return for the best ction from tht stte: V (s) mx Q (s,) A(s) mx A(s) mx A(s) Er t 1 V (s t 1 ) s t s, t s

More information

f(x) dx, If one of these two conditions is not met, we call the integral improper. Our usual definition for the value for the definite integral

f(x) dx, If one of these two conditions is not met, we call the integral improper. Our usual definition for the value for the definite integral Improper Integrls Every time tht we hve evluted definite integrl such s f(x) dx, we hve mde two implicit ssumptions bout the integrl:. The intervl [, b] is finite, nd. f(x) is continuous on [, b]. If one

More information

p-adic Egyptian Fractions

p-adic Egyptian Fractions p-adic Egyptin Frctions Contents 1 Introduction 1 2 Trditionl Egyptin Frctions nd Greedy Algorithm 2 3 Set-up 3 4 p-greedy Algorithm 5 5 p-egyptin Trditionl 10 6 Conclusion 1 Introduction An Egyptin frction

More information

MAA 4212 Improper Integrals

MAA 4212 Improper Integrals Notes by Dvid Groisser, Copyright c 1995; revised 2002, 2009, 2014 MAA 4212 Improper Integrls The Riemnn integrl, while perfectly well-defined, is too restrictive for mny purposes; there re functions which

More information

Riemann Sums and Riemann Integrals

Riemann Sums and Riemann Integrals Riemnn Sums nd Riemnn Integrls Jmes K. Peterson Deprtment of Biologicl Sciences nd Deprtment of Mthemticl Sciences Clemson University August 26, 203 Outline Riemnn Sums Riemnn Integrls Properties Abstrct

More information

7.2 The Definite Integral

7.2 The Definite Integral 7.2 The Definite Integrl the definite integrl In the previous section, it ws found tht if function f is continuous nd nonnegtive, then the re under the grph of f on [, b] is given by F (b) F (), where

More information

Riemann Sums and Riemann Integrals

Riemann Sums and Riemann Integrals Riemnn Sums nd Riemnn Integrls Jmes K. Peterson Deprtment of Biologicl Sciences nd Deprtment of Mthemticl Sciences Clemson University August 26, 2013 Outline 1 Riemnn Sums 2 Riemnn Integrls 3 Properties

More information

CS 188 Introduction to Artificial Intelligence Fall 2018 Note 7

CS 188 Introduction to Artificial Intelligence Fall 2018 Note 7 CS 188 Introduction to Artificil Intelligence Fll 2018 Note 7 These lecture notes re hevily bsed on notes originlly written by Nikhil Shrm. Decision Networks In the third note, we lerned bout gme trees

More information

New Expansion and Infinite Series

New Expansion and Infinite Series Interntionl Mthemticl Forum, Vol. 9, 204, no. 22, 06-073 HIKARI Ltd, www.m-hikri.com http://dx.doi.org/0.2988/imf.204.4502 New Expnsion nd Infinite Series Diyun Zhng College of Computer Nnjing University

More information

Properties of Integrals, Indefinite Integrals. Goals: Definition of the Definite Integral Integral Calculations using Antiderivatives

Properties of Integrals, Indefinite Integrals. Goals: Definition of the Definite Integral Integral Calculations using Antiderivatives Block #6: Properties of Integrls, Indefinite Integrls Gols: Definition of the Definite Integrl Integrl Clcultions using Antiderivtives Properties of Integrls The Indefinite Integrl 1 Riemnn Sums - 1 Riemnn

More information

Jim Lambers MAT 169 Fall Semester Lecture 4 Notes

Jim Lambers MAT 169 Fall Semester Lecture 4 Notes Jim Lmbers MAT 169 Fll Semester 2009-10 Lecture 4 Notes These notes correspond to Section 8.2 in the text. Series Wht is Series? An infinte series, usully referred to simply s series, is n sum of ll of

More information

Advanced Calculus: MATH 410 Uniform Convergence of Functions Professor David Levermore 11 December 2015

Advanced Calculus: MATH 410 Uniform Convergence of Functions Professor David Levermore 11 December 2015 Advnced Clculus: MATH 410 Uniform Convergence of Functions Professor Dvid Levermore 11 December 2015 12. Sequences of Functions We now explore two notions of wht it mens for sequence of functions {f n

More information

Handout: Natural deduction for first order logic

Handout: Natural deduction for first order logic MATH 457 Introduction to Mthemticl Logic Spring 2016 Dr Json Rute Hndout: Nturl deduction for first order logic We will extend our nturl deduction rules for sententil logic to first order logic These notes

More information

A PROOF OF THE FUNDAMENTAL THEOREM OF CALCULUS USING HAUSDORFF MEASURES

A PROOF OF THE FUNDAMENTAL THEOREM OF CALCULUS USING HAUSDORFF MEASURES INROADS Rel Anlysis Exchnge Vol. 26(1), 2000/2001, pp. 381 390 Constntin Volintiru, Deprtment of Mthemtics, University of Buchrest, Buchrest, Romni. e-mil: cosv@mt.cs.unibuc.ro A PROOF OF THE FUNDAMENTAL

More information

For the percentage of full time students at RCC the symbols would be:

For the percentage of full time students at RCC the symbols would be: Mth 17/171 Chpter 7- ypothesis Testing with One Smple This chpter is s simple s the previous one, except it is more interesting In this chpter we will test clims concerning the sme prmeters tht we worked

More information

The First Fundamental Theorem of Calculus. If f(x) is continuous on [a, b] and F (x) is any antiderivative. f(x) dx = F (b) F (a).

The First Fundamental Theorem of Calculus. If f(x) is continuous on [a, b] and F (x) is any antiderivative. f(x) dx = F (b) F (a). The Fundmentl Theorems of Clculus Mth 4, Section 0, Spring 009 We now know enough bout definite integrls to give precise formultions of the Fundmentl Theorems of Clculus. We will lso look t some bsic emples

More information

Review of basic calculus

Review of basic calculus Review of bsic clculus This brief review reclls some of the most importnt concepts, definitions, nd theorems from bsic clculus. It is not intended to tech bsic clculus from scrtch. If ny of the items below

More information

Review of Calculus, cont d

Review of Calculus, cont d Jim Lmbers MAT 460 Fll Semester 2009-10 Lecture 3 Notes These notes correspond to Section 1.1 in the text. Review of Clculus, cont d Riemnn Sums nd the Definite Integrl There re mny cses in which some

More information

Duality # Second iteration for HW problem. Recall our LP example problem we have been working on, in equality form, is given below.

Duality # Second iteration for HW problem. Recall our LP example problem we have been working on, in equality form, is given below. Dulity #. Second itertion for HW problem Recll our LP emple problem we hve been working on, in equlity form, is given below.,,,, 8 m F which, when written in slightly different form, is 8 F Recll tht we

More information

Exam 2, Mathematics 4701, Section ETY6 6:05 pm 7:40 pm, March 31, 2016, IH-1105 Instructor: Attila Máté 1

Exam 2, Mathematics 4701, Section ETY6 6:05 pm 7:40 pm, March 31, 2016, IH-1105 Instructor: Attila Máté 1 Exm, Mthemtics 471, Section ETY6 6:5 pm 7:4 pm, Mrch 1, 16, IH-115 Instructor: Attil Máté 1 17 copies 1. ) Stte the usul sufficient condition for the fixed-point itertion to converge when solving the eqution

More information

How do we solve these things, especially when they get complicated? How do we know when a system has a solution, and when is it unique?

How do we solve these things, especially when they get complicated? How do we know when a system has a solution, and when is it unique? XII. LINEAR ALGEBRA: SOLVING SYSTEMS OF EQUATIONS Tody we re going to tlk bout solving systems of liner equtions. These re problems tht give couple of equtions with couple of unknowns, like: 6 2 3 7 4

More information

Credibility Hypothesis Testing of Fuzzy Triangular Distributions

Credibility Hypothesis Testing of Fuzzy Triangular Distributions 666663 Journl of Uncertin Systems Vol.9, No., pp.6-74, 5 Online t: www.jus.org.uk Credibility Hypothesis Testing of Fuzzy Tringulr Distributions S. Smpth, B. Rmy Received April 3; Revised 4 April 4 Abstrct

More information

Bases for Vector Spaces

Bases for Vector Spaces Bses for Vector Spces 2-26-25 A set is independent if, roughly speking, there is no redundncy in the set: You cn t uild ny vector in the set s liner comintion of the others A set spns if you cn uild everything

More information

Heat flux and total heat

Heat flux and total heat Het flux nd totl het John McCun Mrch 14, 2017 1 Introduction Yesterdy (if I remember correctly) Ms. Prsd sked me question bout the condition of insulted boundry for the 1D het eqution, nd (bsed on glnce

More information

Math 61CM - Solutions to homework 9

Math 61CM - Solutions to homework 9 Mth 61CM - Solutions to homework 9 Cédric De Groote November 30 th, 2018 Problem 1: Recll tht the left limit of function f t point c is defined s follows: lim f(x) = l x c if for ny > 0 there exists δ

More information

Strong Bisimulation. Overview. References. Actions Labeled transition system Transition semantics Simulation Bisimulation

Strong Bisimulation. Overview. References. Actions Labeled transition system Transition semantics Simulation Bisimulation Strong Bisimultion Overview Actions Lbeled trnsition system Trnsition semntics Simultion Bisimultion References Robin Milner, Communiction nd Concurrency Robin Milner, Communicting nd Mobil Systems 32

More information

W. We shall do so one by one, starting with I 1, and we shall do it greedily, trying

W. We shall do so one by one, starting with I 1, and we shall do it greedily, trying Vitli covers 1 Definition. A Vitli cover of set E R is set V of closed intervls with positive length so tht, for every δ > 0 nd every x E, there is some I V with λ(i ) < δ nd x I. 2 Lemm (Vitli covering)

More information

Chapter 4: Dynamic Programming

Chapter 4: Dynamic Programming Chpter 4: Dynmic Progrmming Objectives of this chpter: Overview of collection of clssicl solution methods for MDPs known s dynmic progrmming (DP) Show how DP cn be used to compute vlue functions, nd hence,

More information

5.7 Improper Integrals

5.7 Improper Integrals 458 pplictions of definite integrls 5.7 Improper Integrls In Section 5.4, we computed the work required to lift pylod of mss m from the surfce of moon of mss nd rdius R to height H bove the surfce of the

More information

Chapter 5 : Continuous Random Variables

Chapter 5 : Continuous Random Variables STAT/MATH 395 A - PROBABILITY II UW Winter Qurter 216 Néhémy Lim Chpter 5 : Continuous Rndom Vribles Nottions. N {, 1, 2,...}, set of nturl numbers (i.e. ll nonnegtive integers); N {1, 2,...}, set of ll

More information

Tests for the Ratio of Two Poisson Rates

Tests for the Ratio of Two Poisson Rates Chpter 437 Tests for the Rtio of Two Poisson Rtes Introduction The Poisson probbility lw gives the probbility distribution of the number of events occurring in specified intervl of time or spce. The Poisson

More information

Improper Integrals, and Differential Equations

Improper Integrals, and Differential Equations Improper Integrls, nd Differentil Equtions October 22, 204 5.3 Improper Integrls Previously, we discussed how integrls correspond to res. More specificlly, we sid tht for function f(x), the region creted

More information

Continuous Random Variables

Continuous Random Variables STAT/MATH 395 A - PROBABILITY II UW Winter Qurter 217 Néhémy Lim Continuous Rndom Vribles Nottion. The indictor function of set S is rel-vlued function defined by : { 1 if x S 1 S (x) if x S Suppose tht

More information

Chapter 4 Contravariance, Covariance, and Spacetime Diagrams

Chapter 4 Contravariance, Covariance, and Spacetime Diagrams Chpter 4 Contrvrince, Covrince, nd Spcetime Digrms 4. The Components of Vector in Skewed Coordintes We hve seen in Chpter 3; figure 3.9, tht in order to show inertil motion tht is consistent with the Lorentz

More information

UNIFORM CONVERGENCE. Contents 1. Uniform Convergence 1 2. Properties of uniform convergence 3

UNIFORM CONVERGENCE. Contents 1. Uniform Convergence 1 2. Properties of uniform convergence 3 UNIFORM CONVERGENCE Contents 1. Uniform Convergence 1 2. Properties of uniform convergence 3 Suppose f n : Ω R or f n : Ω C is sequence of rel or complex functions, nd f n f s n in some sense. Furthermore,

More information

1.9 C 2 inner variations

1.9 C 2 inner variations 46 CHAPTER 1. INDIRECT METHODS 1.9 C 2 inner vritions So fr, we hve restricted ttention to liner vritions. These re vritions of the form vx; ǫ = ux + ǫφx where φ is in some liner perturbtion clss P, for

More information

{ } = E! & $ " k r t +k +1

{ } = E! & $  k r t +k +1 Chpter 4: Dynmic Progrmming Objectives of this chpter: Overview of collection of clssicl solution methods for MDPs known s dynmic progrmming (DP) Show how DP cn be used to compute vlue functions, nd hence,

More information

MA Handout 2: Notation and Background Concepts from Analysis

MA Handout 2: Notation and Background Concepts from Analysis MA350059 Hndout 2: Nottion nd Bckground Concepts from Anlysis This hndout summrises some nottion we will use nd lso gives recp of some concepts from other units (MA20023: PDEs nd CM, MA20218: Anlysis 2A,

More information

Riemann is the Mann! (But Lebesgue may besgue to differ.)

Riemann is the Mann! (But Lebesgue may besgue to differ.) Riemnn is the Mnn! (But Lebesgue my besgue to differ.) Leo Livshits My 2, 2008 1 For finite intervls in R We hve seen in clss tht every continuous function f : [, b] R hs the property tht for every ɛ >

More information

CS667 Lecture 6: Monte Carlo Integration 02/10/05

CS667 Lecture 6: Monte Carlo Integration 02/10/05 CS667 Lecture 6: Monte Crlo Integrtion 02/10/05 Venkt Krishnrj Lecturer: Steve Mrschner 1 Ide The min ide of Monte Crlo Integrtion is tht we cn estimte the vlue of n integrl by looking t lrge number of

More information

Research Article Moment Inequalities and Complete Moment Convergence

Research Article Moment Inequalities and Complete Moment Convergence Hindwi Publishing Corportion Journl of Inequlities nd Applictions Volume 2009, Article ID 271265, 14 pges doi:10.1155/2009/271265 Reserch Article Moment Inequlities nd Complete Moment Convergence Soo Hk

More information

1 Probability Density Functions

1 Probability Density Functions Lis Yn CS 9 Continuous Distributions Lecture Notes #9 July 6, 28 Bsed on chpter by Chris Piech So fr, ll rndom vribles we hve seen hve been discrete. In ll the cses we hve seen in CS 9, this ment tht our

More information

Generalized Fano and non-fano networks

Generalized Fano and non-fano networks Generlized Fno nd non-fno networks Nildri Ds nd Brijesh Kumr Ri Deprtment of Electronics nd Electricl Engineering Indin Institute of Technology Guwhti, Guwhti, Assm, Indi Emil: {d.nildri, bkri}@iitg.ernet.in

More information

Section 5.1 #7, 10, 16, 21, 25; Section 5.2 #8, 9, 15, 20, 27, 30; Section 5.3 #4, 6, 9, 13, 16, 28, 31; Section 5.4 #7, 18, 21, 23, 25, 29, 40

Section 5.1 #7, 10, 16, 21, 25; Section 5.2 #8, 9, 15, 20, 27, 30; Section 5.3 #4, 6, 9, 13, 16, 28, 31; Section 5.4 #7, 18, 21, 23, 25, 29, 40 Mth B Prof. Audrey Terrs HW # Solutions by Alex Eustis Due Tuesdy, Oct. 9 Section 5. #7,, 6,, 5; Section 5. #8, 9, 5,, 7, 3; Section 5.3 #4, 6, 9, 3, 6, 8, 3; Section 5.4 #7, 8,, 3, 5, 9, 4 5..7 Since

More information

Chapter 0. What is the Lebesgue integral about?

Chapter 0. What is the Lebesgue integral about? Chpter 0. Wht is the Lebesgue integrl bout? The pln is to hve tutoril sheet ech week, most often on Fridy, (to be done during the clss) where you will try to get used to the ides introduced in the previous

More information

Chapter 2 Fundamental Concepts

Chapter 2 Fundamental Concepts Chpter 2 Fundmentl Concepts This chpter describes the fundmentl concepts in the theory of time series models In prticulr we introduce the concepts of stochstic process, men nd covrince function, sttionry

More information

19 Optimal behavior: Game theory

19 Optimal behavior: Game theory Intro. to Artificil Intelligence: Dle Schuurmns, Relu Ptrscu 1 19 Optiml behvior: Gme theory Adversril stte dynmics hve to ccount for worst cse Compute policy π : S A tht mximizes minimum rewrd Let S (,

More information

Decision Networks. CS 188: Artificial Intelligence Fall Example: Decision Networks. Decision Networks. Decisions as Outcome Trees

Decision Networks. CS 188: Artificial Intelligence Fall Example: Decision Networks. Decision Networks. Decisions as Outcome Trees CS 188: Artificil Intelligence Fll 2011 Decision Networks ME: choose the ction which mximizes the expected utility given the evidence mbrell Lecture 17: Decision Digrms 10/27/2011 Cn directly opertionlize

More information

20 MATHEMATICS POLYNOMIALS

20 MATHEMATICS POLYNOMIALS 0 MATHEMATICS POLYNOMIALS.1 Introduction In Clss IX, you hve studied polynomils in one vrible nd their degrees. Recll tht if p(x) is polynomil in x, the highest power of x in p(x) is clled the degree of

More information

Module 6 Value Iteration. CS 886 Sequential Decision Making and Reinforcement Learning University of Waterloo

Module 6 Value Iteration. CS 886 Sequential Decision Making and Reinforcement Learning University of Waterloo Module 6 Vlue Itertion CS 886 Sequentil Decision Mking nd Reinforcement Lerning University of Wterloo Mrkov Decision Process Definition Set of sttes: S Set of ctions (i.e., decisions): A Trnsition model:

More information

Discrete Mathematics and Probability Theory Spring 2013 Anant Sahai Lecture 17

Discrete Mathematics and Probability Theory Spring 2013 Anant Sahai Lecture 17 EECS 70 Discrete Mthemtics nd Proility Theory Spring 2013 Annt Shi Lecture 17 I.I.D. Rndom Vriles Estimting the is of coin Question: We wnt to estimte the proportion p of Democrts in the US popultion,

More information

LECTURE NOTE #12 PROF. ALAN YUILLE

LECTURE NOTE #12 PROF. ALAN YUILLE LECTURE NOTE #12 PROF. ALAN YUILLE 1. Clustering, K-mens, nd EM Tsk: set of unlbeled dt D = {x 1,..., x n } Decompose into clsses w 1,..., w M where M is unknown. Lern clss models p(x w)) Discovery of

More information

Unit #9 : Definite Integral Properties; Fundamental Theorem of Calculus

Unit #9 : Definite Integral Properties; Fundamental Theorem of Calculus Unit #9 : Definite Integrl Properties; Fundmentl Theorem of Clculus Gols: Identify properties of definite integrls Define odd nd even functions, nd reltionship to integrl vlues Introduce the Fundmentl

More information

State space systems analysis (continued) Stability. A. Definitions A system is said to be Asymptotically Stable (AS) when it satisfies

State space systems analysis (continued) Stability. A. Definitions A system is said to be Asymptotically Stable (AS) when it satisfies Stte spce systems nlysis (continued) Stbility A. Definitions A system is sid to be Asymptoticlly Stble (AS) when it stisfies ut () = 0, t > 0 lim xt () 0. t A system is AS if nd only if the impulse response

More information

An approximation to the arithmetic-geometric mean. G.J.O. Jameson, Math. Gazette 98 (2014), 85 95

An approximation to the arithmetic-geometric mean. G.J.O. Jameson, Math. Gazette 98 (2014), 85 95 An pproximtion to the rithmetic-geometric men G.J.O. Jmeson, Mth. Gzette 98 (4), 85 95 Given positive numbers > b, consider the itertion given by =, b = b nd n+ = ( n + b n ), b n+ = ( n b n ) /. At ech

More information

Discrete Mathematics and Probability Theory Summer 2014 James Cook Note 17

Discrete Mathematics and Probability Theory Summer 2014 James Cook Note 17 CS 70 Discrete Mthemtics nd Proility Theory Summer 2014 Jmes Cook Note 17 I.I.D. Rndom Vriles Estimting the is of coin Question: We wnt to estimte the proportion p of Democrts in the US popultion, y tking

More information

( dg. ) 2 dt. + dt. dt j + dh. + dt. r(t) dt. Comparing this equation with the one listed above for the length of see that

( dg. ) 2 dt. + dt. dt j + dh. + dt. r(t) dt. Comparing this equation with the one listed above for the length of see that Arc Length of Curves in Three Dimensionl Spce If the vector function r(t) f(t) i + g(t) j + h(t) k trces out the curve C s t vries, we cn mesure distnces long C using formul nerly identicl to one tht we

More information

Math 270A: Numerical Linear Algebra

Math 270A: Numerical Linear Algebra Mth 70A: Numericl Liner Algebr Instructor: Michel Holst Fll Qurter 014 Homework Assignment #3 Due Give to TA t lest few dys before finl if you wnt feedbck. Exercise 3.1. (The Bsic Liner Method for Liner

More information

ODE: Existence and Uniqueness of a Solution

ODE: Existence and Uniqueness of a Solution Mth 22 Fll 213 Jerry Kzdn ODE: Existence nd Uniqueness of Solution The Fundmentl Theorem of Clculus tells us how to solve the ordinry differentil eqution (ODE) du = f(t) dt with initil condition u() =

More information

Riemann Integrals and the Fundamental Theorem of Calculus

Riemann Integrals and the Fundamental Theorem of Calculus Riemnn Integrls nd the Fundmentl Theorem of Clculus Jmes K. Peterson Deprtment of Biologicl Sciences nd Deprtment of Mthemticl Sciences Clemson University September 16, 2013 Outline Grphing Riemnn Sums

More information

Math& 152 Section Integration by Parts

Math& 152 Section Integration by Parts Mth& 5 Section 7. - Integrtion by Prts Integrtion by prts is rule tht trnsforms the integrl of the product of two functions into other (idelly simpler) integrls. Recll from Clculus I tht given two differentible

More information

Theoretical foundations of Gaussian quadrature

Theoretical foundations of Gaussian quadrature Theoreticl foundtions of Gussin qudrture 1 Inner product vector spce Definition 1. A vector spce (or liner spce) is set V = {u, v, w,...} in which the following two opertions re defined: (A) Addition of

More information

Student Activity 3: Single Factor ANOVA

Student Activity 3: Single Factor ANOVA MATH 40 Student Activity 3: Single Fctor ANOVA Some Bsic Concepts In designed experiment, two or more tretments, or combintions of tretments, is pplied to experimentl units The number of tretments, whether

More information

1B40 Practical Skills

1B40 Practical Skills B40 Prcticl Skills Comining uncertinties from severl quntities error propgtion We usully encounter situtions where the result of n experiment is given in terms of two (or more) quntities. We then need

More information

How to simulate Turing machines by invertible one-dimensional cellular automata

How to simulate Turing machines by invertible one-dimensional cellular automata How to simulte Turing mchines by invertible one-dimensionl cellulr utomt Jen-Christophe Dubcq Déprtement de Mthémtiques et d Informtique, École Normle Supérieure de Lyon, 46, llée d Itlie, 69364 Lyon Cedex

More information

Solution for Assignment 1 : Intro to Probability and Statistics, PAC learning

Solution for Assignment 1 : Intro to Probability and Statistics, PAC learning Solution for Assignment 1 : Intro to Probbility nd Sttistics, PAC lerning 10-701/15-781: Mchine Lerning (Fll 004) Due: Sept. 30th 004, Thursdy, Strt of clss Question 1. Bsic Probbility ( 18 pts) 1.1 (

More information

Decision Networks. CS 188: Artificial Intelligence. Decision Networks. Decision Networks. Decision Networks and Value of Information

Decision Networks. CS 188: Artificial Intelligence. Decision Networks. Decision Networks. Decision Networks and Value of Information CS 188: Artificil Intelligence nd Vlue of Informtion Instructors: Dn Klein nd Pieter Abbeel niversity of Cliforni, Berkeley [These slides were creted by Dn Klein nd Pieter Abbeel for CS188 Intro to AI

More information

Math 1B, lecture 4: Error bounds for numerical methods

Math 1B, lecture 4: Error bounds for numerical methods Mth B, lecture 4: Error bounds for numericl methods Nthn Pflueger 4 September 0 Introduction The five numericl methods descried in the previous lecture ll operte by the sme principle: they pproximte the

More information

Sufficient condition on noise correlations for scalable quantum computing

Sufficient condition on noise correlations for scalable quantum computing Sufficient condition on noise correltions for sclble quntum computing John Presill, 2 Februry 202 Is quntum computing sclble? The ccurcy threshold theorem for quntum computtion estblishes tht sclbility

More information

Entropy and Ergodic Theory Notes 10: Large Deviations I

Entropy and Ergodic Theory Notes 10: Large Deviations I Entropy nd Ergodic Theory Notes 10: Lrge Devitions I 1 A chnge of convention This is our first lecture on pplictions of entropy in probbility theory. In probbility theory, the convention is tht ll logrithms

More information

Lecture 2: Fields, Formally

Lecture 2: Fields, Formally Mth 08 Lecture 2: Fields, Formlly Professor: Pdric Brtlett Week UCSB 203 In our first lecture, we studied R, the rel numbers. In prticulr, we exmined how the rel numbers intercted with the opertions of

More information

Math 8 Winter 2015 Applications of Integration

Math 8 Winter 2015 Applications of Integration Mth 8 Winter 205 Applictions of Integrtion Here re few importnt pplictions of integrtion. The pplictions you my see on n exm in this course include only the Net Chnge Theorem (which is relly just the Fundmentl

More information

Notes on length and conformal metrics

Notes on length and conformal metrics Notes on length nd conforml metrics We recll how to mesure the Eucliden distnce of n rc in the plne. Let α : [, b] R 2 be smooth (C ) rc. Tht is α(t) (x(t), y(t)) where x(t) nd y(t) re smooth rel vlued

More information

A REVIEW OF CALCULUS CONCEPTS FOR JDEP 384H. Thomas Shores Department of Mathematics University of Nebraska Spring 2007

A REVIEW OF CALCULUS CONCEPTS FOR JDEP 384H. Thomas Shores Department of Mathematics University of Nebraska Spring 2007 A REVIEW OF CALCULUS CONCEPTS FOR JDEP 384H Thoms Shores Deprtment of Mthemtics University of Nebrsk Spring 2007 Contents Rtes of Chnge nd Derivtives 1 Dierentils 4 Are nd Integrls 5 Multivrite Clculus

More information

Lecture 3 ( ) (translated and slightly adapted from lecture notes by Martin Klazar)

Lecture 3 ( ) (translated and slightly adapted from lecture notes by Martin Klazar) Lecture 3 (5.3.2018) (trnslted nd slightly dpted from lecture notes by Mrtin Klzr) Riemnn integrl Now we define precisely the concept of the re, in prticulr, the re of figure U(, b, f) under the grph of

More information

3.4 Numerical integration

3.4 Numerical integration 3.4. Numericl integrtion 63 3.4 Numericl integrtion In mny economic pplictions it is necessry to compute the definite integrl of relvlued function f with respect to "weight" function w over n intervl [,

More information

Goals: Determine how to calculate the area described by a function. Define the definite integral. Explore the relationship between the definite

Goals: Determine how to calculate the area described by a function. Define the definite integral. Explore the relationship between the definite Unit #8 : The Integrl Gols: Determine how to clculte the re described by function. Define the definite integrl. Eplore the reltionship between the definite integrl nd re. Eplore wys to estimte the definite

More information

Integral points on the rational curve

Integral points on the rational curve Integrl points on the rtionl curve y x bx c x ;, b, c integers. Konstntine Zeltor Mthemtics University of Wisconsin - Mrinette 750 W. Byshore Street Mrinette, WI 5443-453 Also: Konstntine Zeltor P.O. Box

More information

A Fast and Reliable Policy Improvement Algorithm

A Fast and Reliable Policy Improvement Algorithm A Fst nd Relible Policy Improvement Algorithm Ysin Abbsi-Ydkori Peter L. Brtlett Stephen J. Wright Queenslnd University of Technology UC Berkeley nd QUT University of Wisconsin-Mdison Abstrct We introduce

More information

Non-Linear & Logistic Regression

Non-Linear & Logistic Regression Non-Liner & Logistic Regression If the sttistics re boring, then you've got the wrong numbers. Edwrd R. Tufte (Sttistics Professor, Yle University) Regression Anlyses When do we use these? PART 1: find

More information

Numerical integration

Numerical integration 2 Numericl integrtion This is pge i Printer: Opque this 2. Introduction Numericl integrtion is problem tht is prt of mny problems in the economics nd econometrics literture. The orgniztion of this chpter

More information

CS 188: Artificial Intelligence Spring 2007

CS 188: Artificial Intelligence Spring 2007 CS 188: Artificil Intelligence Spring 2007 Lecture 3: Queue-Bsed Serch 1/23/2007 Srini Nrynn UC Berkeley Mny slides over the course dpted from Dn Klein, Sturt Russell or Andrew Moore Announcements Assignment

More information

I1 = I2 I1 = I2 + I3 I1 + I2 = I3 + I4 I 3

I1 = I2 I1 = I2 + I3 I1 + I2 = I3 + I4 I 3 2 The Prllel Circuit Electric Circuits: Figure 2- elow show ttery nd multiple resistors rrnged in prllel. Ech resistor receives portion of the current from the ttery sed on its resistnce. The split is

More information

Energy Bands Energy Bands and Band Gap. Phys463.nb Phenomenon

Energy Bands Energy Bands and Band Gap. Phys463.nb Phenomenon Phys463.nb 49 7 Energy Bnds Ref: textbook, Chpter 7 Q: Why re there insultors nd conductors? Q: Wht will hppen when n electron moves in crystl? In the previous chpter, we discussed free electron gses,

More information

A recursive construction of efficiently decodable list-disjunct matrices

A recursive construction of efficiently decodable list-disjunct matrices CSE 709: Compressed Sensing nd Group Testing. Prt I Lecturers: Hung Q. Ngo nd Atri Rudr SUNY t Bufflo, Fll 2011 Lst updte: October 13, 2011 A recursive construction of efficiently decodble list-disjunct

More information

The steps of the hypothesis test

The steps of the hypothesis test ttisticl Methods I (EXT 7005) Pge 78 Mosquito species Time of dy A B C Mid morning 0.0088 5.4900 5.5000 Mid Afternoon.3400 0.0300 0.8700 Dusk 0.600 5.400 3.000 The Chi squre test sttistic is the sum of

More information