Online Learning, Regret Minimization, Minimax Optimality, and Correlated Equilibrium
|
|
- Shauna Dorsey
- 5 years ago
- Views:
Transcription
1 Algorihm Online Learning, Regre Minimizaion, Minimax Opimaliy, and Correlaed Equilibrium High level Las ime we discussed noion of Nash equilibrium Saic concep: se of prob Disribuions (p,q, ) such ha nobody has any incenive o deviae Bu doesn alk abou how sysem would ge here Troubling ha even finding one can be hard in large games Wha if agens adap (learn) in ways ha are well-moivaed in erms of heir own rewards? Wha can we say abou he sysem hen? Avrim Blum High level Today: Online learning guaranees ha are achievable when acing in a changing and unpredicable environmen Wha happens when wo players in a zero-sum game boh use such sraegies? Approach minimax opimaliy Gives alernaive proof of minimax heorem Wha happens in a general-sum game? Approaches (he se of) correlaed equilibria Consider he following seing Each morning, you need o pick one of N possible roues o drive o work Bu raffic is differen each day No clear a priori which will be bes When you ge here you find ou how long your roue ook (And maybe ohers oo or maybe no) Is here a sraegy for picking roues so ha in he long run, whaever he sequence of raffic paerns has been, you ve done nearly as well as he bes fixed roue in hindsigh? (In expecaion, over inernal randomness in he algorihm) Yes Robos R Us 32 min No-regre algorihms for repeaed decisions A bi more generally: Algorihm has N opions World chooses cos vecor Can view as marix like his (maybe infinie # cols) No-regre algorihms for repeaed decisions A each ime sep, algorihm picks row, life picks column Alg pays cos for acion chosen Alg ges column as feedback (or jus is own cos in he bandi model) Need o assume some bound on max cos Le s say all coss beween 0 and 1 A each ime sep, algorihm picks row, life picks column Define Alg average pays cos regre for acion in T ime chosen seps as: (avg per-day cos of alg) (avg per-day cos of bes Alg ges column as feedback (or jus is own cos in fixed row in hindsigh) he bandi model) We wan his o go o 0 or beer as T ges large Need o assume some bound on max cos Le s say all coss [called beween a no-regre 0 and 1 algorihm] 1
2 Algorihm Algorihm Some inuiion & properies of no-regre algs Le s look a a small example: des Noe: No rying o compee wih bes adapive sraegy jus bes fixed row (pah) in hindsigh No-regre algorihms can do much beer han playing minimax opimal, and never much worse Exisence of no-regre algs yields immediae proof of minimax hm (will see in a bi) Hisory and developmen (abridged) [Hannan 57, Blackwell 56]: Alg wih avg regre O((N/T) 1/2 ) Re-phrasing, need only T = O(N/ 2 ) seps o ge imeaverage regre down o (will call his quaniy T ) Opimal dependence on T (or ) Game-heoriss viewed #rows N as consan, no so imporan as T, so prey much done Why opimal in T? Say world flips fair coin each day Any alg, in T days, has expeced cos T/2 Bu E[min(# heads,#ails)] = T/2 Θ T So, expeced avg per-day gap is Θ 1/ T des Hisory and developmen (abridged) [Hannan 57, Blackwell 56]: Alg wih avg regre O((N/T) 1/2 ) Re-phrasing, need only T = O(N/ 2 ) seps o ge imeaverage regre down o (will call his quaniy T ) Opimal dependence on T (or ) Game-heoriss viewed #rows N as consan, no so imporan as T, so prey much done Learning-heory 80s-90s: combining exper advice Imagine large class C of N predicion rules Perform (nearly) as well as bes f2c [LilesoneWarmuh 89]: Weighed-majoriy algorihm E[cos] OPT(1+ ) + (log N)/ Regre O((log N)/T) 1/2 T = O((log N)/ 2 ) Opimal as fn of N oo, plus los of work on exac consans, 2 nd order erms, ec [CFHHSW93] Exensions o bandi model (adds exra facor of N) To hink abou his, le s look a he problem of combining exper advice Using exper advice Say we wan o predic he sock marke We solici n expers for heir advice (Will he marke go up or down?) We hen wan o use heir advice somehow o make our predicion Eg, Basic quesion: Is here a sraegy ha allows us o do nearly as well as bes of hese in hindsigh? [ exper = someone wih an opinion No necessarily someone who knows anyhing] Simpler quesion We have n expers One of hese is perfec (never makes a misake) We jus don know which one Can we find a sraegy ha makes no more han lg(n) misakes? Answer: sure Jus ake majoriy voe over all expers ha have been correc so far Each misake cus # available by facor of 2 Noe: his means ok for n o be very large halving algorihm 2
3 Wha if no exper is perfec? One idea: jus run above proocol unil all expers are crossed off, hen repea Makes a mos log(n) misakes per misake of he bes exper (plus iniial log(n)) Seems waseful Consanly forgeing wha we've learned Can we do beer? Weighed Majoriy Algorihm Inuiion: Making a misake doesn' compleely disqualify an exper So, insead of crossing off, jus lower is weigh Weighed Majoriy Alg: Sar wih all expers having weigh 1 Predic based on weighed majoriy voe Penalize misakes by cuing weigh in half Analysis: do nearly as well as bes exper in hindsigh M = # misakes we've made so far m = # misakes bes exper has made so far W = oal weigh (sars a n) Afer each misake, W drops by a leas 25% So, afer M misakes, W is a mos n(3/4) M Weigh of bes exper is (1/2) m So, So, if m is small, hen M is prey small oo consan raio Randomized Weighed Majoriy 24(m + lg n) no so good if he bes exper makes a misake 20% of he ime Can we do beer? Yes Insead of aking majoriy voe, use weighs as probabiliies (eg, if 70% on up, 30% on down, hen pick 70:30) Idea: smooh ou he wors case Also, generalize ½ o 1- M = expeced #misakes unlike mos wors-case bounds, numbers are prey good Analysis Say a ime we have fracion F of weigh on expers ha made misake So, we have probabiliy F of making a misake, and we remove an F fracion of he oal weigh W final = n(1- F 1 )(1 - F 2 ) ln(w final ) = ln(n) + [ln(1 - F )] ln(n) - F (using ln(1-x) < -x) = ln(n) - M ( F = E[# misakes]) If bes exper makes m misakes, hen ln(w final ) > ln((1- ) m ) Now solve: ln(n) - M > m ln(1- ) Summarizing E[# misakes] (1+ )m + -1 log(n) If se =(log(n)/m) 1/2 o balance he wo erms ou (or use guess-and-double), ge bound of E[misakes] m + 2(m log n) 1/2 Since m T, his is a mos m + 2(Tlog n) 1/2 So, avg regre = 2(Tlog n) 1/2 /T! 0 3
4 Wha can we use his for? Can use o combine muliple algorihms o do nearly as well as bes in hindsigh Bu wha abou cases like choosing pahs o work, where expers are differen acions, no differen predicions? Game-heoreic version Wha if expers are acions? (pahs in a nework, rows in a marix game, ) A each ime, each has a loss (cos) in {0,1} Can sill run he algorihm Raher han viewing as pick a predicion wih prob proporional o is weigh, View as pick an exper wih probabiliy proporional o is weigh Choose exper i wih probabiliy p i = w i / i w i Same analysis applies Game-heoreic version Wha if expers are acions? (pahs in a nework, rows in a marix game, ) Wha if losses (coss) in [0,1]? If exper i has cos c i, do: w i à w i (1-c i ) Our expeced cos = i c i w i /W Amoun of weigh removed = i w i c i So, fracion removed = (our cos) Res of proof coninues as before So, now we can drive o work! (assuming full feedback) (1- c 12 )(1- c 11 ) 1 (1- c 22 )(1- c 21 ) 1 (1- c 32 )(1- c 31 ) (1- c n2 )(1- c n1 ) 1 Illusraion c 1 c 2 Guaranee: E[cos] OPT + 2(OPT log n) 1/2 scaling so coss in [0,1] Since OPT T, his is a mos OPT + 2(Tlog n) 1/2 So, regre/ime sep 2(Tlog n) 1/2 /T! 0 Connecions o Minimax Opimaliy Minimax-opimal sraegies Can solve for minimax-opimal sraegies using Linear programming Claim: no-regre sraegies will do nearly as well or beer agains any sequence of opponen plays Do nearly as well as bes fixed choice in hindsigh Implies do nearly as well as bes disrib in hindsigh Implies do nearly as well as minimax opimal! Lef Righ Lef Righ (½,-½) (1,-1) (1,-1) (0,0) 4
5 Proof of minimax hm using RWM Suppose for conradicion i was false This means some game G has V C > V R : If Column player commis firs, here exiss a row ha ges he Row player a leas V C Bu if Row player has o commi firs, he Column player can make him ge only V R Scale marix so payoffs o row are in [-1,0] Say V R = V C - V C V R Proof cond Now, consider playing randomized weighedmajoriy alg as Row, agains Col who plays opimally agains Row s disrib In T seps, Alg ges [bes row in hindsigh] 2(Tlog n) 1/2 BRiH T V C [Bes agains opponen s empirical disribuion] Alg T V R [Each ime, opponen knows your randomized sraegy] Gap is T Conradics assumpion once T > 2(Tlog n) 1/2, or T > 4log(n)/ 2 Proof cond Now, consider playing randomized weighedmajoriy alg as Row, agains Col who plays opimally agains Row s disrib Wha if wo RWMs play each oher? Can anyone see he argumen ha heir ime-average sraegies mus be approaching minimax opimaliy? Noe ha our procedure gives a fas way o compue apx minimax-opimal sraegies, if we can simulae Col (bes-response) quickly Wha if all players minimize regre? Inernal/Swap Regre and Correlaed Equilibria In zero-sum games, empirical frequencies quickly approaches minimax opimal In general-sum games, does behavior quickly (or a all) approach a Nash equilibrium? Afer all, a Nash Eq is exacly a se of disribuions ha are no-regre wr each oher So if he disribuions sabilize, hey mus converge o a Nash equil Well, unforunaely, no 5
6 Wha can we say? If algorihms minimize inernal or swap regre, hen empirical disribuion of play approaches correlaed equilibrium Foser & Vohra, Har & Mas-Colell, Though doesn imply play is sabilizing Wha are inernal/swap regre and correlaed equilibria? More general forms of regre 1 bes exper or exernal regre: Given n sraegies Compee wih bes of hem in hindsigh 2 sleeping exper or regre wih ime-inervals : Given n sraegies, k properies Le S i be se of days saisfying propery i (migh overlap) Wan o simulaneously achieve low regre over each S i 3 inernal or swap regre: like (2), excep ha S i = se of days in which we chose sraegy i Inernal/swap-regre Eg, each day we pick one sock o buy shares in Don wan o have regre of he form every ime I bough IBM, I should have bough Microsof insead Formally, swap regre is wr opimal funcion f:{1,,n}!{1,,n} such ha every ime you played acion j, i plays f(j) Weird why care? Correlaed equilibrium Disribuion over enries in marix, such ha if a rused pary chooses one a random and ells you your par, you have no incenive o deviae Eg, Shapley game R P S R P S -1,-1-1,1 1,-1 1,-1-1,-1-1,1-1,1 1,-1-1,-1-1,-1 In general-sum games, if all players have low swapregre, hen empirical disribuion of play is apx correlaed equilibrium Connecion If all paries run a low swap regre algorihm, hen empirical disribuion of play is an apx correlaed equilibrium Correlaor chooses random ime 2 {1,2,,T} Tells each player o play he acion j hey played in ime (bu does no reveal value of ) Expeced incenive o deviae: j Pr(j)(Regre j) = swap-regre of algorihm So, his suggess correlaed equilibria may be naural hings o see in muli-agen sysems where individuals are opimizing for hemselves Correlaed vs Coarse-correlaed Eq In boh cases: a disribuion over enries in he marix Think of a hird pary choosing from his disr and elling you your par as advice Correlaed equilibrium You have no incenive o deviae, even afer seeing wha he advice is Coarse-Correlaed equilibrium If only choice is o see and follow, or no o see a all, would prefer he former Low exernal-regre ) apx coarse correlaed equilib 6
7 Inernal/swap-regre, cond Algorihms for achieving low regre of his form: Foser & Vohra, Har & Mas-Colell, Fudenberg & Levine Will presen mehod of [BM05] showing how o conver any bes exper algorihm ino one achieving low swap regre Unforunaely, #seps o achieve low swap regre is O(n log n) raher han O(log n) Can conver any bes exper algorihm A ino one achieving low swap regre Idea: Insaniae one copy A j responsible for expeced regre over imes we play j Play p = pq Q Alg Cos vecor c q 2 Allows us o view p j as prob we play acion j, or as prob we play alg A j Give A j feedback of p j c A j guaranees (p j c ) q j min i p j c i + [regre erm] Wrie as: p j (q j c ) min i p j c i + [regre erm] p 2 c A 1 A 2 A n Can conver any bes exper algorihm A ino one achieving low swap regre Idea: Insaniae one copy A j responsible for expeced regre over imes we play j Can conver any bes exper algorihm A ino one achieving low swap regre Idea: Insaniae one copy A j responsible for expeced regre over imes we play j Q Play p = pq Alg Cos vecor c q 2 p 2 c A 1 A 2 Q Play p = pq Alg Cos vecor c q 2 p 2 c A 1 A 2 Sum over j, ge: A n Sum over j, ge: A n p Q c j min i p j c i + n[regre erm] p Q c j min i p j c i + n[regre erm] Our oal cos For each j, can move our prob o is own i=f(j) Our oal cos For each j, can move our prob o is own i=f(j) Wrie as: p j (q j c ) min i p j c i + [regre erm] Ge swap-regre a mos n imes orig exernal regre 7
Topics in Machine Learning Theory
Topics in Machine Learning Theory The Adversarial Muli-armed Bandi Problem, Inernal Regre, and Correlaed Equilibria Avrim Blum 10/8/14 Plan for oday Online game playing / combining exper advice bu: Wha
More informationPlan for Today & Next time (B) Machine Learning Theory. 2-Player Zero-Sum games. Game Theory terminolgy. Minimax-optimal strategies
5-859(B) Machine Learning Theory Learning and Game Theory Avrim Blum Plan for Today & Ne ime 2-player zero-sum games 2-player general-sum games Nash equilibria Correlaed equilibria Inernal/swap regre and
More informationRecap. Online Learning RWM. Avrim Blum. [ACFS02]: applying RWM to bandits
Algorihm Online Learning Recap Noregre algorihms for repeaed decisions: Algorihm has N opions World chooses cos vecor Can view as marix like his (maybe infinie # cols) World life fae Your guide: Avrim
More information1 Review of Zero-Sum Games
COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any
More informationMath 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities:
Mah 4 Eam Review Problems Problem. Calculae he 3rd Taylor polynomial for arcsin a =. Soluion. Le f() = arcsin. For his problem, we use he formula f() + f () + f ()! + f () 3! for he 3rd Taylor polynomial
More informationChapter 7: Solving Trig Equations
Haberman MTH Secion I: The Trigonomeric Funcions Chaper 7: Solving Trig Equaions Le s sar by solving a couple of equaions ha involve he sine funcion EXAMPLE a: Solve he equaion sin( ) The inverse funcions
More informationOnline Convex Optimization Example And Follow-The-Leader
CSE599s, Spring 2014, Online Learning Lecure 2-04/03/2014 Online Convex Opimizaion Example And Follow-The-Leader Lecurer: Brendan McMahan Scribe: Sephen Joe Jonany 1 Review of Online Convex Opimizaion
More informationt is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...
Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger
More informationOnline Learning Applications
Online Learning Applicaions Sepember 19, 2016 In he las lecure we saw he following guaranee for minimizing misakes wih Randomized Weighed Majoriy (RWM). Theorem 1 Le M be misakes of RWM and M i he misakes
More informationLecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.
Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in
More informationChallenge Problems. DIS 203 and 210. March 6, (e 2) k. k(k + 2). k=1. f(x) = k(k + 2) = 1 x k
Challenge Problems DIS 03 and 0 March 6, 05 Choose one of he following problems, and work on i in your group. Your goal is o convince me ha your answer is correc. Even if your answer isn compleely correc,
More informationSolutions Problem Set 3 Macro II (14.452)
Soluions Problem Se 3 Macro II (14.452) Francisco A. Gallego 04/27/2005 1 Q heory of invesmen in coninuous ime and no uncerainy Consider he in nie horizon model of a rm facing adjusmen coss o invesmen.
More informationInventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions
Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.
More informationNotes for Lecture 17-18
U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up
More informationLinear Response Theory: The connection between QFT and experiments
Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and
More informationEnsamble methods: Bagging and Boosting
Lecure 21 Ensamble mehods: Bagging and Boosing Milos Hauskrech milos@cs.pi.edu 5329 Senno Square Ensemble mehods Mixure of expers Muliple base models (classifiers, regressors), each covers a differen par
More information. Now define y j = log x j, and solve the iteration.
Problem 1: (Disribued Resource Allocaion (ALOHA!)) (Adaped from M& U, Problem 5.11) In his problem, we sudy a simple disribued proocol for allocaing agens o shared resources, wherein agens conend for resources
More informationUnit Root Time Series. Univariate random walk
Uni Roo ime Series Univariae random walk Consider he regression y y where ~ iid N 0, he leas squares esimae of is: ˆ yy y y yy Now wha if = If y y hen le y 0 =0 so ha y j j If ~ iid N 0, hen y ~ N 0, he
More informationT L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB
Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal
More informationLecture 2 October ε-approximation of 2-player zero-sum games
Opimizaion II Winer 009/10 Lecurer: Khaled Elbassioni Lecure Ocober 19 1 ε-approximaion of -player zero-sum games In his lecure we give a randomized ficiious play algorihm for obaining an approximae soluion
More informationCompetitive and Cooperative Inventory Policies in a Two-Stage Supply-Chain
Compeiive and Cooperaive Invenory Policies in a Two-Sage Supply-Chain (G. P. Cachon and P. H. Zipkin) Presened by Shruivandana Sharma IOE 64, Supply Chain Managemen, Winer 2009 Universiy of Michigan, Ann
More informationLecture 33: November 29
36-705: Inermediae Saisics Fall 2017 Lecurer: Siva Balakrishnan Lecure 33: November 29 Today we will coninue discussing he boosrap, and hen ry o undersand why i works in a simple case. In he las lecure
More informationKINEMATICS IN ONE DIMENSION
KINEMATICS IN ONE DIMENSION PREVIEW Kinemaics is he sudy of how hings move how far (disance and displacemen), how fas (speed and velociy), and how fas ha how fas changes (acceleraion). We say ha an objec
More informationPhys1112: DC and RC circuits
Name: Group Members: Dae: TA s Name: Phys1112: DC and RC circuis Objecives: 1. To undersand curren and volage characerisics of a DC RC discharging circui. 2. To undersand he effec of he RC ime consan.
More informationBiol. 356 Lab 8. Mortality, Recruitment, and Migration Rates
Biol. 356 Lab 8. Moraliy, Recruimen, and Migraion Raes (modified from Cox, 00, General Ecology Lab Manual, McGraw Hill) Las week we esimaed populaion size hrough several mehods. One assumpion of all hese
More informationFinal Spring 2007
.615 Final Spring 7 Overview The purpose of he final exam is o calculae he MHD β limi in a high-bea oroidal okamak agains he dangerous n = 1 exernal ballooning-kink mode. Effecively, his corresponds o
More informationOperating Systems Exercise 3
Operaing Sysems SS 00 Universiy of Zurich Operaing Sysems Exercise 3 00-06-4 Dominique Emery, s97-60-056 Thomas Bocek, s99-706-39 Philip Iezzi, s99-74-354 Florian Caflisch, s96-90-55 Table page Table page
More informationZürich. ETH Master Course: L Autonomous Mobile Robots Localization II
Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),
More information20. Applications of the Genetic-Drift Model
0. Applicaions of he Geneic-Drif Model 1) Deermining he probabiliy of forming any paricular combinaion of genoypes in he nex generaion: Example: If he parenal allele frequencies are p 0 = 0.35 and q 0
More informationFinish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!
MAT 257, Handou 6: Ocober 7-2, 20. I. Assignmen. Finish reading Chaper 2 of Spiva, rereading earlier secions as necessary. handou and fill in some missing deails! II. Higher derivaives. Also, read his
More informationEnsamble methods: Boosting
Lecure 21 Ensamble mehods: Boosing Milos Hauskrech milos@cs.pi.edu 5329 Senno Square Schedule Final exam: April 18: 1:00-2:15pm, in-class Term projecs April 23 & April 25: a 1:00-2:30pm in CS seminar room
More informationLaplace transfom: t-translation rule , Haynes Miller and Jeremy Orloff
Laplace ransfom: -ranslaion rule 8.03, Haynes Miller and Jeremy Orloff Inroducory example Consider he sysem ẋ + 3x = f(, where f is he inpu and x he response. We know is uni impulse response is 0 for
More informationSTA 114: Statistics. Notes 2. Statistical Models and the Likelihood Function
STA 114: Saisics Noes 2. Saisical Models and he Likelihood Funcion Describing Daa & Saisical Models A physicis has a heory ha makes a precise predicion of wha s o be observed in daa. If he daa doesn mach
More informationHamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:
M ah 5 7 Fall 9 L ecure O c. 4, 9 ) Hamilon- J acobi Equaion: Weak S oluion We coninue he sudy of he Hamilon-Jacobi equaion: We have shown ha u + H D u) = R n, ) ; u = g R n { = }. ). In general we canno
More informationEcon107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)
I. Definiions and Problems A. Perfec Mulicollineariy Econ7 Applied Economerics Topic 7: Mulicollineariy (Sudenmund, Chaper 8) Definiion: Perfec mulicollineariy exiss in a following K-variable regression
More informationMath 10B: Mock Mid II. April 13, 2016
Name: Soluions Mah 10B: Mock Mid II April 13, 016 1. ( poins) Sae, wih jusificaion, wheher he following saemens are rue or false. (a) If a 3 3 marix A saisfies A 3 A = 0, hen i canno be inverible. True.
More informationAssignment 6. Tyler Shendruk December 6, 2010
Assignmen 6 Tyler Shendruk December 6, 1 1 Harden Problem 1 Le K be he coupling and h he exernal field in a 1D Ising model. From he lecures hese can be ransformed ino effecive coupling and fields K and
More informationChapter 2. First Order Scalar Equations
Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.
More informationGames Against Nature
Advanced Course in Machine Learning Spring 2010 Games Agains Naure Handous are joinly prepared by Shie Mannor and Shai Shalev-Shwarz In he previous lecures we alked abou expers in differen seups and analyzed
More informationdt = C exp (3 ln t 4 ). t 4 W = C exp ( ln(4 t) 3) = C(4 t) 3.
Mah Rahman Exam Review Soluions () Consider he IVP: ( 4)y 3y + 4y = ; y(3) = 0, y (3) =. (a) Please deermine he longes inerval for which he IVP is guaraneed o have a unique soluion. Soluion: The disconinuiies
More informationExpert Advice for Amateurs
Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he
More informationThe Brock-Mirman Stochastic Growth Model
c December 3, 208, Chrisopher D. Carroll BrockMirman The Brock-Mirman Sochasic Growh Model Brock and Mirman (972) provided he firs opimizing growh model wih unpredicable (sochasic) shocks. The social planner
More informationAlgorithms, Games, and Networks January 17, Lecture 2
Algorithms, Games, and Networks January 17, 2013 Lecturer: Avrim Blum Lecture 2 Scribe: Aleksandr Kazachkov 1 Readings for today s lecture Today s topic is online learning, regret minimization, and minimax
More informationFinancial Econometrics Jeffrey R. Russell Midterm Winter 2009 SOLUTIONS
Name SOLUTIONS Financial Economerics Jeffrey R. Russell Miderm Winer 009 SOLUTIONS You have 80 minues o complee he exam. Use can use a calculaor and noes. Try o fi all your work in he space provided. If
More informationVehicle Arrival Models : Headway
Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where
More informationFishing limits and the Logistic Equation. 1
Fishing limis and he Logisic Equaion. 1 1. The Logisic Equaion. The logisic equaion is an equaion governing populaion growh for populaions in an environmen wih a limied amoun of resources (for insance,
More informationDiebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles
Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance
More information3.1 More on model selection
3. More on Model selecion 3. Comparing models AIC, BIC, Adjused R squared. 3. Over Fiing problem. 3.3 Sample spliing. 3. More on model selecion crieria Ofen afer model fiing you are lef wih a handful of
More informationMATH 4330/5330, Fourier Analysis Section 6, Proof of Fourier s Theorem for Pointwise Convergence
MATH 433/533, Fourier Analysis Secion 6, Proof of Fourier s Theorem for Poinwise Convergence Firs, some commens abou inegraing periodic funcions. If g is a periodic funcion, g(x + ) g(x) for all real x,
More informationGuest Lectures for Dr. MacFarlane s EE3350 Part Deux
Gues Lecures for Dr. MacFarlane s EE3350 Par Deux Michael Plane Mon., 08-30-2010 Wrie name in corner. Poin ou his is a review, so I will go faser. Remind hem o go lisen o online lecure abou geing an A
More informationTwo Coupled Oscillators / Normal Modes
Lecure 3 Phys 3750 Two Coupled Oscillaors / Normal Modes Overview and Moivaion: Today we ake a small, bu significan, sep owards wave moion. We will no ye observe waves, bu his sep is imporan in is own
More informationSection 3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients
Secion 3.5 Nonhomogeneous Equaions; Mehod of Undeermined Coefficiens Key Terms/Ideas: Linear Differenial operaor Nonlinear operaor Second order homogeneous DE Second order nonhomogeneous DE Soluion o homogeneous
More information) were both constant and we brought them from under the integral.
YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha
More information5.1 - Logarithms and Their Properties
Chaper 5 Logarihmic Funcions 5.1 - Logarihms and Their Properies Suppose ha a populaion grows according o he formula P 10, where P is he colony size a ime, in hours. When will he populaion be 2500? We
More informationHOMEWORK # 2: MATH 211, SPRING Note: This is the last solution set where I will describe the MATLAB I used to make my pictures.
HOMEWORK # 2: MATH 2, SPRING 25 TJ HITCHMAN Noe: This is he las soluion se where I will describe he MATLAB I used o make my picures.. Exercises from he ex.. Chaper 2.. Problem 6. We are o show ha y() =
More informationTHE 2-BODY PROBLEM. FIGURE 1. A pair of ellipses sharing a common focus. (c,b) c+a ROBERT J. VANDERBEI
THE 2-BODY PROBLEM ROBERT J. VANDERBEI ABSTRACT. In his shor noe, we show ha a pair of ellipses wih a common focus is a soluion o he 2-body problem. INTRODUCTION. Solving he 2-body problem from scrach
More informationSeminar 4: Hotelling 2
Seminar 4: Hoelling 2 November 3, 211 1 Exercise Par 1 Iso-elasic demand A non renewable resource of a known sock S can be exraced a zero cos. Demand for he resource is of he form: D(p ) = p ε ε > A a
More informationSMT 2014 Calculus Test Solutions February 15, 2014 = 3 5 = 15.
SMT Calculus Tes Soluions February 5,. Le f() = and le g() =. Compue f ()g (). Answer: 5 Soluion: We noe ha f () = and g () = 6. Then f ()g () =. Plugging in = we ge f ()g () = 6 = 3 5 = 5.. There is a
More informationHamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t
M ah 5 2 7 Fall 2 0 0 9 L ecure 1 0 O c. 7, 2 0 0 9 Hamilon- J acobi Equaion: Explici Formulas In his lecure we ry o apply he mehod of characerisics o he Hamilon-Jacobi equaion: u + H D u, x = 0 in R n
More informationMath 333 Problem Set #2 Solution 14 February 2003
Mah 333 Problem Se #2 Soluion 14 February 2003 A1. Solve he iniial value problem dy dx = x2 + e 3x ; 2y 4 y(0) = 1. Soluion: This is separable; we wrie 2y 4 dy = x 2 + e x dx and inegrae o ge The iniial
More informationLearning a Class from Examples. Training set X. Class C 1. Class C of a family car. Output: Input representation: x 1 : price, x 2 : engine power
Alpaydin Chaper, Michell Chaper 7 Alpaydin slides are in urquoise. Ehem Alpaydin, copyrigh: The MIT Press, 010. alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/ ehem/imle All oher slides are based on Michell.
More informationSection 5: Chain Rule
Chaper The Derivaive Applie Calculus 11 Secion 5: Chain Rule There is one more ype of complicae funcion ha we will wan o know how o iffereniae: composiion. The Chain Rule will le us fin he erivaive of
More informationCSE/NB 528 Lecture 14: From Supervised to Reinforcement Learning (Chapter 9) R. Rao, 528: Lecture 14
CSE/NB 58 Lecure 14: From Supervised o Reinforcemen Learning Chaper 9 1 Recall from las ime: Sigmoid Neworks Oupu v T g w u g wiui w Inpu nodes u = u 1 u u 3 T i Sigmoid oupu funcion: 1 g a 1 a e 1 ga
More informationCSE/NB 528 Lecture 14: Reinforcement Learning (Chapter 9)
CSE/NB 528 Lecure 14: Reinforcemen Learning Chaper 9 Image from hp://clasdean.la.asu.edu/news/images/ubep2001/neuron3.jpg Lecure figures are from Dayan & Abbo s book hp://people.brandeis.edu/~abbo/book/index.hml
More informationCHAPTER 12 DIRECT CURRENT CIRCUITS
CHAPTER 12 DIRECT CURRENT CIUITS DIRECT CURRENT CIUITS 257 12.1 RESISTORS IN SERIES AND IN PARALLEL When wo resisors are conneced ogeher as shown in Figure 12.1 we said ha hey are conneced in series. As
More informationIntroduction D P. r = constant discount rate, g = Gordon Model (1962): constant dividend growth rate.
Inroducion Gordon Model (1962): D P = r g r = consan discoun rae, g = consan dividend growh rae. If raional expecaions of fuure discoun raes and dividend growh vary over ime, so should he D/P raio. Since
More informationRL Lecture 7: Eligibility Traces. R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 1
RL Lecure 7: Eligibiliy Traces R. S. Suon and A. G. Baro: Reinforcemen Learning: An Inroducion 1 N-sep TD Predicion Idea: Look farher ino he fuure when you do TD backup (1, 2, 3,, n seps) R. S. Suon and
More informationNotes on online convex optimization
Noes on online convex opimizaion Karl Sraos Online convex opimizaion (OCO) is a principled framework for online learning: OnlineConvexOpimizaion Inpu: convex se S, number of seps T For =, 2,..., T : Selec
More informationSection 4.4 Logarithmic Properties
Secion. Logarihmic Properies 59 Secion. Logarihmic Properies In he previous secion, we derived wo imporan properies of arihms, which allowed us o solve some asic eponenial and arihmic equaions. Properies
More informationPredator - Prey Model Trajectories and the nonlinear conservation law
Predaor - Prey Model Trajecories and he nonlinear conservaion law James K. Peerson Deparmen of Biological Sciences and Deparmen of Mahemaical Sciences Clemson Universiy Ocober 28, 213 Ouline Drawing Trajecories
More informationSome Basic Information about M-S-D Systems
Some Basic Informaion abou M-S-D Sysems 1 Inroducion We wan o give some summary of he facs concerning unforced (homogeneous) and forced (non-homogeneous) models for linear oscillaors governed by second-order,
More information= ( ) ) or a system of differential equations with continuous parametrization (T = R
XIII. DIFFERENCE AND DIFFERENTIAL EQUATIONS Ofen funcions, or a sysem of funcion, are paramerized in erms of some variable, usually denoed as and inerpreed as ime. The variable is wrien as a funcion of
More informationFrom Complex Fourier Series to Fourier Transforms
Topic From Complex Fourier Series o Fourier Transforms. Inroducion In he previous lecure you saw ha complex Fourier Series and is coeciens were dened by as f ( = n= C ne in! where C n = T T = T = f (e
More informationMon Apr 9 EP 7.6 Convolutions and Laplace transforms. Announcements: Warm-up Exercise:
Mah 225-4 Week 3 April 9-3 EP 7.6 - convoluions; 6.-6.2 - eigenvalues, eigenvecors and diagonalizabiliy; 7. - sysems of differenial equaions. Mon Apr 9 EP 7.6 Convoluions and Laplace ransforms. Announcemens:
More informationLecture 4 Notes (Little s Theorem)
Lecure 4 Noes (Lile s Theorem) This lecure concerns one of he mos imporan (and simples) heorems in Queuing Theory, Lile s Theorem. More informaion can be found in he course book, Bersekas & Gallagher,
More informationMatlab and Python programming: how to get started
Malab and Pyhon programming: how o ge sared Equipping readers he skills o wrie programs o explore complex sysems and discover ineresing paerns from big daa is one of he main goals of his book. In his chaper,
More informationLearning a Class from Examples. Training set X. Class C 1. Class C of a family car. Output: Input representation: x 1 : price, x 2 : engine power
Alpaydin Chaper, Michell Chaper 7 Alpaydin slides are in urquoise. Ehem Alpaydin, copyrigh: The MIT Press, 010. alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/ ehem/imle All oher slides are based on Michell.
More informationViterbi Algorithm: Background
Vierbi Algorihm: Background Jean Mark Gawron March 24, 2014 1 The Key propery of an HMM Wha is an HMM. Formally, i has he following ingrediens: 1. a se of saes: S 2. a se of final saes: F 3. an iniial
More informationOnline Learning with Queries
Online Learning wih Queries Chao-Kai Chiang Chi-Jen Lu Absrac The online learning problem requires a player o ieraively choose an acion in an unknown and changing environmen. In he sandard seing of his
More informationE β t log (C t ) + M t M t 1. = Y t + B t 1 P t. B t 0 (3) v t = P tc t M t Question 1. Find the FOC s for an optimum in the agent s problem.
Noes, M. Krause.. Problem Se 9: Exercise on FTPL Same model as in paper and lecure, only ha one-period govenmen bonds are replaced by consols, which are bonds ha pay one dollar forever. I has curren marke
More informationSolutions to Assignment 1
MA 2326 Differenial Equaions Insrucor: Peronela Radu Friday, February 8, 203 Soluions o Assignmen. Find he general soluions of he following ODEs: (a) 2 x = an x Soluion: I is a separable equaion as we
More informationHomework sheet Exercises done during the lecture of March 12, 2014
EXERCISE SESSION 2A FOR THE COURSE GÉOMÉTRIE EUCLIDIENNE, NON EUCLIDIENNE ET PROJECTIVE MATTEO TOMMASINI Homework shee 3-4 - Exercises done during he lecure of March 2, 204 Exercise 2 Is i rue ha he parameerized
More informationProblem Set 5. Graduate Macro II, Spring 2017 The University of Notre Dame Professor Sims
Problem Se 5 Graduae Macro II, Spring 2017 The Universiy of Nore Dame Professor Sims Insrucions: You may consul wih oher members of he class, bu please make sure o urn in your own work. Where applicable,
More informationEchocardiography Project and Finite Fourier Series
Echocardiography Projec and Finie Fourier Series 1 U M An echocardiagram is a plo of how a porion of he hear moves as he funcion of ime over he one or more hearbea cycles If he hearbea repeas iself every
More informationES.1803 Topic 22 Notes Jeremy Orloff
ES.83 Topic Noes Jeremy Orloff Fourier series inroducion: coninued. Goals. Be able o compue he Fourier coefficiens of even or odd periodic funcion using he simplified formulas.. Be able o wrie and graph
More information( ) a system of differential equations with continuous parametrization ( T = R + These look like, respectively:
XIII. DIFFERENCE AND DIFFERENTIAL EQUATIONS Ofen funcions, or a sysem of funcion, are paramerized in erms of some variable, usually denoed as and inerpreed as ime. The variable is wrien as a funcion of
More information2. Nonlinear Conservation Law Equations
. Nonlinear Conservaion Law Equaions One of he clear lessons learned over recen years in sudying nonlinear parial differenial equaions is ha i is generally no wise o ry o aack a general class of nonlinear
More informationTwo Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017
Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =
More informationLinear Time-invariant systems, Convolution, and Cross-correlation
Linear Time-invarian sysems, Convoluion, and Cross-correlaion (1) Linear Time-invarian (LTI) sysem A sysem akes in an inpu funcion and reurns an oupu funcion. x() T y() Inpu Sysem Oupu y() = T[x()] An
More informationHOTELLING LOCATION MODEL
HOTELLING LOCATION MODEL THE LINEAR CITY MODEL The Example of Choosing only Locaion wihou Price Compeiion Le a be he locaion of rm and b is he locaion of rm. Assume he linear ransporaion cos equal o d,
More informationGMM - Generalized Method of Moments
GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................
More informationPhysics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle
Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,
More informationModule 2 F c i k c s la l w a s o s f dif di fusi s o i n
Module Fick s laws of diffusion Fick s laws of diffusion and hin film soluion Adolf Fick (1855) proposed: d J α d d d J (mole/m s) flu (m /s) diffusion coefficien and (mole/m 3 ) concenraion of ions, aoms
More informationComputational Equivalence of Fixed Points and No Regret Algorithms, and Convergence to Equilibria
Compuaional Equivalence of Fixed Poins and No Regre Algorihms, and Convergence o Equilibria Elad Hazan IBM Almaden Research Cener 650 Harry Road San Jose, CA 95120 hazan@us.ibm.com Sayen Kale Compuer Science
More informationSystem of Linear Differential Equations
Sysem of Linear Differenial Equaions In "Ordinary Differenial Equaions" we've learned how o solve a differenial equaion for a variable, such as: y'k5$e K2$x =0 solve DE yx = K 5 2 ek2 x C_C1 2$y''C7$y
More informationNotes 04 largely plagiarized by %khc
Noes 04 largely plagiarized by %khc Convoluion Recap Some ricks: x() () =x() x() (, 0 )=x(, 0 ) R ț x() u() = x( )d x() () =ẋ() This hen ells us ha an inegraor has impulse response h() =u(), and ha a differeniaor
More informationThe average rate of change between two points on a function is d t
SM Dae: Secion: Objecive: The average rae of change beween wo poins on a funcion is d. For example, if he funcion ( ) represens he disance in miles ha a car has raveled afer hours, hen finding he slope
More informationLecture Notes 2. The Hilbert Space Approach to Time Series
Time Series Seven N. Durlauf Universiy of Wisconsin. Basic ideas Lecure Noes. The Hilber Space Approach o Time Series The Hilber space framework provides a very powerful language for discussing he relaionship
More informationChapter 2 The Derivative Applied Calculus 107. We ll need a rule for finding the derivative of a product so we don t have to multiply everything out.
Chaper The Derivaive Applie Calculus 107 Secion 4: Prouc an Quoien Rules The basic rules will le us ackle simple funcions. Bu wha happens if we nee he erivaive of a combinaion of hese funcions? Eample
More informationSection 4.4 Logarithmic Properties
Secion. Logarihmic Properies 5 Secion. Logarihmic Properies In he previous secion, we derived wo imporan properies of arihms, which allowed us o solve some asic eponenial and arihmic equaions. Properies
More information