Lecture 2 October ε-approximation of 2-player zero-sum games
|
|
- Crystal Malone
- 5 years ago
- Views:
Transcription
1 Opimizaion II Winer 009/10 Lecurer: Khaled Elbassioni Lecure Ocober 19 1 ε-approximaion of -player zero-sum games In his lecure we give a randomized ficiious play algorihm for obaining an approximae soluion for -player zero-sum games 11 marix games A -player zero-sum game, or a marix game, is defined by a marix A R m n, called he payoff marix There are wo players wih opposing ineress: he row player (minimizer) and he column player (maximizer) A each sep of he play, he row player selecs a row i, and he column player selecs a column j, hen he row player pays o he column player he value a ij of he (i, j)h enry in he marix Suppose ha he play coninues forever, how o play such a game? Example 1 Consider he payoff marix A = Le us see wha happens if he row player chooses row 1 Then, he column player (being a maximizer) will choose he firs column Bu hen he row player (being a minimizer) will swich o row 3 Then, he column player will find i more profiable o swich o column 3, afer which he row player will swich o row 1, resuling in a cycle! This siuaion describes wha is no an equilibrium Examining he above marix, he row maxima are 1, 0, and 1, respecively So if he row player chooses row 1, he column player would guaranee 1, if he row player chooses row, hen he column player would guaranee 0, and so on In general, if he row player chooses row i, hen he column player would guaranee max j a ij, and hus he row player should choose he row ha minimizes his maximum Similarly, since he row minima are 1, 0, 1, respecively, he column player can guaranee max j min i a ij = 0 Since i happens in his example ha hese wo values are equal, here will be an equilibrium if he row player sicks o playing he nd row and he column player sicks o playing he nd column An equilibrium or a saddle poin is a pair of sraegies for he wo players such ha no player has incenive o swich, assuming ha he oher player does no swich Bu is here always a saddle-poin in pure sraegies as in Example 1 The answer is NO as he following well-known example shows Example Consider he payoff marix [ A = Then min i max j a ij = 1 1 = max j min i a ij
2 Op II Lecure Ocober 19 Winer 009/10 So wha o do? Play wih mixed sraegies Tha is, he row player chooses a probabiliy vecor x S m = {x R m : e T x = 1, x 0}, where e denoes he vecor of all ones of he appropriae dimension, and plays row i wih probabiliy x i Similarly, he column vecor plays according o a probabiliy vecor y S n = {y R n : e T y = 1, y 0} Le us denoe by A 1,, A m he rows of A, by A 1,, A m he columns of A, and by e i he i-uni vecor wih he appropriae dimension Then he expeced value ha he row player would pay if she decided o play row i is j a ijy j = A i y = e T i Ay, and hence her expeced payoff would be i x ia i y = x T Ay Similarly, he expeced payoff of he column player is x T Ay For insance, in Example above, if boh players choose ( 1, 1 ) as heir sraegy, hen he expeced payoff for boh is 0 On he oher hand, if he row player chooses x = ( 1, ), while he column player chooses y = (, 1), he hen payoff is 1 Is any of hese wo pairs of sraegies an equilibrium? And does such an equilibrium exiss in general? The answer is YES as given by he following Theorem Theorem 1 (Von Neumann (198)) For any marix A R m n, min max x T Ay = max min x T Ay (1) x S m y S n x S m Definiion (Saddle poin) A saddle poin in a marix game wih payoff marix A R m n, is a pair of sraegies x S m and y S n such ha Such a pair will be also called an opimal pair y S n min x Sm x T Ay = max y S n (x ) T Ay () Exercise 1 (i) Show ha min x Sm max y Sn x T Ay max y Sn min x Sm x T Ay (ii) Show ha a pair of sraegies x S m and y S n are opimal if and only if for all i, j: (x ) T A j A i y I is worh noing ha a marix game is equivalen o a pair of packing-covering linear programs (LP s) Exercise Le v be he common value in he ideniy (1) Show ha v = min{v x T A ve T, x S m } = max{v Ay ve, y S n } Le ε [0, 1 be a given consan We are ineresed in ε-opimal sraegies, defined as follows Definiion 3 (ε-opimal sraegies) A pair of sraegies x S m and y S n is an ε-opimal pair for a marix game wih payoff marix A R m n if max(x ) T Ay min x T Ay + ε (3) y S n x S m In his lecure, we consider he problem of finding approximae saddle poins of marix games ɛ-approximaion of zero-sum games Inpu: A marix A R m n and a desired accuracy ε Oupu: A pair of sraegies x S m and y S n Objecive: x, y are ε-equilibria -
3 Op II Lecure Ocober 19 Winer 009/10 1 ficiious play Ficiious play is a mehod suggesed by Brown in 1951 [Bro51 o obain a saddle poin for a given marix game Ieraively, he minimizer and he maximizer mainain in X() Z m + and Y () Z n + he frequencies by which rows and columns have been used, respecively, upo ime of he play Then each player updaes his/her sraegy by applying he bes response, given he curren opponen s sraegy The procedure is given below Algorihm 1 ficiious play(a) 1 X(0) := 0 and Y (0) := 0 for = 1,, do 3 i := argmin{a 1 Y ( 1),, A m Y ( 1)}; X() := X( 1) + e i 4 j := argmax{x( 1) T A 1,, X( 1) T A n }; Y () := Y ( 1)+e j Noe ha a each, he vecors X() such pair of sraegies, x X() = lim [Rob51 A bound of ( m+n ρ ε are feasible sraegies The convergence of, y Y () = lim, was esablished by Robinson ) m+n, where ρ = maxi,j a ij, on he ime needed for and Y () convergence o an ε-pair was obained by Shapiro in 1958 [Sha58 The endency in he lieraure is o believe ha his ime is bounded by O( poly(n,m) ) A smoohed version of ε his ficiious play, inroduced in he nex secion, archives such a bound 13 Randomized ficiious play Grigoriadis and Khachiyan (1995) [GK95 inroduced a randomized version of ficiious play, in which he argmin and argmax operaors in seps 3 and 4 above are replaced by a smoohed selecion which picks a row (respecively, a column) wih probabiliy ha decreases (respecively, increases) quickly as he curren response of he opponen o his row (respecively, column) increases More precisely, given he curren vecors of frequencies X() Z m + and Y () Z n +, he row and column players choose, respecively, a row i and a column j according o he (so-called Gibbs) disribuions: p i () p() where p i() = e εa i Y ( 1) and p() = q j () q() where q j() = e εx( 1) T A j and q() = p i () (4) n q j () (5) Here is he algorihm This will be he heme of mos of he algorihms described in he lecures on packing and covering LP s j=1-3
4 Op II Lecure Ocober 19 Winer 009/10 Algorihm randomized ficiious play(a) 1 X(0) := 0 and Y (0) := 0 for = 1,,, T def do = 6 ln(nm) ε 3 Pick i [m and j [n independenly, wih probabiliies p i() and q j(), respecively p() q() 4 X() := X( 1) + e i ; Y () := Y ( 1) + e j 5 reurn ( X(), Y () ) I is he smoohing sep in line 3 ha makes i possible o prove beer bounds on he number of ieraions han hose currenly known for deerminisic ficiious play The analysis, here and in all he algorihms considered in subsequen lecures, will follow more or less he same framework: we define a poenial funcion Φ() = p( + 1) q( + 1), (6) and show ha i does no increase by much from one ieraion o he nex Then his implies, by ieraing, ha he expeced poenial afer ime seps is bounded by he iniial poenial muliplied by some facor, which migh depend exponenially on Since he iniial poenial is he sum of some non-negaive exponenials, each erm in he sum is bounded by he final poenial Taking logs allows us o bound he error in approximaion a ime, as a funcion of, and our choice of he erminaing ime T, when plugged in his funcion, guaranees o make he error less han ε as desired The proof we give here uses ideas from Grigoriadis and Khachiyan (1995) [GK95 and Koufogiannakis and Young [KY07 For he purpose of obaining an approximaion wih an absolue error, we will assume ha all he enries of he marix A are in some fixed range, say [ 1, 1 Scaling he marix A by 1, where he widh parameer ρ is defined as ρ = max ρ i,j a ij, and replacing ε by ε in wha follows, we ge an algorihm ha works wihou his assumpion, bu ρ whose running ime is proporional o ρ We noe ha such dependence on he widh is unavoidable in all known algorihms ha obain ε-approximae soluions and whose running ime is proporional o poly( 1 ) An excepion is when A is non-negaive in which ε his dependence can be removed as we shall see in a laer lecure Exercise 3 Show ha any marix game (1) can be convered ino an equivalen one in which each enry in he marix A is in [a, b, where a, b R Does he same reducion work if we are aiming a an ɛ-approximae saddle poin? Theorem 4 Assuming A [ 1, 1 m n, algorihm randomized ficiious play (n+m) log(n+m) oupus ε-opimal sraegies The oal running ime is O( ) ε Proof: The bound on he running ime is obvious So i remains o show ha he pair oupu by he algorihm is ε-opimal As menioned above, we analyze he change in he poenial funcion (6) Noe ha, due o he random choices of he algorihm, he poenial funcion is a random variable We will prove he following bound Lemma 5 For = 1,,, E[Φ() E[Φ( 1)(1 + ε 6 ) -4
5 Op II Lecure Ocober 19 Winer 009/10 Then by ieraion we ge ha E[Φ() E[Φ(0)(1 + ε ) Φ(0)e ε 3, where he las 6 inequaliy follows by using he inequaliy 1 + x e x, valid for all real x This implies by Markov s inequaliy ha, wih probabiliy a leas 1, afer ieraions, Φ() e ε 3 Φ(0) (7) Noe ha Φ() = i,j e εx()t A j εa i Y () Since each erm in his sum is non-negaive and he sum is bounded by e ε 3 Φ(0), we conclude ha each erm is also bounded by he same bound Taking logs and using Φ(0) = nm, we ge ha or εx() T A j X() T εa iy () A j A i Y () ln(nm) + ε 3 + ln(nm) ε + ε 3 Using he value of = T = 6 ln(nm) a he end of he las ieraion, we ge ha X()T A j ε Y () A i +ε, implying (see Exercise 1(ii)) ha he pair of sraegies oupu by he algorihm is ε-opimal I remains o prove Lemma 5 Proof of Lemma 5 Fix an ieraion Denoe by X = X() and Y = Y () he changes in he vecors X and Y in ieraion, ha is, in sep 4, we use he updaes X( + 1) = X() + X and Y ( + 1) = Y () + Y In he following, we condiion on he values of X() and Y () (so for he momen, we will hink ha he only random evens are hose in sep 3, and hence p(), q(), and φ() are all consans) Le p() = (p 1 (),, p m ()) and q() = (q 1 (),, q n ()) Then E[ X = p() p() and E[ Y = q() q() To esimae he change in Φ( 1), we esimae he changes in p() and q() p( + 1) = = p i ( + 1) = e εa i Y () = e εa i (Y ( 1)+ Y ) p i ()e εa i Y (8) Exercise 4 Show ha, for all δ [ 1, 1, eδ 1 + δ + 3 δ Noe ha εa i Y [ 1, 1 since we have assumed ha a ij 1 Thus he fac in Exercise 4, ogeher wih (8), implies ha p( + 1) [ p i () 1 εa i Y p i () [1 + ε 6 εa i Y + ε 6 (A i Y ) -5 = p() [1 + ε 6 εp()t A Y, p()
6 Op II Lecure Ocober 19 Winer 009/10 where in he second inequaliy we used again he assumpion ha a ij 1 and hence (A i Y ) 1 In fac, his is he only place where his assumpion plays a role in he analysis Taking he expecaion wih respec o Y, we ge by lineariy of expecaion E[ p( + 1) p() [1 + ε 6 εp()t Aq() p() q() Similarly, we can derive E[ q( + 1) q() Thus, using independence of X and Y, we have [1 + ε 6 + εq()t A T p() q() p() [ ( ) E[Φ() = E[ p( + 1) E[ q( + 1) p() q() 1 + ε + 6 ε ) ( ) (1 + ε q() T A T p() p()t Aq() ε q() T A T p() p()t Aq() 6 q() p() p() q() 4 q() p() p() q() Since q()t A T p() q() p() = p()t Aq() p() q(), we ge ha E[Φ() Φ( 1) (1 + ε 6 ) Recalling ha his expecaion was condiional on he values of X() and Y (), he lemma follows by aking he expecaion of boh sides of his inequaliy wih respec o X() and Y () -6
7 Bibliography [Bro51 George W Brown Ieraive soluion of games by ficiious play In: TC Koopmans, Edior, Aciviy Analysis of Producion and Allocaion, pages , 1951 [GK95 Michael D Grigoriadis and Leonid G Khachiyan A sublinear-ime randomized approximaion algorihm for marix games Operaions Research Leers, 18():53 58, 1995 [KY07 Chrisos Koufogiannakis and Neal E Young Beaing simplex for fracional packing and covering linear programs In 48h Annual IEEE Symposium on Foundaions of Compuer Science (FOCS), pages , 007 [Rob51 Julia Robinson An ieraive mehod of solving a game The Annals of Mahemaics, 54():96 301, 1951 [Sha58 Harold N Shapiro Noe on a compuaion mehod in he heory of games Communicaions on Pure and Applied Mahemaics, 11(4): ,
1 Review of Zero-Sum Games
COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any
More informationCourse Notes for EE227C (Spring 2018): Convex Optimization and Approximation
Course Noes for EE7C Spring 018: Convex Opimizaion and Approximaion Insrucor: Moriz Hard Email: hard+ee7c@berkeley.edu Graduae Insrucor: Max Simchowiz Email: msimchow+ee7c@berkeley.edu Ocober 15, 018 3
More informationApproximation Algorithms for Unique Games via Orthogonal Separators
Approximaion Algorihms for Unique Games via Orhogonal Separaors Lecure noes by Konsanin Makarychev. Lecure noes are based on he papers [CMM06a, CMM06b, LM4]. Unique Games In hese lecure noes, we define
More informationOnline Convex Optimization Example And Follow-The-Leader
CSE599s, Spring 2014, Online Learning Lecure 2-04/03/2014 Online Convex Opimizaion Example And Follow-The-Leader Lecurer: Brendan McMahan Scribe: Sephen Joe Jonany 1 Review of Online Convex Opimizaion
More informationt is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...
Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger
More informationA Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs
PROC. IEEE CONFERENCE ON DECISION AND CONTROL, 06 A Primal-Dual Type Algorihm wih he O(/) Convergence Rae for Large Scale Consrained Convex Programs Hao Yu and Michael J. Neely Absrac This paper considers
More informationNotes for Lecture 17-18
U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up
More informationPENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD
PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD HAN XIAO 1. Penalized Leas Squares Lasso solves he following opimizaion problem, ˆβ lasso = arg max β R p+1 1 N y i β 0 N x ij β j β j (1.1) for some 0.
More informationLecture 9: September 25
0-725: Opimizaion Fall 202 Lecure 9: Sepember 25 Lecurer: Geoff Gordon/Ryan Tibshirani Scribes: Xuezhi Wang, Subhodeep Moira, Abhimanu Kumar Noe: LaTeX emplae couresy of UC Berkeley EECS dep. Disclaimer:
More informationHamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:
M ah 5 7 Fall 9 L ecure O c. 4, 9 ) Hamilon- J acobi Equaion: Weak S oluion We coninue he sudy of he Hamilon-Jacobi equaion: We have shown ha u + H D u) = R n, ) ; u = g R n { = }. ). In general we canno
More informationMATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018
MATH 5720: Gradien Mehods Hung Phan, UMass Lowell Ocober 4, 208 Descen Direcion Mehods Consider he problem min { f(x) x R n}. The general descen direcions mehod is x k+ = x k + k d k where x k is he curren
More informationVehicle Arrival Models : Headway
Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where
More informationFinal Spring 2007
.615 Final Spring 7 Overview The purpose of he final exam is o calculae he MHD β limi in a high-bea oroidal okamak agains he dangerous n = 1 exernal ballooning-kink mode. Effecively, his corresponds o
More informationT L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB
Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal
More informationExamples of Dynamic Programming Problems
M.I.T. 5.450-Fall 00 Sloan School of Managemen Professor Leonid Kogan Examples of Dynamic Programming Problems Problem A given quaniy X of a single resource is o be allocaed opimally among N producion
More informationLinear Response Theory: The connection between QFT and experiments
Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and
More informationChristos Papadimitriou & Luca Trevisan November 22, 2016
U.C. Bereley CS170: Algorihms Handou LN-11-22 Chrisos Papadimiriou & Luca Trevisan November 22, 2016 Sreaming algorihms In his lecure and he nex one we sudy memory-efficien algorihms ha process a sream
More informationFinish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!
MAT 257, Handou 6: Ocober 7-2, 20. I. Assignmen. Finish reading Chaper 2 of Spiva, rereading earlier secions as necessary. handou and fill in some missing deails! II. Higher derivaives. Also, read his
More informationRandom Walk with Anti-Correlated Steps
Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and
More informationOnline Learning Applications
Online Learning Applicaions Sepember 19, 2016 In he las lecure we saw he following guaranee for minimizing misakes wih Randomized Weighed Majoriy (RWM). Theorem 1 Le M be misakes of RWM and M i he misakes
More informationMath Week 14 April 16-20: sections first order systems of linear differential equations; 7.4 mass-spring systems.
Mah 2250-004 Week 4 April 6-20 secions 7.-7.3 firs order sysems of linear differenial equaions; 7.4 mass-spring sysems. Mon Apr 6 7.-7.2 Sysems of differenial equaions (7.), and he vecor Calculus we need
More informationSimulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010
Simulaion-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Week Descripion Reading Maerial 2 Compuer Simulaion of Dynamic Models Finie Difference, coninuous saes, discree ime Simple Mehods Euler Trapezoid
More information3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon
3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of
More informationChapter 7: Solving Trig Equations
Haberman MTH Secion I: The Trigonomeric Funcions Chaper 7: Solving Trig Equaions Le s sar by solving a couple of equaions ha involve he sine funcion EXAMPLE a: Solve he equaion sin( ) The inverse funcions
More informationAn Introduction to Malliavin calculus and its applications
An Inroducion o Malliavin calculus and is applicaions Lecure 5: Smoohness of he densiy and Hörmander s heorem David Nualar Deparmen of Mahemaics Kansas Universiy Universiy of Wyoming Summer School 214
More informationOn Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature
On Measuring Pro-Poor Growh 1. On Various Ways of Measuring Pro-Poor Growh: A Shor eview of he Lieraure During he pas en years or so here have been various suggesions concerning he way one should check
More informationOnline Appendix to Solution Methods for Models with Rare Disasters
Online Appendix o Soluion Mehods for Models wih Rare Disasers Jesús Fernández-Villaverde and Oren Levinal In his Online Appendix, we presen he Euler condiions of he model, we develop he pricing Calvo block,
More informationOnline Appendix for "Customer Recognition in. Experience versus Inspection Good Markets"
Online Appendix for "Cusomer Recogniion in Experience versus Inspecion Good Markes" Bing Jing Cheong Kong Graduae School of Business Beijing, 0078, People s Republic of China, bjing@ckgsbeducn November
More informationThe Arcsine Distribution
The Arcsine Disribuion Chris H. Rycrof Ocober 6, 006 A common heme of he class has been ha he saisics of single walker are ofen very differen from hose of an ensemble of walkers. On he firs homework, we
More informationTom Heskes and Onno Zoeter. Presented by Mark Buller
Tom Heskes and Onno Zoeer Presened by Mark Buller Dynamic Bayesian Neworks Direced graphical models of sochasic processes Represen hidden and observed variables wih differen dependencies Generalize Hidden
More informationLecture 4 Notes (Little s Theorem)
Lecure 4 Noes (Lile s Theorem) This lecure concerns one of he mos imporan (and simples) heorems in Queuing Theory, Lile s Theorem. More informaion can be found in he course book, Bersekas & Gallagher,
More information10. State Space Methods
. Sae Space Mehods. Inroducion Sae space modelling was briefly inroduced in chaper. Here more coverage is provided of sae space mehods before some of heir uses in conrol sysem design are covered in he
More informationChapter 6. Systems of First Order Linear Differential Equations
Chaper 6 Sysems of Firs Order Linear Differenial Equaions We will only discuss firs order sysems However higher order sysems may be made ino firs order sysems by a rick shown below We will have a sligh
More informationExpert Advice for Amateurs
Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he
More informationSolutions Problem Set 3 Macro II (14.452)
Soluions Problem Se 3 Macro II (14.452) Francisco A. Gallego 04/27/2005 1 Q heory of invesmen in coninuous ime and no uncerainy Consider he in nie horizon model of a rm facing adjusmen coss o invesmen.
More informationE β t log (C t ) + M t M t 1. = Y t + B t 1 P t. B t 0 (3) v t = P tc t M t Question 1. Find the FOC s for an optimum in the agent s problem.
Noes, M. Krause.. Problem Se 9: Exercise on FTPL Same model as in paper and lecure, only ha one-period govenmen bonds are replaced by consols, which are bonds ha pay one dollar forever. I has curren marke
More informationEconomics 8105 Macroeconomic Theory Recitation 6
Economics 8105 Macroeconomic Theory Reciaion 6 Conor Ryan Ocober 11h, 2016 Ouline: Opimal Taxaion wih Governmen Invesmen 1 Governmen Expendiure in Producion In hese noes we will examine a model in which
More informationINTRODUCTION TO MACHINE LEARNING 3RD EDITION
ETHEM ALPAYDIN The MIT Press, 2014 Lecure Slides for INTRODUCTION TO MACHINE LEARNING 3RD EDITION alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/~ehem/i2ml3e CHAPTER 2: SUPERVISED LEARNING Learning a Class
More informationGames Against Nature
Advanced Course in Machine Learning Spring 2010 Games Agains Naure Handous are joinly prepared by Shie Mannor and Shai Shalev-Shwarz In he previous lecures we alked abou expers in differen seups and analyzed
More informationOBJECTIVES OF TIME SERIES ANALYSIS
OBJECTIVES OF TIME SERIES ANALYSIS Undersanding he dynamic or imedependen srucure of he observaions of a single series (univariae analysis) Forecasing of fuure observaions Asceraining he leading, lagging
More information20. Applications of the Genetic-Drift Model
0. Applicaions of he Geneic-Drif Model 1) Deermining he probabiliy of forming any paricular combinaion of genoypes in he nex generaion: Example: If he parenal allele frequencies are p 0 = 0.35 and q 0
More information) were both constant and we brought them from under the integral.
YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha
More informationTwo Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017
Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =
More informationThe Brock-Mirman Stochastic Growth Model
c December 3, 208, Chrisopher D. Carroll BrockMirman The Brock-Mirman Sochasic Growh Model Brock and Mirman (972) provided he firs opimizing growh model wih unpredicable (sochasic) shocks. The social planner
More informationMath 10B: Mock Mid II. April 13, 2016
Name: Soluions Mah 10B: Mock Mid II April 13, 016 1. ( poins) Sae, wih jusificaion, wheher he following saemens are rue or false. (a) If a 3 3 marix A saisfies A 3 A = 0, hen i canno be inverible. True.
More informationLecture 33: November 29
36-705: Inermediae Saisics Fall 2017 Lecurer: Siva Balakrishnan Lecure 33: November 29 Today we will coninue discussing he boosrap, and hen ry o undersand why i works in a simple case. In he las lecure
More informationSome Ramsey results for the n-cube
Some Ramsey resuls for he n-cube Ron Graham Universiy of California, San Diego Jozsef Solymosi Universiy of Briish Columbia, Vancouver, Canada Absrac In his noe we esablish a Ramsey-ype resul for cerain
More informationLearning a Class from Examples. Training set X. Class C 1. Class C of a family car. Output: Input representation: x 1 : price, x 2 : engine power
Alpaydin Chaper, Michell Chaper 7 Alpaydin slides are in urquoise. Ehem Alpaydin, copyrigh: The MIT Press, 010. alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/ ehem/imle All oher slides are based on Michell.
More informationL07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS. NA568 Mobile Robotics: Methods & Algorithms
L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS NA568 Mobile Roboics: Mehods & Algorihms Today s Topic Quick review on (Linear) Kalman Filer Kalman Filering for Non-Linear Sysems Exended Kalman Filer (EKF)
More informationChapter 3 Boundary Value Problem
Chaper 3 Boundary Value Problem A boundary value problem (BVP) is a problem, ypically an ODE or a PDE, which has values assigned on he physical boundary of he domain in which he problem is specified. Le
More informationSeminar 4: Hotelling 2
Seminar 4: Hoelling 2 November 3, 211 1 Exercise Par 1 Iso-elasic demand A non renewable resource of a known sock S can be exraced a zero cos. Demand for he resource is of he form: D(p ) = p ε ε > A a
More informationMath 334 Fall 2011 Homework 11 Solutions
Dec. 2, 2 Mah 334 Fall 2 Homework Soluions Basic Problem. Transform he following iniial value problem ino an iniial value problem for a sysem: u + p()u + q() u g(), u() u, u () v. () Soluion. Le v u. Then
More informationInventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions
Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.
More informationApplication of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing
Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology
More informationLecture Notes 5: Investment
Lecure Noes 5: Invesmen Zhiwei Xu (xuzhiwei@sju.edu.cn) Invesmen decisions made by rms are one of he mos imporan behaviors in he economy. As he invesmen deermines how he capials accumulae along he ime,
More informationAppendix to Online l 1 -Dictionary Learning with Application to Novel Document Detection
Appendix o Online l -Dicionary Learning wih Applicaion o Novel Documen Deecion Shiva Prasad Kasiviswanahan Huahua Wang Arindam Banerjee Prem Melville A Background abou ADMM In his secion, we give a brief
More informationPolicy regimes Theory
Advanced Moneary Theory and Policy EPOS 2012/13 Policy regimes Theory Giovanni Di Barolomeo giovanni.dibarolomeo@uniroma1.i The moneary policy regime The simple model: x = - s (i - p e ) + x e + e D p
More informationSliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game
Sliding Mode Exremum Seeking Conrol for Linear Quadraic Dynamic Game Yaodong Pan and Ümi Özgüner ITS Research Group, AIST Tsukuba Eas Namiki --, Tsukuba-shi,Ibaraki-ken 5-856, Japan e-mail: pan.yaodong@ais.go.jp
More informationSupplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence
Supplemen for Sochasic Convex Opimizaion: Faser Local Growh Implies Faser Global Convergence Yi Xu Qihang Lin ianbao Yang Proof of heorem heorem Suppose Assumpion holds and F (w) obeys he LGC (6) Given
More informationEssential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems
Essenial Microeconomics -- 6.5: OPIMAL CONROL Consider he following class of opimizaion problems Max{ U( k, x) + U+ ( k+ ) k+ k F( k, x)}. { x, k+ } = In he language of conrol heory, he vecor k is he vecor
More informationOn Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems
MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 2, May 2013, pp. 209 227 ISSN 0364-765X (prin) ISSN 1526-5471 (online) hp://dx.doi.org/10.1287/moor.1120.0562 2013 INFORMS On Boundedness of Q-Learning Ieraes
More informationSOLUTIONS TO ECE 3084
SOLUTIONS TO ECE 384 PROBLEM 2.. For each sysem below, specify wheher or no i is: (i) memoryless; (ii) causal; (iii) inverible; (iv) linear; (v) ime invarian; Explain your reasoning. If he propery is no
More informationLAPLACE TRANSFORM AND TRANSFER FUNCTION
CHBE320 LECTURE V LAPLACE TRANSFORM AND TRANSFER FUNCTION Professor Dae Ryook Yang Spring 2018 Dep. of Chemical and Biological Engineering 5-1 Road Map of he Lecure V Laplace Transform and Transfer funcions
More informationMatrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality
Marix Versions of Some Refinemens of he Arihmeic-Geomeric Mean Inequaliy Bao Qi Feng and Andrew Tonge Absrac. We esablish marix versions of refinemens due o Alzer ], Carwrigh and Field 4], and Mercer 5]
More informationNotes on Kalman Filtering
Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren
More informationMath 333 Problem Set #2 Solution 14 February 2003
Mah 333 Problem Se #2 Soluion 14 February 2003 A1. Solve he iniial value problem dy dx = x2 + e 3x ; 2y 4 y(0) = 1. Soluion: This is separable; we wrie 2y 4 dy = x 2 + e x dx and inegrae o ge The iniial
More informationGuest Lectures for Dr. MacFarlane s EE3350 Part Deux
Gues Lecures for Dr. MacFarlane s EE3350 Par Deux Michael Plane Mon., 08-30-2010 Wrie name in corner. Poin ou his is a review, so I will go faser. Remind hem o go lisen o online lecure abou geing an A
More informationOptimality Conditions for Unconstrained Problems
62 CHAPTER 6 Opimaliy Condiions for Unconsrained Problems 1 Unconsrained Opimizaion 11 Exisence Consider he problem of minimizing he funcion f : R n R where f is coninuous on all of R n : P min f(x) x
More informationVanishing Viscosity Method. There are another instructive and perhaps more natural discontinuous solutions of the conservation law
Vanishing Viscosiy Mehod. There are anoher insrucive and perhaps more naural disconinuous soluions of he conservaion law (1 u +(q(u x 0, he so called vanishing viscosiy mehod. This mehod consiss in viewing
More informationMATH 128A, SUMMER 2009, FINAL EXAM SOLUTION
MATH 28A, SUMME 2009, FINAL EXAM SOLUTION BENJAMIN JOHNSON () (8 poins) [Lagrange Inerpolaion] (a) (4 poins) Le f be a funcion defined a some real numbers x 0,..., x n. Give a defining equaion for he Lagrange
More informationGeorey E. Hinton. University oftoronto. Technical Report CRG-TR February 22, Abstract
Parameer Esimaion for Linear Dynamical Sysems Zoubin Ghahramani Georey E. Hinon Deparmen of Compuer Science Universiy oftorono 6 King's College Road Torono, Canada M5S A4 Email: zoubin@cs.orono.edu Technical
More informationStationary Distribution. Design and Analysis of Algorithms Andrei Bulatov
Saionary Disribuion Design and Analysis of Algorihms Andrei Bulaov Algorihms Markov Chains 34-2 Classificaion of Saes k By P we denoe he (i,j)-enry of i, j Sae is accessible from sae if 0 for some k 0
More informationModal identification of structures from roving input data by means of maximum likelihood estimation of the state space model
Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix
More informationLearning a Class from Examples. Training set X. Class C 1. Class C of a family car. Output: Input representation: x 1 : price, x 2 : engine power
Alpaydin Chaper, Michell Chaper 7 Alpaydin slides are in urquoise. Ehem Alpaydin, copyrigh: The MIT Press, 010. alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/ ehem/imle All oher slides are based on Michell.
More informationFinal Exam. Tuesday, December hours
San Francisco Sae Universiy Michael Bar ECON 560 Fall 03 Final Exam Tuesday, December 7 hours Name: Insrucions. This is closed book, closed noes exam.. No calculaors of any kind are allowed. 3. Show all
More informationEE650R: Reliability Physics of Nanoelectronic Devices Lecture 9:
EE65R: Reliabiliy Physics of anoelecronic Devices Lecure 9: Feaures of Time-Dependen BTI Degradaion Dae: Sep. 9, 6 Classnoe Lufe Siddique Review Animesh Daa 9. Background/Review: BTI is observed when he
More informationTechnical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.
Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear
More informationOnline Learning with Partial Feedback. 1 Online Mirror Descent with Estimated Gradient
Avance Course in Machine Learning Spring 2010 Online Learning wih Parial Feeback Hanous are joinly prepare by Shie Mannor an Shai Shalev-Shwarz In previous lecures we alke abou he general framework of
More informationFINM 6900 Finance Theory
FINM 6900 Finance Theory Universiy of Queensland Lecure Noe 4 The Lucas Model 1. Inroducion In his lecure we consider a simple endowmen economy in which an unspecified number of raional invesors rade asses
More informationVariational Iteration Method for Solving System of Fractional Order Ordinary Differential Equations
IOSR Journal of Mahemaics (IOSR-JM) e-issn: 2278-5728, p-issn: 2319-765X. Volume 1, Issue 6 Ver. II (Nov - Dec. 214), PP 48-54 Variaional Ieraion Mehod for Solving Sysem of Fracional Order Ordinary Differenial
More informationR t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t
Exercise 7 C P = α + β R P + u C = αp + βr + v (a) (b) C R = α P R + β + w (c) Assumpions abou he disurbances u, v, w : Classical assumions on he disurbance of one of he equaions, eg. on (b): E(v v s P,
More information- If one knows that a magnetic field has a symmetry, one may calculate the magnitude of B by use of Ampere s law: The integral of scalar product
11.1 APPCATON OF AMPEE S AW N SYMMETC MAGNETC FEDS - f one knows ha a magneic field has a symmery, one may calculae he magniude of by use of Ampere s law: The inegral of scalar produc Closed _ pah * d
More informationOptima and Equilibria for Traffic Flow on a Network
Opima and Equilibria for Traffic Flow on a Nework Albero Bressan Deparmen of Mahemaics, Penn Sae Universiy bressan@mah.psu.edu Albero Bressan (Penn Sae) Opima and equilibria for raffic flow 1 / 1 A Traffic
More informationLecture 10 Estimating Nonlinear Regression Models
Lecure 0 Esimaing Nonlinear Regression Models References: Greene, Economeric Analysis, Chaper 0 Consider he following regression model: y = f(x, β) + ε =,, x is kx for each, β is an rxconsan vecor, ε is
More informationInternational Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN
Inernaional Journal of Scienific & Engineering Research, Volume 4, Issue 10, Ocober-2013 900 FUZZY MEAN RESIDUAL LIFE ORDERING OF FUZZY RANDOM VARIABLES J. EARNEST LAZARUS PIRIYAKUMAR 1, A. YAMUNA 2 1.
More information6.2 Transforms of Derivatives and Integrals.
SEC. 6.2 Transforms of Derivaives and Inegrals. ODEs 2 3 33 39 23. Change of scale. If l( f ()) F(s) and c is any 33 45 APPLICATION OF s-shifting posiive consan, show ha l( f (c)) F(s>c)>c (Hin: In Probs.
More informationLecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.
Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in
More informationAnnouncements: Warm-up Exercise:
Fri Apr 13 7.1 Sysems of differenial equaions - o model muli-componen sysems via comparmenal analysis hp//en.wikipedia.org/wiki/muli-comparmen_model Announcemens Warm-up Exercise Here's a relaively simple
More informationEE 330 Lecture 23. Small Signal Analysis Small Signal Modelling
EE 330 Lecure 23 Small Signal Analysis Small Signal Modelling Exam 2 Friday March 9 Exam 3 Friday April 13 Review Session for Exam 2: 6:00 p.m. on Thursday March 8 in Room Sweeney 1116 Review from Las
More informationMATH 31B: MIDTERM 2 REVIEW. x 2 e x2 2x dx = 1. ue u du 2. x 2 e x2 e x2] + C 2. dx = x ln(x) 2 2. ln x dx = x ln x x + C. 2, or dx = 2u du.
MATH 3B: MIDTERM REVIEW JOE HUGHES. Inegraion by Pars. Evaluae 3 e. Soluion: Firs make he subsiuion u =. Then =, hence 3 e = e = ue u Now inegrae by pars o ge ue u = ue u e u + C and subsiue he definiion
More informationAn introduction to the theory of SDDP algorithm
An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking
More informationProblem Set #3: AK models
Universiy of Warwick EC9A2 Advanced Macroeconomic Analysis Problem Se #3: AK models Jorge F. Chavez December 3, 2012 Problem 1 Consider a compeiive economy, in which he level of echnology, which is exernal
More informationState-Space Models. Initialization, Estimation and Smoothing of the Kalman Filter
Sae-Space Models Iniializaion, Esimaion and Smoohing of he Kalman Filer Iniializaion of he Kalman Filer The Kalman filer shows how o updae pas predicors and he corresponding predicion error variances when
More informationComments on Window-Constrained Scheduling
Commens on Window-Consrained Scheduling Richard Wes Member, IEEE and Yuing Zhang Absrac This shor repor clarifies he behavior of DWCS wih respec o Theorem 3 in our previously published paper [1], and describes
More informationRobotics I. April 11, The kinematics of a 3R spatial robot is specified by the Denavit-Hartenberg parameters in Tab. 1.
Roboics I April 11, 017 Exercise 1 he kinemaics of a 3R spaial robo is specified by he Denavi-Harenberg parameers in ab 1 i α i d i a i θ i 1 π/ L 1 0 1 0 0 L 3 0 0 L 3 3 able 1: able of DH parameers of
More informationTwo Coupled Oscillators / Normal Modes
Lecure 3 Phys 3750 Two Coupled Oscillaors / Normal Modes Overview and Moivaion: Today we ake a small, bu significan, sep owards wave moion. We will no ye observe waves, bu his sep is imporan in is own
More informationGMM - Generalized Method of Moments
GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................
More informationR.#W.#Erickson# Department#of#Electrical,#Computer,#and#Energy#Engineering# University#of#Colorado,#Boulder#
.#W.#Erickson# Deparmen#of#Elecrical,#Compuer,#and#Energy#Engineering# Universiy#of#Colorado,#Boulder# Chaper 2 Principles of Seady-Sae Converer Analysis 2.1. Inroducion 2.2. Inducor vol-second balance,
More informationStochastic models and their distributions
Sochasic models and heir disribuions Couning cusomers Suppose ha n cusomers arrive a a grocery a imes, say T 1,, T n, each of which akes any real number in he inerval (, ) equally likely The values T 1,,
More informationPredator - Prey Model Trajectories and the nonlinear conservation law
Predaor - Prey Model Trajecories and he nonlinear conservaion law James K. Peerson Deparmen of Biological Sciences and Deparmen of Mahemaical Sciences Clemson Universiy Ocober 28, 213 Ouline Drawing Trajecories
More information