CMSC 474, Introduction to Game Theory Maxmin and Minmax Strategies

Similar documents
1 Bertrand duopoly with incomplete information

(b) Is the game below solvable by iterated strict dominance? Does it have a unique Nash equilibrium?

Sociology 376 Exam 1 Spring 2011 Prof Montgomery

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

Zero-Sum Games Public Strategies Minimax Theorem and Nash Equilibria Appendix. Zero-Sum Games. Algorithmic Game Theory.

1. The F-test for Equality of Two Variances

Chapter 9. Mixed Extensions. 9.1 Mixed strategies

Computing Minmax; Dominance

Bogoliubov Transformation in Classical Mechanics

SOME RESULTS ON INFINITE POWER TOWERS

DIFFERENTIAL EQUATIONS

Introduction to Mechanism Design

Lecture 8: Period Finding: Simon s Problem over Z N

Department of Mechanical Engineering Massachusetts Institute of Technology Modeling, Dynamics and Control III Spring 2002

Computing Minmax; Dominance

Computing Solution Concepts of Normal-Form Games. Song Chong EE, KAIST

Social Studies 201 Notes for March 18, 2005

Assignment for Mathematics for Economists Fall 2016

Comparing Means: t-tests for Two Independent Samples

DIFFERENTIAL EQUATIONS Laplace Transforms. Paul Dawkins

Preemptive scheduling on a small number of hierarchical machines

μ + = σ = D 4 σ = D 3 σ = σ = All units in parts (a) and (b) are in V. (1) x chart: Center = μ = 0.75 UCL =

Quantitative Information Leakage. Lecture 9

Clustering Methods without Given Number of Clusters

V V V V. Vectors. Mike Bailey. Vectors have Direction and Magnitude. Magnitude: x y z. Computer Graphics.

V V The circumflex (^) tells us this is a unit vector

If Y is normally Distributed, then and 2 Y Y 10. σ σ

into a discrete time function. Recall that the table of Laplace/z-transforms is constructed by (i) selecting to get

Social Studies 201 Notes for November 14, 2003

Online Appendix for Corporate Control Activism

Notes on Strategic Substitutes and Complements in Global Games

Laplace Transformation

Lecture 21. The Lovasz splitting-off lemma Topics in Combinatorial Optimization April 29th, 2004

Quantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli

Problem Set 8 Solutions

NAME (pinyin/italian)... MATRICULATION NUMBER... SIGNATURE

March 18, 2014 Academic Year 2013/14

Game Theory: Lecture 3

online learning Unit Workbook 4 RLC Transients

Secretary problems with competing employers

Math 273 Solutions to Review Problems for Exam 1

Non-myopic Strategies in Prediction Markets

6.302 Feedback Systems Recitation 6: Steady-State Errors Prof. Joel L. Dawson S -

At the end of this lesson, the students should be able to understand:

Unit I Review Worksheet Key

Electronic Theses and Dissertations

Lecture 10 Filtering: Applied Concepts

EconS Advanced Microeconomics II Handout on Bayesian Nash Equilibrium

EE Control Systems LECTURE 14

Introduction to Game Theory

Source slideplayer.com/fundamentals of Analytical Chemistry, F.J. Holler, S.R.Crouch. Chapter 6: Random Errors in Chemical Analysis

Design By Emulation (Indirect Method)

ONLINE APPENDIX: TESTABLE IMPLICATIONS OF TRANSLATION INVARIANCE AND HOMOTHETICITY: VARIATIONAL, MAXMIN, CARA AND CRRA PREFERENCES

IEOR 3106: Fall 2013, Professor Whitt Topics for Discussion: Tuesday, November 19 Alternating Renewal Processes and The Renewal Equation

OPTIMAL STOPPING FOR SHEPP S URN WITH RISK AVERSION

Fermi Distribution Function. n(e) T = 0 T > 0 E F

Given the following circuit with unknown initial capacitor voltage v(0): X(s) Immediately, we know that the transfer function H(s) is

Optimal Fiscal Policy. in a Business Cycle Model without Commitment

STOCHASTIC DIFFERENTIAL GAMES:THE LINEAR QUADRATIC ZERO SUM CASE

Mechanics Physics 151

NCAAPMT Calculus Challenge Challenge #3 Due: October 26, 2011

PhysicsAndMathsTutor.com

EC381/MN308 Probability and Some Statistics. Lecture 7 - Outline. Chapter Cumulative Distribution Function (CDF) Continuous Random Variables

Theoretical Computer Science. Optimal algorithms for online scheduling with bounded rearrangement at the end

Chapter 2: Problem Solutions

Random vs. Deterministic Deployment of Sensors in the Presence of Failures and Placement Errors

Game Theory. Professor Peter Cramton Economics 300

Games with Perfect Information

PHYSICS LAB Experiment 5 Fall 2004 FRICTION

TUTORIAL PROBLEMS 1 - SOLUTIONS RATIONAL CHEREDNIK ALGEBRAS

COHOMOLOGY AS A LOCAL-TO-GLOBAL BRIDGE

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

Stochastic Neoclassical Growth Model

Online Appendix for Managerial Attention and Worker Performance by Marina Halac and Andrea Prat

ECE 3510 Root Locus Design Examples. PI To eliminate steady-state error (for constant inputs) & perfect rejection of constant disturbances

EXAM 4 -A2 MATH 261: Elementary Differential Equations MATH 261 FALL 2010 EXAMINATION COVER PAGE Professor Moseley

UNIT 15 RELIABILITY EVALUATION OF k-out-of-n AND STANDBY SYSTEMS

Math 201 Lecture 17: Discontinuous and Periodic Functions

ON THE APPROXIMATION ERROR IN HIGH DIMENSIONAL MODEL REPRESENTATION. Xiaoqun Wang

Lecture #9 Continuous time filter

Lecture 9: Shor s Algorithm

arxiv: v2 [math.nt] 30 Apr 2015

DYNAMICS OF ROTATIONAL MOTION

THE SPLITTING SUBSPACE CONJECTURE

Extension of Inagaki General Weighted Operators. and. A New Fusion Rule Class of Proportional Redistribution of Intersection Masses

Eigenvalues and eigenvectors

The Winding Path to RL

Homework 12 Solution - AME30315, Spring 2013

EXAM 4 -B2 MATH 261: Elementary Differential Equations MATH 261 FALL 2012 EXAMINATION COVER PAGE Professor Moseley

Chapter 4. The Laplace Transform Method

Lecture 7: Testing Distributions

Chapter Landscape of an Optimization Problem. Local Search. Coping With NP-Hardness. Gradient Descent: Vertex Cover

PHYS 110B - HW #2 Spring 2004, Solutions by David Pace Any referenced equations are from Griffiths Problem statements are paraphrased

MATEMATIK Datum: Tid: eftermiddag. A.Heintz Telefonvakt: Anders Martinsson Tel.:

Frictional Forces. Friction has its basis in surfaces that are not completely smooth: 1/29

Today: Linear Programming (con t.)

CHAPTER 6. Estimation

Control Systems Analysis and Design by the Root-Locus Method

Chapter 2 Sampling and Quantization. In order to investigate sampling and quantization, the difference between analog

Sample Problems. Lecture Notes Related Rates page 1

Transcription:

CMSC 474, Introduction to Game Theory Maxmin and Minmax Strategie Mohammad T. Hajiaghayi Univerity of Maryland

Wort-Cae Expected Utility For agent i, the wort-cae expected utility of a trategy i i the minimum over all poible combination of trategie for the other agent: Example: Battle of the Sexe ( ) min -i u i i, -i Wife trategy w = {(p, Opera), ( p, Football)} Huband trategy h = {(q, Opera), ( q, Football)} u w (p,q) = pq + ( p)( q) = 3pq p q + Huband Wife Opera Football Opera, 0, 0 Football 0, 0, For any fixed p, u w (p,q) i linear in q e.g., if p = ½, then u w (½,q) = ½ q + ½ 0 q, o the min mut be at q = 0 or q = e.g., min q (½ q + ½) i at q = 0 min q u w (p,q) = min (u w (p,0), u w (p,)) = min ( p, p) We can write u w (p,q) intead of u w ( w, h )

Maxmin Strategie Alo called maximin A maxmin trategy for agent i A trategy i that make i wort-cae expected utility a high a poible: Thi in t necearily unique Often it i mixed argmax min u i i, -i i -i ( ) Agent i maxmin value, or ecurity level, i the maxmin trategy wort-cae expected utility: For player it implifie to max i min u i ( i, -i ) -i max min u,

Wife and huband trategie Example w = {(p, Opera), ( p, Football)} h = {(q, Opera), ( q, Football)} Recall that wife wort-cae expected utility i min q u w (p,q) = min ( p, p) Find p that maximize it Max i at p = p, i.e., p = /3 Wife maxmin value i p = /3 Wife maxmin trategy i {(/3, Opera), (/3, Football)} Similarly, Huband maxmin value i /3 min q u w (p,q) Huband Wife Huband maxmin trategy i {(/3, Opera), (/3, Football)} p p Opera p Football Opera, 0, 0 Football 0, 0,

Quetion Why might an agent i want to ue a maxmin trategy?

Anwer Why might an agent i want to ue a maxmin trategy? Ueful if i i cautiou (want to maximize hi/her wort-cae utility) and doen t have any information about the other agent whether they are rational what their payoff are whether they draw their action choice from known ditribution Ueful if i ha reaon to believe that the other agent objective i to minimize i expected utility e.g., -player zero-um game (we dicu thi later in hi eion) Solution concept: maxmin trategy profile all player ue their maxmin trategie

Minmax Strategie (in -Player Game) Alo called minimax Minmax trategy and minmax value Dual of their maxmin counterpart Suppoe agent want to punih agent, regardle of how it affect agent own payoff Agent minmax trategy againt agent A trategy that minimize the expected utility of bet repone to argmin max u, ( ) Agent minmax value i maximum expected utility if agent play hi/her minmax trategy: min max u, ( ) Minmax trategy profile: both player ue their minmax trategie

Example Wife and huband trategie w = {(p, Opera), ( p, Football)} h = {(q, Opera), ( q, Football)} u h (p,q) = pq + ( p)( q) = 3pq p q + Given wife trategy p, huband expected utility i linear in q e.g., if p = ½, then u h (½,q) = ½ q + Max i at q = 0 or q = max q u h (p,q) = ( p, p) Find p that minimize thi Min i at p + = p p = /3 Huband/ minmax value i /3 Wife minmax trategy i {(/3, Opera), (/3, Football)} Huband Wife p Opera Football Opera, 0, 0 Football 0, 0, p p p

Minmax Strategie in n-agent Game In n-agent game (n > ), agent i uually can t minimize agent j payoff by acting unilaterally But uppoe all the agent gang up on agent j Let * j be a mixed-trategy profile that minimize j maximum payoff, i.e., * j arg min max u j j For every agent i j, a minmax trategy for i i i component of -j * j j, j Agent j minmax value i j maximum payoff againt j * * max u j j, - j j ( ) = min We have equality ince we jut replaced j * by it value above - j ( ) max j u j j, - j

Minimax Theorem (von Neumann, 98) Theorem. Let G be any finite two-player zero-um game. For each player i, i expected utility in any Nah equilibrium = i maxmin value = i minmax value In other word, for every Nah equilibrium ( *, *), u ( *, * ) min maxu (, ) max min u (, ) u ( *, * ) Corollary. For two-player zero-um game:{nah equilibria} = {maxmin trategy profile}= {minmax trategy profile} Note that thi i not neceary true for non-zero-um game a we aw for Battle of Sexe in previou lide Terminology: the value (or minmax value) of G i agent minmax value.

Maximin and Minimax via LP Let-u =u = u and let mixed trategie = x = x,, x k and = y = y,, y r, in which player ha k trategie and player ha r trategie. Then u x, y = i j x i y j u i,j = j y j i x i u i,j We want to find x which optimize v = max x min y Since player i doing hi bet repone (in min y i x i u i,j i minimized. Thu v = j i x i y j u i,j = ( j y j ) min j i x i u i,j = min j u(x,y) u(x,y) ) he et y j > 0 only if We have the following LP to find v and the firt player trategy x max v uch that v i x i u i,j for all j i x i = x i 0 i x i u i,j i x i u i,j for any j

Maximin and Minimax via LP Similarly by writing an LP for minimax value v = min y max u(x,y), we x can obtain the econd player trategy min v uch that v j y j u i,j for all i j y j = y j 0 Note that due to Minimax Theorem v = v (v v i trivial jut by definition). Alo (, )= (x, y) i a Nah equilibrium.

Example: Matching Pennie Head Tail Agent trategy: diplay head with probability p Agent trategy: diplay head with probability q u (p, q) = p q + ( p)( q) p( q) q( p) = p q + 4pq u (p, q) = u (p, q) u (p, q) Head,, Tail,, Want to how that {Nah equilibria} = {maxmin trategy profile} = {minmax trategy profile} = {(p = ½, q = ½)} p q

Example: Matching Pennie Head Tail Find Nah equilibria u (p, q) = p q + 4pq u (p, q) = u (p, q) If p = q = ½, then u = u = 0 If agent change to p ½ and agent keep q = ½, then u (p, ½) = p + p = 0 If agent change to q ½ and agent keep p = ½, then u (½, q) = ( q + q) = 0 Thu p = q = ½ i a Nah equilibrium u (p, q) Head,, Tail,, Are there any other? p q

Example: Matching Pennie Head Tail Show there are no other Nah equilibria u (p, q) = p q + 4pq u (p, q) = u (p, q) Conider any trategy profile (p, q) where p ½ or q ½ or both Several different cae, depending on the exact value of p and q In every one of them, either agent can increae u by changing p, or agent can increae u by changing q, or both So there are no other Nah equilibria u (p, q) Head,, Tail,, p q

Example: Matching Pennie Head Tail Find all maxmin trategy profile u (p, q) = p q + 4pq u (p, q) = u (p, q) Head,, Tail,, If agent trategy i p = ½ then regardle of value of q, u (½, q) = q + q = 0 If agent trategy i p > ½ then bet repone i q = 0 (ee the diagram) u (p, 0) = p < 0 If agent trategy i p < ½ then bet repone i q = u (p, ) = + p < 0 Thu ha one maxmin trategy: p = ½ u (p, q) p q Similarly, ha one maxmin trategy: q = ½

Example: Matching Pennie Head Tail Find all minmax trategy profile u (p, q) = p q + 4pq u (p, q) = u (p, q) Head,, Tail,, If agent trategy i p = ½ then regardle of value of q, u (½, q) = ( q + q) = 0 If agent trategy i p > ½ then bet repone i q = 0 (ee the diagram) u (p, 0) = ( p) > 0 If agent trategy i p < ½ then bet repone i q = u (p, ) = ( + p) > 0 Thu ha one minmax trategy: p = ½ u (p, q) p q Similarly, ha one minmax trategy: q = ½