CS286r Assign One. Answer Key

Similar documents
The Second Anti-Mathima on Game Theory

Online Appendix. t=1 (p t w)q t. Then the first order condition shows that

Hila Etzion. Min-Seok Pang

U.C. Berkeley CS294: Beyond Worst-Case Analysis Luca Trevisan September 5, 2017

2.3 Nilpotent endomorphisms

Module 9. Lecture 6. Duality in Assignment Problems

Endogenous timing in a mixed oligopoly consisting of a single public firm and foreign competitors. Abstract

Vickrey Auction VCG Combinatorial Auctions. Mechanism Design. Algorithms and Data Structures. Winter 2016

In the figure below, the point d indicates the location of the consumer that is under competition. Transportation costs are given by td.

Lecture 3: Probability Distributions

Equilibrium with Complete Markets. Instructor: Dmytro Hryshko

COS 521: Advanced Algorithms Game Theory and Linear Programming

Bayesian epistemology II: Arguments for Probabilism

Problem Set 2 Solutions

Economics 101. Lecture 4 - Equilibrium and Efficiency

xp(x µ) = 0 p(x = 0 µ) + 1 p(x = 1 µ) = µ

Singular Value Decomposition: Theory and Applications

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture #16 Scribe: Yannan Wang April 3, 2014

Lecture 4. Instructor: Haipeng Luo

Assortment Optimization under MNL

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture # 15 Scribe: Jieming Mao April 1, 2013

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

Lecture 7: Boltzmann distribution & Thermodynamics of mixing

k t+1 + c t A t k t, t=0

Week3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity

Linear Feature Engineering 11

The Geometry of Logit and Probit

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India February 2008

PROBLEM SET 7 GENERAL EQUILIBRIUM

8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore

1 The Mistake Bound Model

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements

Computing Correlated Equilibria in Multi-Player Games

Vapnik-Chervonenkis theory

Linear Approximation with Regularization and Moving Least Squares

and problem sheet 2

10-701/ Machine Learning, Fall 2005 Homework 3

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

How Strong Are Weak Patents? Joseph Farrell and Carl Shapiro. Supplementary Material Licensing Probabilistic Patents to Cournot Oligopolists *

Difference Equations

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

a b a In case b 0, a being divisible by b is the same as to say that

Physics 4B. A positive value is obtained, so the current is counterclockwise around the circuit.

Microeconomics: Auctions

University of California, Davis Date: June 22, 2009 Department of Agricultural and Resource Economics. PRELIMINARY EXAMINATION FOR THE Ph.D.

Stanford University CS359G: Graph Partitioning and Expanders Handout 4 Luca Trevisan January 13, 2011

PROBABILITY PRIMER. Exercise Solutions

Lecture Notes on Linear Regression

Exercise Solutions to Real Analysis

Numerical Heat and Mass Transfer

The exam is closed book, closed notes except your one-page cheat sheet.

Subjective Uncertainty Over Behavior Strategies: A Correction

Welfare Properties of General Equilibrium. What can be said about optimality properties of resource allocation implied by general equilibrium?

C/CS/Phy191 Problem Set 3 Solutions Out: Oct 1, 2008., where ( 00. ), so the overall state of the system is ) ( ( ( ( 00 ± 11 ), Φ ± = 1

STATISTICAL MECHANICS

Bézier curves. Michael S. Floater. September 10, These notes provide an introduction to Bézier curves. i=0

ELASTIC WAVE PROPAGATION IN A CONTINUOUS MEDIUM

Credit Card Pricing and Impact of Adverse Selection

Perfect Competition and the Nash Bargaining Solution

APPENDIX A Some Linear Algebra

Bayesian Learning. Smart Home Health Analytics Spring Nirmalya Roy Department of Information Systems University of Maryland Baltimore County

THE CHINESE REMAINDER THEOREM. We should thank the Chinese for their wonderful remainder theorem. Glenn Stevens

(1 ) (1 ) 0 (1 ) (1 ) 0

Lecture 6. 1 Introduction. 2 Existence of Nash Equilibria. Apple Client 1. Client 2. Client 3. Microsoft

Limited Dependent Variables

Global Sensitivity. Tuesday 20 th February, 2018

Chapter Newton s Method

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

Pricing and Resource Allocation Game Theoretic Models

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

DISCRETE TIME ATTACKER-DEFENDER GAME

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

Engineering Risk Benefit Analysis

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

A 2D Bounded Linear Program (H,c) 2D Linear Programming

Bezier curves. Michael S. Floater. August 25, These notes provide an introduction to Bezier curves. i=0

Problem Solving in Math (Math 43900) Fall 2013

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

Supporting Materials for: Two Monetary Models with Alternating Markets

NP-Completeness : Proofs

Fundamental loop-current method using virtual voltage sources technique for special cases

Math 426: Probability MWF 1pm, Gasson 310 Homework 4 Selected Solutions

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Expected Value and Variance

Affine transformations and convexity

Example: (13320, 22140) =? Solution #1: The divisors of are 1, 2, 3, 4, 5, 6, 9, 10, 12, 15, 18, 20, 27, 30, 36, 41,

Case A. P k = Ni ( 2L i k 1 ) + (# big cells) 10d 2 P k.

Transfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system

Note on EM-training of IBM-model 1

CS294 Topics in Algorithmic Game Theory October 11, Lecture 7

More metrics on cartesian products

1 Generating functions, continued

1 Matrix representations of canonical matrices

3.1 ML and Empirical Distribution

CS 798: Homework Assignment 2 (Probability)

Foundations of Arithmetic

Implementation and Detection

Tit-For-Tat Equilibria in Discounted Repeated Games with. Private Monitoring

Statistics for Economics & Business

Transcription:

CS286r Assgn One Answer Key 1 Game theory 1.1 1.1.1 Let off-equlbrum strateges also be that people contnue to play n Nash equlbrum. Devatng from any Nash equlbrum s a weakly domnated strategy. That s, any alternatve strategy s has a lower payoff n expectaton than the Nash equlbrum strategy n each stage game. Ths mples that any devaton n the supergame s also weakly domnated by playng a Nash equlbrum n each round. Snce any devaton s weakly domnated the supergame s n subgame perfect equlbrum. 1.1.2 No, these are not the only subgame perfect equlbra of supergames. Grm trgger strateges n the repeated prsoner s dlemma, for example, are subgame perfect equlbra but a Nash equlbrum s not played n each stage game. 1.2 1.2.1 Along the same lnes as the no-trade theorem, Bayesan agents cannot smultaneously both agree they wll proft n a zero-sum game, mplyng both wll ext. Wthout loss of generalty we restrct attenton to two players wth a fnte set of sgnals each. Let X be a jont dstrbuton over the sgnals and π some proposed strategy profle. We can then construct a 2-dmensonal matrx of the expected score for each player gven both sgnals, and each entry n the matrx can be fully descrbed by the expected payoff to the row player. Ths produces Left Rght Top 3-5 Bottom 4 - Each player s expected score for playng under π gven ther sgnal s then a matrx lke the sum of values n each row or the negatve sum of each column (assumng, wthout loss of generalty, that all sgnals are equally lkely. If not we can fnd the 1

greatest common dvsor and break sgnals nto equvalent parts). If the sum of a row s negatve, or sum of a column s postve, then the row (resp. column) player wll ext, so those rows and columns must be removed. Ths reduces our orgnal queston to: can there exst a real-valued matrx where every row sums to a strctly postve number and every column sums to a strctly negatve one? One last smplfyng assumpton gves us an easy answer to ths queston. Assume that the postve and negatve values both use the same real, e.g. every entry n the matrx s ether r or r. Ths s, agan, wthout loss of generalty because we can fnd a gcd and expand each row (also duplcatng and extendng columns) and preserve all sums whle growng the matrx. Then our queston becomes: can there exst a matrx wth more postve terms n each row and more negatve terms n each column? But ths s easly seen to be mpossble snce then most terms would need to be postve and most terms would need to be negatve! Thus we conclude such a matrx cannot exst, and the expected score for each player must be zero (or negatve, but then they would ext). From the technque we can deduce the extenson readly to more than two players. 1.2.2 Followng the no-trade lne of thought a market maker must subsdze a predcton market (make t a constant-sum game) to encourage ratonal agents to partcpate. Wthout a subsdy the game would be zero-sum and all agents mght ext. 2 Buldng a scorng rule 2.1 2.1.1 It s strctly convex. Consder the functon g(p + λq), then t has dervatves dg dλ = q log(p + λq ) d 2 g dλ 2 = 1 q 2 /(p + λq ) log 2 The second dervatve s strctly postve, mplyng the functon s strctly convex. 2.1.2 s(p, ) = g(p) g (p) p + g (p) = p log p p log p + log p = log p 2

2.1.3 The logarthmc scorng rule. 2.2 2.2.1 It s derved from a strctly convex functon. So followng G + R t s strctly convex. 2.2.2 2.3 2.3.1 q log p When an expert s reportng honestly (q == p) they are the same functons. Varous answers are acceptable. The most smple may be that the net score s undefned wthout a pror predcton. 2.3.2 The market maker s worst-case loss s max p log p log p 0 where p 0 s the market s ntal predcton and the logarthmc scorng rule s used. More abstractly we would replace the logarthmc functon n the above wth the scorng rule used. The worst-case loss s realzed when the last expert predcts the realzed outcome perfectly, snce experts always score more for more accurate predctons when usng a strctly proper scorng rule. To mnmmze the worst-case loss we solve sup p nf log p whch (straghtforwardly) has a soluton at the unform vector. Equvalently, the market maker mnmzes ts worst-case loss (when usng the logarthmc scorng rule) by settng ts ntal dstrbuton to an entropy maxmzng one. 2.3.3 q (log p log p ) 3

2.3.4 The score of prevous predctons n a predcton market s not affected by your own predcton. Snce they are constant and solvng an arg max expresson s ndependent of addng or subtractng constant terms, a strctly proper scorng rule maxmzes both a score and net score. 2.4 2.4.1 2.4.2 C(Y ) C(X) = b ln(e Y1/b + e Y2/b ) ln(e X1/b + e X2/b ) The mpled probablty s the prce and mutats mutand for p 2. 2.4.3 p 1 = e X1/b /(e X1/b + e X2/b ) The b parameter controls the rate of change of the market or equvalently ts lqudty. A hgher b means that trades wll have a smaller effect on prces. 2.4.4 Prces are ntrade are n a 1:1 correspondence wth the market s belef of the lkelhood. If the prces s below your beleved probablty you wll buy contracts and f t s above your belefs you wll sell t. 3 Scorng Experts The followng table constructed used Bayes s theorem wll be useful. Far F 11 1 C 3 49 FF 65 FC CF 1 CC 5 Cloudy 4 11 2 3 16 65 8 8 4 5 3.0.5 See the above table. 4

3.0.6 Wth a flat fee an expert has no mmedate ncentve to be accurate snce t s pad the same free regardless of whether t s accurate or not. 3.0. Let c be the cost of acqurng addtonal nformaton. Assume the frst sgnal s F, then the expected score maxmzng predcton s (/11, 4/11). If the true state of the world s far (whch t s 11 of the tme) and f we purchase an addtonal sgnal we get F F /10 of the tme and score log 49/65, otherwse we get F C and score log /. Otherwse, 4/11 of the tme the world s cloudy and 4/10 of the tme we get log 16/65, otherwse we get log 8/. So we compare 11 ( 49 log 10 65 + 3 10 log ) + 4 11 ( 4 16 log 10 65 + 6 10 log 8 ) 11 log 11 + 4 11 log 4 11 ) + c and puttng ths nto Mathematca gves us a value of c.043. Usng the same reasonng for the case where we get C we fnd that c.0408. So f c s less than (roughly).0408 the expert wll always purchase addtonal nformaton. 3.0.8 Gven a value for c as our cost, then we take the least c from the prevous queston (.0408) and solve b = c/c. 3.0.9 Incentves are not affected by the addtve term a snce the change n the functon s ndependent of the predcton qualty. It may, however, be useful to have a postve a to attract a partcpant. Especally snce the logarthmc scorng rule s always negatve, so extng s always a preferred strategy f a = 0. 4 Some research usng scorng rules 4.0.10 Followng the paper where n s the number of outcomes. 4.0.11 p = π u (m ) j π ju (m j ) = π j π j = 1 n The same snce the probablty dstrbuton π and the dervatve of the utlty functon are nvarant. 5

4.0.12 Boundng u below practcally means that the market maker wll not accept trades that mght cause t to lose more than 5 dollars. 4.0.13 Ths market maker has bounded loss, so n that sense t s better. It achves ths by denyng some trades, however. 4.0.14 Let y be the shares outstandng n the market, and x the shares outstandng on outcome x. Then we have that must hold for a trade to occur. (y + 20)/n x + 20 5 6