Observations on Cooperation

Similar documents
Observations on Cooperation

Social Learning and the Shadow of the Past

Evolution & Learning in Games

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

BELIEFS & EVOLUTIONARY GAME THEORY

Extensive Form Games with Perfect Information

A Folk Theorem For Stochastic Games With Finite Horizon

Lecture Notes on Game Theory

Costly Expertise. Dino Gerardi and Leeat Yariv yz. Current Version: December, 2007

ECO421: Reputation. Marcin P ski. March 29, 2018

Bounded Rationality Lecture 2. Full (Substantive, Economic) Rationality

Belief-based Learning

1 Games in Normal Form (Strategic Form)

C31: Game Theory, Lecture 1

by burning money Sjaak Hurkens y CentER, Tilburg University P.O. Box November 1994 Abstract

A (Brief) Introduction to Game Theory

Reciprocity and Trust under an Evolutionary Context

1. Introduction. 2. A Simple Model

Quantum Games Have No News for Economists 1

SONDERFORSCHUNGSBEREICH 504

University of Warwick, Department of Economics Spring Final Exam. Answer TWO questions. All questions carry equal weight. Time allowed 2 hours.

Cooperation Stimulation in Cooperative Communications: An Indirect Reciprocity Game

Spatial Economics and Potential Games

Equilibrium Refinements

Community Enforcement beyond the Prisoner s Dilemma

6 The Principle of Optimality

Informed Principal in Private-Value Environments

Definitions and Proofs

Computation of Efficient Nash Equilibria for experimental economic games

Prisoner s Dilemma. Veronica Ciocanel. February 25, 2013

Understanding and Solving Societal Problems with Modeling and Simulation

ECO421: Communication

Area I: Contract Theory Question (Econ 206)

Variants of Nash Equilibrium. CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte

Mathematical Economics - PhD in Economics

Preliminary Results on Social Learning with Partial Observations

Wars of Attrition with Budget Constraints

The Scope of Cooperation: Values and Incentives

Game Theory. Solutions to Problem Set 4

When to Ask for an Update: Timing in Strategic Communication

Virtual Robust Implementation and Strategic Revealed Preference

Basic Game Theory. Kate Larson. January 7, University of Waterloo. Kate Larson. What is Game Theory? Normal Form Games. Computing Equilibria

Game Theory and Rationality

Puri cation 1. Stephen Morris Princeton University. July Economics.

When to Ask for an Update: Timing in Strategic Communication. National University of Singapore June 5, 2018

Iterated Strict Dominance in Pure Strategies

Online Appendices for Large Matching Markets: Risk, Unraveling, and Conflation

Asymmetric Social Norms

Game Theory, Population Dynamics, Social Aggregation. Daniele Vilone (CSDC - Firenze) Namur

EVOLUTIONARY GAMES WITH GROUP SELECTION

Introduction to game theory LECTURE 1

6.207/14.15: Networks Lecture 16: Cooperation and Trust in Networks

Bayes Correlated Equilibrium and Comparing Information Structures

General idea. Firms can use competition between agents for. We mainly focus on incentives. 1 incentive and. 2 selection purposes 3 / 101

On the Unique D1 Equilibrium in the Stackelberg Model with Asymmetric Information Janssen, M.C.W.; Maasland, E.

Coordinating Expectations: Global Games with Strategic Substitutes

Game Theory: Spring 2017

Nonlinear Dynamics between Micromotives and Macrobehavior

REPEATED GAMES. Jörgen Weibull. April 13, 2010

Supplementary appendix to the paper Hierarchical cheap talk Not for publication

Perfect Bayesian Equilibrium

Conservative Belief and Rationality

The ambiguous impact of contracts on competition in the electricity market Yves Smeers

9 A Class of Dynamic Games of Incomplete Information:

CS 798: Multiagent Systems

Introduction. 1 University of Pennsylvania, Wharton Finance Department, Steinberg Hall-Dietrich Hall, 3620

Multiagent Systems Motivation. Multiagent Systems Terminology Basics Shapley value Representation. 10.

Mechanism Design: Implementation. Game Theory Course: Jackson, Leyton-Brown & Shoham

Chapter 7. Evolutionary Game Theory

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

ONLINE APPENDICES FOR INCENTIVES IN EXPERIMENTS: A THEORETICAL INVESTIGATION BY AZRIELI, CHAMBERS & HEALY

Imitation Processes with Small Mutations

SEQUENTIAL EQUILIBRIA IN BAYESIAN GAMES WITH COMMUNICATION. Dino Gerardi and Roger B. Myerson. December 2005

Cooperation in Social Dilemmas through Position Uncertainty

Persuasion Under Costly Lying

For general queries, contact

Quantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli

Guilt aversion in trust games

Evolution Through Imitation in a. Single Population 1

CHARACTERIZATIONS OF STABILITY

Ex Post Cheap Talk : Value of Information and Value of Signals

Suggested solutions to the 6 th seminar, ECON4260

Prisoner s Dilemma with Talk

Optimal Income Taxation and Public-Goods Provision with Preference and Productivity Shocks

Learning by (limited) forward looking players

The Robustness of Repeated Game Equilibria to Incomplete Payoff Information

Uncertainty and Disagreement in Equilibrium Models

Trust and Trustworthiness in an Economy with Heterogeneous Individuals

EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium

Game Theory -- Lecture 4. Patrick Loiseau EURECOM Fall 2016

Learning Equilibrium as a Generalization of Learning to Optimize

STOCHASTIC STABILITY OF GROUP FORMATION IN COLLECTIVE ACTION GAMES. Toshimasa Maruta 1 and Akira Okada 2

Simultaneous Choice Models: The Sandwich Approach to Nonparametric Analysis

EVOLUTIONARY STABILITY FOR TWO-STAGE HAWK-DOVE GAMES

Deceptive Advertising with Rational Buyers

1 Extensive Form Games

Other Equilibrium Notions

Endowment as a Blessing

arxiv: v1 [physics.soc-ph] 3 Nov 2008

Non-zero-sum Game and Nash Equilibarium

Transcription:

Introduction Observations on Cooperation Yuval Heller (Bar Ilan) and Erik Mohlin (Lund) PhD Workshop, BIU, January, 2018 Heller & Mohlin Observations on Cooperation 1 / 20

Introduction Motivating Example Alice interacts with a remote trader, Bob. Both agents have opportunities to shirk/cheat. Alice obtains anecdotal evidence about Bob's past behavior. Alice considers this information when deciding how to act. Alice is unlikely to interact with Bob again. Future partners may ask Bob about Alice's behavior. Heller & Mohlin Observations on Cooperation 2 / 20

Introduction Motivating Example Alice interacts with a remote trader, Bob. Both agents have opportunities to shirk/cheat. Alice obtains anecdotal evidence about Bob's past behavior. Alice considers this information when deciding how to act. Alice is unlikely to interact with Bob again. Future partners may ask Bob about Alice's behavior. Important economic interactions: Hunter-gatherer societies, medieval trade (e.g, Greif, 94)), modern face-to-face interactions (e.g., Bernstein, 1992; Dixit 2003), the sharing economy & online trade (e.g., ebay, Uber, Airbnb). Heller & Mohlin Observations on Cooperation 2 / 20

Introduction Underlying Game: The Prisoner's Dilemma (PD) c d g > 0 - gain of a greedy player. c 1 1 l 1+g l > 0 - loss if the partner defects. d 1+g l 0 0 Heller & Mohlin Observations on Cooperation 3 / 20

Introduction Brief Summary of 1 Novel simple behavior supports stable cooperation. (uniqueness in the restricted set of stationary strategies). 2 Stable cooperation requires observation of 2+ of interactions 3 Observation of partner's past actions: g > l: Only defection is stable. g < l: Cooperation is stable (and robust to any noise). Heller & Mohlin Observations on Cooperation 4 / 20

Introduction Brief Summary of 1 Novel simple behavior supports stable cooperation. (uniqueness in the restricted set of stationary strategies). 2 Stable cooperation requires observation of 2+ of interactions 3 Observation of partner's past actions: g > l: Only defection is stable. g < l: Cooperation is stable (and robust to any noise). 4 Observation of action proles: Cooperation is stable i g < l+1 2. 5 Optimal feedback: Observing partner's actions against cooperation. Heller & Mohlin Observations on Cooperation 4 / 20

Introduction Observation Structure and Environment Strategies and Steady States Solution Concept Observation Structure Basic stationary model (focus of the presentation): Each player privately observes a sample of k actions played by his partner (against other opponents). Agents are restricted to stationary strategies. IID sampling from the partner's stationary behavior. Set of possible signals - m {0,1,2,...,k} (interpreted as the number of observed defections). Heller & Mohlin Observations on Cooperation 5 / 20

Introduction Observation Structure and Environment Strategies and Steady States Solution Concept Observation Structure Basic stationary model (focus of the presentation): Each player privately observes a sample of k actions played by his partner (against other opponents). Agents are restricted to stationary strategies. IID sampling from the partner's stationary behavior. Set of possible signals - m {0,1,2,...,k} (interpreted as the number of observed defections). Alternative model (in the paper): Unrestricted set of strategies, observing the last k actions. All the results hold except uniqueness. Heller & Mohlin Observations on Cooperation 5 / 20

Stationary Strategies Denition (Strategy - s : {0,...,k} ({c,d})) Mapping assigning a mixed action for each possible observation. Interpretation: The agent's behavior conditional on the observed signal. Strategy distribution Distribution σ over the set of strategies (with a nite support). Interpretation: Heterogeneous population.

Stationary Strategies Denition (Strategy - s : {0,...,k} ({c,d})) Mapping assigning a mixed action for each possible observation. Interpretation: The agent's behavior conditional on the observed signal. Strategy distribution Distribution σ over the set of strategies (with a nite support). Interpretation: Heterogeneous population. Example of a Strategy Distribution supp (σ) = {s u,s 1,s 2 } σ (s u ) = ε, σ (s 1 ) = 1 ε, σ (s 6 2) = 5 (1 ε) 6 c m = 0 c m 1 s u 50%, s 1 (m) = s 2 (m) = d m 1, d m 2

Consistent Signal Prole Denition (Signal prole - θ : supp (σ) (M)) θ (s) is interpreted as the distribution of signals observed by agents who are matched with a partner who plays strategy s. Denition (Consistent signal prole ) Signal prole θ and strategy distribution σ jointly induce a behavior prole: a distribution of actions for each strategy. The behavior prole induce a signal prole of observed actions. Consistency: The induced signal prole is θ. Denition (Steady state (σ, θ)) Pair consisting of a strategy distribution and a consistent signal prole.

Commitment Strategies (Crazy Agents) As argued by Ellison (1994), the assumption that all agents are rational and, in equilibrium, best reply to what everyone else is doing is fairly implausible in large populations. We rene our solution concept by requiring robustness to the presence of a few crazy agents (à la Kreps et al., 1982).

Commitment Strategies (Crazy Agents) As argued by Ellison (1994), the assumption that all agents are rational and, in equilibrium, best reply to what everyone else is doing is fairly implausible in large populations. We rene our solution concept by requiring robustness to the presence of a few crazy agents (à la Kreps et al., 1982). Denition (Distribution of Commitments (S c,λ)) S c is a nite set of commitment strategies, and λ (S c ) is a distribution over these strategies. We assume that at least one of the commitment strategies is totally mixed.

Introduction Observation Structure and Environment Strategies and Steady States Solution Concept Nash in Perturbed Environment Denition (Perturbed Environment ((S C,λ),ε n )) A fraction ε n of committed agents plays a strategy according to λ (S C ). Heller & Mohlin Observations on Cooperation 9 / 20

Introduction Observation Structure and Environment Strategies and Steady States Solution Concept Nash in Perturbed Environment Denition (Perturbed Environment ((S C,λ),ε n )) A fraction ε n of committed agents plays a strategy according to λ (S C ). Denition (Nash equilibrium in a perturbed environment - (σn,θ n )) A normal agent cannot improve his payo by deviating. Formally: π (σn,θn ) π s (σn,θn ) for every strategy s, where π (σn,θn ) denotes the mean payo of the 1 ε n normal agents, and π s (σn,θn ) denotes the payo to strategy s, against a population who plays according to (σn,θn ). Heller & Mohlin Observations on Cooperation 9 / 20

Introduction Observation Structure and Environment Strategies and Steady States Solution Concept Perfect Equilibrium Denition (Perfect equilibrium (σ,θ )) The limit of Nash equilibria in a converging sequence of perturbed environments. (i.e., (S C,λ,(ε n ) n ), s.t. ((σ n,θ n ) εn 0 (σ,θ ))) Heller & Mohlin Observations on Cooperation 10 / 20

Introduction Taxonomy of PDs Observation of Actions Other Observation Structures Heller & Mohlin Observations on Cooperation 11 / 20

Prisoner's Dilemma - Taxonomy Oensive (submodular, Takahashi, 2010) PD l < g: stronger incentive to defect against a cooperative partner than against a defective partner. Defensive (supermodular) PD l > g. c d g > 0 - gain of a greedy player. c 1 1 l 1+g l > 0 - loss if the partner defects. d 1+g l 0 0

Prisoner's Dilemma - Taxonomy Oensive (submodular, Takahashi, 2010) PD l < g: stronger incentive to defect against a cooperative partner than against a defective partner. Defensive (supermodular) PD l > g. c d g > 0 - gain of a greedy player. c 1 1 l 1+g l > 0 - loss if the partner defects. d 1+g l 0 0

Introduction Taxonomy of PDs Observation of Actions Other Observation Structures Stable Defection in any PD Claim Defection is perfect equilibrium outcome in any PD. Heller & Mohlin Observations on Cooperation 13 / 20

Defection is the Unique Outcome in Oensive PDs Proposition Assume an oensive PD (l < g) with observation of any number of actions. If (σ,θ ) is a perfect equilibrium then everyone defects.

Defection is the Unique Outcome in Oensive PDs Proposition Assume an oensive PD (l < g) with observation of any number of actions. If (σ,θ ) is a perfect equilibrium then everyone defects. Intuition: Assume to the contrary that normal agents sometimes cooperate. Direct gain from defecting decreases in the partner's prob. of defection. The indirect loss is independent of the current partner's behavior. Incumbents are less likely to defect when observing more defections. If Alice always defects, she outperforms the incumbents.

Stable Cooperation in Defensive PD Proposition Assume g l and observing k 2 actions. Cooperation is strictly perfect. Moreover there is essentially a unique strategy distribution that supports cooperation.

Stable Cooperation in Defensive PD Proposition Assume g l and observing k 2 actions. Cooperation is strictly perfect. Moreover there is essentially a unique strategy distribution that supports cooperation. Essentially Unique Stable State Everyone cooperates when observing no defections. Sketch of proof Everyone defects when observing 2 defections. 0 < q < 1 k of the incumbents defect when observing 1 defection (i.e., q of the agents follow s 1, and the remaining 1 q follow s 2 ). The value of q balances the payo of s 1 and s 2. The value of q depends on the commitment strategies.

Introduction Taxonomy of PDs Observation of Actions Other Observation Structures Other Observation Structures What happens if the signal about the partner depends also on the behavior of other opponents against her? We study three observation structures: 1 The entire action prole. Signals:{CC, DC, CD, DD}. 2 Mutual cooperation or not (=conict). Signals:{CC, not CC}. 3 Observing actions against cooperation. Signals:{CC, DC,?D}. (1)+(2): Cooperation is a perfect equilibrium outcome i g < l+1 2. (3): Cooperation is a perfect equilibrium outcome for all Heller & Mohlin Observations on Cooperation 16 / 20

Introduction Equilibrium Renements Related Literature and Contribution Conclusion Equilibrium Renements In the paper we show that all of our equilibria satisfy two additional requirement: 1 Strict perfection - the equilibrium holds regardless of how the crazy agents behave. 2 Evolutionary stability (a la Maynard-Smith & Price, 1973). Small group of agents who deviate together are outperformed. 3 Robustness to small perturbations of the signal prole. The population converges back to steady state (σ,θ ), from any nearby (inconsistent) state (σ,θ). Heller & Mohlin Observations on Cooperation 17 / 20

Related literature (Partial List): Community Enforcement 1 Contagious equilibria (e.g., Kandori 1992; Ellison, 1994). 2 Applications of belief-free equilibria (Takahashi, 10; Deb, 12). 3 Image scoring (e.g., Nowak & Sigmund, 98). 4 Exogenous reputation mechanisms (e.g., Sugden, 86; Kandori, 92). 5 Structured populations (Cooper & Wallace, 04; Alger & Weibull, 13). 6 Observation of preferences (e.g., Dekel et al., 07; Herold, 12).

Related literature (Partial List): Community Enforcement 1 Contagious equilibria (e.g., Kandori 1992; Ellison, 1994). 2 Applications of belief-free equilibria (Takahashi, 10; Deb, 12). 3 Image scoring (e.g., Nowak & Sigmund, 98). 4 Exogenous reputation mechanisms (e.g., Sugden, 86; Kandori, 92). 5 Structured populations (Cooper & Wallace, 04; Alger & Weibull, 13). 6 Observation of preferences (e.g., Dekel et al., 07; Herold, 12). We show that a plausible requirement of robustness to a few crazy agents qualitatively changes the analysis.

Introduction Equilibrium Renements Related Literature and Contribution Conclusion Directions for Future Research Experiment to test the model's predictions. Realistic, yet tractable, model of online feedback. Applying the methodology to other underlying games. Heller & Mohlin Observations on Cooperation 19 / 20

Introduction Equilibrium Renements Related Literature and Contribution Conclusion Conclusion Introducing robustness against few crazy agents into the setup of community enforcement (Prisoner's Dilemma with random matching). 1 Unique novel simple behavior supports stable cooperation. 2 Stable cooperation requires observation of 2+ interactions 3 Observation of partner's past actions: g > l: Only defection is stable. g < l: Cooperation is stable. 4 Observation of action proles: Cooperation is stable i g < l+1 2. 5 Providing more information may harm cooperation. Heller & Mohlin Observations on Cooperation 20 / 20

Summary of - When is Cooperation Stable? Category of PD Actions Conicts Action proles Action against Coop. Mild (g < l+1 2 ) Defen. Oen. Y N Acute Defen. Y (g > l+1 2 ) Oen. N Stable cooperation requires observation of 2+ interactions. Y N Y N Y Observing a single interaction: Cooperation is not stable if g > 1.

Introduction Equilibrium Renements Related Literature and Contribution Conclusion Backup Slides Heller & Mohlin Observations on Cooperation 22 / 20

Sketch of Proof. η c Everyone cooperates when observing no defections.

Sketch of Proof. η c Everyone cooperates when observing no defections. 0 < q < 1 of the incumbents cooperate when observing 1 defection: If everyone cooperates when m = 1: If Alice deviates and defects with probability δ << 1 when observing m = 0, then she outperforms the incumbents (indirect loss is O ( δ 2) ). If everyone defects when m = 1: The unique consistent behavior is η d (each defection induces k 1 additional defections). Both actions must be best replies when m = 1.

Sketch of Proof. η c Everyone cooperates when observing no defections. 0 < q < 1 of the incumbents cooperate when observing 1 defection: If everyone cooperates when m = 1: If Alice deviates and defects with probability δ << 1 when observing m = 0, then she outperforms the incumbents (indirect loss is O ( δ 2) ). If everyone defects when m = 1: The unique consistent behavior is η d (each defection induces k 1 additional defections). Both actions must be best replies when m = 1. Everyone defects when observing m 2: The gain from defecting is increasing in the probability that the partner is going to defect. Defection is the unique best reply when observing m 2.

Sketch of Proof (continued). Back Let δ << 1 be the average probability of observing m = 1. Let Pr(d m = 1) be the probability that the partner is going to defect conditional on observing m = 1. The direct gain from defecting when observing m = 1 is: δ ((l Pr(d m = 1)) + g Pr (c m = 1)) < δ (l + 1).

Sketch of Proof (continued). Back Let δ << 1 be the average probability of observing m = 1. Let Pr(d m = 1) be the probability that the partner is going to defect conditional on observing m = 1. The direct gain from defecting when observing m = 1 is: δ ((l Pr(d m = 1)) + g Pr (c m = 1)) < δ (l + 1). The indirect loss from defecting when m = 1 is: q (k δ) (l + 1) + O ( δ 2). Unique 0 < q < 1 k balances the gain and the indirect loss. q is robust to perturbations (like the mixed eq. in Hawk-Dove).

Inuence of Cheap Talk Introducing cheap-talk with unrestricted language destabilize the perfect equilibrium in which everyone defects. Experimenting agents use a secret handshake to cooperate among themselves (Robson, 1990). Implications (observation of actions + cheap-talk): Defensive PD - Only the cooperative equilibrium is stable. Oensive PD - No stable equilibrium. The population state cycles between the defective and the cooperative equilibrium (as in the one-shot PD, see Wiseman & Yilankaya, 2001).

Introduction Equilibrium Renements Related Literature and Contribution Conclusion Steady States and Payos - Details Back Fact (standard xed point argument) Each strategy distribution admits a consistent behavior (not necessarily unique). Example (k = 3; Each agent plays the mode (frequently observed action).) 3 consistent behaviors: full cooperation, no cooperation, uniform mixing. The Payo of each incumbent strategy (s supp (σ)) and the average payo in the population are dened in a standard way. π s (σ,η) = s σ (s ) π (η s (s ),η s (s)), π (σ,η) = s supp(σ) σ (s) π s (σ,η). Heller & Mohlin Observations on Cooperation 26 / 20

Introduction Equilibrium Renements Related Literature and Contribution Conclusion Illustration of Stable Cooperation in Defensive PD Heller & Mohlin Observations on Cooperation 27 / 20

Introduction Equilibrium Renements Related Literature and Contribution Conclusion Illustration of Unstable Cooperation in Oensive PD Heller & Mohlin Observations on Cooperation 28 / 20