Best reply dynamics for scoring rules

Similar documents
How Bad is Selfish Voting?

Partial lecture notes THE PROBABILITY OF VIOLATING ARROW S CONDITIONS

How Bad is Selfish Voting?

Points-based rules respecting a pairwise-change-symmetric ordering

Algebraic Voting Theory

Winning probabilities in a pairwise lottery system with three alternatives

CMU Social choice 2: Manipulation. Teacher: Ariel Procaccia

Coalitional Structure of the Muller-Satterthwaite Theorem

Math for Liberal Studies

6.207/14.15: Networks Lecture 24: Decisions in Groups

Wreath Products in Algebraic Voting Theory

Nontransitive Dice and Arrow s Theorem

Convergence to Equilibria in Plurality Voting

Lecture 3: Probability

Incentive-Compatible Voting Rules with Positively Correlated Beliefs

Gibbard Satterthwaite Games

Voting. José M Vidal. September 29, Abstract. The problems with voting.

Introduction to Game Theory Lecture Note 2: Strategic-Form Games and Nash Equilibrium (2)

Multiple Equilibria in the Citizen-Candidate Model of Representative Democracy.

Algebraic Voting Theory

Ties Matter: Complexity of Voting Manipulation Revisited

Voting Handout. Election from the text, p. 47 (for use during class)

Recap Social Choice Fun Game Voting Paradoxes Properties. Social Choice. Lecture 11. Social Choice Lecture 11, Slide 1

APPLIED MECHANISM DESIGN FOR SOCIAL GOOD

Volume 31, Issue 1. Manipulation of the Borda rule by introduction of a similar candidate. Jérôme Serais CREM UMR 6211 University of Caen

Changing Perspectives

Lecture 3 : Probability II. Jonathan Marchini

Probabilistic Aspects of Voting

An NTU Cooperative Game Theoretic View of Manipulating Elections

THE PRICE OF DECEPTION IN SOCIAL CHOICE. A Dissertation Presented to The Academic Faculty. James P. Bailey

Approximation algorithms and mechanism design for minimax approval voting

Survey of Voting Procedures and Paradoxes

"Arrow s Theorem and the Gibbard-Satterthwaite Theorem: A Unified Approach", by Phillip Reny. Economic Letters (70) (2001),

MATH 19-02: HW 5 TUFTS UNIVERSITY DEPARTMENT OF MATHEMATICS SPRING 2018

Possible Winners when New Candidates Are Added: The Case of Scoring Rules

Geometry of distance-rationalization

Dependence and Independence in Social Choice Theory

Strategy-Proofness on the Condorcet Domain

Introduction to Formal Epistemology Lecture 5

Repeated Downsian Electoral Competition

Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples

Combining Voting Rules Together

Nash implementable domains for the Borda count

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games

CMU Social choice: Advanced manipulation. Teachers: Avrim Blum Ariel Procaccia (this time)

Permutation. Permutation. Permutation. Permutation. Permutation

Complexity of Shift Bribery in Hare, Coombs, Baldwin, and Nanson Elections

Mechanism Design for Bounded Agents

Truth-revealing voting rules for large populations

Algorithmic Game Theory Introduction to Mechanism Design

Mechanism Design Tutorial David C. Parkes, Harvard University Indo-US Lectures Week in Machine Learning, Game Theory and Optimization

FINITE MEMORY DETERMINACY

Iterated Strict Dominance in Pure Strategies

University of Technology, Building and Construction Engineering Department (Undergraduate study) PROBABILITY THEORY

Voting and Mechanism Design

Chapter 1. The Mathematics of Voting

4/17/2012. NE ( ) # of ways an event can happen NS ( ) # of events in the sample space

EC3224 Autumn Lecture #03 Applications of Nash Equilibrium

2005 Hypatia Contest

Dominance Solvable Approval Voting Games

CHAPTER 5: PROBABILITY, COMBINATIONS & PERMUTATIONS

1 The General Definition

MS&E 246: Lecture 4 Mixed strategies. Ramesh Johari January 18, 2007

Approval Voting for Committees: Threshold Approaches

Stackelberg Voting Games: Computational Aspects and Paradoxes

Efficient Algorithms for Hard Problems on Structured Electorates

Game Theory for Linguists

How Credible is the Prediction of a Party-Based Election?

Maximization of Relative Social Welfare on Truthful Cardinal Voting Schemes

10.1. Randomness and Probability. Investigation: Flip a Coin EXAMPLE A CONDENSED LESSON

Computational Aspects of Strategic Behaviour in Elections with Top-Truncated Ballots

Participation Incentives in Randomized Social Choice

1 Quick Sort LECTURE 7. OHSU/OGI (Winter 2009) ANALYSIS AND DESIGN OF ALGORITHMS

Social Dichotomy Functions (extended abstract)

1. Groups Definitions

Multivariate Complexity of Swap Bribery

Intro Prefs & Voting Electoral comp. Political Economics. Ludwig-Maximilians University Munich. Summer term / 37

Incentive-compatible voting rules with positively correlated beliefs

NOTE. A 2 2 Game without the Fictitious Play Property

CMPSCI 240: Reasoning about Uncertainty

Complexity of unweighted coalitional manipulation under some common voting rules

Mechanism Design for Multiagent Systems

Condorcet Efficiency: A Preference for Indifference

Social Choice. Jan-Michael van Linthoudt

Trembling Hand Equilibria of Plurality Voting

Online Learning, Mistake Bounds, Perceptron Algorithm

Voting Theory: Pairwise Comparison

Learning foundation and equilibrium selection in voting environments with private information

epub WU Institutional Repository

Abstract Arrowian Aggregation *

Leveling the Playing Field:

APPLIED MECHANISM DESIGN FOR SOCIAL GOOD

arxiv: v1 [cs.gt] 1 May 2017

THREE BRIEF PROOFS OF ARROW S IMPOSSIBILITY THEOREM JOHN GEANAKOPLOS COWLES FOUNDATION PAPER NO. 1116

Non compensatory Multicriteria Methods

CONNECTING PAIRWISE AND POSITIONAL ELECTION OUTCOMES

Computational Game Theory Spring Semester, 2005/6. Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg*

Ties Matter: Complexity of Manipulation when Tie-breaking with a Random Vote

The Best Way to Choose a Winner

Minimal manipulability: anonymity and unanimity

Transcription:

Best reply dynamics for scoring rules R. Reyhani M.C Wilson Department of Computer Science University of Auckland 3rd Summer workshop of CMSS, 20-22 Feb 2012

Introduction Voting game Player (voter) action is to submit an expressed vote (possibly different from its sincere preference). We are interested in the equilibrium result to predict the winner of election. Gibbard-Satterthwaite and other theorems show that dominant strategies don t always exist. The common solution concept is Nash Equilibrium (NE). Far too many NE exist and some of them are trivial. Also, voters cannot coordinate on one equilibrium.

Dynamic process of convergence We can use learning models or dynamic process (iterative games) as a coordination device Fudenberg and Levine (1998): In some cases, most learning models do not converge to any equilibrium and just coincide with the notion of rationalizability. However, if it converges, it necessarily finds a NE. It has application for voting in finding consensus: doodle.com, etc.

Best reply dynamics In the voting case it has only been used by Meir et al, for best-reply dynamics (BRD) as far as we know (AAAI 2010). The first player moves, then another one responds, etc Players move one at a time and in each move, they do a best reply. Myopic moves, no communication between players and zero knowledge of others.

Basic setup We have a set C of alternatives (candidates) and set V of voters, with m := C, n := V. Each voter v submits a permutation L(v) of the candidates. This defines the set T of types, and T = m!. A profile is a function V T. A voting situation is a multiset from T with total weight n. The scoring rule determined by a vector w with w 1 w 2 w m 1 w m assigns the score c := t T {v V L(v) = t} w L(v) 1 (c). Special cases: plurality: w = (1, 0, 0,..., 0); antiplurality (veto): w = (1, 1,..., 1, 0); Borda: w = (m 1, m 2,..., 1, 0).

An example of BRD Consider antiplurality system with 2 voters V = {1, 2} and 4 candidates C = {a, b, c, d}, alphabetically tie-breaking. The sincere profile is P 0 = (acbd, bacd). If voters start from sincere state, we have ( d, d){a} 2 ( d, a){b} 1 ( b, a){c} 2 ( b, c){a} The best reply is not unique. For example, the last move by second player can instead be d. However, c (vetoing the current winner) is what we call Restricted Best Reply (RBR) for antiplurality.

Summary of Meir et al Plurality voting Other assumptions: Behaviour : RBR at each step or best reply. Indifference: keep last move or truth-biased. Initial state: sincere profile or arbitrary profile. Tiebreaking: deterministic or uniform random. Voters: unweighted or weighted. Convergence for plurality under red hypotheses in at most m 2 n 2 steps.

Summary of Meir et al Results: Deterministic tiebreaking from an arbitrary initial state converges for unweighted voters. The winner is the sincere winner or a candidate at most 1 point behind initially. In each case, changing each red hypothesis and keeping the others yields examples of non-convergence.

Example of cycle in BRD Example Consider the sincere profile P 0 = (abc, bca) and voting rule Borda and alphabetical tie-breaking. (abc, bca){b} 1 (acb, bca){a} 2 (acb, cba){c} 1 (abc, cba){a} 2 (abc, bca){b}.

Restricted Best Reply Dynamics for antiplurality Best reply is not unique, because several preference orders may yield the same result. Is there a natural restricted best reply which is unique? One answer can be the best reply that maximizes the winning score margin of the new winner.

Restricted Best Reply Dynamics for antiplurality Without loss of generality, we can assume that S i = { c c C} o t is the winner set after the move of player i at time t and is the current winner set. Best reply: a b where b a. Type 1: a / o t and b o t 1 Type 2: a o t and b / o t 1 Type 3: a o t and b o t 1 For plurality and antiplurality, allowing type 2 moves can lead to a cycle. We call type 1 and 3 as RBR. In randomized tie-breaking, the player vetoes the least preferred member of o t 1.

RBRD for antiplurality Set of potential winners Definition (set of potential winners) The set of potential winners at time t, W t, those candidates who have a chance of winning at the next step (time t + 1), depending on the different RBR of voters. W t = {c if a player moves c b at time t + 1, then c o t+1 }

RBRD for antilurality Alphabetical tie-breaking Lemma If t < t then W t W t. Proof. Let c W t 1, and we have an improvement step a b at time t. Then we show c W t. If it is type 3, it is easy to show c a makes c winner. If it is a type 1, let b = o t. Note that b / {a, b}. Then we show c b makes c winner. It uses the transitivity of lexicographic order which may not be true for an arbitrary deterministic tie-breaking rule.

RBRD for antilurality Alphabetical tie-breaking Lemma Each voter has at most one type 1 move and at most m 1 moves of type 3. Proof. 1 Suppose a step a i b is a type 1 move by voter i at time t. With proof by contradiction, we show this improvement step is the first improvement step of voter i. 2 as at every step a i b of type 3, it must hold that a i b because of the definition of improvement step. Conclusion: RBRD for G(V, C, A) with alphabetical tie-breaking will converge to a NE from any state in at most mn steps.

RBRD for antilurality Randomized tie-breaking Definition (Stochastic dominance improvement step) Voter v prefers an outcome with winner set W to an outcome with winner set W is preferred to W if and only if for each k = 1 m, the probability of electing one of the first k candidates given outcome W should be no less than given W. Lemma If t < t then W t W t.

RBRD for antilurality Randomized tie-breaking Lemma Each voter has at most one type 1 move and at most m 1 moves of type 3. Proof. For type 1 move, similarly proof by contradiction. For type 3 move a b, we show the probability of winning of a has increased and b has decreased. Therefore, a i b. Conclusion: Stochastic dominance RBRD for G(V, C, A) with randomized tie-breaking will converge to a NE from any state in at most mn steps.

RBRD for plurality Type 1: a / o t 1 and b o t Type 3: a o t 1 and b o t W t = {c if a player moves a c and a o t 1 then c o t } Lemma If t < t then W t W t. Lemma The number of type 1 moves are at most m and each voter has at most m 1 moves of type 3. Theorem RBRD G(V, C, P), will converge to a NE from any state in at most m + (m 1)n steps.

Cycles for more general scoring rules Cycle for scoring rules close to Plurality: Suppose we have 3 candidates a, b and c and P 0 = (abc, bca). The scoring rule is w = (1, α, 0);α 1 2 and we use alphabetical tie-breaking. (abc, bca){b} 1 (acb, bca){a} 2 (acb, cba){c} 1 (abc, cba){a} 2 (abc, bca){b} general m and n = 2 (ab c, bc a){b} 1 (a cb, bc a){a} 2 (a cb, cb a){c} 1 (ab c, cb a){a} 2 (ab c, bc a){b}

Cycles for more general scoring rules Cycle for scoring rules close to antiplurality: m = 3, n = 4 Suppose we have 3 candidates a, b and c. The sincere profile is P 0 = (abc, bac, cab, bca). Our scoring rule is (1, α, 0); 1 2 α < 1 with alphabetical tie-breaking. (abc, bac, cab, bca){b} 1 (acb, bac, cab, bca){a} 4 (acb, bac, cab, cba){c} 1 (abc, bac, cab, cba){a} 4 (abc, bac, cab, bca){b}

Order of players matters Consider Borda system with 4 voters and 3 candidates, P 0 = (acb, acb, cab, cba) and alphabetically tie-breaking (acb, acb, cab, cba){c} 1 (abc, acb, cab, cba){a} 3 (abc, acb, cba, cba){c} 2 (abc, abc, cba, cba){a} 4 (abc, abc, cba, bca){b} 1 (acb, abc, cba, bca){1} 4 (acb, abc, cba, cba){c} 1 (abc, abc, cba, bca) P is the same as P after fourth move and the cycle starts. Now let s consider another order for the players. (acb, acb, cba, cab){c} 1 (abc, acb, cba, cab){a} 4 (abc, acb, cba, cba){c} 2 (abc, abc, cba, cba){a} 3 (abc, abc, bca, cba){b} 4 (abc, abc, bca, cab){a}

Conclusion The possibility of winning of a candidate depends on the type of improvement step and also candidate s priority in tie-breaking (max d = 2) The number of type 2 moves are not bounded, so we need to use RBR for convergence. We need to use stochastic dominance RBR for randomized tie-breaking. The results of convergence do not happen for a non-linear deterministic tie-breaking rule The order of players matters in convergence, equilibrium result and also speed of convergence.