arxiv:math/ v1 [math.co] 22 Jul 2005

Similar documents
Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson) February 5,

ALGEBRA REVIEW. MULTINOMIAL An algebraic expression consisting of more than one term.

Polygonal Designs: Existence and Construction

arxiv: v2 [math.co] 8 Mar 2018

On Certain C-Test Words for Free Groups

CSE525: Randomized Algorithms and Probabilistic Analysis May 16, Lecture 13

THE AVERAGE NORM OF POLYNOMIALS OF FIXED HEIGHT

MULTIPLAYER ROCK-PAPER-SCISSORS

List Scheduling and LPT Oliver Braun (09/05/2017)

The Weierstrass Approximation Theorem

Birthday Paradox Calculations and Approximation

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices

Combining Classifiers

16 Independence Definitions Potential Pitfall Alternative Formulation. mcs-ftl 2010/9/8 0:40 page 431 #437

a a a a a a a m a b a b

Deflation of the I-O Series Some Technical Aspects. Giorgio Rampa University of Genoa April 2007

Closed-form evaluations of Fibonacci Lucas reciprocal sums with three factors

Module #1: Units and Vectors Revisited. Introduction. Units Revisited EXAMPLE 1.1. A sample of iron has a mass of mg. How many kg is that?

A Note on Scheduling Tall/Small Multiprocessor Tasks with Unit Processing Time to Minimize Maximum Tardiness

In this chapter, we consider several graph-theoretic and probabilistic models

Midterm 1 Sample Solution

Curious Bounds for Floor Function Sums

Lectures 8 & 9: The Z-transform.

What is Probability? (again)

SUR LE CALCUL DES PROBABILITÉS

New upper bound for the B-spline basis condition number II. K. Scherer. Institut fur Angewandte Mathematik, Universitat Bonn, Bonn, Germany.

Chapter 5, Conceptual Questions

Ocean 420 Physical Processes in the Ocean Project 1: Hydrostatic Balance, Advection and Diffusion Answers

arxiv: v1 [math.nt] 14 Sep 2014

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon

EXPLICIT CONGRUENCES FOR EULER POLYNOMIALS

Probability and Stochastic Processes: A Friendly Introduction for Electrical and Computer Engineers Roy D. Yates and David J.

Measures of average are called measures of central tendency and include the mean, median, mode, and midrange.

Solutions of some selected problems of Homework 4

ORIGAMI CONSTRUCTIONS OF RINGS OF INTEGERS OF IMAGINARY QUADRATIC FIELDS

26 Impulse and Momentum

Chapter 6: Economic Inequality

R. L. Ollerton University of Western Sydney, Penrith Campus DC1797, Australia

USEFUL HINTS FOR SOLVING PHYSICS OLYMPIAD PROBLEMS. By: Ian Blokland, Augustana Campus, University of Alberta

A Simple Regression Problem

ON SEQUENCES OF NUMBERS IN GENERALIZED ARITHMETIC AND GEOMETRIC PROGRESSIONS

The Frobenius problem, sums of powers of integers, and recurrences for the Bernoulli numbers

Lesson 24: Newton's Second Law (Motion)

Ştefan ŞTEFĂNESCU * is the minimum global value for the function h (x)

A1. Find all ordered pairs (a, b) of positive integers for which 1 a + 1 b = 3

MA304 Differential Geometry

A Self-Organizing Model for Logical Regression Jerry Farlow 1 University of Maine. (1900 words)

Uniform Approximation and Bernstein Polynomials with Coefficients in the Unit Interval

KONINKL. NEDERL. AKADEMIE VAN WETENSCHAPPEN AMSTERDAM Reprinted from Proceedings, Series A, 61, No. 1 and Indag. Math., 20, No.

Last Name Student Number. Last Name Student Number [5] Q1. Circuit analysis. [2] (a) For the following circuit, give the truth table.

NB1140: Physics 1A - Classical mechanics and Thermodynamics Problem set 2 - Forces and energy Week 2: November 2016

Physics 6A. Stress, Strain and Elastic Deformations. Prepared by Vince Zaccone For Campus Learning Assistance Services at UCSB

2 Q 10. Likewise, in case of multiple particles, the corresponding density in 2 must be averaged over all

A REMARK ON PRIME DIVISORS OF PARTITION FUNCTIONS

2. THE FUNDAMENTAL THEOREM: n\n(n)

Constant-Space String-Matching. in Sublinear Average Time. (Extended Abstract) Wojciech Rytter z. Warsaw University. and. University of Liverpool

Least squares fitting with elliptic paraboloids

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Descent polynomials. Mohamed Omar Department of Mathematics, Harvey Mudd College, 301 Platt Boulevard, Claremont, CA , USA,

PRELIMINARIES This section lists for later sections the necessary preliminaries, which include definitions, notations and lemmas.

Homework 3 Solutions CSE 101 Summer 2017

Student Book pages

arxiv: v2 [math.co] 3 Dec 2008

. The univariate situation. It is well-known for a long tie that denoinators of Pade approxiants can be considered as orthogonal polynoials with respe

Fixed-to-Variable Length Distribution Matching

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS

The concavity and convexity of the Boros Moll sequences

CHARACTER SUMS AND RAMSEY PROPERTIES OF GENERALIZED PALEY GRAPHS. Nicholas Wage Appleton East High School, Appleton, WI 54915, USA.

Optical Properties of Plasmas of High-Z Elements

Left-to-right maxima in words and multiset permutations

A Better Algorithm For an Ancient Scheduling Problem. David R. Karger Steven J. Phillips Eric Torng. Department of Computer Science

arxiv: v1 [cs.ds] 3 Feb 2014

#A52 INTEGERS 10 (2010), COMBINATORIAL INTERPRETATIONS OF BINOMIAL COEFFICIENT ANALOGUES RELATED TO LUCAS SEQUENCES

Research Article Some Formulae of Products of the Apostol-Bernoulli and Apostol-Euler Polynomials

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair

Principal Components Analysis

Lecture 21 Principle of Inclusion and Exclusion

A NOTE ON ENTROPY OF LOGIC

Efficient Filter Banks And Interpolators

The Euler-Maclaurin Formula and Sums of Powers

4 = (0.02) 3 13, = 0.25 because = 25. Simi-

Non-Parametric Non-Line-of-Sight Identification 1

Problem Set 2. Chapter 1 Numerical:

Egyptian Mathematics Problem Set

Randomized Recovery for Boolean Compressed Sensing

LATTICE POINT SOLUTION OF THE GENERALIZED PROBLEM OF TERQUEi. AND AN EXTENSION OF FIBONACCI NUMBERS.

On the Existence of Pure Nash Equilibria in Weighted Congestion Games

On Poset Merging. 1 Introduction. Peter Chen Guoli Ding Steve Seiden. Keywords: Merging, Partial Order, Lower Bounds. AMS Classification: 68W40

Finite fields. and we ve used it in various examples and homework problems. In these notes I will introduce more finite fields

International Mathematical Olympiad. Preliminary Selection Contest 2009 Hong Kong. Outline of Solutions

1 Generalization bounds based on Rademacher complexity

Divisibility of Polynomials over Finite Fields and Combinatorial Applications

Chapter II TRIANGULAR NUMBERS

IN modern society that various systems have become more

ADVANCES ON THE BESSIS- MOUSSA-VILLANI TRACE CONJECTURE

PY /005 Practice Test 1, 2004 Feb. 10

A proposal for a First-Citation-Speed-Index Link Peer-reviewed author version

A note on the multiplication of sparse matrices

Figure 1: Equivalent electric (RC) circuit of a neurons membrane

Uncoupled automata and pure Nash equilibria

Transcription:

Distances between the winning nubers in Lottery Konstantinos Drakakis arxiv:ath/0507469v1 [ath.co] 22 Jul 2005 16 March 2005 Abstract We prove an interesting fact about Lottery: the winning 6 nubers (out of 49 in the gae of the Lottery contain two consecutive nubers with a surprisingly high probability (alost 50%. 1 Introduction The gae of lottery exists and has been run in any countries (such as the UK, the US, Gerany, France, Ireland, Australia, Greece, Spain, etc. for a nuber of years. In this gae, the player chooses nubers fro aong the nubers 1,...,n >, the order of the choice being uniportant and the values of n and varying fro country to country; the lottery organizers choose publicly nubers in the sae way, and if they are the sae with the ones the player chose, the player wins. Newspapers usually publish the winning set of nubers along with statistics on the nuber of ties each particular nuber fro 1 to n has appeared in the winning set. It is however a slightly different and ore elusive statistical observation that will be of interest to us here. Soe people have noticed that, in the usual case = 6 and n = 49, it happens very often that at least two of the winning nubers are close to each other. As 6 out of 49 is not really any, this sees at first to be paradoxical, if not altogether wrong, and ay reind us strongly of another very siilar faous paradox, the Birthday Paradox. In this work we will prove that this observation is well founded, even if we adopt the strictest interpretation of nubers being close, i.e. that they be consecutive. Our proble to solve then will be the following: What is the probability that, out of > 0 nubers drawn uniforly randoly fro the range 1,...,n >, at least two are consecutive? We will calculate this probability in two ways below: one quite echanical, by finding a recursion and then solving it by eans of generating functions, and one cobinatorial, which will actually yield a ore general result. We will also see that this proble, at least for the usual values = 6 and n = 49, leads to a novel and unexpected gabling application. 2 First solution Let f(n, be the nuber of ways in which nubers can be chosen out of 1,...,n so that no two are consecutive. For any particular choice, one of the following will hold: Neither 1 nor n is chosen: we have to choose nubers aong 2,...,n 1 and the nuber of ways this can be accoplished in is f(n 2,. 1 and/or n is chosen: the nuber of ways this can be accoplished in is, according to the inclusionexclusion principle, the su of the nuber of ways of choosing 1 and choosing n inus the nuber of ways in choosing both. Observe now that 2 cannot be chosen if 1 is, and that n 1 cannot be chosen if n is. Then, in the first two cases the nuber of choices is f(n 2, 1, and in the last one f(n 4, 2, so that the total nuber of choices if 1 and/or n is chosen is 2f(n 2, 1+f(n 4, 2. Accordingly, suing both cases: f(n, = f(n 2,+2f(n 2, 1 f(n 4, 2 In addition to the recursive forula above, we need soe boundary conditions as well, corresponding to n = 0,1,2,3 and = 0,1. They are provided by the following: We can choose no nubers in only one way: f(n,0 = 1, n 0. 1

We can choose one nuber in n ways: f(n,1 = n, n 0. f(3,2 = 1 Let us now write down the generating function for f(n,: F(z,w = n=4 =2 f(n,z n w The upper boundary for is deterined by the fact that f(n, = 0 if where By ultiplying the recursion forula by z n w, and applying the operator F(n, = F 1(n,+2F 2(n, F 3(n, n 2 +1., we get: n=4 =2 F 1(n, = F 2(n, = F 3(n, = n=4 =2 n=4 =2 n=4 =2 f(n 2,z n w f(n 2, 1z n w f(n 4, 2z n w For each of the three functions, we get F 1(n, = +1 =2 f(n,z n+2 w = z 2 =2 f(n,z n w = z 2 n=4 =2 f(n,z n w +f(3,2z 3 w 2 = = z 2[ F(z,w+z 3 w 2] F 2(n, = +1 =2 = wz 2 n=4 =2 f(n, 1z n+2 w = =1 f(n,z n+2 w +1 = wz 2 =1 f(n,z n w = [ ] f(n,z n w +f(3,2z 3 w 2 + f(n,1z n w = z 2 w F(z,w+z 3 w 2 +w nz n F 3(n, = +1 =2 = w 2 z 4 f(n, 2z n+4 w = n=4 =2 =0 We still need three auxiliary coputations: ( nz n = z nz n 1 z 2 = z = z 2 2 z 1 z (1 z 2 f(n,z n+4 w +2 = w 2 z 4 =0 f(n,z n w +f(3,2z 3 w 2 + f(n,1z n w + f(n,0z n = f(n,z n w = ] = z 4 w [F(z,w+z 2 3 w 2 +w nz n + z n 2

z n = 1 1 z nz n = z ( nz n 1 1 = z = 1 z z (1 z 2 Putting all of the above together, and after soe further algebraic siplifications, we find: F(z,w = w 2 z 4 3+z(z 3+w(z 1 2 (z 1 2 (1 z wz 2 Of course, this is not the full generating function, as the cases n = 1,2,3 and = 0,1 are entirely issing; we oitted the in order to avoid to have to deal with weird boundary conditions such as f( 3, 1 etc. But now we can add the back. Reeber that f(n,0 = 1, n 0 and f(n,1 = n, n 1; but we have already carried out the relevant coputations as auxiliary coputations above. Therefore: F(z,w = F(z,w+z 3 w 2 + 1 1 z + zw (1 z 2 where the first fraction is the generating function for f(n,0 and the second for f(n,1. After soe algebraic siplifications, we find: F(z,w = 1+zw 1 z wz = 1+zw 2 1 z wz = 1 z(1+zw 2 z 1 z(1+wz = 1 [z(1+wz] n = z n=1 n ( n = z n+ 1 w = n=1 =0 =0 n= ( n +1 z n w so that f(n, = ( n +1 If then we draw nubers fro the range 1,...,n, the probability no two are consecutive is: ( n +1 so that the solution to our original proble is: q(n, = p(n, = 1 ( n ( n +1 ( n We should note here that a proof of the forula for f(n, based on induction appears in [1]. 3 Second solution The second solution, cobinatorial in nature, allows us to solve a ore general proble: in how any ways f k (n, can we choose nubers aong the nubers 1,...,n so that the iniu distance between any two of our choices (which we will be calling the distance of our choice is k > 0? There is a very siple forula for that. Iagine we have nubered n balls with the nubers 1,...,n, and that we have chosen the nubers 1 N 1 <... < N n. For every nuber chosen but the last one, reove the nubers of the 1 balls iediately following it; as for the reaining balls, renuber the consecutively and in the order they are. We will end up with n (k 1( 1 balls nubered consecutively fro 1 to n (k 1( 1, and (k 1( 1 blank ones. This final situation will not depend on the balls we chose originally, although the exact positioning of the blank balls aong the nubered ones will. Notice finally that the original nuber of every ball can be recovered: it is the nuber of balls preceding it, including itself! 3

Any valid choice of nubers in the original nubering will correspond to a choice of nubers after renubering, and vice versa: after we choose nubers between 1 and n (k 1( 1, we insert blanks as described above and renuber, getting a valid choice of nubers in the original nubering. This correspondence is obviously bijective. Therefore, f k (n, = ( n (k 1( 1, n > > 1,k 1 For k = 2 we recover the result of our first solution, and hence the sae probability p(n, of at least two choices being consecutive. We also obtain the ore general forula ( n (k 1( 1 p k (n, = 1 ( n for the probability that at least two of the winning nubers have a distance less than k. 4 Application in gabling The probability p(n, can actually be quite large, aybe unexpectedly large: for exaple, for the usual values n = 49 and = 6, we find p(49,6 0.495198. Therefore, the observation that the winning six nubers of the lottery often contain two that are very close is well founded; in alost one gae out of two the winning set of nubers contains two consecutive ones! Moreover, as p(49,6 is very close to 0.5, the proble we just studied can be turned into a successful casino gae: the player bets e that 6 nubers randoly chosen aong 1,...,49 will contain at least two consecutive ones. If this happens, the player gets e fro the house, otherwise the house wins the player s oney. This gae is alost fair, as the player has an alost 50% chance to win; but he actually has slightly less than that, and this gives the house a (profitable advantage! 5 A slight variant What would happen, though, if the player suggests that nubers 1 and n be treated as consecutive as well, naely if we order the nubers on a ring instead of a line? There should now be fewer possible choices for non-consecutive nubers. Indeed, let now g k (n, be the nuber of possible choices of n aong n > 0 nubers so that the iniu distance between any two of the chosen ones is k; in other words, aong any two chosen nubers, with the property that no nuber between the is chosen, there are at least k 1 nubers lying between the. Then, we can split the choices into those in which one nuber aong 1,...,k 1 is chosen, and those in which this is not the case: If one ball aong 1,...,k 1 is chosen, then the reaining 1 balls can be chosen aong n 2k +1 balls (we exclude the chosen ball and the k 1 adjacent balls on either side; but now, by reoving a block of 2k 1 balls fro the circle, we turn it into a line, so the total nuber of choices, for a fixed choice within 1,...,k, is f k (n 2k +1, 1; and since every different choice within 1,...,k leads to different possible choices, the total nuber of choices in this category is (k 1f k (n 2k +1, 1. If no ball is chosen aong 1,...,k 1, then we can just reove the, turn the circle into a line, and renuber: we need to choose balls aong the reaining n k + 1, obeying the distance restrictions, and this can happen in f k (n k+1, ways. Therefore, If we define now g k (n, = (k 1f k (n 2k +1, 1+f k (n k+1,, n > > 1,k 0 p k (n, = 1 g ( k(n, = 1 n ( n k+1 (k 1( 1 ( n 2k +1 (k 1( 2 + 1 ( n we find that p 2(49,6 = p(49,6 0.503203. Therefore, if soe casino agreed to play this variant of the gae with a player, the player would have a slight advantage over the house, and the latter would loose oney! Table 1 gives the values of p k (49,6 and p k (49,6 for k N : 4

k p k (49,6 p k (49,6 1 0 0 2 0.495198 0.503203 3 0.766686 0.806793 4 0.903824 0.937157 5 0.966031 0.984296 6 0.990375 0.997447 7 0.99806 0.999821 8 0.999785 0.999999 9 0.999994 1 10 1 1 Table 1: The probabilities that the winning set of nubers of the standard Lottery has a iniu distance k. 6 Acknowledgeents The author would like to thank an anonyous student of his for counicating to the author his observation about the frequency of appearance of consecutive nubers in the set of the Lottery winning nubers, and thus stiulating hi to write this article. References [1] H. Ryser. Cobinatorial Matheatics Carus Matheatical Monographs (1978 5