Distances between the winning nubers in Lottery Konstantinos Drakakis arxiv:ath/0507469v1 [ath.co] 22 Jul 2005 16 March 2005 Abstract We prove an interesting fact about Lottery: the winning 6 nubers (out of 49 in the gae of the Lottery contain two consecutive nubers with a surprisingly high probability (alost 50%. 1 Introduction The gae of lottery exists and has been run in any countries (such as the UK, the US, Gerany, France, Ireland, Australia, Greece, Spain, etc. for a nuber of years. In this gae, the player chooses nubers fro aong the nubers 1,...,n >, the order of the choice being uniportant and the values of n and varying fro country to country; the lottery organizers choose publicly nubers in the sae way, and if they are the sae with the ones the player chose, the player wins. Newspapers usually publish the winning set of nubers along with statistics on the nuber of ties each particular nuber fro 1 to n has appeared in the winning set. It is however a slightly different and ore elusive statistical observation that will be of interest to us here. Soe people have noticed that, in the usual case = 6 and n = 49, it happens very often that at least two of the winning nubers are close to each other. As 6 out of 49 is not really any, this sees at first to be paradoxical, if not altogether wrong, and ay reind us strongly of another very siilar faous paradox, the Birthday Paradox. In this work we will prove that this observation is well founded, even if we adopt the strictest interpretation of nubers being close, i.e. that they be consecutive. Our proble to solve then will be the following: What is the probability that, out of > 0 nubers drawn uniforly randoly fro the range 1,...,n >, at least two are consecutive? We will calculate this probability in two ways below: one quite echanical, by finding a recursion and then solving it by eans of generating functions, and one cobinatorial, which will actually yield a ore general result. We will also see that this proble, at least for the usual values = 6 and n = 49, leads to a novel and unexpected gabling application. 2 First solution Let f(n, be the nuber of ways in which nubers can be chosen out of 1,...,n so that no two are consecutive. For any particular choice, one of the following will hold: Neither 1 nor n is chosen: we have to choose nubers aong 2,...,n 1 and the nuber of ways this can be accoplished in is f(n 2,. 1 and/or n is chosen: the nuber of ways this can be accoplished in is, according to the inclusionexclusion principle, the su of the nuber of ways of choosing 1 and choosing n inus the nuber of ways in choosing both. Observe now that 2 cannot be chosen if 1 is, and that n 1 cannot be chosen if n is. Then, in the first two cases the nuber of choices is f(n 2, 1, and in the last one f(n 4, 2, so that the total nuber of choices if 1 and/or n is chosen is 2f(n 2, 1+f(n 4, 2. Accordingly, suing both cases: f(n, = f(n 2,+2f(n 2, 1 f(n 4, 2 In addition to the recursive forula above, we need soe boundary conditions as well, corresponding to n = 0,1,2,3 and = 0,1. They are provided by the following: We can choose no nubers in only one way: f(n,0 = 1, n 0. 1
We can choose one nuber in n ways: f(n,1 = n, n 0. f(3,2 = 1 Let us now write down the generating function for f(n,: F(z,w = n=4 =2 f(n,z n w The upper boundary for is deterined by the fact that f(n, = 0 if where By ultiplying the recursion forula by z n w, and applying the operator F(n, = F 1(n,+2F 2(n, F 3(n, n 2 +1., we get: n=4 =2 F 1(n, = F 2(n, = F 3(n, = n=4 =2 n=4 =2 n=4 =2 f(n 2,z n w f(n 2, 1z n w f(n 4, 2z n w For each of the three functions, we get F 1(n, = +1 =2 f(n,z n+2 w = z 2 =2 f(n,z n w = z 2 n=4 =2 f(n,z n w +f(3,2z 3 w 2 = = z 2[ F(z,w+z 3 w 2] F 2(n, = +1 =2 = wz 2 n=4 =2 f(n, 1z n+2 w = =1 f(n,z n+2 w +1 = wz 2 =1 f(n,z n w = [ ] f(n,z n w +f(3,2z 3 w 2 + f(n,1z n w = z 2 w F(z,w+z 3 w 2 +w nz n F 3(n, = +1 =2 = w 2 z 4 f(n, 2z n+4 w = n=4 =2 =0 We still need three auxiliary coputations: ( nz n = z nz n 1 z 2 = z = z 2 2 z 1 z (1 z 2 f(n,z n+4 w +2 = w 2 z 4 =0 f(n,z n w +f(3,2z 3 w 2 + f(n,1z n w + f(n,0z n = f(n,z n w = ] = z 4 w [F(z,w+z 2 3 w 2 +w nz n + z n 2
z n = 1 1 z nz n = z ( nz n 1 1 = z = 1 z z (1 z 2 Putting all of the above together, and after soe further algebraic siplifications, we find: F(z,w = w 2 z 4 3+z(z 3+w(z 1 2 (z 1 2 (1 z wz 2 Of course, this is not the full generating function, as the cases n = 1,2,3 and = 0,1 are entirely issing; we oitted the in order to avoid to have to deal with weird boundary conditions such as f( 3, 1 etc. But now we can add the back. Reeber that f(n,0 = 1, n 0 and f(n,1 = n, n 1; but we have already carried out the relevant coputations as auxiliary coputations above. Therefore: F(z,w = F(z,w+z 3 w 2 + 1 1 z + zw (1 z 2 where the first fraction is the generating function for f(n,0 and the second for f(n,1. After soe algebraic siplifications, we find: F(z,w = 1+zw 1 z wz = 1+zw 2 1 z wz = 1 z(1+zw 2 z 1 z(1+wz = 1 [z(1+wz] n = z n=1 n ( n = z n+ 1 w = n=1 =0 =0 n= ( n +1 z n w so that f(n, = ( n +1 If then we draw nubers fro the range 1,...,n, the probability no two are consecutive is: ( n +1 so that the solution to our original proble is: q(n, = p(n, = 1 ( n ( n +1 ( n We should note here that a proof of the forula for f(n, based on induction appears in [1]. 3 Second solution The second solution, cobinatorial in nature, allows us to solve a ore general proble: in how any ways f k (n, can we choose nubers aong the nubers 1,...,n so that the iniu distance between any two of our choices (which we will be calling the distance of our choice is k > 0? There is a very siple forula for that. Iagine we have nubered n balls with the nubers 1,...,n, and that we have chosen the nubers 1 N 1 <... < N n. For every nuber chosen but the last one, reove the nubers of the 1 balls iediately following it; as for the reaining balls, renuber the consecutively and in the order they are. We will end up with n (k 1( 1 balls nubered consecutively fro 1 to n (k 1( 1, and (k 1( 1 blank ones. This final situation will not depend on the balls we chose originally, although the exact positioning of the blank balls aong the nubered ones will. Notice finally that the original nuber of every ball can be recovered: it is the nuber of balls preceding it, including itself! 3
Any valid choice of nubers in the original nubering will correspond to a choice of nubers after renubering, and vice versa: after we choose nubers between 1 and n (k 1( 1, we insert blanks as described above and renuber, getting a valid choice of nubers in the original nubering. This correspondence is obviously bijective. Therefore, f k (n, = ( n (k 1( 1, n > > 1,k 1 For k = 2 we recover the result of our first solution, and hence the sae probability p(n, of at least two choices being consecutive. We also obtain the ore general forula ( n (k 1( 1 p k (n, = 1 ( n for the probability that at least two of the winning nubers have a distance less than k. 4 Application in gabling The probability p(n, can actually be quite large, aybe unexpectedly large: for exaple, for the usual values n = 49 and = 6, we find p(49,6 0.495198. Therefore, the observation that the winning six nubers of the lottery often contain two that are very close is well founded; in alost one gae out of two the winning set of nubers contains two consecutive ones! Moreover, as p(49,6 is very close to 0.5, the proble we just studied can be turned into a successful casino gae: the player bets e that 6 nubers randoly chosen aong 1,...,49 will contain at least two consecutive ones. If this happens, the player gets e fro the house, otherwise the house wins the player s oney. This gae is alost fair, as the player has an alost 50% chance to win; but he actually has slightly less than that, and this gives the house a (profitable advantage! 5 A slight variant What would happen, though, if the player suggests that nubers 1 and n be treated as consecutive as well, naely if we order the nubers on a ring instead of a line? There should now be fewer possible choices for non-consecutive nubers. Indeed, let now g k (n, be the nuber of possible choices of n aong n > 0 nubers so that the iniu distance between any two of the chosen ones is k; in other words, aong any two chosen nubers, with the property that no nuber between the is chosen, there are at least k 1 nubers lying between the. Then, we can split the choices into those in which one nuber aong 1,...,k 1 is chosen, and those in which this is not the case: If one ball aong 1,...,k 1 is chosen, then the reaining 1 balls can be chosen aong n 2k +1 balls (we exclude the chosen ball and the k 1 adjacent balls on either side; but now, by reoving a block of 2k 1 balls fro the circle, we turn it into a line, so the total nuber of choices, for a fixed choice within 1,...,k, is f k (n 2k +1, 1; and since every different choice within 1,...,k leads to different possible choices, the total nuber of choices in this category is (k 1f k (n 2k +1, 1. If no ball is chosen aong 1,...,k 1, then we can just reove the, turn the circle into a line, and renuber: we need to choose balls aong the reaining n k + 1, obeying the distance restrictions, and this can happen in f k (n k+1, ways. Therefore, If we define now g k (n, = (k 1f k (n 2k +1, 1+f k (n k+1,, n > > 1,k 0 p k (n, = 1 g ( k(n, = 1 n ( n k+1 (k 1( 1 ( n 2k +1 (k 1( 2 + 1 ( n we find that p 2(49,6 = p(49,6 0.503203. Therefore, if soe casino agreed to play this variant of the gae with a player, the player would have a slight advantage over the house, and the latter would loose oney! Table 1 gives the values of p k (49,6 and p k (49,6 for k N : 4
k p k (49,6 p k (49,6 1 0 0 2 0.495198 0.503203 3 0.766686 0.806793 4 0.903824 0.937157 5 0.966031 0.984296 6 0.990375 0.997447 7 0.99806 0.999821 8 0.999785 0.999999 9 0.999994 1 10 1 1 Table 1: The probabilities that the winning set of nubers of the standard Lottery has a iniu distance k. 6 Acknowledgeents The author would like to thank an anonyous student of his for counicating to the author his observation about the frequency of appearance of consecutive nubers in the set of the Lottery winning nubers, and thus stiulating hi to write this article. References [1] H. Ryser. Cobinatorial Matheatics Carus Matheatical Monographs (1978 5