THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR

Similar documents
7.3 Singular points and the method of Frobenius

Rational Functions. Elementary Functions. Algebra with mixed fractions. Algebra with mixed fractions

Math 115 Spring 11 Written Homework 10 Solutions

Some Node Degree Properties of Series-Parallel Graphs Evolving Under a Stochastic Growth Model

DIFFERENTIAL EQUATIONS

Georgia Department of Education Common Core Georgia Performance Standards Framework CCGPS Advanced Algebra Unit 2

Lecture 2. We now introduce some fundamental tools in martingale theory, which are useful in controlling the fluctuation of martingales.

On the game of Memory

Identifying the Graphs of Polynomial Functions

Section 3.1 Quadratic Functions

September Math Course: First Order Derivative

REVIEW OF DIFFERENTIAL CALCULUS

MATH The Chain Rule Fall 2016 A vector function of a vector variable is a function F: R n R m. In practice, if x 1, x n is the input,

Advanced Mathematics Unit 2 Limits and Continuity

Advanced Mathematics Unit 2 Limits and Continuity

Limits at Infinity. Horizontal Asymptotes. Definition (Limits at Infinity) Horizontal Asymptotes

Section Properties of Rational Expressions

The Theory of Second Order Linear Differential Equations 1 Michael C. Sullivan Math Department Southern Illinois University

18.03SC Practice Problems 2

Solving Quadratic & Higher Degree Equations

1.2 Functions and Their Properties Name:

Module 2: Reflecting on One s Problems

56 CHAPTER 3. POLYNOMIAL FUNCTIONS

Announcements. Topics: Homework:

STAT 414: Introduction to Probability Theory

Advanced topic: Space complexity

Functional Limits and Continuity

2tdt 1 y = t2 + C y = which implies C = 1 and the solution is y = 1

d(x n, x) d(x n, x nk ) + d(x nk, x) where we chose any fixed k > N

I forgot to mention last time: in the Ito formula for two standard processes, putting

ECE534, Spring 2018: Solutions for Problem Set #4 Due Friday April 6, 2018

Complex Differentials and the Stokes, Goursat and Cauchy Theorems

AP Calculus Chapter 9: Infinite Series

MATH 56A SPRING 2008 STOCHASTIC PROCESSES

6. Brownian Motion. Q(A) = P [ ω : x(, ω) A )

Chapter 3: Root Finding. September 26, 2005

Final Exam A Name. 20 i C) Solve the equation by factoring. 4) x2 = x + 30 A) {-5, 6} B) {5, 6} C) {1, 30} D) {-5, -6} -9 ± i 3 14

In other words, we are interested in what is happening to the y values as we get really large x values and as we get really small x values.

Determine whether the formula determines y as a function of x. If not, explain. Is there a way to look at a graph and determine if it's a function?

Lecture 5. 1 Chung-Fuchs Theorem. Tel Aviv University Spring 2011

T 1. The value function v(x) is the expected net gain when using the optimal stopping time starting at state x:

Section 1.6 Inverse Functions

Recursive Estimation

Theory of Computation

Numerical method for approximating the solution of an IVP. Euler Algorithm (the simplest approximation method)

3 Polynomial and Rational Functions

APPLICATIONS OF DIFFERENTIATION

Population protocols & Ehrenfest urns scheme

, and rewards and transition matrices as shown below:

Mission 1 Simplify and Multiply Rational Expressions

Chapter 2: Unconstrained Extrema

SYDE 112, LECTURE 7: Integration by Parts

Linear Algebra, Summer 2011, pt. 2

2. If the values for f(x) can be made as close as we like to L by choosing arbitrarily large. lim

STAT 418: Probability and Stochastic Processes

Polynomials Patterns Task

Review all the activities leading to Midterm 3. Review all the problems in the previous online homework sets (8+9+10).

Chapter 3: Polynomial and Rational Functions

The Growth of Functions. A Practical Introduction with as Little Theory as possible

One-Variable Calculus

USA Mathematical Talent Search Round 1 Solutions Year 27 Academic Year

15-451/651: Design & Analysis of Algorithms September 13, 2018 Lecture #6: Streaming Algorithms last changed: August 30, 2018

2 P a g e. Essential Questions:

Limits: An Intuitive Approach

1 Functions of many variables.

Symmetric Inclusion-Exclusion

Mock Final Exam Name. Solve and check the linear equation. 1) (-8x + 8) + 1 = -7(x + 3) A) {- 30} B) {- 6} C) {30} D) {- 28}

Pre-Calculus: Functions and Their Properties (Solving equations algebraically and graphically, matching graphs, tables, and equations, and

Chapter 1 Review of Equations and Inequalities

CHAPTER 3 Further properties of splines and B-splines

Final Exam C Name i D) 2. Solve the equation by factoring. 4) x2 = x + 72 A) {1, 72} B) {-8, 9} C) {-8, -9} D) {8, 9} 9 ± i

1) The line has a slope of ) The line passes through (2, 11) and. 6) r(x) = x + 4. From memory match each equation with its graph.

2, or x 5, 3 x 0, x 2

MADHAVA MATHEMATICS COMPETITION, December 2015 Solutions and Scheme of Marking

KEMATH1 Calculus for Chemistry and Biochemistry Students. Francis Joseph H. Campeña, De La Salle University Manila

Stochastic Processes, Kernel Regression, Infinite Mixture Models

Lecture 7: Sections 2.3 and 2.4 Rational and Exponential Functions. Recall that a power function has the form f(x) = x r where r is a real number.

19. Basis and Dimension

THE INVERSE FUNCTION THEOREM

1. Introductory Examples

P (A G) dp G P (A G)

k-protected VERTICES IN BINARY SEARCH TREES

Lecture 5. If we interpret the index n 0 as time, then a Markov chain simply requires that the future depends only on the present and not on the past.

Brownian Motion and Stochastic Calculus

Selected Exercises on Expectations and Some Probability Inequalities

For a stochastic process {Y t : t = 0, ±1, ±2, ±3, }, the mean function is defined by (2.2.1) ± 2..., γ t,

Graphs of Polynomial Functions

Notes on Measure Theory and Markov Processes

Lecture 5 - Logarithms, Slope of a Function, Derivatives

6.1 Polynomial Functions

MATH 116, LECTURE 13, 14 & 15: Derivatives

Lecture for Week 2 (Secs. 1.3 and ) Functions and Limits

Math Example Set 12A. m10360/homework.html

UNC Charlotte Super Competition Level 3 Test March 4, 2019 Test with Solutions for Sponsors

Pure Mathematics P1

Chapter 4.1 Introduction to Relations

ter. on Can we get a still better result? Yes, by making the rectangles still smaller. As we make the rectangles smaller and smaller, the

M155 Exam 2 Concept Review

School of the Art Institute of Chicago. Calculus. Frank Timmes. flash.uchicago.edu/~fxt/class_pages/class_calc.

The Princeton Review AP Calculus BC Practice Test 2

Transcription:

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR MICHAEL KANE As a final project for STAT 637 (Deterministic and Stochastic Optimization) the simple urn model is studied, with special attention being payed to its relationship with stochastic approximation. This write-up is based on portions of Pemantle (2007) which provides a survey of random process but, it also draws from other sources. This paper begins by describing the simple urn model. Next, distributions of ball counts and ball proportions are derived. These derivations should help give the reader an understanding of how urn processes behave. After this intuition is developed, a stochastic approximation of its behavior is derived. This way, the relationship between the stochastic results and the previously derived results is clear. Finally, a case where the stochastic approximation may not hold is introduced and an alternative solution is described. The Polya urn process is defined as: 1. THE POLYA URN Place r red balls and g green balls in the urn. repeat Pick a ball l from the urn uniformly at random. if l is a green ball then Replace l and put another green ball into the urn. end else Replace l and put another red ball into the urn. end end ; Algorithm 1: The Polya Urn Process Algorithm Clearly, the urn starts with r + g balls. For the entire paper it will be assumed that r and g are greater than zero. At any time t 0, the number Date: December 24, 2007. 1

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR2 of balls in the urn is t + r + g. The next step is to understand the distribution of these balls at t. 2. COUNTING BALLS At the first step of the algorithm, there are r + g balls in the urn and one is selected at random. This means that there is an r/(r + g) chance of picking a red ball at the first step. Likewise, there is a g/(r + g) chance of picking a green ball. Following Pollard (1997), let R t be the event that a red ball is chosen at time t. Likewise, let G t be the even that a green ball is chosen at time t. The probability of picking a red ball at time 2 can then be solved as: PR 2 PR 1 R 2 + PG 1 R 2 PR 1 P(R 2 R 1 ) + PG 1 P(R 2 G 1 ) r r + 1 (r + g) (r + g + 1) + g r (r + g) (r + g + 1) r r + g. This shows PR 1 PR 2 r/(r + g). These calculations work similarly for any t 1, giving the result r PR t (r + g). Now, let s examine the probabilities of a sequence of balls. For example, the probability of getting a green ball at time 1 and a red ball at time 2 is g r PG 1 R 2 PG 1 P(R 2 G 1 ) (r + g) (r + g + 1). The probability of getting a red ball at time 1 and a green ball at time 2 is r g PR 1 G 2 PR 1 P(G 2 R 1 ) (r + g) (r + g + 1). The probability of getting two green balls is PG 1 G 2 PG 1 P(G 2 G 1 ) Finally, the probability of getting two red balls is PR 1 R 2 PR 1 P(R 2 R 1 ) g g + 1 (r + g) (r + g + 1). r r + 1 (r + g) (r + g + 1). It may be noted that PR 1 G 2 PG 1 R 2. In fact, the probability of any sequence of balls is only a function of the number of red balls and the number of green balls which appear in the sequence. P{R 1 red, G 2 green, R 3 red} P{R 1 + R 2 + R 3 2}

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR3 In Panchenko (2007) it is shown that, by de Finetti Theorem, for an exchangeable sequence of Bernoulli random variables, there exists a distribution F on [0, 1] such that 1 ( ) n P(R 1 +... + R n k) x k (1 x) n k df (x). k To find the distribution of F, let s look at the moments. We have already found that the first moment for R k is r/(r + g). The second moment was also found as Solving for the variance we get 0 r(r + 1) (r + g)(r + g + 1). r(r + 1) (r + g)(r + g + 1) r 2 (r + g) (r2 + r)(r + g) r 2 (r + g + 1) 2 (r + g) 2 (r + g + 1) rg (r + g) 2 (r + g + 1) Which is the variance of the β(r, g) distribution. The fact the F is a beta distribution can be confirmed by realizing that moment generating function of the beta distribution and the the higher moments of this process are equal. These result imply that the following equation tells us how many reds we can expect after n steps 1 ( ) n P{k reds after n steps} x k+r 1 n k+g 1 Γ(r + g)) (1 x) 0 k Γ(r)Γ(g) dx ( ) n Γ(r + g) Γ(k + r)γ(n k + g) k Γ(g)Γ(r) Γ(n + g + r) 3. THE DISTRIBUTION OF THE PROPORTION OF RED BALLS In this section, we would like to find the distribution of the proportion of the number of red balls in the urn at any time. Let X t+1 and Y t+1 be the number of red balls and green balls respectively at time t + 1 and let W t+1 be the proportion of red balls at time t + 1. W t+1 r + X t+1 X t+1 + Y t+1 r + X t+1 r + g + t + 1 W t+1 may be written as a function of the random variable X t+1 and constants r, g, and t.

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR4 W t+1 is a martingale. Since W t+1 is a proportion we are guaranteed P W t+1 <. Also, if we let P t W t+1 denote the probability of W t+1 conditioned on all information up to and including t then, P t W t+1 (X t + 1) X t X t Y t + (X t + Y t + 1) X t + Y t (X t + Y t ) (X t + Y t + 1) X t (X t + Y t + 1) (X t + Y t )(X t + Y t + 1) X t X t + Y t W t From Pollard (2001, page 151) we know that a submartingale {S n : n N 0 } with sup n PS n + < converges almost surely to an integrable limit. Since a martingale is also a submartingale, and 0 W t+1 1 for all t 0, W t+1 must converge to an integrable limit. To find this limit, first realize W n r + X n r + g + n X n as n. n From the last section, we know that the by de Finetti s theorem ( ) n 1 P{X n k} x k (1 x) n k dβ(r, g)(x). k 0 Again, using results from Panchenko (2007), for any function f with range [0, 1], ( ) Xn n ( ) ( ) k n 1 Pf f x k (1 x) n k dβ(r, g)(x) n n k 0 k0 1 0 B u (x)dβ(r, g)(x). B u (x) is a Bernstein Polynomial which, in Wikipedia (2007), is shown to have the property lim B n(f)(x) f(x). n Therefore, ( ) 1 Xn Pf f(x)dβ(r, g)(x) as n. n 0 Any moment of X t /t approaches the same moment of a β(r, g) distribution for large enough t. The distribution of the proportion of red balls approaches a β(r, g) almost surely as n approaches infinity.

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR5 4. EXTENSIONS TO OTHER URN MODELS The previous sections study the simple urn model in which, at time 0 there are r red balls and g green balls. At each time step, a ball is chosen at random from the urn and that ball along with another ball of the same color are put back. An extension to this process would be, for each iteration, to replace the picked ball along with c more of the same color (c 1 in the simple urn case). In this case, the the for the probability of the number of red balls is almost the exact same. ( ) ( n Γ r+b ) c P{k red balls at time n} k Γ ( ) ( g c Γ g ) Γ ( ) ( ) k + r c Γ n k + g c Γ ( n + g + ) r c c c The corresponding distribution of the proportion of red balls to the number of balls in the urn is β(r/c, g/c). When the algorithm is run with c 1, this corresponds to the OK Corral process. The story used to motivate this process is that there is a shootout between two gangs. Before the shootout starts there are r live members of the first gang and g live members of the second. At any time t, the probability that a member of first gang will be shot is equal to the proportion of size of the second gang to the total number of gunfighters, Y t /(X t + Y t ). It is assumed that each shot is immediately fatal. The process is run until all the members of one of the two gangs is killed. The distributions of interest are the time when one gang kills all members of the other gang and, for the victorious side, the number of surviving gang members. The OK Corral process falls under a more general set of process which are described as sacrificial in Flajolet, Dumas, and Puyhaubert (2006). For these urn processes c < 1. These models generally add the requirement that c divides both r and g so that on population is guaranteed to reach zero. 5. THE STOCHASTIC APPROXIMATION Let s once again consider the simple urn model, c 1. Now, we would like to get an approximation for the number of red balls in the urn at any time. The number of red balls at time t + 1 can be written recursively as (5.1) X t+1 X t + R t+1.

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR6 Let t+1 X t+1 X t, then we can rewrite Equation (5.1) as t+1 X R t+1 P t R t+1 + P t R t+1 M t+1 + P t R t+1 M t+1 is a martingale with zero expectation. Furthermore, M t+1 R t+1 P t R t+1 R t+1 r + X t r + g + t R t+1 W t. If we look at the total variation distance between R t+1 and W t sup{ PR t+1 PW t } sup { PP t R t+1 PW t } { ( ) } r + Xt+1 sup P PW t r + g + 1 sup { PW t+1 PW t } 0 as t. The last step comes from Section (3), where it was shown that the distribution of the proportion of red balls is converging to a constant distribution. This result tells us that, for t big enough, the change in the number of red balls at time t + 1 behaves like the expectation of the number of balls added. t+1 X P t R t+1 r + X t r + g + t X t r + g + t + r r + g + t. The term r/(r + g + t) approaches zero as t increases. Therefore, it will be ignored. The difference equation can then be approximated by the differential equation which can be rewritten as dx dt dx X X r + g + t dt r + g + t. Taking the indefinite integral and then the log of both sides yields the equation (5.2) X(t) (r + g + t)e m

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR7 for some constant m. Remembering that initially there are r red balls we can solve for e m e m r (r + g). Substituting back into Equation (5.2) r X(t) (r + g + t) (r + g) r (r + g) t + r. The number of red balls increases linearly with time. This result is somewhat unsurprising. We already know that the distribution of the probability of getting a red ball at some time t is approximately β(r, b) and so we can expect that the number of red balls will increase linearly in time. The derivation serves to confirm this suspicion. This particular example of creating a dynamic system from the expected behavior of a process falls under the general category of stochastic approximation. These processes can be put into the form Z t+1 Z t 1 η n (F (Z t ) + ξ t+1 + D t ) and satisfy the following requirements: F is a vector field on R n. ξ t+1 is a martingale which converges and satisfies satisfies P t ξ t+1 0. The remainder term D t goes to zero as t increases t1 D t/η t <. When these requirements are satisfied, for big enough t, the differential equation dz dt F (Z). approximates the behavior of the process {Z t }. 6. PATH COUNTING Stochastic approximations rely on the convergence of a martingale in its description. The question then arises: what happens if the process does not continue for an arbitrarily long time so that the martingale term does not have enough time to converge? This is potentially the case for the previously described OK Corral process. To understand these processes Flajolet, Dumas, and Puyhaubert (2006) find urn configuration probabilities based on history counting which model the number of ways an urn can have a configuration given the initial condition of r red balls and g green balls. This section will provide a sketch for the justification and then describe some results reached in the paper.

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR8 As a motivating example, consider the simple urn process with r g 1 going from time 0 and 1. During this iteration there is 1 red ball which may be picked. If it is picked then the ball count at time 2 is 2 red balls and 1 green balls. Similarly, at this iteration, there is 1 green ball to pick. If it is picked, then the ball count at time 2 is 1 red ball and 2 green balls. Compare this to the polynomial xy(1 + x)(1 + y) xy + x 2 y + xy 2 + x 2 y 2 If we take the exponent of the x term as the red ball count and the exponent of the y term as the green ball count, then the terms x 2 y and xy 2 correspond to these cases. The coefficient of the terms correspond to the number of different ways to reach a count of 3 balls. To consider the possible ball counts at time 2, consider the polynomial x 2 y(1 + x)(1 + y)+xy 2 (1 + x)(1 + y) x 2 y + x 3 y + 2x 2 y 2 + xy 3 + x 3 y 2 + xy 2 + x 2 y 3 At time 2 there are 4 balls in the urn corresponding to term x 3 y, 2x 2 y 2, and xy 3. The only way to have 3 red balls and 1 green ball in the urn at this time is if a red ball is picked twice. Likewise, the only way to have 3 green balls and 1 red ball in the urn at time 2 is if a green ball is picked twice. There can be 2 red balls and 2 green balls if a red ball is picked at the first iteration and a green is picked on the second iteration or a green ball is picked at the first iteration and a red is picked at the second. This corresponds to a coefficient of 2 for the term 2x 2 y 2 Now that we can count the total number of paths to a given urn configuration for the simple urn model, let s find the total number of configurations of the urn at a given time. Again, consider the urn model where the balls along with the sample ball is returned to the urn at any time t. At t 1 there are r + g balls to choose from. At time t 2 there are r + g + 1 balls to choose from. Let s r + g, then the total number of different ways to choose balls through time t is s(s + 1)...(s + t 1) t!c t ( n + s 1 t ). This is referred to as the number of histories of the process. From the difference equations describing an urn model, Flajolet, Dumas, and Puyhaubert (2006) give probabilities of urn configurations at given times based on path counts. These difference equations must have a linear recurrence representation, the sum of the rows of the state transition matrix must be equal, and if balls are being taken from the urn, the number being taken must divide the number of balls.

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR9 After finding a set of difference equations which describe an urn process, a polynomial generating function is derived. To get a configuration probability at time t, a derived differential operator is applied to the polynomial t times and then the generating function variable is set to zero. The calculated number can then be divided by the the total number of paths to get a probability of a path. As a final example, we will once again consider the simple urn model. The linear recurrence representation is [ ] [ ] [ ] [ ] Xt+1 1 0 Rt+1 Xt +. Y t+1 0 1 G t+1 Y t Using the technique described in the paper, the differential system is { dx dt x2 dy. dt y2 Solving this pair, and remembering that initially there are r red balls and g green balls we get { Xt r 1 tr. Y t g 1 tg Which corresponds to a generating function H(x, y, t) of H(x, y, z) x r y g (1 tx) r (1 ty) g. To get the probability that this urn will have a red balls and g green balls at time n, the differential operator is applied n times and x, y, and t is set to zero. Then, this number is divided by the total number of histories to get P{k reds after n steps} ( n k ) Γ(r + g) Γ(g)Γ(r) which agrees with the results of Section (2). Γ(k + r)γ(n k + g) Γ(n + g + r) 7. CONCLUSION AND FUTURE WORK This paper gives several different approaches to solving the simple urn problem. Although it was only mentioned, each of these techniques lend themselves analyses of more complicated processes (some of which were described). And, in many cases, extending the analysis to these processes is straight forward. It may have been noted that the description of the history counting technique left some holes its justification as well as the example. This was mostly due to time constraints. Since they were more relevant to the class,

References 10 analyses from Sections 2, 3, and 5 were deemed more important and less time was spent understanding the history counting technique. Along with gaining a better understanding of the path counting technique, future work may consist of applying stochastic approximation to the graph coloring problem described in Achlioptas and Molloy (1997). This paper describes a greedy graph coloring algorithm which is applied to Erdös-R enyi graphs (G(n, m/n)). Consider the case where n is large. When the algorithm is run, trying to color the graph with k colors, there is a threshold value γ such that if m < γ, the algorithm will succeed with high probability. If m > γ, then the algorithm will fail with high probability. The algorithm works by keeping track of possible colorings for each vertex. When a vertex is colored, adjacent vertices update their list of possible colorings. A stochastic approximation could be used to model these color list sizes. REFERENCES Achlioptas, D. and M. Molloy (1997). The analysis of a list-coloring algorithm on a random graph (extended abstract). Proceedings of the 38th IEEE Symposium on Foundations of Computer Science, 204 212. Flajolet, P., P. Dumas, and V. Puyhaubert (2006). Some exactly solvable models of urn process theory. Proceedings of Fourth Colloquium on Mathematics and Computer Science AG, 59 118. Panchenko, D. (2007). Theory of probability. Lecture Notes from MIT Open Courseware. Pemantle, R. (2007). A survey of random processes with reinforcement. Probability Surveys 4, 1 79. Pollard, D. (1997). Unexpected symmetry. Lecture Notes for STAT 241/541. Pollard, D. (2001). A User s Guide to Measure Theoretic Probability. Cambridge University Press. Wikipedia (2007). Bernstein polynomial. [Online; accessed Dec-21-2007].