arxiv: v2 [math.pr] 29 Jul 2012

Similar documents
A Note on the Approximation of Perpetuities

arxiv:math.pr/ v1 17 May 2004

Monte Carlo methods for sampling-based Stochastic Optimization

Perfect simulation for image analysis

ECE276A: Sensing & Estimation in Robotics Lecture 10: Gaussian Mixture and Particle Filtering

General Glivenko-Cantelli theorems

Advances and Applications in Perfect Sampling

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Asymptotics and Simulation of Heavy-Tailed Processes

Random Variable. Pr(X = a) = Pr(s)

Recap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks

Exact Simulation of Diffusions and Jump Diffusions

Extension of Fill s perfect rejection sampling algorithm to general chains (EXT. ABS.)

Adaptive Rejection Sampling with fixed number of nodes

Minicourse on: Markov Chain Monte Carlo: Simulation Techniques in Statistics

Approximating a single component of the solution to a linear system

arxiv: v2 [math.pr] 4 Sep 2017

1. Numerical Methods for Stochastic Control Problems in Continuous Time (with H. J. Kushner), Second Revised Edition, Springer-Verlag, New York, 2001.

The Contraction Method on C([0, 1]) and Donsker s Theorem

Lect4: Exact Sampling Techniques and MCMC Convergence Analysis

University of Toronto Department of Statistics

16 : Markov Chain Monte Carlo (MCMC)

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Stochastic Stability - H.J. Kushner

Adaptive Rejection Sampling with fixed number of nodes

Answers and expectations

MSc MT15. Further Statistical Methods: MCMC. Lecture 5-6: Markov chains; Metropolis Hastings MCMC. Notes and Practicals available at

Uniformly Uniformly-ergodic Markov chains and BSDEs

Sampling Methods (11/30/04)

Markov chain Monte Carlo

Central limit theorems for ergodic continuous-time Markov chains with applications to single birth processes

A Markov process associated with plot-size distribution in Czech Land Registry and its number-theoretic properties

ELEG 3143 Probability & Stochastic Process Ch. 6 Stochastic Process

PAIRS OF SUCCESSES IN BERNOULLI TRIALS AND A NEW n-estimator FOR THE BINOMIAL DISTRIBUTION

Reversible Markov chains

A Type of Shannon-McMillan Approximation Theorems for Second-Order Nonhomogeneous Markov Chains Indexed by a Double Rooted Tree

The simple slice sampler is a specialised type of MCMC auxiliary variable method (Swendsen and Wang, 1987; Edwards and Sokal, 1988; Besag and Green, 1

Simulation - Lectures - Part III Markov chain Monte Carlo

Some Results on the Ergodicity of Adaptive MCMC Algorithms

Introduction to self-similar growth-fragmentations

Ergodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R.

Linear programming. Saad Mneimneh. maximize x 1 + x 2 subject to 4x 1 x 2 8 2x 1 + x x 1 2x 2 2

Computational statistics

Markov Chain Monte Carlo

Statistics 150: Spring 2007

6 Markov Chain Monte Carlo (MCMC)

Markov Chain Monte Carlo (MCMC)

Lecture 2: September 8

27 : Distributed Monte Carlo Markov Chain. 1 Recap of MCMC and Naive Parallel Gibbs Sampling

Chapter 2 Event-Triggered Sampling

A quick introduction to Markov chains and Markov chain Monte Carlo (revised version)


Computer Vision Group Prof. Daniel Cremers. 14. Sampling Methods

Stability and Rare Events in Stochastic Models Sergey Foss Heriot-Watt University, Edinburgh and Institute of Mathematics, Novosibirsk

arxiv: v1 [math.pr] 1 Jul 2013

Random Signal Transformations and Quantization

Stat-491-Fall2014-Assignment-III

ABC methods for phase-type distributions with applications in insurance risk problems

Existence and uniqueness: Picard s theorem

GARCH processes probabilistic properties (Part 1)

Infinitely iterated Brownian motion

Introduction to Machine Learning


1. INTRODUCTION Propp and Wilson (1996,1998) described a protocol called \coupling from the past" (CFTP) for exact sampling from a distribution using

Part 1. The Review of Linear Programming

Stat 535 C - Statistical Computing & Monte Carlo Methods. Lecture 18-16th March Arnaud Doucet

Exact Simulation of the Stationary Distribution of M/G/c Queues

INVARIANT PROBABILITIES FOR

Lecture 8: The Metropolis-Hastings Algorithm

Bootstrap Percolation on Periodic Trees

Part IV: Monte Carlo and nonparametric Bayes

Scaling Limits of Waves in Convex Scalar Conservation Laws under Random Initial Perturbations

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision

ALMOST SURE CONVERGENCE OF RANDOM GOSSIP ALGORITHMS

Local consistency of Markov chain Monte Carlo methods

THE LINDEBERG-FELLER CENTRAL LIMIT THEOREM VIA ZERO BIAS TRANSFORMATION

Bisection Ideas in End-Point Conditioned Markov Process Simulation

arxiv: v1 [math.st] 4 Apr 2011

Model Counting for Logical Theories

Finite-Horizon Statistics for Markov chains

The random continued fractions of Dyson and their extensions. La Sapientia, January 25th, G. Letac, Université Paul Sabatier, Toulouse.

Quantitative Non-Geometric Convergence Bounds for Independence Samplers

Advanced Monte Carlo Methods - Computational Challenges

Computer Vision Group Prof. Daniel Cremers. 11. Sampling Methods: Markov Chain Monte Carlo

ON CONVERGENCE RATES OF GIBBS SAMPLERS FOR UNIFORM DISTRIBUTIONS

Zig-Zag Monte Carlo. Delft University of Technology. Joris Bierkens February 7, 2017

This question has three parts, each of which can be answered concisely, but be prepared to explain and justify your concise answer.

Ordered Sample Generation

Quantifying Uncertainty

Scaling Limits of Waves in Convex Scalar Conservation Laws Under Random Initial Perturbations

The Markov Chain Monte Carlo Method

Simultaneous drift conditions for Adaptive Markov Chain Monte Carlo algorithms

Nonparametric inference for ergodic, stationary time series.

Geometric ergodicity of the Bayesian lasso

RELATING TIME AND CUSTOMER AVERAGES FOR QUEUES USING FORWARD COUPLING FROM THE PAST

Network Reliability Assessment in a Random Environment

25.1 Ergodicity and Metric Transitivity

Gärtner-Ellis Theorem and applications.

Markov Chain BSDEs and risk averse networks

arxiv:math/ v2 [math.pr] 25 Sep 2007

arxiv: v1 [math.pr] 24 Sep 2018

Transcription:

Appendix to Approximating perpetuities arxiv:203.0679v2 [math.pr] 29 Jul 202 Margarete Knape and Ralph Neininger Institute for Mathematics J.W. Goethe-University 60054 Frankfurt a.m. Germany July 25, 202 Abstract An algorithm for perfect simulation from the unique solution of the distributional fixed point equation Y = d UY +U( U) is constructed, where Y and U are independent and U is uniformly distributed on [0, ]. This distribution comes up as a limit distribution in the probabilistic analysis of the Quickselect algorithm. Our simulation algorithm is based on coupling from the past with a multigamma coupler. It has four lines of code. Keywords: Perfect simulation, perpetuity, Quickselect, coupling from the past, multigamma coupler, key exchanges. Introduction In a probabilistic analysis of the algorithm Quickselect Hwang and Tsai [8] showed that, when applied to a uniformly random permutation of length n and selecting a rank of order o(n), the normalized number of key exchanges performed by Quickselect converges in distribution to a limit distribution µ. This limit distribution is characterized as the unique probability measure µ = L(Y ) such that Y d = UY + U( U), () Email: {knape, neiningr}@math.uni-frankfurt.de, DOI 0.007/s009-02-9299-2

where = d (also = d ) denotes equality in distribution and U is uniformly distributed over the unit interval [0, ] and independent of Y. The distribution µ was studied in [9]. In particular we showed that µ has a bounded, /2-Hölder continuous density, µ is supported by the unit interval [0, ] and we developed a method to numerically approximate the density and the corresponding distribution function. In Remark 2.9 of [9] we noted that this is sufficient to theoretically construct an algorithm for perfect simulation from µ based on von Neumann s rejection method along the approach taken in Devroye [2]. While the numerical approximations yield an algorithm for perfect simulation in almost surely finite time, the convergence rates of our approximations are poor and the expected running time is infinite. We do not expect such an algorithm to terminate within our lifetimes. Recently, Fill and Huber [6] published an algorithm for perfect simulation of a related distribution, known as the Dickman distribution and characterized as unique solution of the distributional fixed point equation Y = d UY +. This algorithm is based on coupling from the past of a Markov chain with the Dickman distribution as stationary distribution. The method makes use of a multigamma coupler and of a dominating chain to deal with the unbounded support of the Dickman distribution. In fact Fill and Huber develop their algorithm for a more general class of distributions, the Vervaat perpetuities. Devroye and Fawzi [3] presented a different multigamma coupler and a different dominating chain resulting in a faster coupling from the past algorithm for the Dickman distribution. Both algorithms are also fully satisfactory from a practical point of view, millions of independent samples from the Dickman distribution can be generated within seconds. In this note we construct a coupling from the past algorithm for the solution µ of (). Compared to the more difficult Dickman case we benefit from the special analytic structure of the densities ϕ x of Ux + U( U) for x [0, ]. In particular, we have inf x [0,] inf ϕ x(t) /2, (2) t [0,/4] which allows for the construction of a multigamma coupler as proposed by Murdoch and Green [0, Section 2.]. This results in a fast and simple four-line-code algorithm. Note that a general method described in an unpublished extension of [3], see Fawzi [5], can also be applied to our µ: In [5, Section 4] it is shown that when one is able to perfectly simulate from the solution of Y = d AY + with a random 0 A this can be turned into an algorithm to simulate 2

from the solution of Y = d AY + B, whenever B 0 is bounded. Here, (A, B) is independent of Y. Hence, this method together with the simulation algorithm for the Dickman distribution yields as well an algorithm to simulate from µ. For general perfect simulation algorithms for another class of perpetuities see Devroye and James [4]. For perfect simulation algorithms from stationary distributions of positive Harris recurrent Markov chains see Hobert and Robert [7]. In the field of exact simulation from nonuniform distributions it is customary to assume that a sequence of independent and identically, uniformly on [0, ] distributed random variables is available and that elementary operations of and between real number such as +,, /,, x, log x, etc., can be performed with absolute precision, see Devroye [] for a comprehensive account on nonuniform random number generation. 2 Markov chain and multigamma coupler An underlying ergodic Markov chain (X j ) on [0, ] having µ as stationary distribution is given as follows: For all x [0, ], given X j = x, we define X j+ to be distributed as Ux + U( U) with a uniform [0, ] random variable U. In the context of coupling from the past a realization of such a Markov chain is usually constructed with a deterministic update function Φ : [0, ] [0, ] [0, ] such that X j+ := Φ(X j, U j+ ) yields a realization of the chain, where (U j ) is a sequence of independent and uniform [0, ] random variables. A trivial choice for Φ is (x, u) ux + u( u). However, to make coupling of the chains possible, we follow the construction of a multigamma coupler as described by Murdoch and Green [0]. The construction is as follows: Assume that a probability density f is written as f = f + f 2 with measurable, nonnegative functions f, f 2 such that f := f (x) dx, f 2 > 0. Assume that Y, Y 2 are random variables with densities f / f and f 2 / f 2 respectively and that B is a Bernoulli( f ) random variable independent of (Y, Y 2 ). Then the random variable BY + ( B)Y 2 has density f. The aim now is to obtain for the densities ϕ x of Ux+U( U) representations ϕ x = r + g x as above, where r is independent of x [0, ]. Typically this may not be possible since one may have inf ϕ x = 0 such that a non-zero r independent of x does not exist. However, in our particular situation we 3

have (2), hence we are able to choose, e.g., r(t) := 2 [0,/4)(t), t [0, ]. (3) Clearly, U/4 has density r/ r and let us assume for the moment that a random variable Y x with density g x / g x can be simulated via its inverse distribution function (quantile function) G x, i.e., L(Y x ) = L(G x (U)). Then, with a Bernoulli( r ) random variable B, independent of U, we have that for all x [0, ] Ux + U( U) d = BU 4 + ( B)G x (U). Hence, our update function is Φ : [0, ] {0, } [0, ] [0, ], (x, b, u) bu/4 + ( b)g x (u). If we construct our Markov chain from the past using Φ, in each step there is a probability of r = /8 that all chains couple simultaneously. In other words, we can just start at a Geometric(/8) distributed time N in the past, the first instant of {B = } when moving back into the past. At this time N we couple all chains via X N := U N /4 and let the chain run from there until time 0 using the updates G X j (U j+ ) for j = N,...,. It is shown in [0, Section 2.] that this is a valid implementation of the coupling from the past algorithm in general. Hence, we need to derive expressions for the functions G x containing only elementary operations. It was calculated in [9, equation (28)] that, for all t [0, ] we have ϕ x (t) = ( ( + x) 2 4t ) /2 ( [0,x) (t) + 2 ) [x,bx)(t) with b x := (( + x)/2) 2. Hence, with r given in (3) we have ϕ x (t) r(t) for all x, t [0, ]. Note that coupling occurs faster when the function r can be chosen larger. For our densities ϕ x we could as well choose r (t) = 2 t [0,/4)(t), t [0, ]. Then we have ϕ x (t) r (t) r(t) for all x, t [0, ]. However, the subsequent inversion of distribution functions can be done elementary with our choice of r. We need to invert the distribution functions G x : [0, ] [0, ] corresponding to the normalized versions of g x = ϕ x r. We have y G x (y) = ϕ x (t) r(t) dt = 8 ( F x (y) ) r 7 2 (y /4), 0 4

where F x (y) := ( 2 + x ) ( + x) 2 4y, 0 y < x, ( + x) 2 4y, x y < b x,, b x y, is the distribution function of Ux + U( U). The inversion of G x can be done by explicit calculations and yields 7 4 z + 7z + ( x) 2 + x, if x [0, /4], z [0, q x ], G x (z) = where 7 4 z + 2 7z + 9 + x(x + 2) 6, if x [0, /4], z (q x, r x ], 256 (5 + 8x 7z)( + 8x + 7z), if x [0, /4], z (r x, ], 7 4 z + 7z + ( x) 2 + x, if x ( /4, ], z [0, s x ], 64 (7 + 8x 7z)( + 7z), if x ( /4, ], z (s x, t x ], 256 (5 + 8x 7z)( + 8x + 7z), if x ( /4, ], z (t x, ], q x := 4 7 x, r x := 8 7 x(x + 2), s x := 7 3 The algorithm ( 3 + 4x 4 ) x(x + 2), t x := (8x ). 7 Our algorithm Simulate[Y = d UY +U( U)] has the form discussed in the previous section: It draws back to a sequence of independent uniform[0, ] random variables (U n ) n 0 and an independent geometrically distributed random variable. (This clearly can be simulated on the basis of independent uniform[0, ] random variables as well.) Simulate[Y = d UY + U( U)]: N Geometric(/8) X U N /4 for j from N + to 0 do X G X (U j) return(x) 5

0.0 0.5.0.5 2.0 2.5 0.0 0.2 0.4 0.6 0.8.0 Figure : Histogram of the values of 0 million independent samples from µ generated with the algorithm Simulate[Y = d UY + U( U)]. Data The analysis of the complexity of this algorithm is trivial as the loop is iterated a random Geometric(/8) number of times, hence, e.g., on average eight times. In Figure the histogram (normalized to area ) of the values of 0 million independent samples generated with Simulate[Y = d UY +U( U)] is plotted. This simulation was done within a few seconds. A numerical approximation of the density of µ has already been presented in [9, Figure ]. References [] Devroye, L. (986) Nonuniform Random Variate Generation. Springer, New York. [2] Devroye, L. (200) Simulating perpetuities. Methodol. Comput. Appl. Probab. 3, 97 5. [3] Devroye, L. and Fawzi, O. (200) Simulating the Dickman distribution. Statist. Probab. Lett. 80, 242 247. 6

[4] Devroye and James, L. (20) The double CFTP method. ACM Trans. Model. Comput. Simul. 2, 20. [5] Fawzi, O. (2007) Efficient sampling from perpetuities using coupling from the past. Unpublished research report, available via http://www.cs.mcgill.ca/~ofawzi/docs/rapportm.pdf [6] Fill, J.A. and Huber, M.L. (200) Perfect simulation of Vervaat perpetuities. Elec. J. Probab. 5, 96 09. [7] Hobert J.P. and Robert C.P. (2004) A mixture representation of π with applications in Markov chain Monte Carlo and perfect sampling. Ann. Appl. Probab. 4, 295 305. [8] Hwang, H.-K. and Tsai, T.-H. (2002) Quickselect and the Dickman function. Combinatorics, Probab. Comput., 353 37. [9] Knape, M. and Neininger, R. (2008) Approximating perpetuities. Methodol. Comput. Appl. Probab. 0, 507 529. [0] Murdoch, D.J. and Green, P.J. (998) Exact sampling from a continuous state space. Scand. J. Statist. 25, 483 502. 7