Convergence of Feller Processes

Similar documents
Path Decomposition of Markov Processes. Götz Kersting. University of Frankfurt/Main

Weak convergence and Compactness.

SUMMARY OF RESULTS ON PATH SPACES AND CONVERGENCE IN DISTRIBUTION FOR STOCHASTIC PROCESSES

Feller Processes and Semigroups

Weak convergence and Brownian Motion. (telegram style notes) P.J.C. Spreij

Math 209B Homework 2

Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals

Basic Definitions: Indexed Collections and Random Functions

Convergence & Continuity

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3

Additive Lévy Processes

Empirical Processes: General Weak Convergence Theory

16 1 Basic Facts from Functional Analysis and Banach Lattices

Introduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Lecture 12. F o s, (1.1) F t := s>t

Math General Topology Fall 2012 Homework 11 Solutions

Metric Spaces and Topology

1 Weak Convergence in R k

Introduction to Empirical Processes and Semiparametric Inference Lecture 08: Stochastic Convergence

{σ x >t}p x. (σ x >t)=e at.

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Wiener Measure and Brownian Motion

ELEMENTS OF PROBABILITY THEORY

Translation Invariant Exclusion Processes (Book in Progress)

Combinatorics in Banach space theory Lecture 12

Interacting Particle Systems. J.M. Swart July 20, 2011

EXCHANGEABLE MARKOV PROCESSES ON GRAPHS: FELLER CASE

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

Hilbert Spaces. Contents

Bounded uniformly continuous functions

Lecture 21 Representations of Martingales

Stochastic Processes. Winter Term Paolo Di Tella Technische Universität Dresden Institut für Stochastik

Functional Analysis, Math 7320 Lecture Notes from August taken by Yaofeng Su

Problem Set 1: Solutions Math 201A: Fall Problem 1. Let (X, d) be a metric space. (a) Prove the reverse triangle inequality: for every x, y, z X

An introduction to Mathematical Theory of Control

SMSTC (2007/08) Probability.

Existence and approximation of Hunt processes associated with generalized Dirichlet forms

Lebesgue Integration: A non-rigorous introduction. What is wrong with Riemann integration?

Overview of normed linear spaces

6. Brownian Motion. Q(A) = P [ ω : x(, ω) A )

Homework # , Spring Due 14 May Convergence of the empirical CDF, uniform samples

Branching Processes II: Convergence of critical branching to Feller s CSB

Pathwise construction of tree-valued Fleming-Viot processes

REAL VARIABLES: PROBLEM SET 1. = x limsup E k

Invariance Principle for Variable Speed Random Walks on Trees

Measurable Choice Functions

2. Dual space is essential for the concept of gradient which, in turn, leads to the variational analysis of Lagrange multipliers.

2.2 Annihilators, complemented subspaces

REVIEW OF ESSENTIAL MATH 346 TOPICS

Real Analysis Notes. Thomas Goller

Simulation methods for stochastic models in chemistry

Filters in Analysis and Topology

Weakly interacting particle systems on graphs: from dense to sparse

Advanced Calculus Math 127B, Winter 2005 Solutions: Final. nx2 1 + n 2 x, g n(x) = n2 x

MAT 578 FUNCTIONAL ANALYSIS EXERCISES

Section 9.2 introduces the description of Markov processes in terms of their transition probabilities and proves the existence of such processes.

MAS3706 Topology. Revision Lectures, May I do not answer enquiries as to what material will be in the exam.

4 Countability axioms

Generators of Markov Processes

Reflected Brownian Motion

Convergence of Markov Processes. Amanda Turner University of Cambridge

Lecture 4 Lebesgue spaces and inequalities

Semicontinuous functions and convexity

Economics 574 Appendix to 13 Ways

FOUNDATIONS OF MARTINGALE THEORY AND STOCHASTIC CALCULUS FROM A FINANCE PERSPECTIVE

Math 328 Course Notes

Math 118B Solutions. Charles Martin. March 6, d i (x i, y i ) + d i (y i, z i ) = d(x, y) + d(y, z). i=1

THEOREMS, ETC., FOR MATH 515

MATH 102 INTRODUCTION TO MATHEMATICAL ANALYSIS. 1. Some Fundamentals

Homework If the inverse T 1 of a closed linear operator exists, show that T 1 is a closed linear operator.

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University

Functional Analysis Exercise Class

Stable Lévy motion with values in the Skorokhod space: construction and approximation

LARGE DEVIATIONS FOR STOCHASTIC PROCESSES

On Stopping Times and Impulse Control with Constraint

UNBOUNDED OPERATORS ON HILBERT SPACES. Let X and Y be normed linear spaces, and suppose A : X Y is a linear map.

Some results on primeness in the near-ring of Lipschitz functions on a normed vector space

Chapter 1. Introduction

Martingale Problems. Abhay G. Bhatt Theoretical Statistics and Mathematics Unit Indian Statistical Institute, Delhi

A.Piunovskiy. University of Liverpool Fluid Approximation to Controlled Markov. Chains with Local Transitions. A.Piunovskiy.

7 Complete metric spaces and function spaces

Functional Analysis. Thierry De Pauw

REAL AND COMPLEX ANALYSIS

3 Measurable Functions

Lecture Notes on Metric Spaces

Chapter 4. The dominated convergence theorem and applications

Fonctions on bounded variations in Hilbert spaces

Existence, Uniqueness and Stability of Invariant Distributions in Continuous-Time Stochastic Models

Analysis Preliminary Exam Workshop: Hilbert Spaces

Markov processes and queueing networks

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Stochastic integration. P.J.C. Spreij

Metric spaces and metrizability

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy

Math 5052 Measure Theory and Functional Analysis II Homework Assignment 7

Alternate Characterizations of Markov Processes

Stochastic2010 Page 1

CHAPTER V DUAL SPACES

Transcription:

Chapter 15 Convergence of Feller Processes This chapter looks at the convergence of sequences of Feller processes to a iting process. Section 15.1 lays some ground work concerning weak convergence of processes with cadlag sample paths. Section 15.2 states and proves the central theorem about the convergence of sequences of Feller processes. Section 15.3 examines a particularly important special case, the approximation of ordinary differential equations by pure-jump Markov processes. 15.1 Weak Convergence of Processes with Cadlag Paths (The Skorokhod Topology) Recall that a sequence of random variables X 1, X 2,... converges in distribution d on X, or weakly converges on X, X n X, if and only if E [f(xn )] E [f(x)], for all bounded, continuous functions f. This is still true when X n are random functions, i.e., stochastic processes, only now the relevant functions f are functionals of the sample paths. Definition 161 (Convergence in Finite-Dimensional Distribution) Random fd processes X n on T converge in finite-dimensional distribution on X, X n X, when, J Fin(T ), X n (J) d X(J). Proposition 162 Convergence in finite-dimensional distribution is necessary but not sufficient for convergence in distribution. 80

CHAPTER 15. CONVERGENCE OF FELLER PROCESSES 81 Proof: Necessity is obvious: the coordinate projections π t are continuous functionals of the sample path, so they must converge if the distributions converge. Insufficiency stems from the problem that, even if a sequence of X n all have sample paths in some set U, the iting process might not: recall our example (78) of the version of the Wiener process with unmeasurable suprema. Definition 163 (The Space D) By D(T, Ξ) we denote the space of all cadlag functions from T to Ξ. By default, D will mean D(R +, Ξ). D admits of multiple topologies. For most purposes, the most convenient one is the Skorokhod topology, a.k.a. the J 1 topology or the Skorokhod J 1 topology, which makes D(Ξ) a complete separable metric space when Ξ is itself complete and separable. (See Appendix A2 of Kallenberg.) For our purposes, we need only the following notion and theorem. Definition 164 (Modified Modulus of Continuity) The modified modulus of continuity of a function x D(T, Ξ) at time t T and scale h > 0 is given by w(x, t, h) inf max sup ρ(x(s), x(r)) (15.1) (I k ) k r,s I k where the infimum is over partitions of [0, t) into half-open intervals whose length is at least h (except possibly for the last one). Because x is cadlag, for fixed x and t, w(x, t, h) 0 as h 0. Theorem 165 (Weak Convergence in D(R +, Ξ)) Let Ξ be a complete, separable metric space. Then a sequence of random functions X 1, X 2,... D(R +, Ξ) converges in distribution to X D if and only if i The set T c = {t T : X(t) = X(t )} has a countable dense subset T 0, and the finite-dimensional distributions of the X n converge on those of X on T 0. ii For every t, h 0 sup E [w(x n, t, h) 1] = 0 (15.2) Proof: See Kallenberg, Theorem 16.10, pp. 313 314. Theorem 166 (Sufficient Condition for Weak Convergence) The following three conditions are all equivalent, and all imply condition (ii) in Theorem 165. 1. For any sequence of a.s.-finite F Xn -optional times τ n and positive constants h n 0, ρ(x n (τ n ), X n (τ n + h n )) P 0 (15.3)

CHAPTER 15. CONVERGENCE OF FELLER PROCESSES 82 2. For all t > 0, for all h 0 sup sup E [ρ(x n (σ), X n (τ)) 1] = 0 (15.4) σ,τ where σ and τ are F Xn -optional times σ, τ t, with σ τ τ + h. 3. For all t > 0, δ 0 sup sup sup τ t 0 h δ E [ρ(x n (τ), X n (τ + h)) 1] = 0 (15.5) where the supremum in τ runs over all F Xn -optional times t. Proof: See Kallenberg, Theorem 16.11, pp. 314 315. 15.2 Convergence of Feller Processes We need some technical notions about generators. Definition 167 (Closed and Closable Generators, Closures) A linear operator O on a Banach space B is closed if its graph { f, g B 2 : f Dom(O), g = Of } is a closed set. An operator is closable if the closure of its graph is a function (and not just a relation). The closure of a closable operator is that function. Notice, by the way, that because O is linear, it is closable iff f n 0 and Af n g implies g = 0. Definition 168 (Core of an Operator) Let O be a closed linear operator on a Banach space B. A linear subspace D Dom(O) is a core of O if the closure of O restricted to D is, again O. The idea of a core is that we can get away with knowing how the operator works on a linear subspace, which is often much easier to deal with, rather than controlling how it acts on its whole domain. Proposition 169 The generator of every Feller semigroup is closed. Proof: We need to show that the graph of G contains all of its it points, that is, if f n Dom(G) converges (in L ) on f, and Gf n g, then f Dom(G) and Gf = g. First we show that f Dom(G). (I G)f n = f n Gf n (15.6) n n = f g (15.7)

CHAPTER 15. CONVERGENCE OF FELLER PROCESSES 83 But (I G) 1 = R 1. Since this is a bounded linear operator, we can exchange applying the inverse and taking the it, i.e., R 1 n (I G)f n = R 1 (f g) (15.8) n R 1 (I G)f n = R 1 (f g) (15.9) n f n = R 1 (f g) (15.10) f = R 1 (f g) (15.11) Since the range of the resolvents is contained in the domain of the generator, f Dom(G). We can therefore say that f g = (I G)f, which implies that Gf = g. Hence, the graph of G contains all its it points, and G is closed. Theorem 170 Let X n be a sequence of Feller processes with semigroups K n,t and generators G n, and X be another Feller process with semigroup K t and a generator G containing a core D. Then the following are equivalent. 1. If f D, there exists a sequence of f n Dom(G n ) such that f n f 0 and A n f n Af 0. 2. K n,t K t for every t > 0 3. K n,t f K t f 0 uniformly over f C 0 for bounded positive t 4. If X n (0) d X(0) in Ξ, then X n d X in D. Proof: See Kallenberg, Theorem 19.25, p. 385. Remark. The important versions of the property above are the second convergence of the semigroups and the fourth converge in distribution of the processes. The other two are there to simplify the proof. 15.3 Approximation of Ordinary Differential Equations by Markov Processes The following result, due to Following Kurtz (1970, 1971), is essentially an application of Theorem 170. First, recall that continuous-time, discrete-state Markov processes work essentially like a combination of a Poisson process (giving the time of transitions) with a Markov chain (giving the state moved to on transitions). This can be generalized to continuous-time, continuous-state processes, of what are called pure jump type. Definition 171 (Pure Jump Markov Process) A continuous-parameter Markov process is a pure jump process when its sample paths are piece-wise constant. For each state, there is an exponential distribution of times spent in that state, whose parameter is denoted λ(x), and a transition probability kernel or exit distribution µ(x, B).

CHAPTER 15. CONVERGENCE OF FELLER PROCESSES 84 Observe that pure-jump Markov processes always have cadlag sample paths. Also observe that the average amount of time the process spends in state x, once it jumps there, is 1/λ(x). So the time-average velocity, i.e., rate of change, starting from x, λ(x) (y x)µ(x, dy) Ξ Theorem 172 Let X n be a sequence of pure-jump Markov processes with state spaces Ξ n, holding time parameters λ n and transition probabilities µ n. Suppose that, for all n Ξ n is a Borel-measurable subset of R k for some k. Let Ξ be another measurable subset of R k, on which there exists a function F (x) such that F (x) F (y) M x y for some constant M. Suppose all of the following conditions holds. 1. The time-averaged rate of change is always finite: sup sup λ n (x) y x µ n (x, dy) < (15.12) n x Ξ n Ξ Ξ n 2. There exists a positive sequence ɛ n 0 such that sup λ n (x) y x µ n (x, dy) = 0 (15.13) x Ξ n Ξ y x >ɛ 3. The worst-case difference between F (x) and the time-averaged rates of change goes to zero: sup F (x) λ n(x) (y x)µ n (x, dy) = 0 (15.14) x Ξ n Ξ Let X(s, x 0 ) be the solution to the initial-value problem where the differential is given by F, i.e., for each 0 s t, s X(s, x 0) = F (X(s, x 0 )) (15.15) X(0, x 0 ) = x 0 (15.16) and suppose there exists an η > 0 such that, for all n, { } Ξ n y R k : inf y X(s, x 0) η 0 s t Then X n (0) = x 0 implies that, for every δ > 0, ( P sup X n (s) X(s, x 0 ) > δ 0 s t ) Ξ (15.17) = 0 (15.18)

CHAPTER 15. CONVERGENCE OF FELLER PROCESSES 85 The first conditions on the X n basically make sure that they are Feller processes. The subsequent ones make sure that the mean time-averaged rate of change of the jump processes converges on the instantaneous rate of change of the differential equation, and that, if we re sufficiently close to the solution of the differential equation in R k, we re not in some weird way outside the relevant domains of definition. Even though Theorem 170 is about weak convergence, converging in distribution on a non-random object is the same as converging in probability, which is how we get uniform-in-time convergence in probability for a conclusion. There are, broadly speaking, two kinds of uses for this result. One kind is practical, and has to do with justifying convenient approximations. If n is large, we can get away with using an ODE instead of the noisy stochastic scheme, or alternately we can use stochastic simulation to approximate the solutions of ugly ODEs. The other kind is theoretical, about showing that the large-population it behaves deterministically, even when the individual behavior is stochastic and strongly dependent over time.