On forward improvement iteration for stopping problems

Similar documents
Solutions to HW Assignment 1

1 Convergence in Probability and the Weak Law of Large Numbers

Lecture 19: Convergence

Infinite Sequences and Series

lim za n n = z lim a n n.

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

Random Models. Tusheng Zhang. February 14, 2013

Seunghee Ye Ma 8: Week 5 Oct 28

7.1 Convergence of sequences of random variables

Application to Random Graphs

Convergence of random variables. (telegram style notes) P.J.C. Spreij

MAS111 Convergence and Continuity

J. Stat. Appl. Pro. Lett. 2, No. 1, (2015) 15

Lecture 7: Properties of Random Samples

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

Markov Decision Processes

7.1 Convergence of sequences of random variables

Riesz-Fischer Sequences and Lower Frame Bounds

Chapter 0. Review of set theory. 0.1 Sets

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

M17 MAT25-21 HOMEWORK 5 SOLUTIONS

Introduction to Probability. Ariel Yadin. Lecture 2

An Introduction to Randomized Algorithms

Singular Continuous Measures by Michael Pejic 5/14/10

6a Time change b Quadratic variation c Planar Brownian motion d Conformal local martingales e Hints to exercises...

6.3 Testing Series With Positive Terms

ACO Comprehensive Exam 9 October 2007 Student code A. 1. Graph Theory

Sequences, Mathematical Induction, and Recursion. CSE 2353 Discrete Computational Structures Spring 2018

Lecture 3 : Random variables and their distributions

Generalized Semi- Markov Processes (GSMP)

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.

Introduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT

Chapter 2 The Monte Carlo Method

Distribution of Random Samples & Limit theorems

Two dependent random walks

Math 155 (Lecture 3)

Mathematical Induction

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

The picture in figure 1.1 helps us to see that the area represents the distance traveled. Figure 1: Area represents distance travelled

Properties of Fuzzy Length on Fuzzy Set

Determinants of order 2 and 3 were defined in Chapter 2 by the formulae (5.1)

5.1. The Rayleigh s quotient. Definition 49. Let A = A be a self-adjoint matrix. quotient is the function. R(x) = x,ax, for x = 0.

Notes 19 : Martingale CLT

Different kinds of Mathematical Induction

BETWEEN QUASICONVEX AND CONVEX SET-VALUED MAPPINGS. 1. Introduction. Throughout the paper we denote by X a linear space and by Y a topological linear

Math 525: Lecture 5. January 18, 2018

Recurrence Relations

The Boolean Ring of Intervals

Axioms of Measure Theory

Hoggatt and King [lo] defined a complete sequence of natural numbers

Gamma Distribution and Gamma Approximation

Chapter 6 Infinite Series

Solution. 1 Solutions of Homework 1. Sangchul Lee. October 27, Problem 1.1

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem

1+x 1 + α+x. x = 2(α x2 ) 1+x

Basics of Probability Theory (for Theory of Computation courses)

Chapter 1. Complex Numbers. Dr. Pulak Sahoo

NEW FAST CONVERGENT SEQUENCES OF EULER-MASCHERONI TYPE

Advanced Stochastic Processes.

Common Coupled Fixed Point of Mappings Satisfying Rational Inequalities in Ordered Complex Valued Generalized Metric Spaces

Sequences and Series of Functions

Benaissa Bernoussi Université Abdelmalek Essaadi, ENSAT de Tanger, B.P. 416, Tanger, Morocco

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense,

Math 299 Supplement: Real Analysis Nov 2013

MATH 112: HOMEWORK 6 SOLUTIONS. Problem 1: Rudin, Chapter 3, Problem s k < s k < 2 + s k+1

2.1. The Algebraic and Order Properties of R Definition. A binary operation on a set F is a function B : F F! F.

Estimation of Backward Perturbation Bounds For Linear Least Squares Problem

Self-normalized deviation inequalities with application to t-statistic

Relations Among Algebras

Optimal Two-Choice Stopping on an Exponential Sequence

THE STRONG LAW OF LARGE NUMBERS FOR STATIONARY SEQUENCES

Stochastic Matrices in a Finite Field

APPROXIMATION BY BERNSTEIN-CHLODOWSKY POLYNOMIALS

Notes on Snell Envelops and Examples

Exponential Functions and Taylor Series

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013. Large Deviations for i.i.d. Random Variables

An elementary proof that almost all real numbers are normal

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

STAT Homework 1 - Solutions

SOME TRIGONOMETRIC IDENTITIES RELATED TO POWERS OF COSINE AND SINE FUNCTIONS

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3

Introduction to Probability. Ariel Yadin. Lecture 7

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero?

Signal Processing. Lecture 02: Discrete Time Signals and Systems. Ahmet Taha Koru, Ph. D. Yildiz Technical University.

Here are some examples of algebras: F α = A(G). Also, if A, B A(G) then A, B F α. F α = A(G). In other words, A(G)

Basis for simulation techniques

MAT1026 Calculus II Basic Convergence Tests for Series

Stochastic Simulation

Optimally Sparse SVMs

A Hilbert Space Central Limit Theorem for Geometrically Ergodic Markov Chains

Math 61CM - Solutions to homework 3

A Note on Integrals & Hybrid Contours in the Complex Plane

CSI 2101 Discrete Structures Winter Homework Assignment #4 (100 points, weight 5%) Due: Thursday, April 5, at 1:00pm (in lecture)

A Note on the Symmetric Powers of the Standard Representation of S n

OPTIMAL STOPPING AND EXIT TIMES FOR SOME CLASSES OF RANDOM PROCESSES. Vladyslav Tomashyk

MATH 312 Midterm I(Spring 2015)

Chapter IV Integration Theory

arxiv: v1 [cs.sc] 2 Jan 2018

Numerical Method for Blasius Equation on an infinite Interval

Transcription:

O forward improvemet iteratio for stoppig problems Mathematical Istitute, Uiversity of Kiel, Ludewig-Mey-Str. 4, D-24098 Kiel, Germay irle@math.ui-iel.de Albrecht Irle Abstract. We cosider the optimal stoppig problem for g(z ), where Z, 1, 2... is a homogeeous Marov sequece. A algorithm, called forward improvemet iteratio, is ivestigated by which a optimal stoppig time ca be computed. Usig a iterative step, this algorithm computes a sequece B 0 B 1 B 2... of subsets of the state space such that the first etrace time ito the itersectio F of these sets is a optimal stoppig time. Keywords. Optimal stoppig, Marov sequece, forward improvemet iteratio. 1 Itroductio We recall the geeral situatio for a problem of optimal stoppig. Startig with a probability space ad a time set T [0, ], we have a filtratio (A t ) t T ad a adapted real-valued stochastic process (X t ) t T. A stoppig rule is a mappig τ : Ω T satisfyig {τ t} A t for all t T, ad we let T deote the set of all stoppig rules. We assume that EX τ exists (possibly ifiite) for all τ T. The aim is to fid a stoppig rule τ satisfyig EX τ sup EX τ τ T ad to compute this supremum, called the value of the stoppig problem. There is a wealth of literature o the theory of optimal stoppig ad its applicatios, see e.g. the recet moograph by Pesir ad Shiryaev (2006), curret research ofte triggered by the applicability i optio pricig problems of mathematical fiace. Differig from the well-ow bacwards iductio approach, the followig algorithm called forward improvemet iteratio (FII) was itroduced i Irle (1980) as a geeral theoretical tool ad was the used for fidig explicit solutios i best choice problems. For FII we let T {0, 1, 2,..., m} for m thus icludig the ifiite time case m. For ay sequece C (C ) T of C A with C m Ω we defie the stoppig rule For C we defie C by C m Ω ad τ (C)(ω) if { : ω C }. C {E(X τ+1 (C) A ) X } C, < m. FII proceeds i the followig way. Let C 0 (Ω) ad by iductio C (C 1 ). Defie D by D C. It is show i Irle (1980) that τ 0 (D) is a optimal stoppig rule uder the coditio, trivially true for m <, that EX lim σ lim EX σ for ay icreasig sequece (σ ) of stoppig rules. This approach was adapted i Irle (2006) to Marovia stoppig problems. Differet to the geeral case where FII wors i the set of sample paths, the adapted algorithm ow wors i the space set of a Marov chai which allows for efficiet umerical calculatios. We cosider T {0, 1, 2,..., } ad a homogeeous Marov process (Z ) < with respect to the uderlyig filtratio. The measurable state space is deoted by (S, S). Let g : S R be measurable. We loo at the optimal stoppig problem for X g(z ), <, X lim sup X,

2 Irle formally writig g(z ) for X i various expressios. We use P z, E z for P ( Z 0 z), E( Z 0 z) ad assume that E z g(z τ ) exists for all stoppig rules τ ad all z S. We are looig for a stoppig rule τ such that for all z S E z g(z τ ) sup E z g(z τ ) V (z), say, τ V referred to as value fuctio. FII wors i the followig way. For a measurable B S set τ (B) if {j : Z j B} ad defie B {z : g(z) E z g(z τ1 (B)} B. Let B 0 S ad by iductio B (B 1 ), furthermore F B. Uder the coditio E z g(z lim σ ) lim E z g(z σ ) for all z ad all icreasig sequeces (σ ) of stoppig rules, τ 0 (F ) is a optimal stoppig rule. The above coditio may be omitted i the case of a fiite state space S ad the the algorithm termiates after at most S steps. I the case of discoutig with 0 < α 1 we cosider the stoppig problem for X α g(z ). For α 1 this reduces to X g(z ), but o the other had, the discouted case may also be viewed as a o-discouted stoppig problem for the space-time chai (Z, ). The algorithmic step of goig from B to B is provided by B {z : g(z) E z α τ1(b) g(z τ1 (B))} B. We refer to Irle (2006) where also various examples with discoutig were treated. I these examples, the algorithmic step of goig from B to B was performed by providig umerical values for the quatities E z α τ 1(B) g(z τ1 (B)) by path wise simulatios of the Marov chai. As show theoretically ad demostrated by actual computatios, FII fids, for fiite state space S, the value fuctio ad the optimal stoppig time i a fiite umber of iteratios. I the examples of Irle (2006), this umber was surprisigly low. The purpose of this ote is twofold. Firstly we shall show that the algorithmic step may be described i terms of a liear equatio which allows the use of fast methods of umerical liear algebra ad the hadlig of very large state spaces. Secodly we shall show how the algorithm may be adapted to cotiuous time Marov chais. 2 The algorithmic step as a liear equatio We cosider a discouted Marovia stoppig problem as described i the itroductio with a fiite state space S, g : S R, 0 < α 1. We assume g 0 for α < 1 to rule out that ifiite observatio is optimal due to discoutig. Let p zy P (Z 1 y Z 0 z), y, z S. For shorter otatio we write h i (B)(z) E z α τ i(b) g(z τi (B)), z S, i 0, 1. Propositio 1 Let B S with P z (τ 0 (B) < ) 1 for all z S. The h 0 (B) is the uique solutio of h(z) g(z), z B, h(z) α y p zy h(y), z S \ B.

Forward improvemet iteratio 3 Proof. Obviously h 0 (B) fulfills h 0 (B)(z) g (z), z B, by defiitio. Furthermore h 0 (B)(z) p zy α h 0 (B)(y), z S \ B, as a well-ow cosequece of the Marov property which is ofte y deoted as the α-harmoicity of h 0 (B) o S \ B. So h 0 (B) provides a solutio. Usig the coditio P z (τ 0 (B) < ) 1, it is immediate from the α-harmoicity that the maximum of h 0 (B) is attaied at some poit i B, this fact ofte beig referred to as discrete maximum priciple. Uiqueess is a immediate cosequece of this priciple. Cosider two solutios h, h ad set f. The f ad f provide solutios correspodig to g(z) 0, z B, ad by the maximum priciple f 0, f 0, hece h h. Corollary 1 Let B 0 S, B (B 1 ), 1, 2,.... The for all, h 0 (B ) is the uique solutio of h(z) g(z), z B, h(z) α y p zy h(y), z S \ B. Furthermore h 1 (B )(z) h 0 (B )(z), z S \ B, h 1 (B )(z) α y p zy h 0 (B )(y), z B. Proof. The relatio betwee h 0 (B ) ad h 1 (B ) is obvious. Hece it is eough to show that P z (τ 0 (B ) < ) 1 for all z. For this defie F {z : g(z) V (z)}, where V is the value fuctio of the discouted stoppig problem. It is well-ow that τ 0 (F ) is a optimal stoppig time ad P z (τ 0 (F ) < ) 1 for all z, see Shiryayev (1978), Chapter 2, Theorem 4. By iductio, B F for all, hece P z (τ 0 (B ) < ) 1 for all z. This corollary shows that the iterative step may be performed by solvig h(z) g(z), z B,h(z) α p zy h(y), z S, ad the usig h(z) h(z), z S \ B, h(z) α p zy h(y), z B y y for the compariso with g(z). Usig appropriate pacages of umerical liear algebra this may be doe for large state spaces. 3 Cotiuous time Marov chais ad radom discoutig We ow treat a cotiuous time Marov chai (Ẑt) t [0, ) with a fiite state space S ad loo at the discouted optimal stoppig problem for X t α t g(ẑt), t [0, ), X lim sup X t, assumig g 0 for α < 1. It is well-ow that a optimal stoppig time exists ad is of the form τ if {t [0, ) : Ẑt B} for some B S, see Shiryayev (1978). So we oly have to loo at first etrace times i this stoppig problem. Deote the jump times of the chai by T 0 0 < T 1 < T 2 <... ad the embedded chai by Z ẐT with trasitio probabilities p zy, z, y S. Give Z 0 z 0,..., Z z, the waitig times T i T i 1, i 1,...,, are idepedet expoetially distributed with parameters λ(z 1 ),..., λ(z ). The trasitio probabilities p zy ad the parameters are computed from the Q-matrix of the chai.

4 Irle Now loo at ay a.s. fiite first etrace time τ if {t : Ẑt B}. The for D (B c ) 1 B E X τ ( )) E 1 {τt} α T g (ẐT ( ) E 1 {(Z0,...,Z ) D } α T T 1 g (Z ) ( ( )) E 1 {(Z0,...,Z ) D } g(z )E α T T 1 Z 1,..., Z ( E 1 {(Z0,...,Z ) D } g(z ) 1 ) α(z i ) ( E g(z τ ) 1 ) α(z i ), where τ if{ : Z B} ad α(z) λ(z) λ(z) log α is the momet geeratig fuctio of a expoetial distributio with parameter λ(z). So the cotiuous time problem is equivalet to a discrete time problem with radom discoutig where X g(z ) 1 α(z i ). The algorithmic step is doe i the followig way. For B S set B {z : g(z) E z (g(z τ1 (B)) τ 1 (B) 1 α(z i ))}. Agai let B 0 S, B (B 1 ) ad F B. Uder the coditio E z X lim σ lim E z X σ for all z ad all icreasig sequeces (σ ) of stoppig rules, τ 0 (F ) agai is a optimal stoppig time. We give a proof of this result i the Appedix. Writig as before h i (B)(z) E z (g(z τi (B)) it follows as i Sectio 2 that h 0 (B) is the uique solutio of τ i (B) 1 α(z i )) h(z) g(z), z B, h(z) α(z) y p zy (y), z S \ B. Furthermore h 1 (B)(z) h 0 (B)(z), z S \ B, h 1 (B)(z) α(z) y p zy h 0 (B)(y), z B. So we may use agai umerical liear algebra for FII. We ow state ad prove the validity of FII i the case of radom discoutig. Theorem 1 Cosider a Marovia stoppig problem with radom discoutig where g : S R, α : S [0, 1] are measurable fuctios. Let 1 X g(z ) α(z i ), 0, 1,..., X lim sup X. Assume that E z X τ exists for all stoppig rules τ, z S ad E z lim X σ lim E z X σ for all z S ad all icreasig sequeces (σ ) of stoppig rules. Defie B, 0, 1, 2,... ad F B as i Sectio 3. The for all z S (i) E z X τ(b 0 ) E z X τ(b 1 )... E z X τ(f ), (ii) E z X τ(f ) V (z), i.e. τ(f ) is optimal.

Forward improvemet iteratio 5 Proof. The proof uses the basic ideas of the proof of Theorem 1 i Irle (1980) adapted to the Marovia settig with discoutig. (a) Let B S, B as above. For σ T (B) set σ if{ σ : Z B } with T (B) {τ : Z τ B o {τ < }. We shall show E z X σ E z X σ from which (i) immediately follows. Let σ < τ +1 (B) 1 {σ,z / B } + σ1 {Zσ B } + 1 {σ }. Obviously σ T (B). For ay <, with a idepedet copy (Z ) of (Z ) {σ,z / B } X σ dp z This shows E z X bσ E z X σ. Next let {σ,z / B } {σ,z / B } {σ,z / B } {σ,z / B } {σ,z / B } g(z ) 1 [ E Z g E ( g(z τ+1 (B) (Z τ 1(B) ) τ 1 (B) 1 ) τ +1(B) 1 i τ +1 (B) 1 g(z τ+1 (B)) X τ+1 (B)dP z. σ 0 σ ad by iductio σ σ 1, α(z i ) ] 1 α(z i ) A ) 1 so that E z X σ E z X σ E z X σ+1... for all. We have σ σ 1 σ 2 σ. Assume that sup σ (ω) < σ (ω) for some ω. The σ (ω) σ +1 (ω) <, for some, hece Z σ (ω) B ad σ +1 (ω) σ (ω). This cotradictio implies σ lim σ, so that by assumptio E z X σ lim E z X σ E z X σ. (b) We shall ow show that for ay stoppig time σ there exists τ T (F ) such that E z X τ E z X σ. Defie σ if{ σ : Z B }, τ if{ σ : Z F }. The σ 0 σ σ 1 σ 2 τ, σ +1 (σ ). It follows from (a) that E z X σ E z X σ E z X σ +1 for all. Sice F B, we obtai lim σ τ, hece by assumptio E z X σ lim E z X σ E z X τ. (c) Let ρ, τ T (F ) such that ρ τ. We shall show E z X ρ E z X τ. Let ρ ρ1 {ρτ} + τ +1 (B )1 {ρ<τ}, ρ ρ1 {ρτ} + τ +1 (F )1 {ρ<τ}. < <

6 Irle The ρ ρ ρ +1 ρ τ, ρ lim ρ. Sice ρ T (F ) we have for ay implyig τ 1 (B ) 1 {ρ < τ} {g(z ) E Z g(z τ 1 (B ) ) α(z i)}, {ρ<τ} X ρ dp z {ρ<z} {ρ<τ} {ρ<τ} g(z ) 1 α(z i ) dp z [E Z g(z τ τ 1 (B ) ) 1 (B ) 1 α(z i 1 )] τ +1 (B ) 1 g(z τ+1 (B )) α(z i ) dp z α(z i ) dp z {ρ<τ} ρ dp z, hece E z X ρ E z X ρ. Lettig we obtai E z X ρ E z X ρ. Defie ρ 0 ρ, ad ρ (ρ 1 ) for 1. The obviously ρ ρ ρ +1, ρ T (F ), ad lim ρ τ. It follows E z X ρ lim E z X ρ E z X τ. (d) Let σ be ay stoppig time. The by (b) there exists τ T (F ) such that E z X σ E z X τ. By defiitio τ 0 (F ) τ, hece by (c) E z X τ0 (F ) E z X τ E z X σ. Refereces Irle, A. (1980). O the Best Choice Problem with Radom Populatio Size. Zeitschrift f. Operatios Research, 24, 177 190. Irle, A. (2006). A Forward Algorithm for Solvig Optimal Stoppig Problems. J. Appl. Probab., 43, 102 113. Pesir, G. ad Shiryaev, A.N. (2006). Optimal Stoppig ad Free-Boudary Problems. Birhaeuser, Basel. Shiryaev, A.N. (1978). Optimal Stoppig Rules. Spriger, New Yor.