Approximating Sepp s constants for te Slepian process Jack Noonan a, Anatoly Zigljavsky a, a Scool of Matematics, Cardiff University, Cardiff, CF4 4AG, UK arxiv:8.0v [mat.pr] 8 Dec 08 Abstract Slepian process S(t) is a stationary Gaussian process wit zero mean and covariance ES(t)S(t ) = max{0, t t }. For any T > 0 and > 0, define F T () = Pr { max t [0,T ] S(t) < } and te constants Λ() = lim T T log F T () and λ() = exp{ Λ()}; we will call tem Sepp s constants. Te aim of te paper is construction of accurate approximations for F T () and ence for te Sepp s constants. We demonstrate tat at least some of te approximations are extremely accurate. Keywords:. Introduction Slepian process, extreme value teory, boundary crossing probability Let S(t), t [0, T ], be a Gaussian process wit mean 0 and covariance ES(t)S(t ) = max{0, t t }. Tis process is often called Slepian process. For any > 0 and x <, define { F T ( x) := Pr max S(t) < } S(0) = x ; (.) t [0,T ] if x we set F T ( x) = 0. Assuming tat x as Gaussian distribution N(0, ), and ence te stationarity of te process S(t), we average F T ( x) and tus define F T () := F T ( x)ϕ(x)dx, were ϕ(x) = (π) / exp{ x /}. Key results on te boundary crossing probabilities for te Slepian process ave been establised by L.Sepp in []. In particular, L.Sepp as derived an explicit formula for F n () wit n integer, see (.) below. As tis explicit formula is quite complicated, in (3.7) in te same paper, L.Sepp as conjectured te existence of te following constant (depending on ) Λ() = lim n n log F n() (.) and raised te question of constructing accurate approximations and bounds for tis constant. Te importance of tis constant is related to te asymptotic relation F T () const[λ()] T as T, (.3) were λ() = exp{ Λ()}. We will call Λ() and λ() Sepp s constants. In tis paper, we are interested in deriving approximations for F T () in te form (.3) and ence for te Sepp s Corresponding autor Email addresses: NoonanJ@cardiff.ac.uk (Jack Noonan), ZigljavskyAA@cardiff.ac.uk (Anatoly Zigljavsky) Preprint submitted to Elsevier December 3, 08
constants. In formulation of approximations, we offer approximations for F T () for all T > and ence approximations for Λ() and λ(). One can apply general results sown in [, 3] and formula (..3) in [4] to approximate λ() but tese results only sow tat λ() as and terefore are of no use ere. On te oter and, te following approximation, also derived from general principles, is simple but more useful. Approximation 0. Poisson Clumping Heuristic, see [5]: F T () exp( ϕ()t ), Λ (0) () = ϕ(), λ (0) () = e ϕ(). For large, tis approximation is quite good, see Tables and below. In Section we derive several new approximations for F T () and λ(). In Section 3 we provide numerical results sowing tat at least some of te approximation derived in Section are extremely accurate. Section 4 contains some minor tecnical details and Section 5 delivers conclusions.. Derivation of approximations for F T () and λ().. Sepp s explicit formula for F n ( x)... Sepp s formula Te following formula is te result (.5) in []: F n ( x) = det ϕ(y i y j+ + ) n i,j=0 dy... dy n+, (.) ϕ(x) D x were T = n is a positive integer, D x = {y,..., y T + x < y < y 3 <... < y n+ }, y 0 = 0, y = x. L.Sepp in [] as also derived explicit formulas for F T ( x) wit non-integral T > 0 but tese formulas are more complicated and are realistically applicable only for small enoug T (say, T 3). From (.) we straigtforwardly obtain F ( ϕ() x) = Φ() Φ(x), (.) ϕ(x) F () = F ( x)ϕ(x)dx = Φ () ϕ() [ Φ() + ϕ() ], were Φ(x) = x ϕ(t)dt. Derivation of explicit formulas for F T ( x) and FT () wit T is relatively easy as te process S(t) is conditionally Markovian in te interval [0, ], see [6]. Formula (.) as been first derived in [7]. In wat follows, F () and F ( 0) (in addition to F () and F ( 0)) play important roles. Te expressions for bot, F () and F ( 0), are more complicated tan expressions for F () and F ( 0). Neverteless, tese expressions can be reduced to a one-dimensional integrals and furter approximated as sown in Appendix; see Section 4.... An alternative representation of te Sepp s formula (.) Let T = n be a positive integer, y 0 = 0, y = x. For i = 0,,..., n we set s i = + y i y i+ wit s 0 = x. It follows from Sepp s proof of (.) tat s 0, s,..., s n ave te meaning of te values of te process S(t) at te times t = 0,,..., n: S(i) = s i (i = 0,,..., n). Te range of te variables s i is (, ). Te variables y,..., y n+ are expressed via s 0,..., s n by y k = k s 0 s... s k (k =,..., n+) wit y 0 = 0. Canging te variables, we obtain te following equivalent expression for te probability (.): F n ( x) =... det ϕ(s i + a i,j ) n i,j=0 ds... ds n, (.3) ϕ(x)
were 0 for i = j a i,j = y i+ y j+ = (i j) s j+... s i+ for i > j (i j) + s i+ +... + s j for i < j. Expression (.3) for te probability F n ( x) implies tat te function p(s 0, s,... s n ) = ϕ(s 0 )F n ( s0 ) det ϕ(s i + a i,j ) n i,j=0. (.4) is te joint probability density function for te values S(0), S(),..., S(n) under te condition S(t) < for all t [0, n]. Since s n is te value of S(n), te formula (.4) also sows te transition density from s 0 = x to s n conditionally S(t) < for all t [0, n]: p (n) (x s n) =... det ϕ(s i + a i,j ) n i,j=0 ds... ds n. (.5) ϕ(x) For tis transition density, p(n) (x z)dz = F n( x)... Approximating λ() troug eigenvalues of integral operators... One-step transition In te case n = we obtain from (.5): ( ) p () ϕ(x) ϕ(x +z) (x z) = ϕ(x) det ϕ() ϕ(z) [ = ϕ(z) e ( z)( x)] (.6) wit z = s <. Let λ () be te largest eigenvalue of te te integral operator wit kernel (.6): λ ()p(z) = p(x)p () (x z)dx, z <, were eigenfunction p(x) is some probability density on (, ]. Te Ruelle-Krasnoselskii-Perron- Frobenius teory of bounded linear positive operators (see e.g. Teorem XIII.43 in [8]) implies tat te maximum eigenvalue λ of te operator wit kernel K(x, z) = p () (x z) is simple, real and positive and te eigenfunction p(x) can be cosen as a probability density. Similarly to wat we ave done below in Section.., we can suggest computing good numerical approximations to λ () using Gauss-Legendre quadrature formulas. However, we suggest to use (4.5) from [9] instead; tis elps us to obtain te following simple but rater accurate approximation to λ (): [ ] ˆλ () = Φ() + ϕ()/ ϕ()[ϕ() + Φ()]/ Φ() e / /. Approximation : F T () F () [ˆλ () ] T (T ); Λ () () = log ˆλ (), λ () () = ˆλ ().... Transition in a twice longer interval Consider now te interval [0, ]. We could ave extended te metod of Section.. and used te eigenvalue (square root of it) for te transition s 0 s wit transition density expressed in (.5) wit n =. Tis would improve Approximation but tis improvement is only marginal. Instead, we will use anoter approac: we consider te transition s s but use te interval [0, ] just for setting up te initial condition for observing S(t) at t [, ]. 3
For n =, te expression (.4) for te joint probability density function for te values S(0), S(), S() under te condition S(t) < for all t [0, ] as te form ϕ(s 0 ) ϕ(s 0 +s ) ϕ(s 0 +s +s ) p(s 0, s, s ) = ϕ(s 0 )F ( s0 ) det ϕ() ϕ(s ) ϕ(s +s ). ϕ( s ) ϕ() ϕ(s ) Denote by p (z), z <, te non-normalized density of S() under te condition S(t) < for all t [0, ] tat satisfies p (z)dz = F (). Using (.6), we obtain p (z) = p () (x z)ϕ(x)dx = Φ()ϕ(z) Φ(z)ϕ(). Ten te transition density from x = s to z = s under te condition S(t) < for all t [0, ] is acieved by integrating s 0 out and renormalising te joint density: ϕ(s 0 ) ϕ(s 0 +x) ϕ(s 0 +x+z) q (x z) = det ϕ() ϕ(x) ϕ(x+z ) ds 0 p (x) ϕ( x) ϕ() ϕ(z) = Φ()ϕ(x) Φ(x)ϕ() det Φ() Φ(x) Φ(x+z ) ϕ() ϕ(x) ϕ(x+z ) ϕ( x) ϕ() ϕ(z) Let λ () be te largest eigenvalue of te integral operator wit kernel q : λ ()q(z) = q(x)q (x z)dx, z <, were eigenfunction q(x) is some probability density on (, ]. Similarly to te case n =, λ () is simple, real and positive eigenvalue of te operator wit kernel K(x, z) = q (x z) and te eigenfunction q(x) can be cosen as a probability density. In numerical examples below we approximate λ () using te metodology described in [0], p.54. It is based on te Gauss-Legendre discretization of te interval [ C, ], wit some large C > 0, into an N-point set x,..., x N (te x i s are te roots of te N-t Legendre polynomial on [ C, ]), and te use of te Gauss-Legendre weigts w i associated wit points x i ; λ () and q(x) are ten approximated by te largest eigenvalue and associated eigenvector of te matrix D / AD /, were D = diag(w i ), and A i,j = q (x i x j ). If N is large enoug ten te resulting approximation ˆλ () to λ () is arbitrarily accurate.. Approximation : F T () F () [ˆλ () ] T (T ); Λ () () = log ˆλ (), λ () () = ˆλ ()...3. Quality of Approximations and Approximation is more accurate tan Approximation 0 but it is still not accurate enoug. Tis is related to te fact tat te process S(t) is not Markovian and te beaviour of S(t) on te interval [i, i + ] depends on all values of S(t) in te interval [i, i] and not only on te value s i = S(i), wic is a simplification we used for derivation of Approximation. Approximation corrects te bias of Approximation by considering twice longer intervals [i, i + ] and using te beaviour of S(t) in te first alf of te interval [i, i + ] just for setting up te initial condition at [i, i + ]. As sown in Section 3, Approximation is muc more accurate tan Approximations 0 and. Te approximations developed in te following section also carefully consider te dependence of S(t) on its past; tey could be made arbitrarily accurate (on expense of increased computational complexity). 4
.3. Furter approximations taking into account te non-markovianity of S(t) As mentioned above, te beaviour of S(t) on te interval [i, i + ] depends on all values of S(t) in te interval [i, i] and not only on te value s i = S(i). Te exact value of te Sepp s constant λ() can be defined as te limit (as i ) of te probability tat S(t) < for all t [i, i + ] under te condition S(t) < for all t i. Using te formula for conditional probability, we obtain λ() = lim i F i ()/F i (). (.7) Waiting a long time witout reacing is not numerically possible and is not wat is really required for computation of λ(). Wat we need is for te process S(t) to (approximately) reac te stationary beaviour in te interval [i, i] under te condition S(t) < for all t < i. Since te memory of S(t) is sort (it follows from te representation S(t) = W (t) W (t + ), were W (t) is te standard Wiener process), tis stationary beaviour of S(t) is practically acieved for very small i, as is also seen from numerical results of Section 3. Moreover, since ratios F i ()/F i () are very close to F i ( 0)/F i ( 0) for i, we can use ratios F i ( 0)/F i ( 0) in (.7) instead. For computing te approximations, it makes integration easier. Te above considerations give rise to several approximations formulated below. We start wit simpler approximations wic are easy to compute and end up wit approximations wic are extremely accurate but are arder to compute. We claim tat for all, Approximation 7 as at least seven correct decimal places as te true value of λ(). However, we would not recommend extremely accurate Approximations 6 and 7 since Approximations 4 and 5 are already very accurate, see Tables and, but are muc easier to compute. Approximation 3, te simplest in te bunc, is also quite accurate. Note tat all approximations for F T () can be applied for any T. [ T Approximation 3: F T () F () λ ()] (3), were λ (3) () = F ( 0)/F ( 0). [ T Approximation 4: F T () F () λ ()] (4), were λ (4) () = F ()/F (). [ T Approximation 5: F T () F () λ ()] (5), were λ (5) () = F 3 ( 0)/F ( 0). [ T 3 Approximation 6: F T () F 3 () λ ()] (6), were λ (6) () = F 3 ()/F (). [ T 4 Approximation 7: F T () F 4 () λ ()] (7), were λ (7) () = F 4 ()/F 3 (). Numerical complexity of tese approximation is related to te necessity of computing eiter F n ( 0) or F n () for suitable n. It follows from (.3) tat F n ( 0) is an n-dimensional integral. Consequently, F n () is an (n + )-dimensional integral. In bot cases, te dimensionality of te integral can be reduced by one, respectively to n and n, wit no furter analytical reduction possible. In view of results of Section 4, computation of Approximations 3 and 4 is easy, computation of Approximation 5 requires numerical evaluation of a one-dimensional integral (wic is not ard) but to compute Approximation 7 we need to approximate a tree-dimensional integral, wic as to be done wit ig precision as oterwise Approximation 7 is not wort using: indeed, Approximations 4 6 are almost as good but are muc easier to compute. As Approximation 7 provides us wit te values wic are practically indistinguisable from te true values of λ(), we use Approximation 7 only for te assessment of te accuracy of oter approximations and do not recommend using it in practice. 3. Numerical results In tis section we discuss te quality of approximations introduced in Sections,. and.3. In Table, we present te values of λ (i) (), i = 0,,..., 7, for a number of different. As mentioned above, λ (7) () is practically te true λ() and terefore we compare all oter approximations against 5
λ (7) (). In Table we present te relative errors of all oter approximations against λ (7) (); tat is, te values λ (i) ()/λ (7) () for i = 0,,..., 6. From tese two tables we see tat Approximations -7 are very accurate across all. =0.5 = =.5 = =.5 =3 =3.5 =4 =4.5 λ (0) () 0.83859 0.785079 0.83430 0.897644 0.9576 0.98679 0.996950 0.999465 0.99998 λ () () 0.43754 0.59656 0.76590 0.88505 0.955674 0.986738 0.996958 0.999466 0.99998 λ () () 0.366973 0.56346 0.746457 0.87979 0.9545 0.986566 0.996939 0.999464 0.99998 λ (3) () 0.36046 0.564075 0.74833 0.880358 0.954548 0.986534 0.996930 0.999463 0.99998 λ (4) () 0.365730 0.56888 0.746559 0.87983 0.954556 0.986570 0.996939 0.999464 0.99998 λ (5) () 0.367994 0.56456 0.74773 0.879946 0.954565 0.98657 0.996939 0.999464 0.99998 λ (6) () 0.3679 0.5643 0.74709 0.879945 0.954566 0.98657 0.996939 0.999464 0.99998 λ (7) () 0.3684 0.564385 0.7479 0.879945 0.954566 0.98657 0.996939 0.999464 0.99998 Table : λ (i) (), i = 0,,..., 7, for different. =0.5 = =.5 = =.5 =3 =3.5 =4 =4.5 λ (0) ().8e+00 3.9e-0.0e-0.0e-0.68e-03.5e-04.4e-05 3.4e-07 6.06e-09 λ () ().4e-0 5.63e-0.07e-0 5.77e-03.6e-03.69e-04.93e-05.88e-06.56e-07 λ () () -3.0e-03 -.0e-03-8.86e-04 -.57e-04-4.56e-05-4.6e-06 -.56e-07-7.8e-09 -.37e-0 λ (3) () -.4e-0-5.49e-04.6e-03 4.70e-04 -.86e-05-3.76e-05-9.3e-06 -.8e-06 -.e-07 λ (4) () -6.48e-03 -.65e-03-7.50e-04 -.9e-04 -.09e-05 -.06e-07.6e-08.35e-09 3.0e- λ (5) () -3.6e-04.49e-04 7.e-05.38e-06 -.45e-06 -.33e-07 -.3e-09 8.48e-.6e- λ (6) () -5.e-04 -.30e-04 -.43e-05-5.73e-08 8.37e-08 3.9e-09 5.45e-.4e- -.63e- Table : Relative errors of λ (i) (), i = 0,,..., 6, against λ (7) (). A plot of te relative errors can be seen in Figure a, were te number next to te line corresponds to te approximation. Approximations,4,6 and 7 are monotonically increasing across all and suggest very accurate lower bounds for te true λ(). Approximations 0 and appear to provide upper bounds for λ() for all. (a) Relative error of λ (i) (), i = 0,..., 6, against λ (7) () (b) λ (0) () (dotted red), λ () () (dased blue) and λ (6) () (solid green) Figure As mentioned in Section..3, Approximation is not as accurate as Approximations 7 because it does not adequately take into account te non-markovianity of S(t). In Figure b we 6
ave plotted λ (0) () (dotted red line), λ () () (dased red line) and λ (6) () (solid green line) for a range of interesting. Visually, all λ (i) () wit i =, 4, 5, 6, 7 would be visually indistinguisable from eac oter on te plot in Figure b and λ (3) () would be very close to tem. In Figure a we illustrate te rate of convergence of [log F n ()]/n in (.) to te Sepp s constant Λ() as n increases. Te dotted black lines correspond to simulation results obtained by computing te probability F n () wit 00,000 simulations for =.5, and 3. Te solid red lines correspond to Approximation 4 for Λ() wit te cosen. Figure a demonstrates ow accurate tis computationally ceap approximation is and also demonstrates te importance of te multiplying constant F () in Approximation 4 to correct for te non-linear beaviour seen for small n. In Figure b we investigate te rate of convergence and accuracy of convergence to Λ() using all approximations, were we ave fixed = 3. In tis figure, Approximations 4, 5, 6 and 7 produce results tat are visually indistinguisable to Approximation and ence are not plotted. Terefore, in tis figure, Approximation can be considered as giving te true Sepp s constant Λ(). Te number next to te line corresponds to wic approximation was used. (a) Rate and accuracy of convergence to te Sepp s constant Λ() using simulations (dotted black line) and using Approximation 4 (solid red) for =.5, and 3. (b) Rate and accuracy of convergence to Λ() wit = 3 using all approximations (numbers next to line correspond to wic approximation as been used). Figure 4. Appendix 4.. Simplified form of F () and its approximation Using (.) and canging te order of integration were suitable, F () can expressed troug a one-dimensional integral as follows: F () = Φ() 3 + ϕ() Φ() + ϕ() ϕ()φ()[φ() + ϕ()] [ ( )Φ() + ϕ() ] + 0 Φ(y) ϕ( y)dy Φ( y)ϕ( [ ) Φ( y) / ] dy. Using approximations for Φ(t), it is possible to approximate F () very accurately. For example, 7
using te approximation (see []) Φ(t) = { 0.5 exp(0.77t 0.46t ) for t 0 0.5 exp( 0.77t 0.46t ) for t > 0, (4.) we obtain F () = Φ() 3 + ϕ() Φ() + ϕ() [ ( )Φ() + ϕ() ] ϕ()φ() [Φ() + ϕ()] + Φ() Φ() 0.5 [ e J(0.96, b, ) π J(.33, b, ) V (.46, b, ) π + V (, b, ) + π π K(.5, b, ) { K(.33, b 3, 0) } ] U(.46, b 4, 0), (4.) π were b = 0.77, b = b 0.77, b =, b 3 = b +.5, b 4 = b +.434, K(x, y, z) = ( ) πe y /(4x) xz y Φ, U(x, y, z) = [ yk(x, y, z) e z(y xz)], (4.3) x x x J(x, y, z) = K(x, y, z) K(x, y, 0) and V (x, y, z) = U(x, y, z) U(x, y, 0). Table 3 sows tat approximation (4.) is very accurate across all of interest. =0.5 = =.5 = =.5 =3 =3.5 =4 =4.5 F () 0.08504 0.50896 0.5068 0.744845 0.900875 0.970790 0.993430 0.998866 0.999849 (4.) 0.084687 0.5003 0.50097 0.744837 0.900875 0.970790 0.993430 0.998866 0.999849 Table 3: Accuracy of approximation (4.) for F () Using (.), we can express F ( 0) as follows: F ( 0) = Φ() ϕ(0) ϕ()φ() ϕ()φ() + ϕ(0) Using (4.), we can obtain te approximation ϕ(0) ϕ( y)φ( y)ϕ(y)dy Φ( y)ϕ( y)ϕ(y)dy. F ( 0) = Φ() ϕ(0) ϕ()φ() ϕ()φ() + ( ) [ πϕ Φ ( ) Φ ( )] ϕ() e.664 {e.434 [K(.46,.664 + 0.77, ) K(.46,.664 + 0.77, )] e.434 [K(.46,.664 0.77, ) K(.46,.664 0.77, )]} [ ] e0.77.46 K(.46,.83 0.77, ) K(.46,.83 0.77, ), (4.4) π were K(x, y, z) can be found in (4.3). Table 4 sows tat approximation (4.4) is very accurate across all of interest. 5. Conclusions In is seminal paper [], L. Sepp derived explicit formulas for F T () = Pr { max t [0,T ] S(t) < }, te distribution of maximum of te so-called Slepian process S(t). As tese explicit formulas are 8
=0.5 = =.5 = =.5 =3 =3.5 =4 =4.5 F ( 0) 0.09039 0.30357 0.576857 0.800758 0.9765 0.9797 0.995607 0.99964 0.999905 (4.4) 0.088648 0.30448 0.57660 0.80079 0.97648 0.9797 0.995607 0.99964 0.999905 Table 4: Accuracy of approximation (4.4) for F ( 0) complicated, in te same paper L. Sepp as introduced a constant Λ() = lim T T log F T () (wic we call Sepp s constant) measuring te rate of decrease of F T () as T grows; L. Sepp also raised te question of constructing accurate approximations and bounds for tis constant. Until now, tis question as not been addressed. To answer it, we ave constructed different approximations for F T () (and ence for Λ()). We ave sown in Section 3 tat at least some of tese approximations are extremely accurate. We ave also provided oter approximations tat are almost as good but are muc simpler to compute. References [] LA Sepp. First passage time for a particular Gaussian process. Te Annals of Matematical Statistics, pages 946 95, 97. [] HJ Landau and LA Sepp. On te supremum of a Gaussian process. Sankyā: Te Indian Journal of Statistics, Series A, pages 369 378, 970. [3] M Marcus and LA Sepp. Sample beavior of Gaussian processes. In Proc. of te Sixt Berkeley Symposium on Mat. Statist. and Prob, volume, pages 43 4, 97. [4] RJ Adler and J Taylor. Random Fields and Geometry. Springer, 007. [5] D Aldous. Probability Approximations via te Poisson Clumping Heuristic. Springer Science & Business Media, 989. [6] CB Mer and JA McFadden. Certain properties of Gaussian processes and teir first-passage times. Journal of te Royal Statistical Society. Series B (Metodological), pages 505 5, 965. [7] D Slepian. First passage time for a particular Gaussian process. Te Annals of Matematical Statistics, 3():60 6, 96. [8] M Reed and B Simon. Metods of Modern Matematical Pysics: Scattering teory Vol. 3. Academic Press, 979. [9] J Noonan and A Zigljavsky. Approximations of te boundary crossing probabilities for te maximum of moving sums. arxiv preprint arxiv:80.099, 08. [0] JL Moamed and LM Delves. Computational Metods for Integral Equations. Cambridge University Press, 985. [] JT Lin. Approximating te normal tail probability and its inverse for use on a pocket calculator. Applied Statistics, 38():69 70, 989. 9