MINIMAX RATES OF CONVERGENCE AND OPTIMALITY OF BAYES FACTOR WAVELET REGRESSION ESTIMATORS UNDER POINTWISE RISKS
|
|
- Julie Blankenship
- 5 years ago
- Views:
Transcription
1 Statistica Siica 19 (2009) : Supplemet S1 S17 1 MINIMAX RATES OF ONVERGENE AND OPTIMALITY OF BAYES FATOR WAVELET REGRESSION ESTIMATORS UNDER POINTWISE RISKS Natalia Bochkia ad Theofais Sapatias Uiversity of Ediburgh ad Uiversity of yprus Supplemetary Material This ote cotais detailed proofs for Theorems 1-6 ad Propositios 1-2. Proofs of most auxiliary statemets are also icluded for completeess. Throughout this ote we use a geeric positive costat which is ot ecessarily the same eve withi a sigle equatio. S1. No-adaptive ad adaptive miimax rates of covergece uder poitwise l u -risks (1 u < ) i the stadard oparametric regressio model. First we prove Theorem 1 (o-adaptive case). Proof of Theorem 1. Lower boud] R u ( f Bpq(A) r t 0 ) u(r 1/p) estimator f ad ay sequece B as ] lim sup u(r 1/p) B sup E f(t 0 ) f(t 0 ) u f Bpq r (A) as is equivalet to the followig: for ay =. (7.1) Suppose ow that (7.1) does ot hold for some estimator f ad some sequece B such that B ad / log (B ) as i.e. lim sup u(r 1/p) B Hece for ay f 0 B r pq(a ) A < A sup f B r pq(a) ] lim sup u(r 1/p) B E f(t 0 ) f 0 (t 0 ) u <. E f(t 0 ) f(t 0 ) u ] <. (7.2) Thus Theorem S1.2 yields that for B ad / log (B ) as lim if (/ log (B ) ) ] u(r 1/p) sup E f(t 0 ) f(t 0 ) u > 0 f Bpq(A) r
2 2 NATALIA BOHKINA AND THEOFANIS SAPATINAS ad hece lim sup u(r 1/p) ] sup E f(t 0 ) f(t 0 ) u = f Bpq(A) r which cotradicts the assumptio made i (7.2). This completes the proof for the lower boud of Theorem 1. Upper boud] The validity of the upper bouds follows from the followig theorem. (Note that Theorems S1.1 ad S1.3 are proved uder a slightly more geeral coditio o the error distributio.) Theorem S1.1. Assume (S1) (S2) ad (S3) 1 u < ad f B r pq(a) with 1 p q A > 0 ad 1/p < r < s. Let ϕ be the pdf of N(0 σ 2 /2) 0 < σ σ σ < ad let ˆf be a hard thresholdig wavelet estimator with threshold t : σ 1/2 for = L L t = σ 1/2 log 2 (2u)] 1/2 for = J 1 1 where 1 = log 2. The for ay t 0 (0 1) ( ) R( u ˆf Bpq(A) r t 0 ) = O u(r 1/p) as. Proof of Theorem S1.1. It is easily see that the risk is bouded by R( u ˆf Bpq(A) r t 0 ) 2 L/2 E(ˆθ k θ k ) u ] 1/u φ k K L 1 (t 0 ) k K L 1 (t 0 ) k K (t 0 ) Term Q 11 + Q 12 i (7.3) is bouded by k K (t 0 ) =J k K (t 0 ) 2 L/2 θ k θ k φ 2 /2 E(ˆθ k θ k ) u ] 1/u ψ 2 /2 θ k θ k ψ 2 /2 θ k ψ = Q 11 + Q 12 + Q 21 + Q 22 + Q 3 ] u. (7.3) 2 L/2 V(ˆθ k )] 1/2 + 2 L/2 θ k θ k 1/2 σ L 1 + r k K L 1 k K L 1 ( = O 1/2) ( + o (r 1/p)) ( ) = o (r 1/p) u
3 POINTWISE OPTIMALITY UNDER GENERAL LOSSES 3 due to Lemma A.4 i Bochkia ad Sapatias (2006) ad the fact that V(ˆθ k ) = O( 1 ). O the other had term Q 3 i (7.3) is bouded by =J k K (t 0 ) 2 /2 θ k = O =J 2 /2 2 (r 1/p+1/2) = O (2 J(r 1/p)) ( (r 1/p)) ( = o (r 1/p) due to Lemma A.4 i Bochkia ad Sapatias (2006) which also implies that term Q 22 i (7.3) is domiated by (r 1/p) (log ) I(p= ). Thus we have that R( u ˆf ] 1/u Bpq(A) r t 0 ) (r 1/p) + k K (t 0 ) ) 2 /2 E θ k ˆθ k u] 1/u. (7.4) osider separately the sums for low ad high resolutio levels usig Lemma A.3 i Bochkia ad Sapatias (2006) to boud the summad i the first sum ad Lemma S5.6 to boud the summad i the secod sum: R u ( ˆf B r pq(a) t 0 )] 1/u (r 1/p)/() /2 k K (t 0 ) 1 2 /2 t E d k θ k u I( d k > t )] 1/u. The first sum is easy to calculate sice by defiitio t is idepedet of for 1 ad it is (r 1/p) equal to. Sice for > 1 t by Lemma A.4 i Bochkia ad Sapatias (2006) ad Lemma S5.7 the secod sum is bouded from above by 1/2 2 /2 (u+1)/2u 2 + 1/2 log ] (u+1)/2u 2 1/2 + k K (t 0 ) 2 /2 θ k 2 (r 1/p) = O ( (r 1/p) ). Thus for ay t 0 (0 1) R u ( ˆf B r pq(a) t 0 ) u(r 1/p) as. This completes the proof of Theorem S1.1. proved. Thus the proof for the upper boud of Theorem 1 is completed ad hece Theorem 1 is We ow prove Theorem 2 (adaptive case). I order to do that we eed some prelimiary results.
4 4 NATALIA BOHKINA AND THEOFANIS SAPATINAS Theorem S1.2. Take (r p q) such that r > 1 p 1 p q ad a sequece B such that B / log (B ) as. Let f be a estimator of f based o observatios from the stadard oparametric regressio model (2.1). If f 0 Bpq(A r ) with 0 < A < A satisfyig ] lim sup u(r 1/p) B E f(t 0 ) f 0 (t 0 ) u < the lim if (/ log (B ) ) ] u(r 1/p) sup E f(t 0 ) f(t 0 ) u > 0. f Bpq(A) r Proof of Theorem S1.2. Let X be a radom variable havig either distributio P θ0 with desity f θ0 or distributio P θ1 with desity f θ1 with respect to some domiatig measure. For ay estimator δ = δ(x) of θ {θ 0 θ 1 } its l u -risk (1 u < ) is defied by R u (δ θ) = E δ(x) θ u. Deote by κ(x) = f θ1 (x)/f θ0 (x) the ratio of the two desities. (κ(x) = for some x is possible with the obvious iterpretatio κ(x)f θ0 (x) = f θ1 (x).) For 1 u < deote by u the value satisfyig 1/u + 1/u = 1 (i.e. u ad u are cougate umbers). Let I u := I u (θ 0 θ 1 ) = E θ0 (κ(x)) u ] 1/u with obvious chage for u = (this is a measure of distace betwee the two distributios P θ0 ad P θ1 ). Take f (t) = γ 1 g(β (t t 0 )) + f 0 (t) t 0 1] where 1) g is a compactly supported ad mootoically decreasig fuctio satisfyig (i) g B r pq(a A ) (ii) g(x) 0 for x (0 1] g(0) > 0 ad (iii) g 2 2 > 0 (such a fuctio is easy to costruct either directly or by usig wavelets); deote by b = uc u g 2 2 ] 1 > 0 where c u = 1/2(u 1)] for u > 1 ad c u = 1 for u = 1; 2) γ = (/(b log (B ))) (r 1/p) ad β = (/(b log (B ))) 1. Note that 2) above implies that γβ 2 = /(b log (B )) ad γ 1 β r 1/p = 1. The i view of Lemma 1 i ai (2003) f B r pq(a). Write P 0 ad P 1 for the probability measures o R geerated from the stadard oparametric regressio model (2.1) with f = f 0 ad f = f respectively. Sice σ 2 is assumed kow we take σ 2 = 1 without loss of geerality. The a sufficiet statistic for the family of probability measures {P 0 P 1 } is give by the log likelihood ratio T = log(dp 1 /dp 0 ) (see e.g. Brow ad Low (1996b)). Set ρ = { } i=1 where t i := i/ i = The g 2 (β (t i t 0 )) γ 2 N( ρ /2 ρ ) uder P 0 T N( ρ /2 ρ ) uder P 1
5 POINTWISE OPTIMALITY UNDER GENERAL LOSSES 5 ad κ(x) = e x. Note that for large ad r > 1/p + 1/2 ρ ρ := 1 0 ρ = f f = γ 2 (1996b)). Now defie ḡ by β 1 g 2 2 g 2 (β (t t 0 )) dt where γ 2 (by followig the argumets of Sectio 5 i Brow ad Low ḡ(β (t t 0 )) = g(β (t i t 0 )) t i 1 < t t i i = with ḡ(0) = g(0). The ρ = 1 0 ḡ 2 (β (t t 0 )) dt. γ 2 Followig the argumets i the proof of Theorem 4 i ai (2003) we ca rewrite I u = ρ e ρ (u 1)/2 = e 2(u 1) for 1 < u < ad for u = Ĩ = κ(x)i(x ρ ) = e ρ. We ca uify the two cases by writig I u = e γ 2 β 1 values of β ad γ i the defiitio of g we get that I u = B 1/u /(ub) e µcu where µ = ρ ρ. Substitutig the e µcu sice γ 2 β 1 /b = log (B ). Sice µ does ot ecessarily goes to zero whe 1/p < r < 1/p + 1/2 we use the assumptio that g is o-egative ad decreasig to obtai the iequality that µ 0 for r > 1/p which is sufficiet for our purpose. Ideed sice g is o-egative ad decreasig for t t i 1 t i ] we get ḡ 2 (β (t t 0 )) = g 2 (β (t i t 0 )) g 2 (β (t t 0 )) implyig that ρ ρ (i.e. µ 0) ad thus that I u = e ρ c u e ρcu = B 1/u. Let ow δ = f (t 0 ) θ 0 = f 0 (t 0 ) ad θ 1 = f (t 0 ). Note that for sufficietly large = f (t 0 ) f 0 (t 0 ) = g(0)γ 1. For some > 0 ad large eough Lemma 2(i) i ai (2003) states that R u (δ θ 1 ) u (1 uε u I u / ) where R u (δ θ 0 ) ε u u; i our case it is give that ε u = 1/u value of ε u ad the upper boud for I u we get γ (r 1/p) B 1/u. Substitutig the ( ) R u (δ θ 1 ) = E f g(0) u { } (t 0 ) f (t 0 ) u 1 u 1/u (r 1/p) B 1/u B 1/u (1 + o(1))γ (g(0)) 1 This completes the proof of Theorem S1.2. ( =b g(0)] u(r 1/p) log B ) u(r 1/p) { } 1 u 1/u (b log(b )) (r 1/p) (1 + o(1))(g(0)) 1 ( ) u(r 1/p) =b g(0)] u(r 1/p) log B (1 + o(1)).
6 6 NATALIA BOHKINA AND THEOFANIS SAPATINAS We are ow ready to prove Theorem 2 (adaptive case). Proof of Theorem 2 Lower boud] osider two Besov classes B r i p i q i (A i ) with r i > 1/p i for i = 1 2. Let 0 < r 2 1/p 2 < r 1 1/p 1. Applyig Theorem S1.2 it immediately follows that if a estimator ˆf satisfies as lim sup l sup f B r 1 p 1 q 1 (A 1 ) E ˆf (t 0 ) f(t 0 ) u ] < for some l > u(r 2 1/p 2 )/(1 + 2(r 2 1/p 2 ) the ] (/ ) lim if u(r 2 1/p 2 ) log 2(r 2 1/p 2 )+1 sup E ˆf (t 0 ) f(t 0 ) u > 0. f B r 2 p 2 q 2 (A 2 ) This completes the proof for the lower boud of Theorem 2. Upper boud] The validity of the upper boud follows from the followig theorem. Theorem S1.3. Assume (S1) (S2) ad (S3) 1 u < ad f B r pq(a) with 1 p q A > 0 ad 1/p < r < s. Let ϕ be the pdf of N(0 σ 2 /2) 0 < σ σ σ < ad let ˆf be a hard thresholdig wavelet estimator with threshold t = σ 1/2 (u log 2) 1/2 for = L L J 1. The for ay t 0 (0 1) ( R( u ˆf Bpq(A) r t 0 ) = O log ) u(r 1/p) as. ( ) 1 Proof of Theorem S1.3. Let 2 be such that 2 2 = 2ν+1 log. From the proof of Theorem S1.1 it follows that we eed to fid a upper boud o (7.4) which we cosider separately for low ad high resolutio levels: 2 k K (t 0 ) 2 /2 E(ˆθ k θ k ) u ] 1/u + k K (t 0 ) 2 /2 E(ˆθ k θ k ) u ] 1/u = R 1 + R 2. By Lemma A.3 i Bochkia ad Sapatias (2006) the first sum is bouded from above by /2 t + O( 1/2 2 2/2 ) = 1/2 2 /2 σ 1/2 + O( 1/2 2 2/2 ) 1/2 log 2 ] 1/2 2 2/2 + O( 1/2 2 2/2 ) ( ) ν/(2ν+1). log I the proof of Theorem S1.1 we showed that if t 1/2 as E d k θ k u I( d k > t ) σ u u/2 t /σ u+1 exp{ (t /σ ) 2 /2}
7 POINTWISE OPTIMALITY UNDER GENERAL LOSSES 7 ( ) which holds here sice for > 2 t 1/2 1/2 2 = log 2 log as. Note that sice 2 < 1 ad t 1/2 for > 2 as we ca apply Lemma S5.7 to obtai a upper boud o the secod sum: 1/2 = ( log 2 /2 σ 1/2 t /σ 1+1/u exp{ (t /σ ) 2 /2u} 2 /2 1/2+1/2u exp{ log 2/2} 1/2 (log ) 1/2+1/2u ) 1/2 (log ) 1+1/2u = o This completes the proof of Theorem S1.3. proved. ( ) ν/(2ν+1). log Thus the proof for the upper boud of Theorem 2 is completed ad hece Theorem 2 is S2. Proof of theorems for o-adaptive Bayes factor wavelet estimators Set ε 1 = 1. Proof of Theorem 3. Followig the proof of Theorem S1.1 we oly eed to cosider (7.4): R = 2 /2 k K (t 0 ) E θ k ˆθ k u] 1/u = Slow + S high where S low represets the sum over idices L 1 ad S high represets the sum over 1 < J 1. Note that for the low resolutio levels 1 ν / = 2 m 1 1/2 = 2 m 1( 1 ) m 1ε 1 1/2 0 as (7.5) sice m 1 ε 1 1/2 = m 1 (r 1/p+1/2) 0. Similarly for the high resolutio levels ν / = 2 m 2 1/2 = 2 m 2( 1 ) m 2ε 1 1/2 as (7.6) sice m 2 ε 1 1/2 = m 2 (r 1/p+1/2) 0. Low resolutio levels L 1. We use Lemma A.3 i Bochkia ad Sapatias (2006) to boud S low from above: S low = 1 2 /2 k K (t 0 ) mi(t θ k ) + O(2 1/2 1/2 ) sice κ u = + x u ϕ (x)dx = cσ u + z u e z β dz < σ u < (uiformly).
8 8 NATALIA BOHKINA AND THEOFANIS SAPATINAS Sice ν / 0 due to (7.5) ad prior h satisfies the assumptios of Lemma S5.5 we ca apply Lemma S5.5 together with the upper boud for the threshold uder power-expoetial errors (Lemma 4(i) i Pesky ad Sapatias (2007)) to obtai the followig boud: S low 1 1 ( 2 /2 t + O(2 1/2 1/2 ) 1/2 2 /2 )] 1/β log β ν 1 + O(2 1 /2 1/2 ) 1 ( 1/2 2 /2 )] 1/β log β ν 1 + O(2 1 /2 1/2 ) = O ( r 1/p ) sice β ν 1 = 2 (a 1 m 1 ) b 1+1/2 b 1+1/2+ε 1 (a 1 m 1 ) + uder the assumptios of the theorem. Thus for the low level sum the rate is optimal. High resolutio levels J 1. We eed to show that the followig sum is bouded by ν/(2ν+1) : S high = 2 /2 k K (t 0 ) E θ k ˆθ k u] 1/u. By Lemma S5.6 S high 2 2 /2 k K (t 0 ) E θ k d k u I( d k > t )] 1/u /2 k K (t 0 ) Due to Lemma A.4 i Bochkia ad Sapatias (2006) the secod sum is bouded by 2 /2 2 (r 1/p+1/2) = 2 (r 1/p) = O( ν/(2ν+1) ) as. Note also that ν / due to (7.6). Now we cosider the first term. osider separately 2 cases: β 1 ad β > β 1. I this case λ ϕ = 0 ad if β > 1 by Lemma 2(ii) i Pesky (2006) θ k. ζ = 1 + o(ν 1 ) < β x implyig that I( d k > t k ) = I(ζ (d k ) > β ) = 0 ad thus the first sum is zero. S5.3: Note that β > 1 sice β = 2 a 2 b 2 b 2+ε 1 (a 2 ) + sice b 2 + ε 1 (a 2 ) + > β > 1. I this case we boud the summads i the first sum usig Lemma S5.7 ad Lemma E θ k d k u I( d k > t ) u/2 t /σ ] u+1 e (t /σ ) β = u/2 log β ] (u+1)/β β 1
9 POINTWISE OPTIMALITY UNDER GENERAL LOSSES 9 sice β for > 1 due to b 2 + ε 1 (a 2 ) + > 0 ad thus t 1/2. Thus the secod sum i this case is bouded by The last sum is equal to = 1/2 2 /2 1/2 log β ] (u+1)/uβ β 1/u 2 /2 b 2 log + a 2 log 2] (u+1)/uβ b 2/u 2 a 2/u 1/2 b2/u (b 2 + ε 1 (a 2 ) + ) log ] (u+1)/uβ 2 (1/2 a 2/u). (b 2+a 2 )/u log ] (u+1)/uβ 1/2 a 2 /u > 0 1/2 b 2/u log ] 1+(u+1)/uβ 1/2 a 2 /u = 0 (1 ε 1)/2 (b 2+ε 1 a 2 )/u log ] (u+1)/uβ 1/2 a 2 /u < 0. Thus this sum coverges to zero at a rate ot slower tha the optimal if b 2 + a 2 > u(1 ε 1 )/2 if a 2 u/2 b 2 + ε 1 a 2 > 0 if a 2 > u/2 which ca be rewritte as b 2 + ε 1 a 2 > (u/2 a 2 )(1 ε 1 ) if a 2 u/2 b 2 + ε 1 a 2 > 0 if a 2 > u/2 or b 2 + ε 1 a 2 > (u/2 a 2 ) + (1 ε 1 ). It is easy to check that this coditio implies b 2 + ε 1 (a 2 ) + > 0 required earlier. This completes the proof of Theorem 3. Proof of Theorem 4. Followig the proof of Theorem S1.1 we oly eed to cosider (7.4) which we cosider separately for low ad high resolutio levels. Low resolutio levels L 1. Sice m 1 r 1/p + 1/2 ν / 0 followig the proof for the low levels of Theorem 3 the sum for the low levels is bouded by the optimal rate plus 1 2 t which achieves the optimal rate if β /ν for the cosidered (Lemma S5.5) i.e. give 1/2+b 1 2 (a 1 m 1 ) 1 if a 1 m 1 > 0 β /ν = 1/2+b 1 2 (a 1 m 1 ) 1/2+b 1 1 if a 1 m 1 = 0 1/2+b 1 if a 1 m 1 < 0
10 10 NATALIA BOHKINA AND THEOFANIS SAPATINAS or equivaletly if 1/2 + b 1 + ε 1 (a 1 m 1 ) 0 if a 1 m 1 > 0 1/2 + b 1 < 0 if a 1 m 1 = 0 1/2 + b 1 0 if a 1 m 1 < 0. High resolutio levels J 1. Sice ϕ is a heavy tailed desity (3.3) by Lemma 2 (ii) i Pesky (2006) ζ (x) = I (x) ϕ ( x) = 1 + o(1) < β sice β = 2 a 2 b 2 b 2+ε 1 (a 2 ) + as due to b 2 + ε 1 (a 2 ) + > 0. Thus I(ζ ( d k ) > β ) = 0 ad the secod sum is zero. This completes the proof of Theorem 4. S3. Proof of theorems for adaptive Bayes factor wavelet estimators To prove adaptive error rates we use aother divisio of the resolutio levels with the critical level 2 defied by 2 = ( ) 1 2(r 1/p) + 1 log 2. (7.7) log Note that this adaptive critical level is smaller tha the o-adaptive critical level 1. Lemma S3.1. Assume (S1) (S2) ad (S3) 1 u < ad f B r pq(a) with 1 p q A > 0 ad 1/p < r < s. Let ϕ be the pdf of N(0 σ 2 /2) 0 < σ σ σ < ad let ˆf BF is the correspodig Bayes Factor estimator. We assume that h is such that ζ (x) icreases for x > 0. Deote f = max(1 ν / ) v = β f /ν as. The for ay t 0 (0 1) R u ( ˆf BF B r pq(a) t 0 )] 1/u ( ) (r 1/p) () log + 1/2 + 1/2 2 2 /2 f log( ϕh v )] 1/2 2 /2 v 1/u log v ] (u+1)/2u. (7.8) Proof of Lemma S3.1. Followig the proof of Theorem S1.1 we oly eed to cosider (7.4) ad usig Lemma A.3 i Bochkia ad Sapatias (2006) ad Lemma S5.6 we have that R u ( ˆf BF B r pq(a) t 0 )] 1/u ν/(2ν+1) /2 2 k K (t 0 ) 2 /2 t E θ k d k u I( d k > t )] 1/u.
11 POINTWISE OPTIMALITY UNDER GENERAL LOSSES 11 By Lemma S5.4 the first sum is bouded by 2 2 /2 t 2 2 /2 σ 1/2 ( 1 + ν2 2 1/2 2 /2 σ f log(ϕh β f /ν ) ] 1/2. ) 1/2 log( ϕh β 1 + /ν 2 ) ] 1/2 By Lemma S5.7 ad Lemma S5.2 the secod sum is bouded by 1/2 1/2 1/2 2 /2 t σ / ] 1+1/u exp{ t σ / ] 2 /u} 2 /2 log(β max(1 /ν ))] (u+1)/2u β max(1 /ν )] 1/u 2 /2 log(β f /ν )] (u+1)/2u β f /ν ] 1/u. Thus Lemma S3.1 is proved. If we boud the logarithmic term i the sums from above we obtai the followig corollary. orollary S3.1. Assume that ν ad β are such that for all = L L+1... J 1 ν c ad β /ν as such that log(β /ν ) B log for some c B > 0. The uder assumptios of Lemma 1 for ay t 0 (0 1) R u ( ˆf BF B r pq(a) t 0 )] 1/u ( ) r 1/p ( + log We are ow ready to prove Theorems 5 ad 6. log ) 1/2 1/2u 2 /2 (β /ν ) 1/u. (7.9) Proof of Theorem 5. By substitutig assumptio (3) of the theorem ito orollary S3.1 we obtai the followig boud: R u ( ˆf BF B r pq(a) t 0 )] 1/u ( ) r 1/p ( + log log ) 1/2 1/2u 2 /2 b/u 2 a/u.
12 12 NATALIA BOHKINA AND THEOFANIS SAPATINAS The latter sum is bouded by = ( ) 1/2 (log ) 1/2u log ( ) 1/2 (log ) 1/2u log b/u 1/2u ε 1(1/2 a/u) 1/2 a/u < 0 b/u 1/2u log 1/2 a/u = 0 b/u 1/2u (1/2 a/u) 1/2 a/u > 0 b/u 1/2u+ε 1(1/2 a/u) a > u/2 b/u 1/2u log a = u/2 b/u 1/2u+1/2 a/u a < u/2 ( ) 1/2 (log ) 1/2u+1 b/u 1/2u+(u/2 a) +/u log ( ) 1/2 ( ) r 1/p (log ) 1/2u+1 log log due to assumptio b + 1/2 (u/2 a) + 0. Thus Theorem 5 is proved. Proof of Theorem 6. Followig the proof of Theorem S1.1 we oly eed to cosider (7.4) ad usig Lemma A.3 i Bochkia ad Sapatias (2006) ad Lemma S5.6 we have that R u ( ˆf BF B r pq(a) t 0 )] 1/u ν/(2ν+1) /2 k K (t 0 ) 2 /2 t E θ k d k u I( d k > t )] 1/u. a) Low resolutio levels. Sice ν / 0 ad h is either heavy tailed (3.3) or ormal we ca apply Lemma 4(i) i Pesky ad Sapatias (2007) to boud the first sum by 1/2 2 2 /2 log(β /ν )] 1/β 1/2 (log ) 1/2 2 2/2 = sice we assume that log(β /ν ) Blog ] β/2. b) High resolutio levels. By Lemma S5.7 we have ( ) (r 1/p)/() log E ˆθ k θ k u I( d k > t ) u/2 t /σ ] u+1 e t /σ ] β ad by Lemma S5.2 we have ϕ (t ) ϕ (0)(β /ν ) 1 sice ν / 0. Hece t σ 1/2 log(β /ν )] 1/β with β /ν implyig E ˆθ k θ k u I( d k > t ) u/2 log(β /ν )] (u+1)/β β /ν ] 1
13 POINTWISE OPTIMALITY UNDER GENERAL LOSSES 13 ad = 2 k K (t 0 ) 2 /2 E ˆθ k θ k u I( d k > t )] 1/u 1/2 1/2u log ] (u+1)/(2u) = 2 2 /2 β /ν ] 1/u. We have the same sum as i the proof of Theorem 5 but with a differet power of the logarithmic factor. Therefore uder the same assumptios as i Theorem 5 we obtai the optimal rate of covergece. Hece Theorem 6 is proved. S4. Proof of propositios for a priori Besov regularity Proof of Propositio 1. Accordig to Theorem 3 i Pesky ad Sapatias (2007) we eed to check that lim 2 (r+1/2) ν 1 β 1/p ] mi(p q) <. Deote κ = mi(p q) 1 ) the the sum ca be writte as κb 1/p 1 2 κ(r+1/2 m 1 a 1 /p) + κb 2/p = κb 1/p+κ(r+1/2 m 1 a 1 /p) + log ] I(r+1/2 1/p m 1 a 1 /p=0) + κb 2/p+κZ(r+1/2 m 2 a 2 /p) log ] I(r+1/2 1/p m 2 a 2 /p=0) 2 κ(r+1/2 m 2 a 2 /p) where Z = 1 if r+1/2 1/p m 2 a 2 /p 0 ad Z = 1 if r+1/2 1/p m 2 a 2 /p < 0. This sum is fiite as if b 1 /p (r + 1/2 m 1 a 1 /p) + 0 ad b 2 /p Z(r + 1/2 m 2 a 2 /p) 0 the iequalities are strict if r + 1/2 m 1 a 1 /p = 0 or r + 1/2 m 2 a 2 /p = 0 respectively. Proof of Propositio 2. Accordig to Theorem 3 i Pesky ad Sapatias (2007) we eed to check that lim 2 (r+1/2) ν 1 β 1/p ] mi(p q) <. Deote κ = mi(p q) 1 ) the the sum ca be writte as κb/p 2 κ(r+1/2 m (a+m)/p) = κb/p+κ(r+1/2 m (a+m)/p) + log ] I(r+1/2 1/p m (a+m)/p=0) which is fiite as if b/p (r + 1/2 m (a + m)/p) + 0 ad the iequality is strict if r + 1/2 m (a + m)/p = 0. S5. Auxiliary lemmas. Lemma S5.2. If ϕ ad h are symmetric uimodal desity fuctios the ϕ ( ( t ) β 1 mi ϕ (0) h(0)ν ).
14 14 NATALIA BOHKINA AND THEOFANIS SAPATINAS Proof of Lemma S5.2. Note that the symmetry ad uimodality of ϕ implies that ϕ (x) ϕ (0) for ay x. Therefore the equatio for the threshold t (see expressio above (3.11)) ca be rewritte as follows β = ζ (t ) = + ϕ ( (t x))ν h(ν x)dx ϕ ( t ) + ϕ (0)ν h(ν x)dx ϕ ( = t ) Similarly by symmetry ad uimodality of h we have β = ζ (t ) = + ϕ ( x)ν h(ν (t x))dx ϕ ( t ) ϕ (0) ϕ ( t ). + ϕ ( x)ν h(0)dx ϕ ( = t ) ν h(0) ϕ ( t ). Rearragig the terms we have ϕ ( { t ) mi β 1 ϕ (0) β 1 h(0)ν / }. (7.10) Thus Lemma S5.2 is proved. The followig lemma is a obvious corollary from Lemma S5.2. Lemma S5.3. If ϕ (x) = c β σ 1 e x/σ β β > 0 ad h is a symmetric uimodal desity the { ( )] } 1/β β t σ max log log (β )] 1/β. h(0)ν Lemma S5.4. Take ϕ (x) N(0 σ 2 /2) ϕ (x)/h(x) ϕh ad let ζ (x) be icreasig for x > 0. The t σ 1/2 ( 1 + ν2 ) 1/2 log ϕh β 1 + ν 2 1/2. Proof of Lemma S5.4. Sice ζ (x) icreases for x > 0 ζ (x) β if ad oly if x t. We ca fid the followig lower boud o ζ (x) usig h(x) 1 ϕh ϕ (x). More precisely ζ (x) = ϕ (x )] 1 ν ϕ ((x y) )h(ν y)dy 1 ϕh ϕ (x )] 1 ν = 1 ϕh = 1 ϕh ν πσ ϕ ((x y) )ϕ (ν y)dy exp { ( + ν 2 )y 2 /σ 2 + 2xy/σ 2 } dy { } ν 2 x 2 exp ν 2 + ( + ν 2. )σ2
15 POINTWISE OPTIMALITY UNDER GENERAL LOSSES 15 { } Take x > 0 such that 1 ν ϕh exp 2 x 2 = β + (+ν 2. The )σ2 ν 2 t x = σ ( 1 + ν2 ) 1/2 log ϕh β 1 + ν 2 1/2. Thus Lemma S5.4 is proved. Lemma S5.5. Let ϕ ad h be symmetric uimodal desities ad ϕ have fiite variaces σ 2 0 < σ σ σ < h has a bouded secod derivative ad ζ (x) icreases for x > 0. The if ν / 0 as β /ν 1 if ad oly if t 0 1/2 for some 1 0 > 0 which deped o ϕ ad h but ot o ν or. Remark S5.1. If the coditio that h has a bouded secod derivative is replaced with the coditios that ϕ (x)/h(x) ϕh ad ϕ have bouded secod derivatives (uiformly i ) the β /ν 1 implies that t 0 1/2. Proof of Lemma S5.5. osider the fuctio ζ (x) at poit x = v where v is idepedet of. The ζ (x ) = ϕ ( + x )] 1 ν ϕ ( y)h(ν (x y))dy = ϕ (v)] 1 ν + ϕ (z)h = ϕ (v)] 1 ν + ϕ (z) h(0) + ( ) ν (v z) = ϕ (v)] 1 ν ( ) ] 2 ν h(0) + O (v 2 + σ 2 ). dz ( ν ) 2 (v z) 2 h (c(v z))/2 ] dz Sice ζ (x) icreases t x if ad oly if ζ (x ) ζ (t ) = β. The last iequality ca be writte as ( ) ] 2 ϕ (v)] h(0) 1 ν + O (v 2 + σ 2 ) β /ν. As ν / 0 the left had side teds to = h(0)/ϕ (v) uiformly i sice σ 2 are bouded from above ad below implyig that β /ν 0 if ad oly if t 1 1/2 where 1 = v is a costat. Thus Lemma S5.5 is proved. We state the obvious lemma below due to its frequet use i the proofs.
16 16 NATALIA BOHKINA AND THEOFANIS SAPATINAS Lemma S5.6. Let ˆθ k be a hard thresholdig estimator of θ k with threshold t based o observatio d k ad 1 u <. The E ˆθ k θ k u = θ k u I( d k t ) + E d k θ k u I( d k > t ) θ k u + E d k θ k u I( d k > t ). Lemma S5.7. If ϕ (x) = β σ 1 exp{ x/σ β } ad t for > 1 as the E θ k d k u I( d k > t )] 1/u β t σ / ] u+1 exp{ t σ / ] β }. Proof of Lemma S5.7. It is easily see that E d k θ k u I( d k > t ) = x θ k u exp{ (x θ k )/σ β }dx σ 2π x >t = σu u/2 2π yσ / +θ k >t y u exp{ y β }dy. Sice for B > max(0 (u + 1)/β 1] 1/β ) we have the followig boud: B z u e z β dz 1 β Bu+1 e Bβ (7.11) E d k θ k u I( d k > t ) σ u u/2 t /σ u+1 exp{ (t /σ ) β } = σ u u/2 (2u log 2) (u+1)/2 e u log 2 by Lemma S5.6 ad for θk A 2 (r 1/p+1/2) A 2 1(r 1/p+1/2) = A (7.12) accordig to Lemma A.4 i Bochkia ad Sapatias (2006). Now we prove (7.11). B z u e zβ dz = x = 1/z] = 1/B 0 x u 2 e x β dx Fuctio x u 2 e x β icreases for x < β/(u + 2)] 1/β sice the Thus for all x 1/B < β/(u + 2)] 1/β B d dx (x u 2 e x β ) = e x β x u 3 (u + 2) + βx β ] < 0. z u e zβ dz = 1/B 0 x u 2 e x β dx B 1 B u+2 e Bβ = B u+1 e Bβ.
17 Thus Lemma S5.7 is proved. POINTWISE OPTIMALITY UNDER GENERAL LOSSES 17 Lemma S5.8. Let ϕ ad h be Studet s t distributios with desity (4.2) with ρ ad γ degrees of freedom respectively 0 < γ < ρ ϕ (x) = σ 1 ϕ(x/σ ) 0 < σ σ σ <. The if ν / 0 as the threshold t defied by (3.10) asymptotically satisfies 1 t /σ = (1 + o(1)) 0 1 β 1 0 β ] 1/(ρ+1) ν σ ( ) ν σ γ ] 1/(ρ γ) for some costats 0 1 > 0 depedig oly o γ ad ρ. β < 0 ν σ ν σ 0 β 0 ( ν σ ) ρ β > 0 ( ν σ ) ρ Proof of Lemma S5.8. We ca obtai the asymptotic expressio for the threshold followig the proof of Lemma 4(ii) i Pesky ad Sapatias (2007) uder the assumptios of the lemma that for some costat 0 > 0 ζ (x) = 0 ν σ F (x) with F (x) = (γ +ν 2 x2 ) (γ+1)/2 (ρ+x 2 /σ 2 )(ρ+1)/2 (1+ o(1)) (due to Lemma 2 i Pesky ad Sapatias (2007)) ad c ρ c 1 γ (1 + o(1)) x < σ / F (x) = c 1 γ ( x/σ ) ρ o(1)] σ / x 1/ν ( x/σ ) ρ+1 (ν x) γ o(1)] x > 1/ν where c k = k (k+1)/2. Sice the threshold satisfies β = ζ (t ) = 0 ν σ F (t ) we obtai that 1 t /σ = (1 + o(1)) 0 1 β 1 0 β ν σ which is the statemet of the lemma. β ν σ < 0 ] 1/(ρ+1) ν σ 0 β ( ) ] ν σ γ+1 1/(ρ γ) β ν σ ν σ 0 ( ν σ ) ρ+1 > 0 ( ν σ ) ρ+1
7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationConvergence of random variables. (telegram style notes) P.J.C. Spreij
Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More information(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3
MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special
More informationApplication to Random Graphs
A Applicatio to Radom Graphs Brachig processes have a umber of iterestig ad importat applicatios. We shall cosider oe of the most famous of them, the Erdős-Réyi radom graph theory. 1 Defiitio A.1. Let
More informationIntegrable Functions. { f n } is called a determining sequence for f. If f is integrable with respect to, then f d does exist as a finite real number
MATH 532 Itegrable Fuctios Dr. Neal, WKU We ow shall defie what it meas for a measurable fuctio to be itegrable, show that all itegral properties of simple fuctios still hold, ad the give some coditios
More informationLet us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.
Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationChapter 6 Infinite Series
Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 3 9//203 Large deviatios Theory. Cramér s Theorem Cotet.. Cramér s Theorem. 2. Rate fuctio ad properties. 3. Chage of measure techique.
More information1+x 1 + α+x. x = 2(α x2 ) 1+x
Math 2030 Homework 6 Solutios # [Problem 5] For coveiece we let α lim sup a ad β lim sup b. Without loss of geerality let us assume that α β. If α the by assumptio β < so i this case α + β. By Theorem
More informationSequences and Series of Functions
Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More information62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +
62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of
More informationNotes 5 : More on the a.s. convergence of sums
Notes 5 : More o the a.s. covergece of sums Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces: Dur0, Sectios.5; Wil9, Sectio 4.7, Shi96, Sectio IV.4, Dur0, Sectio.. Radom series. Three-series
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More informationMAS111 Convergence and Continuity
MAS Covergece ad Cotiuity Key Objectives At the ed of the course, studets should kow the followig topics ad be able to apply the basic priciples ad theorems therei to solvig various problems cocerig covergece
More information1 Convergence in Probability and the Weak Law of Large Numbers
36-752 Advaced Probability Overview Sprig 2018 8. Covergece Cocepts: i Probability, i L p ad Almost Surely Istructor: Alessadro Rialdo Associated readig: Sec 2.4, 2.5, ad 4.11 of Ash ad Doléas-Dade; Sec
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors
ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationSelf-normalized deviation inequalities with application to t-statistic
Self-ormalized deviatio iequalities with applicatio to t-statistic Xiequa Fa Ceter for Applied Mathematics, Tiaji Uiversity, 30007 Tiaji, Chia Abstract Let ξ i i 1 be a sequece of idepedet ad symmetric
More informationProduct measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.
Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the
More information1 Introduction to reducing variance in Monte Carlo simulations
Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by
More informations = and t = with C ij = A i B j F. (i) Note that cs = M and so ca i µ(a i ) I E (cs) = = c a i µ(a i ) = ci E (s). (ii) Note that s + t = M and so
3 From the otes we see that the parts of Theorem 4. that cocer us are: Let s ad t be two simple o-egative F-measurable fuctios o X, F, µ ad E, F F. The i I E cs ci E s for all c R, ii I E s + t I E s +
More informationDistribution of Random Samples & Limit theorems
STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationIt is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.
MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied
More information32 estimating the cumulative distribution function
32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio
More informationThis exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.
Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the
More informationThis section is optional.
4 Momet Geeratig Fuctios* This sectio is optioal. The momet geeratig fuctio g : R R of a radom variable X is defied as g(t) = E[e tx ]. Propositio 1. We have g () (0) = E[X ] for = 1, 2,... Proof. Therefore
More informationChapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
More informationKernel density estimator
Jauary, 07 NONPARAMETRIC ERNEL DENSITY ESTIMATION I this lecture, we discuss kerel estimatio of probability desity fuctios PDF Noparametric desity estimatio is oe of the cetral problems i statistics I
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationRandom Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.
Radom Walks o Discrete ad Cotiuous Circles by Jeffrey S. Rosethal School of Mathematics, Uiversity of Miesota, Mieapolis, MN, U.S.A. 55455 (Appeared i Joural of Applied Probability 30 (1993), 780 789.)
More informationIntroduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT
Itroductio to Extreme Value Theory Laures de Haa, ISM Japa, 202 Itroductio to Extreme Value Theory Laures de Haa Erasmus Uiversity Rotterdam, NL Uiversity of Lisbo, PT Itroductio to Extreme Value Theory
More informationMa 4121: Introduction to Lebesgue Integration Solutions to Homework Assignment 5
Ma 42: Itroductio to Lebesgue Itegratio Solutios to Homework Assigmet 5 Prof. Wickerhauser Due Thursday, April th, 23 Please retur your solutios to the istructor by the ed of class o the due date. You
More informationLecture 10 October Minimaxity and least favorable prior sequences
STATS 300A: Theory of Statistics Fall 205 Lecture 0 October 22 Lecturer: Lester Mackey Scribe: Brya He, Rahul Makhijai Warig: These otes may cotai factual ad/or typographic errors. 0. Miimaxity ad least
More informationApproximations and more PMFs and PDFs
Approximatios ad more PMFs ad PDFs Saad Meimeh 1 Approximatio of biomial with Poisso Cosider the biomial distributio ( b(k,,p = p k (1 p k, k λ: k Assume that is large, ad p is small, but p λ at the limit.
More informationn=1 a n is the sequence (s n ) n 1 n=1 a n converges to s. We write a n = s, n=1 n=1 a n
Series. Defiitios ad first properties A series is a ifiite sum a + a + a +..., deoted i short by a. The sequece of partial sums of the series a is the sequece s ) defied by s = a k = a +... + a,. k= Defiitio
More informationUniversity of Colorado Denver Dept. Math. & Stat. Sciences Applied Analysis Preliminary Exam 13 January 2012, 10:00 am 2:00 pm. Good luck!
Uiversity of Colorado Dever Dept. Math. & Stat. Scieces Applied Aalysis Prelimiary Exam 13 Jauary 01, 10:00 am :00 pm Name: The proctor will let you read the followig coditios before the exam begis, ad
More informationLecture 3 The Lebesgue Integral
Lecture 3: The Lebesgue Itegral 1 of 14 Course: Theory of Probability I Term: Fall 2013 Istructor: Gorda Zitkovic Lecture 3 The Lebesgue Itegral The costructio of the itegral Uless expressly specified
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationEconometrica Supplementary Material
Ecoometrica Supplemetary Material SUPPLEMENT TO NONPARAMETRIC INSTRUMENTAL VARIABLES ESTIMATION OF A QUANTILE REGRESSION MODEL : MATHEMATICAL APPENDIX (Ecoometrica, Vol. 75, No. 4, July 27, 1191 128) BY
More informationEntropy Rates and Asymptotic Equipartition
Chapter 29 Etropy Rates ad Asymptotic Equipartitio Sectio 29. itroduces the etropy rate the asymptotic etropy per time-step of a stochastic process ad shows that it is well-defied; ad similarly for iformatio,
More informationON POINTWISE BINOMIAL APPROXIMATION
Iteratioal Joural of Pure ad Applied Mathematics Volume 71 No. 1 2011, 57-66 ON POINTWISE BINOMIAL APPROXIMATION BY w-functions K. Teerapabolar 1, P. Wogkasem 2 Departmet of Mathematics Faculty of Sciece
More informationProblem Set 4 Due Oct, 12
EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios
More informationA Proof of Birkhoff s Ergodic Theorem
A Proof of Birkhoff s Ergodic Theorem Joseph Hora September 2, 205 Itroductio I Fall 203, I was learig the basics of ergodic theory, ad I came across this theorem. Oe of my supervisors, Athoy Quas, showed
More informationProbability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].
Probability 2 - Notes 0 Some Useful Iequalities. Lemma. If X is a radom variable ad g(x 0 for all x i the support of f X, the P(g(X E[g(X]. Proof. (cotiuous case P(g(X Corollaries x:g(x f X (xdx x:g(x
More informationFall 2013 MTH431/531 Real analysis Section Notes
Fall 013 MTH431/531 Real aalysis Sectio 8.1-8. Notes Yi Su 013.11.1 1. Defiitio of uiform covergece. We look at a sequece of fuctios f (x) ad study the coverget property. Notice we have two parameters
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More information1 = δ2 (0, ), Y Y n nδ. , T n = Y Y n n. ( U n,k + X ) ( f U n,k + Y ) n 2n f U n,k + θ Y ) 2 E X1 2 X1
8. The cetral limit theorems 8.1. The cetral limit theorem for i.i.d. sequeces. ecall that C ( is N -separatig. Theorem 8.1. Let X 1, X,... be i.i.d. radom variables with EX 1 = ad EX 1 = σ (,. Suppose
More informationSolutions to Tutorial 5 (Week 6)
The Uiversity of Sydey School of Mathematics ad Statistics Solutios to Tutorial 5 (Wee 6 MATH2962: Real ad Complex Aalysis (Advaced Semester, 207 Web Page: http://www.maths.usyd.edu.au/u/ug/im/math2962/
More informationMATH 472 / SPRING 2013 ASSIGNMENT 2: DUE FEBRUARY 4 FINALIZED
MATH 47 / SPRING 013 ASSIGNMENT : DUE FEBRUARY 4 FINALIZED Please iclude a cover sheet that provides a complete setece aswer to each the followig three questios: (a) I your opiio, what were the mai ideas
More informationInfinite Sequences and Series
Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet
More informationMath 2784 (or 2794W) University of Connecticut
ORDERS OF GROWTH PAT SMITH Math 2784 (or 2794W) Uiversity of Coecticut Date: Mar. 2, 22. ORDERS OF GROWTH. Itroductio Gaiig a ituitive feel for the relative growth of fuctios is importat if you really
More informationMath 299 Supplement: Real Analysis Nov 2013
Math 299 Supplemet: Real Aalysis Nov 203 Algebra Axioms. I Real Aalysis, we work withi the axiomatic system of real umbers: the set R alog with the additio ad multiplicatio operatios +,, ad the iequality
More informationEFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS
EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS Ryszard Zieliński Ist Math Polish Acad Sc POBox 21, 00-956 Warszawa 10, Polad e-mail: rziel@impagovpl ABSTRACT Weak laws of large umbers (W LLN), strog
More informationEECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1
EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum
More informationLecture Chapter 6: Convergence of Random Sequences
ECE5: Aalysis of Radom Sigals Fall 6 Lecture Chapter 6: Covergece of Radom Sequeces Dr Salim El Rouayheb Scribe: Abhay Ashutosh Doel, Qibo Zhag, Peiwe Tia, Pegzhe Wag, Lu Liu Radom sequece Defiitio A ifiite
More information17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15
17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig
More informationAn Introduction to Randomized Algorithms
A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis
More informationON THE CHOICE OF m IN THE n OUT OF n BOOTSTRAP AND CONFIDENCE BOUNDS FOR EXTREMA
Statistica Siica 182008: Supplemet, S47 S57 ON THE CHOICE OF m IN THE OUT OF BOOTSTRAP AND CONFIDENCE BOUNDS FOR EXTREMA Peter J. Bickel ad Aat Sakov Uiversity of Califoria, Berkeley ad Taor Group 2. Appedix
More informationMathematical Methods for Physics and Engineering
Mathematical Methods for Physics ad Egieerig Lecture otes Sergei V. Shabaov Departmet of Mathematics, Uiversity of Florida, Gaiesville, FL 326 USA CHAPTER The theory of covergece. Numerical sequeces..
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationBerry-Esseen bounds for self-normalized martingales
Berry-Essee bouds for self-ormalized martigales Xiequa Fa a, Qi-Ma Shao b a Ceter for Applied Mathematics, Tiaji Uiversity, Tiaji 30007, Chia b Departmet of Statistics, The Chiese Uiversity of Hog Kog,
More informationINFINITE SEQUENCES AND SERIES
11 INFINITE SEQUENCES AND SERIES INFINITE SEQUENCES AND SERIES 11.4 The Compariso Tests I this sectio, we will lear: How to fid the value of a series by comparig it with a kow series. COMPARISON TESTS
More informationTechnical Proofs for Homogeneity Pursuit
Techical Proofs for Homogeeity Pursuit bstract This is the supplemetal material for the article Homogeeity Pursuit, submitted for publicatio i Joural of the merica Statistical ssociatio. B Proofs B. Proof
More informationEE 4TM4: Digital Communications II Probability Theory
1 EE 4TM4: Digital Commuicatios II Probability Theory I. RANDOM VARIABLES A radom variable is a real-valued fuctio defied o the sample space. Example: Suppose that our experimet cosists of tossig two fair
More informationA sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as
More informationOn Random Line Segments in the Unit Square
O Radom Lie Segmets i the Uit Square Thomas A. Courtade Departmet of Electrical Egieerig Uiversity of Califoria Los Ageles, Califoria 90095 Email: tacourta@ee.ucla.edu I. INTRODUCTION Let Q = [0, 1] [0,
More informationChapter 7 Isoperimetric problem
Chapter 7 Isoperimetric problem Recall that the isoperimetric problem (see the itroductio its coectio with ido s proble) is oe of the most classical problem of a shape optimizatio. It ca be formulated
More informationTopics. Homework Problems. MATH 301 Introduction to Analysis Chapter Four Sequences. 1. Definition of convergence of sequences.
MATH 301 Itroductio to Aalysis Chapter Four Sequeces Topics 1. Defiitio of covergece of sequeces. 2. Fidig ad provig the limit of sequeces. 3. Bouded covergece theorem: Theorem 4.1.8. 4. Theorems 4.1.13
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 6 9/23/203 Browia motio. Itroductio Cotet.. A heuristic costructio of a Browia motio from a radom walk. 2. Defiitio ad basic properties
More informationDirection: This test is worth 250 points. You are required to complete this test within 50 minutes.
Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely
More informationDefinition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.
4. BASES I BAACH SPACES 39 4. BASES I BAACH SPACES Sice a Baach space X is a vector space, it must possess a Hamel, or vector space, basis, i.e., a subset {x γ } γ Γ whose fiite liear spa is all of X ad
More information4. Partial Sums and the Central Limit Theorem
1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems
More information5. Likelihood Ratio Tests
1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,
More informationPROBLEM SET 5 SOLUTIONS 126 = , 37 = , 15 = , 7 = 7 1.
Math 7 Sprig 06 PROBLEM SET 5 SOLUTIONS Notatios. Give a real umber x, we will defie sequeces (a k ), (x k ), (p k ), (q k ) as i lecture.. (a) (5 pts) Fid the simple cotiued fractio represetatios of 6
More informationSequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece 1, 1, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet
More informationLecture 8: Convergence of transformations and law of large numbers
Lecture 8: Covergece of trasformatios ad law of large umbers Trasformatio ad covergece Trasformatio is a importat tool i statistics. If X coverges to X i some sese, we ofte eed to check whether g(x ) coverges
More informationSequences. Notation. Convergence of a Sequence
Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it
More informationMA131 - Analysis 1. Workbook 3 Sequences II
MA3 - Aalysis Workbook 3 Sequeces II Autum 2004 Cotets 2.8 Coverget Sequeces........................ 2.9 Algebra of Limits......................... 2 2.0 Further Useful Results........................
More informationLinear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d
Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y
More informationLECTURE 2 LEAST SQUARES CROSS-VALIDATION FOR KERNEL DENSITY ESTIMATION
Jauary 3 07 LECTURE LEAST SQUARES CROSS-VALIDATION FOR ERNEL DENSITY ESTIMATION Noparametric kerel estimatio is extremely sesitive to te coice of badwidt as larger values of result i averagig over more
More informationStat410 Probability and Statistics II (F16)
Some Basic Cocepts of Statistical Iferece (Sec 5.) Suppose we have a rv X that has a pdf/pmf deoted by f(x; θ) or p(x; θ), where θ is called the parameter. I previous lectures, we focus o probability problems
More informationExponential Functions and Taylor Series
MATH 4530: Aalysis Oe Expoetial Fuctios ad Taylor Series James K. Peterso Departmet of Biological Scieces ad Departmet of Mathematical Scieces Clemso Uiversity March 29, 2017 MATH 4530: Aalysis Oe Outlie
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationAssignment 5: Solutions
McGill Uiversity Departmet of Mathematics ad Statistics MATH 54 Aalysis, Fall 05 Assigmet 5: Solutios. Let y be a ubouded sequece of positive umbers satisfyig y + > y for all N. Let x be aother sequece
More informationLecture 6 Simple alternatives and the Neyman-Pearson lemma
STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull
More informationAnalytic Continuation
Aalytic Cotiuatio The stadard example of this is give by Example Let h (z) = 1 + z + z 2 + z 3 +... kow to coverge oly for z < 1. I fact h (z) = 1/ (1 z) for such z. Yet H (z) = 1/ (1 z) is defied for
More informationMath 113 Exam 3 Practice
Math Exam Practice Exam 4 will cover.-., 0. ad 0.. Note that eve though. was tested i exam, questios from that sectios may also be o this exam. For practice problems o., refer to the last review. This
More informationSingular Continuous Measures by Michael Pejic 5/14/10
Sigular Cotiuous Measures by Michael Peic 5/4/0 Prelimiaries Give a set X, a σ-algebra o X is a collectio of subsets of X that cotais X ad ad is closed uder complemetatio ad coutable uios hece, coutable
More informationDirection: This test is worth 150 points. You are required to complete this test within 55 minutes.
Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem
More informationProbability and Statistics
ICME Refresher Course: robability ad Statistics Staford Uiversity robability ad Statistics Luyag Che September 20, 2016 1 Basic robability Theory 11 robability Spaces A probability space is a triple (Ω,
More informationNotes 19 : Martingale CLT
Notes 9 : Martigale CLT Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces: [Bil95, Chapter 35], [Roc, Chapter 3]. Sice we have ot ecoutered weak covergece i some time, we first recall
More informationLinear Elliptic PDE s Elliptic partial differential equations frequently arise out of conservation statements of the form
Liear Elliptic PDE s Elliptic partial differetial equatios frequetly arise out of coservatio statemets of the form B F d B Sdx B cotaied i bouded ope set U R. Here F, S deote respectively, the flux desity
More informationECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization
ECE 90 Lecture 4: Maximum Likelihood Estimatio ad Complexity Regularizatio R Nowak 5/7/009 Review : Maximum Likelihood Estimatio We have iid observatios draw from a ukow distributio Y i iid p θ, i,, where
More informationST5215: Advanced Statistical Theory
ST525: Advaced Statistical Theory Departmet of Statistics & Applied Probability Tuesday, September 7, 2 ST525: Advaced Statistical Theory Lecture : The law of large umbers The Law of Large Numbers The
More informationBIRKHOFF ERGODIC THEOREM
BIRKHOFF ERGODIC THEOREM Abstract. We will give a proof of the poitwise ergodic theorem, which was first proved by Birkhoff. May improvemets have bee made sice Birkhoff s orgial proof. The versio we give
More information