Tail bound inequalities and empirical likelihood for the mean

Similar documents
High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

Empirical Likelihood

Empirical likelihood-based methods for the difference of two trimmed means

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

Introduction to Self-normalized Limit Theory

Test Problems for Probability Theory ,

Phenomena in high dimensions in geometric analysis, random matrices, and computational geometry Roscoff, France, June 25-29, 2012

Math 152. Rumbos Fall Solutions to Assignment #12

Lecture 13: Subsampling vs Bootstrap. Dimitris N. Politis, Joseph P. Romano, Michael Wolf

STAT 512 sp 2018 Summary Sheet

PRINCIPLES OF STATISTICAL INFERENCE

Theoretical Statistics. Lecture 1.

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

Notes on the Multivariate Normal and Related Topics

Bickel Rosenblatt test

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics

Mathematical Methods for Neurosciences. ENS - Master MVA Paris 6 - Master Maths-Bio ( )

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Chapter 3. Point Estimation. 3.1 Introduction

Introduction to Machine Learning CMU-10701

A COARSENING OF THE STRONG MIXING CONDITION

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

Better Bootstrap Confidence Intervals

SOLUTION FOR HOMEWORK 8, STAT 4372

Chapter 4. Theory of Tests. 4.1 Introduction

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.

Theory of Statistical Tests

Spring 2012 Math 541B Exam 1

STAT 200C: High-dimensional Statistics

E X A M. Probability Theory and Stochastic Processes Date: December 13, 2016 Duration: 4 hours. Number of pages incl.

Selected Exercises on Expectations and Some Probability Inequalities

ICSA Applied Statistics Symposium 1. Balanced adjusted empirical likelihood

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example

DA Freedman Notes on the MLE Fall 2003

Statistics 135 Fall 2007 Midterm Exam

1 Glivenko-Cantelli type theorems

Outline of GLMs. Definitions

A Functional Central Limit Theorem for an ARMA(p, q) Process with Markov Switching

Exercises and Answers to Chapter 1

Adjusted Empirical Likelihood for Long-memory Time Series Models

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Bootstrap (Part 3) Christof Seiler. Stanford University, Spring 2016, Stats 205

Expectation Maximization (EM) Algorithm. Each has it s own probability of seeing H on any one flip. Let. p 1 = P ( H on Coin 1 )

Continuous Random Variables. and Probability Distributions. Continuous Random Variables and Probability Distributions ( ) ( ) Chapter 4 4.

Empirical Likelihood Inference for Two-Sample Problems

University of California San Diego and Stanford University and

CS145: Probability & Computing

Entrance Exam, Real Analysis September 1, 2017 Solve exactly 6 out of the 8 problems

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

March 1, Florida State University. Concentration Inequalities: Martingale. Approach and Entropy Method. Lizhe Sun and Boning Yang.

HT Introduction. P(X i = x i ) = e λ λ x i

Tail and Concentration Inequalities

Simulation. Alberto Ceselli MSc in Computer Science Univ. of Milan. Part 4 - Statistical Analysis of Simulated Data

Additive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535

Hypothesis testing: theory and methods

Continuous Random Variables. and Probability Distributions. Continuous Random Variables and Probability Distributions ( ) ( )

ST5215: Advanced Statistical Theory

Overall Plan of Simulation and Modeling I. Chapters

Jackknife Empirical Likelihood for the Variance in the Linear Regression Model

Formulas for probability theory and linear models SF2941

Link lecture - Lagrange Multipliers

Extreme Value Analysis and Spatial Extremes

f(x θ)dx with respect to θ. Assuming certain smoothness conditions concern differentiating under the integral the integral sign, we first obtain

P n. This is called the law of large numbers but it comes in two forms: Strong and Weak.

Information Theoretic Asymptotic Approximations for Distributions of Statistics

Bayesian Regularization

Exam C Solutions Spring 2005

2.6.3 Generalized likelihood ratio tests

Maximum Likelihood Estimation

Probability Theory and Statistics. Peter Jochumzen

Chapter 3: Unbiased Estimation Lecture 22: UMVUE and the method of using a sufficient and complete statistic

8 Laws of large numbers

Chapter 6. Estimation of Confidence Intervals for Nodal Maximum Power Consumption per Customer

Estimation of Quantiles

Time-Varying Parameters

Lecture 3: More on regularization. Bayesian vs maximum likelihood learning

Case study: stochastic simulation via Rademacher bootstrap

STA 2201/442 Assignment 2

( ) ( ) Monte Carlo Methods Interested in. E f X = f x d x. Examples:

ECE 275A Homework 7 Solutions

Moment Condition Selection for Dynamic Panel Data Models

Master s Written Examination

7 Influence Functions

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Mathematical statistics

Theory and Applications of Stochastic Systems Lecture Exponential Martingale for Random Walk

Introduction to Estimation Methods for Time Series models Lecture 2

Random Variables. P(x) = P[X(e)] = P(e). (1)

STAT 200C: High-dimensional Statistics

Quantile-quantile plots and the method of peaksover-threshold

For iid Y i the stronger conclusion holds; for our heuristics ignore differences between these notions.

Composite Hypotheses and Generalized Likelihood Ratio Tests

Interval Estimation for AR(1) and GARCH(1,1) Models

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

11 Survival Analysis and Empirical Likelihood

Stein s method and zero bias transformation: Application to CDO pricing

Statistics & Data Sciences: First Year Prelim Exam May 2018

Basic concepts of probability theory

Modelling the risk process

Transcription:

Tail bound inequalities and empirical likelihood for the mean Sandra Vucane 1 1 University of Latvia, Riga 29 th of September, 2011 Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 1 / 21

Motivation Dealing with confidence intervals for mean for small samples sizes from highly skewed, heavy tailed probability distributions Rosenblum and van der Laan (2009) Claimed normal-based methods (z test, t test) and nonparametric bootstrap give poor coverage Provided alternative method using exponential type inequalities Idea: Compare the method proposed by Rosenblum and van der Laan with the empirical likelihood method. Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 2 / 21

Bernstein s inequality X 1,..., X n independent, bounded r.v. P( X i EX i W ) = 1 v n D(X i) Bernstein s inequality (1927) For all x > 0 ( 1 n P n (X i EX i ) > x/ ) n ( 2 exp nx 2 ) 2(v + W nx/3) Alternatives: Bennett s inequality (1962), Hoeffding s inequality (1963), Berry-Esseen inequality (1941, 1942) Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 3 / 21

Example - standard confidence intervals perform poorly Construction of random variable X is as follows: X = A + Y, A and Y are independent random variables, Y P ois(λ) P(A = 0) = 1 δ, P(A = t) = δ, where δ (0, 1) EX = EA + EY = tδ + λ, DX = DA + DY = t 2 δ(1 δ) + λ.!!!for simulations we choose δ = 0.01, t = 50, λ = 1. Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 4 / 21

Example - standard confidence intervals perform poorly Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 5 / 21

Example - standard confidence intervals perform poorly Table: Coverage accuracy for the mean of X from Example, α = 0.05 t B perc B norm B basic B stud Bernst n = 10 0.696 0.644 0.567 0.593 0.760 0.908 n = 20 0.518 0.520 0.477 0.477 0.636 0.848 n = 50 0.429 0.428 0.423 0.426 0.450 0.668 n = 100 0.626 0.623 0.626 0.626 0.619 0.652 n = 200 0.879 0.878 0.879 0.866 0.873 0.881 n = 500 0.905 0.928 0.903 0.875 0.954 0.994 Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 6 / 21

Empirical likelihood method (EL) Empirical likelihood method was introduced by Art B. Owen in 1988. The idea - model data using distributions placing point masses on the data. L(F ) = n P (X = X i ) = n p i, n p i = 1.!!! Empirical likelihood method is the only nonparametric method that admits Bartlett adjustment. Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 7 / 21

Empirical likelihood X 1,..., X n iid with EX i = µ 0 R. g(x, µ) such that E {g(x, µ)} = 0 (g(x i, µ) = X i µ); Empirical likelihood for µ : n L(µ) = P (X = X i ) = n L(µ) is maximized subject to constraints: p i p i 0, i p i = 1, i p i g(x i, µ) = 0. Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 8 / 21

Empirical likelihood Empirical likelihood ratio statistic for µ R(µ) = L(µ) n L(ˆµ) = {1 + λ(µ)g(x i, µ)} 1, Theorem (Owen, 1988) X 1,..., X n i.i.d. with µ 0 <. Then W (µ 0 ) = 2 log R(µ 0 ) d χ 2 1. Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 9 / 21

Bartlett correction Bartlett correction Simple correction of W (µ 0 ) with its mean E{W (µ 0 )} reduces coverage error from O(n 1 ) to O(n 2 ). Edgeworth series Approximate a probability distribution in terms of its cumulants. X 1,..., X n i.i.d. with mean θ 0 and finite variance σ 2. S n = n 1/2 (ˆθ θ 0 )/σ, where ˆθ = X. Edgeworth expansion for S n P(S n x) = Φ(x) + n 1/2 p 1 (x)φ(x) + n 1 p 2 (x)φ(x) +.... Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 10 / 21

Example - standard confidence intervals perform poorly t B perc B norm B basic B stud Bernst n = 10 0.696 0.644 0.567 0.593 0.760 0.908 n = 20 0.518 0.520 0.477 0.477 0.636 0.848 n = 50 0.429 0.428 0.423 0.426 0.450 0.668 n = 100 0.626 0.623 0.626 0.626 0.619 0.652 n = 200 0.879 0.878 0.879 0.866 0.873 0.881 n = 500 0.905 0.928 0.903 0.875 0.954 0.994 Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 11 / 21

Example - standard confidence intervals perform poorly t B perc B norm B basic B stud Bernst n = 10 0.696 0.644 0.567 0.593 0.760 0.908 n = 20 0.518 0.520 0.477 0.477 0.636 0.848 n = 50 0.429 0.428 0.423 0.426 0.450 0.668 n = 100 0.626 0.623 0.626 0.626 0.619 0.652 n = 200 0.879 0.878 0.879 0.866 0.873 0.881 n = 500 0.905 0.928 0.903 0.875 0.954 0.994 t EL EL B EL F EL F B Bernst n = 10 0.696 0.614 0.634 0.690 0.698 0.908 n = 20 0.518 0.522 0.546 0.570 0.581 0.848 n = 50 0.429 0.428 0.430 0.432 0.437 0.668 n = 100 0.626 0.611 0.613 0.611 0.614 0.652 n = 200 0.879 0.865 0.867 0.866 0.867 0.881 n = 500 0.905 0.933 0.944 0.935 0.945 0.994 Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 11 / 21

Example: Lognormal distribution LogN(µ, σ 2 ) σ 2 = 0.5 σ 2 = 1 Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 12 / 21

Example: Lognormal distribution LogN(µ, σ 2 ) σ 2 = 2 σ 2 t EL EL B EL F EL F B B perc B norm B basic B stud Bernst n = 10 0.5 0.894 0.856 0.871 0.893 0.902 0.858 0.852 0.839 0.931 0.978 1.0 0.853 0.822 0.841 0.859 0.873 0.816 0.811 0.772 0.912 0.942 2.0 0.757 0.743 0.763 0.791 0.808 0.739 0.719 0.675 0.877 0.876 5.0 0.518 0.524 0.546 0.566 0.582 0.495 0.466 0.421 0.781 0.663 n = 20 0.5 0.922 0.916 0.928 0.935 0.941 0.904 0.896 0.886 0.943 0.986 1.0 0.866 0.878 0.889 0.901 0.910 0.859 0.845 0.822 0.927 0.972 2.0 0.785 0.802 0.820 0.826 0.839 0.786 0.760 0.732 0.898 0.926 5.0 0.560 0.590 0.611 0.618 0.639 0.569 0.541 0.498 0.797 0.742 Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 13 / 21

Summary for independent case Bernstein s method can be recommended for very sparse data usually with big variance Coverage accuracy for larger samples grow to 1, that makes the comparison more difficult For any distribution we can find such parameter W and v values that the coverage accuracy of Bernstein s method is 95%, but these parameter values usually don t satisfy assumptions Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 14 / 21

Dependent case 1 Empirical likelihood Kitamura (1997) - empirical likelihood for weakly dependent processes Kitamura (1997) - Bartlett correction for blockwise empirical likelihood of the smooth function Nordman et al. (2007) - empirical likelihood for the mean of long-range dependent processes 2 Exponential type inequalities Bosq (1996) Dedecker, J., Doukhan, P., Lang, G., León R., J. R., Louhichi, S. and Prieur, C. (2007) Work in progress: comparison for weakly dependent processes Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 15 / 21

Empirical likelihood for weakly dependent processes {X t } R d valued stationary stochastic process that satisfy strong mixing condition α X (k) 0 as k ; Parameter of interest θ 0 Θ R p, r p; Estimating equation f : R d Θ R r and E {f(x t, θ 0 )} = 0; M length of block, L block number Q = [(N M)/L] + 1 and A N = QM/N; T i (θ) = φ M (B i, θ), where B i is the ith block of observations and mapping φ M : R rm Θ R r has following form φ M (B i, θ) = M f(x (i 1)L+n, Θ)/M Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 16 / 21

Empirical likelihood for weakly dependent processes Empirical likelihood for θ : L(θ) = Q L(θ) is maximized subject to constraints: p i p i > 0, i p i = 1, i p i T i (θ) = 0. Blockwise empirical log-likelihood ratio for θ Q 2 log (L(θ)/L F ) = 2 log (1 + γ N (θ) T i (θ)). Theorem (Kitamura, 1997) Q 1 LR 0 = 2A N log (1 + γ N (θ) T i (θ)) d χ 2 r p. Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 17 / 21

Simulations AR(1) with X t = 0.3X t 1 + ε t N/M 5 10 20 50 200 0.973 0.923 0.852 0.735 500 0.975 0.949 0.909 0.862 1000 0.981 0.948 0.931 0.888 Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 18 / 21

Exponential type inequalities Bosq (1996) Let (X t, t Z) be a zero-mean real-valued process such that sup 1 t n X t b. Then for each integer q [1, n 2 ] and each ε > 0 ( ) ( P( S n > nε) 4 exp ε2 q 8b 2 + 22 1 + 4b ) 1/2 ([ ]) n qα. ε 2q Doukhan (1995) Let (X t, t N) be a zero-mean real-valued process such that σ 2 R +, 1 n, m N : m (X n +... + X n+m ) 2 σ 2 and t N : X t M. Then for each ε > 0, θ = ε2 4 and q ( n 1+θ (1 ε)x 2 ) P( S n x) 4 exp 2(nσ 2 + 2 n β( qθ 1) q. + qmx/3)!!!problem - estimation of mixing coefficients McDonald et al (2011)-method for estimating beta-mixing coefficients Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 19 / 21

Exponential type inequalities Bennett-type inequality Let (X i ) i>0 such that X i, and M i = σ(x k, 1 k i). Let S k = k (X i E(X i )) and S n = max 1 k n S k. Let q be some positive integer, v q some nonnegative number such that [n/q] v q X q[n/q]+1 +... + X n 2 2 + X (i 1)q+1 +... + X iq 2 2 and h(x) = (1 + x) log (1 + x) x. Then for λ > 0, ( P( S n 3λ) 4 exp v ( )) q λqm (qm) 2 h + n λ τ 1,q(q + 1).!!!Problem - estimation of mixing coefficient v q Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 20 / 21

Thank you for your attention! Sandra Vucane (LU) Tail bound inequalities and EL for the mean 29.09.2011 21 / 21