Bickel Rosenblatt test

Similar documents
Can we do statistical inference in a non-asymptotic way? 1

Integral approximation by kernel smoothing

An Introduction to Nonstationary Time Series Analysis

O Combining cross-validation and plug-in methods - for kernel density bandwidth selection O

DEPARTMENT MATHEMATIK ARBEITSBEREICH MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Nonparametric Regression

Nonparametric Inference via Bootstrapping the Debiased Estimator

Discussion Papers in Economics

Three Papers by Peter Bickel on Nonparametric Curve Estimation

Nonparametric estimation using wavelet methods. Dominique Picard. Laboratoire Probabilités et Modèles Aléatoires Université Paris VII

Test for Discontinuities in Nonparametric Regression

A New Test in Parametric Linear Models with Nonparametric Autoregressive Errors

Malaya J. Mat. 2(1)(2014) 35 42

Variance Function Estimation in Multivariate Nonparametric Regression

Nonparametric Regression. Badr Missaoui

Time Series and Forecasting Lecture 4 NonLinear Time Series

Nonparametric Function Estimation with Infinite-Order Kernels

Local linear multiple regression with variable. bandwidth in the presence of heteroscedasticity

Goodness-of-fit tests for the cure rate in a mixture cure model

Tail bound inequalities and empirical likelihood for the mean

A Bootstrap Test for Conditional Symmetry

Nonparametric regression with martingale increment errors

Estimation of a quadratic regression functional using the sinc kernel

Additive Isotonic Regression

Smooth simultaneous confidence bands for cumulative distribution functions

Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes

Rank tests for short memory stationarity

Adaptive Kernel Estimation of The Hazard Rate Function

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

Introduction to Regression

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University

Gaussian processes for inference in stochastic differential equations

Confidence intervals for kernel density estimation

University, Tempe, Arizona, USA b Department of Mathematics and Statistics, University of New. Mexico, Albuquerque, New Mexico, USA

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017

Modelling Non-linear and Non-stationary Time Series

Chapter 2: Resampling Maarten Jansen

USING A BIMODAL KERNEL FOR A NONPARAMETRIC REGRESSION SPECIFICATION TEST

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½

Defect Detection using Nonparametric Regression

A New Approach to Robust Inference in Cointegration

Estimation of the functional Weibull-tail coefficient

M O N A S H U N I V E R S I T Y

Non-parametric Inference and Resampling

Density estimators for the convolution of discrete and continuous random variables

Inference For High Dimensional M-estimates. Fixed Design Results

Nonparametric estimation of extreme risks from heavy-tailed distributions

Transformation and Smoothing in Sample Survey Data

Introduction to Regression

Empirical likelihood for average derivatives of hazard regression functions

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Discussion Paper Series

Local linear multivariate. regression with variable. bandwidth in the presence of. heteroscedasticity

11. Bootstrap Methods

Introduction to Regression

A Conditional Approach to Modeling Multivariate Extremes

On variable bandwidth kernel density estimation

LIST OF PUBLICATIONS. 1. J.-P. Kreiss and E. Paparoditis, Bootstrap for Time Series: Theory and Applications, Springer-Verlag, New York, To appear.

Minimum Contrast Empirical Likelihood Manipulation. Testing for Regression Discontinuity Design

Optimal Estimation of a Nonsmooth Functional

Bootstrap. Director of Center for Astrostatistics. G. Jogesh Babu. Penn State University babu.

Single Index Quantile Regression for Heteroscedastic Data

Random Eigenvalue Problems Revisited

Random Matrix Eigenvalue Problems in Probabilistic Structural Mechanics

On a Nonparametric Notion of Residual and its Applications

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

A nonparametric method of multi-step ahead forecasting in diffusion processes

Shape constrained kernel density estimation

HAR Inference: Recommendations for Practice

Nonparametric Identi cation and Estimation of Truncated Regression Models with Heteroskedasticity

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels

On detection of unit roots generalizing the classic Dickey-Fuller approach

Nonparametric confidence intervals. for receiver operating characteristic curves

Section 7: Local linear regression (loess) and regression discontinuity designs

Nonparametric estimation and symmetry tests for

Nonparametric Methods

Local Whittle Likelihood Estimators and Tests for non-gaussian Linear Processes

Limit theory and inference about conditional distributions

Statistical inference on Lévy processes

Acceleration of some empirical means. Application to semiparametric regression

Inference on distributions and quantiles using a finite-sample Dirichlet process

Error distribution function for parametrically truncated and censored data

Confidence Intervals in Ridge Regression using Jackknife and Bootstrap Methods

On Consistent Hypotheses Testing

Asymptotical distribution free test for parameter change in a diffusion model (joint work with Y. Nishiyama) Ilia Negri

Working Paper No Maximum score type estimators

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix

Nonparametric Identification and Estimation of a Transformation Model

Weak convergence of Markov chain Monte Carlo II

Maximum Non-extensive Entropy Block Bootstrap

Detection of structural breaks in multivariate time series

Volume 03, Issue 6. Comparison of Panel Cointegration Tests

TESTING SERIAL CORRELATION IN SEMIPARAMETRIC VARYING COEFFICIENT PARTIALLY LINEAR ERRORS-IN-VARIABLES MODEL

Estimating a frontier function using a high-order moments method

An updated review of Goodness-of-Fit tests for regression models

Independent and conditionally independent counterfactual distributions

ECON 721: Lecture Notes on Nonparametric Density and Regression Estimation. Petra E. Todd

Bayesian Aggregation for Extraordinarily Large Dataset

Transcription:

University of Latvia 28.05.2011.

A classical Let X 1,..., X n be i.i.d. random variables with a continuous probability density function f. Consider a simple hypothesis H 0 : f = f 0 with a significance level α and completely specified f 0. Given the kernel density estimate f n (x) = 1 nh n n ( x Xi K i=1 h n ), where h n = h(n), a test statistic can be defined. The classical statistic [Bickel and Rosenblatt(1973)] = nh n ˆT br n [f n (x) f 0 (x)] 2 a(x)dx.

A smoothed modification To avoid bias problems a smoothed version of f 0, namely ) ( z (K hn f 0 )( ) = hn 1 K f 0 (z)dz, where is a convolution operator, is employed. And a(x) 1 is used as the arbitrary weight function, which leads to a modification of the statistic T n = nh d/2 n h n [f n (x) (K hn f 0 )(x)] 2 dx, and for composite hypothesis [fn T = n,ˆθ nhd/2 n (x) (K hn fˆθ )(x)] 2 dx.

Graphical representation 0.0 0.5 1.0 1.5 conv f n diff 2 0.5 0.0 0.5 1.0 1.5 Figure: Convolution, f n and the squared error.

An absolutely regular weakly dependent process Let (X t ) t Z, X t R be a strictly stationary process on a probability space (Ω, F, P). For any two σ-fields A and B F define the following measure of dependence β(a, B) := sup 1 2 I J P(A i B j ) P(A i )P(B j ), i=1 j=1 such that A i A i and B j B j, where i, j A i, B j Ω. Define F L J := σ(x k, J k L), when J L. Definition (X t ) t Z is called absolutely regular or β-mixing if β(n) = sup J Z β(f, J FJ+n ) 0, when n.

Asymptotic distribution of T n Theorem ([Neumann and Paparoditis(2000)]) If certain assumptions are fulfilled, then under H 0, (T n µ) d N(0, σ 2 ), where µ and σ 2 are σ 2 = 2 µ = hn d/2 [ f0 2 (x)dx K 2 (u)du, K(u)K(u + v)du] 2 dv.

Motivation (+) The test statistic can be used for: simple as well as composite hypothesis, independent and dependent identically distributed data without modification. ( ) No procedure for selecting the bandwidth h n.

Simulation study We define by f u the probability density function of the uniform U[0, 1] distribution f u = F u, F u = U[0, 1]. For the process (X t ) t Z we test the single hypothesis H 0 : f = f u versus H 1 : f f u. Suppose that a random variable X has a continuous cumulative distribution function F X, then F X (X ) = Y U[0, 1].

Alternatives close to U[0, 1] [Kallenberg and Ledwina(1995)] uses g 1 (x) = 1 + ρ cos(jπx), k g 2 (x) = exp θ j π j (x) ψ k (θ), j=1 with {π j } the orthonormal Legendre polynomials on [0, 1], ψ k (θ) = log 1 0 exp(θ φ(x))dx, θ Rk. 0.0 0.5 1.0 1.5 2.0 0.0 0.5 1.0 1.5 2.0 0.0 0.2 0.4 0.6 0.8 1.0 g 1 0.0 0.2 0.4 0.6 0.8 1.0 g 2

Simulated power for dependent (AR(1), θ = 0.3) data Table: T n percentage rejections of the true H 0 at 5% significance level with n = 20, 50, 100, 500, 1000 for AR(1) case with φ = 0.3 made with 10,000 replications; h = h 0n 1/4 ; kernel U(0, 1). h 0 n 0.005 0.01 0.02 0.03 0.05 0.1 0.15 0.2 0.25 0.3 20 6.53 6.10 5.42 4.68 4.26 2.97 2.39 1.86 1.48 1.08 50 6.26 5.97 5.31 5.31 4.59 3.59 2.91 2.48 2.07 1.63 100 6.02 5.77 4.98 4.82 4.44 3.69 3.40 3.01 2.56 2.33 500 5.91 5.94 6.00 5.21 5.20 4.34 3.91 3.53 3.13 2.85 1000 5.95 6.05 5.99 4.49 4.29 4.52 4.88 3.66 3.42 3.36 Table: AR(1) case with φ = 0.3. Simulated power for alternatives g 1 and g 2 with n = 50 and 10,000 replications; h = h 0n 1/4, kernel U(0, 1). h 0 ρ j 0.005 0.01 0.02 0.03 0.05 0.1 0.15 0.2 0.25 0.3 0.4 1 6.1 9.2 10.8 12.8 15.2 22.5 28.6 34.4 37.3 41.5 0.5 2 8.9 12.8 16.6 19.2 26.3 38.9 46.0 50.7 54.6 57.5 0.7 4 15.3 21.9 33.8 40.5 53.0 70.0 75.7 77.9 75.4 70.4 0.7 5 16.3 22.6 33.6 39.7 53.6 67.9 70.8 68.1 58.3 42.2 0.7 6 16.1 22.0 32.0 38.9 52.2 64.2 64.8 55.3 39.0 22.2 θ (0, 3) 25.1 25.3 25.2 24.5 27.6 34.6 40.1 45.3 49.8 53.7 (0,-0. 4) 7.8 11.1 15.4 17.1 23.2 33.4 38.4 42.5 44.8 47.0 (0.25,-0. 35) 9.3 11.7 15.5 17.4 23.7 33.8 40.2 45.7 48.5 51.6

Bandwidth selection for nonparametric kernel tests [Gao and Gijbels(2008)] consider a statistic ˆT n (h), similar to T n, for regression fit and derive Edgeworth expansions for size and power functions: α n (h) = P( ˆT n (h) > l α H 0 ) β n (h) = P( ˆT n (h) > l α H 1 ), and where l α is a simulated critical value of ˆT n (h). The Edgeworth expansions of α n (h) and β n (h) are then used to choose a suitable bandwidth β n (h ew ) = max β n(h), h H n(α) with H n (α) = {h : α c min < α n (h) < α + c min } for a small 0 < c min < α.

Gao an Gijbels used Edgeworth expansions for quadratic forms. [Bachmann and Dette(2005)] states that under H 0 (T n /nh) is a degenerate U statistic. For i.i.d. random variables and fixed bandwidth [Tenreiro(2005)] states that statistic I 2 n (h) = T n /h is a V statistic, In 2 (h) = 1 n Q h (X i, X j ), n i,j=1 Q h (u, v) = k(x, u, h)k(x, v, h)dx... [Fan and Linton(2003)] have derived Edgeworth expansions for a regression model specification test statistic, that is also a degenerate U statistic.

[Bachmann and Dette(2005)] T n nh 1 nh K 2 (x)dx [H h (f f 0 )] 2 (x)dx = U n + 2 n where Y i = (K h g h )(Z i ) E[K h g h (Z i )] and U n = 2 n 2 H n (Z i, Z j ) i<j i<j n ( ) 1 Y i + O P, n = 2 n 2 [K h (x Z i ) K h f (x)] [K h (x Z j ) K h f (x)] dx and U n is a degenerate U statistic. i=1

Bibliography I D. Bachmann and H. Dette. A note on the Bickel - Rosenblatt test in autoregressive time series. Stat. Probab. Lett., 74(3):221 234, 2005. P. J. Bickel and M. Rosenblatt. On some global measures ot the deviations of density function estimates. The Annals of Statistics, 1(6):1071 1095, 1973. Y. Fan and O. Linton. Some higher-order theory for a consistent non-parametric model specification test. J. Stat. Plann. Inference, 109(1-2):125 154, 2003. J. Gao and I. Gijbels. Bandwidth selection in nonparametric kernel testing. Journal of the American Statistical Association, 103(484):1584 1594, 2008. W. C. M. Kallenberg and T. Ledwina. Consistency and monte carlo simulation of a data driven version of smooth goodness-of-fit tests. Annals of Statistics, 23(5):1594 1608, 1995. M. H. Neumann and E. Paparoditis. On bootstrapping L 2 -type statistics in density testing. Statistics & Probability Letters, 50(2):137 147, 2000. C. Tenreiro. On the role played by the fixed bandwidth in the bickel-rosenblatt goodness-of-fit test. SORT, 29(2):201 216, 2005.

Thank you!