Double Bootstrap Confidence Intervals in the Two Stage DEA approach. Essex Business School University of Essex

Similar documents
On the Accuracy of Bootstrap Confidence Intervals for Efficiency Levels in Stochastic Frontier Models with Panel Data

Confidence Intervals in Ridge Regression using Jackknife and Bootstrap Methods

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions

Marginal Screening and Post-Selection Inference

The Nonparametric Bootstrap

Constrained estimation for binary and survival data

The bootstrap. Patrick Breheny. December 6. The empirical distribution function The bootstrap

A better way to bootstrap pairs

Analysis of Type-II Progressively Hybrid Censored Data

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

ST495: Survival Analysis: Hypothesis testing and confidence intervals

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Long-Run Covariability

Field Course Descriptions

Interval Estimation III: Fisher's Information & Bootstrapping

LM threshold unit root tests

Monte Carlo Integration

Pubh 8482: Sequential Analysis

Bootstrapping Spring 2014

Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring

The comparative studies on reliability for Rayleigh models

Nonparametric Methods II

Supporting Information for Estimating restricted mean. treatment effects with stacked survival models

Random Numbers and Simulation

Characterizing Forecast Uncertainty Prediction Intervals. The estimated AR (and VAR) models generate point forecasts of y t+s, y ˆ

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

Likelihood-based inference with missing data under missing-at-random

Drawing Inferences from Statistics Based on Multiyear Asset Returns

Supplemental material to accompany Preacher and Hayes (2008)

Data Analysis and Statistical Methods Statistics 651

arxiv: v5 [stat.me] 13 Feb 2018

A Note on the Scale Efficiency Test of Simar and Wilson

Quantile regression and heteroskedasticity

Model Selection, Estimation, and Bootstrap Smoothing. Bradley Efron Stanford University

Post-exam 2 practice questions 18.05, Spring 2014

Confidence Distribution

Bootstrap Testing in Econometrics

Finite Population Correction Methods

Exponentiated Rayleigh Distribution: A Bayes Study Using MCMC Approach Based on Unified Hybrid Censored Data

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health

STAT 704 Sections IRLS and Bootstrap

11. Bootstrap Methods

Wavelet Methods for Time Series Analysis. Part IV: Wavelet-Based Decorrelation of Time Series

Analysis of Regression and Bayesian Predictive Uncertainty Measures

Confidence intervals for kernel density estimation

BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES

STAT Section 2.1: Basic Inference. Basic Definitions

The Number of Bootstrap Replicates in Bootstrap Dickey-Fuller Unit Root Tests

Estimation of Operational Risk Capital Charge under Parameter Uncertainty

A Resampling Method on Pivotal Estimating Functions

Independent and conditionally independent counterfactual distributions

A Comparison of Approaches to Estimating the Time-Aggregated Uncertainty of Savings Estimated from Meter Data

A Survey of Stochastic Frontier Models and Likely Future Developments

Some New Aspects of Dose-Response Models with Applications to Multistage Models Having Parameters on the Boundary

MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2

Quantifying Weather Risk Analysis

Reliability of inference (1 of 2 lectures)

Inference in Nonparametric Series Estimation with Data-Dependent Number of Series Terms

ST745: Survival Analysis: Nonparametric methods

Estimation and Hypothesis Testing in LAV Regression with Autocorrelated Errors: Is Correction for Autocorrelation Helpful?

Bootstrapping the Grainger Causality Test With Integrated Data

Volume 03, Issue 6. Comparison of Panel Cointegration Tests

M O N A S H U N I V E R S I T Y

Chapter 9. Bootstrap Confidence Intervals. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University

STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL. A Thesis. Presented to the. Faculty of. San Diego State University

Monte Carlo Integration I [RC] Chapter 3

SOME HISTORY OF STOCHASTIC PROGRAMMING

Slack and Net Technical Efficiency Measurement: A Bootstrap Approach

XLVII SIMPÓSIO BRASILEIRO DE PESQUISA OPERACIONAL

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

ORIGINS OF STOCHASTIC PROGRAMMING

Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk

Bootstrap, Jackknife and other resampling methods

Part III. A Decision-Theoretic Approach and Bayesian testing

Small area prediction based on unit level models when the covariate mean is measured with error

Does low participation in cohort studies induce bias? Additional material

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

Application of Bootstrap Techniques for the Estimation of Target Decomposition Parameters in RADAR Polarimetry

Approximate and Fiducial Confidence Intervals for the Difference Between Two Binomial Proportions

GOODNESS OF FIT TESTS IN STOCHASTIC FRONTIER MODELS. Christine Amsler Michigan State University

Discussant: Lawrence D Brown* Statistics Department, Wharton, Univ. of Penn.

Rank conditional coverage and confidence intervals in high dimensional problems

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data

Bootstrapping Australian inbound tourism

Simulation. Where real stuff starts

Testing for structural breaks in discrete choice models

Confidence Intervals for the Process Capability Index C p Based on Confidence Intervals for Variance under Non-Normality

Bootstrap and Parametric Inference: Successes and Challenges

Lecture 4: Heteroskedasticity

7 Estimation. 7.1 Population and Sample (P.91-92)

Integrated likelihoods in survival models for highlystratified

Bootstrap & Confidence/Prediction intervals

Higher-Order von Mises Expansions, Bagging and Assumption-Lean Inference

Assessing the effect of a partly unobserved, exogenous, binary time-dependent covariate on -APPENDIX-

Contents. Part I: Fundamentals of Bayesian Inference 1

Analysis of incomplete data in presence of competing risks

Bootstrap prediction intervals for factor models

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

Transcription:

Double Bootstrap Confidence Intervals in the Two Stage DEA approach D.K. Chronopoulos, C. Girardone and J.C. Nankervis Essex Business School University of Essex 1

Determinants of efficiency DEA can be a useful tool in the hands of managers identify best practices. Efficiency levels might reflect not only the ability of the management, but the effects of contextual factors on firm s performance, as well. A second stage regression analysis on efficiency estimates can help quantify these effects. Understanding these relationships can be of help to: Managers improve firm s performance. Policy makers better assess cost of regulation. 2

Second Stage Regression: The problem The dependency problem: Efficiency measures estimated with DEA are dependent on each other by definition. (The estimator has a convergence rate of 2 p q 1 n + + ). This dependency disappears asymptotically, but generally at a rate slower than the usual n achieved by the truncated or censored MLE. Conventional inference procedures are invalid, when dimensionality of production is greater than 3 ( p + q > 3) (Xue and Harker 1999; Simar and Wilson 2007). The suggested solution: Bootstrap confidence intervals (Simar and Wilson 2007) 3

Aims Examine the convergence properties of the coverage rates of the alternative bootstrap confidence intervals estimators. Investigate the coverage accuracy of double bootstrap confidence intervals. Provide a less computationally demanding algorithm for constructing double bootstrap confidence intervals. 4

Data Generating Process A firm faces an environmental variable Z ~ N (2,4). Given Z, the production efficiency level δ is drawn from f( δ / Z). The conditioning operates through this mechanism δ = Zβ + ε [1], where ε ~ truncatedn(0,1), with left truncation at 1 Zβ. The input(s) are distributed as x U ) y P 1 3/4 xp p 1 P 1 3/ 4 xp p 1 = δ. = p ~ (6,16. We distinguish between single and multi output technologies: Single output: Multi output: ζ = δ. = If 2 then draw α U ) l 1 If Q 2 then additionally draw α ~ U (0,1 α ), for each Q = 1 ~ (0,1. l = 2,..., Q 1. l k1 K Then the output mix is given by yq = αζ q and q= 1,..., Q 1. Q Q 1 = (1 ) k= 1 k, for y α ζ 5

Step 1: Step 2: Step 3: Bootstrap Confidence Intervals Estimate the efficiency levels ˆ δ. Regress ˆ δ on the environmental variable Z using the truncated regression model to obtain ˆβ and σ ˆε estimates. * * Construct pseudo ˆ δ by drawing ε from the parametric distribution of the * errors truncated N 0, ˆ σ ) such that ˆ* δ = Z ˆ β + ε. A bootstrap estimate ( ε * of the parameter of interest is obtained by regressing ˆ δ on Z and denoted ˆ* β. Repeat the procedure J times. The basic bootstrap CI is given by: ˆ β ( ˆ β ˆ β), ˆ β ( ˆ β ˆ β) * * (1 α)( J+ 1) ( α( J+ 1)) The percentile bootstrap CI is given by: * * ˆ ˆ β( α( J 1), β + (1 α)( J+ 1) 6

Double Bootstrap Confidence Intervals Frequently the nominal coverage probability of the bootstrap CI differs from the true one. Step 4: Step 5: For each set of single bootstrap estimates construct a double bootstrap ** * ** ** sample ˆk δ = Zβ + ε k. Again use the truncated regression to obtain ˆ β k. Repeat the process K times. ** * Compute the statistic: U #( ˆ 2 ˆ ˆ = β k β β) K for the basic CI or ˆ** U = #( β ˆ β) K for the percentile CI. k The basic double bootstrap CI is given by: ˆ β ( ˆ β ˆ β), ˆ β ( ˆ β ˆ β) * * ( U ( J+ 1)) ( U ( J+ 1)) (1 a)( J+ 1) ( α ( J+ 1) The percentile double bootstrap CI is given by: ˆ β, α J β * * ( U ( J+ 1)) ( U ( J+ 1) ) ( ( + 1) ((1 α)( J+ 1) 7

The 25 th and 26 th values are the upper and lower bounds of and respectively. Stopping rules for double bootstrap Suppose J = 999 then U (25) and U(975) are required. Start with calculating 50 U and sort them in an increasing order. If the is greater than the current bound of and smaller than U then not all U 51 U (25) (975) K double bootstrap estimations are required. U (25) U(975) 8

Monte Carlo evidence - Single bootstrap Table 1. Estimated coverages of confidence intervals generated by conventio single bootstrap methods n Basic Boot. Alg.- Nominal significance Percentile Boot. Alg.- Nominal significance Asympt. Normal Apr.- Nominal significance 0.90 0.95 0.90 0.95 0.90 0.95 p = q = 1 100 0.83 0.89 0.88 0.94 0.85 0.90 200 0.86 0.91 0.88 0.93 0.87 0.92 400 0.88 0.93 0.90 0.94 0.89 0.93 1200 5000 10000 15000 100 0.88 0.94 0.89 0.94 0.89 0.94 0.90 0.95 0.90 0.95 0.90 0.95 0.90 0.96 0.90 0.96 0.90 0.96 0.91 0.95 0.91 0.95 0.91 0.95 0.76 0.81 0.82 0.88 - - 200 0.80 0.84 0.83 0.89 - - 400 0.80 0.87 0.83 0.90 - - 1200 0.83 0.88 0.85 0.90 - - 5000 0.82 0.90 0.84 0.91 - - 10000 0.85 0.91 0.85 0.92 - - 15000 0.87 0.93 0.88 0.93 - - 100 0.69 0.73 0.79 0.85 - - 200 0.76 0.80 0.81 0.87 - - 400 0.72 0.78 0.77 0.85 - - 1200 0.69 0.79 0.73 0.83 - - 5000 0.65 0.75 0.66 0.77 - - 10000 0.58 0.71 0.59 0.74 - - 15000 0.60 0.70 0.60 0.71 - - Notes: Results based on 1,000 Monte Carlo trials p = q = 2 p = q = 3 9

Monte Carlo evidence - Double bootstrap Table 2. Estimated coverages of confidence intervals generated by percentile single and double bootstrap methods n Percentile Single Boot. Nominal significance Percentile Double Boot. Nominal significance 0.90 0.95 0.90 0.95 p = q = 1 100 0.88 0.94 0.90 0.95 200 0.88 0.93 0.90 0.95 400 0.90 0.94 0.91 0.96 p = q = 2 100 0.82 0.88 0.86 0.93 200 0.83 0.89 0.86 0.92 400 0.83 0.90 0.86 0.93 p = q = 3 100 0.79 0.85 0.83 0.90 200 0.81 0.87 0.85 0.92 400 0.77 0.85 0.80 0.90 Notes: Results based on 1,000 Monte Carlo trials 10

Conclusions Correlation of efficiency estimates disappears, but not fast enough. Need for alternative inference making method. Bootstrap offers a good alternative, but single bootstrap CIs do not have good coverage rates (the dimensionality problem of the efficiency estimator carries over to the second stage regression). Double bootstrap offers a significant improvement but at a considerable computational cost. This computational burden can be reduced by adopting deterministic stopping rules (in the spirit of Nankervis(2005)). 11