BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

Similar documents
Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Statistics II Final Exam 26/6/18

Statistics for Economics & Business

Joint Statistical Meetings - Biopharmaceutical Section

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Basic Business Statistics, 10/e

Lecture 6 More on Complete Randomized Block Design (RBD)

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Chapter 11: Simple Linear Regression and Correlation

x = , so that calculated

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

/ n ) are compared. The logic is: if the two

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI

STATISTICS QUESTIONS. Step by Step Solutions.

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov,

Statistics for Business and Economics

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

experimenteel en correlationeel onderzoek

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

STAT 3008 Applied Regression Analysis

Topic- 11 The Analysis of Variance

Chapter 8 Indicator Variables

Comparison of Regression Lines

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Lecture 4 Hypothesis Testing

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

1-FACTOR ANOVA (MOTIVATION) [DEVORE 10.1]

Statistics Chapter 4

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

Economics 130. Lecture 4 Simple Linear Regression Continued

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Chapter 13: Multiple Regression

Chapter 14 Simple Linear Regression

Topic 23 - Randomized Complete Block Designs (RCBD)

Lecture 6: Introduction to Linear Regression

A Robust Method for Calculating the Correlation Coefficient

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Modeling and Simulation NETW 707

Chapter 11: I = 2 samples independent samples paired samples Chapter 12: I 3 samples of equal size J one-way layout two-way layout

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

18. SIMPLE LINEAR REGRESSION III

Scatter Plot x

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

ANOVA. The Observations y ij

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Statistical Hypothesis Testing for Returns to Scale Using Data Envelopment Analysis

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor

Chapter 12 Analysis of Covariance

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

28. SIMPLE LINEAR REGRESSION III

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for U Charts. Dr. Wayne A. Taylor

Introduction to Regression

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Introduction to Analysis of Variance (ANOVA) Part 1

Methods of Detecting Outliers in A Regression Analysis Model.

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

# c i. INFERENCE FOR CONTRASTS (Chapter 4) It's unbiased: Recall: A contrast is a linear combination of effects with coefficients summing to zero:

Introduction to Generalized Linear Models

STAT 511 FINAL EXAM NAME Spring 2001

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

x i1 =1 for all i (the constant ).

A nonparametric two-sample wald test of equality of variances

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

BOOTSTRAP METHOD FOR TESTING THE EQUALITY OF MEANS: IN CASE OF HETEROSCEDASTICITY

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Biostatistics 360 F&t Tests and Intervals in Regression 1

Chapter 15 Student Lecture Notes 15-1

F statistic = s2 1 s 2 ( F for Fisher )

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

A Method for Analyzing Unreplicated Experiments Using Information on the Intraclass Correlation Coefficient

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result

SIMPLE LINEAR REGRESSION

Boostrapaggregating (Bagging)

a. (All your answers should be in the letter!

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Negative Binomial Regression

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Learning Objectives for Chapter 11

Using Multivariate Rank Sum Tests to Evaluate Effectiveness of Computer Applications in Teaching Business Statistics

Appendix B: Resampling Algorithms

Professor Chris Murray. Midterm Exam

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Stat 543 Exam 2 Spring 2016

Transcription:

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com Abstract: In ths paper, a graphcal procedure usng bootstrap method s developed as an alternatve to the ANOVA to test the hypothess on equalty of several means. An example s gven to demonstrate the advantage of bootstrap graphcal procedure over the ANOVA n decson makng pont of vew. Keywords: Mean, Analyss of varance, Bootstrap method.. Introducton The analyss of varance frequently referred to by the contracton ANOVA s a statstcal technque specally desgned to test whether the means of more than two quanttatve populatons are equal. Bascally, ths method conssts of classfyng and crossclassfyng statstcal results and testng whether the means of specfed classfcaton dffer sgnfcantly. In ths study, applcablty of bootstrap method for testng of several means s dscussed and ths bootstrap method can be treated as an alternatve method to ANOVA when the samples are very small. Statstcal nference based on data resamplng has drawn a great deal of attenton n recent years. The man dea about these resamplng methods s not to assume much about the underlyng populaton dstrbuton and nstead tres to get the nformaton about the populaton from the data tself varous types of resamplng methods. Bootstrap method (Efron, 979) use the relatonshp between the sample and resamples drawn from the sample,

to approxmate the relatonshp between the populaton and samples drawn from t. Wth the bootstrap method, the basc sample s treated as the populaton and a Monte Carlo style procedure s conducted on t. Ths s done by randomly drawng a large number of resamples of sze n from ths orgnal sample wth replacement [] []. Both bootstrap and tradtonal parametrc nference seek to acheve the same goal usng lmted nformaton to estmate the samplng dstrbuton of the chosen estmatorθˆ. The estmate wll be used to make nferences about a populaton parameterθ. The key dfference between these nferental approaches s how they obtan ths samplng dstrbuton where as tradtonal parametrc nference utlzes a pror assumptons about the shape of θˆ s dstrbuton, the non-parametrc bootstrap s dstrbuton free, whch means that t s not dependent on a partcular class of dstrbutons. Wth the bootstrap method, the entre samplng dstrbuton θˆ s estmated by relyng on the fact that the sample s dstrbuton s good estmate of the populaton dstrbuton [] [].. Bootstrap method for testng of equalty of several means Let { X,,,..., k, j =,,..., } = represent k ndependent random samples of j n szes n,, n,... nk respectvely and we assume that X ~ (, j N σ ) µ for =,,..., k. Here, we are nterested n testng the null hypothess. H : 0 µ = µ = K = µ k = µ (Unknown) aganst the alternatve hypothess H : µ µ K µ. ANOVA s used for testng k H 0 n the lterature. Ths test demonstrates only the statstcal sgnfcance of the equalty of means beng compared. The Bootstrap graphcal procedure for testng H 0 s gven n the followng steps.

k Let the avalable combned sample s{ Z j, j =,, LN} of sze N = n =,. Draw the B- bootstrap samples of sze N wth replacement from the combned sample { j,, LN} Z j, = and the b th bootstrap sample of sze N s gven by { y b, =,, K, N and b =,, K, B} (.). Compute the mean for the b th bootstrap sample. y N b = y b N =, b =,, L, B. (.) 3. Obtan the samplng dstrbuton of mean usng B-bootstrap estmates and compute the mean and standard error of mean. y B = y b B b= B. and s = ( y b y ) B b= (.3) 4. The lower decson lne (LDL) and the upper decson lne (UDL) for the comparson of each of the x are gven by LDL = y UDL = y Z + Z α / α / s s (.4) where Zα / s the crtcal value at α level and Z 0.05 =. 96at 5% level. 5. Plot x aganst the decson lnes. If any one of the ponts plotted les outsde the respectve decson lnes, H 0 s rejected at α level and conclude that the means are not homogenous. 3

The proposed method s very useful n handlng of small samples of sze less than 30. Ths method not only tests the sgnfcant dfference among the means but also dentfy the source of heterogenety of means. 3. Numercal Example The lfetmes (n hours) of samples from three dfferent brands of batteres were recorded wth the followng results [3]. Table. Lfetmes (n hours) of the batteres Brand X X X 3 40 60 60 30 40 50 50 55 70 50 65 65 30 75 40 We wsh to test whether the three brands have same average lfetmes or not. We wll assume that the three samples come from normal populatons wth common (unknown) standard devaton. From the data, we have n =, n = 4, n 6, x = 40, x = 55 and x3 = 60. 5 3 = One-way ANOVA s carred out to test the hypothess H : µ = µ = µ 0 3 and the results gven below. Table. ANOVA Source of Varaton SS df MS F P-value F crt. Between Brands 40 570.000 4.75 0.040 3.885 Wthn Brands 600 33.333 Total 740 4 4

Snce the sgnfcant probablty value s 0.040, whch s less than the level of sgnfcance α = 0. 05, thus we reject the null hypothess and we may conclude that the mean lfetmes of three brands are not homogenous. We apply bootstrap method to test the same hypothess for the gven example. The procedure s explaned n the followng steps.. Draw B (=000) bootstrap sample of sze N=5 from the combned sample of sze N=5.. Compute the mean for each bootstrap sample usng (.) and form the samplng dstrbuton. 3. Compute the mean and standard devaton of the B-bootstrap estmates of mean usng (.3) and t s observed that y = 5. 044 and s = 3. 604 4. The decson lnes are computed usng (.4) and are obtaned at α = 0. 05 s as follows. LDL=44.980 and UDL=59.08. 5. Prepare a chart as n Fgure wth the above decson lnes and plot the ponts x (=,, 3). From Fg., we observe that x and x3 le outsde the decson lnes. Hence, H 0 may be rejected and we may conclude that the three populaton means are not same. 4. Conclusons From the above study, we observe that the concluson s same, that s Ho s rejected by both ANOVA and bootstrap method and bootstrap method not only tests the sgnfcant dfference among the means but also dentfes the source of heterogenety of means. 5

Snce x 3 les above the UDL, therefore the customer s advsed to purchase the brand X 3 batteres and try to avod the brand X batteres. References. Efron, B (979), Bootstrap methods: Another look at the Jackknfe, Ann. Statst., 7, pp -6.. Efron, B, Tbshran, R.J (993), An ntroducton to the Bootstrap, Chapman & Hall, New York. 3. Rohatg, V.K. (976), An Introducton to probablty theory and mathematcal statstcs, Wley Eastern Ltd., pp 55-56. Fgure. Chart of decson lnes x 3 UDL x LDL x 6