Estimating Confidence Interval of Mean Using. Classical, Bayesian, and Bootstrap Approaches

Similar documents
The Sampling Distribution of the Maximum. Likelihood Estimators for the Parameters of. Beta-Binomial Distribution

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

Sample Size Determination (Two or More Samples)

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

The (P-A-L) Generalized Exponential Distribution: Properties and Estimation

Bootstrap Intervals of the Parameters of Lognormal Distribution Using Power Rule Model and Accelerated Life Tests

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Chapter 6 Sampling Distributions

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Random Variables, Sampling and Estimation

Expectation and Variance of a random variable

Topic 9: Sampling Distributions of Estimators

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

3 Resampling Methods: The Jackknife

Bayesian and E- Bayesian Method of Estimation of Parameter of Rayleigh Distribution- A Bayesian Approach under Linex Loss Function

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.

Binomial Distribution

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

Parameter, Statistic and Random Samples

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Stat 200 -Testing Summary Page 1

Economics Spring 2015

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Topic 9: Sampling Distributions of Estimators

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Computing Confidence Intervals for Sample Data

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

4.5 Multiple Imputation

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

Comparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Bayesian Methods: Introduction to Multi-parameter Models

STATISTICAL method is one branch of mathematical

Confidence Intervals รศ.ดร. อน นต ผลเพ ม Assoc.Prof. Anan Phonphoem, Ph.D. Intelligent Wireless Network Group (IWING Lab)

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

Improved Class of Ratio -Cum- Product Estimators of Finite Population Mean in two Phase Sampling

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Linear Regression Models

Properties and Hypothesis Testing

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Understanding Samples

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Topic 9: Sampling Distributions of Estimators

Statistics 511 Additional Materials

University of California, Los Angeles Department of Statistics. Hypothesis testing

Rule of probability. Let A and B be two events (sets of elementary events). 11. If P (AB) = P (A)P (B), then A and B are independent.

MATH/STAT 352: Lecture 15

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.

Basis for simulation techniques

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values

Module 1 Fundamentals in statistics

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

Lecture 2: Monte Carlo Simulation

Stat 319 Theory of Statistics (2) Exercises

1 Inferential Methods for Correlation and Regression Analysis

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

1.010 Uncertainty in Engineering Fall 2008

Maximum likelihood estimation from record-breaking data for the generalized Pareto distribution

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan

Chapter 13, Part A Analysis of Variance and Experimental Design

Parameter, Statistic and Random Samples

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

z is the upper tail critical value from the normal distribution

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

A proposed discrete distribution for the statistical modeling of

STAT431 Review. X = n. n )

On an Application of Bayesian Estimation

This is an introductory course in Analysis of Variance and Design of Experiments.

Stat 421-SP2012 Interval Estimation Section

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State

Common Large/Small Sample Tests 1/55

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Topic 10: Introduction to Estimation

Exponential Families and Bayesian Inference

(7 One- and Two-Sample Estimation Problem )

Overview of Estimation

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

f(x i ; ) L(x; p) = i=1 To estimate the value of that maximizes L or equivalently ln L we will set =0, for i =1, 2,...,m p x i (1 p) 1 x i i=1

AAEC/ECON 5126 FINAL EXAM: SOLUTIONS

Final Examination Solutions 17/6/2010

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Modified Decomposition Method by Adomian and. Rach for Solving Nonlinear Volterra Integro- Differential Equations

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

New Results for the Fibonacci Sequence Using Binet s Formula

Transcription:

Iteratioal Joural of Mathematical Aalysis Vol. 8, 2014, o. 48, 2375-2383 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ijma.2014.49287 Estimatig Cofidece Iterval of Mea Usig Classical, Bayesia, ad Bootstrap Approaches Solimu Departmet of Mathematics Faculty of Mathematics ad Natural Scieces Uiversity of Brawijaya Jala Vetera Malag-Idoesia Copyright 2014 Solimu. This is a ope access article distributed uder the Creative Commos Attributio Licese, which permits urestricted use, distributio, ad reproductio i ay medium, provided the origial work is properly cited. Abstract I oe study, sometimes observed the characteristics of a populatio (eg the media, variace, media, or proportio). Give the limitatios ad costraits, it is ot possible to observe the whole of the populatio elemets. Alterative estimatio step is performed usig a populatio sample draw at radom from a populatio. I this study, the media iterval estimatio of the populatio ( ). There are three methods that will be studied is the classical method, Bayes approach, ad a bootstrap approach. This study focused o estimatig meas usig the third approach ad compare the results obtaied from the three approaches. Test results usig the data of the populatio obtaied a estimate of the value of the middle third populatio is relatively the same method, i which the bootstrap method produces the smallest cofidece iterval. Keywords: Cofidece iterval, classic, Bayes, ad Bootstrap 1 Itroductio All observatios, whether fiite or ifiite, comprise of what s kow as populatio. I a study, sometimes characteristics of a populatio are observed. Several statistical measures are used to discover the characteristics of populatios, such as mea, variace, media, or proportio.

2376 Solimu I statistical iferece we wat to draw coclusios o populatios, although it s impossible or impractical for us to observe all idividuals i a populatio. With various limitatios ad obstacles, it s impossible to observe the etire populatio elemets. A alterative step is estimatig populatios usig a radomly collected sample from a populatio. Oe of populatio parameter estimatio system based o sample statistics is cofidece iterval which is a system which produces represetative parameter estimatios. Statistical iferece theory icludes all methods used i drawig coclusios or geeralizig a populatio. The curret tedecy i estimatig a populatio parameter is the developmet of classical method which bases its coclusio oly o iformatio from a radom sample from the populatio. Two ew methods discussed i this study are Bayesia ad Bootstrap methods. Bayesia method uses or combies subjective kowledge o the distributio of ukow parameter opportuity with iformatio from data sample. Bootstrap method uses classical method which uses resamplig. Based o the backgroud above, the problem discussed was How is the use of iterval estimatio of the meas of a populatio usig classical method, Bayesia approach, ad bootstrap approach, ad the compariso of the three methods? The purpose of this study was usig iterval estimatio of the meas of a populatio usig classical method, Bayesia approach, ad bootstrap approach, ad the compariso of the three methods. The beefit of this study was researchers ca use Bayesia approach ad bootstrap approach as alteratives i parameter estimatio, aside from the popular classical method. 2 Materials ad Methods The populatio of a data is assumed to be ormally distributed with X N(µ, 2 ) where expectatio value of X is with mea µ ad variace 2. Populatio parameters µ ad 2 are ukow. Mea sample X ad variace s 2 are estimators of the mea ad variace of the populatio: 1 2 2 1 ˆ X X i da ˆ s X i X i 1 1 where Xi is radom variable take radomly from a populatio. Expectatio value of average sample is E ( X ) = µ ad Stadard deviatio Se ( X ) =. For a small sample ( < 30) the populatio distributes ormally (X N (µ, 2 )) ad 2 is ukow ad estimated by s 2, so it ca be formulated that: i 1 2

Estimatig cofidece iterval of mea 2377 X µ s t-1 where t-1 is produced from t distributio with degree of freedom -1, so cofidece iterval for mea is: P( t s 1 / 2, < µ < X 1 t s 1 / 2, ) = 95% X 1 I classical approach, cofidece iterval estimatio comes from asymptotic sample drawig theory, while for Bayesia approach, cofidece iterval estimatio comes from posterior distributio from the geeratio of sample data from data ad some cocetratio of prior distributio of parameters. At the first level of the model, it s assumed that the distributio of the sample is ormal Level 1 (DATA): Xi N(µ, 2 ). At the secod level, it specifies prior distributio for μ Level 2 (PRIOR): µ N(μμ, 2 μ ). At the third or last level, it specifies hyperprior distributio for 2, μμ, 2 μ. Level 3 (HYPERPRIOR): P( 2 ), P(μμ) ad P( 2 μ). This Bayesia approach geerate a sample for uobserved parameters µ (1), µ (2),, µ (k) of distributio µ. Every sample geeratio estimates posterior distributio for µ ad calculates posterior mea. Estimator of cofidece iterval of mea with cofidece level 95% is obtaied from percetile 2.5% ad 97.5% of the simulatio. Bootstrap method uses resamplig method. It s assumed that data distributio is ukow. x1,x2,...,x is a radom sample of F which is a ukow distributio, = (F) is parameter ad ˆ T( x,..., x ) is the ˆ * * 1 1 * estimatio of. Estimator T( x,...,x ) obtaied from bootstrap sample * * ( x1,...,x ) is called bootstrap replicatio for ˆ. This study uses data of bowlig scores preseted i Table 1.

2378 Solimu Table 1: Data Score Bowlig Game No Score No Score 1 93 12 72 2 119 13 118 3 110 14 73 4 72 15 102 5 99 16 122 6 85 17 70 7 53 18 81 8 70 19 130 9 66 20 97 10 142 21 89 11 63 22 27 Usig three method, classical, Bayesia, ad bootstrap, estimatio of cofidece level of a populatio was coducted. The software used were SPLUS ad Wibugs: 3 Result ad Discussio Figure 1 shows histogram ad data cocetratio fuctio. We ca see that the data has asymmetrical distributio. Estimator of the mea of the populatio is 88, 77 with stadard deviatio 5, 90. Cofidece iterval 95% for estimatio of the mea of the populatio is [76, 51; 101.05]. 0 2 4 6 0.0 0.002 0.004 0.006 0.008 0.010 0.012 0.014 20 40 60 80 100 120 140 160 data 0 50 100 150 data Figure 1: Histograms ad Desity Fuctio Data

Estimatig cofidece iterval of mea 2379 This Bayesia approach used software Wibugs. First, it defied model data ad estimatio of iitial value. Next, it performed simulatio with iteratio 10000. Figure 2 shows the mea obtaied i every simulatio. The fial part used aalysis based o 1001 st to 10000 th iteratios. mu 120.0 100.0 80.0 60.0 40.0 1000 2500 5000 7500 10000 iteratio Figure 2: Trace plots for the media populatio (after the disposal of the first observatio i 1000) Estimatio of cocetratio fuctio for posterior distributio for mea of the populatio is preseted i Figure 3. Estimatio of cofidece iterval of the mea of the populatio is obtaied from quatile values 2,5% ad 97,5% from the simulatio. 0.08 0.06 0.04 0.02 0.0 mu sample: 9001 40.0 60.0 80.0 100.0 Figure 3: Posterior desity fuctio for the distributio of the media populatio Estimator of the mea of posterior distributio is 88, 58 ad stadard deviatio is 6, 18. Cofidece iterval 95% for mea of the populatio is [76, 46; 100, 30]. Estimatio usig Maximum Likelihood i Bootstrap approach is 88, 78 with stadard deviatio 5, 75. Cofidece iterval 95% for mea of the populatio is [77, 50; 99, 95]. Figure 4 shows histogram ad cocetratio fuctio for the mea of the sample based o the result of 1000 bootstrap iteratios.

2380 Solimu 0 200 400 600 800 1000 1200 1400 dx$y 0.0 0.02 0.04 0.06 70 80 90 100 110 theta.x 70 80 90 100 110 theta Figure 4: Histograms ad Desity Fuctio Cetral Value Based o 1000 Repetitio Bootstrap Samples The results of the three methods are preseted i the followig table: Table 2: Estimatio of Cetral Value, Stadard Deviatio ad Cofidece Iterval Methods Classical, Bayes ad Bootstrap Method Cetral Value Stadard Deviatio Lower limit Hose cofidece Upper Limit Width Clasic 88,77 5,90 76,51 101,05 24,54 Bayes 88,58 6,18 76,46 100,30 23,84 Bootstrap 88,78 5,74 77,50 99,95 22,45 Table 2 shows that the mea obtaied from the three approaches were early the same, especially classical ad bootstrap methods. Similarly for stadard deviatio Bayesia method had the biggest stadard deviatio, ad Bootstrap method had the smallest stadard deviatio. Similarly for cofidece iterval, Bootstrap method had the smallest width of cofidece iterval. The mai differeces

Estimatig cofidece iterval of mea 2381 of the three methods were: 1) classical ad Bayesia methods required distributio assumptio to base the data, while bootstrap method did t assume data with certai distributio. 2) Classical method was derived from multiplicatio with critical value. This made the cofidece iterval produced to be symmetrical with mea estimator. While i Bayesia ad Bootstrap methods, cofidece iterval approaches used quatile 2,5% ad 97,5% which produced asymmetrical cofidece iterval. 4 Coclusio Cofidece iterval estimatio could use Classical, Bayesia ad Bootstrap methods. I the applicatio, by usig data of a populatio, three methods were relatively similar. Bootstrap method had the smallest width of cofidece iterval, idicatig that this method was more thorough ad recommeded. Ackowledgemets. May thaks to Uiversity of Brawijaya for fiacial support. Refereces [1]. Dukic, V., da Hoga, J.W. A hierarchical bayesia approach to modelig embryo implatatio followig i vitro fertilizatio. http://biostatistics.oxfordjourals.org/cgi/reprit/3/3/361.pdf. [2]. Friedma, N., Goldszmidt, M., ad Wyer, A. Data aalysis with Bayesia etworks: a bootstrap approach. Http://www.cs.huji.ac.il/~ir/Abstracts/FGW2.html. [3]. Matthew, J. B., Falciai, F., Ghahramai, Z., Ragel, C., da Wild, D. L.. A Bayesia approach to recostructig geetic regulatory etworks with hidde factors. Http://bioiformatics.oxfordjourals.org/cgi/cotet/full/21/3/349. [4]. Walpole, R.E. 1995. Pegatar Statistika. (I Idoesia) PT. Gramedia Pustaka Utama, Idoesia. Received: August 12, 2014 Appedix 1. Code

2382 Solimu Clasical Method data<-c(93,119,110,72,99,85,53,70,66,142,63,72,118,73,102,122,70,81,130,97,89,27) par(mfrow=c(1,2)) hist(data,col=0,class=7) dx<-desity(data) data<-dx$x plot(data,dx$y,type="l") s.data<-sum(data) ssq.data<-sum(data^2) <-legth(data) # histogram of replicates # desity estimate # sum the data # sum of square the data # sample size ml.x<-s.data/ mea ml.sd<-sqrt((ssq.data-s.data^2/)/(-1)) for stadard deviatio ml.se<-ml.sd/sqrt() df<--1 CIL<-ml.x-qt(0.975,df)*ml.se CIU<-ml.x+qt(0.975,df)*ml.se # maximum likelihood estimator for # maximum likelihood estimator # stadard error # degree of freedom # lower limit CI # upper limit CI ml.x ml.se CIL CIU Bayesia Approach model { } for( i i 1 : N ) { data[i] ~ dorm(mu,tau.c) } tau.c ~ dgamma(0.001,0.001) mu ~ dorm(alpha,tau.alpha) alpha ~ dorm(0.0,1.0e-6) tau.alpha ~ dgamma(0.001,0.001) list(n=22, data=c(93,119,110,72,99,85,53,70,66,142,63,72,118,73,102,122,70,81,130,97,89, 27)) list(mu=10, alpha = 0, tau.c = 1, tau.alpha = 1)

Estimatig cofidece iterval of mea 2383 Bootstrapig Method data<-c(93,119,110,72,99,85,53,70,66,142,63,72,118,73,102,122,70,81,130,97, 89,27) B<-10000 # umber of bootstrap theta.x<-c(1:b) # vector to keep the theta for (i i 1:B) { data.boot<-sample(data,size=,replace=t) # draw o-parametric bootstrap sample theta.x[i]<-mea(data.boot) # calculate theta } mu<-mea(theta.x) # mea of theta sd<-stdev(theta.x) # stadard deviatio of theta CIL<-quatile(theta.x,probs=0.025) # lower limit cofidece iterval CIU<-quatile(theta.x,probs=0.975) # upper limit cofidece iterval mu sd CIL CIU par(mfrow=c(1,2)) hist(theta.x,col=0,class=) dx<-desity(theta.x) theta<-dx$x plot(theta,dx$y,type="l") # histogram of replicates # desity estimate