Sampling, Sampling Distribution and Normality

Similar documents
Sampling Error. Chapter 6 Student Lecture Notes 6-1. Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

Chapter 7 Student Lecture Notes 7-1

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Statistical inference: example 1. Inferential Statistics

1 Inferential Methods for Correlation and Regression Analysis

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Properties and Hypothesis Testing

Topic 9: Sampling Distributions of Estimators

Final Examination Solutions 17/6/2010

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Common Large/Small Sample Tests 1/55

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Chapter 13, Part A Analysis of Variance and Experimental Design

Describing the Relation between Two Variables

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Mathematical Notation Math Introduction to Applied Statistics

Chapter 1 (Definitions)

Module 1 Fundamentals in statistics

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Computing Confidence Intervals for Sample Data

(all terms are scalars).the minimization is clearer in sum notation:

Expectation and Variance of a random variable

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

STATISTICAL INFERENCE

Sampling Distributions, Z-Tests, Power

Sample Size Determination (Two or More Samples)

Parameter, Statistic and Random Samples

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

Chapter 5: Hypothesis testing

Introduction There are two really interesting things to do in statistics.

MA238 Assignment 4 Solutions (part a)

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Estimation of a population proportion March 23,

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Chapter two: Hypothesis testing

Chapter 23: Inferences About Means

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Hypothesis Testing (2) Barrow, Statistics for Economics, Accounting and Business Studies, 4 th edition Pearson Education Limited 2006

Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }

Statistics 511 Additional Materials

Important Concepts not on the AP Statistics Formula Sheet

Confidence Intervals รศ.ดร. อน นต ผลเพ ม Assoc.Prof. Anan Phonphoem, Ph.D. Intelligent Wireless Network Group (IWING Lab)

Chapter 6 Sampling Distributions

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Random Variables, Sampling and Estimation

Frequentist Inference

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

z is the upper tail critical value from the normal distribution

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

Math 140 Introductory Statistics

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Stat 421-SP2012 Interval Estimation Section

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Stat 200 -Testing Summary Page 1

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Statisticians use the word population to refer the total number of (potential) observations under consideration

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

Parameter, Statistic and Random Samples

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Output Analysis (2, Chapters 10 &11 Law)

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

Topic 18: Composite Hypotheses

1 Models for Matched Pairs

University of California, Los Angeles Department of Statistics. Hypothesis testing

f(x i ; ) L(x; p) = i=1 To estimate the value of that maximizes L or equivalently ln L we will set =0, for i =1, 2,...,m p x i (1 p) 1 x i i=1

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

Statistics 20: Final Exam Solutions Summer Session 2007

Statistical Fundamentals and Control Charts

Stat 319 Theory of Statistics (2) Exercises

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Introduction to Statistics

Transcription:

4/17/11 Tools of Busiess Statistics Samplig, Samplig Distributio ad ormality Preseted by: Mahedra Adhi ugroho, M.Sc Descriptive statistics Collectig, presetig, ad describig data Iferetial statistics Drawig coclusios ad/or makig decisios cocerig a populatio based oly o sample data Sources: Aderso, Sweeey,wiliams, Statistics for Busiess ad Ecoomics, 6 e, Pearso educatio ic, 7 Sugiyoo, Statistika utuk peelitia, alfbeta, Badug, 7 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7- Populatios ad s Populatio vs. A Populatio is the set of all items or idividuals of iterest Eamples: All likely voters i the et electio All parts produced today All sales receipts for ovember A is a subset of the populatio Eamples: 1 voters selected at radom for iterview A few parts selected for destructive testig Radom receipts selected for audit Populatio a b c d ef gh i jk l m o p q rs t u v w y z b c g i o r u y Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-3 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-4 Why? Samplig Methods Less time cosumig tha a cesus Samplig Methods Less costly to admiister tha a cesus It is possible to obtai statistical results of a sufficietly high precisio based o samples. Probability samplig 1. Simple radom samplig. Proportioate stratified radom samplig 3. Disproportioate stratified radom samplig 4. Cluster samplig o probability samplig 1. Systematic samplig. Quota samplig 3. Icidetal samplig 4. Purposive samplig 5. Surfeited samplig 6. Sowball samplig Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-5 1

4/17/11 Makig statemets about a populatio by eamiig sample results statistics Iferetial Statistics Populatio parameters (kow) Iferece (ukow, but ca be estimated from sample evidece) Populatio Estimatio Iferetial Statistics Drawig coclusios ad/or makig decisios cocerig a populatio based o sample results. e.g., Estimate the populatio mea weight usig the sample mea weight Hypothesis Testig e.g., Use sample evidece to test the claim that the populatio mea weight is 1 pouds Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-7 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-8 Samplig Distributios Samplig Distributios of Meas A samplig distributio is a distributio of all of the possible values of a statistic for a give size sample selected from a populatio Samplig Distributio of Mea Samplig Distributios Samplig Distributio of Proportio Samplig Distributio of Variace ote: this chapter oly discussig sample mea distributio. Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-9 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-1 Developig a Samplig Distributio Developig a Samplig Distributio (cotiued) Assume there is a populatio Populatio size 4 Radom variable,, is age of idividuals Values of : 18,,, 4 (years) A B C D Summary Measures for the Populatio Distributio: i 18 + + + 4 1 4 (i ).36 P().5 18 4 A B C D Uiform Distributio Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-11 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-1

4/17/11 Developig a Samplig Distributio ow cosider all possible samples of size 1 st d Observatio Obs 18 4 18 18,18 18, 18, 18,4,18,,,4,18,,,4 4 4,18 4, 4, 4,4 16 possible samples (samplig with replacemet) 16 Meas (cotiued) 1st d Observatio Obs 18 4 18 18 19 1 19 1 1 3 4 1 3 4 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-13 Developig a Samplig Distributio Samplig Distributio of All Meas 16 Meas 1st d Observatio Obs 18 4 18 18 19 1 19 1 1 3 P() Meas Distributio (cotiued) 4 1 3 4 18 19 1 3 4 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-14.3..1 (o loger uiform) Developig a Samplig Distributio Summary Measures of this Samplig Distributio: i 18 + 19 + 1+ + 4 E( ) L 1 16 (i ) (18-1) + (19-1) + L+ (4-1) 16 (cotiued) 1.58 Comparig the Populatio with its Samplig Distributio Populatio 4 1.36 P().3..1 18 4 A B C D Meas Distributio 1 1.58 P().3..1 18 19 1 3 4 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-15 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-16 Epected Value of Mea Stadard Error of the Mea Let 1,,... represet a radom sample from a populatio The sample mea value of these observatios is defied as 1 i i 1 Differet samples of the same size from the same populatio will yield differet sample meas A measure of the variability i the mea from sample to sample is give by the Stadard Error of the Mea: ote that the stadard error of the mea decreases as the sample size icreases Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-17 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-18 3

4/17/11 If the Populatio is ormal If a populatio is ormal with mea ad stadard deviatio, the samplig distributio of is also ormally distributed with ad Z-value for Samplig Distributio of the Mea Z-value for the samplig distributio of : where: ( ) ( ) Z sample mea populatio mea populatio stadard deviatio sample size Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-19 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7- Fiite Populatio Correctio Samplig Distributio Properties Apply the Fiite Populatio Correctio if: a populatio member caot be icluded more tha oce i a sample (samplig is without replacemet), ad the sample is large relative to the populatio ( is greater tha about 5 of ) The Var( ) 1 or 1 (i.e. is ubiased ) ormal Populatio Distributio ormal Samplig Distributio (has the same mea) Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-1 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7- Samplig Distributio Properties For samplig with replacemet: As icreases, decreases Smaller sample size Larger sample size (cotiued) Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-3 If the Populatio is ot ormal We ca apply the Cetral Limit Theorem: Eve if the populatio is ot ormal, sample meas from the populatio will be approimately ormal as log as the sample size is large eough. Properties of the samplig distributio: ad Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-4 4

4/17/11 Cetral Limit Theorem If the Populatio is ot ormal (cotiued) As the sample size gets large eough the samplig distributio becomes almost ormal regardless of shape of populatio Samplig distributio properties: Cetral Tedecy Variatio Populatio Distributio Samplig Distributio (becomes ormal as icreases) Smaller sample size Larger sample size Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-5 Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-6 How Large is Large Eough? For most distributios, > 5 will give a samplig distributio that is early ormal For ormal populatio distributios, the samplig distributio of the mea is always ormally distributed Decidig Size Isaac ad Michael Approach.. P. Q s λ d ( 1) + λ. P. Q Use table o sugiyoo page 71 omogram Herry Kig Maimum sample size is Statistics for Busiess ad Ecoomics, 6e 7 Pearso Educatio, Ic. Chap 7-7 Suggested Size Proper sample size i a research are 3-5 samples Proper size i categorized sample are miimum 3 samples each category Proper sample size for multivariate data aalysis (correlatio or multivariate regressio) are miimum 1 times of variables umbers (idepedet ad depedet) Proper sample size for simple eperimetal desig that use eperimet ad cotrol groups are 1 samples each variable group. 9 ormality: ormal Curve A data set could have ormal distributio if sum of data ad stadard deviatio of upper mea ad uder mea data are same. umber4 5 4 6 8 1 1 Value 5

4/17/11 ormality: ormal Curve Divide ormal curve i 6 areas base o deviatio stadard values..7 3s 13.53 s 34.53 1s 34.53 1s ( ) z i s 13.53 s.7 3s ormality Test Usig Chi square Decide iterval class. I this chase use 6 as class iterval because chi square ormal distributios is divided i 6 part. Decide iterval wide Coutig the estimated chi square value ad compare the value with value that is stated i chi square table. χ ( f f ) o f e e Let me wi! If I caot be a wier, Let me brave i attempt! 6