Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

Similar documents
MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

Chapter 7 Comparison of two independent samples

Disadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs

CHAPTER 7. Hypothesis Testing

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Tests for Two Coefficient Alphas

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

Introduction to Business Statistics QM 220 Chapter 12

Chapter 9 Inferences from Two Samples

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

10.2: The Chi Square Test for Goodness of Fit

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Topic 15: Simple Hypotheses

STATISTICS 141 Final Review

Inference with Simple Regression

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 24. Comparing Means

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

STA 101 Final Review

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Analysis of Variance (ANOVA)

Single Sample Means. SOCY601 Alan Neustadtl

Sampling Distributions: Central Limit Theorem

WISE Power Tutorial Answer Sheet

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

One-Way Repeated Measures Contrasts

The simple linear regression model discussed in Chapter 13 was written as

Independent Samples ANOVA

Contrasts (in general)

Chapter 23: Inferences About Means

1 Introduction to One-way ANOVA

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 10 Comparing Two Populations or Groups

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3

Using SPSS for One Way Analysis of Variance

The Chi-Square Distributions

10.4 Hypothesis Testing: Two Independent Samples Proportion

The Chi-Square Distributions

PSY 216. Assignment 9 Answers. Under what circumstances is a t statistic used instead of a z-score for a hypothesis test

Analysis of Variance (ANOVA)

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

Comparing Means from Two-Sample

1 Introduction to Minitab

HYPOTHESIS TESTING. Hypothesis Testing

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Chapter 20 Comparing Groups

Simple Linear Regression: One Qualitative IV

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Lecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown

The t-statistic. Student s t Test

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Chapter 23. Inference About Means

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

determine whether or not this relationship is.

STAT22200 Spring 2014 Chapter 5

Business Statistics. Lecture 10: Course Review

Exam 2 (KEY) July 20, 2009

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

9-6. Testing the difference between proportions /20

Lecture 3: Inference in SLR

STAT 328 (Statistical Packages)

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

5 Basic Steps in Any Hypothesis Test

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

Political Science 236 Hypothesis Testing: Review and Bootstrapping

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Testing. Confidence intervals on mean. CL = x ± t * CL1- = exp

Inference for Distributions Inference for the Mean of a Population. Section 7.1

Analysis of Variance: Part 1

Six Sigma Black Belt Study Guides

An inferential procedure to use sample data to understand a population Procedures

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

Factorial Independent Samples ANOVA

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

Lecture 2. Estimating Single Population Parameters 8-1

Midterm 1 and 2 results

Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Inferences for Regression

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

One sample problem. sample mean: ȳ = . sample variance: s 2 = sample standard deviation: s = s 2. y i n. i=1. i=1 (y i ȳ) 2 n 1

INFERENCE FOR REGRESSION

Chapter 8 of Devore , H 1 :

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Survey on Population Mean

CE3502. Environmental Measurements, Monitoring & Data Analysis. ANOVA: Analysis of. T-tests: Excel options

This gives us an upper and lower bound that capture our population mean.

Transcription:

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem Reading: 2.4 2.6. Motivation: Designing a new silver coins experiment Sample size calculations Margin of error for the pooled two sample t CI The power of the pooled two sample t-test The paired experiment versus the two independent samples experiment Comparing the standard errors for the teachers example 1

Motivation: Designing a new silver coins experiment Problem: to distinguish whether there was a change in the silver content in coins minted during the reign of.... Design: Two samples are chosen from coins minted in the early and late periods of the reign. Analysis: A two-sample pooled t-test along with a 95% CI confidence interval for µ 1 µ 2, the difference in the mean silver content. Goals: 1. To provide an interval with a margin of error of 0.2. 2. What sample sizes would we need to detect a difference of 0.2 with a two-sided two-sample pooled t-test at level α = 0.05 with 90% power? 2

Experimental design Experimental design is the act of evaluating and choosing between different experiments. Sample size calculations are commonly used either before or after an experimental design is chosen. Here are two different approaches to sample size calculation for the two-sample pooled problem: 1. You want to select the sample sizes for a C% confidence interval for µ 1 µ 2 with a certain margin of error, m. 2. You want to select the samples sizes for testing H 0 : µ 1 = µ 2 with a certain significance level and power. 3

The margin of error Remember the pooled t-based 100(1 α)% confidence interval for µ 1 µ 2 is where S.E.(Y 1 Y 2 ) = In the above interval, the margin of error is m = For the silver coins example for a confidence level of C = 95%, say we want a margin of error of m = Why is this hard to solve? 4

Making approximations We have m = t n1 +n 2 2(0.975) s p 1 n 1 + 1 n 2. Approximation 1: Setting n 1 = n 2 = n we have: Approximation 2: Plug-in an estimate of s p (we will use the value of 0.474 from the data from Manuel I s reign). Approximation 3: Setting the df = we get a guess for n: 5

Assumptions, assumptions This calculation for n 1 and n 2 assumes that 1. A two-sample pooled t procedure will be appropriate. 2. We can actually obtain samples of size n 1 and n 2. 3. The variability in the sample we will collect is similar to that of our Byzantine coins. 6

The power of a significance test Remember, the power of a significance test is related to the probability of a type II error: Power = 1 P (Type II error) = 1 P (fail to reject H 0 when H 0 is false) = P (reject H 0 when H 0 is false). We need to be specific about what when H 0 is false means. For two-sample t-test: H 0 true: H 0 false: We must specify what specific value of µ 1 µ 2 in the alternative hypothesis we mean when we say H 0 is false in order to compute power. 7

The power of the pooled two sample t-test We use MINITAB. Stat Power and Sample Size 2-Sample t. Under Options select the Alternative Hypothesis and Significance Level. Then enter any two of the following three items: 1. Sample sizes: 2. Differences: (the difference between the µ 1 µ 2 value under H a and the µ 1 µ 2 value under H 0 ). 3. Power values: Enter the Standard deviation (s p ) and click OK. 8

Power calculation for the silver coins What sample sizes would we need to detect a difference of 0.2 with a two-sided two-sample pooled t-test at level α = 0.05 with 90% power? Power and Sample Size 2-Sample t Test Testing mean 1 = mean 2 (versus not =) Calculating power for mean 1 = mean 2 + difference Alpha = 0.05 Assumed standard deviation = 0.474 Sample Target Difference Size Power Actual Power 0.2 120 0.9 0.902368 The sample size is for each group. We need n 1 = n 2 = 9

The paired experiment versus the two independent samples experiment To compare these two experiments, we will evaluate the standard error (S.E.) under each setup for the Spanish teachers. We could also compare the margin of errors of the confidence intervals. 10

Spanish teachers example Suppose that the data had been collected on two separate sets of 20 teachers. One set at the beginning of the course. One set at the end of the course. Here are the statistics required for the two sample analysis: (check this for yourself) For sample 1 (pre): n 1 = 20 Y 1 = 27.3 s 2 1 = 25.38 For sample 2 (post): n 2 = 20 Y 2 = 28.75 s 2 2 = 22.51 The pooled estimate of σ is (n 1 1)s 2 1 s p = + (n 2 1)s 2 2 (n 1 1) + (n 2 1) = 4.89. 11

The two sample analysis (exercise!) Hypotheses: H 0 : µ 1 = µ 2 versus H a : µ 1 < µ 2, where µ 1 is the pre-mean score, and µ 2 is the post-mean score. The two-sample pooled t statistic is t = Y 1 Y 2 δ s p 1 n 1 + 1 n 2. 1.45 = 1 4.89 20 + 1 20 = 1.45 1.546 = 0.937. If T is a t distributed random variable on n 1 + n 2 2 = 20 + 20 2 = 38 degrees of freedom, the p-value is P (T < 0.937) = 0.177. Conclusion: 12

Comparing the SEs for the two experimental designs The paired experiment: Test the same n = 20 teachers before and after the course. The S.E. for the sample mean of the differences, Y, is S.E.(Y ) = s d n = 0.716. The non-paired experiment: Test different teachers before and after the course. n 1 = n 2 = 20. 1 S.E.(Y 1 Y 2 ) = s p + 1 = 1.546. n 1 n 2 Was it wise to originally use the paired t-test? 13