T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Similar documents
CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 10 Comparing Two Populations or Groups

Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

2011 Pearson Education, Inc

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

STA Module 11 Inferences for Two Population Means

STA Rev. F Learning Objectives. Two Population Means. Module 11 Inferences for Two Population Means

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

STA Module 10 Comparing Two Proportions

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

Two-Sample Inferential Statistics

An inferential procedure to use sample data to understand a population Procedures

Inference About Two Means: Independent Samples

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

Levene's Test of Equality of Error Variances a

HYPOTHESIS TESTING. Hypothesis Testing

CBA4 is live in practice mode this week exam mode from Saturday!

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Chapter 8. Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis

Chapter 5 Confidence Intervals

Lectures 5 & 6: Hypothesis Testing

CHAPTER 10 Comparing Two Populations or Groups

Introduction to Business Statistics QM 220 Chapter 12

16.400/453J Human Factors Engineering. Design of Experiments II

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Business Statistics. Lecture 10: Course Review

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

Statistical Methods in Natural Resources Management ESRM 304

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Single Sample Means. SOCY601 Alan Neustadtl

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

Exam 2 (KEY) July 20, 2009

Two Sample Problems. Two sample problems

ANOVA Analysis of Variance

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Hypothesis Testing hypothesis testing approach formulation of the test statistic

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Introduction to Statistical Data Analysis III

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Statistics for IT Managers

Chapter 7 Comparison of two independent samples

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing

CHAPTER 9: HYPOTHESIS TESTING

STA 101 Final Review

Chapter 9 Inferences from Two Samples

Sociology 6Z03 Review II

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

POLI 443 Applied Political Research

Pooled Variance t Test

Performance Evaluation and Comparison

Empirical Evaluation (Ch 5)

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Note: k = the # of conditions n = # of data points in a condition N = total # of data points

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

Chapter 22. Comparing Two Proportions 1 /29

Business Statistics. Lecture 5: Confidence Intervals

Linear Combinations of Group Means

CHAPTER 7. Hypothesis Testing

Chapter 23: Inferences About Means

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Chapter 22. Comparing Two Proportions 1 /30

How do we compare the relative performance among competing models?

THE ROYAL STATISTICAL SOCIETY 2015 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 3

Tables Table A Table B Table C Table D Table E 675

CH.9 Tests of Hypotheses for a Single Sample

Inferential statistics

Advanced Experimental Design

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Lecture 15: Inference Based on Two Samples

1 Statistical inference for a population mean

Descriptive Statistics

4.1 Hypothesis Testing

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Ch 2: Simple Linear Regression

T-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum

Psychology 282 Lecture #4 Outline Inferences in SLR

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

Acknowledge error Smaller samples, less spread

Relating Graph to Matlab

Hypotheses and Errors

Lecture 9 Two-Sample Test. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Correlation 1. December 4, HMS, 2017, v1.1

MATH 644: Regression Analysis Methods

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Psych 230. Psychological Measurement and Statistics

For tests of hypotheses made about population, we consider. only tests having a hypothesis of. Note that under the assumption of

The independent-means t-test:

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Statistical Inference

Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Testing. Confidence intervals on mean. CL = x ± t * CL1- = exp

Transcription:

ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only on the mean and proportion. The goal was to show that there was a large enough difference between the value of a sample statistic and the population parameter and that the difference could not be due to chance alone. When the difference was large, we argued that the population parameter has changed. When the difference was small, we say that the population parameter has not changed. This approach, however did not give us any information of what causes the change in the population parameter. In this part of our course we will be comparing two populations and use independent and dependent samples to determine what causes the change. Definition of Independent Vs. Dependent Samples Two samples drawn from two populations are independent if the selection of one sample from one population does not affect the selection of the second sample from the second population. Otherwise the samples are dependent. Mean, Standard Deviation and Sampling Distribution of Suppose we select two independent large samples from two different populations that are referred to as Population 1 and Population 2. : 1 : 2 :. 1 :. 2 : 1 30 : 2 30 : 1 : 2 The Central Limit Theorem says that: 1. is approximately normally distributed with mean and std. dev 2. is approximately normally distributed with mean and std. dev Using these results we state the following: 1. The mean of is 2. The Standard deviation of is 1

3. The shape of the sampling distribution of is approximately normal regardless of the shape of the two populations. This is so because the difference between two normally distributed random variables is also normally distributed. 4. For the above to hold true, the samples must both be large. Sampling Distribution, Mean and Std. Dev. of For two large and independent samples selected from two different populations, the sampling distribution of is approximately normal with mean and std. dev. as follows: The mean of is The Standard Deviation of is The value of which gives an estimate of is calculated as Interval Estimation of Confidence Interval for : The 1 100% is if if Examples to be given in class Hypothesis Testing about The value of the test statistic is computed as follows: Inferences about the two population means for small and independent samples: Equal Standard Deviation How and when to use the t-distribution to make inferences about The t-distribution is used to make inferences about when the following assumptions hold true. 1. The two populations from which the two samples are drawn are approximately normally distributed. 2. The samples are small ( 30 30 and independent. 3. The std. dev. of the two populations are unknown but they are assumed to be equal. 2

Pooled Standard deviation for Two Samples The pooled std. dev. for two samples is computed as 1 1 2 Where are the sizes of the two samples and are the variances of the two samples. Here, is called an estimator of. When is used as an estimator of, the std. dev. of is estimated by Estimator of the Standard Deviation of The estimator of the standard deviation of is : 1 1 Now we are ready to discuss the procedures that are used to make confidence intervals and test hypothesis about for small and independent samples selected from two population with unknown but equal standard deviations. Interval Estimation of As we mentioned earlier the difference is the Point Estimator of. The following formulas gives the confidence intervals for when the t-distribution is used. Confidence Interval for. The 1 100% where the value of t is obtained from the t-distribution table for the given confidence level and 2 degrees of freedom. Hypothesis Testing about. When the three hypotheses mentioned earlier are satisfied, the t-distribution is used to make hypothesis test about. The test statistic t in this case is calculated as follows: The value of the test statistic t for is 3

Inferences about the two population means for small and independent samples: Unequal Standard Deviation If all the 3 assumptions made above for the Equal Std. Dev are satisfied, but the std. dev. are not only unknown but unequal, then the procedures for finding confidence intervals and testing of hypothesis about remains similar to what we have learned except for two differences: When the Std. dev. are unknown and unequal: 1. The Degrees of Freedom are no longer given by 2 2. The Std. Dev. of is not calculated using the pooled std. dev. Degrees of Freedom If 1. The two populations from which the two samples are drawn are approximately normally distributed. 2. The samples are small ( 30 30 and independent. 3. The std. dev. of the two populations are unknown and unequal. Then the t-distribution is used to make inferences about and the degrees of freedom for the t-distribution are given by: 1 1 The number given by this formula is always rounded down for df Because the two std. dev. of the two populations are not known, we use as a point estimator of. The following formula is used to calculate the standard deviation 4

Interval Estimation of The difference between the sample means is the point estimator of the difference between the population means. The following formulas give the confidence interval for when the t-distribution is used. The population std. dev. are unknown and are presumed to be unequal. Confidence Interval for The 1 100% where the value of t is obtained from the t-distribution. Hypothesis Testing about The value of the test statistic is computed as follows: POPULATION PROPORTIONS Inferences about the difference between two Population Proportions for large and independent samples Mean, Standard Deviation and Sampling Distribution of For two large and independent samples selected from two different populations, the sampling distribution of is approximately normal with mean and std. dev. as follows: The mean of is The Standard Deviation of is Where 1 1 To construct a confidence interval and test of hypothesis about for large and independent samples, we use the normal distribution. As was indicated earlier, that in the case of proportion, the sample is large if 5 5. In the case of two samples, both samples are large if: 5 & 5 5 & 5 5

Interval Estimation of The difference between the two sample proportions is the point estimator for the difference between the two population proportions. Because we do not know we cannot calculate the value of Therefore we use as the point estimator of. We construct the confidence interval for using the following formulas: Confidence Interval for : The 1 100% is where Examples to be given in class Hypothesis Testing about The testing of hypothesis about for large and independent samples, involve the same five steps as learned earlier. We will calculate the standard deviation of as When a test of hypothesis about is performed the null hypothesis is and the values of are not known. Assuming the null hypothesis is true and, a common value of denoted by is calculated using the following formulas: In order to decide which one of these formulas are to be used, depends on whether the values of or the values of are known. Note that are the number of elements that each of the two samples possess a certain characteristics. The quantity is called the Pooled sample proportion. Using the value of the Pooled sample proportion the standard deviation of is given by the formula: 1 1 1 The Test Statistic z for is 6

The Tree Diagram below is presented as an aid to understand how the formulas are connected and which one to use in calculating the test statistic. DECISION Large Sample Small Sample σ σ σ σ 1 1 2 1 1 7