Chapter 8: Confidence Interval Estimation: Further Topics

Similar documents
Chapter 3. Comparing two populations

Econ 325: Introduction to Empirical Economics

ST 371 (IX): Theories of Sampling Distributions

1 Statistical inference for a population mean

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

Psy 420 Final Exam Fall 06 Ainsworth. Key Name

Sampling Distributions: Central Limit Theorem

Lecture 10: Comparing two populations: proportions

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Confidence Intervals, Testing and ANOVA Summary

Hypothesis testing: Steps

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Content by Week Week of October 14 27

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

σ. We further know that if the sample is from a normal distribution then the sampling STAT 2507 Assignment # 3 (Chapters 7 & 8)

2.57 when the critical value is 1.96, what decision should be made?

Hypothesis testing for µ:

QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018

Hypothesis testing: Steps

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly.

Chapter 10: Inferences based on two samples

Practice Questions: Statistics W1111, Fall Solutions

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Statistical Inference Theory Lesson 46 Non-parametric Statistics

STATISTICS 141 Final Review

Two-Sample Inferential Statistics

UNIVERSITY OF TORONTO Faculty of Arts and Science

Sections 7.1 and 7.2. This chapter presents the beginning of inferential statistics. The two major applications of inferential statistics

Chapter 8: Confidence Intervals

MATH 250 / SPRING 2011 SAMPLE QUESTIONS / SET 3

Basic Statistics and Probability Chapter 9: Inferences Based on Two Samples: Confidence Intervals and Tests of Hypotheses

1 Hypothesis testing for a single mean

ST 305: Final Exam ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) ( ) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ Y. σ X. σ n.

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Inferences Based on Two Samples

Extra Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences , July 2, 2015

Chapter 6 Estimation and Sample Sizes

Chapter 18. Sampling Distribution Models. Bin Zou STAT 141 University of Alberta Winter / 10

Statistics 300. Time for Lane

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. describes the.

Analysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

Hint: The following equation converts Celsius to Fahrenheit: F = C where C = degrees Celsius F = degrees Fahrenheit

Comparing Means from Two-Sample

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Basic Linear Model. Chapters 4 and 4: Part II. Basic Linear Model

Exam 1 Solutions. Problem Points Score Total 145

The variable θ is called the parameter of the model, and the set Ω is called the parameter space.

SALES AND MARKETING Department MATHEMATICS. 2nd Semester. Bivariate statistics. Tutorials and exercises

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

Midterm 1 and 2 results

2011 Pearson Education, Inc

1988 Chilean Elections

Inferences About Two Population Proportions

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Estadística I Exercises Chapter 4 Academic year 2015/16

2007 First-Class Mail Rates for Flats* Weight (oz) Rate (dollars) Weight (oz) Rate (dollars)

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly.

Multiple Choice Practice Set 1

Test statistic P value Reject/fail to reject Conclusion in Context:

Data, Statistics, and Probability Practice Questions

Confidence Intervals for the Mean of Non-normal Data Class 23, Jeremy Orloff and Jonathan Bloom

n i n T Note: You can use the fact that t(.975; 10) = 2.228, t(.95; 10) = 1.813, t(.975; 12) = 2.179, t(.95; 12) =

[ z = 1.48 ; accept H 0 ]

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean.

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that

ECON 427: ECONOMIC FORECASTING. Ch1. Getting started OTexts.org/fpp2/

There s a science to it!

Statistics for Business and Economics

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

Chapter 6: SAMPLING DISTRIBUTIONS

GEOG 100E Introduction to Geography (5 credits)

Multiple Regression Analysis: The Problem of Inference

Lecture 11. Data Description Estimation

Tables Table A Table B Table C Table D Table E 675

Gov 2000: 6. Hypothesis Testing

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

Confidence Intervals for the Sample Mean

Hypothesis Testing in Action

Mathematical statistics

This paper is not to be removed from the Examination Halls

STA 2101/442 Assignment 2 1

Comparison of Two Population Means

The independent-means t-test:

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

104 Business Research Methods - MCQs

Inference for the Regression Coefficient

Power of a test. Hypothesis testing

Point Estimation and Confidence Interval

You are allowed two hours to answer this question paper. All questions are compulsory.

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

MATH 1150 Chapter 2 Notation and Terminology

MAT2377. Rafa l Kulik. Version 2015/November/23. Rafa l Kulik

Problem Set 4 - Solutions

Chapter 4: Probability and Probability Distributions

Transcription:

Chapter 8: Confidence Interval Estimation: Further Topics Department of Mathematics Izmir University of Economics Week 11 2014-2015

Introduction In this chapter we will focus on inferential statements concerning estimation of estimation of two populations and the comparison of two means from normally distributed populations (mainly) and two population proportions.

Confidence Intervals for the Difference Between TWO Normally Distributed Population Means Our analysis is based on the sample results. We present two sampling schemes for analyzing means: The first sampling scheme is for dependent samples and the second scheme is for independent samples. For independent samples we will discuss the theory (similar to Chapter 7) when population variance is known and when the population variance is unknown. When the population variance is unknown we consider the following cases: assuming that the population variances are equal or NOT assuming that they are equal.

Confidence Intervals for the Difference Between TWO Normal Population Means: DEPENDENT Samples In this section, we first deal with the sampling schemes for DEPENDENT samples, that is, the values in one sample are influenced by the values in the other sample. This procedure can be also called as matched pairs. Assumptions: Let x 1, x 2,..., x n be the values of the observations from the population with mean µ x, y 1, y 2,..., y n be the matched sampled values from the population with mean µ y, d i = x i y i be the n differences, d = d i be the sample mean for the n differences, and s d = n (di d) 2 n 1 be the sample standard deviation for the n differences. The population distribution of the differences is normal.

The 100(1 α)% confidence interval for the difference between means µ d = µ x µ y is d t n 1, α 2 s d < µ d < d s d + t n 1, α n 2. n The confidence interval can be also written as d ± ME.

Example: A medical study was conducted to compare the difference in effectiveness of two particular drugs in lowering cholesterol levels. The research team used a paired sample approach to control variation in reduction that might be due to factors other than the drug itself. Each member of a pair was matched by age, weight, lifestyle, and other pertinent factors. Drug X was given to one person randomly selected in each pair and drug Y was given to the other individual in the pair. After a specified amount of time each persons cholesterol level was measured again. Suppose that a random sample of nine pairs of patients with known cholesterol problems is selected from the large populations of participants. Estimate with a 99% confidence level the mean difference in the effectiveness of the two drugs X and Y to lower cholesterol where Pair Drug X Drug Y Differences (d i = x i y i ) 1 29 26 3 2 32 27 5 3 31 28 3 4 32 27 5 5 30 6 32 30 2 7 29 26 3 8 31 33-2 9 30 36-6

Confidence Intervals for the Difference Between TWO Normal Population Means: INDEPENDENT Samples with KNOWN population variances Assumptions: Suppose we have two independent random samples of sizes n x and n y observations from normally distributed populations with means µ x and µ y, and variances σ 2 x and σ 2 y. The sample means are x and ȳ.

The 100(1 α)% confidence interval for µ x µ y is σx x ȳ z 2 α 2 + σ2 y < µ x µ y < x ȳ + z α n x n 2 y The confidence interval can be also written as σx 2 + σ2 y. n x n y x ȳ ± ME.

Example: From a very large university, independent random samples of 120 students majoring in marketing and 90 students majoring in finance were selected. The mean GPA for the random sample of marketing majors was found to be 3.08, and the mean GPA for the random sample of finance majors was 2.88. From similar past studies the population standard deviation for the marketing majors is assumed to be 0.42; similarly, the population standard deviation for the finance majors is 0.64. Denoting the population mean for marketing majors by µ x and the population mean for finance majors by µ y, find a 95 % confidence interval for µ x µ y.

Confidence Intervals for the Difference Between TWO Normal Population Means: INDEPENDENT Samples with UNKNOWN population variances In this section we consider the case where the population variances are unknown. At this point, one should ask whether the unknown population variances are assumed to be equal or not. For this purpose, we present two situations: A. Population variances are assumed to be EQUAL. B. Population variances are NOT assumed to be EQUAL.

Confidence Intervals for the Difference Between TWO Normal Population Means: INDEPENDENT Samples with UNKNOWN population variances (Case A: Population variances are assumed to be EQUAL) Assumptions: Suppose we have two independent random samples of sizes n x and n y observations from normally distributed populations with means µ x and µ y, and an unknown but common variance. The observed sample means are x and ȳ and the observed sample variances are s 2 x and s 2 y.

The 100(1 α)% confidence interval for µ x µ y is x ȳ t nx +n y 2, α 2 where s 2 p n x + s2 p n y < µ x µ y < x ȳ + t nx +n y 2, α 2 s 2 p = (nx 1)s2 x + (n y 1)s 2 y n x + n y 2 is called the pooled sample variance. The confidence interval can be also written as x ȳ ± ME. s 2 p n x + s2 p n y,

Example: The residents of St.Paul, Minnesota, complain that traffic speeding fines given in their city are higher than the traffic speeding fines that are given in nearby Minneapolis. Independent random samples of the amounts paid by residents for speeding tickets in each of the two cities over the last 3 months were obtained. These amounts were St.Paul Minneapolis 100 95 125 87 135 100 128 75 140 110 142 105 128 85 137 95 156 142 Assuming equal population variances, find a 95% confidence interval for the difference in the mean costs of speeding tickets in these two cities.

Confidence Intervals for the Difference Between TWO Normal Population Means: INDEPENDENT Samples with UNKNOWN population variances (Case B: Population variances are NOT assumed to be EQUAL) Assumptions: Suppose we have two independent random samples of sizes n x and n y observations from normally distributed populations with means µ x and µ y, and unknown and NOT EQUAL variances. The observed sample means are x and ȳ and the observed sample variances are s 2 x and s 2 y.

The 100(1 α)% confidence interval for µ x µ y is x ȳ t v, α 2 s 2 x n x + s2 y n y < µ x µ y < x ȳ + t v, α 2 where the degrees of freedom is v = [ s 2 x n x + s2 y n y ] 2 ( ( ) s 2 2 s 2 ) 2 y x nx + ny n x 1 n y 1 The confidence interval can be also written as. s 2 x n x + s2 y n y, x ȳ ± ME.

Example: An accounting firm conducted a random sample of the accounts payable for the east and the west offices of one of its clients. Among these two independent samples they wanted to estimate the difference between the population mean values of the payables. The sample statistics obtained were as follows: East office(population x) West office (population y) Sample mean 290 250 Sample size 16 11 Sample standard deviation 15 50 We do not assume that the unknown population variances are equal. Estimate the difference between the mean values of the payables for the two offices. Use a 95% confidence level.

Confidence Interval for the Difference Between TWO Population Proportions (Large Samples) Often a comparison of two population proportions is of much interest. In this section, we consider confidence intervals for the difference between two population proportions with independent large samples taken from these two populations. Assumptions: Suppose we have two independent random samples of sizes n x and n y with proportions of "successes" p x and p y, respectively. Sample proportions are denoted as ˆp x and ˆp y. Note that, we examine the random variable ˆp x ˆp y.

If the sample sizes are large (generally at least 40 observations in each sample), a 100(1 α)% confidence interval for the difference between two population proportions, (p x p y ), is ˆpx (1 ˆp x ) ˆpy (1 ˆpy ) ˆpx (1 ˆp x ) ˆpy (1 ˆpy ) ˆp x ˆp y z α + < p x p y < ˆp x ˆp y z α +. 2 n x n y 2 n x n y The confidence interval can be also written as (ˆp x ˆp y ) ± ME.

Example: During a presidential election year, many forecasts are made to determine how voters perceive a particular candidate. In a random sample of 120 registered voters in precinct X, 107 indicated that they supported the candidate in question. In an independent random sample of 141 registered voters in precinct Y, only 73 indicated support for the same candidate. The respective population proportions are denoted p x and p y. Find a 95 % confidence interval for the population difference (p x p y).

Example: In a random sample of n x = 120 large retailers, 85 used regression as a method of forecasting. In an independent random sample of n y = 163 small retailers, 78 used regression as a method of forecasting. Find a 98% confidence interval for the difference between the two population proportions.