Chapter 23: Inferences About Means

Similar documents
Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 24. Comparing Means

Chapter 23. Inference About Means

Ch18 links / ch18 pdf links Ch18 image t-dist table

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Inferences for Regression

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

INTRODUCTION TO ANALYSIS OF VARIANCE

Sociology 6Z03 Review II

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

Business Statistics. Lecture 10: Course Review

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Statistical inference provides methods for drawing conclusions about a population from sample data.

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

Using Tables and Graphing Calculators in Math 11

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

STA Module 11 Inferences for Two Population Means

STA Rev. F Learning Objectives. Two Population Means. Module 11 Inferences for Two Population Means

Ch Inference for Linear Regression

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

appstats27.notebook April 06, 2017

Correlation and Regression

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 10 Comparing Two Populations or Groups

The Chi-Square Distributions

Chapter 9 Inferences from Two Samples

The Chi-Square Distributions

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

STA Module 10 Comparing Two Proportions

Statistical Inference for Means

Sampling Distributions

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Stat 427/527: Advanced Data Analysis I

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 27 Summary Inferences for Regression

+ Specify 1 tail / 2 tail

Describing distributions with numbers

Sampling Distributions: Central Limit Theorem

determine whether or not this relationship is.

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Difference between means - t-test /25

11: Comparing Group Variances. Review of Variance

10/31/2012. One-Way ANOVA F-test

Chapter 8: Estimating with Confidence

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

4.1 Hypothesis Testing

Two-Sample Inferential Statistics

df=degrees of freedom = n - 1

Chapter 8: Estimating with Confidence

Harvard University. Rigorous Research in Engineering Education

Ch. 7. One sample hypothesis tests for µ and σ

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

Single Sample Means. SOCY601 Alan Neustadtl

Chapter 22. Comparing Two Proportions 1 /29

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

9.5 t test: one μ, σ unknown

CHAPTER 7. Hypothesis Testing

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Data Analysis and Statistical Methods Statistics 651

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

The Difference in Proportions Test

23. MORE HYPOTHESIS TESTING

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs

MATH Chapter 21 Notes Two Sample Problems

Chapter 22. Comparing Two Proportions 1 /30

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t

Inference About Means and Proportions with Two Populations. Chapter 10

Hypothesis Testing hypothesis testing approach formulation of the test statistic

Introduction to Business Statistics QM 220 Chapter 12

Distribution-Free Procedures (Devore Chapter Fifteen)

Review of Statistics 101

Data Analysis and Statistical Methods Statistics 651

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Statistical Calculations and Tests Using the TI 83/84.

HYPOTHESIS TESTING. Hypothesis Testing

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

STA 101 Final Review

Chapter 26: Comparing Counts (Chi Square)

16.3 One-Way ANOVA: The Procedure

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Advanced Experimental Design

Correlation Analysis

Lecture 3: Inference in SLR

16.400/453J Human Factors Engineering. Design of Experiments II

Chapter 1 Statistical Inference

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

Introduction to hypothesis testing

STATISTICS 141 Final Review

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

Stat 101 Exam 1 Important Formulas and Concepts 1

Can you tell the relationship between students SAT scores and their college grades?

Lecture 18: Analysis of variance: ANOVA

Transcription:

Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population mean (you will use this only when the standard deviation of the population is known, i.e., it will not be used in this chapter) sample standard deviation (observed standard deviation with 1 in the denominator) is the standard error of the sample mean (use this when the standard deviation of the population is unknown, in which case you calculate the standard deviation of the sample,. This is what will be used in this chapter) t distribution:, General Student s t Model with 1 degrees of freedom, mean and standard deviation t value: If the sample size is large enough, the sample of means approximates a Normal Distribution. Confidence Interval: One sample interval for the mean: Standard Error: Degrees of Freedom: High : As, t distribition Normal Distribution On the TI 84 Calculator: o STAT CALC 1 Var Stats to get the required statistics for o,, you have the value; you want the desired area o, you have the area; you want the desired value Page 1 of 10

Using vs. If is known, use and If is NOT known, use and Key Assumptions and Conditions for a Difference of Two Proportions (RITN): : Data should be sampled at random or generated from a properly randomized experiment. Independence: Sample values are independent of each other. Condition: 10% of the population for each group. Nearly Normal Condition: the source population should be nearly Normal, i.e., unimodal and symmetric. Check this for the sample using: o A histogram o A plot of Normal scores Size of Sample Required to Use t Distribution : Population should be normally distributed : Population should be nearly normal (unimodal and symmetric) : Generally safe unless the population is very skewed (also see special considerations below) Special Considerations Are Required For (MOBS): Multimodal data (separate the data so that it becomes unimodal) Outliers (do analysis with and without outliers) Bias (remove the bias or use special techniques) Skewed data (increase the value of n, or use special techniques) Two Sets of Language for a Confidence Interval (p. 54 BVD Textbook) A 95% confidence interval tells us that we will capture the true value of the statistic (identify which statistic) in 95% of the confidence intervals developed from samples of the same size. I am 95% confident that the true value of the statistic (state which statistic) is between the lower and upper values of my confidence interval. Page of 10

Hypothesis Testing (One sample test) : : or : or : t value: Standard Error: Degrees of Freedom: 1 Relationship between Intervals and Tests A Level C Confidence Interval contains all of the possible null hypothesis values that would NOT be rejected by a sided hypothesis test with. Finding the Required Sample Size for a Given Margin of Error (ME) Use the formula:. Note: the book (BVD) substitutes for, using the formula which does not provide a precise value of n. The process described below is better, but you may not be required to know it for this course. This process should be iterative because the appears twice in the equation we need to solve once in the calculation of and once in the denominator of. Steps (example on next page): Note: the book (BVD) stops after Step 1. 1. Assume a preliminary value of to calculate the value of. Alternatively, replace with in the first calculation (i.e., in the first iteration, use the z interval formula to calculate a preliminary of ).. Leave the in the denominator because that is what we are trying to calculate. 3. Calculate the value of in the denominator from the formula: 4. Use the calculated value of to calculate a new value of. 5. Repeat Steps 3 and 4 until you hone in on a single value of.. Page 3 of 10

Example: Finding the Required Sample Size for a Given Margin of Error (ME) Note: The book (BVD) describes a non iterative process, which does not generate a precise value of n. The process laid out in this example goes beyond that to calculate a more precise value of n. Assume we are given the following: % %.. Find the required value of. Step 1: Begin with the z interval formula: interval is 1.960. Step : The formula becomes:. 1.96. Step 3: Solve to get. Remember to round up!. The value for a 95% confidence or 1.96.. Step 4: Using the new value of, calculate a tail t value:.101 Step 5: Calculate n in the formula... or... Solve to get. Remember to round up! Repeat: Using the new value of, calculate a tail t value:.080... or... Solve to get. Remember to round up! Repeat: Using the new value of, calculate a tail t value:.086... or... Solve to get. Remember to round up! We now find ourselves in a repeating loop between and. In such a case, we would use the higher number to assure we get the proper confidence level. In this example, we would use. Page 4 of 10

Chapter 4: Comparing Means Boxplots Draw boxplots to compare the distributions of the two samples. The analysis may need to be run twice if there are outliers once with and once without outliers. Sample of Means: Means: x 1 = mean of Sample 1 x = mean of Sample μ(x 1 x ) = x 1 x Variance: Var(x 1 x ) = σ 1 + σ Standard Deviation: SD(x 1 x ) = σ 1 + σ n n Not used in this chapter. Standard Error: SE(x 1 x ) = s 1 + s n Confidence Interval: Two-sample t-interval for the difference of means: (x 1 x ) ± t df SE(x 1 x ) Standard Error: SE(x ) = s 1 + s Degrees of Freedom: df = n s 1 n1 + s n 1 n1 1 s 1 n1 + 1 n 1 s n Margin of Error (ME) On the TI-84 Calculator: o STAT TESTS -SampTTest: Enter the requested information and hit ENTER o Read the Degrees of Freedom and P-Value from the output Page 5 of 10

Key Assumptions and Conditions for a Difference of Two Proportions (RITN): : Data within each group should be sampled at random or generated from a properly randomized experiment. Independence: Sample values within each group are independent of each other. In addition the two groups must be independent of each other. Data must not be paired (paired data e.g., data on husbands and wives are handled in Chapter 5). Ten-Percent Condition: n 10% of the population for each group. Nearly Normal Condition: the source population for each sample should be nearly Normal, i.e., unimodal and symmetric. Check this for the sample using: o A histogram o A plot of Normal scores Size of Sample Required to Use t-distribution n < 15: Population should be normally distributed 15 n 40: Population should be nearly normal (unimodal and symmetric) n > 40: Generally safe unless the population is very skewed (also see special considerations below) Special Considerations Are Required For (MOBS): Multimodal data (separate the data so that it becomes unimodal) Outliers (do analysis with and without outliers) Bias (remove the bias or use special techniques) Skewed data (increase the value of n, or use special techniques) Hypothesis Testing (Two-sample t-test) H 0 : μ 1 μ = 0 Usually, 0 will be equal to zero. H A : μ 1 μ 0 or H A : μ 1 μ < 0 or H A : μ 1 μ > 0 t-value: t df = (x 1 x ) 0 SE(x 1 x ) Standard Error: SE(x 1 x ) = s 1 + s Degrees of Freedom: see formula on previous page (difference in means) (tested difference) = (standard error of the mean) n Page 6 of 10

Pooled t-interval and t-test Additional assumption required: the two samples must have equal variances. This can happen, for example, in a randomized comparative experiment where the two samples are from the same population. The Pooled t-interval and t-test are rarely used. s pooled = ( 1) s 1 + (n 1) s ( 1) + (n 1) SE pooled = s pooled + s pooled n = s pooled 1 + 1 n Confidence Interval: (x 1 x ) ± t df SE pooled (x 1 x ) Hypothesis Test: t = (x 1 x ) 0 SE pooled (x 1 x ) df = + n Note: usually, 0 will be equal to zero. Page 7 of 10

Chapter 5: Inferences About Paired Data When two samples are not independent of each other, they are called Paired Data. When paired data are analyzed, we look at the differences of the paired values and use the techniques described in Chapter 3. Pairing From an experiment: a type of blocking From an observational study: a type of matching Sample of Means: number of paired observations of differences (data which are not paired are excluded from the Paired Data analysis) theoretical mean of the differences in the underlying populations sample mean of the differences of the pairs (observed mean of differences) sample standard deviation (observed standard deviation of the differences with 1 in the denominator) is the standard error of the mean of the differences Tested Difference in Means:. Usually this value is zero; i.e, 0. t value: Confidence Interval: Paired interval for the mean: Standard Error: Degrees of Freedom: On the TI 84 Calculator: o Enter the two samples into and ; then or o STAT CALC 1 Var Stats to get the required statistics for o,, you have the value; you want the desired area o, you have the area; you want the desired value Page 8 of 10

Key Assumptions and Conditions for a Difference of Two Proportions (RITN): : Data within each group should be sampled at random or generated from a properly randomized experiment. Independence: Sample values are independent of each other. However, the two samples must NOT be independent of each other. Ten-Percent Condition: n 10% of the population for each group. Nearly Normal Condition: the source population should be nearly Normal, i.e., unimodal and symmetric. Check this for the sample using: o A histogram o A plot of Normal scores Paired Data: the data must pair naturally. You cannot decide to pair independent samples. Whether or not data are paired is generally easy to determine if you know how the data were collected. Hypothesis Testing (Paired t-test) H 0 : μ d = 0 Usually, 0 will be equal to zero. H A : μ d 0 or H A : μ d < 0 or H A : μ d > 0 t-value: t n 1 = d 0 SE(d) Standard Error: SEd = s d n Degrees of Freedom: df = n 1 = (sample mean) (theoretical mean) (standard error of the mean) Sample Size Requirements and Special Considerations are the same as they are in Chapters 3 and 4. Page 9 of 10

Proportions Independence within sample Sample Size Success/Failure Difference of Proportions Independence between and within samples Sample Size Success/Failure Statistical Inference Summary known not known Means Means Difference of Means Paired Mean Independence within sample Sample Size Success/Failure Independence within sample Nearly Normal Independence between and within samples Nearly Normal Independence within sample Nearly Normal Paired data Hypothesis Testing : : 0 : : : : (1) () (3) (3) Confidence Interval Notes: (1) the pooled value of ; () If is the same in the two populations, pooled values of s and SE may be used, but pooling is not required. 1 (4) 1 (3) Usually, 0. (4) Page 10 of 10