Chapters 4-6: Inference with two samples Read sections 4.2.5, 5.2, 5.3, 6.2

Size: px
Start display at page:

Download "Chapters 4-6: Inference with two samples Read sections 4.2.5, 5.2, 5.3, 6.2"

Transcription

1 Chapters 4-6: Inference with two samples Read sections 45, 5, 53, 6 COMPARING TWO POPULATION MEANS When presented with two samples that you wish to compare, there are two possibilities: I independent samples and II paired samples I INDEPENDENT SAMPLES: Select a SRS of size from populatio which has mean µ and standard deviation σ Independently from the first sample, select a SRS of size from populatio which has mean µ and standard deviation σ If sampling a finite population without replacement, then 05N and 05N The difference between the two population means is the parameter of interest, µ µ The statistic X X is a point estimator of the parameter µ µ Note about Study Design: Completely Randomized Design Experiment Two treatment groups from a CRD are independent In a CRD, it is possible to claim that the treatment caused the difference in means If the individuals in the CRD were chosen from a SRS, then conclusions about µ µ can be extended to the populations from which the individuals were drawn However, if the individuals were not a SRS, then conclusions to larger populations are dubious Observational Study Two samples to be compared in an observational study are considered to be independent if individuals were randomly chosen from each respective population Do not claim that an explanatory variable caused the difference in means Facts about the sampling distribution of X X : µ X X = µ µ, so X X is an unbiased point estimator of µ µ σ = σ X X + σ σ and σ X = X + σ IfthesamplingdistributionsofX andx areapproximately ( normal,thenthesamplingdistribution σ of X X is approximately normal, ie, X X N µ µ, + σ )

2 Un-pooled Approach to estimating and testing µ µ (53 and 53): A Confidence Interval for µ µ is X X ± t α,df S + S df = (V +V ) V + V and V = S and V = S A conservative degree of freedom estimate is df = min(, ) Check the assumptions that the SRS s are independent and that X and X are normal! A Hypothesis Test for µ µ is: Hypotheses: H 0 : µ µ = 0 where 0 is a specific value (usually 0) H a : µ µ > 0 Choose one: H a : µ µ < 0 H a : µ µ 0 NOTE: Perform steps -4 assuming that H 0 is true! Assumptions: (a) SRS s from each population (b) The two SRS s are independent of each other (c) If sampling finite populations without replacement, then 05N and 05N (d) The sampling distributions of X and X must be at least approximately normal (so the data is normal OR, 5 for symmetric non-normal data OR, 30 for skewed data) If H 0 is true, then ( X X ) N ( 0, σ + σ ) 3 Test Statistic: T = X X 0 S + S When H 0 is true, T t(df) where df = (V +V ) V + V A conservative degree of freedom estimate is df = min(, ) 4 The p-value is the probability of obtaining a sample statistic that is as extreme or more extreme than what was actually observed assuming that H 0 is true Alternative Hypothesis p-value H a : µ µ > 0 P(T > t) H a : µ µ < 0 P(T < t) H a : µ µ 0 P(T > t ) 5 Decision: Given a predetermined significance level α: Reject H 0 if p-value α Fail to Reject H 0 if p-value > α 6 Conclusion: a statement of how much evidence there is for the alternative hypothesis If you reject H 0, then you conclude The evidence suggests H a

3 If you fail to reject H 0, then you conclude The evidence fails to suggest H a In each case, specify H a in terms of the problem EXAMPLE: To compare the mean cholesterol content (in milligrams) of grilled chicken sandwiches from Arby s and McDonald s, you randomly select several sandwiches from each restaurant and measure their cholesterol contents From Arby s, = 5 sandwiches are randomly selected, x = 60 mg and s = 359 mg From McDonald s, = sandwiches are randomly selected, x = 70 mg and s = 4 mg Construct an 90% confidence interval for the difference in mean cholesterol content of grilled chicken sandwiches Is there sufficient evidence to conclude the McDonald s grilled chicken sandwiches have a higher cholesterol content, on average, compared to Arby s? Hypotheses: Check assumptions: 3 Test statistic: 4 p-value: 5 Decision: 6 Conclusion: 3

4 Pooled Approach to testing and estimating µ µ (536): Assume that σ = σ, called the homogeneity of variance assumption The pooled approach is more powerful than the un-pooled approach (ie Power= β is larger) if σ and σ are truly equal since the degrees of freedom for the pooled approach is larger than the degrees of freedom for the un-pooled approach If σ p = σ = σ, then σ X X = σ p + where σ p is the pooled population variance Confidence Interval for µ µ is X X ± t α,df S p + The pooled sample variance is S p = ( )S +( )S +, an unbiased estimator of σ p df = + Check the assumptions that the SRS s are independent and that X and X are normal! Hypothesis Testing for µ µ : Hypotheses: H 0 : µ µ = 0 where 0 is a specific value (usually 0) H a : µ µ > 0 Choose one: H a : µ µ < 0 H a : µ µ 0 NOTE: Perform steps -4 assuming that H 0 is true! Assumptions: (a) SRS s from each population (b) The two SRS s are independent of each other (c) If sampling finite populations without replacement, then 05N and 05N (d) The sampling distributions of X and X must be at least approximately normal (so the data is normal OR, 5 for symmetric non-normal data OR, 30 for skewed data) If H 0 is true, then ( X X ) N ( 0,σ p + ) (e) Homogeneity of variance: σ = σ If one sample standard deviation is more than twice the other, then this assumption is suspect 3 Test Statistic: T = X X 0 S p + If H 0 is true, T t(df) where df = + 4, 5 and 6 Find the p-value, make a Decision, and give a Conclusion as in the un-pooled case EXAMPLE: Redo the grilled chicken problem using a pooled variance this time 4

5 II PAIRED SAMPLES (5 and 5): Samples can be paired when: From a SRS of size n, two measurements (X i,y i ) are recorded for each individual, such as when there are before-treatment and after-treatment measurements From two SRS s, each of size n, two measurements (X i,y i ) are recorded for two similar individuals, such as when there are blocks of two individuals, and each individual gets a different treatment The paired measurements are subtracted, d i = X i Y i The differences d,d,,d n form a new set sample of data, which is analyzed using the one sample procedures from Chapter 0 The sample mean difference, d, is an unbiased point estimator of the parameter, µd, the mean of the population of differences Your book uses the notation x d instead of d The sampling distribution of d has mean µ d = µ d and sampling variability σ d = σ d n A Confidence Interval for µ d is d ± t α n Check the Assumption that d is normal!,n S d A Hypothesis Test for µ d : Hypotheses: H 0 : µ d = δ 0 where δ 0 is a specific value (usually 0) H a : µ d > δ 0 Choose one: H a : µ d < δ 0 H a : µ d δ 0 NOTE: Perform steps -4 assuming that H 0 is true! Assumptions: Same as for hypothesis testing using one sample: (a) The data must be a SRS (b) If sampling a finite population without replacement, then 05N n (c) The sampling distribution of d must be at least approximately normal (so the differences are normal OR n 5 for symmetric non-normal differences OR n 30 for skewed differences) If H 0 is true, then d ( N δ 0, σ d n ) 3 The Test Statistic is T = d δ 0 S d n 4 p-value where T t(n ) when H 0 is true Alternative Hypothesis p-value H a : µ d > δ 0 P(T > t) H a : µ d < δ 0 P(T < t) H a : µ d δ 0 P(T > t ) 5 and 6 Make a Decision and give a Conclusion 5

6 EXAMPLE: An herbal medicine is tested o4 randomly selected patients with sleeping disorders The table shows the hours of sleep patients got during one night without using the herbal medicine and during another night after the herbal medicine was administered Patient Without medicine With medicine Difference The mean difference in sleep is d = and the sample standard deviation of the differences is s d = 0677 Construct an 95% confidence interval for µ d Is there sufficient evidence to suggest the herbal medicine provides a longer night s sleep, on average? Hypotheses: Check assumptions: 3 Test statistic: 4 p-value: 5 Decision: 6 Conclusion: 6

7 COMPARING POPULATION PROPORTIONS FROM INDEPENDENT SAMPLES Collect binary data from a SRS of size from populatio, which has mean p Collect binary data from a SRS of size from populatio, which has mean p If sampling a finite population without replacement, then 05N and 05N When comparing two population proportions, the difference between the two population proportions is the parameter of interest, p p The statistic ˆp ˆp is a point estimator of the parameter p p Note about Study Design: See notes for comparing two population means Facts about the sampling distribution of ˆp ˆp (6): µˆp ˆp = p p so ˆp ˆp is an unbiased point estimator of p p σ ˆp = p ( p ) ˆp + p ( p ) and = p ( p ) σˆp ˆp + p ( p ) If p 0, ( p ) 0, p 0, and ( p ) 0 then p ( p ) (ˆp ˆp ) N p p, + p ( p ) Estimating and Testing p p (6 and 63): ˆp ( ˆp AConfidence Interval for p p is ˆp ˆp ± z ) α + ˆp ( ˆp ) that the SRS s are independent and that ˆp and ˆp are normal! Checktheassumptions A Hypothesis Test for p p : Hypotheses: H 0 : p p = 0 where 0 is a specific value (usually 0) H a : p p > 0 Choose one: H a : p p < 0 H a : p p 0 NOTE: Perform steps -4 assuming that H 0 is true! Assumptions: Under H 0, if 0 = 0, then p = p = p, and so an estimate of p is ˆp = total successes total sample size = count +count + (a) SRS s from each population (b) The two SRS s are independent of each other (c) If sampling a finite population without replacement, then 05N and 05N (d) The sampling distributions for ˆp and ˆp must be approximately normal 7

8 If 0 0, then check the sample size like this: ˆp 0, ( ˆp ) 0, ˆp 0, and ( ˆp ) 0 If 0 = 0, use ˆp to check the sample size: ˆp 0, ( ˆp) 0, ˆp 0, and ( ˆp) 0 For large sample sizes, if H 0 is true, p ( p ) (ˆp ˆp ) N 0, + p ( p ) For small sample sizes, consider Fisher s exact test (65 and 653) 3 Test Statistic: If 0 0, then Z = ˆp ˆp 0 ˆp ( ˆp ) + ˆp ( ˆp ) If 0 = 0, then p = p = p and now Z = ˆp ˆp 0 ˆp( ˆp) + ˆp( ˆp) Z N(0, ) if H 0 is true 4 p-value: Alternative Hypothesis p-value H a : p p > 0 P(Z > z) H a : p p < 0 P(Z < z) H a : p p 0 P(Z > z ) 5 and 6 Make a Decision and give a Conclusion EXAMPLE: Among 00 randomly selected male car occupants over the age of 8, 7% wear seat belts Among 380 randomly selected female car occupants over the age of 8, 84% wear seat belts Test the claim that both genders have the same rate of seat belt use Use α = 005 Does there appear to be a gender gap? Hypotheses: Check assumptions: 3 Test statistic: 4 p-value: 8

9 5 Decision: 6 Conclusion: Construct a 90% CI to estimate the true difference in the proportions of seat belt users in the male and female populations R code # Comparing Two Population Means using UNPOOLED Variance: # Arby s and McDonald s chicken sandwiches Example > ttest(arby,mcd,conflevel=090,varequal=false,alternative="twosided") Two Sample t-test data: Arby and McD t = , df = 437, p-value = e-09 alternative hypothesis: true difference in means is not equal to 0 90 percent confidence interval: sample estimates: mean of x mean of y # Comparing Two Population Means using POOLED Variance: # Arby s and McDonald s chicken sandwiches Example > ttest(arby,mcd,conflevel=090,varequal=true,alternative="twosided") Two Sample t-test data: Arby and McD t = , df = 5, p-value = 786e-09 alternative hypothesis: true difference in means is not equal to 0 90 percent confidence interval: sample estimates: mean of x mean of y # Comparing Two Population Means using Matched Pairs t-test: 9

10 # Herbal Medicine Example # Here s the data given as a table: > nomed=c(,4,34,37,5,5,5,3,55,58,4,48,9,45) > med=c(9,33,35,44,5,5,5,3,6,65,44,47,3,47) > rbind(nomed,med,nomed-med) [,] [,] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,0] [,] [,] [,3] [,4] nomed med > ttest(nomed,med,conflevel=095,paired=true,alternative="less") Paired t-test data: nomed and med t = -4094, df = 3, p-value = alternative hypothesis: true difference in means is less than 0 95 percent confidence interval: -Inf sample estimates: mean of the differences # Comparing Two Population Proportions: # Seat Belt Example > proptest(c(7*00,84*380),c(00,380),conflevel=95,alternative="twosided", correct=f) -sample test for equality of proportions without continuity correction data: c(07 * 00, 084 * 380) out of c(00, 380) X-squared = 96686, df =, p-value < e-6 alternative hypothesis: twosided 95 percent confidence interval: sample estimates: prop prop Exercises Two independent means on p63: 55, 56 (answers: Yes because n is large; test statistic is -578, p-value is tiny (< 0 6 ), so the evidence suggests that the age difference is not due to chance), odd, 538 (answers: T (if first sample is symmetric and the second is skewed) TF) Paired means on p59: 55, 56 (answers: TTTF), 57, 58 (answers: YYN), odd, 54 (answer: F), 57, 59 Two independent proportions on p37: odd, 637 0

Inferences About Two Population Proportions

Inferences About Two Population Proportions Inferences About Two Population Proportions MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Background Recall: for a single population the sampling proportion

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Midterm 1 and 2 results

Midterm 1 and 2 results Midterm 1 and 2 results Midterm 1 Midterm 2 ------------------------------ Min. :40.00 Min. : 20.0 1st Qu.:60.00 1st Qu.:60.00 Median :75.00 Median :70.0 Mean :71.97 Mean :69.77 3rd Qu.:85.00 3rd Qu.:85.0

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

Chapter. Hypothesis Testing with Two Samples. Copyright 2015, 2012, and 2009 Pearson Education, Inc. 1

Chapter. Hypothesis Testing with Two Samples. Copyright 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter 8 Hypothesis Testing with Two Samples Copyright 2015, 2012, and 2009 Pearson Education, Inc 1 Two Sample Hypothesis Test Compares two parameters from two populations Sampling methods: Independent

More information

5 Basic Steps in Any Hypothesis Test

5 Basic Steps in Any Hypothesis Test 5 Basic Steps in Any Hypothesis Test Step 1: Determine hypotheses (H0 and Ha). H0: μ d = 0 (μ 1 μ 2 =0) Ha: μ d > 0 (μ 1 μ 2 >0) upper-sided Ha: : μ d 0 (μ 1 μ 2 0) two-sided Step 2: Verify necessary conditions,

More information

Ch. 7. One sample hypothesis tests for µ and σ

Ch. 7. One sample hypothesis tests for µ and σ Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math

More information

Lecture 10: Comparing two populations: proportions

Lecture 10: Comparing two populations: proportions Lecture 10: Comparing two populations: proportions Problem: Compare two sets of sample data: e.g. is the proportion of As in this semester 152 the same as last Fall? Methods: Extend the methods introduced

More information

Problem Set 4 - Solutions

Problem Set 4 - Solutions Problem Set 4 - Solutions Econ-310, Spring 004 8. a. If we wish to test the research hypothesis that the mean GHQ score for all unemployed men exceeds 10, we test: H 0 : µ 10 H a : µ > 10 This is a one-tailed

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Hypothesis testing: Steps

Hypothesis testing: Steps Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute

More information

7 Estimation. 7.1 Population and Sample (P.91-92)

7 Estimation. 7.1 Population and Sample (P.91-92) 7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Chapter 7: Statistical Inference (Two Samples)

Chapter 7: Statistical Inference (Two Samples) Chapter 7: Statistical Inference (Two Samples) Shiwen Shen University of South Carolina 2016 Fall Section 003 1 / 41 Motivation of Inference on Two Samples Until now we have been mainly interested in a

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

Exam 2 (KEY) July 20, 2009

Exam 2 (KEY) July 20, 2009 STAT 2300 Business Statistics/Summer 2009, Section 002 Exam 2 (KEY) July 20, 2009 Name: USU A#: Score: /225 Directions: This exam consists of six (6) questions, assessing material learned within Modules

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

What is Experimental Design?

What is Experimental Design? One Factor ANOVA What is Experimental Design? A designed experiment is a test in which purposeful changes are made to the input variables (x) so that we may observe and identify the reasons for change

More information

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions Chapter 9 Inferences from Two Samples 9. Inferences About Two Proportions 9.3 Inferences About Two s (Independent) 9.4 Inferences About Two s (Matched Pairs) 9.5 Comparing Variation in Two Samples Objective

More information

Lecture 26 Section 8.4. Wed, Oct 14, 2009

Lecture 26 Section 8.4. Wed, Oct 14, 2009 PDFs n = Lecture 26 Section 8.4 Hampden-Sydney College Wed, Oct 14, 2009 Outline PDFs n = 1 2 PDFs n = 3 4 5 6 Outline PDFs n = 1 2 PDFs n = 3 4 5 6 PDFs n = Exercise 8.12, page 528. Suppose that 60% of

More information

For tests of hypotheses made about population, we consider. only tests having a hypothesis of. Note that under the assumption of

For tests of hypotheses made about population, we consider. only tests having a hypothesis of. Note that under the assumption of CONFIDENCE INTERVAL ESTIMATE OF p 1 p 2 The confidence interval estimate of the is: where the of is given by Rounding: Round the confidence interval limits to significant digits. HYPOTHESIS TESTS For tests

More information

Statistics 251: Statistical Methods

Statistics 251: Statistical Methods Statistics 251: Statistical Methods 1-sample Hypothesis Tests Module 9 2018 Introduction We have learned about estimating parameters by point estimation and interval estimation (specifically confidence

More information

Inference for Distributions Inference for the Mean of a Population. Section 7.1

Inference for Distributions Inference for the Mean of a Population. Section 7.1 Inference for Distributions Inference for the Mean of a Population Section 7.1 Statistical inference in practice Emphasis turns from statistical reasoning to statistical practice: Population standard deviation,

More information

Analysis of Variance (ANOVA)

Analysis of Variance (ANOVA) Analysis of Variance (ANOVA) Much of statistical inference centers around the ability to distinguish between two or more groups in terms of some underlying response variable y. Sometimes, there are but

More information

Hypothesis testing: Steps

Hypothesis testing: Steps Review for Exam 2 Hypothesis testing: Steps Repeated-Measures ANOVA 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region

More information

Testing a Claim about the Difference in 2 Population Means Independent Samples. (there is no difference in Population Means µ 1 µ 2 = 0) against

Testing a Claim about the Difference in 2 Population Means Independent Samples. (there is no difference in Population Means µ 1 µ 2 = 0) against Section 9 2A Lecture Testing a Claim about the Difference i Population Means Independent Samples Test H 0 : µ 1 = µ 2 (there is no difference in Population Means µ 1 µ 2 = 0) against H 1 : µ 1 > µ 2 or

More information

Difference between means - t-test /25

Difference between means - t-test /25 Difference between means - t-test 1 Discussion Question p492 Ex 9-4 p492 1-3, 6-8, 12 Assume all variances are not equal. Ignore the test for variance. 2 Students will perform hypothesis tests for two

More information

10.1. Comparing Two Proportions. Section 10.1

10.1. Comparing Two Proportions. Section 10.1 /6/04 0. Comparing Two Proportions Sectio0. Comparing Two Proportions After this section, you should be able to DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

STAT Chapter 8: Hypothesis Tests

STAT Chapter 8: Hypothesis Tests STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3) STAT 515 -- Chapter 9: Two-Sample Problems Paired Differences (Section 9.3) Examples of Paired Differences studies: Similar subjects are paired off and one of two treatments is given to each subject in

More information

Lecture 5: ANOVA and Correlation

Lecture 5: ANOVA and Correlation Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions

More information

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc. Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Lecture 28 Chi-Square Analysis

Lecture 28 Chi-Square Analysis Lecture 28 STAT 225 Introduction to Probability Models April 23, 2014 Whitney Huang Purdue University 28.1 χ 2 test for For a given contingency table, we want to test if two have a relationship or not

More information

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means

More information

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Lecture 9 Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Univariate categorical data Univariate categorical data are best summarized in a one way frequency table.

More information

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems

More information

Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups

Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups Sections 10-1 & 10-2 Independent Groups It is common to compare two groups, and do a hypothesis

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation

More information

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for

More information

Hypothesis testing. Data to decisions

Hypothesis testing. Data to decisions Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the

More information

Difference Between Pair Differences v. 2 Samples

Difference Between Pair Differences v. 2 Samples 1 Sectio1.1 Comparing Two Proportions Learning Objectives After this section, you should be able to DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses. 1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately

More information

STATISTICS 141 Final Review

STATISTICS 141 Final Review STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /

More information

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: CHAPTER 9, 10 Hypothesis Testing Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: The person is guilty. The person is innocent. To

More information

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α Chapter 8 Notes Section 8-1 Independent and Dependent Samples Independent samples have no relation to each other. An example would be comparing the costs of vacationing in Florida to the cost of vacationing

More information

Chapter 26: Comparing Counts (Chi Square)

Chapter 26: Comparing Counts (Chi Square) Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces

More information

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in

More information

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8 Concepts from previous lectures HUMBEHV 3HB3 one-sample t-tests Week 8 Prof. Patrick Bennett sampling distributions - sampling error - standard error of the mean - degrees-of-freedom Null and alternative/research

More information

Independent Samples t tests. Background for Independent Samples t test

Independent Samples t tests. Background for Independent Samples t test Independent Samples t tests Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Background for Independent Samples t

More information

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication CHAPTER 4 Analysis of Variance One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication 1 Introduction In this chapter, expand the idea of hypothesis tests. We

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Hypothesis Testing Problem. TMS-062: Lecture 5 Hypotheses Testing. Alternative Hypotheses. Test Statistic

Hypothesis Testing Problem. TMS-062: Lecture 5 Hypotheses Testing. Alternative Hypotheses. Test Statistic Hypothesis Testing Problem TMS-062: Lecture 5 Hypotheses Testing Same basic situation as befe: Data: random i. i. d. sample X 1,..., X n from a population and we wish to draw inference about unknown population

More information

Inference for Distributions Inference for the Mean of a Population

Inference for Distributions Inference for the Mean of a Population Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The

More information

Performance Evaluation and Comparison

Performance Evaluation and Comparison Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation

More information

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000. Two sided, two sample t-tests. I. Brief review: 1) We are interested in how a sample compares to some pre-conceived notion. For example: a) IQ = 100 b) Average height for men = 5 10. c) Average number

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

An inferential procedure to use sample data to understand a population Procedures

An inferential procedure to use sample data to understand a population Procedures Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors

More information

Hypothesis testing for µ:

Hypothesis testing for µ: University of California, Los Angeles Department of Statistics Statistics 10 Elements of a hypothesis test: Hypothesis testing Instructor: Nicolas Christou 1. Null hypothesis, H 0 (always =). 2. Alternative

More information

Comparison of Two Population Means

Comparison of Two Population Means Comparison of Two Population Means Esra Akdeniz March 15, 2015 Independent versus Dependent (paired) Samples We have independent samples if we perform an experiment in two unrelated populations. We have

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

Sampling Distribution of a Sample Proportion

Sampling Distribution of a Sample Proportion Sampling Distribution of a Sample Proportion Lecture 26 Section 8.4 Robb T. Koether Hampden-Sydney College Mon, Mar 1, 2010 Robb T. Koether (Hampden-Sydney College) Sampling Distribution of a Sample Proportion

More information

Inferences About Two Proportions

Inferences About Two Proportions Inferences About Two Proportions Quantitative Methods II Plan for Today Sampling two populations Confidence intervals for differences of two proportions Testing the difference of proportions Examples 1

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample

More information

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1 Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 9.1-1 Chapter 9 Inferences

More information

Ch18 links / ch18 pdf links Ch18 image t-dist table

Ch18 links / ch18 pdf links Ch18 image t-dist table Ch18 links / ch18 pdf links Ch18 image t-dist table ch18 (inference about population mean) exercises: 18.3, 18.5, 18.7, 18.9, 18.15, 18.17, 18.19, 18.27 CHAPTER 18: Inference about a Population Mean The

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878 Contingency Tables I. Definition & Examples. A) Contingency tables are tables where we are looking at two (or more - but we won t cover three or more way tables, it s way too complicated) factors, each

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Solution: First note that the power function of the test is given as follows,

Solution: First note that the power function of the test is given as follows, Problem 4.5.8: Assume the life of a tire given by X is distributed N(θ, 5000 ) Past experience indicates that θ = 30000. The manufacturere claims the tires made by a new process have mean θ > 30000. Is

More information

23. Inference for regression

23. Inference for regression 23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence

More information

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 I. χ 2 or chi-square test Objectives: Compare how close an experimentally derived value agrees with an expected value. One method to

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

STA 101 Final Review

STA 101 Final Review STA 101 Final Review Statistics 101 Thomas Leininger June 24, 2013 Announcements All work (besides projects) should be returned to you and should be entered on Sakai. Office Hour: 2 3pm today (Old Chem

More information

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions.

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions. Section 0. Comparing Two Proportions Learning Objectives After this section, you should be able to DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence

More information

Chapters 4-6: Estimation

Chapters 4-6: Estimation Chapters 4-6: Estimation Read sections 4. (except 4..3), 4.5.1, 5.1 (except 5.1.5), 6.1 (except 6.1.3) Point Estimation (4.1.1) Point Estimator - A formula applied to a data set which results in a single

More information

1 Statistical inference for a population mean

1 Statistical inference for a population mean 1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known

More information

Probability and Statistics

Probability and Statistics The big picture Probability and Statistics Sample Population 1) Data Collection Data ) Explanatory Data Analysis (EDA) Inference on Relationship Between two Variables 4) Inference 3) Probability The Big

More information

The Empirical Rule, z-scores, and the Rare Event Approach

The Empirical Rule, z-scores, and the Rare Event Approach Overview The Empirical Rule, z-scores, and the Rare Event Approach Look at Chebyshev s Rule and the Empirical Rule Explore some applications of the Empirical Rule How to calculate and use z-scores Introducing

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

FinalExamReview. Sta Fall Provided: Z, t and χ 2 tables

FinalExamReview. Sta Fall Provided: Z, t and χ 2 tables Final Exam FinalExamReview Sta 101 - Fall 2017 Duke University, Department of Statistical Science When: Wednesday, December 13 from 9:00am-12:00pm What to bring: Scientific calculator (graphing calculator

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

The Difference in Proportions Test

The Difference in Proportions Test Overview The Difference in Proportions Test Dr Tom Ilvento Department of Food and Resource Economics A Difference of Proportions test is based on large sample only Same strategy as for the mean We calculate

More information

MATH Notebook 3 Spring 2018

MATH Notebook 3 Spring 2018 MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................

More information

MATH Chapter 21 Notes Two Sample Problems

MATH Chapter 21 Notes Two Sample Problems MATH 1070 - Chapter 21 Notes Two Sample Problems Recall: So far, we have dealt with inference (confidence intervals and hypothesis testing) pertaining to: Single sample of data. A matched pairs design

More information

One sample problem. sample mean: ȳ = . sample variance: s 2 = sample standard deviation: s = s 2. y i n. i=1. i=1 (y i ȳ) 2 n 1

One sample problem. sample mean: ȳ = . sample variance: s 2 = sample standard deviation: s = s 2. y i n. i=1. i=1 (y i ȳ) 2 n 1 One sample problem Population mean E(y) = µ, variance: Var(y) = σ 2 = E(y µ) 2, standard deviation: σ = σ 2. Normal distribution: y N(µ, σ 2 ). Standard normal distribution: z N(0, 1). If y N(µ, σ 2 ),

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

Introduction to Statistical Data Analysis III

Introduction to Statistical Data Analysis III Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The

More information

Sampling Distribution of a Sample Proportion

Sampling Distribution of a Sample Proportion Sampling Distribution of a Sample Proportion Lecture 26 Section 8.4 Robb T. Koether Hampden-Sydney College Mon, Oct 10, 2011 Robb T. Koether (Hampden-Sydney College) Sampling Distribution of a Sample Proportion

More information