Chapter Eight: Assessment of Relationships 1/42

Size: px
Start display at page:

Download "Chapter Eight: Assessment of Relationships 1/42"

Transcription

1 Chapter Eight: Assessment of Relationships 1/42

2 8.1 Introduction 2/42 Background This chapter deals, primarily, with two topics. The Pearson product-moment correlation coefficient. The chi-square test for independence.

3 8.2 Pearson Product-Moment Correlation Coefficient 3/42 The Pearson Product-Moment Correlation Coefficient The conceptual equation for the P-M correlation coefficient is (x x) (y ȳ) r = [ ] [ (x x) 2 ] (y ȳ) 2 where r is the sample correlation coefficient, x and y are the two variables to be correlated and n is the number of paired observations.

4 8.2 Pearson Product-Moment Correlation Coefficient 4/42 The Pearson Product-Moment Correlation Coefficient The computational form of r is given by xy ( P x)( P y) n r = [ ] [ x 2 (P x) 2 ] y 2 (P y) 2 n n

5 8.2 Pearson Product-Moment Correlation Coefficient 5/42 Example Calculate the P-M correlation coefficient for the access and wellness scores provided here. Table: Wellness and access to medical care scores for 15 subjects. Subject Access Wellness Subject Access Wellness Number Score Score Number Score Score

6 8.2 Pearson Product-Moment Correlation Coefficient 6/42 Solution Letting access scores equal x and wellness scores equal y, we note that x = 99, y = 83, xy = 700, x 2 = 869 and y 2 = 575. By Equation 8.2 xy ( P x)( P y) n r = [ ] [ x 2 (P x) 2 ] y 2 (P y) 2 = = [ 869 (99)2 15 n 700 (99)(83) [215.60] [115.73] =.964 ] [575 (83)2 15 ] n

7 8.2 Pearson Product-Moment Correlation Coefficient 7/42 Information Provided By P-M The P-M coefficient provides two pieces of information about the relationship of x and y. Nature of the relationship. Strength of the relationship.

8 8.2 Pearson Product-Moment Correlation Coefficient 8/42 Nature Of The Relationship When P-M takes a positive value it indicates that high values of x tend to be associated with high values of y while low values of x tend to be associated with low values of y. When P-M takes a negative value it indicates that high values of x tend to be associated with low values of y while low values of x tend to be associated with high values of y.

9 Depiction Of Positive x y Relationship Figure: Bivariate plot of positively correlated health care access scores and wellness index scores Wellness Index Score Access Score 8.2 Pearson Product-Moment Correlation Coefficient 9/42

10 8.2 Pearson Product-Moment Correlation Coefficient 10/42 Depiction Of Negative x y Relationship Figure: Bivariate plot of negatively correlated percents of students on free or reduced lunch and percents using bicycle helmets in nine schools. Bicycle Helmet Wear Free and Reduced Lunch

11 8.2 Pearson Product-Moment Correlation Coefficient 11/42 Strength Of The Relationship P-M can take values between 1 and 1. The P-M is at maximum strength when it takes the value 1.0 or 1.0. P-M loses strength as it recedes toward zero. At zero, P-M is at minimum strength.

12 8.2 Pearson Product-Moment Correlation Coefficient 12/42 P-M= 1.0 A correlation coefficient of 1.0 means that each subject made exactly the same score on the two variables when scaling differences are eliminated by expressing the two variables in terms of z scores. If you use Equation 8.2 to correlate x and y in the table on the following slide, (#13)you will find that P-M= 1.0.

13 8.2 Pearson Product-Moment Correlation Coefficient 13/42 Data Set Where r = 1.0. Table: Data set where r = 1.0. Subject Number x y

14 8.2 Pearson Product-Moment Correlation Coefficient 14/42 Data From Slide #13 Converted To z Scores Table: x and y values from Slide #13 expressed as z scores. Subject Number z x z y

15 8.2 Pearson Product-Moment Correlation Coefficient 15/42 Plot Of x and y When r = 1.0 Because P-M assesses the degree to which x and y are linearly related, when r = 1.0 the bivariate plot of x and y shows that the individual points fall on a positively sloped line. This can be seen in the figure on the next slide (#16). When r is positive but less than 1.0, the plot suggests a linear relationship but does not constitute a line showing that there is not a perfect linear relationship.

16 8.2 Pearson Product-Moment Correlation Coefficient 16/42 Bivariate Plot For Data Where r = 1.0 Figure: Bivariate plot of data from slide 13 for which r = y x

17 8.2 Pearson Product-Moment Correlation Coefficient 17/42 P-M= 1.0 When r = 1.0, the magnitude of each subject s z score on the two variables is the same but is always opposite in sign. Thus, if a subject is 1.5 standard deviations above the mean on variable x she will be 1.5 standard deviation below the mean on variable y. If you use Equation 8.2 to correlate x and y in the table on the following slide, (#18)you will find that P-M= 1.0.

18 8.2 Pearson Product-Moment Correlation Coefficient 18/42 Data Set Where r = 1.0. Table: Data set where r = 1.0. Subject Number x y

19 8.2 Pearson Product-Moment Correlation Coefficient 19/42 Data From Slide #18 Converted To z Scores Table: x and y values from Slide #18 expressed as z scores. Subject Number z x z y

20 8.2 Pearson Product-Moment Correlation Coefficient 20/42 Plot Of x and y When r = 1.0 Because P-M assesses the degree to which x and y are linearly related, when r = 1.0 the bivariate plot of x and y shows that the individual points fall on a negatively sloped line. This can be seen in the figure on the next slide (#21). When r is negative but greater than 1.0, the plot suggests a linear relationship but does not constitute a line showing that there is not a perfect linear relationship.

21 8.2 Pearson Product-Moment Correlation Coefficient 21/42 Bivariate Plot For Data Where r = 1.0 Figure: Bivariate plot of data from slide 18 for which r = y x

22 8.2 Pearson Product-Moment Correlation Coefficient 22/42 Cause-Effect Relationships The fact that two variables are correlated should not be used as evidence that one variable causes the other. Two variables may be correlated because one variable causes the other. Two variables may be correlated without one of the two variables causing the other.

23 8.2 Pearson Product-Moment Correlation Coefficient 23/42 Test Of H 0 : ρ = 0 A test of H 0 : ρ = 0 may be carried out by means of the following test statistic. r t = 1 r 2 n 2 where r is the P-M correlation coefficient and n is the number of pairs of observations. The degrees of freedom for the test critical value is n 1.

24 8.2 Pearson Product-Moment Correlation Coefficient 24/42 Test Of H 0 : ρ = 0 (continued) Alternative hypotheses. Two-Tailed One-Tailed One-Tailed H A : ρ 0 H A : ρ < 0 H A : ρ > 0

25 8.2 Pearson Product-Moment Correlation Coefficient 25/42 Example The correlation between 15 pairs of wellness and access to medical care scores is.964. Use this information to test the hypothesis H 0 : ρ = 0 against the alternative H A : ρ > 0.

26 8.2 Pearson Product-Moment Correlation Coefficient 26/42 Solution Obtained t is t = r 1 r 2 n 2 = = Reference to Appendix B shows that critical t for a one-tailed test conducted at α =.05 with 13 degrees of freedom is Because obtained t exceeds this value, the null hypothesis is rejected. As a result, the researcher can be assured that the population correlation is greater than 0.

27 8.2 Pearson Product-Moment Correlation Coefficient 27/42 Test Of H 0 : ρ = ρ 0 A test of H 0 : ρ = ρ 0 may be carried out by means of the following test statistic. ( ) ( ).5 ln 1+r 1 r.5 ln 1+ρ0 1 ρ 0 Z = 1 n 3 Here, ln is the natural log, ρ 0 is the hypothesized value of the population correlation coefficient and n is the number of pairs. This statistic is approximately normally distributed so that the test may be conducted by reference to the normal curve.

28 8.2 Pearson Product-Moment Correlation Coefficient 28/42 Example A researcher knows that the correlation between a shortened form of the Attitude Toward Risky Sexual Behaviors Assessment Scale and a more elaborate form of the scale is.57. After modifying the shortened version and administering it along with the elaborate version to 18 subjects the researcher finds that the newly modified form correlates.71 with the elaborate form. Use this information to perform a two-tailed test of the null hypothesis H 0 : ρ =.57

29 8.2 Pearson Product-Moment Correlation Coefficient 29/42 Solution By Equation 8.5 Z = = = ( ).5 ln 1+r 1 r ( ).5 ln 1+ρ0 1 ρ 0 1 n 3 ( ).5 ln ln ( =.926 )

30 8.2 Pearson Product-Moment Correlation Coefficient 30/42 Solution (continued) From Appendix A it can be seen that the critical Z values for a two-tailed test conducted at α =.05 are 1.96 and Because obtained Z is between these two values, the null hypothesis is not rejected. This means that the researcher has been unable to demonstrate a change in correlation after modifying the short form of the scale.

31 8.2 Pearson Product-Moment Correlation Coefficient 31/42 CI For Estimation Of ρ A confidence interval for the estimation of ρ is provided by the following equations. L = U = (1 + F ) r + (1 F ) (1 + F ) + (1 F ) r (1 + F ) r (1 F ) (1 + F ) (1 F ) r (1) (2) In these equations r is the sample correlation coefficient and F is the appropriate value from Appendix C. The degrees of freedom for F are n 2 for both numerator and denominator degrees of freedom.

32 8.2 Pearson Product-Moment Correlation Coefficient 32/42 Example Use the Risky Sexual Behaviors Assessment Scale data alluded to previously to form a two-sided 95% confidence interval for the estimation of ρ. The observed correlation in that study was.71 which was obtained from data collected on 18 subjects. Use the resulting interval to perform a two-tailed test of H 0 : ρ =.57 at α =.05. How did you obtain your result?

33 8.2 Pearson Product-Moment Correlation Coefficient 33/42 Solution By Equations 8.6 and 8.7 and L = U = ( ).71 + (1 2.76) ( ) + (1 2.76).71 = =.363 ( ).71 (1 2.76) ( ) (1 2.76).71 = =.884 F = 2.76 was obtained by entering Appendix C for a two-sided confidence interval with numerator and denominator degrees of freedom of 18 2 = 16. The researcher can, therefore, be 95 percent confident that the population correlation coefficient lies between.363 and.884. The two-tailed hypothesis test would not be rejected because the hypothesized value of.57 lies between the two limits.

34 8.3 The Chi-Square Test For Independence 34/42 Chi-Square Test: Introduction The chi-square test for independence is used to test the null hypothesis that two discrete variables are independent against the alternative that they are not independent. The chi-square test for independence is simply a more general form of the 2 by k chi-square test.

35 8.3 The Chi-Square Test For Independence 35/42 The Test Statistic Obtained chi-square is calculated by by the following: χ 2 = [ ] (f o f e ) 2 all cells Here f o and f e are the observed and expected frequencies respectively. The f e are the numbers to be expected in each cell if the null hypothesis is true and are computed as follows. f e f e = (N R) (N C ) N Here N R is the row total for the cell whose expected frequency is being calculated and N C is the column total for the same cell.

36 8.3 The Chi-Square Test For Independence 36/42 The Test Statistic (continued) The degrees of freedom for the test statistic are computed by χ 2 df = (j 1) (k 1) where j and k are the number of rows and columns in the table respectively.

37 8.3 The Chi-Square Test For Independence 37/42 j By k Chi-Square Table Table: Depiction of a j by k chi-square table. Variable One Category Category Category Category One Two k Category f o11 f o12 f o1k One f e11 f e12 f e1k Category f o21 f o22 f o2k Variable Two f e21 f e22 f e2k Two Category Category f oj1 f oj2 f ojk j f ej1 f ej2 f ejk.

38 8.3 The Chi-Square Test For Independence 38/42 Example Suppose a survey is conducted in three rural counties to determine vaccination status against hepatitis B. It is found that in county one, 41 persons have been vaccinated, 126 have not been vaccinated and 452 do not know their status. In county two, 202 have been vaccinated, 210 have not been vaccinated and 440 do not know their status. In the last county 330 had been vaccinated, 614 had not been vaccinated, and 680 did not know their status. Use this data to perform a chi-square analysis. Interpret the results.

39 8.3 The Chi-Square Test For Independence 39/42 Solution The data are arranged for chi-square analysis as follows. Table: Data from survey arranged for chi-square analysis. Vaccinated Not Unknown Vaccinated County [41] [126] [452] 619 One (114.60) (190.00) ( County [202] [210] [440] 852 Two (157.74) (261.52) (432.74) County [330] [614] [680] 1624 Three (300.66) (498.48) (824.86) N = 3095

40 8.3 The Chi-Square Test For Independence 40/42 Solution (continued) Calculations of expected values for the cells in the first row are as follows. f e11 = f e12 = f e13 = ( NOne ) ( NV ) = N ( ) ( ) NOne NNV N ( ) ( ) NOne NU N = = (619) (573) 3095 (619) (950) 3095 (619) (1572) 3095 = = =

41 8.3 The Chi-Square Test For Independence 41/42 Solution (continued) Obtained chi-square is then χ 2 = [ ] (f o f e ) 2 all cells f e ( )2 ( )2 ( )2 = ( )2 ( )2 ( ) ( )2 ( )2 ( ) = =

42 8.3 The Chi-Square Test For Independence 42/42 Solution (continued) The critical value is obtained by entering Appendix D with (j 1) (k 1) = (3 1) (3 1) = 4 degrees of freedom Appendix D shows that for α =.05 and four degrees of freedom, critical χ 2 is Because is greater than 9.488, the null hypothesis is rejected. We conclude that vaccination status and county of residence are not independent. This means that vaccination status depends on the county of residence. Said yet another way, the proportions of vaccinated, not vaccinated and unknown status, persons in the three counties are not the same.

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

THE PEARSON CORRELATION COEFFICIENT

THE PEARSON CORRELATION COEFFICIENT CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION Structure 4.0 Introduction 4.1 Objectives 4. Rank-Order s 4..1 Rank-order data 4.. Assumptions Underlying Pearson s r are Not Satisfied 4.3 Spearman

More information

Part III: Unstructured Data

Part III: Unstructured Data Inf1-DA 2010 2011 III: 51 / 89 Part III Unstructured Data Data Retrieval: III.1 Unstructured data and data retrieval Statistical Analysis of Data: III.2 Data scales and summary statistics III.3 Hypothesis

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Correlation 1. December 4, HMS, 2017, v1.1

Correlation 1. December 4, HMS, 2017, v1.1 Correlation 1 December 4, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 7 Navidi, Chapter 7 I don t expect you to learn the proofs what will follow. Chapter References 2 Correlation The sample

More information

Upon completion of this chapter, you should be able to:

Upon completion of this chapter, you should be able to: 1 Chaptter 7:: CORRELATIION Upon completion of this chapter, you should be able to: Explain the concept of relationship between variables Discuss the use of the statistical tests to determine correlation

More information

Inferences for Correlation

Inferences for Correlation Inferences for Correlation Quantitative Methods II Plan for Today Recall: correlation coefficient Bivariate normal distributions Hypotheses testing for population correlation Confidence intervals for population

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

POLI 443 Applied Political Research

POLI 443 Applied Political Research POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College

More information

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126 Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Hypothesis Testing hypothesis testing approach

Hypothesis Testing hypothesis testing approach Hypothesis Testing In this case, we d be trying to form an inference about that neighborhood: Do people there shop more often those people who are members of the larger population To ascertain this, we

More information

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 I. χ 2 or chi-square test Objectives: Compare how close an experimentally derived value agrees with an expected value. One method to

More information

10.4 Hypothesis Testing: Two Independent Samples Proportion

10.4 Hypothesis Testing: Two Independent Samples Proportion 10.4 Hypothesis Testing: Two Independent Samples Proportion Example 3: Smoking cigarettes has been known to cause cancer and other ailments. One politician believes that a higher tax should be imposed

More information

Inferences about central values (.)

Inferences about central values (.) Inferences about central values (.) ]µnormal., 5 # Inferences about. using data: C", C#,..., C8 (collected as a random sample) Point estimate How good is the estimate?.s œc 1 œ C" C# âc8 8 Confidence interval

More information

Quantitative Analysis and Empirical Methods

Quantitative Analysis and Empirical Methods Hypothesis testing Sciences Po, Paris, CEE / LIEPP Introduction Hypotheses Procedure of hypothesis testing Two-tailed and one-tailed tests Statistical tests with categorical variables A hypothesis A testable

More information

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Correlation and Regression Analysis. Linear Regression and Correlation. Correlation and Linear Regression. Three Questions.

Correlation and Regression Analysis. Linear Regression and Correlation. Correlation and Linear Regression. Three Questions. 10/8/18 Correlation and Regression Analysis Correlation Analysis is the study of the relationship between variables. It is also defined as group of techniques to measure the association between two variables.

More information

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2 Review 6 Use the traditional method to test the given hypothesis. Assume that the samples are independent and that they have been randomly selected ) A researcher finds that of,000 people who said that

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. 12er12 Chapte Bivariate i Regression (Part 1) Bivariate Regression Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Spearman Rho Correlation

Spearman Rho Correlation Spearman Rho Correlation Learning Objectives After studying this Chapter, you should be able to: know when to use Spearman rho, Calculate Spearman rho coefficient, Interpret the correlation coefficient,

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

POLI 443 Applied Political Research

POLI 443 Applied Political Research POLI 443 Applied Political Research Session 6: Tests of Hypotheses Contingency Analysis Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College

More information

Lecture 5: ANOVA and Correlation

Lecture 5: ANOVA and Correlation Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions

More information

Statistics Handbook. All statistical tables were computed by the author.

Statistics Handbook. All statistical tables were computed by the author. Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance

More information

Advanced Experimental Design

Advanced Experimental Design Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals

More information

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc. Hypothesis Tests and Estimation for Population Variances 11-1 Learning Outcomes Outcome 1. Formulate and carry out hypothesis tests for a single population variance. Outcome 2. Develop and interpret confidence

More information

χ test statistics of 2.5? χ we see that: χ indicate agreement between the two sets of frequencies.

χ test statistics of 2.5? χ we see that: χ indicate agreement between the two sets of frequencies. I. T or F. (1 points each) 1. The χ -distribution is symmetric. F. The χ may be negative, zero, or positive F 3. The chi-square distribution is skewed to the right. T 4. The observed frequency of a cell

More information

MATH 240. Chapter 8 Outlines of Hypothesis Tests

MATH 240. Chapter 8 Outlines of Hypothesis Tests MATH 4 Chapter 8 Outlines of Hypothesis Tests Test for Population Proportion p Specify the null and alternative hypotheses, ie, choose one of the three, where p is some specified number: () H : p H : p

More information

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Bivariate Relationships Between Variables

Bivariate Relationships Between Variables Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

14: Correlation. Introduction Scatter Plot The Correlational Coefficient Hypothesis Test Assumptions An Additional Example

14: Correlation. Introduction Scatter Plot The Correlational Coefficient Hypothesis Test Assumptions An Additional Example 14: Correlation Introduction Scatter Plot The Correlational Coefficient Hypothesis Test Assumptions An Additional Example Introduction Correlation quantifies the extent to which two quantitative variables,

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Statistics for Managers Using Microsoft Excel

Statistics for Managers Using Microsoft Excel Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Lecture 2. Estimating Single Population Parameters 8-1

Lecture 2. Estimating Single Population Parameters 8-1 Lecture 2 Estimating Single Population Parameters 8-1 8.1 Point and Confidence Interval Estimates for a Population Mean Point Estimate A single statistic, determined from a sample, that is used to estimate

More information

Linear Correlation and Regression Analysis

Linear Correlation and Regression Analysis Linear Correlation and Regression Analysis Set Up the Calculator 2 nd CATALOG D arrow down DiagnosticOn ENTER ENTER SCATTER DIAGRAM Positive Linear Correlation Positive Correlation Variables will tend

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Mathematical statistics

Mathematical statistics November 15 th, 2018 Lecture 21: The two-sample t-test Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

2 Regression Analysis

2 Regression Analysis FORK 1002 Preparatory Course in Statistics: 2 Regression Analysis Genaro Sucarrat (BI) http://www.sucarrat.net/ Contents: 1 Bivariate Correlation Analysis 2 Simple Regression 3 Estimation and Fit 4 T -Test:

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Ch. 16: Correlation and Regression

Ch. 16: Correlation and Regression Ch. 1: Correlation and Regression With the shift to correlational analyses, we change the very nature of the question we are asking of our data. Heretofore, we were asking if a difference was likely to

More information

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878 Contingency Tables I. Definition & Examples. A) Contingency tables are tables where we are looking at two (or more - but we won t cover three or more way tables, it s way too complicated) factors, each

More information

Correlation and simple linear regression S5

Correlation and simple linear regression S5 Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and

More information

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t = 2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result

More information

10: Crosstabs & Independent Proportions

10: Crosstabs & Independent Proportions 10: Crosstabs & Independent Proportions p. 10.1 P Background < Two independent groups < Binary outcome < Compare binomial proportions P Illustrative example ( oswege.sav ) < Food poisoning following church

More information

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses. 1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately

More information

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION FOR SAMPLE OF RAW DATA (E.G. 4, 1, 7, 5, 11, 6, 9, 7, 11, 5, 4, 7) BE ABLE TO COMPUTE MEAN G / STANDARD DEVIATION MEDIAN AND QUARTILES Σ ( Σ) / 1 GROUPED DATA E.G. AGE FREQ. 0-9 53 10-19 4...... 80-89

More information

Finding Relationships Among Variables

Finding Relationships Among Variables Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis

More information

STAT 328 (Statistical Packages)

STAT 328 (Statistical Packages) Department of Statistics and Operations Research College of Science King Saud University Exercises STAT 328 (Statistical Packages) nashmiah r.alshammari ^-^ Excel and Minitab - 1 - Write the commands of

More information

Multiple comparisons - subsequent inferences for two-way ANOVA

Multiple comparisons - subsequent inferences for two-way ANOVA 1 Multiple comparisons - subsequent inferences for two-way ANOVA the kinds of inferences to be made after the F tests of a two-way ANOVA depend on the results if none of the F tests lead to rejection of

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

12.10 (STUDENT CD-ROM TOPIC) CHI-SQUARE GOODNESS- OF-FIT TESTS

12.10 (STUDENT CD-ROM TOPIC) CHI-SQUARE GOODNESS- OF-FIT TESTS CDR4_BERE601_11_SE_C1QXD 1//08 1:0 PM Page 1 110: (Student CD-ROM Topic) Chi-Square Goodness-of-Fit Tests CD1-1 110 (STUDENT CD-ROM TOPIC) CHI-SQUARE GOODNESS- OF-FIT TESTS In this section, χ goodness-of-fit

More information

Section 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1

Section 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1 Section 9.5 Testing the Difference Between Two Variances Bluman, Chapter 9 1 This the last day the class meets before spring break starts. Please make sure to be present for the test or make appropriate

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments. Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a

More information

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Lecture 9 Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Univariate categorical data Univariate categorical data are best summarized in a one way frequency table.

More information

PLSC PRACTICE TEST ONE

PLSC PRACTICE TEST ONE PLSC 724 - PRACTICE TEST ONE 1. Discuss briefly the relationship between the shape of the normal curve and the variance. 2. What is the relationship between a statistic and a parameter? 3. How is the α

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Tables Table A Table B Table C Table D Table E 675

Tables Table A Table B Table C Table D Table E 675 BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values

More information

Outline for Today. Review of In-class Exercise Bivariate Hypothesis Test 2: Difference of Means Bivariate Hypothesis Testing 3: Correla

Outline for Today. Review of In-class Exercise Bivariate Hypothesis Test 2: Difference of Means Bivariate Hypothesis Testing 3: Correla Outline for Today 1 Review of In-class Exercise 2 Bivariate hypothesis testing 2: difference of means 3 Bivariate hypothesis testing 3: correlation 2 / 51 Task for ext Week Any questions? 3 / 51 In-class

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression

Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression Last couple of classes: Measures of Association: Phi, Cramer s V and Lambda (nominal level of measurement)

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs) The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

Retrieve and Open the Data

Retrieve and Open the Data Retrieve and Open the Data 1. To download the data, click on the link on the class website for the SPSS syntax file for lab 1. 2. Open the file that you downloaded. 3. In the SPSS Syntax Editor, click

More information

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

Problem Set 4 - Solutions

Problem Set 4 - Solutions Problem Set 4 - Solutions Econ-310, Spring 004 8. a. If we wish to test the research hypothesis that the mean GHQ score for all unemployed men exceeds 10, we test: H 0 : µ 10 H a : µ > 10 This is a one-tailed

More information

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that Math 47 Homework Assignment 4 Problem 411 Let X 1, X,, X n, X n+1 be a random sample of size n + 1, n > 1, from a distribution that is N(µ, σ ) Let X = n i=1 X i/n and S = n i=1 (X i X) /(n 1) Find the

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

4/22/2010. Test 3 Review ANOVA

4/22/2010. Test 3 Review ANOVA Test 3 Review ANOVA 1 School recruiter wants to examine if there are difference between students at different class ranks in their reported intensity of school spirit. What is the factor? How many levels

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information