Correlation and Regression

Size: px
Start display at page:

Download "Correlation and Regression"

Transcription

1 Correlation and Regression 1

2 Overview Introduction Scatter Plots Correlation Regression Coefficient of Determination 2

3 Objectives of the topic 1. Draw a scatter plot for a set of ordered pairs. 2. Compute the correlation coefficient. 3. Test the hypothesis H 0 : ρ = Compute the equation of the regression line. 5. Compute the coefficient of determination. 6. Have a working idea of the concept of multiple regression ( it is not examinable) 3

4 Introduction Correlation is a statistical method used to determine whether a linear relationship between variables exists. In this course reference will be made only to pair wise correlation. i.e the correlation between only two variables Regression is a statistical method used to describe and estimate the nature of the relationship between variables that is, positive or negative, linear or nonlinear. 4

5 introduction If there exists a linear relationship, then regression will be used to estimate the equation for the linear relationship Correlation and regression therefore work hand in hand. They are complements not substitutes. 5

6 Introduction to correlation and regression The purpose of this topic is to answer these questions statistically: 1. Are two variables related?- is there a particular observable or logical relationship between two variables? If one variable increases, what happens to the other variable? Does it increase or decrease? Consider sales and revenue; motivation and worker performance; exchange rate and imports; smoking and lung cancer; weight and blood pressure; interest rate and investment; study time and score. 6

7 Introduction to correlation and regression 2. If there is a relationship, what type of relationship exists? Is it positive or negative? 3. What is the strength of the linear relationship? Is it a weak relationship or a strong relationship? 4. What kind of predictions can be made from the relationship? 7

8 Introduction To answer question 1,2 and 3,the correlation coefficient, a numerical measure to determine whether two variables are related and to determine the strength of the relationship between the variables is used. To answer question 3, regression will be used. 8

9 Scatter Plots and Correlation The first step in correlation is to have a conceptual or logical understanding of the relation between the two variables. What do you think is the relationship between exchange rate and imports. The next step is to construct a scatter plot for the data to confirm the conceptual relationship A scatter plot is a graph of the ordered pairs (x, y) of numbers consisting of the independent variable x and the dependent variable y. 9

10 Scatter plots and correlation Correlation is symmetric- i.e it does not matter which of the two variables is dependent and which is independent 10

11 Example1: Car Rental Companies Construct a scatter plot for the data shown for car rental companies in the United States for a recent year. Step 1: Draw and label the x and y axes and denote one of the variables by X and the other by Y. Step 2: Plot each point on the graph. 11

12 Example 1 What do you think is the conceptual or logical relationship between the number of cars given out to hire and the sales revenue made. Positive? Negative? No relationship? As more cars are given out to hire does the sales revenue of the firm decrease or increase? The scatter plot is shown in the following diagram 12

13 Example : Car Rental Companies Positive Relationship cars (in ten thousands) 13

14 Example 2: Absences/Final Grades Construct a scatter plot for the data obtained in a study on the number of absences and the final grades of seven randomly selected students from a statistics class. Step 1: Draw and label the x and y axes and denote one of the variables by X and the other by Y Step 2: Plot each point on the graph. 14

15 Example -2: Absences/Final Grades Negative Relationship number of absentees 15

16 Correlation-interpretation After the scatter plot, the next step is to numerically calculate the correlation coefficient between the variables and interpret the coefficient The correlation coefficient computed from the sample data measures the strength and direction of a linear relationship between two variables. There are several types of correlation coefficients. Two will be explained in this course; The two explained in this course are the Pearson product moment correlation coefficient (PPMC) and the Spearman s Rank correlation coefficient 16

17 Correlation-interpretation The symbol for the sample correlation coefficient is r. The symbol for the population correlation coefficient is. 17

18 Correlation-interpretation The range of the correlation coefficient is from 1 to 1. If there is a strong positive linear relationship between the variables, the value of r will be close to 1. If there is a strong negative linear relationship between the variables, the value of r will be close to 1. 18

19 Correlation-interpretation If r=1, then there exists a perfect positive relationship between the variables If r=-1, then there exists a perfect negative relationship between the variables If r=0, then there exists no relationship between the variables If 0<r<0.5, then there exists a weak positive relationship between the variables If 0.5<r<1, then there exists a strong positive relationship between the variables 19

20 Correlation-interpretation If -0.5<r<0, then there exists a weak negative relationship between the variables If -0.5<r<-1, then there exists a strong negative relationship between the variables 20

21 Correlation-interpretation 21

22 Pearson s Product Moment Correlation Coefficient (PPMCC) 22

23 Correlation Coefficient (PPMCC) The formula for the correlation coefficient (PPMCC) is given by r n xy x y n x x n y y where n is the number of data pairs. 23

24 Correlation coefficient (PPMCC) It can also be expressed in notation form as: r S S xx xy S Where S n xy x y ; xy yy 2 2 Syy n y y S 2 2 xx n x x 24

25 PPMCC- alternative/easier formula Alternatively, the formula can be written as: r XY n XY X nx Y ny Y Where and X are mean of Y and X respectively 25

26 Example 1: Car Rental Companies Compute the correlation coefficient for the data in Example 1. Company A B C D E F Cars x (in 10,000s) Σx = Income y (in billions) xy x 2 y Σy = Σxy = Σx 2 = Σy 2 =

27 Example 1: Car Rental Companies Compute the correlation coefficient for the data in Example 1. Σx = 153.8, Σy = 18.7, Σxy = , Σx 2 = , Σy 2 = 80.67, n = 6 r r r n xy x y n x x n y y (strong positive relationship) 27

28 Example 10-5: Absences/Final Grades Compute the correlation coefficient for the data in Example Student A B C D E F Number of absences, x Final Grade y (pct.) xy x 2 y ,724 7,396 1,849 5,476 3,364 8,100 G ,084 Σx = 57 Σy = 511 Σxy = 3745 Σx 2 = 579 Σy 2 = 38,993 28

29 Alternative calculation We can calculate for the mean of Y and X. the mean of X r and Y are given as and respectively ( )(3.1167) ( ( ) 2 ( (3.1167) 2 r Can you interpret the correlation coefficient? There is a strong positive correlation between car rentals and sales revenue 29

30 Example 2: Absences/Final Grades Compute the correlation coefficient for the data in Example 2. Σx = 57, Σy = 511, Σxy = 3745, Σx 2 = 579, Σy 2 = 38,993, n = 7 r r r n xy x y n x x n y y , (strong negative relationship) 30

31 Spearman s Rank correlation coefficient 31

32 Formula for spearman s rank correlation coefficient The formulas is given by: r s 1 6 nn d 2 2 ( 1) Where n is the number of observations. d is the difference in ranks between X and Y. The interpretation for the spearman s correlation is the same as for the PPMCC 32

33 STEP BY STEP APPROACH 1. Rank X and Y in an ascending or descending order 2. Calculate the difference between the ranks 3. Find the square of the difference and calculate 33

34 Example-Step by step approach Consider the data from example 1 COMPANY CARS(in 10,000s) Income (in billions) A 63 7 B C D E F

35 Example-Step by step approach Represent one of the variable by X and the other by Y. Remember that correlation is symmetric Order the data in ascending order Find the difference in the ranks Calculate the correlation coefficient 35

36 solution company Cars(X) Income (y) Rank of X Rank of Y A B C D E F d 4 d 2 36

37 solution The spearman s rank correlation coefficient is given as: 6(4) r s 2 6(6 1) Note: the answer will not necessarily be the same but the interpretation will be the same. In this case, there exists a strong positive correlation between car rentals and revenue 37

38 Special cases How do we rank the data when two or more data points have the same rank? When two scores tie in rank, both are given the mean of the two ranks they would occupy and the next rank is eliminated to keep n (the number of observations) consistent. For example, if two data points tied for 4 th place, both would receive a rank of 4.5 ((4 + 5) 2), and the next data point would be ranked number 6. if three points tied for 4 th place, the three would receive a rank of 5 (( ) 3), and the next school would be ranked number 7. 38

39 example Consider the following example company Cars(X) Income (y) Rank of X Rank of Y A B C D E F d d

40 solution The spearman s rank correlation coefficient is given as: 6(10.5) r s 2 6(6 1) 40

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Overview Introduction 10-1 Scatter Plots and Correlation 10- Regression 10-3 Coefficient of Determination and

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Example 10-2: Absences/Final Grades Please enter the data below in L1 and L2. The data appears on page 537 of your textbook.

More information

CORRELATION AND REGRESSION

CORRELATION AND REGRESSION CORRELATION AND REGRESSION CORRELATION The correlation coefficient is a number, between -1 and +1, which measures the strength of the relationship between two sets of data. The closer the correlation coefficient

More information

Slide 7.1. Theme 7. Correlation

Slide 7.1. Theme 7. Correlation Slide 7.1 Theme 7 Correlation Slide 7.2 Overview Researchers are often interested in exploring whether or not two variables are associated This lecture will consider Scatter plots Pearson correlation coefficient

More information

Correlation and Regression

Correlation and Regression Elementary Statistics A Step by Step Approach Sixth Edition by Allan G. Bluman http://www.mhhe.com/math/stat/blumanbrief SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD KY Updated

More information

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist sales $ (y - dependent variable) advertising $ (x - independent variable)

More information

Measuring Associations : Pearson s correlation

Measuring Associations : Pearson s correlation Measuring Associations : Pearson s correlation Scatter Diagram A scatter diagram is a graph that shows that the relationship between two variables measured on the same individual. Each individual in the

More information

CORELATION - Pearson-r - Spearman-rho

CORELATION - Pearson-r - Spearman-rho CORELATION - Pearson-r - Spearman-rho Scatter Diagram A scatter diagram is a graph that shows that the relationship between two variables measured on the same individual. Each individual in the set is

More information

CORRELATION ANALYSIS. Dr. Anulawathie Menike Dept. of Economics

CORRELATION ANALYSIS. Dr. Anulawathie Menike Dept. of Economics CORRELATION ANALYSIS Dr. Anulawathie Menike Dept. of Economics 1 What is Correlation The correlation is one of the most common and most useful statistics. It is a term used to describe the relationship

More information

CORRELATION. compiled by Dr Kunal Pathak

CORRELATION. compiled by Dr Kunal Pathak CORRELATION compiled by Dr Kunal Pathak Flow of Presentation Definition Types of correlation Method of studying correlation a) Scatter diagram b) Karl Pearson s coefficient of correlation c) Spearman s

More information

MEI STRUCTURED MATHEMATICS STATISTICS 2, S2. Practice Paper S2-A

MEI STRUCTURED MATHEMATICS STATISTICS 2, S2. Practice Paper S2-A MEI Mathematics in Education and Industry MEI STRUCTURED MATHEMATICS STATISTICS, S Practice Paper S-A Additional materials: Answer booklet/paper Graph paper MEI Examination formulae and tables (MF) TIME

More information

Correlation. Engineering Mathematics III

Correlation. Engineering Mathematics III Correlation Correlation Finding the relationship between two quantitative variables without being able to infer causal relationships Correlation is a statistical technique used to determine the degree

More information

Statistics S1 Advanced Subsidiary

Statistics S1 Advanced Subsidiary Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced Subsidiary Friday 14 January 2005 Morning Time: 1 hour 30 minutes Materials required for examination Items included with question papers Mathematical

More information

Correlation and Regression

Correlation and Regression Correlation and Regression. ITRDUCTI Till now, we have been working on one set of observations or measurements e.g. heights of students in a class, marks of students in an exam, weekly wages of workers

More information

Correlation: Relationships between Variables

Correlation: Relationships between Variables Correlation Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means However, researchers are

More information

Correlation and Regression

Correlation and Regression Edexcel GCE Core Mathematics S1 Correlation and Regression Materials required for examination Mathematical Formulae (Green) Items included with question papers Nil Advice to Candidates You must ensure

More information

About Bivariate Correlations and Linear Regression

About Bivariate Correlations and Linear Regression About Bivariate Correlations and Linear Regression TABLE OF CONTENTS About Bivariate Correlations and Linear Regression... 1 What is BIVARIATE CORRELATION?... 1 What is LINEAR REGRESSION... 1 Bivariate

More information

CORRELATION AND REGRESSION

CORRELATION AND REGRESSION CORRELATION AND REGRESSION CORRELATION Introduction CORRELATION problems which involve measuring the strength of a relationship. Correlation Analysis involves various methods and techniques used for studying

More information

12.7. Scattergrams and Correlation

12.7. Scattergrams and Correlation 12.7. Scattergrams and Correlation 1 Objectives A. Make a scattergram and determine if there is a positive, a negative, or no correlation for the data. B. Find and interpret the coefficient of correlation

More information

UGRC 120 Numeracy Skills

UGRC 120 Numeracy Skills UGRC 120 Numeracy Skills Session 7 MEASURE OF LINEAR ASSOCIATION & RELATION Lecturer: Dr. Ezekiel N. N. Nortey/Mr. Enoch Nii Boi Quaye, Statistics Contact Information: ennortey@ug.edu.gh/enbquaye@ug.edu.gh

More information

Correlation & Regression. Dr. Moataza Mahmoud Abdel Wahab Lecturer of Biostatistics High Institute of Public Health University of Alexandria

Correlation & Regression. Dr. Moataza Mahmoud Abdel Wahab Lecturer of Biostatistics High Institute of Public Health University of Alexandria بسم الرحمن الرحيم Correlation & Regression Dr. Moataza Mahmoud Abdel Wahab Lecturer of Biostatistics High Institute of Public Health University of Alexandria Correlation Finding the relationship between

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION Structure 4.0 Introduction 4.1 Objectives 4. Rank-Order s 4..1 Rank-order data 4.. Assumptions Underlying Pearson s r are Not Satisfied 4.3 Spearman

More information

Nonparametric Statistics Notes

Nonparametric Statistics Notes Nonparametric Statistics Notes Chapter 5: Some Methods Based on Ranks Jesse Crawford Department of Mathematics Tarleton State University (Tarleton State University) Ch 5: Some Methods Based on Ranks 1

More information

Reminder: Student Instructional Rating Surveys

Reminder: Student Instructional Rating Surveys Reminder: Student Instructional Rating Surveys You have until May 7 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs The survey should be available

More information

PhysicsAndMathsTutor.com

PhysicsAndMathsTutor.com 1. A personnel manager wants to find out if a test carried out during an employee s interview and a skills assessment at the end of basic training is a guide to performance after working for the company

More information

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Solutionbank S1 Edexcel AS and A Level Modular Mathematics file://c:\users\buba\kaz\ouba\s1_6_a_1.html Exercise A, Question 1 The following scatter diagrams were drawn. a Describe the type of correlation shown by each scatter diagram. b Interpret each correlation.

More information

Business Mathematics and Statistics (MATH0203) Chapter 1: Correlation & Regression

Business Mathematics and Statistics (MATH0203) Chapter 1: Correlation & Regression Business Mathematics and Statistics (MATH0203) Chapter 1: Correlation & Regression Dependent and independent variables The independent variable (x) is the one that is chosen freely or occur naturally.

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem. Statistics 1 Mathematical Model A mathematical model is a simplification of a real world problem. 1. A real world problem is observed. 2. A mathematical model is thought up. 3. The model is used to make

More information

OCR Maths S1. Topic Questions from Papers. Bivariate Data

OCR Maths S1. Topic Questions from Papers. Bivariate Data OCR Maths S1 Topic Questions from Papers Bivariate Data PhysicsAndMathsTutor.com 1 The scatter diagrams below illustrate three sets of bivariate data, A, B and C. State, with an explanation in each case,

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Upon completion of this chapter, you should be able to:

Upon completion of this chapter, you should be able to: 1 Chaptter 7:: CORRELATIION Upon completion of this chapter, you should be able to: Explain the concept of relationship between variables Discuss the use of the statistical tests to determine correlation

More information

Correlation and regression

Correlation and regression NST 1B Experimental Psychology Statistics practical 1 Correlation and regression Rudolf Cardinal & Mike Aitken 11 / 12 November 2003 Department of Experimental Psychology University of Cambridge Handouts:

More information

Statistics S1 Advanced/Advanced Subsidiary

Statistics S1 Advanced/Advanced Subsidiary Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Tuesday 2 November 2004 Morning Time: 1 hour 30 minutes Materials required for examination Answer Book (AB16) Graph Paper

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 2011 MODULE 4 : Linear models Time allowed: One and a half hours Candidates should answer THREE questions. Each question

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

Math Analysis Notes Mrs. Atkinson 1

Math Analysis Notes Mrs. Atkinson 1 Name: Math Analysis Chapter 7 Notes Day 6: Section 7-1 Solving Systems of Equations with Two Variables; Sections 7-1: Solving Systems of Equations with Two Variables Solving Systems of equations with two

More information

There is another type of correlation coefficient however called SPEARMAN S RANK CORRELATION COEFFICIENT (denoted r S ).

There is another type of correlation coefficient however called SPEARMAN S RANK CORRELATION COEFFICIENT (denoted r S ). Spearman s Rank Correlation Coefficient The learning objectives of this unit are to: * introduce Spearman s rank correlation coefficient as an alternative measure of correlation to the PMCC; * to understand

More information

PhysicsAndMathsTutor.com

PhysicsAndMathsTutor.com 1. The product moment correlation coefficient is denoted by r and Spearman s rank correlation coefficient is denoted by r s. (a) Sketch separate scatter diagrams, with five points on each diagram, to show

More information

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up? Comment: notes are adapted from BIOL 214/312. I. Correlation. Correlation A) Correlation is used when we want to examine the relationship of two continuous variables. We are not interested in prediction.

More information

INSTITUTE OF ACTUARIES OF INDIA

INSTITUTE OF ACTUARIES OF INDIA INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS 4 th November 008 Subject CT3 Probability & Mathematical Statistics Time allowed: Three Hours (10.00 13.00 Hrs) Total Marks: 100 INSTRUCTIONS TO THE CANDIDATES

More information

A LEVEL MATHEMATICS QUESTIONBANKS REGRESSION AND CORRELATION. 1. Sketch scatter diagrams with at least 5 points to illustrate the following:

A LEVEL MATHEMATICS QUESTIONBANKS REGRESSION AND CORRELATION. 1. Sketch scatter diagrams with at least 5 points to illustrate the following: 1. Sketch scatter diagrams with at least 5 points to illustrate the following: a) Data with a product moment correlation coefficient of 1 b) Data with a rank correlation coefficient of 1, but product moment

More information

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Solutionbank S1 Edexcel AS and A Level Modular Mathematics Page 1 of 2 Exercise A, Question 1 As part of a statistics project, Gill collected data relating to the length of time, to the nearest minute, spent by shoppers in a supermarket and the amount of money

More information

CORRELATION AND SIMPLE REGRESSION 10.0 OBJECTIVES 10.1 INTRODUCTION

CORRELATION AND SIMPLE REGRESSION 10.0 OBJECTIVES 10.1 INTRODUCTION UNIT 10 CORRELATION AND SIMPLE REGRESSION STRUCTURE 10.0 Objectives 10.1 Introduction 10. Correlation 10..1 Scatter Diagram 10.3 The Correlation Coefficient 10.3.1 Karl Pearson s Correlation Coefficient

More information

1)I have 4 red pens, 3 purple pens and 1 green pen that I use for grading. If I randomly choose a pen,

1)I have 4 red pens, 3 purple pens and 1 green pen that I use for grading. If I randomly choose a pen, Introduction to Statistics 252 1I have 4 red pens, 3 purple pens and 1 green pen that I use for grading. If I randomly choose a pen, a what is the probability that I choose a red pen? 4 8 1 2 b what is

More information

UNIT-IV CORRELATION AND REGRESSION

UNIT-IV CORRELATION AND REGRESSION Correlation coefficient: UNIT-IV CORRELATION AND REGRESSION The quantity r, called the linear correlation coefficient, measures the strength and the direction of a linear relationship between two variables.

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis 4.1 Introduction Correlation is a technique that measures the strength (or the degree) of the relationship between two variables. For example, we could measure how strong the relationship is between people

More information

GROUP-RANK CORRELATION COEFFICIENT

GROUP-RANK CORRELATION COEFFICIENT GROUP-RANK CORRELATION COEFFICIENT By S. V. SIMHADRI (Department of Electrical Communication Engineering Indian Institution of Science Bangalore- 12, India) [Received: October 30, 1972] ABSTRACT A new

More information

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots SFBS Course Notes Part 7: Correlation Bivariate relationships (p. 1) Linear transformations (p. 3) Pearson r : Measuring a relationship (p. 5) Interpretation of correlations (p. 10) Relationships between

More information

IOP2601. Some notes on basic mathematical calculations

IOP2601. Some notes on basic mathematical calculations IOP601 Some notes on basic mathematical calculations The order of calculations In order to perform the calculations required in this module, there are a few steps that you need to complete. Step 1: Choose

More information

Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Thursday 5 June 2003 Morning Time: 1 hour 30 minutes

Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Thursday 5 June 2003 Morning Time: 1 hour 30 minutes Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Thursday 5 June 2003 Morning Time: 1 hour 30 minutes Materials required for examination Answer Book (AB16) Graph Paper (ASG2)

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information

Correlation. January 11, 2018

Correlation. January 11, 2018 Correlation January 11, 2018 Contents Correlations The Scattterplot The Pearson correlation The computational raw-score formula Survey data Fun facts about r Sensitivity to outliers Spearman rank-order

More information

Chapter 4 Data with Two Variables

Chapter 4 Data with Two Variables Chapter 4 Data with Two Variables 1 Scatter Plots and Correlation and 2 Pearson s Correlation Coefficient Looking for Correlation Example Does the number of hours you watch TV per week impact your average

More information

due date: third day of class estimated time: 10 hours (for planning purposes only; work until you finish)

due date: third day of class estimated time: 10 hours (for planning purposes only; work until you finish) Honors Geometry Summer Work 2015 due date: third day of class estimated time: 10 hours (for planning purposes only; work until you finish) Dear Honors Geometry Students, This assignment is designed for

More information

Mathematics Revision Guide. Algebra. Grade C B

Mathematics Revision Guide. Algebra. Grade C B Mathematics Revision Guide Algebra Grade C B 1 y 5 x y 4 = y 9 Add powers a 3 a 4.. (1) y 10 y 7 = y 3 (y 5 ) 3 = y 15 Subtract powers Multiply powers x 4 x 9...(1) (q 3 ) 4...(1) Keep numbers without

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

3.1 Events, Sample Spaces, and Probability

3.1 Events, Sample Spaces, and Probability Chapter 3 Probability Probability is the tool that allows the statistician to use sample information to make inferences about or to describe the population from which the sample was drawn. 3.1 Events,

More information

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 14 1 Statistical versus Deterministic Relationships Distance versus Speed (when travel time is constant). Income (in millions of

More information

Chapter 4 Data with Two Variables

Chapter 4 Data with Two Variables Chapter 4 Data with Two Variables 1 Scatter Plots and Correlation and 2 Pearson s Correlation Coefficient Looking for Correlation Example Does the number of hours you watch TV per week impact your average

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

PhysicsAndMathsTutor.com

PhysicsAndMathsTutor.com 1. A researcher claims that, at a river bend, the water gradually gets deeper as the distance from the inner bank increases. He measures the distance from the inner bank, b cm, and the depth of a river,

More information

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). Linear Regression Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). A dependent variable is a random variable whose variation

More information

Class 11 Maths Chapter 15. Statistics

Class 11 Maths Chapter 15. Statistics 1 P a g e Class 11 Maths Chapter 15. Statistics Statistics is the Science of collection, organization, presentation, analysis and interpretation of the numerical data. Useful Terms 1. Limit of the Class

More information

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Solutionbank S1 Edexcel AS and A Level Modular Mathematics file://c:\users\buba\kaz\ouba\s1_7_a_1.html Exercise A, Question 1 An NHS trust has the following data on operations. Number of operating theatres 5 6 7 8 Number of operations carried out per day 25 30

More information

Correlation. Patrick Breheny. November 15. Descriptive statistics Inference Summary

Correlation. Patrick Breheny. November 15. Descriptive statistics Inference Summary Correlation Patrick Breheny November 15 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 21 Introduction Descriptive statistics Generally speaking, scientific questions often

More information

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations

More information

Bivariate Relationships Between Variables

Bivariate Relationships Between Variables Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods

More information

IB Questionbank Mathematical Studies 3rd edition. Bivariate data. 179 min 172 marks

IB Questionbank Mathematical Studies 3rd edition. Bivariate data. 179 min 172 marks IB Questionbank Mathematical Studies 3rd edition Bivariate data 179 min 17 marks 1. The heat output in thermal units from burning 1 kg of wood changes according to the wood s percentage moisture content.

More information

Chapter 10. Correlation and Regression. Lecture 1 Sections:

Chapter 10. Correlation and Regression. Lecture 1 Sections: Chapter 10 Correlation and Regression Lecture 1 Sections: 10.1 10. You will now be introduced to important methods for making inferences based on sample data that come in pairs. In the previous chapter,

More information

Mathematics: applications and interpretation SL

Mathematics: applications and interpretation SL Mathematics: applications and interpretation SL Chapter 1: Approximations and error A Rounding numbers B Approximations C Errors in measurement D Absolute and percentage error The first two sections of

More information

PS2: Two Variable Statistics

PS2: Two Variable Statistics PS2: Two Variable Statistics LT2: Measuring Correlation and Line of best fit by eye. LT3: Linear regression LT4: The χ 2 test of independence. 1 Pearson's Correlation Coefficient In examinations you are

More information

Bivariate Paired Numerical Data

Bivariate Paired Numerical Data Bivariate Paired Numerical Data Pearson s correlation, Spearman s ρ and Kendall s τ, tests of independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4

More information

Discovering Correlation in Data. Vinh Nguyen Research Fellow in Data Science Computing and Information Systems DMD 7.

Discovering Correlation in Data. Vinh Nguyen Research Fellow in Data Science Computing and Information Systems DMD 7. Discovering Correlation in Data Vinh Nguyen (vinh.nguyen@unimelb.edu.au) Research Fellow in Data Science Computing and Information Systems DMD 7.14 Discovering Correlation Why is correlation important?

More information

Statistics: revision

Statistics: revision NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers

More information

Topic 10 - Linear Regression

Topic 10 - Linear Regression Topic 10 - Linear Regression Least squares principle Hypothesis tests/confidence intervals/prediction intervals for regression 1 Linear Regression How much should you pay for a house? Would you consider

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Linear Correlation: Does one variable increase or decrease linearly with another? Is there a linear relationship between two or more variables? Types of linear relationships:

More information

Math Sec 4 CST Topic 7. Statistics. i.e: Add up all values and divide by the total number of values.

Math Sec 4 CST Topic 7. Statistics. i.e: Add up all values and divide by the total number of values. Measures of Central Tendency Statistics 1) Mean: The of all data values Mean= x = x 1+x 2 +x 3 + +x n n i.e: Add up all values and divide by the total number of values. 2) Mode: Most data value 3) Median:

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

Linear Correlation and Regression Analysis

Linear Correlation and Regression Analysis Linear Correlation and Regression Analysis Set Up the Calculator 2 nd CATALOG D arrow down DiagnosticOn ENTER ENTER SCATTER DIAGRAM Positive Linear Correlation Positive Correlation Variables will tend

More information

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline. MFM Practitioner Module: Risk & Asset Allocation September 11, 2013 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y

More information

Correlation. Quantitative Aptitude & Business Statistics

Correlation. Quantitative Aptitude & Business Statistics Correlation Statistics Correlation Correlation is the relationship that exists between two or more variables. If two variables are related to each other in such a way that change increases a corresponding

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

Biostatistics: Correlations

Biostatistics: Correlations Biostatistics: s One of the most common errors we find in the press is the confusion between correlation and causation in scientific and health-related studies. In theory, these are easy to distinguish

More information

Algebra. Topic: Manipulate simple algebraic expressions.

Algebra. Topic: Manipulate simple algebraic expressions. 30-4-10 Algebra Days: 1 and 2 Topic: Manipulate simple algebraic expressions. You need to be able to: Use index notation and simple instances of index laws. Collect like terms Multiply a single term over

More information

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 12th Class 6/23/10

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 12th Class 6/23/10 Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis 12th Class 6/23/10 In God we trust, all others must use data. --Edward Deming hand out review sheet, answer, point to old test,

More information

4/22/2010. Test 3 Review ANOVA

4/22/2010. Test 3 Review ANOVA Test 3 Review ANOVA 1 School recruiter wants to examine if there are difference between students at different class ranks in their reported intensity of school spirit. What is the factor? How many levels

More information

ALGEBRA 2 Summer Review Assignments Graphing

ALGEBRA 2 Summer Review Assignments Graphing ALGEBRA 2 Summer Review Assignments Graphing To be prepared for algebra two, and all subsequent math courses, you need to be able to accurately and efficiently find the slope of any line, be able to write

More information

8.1 Frequency Distribution, Frequency Polygon, Histogram page 326

8.1 Frequency Distribution, Frequency Polygon, Histogram page 326 page 35 8 Statistics are around us both seen and in ways that affect our lives without us knowing it. We have seen data organized into charts in magazines, books and newspapers. That s descriptive statistics!

More information

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY MODULE 4 : Linear models Time allowed: One and a half hours Candidates should answer THREE questions. Each question carries 20 marks. The number of marks

More information

8th Grade Common Core Math

8th Grade Common Core Math 8th Grade Common Core Math Booklet 5 Functions Part 2 One of the Main Idea of Functions: Use functions to model relationships between quantities What are functions? Functions are like machines. You give

More information

Mathematical studies Standard level Paper 2

Mathematical studies Standard level Paper 2 Mathematical studies Standard level Paper 2 Friday 5 May 2017 (morning) 1 hour 30 minutes Instructions to candidates ydo not open this examination paper until instructed to do so. ya graphic display calculator

More information

Mathematics Level D: Lesson 2 Representations of a Line

Mathematics Level D: Lesson 2 Representations of a Line Mathematics Level D: Lesson 2 Representations of a Line Targeted Student Outcomes Students graph a line specified by a linear function. Students graph a line specified by an initial value and rate of change

More information