Gotta Keep It Correlatin

Similar documents
II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

a is some real number (called the coefficient) other

Name Date MIDTERM REVIEW II: SYSTEM OF EQUATIONS & INEQUALITIES, FUNCTIONS, LINE REGRESSION, AND LINEAR EQUATIONS

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Chapter 12 Correlation

multiplies all measures of center and the standard deviation and range by k, while the variance is multiplied by k 2.

Mth 95 Notes Module 1 Spring Section 4.1- Solving Systems of Linear Equations in Two Variables by Graphing, Substitution, and Elimination

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

CALCULUS BASIC SUMMER REVIEW

Regression, Inference, and Model Building

Essential Question How can you recognize an arithmetic sequence from its graph?

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

Continuous Functions

Chapter 4 - Summarizing Numerical Data

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Linear Regression Analysis. Analysis of paired data and using a given value of one variable to predict the value of the other

7.1 Finding Rational Solutions of Polynomial Equations

STP 226 ELEMENTARY STATISTICS

Paired Data and Linear Correlation

Correlation Regression

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

Lyman Memorial High School. Honors Pre-Calculus Prerequisite Packet. Name:

We will conclude the chapter with the study a few methods and techniques which are useful

11 Correlation and Regression

MATH CALCULUS II Objectives and Notes for Test 4

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

P.3 Polynomials and Special products

Regression, Part I. A) Correlation describes the relationship between two variables, where neither is independent or a predictor.

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Northwest High School s Algebra 2/Honors Algebra 2 Summer Review Packet

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

( ) 2 + k The vertex is ( h, k) ( )( x q) The x-intercepts are x = p and x = q.

Correlation and Covariance

STP 226 EXAMPLE EXAM #1

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

DAWSON COLLEGE DEPARTMENT OF MATHEMATICS 201-BZS-05 PROBABILITY AND STATISTICS FALL 2015 FINAL EXAM

G r a d e 1 1 P r e - C a l c u l u s M a t h e m a t i c s ( 3 0 S )

Summary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram.

Final Examination Solutions 17/6/2010

University of California, Los Angeles Department of Statistics. Simple regression analysis

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Linear Regression Models

Least-Squares Regression

Ismor Fischer, 1/11/

Exponential and Trigonometric Functions Lesson #1

U8L1: Sec Equations of Lines in R 2

Topic 9: Sampling Distributions of Estimators

Algebra of Least Squares

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Tennessee Department of Education

Statistical Properties of OLS estimators

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

Understanding Dissimilarity Among Samples

6.3 Testing Series With Positive Terms

Dept. of maths, MJ College.

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day

Bivariate Sample Statistics Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 7

4755 Mark Scheme June Question Answer Marks Guidance M1* Attempt to find M or 108M -1 M 108 M1 A1 [6] M1 A1

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

MATHEMATICS: PAPER III (LO 3 AND LO 4) PLEASE READ THE FOLLOWING INSTRUCTIONS CAREFULLY

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Academic. Grade 9 Assessment of Mathematics. Released assessment Questions

Lecture 11 Simple Linear Regression

Most text will write ordinary derivatives using either Leibniz notation 2 3. y + 5y= e and y y. xx tt t

Introduction to Signals and Systems, Part V: Lecture Summary

The Method of Least Squares. To understand least squares fitting of data.

1 Inferential Methods for Correlation and Regression Analysis

MA Lesson 26 Notes Graphs of Rational Functions (Asymptotes) Limits at infinity

NATIONAL SENIOR CERTIFICATE EXAMINATION MATHEMATICS P2 SEPTEMBER 2016 GRADE 12. This question paper consists of 13 pages including the formula sheet

Pearson Edexcel Level 3 Advanced Subsidiary and Advanced GCE in Statistics

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Measures of Spread: Standard Deviation

U8L1: Sec Equations of Lines in R 2

Polynomial Functions and Their Graphs

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS

24.1 Confidence Intervals and Margins of Error

LESSON 2: SIMPLIFYING RADICALS

AP CALCULUS AB 2003 SCORING GUIDELINES (Form B)

Algebra II Notes Unit Seven: Powers, Roots, and Radicals

3.2 Properties of Division 3.3 Zeros of Polynomials 3.4 Complex and Rational Zeros of Polynomials

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions

REGRESSION (Physics 1210 Notes, Partial Modified Appendix A)

GRADE 12 SEPTEMBER 2012 MATHEMATICS P2

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

Chapter Vectors

Data Analysis and Statistical Methods Statistics 651

Math 105: Review for Final Exam, Part II - SOLUTIONS

MATHEMATICS 9740 (HIGHER 2)

Section 14. Simple linear regression.

In exercises 1 and 2, (a) write the repeating decimal as a geometric series and (b) write its sum as the ratio of two integers _

Unit 4: Polynomial and Rational Functions

Solutions to Odd Numbered End of Chapter Exercises: Chapter 4

Chapter 2 Feedback Control Theory Continued

Stat 139 Homework 7 Solutions, Fall 2015

NATIONAL SENIOR CERTIFICATE GRADE 12

Transcription:

Gotta Keep It Correlati Correlatio.2 Learig Goals I this lesso, ou will: Determie the correlatio coefficiet usig a formula. Iterpret the correlatio coefficiet for a set of data. ew Stud Liks Dark Chocolate to Heart Health. Video Games Show to NBoost I.Q. College Graduates Live Loger, New Stud Fids. You have probabl see or heard headlies similar to these i magazies, o TV, ad olie. Each oe of these headlies is the result of a correlatioal stud. I a correlatioal stud, researchers compare two variables to see how the are associated. The do this through the use of surves or eve b researchig documets such as medical records. 2012 Caregie Learig What methods do ou thik researchers could have used to produce the results metioed i the headlies above? 533

Problem 1 Associate, Formulate, Correlate! Recall that data comparig two variables ca show a positive associatio, a egative associatio, or o associatio. 1. Describe the tpe of associatio betwee the idepedet ad depedet variables show o each scatterplot. The, draw a lie of best fit for each, if possible. a. Miles per Gallo Weight of Vehicle b. c. Height Grades o Algebra Test IQ Score Time Spet Studig 2012 Caregie Learig 534 Chapter Correlatio ad Residuals

A measure of how well a liear regressio lie fits a set of data is called correlatio. The correlatio coefficiet is a value betwee 21 ad 1 which idicates how close the data are to formig a straight lie. The closer the correlatio coefficiet is to 1 or 21, the stroger the liear relatioship is betwee the two variables. The variable r is used to represet the correlatio coefficiet. I remember that the correlatio coefficiet either falls betwee 1 ad 0 if the data show a egative associatio, or betwee 0 ad 1 if the data show a positive associatio. 2. Determie whether the poits i each scatter plot have a positive correlatio, a egative correlatio, or o correlatio. Four possible r-values are give. Circle the r-value ou thik is most appropriate. Eplai our reasoig for each. a. 8 7 r 5 0. r 5 20. r 5 0.0 r 5 20.0 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 The closer the r-value gets to 0, the less of a liear relatioship there is i the data! 2012 Caregie Learig b. 8 7 6 5 4 3 2 r 5 0.7 r 5 20.7 r 5 0.07 r 5 20.07 1 0 1 2 3 4 5 6 7 8.2 Correlatio 535

c. 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 r 5 1 r 5 0.5 r 5 0.01 You ca calculate the correlatio coefficiet of a data set usig this formula: ( i 2 )( i ) r 5 ( i 2 ) 2 ( i ) 2 i51 i51 Most of the pieces of this formula look familiar. I thik we used them i the formula for stadard deviatio! Let s determie the correlatio coefficiet of this data set usig the formula. (23, 23), (1, 2) ad (3, 4) Look at the umerator of the formula first. ( i 2 )( i 2 ) Determie the mea of the -values ad the mea of the -values. 5 1 3 5 1 Keep i mid that the otatio just tells ou that ou will be determiig the sum of all the data values. 2012 Caregie Learig 536 Chapter Correlatio ad Residuals

Notice these differeces are used throughout the formula. Determie the differece betwee each data value ad the mea for both the -coordiates ad the -coordiates. Determie the product of the differeces i each pair. The, determie the sum of those products. This is our umerator. ( i 2 ) ( 23 2 1 3 ) ( 1 2 1 3 ) 5 2 3 ( 3 2 1 3 ) 5 8 3 10 5 2 3 ( i 2 )( i ) ( i 2 ) (23 2 1) 5 24 (2 2 1) 5 1 (4 2 1) 5 3 ( 2 10 3? 24 )5 40 3 ( 2 3? 1 ) 5 2 3 40 3 1 2 1 8 5 22 3 ( 8 3? 3 ) 5 8 Now let s aalze the deomiator of the formula. ( i 2 ) 2 ( i ) 2 Determie the sum of the squares of the differeces betwee each value ad its mea. ( i 2 ) 2 ( 2 10 3 ) 2 5 100 ( 2 3 ) 2 5 4 ( 8 3 ) 2 5 64 5 56 3 ( i ) 2 (24) 2 5 16 (1) 2 5 1 5 26 (3) 2 5 2012 Caregie Learig Determie the square root of each sum. Determie the product of these two values. This is our deomiator. ( i 2 ) 2 ( i 2 ) 2 56 3 4.32 26 5.0 ( i 2 ) 2 ( i ) 2 (4.32)(5.0) 5 22.02768.2 Correlatio 537

3. Put the pieces together. Determie the correlatio coefficiet of the data set. 4. Iterpret the correlatio coefficiet of the data set. Problem 2 The Doctor Will See You Now The Ceter for Disease Cotrol collected data o the percet of childre, aged 12 to 1, that were cosidered obese betwee the ears 171 ad 2007. The data are give i the table. Year Percet of Obese Childre 171 6.4 What do ou otice as ou read through the data? 176 5.0 188 10.5 1 14.8 2001 16.7 2003 17.4 2005 17.8 2012 Caregie Learig 2007 18.1 538 Chapter Correlatio ad Residuals

1. Idetif the idepedet ad depedet quatities i this problem situatio. 2. Costruct a scatter plot of the data usig our graphig calculator. a. Sketch the scatter plot. Label the aes. b. Do ou thik a liear regressio equatio would best describe this situatio? Eplai our reasoig. 3. Use a graphig calculator to determie whether a lie of best fit is appropriate for these data. a. Determie ad iterpret the liear regressio equatio. Wait! There s a r ad a r 2 value o m calculator. Which oe do I use? 2012 Caregie Learig b. Determie the correlatio coefficiet. c. Would a lie of best fit be appropriate for this data set? Eplai our reasoig..2 Correlatio 53

4. The amout of atibiotic that remais i our bod over a period of time varies from oe drug to the et. The table give shows the amout of Atibiotic X that remais i our bod over a period of two das. Time (hours) 0 6 12 18 24 30 36 42 48 Amout of Atibiotic X i Bod (mg) 60 36 22 13 7.8 4.7 2.8 1.7 1 a. Determie ad iterpret a liear regressio equatio for this data set. b. Determie ad iterpret the correlatio coefficiet of this data set. c. Does it seem appropriate to use a lie of best fit? If o, eplai our reasoig. If es, determie ad iterpret the least squares regressio equatio. d. Sketch a scatter plot of the data. Amout of Atibiotic X i the Bod (mg) 0 80 70 60 50 40 30 20 10 0 5 10 15 20 25 30 35 40 45 Time (hours) e. Look at the graph of the data. Do ou still agree with our aswer to part (c)? Eplai our reasoig. 2012 Caregie Learig Be prepared to share our solutios ad methods. 540 Chapter Correlatio ad Residuals