Chapter 3. Introduction to Linear Correlation and Regression Part 3

Size: px
Start display at page:

Download "Chapter 3. Introduction to Linear Correlation and Regression Part 3"

Transcription

1 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 1 Richard Lowry, All rights reserved. Chapter 3. Introduction to Linear Correlation and Regression Part 3 Regression The appearance of the term regression at this point (literally, backward movement) is something of an historical accident. It could just as easily have been called progression. The basic concept is the same we found for correlation, though now it has added into it the visual imagery of movement essentially, of two things, two variables, moving together. As indicated earlier, correlation and regression are two sides of the same statistical coin. When you measure the linear correlation of two variables, what you are in effect doing is laying out a straight line that best fits the average "together-movement" of these two variables. That line is spoken of as the line of regression, and its utility is not only as a device for helping us to visualize the relationship between the two variables. It can also serve very usefully as a basis for making rational predictions. To illustrate, consider again our 1993 SAT correlation. Assuming that the negative correlation for that year is likely to occur in subsequent years, you are now in a position to predict a state's average SAT score for some subsequent year, before the results are reported, simply on the basis of knowing the percentage of students within the state who take the SAT that year. If 10% of the high school seniors within a state take the SAT, it is a fairly safe bet that the average combined SAT score for that state will be somewhere in the vicinity of 1,010 perhaps a bit higher or lower, but in any event somewhere in the vicinity. If 70% of the high school seniors in some other state take the SAT, it is a fairly safe bet that the average for that state will be nowhere near 1,010, but rather somewhere in the vicinity of 880. Regression analysis provides a rational foundation for making such

2 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 2 predictions; it also provides a basis for specifying precisely what we mean by "somewhere in the vicinity." As we noted earlier, when you perform the computational procedures for linear correlation and regression, what you are essentially doing is defining the straight line that best fits the bivariate distribution of data points. The criterion for "best fit" is that the sum of the squared vertical distances between the data points and the regression line must be as small as possible. The slant of the resulting line will correspond to the direction of correlation (upward, +; downward, ); and the tightness of the data points around the line will correspond to the strength of the correlation. You can think of the regression line as representing the average relationship that exists between X and Y, as observed within this particular sample. The location and orientation of the regression line are defined by two quantities, spoken of as regression constants, that can be easily derived from the results of calculations already performed in Table 3.2. These are a = the point at which the line crosses the Y axis (the 'intercept'); and b = the rate at which the line angles upward or downward along the X axis (the 'slope'). The computational formulas for these two quantities are quite simple and can be introduced without elaborate comment: For the slope: b = SC XY SS X and for the intercept: a = M Y bm X Before we perform these calculations for the SAT data, I think it might be useful to illustrate the process with a simpler data set. For this purpose, consider yet again the pairing of X i and values that produced the positive correlation shown in Example II of Figure 3.3.

3 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 3 Pair a b c d e f X i means SS X = 17.5 SS Y = 70.0 SC XY = 23.0 Given these previously calculated values: slope: b = = SC XY SS X = intercept: a = M Y bm X = 7.0 [1.31(3.5)] = 2.4 In the following graph I show the same figure that appears above, but now constructed in such a way as to emphasize the intercept and slope of the regression line. The intercept, shown on the left-hand side of the graph, is the point at which the regression line crosses the vertical Y axis providing that the Y axis is lined up with the point on the horizontal axis where X is equal to zero. (Be careful with this, because scatter plots do not always begin the X axis at X = 0.) The slope of the regression line is indicated by the green pattern in the graph that looks like a flight of stairs. What this pattern shows is that for each increase of one unit in the value of X, the value of ncreases by 1.31 units. Thus, when X is equal to zero, s equal to the intercept, which is 2.4; when X = 1.0, s equal to the intercept plus (+) 1.31 (i.e., = 3.71); when X = 2.0, s equal to the intercept plus 2.62 (i.e., = 5.02); and so on.

4 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 4 Now we perform the same calculations for the data set of our 1993 SAT correlation. In Table 3.2 we have already arrived at the summary values mean of X = mean of Y = SS X = 36, SS Y = 231, SC XY = X = percentage of high school seniors taking the SAT Y = state average combined SAT score Given these values, the slope of the regression line can be calculated as b = = SC XY SS X , = 2.17 and the intercept as a = M Y bm X = [ 2.17(36.32)] = For this data set, the regression line intercepts the vertical axis at the point where Y is equal to , and then slants downward ( ) 2.17 units of Y for each unit of X. Thus, when X is equal to zero, s equal to ; when X = 10, s equal to the intercept minus 2.17x10 (i.e., = ); when X = 20, s equal to the intercept minus 2.17x20 (i.e., = ); and so on.

5 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 5 These are the mechanics of regression in a nutshell; now to the logic and strategy of prediction. If the observed correlation between two variables, X and Y, proves to be statistically significant unlikely to have occurred through mere chance coincidence the rational presumption is that it pertains not just to this particular sample of X i pairs, but to the relationship between X and n general. And once you know the relationship between X and n general, you are then in a position to figure out the value of that is likely to be associated with any particular newly observed value of X i. The procedure for making such a prediction is illustrated pictorially below. From the observed correlation in this 1993 sample, we infer that the general relationship between X and Y can be described by a regression line that has an intercept of a = 1, and a slope of b = Suppose, now, that for some subsequent year a certain state has X i = 10% of its high school seniors taking the SAT. If you wanted to predict, the average SAT score for that state, the obvious way to proceed would be to start with the observed value of X i = 10%, go straight up to the line of regression, and then turn left to see where you end up on the Y axis. That will be your predicted value of, which as you can see from the graph is

6 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 6 something quite close to Y = 1,010. For X i = 50%, on the other hand, the predicted value is in the vicinity of Y = 925. In practice, of course, the predicted values of are not arrived at graphically, but through calculation. For any particular observed linear correlation between two variables, X and Y, the value of to be predicted on the basis of a newly observed value of X i is given by the following formula. Please note, however, that this version of the formula is only preliminary. There is something we will need to add to it a bit later. Try this formula out with a few different values of X i and you will see that it is arriving mathematically, hence more precisely, at the same result that would be reached through the graphical method shown above. The formula does it by starting at a, the point at which the regression line intercepts the Y axis, and then moving up or down the Y axis (depending on the direction of the correlation) one unit of slope (b) for each unit of X. for X i = 10% = 1, ( 2.17 x 10) and for X i = 50% = 1, = 1, ( 2.17 x 50) = Now we are of course not claiming for either of these cases that the actual values of will fall precisely at the points we have calculated. All we can rationally assert is that actual values of for the case where X i = 10% will tend to approximate the predicted regression-line value of 1,009.65; that actual values of for the case where X i = 50% will tend to approximate the predicted regression-line value of ; and so on for any other values of X i that fall within the range of X i values observed within the sample. It will probably be intuitively obvious to you that the strength of this "tendency to approximate" will be determined by the strength of the correlation observed within the original sample: The stronger the observed correlation, the more closely the actual values of will tend to approximate their predicted values; and conversely, the weaker the correlation, the greater will be the tendency of the actual values of to deviate from their predicted values. A moment ago I indicated that the formula for a predicted value of

7 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 7 needs to have something added to it. What needs to be added is a measure of probable error, something that reflects the strength of the observed correlation, hence the strength of the tendency for actual values of to approximate their predicted values. Although the full conceptual background for this step will not be available until we have covered some basic concepts of probability, it is possible at this point to convey at least a practical working knowledge of it. Within the context of linear regression, the measure of probable error is a quantity spoken of as the standard error of estimate. Essentially, it is a kind of standard deviation. Here again is the scatter plot for the 1993 SAT correlation. In your mind's eye, please try to envision a green line extending straight up or straight down from each of the blue data points to the red regression line. Each of these imaginary green lines is a measure of the degree to which the associated data point deviates (along the Y axis) from the regression line. Square each of these distances, then take the sum of those squares, and you will have a sum of squared deviates. In statistical parlance, each deviate (the imaginary green line) is spoken of as a residual, so the sum of their squares can be denoted as the sum of squared residuals, which we will abbreviate as SS residual. At any rate, divide this sum of squared deviates (residuals) by N, and you will have a variance. Take the square root of that variance, and you will have a standard deviation. As it happens, the sum of squared residuals can be arrived at mathematically through the simple formula SS residual = SS Y x (1 r 2 ) Recall that r 2 is the proportion of variability in Y that is associated with variability in X, and that 1 r 2 is the proportion (residual) that is not associated with variability in X. Multiplying SS Y by 1 r 2 therefore gives you the proportion of SS Y that is residual, "left over," not accounted for by the correlation between X and Y.

8 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 8 For the 1993 SAT example, this yield SS residual = 231, x ( ) = 60, Divide this quantity by N, and you will have the residual variance of Y: 60,184.38/50 = 1, Take the square root of it, and you will have the standard deviation of the residuals: sqrt[1,203.69] = ±34.69 This standard deviation of the residuals is almost, but not quite, equivalent to the standard error of estimate. The difference is that the quantity we have just calculated is purely descriptive it pertains only to this particular sample of paired X i values whereas the standard error of estimate aims to reach beyond the sample into the realm of events as yet unobserved. This extension from the particular sample of X i pairs to the relationship between X and n general is achieved through the simple expedient of dividing SS residual by N 2 rather than by N. The rationale for this N 2 denominator will have to wait until a later chapter. For the moment, suffice it to say that the standard error of estimate, which we will abbreviate as SE, is given by the formula SE = sqrt[(ss residual ) / (N 2)] For the present example, our standard error of estimate is therefore SE = sqrt[60, / (50 2)] = ±35.41 In brief: On the basis of what we have observed within our sample of X i pairs, we estimate that if the regression line of the sample were to be applied to the entire population of pairs, the Y residuals of the population would have a standard deviation somewhere very close to ± The next version of the SAT scatter plot shows how all of this applies to the task of prediction. A parallel line drawn units of Y above the regression line will give you +1 standard error of estimate; one drawn units of Y below the regression line will give 1 standard error of estimate; and the inference (details in a later chapter) is that the range between +1 SE and 1 SE will include approximately two-thirds of all the X i pairs within the population. Thus, when you predict an unknown value of according to the formula

9 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 9 the true value of has about a two-thirds chance of falling within plus-or-minus points of your predicted value, that is, within plus-or-minus 1 standard error of estimate. In making predictions of this type, the convention is to state the predicted value not simply as but rather as 'predicted Y' plus-or-minus 1 standard error of estimate. That is ±SE Thus, our predicted state average SAT scores for the cases where 10% and 50% of a state's high school seniors take the test are, in their full form for X i = 10% = 1, ( 2.17 x 10)±35.41 and for X i = 50% = 1,009.65±35.41 = 1, ( 2.17 x 50)±35.41 = ±35.41 That is, for X i = 10% we predict that the corresponding value of has a two-thirds chance of falling between Y = and Y = 1,045.06; for X i = 50%, we predict that the corresponding value of has a two-thirds chance of falling between Y = and Y = ; and so on. Providing that the sample is adequately representative of the relationship between X and n general, we can expect approximately two-thirds of the entire 'population' of X i pairs to fall within the range defined by plus-or-minus 1 standard error of estimate, and only about one-third to fall outside that range. Hence, any particular prediction of the general form ±SE will have about a two-thirds chance of catching the true value of in its net and only a one-third chance of missing it. Another way of expressing this concept is in terms of confidence. For a linear-regression prediction of this general form, you can be about two-thirds confident that the true value of will fall within ±1SE of the predicted value. In a later chapter we will examine procedures by which you can increase the confidence you might have in an estimate or a prediction to much higher levels such as 95% or 99%.

10 Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 10 But the proof, as they say, is in the pudding. If you examine the SAT data for any testing year subsequent to 1993, you will find that about two-thirds of the actual values of do in fact fall within the range defined by the regression line of the 1993 sample, plus-or-minus 1SE. Hence any particular prediction of the form ±SE would have had about a two-thirds chance of falling within the net. In Part 2 of this chapter we noted briefly that the first question to be asked of an observed correlation is whether it comes from anything other than mere chance coincidence. It is now time to take that question up in greater depth; however, as it is a question whose implications extend far beyond the confines of correlation and regression, we will make it a separate chapter. *Note, however, that Chapter 3 also has two subchapters examining a couple of aspects of correlation not covered in the main body of the chapter. End of Chapter 3. Return to Top of Chapter 3, Part 3 Go to Subchapter 3a [Partial Correlation] Go to Subchapter 3b [Rank-Order Correlation] Go to Chapter 4 [A First Glance at the Question of Statistical Significance] Home Click this link only if the present page does not appear in a frameset headed by the logo Concepts and Applications of Inferential Statistics

Chapter 14. One-Way Analysis of Variance for Independent Samples Part 2

Chapter 14. One-Way Analysis of Variance for Independent Samples Part 2 Tuesday, December 12, 2000 One-Way ANOVA: Independent Samples: II Page: 1 Richard Lowry, 1999-2000 All rights reserved. Chapter 14. One-Way Analysis of Variance for Independent Samples Part 2 For the items

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression Recall, back some time ago, we used a descriptive statistic which allowed us to draw the best fit line through a scatter plot. We

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

Week 8: Correlation and Regression

Week 8: Correlation and Regression Health Sciences M.Sc. Programme Applied Biostatistics Week 8: Correlation and Regression The correlation coefficient Correlation coefficients are used to measure the strength of the relationship or association

More information

BIOSTATISTICS NURS 3324

BIOSTATISTICS NURS 3324 Simple Linear Regression and Correlation Introduction Previously, our attention has been focused on one variable which we designated by x. Frequently, it is desirable to learn something about the relationship

More information

Lecture 18: Simple Linear Regression

Lecture 18: Simple Linear Regression Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots SFBS Course Notes Part 7: Correlation Bivariate relationships (p. 1) Linear transformations (p. 3) Pearson r : Measuring a relationship (p. 5) Interpretation of correlations (p. 10) Relationships between

More information

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist sales $ (y - dependent variable) advertising $ (x - independent variable)

More information

Error Correcting Codes Prof. Dr. P. Vijay Kumar Department of Electrical Communication Engineering Indian Institute of Science, Bangalore

Error Correcting Codes Prof. Dr. P. Vijay Kumar Department of Electrical Communication Engineering Indian Institute of Science, Bangalore (Refer Slide Time: 00:15) Error Correcting Codes Prof. Dr. P. Vijay Kumar Department of Electrical Communication Engineering Indian Institute of Science, Bangalore Lecture No. # 03 Mathematical Preliminaries:

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet WISE Regression/Correlation Interactive Lab Introduction to the WISE Correlation/Regression Applet This tutorial focuses on the logic of regression analysis with special attention given to variance components.

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

INTRODUCTION TO ANALYSIS OF VARIANCE

INTRODUCTION TO ANALYSIS OF VARIANCE CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two

More information

Sampling. What is the purpose of sampling: Sampling Terms. Sampling and Sampling Distributions

Sampling. What is the purpose of sampling: Sampling Terms. Sampling and Sampling Distributions Sampling and Sampling Distributions Normal Distribution Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions Sampling Distribution of the Mean Standard Error

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

Suppose we have the set of all real numbers, R, and two operations, +, and *. Then the following are assumed to be true.

Suppose we have the set of all real numbers, R, and two operations, +, and *. Then the following are assumed to be true. Algebra Review In this appendix, a review of algebra skills will be provided. Students sometimes think that there are tricks needed to do algebra. Rather, algebra is a set of rules about what one may and

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up? Comment: notes are adapted from BIOL 214/312. I. Correlation. Correlation A) Correlation is used when we want to examine the relationship of two continuous variables. We are not interested in prediction.

More information

5 Systems of Equations

5 Systems of Equations Systems of Equations Concepts: Solutions to Systems of Equations-Graphically and Algebraically Solving Systems - Substitution Method Solving Systems - Elimination Method Using -Dimensional Graphs to Approximate

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of

More information

WEIGHTED LEAST SQUARES. Model Assumptions for Weighted Least Squares: Recall: We can fit least squares estimates just assuming a linear mean function.

WEIGHTED LEAST SQUARES. Model Assumptions for Weighted Least Squares: Recall: We can fit least squares estimates just assuming a linear mean function. 1 2 WEIGHTED LEAST SQUARES Recall: We can fit least squares estimates just assuming a linear mean function. Without the constant variance assumption, we can still conclude that the coefficient estimators

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics

More information

Practical Algebra. A Step-by-step Approach. Brought to you by Softmath, producers of Algebrator Software

Practical Algebra. A Step-by-step Approach. Brought to you by Softmath, producers of Algebrator Software Practical Algebra A Step-by-step Approach Brought to you by Softmath, producers of Algebrator Software 2 Algebra e-book Table of Contents Chapter 1 Algebraic expressions 5 1 Collecting... like terms 5

More information

Section Linear Correlation and Regression. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

Section Linear Correlation and Regression. Copyright 2013, 2010, 2007, Pearson, Education, Inc. Section 13.7 Linear Correlation and Regression What You Will Learn Linear Correlation Scatter Diagram Linear Regression Least Squares Line 13.7-2 Linear Correlation Linear correlation is used to determine

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Modern Algebra Prof. Manindra Agrawal Department of Computer Science and Engineering Indian Institute of Technology, Kanpur

Modern Algebra Prof. Manindra Agrawal Department of Computer Science and Engineering Indian Institute of Technology, Kanpur Modern Algebra Prof. Manindra Agrawal Department of Computer Science and Engineering Indian Institute of Technology, Kanpur Lecture 02 Groups: Subgroups and homomorphism (Refer Slide Time: 00:13) We looked

More information

Introduction to Error Analysis

Introduction to Error Analysis Introduction to Error Analysis This is a brief and incomplete discussion of error analysis. It is incomplete out of necessity; there are many books devoted entirely to the subject, and we cannot hope to

More information

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). Linear Regression Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). A dependent variable is a random variable whose variation

More information

10. Alternative case influence statistics

10. Alternative case influence statistics 10. Alternative case influence statistics a. Alternative to D i : dffits i (and others) b. Alternative to studres i : externally-studentized residual c. Suggestion: use whatever is convenient with the

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

The role of multiple representations in the understanding of ideal gas problems Madden S. P., Jones L. L. and Rahm J.

The role of multiple representations in the understanding of ideal gas problems Madden S. P., Jones L. L. and Rahm J. www.rsc.org/ xxxxxx XXXXXXXX The role of multiple representations in the understanding of ideal gas problems Madden S. P., Jones L. L. and Rahm J. Published in Chemistry Education Research and Practice,

More information

Algebra 1 Khan Academy Video Correlations By SpringBoard Activity and Learning Target

Algebra 1 Khan Academy Video Correlations By SpringBoard Activity and Learning Target Algebra 1 Khan Academy Video Correlations By SpringBoard Activity and Learning Target SB Activity Activity 1 Investigating Patterns 1-1 Learning Targets: Identify patterns in data. Use tables, graphs,

More information

2 Representing Motion 4 How Fast? MAINIDEA Write the Main Idea for this section.

2 Representing Motion 4 How Fast? MAINIDEA Write the Main Idea for this section. 2 Representing Motion 4 How Fast? MAINIDEA Write the Main Idea for this section. REVIEW VOCABULARY absolute value Recall and write the definition of the Review Vocabulary term. absolute value NEW VOCABULARY

More information

AP Calculus AB Summer Assignment

AP Calculus AB Summer Assignment AP Calculus AB Summer Assignment Name: When you come back to school, it is my epectation that you will have this packet completed. You will be way behind at the beginning of the year if you haven t attempted

More information

Wednesday, September 19 Handout: Ordinary Least Squares Estimation Procedure The Mechanics

Wednesday, September 19 Handout: Ordinary Least Squares Estimation Procedure The Mechanics Amherst College Department of Economics Economics Fall 2012 Wednesday, September 19 Handout: Ordinary Least Squares Estimation Procedure he Mechanics Preview Best Fitting Line: Income and Savings Clint

More information

MATH 1130 Exam 1 Review Sheet

MATH 1130 Exam 1 Review Sheet MATH 1130 Exam 1 Review Sheet The Cartesian Coordinate Plane The Cartesian Coordinate Plane is a visual representation of the collection of all ordered pairs (x, y) where x and y are real numbers. This

More information

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Mathematics-I Prof. S.K. Ray Department of Mathematics and Statistics Indian Institute of Technology, Kanpur. Lecture 1 Real Numbers

Mathematics-I Prof. S.K. Ray Department of Mathematics and Statistics Indian Institute of Technology, Kanpur. Lecture 1 Real Numbers Mathematics-I Prof. S.K. Ray Department of Mathematics and Statistics Indian Institute of Technology, Kanpur Lecture 1 Real Numbers In these lectures, we are going to study a branch of mathematics called

More information

SOLUTIONS FOR PROBLEMS 1-30

SOLUTIONS FOR PROBLEMS 1-30 . Answer: 5 Evaluate x x + 9 for x SOLUTIONS FOR PROBLEMS - 0 When substituting x in x be sure to do the exponent before the multiplication by to get (). + 9 5 + When multiplying ( ) so that ( 7) ( ).

More information

SKILL BUILDER TEN. Graphs of Linear Equations with Two Variables. If x = 2 then y = = = 7 and (2, 7) is a solution.

SKILL BUILDER TEN. Graphs of Linear Equations with Two Variables. If x = 2 then y = = = 7 and (2, 7) is a solution. SKILL BUILDER TEN Graphs of Linear Equations with Two Variables A first degree equation is called a linear equation, since its graph is a straight line. In a linear equation, each term is a constant or

More information

B. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i

B. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i B. Weaver (24-Mar-2005) Multiple Regression... 1 Chapter 5: Multiple Regression 5.1 Partial and semi-partial correlation Before starting on multiple regression per se, we need to consider the concepts

More information

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already

More information

22 Approximations - the method of least squares (1)

22 Approximations - the method of least squares (1) 22 Approximations - the method of least squares () Suppose that for some y, the equation Ax = y has no solutions It may happpen that this is an important problem and we can t just forget about it If we

More information

ABE Math Review Package

ABE Math Review Package P a g e ABE Math Review Package This material is intended as a review of skills you once learned and wish to review before your assessment. Before studying Algebra, you should be familiar with all of the

More information

Describing the Relationship between Two Variables

Describing the Relationship between Two Variables 1 Describing the Relationship between Two Variables Key Definitions Scatter : A graph made to show the relationship between two different variables (each pair of x s and y s) measured from the same equation.

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

STEP 1: Ask Do I know the SLOPE of the line? (Notice how it s needed for both!) YES! NO! But, I have two NO! But, my line is

STEP 1: Ask Do I know the SLOPE of the line? (Notice how it s needed for both!) YES! NO! But, I have two NO! But, my line is EQUATIONS OF LINES 1. Writing Equations of Lines There are many ways to define a line, but for today, let s think of a LINE as a collection of points such that the slope between any two of those points

More information

Mrs. Poyner/Mr. Page Chapter 3 page 1

Mrs. Poyner/Mr. Page Chapter 3 page 1 Name: Date: Period: Chapter 2: Take Home TEST Bivariate Data Part 1: Multiple Choice. (2.5 points each) Hand write the letter corresponding to the best answer in space provided on page 6. 1. In a statistics

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

A Quick Algebra Review

A Quick Algebra Review 1. Simplifying Epressions. Solving Equations 3. Problem Solving 4. Inequalities 5. Absolute Values 6. Linear Equations 7. Systems of Equations 8. Laws of Eponents 9. Quadratics 10. Rationals 11. Radicals

More information

LOOKING FOR RELATIONSHIPS

LOOKING FOR RELATIONSHIPS LOOKING FOR RELATIONSHIPS One of most common types of investigation we do is to look for relationships between variables. Variables may be nominal (categorical), for example looking at the effect of an

More information

1/11/2011. Chapter 4: Variability. Overview

1/11/2011. Chapter 4: Variability. Overview Chapter 4: Variability Overview In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In simple terms, if the scores in a distribution are all

More information

Slope Fields: Graphing Solutions Without the Solutions

Slope Fields: Graphing Solutions Without the Solutions 8 Slope Fields: Graphing Solutions Without the Solutions Up to now, our efforts have been directed mainly towards finding formulas or equations describing solutions to given differential equations. Then,

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Lines and Their Equations

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Lines and Their Equations ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 017/018 DR. ANTHONY BROWN. Lines and Their Equations.1. Slope of a Line and its y-intercept. In Euclidean geometry (where

More information

Ch. 17. DETERMINATION OF SAMPLE SIZE

Ch. 17. DETERMINATION OF SAMPLE SIZE LOGO Ch. 17. DETERMINATION OF SAMPLE SIZE Dr. Werner R. Murhadi www.wernermurhadi.wordpress.com Descriptive and Inferential Statistics descriptive statistics is Statistics which summarize and describe

More information

Hawai`i Post-Secondary Math Survey -- Content Items

Hawai`i Post-Secondary Math Survey -- Content Items Hawai`i Post-Secondary Math Survey -- Content Items Section I: Number sense and numerical operations -- Several numerical operations skills are listed below. Please rank each on a scale from 1 (not essential)

More information

Chapter 5 - Differentiating Functions

Chapter 5 - Differentiating Functions Chapter 5 - Differentiating Functions Section 5.1 - Differentiating Functions Differentiation is the process of finding the rate of change of a function. We have proven that if f is a variable dependent

More information

Last few slides from last time

Last few slides from last time Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an

More information

QUADRATIC EQUATIONS M.K. HOME TUITION. Mathematics Revision Guides Level: GCSE Higher Tier

QUADRATIC EQUATIONS M.K. HOME TUITION. Mathematics Revision Guides Level: GCSE Higher Tier Mathematics Revision Guides Quadratic Equations Page 1 of 8 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier QUADRATIC EQUATIONS Version: 3.1 Date: 6-10-014 Mathematics Revision Guides

More information

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION STP 226 ELEMENTARY STATISTICS CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION Linear Regression and correlation allows us to examine the relationship between two or more quantitative variables.

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Topic 10 - Linear Regression

Topic 10 - Linear Regression Topic 10 - Linear Regression Least squares principle Hypothesis tests/confidence intervals/prediction intervals for regression 1 Linear Regression How much should you pay for a house? Would you consider

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Information Sources. Class webpage (also linked to my.ucdavis page for the class):

Information Sources. Class webpage (also linked to my.ucdavis page for the class): STATISTICS 108 Outline for today: Go over syllabus Provide requested information I will hand out blank paper and ask questions Brief introduction and hands-on activity Information Sources Class webpage

More information

9.2 Multiplication Properties of Radicals

9.2 Multiplication Properties of Radicals Section 9.2 Multiplication Properties of Radicals 885 9.2 Multiplication Properties of Radicals Recall that the equation x 2 = a, where a is a positive real number, has two solutions, as indicated in Figure

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Module 8: Linear Regression. The Applied Research Center

Module 8: Linear Regression. The Applied Research Center Module 8: Linear Regression The Applied Research Center Module 8 Overview } Purpose of Linear Regression } Scatter Diagrams } Regression Equation } Regression Results } Example Purpose } To predict scores

More information

Chapter 3 Representations of a Linear Relation

Chapter 3 Representations of a Linear Relation Chapter 3 Representations of a Linear Relation The purpose of this chapter is to develop fluency in the ways of representing a linear relation, and in extracting information from these representations.

More information

Simple linear regression

Simple linear regression Simple linear regression Biometry 755 Spring 2008 Simple linear regression p. 1/40 Overview of regression analysis Evaluate relationship between one or more independent variables (X 1,...,X k ) and a single

More information

Mathematics GRADE 8 Teacher Packet

Mathematics GRADE 8 Teacher Packet COMMON CORE Standards Plus Mathematics GRADE 8 Teacher Packet Copyright 01 Learning Plus Associates All Rights Reserved; International Copyright Secured. Permission is hereby granted to teachers to reprint

More information

Regression Analysis in R

Regression Analysis in R Regression Analysis in R 1 Purpose The purpose of this activity is to provide you with an understanding of regression analysis and to both develop and apply that knowledge to the use of the R statistical

More information

Water tank. Fortunately there are a couple of objectors. Why is it straight? Shouldn t it be a curve?

Water tank. Fortunately there are a couple of objectors. Why is it straight? Shouldn t it be a curve? Water tank (a) A cylindrical tank contains 800 ml of water. At t=0 (minutes) a hole is punched in the bottom, and water begins to flow out. It takes exactly 100 seconds for the tank to empty. Draw the

More information

DIFFERENTIAL EQUATIONS

DIFFERENTIAL EQUATIONS DIFFERENTIAL EQUATIONS Basic Concepts Paul Dawkins Table of Contents Preface... Basic Concepts... 1 Introduction... 1 Definitions... Direction Fields... 8 Final Thoughts...19 007 Paul Dawkins i http://tutorial.math.lamar.edu/terms.aspx

More information

For those of you who are taking Calculus AB concurrently with AP Physics, I have developed a

For those of you who are taking Calculus AB concurrently with AP Physics, I have developed a AP Physics C: Mechanics Greetings, For those of you who are taking Calculus AB concurrently with AP Physics, I have developed a brief introduction to Calculus that gives you an operational knowledge of

More information

Experimental Design and Graphical Analysis of Data

Experimental Design and Graphical Analysis of Data Experimental Design and Graphical Analysis of Data A. Designing a controlled experiment When scientists set up experiments they often attempt to determine how a given variable affects another variable.

More information

Module 1 Linear Regression

Module 1 Linear Regression Regression Analysis Although many phenomena can be modeled with well-defined and simply stated mathematical functions, as illustrated by our study of linear, exponential and quadratic functions, the world

More information

Algebra vocabulary CARD SETS Frame Clip Art by Pixels & Ice Cream

Algebra vocabulary CARD SETS Frame Clip Art by Pixels & Ice Cream Algebra vocabulary CARD SETS 1-7 www.lisatilmon.blogspot.com Frame Clip Art by Pixels & Ice Cream Algebra vocabulary Game Materials: one deck of Who has cards Objective: to match Who has words with definitions

More information

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations Basics of Experimental Design Review of Statistics And Experimental Design Scientists study relation between variables In the context of experiments these variables are called independent and dependent

More information

Semiconductor Optoelectronics Prof. M. R. Shenoy Department of physics Indian Institute of Technology, Delhi

Semiconductor Optoelectronics Prof. M. R. Shenoy Department of physics Indian Institute of Technology, Delhi Semiconductor Optoelectronics Prof. M. R. Shenoy Department of physics Indian Institute of Technology, Delhi Lecture - 18 Optical Joint Density of States So, today we will discuss the concept of optical

More information

Quantitative Bivariate Data

Quantitative Bivariate Data Statistics 211 (L02) - Linear Regression Quantitative Bivariate Data Consider two quantitative variables, defined in the following way: X i - the observed value of Variable X from subject i, i = 1, 2,,

More information

Error Correcting Codes Prof. Dr. P Vijay Kumar Department of Electrical Communication Engineering Indian Institute of Science, Bangalore

Error Correcting Codes Prof. Dr. P Vijay Kumar Department of Electrical Communication Engineering Indian Institute of Science, Bangalore (Refer Slide Time: 00:54) Error Correcting Codes Prof. Dr. P Vijay Kumar Department of Electrical Communication Engineering Indian Institute of Science, Bangalore Lecture No. # 05 Cosets, Rings & Fields

More information

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module

More information

X = X X n, + X 2

X = X X n, + X 2 CS 70 Discrete Mathematics for CS Fall 2003 Wagner Lecture 22 Variance Question: At each time step, I flip a fair coin. If it comes up Heads, I walk one step to the right; if it comes up Tails, I walk

More information

UNIT 12 ~ More About Regression

UNIT 12 ~ More About Regression ***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests

More information

AP Calculus AB Summer Assignment

AP Calculus AB Summer Assignment AP Calculus AB Summer Assignment Name: When you come back to school, you will be epected to have attempted every problem. These skills are all different tools that you will pull out of your toolbo this

More information

Chapter 4. Forces and the Laws of Motion. CH 4 Forces and the Laws of Motion.notebook. April 09, Changes in Motion. A. Force

Chapter 4. Forces and the Laws of Motion. CH 4 Forces and the Laws of Motion.notebook. April 09, Changes in Motion. A. Force CH 4 Forces and the Laws of Motion.notebook Chapter 4 A. Force April 09, 2015 Changes in Motion Forces and the Laws of Motion 1. Defined as the cause of an acceleration, or the change in an object s motion,

More information

STAT 285: Fall Semester Final Examination Solutions

STAT 285: Fall Semester Final Examination Solutions Name: Student Number: STAT 285: Fall Semester 2014 Final Examination Solutions 5 December 2014 Instructor: Richard Lockhart Instructions: This is an open book test. As such you may use formulas such as

More information

Maximum-Likelihood Estimation: Basic Ideas

Maximum-Likelihood Estimation: Basic Ideas Sociology 740 John Fox Lecture Notes Maximum-Likelihood Estimation: Basic Ideas Copyright 2014 by John Fox Maximum-Likelihood Estimation: Basic Ideas 1 I The method of maximum likelihood provides estimators

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

ALGEBRAIC PRINCIPLES

ALGEBRAIC PRINCIPLES ALGEBRAIC PRINCIPLES Numbers and Operations Standard: 1 Understands and applies concepts of numbers and operations Power 1: Understands numbers, ways of representing numbers, relationships among numbers,

More information

Regression Analysis IV... More MLR and Model Building

Regression Analysis IV... More MLR and Model Building Regression Analysis IV... More MLR and Model Building This session finishes up presenting the formal methods of inference based on the MLR model and then begins discussion of "model building" (use of regression

More information