Simple linear regression: linear relationship between two qunatitative variables. Linear Regression. The regression line

Size: px
Start display at page:

Download "Simple linear regression: linear relationship between two qunatitative variables. Linear Regression. The regression line"

Transcription

1 Linear Regression Simple linear regression: linear relationship etween two qunatitative variales The regression line Facts aout least-squares regression Residuals Influential oservations Cautions aout Correlation and Regression Correlation/regression using averages Lurking variales Association is not causation Correlation tells us aout strength (scatter) and direction of the linear relationship etween two quantitative variales. In addition, we would like to have a numerical description of how oth variales var together. For instance, is one variale increasing faster than the other one? And we would like to make predictions ased on that numerical description. But which line est descries our data? Here we will ask Minita to decide

2 The linear regression line A regression line is a straight line that descries how a response variale changes as an eplanator variale changes. We often use a regression line to predict the value of for a given value of. In regression, the distinction etween eplanator and response variales is important. Rätta linjers ekvation (repetition) A Statistical model Simple linear regression model X Y Y E { Y } slope is = dependent variale = independent variale = -intercept = slope of the line X ε= error variale, normall distriuted aout (mean is zero) with constant standard deviation. The error variale accounts for all the variales, oth measurale and unmeasurale that are not part of the model 2

3 Estimating regression line for data The least-squares regression (minsta-kvadrat regression) line is the unique line such that the sum of the squared vertical () distances etween the data points and the line is the smallest possile. Distances etween the oserved points and line are called residuals of the model for the data. Residuals can e oth positive and negative. Estimation of β and β β and β are unknown and are estimated ( Minita) from the data least square regression are and respectivel We get the estimated model: That is, we get ˆ ˆ -hat is the predicted response for an is the oserved value is the slope is the intercept An eample For randoml selected 6 students numer of studing hours for the eam and eam marks are recorded student hours marks () () Total mean

4 Eample... Minita output: Regression Analsis: Marks versus Hours The regression equation is Marks =,3 +,65 Hours Predictor Coef SE Coef T P Constant,33 5,56,22,837 Hours,6486,324 2,45, S = 5,5386 R-Sq = 97,5% R-Sq(adj) = 96,9% Interpretations: =.3: when a student spends no hours for studing he/she gets.3 marks =.65: for each additional one hour of stud student gets an increase of.65 in his/her mark Another eample: ads and revenue In order to see the relationship etween numer of advertisents in local radio services ads and revenue (in SEK s) during a month, the manager of a fast food compan recorded them for several months. ads revenue 327, , , , , , , , Epected revenue ads = When the numer of advertisements increases with one, the monthl revenue increases with SEK (in ). = When no advertising is done in local radios, the monthl revunue is SEK (in s) 4

5 5 s s r E. s s r E. Coefficient of determination, r 2 r 2 represents the percentage of the variance in (vertical scatter from the regression line) that can e eplained changes in. s s r r 2, the coefficient of determination, is the square of the correlation coefficient.

6 r = - r 2 = Changes in eplain % of the variations in. Y can e entirel predicted for an given value of. r =.87 r 2 =.76 r = r 2 = Changes in eplain % of the variations in. The value(s) takes is (are) entirel independent of what value takes. Here the change in onl eplains 76% of the change in. The rest of the change in (the vertical scatter, shown as red arrows) must e eplained something other than. hours and marks eample Coefficient of determination S = 5,5386 R-Sq = 97,5% R-Sq(adj) = 96,9% 97.5% of the variance of marks is eplained hours. Facts aout least-squares regression 6

7 Etrapolation!!! Etrapolation is the use of a regression line for predictions outside the range of values used to otain the line.!!! This can e a ver stupid thing to do, as seen here. The intercept Sometimes the -intercept is not iologicall possile. Here we have negative lood alcohol content, which makes no sense But the negative value is appropriate for the equation of the regression line. -intercept shows negative lood alcohol There is a lot of scatter in the data, and the line is just an estimate. Residuals The distances from each point to the least-squares regression line give us potentiall useful information aout the contriution of individual data points to the overall pattern of scatter. These distances are called residuals. Points aove the line have a positive residual. The sum of these residuals is alwas. Points elow the line have a negative residual. Predicted ŷ Oserved dist. ( ˆ) residual 7

8 Residual plots Residuals are randoml scattered good! Curved pattern means the relationship ou are looking at is not linear. A change in variailit across plot is a warning sign. You need to find out wh it is, and rememer that predictions made in areas of larger variailit will not e as good. Regression diagnostics: ads and revenue 8

9 Assessing the Model The least squares method produces a regression line whether or not there are linear relationship etween X and Y. Consequentl, it is important to assess how well the linear model fits the data. Check if we have reasonal high R-square value. Check if regression assumptions are fulfilled: do the residuals of the model. Do the residuals have constant variation for all predicted values Draw a scatter plot of residuals against predicted values to see if spread is the same 2. Do the residuals independent from each other Draw a scatter plot of residuals against predicted values to see if the follow a regular pattern 3. Normal with aout zero: draw a histogram of the residuals and normal P-P plot Alwas plot our data A correlation coefficient and a regression line can e calculated for an relationship etween two quantitative variales. However, outliers can greatl influence the results. Also, running a linear regression on a nonlinear association is not onl meaningless ut misleading. log_, 8,7 2, 2, 2,8 2,5, 9,9 2,29, 8, 2,9 9, 6,8,92 4, 8, 2,9 5, 22, 3,9 6, 27, 3,3 4, 8,2 2,9 Y is transformed into log_ to get a linear relationship 7, 33, 3,5 3, 3,5 2,6 So, make sure to alwas plot our data efore ou run a correlation or regression analsis. However, making the scatterplots shows us that the correlation/ regression analsis is not appropriate for all data sets. Moderate linear association; regression OK. Ovious nonlinear relationship; regression not OK. One point deviates from the highl linear pattern; this outlier must e eamined closel efore proceeding. Just one ver influential point; all other points have the same value; a redesign is due here. 9

10 Lurking variales A lurking variale is a variale not included in the stud design that does have an effect on the variales studied. Lurking variales can falsel suggest a relationship. What is the lurking variale in these eamples? How could ou answer if ou didn t know anthing aout the topic? Strong positive association etween numer of firefighters at a fire site and the amount of damage a fire does. Negative association etween moderate amounts of wine drinking and death rates from heart disease in developed nations. Simpsons parado (s i Moore). Eempel: Vilket sjukhus är det ättre? 29 Men om vi även vet om patientens tillstånd innan operationen... 3

11 Vocaular: lurking vs. confounding A lurking variale is a variale that is not among the eplanator or response variales in a stud and et ma influence the interpretation of relationships among those variales. Two variales are confounded when their effects on a response variale cannot e distinguished from each other. The confounded variales ma e either eplanator variales or lurking variales. But ou often see them used interchangeal Association is not causation An association etween an eplanator variale and a response variale, even if it is ver strong, is not itself good evidence that changes in actuall cause changes in. Eample: There is a high positive correlation etween the numer of television sets per person () and the average life epectanc () for the world s nations. Could we lengthen the lives of people in Rwanda shipping them TV sets? The est wa to get evidence that causes is to do an eperiment in which we change and keep lurking variales under control. Caution efore rushing into a correlation or a regression analsis Do not use a regression on inappropriate data. Pattern in the residuals Presence of large outliers Use residual plots for help. Clumped data falsel appearing linear Beware of lurking variales. Avoid etrapolating (going eond interpolation). A relationship, however strong, does not itself impl causation.

12 Linear regression eample (multiple) 2

Relationships Regression

Relationships Regression Relationships Regression BPS chapter 5 2006 W.H. Freeman and Company Objectives (BPS chapter 5) Regression Regression lines The least-squares regression line Using technology Facts about least-squares

More information

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships Objectives 2.3 Least-squares regression Regression lines Prediction and Extrapolation Correlation and r 2 Transforming relationships Adapted from authors slides 2012 W.H. Freeman and Company Straight Line

More information

Topic - 12 Linear Regression and Correlation

Topic - 12 Linear Regression and Correlation Topic 1 Linear Regression and Correlation Correlation & Regression Univariate & Bivariate tatistics U: frequenc distribution, mean, mode, range, standard deviation B: correlation two variables Correlation

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11 Correlation and Regression Correlation A relationship between two variables. The data can be represented b ordered pairs (, ) is the independent (or eplanator) variable is the dependent (or

More information

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 14 1 Statistical versus Deterministic Relationships Distance versus Speed (when travel time is constant). Income (in millions of

More information

Looking at data: relationships

Looking at data: relationships Looking at data: relationships Least-squares regression IPS chapter 2.3 2006 W. H. Freeman and Company Objectives (IPS chapter 2.3) Least-squares regression p p The regression line Making predictions:

More information

Chapter 2: Looking at Data Relationships (Part 3)

Chapter 2: Looking at Data Relationships (Part 3) Chapter 2: Looking at Data Relationships (Part 3) Dr. Nahid Sultana Chapter 2: Looking at Data Relationships 2.1: Scatterplots 2.2: Correlation 2.3: Least-Squares Regression 2.5: Data Analysis for Two-Way

More information

Review of Regression Basics

Review of Regression Basics Review of Regression Basics When describing a Bivariate Relationship: Make a Scatterplot Strength, Direction, Form Model: y-hat=a+bx Interpret slope in context Make Predictions Residual = Observed-Predicted

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation Scatterplots and Correlation Name Hr A scatterplot shows the relationship between two quantitative variables measured on the same individuals. variable (y) measures an outcome of a study variable (x) may

More information

Review of Regression Basics

Review of Regression Basics Review of Regression Basics When describing a Bivariate Relationship: Make a plot Strength, Direction, Form Model: yhata+b Interpret slope in contet Make Predictions Residual ObservedPredicted Assess the

More information

1. Define the following terms (1 point each): alternative hypothesis

1. Define the following terms (1 point each): alternative hypothesis 1 1. Define the following terms (1 point each): alternative hypothesis One of three hypotheses indicating that the parameter is not zero; one states the parameter is not equal to zero, one states the parameter

More information

Correlation and regression. Correlation and regression analysis. Measures of association. Why bother? Positive linear relationship

Correlation and regression. Correlation and regression analysis. Measures of association. Why bother? Positive linear relationship 1 Correlation and regression analsis 12 Januar 2009 Monda, 14.00-16.00 (C1058) Frank Haege Department of Politics and Public Administration Universit of Limerick frank.haege@ul.ie www.frankhaege.eu Correlation

More information

ONLINE PAGE PROOFS. Relationships between two numerical variables

ONLINE PAGE PROOFS. Relationships between two numerical variables 14 Relationships between two numerical variables 14.1 Kick off with CAS 14.2 Scatterplots and basic correlation 14.3 Further correlation coefficients 14.4 Making predictions 14.5 Review 14.1 Kick off with

More information

Chapter 5 Friday, May 21st

Chapter 5 Friday, May 21st Chapter 5 Friday, May 21 st Overview In this Chapter we will see three different methods we can use to describe a relationship between two quantitative variables. These methods are: Scatterplot Correlation

More information

Biostatistics in Research Practice - Regression I

Biostatistics in Research Practice - Regression I Biostatistics in Research Practice - Regression I Simon Crouch 30th Januar 2007 In scientific studies, we often wish to model the relationships between observed variables over a sample of different subjects.

More information

The Mean Version One way to write the One True Regression Line is: Equation 1 - The One True Line

The Mean Version One way to write the One True Regression Line is: Equation 1 - The One True Line Chapter 27: Inferences for Regression And so, there is one more thing which might vary one more thing aout which we might want to make some inference: the slope of the least squares regression line. The

More information

Linear correlation. Contents. 1 Linear correlation. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College

Linear correlation. Contents. 1 Linear correlation. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College Introductor Statistics Lectures Linear correlation Testing two variables for a linear relationship Anthon Tanbakuchi Department of Mathematics Pima Communit College Redistribution of this material is prohibited

More information

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variable In this lecture: We shall look at two quantitative variables.

More information

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals.

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals. 9.1 Simple linear regression 9.1.1 Linear models Response and eplanatory variables Chapter 9 Regression With bivariate data, it is often useful to predict the value of one variable (the response variable,

More information

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics Chapter 13 Student Lecture Notes 13-1 Department of Quantitative Methods & Information Sstems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analsis QMIS 0 Dr. Mohammad

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Section 3.2 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 3 Describing Relationships 3.1 Scatterplots and Correlation 3.2 Section 3.2

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Ch. 3 Review - LSRL AP Stats

Ch. 3 Review - LSRL AP Stats Ch. 3 Review - LSRL AP Stats Multiple Choice Identify the choice that best completes the statement or answers the question. Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable lumber

More information

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variables In this lecture: We shall look at two quantitative variables.

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Section 3.2 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 3 Describing Relationships 3.1 Scatterplots and Correlation 3.2 Section 3.2

More information

ab is shifted horizontally by h units. ab is shifted vertically by k units.

ab is shifted horizontally by h units. ab is shifted vertically by k units. Algera II Notes Unit Eight: Eponential and Logarithmic Functions Sllaus Ojective: 8. The student will graph logarithmic and eponential functions including ase e. Eponential Function: a, 0, Graph of an

More information

Chapter 6: Exploring Data: Relationships Lesson Plan

Chapter 6: Exploring Data: Relationships Lesson Plan Chapter 6: Exploring Data: Relationships Lesson Plan For All Practical Purposes Displaying Relationships: Scatterplots Mathematical Literacy in Today s World, 9th ed. Making Predictions: Regression Line

More information

Graphs and polynomials

Graphs and polynomials 1 1A The inomial theorem 1B Polnomials 1C Division of polnomials 1D Linear graphs 1E Quadratic graphs 1F Cuic graphs 1G Quartic graphs Graphs and polnomials AreAS of STud Graphs of polnomial functions

More information

What is the easiest way to lose points when making a scatterplot?

What is the easiest way to lose points when making a scatterplot? Day #1: Read 141-142 3.1 Describing Relationships Why do we study relationships between two variables? Read 143-144 Page 144: Check Your Understanding Read 144-149 How do you know which variable to put

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice Name Period AP Statistics Bivariate Data Analysis Test Review Multiple-Choice 1. The correlation coefficient measures: (a) Whether there is a relationship between two variables (b) The strength of the

More information

Chapter 6. Exploring Data: Relationships

Chapter 6. Exploring Data: Relationships Chapter 6 Exploring Data: Relationships For All Practical Purposes: Effective Teaching A characteristic of an effective instructor is fairness and consistenc in grading and evaluating student performance.

More information

Correlation. Bret Hanlon and Bret Larget. Department of Statistics University of Wisconsin Madison. December 6, Correlation 1 / 25

Correlation. Bret Hanlon and Bret Larget. Department of Statistics University of Wisconsin Madison. December 6, Correlation 1 / 25 Correlation Bret Hanlon and Bret Larget Department of Statistics Universit of Wisconsin Madison December 6, 2 Correlation / 25 The Big Picture We have just completed a length series of lectures on ANOVA

More information

Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model

Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model A1: There is a linear relationship between X and Y. A2: The error terms (and

More information

y n 1 ( x i x )( y y i n 1 i y 2

y n 1 ( x i x )( y y i n 1 i y 2 STP3 Brief Class Notes Instructor: Ela Jackiewicz Chapter Regression and Correlation In this chapter we will explore the relationship between two quantitative variables, X an Y. We will consider n ordered

More information

CHAPTER. Scatterplots

CHAPTER. Scatterplots CHAPTER 7 Two-Variable Data Analysis IN THIS CHAPTER Summary: In the previous chapter we used eploratory data analysis to help us understand what a one-variable data set was saying to us. In this chapter

More information

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation Chapter 1 Summarizing Bivariate Data Linear Regression and Correlation This chapter introduces an important method for making inferences about a linear correlation (or relationship) between two variables,

More information

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals Chapter 8 Linear Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fat Versus

More information

Dependence and scatter-plots. MVE-495: Lecture 4 Correlation and Regression

Dependence and scatter-plots. MVE-495: Lecture 4 Correlation and Regression Dependence and scatter-plots MVE-495: Lecture 4 Correlation and Regression It is common for two or more quantitative variables to be measured on the same individuals. Then it is useful to consider what

More information

Non-Linear Regression Samuel L. Baker

Non-Linear Regression Samuel L. Baker NON-LINEAR REGRESSION 1 Non-Linear Regression 2006-2008 Samuel L. Baker The linear least squares method that you have een using fits a straight line or a flat plane to a unch of data points. Sometimes

More information

f 0 ab a b: base f

f 0 ab a b: base f Precalculus Notes: Unit Eponential and Logarithmic Functions Sllaus Ojective: 9. The student will sketch the graph of a eponential, logistic, or logarithmic function. 9. The student will evaluate eponential

More information

Section 3.3. How Can We Predict the Outcome of a Variable? Agresti/Franklin Statistics, 1of 18

Section 3.3. How Can We Predict the Outcome of a Variable? Agresti/Franklin Statistics, 1of 18 Section 3.3 How Can We Predict the Outcome of a Variable? Agresti/Franklin Statistics, 1of 18 Regression Line Predicts the value for the response variable, y, as a straight-line function of the value of

More information

STATISTICAL DATA ANALYSIS IN EXCEL

STATISTICAL DATA ANALYSIS IN EXCEL Microarra Center STATISTICAL DATA ANALYSIS IN EXCEL Lecture 5 Linear Regression dr. Petr Nazarov 14-1-213 petr.nazarov@crp-sante.lu Statistical data analsis in Ecel. 5. Linear regression OUTLINE Lecture

More information

Conditions for Regression Inference:

Conditions for Regression Inference: AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190.

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190. Name Chapter 3 Learning Objectives Identify explanatory and response variables in situations where one variable helps to explain or influences the other. Make a scatterplot to display the relationship

More information

Analysis of Bivariate Data

Analysis of Bivariate Data Analysis of Bivariate Data Data Two Quantitative variables GPA and GAES Interest rates and indices Tax and fund allocation Population size and prison population Bivariate data (x,y) Case corr&reg 2 Independent

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Higher. Polynomials and Quadratics. Polynomials and Quadratics 1

Higher. Polynomials and Quadratics. Polynomials and Quadratics 1 Higher Mathematics Polnomials and Quadratics Contents Polnomials and Quadratics 1 1 Quadratics EF 1 The Discriminant EF Completing the Square EF Sketching Paraolas EF 7 5 Determining the Equation of a

More information

Practice Questions for Exam 1

Practice Questions for Exam 1 Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon

More information

Chapter 5 Least Squares Regression

Chapter 5 Least Squares Regression Chapter 5 Least Squares Regression A Royal Bengal tiger wandered out of a reserve forest. We tranquilized him and want to take him back to the forest. We need an idea of his weight, but have no scale!

More information

2-3. Linear Regression and Correlation. Vocabulary

2-3. Linear Regression and Correlation. Vocabulary Chapter 2 Lesson 2-3 Linear Regression and Correlation BIG IDEA The regression line is the line of best fi t to data. The correlation coeffi cient measures the strength and direction of a linear pattern

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to: STA 2023 Module 5 Regression and Correlation Learning Objectives Upon completing this module, you should be able to: 1. Define and apply the concepts related to linear equations with one independent variable.

More information

Introduction Direct Variation Rates of Change Scatter Plots. Introduction. EXAMPLE 1 A Mathematical Model

Introduction Direct Variation Rates of Change Scatter Plots. Introduction. EXAMPLE 1 A Mathematical Model APPENDIX B Mathematical Modeling B1 Appendi B Mathematical Modeling B.1 Modeling Data with Linear Functions Introduction Direct Variation Rates of Change Scatter Plots Introduction The primar objective

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent

More information

Historical Note. Regression. Line of Best Fit

Historical Note. Regression. Line of Best Fit 11 4 Regression Objective 4. Compute the equation of the regression line. In studing relationships between two variables, collect the data and then construct a scatter plot. The purpose of the scatter

More information

Graphs and polynomials

Graphs and polynomials 5_6_56_MQVMM - _t Page Frida, Novemer 8, 5 :5 AM MQ Maths Methods / Final Pages / 8//5 Graphs and polnomials VCEcoverage Areas of stud Units & Functions and graphs Algera In this chapter A The inomial

More information

14.2 Choosing Among Linear, Quadratic, and Exponential Models

14.2 Choosing Among Linear, Quadratic, and Exponential Models Name Class Date 14.2 Choosing Among Linear, Quadratic, and Eponential Models Essential Question: How do ou choose among, linear, quadratic, and eponential models for a given set of data? Resource Locker

More information

INFERENCE FOR REGRESSION

INFERENCE FOR REGRESSION CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We

More information

22S39: Class Notes / November 14, 2000 back to start 1

22S39: Class Notes / November 14, 2000 back to start 1 Model diagnostics Interpretation of fitted regression model 22S39: Class Notes / November 14, 2000 back to start 1 Model diagnostics 22S39: Class Notes / November 14, 2000 back to start 2 Model diagnostics

More information

Lecture 18: Simple Linear Regression

Lecture 18: Simple Linear Regression Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength

More information

12Variation UNCORRECTED PAGE PROOFS

12Variation UNCORRECTED PAGE PROOFS Variation. Kick off with CAS. Direct, inverse and joint variation. Data transformations. Data modelling. Review U N C O R R EC TE D PA G E PR O O FS. Kick off with CAS Please refer to the Resources ta

More information

Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight?

Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight? Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight? 16 subjects overfed for 8 weeks Explanatory: change in energy use from non-exercise activity (calories)

More information

Linear Regression Communication, skills, and understanding Calculator Use

Linear Regression Communication, skills, and understanding Calculator Use Linear Regression Communication, skills, and understanding Title, scale and label the horizontal and vertical axes Comment on the direction, shape (form), and strength of the relationship and unusual features

More information

23. Inference for regression

23. Inference for regression 23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence

More information

Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section and

Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section and Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section 2.1.1 and 8.1-8.2.6 Overview Scatterplots Explanatory and Response Variables Describing Association The Regression Equation

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

1. Create a scatterplot of this data. 2. Find the correlation coefficient.

1. Create a scatterplot of this data. 2. Find the correlation coefficient. How Fast Foods Compare Company Entree Total Calories Fat (grams) McDonald s Big Mac 540 29 Filet o Fish 380 18 Burger King Whopper 670 40 Big Fish Sandwich 640 32 Wendy s Single Burger 470 21 1. Create

More information

Correlation and simple linear regression S5

Correlation and simple linear regression S5 Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and

More information

Number Plane Graphs and Coordinate Geometry

Number Plane Graphs and Coordinate Geometry Numer Plane Graphs and Coordinate Geometr Now this is m kind of paraola! Chapter Contents :0 The paraola PS, PS, PS Investigation: The graphs of paraolas :0 Paraolas of the form = a + + c PS Fun Spot:

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

Chapter 6 Scatterplots, Association and Correlation

Chapter 6 Scatterplots, Association and Correlation Chapter 6 Scatterplots, Association and Correlation Looking for Correlation Example Does the number of hours you watch TV per week impact your average grade in a class? Hours 12 10 5 3 15 16 8 Grade 70

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

Nov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3.

Nov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3. Nov 13 AP STAT 1. Check/rev HW 2. Review/recap of notes 3. HW: pg 179 184 #5,7,8,9,11 and read/notes pg 185 188 1 Chapter 3 Notes Review Exploring relationships between two variables. BIVARIATE DATA Is

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

AP Statistics L I N E A R R E G R E S S I O N C H A P 7

AP Statistics L I N E A R R E G R E S S I O N C H A P 7 AP Statistics 1 L I N E A R R E G R E S S I O N C H A P 7 The object [of statistics] is to discover methods of condensing information concerning large groups of allied facts into brief and compendious

More information

Information Sources. Class webpage (also linked to my.ucdavis page for the class):

Information Sources. Class webpage (also linked to my.ucdavis page for the class): STATISTICS 108 Outline for today: Go over syllabus Provide requested information I will hand out blank paper and ask questions Brief introduction and hands-on activity Information Sources Class webpage

More information

Pre-Calculus Multiple Choice Questions - Chapter S8

Pre-Calculus Multiple Choice Questions - Chapter S8 1 If every man married a women who was exactly 3 years younger than he, what would be the correlation between the ages of married men and women? a Somewhat negative b 0 c Somewhat positive d Nearly 1 e

More information

Data transformation. Core: Data analysis. Chapter 5

Data transformation. Core: Data analysis. Chapter 5 Chapter 5 5 Core: Data analsis Data transformation ISBN 978--7-56757-3 Jones et al. 6 66 Core Chapter 5 Data transformation 5A Introduction You first encountered data transformation in Chapter where ou

More information

Test 3A AP Statistics Name:

Test 3A AP Statistics Name: Test 3A AP Statistics Name: Part 1: Multiple Choice. Circle the letter corresponding to the best answer. 1. Other things being equal, larger automobile engines consume more fuel. You are planning an experiment

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and can be printed and given to the

More information

The following formulas related to this topic are provided on the formula sheet:

The following formulas related to this topic are provided on the formula sheet: Student Notes Prep Session Topic: Exploring Content The AP Statistics topic outline contains a long list of items in the category titled Exploring Data. Section D topics will be reviewed in this session.

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Marc H. Mehlman marcmehlman@yahoo.com University of New Haven All models are wrong. Some models are useful. George Box the statistician knows that in nature there never was a

More information

AMS 7 Correlation and Regression Lecture 8

AMS 7 Correlation and Regression Lecture 8 AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of

More information

More Statistical Inference

More Statistical Inference 4 PART FOUR More Statistical Inference CHAPTER 9 Correlation and Regression CHAPTER 10 Chi-Square Tests and the F-Distribution CHAPTER 11 Nonparametric Tests 493 C H A P T E R 9 Correlation and Regression

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

Vocabulary. Fitting a Line to Data. Lesson 2-2 Linear Models

Vocabulary. Fitting a Line to Data. Lesson 2-2 Linear Models Lesson 2-2 Linear Models BIG IDEA The sum of squared deviations is a statistic for determining which of two lines fi ts the data better. A linear function is a set of ordered pairs (, ) satisfing an equation

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

Section 0.4 Inverse functions and logarithms

Section 0.4 Inverse functions and logarithms Section 0.4 Inverse functions and logarithms (5/3/07) Overview: Some applications require not onl a function that converts a numer into a numer, ut also its inverse, which converts ack into. In this section

More information

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO

More information

IT 403 Practice Problems (2-2) Answers

IT 403 Practice Problems (2-2) Answers IT 403 Practice Problems (2-2) Answers #1. Which of the following is correct with respect to the correlation coefficient (r) and the slope of the leastsquares regression line (Choose one)? a. They will

More information

Exam 3 Practice Questions Psych , Fall 9

Exam 3 Practice Questions Psych , Fall 9 Vocabular Eam 3 Practice Questions Psch 3101-100, Fall 9 Rather than choosing some practice terms at random, I suggest ou go through all the terms in the vocabular lists. The real eam will ask for definitions

More information

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3 SMAM 319 Exam1 Name 1. Pick the best choice. (10 points-2 each) _c A. A data set consisting of fifteen observations has the five number summary 4 11 12 13 15.5. For this data set it is definitely true

More information

Mathematical Ideas Modelling data, power variation, straightening data with logarithms, residual plots

Mathematical Ideas Modelling data, power variation, straightening data with logarithms, residual plots Kepler s Law Level Upper secondary Mathematical Ideas Modelling data, power variation, straightening data with logarithms, residual plots Description and Rationale Many traditional mathematics prolems

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

STA 218: Statistics for Management

STA 218: Statistics for Management Al Nosedal. University of Toronto. Fall 2017 My momma always said: Life was like a box of chocolates. You never know what you re gonna get. Forrest Gump. Problem How much do people with a bachelor s degree

More information