Q1: What is the interpretation of the number 4.1? A: There were 4.1 million visits to ER by people 85 and older, Q2: What percent of people 65-74

Size: px
Start display at page:

Download "Q1: What is the interpretation of the number 4.1? A: There were 4.1 million visits to ER by people 85 and older, Q2: What percent of people 65-74"

Transcription

1 Lecture 4 This week lab:exam 1! Review lectures, practice labs 1 to 4 and homework 1 to 5!!!!! Need help? See me during my office hrs, or goto open lab or GS 211. Bring your picture ID and simple calculator.(note computation could be done in Excel) Do not be late.

2 Q1: What is the interpretation of the number 4.1? A: There were 4.1 million visits to ER by people 85 and older, Q2: What percent of people visited the ER between 2009 and 2010? A: About 40%,

3 Q3: What is the total number of ER visits for senior citizens between 2009 and 2010? A: 19.6 millions, Q4: What do you think was the main purpose of this chart? What would be the conclusion? A: Although percentagewise the 85+ population visits ER the most their actual number is the smallest.

4 Q: What % of women age in 2009 were pregnant? Q:Which age group experienced the biggest drop from 1990 to 2009? Q: Which age group had the highest rate of pregnancy in 1990? A: 13.8%, A: year, A: year,

5 Q: Which age group had the highest rate of pregnancy in 2009? Q:Which age group experienced the biggest jump from 1990 to 2009? Q: What do you think was the main purpose of this chart? What would be the conclusion? A :25-29 year, A: year A :There exists a reverse trend! As years go by younger women are less often pregnant while older woman are more often pregnant

6 Q: For which period was the CIA estimate the highest? A , Q: For the period what was the Official Soviet Income Growth? A about 7-8%, Q: For which period do we see the highest discrepancy between Khanin s and CIA estimates? A

7 Q1. Which month has the highest tornado related deaths? A: April, Q2. Which two months have the highest number of tornados? A: May and June

8 Q3. Which month has the lowest number of tornado related deaths? A: July, Q4. The blue chart (tornado deaths) is shifted toward left compared to the red chart. What could be the likely reason for this? A: People tend to underestimate the danger and are unprepared at the beginning of tornado season.

9 Example Compare the # of tornadoes of the months of January, March, May and July. Which of the three months is the most correlated to Jan? Produce a scatter plot of the X axis representing January and the Y axis the month most correlated to January. What is the equation of the regression line and the R^2? If you see 30 tornadoes in January how many tornadoes would you expect to see in the month most correlated to January?

10 jan vs may y = x R² = # of tornadoes in May, Y = * = Round to the nearest integer, Y = 193 tornadoes is one would expect to see in May if 30 tornadoes occurred in January.

11 A few more examples Q1. Consider the Chart and imagine if one would randomly pick a month, what is the probability that this month has more than 4000 tornadoes total? A1. From the chart we can see that there are only three months for which this happened: April, May and June, thus the probability is 3/12 = 1/4 Q2. Consider the Chart and imagine if one would randomly pick two different months, what is the probability that each of them would have more than 5000 tornadoes in total. A2. We have already seen that one can construct 66 different 2-pair months. (Just remember the correlation table). But there is only one pair that satisfies the requirement: (May, June). Thus only one-out-of-66 pairs would work. And probability is 1/66.

12 A few more examples Q3. Out of three statistical measurements we have learned: Median, Average and Standard Deviation, which one is NOT sensitive to outliers? A3. Median. Q4. Which of the three measurements, Median, Average and Standard Deviation, describe the center of the data? A4. Median and Average Note: In literature, we often use Mean and Average indistinguishably.

13 Multiple Regression Week of July 11 th : Exam 2!!!!! Do not miss Lab 5 and Lab 6! Exam 2 will cover materials from lab 0 to lab 6 and of course all lectures Review homework 2 to 8!

14 Warm up: Data in question are SAT.txt. So far we have learned how to work with individual charts based on two variables ( x variable and y variable). Q1.What is the interpretation of on High School chart? A: This is a tricky one. Students typically write: For each 1 point increase in High School GPA one expects increase in College GPA; Another alternative and better answer: Given two students for which High School GPA differs by 1 point, one could expect that their College GPA will differ by points. This second interpretation is useful for College administrators Q2.The largest R square is on the High School vs College GPA chart. How would you comment on this information? A: It looks like High school GPA has the closest relation to College GPA.

15 Warm up: Data in question are SAT.txt. So far we have learned how to work with individual charts based on two variables ( x variable and y variable). Q4.What is the interpretation of the number on the third chart? A: The number should not be interpreted. A person with zero High school GPA does not go to College. Q5. What does each dot represent on the chart? A: A student

16 Breakdown of R^2 from 0 to 0.2 from 0.2 to 0.4 from 0.4 to 0.6 from 0.6 to 0.85 from 0.85 to 1 (poor) (decent) (good) (very good) (excellent) So far so good. Clearly each of the three variables (high school GPA, SAT score and letters) had a positive influence on the college GPA. In other words, if one would like to predict the performance of an incoming freshman, each of these three predictors would be relevant.

17 So how do we compute a student s College GPA based on the letters, HS GPA and SAT? For example: If student A has quality of letter = 8 then his predicted College GPA = (8) = Now if his High school GPA = 3.4 then his predicted College GPA = (3.4) = And if his SAT score is 1050 then his predicated College GPA = (1050) =

18 Warm up: Data in question are SAT.txt. So far we have learned how to work with individual charts based on two variables ( x variable and y variable). Natural question: If one really wants to predict the students performance (and many admission officers do) it would make sense to combine these three variables and try to predict the student s performance based on all three factors at once. But how to do so? For this we need multiple regression.

19 Multiple regression (using the same SAT data). To do this we need to click on Data, then on Data analysis, then on Regression. In dialog box for the input X-variables highlight all three columns, (and do not forget to include Labels) As far as Excel is concerned this highlighting of three columns (instead just one) is the only difference between Regression and Multiple Regression. After performing the above highlighting $B$1: $D$101 should appear in the box for Input X range $A1$:$A$101 should appear in the box for Input Y range) Do not forget to click on the box for Labels and then OK.

20 The following table should appear: We cannot visualize d- dimensions! One of the main and most important problems regarding multiple regressions is the visualization issue. Namely, unlike the one dim case where we can plot the chart and fit the line, here we have 3-dim inputs and 1-dim output and there are no mathematical ways to plot this chart; so we cannot see what is going on. Instead we must depend on the regression table. Predictions: Imagine a student with a good high school GPA=3.5, SAT=1300, and with a letter quality of 9, how do we use the Table above and predict his College GPA????

21 Multiple Regression Analysis Multiple regression analysis is a powerful technique used for predicting the unknown value of a variable from the known value of two or more variables- also called the predictors. More precisely, multiple regression analysis helps us to predict the value of Y for the given values of X 1, X 2,, X k By multiple regression, we mean models with just one dependent(y) and two or more independent (explanatory) variables(xs). The variable whose value is to be predicted is known as the dependent variable and the ones whose known values are used for prediction are known independent (explanatory) variables.

22 The Multiple Regression Model coefficient Standard Error t stat P-value Lower 95% Upper 95% Lower 95% Upper 95% Intercept Variable X Variable X E Variable Xk In general, the multiple regression equation of Y on X 1, X 2,, X k from the table above is given by: Y = b 0 + b 1 X 1 + b 2 X b k X k Here b 0 is the intercept and b 1, b 2, b 3,, b k are analogous to the slope in linear regression equation and are also called regression coefficients. They can be interpreted the same way as slope. Thus if b i = 2.5, it would indicates that Y will increase by 2.5 units if X i increased by 1 unit.

23 The Multiple Regression Model In general, the multiple regression equation of Y on X 1, X 2,, X k is given by: Y = b 0 + b 1 X 1 + b 2 X b k X k Use the above table on SAT data, predict an incoming student s College GPA if his High school GPA = 3.5, SAT = 1300, and Letter quality = 9. From the table above we can read that the predictive model is, Y, the College GPA = Y intercept + coefficient of GPA high* HS GPA + coefficient of SAT*SAT + coefficient of letters *letters College GPA = * * *9 = Thus the predicted College GPA is Important: WE USED ALL THREE VARIABLES!

24 How Good Is the Regression? Once a multiple regression equation has been constructed, one can check how good it is (in terms of predictive ability) by examining the coefficient of determination (R-square, R^2). R- square always lies between 0 and 1. All software provides it whenever regression procedure is run. The closer R^2 is to 1, the better is the model and its prediction. In our case of the SAT data, the predictive model of College GPA based on High school GPA, SAT, and letters has R^2 = which is considered a decent model. Breakdown of R^2 from 0 to 0.2 from 0.2 to 0.4 from 0.4 to 0.6 from 0.6 to 0.85 (poor) (decent) (good) (very good) from 0.85 to 1 (excellent)

25 Let us practice a bit: In general, the multiple regression equation of Y on X 1, X 2,, X k is given by: Y = b 0 + b 1 X 1 + b 2 X b k X k High School GPA SAT Letters Prediction (round to 2 decimal places) ?? ?? ?? ?? 2.85

26 We start with a review: Data in question are SAT.txt. Clearly, each of the three variables (high school GPA, SAT score and letters) had a positive influence on the college GPA. In other words, if one would like to predict the performance of an incoming freshman, each of these three predictors would be relevant. So how do we compute a student s College GPA based on the letters, HS GPA and SAT?

27 So how do we compute a student s College GPA based on the letters, HS GPA and SAT? For example: If student A has quality of letter = 8 then his predicted College GPA = (8) = Now if his High school GPA = 3.4 then his predicted College GPA = (3.4) = And if his SAT score is 1050 then his predicated College GPA = (1050) =

28 So how do we compute a student s College GPA based on the letters, HS GPA and SAT? Natural question: If one really wants to predict the students performance (and many admission officers do) it would make sense to combine these three variables and try to predict the student s performance based on all three factors at once. But how to do so? For this we need multiple regression.

29 Multiple regression (using the same SAT data). To do this we need to click on Data, then on Data analysis, then on Regression. In dialog box for the input X-variables highlight all three columns, (and do not forget to include Labels) As far as Excel is concerned this highlighting of three columns (instead just one) is the only difference between Regression and Multiple Regression. After performing the above highlighting $B$1: $D$101 should appear in the box for Input X range $A1$:$A$101 should appear in the box for Input Y range) Do not forget to click on the box for Labels and then OK.

30 The following table should appear: We cannot visualize d- dimensions! One of the main and most important problems regarding multiple regressions is the visualization issue. Namely, unlike the one dim case where we can plot the chart and fit the line, here we have 3-dim inputs and 1-dim output and there are no mathematical ways to plot this chart; so we cannot see what is going on. Instead we must depend on the regression table.

31 The Multiple Regression Model coefficient Standard Error t stat P-value Lower 95% Upper 95% Lower 95% Upper 95% Intercept Variable X Variable X E Variable Xk In general, the multiple regression equation of Y on X 1, X 2,, X k from the table above is given by: Y = b 0 + b 1 X 1 + b 2 X b k X k Here b 0 is the intercept and b 1, b 2, b 3,, b k are analogous to the slope in linear regression equation and are also called regression coefficients. They can be interpreted the same way as slope. Thus if b i = 2.5, it would indicates that Y will increase by 2.5 units if X i increased by 1 unit.

32 The Multiple Regression Model In general, the multiple regression equation of Y on X 1, X 2,, X k is given by: Y = b 0 + b 1 X 1 + b 2 X b k X k Use the above table on SAT data, predict an incoming student s College GPA if his High school GPA = 3.5, SAT = 1300, and Letter quality = 9. From the table above we can read that the predictive model is, Y, the College GPA = Y intercept + coefficient of GPA high* HS GPA + coefficient of SAT*SAT + coefficient of letters *letters College GPA = * * *9 = Thus the predicted College GPA is Important: WE USED ALL THREE VARIABLES!

33 Practice Use Cars04-1 data. Your TASK is to use engine s size (litre), cylinders, horsepower, and weight (pounds) to predict the retail price of a car. Create the regression table.

34 Using the table predict the retail price of a car if the car s engine size = 4 litre, and the car has 6 cylinders, 400 horsepower, and its weight is 3200 pounds. Retail price = *engine size *cylinders *horsepower *weight. = * * * *3200. = $55,868.45

35 Back to SAT.txt data, after performing multiple regression with variable Y= College GPA we get the table below. Imagine a student with a good high school GPA=3.5, SAT=1300, and with letter quality of 9. The table below allows us to predict his College GPA!! Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 100 ANOVA df SS MS F gnificance F Regression E-10 Residual Total Coefficientsandard Err t Stat P-value Lower 95%Upper 95%ower 95.0%pper 95.0% Intercept GPA High SAT Letters From this table we can read that the predictive model is: College GPA= GPA High * SAT * Letters * In our case this becomes College GPA= * * * =2.963 Thus predicted College GPA is Question: Clearly we cannot be 100% sure that an incoming student with these given credentials will have the College GPA of exactly Thus, the prediction 2.96 is only an approximation. But how accurate is this approximation? The regression table comes to the rescue: the number we use is highlighted above. Standard Error= In other words, statistical analysis implies that for these particular credentials we can expect that student s college GPA will be 2.96+/ Another way to state this: the predicted GPA is in the interval [2.37, 3.52]

36 A bit more practice Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 100 ANOVA df SS MS F gnificance F Regression E-10 Residual Total Coefficientsandard Err t Stat P-value Lower 95%Upper 95%ower 95.0%pper 95.0% Intercept GPA High SAT Letters Q1. What would be the interpretation for the coefficient GPA High =0.376? A For each point increase in Highs School GPA we expect a point increase in College GPA. Q2. What is the interpretation of the number ? A. The intercept has no interpretation here (it is impossible to have a student with 0 high school GPA and zero SAT going to college).

37 A bit more practice Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 100 ANOVA df SS MS F gnificance F Regression E-10 Residual Total Coefficientsandard Err t Stat P-value Lower 95%Upper 95%ower 95.0%pper 95.0% Intercept GPA High SAT Letters Imagine the following scenario: Student A s High School GPA is 1 point higher than Student B s. On the other hand, Student B s SAT score is higher by 200 points. Their letters are of the same strength. Q3. Which of the two students will have a higher predicted College GPA? A: Student A will gain points due to his high school GPA and student B will gain 200*0.0012=0.24 points due to his superior SAT. Overall, Student A will have predicted GPA higher by points (since =0.1363) Q4. What are the predicted GPA s for students A and B? A: It is impossible to state the predictions for students A and B since we do not know their actual credentials.

38 Using Sail boat data, we can make the following chart and the regression table. Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 20 ANOVA df SS MS F nificance Regression E-09 Residual Total Observe: Chart The line equation on the chart is Y=1.0129X R^2 =08529 The chart offers visual information; we can actually see the dots (i.e. sail boats), the trend line and how well they fit. It does not extend beyond 1-dim input X Coefficientndard Er t Stat P-valueower 95% Intercept Feet E Table The intercept is and the coefficient next to Feet is Equation of the line: Y=1.0129X R Square= No visualization but it contains many more numbers, some of which we already used (and some of them we will use soon). Easily extends to d-dimensional input

39 More practice Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 20 ANOVA df SS MS F nificance Regression E-09 Residual Total Coefficientndard Er t Stat P-valueower 95% Intercept Feet E Given the table and chart answer the following questions: Q1. What is the predicted weight for a sail boat that is 30 feet long? A: pounds = * = = * 1000 pounds = = about pounds Q2. This prediction comes with certain error estimate. What is it? In other words what is the interval prediction for this weight? A: the error is 4.41 thus the interval is [7960, 16780] pounds (remember, the units are in thousands of pounds and we truncated the decimals) =[ , ] =[7.9598, ] = [7.9598*1000, *1000] = [7959.8, ]= about [7960,16780] pounds!

40 More practice Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 20 ANOVA df SS MS F nificance Regression E-09 Residual Total Coefficientndard Er t Stat P-valueower 95% Intercept Feet E Given the table and chart answer the following questions: Q3. What is the interpretation of the slope 1.01? A: For each foot increase the boat s weight increases by about 1010 pounds Q4. What is the interpretation for the intercept ? A: The intercept has no real life interpretation here (no sailboat is zero feet long).

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall) Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall) We will cover Chs. 5 and 6 first, then 3 and 4. Mon,

More information

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) In 2007, the number of wins had a mean of 81.79 with a standard

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012 University of California, Berkeley, Statistics 3A: Statistical Inference for the Social and Life Sciences Michael Lugo, Spring 202 Solutions to Exam Friday, March 2, 202. [5: 2+2+] Consider the stemplot

More information

Math 147 Lecture Notes: Lecture 12

Math 147 Lecture Notes: Lecture 12 Math 147 Lecture Notes: Lecture 12 Walter Carlip February, 2018 All generalizations are false, including this one.. Samuel Clemens (aka Mark Twain) (1835-1910) Figures don t lie, but liars do figure. Samuel

More information

Statistics 100 Exam 2 March 8, 2017

Statistics 100 Exam 2 March 8, 2017 STAT 100 EXAM 2 Spring 2017 (This page is worth 1 point. Graded on writing your name and net id clearly and circling section.) PRINT NAME (Last name) (First name) net ID CIRCLE SECTION please! L1 (MWF

More information

FSA Algebra I End-of-Course Review Packet

FSA Algebra I End-of-Course Review Packet FSA Algebra I End-of-Course Review Packet Table of Contents MAFS.912.N-RN.1.2 EOC Practice... 3 MAFS.912.N-RN.2.3 EOC Practice... 5 MAFS.912.N-RN.1.1 EOC Practice... 8 MAFS.912.S-ID.1.1 EOC Practice...

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Section 5: Dummy Variables and Interactions

Section 5: Dummy Variables and Interactions Section 5: Dummy Variables and Interactions Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Example: Detecting

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1 9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3 SMAM 319 Exam1 Name 1. Pick the best choice. (10 points-2 each) _c A. A data set consisting of fifteen observations has the five number summary 4 11 12 13 15.5. For this data set it is definitely true

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

12.7. Scattergrams and Correlation

12.7. Scattergrams and Correlation 12.7. Scattergrams and Correlation 1 Objectives A. Make a scattergram and determine if there is a positive, a negative, or no correlation for the data. B. Find and interpret the coefficient of correlation

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 17, 2010 Instructor: John Parman Final Exam - Solutions You have until 12:30pm to complete this exam. Please remember to put your

More information

ST430 Exam 1 with Answers

ST430 Exam 1 with Answers ST430 Exam 1 with Answers Date: October 5, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textook are permitted but you may use a calculator.

More information

Additional practice with these ideas can be found in the problems for Tintle Section P.1.1

Additional practice with these ideas can be found in the problems for Tintle Section P.1.1 Psych 10 / Stats 60, Practice Problem Set 3 (Week 3 Material) Part 1: Decide if each variable below is quantitative, ordinal, or categorical. If the variable is categorical, also decide whether or not

More information

Chapter 8. Linear Regression /71

Chapter 8. Linear Regression /71 Chapter 8 Linear Regression 1 /71 Homework p192 1, 2, 3, 5, 7, 13, 15, 21, 27, 28, 29, 32, 35, 37 2 /71 3 /71 Objectives Determine Least Squares Regression Line (LSRL) describing the association of two

More information

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis 4.1 Introduction Correlation is a technique that measures the strength (or the degree) of the relationship between two variables. For example, we could measure how strong the relationship is between people

More information

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the

More information

Lesson 4 Linear Functions and Applications

Lesson 4 Linear Functions and Applications In this lesson, we take a close look at Linear Functions and how real world situations can be modeled using Linear Functions. We study the relationship between Average Rate of Change and Slope and how

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Stat 101 L: Laboratory 5

Stat 101 L: Laboratory 5 Stat 101 L: Laboratory 5 The first activity revisits the labeling of Fun Size bags of M&Ms by looking distributions of Total Weight of Fun Size bags and regular size bags (which have a label weight) of

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Section 2.5 from Precalculus was developed by OpenStax College, licensed by Rice University, and is available on the Connexions website.

Section 2.5 from Precalculus was developed by OpenStax College, licensed by Rice University, and is available on the Connexions website. Section 2.5 from Precalculus was developed by OpenStax College, licensed by Rice University, and is available on the Connexions website. It is used under a Creative Commons Attribution-NonCommercial- ShareAlike

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Examining Relationships. Chapter 3

Examining Relationships. Chapter 3 Examining Relationships Chapter 3 Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The explanatory variable, if there is one, is graphed

More information

CRP 272 Introduction To Regression Analysis

CRP 272 Introduction To Regression Analysis CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

Stat 20 Midterm 1 Review

Stat 20 Midterm 1 Review Stat 20 Midterm Review February 7, 2007 This handout is intended to be a comprehensive study guide for the first Stat 20 midterm exam. I have tried to cover all the course material in a way that targets

More information

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variable In this lecture: We shall look at two quantitative variables.

More information

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variables In this lecture: We shall look at two quantitative variables.

More information

3.2: Least Squares Regressions

3.2: Least Squares Regressions 3.2: Least Squares Regressions Section 3.2 Least-Squares Regression After this section, you should be able to INTERPRET a regression line CALCULATE the equation of the least-squares regression line CALCULATE

More information

Copyright, Nick E. Nolfi MPM1D9 Unit 6 Statistics (Data Analysis) STA-1

Copyright, Nick E. Nolfi MPM1D9 Unit 6 Statistics (Data Analysis) STA-1 UNIT 6 STATISTICS (DATA ANALYSIS) UNIT 6 STATISTICS (DATA ANALYSIS)... 1 INTRODUCTION TO STATISTICS... 2 UNDERSTANDING STATISTICS REQUIRES A CHANGE IN MINDSET... 2 UNDERSTANDING SCATTER PLOTS #1... 3 UNDERSTANDING

More information

Ch. 16: Correlation and Regression

Ch. 16: Correlation and Regression Ch. 1: Correlation and Regression With the shift to correlational analyses, we change the very nature of the question we are asking of our data. Heretofore, we were asking if a difference was likely to

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

Scatterplots and Correlation

Scatterplots and Correlation Bivariate Data Page 1 Scatterplots and Correlation Essential Question: What is the correlation coefficient and what does it tell you? Most statistical studies examine data on more than one variable. Fortunately,

More information

Talking feet: Scatterplots and lines of best fit

Talking feet: Scatterplots and lines of best fit Talking feet: Scatterplots and lines of best fit Student worksheet What does your foot say about your height? Can you predict people s height by how long their feet are? If a Grade 10 student s foot is

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data?

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data? Univariate analysis Example - linear regression equation: y = ax + c Least squares criteria ( yobs ycalc ) = yobs ( ax + c) = minimum Simple and + = xa xc xy xa + nc = y Solve for a and c Univariate analysis

More information

This module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression.

This module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression. WISE ANOVA and Regression Lab Introduction to the WISE Correlation/Regression and ANOVA Applet This module focuses on the logic of ANOVA with special attention given to variance components and the relationship

More information

MINI LESSON. Lesson 2a Linear Functions and Applications

MINI LESSON. Lesson 2a Linear Functions and Applications MINI LESSON Lesson 2a Linear Functions and Applications Lesson Objectives: 1. Compute AVERAGE RATE OF CHANGE 2. Explain the meaning of AVERAGE RATE OF CHANGE as it relates to a given situation 3. Interpret

More information

MATH 2200 PROBABILITY AND STATISTICS M2200FL081.1

MATH 2200 PROBABILITY AND STATISTICS M2200FL081.1 MATH 2200 PROBABILITY AND STATISTICS M2200FL081.1 In almost all problems, I have given the answers to four significant digits. If your answer is slightly different from one of mine, consider that to be

More information

Using a Graphing Calculator

Using a Graphing Calculator Using a Graphing Calculator Unit 1 Assignments Bridge to Geometry Name Date Period Warm Ups Name Period Date Friday Directions: Today s Date Tuesday Directions: Today s Date Wednesday Directions: Today

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 46 4 C H A P T E R Regression Models TEACHING SUGGESTIONS Teaching Suggestion 4.1: Which Is the Independent Variable? We find that students are often confused

More information

MEP Y7 Practice Book B

MEP Y7 Practice Book B 8 Quantitative Data 8. Presentation In this section we look at how vertical line diagrams can be used to display discrete quantitative data. (Remember that discrete data can only take specific numerical

More information

Do Now 18 Balance Point. Directions: Use the data table to answer the questions. 2. Explain whether it is reasonable to fit a line to the data.

Do Now 18 Balance Point. Directions: Use the data table to answer the questions. 2. Explain whether it is reasonable to fit a line to the data. Do Now 18 Do Now 18 Balance Point Directions: Use the data table to answer the questions. 1. Calculate the balance point.. Explain whether it is reasonable to fit a line to the data.. The data is plotted

More information

PS2: Two Variable Statistics

PS2: Two Variable Statistics PS2: Two Variable Statistics LT2: Measuring Correlation and Line of best fit by eye. LT3: Linear regression LT4: The χ 2 test of independence. 1 Pearson's Correlation Coefficient In examinations you are

More information

H l o t lol t M t c M D gc o ed u o g u al a 1 g A al lg Al e g b e r r 1 a

H l o t lol t M t c M D gc o ed u o g u al a 1 g A al lg Al e g b e r r 1 a Holt Algebra McDougal 1 Algebra 1 Warm Up Line of Best Fit Identify the slope and the y-intercept. 1. y = -2x + 1 m = -2, b = 1 2. y = 2 2 x - 4 m=, b = -4 3 3 Identify the correlation (positive, negative,

More information

Honors Algebra 1 - Fall Final Review

Honors Algebra 1 - Fall Final Review Name: Period Date: Honors Algebra 1 - Fall Final Review This review packet is due at the beginning of your final exam. In addition to this packet, you should study each of your unit reviews and your notes.

More information

Chapter 3: Examining Relationships Review Sheet

Chapter 3: Examining Relationships Review Sheet Review Sheet 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is A) the yield of the crop. D) either

More information

Complete Week 9 Package

Complete Week 9 Package Complete Week 9 Package Algebra1Teachers @ 2015 Table of Contents Unit 3 Pacing Chart -------------------------------------------------------------------------------------------- 1 Day 41 Bellringer --------------------------------------------------------------------------------------------

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent

More information

Business Statistics 41000: Homework # 5

Business Statistics 41000: Homework # 5 Business Statistics 41000: Homework # 5 Drew Creal Due date: Beginning of class in week # 10 Remarks: These questions cover Lectures #7, 8, and 9. Question # 1. Condence intervals and plug-in predictive

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

Lecture 18 MA Applied Statistics II D 2004

Lecture 18 MA Applied Statistics II D 2004 Lecture 18 MA 2612 - Applied Statistics II D 2004 Today 1. Examples of multiple linear regression 2. The modeling process (PNC 8.4) 3. The graphical exploration of multivariable data (PNC 8.5) 4. Fitting

More information

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Overview Key Concepts Understand how adding (subtracting) a constant or multiplying (dividing) by a constant changes the center and/or spread

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

FSA Algebra I End-of-Course Review Packet

FSA Algebra I End-of-Course Review Packet FSA Algebra I End-of-Course Review Packet Table of Contents MAFS.912.N-RN.1.2 EOC Practice... 3 MAFS.912.N-RN.2.3 EOC Practice... 5 MAFS.912.N-RN.1.1 EOC Practice... 8 MAFS.912.S-ID.1.1 EOC Practice...

More information

HUDM4122 Probability and Statistical Inference. February 2, 2015

HUDM4122 Probability and Statistical Inference. February 2, 2015 HUDM4122 Probability and Statistical Inference February 2, 2015 Special Session on SPSS Thursday, April 23 4pm-6pm As of when I closed the poll, every student except one could make it to this I am happy

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

Regression of Inflation on Percent M3 Change

Regression of Inflation on Percent M3 Change ECON 497 Final Exam Page of ECON 497: Economic Research and Forecasting Name: Spring 2006 Bellas Final Exam Return this exam to me by midnight on Thursday, April 27. It may be e-mailed to me. It may be

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation 3.1: Scatterplots & Correlation Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The values of one variable appear on the horizontal

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Lecture 1: Description of Data. Readings: Sections 1.2,

Lecture 1: Description of Data. Readings: Sections 1.2, Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

MAT 0022C/0028C Final Exam Review. BY: West Campus Math Center

MAT 0022C/0028C Final Exam Review. BY: West Campus Math Center MAT 0022C/0028C Final Exam Review BY: West Campus Math Center Factoring Topics #1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 Problem Solving (Word Problems) #19, 20, 21, 22, 23, 24, 25,

More information

Analysis of Bivariate Data

Analysis of Bivariate Data Analysis of Bivariate Data Data Two Quantitative variables GPA and GAES Interest rates and indices Tax and fund allocation Population size and prison population Bivariate data (x,y) Case corr&reg 2 Independent

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3 Review Chapter 3: Examining Relationships 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study

More information

LECTURE 15: SIMPLE LINEAR REGRESSION I

LECTURE 15: SIMPLE LINEAR REGRESSION I David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).

More information

Unit 8: Exponential & Logarithmic Functions

Unit 8: Exponential & Logarithmic Functions Date Period Unit 8: Eponential & Logarithmic Functions DAY TOPIC ASSIGNMENT 1 8.1 Eponential Growth Pg 47 48 #1 15 odd; 6, 54, 55 8.1 Eponential Decay Pg 47 48 #16 all; 5 1 odd; 5, 7 4 all; 45 5 all 4

More information

Complete Week 8 Package

Complete Week 8 Package Complete Week 8 Package Algebra1Teachers @ 2015 Table of Contents Unit 3 Pacing Chart -------------------------------------------------------------------------------------------- 1 Lesson Plans --------------------------------------------------------------------------------------------

More information

1. Use Scenario 3-1. In this study, the response variable is

1. Use Scenario 3-1. In this study, the response variable is Chapter 8 Bell Work Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable lumber of 32 cherry trees are measured by a researcher. The goal is to determine if volume of usable lumber can

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

This document contains 3 sets of practice problems.

This document contains 3 sets of practice problems. P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them

More information

Econometrics Homework 1

Econometrics Homework 1 Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z

More information

Algebra 1. Statistics and the Number System Day 3

Algebra 1. Statistics and the Number System Day 3 Algebra 1 Statistics and the Number System Day 3 MAFS.912. N-RN.1.2 Which expression is equivalent to 5 m A. m 1 5 B. m 5 C. m 1 5 D. m 5 A MAFS.912. N-RN.1.2 Which expression is equivalent to 5 3 g A.

More information

STATISTICS 141 Final Review

STATISTICS 141 Final Review STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Chapter 12 : Linear Correlation and Linear Regression

Chapter 12 : Linear Correlation and Linear Regression Chapter 1 : Linear Correlation and Linear Regression Determining whether a linear relationship exists between two quantitative variables, and modeling the relationship with a line, if the linear relationship

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

Chapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line

Chapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line Chapter 7 Linear Regression (Pt. 1) 7.1 Introduction Recall that r, the correlation coefficient, measures the linear association between two quantitative variables. Linear regression is the method of fitting

More information

Graphing Equations in Slope-Intercept Form 4.1. Positive Slope Negative Slope 0 slope No Slope

Graphing Equations in Slope-Intercept Form 4.1. Positive Slope Negative Slope 0 slope No Slope Slope-Intercept Form y = mx + b m = slope b = y-intercept Graphing Equations in Slope-Intercept Form 4.1 Positive Slope Negative Slope 0 slope No Slope Example 1 Write an equation in slope-intercept form

More information

AP Statistics - Chapter 2A Extra Practice

AP Statistics - Chapter 2A Extra Practice AP Statistics - Chapter 2A Extra Practice 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is A) yield

More information