Information Sources. Class webpage (also linked to my.ucdavis page for the class):

Similar documents
Recall, Positive/Negative Association:

HOMEWORK (due Wed, Jan 23): Chapter 3: #42, 48, 74

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)

Unit 6 - Introduction to linear regression

AP Statistics L I N E A R R E G R E S S I O N C H A P 7

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation.

Unit 6 - Simple linear regression

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

appstats8.notebook October 11, 2016

Intro to Linear Regression

Intro to Linear Regression

Warm-up Using the given data Create a scatterplot Find the regression line

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

Simple Linear Regression

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

Probability Distributions

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 27 Summary Inferences for Regression

Chapter 5 Friday, May 21st

Simple Linear Regression Using Ordinary Least Squares

Midterm 2 - Solutions

Data Analysis and Statistical Methods Statistics 651

Business Statistics. Lecture 9: Simple Regression

Copyright, Nick E. Nolfi MPM1D9 Unit 6 Statistics (Data Analysis) STA-1

Chapter 3: Describing Relationships

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

Inferences for Regression

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

1. Create a scatterplot of this data. 2. Find the correlation coefficient.

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Scatterplots and Correlation

Correlation and Linear Regression

Mrs. Poyner/Mr. Page Chapter 3 page 1

Chapter 4 Describing the Relation between Two Variables

Section 3.3. How Can We Predict the Outcome of a Variable? Agresti/Franklin Statistics, 1of 18

BNAD 276 Lecture 10 Simple Linear Regression Model

Describing Bivariate Data

SECTION I Number of Questions 42 Percent of Total Grade 50

Chapter 10 Correlation and Regression

UNIT 12 ~ More About Regression

ECON 497 Midterm Spring

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Chapter 7 Summary Scatterplots, Association, and Correlation

Lecture 18: Simple Linear Regression

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Correlation and Regression

3.1 Scatterplots and Correlation

10.1: Scatter Plots & Trend Lines. Essential Question: How can you describe the relationship between two variables and use it to make predictions?

appstats27.notebook April 06, 2017

Ch 13 & 14 - Regression Analysis

Answer Key: Problem Set 6

Chapter 5 Least Squares Regression

9. Linear Regression and Correlation

Chapter 8. Linear Regression /71

Chapter 4. Regression Models. Learning Objectives

Chapter 12: Estimation

Chapter 19 Sir Migo Mendoza

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

The response variable depends on the explanatory variable.

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation

THE PEARSON CORRELATION COEFFICIENT

Regression Analysis and Forecasting Prof. Shalabh Department of Mathematics and Statistics Indian Institute of Technology-Kanpur

In-Class Activities Activity 26-1: House Prices

Comparing Quantitative Variables

Chapter 2: Looking at Data Relationships (Part 3)

Regression and Correlation

28. SIMPLE LINEAR REGRESSION III

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

LECTURE 15: SIMPLE LINEAR REGRESSION I

Chapter 10-Regression

Review of Multiple Regression

Unit 5: Regression. Marius Ionescu 09/22/2011

SMAM 314 Exam 42 Name

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

Math 147 Lecture Notes: Lecture 12

Conditions for Regression Inference:

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

Essential Statistics. Gould Ryan Wong

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression

AMS 7 Correlation and Regression Lecture 8

Chapter 6: Exploring Data: Relationships Lesson Plan

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3.

Introduction to Uncertainty and Treatment of Data

REVIEW 8/2/2017 陈芳华东师大英语系

3.2: Least Squares Regressions

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation

Stat 20 Midterm 1 Review

Chapter 5: Ordinary Least Squares Estimation Procedure The Mechanics Chapter 5 Outline Best Fitting Line Clint s Assignment Simple Regression Model o

Chapter 3: Describing Relationships

1. Use Scenario 3-1. In this study, the response variable is

Sampling, Frequency Distributions, and Graphs (12.1)

Ch. 3 Review - LSRL AP Stats

Chapter 7 Linear Regression

Transcription:

STATISTICS 108 Outline for today: Go over syllabus Provide requested information I will hand out blank paper and ask questions Brief introduction and hands-on activity

Information Sources Class webpage (also linked to my.ucdavis page for the class): http://stat.ucdavis.edu/~utts/st108 Includes link to syllabus Syllabus.pdf

Purple Paper Please provide the following information: 1. Name 2. Major 3. Year in school 4. Something interesting about yourself 5. Why you are taking this class

Purple paper, continued 6. On a 1 to 5 scale, how familiar and comfortable are you with these? 1=none, 5 = completely a. Summation notation b. Normal distribution c. Hypothesis testing d. Confidence intervals e. Sampling distributions f. Scatter plots and simple linear regression

Purple paper, continued 7. The following data: a. Your height, in inches b. Your handspan in centimeters, defined as the distance covered on the ruler by your stretched hand from the tip of the thumb to the tip of the small finger. c. Your residual (to be explained!)

Regression Used to describe the relationship between a response variable and one or more predictor variables. Used to predict a future response using known, current values of the predictors. Switch to power point slides from Brooks/Cole to accompany Mind On Statistics by Utts/Heckard

IMPORTANT NOTE The remaining slides are from Power point presentations to accompany Mind on Statistics, by Utts and Heckard and are copyright Brooks/Cole. They are not to be copied or used for purposes other than this class.

Three Tools we will use Scatterplot, a two-dimensional graph of data values Correlation, a statistic that measures the strength and direction of a linear relationship Regression equation, an equation that describes the average relationship between a response and explanatory variable Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 1

5.1 Looking for Patterns with Scatterplots Questions to Ask about a Scatterplot What is the average pattern? Does it look like a straight line or is it curved? What is the direction of the pattern? How much do individual points vary from the average pattern? Are there any unusual data points? Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 2

Positive/Negative Association Two variables have a positive association when the values of one variable tend to increase as the values of the other variable increase. Two variables have a negative association when the values of one variable tend to decrease as the values of the other variable increase. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 3

Example 5.1 Height and Handspan Data: Height (in.) Span (cm) 71 23.5 69 22.0 66 18.5 64 20.5 71 21.0 72 24.0 67 19.5 65 20.5 76 24.5 67 20.0 70 23.0 62 17.0 and so on, for n = 167 observations. Data shown are the first 12 observations of a data set that includes the heights (in inches) and fully stretched handspans (in centimeters) of 167 college students. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 4

Example 5.1 Height and Handspan Taller people tend to have greater handspan measurements than shorter people do. When two variables tend to increase together, we say that they have a positive association. The handspan and height measurements may have a linear relationship. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 5

Example 5.2 Driver Age and Maximum Legibility Distance of Highway Signs A research firm determined the maximum distance at which each of 30 drivers could read a newly designed sign. The 30 participants in the study ranged in age from 18 to 82 years old. We want to examine the relationship between age and the sign legibility distance. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 6

Example 5.2 Driver Age and Maximum Legibility Distance of Highway Signs We see a negative association with a linear pattern. We will use a straight-line equation to model this relationship. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 7

Example 5.3 The Development of Musical Preferences The 108 participants in the study ranged in age from 16 to 86 years old. We want to examine the relationship between song-specific age (age in the year the song was popular) and musical preference (positive score => above average, negative score => below average). Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 8

Example 5.3 The Development of Musical Preferences Popular music preferences acquired in late adolescence and early adulthood. The association is nonlinear. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 9

5.2 Describing Linear Patterns with a Regression Line When the best equation for describing the relationship between x and y is a straight line, the equation is called the regression line. Two purposes of the regression line: to estimate the average value of y at any specified value of x to predict the value of y for an individual, given that individual s x value Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 10

Example 5.1 Height and Handspan (cont) Regression equation: Handspan = -3 + 0.35 Height Estimate the average handspan for people 60 inches tall: Average handspan = -3 + 0.35(60) = 18 cm. Predict the handspan for someone who is 60 inches tall: Predicted handspan = -3 + 0.35(60) = 18 cm. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 11

Example 5.1 Height and Handspan (cont) Regression equation: Handspan = -3 + 0.35 Height Slope = 0.35 => Handspan increases by 0.35 cm, on average, for each increase of 1 inch in height. In a statistical relationship, there is variation from the average pattern. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 12

The Equation for the Regression Line ŷ yˆ = b + b x 0 1 is spoken as y-hat, and it is also referred to either as predicted y or estimated y. b 0 is the intercept of the straight line. The intercept is the value of y when x = 0. b 1 is the slope of the straight line. The slope tells us how much of an increase (or decrease) there is for the y variable when the x variable increases by one unit. The sign of the slope tells us whether y increases or decreases when x increases. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 13

Prediction Errors and Residuals Prediction Error = difference between the observed value of y and the predicted value ŷ. Residual = ( y yˆ ) Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 14

Let s predict your handspan Record these on your purple sheet Regression equation: yˆ = b0 + b1 x Handspan (cm) = -3 + 0.35 Height (inches) Calculate your predicted handspan: Examples: -3 + (.35)(60 inches) = 18 cm -3 + (.35)(65 inches) = 19.75 cm -3 + (.35)(70 inches) = 21.5 cm Find your residual: (actual handspan predicted handspan) Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 15

5.3 Measuring Strength and Direction with Correlation Correlation r indicates the strength and the direction of a straight-line relationship. The strength of the relationship is determined by the closeness of the points to a straight line. The direction is determined by whether one variable generally increases or generally decreases when the other variable increases. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 16

Example 5.1 Height and Handspan (cont) Regression equation: Handspan = -3 + 0.35 Height Correlation r = +0.74 => a somewhat strong positive linear relationship. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 17

Example 5.2 Driver Age and Maximum Legibility Distance of Highway Signs (cont) Regression equation: Distance = 577-3 Age Correlation r = -0.8 => a somewhat strong negative linear association. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 18

Example 5.6 Left and Right Handspans If you know the span of a person s right hand, can you accurately predict his/her left handspan? Correlation r = +0.95 => a very strong positive linear relationship. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 19

Example 5.7 Verbal SAT and GPA Grade point averages (GPAs) and verbal SAT scores for a sample of 100 university students. Correlation r = 0.485 => a moderately strong positive linear relationship. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 20

Example 5.8 Age and Hours of TV Viewing Relationship between age and hours of daily television viewing for 1913 survey respondents. Correlation r = 0.12 => a weak connection. Note: a few claimed to watch more than 20 hours/day! Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 21

Example 5.9 Hours of Sleep and Hours of Study Relationship between reported hours of sleep the previous 24 hours and the reported hours of study during the same period for a sample of 116 college students. Correlation r = 0.36 => a not too strong negative association. Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 22

Summary Regression is used to do two things: Predict future values using information available now Estimate the average relationship between a response and one or more other variables Regression only works for linear relationships Copyright 2004 Brooks/Cole, a division of Thomson Learning, Inc. 23