HUDM4122 Probability and Statistical Inference. February 2, 2015

Size: px
Start display at page:

Download "HUDM4122 Probability and Statistical Inference. February 2, 2015"

Transcription

1 HUDM4122 Probability and Statistical Inference February 2, 2015

2 Special Session on SPSS Thursday, April 23 4pm-6pm As of when I closed the poll, every student except one could make it to this I am happy to meet individually with students who can t make this session

3 And people say pie charts aren t informative From Jeanine DeFalco

4 Homework 1 How did it go? How did you like working with the ASSISTments system? Too few problems? Too many? Just right?

5 What the homework covered Computing the mean, median, mode Symmetric and skewed distributions Variance Standard Deviation

6 Difficulties with rounding Sorry about that I ll try to be clearer next time

7 Difficult Problems 3. You are given n=8 measurements: 3, 2, 5, 6, 4, 4, 3, 4. What is the median? We had answers 4, 4.5, 5 Anyone want to explain any of these answers?

8 Difficult Problems 9. You are given 6 measurements: 5, 4, 4, 6, 8, 6. Calculate the sample variance, s 2 We had answers , 2.3, 2.5, 3.1, 3.3, 11.5 Anyone want to explain any of these answers?

9 Questions? Comments?

10 Beyond these topics, in the last class We looked at how to create and interpret Box Plots And discussed Bimodal Distributions Mean Absolute Deviation Percentiles Z scores

11 Questions? Comments?

12 Today Ch. 3 in Mendenhall, Beaver, & Beaver

13 Today Scatterplots Covariance The Pearson Correlation Coefficient Regression Lines

14 Univariate Data A single variable is collected Height

15 Bivariate Data Two variables are collected (for the same data point) Height Drum-Playing Skill

16 Multivariate Data 3+ variables are collected Name Height Drum-Playing Skill John Lennon Paul McCartney George Harrison Ringo Starr 5 6 8

17 Univariate Data Last Class

18 Bivariate Data Today

19 Scatterplot Shows the relationship between two variables

20 Are more expensive brands of peanut butter better? From InterMath intermath.coe.uga.edu

21 Dependent and Independent Variables Dependent Variable From InterMath intermath.coe.uga.edu Independent Variable

22 The Independent Variable Influences the Dependent Variable (Maybe) Dependent Variable From InterMath intermath.coe.uga.edu Independent Variable

23 (You don t always have to be sure) Dependent Variable From InterMath intermath.coe.uga.edu Independent Variable

24 Data Miners Would Instead Say Predictor and Predicted Variables Predicted Variable From InterMath intermath.coe.uga.edu Predictor Variable

25 I like this terminology better because it s neutral on causation Predicted Variable From InterMath intermath.coe.uga.edu Predictor Variable

26 From InterMath intermath.coe.uga.edu Anyways

27 So which brand of peanut butter should you buy? From InterMath intermath.coe.uga.edu

28 Which brand of peanut butter should a gourmet buy? From InterMath intermath.coe.uga.edu

29 Which brand of peanut butter should a gourmet buy? From InterMath intermath.coe.uga.edu

30 From InterMath intermath.coe.uga.edu How about a frugal person?

31 From InterMath intermath.coe.uga.edu How about a frugal person?

32 Who should buy this peanut butter? From InterMath intermath.coe.uga.edu

33 From InterMath intermath.coe.uga.edu How about this one?

34 From InterMath intermath.coe.uga.edu A lot of variability, right?

35 Questions? Comments?

36 Let s discuss some of the properties of scatterplots

37 What can you say about the relationship between Price and Quality? 300 Snorgles Quality Price

38 What can you say about the relationship between Price and Quality? 250 Frungles 200 Quality Price

39 What can you say about the relationship between Price and Quality? Quality Trandles Price

40 So in other words Spend your hard earned dollars on expensive snorgles But save your money on frungles and trandles

41 Questions? Comments?

42 Quick comment on scatterplots Scatterplots are great

43 Quick comment on scatterplots Scatterplots are great

44 Quick comment on scatterplots But they don t scale to really big data sets If your scatterplot just looks like a giant blob or a grid, try a heat map We won t go into detail on heat maps there s a lot to cover today -- but I wanted to put that in your brains

45 Linear functions All these graphs can be described by linear functions, a.k.a. straight lines

46 Linear functions All these graphs can be described by linear functions, a.k.a. straight lines Snorgles Quality Price

47 Figuring out what the best-fitting line is Is the simplest case of linear regression Linear regression is a sophisticated statistical modeling method Focus of HUDM5122 This is just the simplest application of it

48 Linear Regression: X and Y If the two variables have a linear (straight line) relationship Then we can predict Y s value from X

49 Finding Y from X If you buy a new snorgle that costs $200, what is its quality likely to be? Snorgles Quality Price

50 Finding Y from X If you buy a new snorgle that costs $120, what is its quality likely to be? Snorgles Quality Price

51 There s a better way to do this Snorgles Quality Price

52 We can create a mathematical function Snorgles Quality Price

53 We can create a mathematical function Y= A + BX Slope Y-intercept Snorgles Quality Price

54 In this specific case Quality = A + (B)(Price) Snorgles Quality Price

55 And the exact values are Quality = (Price) Snorgles Quality Price

56 So if you buy a new snorgle for $200, what is its quality likely to be? Quality = (Price) Snorgles Quality Price

57 How was your earlier estimate? Quality = (Price) Snorgles Quality Price

58 So if you buy a new snorgle for $120, what is its quality likely to be? Quality = (Price) Snorgles Quality Price

59 How was your earlier estimate? Quality = (Price) Snorgles Quality Price

60 What we just did is called interpolation We used a formula to find an unknown value of Y For a value of X Where the value of X was between The minimum and maximum known values of the X variable

61 How did I compute A and B? There s at least three answers

62 Answer #1

63 Answer#1: Least Squares Regression Formulas I used the magical Least Squares Regression Formulas = =

64 Answer#2: Least Squares Regression Formulas I computed first order partial derivatives in order to discover the magical Least Squares Regression Formulas Here s a pretty good explanation (if you know Calculus) mjelde-james/ageco317/read/simple-9.doc

65 Answer#3: Minimize the Sum of Squared Residuals I found the values for A and B that made the sum of squared residuals the smallest A residual is how much each predicted value for Y differs from the actual value for Y The formula computes this easily (after derivation), but you can also do this in Excel! If we have time at the end, I ll show you this

66 Questions? Comments?

67 Slope

68 Slope Slope (B) = 1 Slope (B) =

69 Slope Slope (B) = Slope (B) = 0.5 Slope (B) =

70 Slope Goes between positive infinity and negative infinity Can anyone draw Y = 1 1X Y = 1 3X Y = 1 1/3X

71 Questions? Comments?

72 So far we ve looked at very clean data

73 Real data is usually messier

74 Here s some data I downloaded from greatschools.net Every suburban high school in Allegheny County, Pennsylvania Household income of region Percent of students who got top score on state standardized exam

75 100 Percent of Studnets "Advanced" on Exam Average Household Income

76 Formula Y = X 100 Percent of Studnets "Advanced" on Exam Average Household Income

77 What percent advanced for a school with avg income = 30K? Y = X 100 Percent of Studnets "Advanced" on Exam Average Household Income

78 What percent advanced for a school with avg income = 120K? Y = X 100 Percent of Studnets "Advanced" on Exam Average Household Income

79 What we just did is called extrapolation We used a formula to find an unknown value of Y For a value of X Where the value of X was outside The minimum and maximum known values of the X variable

80 Extrapolation can be dangerous As we just saw in the last example

81 Questions? Comments?

82 In the last lecture We looked at the variability of a single variable

83 In the last lecture We looked at the variability of a single variable There can be variability in a relationship too

84 In the last lecture We looked at the variability of a single variable There can be variability in a relationship too

85 In the last lecture We looked at the variability of a single variable There can be variability in a relationship too Stemming from the variability of each of the variables involved

86 In the last lecture We looked at the variability of a single variable There can be variability in a relationship too Stemming from the variability of each of the variables involved

87 From McDonald, 2014,

88 We can measure that variability

89 We can measure that variability Using the correlation coefficient

90 We can measure that variability Using the correlation coefficient Also called Pearson Correlation Pearson Product-Moment Correlation

91 Important Note Even though many people refer to the Pearson correlation as the correlation A correlation is simply any relationship between two or more variables

92 Important Note Even though many people refer to the Pearson correlation as the correlation A correlation is simply any relationship between two or more variables When A s value changes, does B change in the same direction?

93 Beyond Pearson correlation

94 Beyond Pearson correlation (and this class)

95 Beyond Pearson correlation (and this class) Spearman s ρ Kendall s τ Goodman and Kruskal s γ Intraclass correlation

96 Is written r Pearson correlation

97 Pearson correlation Looks at the strength of the linear relationship between two variables

98 Pearson correlation Looks at the strength of the linear relationship between two variables If it s a nonlinear relationship, you want something that can model nonlinear correlations, like Spearman s ρ

99 Pearson correlation Looks at the strength of the linear relationship between two variables Looks at the quality of a linear model of the relationship between two variables

100 Pearson correlation Looks at the strength of the linear relationship between two variables Looks at the quality of a linear model of the relationship between two variables Yes, the Ax + B we just looked at

101 Close relationship Moderate relationship Weak relationship From McDonald, 2014,

102 What is a good correlation? 1.0 perfect 0.0 none -1.0 perfectly negatively correlated In between depends on the field

103 What is a good correlation? 1.0 perfect 0.0 none -1.0 perfectly negatively correlated In between depends on the field In physics correlation of 0.8 is weak! In education correlation of 0.3 is good

104 Scoping Correlations Time on Task and Learning Cigarette Smoking and Lifespan - 0.3

105 From D. Boigelot, Wikipedia Pearson correlation values

106 How do we compute correlation?

107 First we have to compute the covariance Covariance is to Pearson Correlation As Variance is to Standard Deviation

108 First we have to compute the covariance Covariance is to Pearson Correlation As Variance is to Standard Deviation i.e. the same idea, but its values aren t interpretable

109 Written First we have to compute the covariance

110 Written First we have to compute the covariance = ( )( ) ( )

111 First we have to compute the covariance = ( )( ) ( ) In other words Compute means for X and Y Take each deviation for X Take each deviation for Y For each i Multiply the i-th deviation X by the i-th deviation Y Add them all together Divide by n-1

112 What does it mean to multiply deviations together? Note that we re multiplying deviations Not absolute deviations or standard deviations I.e. sometimes the values are positive, sometimes they re negative

113 And also note We re not comparing all values to each other Just the 1 st X to the 1 st Y And the 2 nd X to the 2 nd Y And the 3 rd X to the 3 rd Y And so on

114 So If the 1 st X is really above the mean 1 st Y is really above the mean Then you ll add a big positive number to the covariance ( )( ) ( 1)

115 So If the 1 st X is really below the mean 1 st Y is really below the mean Then you ll add a big positive number to the covariance ( )( ) ( 1)

116 So If the 1 st X is really above the mean 1 st Y is really below the mean Then you ll add a big negative number to the covariance ( )( ) ( 1)

117 So If the 1 st X is really below the mean 1 st Y is really above the mean Then you ll add a big negative number to the covariance ( )( ) ( 1)

118 So If the 1 st X is near the mean 1 st Y is near the mean Then you ll add a number close to zero to the covariance ( )( ) ( 1)

119 Questions? Comments?

120 Let s do an example together Compute means for X and Y Take each deviation for X Take each deviation for Y For each i Multiply the i-th deviation X by the i-th deviation Y Add them all together Divide by n-1 X Y

121 Please do another example in pairs Compute means for X and Y Take each deviation for X Take each deviation for Y For each i Multiply the i-th deviation X by the i-th deviation Y Add them all together Divide by n-1 X Y

122 Questions? Comments?

123 Computing Pearson Correlation =

124 Returning to our example = X Y

125 Please compute the Pearson Correlation in = = ( )( ) ( ) pairs X Y

126 Questions? Comments?

127 Correlations can be vulnerable to outliers

128 Anscombe s Quartet: Same correlation, very different relationships Image from Wikipedia

129 Anscombe s Quartet: Same correlation, very different relationships Noisy Linear Relationship Nonlinear Relationship Outlier Super Outlier! Image from Wikipedia

130 Implication of Anscombe s Quartet Don t just compute a Pearson correlation, see a high value, and declare victory Actually look at a scatterplot

131 So, that s correlation

132 So, that s correlation Useful way to see if two variables are related to each other (linearly)

133 Please remember Correlation is not causation!

134 Questions? Comments?

135 If we have time Demo finding A and B for Y = A + Bx In Excel Using Sum of Squared Residuals Residual = Difference Between Predicted Y and Actual Y

136 Questions? Comments?

137 Upcoming Classes 2/4 Introduction to Probability Ch /9 No class 2/11 Permutations, Combinations, Unions, and Complements Ch. 4.4 HW2 due

138 Homework 2 Due in 9 days In the ASSISTments system

139 Questions? Comments?

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics

PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics LT1: Basics of Correlation LT2: Measuring Correlation and Line of best fit by eye Univariate (one variable) Displays Frequency tables Bar graphs

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals Chapter 8 Linear Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fat Versus

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

1. Create a scatterplot of this data. 2. Find the correlation coefficient.

1. Create a scatterplot of this data. 2. Find the correlation coefficient. How Fast Foods Compare Company Entree Total Calories Fat (grams) McDonald s Big Mac 540 29 Filet o Fish 380 18 Burger King Whopper 670 40 Big Fish Sandwich 640 32 Wendy s Single Burger 470 21 1. Create

More information

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots SFBS Course Notes Part 7: Correlation Bivariate relationships (p. 1) Linear transformations (p. 3) Pearson r : Measuring a relationship (p. 5) Interpretation of correlations (p. 10) Relationships between

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

AMS 7 Correlation and Regression Lecture 8

AMS 7 Correlation and Regression Lecture 8 AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

PS2: Two Variable Statistics

PS2: Two Variable Statistics PS2: Two Variable Statistics LT2: Measuring Correlation and Line of best fit by eye. LT3: Linear regression LT4: The χ 2 test of independence. 1 Pearson's Correlation Coefficient In examinations you are

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3: Examining Relationships Most statistical studies involve more than one variable. Often in the AP Statistics exam, you will be asked to compare two data sets by using side by side boxplots or

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Statistical View of Least Squares

Statistical View of Least Squares May 23, 2006 Purpose of Regression Some Examples Least Squares Purpose of Regression Purpose of Regression Some Examples Least Squares Suppose we have two variables x and y Purpose of Regression Some Examples

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Data files for today. CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav

Data files for today. CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav Correlation Data files for today CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav Defining Correlation Co-variation or co-relation between two variables These variables change together

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population Lecture 5 1 Lecture 3 The Population Variance The population variance, denoted σ 2, is the sum of the squared deviations about the population mean divided by the number of observations in the population,

More information

Relationships Regression

Relationships Regression Relationships Regression BPS chapter 5 2006 W.H. Freeman and Company Objectives (BPS chapter 5) Regression Regression lines The least-squares regression line Using technology Facts about least-squares

More information

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation Chapter 1 Summarizing Bivariate Data Linear Regression and Correlation This chapter introduces an important method for making inferences about a linear correlation (or relationship) between two variables,

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

CRP 272 Introduction To Regression Analysis

CRP 272 Introduction To Regression Analysis CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous

More information

11 Correlation and Regression

11 Correlation and Regression Chapter 11 Correlation and Regression August 21, 2017 1 11 Correlation and Regression When comparing two variables, sometimes one variable (the explanatory variable) can be used to help predict the value

More information

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of

More information

Review. Midterm Exam. Midterm Review. May 6th, 2015 AMS-UCSC. Spring Session 1 (Midterm Review) AMS-5 May 6th, / 24

Review. Midterm Exam. Midterm Review. May 6th, 2015 AMS-UCSC. Spring Session 1 (Midterm Review) AMS-5 May 6th, / 24 Midterm Exam Midterm Review AMS-UCSC May 6th, 2015 Spring 2015. Session 1 (Midterm Review) AMS-5 May 6th, 2015 1 / 24 Topics Topics We will talk about... 1 Review Spring 2015. Session 1 (Midterm Review)

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

Correlation and regression

Correlation and regression NST 1B Experimental Psychology Statistics practical 1 Correlation and regression Rudolf Cardinal & Mike Aitken 11 / 12 November 2003 Department of Experimental Psychology University of Cambridge Handouts:

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics

More information

MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression

MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression Objectives: 1. Learn the concepts of independent and dependent variables 2. Learn the concept of a scatterplot

More information

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM 1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 14 1 Statistical versus Deterministic Relationships Distance versus Speed (when travel time is constant). Income (in millions of

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section: Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 You have until 10:20am to complete this exam. Please remember to put your name,

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

bivariate correlation bivariate regression multiple regression

bivariate correlation bivariate regression multiple regression bivariate correlation bivariate regression multiple regression Today Bivariate Correlation Pearson product-moment correlation (r) assesses nature and strength of the linear relationship between two continuous

More information

THE PEARSON CORRELATION COEFFICIENT

THE PEARSON CORRELATION COEFFICIENT CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There

More information

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS.

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS. Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS. Last time, we looked at scatterplots, which show the interaction between two variables,

More information

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already

More information

Chapter 9: Roots and Irrational Numbers

Chapter 9: Roots and Irrational Numbers Chapter 9: Roots and Irrational Numbers Index: A: Square Roots B: Irrational Numbers C: Square Root Functions & Shifting D: Finding Zeros by Completing the Square E: The Quadratic Formula F: Quadratic

More information

Slide 7.1. Theme 7. Correlation

Slide 7.1. Theme 7. Correlation Slide 7.1 Theme 7 Correlation Slide 7.2 Overview Researchers are often interested in exploring whether or not two variables are associated This lecture will consider Scatter plots Pearson correlation coefficient

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

Math 147 Lecture Notes: Lecture 12

Math 147 Lecture Notes: Lecture 12 Math 147 Lecture Notes: Lecture 12 Walter Carlip February, 2018 All generalizations are false, including this one.. Samuel Clemens (aka Mark Twain) (1835-1910) Figures don t lie, but liars do figure. Samuel

More information

Chapter 10 Correlation and Regression

Chapter 10 Correlation and Regression Chapter 10 Correlation and Regression 10-1 Review and Preview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple Regression 10-6 Modeling Copyright 2010, 2007, 2004

More information

Big Data Analysis with Apache Spark UC#BERKELEY

Big Data Analysis with Apache Spark UC#BERKELEY Big Data Analysis with Apache Spark UC#BERKELEY This Lecture: Relation between Variables An association A trend» Positive association or Negative association A pattern» Could be any discernible shape»

More information

Subject CS1 Actuarial Statistics 1 Core Principles

Subject CS1 Actuarial Statistics 1 Core Principles Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and

More information

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to: STA 2023 Module 5 Regression and Correlation Learning Objectives Upon completing this module, you should be able to: 1. Define and apply the concepts related to linear equations with one independent variable.

More information

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table. MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

BIOSTATISTICS NURS 3324

BIOSTATISTICS NURS 3324 Simple Linear Regression and Correlation Introduction Previously, our attention has been focused on one variable which we designated by x. Frequently, it is desirable to learn something about the relationship

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And

More information

NAME: DATE: SECTION: MRS. KEINATH

NAME: DATE: SECTION: MRS. KEINATH 1 Vocabulary and Formulas: Correlation coefficient The correlation coefficient, r, measures the direction and strength of a linear relationship between two variables. Formula: = 1 x i x y i y r. n 1 s

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Ramesh Yapalparvi It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And also sent out via email. 30% weekly homework.

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

Introduction. ECN 102: Analysis of Economic Data Winter, J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 January 4, / 51

Introduction. ECN 102: Analysis of Economic Data Winter, J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 January 4, / 51 Introduction ECN 102: Analysis of Economic Data Winter, 2011 J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 January 4, 2011 1 / 51 Contact Information Instructor: John Parman Email: jmparman@ucdavis.edu

More information

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) In 2007, the number of wins had a mean of 81.79 with a standard

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Unit 6 - Simple linear regression

Unit 6 - Simple linear regression Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable

More information

An introduction to plotting data

An introduction to plotting data An introduction to plotting data Eric D. Black California Institute of Technology v2.0 1 Introduction Plotting data is one of the essential skills every scientist must have. We use it on a near-daily basis

More information

Linear Regression and Correlation. February 11, 2009

Linear Regression and Correlation. February 11, 2009 Linear Regression and Correlation February 11, 2009 The Big Ideas To understand a set of data, start with a graph or graphs. The Big Ideas To understand a set of data, start with a graph or graphs. If

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

LECTURE 15: SIMPLE LINEAR REGRESSION I

LECTURE 15: SIMPLE LINEAR REGRESSION I David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).

More information

Math Literacy. Curriculum (457 topics)

Math Literacy. Curriculum (457 topics) Math Literacy This course covers the topics shown below. Students navigate learning paths based on their level of readiness. Institutional users may customize the scope and sequence to meet curricular

More information

Statistical View of Least Squares

Statistical View of Least Squares Basic Ideas Some Examples Least Squares May 22, 2007 Basic Ideas Simple Linear Regression Basic Ideas Some Examples Least Squares Suppose we have two variables x and y Basic Ideas Simple Linear Regression

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3: Examining Relationships 3.1 Scatterplots 3.2 Correlation 3.3 Least-Squares Regression Fabric Tenacity, lb/oz/yd^2 26 25 24 23 22 21 20 19 18 y = 3.9951x + 4.5711 R 2 = 0.9454 3.5 4.0 4.5 5.0

More information

review session gov 2000 gov 2000 () review session 1 / 38

review session gov 2000 gov 2000 () review session 1 / 38 review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review

More information

Interpreting Correlation & Examining Cause and Effect

Interpreting Correlation & Examining Cause and Effect LESSON 15 Interpreting Correlation & Examining Cause and Effect LEARNING OBJECTIVES Today I am: exploring linear relationships between data sets. So that I can: determine if the r-value will be closer

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248) AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will

More information

Bivariate statistics: correlation

Bivariate statistics: correlation Research Methods for Political Science Bivariate statistics: correlation Dr. Thomas Chadefaux Assistant Professor in Political Science Thomas.chadefaux@tcd.ie 1 Bivariate relationships: interval-ratio

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Teaching a Prestatistics Course: Propelling Non-STEM Students Forward

Teaching a Prestatistics Course: Propelling Non-STEM Students Forward Teaching a Prestatistics Course: Propelling Non-STEM Students Forward Jay Lehmann College of San Mateo MathNerdJay@aol.com www.pearsonhighered.com/lehmannseries Learning Is in the Details Detailing concepts

More information

Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height.

Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height. Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height. Income is associated with education. Functional relationships between

More information

Introduction to Linear Regression

Introduction to Linear Regression Introduction to Linear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Introduction to Linear Regression 1 / 46

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Chapter 4 Data with Two Variables

Chapter 4 Data with Two Variables Chapter 4 Data with Two Variables 1 Scatter Plots and Correlation and 2 Pearson s Correlation Coefficient Looking for Correlation Example Does the number of hours you watch TV per week impact your average

More information

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression Recall, back some time ago, we used a descriptive statistic which allowed us to draw the best fit line through a scatter plot. We

More information

8/28/2017. Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables (X and Y)

8/28/2017. Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables (X and Y) PS 5101: Advanced Statistics for Psychological and Behavioral Research 1 Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables ( and

More information

Quadratic Equations Part I

Quadratic Equations Part I Quadratic Equations Part I Before proceeding with this section we should note that the topic of solving quadratic equations will be covered in two sections. This is done for the benefit of those viewing

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 17, 2010 Instructor: John Parman Final Exam - Solutions You have until 12:30pm to complete this exam. Please remember to put your

More information

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

Recall, Positive/Negative Association:

Recall, Positive/Negative Association: ANNOUNCEMENTS: Remember that discussion today is not for credit. Go over R Commander. Go to 192 ICS, except at 4pm, go to 192 or 174 ICS. TODAY: Sections 5.3 to 5.5. Note this is a change made in the daily

More information

Correlation and Regression Theory 1) Multivariate Statistics

Correlation and Regression Theory 1) Multivariate Statistics Correlation and Regression Theory 1) Multivariate Statistics What is a multivariate data set? How to statistically analyze this data set? Is there any kind of relationship between different variables in

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information