HUDM4122 Probability and Statistical Inference. February 2, 2015
|
|
- Myles Lang
- 6 years ago
- Views:
Transcription
1 HUDM4122 Probability and Statistical Inference February 2, 2015
2 Special Session on SPSS Thursday, April 23 4pm-6pm As of when I closed the poll, every student except one could make it to this I am happy to meet individually with students who can t make this session
3 And people say pie charts aren t informative From Jeanine DeFalco
4 Homework 1 How did it go? How did you like working with the ASSISTments system? Too few problems? Too many? Just right?
5 What the homework covered Computing the mean, median, mode Symmetric and skewed distributions Variance Standard Deviation
6 Difficulties with rounding Sorry about that I ll try to be clearer next time
7 Difficult Problems 3. You are given n=8 measurements: 3, 2, 5, 6, 4, 4, 3, 4. What is the median? We had answers 4, 4.5, 5 Anyone want to explain any of these answers?
8 Difficult Problems 9. You are given 6 measurements: 5, 4, 4, 6, 8, 6. Calculate the sample variance, s 2 We had answers , 2.3, 2.5, 3.1, 3.3, 11.5 Anyone want to explain any of these answers?
9 Questions? Comments?
10 Beyond these topics, in the last class We looked at how to create and interpret Box Plots And discussed Bimodal Distributions Mean Absolute Deviation Percentiles Z scores
11 Questions? Comments?
12 Today Ch. 3 in Mendenhall, Beaver, & Beaver
13 Today Scatterplots Covariance The Pearson Correlation Coefficient Regression Lines
14 Univariate Data A single variable is collected Height
15 Bivariate Data Two variables are collected (for the same data point) Height Drum-Playing Skill
16 Multivariate Data 3+ variables are collected Name Height Drum-Playing Skill John Lennon Paul McCartney George Harrison Ringo Starr 5 6 8
17 Univariate Data Last Class
18 Bivariate Data Today
19 Scatterplot Shows the relationship between two variables
20 Are more expensive brands of peanut butter better? From InterMath intermath.coe.uga.edu
21 Dependent and Independent Variables Dependent Variable From InterMath intermath.coe.uga.edu Independent Variable
22 The Independent Variable Influences the Dependent Variable (Maybe) Dependent Variable From InterMath intermath.coe.uga.edu Independent Variable
23 (You don t always have to be sure) Dependent Variable From InterMath intermath.coe.uga.edu Independent Variable
24 Data Miners Would Instead Say Predictor and Predicted Variables Predicted Variable From InterMath intermath.coe.uga.edu Predictor Variable
25 I like this terminology better because it s neutral on causation Predicted Variable From InterMath intermath.coe.uga.edu Predictor Variable
26 From InterMath intermath.coe.uga.edu Anyways
27 So which brand of peanut butter should you buy? From InterMath intermath.coe.uga.edu
28 Which brand of peanut butter should a gourmet buy? From InterMath intermath.coe.uga.edu
29 Which brand of peanut butter should a gourmet buy? From InterMath intermath.coe.uga.edu
30 From InterMath intermath.coe.uga.edu How about a frugal person?
31 From InterMath intermath.coe.uga.edu How about a frugal person?
32 Who should buy this peanut butter? From InterMath intermath.coe.uga.edu
33 From InterMath intermath.coe.uga.edu How about this one?
34 From InterMath intermath.coe.uga.edu A lot of variability, right?
35 Questions? Comments?
36 Let s discuss some of the properties of scatterplots
37 What can you say about the relationship between Price and Quality? 300 Snorgles Quality Price
38 What can you say about the relationship between Price and Quality? 250 Frungles 200 Quality Price
39 What can you say about the relationship between Price and Quality? Quality Trandles Price
40 So in other words Spend your hard earned dollars on expensive snorgles But save your money on frungles and trandles
41 Questions? Comments?
42 Quick comment on scatterplots Scatterplots are great
43 Quick comment on scatterplots Scatterplots are great
44 Quick comment on scatterplots But they don t scale to really big data sets If your scatterplot just looks like a giant blob or a grid, try a heat map We won t go into detail on heat maps there s a lot to cover today -- but I wanted to put that in your brains
45 Linear functions All these graphs can be described by linear functions, a.k.a. straight lines
46 Linear functions All these graphs can be described by linear functions, a.k.a. straight lines Snorgles Quality Price
47 Figuring out what the best-fitting line is Is the simplest case of linear regression Linear regression is a sophisticated statistical modeling method Focus of HUDM5122 This is just the simplest application of it
48 Linear Regression: X and Y If the two variables have a linear (straight line) relationship Then we can predict Y s value from X
49 Finding Y from X If you buy a new snorgle that costs $200, what is its quality likely to be? Snorgles Quality Price
50 Finding Y from X If you buy a new snorgle that costs $120, what is its quality likely to be? Snorgles Quality Price
51 There s a better way to do this Snorgles Quality Price
52 We can create a mathematical function Snorgles Quality Price
53 We can create a mathematical function Y= A + BX Slope Y-intercept Snorgles Quality Price
54 In this specific case Quality = A + (B)(Price) Snorgles Quality Price
55 And the exact values are Quality = (Price) Snorgles Quality Price
56 So if you buy a new snorgle for $200, what is its quality likely to be? Quality = (Price) Snorgles Quality Price
57 How was your earlier estimate? Quality = (Price) Snorgles Quality Price
58 So if you buy a new snorgle for $120, what is its quality likely to be? Quality = (Price) Snorgles Quality Price
59 How was your earlier estimate? Quality = (Price) Snorgles Quality Price
60 What we just did is called interpolation We used a formula to find an unknown value of Y For a value of X Where the value of X was between The minimum and maximum known values of the X variable
61 How did I compute A and B? There s at least three answers
62 Answer #1
63 Answer#1: Least Squares Regression Formulas I used the magical Least Squares Regression Formulas = =
64 Answer#2: Least Squares Regression Formulas I computed first order partial derivatives in order to discover the magical Least Squares Regression Formulas Here s a pretty good explanation (if you know Calculus) mjelde-james/ageco317/read/simple-9.doc
65 Answer#3: Minimize the Sum of Squared Residuals I found the values for A and B that made the sum of squared residuals the smallest A residual is how much each predicted value for Y differs from the actual value for Y The formula computes this easily (after derivation), but you can also do this in Excel! If we have time at the end, I ll show you this
66 Questions? Comments?
67 Slope
68 Slope Slope (B) = 1 Slope (B) =
69 Slope Slope (B) = Slope (B) = 0.5 Slope (B) =
70 Slope Goes between positive infinity and negative infinity Can anyone draw Y = 1 1X Y = 1 3X Y = 1 1/3X
71 Questions? Comments?
72 So far we ve looked at very clean data
73 Real data is usually messier
74 Here s some data I downloaded from greatschools.net Every suburban high school in Allegheny County, Pennsylvania Household income of region Percent of students who got top score on state standardized exam
75 100 Percent of Studnets "Advanced" on Exam Average Household Income
76 Formula Y = X 100 Percent of Studnets "Advanced" on Exam Average Household Income
77 What percent advanced for a school with avg income = 30K? Y = X 100 Percent of Studnets "Advanced" on Exam Average Household Income
78 What percent advanced for a school with avg income = 120K? Y = X 100 Percent of Studnets "Advanced" on Exam Average Household Income
79 What we just did is called extrapolation We used a formula to find an unknown value of Y For a value of X Where the value of X was outside The minimum and maximum known values of the X variable
80 Extrapolation can be dangerous As we just saw in the last example
81 Questions? Comments?
82 In the last lecture We looked at the variability of a single variable
83 In the last lecture We looked at the variability of a single variable There can be variability in a relationship too
84 In the last lecture We looked at the variability of a single variable There can be variability in a relationship too
85 In the last lecture We looked at the variability of a single variable There can be variability in a relationship too Stemming from the variability of each of the variables involved
86 In the last lecture We looked at the variability of a single variable There can be variability in a relationship too Stemming from the variability of each of the variables involved
87 From McDonald, 2014,
88 We can measure that variability
89 We can measure that variability Using the correlation coefficient
90 We can measure that variability Using the correlation coefficient Also called Pearson Correlation Pearson Product-Moment Correlation
91 Important Note Even though many people refer to the Pearson correlation as the correlation A correlation is simply any relationship between two or more variables
92 Important Note Even though many people refer to the Pearson correlation as the correlation A correlation is simply any relationship between two or more variables When A s value changes, does B change in the same direction?
93 Beyond Pearson correlation
94 Beyond Pearson correlation (and this class)
95 Beyond Pearson correlation (and this class) Spearman s ρ Kendall s τ Goodman and Kruskal s γ Intraclass correlation
96 Is written r Pearson correlation
97 Pearson correlation Looks at the strength of the linear relationship between two variables
98 Pearson correlation Looks at the strength of the linear relationship between two variables If it s a nonlinear relationship, you want something that can model nonlinear correlations, like Spearman s ρ
99 Pearson correlation Looks at the strength of the linear relationship between two variables Looks at the quality of a linear model of the relationship between two variables
100 Pearson correlation Looks at the strength of the linear relationship between two variables Looks at the quality of a linear model of the relationship between two variables Yes, the Ax + B we just looked at
101 Close relationship Moderate relationship Weak relationship From McDonald, 2014,
102 What is a good correlation? 1.0 perfect 0.0 none -1.0 perfectly negatively correlated In between depends on the field
103 What is a good correlation? 1.0 perfect 0.0 none -1.0 perfectly negatively correlated In between depends on the field In physics correlation of 0.8 is weak! In education correlation of 0.3 is good
104 Scoping Correlations Time on Task and Learning Cigarette Smoking and Lifespan - 0.3
105 From D. Boigelot, Wikipedia Pearson correlation values
106 How do we compute correlation?
107 First we have to compute the covariance Covariance is to Pearson Correlation As Variance is to Standard Deviation
108 First we have to compute the covariance Covariance is to Pearson Correlation As Variance is to Standard Deviation i.e. the same idea, but its values aren t interpretable
109 Written First we have to compute the covariance
110 Written First we have to compute the covariance = ( )( ) ( )
111 First we have to compute the covariance = ( )( ) ( ) In other words Compute means for X and Y Take each deviation for X Take each deviation for Y For each i Multiply the i-th deviation X by the i-th deviation Y Add them all together Divide by n-1
112 What does it mean to multiply deviations together? Note that we re multiplying deviations Not absolute deviations or standard deviations I.e. sometimes the values are positive, sometimes they re negative
113 And also note We re not comparing all values to each other Just the 1 st X to the 1 st Y And the 2 nd X to the 2 nd Y And the 3 rd X to the 3 rd Y And so on
114 So If the 1 st X is really above the mean 1 st Y is really above the mean Then you ll add a big positive number to the covariance ( )( ) ( 1)
115 So If the 1 st X is really below the mean 1 st Y is really below the mean Then you ll add a big positive number to the covariance ( )( ) ( 1)
116 So If the 1 st X is really above the mean 1 st Y is really below the mean Then you ll add a big negative number to the covariance ( )( ) ( 1)
117 So If the 1 st X is really below the mean 1 st Y is really above the mean Then you ll add a big negative number to the covariance ( )( ) ( 1)
118 So If the 1 st X is near the mean 1 st Y is near the mean Then you ll add a number close to zero to the covariance ( )( ) ( 1)
119 Questions? Comments?
120 Let s do an example together Compute means for X and Y Take each deviation for X Take each deviation for Y For each i Multiply the i-th deviation X by the i-th deviation Y Add them all together Divide by n-1 X Y
121 Please do another example in pairs Compute means for X and Y Take each deviation for X Take each deviation for Y For each i Multiply the i-th deviation X by the i-th deviation Y Add them all together Divide by n-1 X Y
122 Questions? Comments?
123 Computing Pearson Correlation =
124 Returning to our example = X Y
125 Please compute the Pearson Correlation in = = ( )( ) ( ) pairs X Y
126 Questions? Comments?
127 Correlations can be vulnerable to outliers
128 Anscombe s Quartet: Same correlation, very different relationships Image from Wikipedia
129 Anscombe s Quartet: Same correlation, very different relationships Noisy Linear Relationship Nonlinear Relationship Outlier Super Outlier! Image from Wikipedia
130 Implication of Anscombe s Quartet Don t just compute a Pearson correlation, see a high value, and declare victory Actually look at a scatterplot
131 So, that s correlation
132 So, that s correlation Useful way to see if two variables are related to each other (linearly)
133 Please remember Correlation is not causation!
134 Questions? Comments?
135 If we have time Demo finding A and B for Y = A + Bx In Excel Using Sum of Squared Residuals Residual = Difference Between Predicted Y and Actual Y
136 Questions? Comments?
137 Upcoming Classes 2/4 Introduction to Probability Ch /9 No class 2/11 Permutations, Combinations, Unions, and Complements Ch. 4.4 HW2 due
138 Homework 2 Due in 9 days In the ASSISTments system
139 Questions? Comments?
Business Statistics. Lecture 10: Correlation and Linear Regression
Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form
More informationREVIEW 8/2/2017 陈芳华东师大英语系
REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p
More informationBusiness Statistics. Lecture 9: Simple Regression
Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals
More informationMATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More informationPS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics
PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics LT1: Basics of Correlation LT2: Measuring Correlation and Line of best fit by eye Univariate (one variable) Displays Frequency tables Bar graphs
More informationLinear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?
Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation
More informationChapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals
Chapter 8 Linear Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fat Versus
More informationTOPIC: Descriptive Statistics Single Variable
TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency
More information1. Create a scatterplot of this data. 2. Find the correlation coefficient.
How Fast Foods Compare Company Entree Total Calories Fat (grams) McDonald s Big Mac 540 29 Filet o Fish 380 18 Burger King Whopper 670 40 Big Fish Sandwich 640 32 Wendy s Single Burger 470 21 1. Create
More informationRelationships between variables. Visualizing Bivariate Distributions: Scatter Plots
SFBS Course Notes Part 7: Correlation Bivariate relationships (p. 1) Linear transformations (p. 3) Pearson r : Measuring a relationship (p. 5) Interpretation of correlations (p. 10) Relationships between
More informationappstats8.notebook October 11, 2016
Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus
More informationAMS 7 Correlation and Regression Lecture 8
AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation
More informationChapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.
Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright
More informationPS2: Two Variable Statistics
PS2: Two Variable Statistics LT2: Measuring Correlation and Line of best fit by eye. LT3: Linear regression LT4: The χ 2 test of independence. 1 Pearson's Correlation Coefficient In examinations you are
More informationChapter 3: Examining Relationships
Chapter 3: Examining Relationships Most statistical studies involve more than one variable. Often in the AP Statistics exam, you will be asked to compare two data sets by using side by side boxplots or
More informationCorrelation. A statistics method to measure the relationship between two variables. Three characteristics
Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction
More informationStatistical View of Least Squares
May 23, 2006 Purpose of Regression Some Examples Least Squares Purpose of Regression Purpose of Regression Some Examples Least Squares Suppose we have two variables x and y Purpose of Regression Some Examples
More informationCan you tell the relationship between students SAT scores and their college grades?
Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower
More informationData files for today. CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav
Correlation Data files for today CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav Defining Correlation Co-variation or co-relation between two variables These variables change together
More information9. Linear Regression and Correlation
9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,
More informationLecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population
Lecture 5 1 Lecture 3 The Population Variance The population variance, denoted σ 2, is the sum of the squared deviations about the population mean divided by the number of observations in the population,
More informationRelationships Regression
Relationships Regression BPS chapter 5 2006 W.H. Freeman and Company Objectives (BPS chapter 5) Regression Regression lines The least-squares regression line Using technology Facts about least-squares
More informationChapter 12 Summarizing Bivariate Data Linear Regression and Correlation
Chapter 1 Summarizing Bivariate Data Linear Regression and Correlation This chapter introduces an important method for making inferences about a linear correlation (or relationship) between two variables,
More informationContents. Acknowledgments. xix
Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables
More informationAP Final Review II Exploring Data (20% 30%)
AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure
More informationAP Statistics. Chapter 6 Scatterplots, Association, and Correlation
AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables
More informationUnit 6 - Introduction to linear regression
Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,
More informationCRP 272 Introduction To Regression Analysis
CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous
More information11 Correlation and Regression
Chapter 11 Correlation and Regression August 21, 2017 1 11 Correlation and Regression When comparing two variables, sometimes one variable (the explanatory variable) can be used to help predict the value
More informationRegression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y
Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of
More informationReview. Midterm Exam. Midterm Review. May 6th, 2015 AMS-UCSC. Spring Session 1 (Midterm Review) AMS-5 May 6th, / 24
Midterm Exam Midterm Review AMS-UCSC May 6th, 2015 Spring 2015. Session 1 (Midterm Review) AMS-5 May 6th, 2015 1 / 24 Topics Topics We will talk about... 1 Review Spring 2015. Session 1 (Midterm Review)
More informationStat 101 Exam 1 Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative
More informationCorrelation and regression
NST 1B Experimental Psychology Statistics practical 1 Correlation and regression Rudolf Cardinal & Mike Aitken 11 / 12 November 2003 Department of Experimental Psychology University of Cambridge Handouts:
More informationBiostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras
Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics
More informationMATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression
MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression Objectives: 1. Learn the concepts of independent and dependent variables 2. Learn the concept of a scatterplot
More informationDraft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM
1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact
More informationCorrelation and Regression
Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should
More informationChapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation
Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 14 1 Statistical versus Deterministic Relationships Distance versus Speed (when travel time is constant). Income (in millions of
More informationappstats27.notebook April 06, 2017
Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves
More informationEcn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:
Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 You have until 10:20am to complete this exam. Please remember to put your name,
More informationSESSION 5 Descriptive Statistics
SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple
More informationbivariate correlation bivariate regression multiple regression
bivariate correlation bivariate regression multiple regression Today Bivariate Correlation Pearson product-moment correlation (r) assesses nature and strength of the linear relationship between two continuous
More informationTHE PEARSON CORRELATION COEFFICIENT
CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There
More informationWed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS.
Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS. Last time, we looked at scatterplots, which show the interaction between two variables,
More informationOverview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation
Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already
More informationChapter 9: Roots and Irrational Numbers
Chapter 9: Roots and Irrational Numbers Index: A: Square Roots B: Irrational Numbers C: Square Root Functions & Shifting D: Finding Zeros by Completing the Square E: The Quadratic Formula F: Quadratic
More informationSlide 7.1. Theme 7. Correlation
Slide 7.1 Theme 7 Correlation Slide 7.2 Overview Researchers are often interested in exploring whether or not two variables are associated This lecture will consider Scatter plots Pearson correlation coefficient
More informationMidterm 2 - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put
More informationSampling, Frequency Distributions, and Graphs (12.1)
1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which
More informationMath 147 Lecture Notes: Lecture 12
Math 147 Lecture Notes: Lecture 12 Walter Carlip February, 2018 All generalizations are false, including this one.. Samuel Clemens (aka Mark Twain) (1835-1910) Figures don t lie, but liars do figure. Samuel
More informationChapter 10 Correlation and Regression
Chapter 10 Correlation and Regression 10-1 Review and Preview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple Regression 10-6 Modeling Copyright 2010, 2007, 2004
More informationBig Data Analysis with Apache Spark UC#BERKELEY
Big Data Analysis with Apache Spark UC#BERKELEY This Lecture: Relation between Variables An association A trend» Positive association or Negative association A pattern» Could be any discernible shape»
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationSTA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:
STA 2023 Module 5 Regression and Correlation Learning Objectives Upon completing this module, you should be able to: 1. Define and apply the concepts related to linear equations with one independent variable.
More informationMA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.
MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at
More informationdetermine whether or not this relationship is.
Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations
More informationChapter 12 - Part I: Correlation Analysis
ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,
More informationBIOSTATISTICS NURS 3324
Simple Linear Regression and Correlation Introduction Previously, our attention has been focused on one variable which we designated by x. Frequently, it is desirable to learn something about the relationship
More informationChapter 2: Tools for Exploring Univariate Data
Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is
More informationAn Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012
An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person
More informationMATH 10 INTRODUCTORY STATISTICS
MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And
More informationNAME: DATE: SECTION: MRS. KEINATH
1 Vocabulary and Formulas: Correlation coefficient The correlation coefficient, r, measures the direction and strength of a linear relationship between two variables. Formula: = 1 x i x y i y r. n 1 s
More informationMATH 10 INTRODUCTORY STATISTICS
MATH 10 INTRODUCTORY STATISTICS Ramesh Yapalparvi It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And also sent out via email. 30% weekly homework.
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012
More informationIntroduction. ECN 102: Analysis of Economic Data Winter, J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 January 4, / 51
Introduction ECN 102: Analysis of Economic Data Winter, 2011 J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 January 4, 2011 1 / 51 Contact Information Instructor: John Parman Email: jmparman@ucdavis.edu
More informationq3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) In 2007, the number of wins had a mean of 81.79 with a standard
More informationSTAT 350 Final (new Material) Review Problems Key Spring 2016
1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,
More information1 Correlation and Inference from Regression
1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is
More informationUnit 6 - Simple linear regression
Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable
More informationAn introduction to plotting data
An introduction to plotting data Eric D. Black California Institute of Technology v2.0 1 Introduction Plotting data is one of the essential skills every scientist must have. We use it on a near-daily basis
More informationLinear Regression and Correlation. February 11, 2009
Linear Regression and Correlation February 11, 2009 The Big Ideas To understand a set of data, start with a graph or graphs. The Big Ideas To understand a set of data, start with a graph or graphs. If
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationThe Simple Linear Regression Model
The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate
More informationLECTURE 15: SIMPLE LINEAR REGRESSION I
David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).
More informationMath Literacy. Curriculum (457 topics)
Math Literacy This course covers the topics shown below. Students navigate learning paths based on their level of readiness. Institutional users may customize the scope and sequence to meet curricular
More informationStatistical View of Least Squares
Basic Ideas Some Examples Least Squares May 22, 2007 Basic Ideas Simple Linear Regression Basic Ideas Some Examples Least Squares Suppose we have two variables x and y Basic Ideas Simple Linear Regression
More informationChapter 3: Examining Relationships
Chapter 3: Examining Relationships 3.1 Scatterplots 3.2 Correlation 3.3 Least-Squares Regression Fabric Tenacity, lb/oz/yd^2 26 25 24 23 22 21 20 19 18 y = 3.9951x + 4.5711 R 2 = 0.9454 3.5 4.0 4.5 5.0
More informationreview session gov 2000 gov 2000 () review session 1 / 38
review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review
More informationInterpreting Correlation & Examining Cause and Effect
LESSON 15 Interpreting Correlation & Examining Cause and Effect LEARNING OBJECTIVES Today I am: exploring linear relationships between data sets. So that I can: determine if the r-value will be closer
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationAIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)
AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will
More informationBivariate statistics: correlation
Research Methods for Political Science Bivariate statistics: correlation Dr. Thomas Chadefaux Assistant Professor in Political Science Thomas.chadefaux@tcd.ie 1 Bivariate relationships: interval-ratio
More informationChapter 27 Summary Inferences for Regression
Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test
More informationTeaching a Prestatistics Course: Propelling Non-STEM Students Forward
Teaching a Prestatistics Course: Propelling Non-STEM Students Forward Jay Lehmann College of San Mateo MathNerdJay@aol.com www.pearsonhighered.com/lehmannseries Learning Is in the Details Detailing concepts
More informationRelationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height.
Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height. Income is associated with education. Functional relationships between
More informationIntroduction to Linear Regression
Introduction to Linear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Introduction to Linear Regression 1 / 46
More informationRegression Analysis. BUS 735: Business Decision Making and Research
Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn
More informationADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes
We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures
More information1 A Review of Correlation and Regression
1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then
More informationChapter 4 Data with Two Variables
Chapter 4 Data with Two Variables 1 Scatter Plots and Correlation and 2 Pearson s Correlation Coefficient Looking for Correlation Example Does the number of hours you watch TV per week impact your average
More informationt-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression
t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression Recall, back some time ago, we used a descriptive statistic which allowed us to draw the best fit line through a scatter plot. We
More information8/28/2017. Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables (X and Y)
PS 5101: Advanced Statistics for Psychological and Behavioral Research 1 Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables ( and
More informationQuadratic Equations Part I
Quadratic Equations Part I Before proceeding with this section we should note that the topic of solving quadratic equations will be covered in two sections. This is done for the benefit of those viewing
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 17, 2010 Instructor: John Parman Final Exam - Solutions You have until 12:30pm to complete this exam. Please remember to put your
More informationSTAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis
STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO
More informationWarm-up Using the given data Create a scatterplot Find the regression line
Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444
More informationSTP 420 INTRODUCTION TO APPLIED STATISTICS NOTES
INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make
More informationRecall, Positive/Negative Association:
ANNOUNCEMENTS: Remember that discussion today is not for credit. Go over R Commander. Go to 192 ICS, except at 4pm, go to 192 or 174 ICS. TODAY: Sections 5.3 to 5.5. Note this is a change made in the daily
More informationCorrelation and Regression Theory 1) Multivariate Statistics
Correlation and Regression Theory 1) Multivariate Statistics What is a multivariate data set? How to statistically analyze this data set? Is there any kind of relationship between different variables in
More informationLearning Objectives for Stat 225
Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:
More information