PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics

Size: px
Start display at page:

Download "PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics"

Transcription

1 PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics LT1: Basics of Correlation LT2: Measuring Correlation and Line of best fit by eye Univariate (one variable) Displays Frequency tables Bar graphs Histograms Cumulative frequency graphs Box and whisker plots Stem and leaf plots Pie charts Statistics Measures of center Mean Median Mode Measure of spread Range Interquartile range Variance Standard deviation 1

2 Bivariate Statistics Find how two variables are related Height and weight SAT scores and GPA Sleep and GPA Passing a driving test and gender Which of these are quantitative discrete, quantitative continuous or qualitative? Statistics and displays are different because we want to measure the strength of the association; the techniques vary depending on the type of data Bivariate Statistics: What to determine. Determine the type of variables (categorical, quantitative discrete / continuous). Age and length of hair Eye color and shirt brand choice. Careful not to mix quantitative and qualitative information. For the mixed forms, you would have to run an ANOVA test, which we will not be covering and isn t apart of the IB program. Sometimes students think that it would be 2 variables, but really are only 1. height and hair Color. Race and SAT scores. Determine which variable is the independent variable and which variable is the dependent variable. Independent variable-predictor variable (x-axis) Ex. Age Dependent variable-result of the predictor (y-axis) Ex. Weight 2

3 IA Talk. Picking a topic 1) When you pick a topic, you need to focus on a question that you want answered. 2) Students tend to come up with questions that are really just talking about 2 variable being tested. Rather, come up with a topic you want to explore and then use the bi-variate stats to help you answer the questions. Example: Tennis and the affects on achievement. Are Blondes happier then brunettes? Picking a topic and a question to ask. All of you should pick a topic and a questions that is relevant to something your interested in. Example: I m interested in records. Maybe I can see if people who listen to vinyl records are more creative than people who listen to digital music. What would my title be? Analog Music and Creativity What would my big questions be? Are people who listen to vinyl more creative? How would I test this? What do you think? Lets move to the IA handouts Writing Guide and Good Ideas vs. Bad Ideas. 3

4 Correlation: an overview Data - Both variables must be quantitative and continuous* Correlation - Refers to the strength and direction of the relationship between two variables Displays Scatterplots display data on a graph. Statistics - Correlation coefficient, coefficient of determination, and line of best fit help us understand the data *a correlation can be done with quantitative discrete, but gaps appear in the calculations or assumptions are made from the data. Characteristics of Correlation There are several characteristics we consider when describing the correlation between two variables: direction, linearity, strength, outliers, and causation. Direction: Upward = positive correlation. Both independent and dependent variables are increasing. Downward = Negative correlation: As the independent variable increases, the dependent variable decreases. Random: = No correlation. Strength: Strong, moderate and weak (how close are they to a perfect correlation?). 4

5 Linearity! Sometimes they are non-linear. Look. Which one is non-linear? Which one is linear? Outliers. We observe and investigate any outliers, or isolated points which do not follow the trend formed by the main body of data. If an outlier is the result of a recording or graphing error, it should be discarded. However, if the outlier proves to be a genuine piece of data, it should be kept. Sometimes an outlier shows important information on a demographic or sections of the data that isn t represented. 5

6 Causation?? Look at this example: There is a strong positive correlation between children s arm length and their running speed. Does this mean children with longer arms can run faster? Correlation does not mean causation! Blatant confounding variable? - Age Another example: In Springfield the number of stray cats where collected, and in Eugene the number of meat items at taco bell were collected over several years and a strong negative correlation was found between the variables. The implication is that as the number of stray cats decreases, the number of meat dishes increases. This could simply mean that over time, one variable increased and the other decreased. Examples of Causal assumptions. Here is one: Detroit has one of the highest arson rates in the nation. It also has one of the most employed fire districts. If we graphed the amount of arsons in surrounding districts and the amount of firefighters employed, we find that there is a strong positive correlation Detroit Total arsons: 957 Total firefighters: 830. Surrounding Districts Total arsons: 207 Total firefighters: 210. What are we saying about arsons and firefighter? Do firefighters cause more arsons? What is the confounding variable? 6

7 10/19/2017 Pearson s product-moment Correlation Coefficient It is important to get a more precise measure of the strength of linear correlation between two variables. We achieve this using Pearson s product-moment correlation coefficient r. Calculating r can be tricky, but our calculators will help us out quite a bit. For a set of n data given as ordered pairs,,,, (, ),., (, ), Pearson s correlation coefficient is = All values of PPMCC should be Positive numbers are positively correlated, while negative numbers are negatively correlated. The size of r indicates the strength. Pearson's Correlation Coefficient In examinations you are expected to calculate r using technology. Therefore, you will put the data in your calculator and run a linreg(ax+b) test to find. This method is simple and shouldn t be too much of a worry on your paper. However, calculating r using the formula is recommended for the Internal Assessment to get full marks on the mathematic process portion. For us as practice we will only be looking at a few points rather than a multitude of points due to time. The formula is: = ()() You may recognize the bottom. What does this look similar to? The top is called the covariance, which tells us what happens to a specific point compared to the mean. So if we test to see how each point is associated with the mean, we come up with, or a representation of correlation from -1 to 1. 7

8 Pearson s product-moment Correlation Coefficient The other calculation for Pearson s Correlation Coefficient: = Standard Deviation of x Standard Deviation of y Covariance of x and y. Sometimes, all we need is to find the standard deviation of x and y, with a given covariance, and we can calculate What is the precise scale for r? 1.0 = Perfect correlation Very strong Strong Moderately Strong.5 Moderate Moderately Weak For both positive and negative r values Weak Very Weak 0 Perfectly no correlation 8

9 Scatterplot Patterns: Name the correlation. Strong, positive Moderately strong, positive Moderately weak, positive No correlation Scatterplot Patterns: Continue to name the correlation. Strong, negative Moderately strong, negative Weak, negative No correlation 9

10 10/19/2017 You try. Give some examples of variables that would have a positive correlation or a negative correlation. Try to identify the independent variable and the dependent variable. Calculating by hand Using the long equation = ()() example and find the values needed to find r., Lets look at an Example: Daisy investigates how the volume of water in a pot affects the time it takes to boil on the stove. The results are given in the table. Find and interpret Pearson s correlation coefficient between the two variables. The table is on the next slide. Pot Volume (, ) Time to boil ( min) A 1 2 B 2 4 C 4 7 D

11 10/19/2017 Calculating by hand (use stat edit) = Pot ()() Volume (, ) Time to boil ( min) A 1 2, each portion section is needed to be solved. ( )( ) B 2 4 C 4 7 D 6 9 total Calculating mean of, should be a cinch! = =, = = = Try One On Your Own Period 7 s test scores are as follows as well as their IQ for 5 people. by hand, find what type of correlation there is by interpreting r. Person Score (x) IQ (y) Total Calculating mean of, should be a cinch! = =, = = ( )( ) = 11

12 Try One On Your Own Period 7 s test scores are as follows as well as their IQ for 5 people. by hand, find what type of correlation there is by interpreting r. Score Person IQ (y) ( )( ) (x) Total = = =, = = Now that you ve done this once or twice; Click Here for something amazing by an amazing person. Start at 1:21 Score IQ (y) ( )( ) (x)

13 Bug Weight (g) Length (mm) Graphing a Scatterplot 1) Enter the data in Stat Edit 2) Press 2 nd Make sure scatterplot is on and are the two lists. (also make sure nothing is in and no other stat plots are on). 3) Press zoom 9 or. 4) There you have a scatterplot! Bugs Turn on r value 1) 2 nd catalog 2) Press D, and scroll down to diagnostic on, enter, then enter. Finding the r value (Pearson s Correlation Coefficient) 1) Enter the data in Stat Edit 2) Press Stat 3) Press calculate (older = enter). 4) There you have r! : The Coefficient of Determination. To help describe the correlation between two variables, we can also calculate the coefficient of determination r 2. This is simply the square of Pearson s Product moment correlation coefficient, and as such the direction of correlation is eliminated. describes the direction of the correlation and how correlated something is. describes the type of correlation at each point. In other words, it describes the percent in which one variable will follow the correlation. How often will a given variable depend on the other variable? Do not get these confused. The IA specifically states to dock points if students get these mixed up as evidence that the student doesn t know what stands for. 13

14 Homework 11A #1, 3, 5 11B.1 #2-6 11B.2 #1-3 11B.3 P. 327 #1-4 14

PS2: Two Variable Statistics

PS2: Two Variable Statistics PS2: Two Variable Statistics LT2: Measuring Correlation and Line of best fit by eye. LT3: Linear regression LT4: The χ 2 test of independence. 1 Pearson's Correlation Coefficient In examinations you are

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics January 24, 2018 CS 361: Probability & Statistics Relationships in data Standard coordinates If we have two quantities of interest in a dataset, we might like to plot their histograms and compare the two

More information

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down)

Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down) Two-Variable Statistics Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down) Positive Correlation As one variable

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Steps to take to do the descriptive part of regression analysis:

Steps to take to do the descriptive part of regression analysis: STA 2023 Simple Linear Regression: Least Squares Model Steps to take to do the descriptive part of regression analysis: A. Plot the data on a scatter plot. Describe patterns: 1. Is there a strong, moderate,

More information

MEASURING THE SPREAD OF DATA: 6F

MEASURING THE SPREAD OF DATA: 6F CONTINUING WITH DESCRIPTIVE STATS 6E,6F,6G,6H,6I MEASURING THE SPREAD OF DATA: 6F othink about this example: Suppose you are at a high school football game and you sample 40 people from the student section

More information

HUDM4122 Probability and Statistical Inference. February 2, 2015

HUDM4122 Probability and Statistical Inference. February 2, 2015 HUDM4122 Probability and Statistical Inference February 2, 2015 Special Session on SPSS Thursday, April 23 4pm-6pm As of when I closed the poll, every student except one could make it to this I am happy

More information

SCATTERPLOTS. We can talk about the correlation or relationship or association between two variables and mean the same thing.

SCATTERPLOTS. We can talk about the correlation or relationship or association between two variables and mean the same thing. SCATTERPLOTS When we want to know if there is some sort of relationship between 2 numerical variables, we can use a scatterplot. It gives a visual display of the relationship between the 2 variables. Graphing

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?! Topic 3: Introduction to Statistics Collecting Data We collect data through observation, surveys and experiments. We can collect two different types of data: Categorical Quantitative Algebra 1 Table of

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

Bivariate Data Summary

Bivariate Data Summary Bivariate Data Summary Bivariate data data that examines the relationship between two variables What individuals to the data describe? What are the variables and how are they measured Are the variables

More information

Unit Six Information. EOCT Domain & Weight: Algebra Connections to Statistics and Probability - 15%

Unit Six Information. EOCT Domain & Weight: Algebra Connections to Statistics and Probability - 15% GSE Algebra I Unit Six Information EOCT Domain & Weight: Algebra Connections to Statistics and Probability - 15% Curriculum Map: Describing Data Content Descriptors: Concept 1: Summarize, represent, and

More information

Univariate (one variable) data

Univariate (one variable) data Bivariate Data Univariate (one variable) data Involves only a single variable So cannot describe associations or relationships Descriptive Statistics Central tendencies: mean, median, mode Dispersion:

More information

8/4/2009. Describing Data with Graphs

8/4/2009. Describing Data with Graphs Describing Data with Graphs 1 A variable is a characteristic that changes or varies over time and/or for different individuals or objects under consideration. Examples: Hair color, white blood cell count,

More information

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of

More information

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem. Statistics 1 Mathematical Model A mathematical model is a simplification of a real world problem. 1. A real world problem is observed. 2. A mathematical model is thought up. 3. The model is used to make

More information

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis 4.1 Introduction Correlation is a technique that measures the strength (or the degree) of the relationship between two variables. For example, we could measure how strong the relationship is between people

More information

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004 UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 50W - Introduction to Biostatistics Fall 00 Exercises with Solutions Topic Summarizing Data Due: Monday September 7, 00 READINGS.

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables

More information

BNG 495 Capstone Design. Descriptive Statistics

BNG 495 Capstone Design. Descriptive Statistics BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus

More information

M 140 Test 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

M 140 Test 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75 M 140 est 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDI! Problem Max. Points Your Points 1-10 10 11 10 12 3 13 4 14 18 15 8 16 7 17 14 otal 75 Multiple choice questions (1 point each) For questions

More information

PubHlth 540 Fall Summarizing Data Page 1 of 18. Unit 1 - Summarizing Data Practice Problems. Solutions

PubHlth 540 Fall Summarizing Data Page 1 of 18. Unit 1 - Summarizing Data Practice Problems. Solutions PubHlth 50 Fall 0. Summarizing Data Page of 8 Unit - Summarizing Data Practice Problems Solutions #. a. Qualitative - ordinal b. Qualitative - nominal c. Quantitative continuous, ratio d. Qualitative -

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots SFBS Course Notes Part 7: Correlation Bivariate relationships (p. 1) Linear transformations (p. 3) Pearson r : Measuring a relationship (p. 5) Interpretation of correlations (p. 10) Relationships between

More information

AP Statistics Two-Variable Data Analysis

AP Statistics Two-Variable Data Analysis AP Statistics Two-Variable Data Analysis Key Ideas Scatterplots Lines of Best Fit The Correlation Coefficient Least Squares Regression Line Coefficient of Determination Residuals Outliers and Influential

More information

= n 1. n 1. Measures of Variability. Sample Variance. Range. Sample Standard Deviation ( ) 2. Chapter 2 Slides. Maurice Geraghty

= n 1. n 1. Measures of Variability. Sample Variance. Range. Sample Standard Deviation ( ) 2. Chapter 2 Slides. Maurice Geraghty Chapter Slides Inferential Statistics and Probability a Holistic Approach Chapter Descriptive Statistics This Course Material by Maurice Geraghty is licensed under a Creative Commons Attribution-ShareAlike.

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

STATISTICS 1 REVISION NOTES

STATISTICS 1 REVISION NOTES STATISTICS 1 REVISION NOTES Statistical Model Representing and summarising Sample Data Key words: Quantitative Data This is data in NUMERICAL FORM such as shoe size, height etc. Qualitative Data This is

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation.

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation. Please pick up a calculator and take out paper and something to write with. Sep 17 8:08 AM Chapter 6 Scatterplots, Association and Correlation Copyright 2015, 2010, 2007 Pearson Education, Inc. Chapter

More information

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1 Regression diagnostics As is true of all statistical methodologies, linear regression analysis can be a very effective way to model data, as along as the assumptions being made are true. For the regression

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Slide 7.1. Theme 7. Correlation

Slide 7.1. Theme 7. Correlation Slide 7.1 Theme 7 Correlation Slide 7.2 Overview Researchers are often interested in exploring whether or not two variables are associated This lecture will consider Scatter plots Pearson correlation coefficient

More information

2. LECTURE 2. Objectives

2. LECTURE 2. Objectives 2. LECTURE 2 Objectives I understand the distinction between independent variable(s) and the corresponding dependent variable as well as why that distinction was chosen for the situation. I can define

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

STA220H1F Term Test Oct 26, Last Name: First Name: Student #: TA s Name: or Tutorial Room:

STA220H1F Term Test Oct 26, Last Name: First Name: Student #: TA s Name: or Tutorial Room: STA0HF Term Test Oct 6, 005 Last Name: First Name: Student #: TA s Name: or Tutorial Room: Time allowed: hour and 45 minutes. Aids: one sided handwritten aid sheet + non-programmable calculator Statistical

More information

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile

More information

review session gov 2000 gov 2000 () review session 1 / 38

review session gov 2000 gov 2000 () review session 1 / 38 review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

Performance of fourth-grade students on an agility test

Performance of fourth-grade students on an agility test Starter Ch. 5 2005 #1a CW Ch. 4: Regression L1 L2 87 88 84 86 83 73 81 67 78 83 65 80 50 78 78? 93? 86? Create a scatterplot Find the equation of the regression line Predict the scores Chapter 5: Understanding

More information

Chapter 1 Introduction & 1.1: Analyzing Categorical Data

Chapter 1 Introduction & 1.1: Analyzing Categorical Data Chapter 1 Chapter 1 Introduction & 1.1: Analyzing Categorical Data Population Sample Make an inference about the population. Collect data from a representative sample... Perform Data Analysis, keeping

More information

Describing Bivariate Relationships

Describing Bivariate Relationships Describing Bivariate Relationships Bivariate Relationships What is Bivariate data? When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response variables Plot the data

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Session 4 2:40 3:30. If neither the first nor second differences repeat, we need to try another

Session 4 2:40 3:30. If neither the first nor second differences repeat, we need to try another Linear Quadratics & Exponentials using Tables We can classify a table of values as belonging to a particular family of functions based on the math operations found on any calculator. First differences

More information

THE PEARSON CORRELATION COEFFICIENT

THE PEARSON CORRELATION COEFFICIENT CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

CRP 272 Introduction To Regression Analysis

CRP 272 Introduction To Regression Analysis CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous

More information

Simple Regression Model. January 24, 2011

Simple Regression Model. January 24, 2011 Simple Regression Model January 24, 2011 Outline Descriptive Analysis Causal Estimation Forecasting Regression Model We are actually going to derive the linear regression model in 3 very different ways

More information

Chapter 7 Linear Regression

Chapter 7 Linear Regression Chapter 7 Linear Regression 1 7.1 Least Squares: The Line of Best Fit 2 The Linear Model Fat and Protein at Burger King The correlation is 0.76. This indicates a strong linear fit, but what line? The line

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

M & M Project. Think! Crunch those numbers! Answer!

M & M Project. Think! Crunch those numbers! Answer! M & M Project Think! Crunch those numbers! Answer! Chapters 1-2 Exploring Data and Describing Location in a Distribution Univariate Data: Length Stemplot and Frequency Table Stem (Units Digit) 0 1 1 Leaf

More information

Section 5.4. Ken Ueda

Section 5.4. Ken Ueda Section 5.4 Ken Ueda Students seem to think that being graded on a curve is a positive thing. I took lasers 101 at Cornell and got a 92 on the exam. The average was a 93. I ended up with a C on the test.

More information

Reminder: Univariate Data. Bivariate Data. Example: Puppy Weights. You weigh the pups and get these results: 2.5, 3.5, 3.3, 3.1, 2.6, 3.6, 2.

Reminder: Univariate Data. Bivariate Data. Example: Puppy Weights. You weigh the pups and get these results: 2.5, 3.5, 3.3, 3.1, 2.6, 3.6, 2. TP: To review Standard Deviation, Residual Plots, and Correlation Coefficients HW: Do a journal entry on each of the calculator tricks in this lesson. Lesson slides will be posted with notes. Do Now: Write

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence.

PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence. PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence. Example by eye. On a hot day, nine cars were left in the sun in a car parking lot. The length of time each car was left

More information

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc.

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation Copyright 2010 Pearson Education, Inc. Looking at Scatterplots Scatterplots may be the most common and most effective display for data. In a scatterplot,

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

Chapter 6 The Normal Distribution

Chapter 6 The Normal Distribution Chapter 6 The Normal PSY 395 Oswald Outline s and area The normal distribution The standard normal distribution Setting probable limits on a score/observation Measures related to 2 s and Area The idea

More information

5.1 Bivariate Relationships

5.1 Bivariate Relationships Chapter 5 Summarizing Bivariate Data Source: TPS 5.1 Bivariate Relationships What is Bivariate data? When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response variables

More information

Correlation and regression

Correlation and regression NST 1B Experimental Psychology Statistics practical 1 Correlation and regression Rudolf Cardinal & Mike Aitken 11 / 12 November 2003 Department of Experimental Psychology University of Cambridge Handouts:

More information

Chapter 19 Sir Migo Mendoza

Chapter 19 Sir Migo Mendoza The Linear Regression Chapter 19 Sir Migo Mendoza Linear Regression and the Line of Best Fit Lesson 19.1 Sir Migo Mendoza Question: Once we have a Linear Relationship, what can we do with it? Something

More information

Chapter 8. Linear Regression /71

Chapter 8. Linear Regression /71 Chapter 8 Linear Regression 1 /71 Homework p192 1, 2, 3, 5, 7, 13, 15, 21, 27, 28, 29, 32, 35, 37 2 /71 3 /71 Objectives Determine Least Squares Regression Line (LSRL) describing the association of two

More information

STT 315 This lecture is based on Chapter 2 of the textbook.

STT 315 This lecture is based on Chapter 2 of the textbook. STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their

More information

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Overview Key Concepts Understand how adding (subtracting) a constant or multiplying (dividing) by a constant changes the center and/or spread

More information

y n 1 ( x i x )( y y i n 1 i y 2

y n 1 ( x i x )( y y i n 1 i y 2 STP3 Brief Class Notes Instructor: Ela Jackiewicz Chapter Regression and Correlation In this chapter we will explore the relationship between two quantitative variables, X an Y. We will consider n ordered

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

Math 147 Lecture Notes: Lecture 12

Math 147 Lecture Notes: Lecture 12 Math 147 Lecture Notes: Lecture 12 Walter Carlip February, 2018 All generalizations are false, including this one.. Samuel Clemens (aka Mark Twain) (1835-1910) Figures don t lie, but liars do figure. Samuel

More information

Lecture 1: Descriptive Statistics

Lecture 1: Descriptive Statistics Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics

More information

8/28/2017. Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables (X and Y)

8/28/2017. Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables (X and Y) PS 5101: Advanced Statistics for Psychological and Behavioral Research 1 Both examine linear (straight line) relationships Correlation works with a pair of scores One score on each of two variables ( and

More information

STATISTICS/MATH /1760 SHANNON MYERS

STATISTICS/MATH /1760 SHANNON MYERS STATISTICS/MATH 103 11/1760 SHANNON MYERS π 100 POINTS POSSIBLE π YOUR WORK MUST SUPPORT YOUR ANSWER FOR FULL CREDIT TO BE AWARDED π YOU MAY USE A SCIENTIFIC AND/OR A TI-83/84/85/86 CALCULATOR ONCE YOU

More information

Arvind Borde / MAT , Week 5: Relationships I

Arvind Borde / MAT , Week 5: Relationships I Arvind Borde / MAT 19.001, Week 5: Relationships I 1 Review of Standard Deviation Population (N observations) Sample (sample size n) (xi µ) σ = (xi x) s = N n 1 µ = mean x = mean Where are most of the

More information

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to: STA 2023 Module 5 Regression and Correlation Learning Objectives Upon completing this module, you should be able to: 1. Define and apply the concepts related to linear equations with one independent variable.

More information

Statistical Methods. by Robert W. Lindeman WPI, Dept. of Computer Science

Statistical Methods. by Robert W. Lindeman WPI, Dept. of Computer Science Statistical Methods by Robert W. Lindeman WPI, Dept. of Computer Science gogo@wpi.edu Descriptive Methods Frequency distributions How many people were similar in the sense that according to the dependent

More information

Probability Distributions

Probability Distributions CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

HOMEWORK (due Wed, Jan 23): Chapter 3: #42, 48, 74

HOMEWORK (due Wed, Jan 23): Chapter 3: #42, 48, 74 ANNOUNCEMENTS: Grades available on eee for Week 1 clickers, Quiz and Discussion. If your clicker grade is missing, check next week before contacting me. If any other grades are missing let me know now.

More information

Relationships Regression

Relationships Regression Relationships Regression BPS chapter 5 2006 W.H. Freeman and Company Objectives (BPS chapter 5) Regression Regression lines The least-squares regression line Using technology Facts about least-squares

More information

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals Chapter 8 Linear Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fat Versus

More information

Chapter 6 Scatterplots, Association and Correlation

Chapter 6 Scatterplots, Association and Correlation Chapter 6 Scatterplots, Association and Correlation Looking for Correlation Example Does the number of hours you watch TV per week impact your average grade in a class? Hours 12 10 5 3 15 16 8 Grade 70

More information

1. Create a scatterplot of this data. 2. Find the correlation coefficient.

1. Create a scatterplot of this data. 2. Find the correlation coefficient. How Fast Foods Compare Company Entree Total Calories Fat (grams) McDonald s Big Mac 540 29 Filet o Fish 380 18 Burger King Whopper 670 40 Big Fish Sandwich 640 32 Wendy s Single Burger 470 21 1. Create

More information

Describing Distributions With Numbers

Describing Distributions With Numbers Describing Distributions With Numbers October 24, 2012 What Do We Usually Summarize? Measures of Center. Percentiles. Measures of Spread. A Summary Statement. Choosing Numerical Summaries. 1.0 What Do

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Descriptive Univariate Statistics and Bivariate Correlation

Descriptive Univariate Statistics and Bivariate Correlation ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to

More information

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable Chapter 08: Linear Regression There are lots of ways to model the relationships between variables. It is important that you not think that what we do is the way. There are many paths to the summit We are

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math. Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Ramesh Yapalparvi It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And also sent out via email. 30% weekly homework.

More information