The response variable depends on the explanatory variable.
|
|
- Branden Edwards
- 5 years ago
- Views:
Transcription
1 A response variable measures an outcome of study. > dependent variables An explanatory variable attempts to explain the observed outcomes. > independent variables The response variable depends on the explanatory variable. Example: We think that car weight helps explain accident deaths. Explanatory variable: car weight Response variable: accident death rate 1
2 A scatterplot is the most effective way to display the relationship between two quantitative variables measured on the same individuals. Values of one variable appear on the horizontal axis and values of the other variable appear on the vertical axis. Each individual in the data appears as a point in the graph. Always plot the explanatory variable (if there is one) on the horizontal axis (x-axis). If there is no explanatory-response distinction, either variable can go on the horizontal axis. 2
3 3
4 4
5 Examining a Scatterplot In any graph of data, look for the overall pattern and for striking deviations from that pattern. You can describe the overall pattern of a scatterplot by the form, direction, and strength of the relationship. An important kind of deviation is an outlier, an individual that falls outside of the overall pattern of the relationship. Form: shape of scatterplot 5
6 Interpret the scatterplot to the right. Direction: Decreases from left to right. The higer percentage of people taking the SAT, the lower the mean math score was. There is a negative association. Form: The relationship is slightly curved. Clusters/gaps In about half the states, less than 25% took the SAT, and the other half more than 40% took it. Strength: Moderately strong. States with similar percentage of people taking the SAT tend to have similar mean math scores. Outliers: There appears to be two outliers: (20, 500) and (88, 460). 6
7 Describe what the scatterplot reveals about the relationship between body weight and backpack weight. (Direction, Form, Strength, Outliers) *Hint: First describe the general pattern. Then identify any deviations from the pattern. 7
8 Positive Association, Negative Association Two variables are positively associated when above average values of one tend to accompany above average values of the other. And below average values also tend to occur together. Examples: Positive Association: Backpack weight generally increases as body weight increases Negative Association: The mean SAT score goes down as the percent of graduates taking the test increases. 8
9 Thursday Oct. 20th 9
10 Tuesday October 27th 1. You have data for many years on the average price of a barrel of oil and the average retail price of a gallon of gas. If you want to see how well the price of oil predicts the price of gas, then you should make a scatterplot with as the explanatory variable. a) the price of oil b) the price of gas c) the year d) either oil price or gas price e) time 2. A study was designed to determine if smoking influences life expectancy. What will the explanatory and response variables in this study be? 10
11 1. Describe the direction of the relationship. Explain why this makes sense. 2. What form does the relationship take? Why are there two clusters of points? 11
12 1. Describe the direction of the relationship. Explain why this makes sense. Positive Association. The longer the duration, the longer the interval. 2. What form does the relationship take? Why are there two clusters of points? Roughly linear. There are two clusters around 2 and 4.5, Most eruptions fall into two categories shorter (around 2 minutes) and longer (around 4.5 minutes). 3. How strong is the relationships? justify your answer. Fairly strong. The points don't deviate from a linear form too much. 4. Are there any outliers? There are a couple that could be but for the most part they are all in the overall pattern. 5. What information does the family need to predict when the next eruption will occur? The duration of the previous eruption. 12
13 The two scatterplots above show the same data set using two different scales. Since it's easy to be fooled by different scales or amount of space around points in a scatterplot, we need a numerical measure to supplement the graph. 13
14 Correlation The correlation (r) measures the direction and strength of the linear relationship between two quantitative variables 14
15 Correlation The correlation (r) measures the direction and strength of the linear relationship between two quantitative variables Suppose that we have data on variables x and y for n individuals. The values for the first individual are x 1 and y 1, the values for the second individual are x 2 and y 2. The mean and standard deviations of the two variables are and for the x values, and and for the y values. The correlation between x and y is: 15
16 Correlation The correlation (r) measures the direction and strength of the linear relationship between two quantitative variables summation: "add these terms for all individuals" mean of x values mean of y values n = sample size the x and y values for the term. standard deviation of x values standard deviation of y values 16
17 17
18 Interpreting Correlation 1. r is always a number between 1 and 1. r > 0 indicates a positive association and r < 0 indicates a negative association. r values near 0 indicate a very weak linear relationship. r = 1 and r = 1 only occur in the case of a perfect linear relationship where all points lie exactly on the line. 2. Since r uses the standardized values of the observations, r does not change when we change units of measurements of x, y, or both. 3. Correlation makes no distinction between explanatory and response variables. (Doesn't matter which variable you call x, which you call y) 4. Correlation, r, has no unit of measurement. 5. Correlation does not describe curved relationships between variables, only linear relationships. A correlation of 0 doesn't guarantee that there's no relationship, just that there's no linear relationship. 6. Correlation is not resistant: r is strongly affected by a few outlying observations. 7. Correlation is not a complete summary of two variable data. 18
19 19
20 Wednesday October 28th 1. The following scatter plot shows reading test scores against IQ test scores for 14 fifth grade students. There is one outlier in the plot, what are the scores for that child? 2. In a scatterplot of the average price of a barrel of oil and the average retail price of a gallon of gas, you expect to see... 20
21 21
22 Least Squares Regression is a method for finding a line that summarizes the relationship between two variables. A regression line is a straight line that describes how a response variable (y) changes as an explanatory variable (x) changes. A regression line is often used to predict the value of y for a given x value. Regression, unlike correlation, requires that you have an explanatory and a response variable. A regression line is a model for the data 22
23 the y intercept, the predicted value of y when x=0. (y hat): the predicted value of the response variable y for a given value of the explanatory variable x. The slope, the amount by which y is predicted to change when x increases by one unit. 23
24 Everyone knows that cars and trucks lose value the more they are driven. Can we predict the pice of a used Ford F 150 SuperCrew 4x4 if we know how many miles it has on the odometer? A random sample of 16 used F 150s was selected from among those listed for sale at autotrader.com. The number of miles driven and price (in dollars) were recorded for each of the trucks, here's the data: 24
25 Example 1: Identify the slope and y intercept from the regression line and interpret each value in the context. 25
26 Example 1: Identify the slope and y intercept from the regression line and interpret each value in the context. 26
27 Back to the Ford F 150 problem... Example 1: How much would a Ford F 150 be worth if it has 100,000 miles on it? 27
28 Example 2: How much would a Ford F 150 be worth if it has 300,000 miles on it? 28
29 Monday October 24th The distribution of scores on the Chapter 2 Test are as follows: 89, 88, 79, 89, 58, 84, 95, 79, 93, 92, 91, 94, 70, 93, 92, 87, 91, 73, 50, 91 What measure of center and spread would you choose to describe the data? Which is higher, the median or the mean? Graph the data and describe the distribution (SOCS). 29
30 30
31 Example 3: Find and interpret the residual for the Ford F 150 that had 70,583 miles driven and a price of $21,994? 31
32 The least squares regression line of y on x is the line that makes the sum of the squares of the vertical distances of the data points from the line as small as possible. 32
33 A residual plot is a scatterplot of the residuals against the explanatory variable. Residual plots help us assess whether a linear model is appropriate. The mean of the least squares residuals is always zero. A residual plot in effect turns the regression line horizontal. It magnifies the deviations of the points from the line, making it easier to see unusual observations and patterns. If the regression line captures the overall pattern of the data, there should be no pattern in the residuals. 33
34 Examining a Residual Plot 1. The residual plot should show no obvious patterns. Ideally it would look like the plot to the right. 2. A curved pattern in a residual plot shows that the relationship is NOT linear. 3. The residuals should be relatively small in size. 4. Increasing or decreasing spread bout the line as x increases indicates that a prediction of y will be less accurate for largers x values. 5. Individual points with large residuals are outliers because they lie far from the line that describes the overall pattern. 6. Individual points that are extreme in the direction of x may not have large residuals, but can be important. 34
35 An outlier is an observation that lies outside the overall pattern of the other observations. An observation is influential for a statistical calculation if removing it would markedly change the result of the calculation. Points that are outliers in the x direction of a scatterplot are often influential for the least squares regression line. 35
36 Tuesday October 25th Some data was collected on the weight of a male lab rat for the first 25 weeks after its birth. A scatterplot of the weight (in grams) and time since birth (in weeks) shows a fairly strong, positive linear relationship. The linear regression equation models the data fairly well: 1. What is the slope of the regression line? Explain what it means in context. 2. What is the y intercept? Explain what it means in context. 3. Predict the rat's weight after 16 weeks. Show your work. 4. Should you use the line to predict the rat's weight at age 2 years? 36
37 37
38 38
39 39
40 Standard Deviation of the Residuals The average prediction error (or the mean of the residuals) is 0 whenever we use the least squares regression line. That's because the positive and negative residuals "balance out". But that doesn't tell us how far off the predictions are, on average. So, we can say that our predictions are "off" by an average of. This value gives the approximate size of a "typical" or "average" prediction error (residual) 40
41 The coefficient of determination: (or "r sq") The coefficient of determiniation, is the fraction of the variation in the values of y that is accounted for by the least squares regression line of y on x. (tells us how well the least squares regression line predicts values of the response variable y) We can calcluate using the following formula: measures the total variation in the y values. is the sum of the squared errors The ratio tells us what proportion of the total variation in y still remains after using the regression line the predict the values of the response variable. *The least squares regression line accounts for % of the variation in [response variable name]. 41
42 42
43 slope=1.109; For every 1 mpg in the city, the hwy mpg is predicted to increase by mpg. y int=4.62; when the city mileage is zero, we predict a hwy mileage of 4.62 mpg. 43
44 Here's a residual plot for the least squares regression of pack weight on body weight for the 8 hikers. 44
45 Tuesday November 3rd Create a residual plot of the F 150 data 45
46 46
47 Monday Oct. 31st Refer to pg. 165 for data table 1. Calculate the standard deviation of the residuals for the F 150 problem. Interpret what it means in the context. 2. Calculate the coefficient of determination and interpret what it means in the context. 47
48 We can give the equation of the least squares regression line in terms of the means and standard deviations of the two variables and their correlation. where and We know that every least squares regression line passes through the point. 48
49 49
50 With all data: Excluding Child 18: Excluding Child 19: 6. What do you notice? 50
51 Removing child 18 has a strong influence on the position of the regression line. However, removing child 19 has little effect on the regression line. A point that is extreme in the x direction with no other points near it pulls the line toward itself. We call these points influential. 51
52 Recall: The coefficient of determination: How to interpret: "The least squares regression line accounts for % of the variation in [response variable name]." Standard Deviation of the Residuals: This value gives the approximate size of a "typical" or "average" prediction error (residual) How to interpret: "Our predictions are "off" by an average of [response variable name]." 52
53 Bottom Line: Association does NOT imply causation! 53
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships Section 3.2 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 3 Describing Relationships 3.1 Scatterplots and Correlation 3.2 Section 3.2
More informationScatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation
3.1: Scatterplots & Correlation Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The values of one variable appear on the horizontal
More informationChapter 3: Describing Relationships
Chapter 3: Describing Relationships Section 3.2 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 3 Describing Relationships 3.1 Scatterplots and Correlation 3.2 Section 3.2
More informationChapter 3: Examining Relationships
Chapter 3: Examining Relationships Most statistical studies involve more than one variable. Often in the AP Statistics exam, you will be asked to compare two data sets by using side by side boxplots or
More information3.2: Least Squares Regressions
3.2: Least Squares Regressions Section 3.2 Least-Squares Regression After this section, you should be able to INTERPRET a regression line CALCULATE the equation of the least-squares regression line CALCULATE
More informationCHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Scatterplots and Correlation Learning
More informationBivariate Data Summary
Bivariate Data Summary Bivariate data data that examines the relationship between two variables What individuals to the data describe? What are the variables and how are they measured Are the variables
More informationScatterplots and Correlation
Bivariate Data Page 1 Scatterplots and Correlation Essential Question: What is the correlation coefficient and what does it tell you? Most statistical studies examine data on more than one variable. Fortunately,
More informationLecture 4 Scatterplots, Association, and Correlation
Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variable In this lecture: We shall look at two quantitative variables.
More informationAP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation
Scatterplots and Correlation Name Hr A scatterplot shows the relationship between two quantitative variables measured on the same individuals. variable (y) measures an outcome of a study variable (x) may
More informationLecture 4 Scatterplots, Association, and Correlation
Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variables In this lecture: We shall look at two quantitative variables.
More informationChapter 6: Exploring Data: Relationships Lesson Plan
Chapter 6: Exploring Data: Relationships Lesson Plan For All Practical Purposes Displaying Relationships: Scatterplots Mathematical Literacy in Today s World, 9th ed. Making Predictions: Regression Line
More informationAP Statistics Bivariate Data Analysis Test Review. Multiple-Choice
Name Period AP Statistics Bivariate Data Analysis Test Review Multiple-Choice 1. The correlation coefficient measures: (a) Whether there is a relationship between two variables (b) The strength of the
More informationExamining Relationships. Chapter 3
Examining Relationships Chapter 3 Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The explanatory variable, if there is one, is graphed
More informationChapter 3: Examining Relationships
Chapter 3 Review Chapter 3: Examining Relationships 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study
More informationAP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions
AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions Know the definitions of the following words: bivariate data, regression analysis, scatter diagram, correlation coefficient, independent
More informationChapter 12 Summarizing Bivariate Data Linear Regression and Correlation
Chapter 1 Summarizing Bivariate Data Linear Regression and Correlation This chapter introduces an important method for making inferences about a linear correlation (or relationship) between two variables,
More informationChapter 8. Linear Regression /71
Chapter 8 Linear Regression 1 /71 Homework p192 1, 2, 3, 5, 7, 13, 15, 21, 27, 28, 29, 32, 35, 37 2 /71 3 /71 Objectives Determine Least Squares Regression Line (LSRL) describing the association of two
More informationChapter 6. Exploring Data: Relationships. Solutions. Exercises:
Chapter 6 Exploring Data: Relationships Solutions Exercises: 1. (a) It is more reasonable to explore study time as an explanatory variable and the exam grade as the response variable. (b) It is more reasonable
More information3.1 Scatterplots and Correlation
3.1 Scatterplots and Correlation Most statistical studies examine data on more than one variable. In many of these settings, the two variables play different roles. Explanatory variable (independent) predicts
More informationExample: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight?
Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight? 16 subjects overfed for 8 weeks Explanatory: change in energy use from non-exercise activity (calories)
More informationChapter 5 Friday, May 21st
Chapter 5 Friday, May 21 st Overview In this Chapter we will see three different methods we can use to describe a relationship between two quantitative variables. These methods are: Scatterplot Correlation
More informationChapter 7. Scatterplots, Association, and Correlation
Chapter 7 Scatterplots, Association, and Correlation Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 29 Objective In this chapter, we study relationships! Instead, we investigate
More informationRelated Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190.
Name Chapter 3 Learning Objectives Identify explanatory and response variables in situations where one variable helps to explain or influences the other. Make a scatterplot to display the relationship
More informationM 140 Test 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75
M 140 est 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDI! Problem Max. Points Your Points 1-10 10 11 10 12 3 13 4 14 18 15 8 16 7 17 14 otal 75 Multiple choice questions (1 point each) For questions
More informationSem. 1 Review Ch. 1-3
AP Stats Sem. 1 Review Ch. 1-3 Name 1. You measure the age, marital status and earned income of an SRS of 1463 women. The number and type of variables you have measured is a. 1463; all quantitative. b.
More informationUnit 6 - Introduction to linear regression
Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,
More informationNov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3.
Nov 13 AP STAT 1. Check/rev HW 2. Review/recap of notes 3. HW: pg 179 184 #5,7,8,9,11 and read/notes pg 185 188 1 Chapter 3 Notes Review Exploring relationships between two variables. BIVARIATE DATA Is
More informationThe empirical ( ) rule
The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%
More informationObjectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships
Objectives 2.3 Least-squares regression Regression lines Prediction and Extrapolation Correlation and r 2 Transforming relationships Adapted from authors slides 2012 W.H. Freeman and Company Straight Line
More informationPractice Questions for Exam 1
Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon
More information1. Use Scenario 3-1. In this study, the response variable is
Chapter 8 Bell Work Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable lumber of 32 cherry trees are measured by a researcher. The goal is to determine if volume of usable lumber can
More informationHOMEWORK (due Wed, Jan 23): Chapter 3: #42, 48, 74
ANNOUNCEMENTS: Grades available on eee for Week 1 clickers, Quiz and Discussion. If your clicker grade is missing, check next week before contacting me. If any other grades are missing let me know now.
More informationBasic Practice of Statistics 7th
Basic Practice of Statistics 7th Edition Lecture PowerPoint Slides In Chapter 4, we cover Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots Adding categorical
More informationDescribing Bivariate Relationships
Describing Bivariate Relationships Bivariate Relationships What is Bivariate data? When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response variables Plot the data
More informationIT 403 Practice Problems (2-2) Answers
IT 403 Practice Problems (2-2) Answers #1. Which of the following is correct with respect to the correlation coefficient (r) and the slope of the leastsquares regression line (Choose one)? a. They will
More informationLinear Regression Communication, skills, and understanding Calculator Use
Linear Regression Communication, skills, and understanding Title, scale and label the horizontal and vertical axes Comment on the direction, shape (form), and strength of the relationship and unusual features
More informationSampling, Frequency Distributions, and Graphs (12.1)
1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which
More informationChapter 7. Association, and Correlation. Scatterplots & Correlation. Scatterplots & Correlation. Stat correlation.
Stat 1010 - correlation Chapter 7 n Scatterplots, Association, and Correlation 1 n Here, we see a positive relationship between a bear s age and its neck diameter. As a bear gets older, it tends to have
More informationThe following formulas related to this topic are provided on the formula sheet:
Student Notes Prep Session Topic: Exploring Content The AP Statistics topic outline contains a long list of items in the category titled Exploring Data. Section D topics will be reviewed in this session.
More informationAnnouncements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)
Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall) We will cover Chs. 5 and 6 first, then 3 and 4. Mon,
More informationMathematics Level D: Lesson 2 Representations of a Line
Mathematics Level D: Lesson 2 Representations of a Line Targeted Student Outcomes Students graph a line specified by a linear function. Students graph a line specified by an initial value and rate of change
More informationDetermine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2
3.2c Computer Output, Regression to the Mean, & AP Formulas Be sure you can locate: the slope, the y intercept and determine the equation of the LSRL. Slope is always in context and context is x value.
More informationUnit 6 - Simple linear regression
Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable
More informationCh. 3 Review - LSRL AP Stats
Ch. 3 Review - LSRL AP Stats Multiple Choice Identify the choice that best completes the statement or answers the question. Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable lumber
More informationReview of Regression Basics
Review of Regression Basics When describing a Bivariate Relationship: Make a Scatterplot Strength, Direction, Form Model: y-hat=a+bx Interpret slope in context Make Predictions Residual = Observed-Predicted
More information5.1 Bivariate Relationships
Chapter 5 Summarizing Bivariate Data Source: TPS 5.1 Bivariate Relationships What is Bivariate data? When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response variables
More informationMrs. Poyner/Mr. Page Chapter 3 page 1
Name: Date: Period: Chapter 2: Take Home TEST Bivariate Data Part 1: Multiple Choice. (2.5 points each) Hand write the letter corresponding to the best answer in space provided on page 6. 1. In a statistics
More informationInt Math 1 Statistic and Probability. Name:
Name: Int Math 1 1. Juan wants to rent a house. He gathers data on many similar houses. The distance from the center of the city, x, and the monthly rent for each house, y, are shown in the scatter plot.
More informationScatterplots and Correlations
Scatterplots and Correlations Section 4.1 1 New Definitions Explanatory Variable: (independent, x variable): attempts to explain observed outcome. Response Variable: (dependent, y variable): measures outcome
More informationappstats8.notebook October 11, 2016
Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus
More informationQ 1 = 23.8 M = Q 3 = 29.8 IQR = 6 The numbers are in order and there are 18 pieces of data so the median is the average of the 9th and 10th
Sample Exam #1, Math 01 1. Use the data set given below to answer all of the following questions. 14.0, 18.4, 1.6,.1, 3.8, 4.3, 5.9, 6.5, 7.5, 9., 9.3, 9.4, 9.7, 9.8, 30., 30.8, 31.9, 33.5 HaL Use the
More informationCorrelation. Relationship between two variables in a scatterplot. As the x values go up, the y values go down.
Correlation Relationship between two variables in a scatterplot. As the x values go up, the y values go up. As the x values go up, the y values go down. There is no relationship between the x and y values
More informationAnalyzing Lines of Fit
4.5 Analyzing Lines of Fit Essential Question How can you analytically find a line of best fit for a scatter plot? Finding a Line of Best Fit Work with a partner. The scatter plot shows the median ages
More informationRecall, Positive/Negative Association:
ANNOUNCEMENTS: Remember that discussion today is not for credit. Go over R Commander. Go to 192 ICS, except at 4pm, go to 192 or 174 ICS. TODAY: Sections 5.3 to 5.5. Note this is a change made in the daily
More informationAP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1
AP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1 2. A researcher is interested in determining if one could predict the score on a statistics exam from the amount of time spent studying for the exam.
More informationScatterplots. STAT22000 Autumn 2013 Lecture 4. What to Look in a Scatter Plot? Form of an Association
Scatterplots STAT22000 Autumn 2013 Lecture 4 Yibi Huang October 7, 2013 21 Scatterplots 22 Correlation (x 1, y 1 ) (x 2, y 2 ) (x 3, y 3 ) (x n, y n ) A scatter plot shows the relationship between two
More informationTHE PEARSON CORRELATION COEFFICIENT
CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There
More informationIf the roles of the variable are not clear, then which variable is placed on which axis is not important.
Chapter 6 - Scatterplots, Association, and Correlation February 6, 2015 In chapter 6-8, we look at ways to compare the relationship of 2 quantitative variables. First we will look at a graphical representation,
More informationIntro to Stats Lecture 11
Outliers and influential points Intro to Stats Lecture 11 Collect data this week! Midterm is coming! Terms X outliers: observations outlying the overall pattern of the X- variable Y outliers: observations
More information1) A residual plot: A)
1) A residual plot: A) B) C) D) E) displays residuals of the response variable versus the independent variable. displays residuals of the independent variable versus the response variable. displays residuals
More informationTest 3A AP Statistics Name:
Test 3A AP Statistics Name: Part 1: Multiple Choice. Circle the letter corresponding to the best answer. 1. Other things being equal, larger automobile engines consume more fuel. You are planning an experiment
More informationUNIT 12 ~ More About Regression
***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests
More informationUsing a Graphing Calculator
Using a Graphing Calculator Unit 1 Assignments Bridge to Geometry Name Date Period Warm Ups Name Period Date Friday Directions: Today s Date Tuesday Directions: Today s Date Wednesday Directions: Today
More informationIF YOU HAVE DATA VALUES:
Unit 02 Review Ways to obtain a line of best fit IF YOU HAVE DATA VALUES: 1. In your calculator, choose STAT > 1.EDIT and enter your x values into L1 and your y values into L2 2. Choose STAT > CALC > 8.
More informationMultiple Choice Circle the letter corresponding to the best answer for each of the problems below (4 pts each)
Math 221 Hypothetical Exam 1, Wi2008, (Chapter 1-5 in Moore, 4th) April 3, 2063 S. K. Hyde, S. Barton, P. Hurst, K. Yan Name: Show all your work to receive credit. All answers must be justified to get
More informationMath 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section and
Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section 2.1.1 and 8.1-8.2.6 Overview Scatterplots Explanatory and Response Variables Describing Association The Regression Equation
More informationINFERENCE FOR REGRESSION
CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We
More informationWe will now find the one line that best fits the data on a scatter plot.
General Education Statistics Class Notes Least-Squares Regression (Section 4.2) We will now find the one line that best fits the data on a scatter plot. We have seen how two variables can be correlated
More informationMATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More informationLecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population
Lecture 5 1 Lecture 3 The Population Variance The population variance, denoted σ 2, is the sum of the squared deviations about the population mean divided by the number of observations in the population,
More informationAP Statistics Two-Variable Data Analysis
AP Statistics Two-Variable Data Analysis Key Ideas Scatterplots Lines of Best Fit The Correlation Coefficient Least Squares Regression Line Coefficient of Determination Residuals Outliers and Influential
More informationAlgebra 1 Practice Test Modeling with Linear Functions Unit 6. Name Period Date
Name Period Date Vocabular: Define each word and give an example.. Correlation 2. Residual plot. Translation Short Answer: 4. Statement: If a strong correlation is present between two variables, causation
More informationAMS 7 Correlation and Regression Lecture 8
AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation
More informationAP Statistics. Chapter 6 Scatterplots, Association, and Correlation
AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables
More informationKeller: Stats for Mgmt & Econ, 7th Ed July 17, 2006
Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will
More informationLinear Regression and Correlation. February 11, 2009
Linear Regression and Correlation February 11, 2009 The Big Ideas To understand a set of data, start with a graph or graphs. The Big Ideas To understand a set of data, start with a graph or graphs. If
More informationPre-Calculus Multiple Choice Questions - Chapter S8
1 If every man married a women who was exactly 3 years younger than he, what would be the correlation between the ages of married men and women? a Somewhat negative b 0 c Somewhat positive d Nearly 1 e
More informationChapter 5 Least Squares Regression
Chapter 5 Least Squares Regression A Royal Bengal tiger wandered out of a reserve forest. We tranquilized him and want to take him back to the forest. We need an idea of his weight, but have no scale!
More informationAP Final Review II Exploring Data (20% 30%)
AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure
More informationStudy Guide AP Statistics
Study Guide AP Statistics Name: Part 1: Multiple Choice. Circle the letter corresponding to the best answer. 1. Other things being equal, larger automobile engines are less fuel-efficient. You are planning
More informationBIVARIATE DATA data for two variables
(Chapter 3) BIVARIATE DATA data for two variables INVESTIGATING RELATIONSHIPS We have compared the distributions of the same variable for several groups, using double boxplots and back-to-back stemplots.
More informationName Date. Answers 1.
Name Date Honors Algebra 2 Summer Work Due at Meet the Teacher Night Show all work. You will be graded on accuracy and completion. Partial credit will be given on problems where work is not shown. 1. Plot
More informationInference for Regression Inference about the Regression Model and Using the Regression Line
Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about
More informationAP Statistics - Chapter 2A Extra Practice
AP Statistics - Chapter 2A Extra Practice 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is A) yield
More informationChapter 4 Data with Two Variables
Chapter 4 Data with Two Variables 1 Scatter Plots and Correlation and 2 Pearson s Correlation Coefficient Looking for Correlation Example Does the number of hours you watch TV per week impact your average
More informationChapter (7) Continuous Probability Distributions Examples Normal probability distribution
Chapter (7) Continuous robability Distributions Examples Normal probability distribution Example () How to find the area under the normal curve? If 50 Find & 6 60.8 50 X 60.8 6 0.8 p 6 0.5 0.464 0.964.8
More informationChapter 2 Modeling with Linear Functions
Chapter Modeling with Linear Functions Homework.1 1. a. b. c. In 199, t = 1. Notice on the scattergram, when t = 1, p is approximately 4.5. Therefore, 4.5% of deaths due to car accidents in 199 were due
More informationFish act Water temp
A regression of the amount of calories in a serving of breakfast cereal vs. the amount of fat gave the following results: Calories = 97.53 + 9.6525(Fat). Which of the following is FALSE? a) It is estimated
More informationMath 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore
Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile
More informationWhat is the easiest way to lose points when making a scatterplot?
Day #1: Read 141-142 3.1 Describing Relationships Why do we study relationships between two variables? Read 143-144 Page 144: Check Your Understanding Read 144-149 How do you know which variable to put
More informationMini-Lecture 4.1 Scatter Diagrams and Correlation
Mini-Lecture 4.1 Scatter Diagrams and Correlation Objectives 1. Draw and interpret scatter diagrams 2. Describe the properties of the linear correlation coefficient 3. Compute and interpret the linear
More informationChapter 5: Exploring Data: Distributions Lesson Plan
Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The
More informationScatterplots and Correlation
Scatterplots and Correlation Al Nosedal University of Toronto Summer 2017 Al Nosedal University of Toronto Scatterplots and Correlation Summer 2017 1 / 65 My momma always said: Life was like a box of chocolates.
More informationChapter 4 Data with Two Variables
Chapter 4 Data with Two Variables 1 Scatter Plots and Correlation and 2 Pearson s Correlation Coefficient Looking for Correlation Example Does the number of hours you watch TV per week impact your average
More informationEstimating a Population Mean
Estimating a Population Mean MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Objectives At the end of this lesson we will be able to: obtain a point estimate for
More informationChapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.
Chapter Goals To understand the methods for displaying and describing relationship among variables. Formulate Theories Interpret Results/Make Decisions Collect Data Summarize Results Chapter 7: Is There
More informationMATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression
MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression Objectives: 1. Learn the concepts of independent and dependent variables 2. Learn the concept of a scatterplot
More informationChapter (4) Discrete Probability Distributions Examples
Chapter (4) Discrete Probability Distributions Examples Example () Two balanced dice are rolled. Let X be the sum of the two dice. Obtain the probability distribution of X. Solution When the two balanced
More informationy = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output
12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear
More information