Stat 101 L: Laboratory 5

Similar documents
AP Statistics L I N E A R R E G R E S S I O N C H A P 7

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)

appstats8.notebook October 11, 2016

STATISTICS 1 REVISION NOTES

The data in this answer key is sample data only. Student answers will vary based on personal data.

MEASUREMENT VARIATION

Lecture 18: Simple Linear Regression

ASSIGNMENT 3 SIMPLE LINEAR REGRESSION. Old Faithful

Stat 101 Exam 1 Important Formulas and Concepts 1

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet

Chapter 3: Examining Relationships

Chapter 4 Describing the Relation between Two Variables

BIOSTATISTICS NURS 3324

Section 11: Quantitative analyses: Linear relationships among variables

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

B.U.G. Newsletter. As one semester comes to an end by Jennifer L. Brown. December WÜA UÜÉãÇ

Determination of Density 1

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Describing the Relationship between Two Variables

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2

Part Possible Score Base 5 5 MC Total 50

Chapter 5: Ordinary Least Squares Estimation Procedure The Mechanics Chapter 5 Outline Best Fitting Line Clint s Assignment Simple Regression Model o

Chapter 2: Looking at Data Relationships (Part 3)

MATH 1150 Chapter 2 Notation and Terminology

The following formulas related to this topic are provided on the formula sheet:

Temperature measurement

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

Topic 10 - Linear Regression

Inferences for Regression

Ch Inference for Linear Regression

22S39: Class Notes / November 14, 2000 back to start 1

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation

9. Linear Regression and Correlation

The SuperBall Lab. Objective. Instructions

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Thursday 5 June 2003 Morning Time: 1 hour 30 minutes

Chapter 5 Least Squares Regression

Foundations for Functions

Section 3.3. How Can We Predict the Outcome of a Variable? Agresti/Franklin Statistics, 1of 18

Least Squares Regression

Correlation and Regression

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS

Keystone Exam Review: Module 2. Linear Functions and Data Organizations

Applied Regression Modeling: A Business Approach Chapter 3: Multiple Linear Regression Sections

Simple Linear Regression

Q1: What is the interpretation of the number 4.1? A: There were 4.1 million visits to ER by people 85 and older, Q2: What percent of people 65-74

Regression Models for Time Trends: A Second Example. INSR 260, Spring 2009 Bob Stine

SYSTEMS OF THREE EQUATIONS

SIGNIFICANT FIGURES BEGIN

Stat 500 Midterm 2 12 November 2009 page 0 of 11

Regression and Models with Multiple Factors. Ch. 17, 18

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Scatterplots and Correlation

PS2: Two Variable Statistics

ST430 Exam 1 with Answers

Introduction to Determining Power Law Relationships

Data Science for Engineers Department of Computer Science and Engineering Indian Institute of Technology, Madras

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots

Chapter 12 : Linear Correlation and Linear Regression

Water tank. Fortunately there are a couple of objectors. Why is it straight? Shouldn t it be a curve?

STAT 350 Final (new Material) Review Problems Key Spring 2016

Chapter 10 Regression Analysis

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions

Section Linear Correlation and Regression. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Introduction to Physics Physics 114 Eyres

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

Graphing Equations in Slope-Intercept Form 4.1. Positive Slope Negative Slope 0 slope No Slope

S12 - HS Regression Labs Workshop. Linear. Quadratic (not required) Logarithmic. Exponential. Power

Observations Homework Checkpoint quizzes Chapter assessments (Possibly Projects) Blocks of Algebra

Stat 101: Lecture 6. Summer 2006

Introduction to Simple Linear Regression

MODELING. Simple Linear Regression. Want More Stats??? Crickets and Temperature. Crickets and Temperature 4/16/2015. Linear Model

Talking feet: Scatterplots and lines of best fit

Information Sources. Class webpage (also linked to my.ucdavis page for the class):

Solve Systems of Equations Algebraically

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Chapter 7. Scatterplots, Association, and Correlation

Math 8 I CAN Statements

Business Statistics. Lecture 10: Course Review

1. In Activity 1-1, part 3, how do you think graph a will differ from graph b? 3. Draw your graph for Prediction 2-1 below:

a) Do you see a pattern in the scatter plot, or does it look like the data points are

Chapter 6. Exploring Data: Relationships

Name. The data below are airfares to various cities from Baltimore, MD (including the descriptive statistics).

Important note: Transcripts are not substitutes for textbook assignments. 1

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

PubH 7405: REGRESSION ANALYSIS SLR: DIAGNOSTICS & REMEDIES

1 A Review of Correlation and Regression

Name Period Date Ch. 5 Systems of Linear Equations Review Guide

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 3: Examining Relationships

Chapter 12 - Part I: Correlation Analysis

UNIVERSITY OF TORONTO Faculty of Arts and Science

appstats27.notebook April 06, 2017

STAT 458 Lab 4 Linear Regression Analysis

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015

Introduction to Measurement Physics 114 Eyres

Prob and Stats, Sep 23

Transcription:

Stat 101 L: Laboratory 5 The first activity revisits the labeling of Fun Size bags of M&Ms by looking distributions of Total Weight of Fun Size bags and regular size bags (which have a label weight) of M&Ms. In activity 2 you will get a feel for fitting a straight line to data by doing visual regression on heights and hand spans. In activity 3 you will use JMP to compute least squares regression lines for heights and hand spans. Activity 1: The JMP output (this is the same output from Lab #3) gives the distribution of the Total Weight of 325 Fun Size bags of plain M&Ms. In Lab #3 we determined that Total Weight could be modeled with a Normal model. Fun Size bags of M&Ms are sold as part of a larger package. The Fair Packaging and Labeling Act says that Fun Size bags cannot be sold individually because they do not carry a label weight of contents. If we wanted to sell individual Fun Size bags of M&Ms we would need to determine an accurate label weight of contents. a) Based on the distribution of Total Weight what should the label weight for an individual Fun Size bag be? Explain your choice briefly. b) Using a Normal model with μ =19.8 g and σ =1.4 g, what proportion of Fun Size bags have a Total Weight greater than the label weight you chose in a)? c) M&Ms are also sold in larger individual packages that carry a label weight of contents. The distribution and summary statistics for the Total Weights of a sample of 100 regular size bags whose label weight is 47.9 g are given on a separate sheet. Comment on the label weight relative to the distribution of Total Weight. d) Using a Normal model with μ =49.7 g and σ =1.2 g, what proportion of regular size bags have a Total Weight greater than the label weight? e) Given what you have discovered in c) & d) come up with a label weight for Fun Size bags that is consistent with the label weight of regular size bags. Explain the reasoning behind your choice of label weight. f) The Fair Packaging and Labeling Act says: Variations from the stated weight or mass, measure, or numerical count shall be permitted when caused by unavoidable deviations in weighing, measuring, or counting the contents of individual packages which occur in good packaging practice: Provided, that such variations shall not be permitted to such extent that the average of the quantities in the packages comprising a shipment or other delivery of the commodity is below the quantity stated, and no unreasonable shortage in any package will be permitted even though overages in other packages in the same shipment or delivery compensate for such shortage. Variations from stated quantity of contents shall not be unreasonably large. Comment on whether or not your label weight in e) satisfies the Fair Packaging and Labeling Act. 1

Activity 2: In Lab #2 you measured heights and hand spans of the students in this class. Does the height of a person tell us something about that person s right hand span? Does right hand span tell us something about left hand span? The focus of this activity is to get a feel for relating one variable to another using visual regression. On the answer sheet you will see scatter plots of right hand span (response, y) versus height (explanatory, x) and left hand span (response, y) versus right hand span (explanatory, x). For each plot do the following: a) Describe the general association between the explanatory variable, x and response variable, y. Be sure to mention form, direction, strength and the presence of outliers. b) Add a visual regression line to the plot. Use the ruler to locate a line by eye that you believe best captures the general trend in the relationship between explanatory and response variables. Be sure to extend your visual regression line so that it intersects the y-axis. c) Give the y-intercept for your visual regression line. d) Give the slope for your visual regression line. e) Give the equation for your visual regression line. Which visual regression line do you think will be closer to the corresponding least squares regression line? Explain your reasoning briefly. Activity 3: Use JMP to calculate the least squares regression lines for right hand span on height and left hand span on right hand span. The data are in the file Height.JMP on the course web page under Laboratory 3. Follow the directions in the Guide to Using JMP in Stat 101 on scatter diagrams and regression coefficients. Fit right hand span, y by height, x and fit left hand span, y by right hand span, x. Turn in the JMP output with your lab. Use the output to answer the following questions about each regression equation. a) Give the estimates of the y-intercept and slope for the least squares regression line. b) Give an interpretation of the value of the slope within the context of the problem. c) Give the least squares regression equation. d) Give the value, and an interpretation, of R 2. e) Describe the plot of residuals versus the explanatory variable. What does this plot tell you about the fit of the least squares regression line? f) Based on the data, do you think a linear regression line is a good approximation for the relationship between the two variables? Explain briefly. 2

Distribution of the Total Weight for 325 Bags of Fun Size M&Ms.99.95.90.75.50.25.10.05.01 3 2 1 0-1 -2-3 Normal Quantile Plot 75 50 25 Count 15 16 17 18 19 20 21 22 23 24 25 Total Weight (g) Quantiles Moments 100.0% maximum 23.0 Mean 19.84 75.0% quartile 21.0 Std Dev 1.4290634 50.0% median 20.0 N 325 25.0% quartile 19.0 0.0% minimum 16.0 3

Distribution of the Total Weight for 100 Regular Size bags of M&Ms.99.95.90.75.50.25.10.05.01 3 2 1 0-1 -2-3 Normal Quantile Plot 30 20 Count 10 45 46 47 48 49 50 51 52 53 54 55 Total Weight (g) Quantiles Moments 100.0% maximum 53.0 Mean 49.68 75.0% quartile 50.0 Std Dev 1.2299298 50.0% median 50.0 N 100 25.0% quartile 49.0 0.0% minimum 46.0 4

Stat 101 L: Laboratory 5 Answer Sheet Names: Activity 1: a) Based on the distribution of Total Weight what should the label weight for an individual Fun Size bag be? Explain your choice briefly. b) Using a Normal model with μ =19.8 g and σ =1.4 g, what proportion of Fun Size bags have a Total Weight greater than the label weight you chose in a)? c) M&Ms are also sold in larger individual packages that carry a label weight of contents. The distribution and summary statistics for the Total Weights of a sample of 100 regular size bags whose label weight is 47.9 g are given on a separate sheet. Comment on the label weight relative to the distribution of Total Weight. d) Using a Normal model with μ =49.7 g and σ =1.2 g, what proportion of regular size bags have a Total Weight greater than the label weight? 5

e) Given what you have discovered in c) & d) come up with a label weight for Fun Size bags that is consistent with the label weight of regular size bags. Explain the reasoning behind your choice of label weight. f) Comment on whether or not your label weight in e) satisfies the Fair Packaging and Labeling Act. Activity 2: Right Hand Span and Height a) Describe the general association between the explanatory variable, height and response variable, right hand span. Be sure to mention form, direction, strength and the presence of outliers. b) Add a visual regression line to the plot. Use the ruler to locate a line by eye that you believe best captures the general trend in the relationship between explanatory and response variables. Be sure to extend your visual regression line so that it intersects the y-axis. c) Give the y-intercept for your visual regression line. d) Give the slope for your visual regression line. e) Give the equation for your visual regression line. 6

Right Hand Span (cm) By Height (cm) 30 25 20 Right Hand Span (cm) 15 10 5 0 0 50 100 150 200 Height (cm) 7

Left Hand Span (cm) By Right Hand Span (cm) 30 25 20 Left Hand Span (cm) 15 10 5 0 0 5 10 15 20 25 30 Right Hand Span (cm) 8

Activity 2: Left Hand Span and Right Hand Span a) Describe the general association between the explanatory variable, right hand span and response variable, left hand span. Be sure to mention form, direction, strength and the presence of outliers. b) Add a visual regression line to the plot. Use the ruler to locate a line by eye that you believe best captures the general trend in the relationship between explanatory and response variables. Be sure to extend your visual regression line so that it intersects the y-axis. c) Give the y-intercept for your visual regression line. d) Give the slope for your visual regression line. e) Give the equation for your visual regression line. Activity 3: Right Hand Span and Height a) Give the estimates of the y-intercept and slope for the least squares regression line. b) Give an interpretation of the value of the slope within the context of the problem. c) Give the least squares regression equation. d) Give the value, and an interpretation, of R 2. 9

e) Describe the plot of residuals versus the explanatory variable. What does this plot tell you about the fit of the least squares regression line? f) Based on the data, do you think a linear regression line is a good approximation for the relationship between the two variables? Explain briefly. Left Hand Span and Right Hand Span a) Give the estimates of the y-intercept and slope for the least squares regression line. b) Give an interpretation of the value of the slope within the context of the problem. c) Give the least squares regression equation. d) Give the value, and an interpretation, of R 2. e) Describe the plot of residuals versus the explanatory variable. What does this plot tell you about the fit of the least squares regression line? f) Based on the data, do you think a linear regression line is a good approximation for the relationship between the two variables? Explain briefly. 10