Answers to Homework 4

Size: px
Start display at page:

Download "Answers to Homework 4"

Transcription

1 Answers to Homework 4 1. (a) Let p be the true probability of an auto having automatic transmission. The observed proportion of autos is.847, and a 95% confidence interval for p has the form p ± z.025 p(1 p)/n. Substituting in the observed values gives.847 ± (1.96) (.847)(.153)/1203 =.847 ±.02. The exact interval, which we can get from Minitab, is ( , ), which is unsurprisingly very similar to the standard interval. (b) We can actually just get this from Minitab: One-Sample T: City mileage Variable N Mean StDev SE Mean 90% CI City mileage (20.307, ) Of course, this is based on the usual formula X ± t.05 n 1 s/ n. This uses the exact t-critical value of (based on 1202 degrees of freedom), but if you used (based on df = ) you will get virtually the same answer. (c) We now want a prediction interval, X ± t s A prediction interval for n mileage is thus ±( )(5.882) miles per gallon = ±9.687, or (10.9, 30.3) (d) We are assuming that these data can be viewed as a random sample from a normal distribution. We already discussed in Homework 1 why these are not a random sample, but have also said that we will ignore that here. What about normality? This is apparently problematic here, as a histogram of city mileages shows a noticeable right tail, with some very high mileages coming from hybrid cars: c 2017, Jeffrey S. Simonoff 1

2 This probably has not had a strong effect on the confidence interval, as with a sample this large from a not-greatly-nonnormal distribution the Central Limit Theorem has likely taken over, but this has likely affected the prediction interval. The obvious thing to do to try to fix the prediction interval is to take logs, construct the interval in the logged scale, and then exponentiate the interval endpoints to get back to the original scale. This will probably work better, since logged mileages are certainly closer to normally distributed (although still somewhat right-tailed): This output gives us the sample mean and standard deviation of the logged variable: c 2017, Jeffrey S. Simonoff 2

3 Descriptive Statistics: Logged city mileage Variable N Mean SE Mean StDev Minimum Q1 Logged city mileage Variable Median Q3 Maximum Logged city mileage The prediction interval in the logged scale is ±( )(.1132) 1 + 1/1203 = ± = (1.1118, ). Antilogging the two endpoints gives the prediction interval (12.9, 30.5) miles per gallon, being shifted up at the low end in particular (with an estimate of the typical mileage being the geometric mean = 19.9 miles per gallon). 2. (a) Let p be the probability that a yogurt has excellent taste. The number of such yogurts is 7 (of 23), so the observed proportion is.304, and a 99% confidence interval for p has the form p ± z.005 p(1 p)/n. Substituting in the observed values gives.304 ± (2.576) (.304)(.696)/23 =.304 ±.247 = (0.057, 0.551). The exact interval, which we can get from Minitab, is ( , ), which is noticeably different from the approximate interval, since the sample is small and the observed proportion is not very close to.5. (b) After using the calculator to create the price per serving variable, the standard confidence interval is available from Minitab: One-Sample T: Price per serving Variable N Mean StDev SE Mean 95% CI Price per serving (1.227, 1.649) (c) The prediction interval is X ± t s 1 + 1, or n 1.44 ± (2.074)(0.487)(1.022) = 1.44 ± 1.03 = ($0.31, $2.47). (d) This seems to be a suspect prediction interval, as the lower end is $0.50 less than the minimum value in the sample. The reason for this is clear from a normal plot of the variable (a histogram would work too): c 2017, Jeffrey S. Simonoff 3

4 The variable is reasonably normally distributed, except for one clear outlier, the extremely expensive Maple Hill Creamery Strawberry at $3.00 per serving. This could have inflated both the mean and standard deviation, so it should be omitted to see what effect it might have had. We should also look more carefully into this observation to see if we can understand why it is so unusual. If you go to the Maple Hill Creamery web site, you can see that their major selling point is as follows: Since 2009, we ve been crafting uniquely delicious 100% grass-fed organic dairy products, all while building a 100% grass-fed organic Milkshed in New York State. It makes sense that it would be more expensive to make diary products when cows are not fed corn or grain, and presumably this increased cost is passed along to consumers. It is also possible that the grass-fed organic system is a selling point that allows them to charge a premium price, although we can note that according to Consumer Reports its taste is relatively substandard, being one of only three of the yogurts that has a taste rating of only Good. Here is output after removing this point: One-Sample T: Price per serving Variable N Mean StDev SE Mean 95% CI Price per serving (1.2088, ) The confidence interval is about 25% narrower, reflecting the large drop in the standard deviation. The prediction interval is 1.37 ± (2.08)(0.357)(1.024) = 1.37 ± 0.76 = ($0.61, $2.13). It still appears to be too long at the low end, which could reflect a slightly short tail on the left side. c 2017, Jeffrey S. Simonoff 4

5 3. (a) We are looking for a prediction interval, which takes the form X ± t s 1 + 1, n or 15.1 ± (1.982)(15.4)(1.004) = 15.1 ± 30.6 = ( 15.5, 45.7). The interval goes into impossible negative values. The variable is, in fact, distinctly long right-tailed, with more than half of the floods having duration of 10 days or less and more than three-quarters having duration less than three weeks, but some lasting as long as months. You didn t have the data, of course, so you can t do this, but I can. Here is a histogram of the duration variable: The obvious solution is to take logs, construct a prediction interval in the logged scale, and then antilog. The logged variable is quite reasonably normally-distributed, so this should result in a reasonable interval: c 2017, Jeffrey S. Simonoff 5

6 Descriptive Statistics: Logged days Variable N Mean StDev Minimum Q1 Logged days Variable Median Q3 Maximum Logged days The interval is ± (1.982)(.3816)(1.004) = ±.759 = (.245, 1.763), which when antilogged gives the interval (1.8, 57.9) days. with a geometric mean of 10 days, clearly representing the shorter left tail and much longer right tail. (b) Let p be the true probability that there is at least one fatality from a flood. The observed proportion of movies is 78 =.696, and a 95% confidence interval for p 112 has the form p ± z.025 p(1 p)/n. Substituting in the observed values gives.696 ± (1.96) (.696)(.304)/112 =.696 ±.085 = (.611,.781). The exact interval, which we can get from Minitab, is ( , ). We are assuming for the approximate interval that the sample size is large enough to appeal to the normal approximation to the binomial, which seems to be okay (given the similarity in the exact and approximate intervals). More seriously, we are assuming that the probability of at least one fatality is the same for all floods, and whether or not a fatality occurs in any flood is independent of that in any other flood. The first assumption is clearly not true; as we discussed in class, there are big differences in character between floods from hurricanes and floods from snow melt (for example), and the chances of a fatality in one type is very different from that another. Further, as the 2017 hurricane season showed, there are local (in time) changes in weather patterns, and knowledge of the severity of one flood could tell us something about the severity of another flood, which would violate independence. c 2017, Jeffrey S. Simonoff 6

Is economic freedom related to economic growth?

Is economic freedom related to economic growth? Is economic freedom related to economic growth? It is an article of faith among supporters of capitalism: economic freedom leads to economic growth. The publication Economic Freedom of the World: 2003

More information

Describing Center: Mean and Median Section 5.4

Describing Center: Mean and Median Section 5.4 Describing Center: Mean and Median Section 5.4 Look at table 5.2 at the right. We are going to make the dotplot of the city gas mileages of midsize cars. How to describe the center of a distribution: x

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1 Regression diagnostics As is true of all statistical methodologies, linear regression analysis can be a very effective way to model data, as along as the assumptions being made are true. For the regression

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Descriptive Statistics: cal. Is it reasonable to use a t test to test hypotheses about the mean? Hypotheses: Test Statistic: P value:

Descriptive Statistics: cal. Is it reasonable to use a t test to test hypotheses about the mean? Hypotheses: Test Statistic: P value: 1. Many consumers pay careful attention to stated nutritional contents on packaged foods when making purchases. It is therefore important that the information be accurate. A random sample of n = 12 frozen

More information

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t = 2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result

More information

Chapters 4-6: Estimation

Chapters 4-6: Estimation Chapters 4-6: Estimation Read sections 4. (except 4..3), 4.5.1, 5.1 (except 5.1.5), 6.1 (except 6.1.3) Point Estimation (4.1.1) Point Estimator - A formula applied to a data set which results in a single

More information

3 GRAPHICAL DISPLAYS OF DATA

3 GRAPHICAL DISPLAYS OF DATA some without indicating nonnormality. If a sample of 30 observations contains 4 outliers, two of which are extreme, would it be reasonable to assume the population from which the data were collected has

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

ANOVA: Analysis of Variation

ANOVA: Analysis of Variation ANOVA: Analysis of Variation The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative variables depend on which group (given by categorical

More information

Chapter 4.notebook. August 30, 2017

Chapter 4.notebook. August 30, 2017 Sep 1 7:53 AM Sep 1 8:21 AM Sep 1 8:21 AM 1 Sep 1 8:23 AM Sep 1 8:23 AM Sep 1 8:23 AM SOCS When describing a distribution, make sure to always tell about three things: shape, outliers, center, and spread

More information

Chapter 7: Hypothesis Testing - Solutions

Chapter 7: Hypothesis Testing - Solutions Chapter 7: Hypothesis Testing - Solutions 7.1 Introduction to Hypothesis Testing The problem with applying the techniques learned in Chapter 5 is that typically, the population mean (µ) and standard deviation

More information

Electricity Case: Statistical Analysis of Electric Power Outages

Electricity Case: Statistical Analysis of Electric Power Outages CREATE Research Archive Published Articles & Papers 2005 Electricity Case: Statistical Analysis of Electric Power Outages Jeffrey S. Simonoff New York University Carlos E. Restrepo New York University,

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Chapter 6. Exploring Data: Relationships. Solutions. Exercises:

Chapter 6. Exploring Data: Relationships. Solutions. Exercises: Chapter 6 Exploring Data: Relationships Solutions Exercises: 1. (a) It is more reasonable to explore study time as an explanatory variable and the exam grade as the response variable. (b) It is more reasonable

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The

More information

Sections 6.1 and 6.2: The Normal Distribution and its Applications

Sections 6.1 and 6.2: The Normal Distribution and its Applications Sections 6.1 and 6.2: The Normal Distribution and its Applications Definition: A normal distribution is a continuous, symmetric, bell-shaped distribution of a variable. The equation for the normal distribution

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

Physics 509: Bootstrap and Robust Parameter Estimation

Physics 509: Bootstrap and Robust Parameter Estimation Physics 509: Bootstrap and Robust Parameter Estimation Scott Oser Lecture #20 Physics 509 1 Nonparametric parameter estimation Question: what error estimate should you assign to the slope and intercept

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

SMAM 314 Exam 3 Name. 1. Mark the following statements true or false. (6 points 2 each)

SMAM 314 Exam 3 Name. 1. Mark the following statements true or false. (6 points 2 each) SMAM 314 Exam 3 Name 1. Mark the following statements true or false. (6 points 2 each) F A. A t test on independent samples is appropriate when the results of an algebra test are being compared for the

More information

IF YOU HAVE DATA VALUES:

IF YOU HAVE DATA VALUES: Unit 02 Review Ways to obtain a line of best fit IF YOU HAVE DATA VALUES: 1. In your calculator, choose STAT > 1.EDIT and enter your x values into L1 and your y values into L2 2. Choose STAT > CALC > 8.

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis Advanced Regression Techniques CS 147: Computer Systems Performance Analysis Advanced Regression Techniques 1 / 31 Overview Overview Overview Common Transformations

More information

23. MORE HYPOTHESIS TESTING

23. MORE HYPOTHESIS TESTING 23. MORE HYPOTHESIS TESTING The Logic Behind Hypothesis Testing For simplicity, consider testing H 0 : µ = µ 0 against the two-sided alternative H A : µ µ 0. Even if H 0 is true (so that the expectation

More information

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs Stat 528 (Autumn 2008) Inference for the mean of a population (One sample t procedures) Reading: Section 7.1. Inference for the mean of a population. The t distribution for a normal population. Small sample

More information

Performance of fourth-grade students on an agility test

Performance of fourth-grade students on an agility test Starter Ch. 5 2005 #1a CW Ch. 4: Regression L1 L2 87 88 84 86 83 73 81 67 78 83 65 80 50 78 78? 93? 86? Create a scatterplot Find the equation of the regression line Predict the scores Chapter 5: Understanding

More information

χ L = χ R =

χ L = χ R = Chapter 7 3C: Examples of Constructing a Confidence Interval for the true value of the Population Standard Deviation σ for a Normal Population. Example 1 Construct a 95% confidence interval for the true

More information

Chapter 10 Regression Analysis

Chapter 10 Regression Analysis Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST

More information

The response variable depends on the explanatory variable.

The response variable depends on the explanatory variable. A response variable measures an outcome of study. > dependent variables An explanatory variable attempts to explain the observed outcomes. > independent variables The response variable depends on the explanatory

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for

More information

One-Sample and Two-Sample Means Tests

One-Sample and Two-Sample Means Tests One-Sample and Two-Sample Means Tests 1 Sample t Test The 1 sample t test allows us to determine whether the mean of a sample data set is different than a known value. Used when the population variance

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

download instant at

download instant at Answers to Odd-Numbered Exercises Chapter One: An Overview of Regression Analysis 1-3. (a) Positive, (b) negative, (c) positive, (d) negative, (e) ambiguous, (f) negative. 1-5. (a) The coefficients in

More information

Intro to Stats Lecture 11

Intro to Stats Lecture 11 Outliers and influential points Intro to Stats Lecture 11 Collect data this week! Midterm is coming! Terms X outliers: observations outlying the overall pattern of the X- variable Y outliers: observations

More information

Chapter 14 Multiple Regression Analysis

Chapter 14 Multiple Regression Analysis Chapter 14 Multiple Regression Analysis 1. a. Multiple regression equation b. the Y-intercept c. $374,748 found by Y ˆ = 64,1 +.394(796,) + 9.6(694) 11,6(6.) (LO 1) 2. a. Multiple regression equation b.

More information

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table. MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at

More information

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies

More information

Chapter 12: Inference about One Population

Chapter 12: Inference about One Population Chapter 1: Inference about One Population 1.1 Introduction In this chapter, we presented the statistical inference methods used when the problem objective is to describe a single population. Sections 1.

More information

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75 M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-13 13 14 3 15 8 16 4 17 10 18 9 19 7 20 3 21 16 22 2 Total 75 1 Multiple choice questions (1 point each) 1. Look at

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Diploma Part 2. Quantitative Methods. Examiners Suggested Answers

Diploma Part 2. Quantitative Methods. Examiners Suggested Answers Diploma Part 2 Quantitative Methods Examiners Suggested Answers Q1 (a) A frequency distribution is a table or graph (i.e. a histogram) that shows the total number of measurements that fall in each of a

More information

Stat 2300 International, Fall 2006 Sample Midterm. Friday, October 20, Your Name: A Number:

Stat 2300 International, Fall 2006 Sample Midterm. Friday, October 20, Your Name: A Number: Stat 2300 International, Fall 2006 Sample Midterm Friday, October 20, 2006 Your Name: A Number: The Midterm consists of 35 questions: 20 multiple-choice questions (with exactly 1 correct answer) and 15

More information

Margin of Error. What is margin of error and why does it exist?

Margin of Error. What is margin of error and why does it exist? Trig Honors Margin of Error Name: What is margin of error and why does it exist? Thanks to a network of GEO stationary and polar orbiting satellites providing better data, faster supercomputers, and improved

More information

Review of the Normal Distribution

Review of the Normal Distribution Sampling and s Normal Distribution Aims of Sampling Basic Principles of Probability Types of Random Samples s of the Mean Standard Error of the Mean The Central Limit Theorem Review of the Normal Distribution

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unit 10: Simple Linear Regression and Correlation Statistics 571: Statistical Methods Ramón V. León 6/28/2004 Unit 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regression analysis is a method for

More information

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22 Announcements Announcements Lecture 1 - Data and Data Summaries Statistics 102 Colin Rundel January 13, 2013 Homework 1 - Out 1/15, due 1/22 Lab 1 - Tomorrow RStudio accounts created this evening Try logging

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

SMAM 314 Practice Final Examination Winter 2003

SMAM 314 Practice Final Examination Winter 2003 SMAM 314 Practice Final Examination Winter 2003 You may use your textbook, one page of notes and a calculator. Please hand in the notes with your exam. 1. Mark the following statements True T or False

More information

Applied Regression Analysis. Section 2: Multiple Linear Regression

Applied Regression Analysis. Section 2: Multiple Linear Regression Applied Regression Analysis Section 2: Multiple Linear Regression 1 The Multiple Regression Model Many problems involve more than one independent variable or factor which affects the dependent or response

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Confidence Interval for the mean response

Confidence Interval for the mean response Week 3: Prediction and Confidence Intervals at specified x. Testing lack of fit with replicates at some x's. Inference for the correlation. Introduction to regression with several explanatory variables.

More information

Confidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean

Confidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean Confidence Intervals Confidence interval for sample mean The CLT tells us: as the sample size n increases, the sample mean is approximately Normal with mean and standard deviation Thus, we have a standard

More information

Continuous distributions

Continuous distributions Continuous distributions In contrast to discrete random variables, like the Binomial distribution, in many situations the possible values of a random variable cannot be counted. For example, the measurement

More information

Chapter 8: Sampling Variability and Sampling Distributions

Chapter 8: Sampling Variability and Sampling Distributions Chapter 8: Sampling Variability and Sampling Distributions These notes reflect material from our text, Statistics, Learning from Data, First Edition, by Roxy Peck, published by CENGAGE Learning, 2015.

More information

Homework 2. For the homework, be sure to give full explanations where required and to turn in any relevant plots.

Homework 2. For the homework, be sure to give full explanations where required and to turn in any relevant plots. Homework 2 1 Data analysis problems For the homework, be sure to give full explanations where required and to turn in any relevant plots. 1. The file berkeley.dat contains average yearly temperatures for

More information

Chapter 1 Skills Points and Linear Equations

Chapter 1 Skills Points and Linear Equations Example 1. Solve We have Chapter 1 Skills Points and Linear Equations t = 3 t t = 3 t for t. = ( t)( t) = ( t) = 3( t) = 4 4t = 6 3t = = t t = 3 ( t)( t) t Example. Solve We have = A + Bt A Bt for t. =

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Business Statistics Midterm Exam Fall 2015 Russell. Please sign here to acknowledge

Business Statistics Midterm Exam Fall 2015 Russell. Please sign here to acknowledge Business Statistics Midterm Exam Fall 5 Russell Name Do not turn over this page until you are told to do so. You will have hour and 3 minutes to complete the exam. There are a total of points divided into

More information

Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples.

Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples. Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples Requirements 1.A random sample of each population is taken. The sample

More information

Estimating a Population Mean

Estimating a Population Mean Estimating a Population Mean MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Objectives At the end of this lesson we will be able to: obtain a point estimate for

More information

The Flight of the Space Shuttle Challenger

The Flight of the Space Shuttle Challenger The Flight of the Space Shuttle Challenger On January 28, 1986, the space shuttle Challenger took off on the 25 th flight in NASA s space shuttle program. Less than 2 minutes into the flight, the spacecraft

More information

22S39: Class Notes / November 14, 2000 back to start 1

22S39: Class Notes / November 14, 2000 back to start 1 Model diagnostics Interpretation of fitted regression model 22S39: Class Notes / November 14, 2000 back to start 1 Model diagnostics 22S39: Class Notes / November 14, 2000 back to start 2 Model diagnostics

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Lesson 2: Exploring Quadratic Relations Quad Regression Unit 5 Quadratic Relations

Lesson 2: Exploring Quadratic Relations Quad Regression Unit 5 Quadratic Relations (A) Lesson Context BIG PICTURE of this UNIT: CONTEXT of this LESSON: How do we analyze and then work with a data set that shows both increase and decrease What is a parabola and what key features do they

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 24, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Cengage Learning

More information

Chapter 1: Climate and the Atmosphere

Chapter 1: Climate and the Atmosphere Chapter 1: Climate and the Atmosphere ECC: 1.2.1 WARM-UP Students complete and discuss their responses to prompts in an Anticipation Guide. (10 min) Anticipation Guide. The expectation is that you will

More information

INFERENCE FOR REGRESSION

INFERENCE FOR REGRESSION CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We

More information

Five people were asked approximately how many hours of TV they watched per week. Their responses were as follows.

Five people were asked approximately how many hours of TV they watched per week. Their responses were as follows. Exit icket Sample Solutions Five people were asked approximately how many hours of V they watched per week. heir responses were as follows. 1. Find the mean number of hours of V watched for these five

More information

Dealing with the assumption of independence between samples - introducing the paired design.

Dealing with the assumption of independence between samples - introducing the paired design. Dealing with the assumption of independence between samples - introducing the paired design. a) Suppose you deliberately collect one sample and measure something. Then you collect another sample in such

More information

CRP 272 Introduction To Regression Analysis

CRP 272 Introduction To Regression Analysis CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous

More information

5.1 Extreme Values of Functions

5.1 Extreme Values of Functions 5.1 Extreme Values of Functions Lesson Objective To be able to find maximum and minimum values (extrema) of functions. To understand the definition of extrema on an interval. This is called optimization

More information

Analyzing Lines of Fit

Analyzing Lines of Fit 4.5 Analyzing Lines of Fit Essential Question How can you analytically find a line of best fit for a scatter plot? Finding a Line of Best Fit Work with a partner. The scatter plot shows the median ages

More information

Multiple Regression Theory 2006 Samuel L. Baker

Multiple Regression Theory 2006 Samuel L. Baker MULTIPLE REGRESSION THEORY 1 Multiple Regression Theory 2006 Samuel L. Baker Multiple regression is regression with two or more independent variables on the right-hand side of the equation. Use multiple

More information

Modeling Data with Functions

Modeling Data with Functions Chapter 11 Modeling Data with Functions 11.1 Data Modeling Concepts 1 11.1.1 Conceptual Explanations: Modeling Data with Functions In school, you generally start with a function and work from there to

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

The chart below shows the fraction and decimal forms of some rational numbers. Write,, or in each blank to make a true sentence.

The chart below shows the fraction and decimal forms of some rational numbers. Write,, or in each blank to make a true sentence. Study Guide Pages 9 99 Rational Numbers The chart below shows the fraction and decimal forms of some rational numbers. Rational Number 0.7 0.8 Fraction Form Decimal Form.0 0.66. 0.7 0.8 You can compare

More information

Probability Distributions

Probability Distributions CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

Decision 411: Class 3

Decision 411: Class 3 Decision 411: Class 3 Discussion of HW#1 Introduction to seasonal models Seasonal decomposition Seasonal adjustment on a spreadsheet Forecasting with seasonal adjustment Forecasting inflation Log transformation

More information

Decision 411: Class 3

Decision 411: Class 3 Decision 411: Class 3 Discussion of HW#1 Introduction to seasonal models Seasonal decomposition Seasonal adjustment on a spreadsheet Forecasting with seasonal adjustment Forecasting inflation Poor man

More information

Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you.

Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you. ISQS 5347 Final Exam Spring 2017 Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you. 1. Recall the commute

More information

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018 Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018 Chapter One: Data and Statistics Statistics A collection of procedures and principles

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Physics with Matlab and Mathematica Exercise #1 28 Aug 2012

Physics with Matlab and Mathematica Exercise #1 28 Aug 2012 Physics with Matlab and Mathematica Exercise #1 28 Aug 2012 You can work this exercise in either matlab or mathematica. Your choice. A simple harmonic oscillator is constructed from a mass m and a spring

More information

Decision 411: Class 3

Decision 411: Class 3 Decision 411: Class 3 Discussion of HW#1 Introduction to seasonal models Seasonal decomposition Seasonal adjustment on a spreadsheet Forecasting with seasonal adjustment Forecasting inflation Poor man

More information

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay 1 MA421 Introduction Ashis Gangopadhyay Department of Mathematics and Statistics Boston University c Ashis Gangopadhyay 1.1 Introduction 1.1.1 Some key statistical concepts 1. Statistics: Art of data analysis,

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information