CS 147: Computer Systems Performance Analysis

Size: px
Start display at page:

Download "CS 147: Computer Systems Performance Analysis"

Transcription

1 CS 147: Computer Systems Performance Analysis Advanced Regression Techniques CS 147: Computer Systems Performance Analysis Advanced Regression Techniques 1 / 31

2 Overview Overview Overview Common Transformations Handling Outliers Common Transformations Handling Outliers 2 / 31

3 Linear regression assumes a linear relationship between predictor and response What if it isn t linear? You need to fit some other type of function to the relationship Linear regression assumes a linear relationship between predictor and response What if it isn t linear? You need to fit some other type of function to the relationship 3 / 31

4 4 / 31 When To Use When To Use When To Use Easiest to tell by sight Make a scatter plot If plot looks non-linear, try curvilinear regression Or if non-linear relationship is suspected for other reasons Relationship should be convertible to a linear form Easiest to tell by sight Make a scatter plot If plot looks non-linear, try curvilinear regression Or if non-linear relationship is suspected for other reasons Relationship should be convertible to a linear form

5 5 / 31 Common Transformations Types of Common Transformations Types of Types of Many possible types, based on a variety of relationships: y = ax b y = a + b/x y = ab x Etc., ad infinitum Many possible types, based on a variety of relationships: y = ax b y = a + b/x y = ab x Etc., ad infinitum

6 6 / 31 Common Transformations Transform Them to Linear Forms Common Transformations Transform Them to Linear Forms Transform Them to Linear Forms Apply logarithms, multiplication, division, whatever to produce something in linear form I.e., y = a + b something Or a similar form If predictor appears in more than one transformed predictor variable, correlation is likely! Apply logarithms, multiplication, division, whatever to produce something in linear form I.e., y = a + b something Or a similar form If predictor appears in more than one transformed predictor variable, correlation is likely!

7 Common Transformations Sample Transformations Common Transformations Sample Transformations Sample Transformations For y = ae bx take logarithm of y, do regression on log y = b0 + b1x, let b = b1, a = e b 0 For y = a + b log x, take log of x before fitting parameters, let b = b1, a = b0 For y = ax b, take log of both x and y, let b = b1, a = e b 0 For y = ae bx take logarithm of y, do regression on log y = b 0 + b 1 x, let b = b 1, a = e b 0 For y = a + b log x, take log of x before fitting parameters, let b = b 1, a = b 0 For y = ax b, take log of both x and y, let b = b 1, a = e b 0 7 / 31

8 8 / 31 Common Transformations Corrections to Jain p. 257 (Early Editions) Common Transformations Corrections to Jain p. 257 Corrections to Jain p. 257 (Early Editions) Nonlinear y = a + b/x y = 1/(a + bx) y = x(a + bx) y = ab x Linear y = a+b(1/x) (1/y) = a + bx (x/y) = a + bx ln y = ln a + x ln b y = a + bx n y = a + b(x n ) Nonlinear Linear y = a + b/x y = a+b(1/x) y = 1/(a + bx) (1/y) = a + bx y = x(a + bx) (x/y) = a + bx y = ab x ln y = ln a + x ln b y = a + bx n y = a + b(x n )

9 Use some function of response variable y in place of y itself Curvilinear regression is one example But techniques are more generally applicable Use some function of response variable y in place of y itself Curvilinear regression is one example But techniques are more generally applicable 9 / 31

10 When To Transform? When To Transform? When To Transform? If known properties of measured system suggest it If data s range covers several orders of magnitude If homogeneous variance assumption of residuals (homoscedasticity) is violated If known properties of measured system suggest it If data s range covers several orders of magnitude If homogeneous variance assumption of residuals (homoscedasticity) is violated 10 / 31

11 Transforming Due To (Lack of) Homoscedasticity Transforming Due To (Lack of) Homoscedasticity Transforming Due To (Lack of) Homoscedasticity If spread of scatter plot of residual vs. predicted response isn t homogeneous, Then residuals are still functions of the predictor variables Transformation of response may solve the problem If spread of scatter plot of residual vs. predicted response isn t homogeneous, Then residuals are still functions of the predictor variables Transformation of response may solve the problem 11 / 31

12 12 / 31 What Transformation To Use? Compute standard deviation of residuals Plot as function of mean of observations Assuming multiple experiments for single set of predictor values Check for linearity: if linear, use a log transform If variance against mean of observations is linear, use square-root transform If standard deviation against mean squared is linear, use inverse (1/y) transform If standard deviation against mean to a power is linear, use power transform More covered in the book What Transformation To Use? What Transformation To Use? Compute standard deviation of residuals Plot as function of mean of observations Assuming multiple experiments for single set of predictor values Check for linearity: if linear, use a log transform If variance against mean of observations is linear, use square-root transform If standard deviation against mean squared is linear, use inverse (1/y) transform If standard deviation against mean to a power is linear, use power transform More covered in the book

13 13 / 31 General Transformation Principle General Transformation Principle General Transformation Principle For some observed relation between standard deviation and mean, s = g(y): 1 let h(y) = g(y) dy transform to w = h(y) and regress on w For some observed relation between standard deviation and mean, s = g(y): 1 let h(y) = g(y) dy transform to w = h(y) and regress on w

14 14 / 31 Example: Log Transformation Example: Log Transformation Example: Log Transformation If standard deviation against mean is linear, then g(y) = ay 1 So h(y) = ay dy = 1 a ln y If standard deviation against mean is linear, then g(y) = ay 1 So h(y) = ay dy = 1 a ln y

15 15 / 31 Confidence Intervals for Nonlinear Regressions Confidence Intervals for Nonlinear Regressions Confidence Intervals for Nonlinear Regressions For nonlinear fits using general (e.g., exponential) transformations: Confidence intervals apply to transformed parameters Not valid to perform inverse transformation before calculating intervals Must express confidence intervals in transformed domain For nonlinear fits using general (e.g., exponential) transformations: Confidence intervals apply to transformed parameters Not valid to perform inverse transformation before calculating intervals Must express confidence intervals in transformed domain

16 16 / 31 Handling Outliers Outliers Handling Outliers Outliers Outliers Atypical observations might be outliers Measurements that are not truly characteristic By chance, several standard deviations out Or mistakes might have been made in measurement Which leads to a problem: Do you include outliers in analysis or not? Atypical observations might be outliers Measurements that are not truly characteristic By chance, several standard deviations out Or mistakes might have been made in measurement Which leads to a problem: Do you include outliers in analysis or not?

17 17 / 31 Handling Outliers Deciding How To Handle Outliers Handling Outliers Deciding How To Handle Outliers Deciding How To Handle Outliers 1. Find them (by looking at scatter plot) 2. Check carefully for experimental error 3. Repeat experiments at predictor values for each outlier 4. Decide whether to include or omit outliers Or do analysis both ways Question: Is last point in last lecture s example an outlier on rating vs. year plot? 1. Find them (by looking at scatter plot) 2. Check carefully for experimental error 3. Repeat experiments at predictor values for each outlier 4. Decide whether to include or omit outliers Or do analysis both ways Question: Is last point in last lecture s example an outlier on rating vs. year plot?

18 Handling Outliers Rating vs. Year Handling Outliers Rating vs. Year Rating vs. Year 8 6 Rating Year 8 6 Rating Year 18 / 31

19 in Regression in Regression in Regression Generally based on taking shortcuts Or not being careful Or not understanding some fundamental principle of statistics Generally based on taking shortcuts Or not being careful Or not understanding some fundamental principle of statistics 19 / 31

20 Not Verifying Linearity Not Verifying Linearity Not Verifying Linearity Draw the scatter plot If it s not linear, check for curvilinear possibilities Misleading to use linear regression when relationship isn t linear Draw the scatter plot If it s not linear, check for curvilinear possibilities Misleading to use linear regression when relationship isn t linear 20 / 31

21 21 / 31 Relying on Results Without Visual Verification Relying on Results Without Visual Verification Relying on Results Without Visual Verification Always check scatter plot as part of regression Examine predicted line vs. actual points Particularly important if regression is done automatically Always check scatter plot as part of regression Examine predicted line vs. actual points Particularly important if regression is done automatically

22 22 / 31 Some Nonlinear Examples Some Nonlinear Examples Some Nonlinear Examples

23 23 / 31 Attaching Importance to Parameter Values Attaching Importance to Parameter Values Attaching Importance to Parameter Values Numerical values of regression parameters depend on scale of predictor variables So just because a particular parameter s value seems small or large, not necessarily an indication of importance E.g., converting seconds to microseconds doesn t change anything fundamental But magnitude of associated parameter changes Numerical values of regression parameters depend on scale of predictor variables So just because a particular parameter s value seems small or large, not necessarily an indication of importance E.g., converting seconds to microseconds doesn t change anything fundamental But magnitude of associated parameter changes

24 Not Specifying Confidence Intervals Not Specifying Confidence Intervals Not Specifying Confidence Intervals Samples of observations are random Thus, regression yields parameters with random properties Without confidence interval, impossible to understand what a parameter really means Samples of observations are random Thus, regression yields parameters with random properties Without confidence interval, impossible to understand what a parameter really means 24 / 31

25 Not Calculating Coefficient of Determination Not Calculating Coefficient of Determination Not Calculating Coefficient of Determination Without R 2, difficult to determine how much of variance is explained by the regression Even if R 2 looks good, safest to also perform an F-test Not that much extra effort Without R 2, difficult to determine how much of variance is explained by the regression Even if R 2 looks good, safest to also perform an F-test Not that much extra effort 25 / 31

26 Using Coefficient of Correlation Improperly Using Coefficient of Correlation Improperly Using Coefficient of Correlation Improperly Coefficient of determination is R 2 Coefficient of correlation is R R 2 gives percentage of variance explained by regression, not R E.g., if R is.5, R 2 is.25 And regression explains 25% of variance Not 50%! Coefficient of determination is R 2 Coefficient of correlation is R R 2 gives percentage of variance explained by regression, not R E.g., if R is.5, R 2 is.25 And regression explains 25% of variance Not 50%! 26 / 31

27 27 / 31 Using Highly Correlated Predictor Variables Using Highly Correlated Predictor Variables Using Highly Correlated Predictor Variables If two predictor variables are highly correlated, using both degrades regression E.g., likely to be correlation between an executable s on-disk and in-core sizes So don t use both as predictors of run time Means you need to understand your predictor variables as well as possible If two predictor variables are highly correlated, using both degrades regression E.g., likely to be correlation between an executable s on-disk and in-core sizes So don t use both as predictors of run time Means you need to understand your predictor variables as well as possible

28 28 / 31 Using Regression Beyond Range of Observations Using Regression Beyond Range of Observations Using Regression Beyond Range of Observations Regression is based on observed behavior in a particular sample Most likely to predict accurately within range of that sample Far outside the range, who knows? E.g., regression on run time of executables smaller than size of main memory may not predict performance of executables that need VM activity Regression is based on observed behavior in a particular sample Most likely to predict accurately within range of that sample Far outside the range, who knows? E.g., regression on run time of executables smaller than size of main memory may not predict performance of executables that need VM activity

29 Measuring Too Little of the Range Measuring Too Little of the Range Measuring Too Little of the Range Converse of prevoius mistake Regression only predicts well near range of observations If you don t measure commonly used range, regression won t predict much E.g., if many programs are bigger than main memory, only measuring those that are smaller is a mistake Converse of prevoius mistake Regression only predicts well near range of observations If you don t measure commonly used range, regression won t predict much E.g., if many programs are bigger than main memory, only measuring those that are smaller is a mistake 29 / 31

30 30 / 31 Using Too Many Predictor Variables Using Too Many Predictor Variables Using Too Many Predictor Variables Adding more predictors does not necessarily improve model! More likely to run into multicollinearity problems So what variables to choose? It s an art Subject of much of this course Adding more predictors does not necessarily improve model! More likely to run into multicollinearity problems So what variables to choose? It s an art Subject of much of this course

31 Assuming a Good Predictor Is a Good Controller Assuming a Good Predictor Is a Good Controller Assuming a Good Predictor Is a Good Controller Often, a goal of regression is finding control variables But correlation isn t necessarily control Just because variable A is related to variable B, you may not be able to control values of B by varying A E.g., if number of hits on a Web page is correlated to server bandwidth, you might not boost hits by increasing bandwidth Often, a goal of regression is finding control variables But correlation isn t necessarily control Just because variable A is related to variable B, you may not be able to control values of B by varying A E.g., if number of hits on a Web page is correlated to server bandwidth, you might not boost hits by increasing bandwidth 31 / 31

CS 5014: Research Methods in Computer Science

CS 5014: Research Methods in Computer Science Computer Science Clifford A. Shaffer Department of Computer Science Virginia Tech Blacksburg, Virginia Fall 2010 Copyright c 2010 by Clifford A. Shaffer Computer Science Fall 2010 1 / 207 Correlation and

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics

More information

An introduction to plotting data

An introduction to plotting data An introduction to plotting data Eric D. Black California Institute of Technology v2.0 1 Introduction Plotting data is one of the essential skills every scientist must have. We use it on a near-daily basis

More information

Chapter 1 Review of Equations and Inequalities

Chapter 1 Review of Equations and Inequalities Chapter 1 Review of Equations and Inequalities Part I Review of Basic Equations Recall that an equation is an expression with an equal sign in the middle. Also recall that, if a question asks you to solve

More information

10. Alternative case influence statistics

10. Alternative case influence statistics 10. Alternative case influence statistics a. Alternative to D i : dffits i (and others) b. Alternative to studres i : externally-studentized residual c. Suggestion: use whatever is convenient with the

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS.

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS. Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS. Last time, we looked at scatterplots, which show the interaction between two variables,

More information

Multiple Linear Regression estimation, testing and checking assumptions

Multiple Linear Regression estimation, testing and checking assumptions Multiple Linear Regression estimation, testing and checking assumptions Lecture No. 07 Example 1 The president of a large chain of fast-food restaurants has randomly selected 10 franchises and recorded

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression. 10/3/011 Functional Connectivity Correlation and Regression Variance VAR = Standard deviation Standard deviation SD = Unbiased SD = 1 10/3/011 Standard error Confidence interval SE = CI = = t value for

More information

1 Measurement Uncertainties

1 Measurement Uncertainties 1 Measurement Uncertainties (Adapted stolen, really from work by Amin Jaziri) 1.1 Introduction No measurement can be perfectly certain. No measuring device is infinitely sensitive or infinitely precise.

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

:Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics?

:Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics? MRA: Further Issues :Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics? 1. Scaling the explanatory variables Suppose

More information

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Intro to Learning Theory Date: 12/8/16

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Intro to Learning Theory Date: 12/8/16 600.463 Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Intro to Learning Theory Date: 12/8/16 25.1 Introduction Today we re going to talk about machine learning, but from an

More information

JUST THE MATHS UNIT NUMBER 5.3. GEOMETRY 3 (Straight line laws) A.J.Hobson

JUST THE MATHS UNIT NUMBER 5.3. GEOMETRY 3 (Straight line laws) A.J.Hobson JUST THE MATHS UNIT NUMBER 5.3 GEOMETRY 3 (Straight line laws) by A.J.Hobson 5.3.1 Introduction 5.3.2 Laws reducible to linear form 5.3.3 The use of logarithmic graph paper 5.3.4 Exercises 5.3.5 Answers

More information

Algebra Year 10. Language

Algebra Year 10. Language Algebra Year 10 Introduction In Algebra we do Maths with numbers, but some of those numbers are not known. They are represented with letters, and called unknowns, variables or, most formally, literals.

More information

Exam Question 10: Differential Equations. June 19, Applied Mathematics: Lecture 6. Brendan Williamson. Introduction.

Exam Question 10: Differential Equations. June 19, Applied Mathematics: Lecture 6. Brendan Williamson. Introduction. Exam Question 10: June 19, 2016 In this lecture we will study differential equations, which pertains to Q. 10 of the Higher Level paper. It s arguably more theoretical than other topics on the syllabus,

More information

CORRELATION. suppose you get r 0. Does that mean there is no correlation between the data sets? many aspects of the data may a ect the value of r

CORRELATION. suppose you get r 0. Does that mean there is no correlation between the data sets? many aspects of the data may a ect the value of r Introduction to Statistics in Psychology PS 1 Professor Greg Francis Lecture 11 correlation Is there a relationship between IQ and problem solving ability? CORRELATION suppose you get r 0. Does that mean

More information

Regression Analysis Primer DEO PowerPoint, Bureau of Labor Market Statistics

Regression Analysis Primer DEO PowerPoint, Bureau of Labor Market Statistics Regression Analysis Primer DEO PowerPoint, Bureau of Labor Market Statistics September 27-30, 2017 Regression Analysis Stephen Birch, Economic Consultant LTIP Technical Lead, Projections Managing Partnership

More information

Multicollinearity occurs when two or more predictors in the model are correlated and provide redundant information about the response.

Multicollinearity occurs when two or more predictors in the model are correlated and provide redundant information about the response. Multicollinearity Read Section 7.5 in textbook. Multicollinearity occurs when two or more predictors in the model are correlated and provide redundant information about the response. Example of multicollinear

More information

Using Microsoft Excel

Using Microsoft Excel Using Microsoft Excel Objective: Students will gain familiarity with using Excel to record data, display data properly, use built-in formulae to do calculations, and plot and fit data with linear functions.

More information

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Overfitting Categorical Variables Interaction Terms Non-linear Terms Linear Logarithmic y = a +

More information

AP Statistics. The only statistics you can trust are those you falsified yourself. RE- E X P R E S S I N G D A T A ( P A R T 2 ) C H A P 9

AP Statistics. The only statistics you can trust are those you falsified yourself. RE- E X P R E S S I N G D A T A ( P A R T 2 ) C H A P 9 AP Statistics 1 RE- E X P R E S S I N G D A T A ( P A R T 2 ) C H A P 9 The only statistics you can trust are those you falsified yourself. Sir Winston Churchill (1874-1965) (Attribution to Churchill is

More information

Modeling Population Growth

Modeling Population Growth Exponential Growth 1.4 19 Modeling Population Growth Example. Modeling the size of a population. We would like to build a simple model to predict the size of a population in 10 years. A very macro-level

More information

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size.

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size. Descriptive Statistics: One of the most important things we can do is to describe our data. Some of this can be done graphically (you should be familiar with histograms, boxplots, scatter plots and so

More information

Understanding Exponents Eric Rasmusen September 18, 2018

Understanding Exponents Eric Rasmusen September 18, 2018 Understanding Exponents Eric Rasmusen September 18, 2018 These notes are rather long, but mathematics often has the perverse feature that if someone writes a long explanation, the reader can read it much

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Discrete Mathematics and Probability Theory Summer 2014 James Cook Note 5

Discrete Mathematics and Probability Theory Summer 2014 James Cook Note 5 CS 70 Discrete Mathematics and Probability Theory Summer 2014 James Cook Note 5 Modular Arithmetic In several settings, such as error-correcting codes and cryptography, we sometimes wish to work over a

More information

Math. Manipulation. Determine the best function of this type. Now: Visually. Later: Using a computer

Math. Manipulation. Determine the best function of this type. Now: Visually. Later: Using a computer Data Fitting 1.3 & 3.2 11 The next few days Goals: Understand function fitting, introduce Mathematica Frame of reference: Formulation. Suppose the problem has been properly formulated. Problem statement

More information

irst we need to know that there are many ways to indicate multiplication; for example the product of 5 and 7 can be written in a variety of ways:

irst we need to know that there are many ways to indicate multiplication; for example the product of 5 and 7 can be written in a variety of ways: CH 2 VARIABLES INTRODUCTION F irst we need to know that there are many ways to indicate multiplication; for example the product of 5 and 7 can be written in a variety of ways: 5 7 5 7 5(7) (5)7 (5)(7)

More information

LECTURE 15: SIMPLE LINEAR REGRESSION I

LECTURE 15: SIMPLE LINEAR REGRESSION I David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).

More information

Chapter 11 Linear Regression

Chapter 11 Linear Regression Chapter 11 Linear Regression Linear regression is a methodology that allows us to examine the relationship between two continuously measured variables where we believe that values of one variable may influence

More information

Lecture 10: F -Tests, ANOVA and R 2

Lecture 10: F -Tests, ANOVA and R 2 Lecture 10: F -Tests, ANOVA and R 2 1 ANOVA We saw that we could test the null hypothesis that β 1 0 using the statistic ( β 1 0)/ŝe. (Although I also mentioned that confidence intervals are generally

More information

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1 Regression diagnostics As is true of all statistical methodologies, linear regression analysis can be a very effective way to model data, as along as the assumptions being made are true. For the regression

More information

Relationships Regression

Relationships Regression Relationships Regression BPS chapter 5 2006 W.H. Freeman and Company Objectives (BPS chapter 5) Regression Regression lines The least-squares regression line Using technology Facts about least-squares

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Announcements. J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 February 8, / 45

Announcements. J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 February 8, / 45 Announcements Solutions to Problem Set 3 are posted Problem Set 4 is posted, It will be graded and is due a week from Friday You already know everything you need to work on Problem Set 4 Professor Miller

More information

Correlation and regression

Correlation and regression NST 1B Experimental Psychology Statistics practical 1 Correlation and regression Rudolf Cardinal & Mike Aitken 11 / 12 November 2003 Department of Experimental Psychology University of Cambridge Handouts:

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Preptests 55 Answers and Explanations (By Ivy Global) Section 4 Logic Games

Preptests 55 Answers and Explanations (By Ivy Global) Section 4 Logic Games Section 4 Logic Games Questions 1 6 There aren t too many deductions we can make in this game, and it s best to just note how the rules interact and save your time for answering the questions. 1. Type

More information

2 Analogies between addition and multiplication

2 Analogies between addition and multiplication Problem Analysis The problem Start out with 99% water. Some of the water evaporates, end up with 98% water. How much of the water evaporates? Guesses Solution: Guesses: Not %. 2%. 5%. Not 00%. 3%..0%..5%.

More information

Regression Analysis. A statistical procedure used to find relations among a set of variables.

Regression Analysis. A statistical procedure used to find relations among a set of variables. Regression Analysis A statistical procedure used to find relations among a set of variables. Understanding relations Mapping data enables us to examine (describe) where things occur (e.g., areas where

More information

Response Surface Methods

Response Surface Methods Response Surface Methods 3.12.2014 Goals of Today s Lecture See how a sequence of experiments can be performed to optimize a response variable. Understand the difference between first-order and second-order

More information

E.g. The set RE of regular expressions is given by the following rules:

E.g. The set RE of regular expressions is given by the following rules: 1 Lecture Summary We gave a brief overview of inductively defined sets, inductively defined functions, and proofs by structural induction We showed how to prove that a machine is correct using induction

More information

Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2:

Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2: Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2: 03 17 08 3 All about lines 3.1 The Rectangular Coordinate System Know how to plot points in the rectangular coordinate system. Know the

More information

Making Piecewise Functions Continuous and Differentiable by Dave Slomer

Making Piecewise Functions Continuous and Differentiable by Dave Slomer Making Piecewise Functions Continuous and Differentiable by Dave Slomer Piecewise-defined functions are applied in areas such as Computer Assisted Drawing (CAD). Many piecewise functions in textbooks are

More information

Chapter 7 Summary Scatterplots, Association, and Correlation

Chapter 7 Summary Scatterplots, Association, and Correlation Chapter 7 Summary Scatterplots, Association, and Correlation What have we learned? We examine scatterplots for direction, form, strength, and unusual features. Although not every relationship is linear,

More information

Lecture 12. Functional form

Lecture 12. Functional form Lecture 12. Functional form Multiple linear regression model β1 + β2 2 + L+ β K K + u Interpretation of regression coefficient k Change in if k is changed by 1 unit and the other variables are held constant.

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

STAT Checking Model Assumptions

STAT Checking Model Assumptions STAT 704 --- Checking Model Assumptions Recall we assumed the following in our model: (1) The regression relationship between the response and the predictor(s) specified in the model is appropriate (2)

More information

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of

More information

Logarithmic and Exponential Equations and Change-of-Base

Logarithmic and Exponential Equations and Change-of-Base Logarithmic and Exponential Equations and Change-of-Base MATH 101 College Algebra J. Robert Buchanan Department of Mathematics Summer 2012 Objectives In this lesson we will learn to solve exponential equations

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER /2018

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER /2018 ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 2017/2018 DR. ANTHONY BROWN 1. Arithmetic and Algebra 1.1. Arithmetic of Numbers. While we have calculators and computers

More information

Predator - Prey Model Trajectories and the nonlinear conservation law

Predator - Prey Model Trajectories and the nonlinear conservation law Predator - Prey Model Trajectories and the nonlinear conservation law James K. Peterson Department of Biological Sciences and Department of Mathematical Sciences Clemson University October 28, 2013 Outline

More information

Remedial Measures Wrap-Up and Transformations Box Cox

Remedial Measures Wrap-Up and Transformations Box Cox Remedial Measures Wrap-Up and Transformations Box Cox Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 1, Slide 1 Last Class Graphical procedures for determining appropriateness

More information

Suppose we have the set of all real numbers, R, and two operations, +, and *. Then the following are assumed to be true.

Suppose we have the set of all real numbers, R, and two operations, +, and *. Then the following are assumed to be true. Algebra Review In this appendix, a review of algebra skills will be provided. Students sometimes think that there are tricks needed to do algebra. Rather, algebra is a set of rules about what one may and

More information

Lecture 30. DATA 8 Summer Regression Inference

Lecture 30. DATA 8 Summer Regression Inference DATA 8 Summer 2018 Lecture 30 Regression Inference Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Contributions by Fahad Kamran (fhdkmrn@berkeley.edu) and

More information

Please bring the task to your first physics lesson and hand it to the teacher.

Please bring the task to your first physics lesson and hand it to the teacher. Pre-enrolment task for 2014 entry Physics Why do I need to complete a pre-enrolment task? This bridging pack serves a number of purposes. It gives you practice in some of the important skills you will

More information

One important way that you can classify differential equations is as linear or nonlinear.

One important way that you can classify differential equations is as linear or nonlinear. In This Chapter Chapter 1 Looking Closely at Linear First Order Differential Equations Knowing what a first order linear differential equation looks like Finding solutions to first order differential equations

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics January 24, 2018 CS 361: Probability & Statistics Relationships in data Standard coordinates If we have two quantities of interest in a dataset, we might like to plot their histograms and compare the two

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis 1 / 34 Overview Overview Overview Adding Replications Adding Replications 2 / 34 Two-Factor Design Without Replications

More information

Mathematical Modelling Lecture 4 Fitting Data

Mathematical Modelling Lecture 4 Fitting Data Lecture 4 Fitting Data phil.hasnip@york.ac.uk Overview of Course Model construction dimensional analysis Experimental input fitting Finding a best answer optimisation Tools for constructing and manipulating

More information

Modeling Data with Functions

Modeling Data with Functions Chapter 11 Modeling Data with Functions 11.1 Data Modeling Concepts 1 11.1.1 Conceptual Explanations: Modeling Data with Functions In school, you generally start with a function and work from there to

More information

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math. Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if

More information

Nonlinear Regression Functions

Nonlinear Regression Functions Nonlinear Regression Functions (SW Chapter 8) Outline 1. Nonlinear regression functions general comments 2. Nonlinear functions of one variable 3. Nonlinear functions of two variables: interactions 4.

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

General Least Squares Fitting

General Least Squares Fitting Appendix B General Least Squares Fitting B.1 Introduction Previously you have done curve fitting in two dimensions. Now you will learn how to extend that to multiple dimensions. B.1.1 Linearizable Non-linear

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Finding Limits Graphically and Numerically

Finding Limits Graphically and Numerically Finding Limits Graphically and Numerically 1. Welcome to finding limits graphically and numerically. My name is Tuesday Johnson and I m a lecturer at the University of Texas El Paso. 2. With each lecture

More information

Stat 101: Lecture 6. Summer 2006

Stat 101: Lecture 6. Summer 2006 Stat 101: Lecture 6 Summer 2006 Outline Review and Questions Example for regression Transformations, Extrapolations, and Residual Review Mathematical model for regression Each point (X i, Y i ) in the

More information

Math Precalculus I University of Hawai i at Mānoa Spring

Math Precalculus I University of Hawai i at Mānoa Spring Math 135 - Precalculus I University of Hawai i at Mānoa Spring - 2014 Created for Math 135, Spring 2008 by Lukasz Grabarek and Michael Joyce Send comments and corrections to lukasz@math.hawaii.edu Contents

More information

Descriptive Statistics (And a little bit on rounding and significant digits)

Descriptive Statistics (And a little bit on rounding and significant digits) Descriptive Statistics (And a little bit on rounding and significant digits) Now that we know what our data look like, we d like to be able to describe it numerically. In other words, how can we represent

More information

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure. STATGRAPHICS Rev. 9/13/213 Calibration Models Summary... 1 Data Input... 3 Analysis Summary... 5 Analysis Options... 7 Plot of Fitted Model... 9 Predicted Values... 1 Confidence Intervals... 11 Observed

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

2. Outliers and inference for regression

2. Outliers and inference for regression Unit6: Introductiontolinearregression 2. Outliers and inference for regression Sta 101 - Spring 2016 Duke University, Department of Statistical Science Dr. Çetinkaya-Rundel Slides posted at http://bit.ly/sta101_s16

More information

Linear Regression 9/23/17. Simple linear regression. Advertising sales: Variance changes based on # of TVs. Advertising sales: Normal error?

Linear Regression 9/23/17. Simple linear regression. Advertising sales: Variance changes based on # of TVs. Advertising sales: Normal error? Simple linear regression Linear Regression Nicole Beckage y " = β % + β ' x " + ε so y* " = β+ % + β+ ' x " Method to assess and evaluate the correlation between two (continuous) variables. The slope of

More information

Lecture 10: Powers of Matrices, Difference Equations

Lecture 10: Powers of Matrices, Difference Equations Lecture 10: Powers of Matrices, Difference Equations Difference Equations A difference equation, also sometimes called a recurrence equation is an equation that defines a sequence recursively, i.e. each

More information

Algebra. Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Algebra. Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This document was written and copyrighted by Paul Dawkins. Use of this document and its online version is governed by the Terms and Conditions of Use located at. The online version of this document is

More information

Finite Mathematics : A Business Approach

Finite Mathematics : A Business Approach Finite Mathematics : A Business Approach Dr. Brian Travers and Prof. James Lampes Second Edition Cover Art by Stephanie Oxenford Additional Editing by John Gambino Contents What You Should Already Know

More information

ISQS 5349 Spring 2013 Final Exam

ISQS 5349 Spring 2013 Final Exam ISQS 5349 Spring 2013 Final Exam Name: General Instructions: Closed books, notes, no electronic devices. Points (out of 200) are in parentheses. Put written answers on separate paper; multiple choices

More information

Essentials of Intermediate Algebra

Essentials of Intermediate Algebra Essentials of Intermediate Algebra BY Tom K. Kim, Ph.D. Peninsula College, WA Randy Anderson, M.S. Peninsula College, WA 9/24/2012 Contents 1 Review 1 2 Rules of Exponents 2 2.1 Multiplying Two Exponentials

More information

Prof. Bodrero s Guide to Derivatives of Trig Functions (Sec. 3.5) Name:

Prof. Bodrero s Guide to Derivatives of Trig Functions (Sec. 3.5) Name: Prof. Bodrero s Guide to Derivatives of Trig Functions (Sec. 3.5) Name: Objectives: Understand how the derivatives of the six basic trig functions are found. Be able to find the derivative for each of

More information

2.1 The Tangent and Velocity Problems

2.1 The Tangent and Velocity Problems 2.1 The Tangent and Velocity Problems Ex: When you jump off a swing, where do you go? Ex: Can you approximate this line with another nearby? How would you get a better approximation? Ex: A cardiac monitor

More information

A particularly nasty aspect of this is that it is often difficult or impossible to tell if a model fails to satisfy these steps.

A particularly nasty aspect of this is that it is often difficult or impossible to tell if a model fails to satisfy these steps. ECON 497: Lecture 6 Page 1 of 1 Metropolitan State University ECON 497: Research and Forecasting Lecture Notes 6 Specification: Choosing the Independent Variables Studenmund Chapter 6 Before we start,

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

- a value calculated or derived from the data.

- a value calculated or derived from the data. Descriptive statistics: Note: I'm assuming you know some basics. If you don't, please read chapter 1 on your own. It's pretty easy material, and it gives you a good background as to why we need statistics.

More information

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables

More information

Measurements and Data Analysis

Measurements and Data Analysis Measurements and Data Analysis 1 Introduction The central point in experimental physical science is the measurement of physical quantities. Experience has shown that all measurements, no matter how carefully

More information

PHYSICS 15a, Fall 2006 SPEED OF SOUND LAB Due: Tuesday, November 14

PHYSICS 15a, Fall 2006 SPEED OF SOUND LAB Due: Tuesday, November 14 PHYSICS 15a, Fall 2006 SPEED OF SOUND LAB Due: Tuesday, November 14 GENERAL INFO The goal of this lab is to determine the speed of sound in air, by making measurements and taking into consideration the

More information

Fault Tolerant Computing CS 530DL

Fault Tolerant Computing CS 530DL Fault Tolerant Computing CS 530DL Additional Lecture Notes Modeling Yashwant K. Malaiya Colorado State University March 8, 2017 1 Quantitative models Derived from first principles: Arguments are actual

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Psych 10 / Stats 60, Practice Problem Set 10 (Week 10 Material), Solutions

Psych 10 / Stats 60, Practice Problem Set 10 (Week 10 Material), Solutions Psych 10 / Stats 60, Practice Problem Set 10 (Week 10 Material), Solutions Part 1: Conceptual ideas about correlation and regression Tintle 10.1.1 The association would be negative (as distance increases,

More information

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc.

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation Copyright 2010 Pearson Education, Inc. Looking at Scatterplots Scatterplots may be the most common and most effective display for data. In a scatterplot,

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

32. SOLVING LINEAR EQUATIONS IN ONE VARIABLE

32. SOLVING LINEAR EQUATIONS IN ONE VARIABLE get the complete book: /getfulltextfullbook.htm 32. SOLVING LINEAR EQUATIONS IN ONE VARIABLE classifying families of sentences In mathematics, it is common to group together sentences of the same type

More information