NATCOR Regression Modelling for Time Series

Size: px
Start display at page:

Download "NATCOR Regression Modelling for Time Series"

Transcription

1 Universität Hamburg Institut für Wirtschaftsinformatik Prof. Dr. D.B. Preßmar Professor Robert Fildes NATCOR Regression Modelling for Time Series The material presented has been developed with the substantial help of Dr. Sven Crone

2 Forecasting Methods RECAP: Extrapolative vs. Explanatory (Causal) Methods Volume Volume Time Use only past historical data to recognise patterns and extrapolate them into the future. Promotion BOGOF Price Relationships between dependent and independent Variables are identified and assumed to hold in the future Use additional explanatory variables to predict the future. There is no a-priori superiority of either approach depends on problem & data

3 Using regression in time series forecasting research In market modelling Diffusion of new technologies Optimization of the marketing mix promotions One important issue: Decoupling Revenue management the optimization from the forecasting is suboptimal In retail, hotels, transport all models depend on demand Best models policies under certainty with sensitivity testing are often far from best under forecast uncertainty In operations Collaboration In forecasting and econometric research Aggregation, non-linearity, conditions where causal methods add value Analysis of behavioural and field experiments

4 Agenda Regression. Recap: Multiple Regression Concept of Causal Models Choosing the variables 2. Recap: Specifying Regression Models Estimating a multivariate Regression Model Validating a Regression Model 3. Extending the basic Regression for Time Series Modelling (deterministic) Seasonality Modelling Outliers & Level Shifts Including Dummy Variables for Promotions / Events Including Time Lags 4. The Model building Process

5 CAUSAL FORECASTING a simple regression model Inputs or Explanatory variables X, X 2 Output or Dependent variable, Y Y = α + β X + β X t t 2 2 t error t Past relationships in the data are identified and assumed to hold in the future Estimated model: Y = a + b X + b X t t 2 2 t error t

6 Deciding the Causal Variables Variable to be forecast Retail Petrol price How do you decide the variables? Prior Research Experience Experts Brainstorming

7 Preliminary Model Development Graphical Analysis and Preliminary Model Development Suppose we are interested in the level of gas prices as a function of various explanatory variables. Observe Gas Prices (=Y t ) over n time periods, t =, 2,, n Step : DDD (Draw the D- Diagram), a time plot of Y against time Step 2: produce a scatter plot of Y against each explanatory variable X j o For step 2, identify possible variables: E.g. Personal Disposable Income Unemployment S&P 500 Index Price of crude oil Data: Jan 996 Dec 200

8 A Case Study on the Price of Gasoline Suppose we are interested in predicting the price of (unleaded regular) gasoline (at the pump), given the price of crude oil at the refinery. We examine monthly data. The price of crude oil takes some time to have its effect on the pump price, so we lag the price of crude by one month. Define the variables: Y = Unleaded X = L_crude Q: Why else might we use a lagged value for the X variable? 8

9 Graphical Analysis and Preliminary Model Development Data: Jan 996 Dec 200

10 Regression Correlation Correlation does not imply causation!!!. The solution to fire fighting! The more firemen fighting a fire, the more damage there is going to be. Therefore firemen cause damage. 2. Reducing crime levels is simple... Since the 950s, both the atmospheric CO 2 level has increased sharply at the same time as global temperature. Hotter weather always leads time higher crime levels. Hence, reducing atmospheric CO 2 will lead to decreasing crime. But check out Bernard Shaw in Ord and Fildes

11 Graphical Analysis and Preliminary Model Development First row: Pearson correlation Second row: P-Value Measures the linear relationship Data: Jan 96-Dec 200

12 Agenda Regression. Recap: Multiple Regression Concept of Causal Models Choosing the variables 2. Recap: Specifying Regression Models Estimating a multivariate Regression Model Validating a Regression Model 3. Extending the basic Regression for Time Series Modelling (deterministic) Seasonality Modelling Outliers & Level Shifts Including Dummy Variables for Promotions / Events Including Time Lags 4. The Model building Process (Lecture 5)

13 The Multiple Regression Model Assume a linear relation between Y and X,,X p where: Y = β + β X + β X + + β X + ε p p β 0 = intercept (value of Y when all X j = 0) β j = expected effect of X j on Y, all other factors fixed ε = random error Expected value of Y given the {X j }: So Or ( X) = = β0 + β + β βp p EY Y X X X Y = [Expected value] + [Random error] Observed= Signal + Noise

14 The Method of Ordinary Least Squares (OLS) Define error = Observed Fitted e= Y EY ( X) i i i Estimate the intercept and slope coefficients by minimizing the sum of squared errors (SSE). That is, choose the coefficients { b to minimize: 0, b,, b K } n n 2 2 i ( i 0 i 2 2i K Ki) i= i= SSE = e = Y b b X b X b X

15 Least Squares Estimators Equivalent to Maximum Likelihood Unbiased On average the estimators are centred on the true parameter value Efficient E( β ˆ) = β There is no other linear estimator with a smaller variance var( ˆ) β Consistency var( β*) for any other estimator β * As the sample size, n, increases (to ) the estimator β ˆn β

16 Regression Multivariate Models Ordinary Least Squares Linear Regression (OLS) The maths = t t y y y y 2 2 ( ) p p t p t tp x x x x X x x + = = t t ε ε ε ε 2 0 ( ) p p β β β β + = y = β 0 + x β + x 2 β 2 + x 3 β 3 +. ε y 2 = β 0 + x 2 β + x 22 β 2 + x 23 β 3 +. ε 2 y t = β 0 + x t β + x t2 β 2 + x t3 β 3 +.ε t

17 Ordinary Least Squares Linear Regression (OLS) The maths y = Xβ + ε Minimize, a scalar ee Regression Multivariate Models ˆβ = ( ) X X X y TIP! Where is the constant? = p x 2 p if X β t p 0 x x tp is a constant X = transpose of matrix X, switch between rows and columns X - = inverse of matrix X, XX - = I

18 Testing Individual Coefficients Y = β + β X + β X + + β X + ε p p Test of a coefficient H 0 : slope coefficient for X i is zero [X i does not add value to model, given other variables in the model] H A : slope coefficient for X i is not zero [X i adds value, given other variables in the model] Test Statistic t = b i / SE(b i ) b i = slope coefficient SE(b i ) = standard error of b i

19 Testing the Model The estimated coefficients are random unbiased The decision rule for testing the individual model coefficients is: o Reject H 0 if P < α o where P is the observed significance level, and o α is the significance level used for testing, typically o The rule implies that we do not reject H 0 if P > α. o Rule applies for all testing including F test for model With a variable insignificant with Null hypothesis of no linear effect, i.e b =0, what do we then do? Y = β + β X + β X + + β X + ε p p

20 The Multiple Regression Model Example 8.: The Regression Model for Unleaded Gasoline prices The regression equation is Unleaded = L_ crude L_Unemp L_SP L_PDI Predictor Coef SE Coef T P Constant L_ crude L_ Unemp L_SP L_PDI Q: Interpretation do model make sense?

21 How can we compare different models? Let s look at fit. Date Unleaded Crude ResCrude ResPDI 5-Jan Feb Yˆ i = a + 5-Mar bx 5-Apr i 5-May e i = Y i 5-Jun Yˆ residual 5-Jul i = 5-Aug Sep Oct Nov Dec RMSE What are good measures of the fit of our model. the individual residuals tell us everything, but can we summarise their information?

22 Partition of the Sum of Squares Total Sum of Squares: How Good is the Fitted Line? SST = S = ( Y Y ) Sum of Squared Errors (Unexplained Variation): Sum of Squares accounted for by the regression equation: i= n The sums of squares are partitioned: SST = SSR + SSE Our aim: to build a model that explains the variation in Y, SST i= n i= YY i= n i= SSE = ( Y Yˆ ) SSR = ( Yˆ Y ) i= i i i 2 2 i 2

23 ERROR VARIANCE The residual variation is also used to calculate the error variance (and standard deviation) 2 ( ) 2 Y 2 i Yi s = σ = n p It measures the accuracy with which the model explains the data. The accuracy of a prediction using the model depends on it. MINITAB & SPSS carries out the calculation automatically calculating a confidence interval, I. ie. 90% of the time, actual lies within the Range. [ Υˆ Ι, Υˆ Ι] You need to select the 'confidence' probability within which you expect your the observation to lie. p p +

24 Explained MS F = = Un explained MS = n i= n i= ( Yˆ Y) i ( Y Yˆ ) i i 2 2 K ( n K ) F-test Explained SS Explained df Unexplained SS Unexplained df MS = Mean Square SS = Sum of Squares df = degree of freedom m = number of parameters (coefficients in equation) = K+ with K explanatory variables and a constant n = number of observations

25 SUMMARY STATISTICS The 'F' statistic and associated p value measures the overall explanatory power of the explanatory (independent) variables. The model should be chosen so that at least some elements of it are likely to have strong explanatory power. The F test is therefore not particularly important. The 't' tests measure the impact of individual variables. Methods of comparing alternative models are therefore needed. Commonly used * R 2 - a measure of the overall adequacy of the model * σ or s - the standard deviation of the error term - measures the uncertainty around the predictions i.e. the model fit

26 TESTING THE ADEQUACY OF A MODEL Is this a good model Criteria include * Interpretation of the estimated model * Strength of relationship between Xs and Y * Overall adequacy of Model * Validity of Assumptions * Errors in Predictions No simple rules

27 Linear model is: STRENGTH OF THE RELATIONSHIP Does the independent variable (x) affect the dependent variable (y) Y = β0 + β X + ( error) A unit change in X produces an (linear additive) impact of β in Y. Interpreting the Model: should make sense, i.e correct sign, low standard error Elasticity The proportionate change in Y relative to the proportionate change in X is measured as: Y Y X X

28 Assumption of normality Checking the Assumptions Plot a histogram Plot a P-P plot Statistical tests:. Kolmogorov-Smirnov 2. Chi-square

29 Assumption of constant variance Checking the Assumptions Most of the residuals contained in a fixed corridor constant variance Statistical tests:. Split residuals in two sub-samples 2. Hypothesis testing: a. H0: σ = σ 2 b. Hα: σ σ 2 Also check for:. Systematic patterns 2. Trend 3. Seasonality None should be present!

30 Unpredictability of residuals Checking the Assumptions Plot residuals against:. Predicted values (Ŷ) 2. All explanatory variables (X) 3. Time There should be no patterns Linear Nonlinear patterns What is this point? Outlier? Statistical tests:. Correlation Looks more or less random

31 Detecting serial correlation on residuals Residual Autocorrelation Function (ACF) Checking the Assumptions Sample Autocorrelation Strong serial correlation Model invalid! χ Lag Statistical tests: Ljung- Box test Residuals should be uncorrelated There should be no significant lags Can be used to check for season/trend (there should be none!) 2 LB = r where r is the k th order autocorrelation 2 k rk = corr(e t,e t k) k coefficient With degrees of freedom

32 Checking the Assumptions

33 Analysis of Residuals for Gas Price Data Residuals appear to be approximately normal (Probability Plot and Histogram), but there are some outliers o Check the original data to identify the outliers and to determine possible explanations Model does not capture time dependence Zig-zag pattern in Residuals vs. Order Errors are not homoscedastic See in Residuals vs. Fitted Value Increased volatility in the later part of the series See in Residuals vs. Order Some evidence of seasonal pattern Look for peaks every 2 months in Residuals vs. Order

34 Analysis of Residuals for Gas Price Data Graph residuals The Autocorrelation Function (ACF) allows us to check for dependence at a range of possible lags. Ljung-Box stat rk = corr(e t,e t k) 2 LB = r k NB: The Durbin-Watson test examines only first order autocorrelation: invalid with lags Values centred at between 0 and 4 2

35 Forecasting with Multiple Regression Example: One-step-ahead forecasts for gas prices We use the four-variable model for gas prices as an illustration. Crude, Unemployment, S&P, PDI Estimate the model to December 2008 One-step-ahead forecasts were generated from equation, the forecast for January 2009 uses the December 2008 values of the explanatory variables. The regression model is Unleaded = L_crude L_Unemp L_S&P L_PDI The values for December 2008 are as follows: L_crude: 4.2, L_Unemp: 7.3, L_S&P: 888.6, L_PDI: The forecast for January 2009 is then F = * * * * =.889.

36 Forecasting with Multiple Regression Given values of the inputs The point forecast is given by: F = b + bx + bx + + b X The Prediction Interval is given by: F ± t ( n K ) * SE( Y F ). where t denotes the appropriate percentage point from t- tables Assumes Xs are known Example { X, X,, X } n+, n+,2 n+, K n+ 0 n+, 2 n+,2 K n+, K n+ α /2 n+ n+ K = 4 and n = 55, so DF = 50. The SE for the point forecast is found to be (use R, not SPSS). Using t (50) =.976, we find that the 95 percent prediction interval is.8975 ± (.976)*(0.695) =.8975 ± = [.563, 2.233].

37 Issues in Multivariate Regression Establishing the variables to include Testing for insignificant coefficients (& variables) Estimation, summary statistics and testing (same as in the bivariate single explanatory variable case) Simplifying the model Removing inappropriate variables Avoiding spurious relationships Comparing different models

38 Summary - A Multivariate Example The process of model building Establish a default model Graph the data Any features that need explaining? Estimate models Check assumptions: Diagnostic checks Compare models Interpretation Summary statistics (is the model fit for purpose?) Revise model

39 Gas prices model building I. Graph focus variable retail price 2. Identify possible explanatory variables L_Personal Disposable Income L_Unemployment L_S&P 500 Index L_Price of crude oil Seasonal dummies And? 3. Estimated model is: + seasonal dummies Model Coefficients a U sta da d ed Coefficients B Std. Error t Sig. L_Crude L_PDI 4.994E L_Unemp L_Prod L_SP 4.078E

40 Gas prices model building II Summary Stats Diagnostics Outliers? Residuals (approx normal), homoscedastic Serial (auto) correlation? Tests, graphs Add lags Relationship with Xs? Stability Split? Forecasting performance Compared to other regression specifications + univariate benchmark

41 Take-Aways Start the modeling process by careful consideration of available theory and previous empirical studies Carry out a full preliminary analysis of the data to look for associations and for unusual observations Test both the overall model and the individual components Examine the validity of the underlying assumptions Make sure that the model is sensible with respect to the signs and magnitudes of the slope coefficients Use a hold-out sample to evaluate forecasting performance.

42 Questions? Basic reference: Ord, K and Fildes, R (203) Principles of Business Forecasting, South- Western, Cengage

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA CHAPTER 6 TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA 6.1. Introduction A time series is a sequence of observations ordered in time. A basic assumption in the time series analysis

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Multiple Regression. Peerapat Wongchaiwat, Ph.D.

Multiple Regression. Peerapat Wongchaiwat, Ph.D. Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model

More information

Suan Sunandha Rajabhat University

Suan Sunandha Rajabhat University Forecasting Exchange Rate between Thai Baht and the US Dollar Using Time Series Analysis Kunya Bowornchockchai Suan Sunandha Rajabhat University INTRODUCTION The objective of this research is to forecast

More information

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Ref.: Spring SOS3003 Applied data analysis for social science Lecture note

Ref.:   Spring SOS3003 Applied data analysis for social science Lecture note SOS3003 Applied data analysis for social science Lecture note 05-2010 Erling Berge Department of sociology and political science NTNU Spring 2010 Erling Berge 2010 1 Literature Regression criticism I Hamilton

More information

ECON The Simple Regression Model

ECON The Simple Regression Model ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University Lecture 15 20 Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University Modeling for Time Series Forecasting Forecasting is a necessary input to planning, whether in business,

More information

We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model.

We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model. Statistical Methods in Business Lecture 5. Linear Regression We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model.

More information

Figure 1. Time Series Plot of arrivals from Western Europe

Figure 1. Time Series Plot of arrivals from Western Europe FORECASTING TOURIST ARRIVALS TO SRI LANKA FROM WESTERN EUROPE K. M. U. B. Konarasinghe 1 * 1 Institute of Mathematics & Management, Nugegoda, Sri Lanka INTRODUCTION Sri Lanka was re-emerging after defeating

More information

Lecture 11: Simple Linear Regression

Lecture 11: Simple Linear Regression Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

FinQuiz Notes

FinQuiz Notes Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. 12er12 Chapte Bivariate i Regression (Part 1) Bivariate Regression Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

The Simple Regression Model. Part II. The Simple Regression Model

The Simple Regression Model. Part II. The Simple Regression Model Part II The Simple Regression Model As of Sep 22, 2015 Definition 1 The Simple Regression Model Definition Estimation of the model, OLS OLS Statistics Algebraic properties Goodness-of-Fit, the R-square

More information

Lecture 9: Linear Regression

Lecture 9: Linear Regression Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure. STATGRAPHICS Rev. 9/13/213 Calibration Models Summary... 1 Data Input... 3 Analysis Summary... 5 Analysis Options... 7 Plot of Fitted Model... 9 Predicted Values... 1 Confidence Intervals... 11 Observed

More information

Simple and Multiple Linear Regression

Simple and Multiple Linear Regression Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where

More information

Homework 2: Simple Linear Regression

Homework 2: Simple Linear Regression STAT 4385 Applied Regression Analysis Homework : Simple Linear Regression (Simple Linear Regression) Thirty (n = 30) College graduates who have recently entered the job market. For each student, the CGPA

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

Multiple Regression Analysis

Multiple Regression Analysis 1 OUTLINE Basic Concept: Multiple Regression MULTICOLLINEARITY AUTOCORRELATION HETEROSCEDASTICITY REASEARCH IN FINANCE 2 BASIC CONCEPTS: Multiple Regression Y i = β 1 + β 2 X 1i + β 3 X 2i + β 4 X 3i +

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University Lecture 15 20 Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University Modeling for Time Series Forecasting Forecasting is a necessary input to planning, whether in business,

More information

Time Series Analysis of United States of America Crude Oil and Petroleum Products Importations from Saudi Arabia

Time Series Analysis of United States of America Crude Oil and Petroleum Products Importations from Saudi Arabia International Journal of Applied Science and Technology Vol. 5, No. 5; October 2015 Time Series Analysis of United States of America Crude Oil and Petroleum Products Importations from Saudi Arabia Olayan

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Multiple Regression Analysis

Multiple Regression Analysis 1 OUTLINE Analysis of Data and Model Hypothesis Testing Dummy Variables Research in Finance 2 ANALYSIS: Types of Data Time Series data Cross-Sectional data Panel data Trend Seasonal Variation Cyclical

More information

: The model hypothesizes a relationship between the variables. The simplest probabilistic model: or.

: The model hypothesizes a relationship between the variables. The simplest probabilistic model: or. Chapter Simple Linear Regression : comparing means across groups : presenting relationships among numeric variables. Probabilistic Model : The model hypothesizes an relationship between the variables.

More information

Bivariate Regression Analysis. The most useful means of discerning causality and significance of variables

Bivariate Regression Analysis. The most useful means of discerning causality and significance of variables Bivariate Regression Analysis The most useful means of discerning causality and significance of variables Purpose of Regression Analysis Test causal hypotheses Make predictions from samples of data Derive

More information

STAT Chapter 11: Regression

STAT Chapter 11: Regression STAT 515 -- Chapter 11: Regression Mostly we have studied the behavior of a single random variable. Often, however, we gather data on two random variables. We wish to determine: Is there a relationship

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6 STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf

More information

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories. Chapter Goals To understand the methods for displaying and describing relationship among variables. Formulate Theories Interpret Results/Make Decisions Collect Data Summarize Results Chapter 7: Is There

More information

Lab: Box-Jenkins Methodology - US Wholesale Price Indicator

Lab: Box-Jenkins Methodology - US Wholesale Price Indicator Lab: Box-Jenkins Methodology - US Wholesale Price Indicator In this lab we explore the Box-Jenkins methodology by applying it to a time-series data set comprising quarterly observations of the US Wholesale

More information

5.1 Model Specification and Data 5.2 Estimating the Parameters of the Multiple Regression Model 5.3 Sampling Properties of the Least Squares

5.1 Model Specification and Data 5.2 Estimating the Parameters of the Multiple Regression Model 5.3 Sampling Properties of the Least Squares 5.1 Model Specification and Data 5. Estimating the Parameters of the Multiple Regression Model 5.3 Sampling Properties of the Least Squares Estimator 5.4 Interval Estimation 5.5 Hypothesis Testing for

More information

Diagnostics of Linear Regression

Diagnostics of Linear Regression Diagnostics of Linear Regression Junhui Qian October 7, 14 The Objectives After estimating a model, we should always perform diagnostics on the model. In particular, we should check whether the assumptions

More information

Ch 3: Multiple Linear Regression

Ch 3: Multiple Linear Regression Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unit 10: Simple Linear Regression and Correlation Statistics 571: Statistical Methods Ramón V. León 6/28/2004 Unit 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regression analysis is a method for

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index

More information

2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23

2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23 2.4. ASSESSING THE MODEL 23 2.4.3 Estimatingσ 2 Note that the sums of squares are functions of the conditional random variables Y i = (Y X = x i ). Hence, the sums of squares are random variables as well.

More information

4.1 Least Squares Prediction 4.2 Measuring Goodness-of-Fit. 4.3 Modeling Issues. 4.4 Log-Linear Models

4.1 Least Squares Prediction 4.2 Measuring Goodness-of-Fit. 4.3 Modeling Issues. 4.4 Log-Linear Models 4.1 Least Squares Prediction 4. Measuring Goodness-of-Fit 4.3 Modeling Issues 4.4 Log-Linear Models y = β + β x + e 0 1 0 0 ( ) E y where e 0 is a random error. We assume that and E( e 0 ) = 0 var ( e

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Homoskedasticity. Var (u X) = σ 2. (23)

Homoskedasticity. Var (u X) = σ 2. (23) Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This

More information

TESTING FOR CO-INTEGRATION

TESTING FOR CO-INTEGRATION Bo Sjö 2010-12-05 TESTING FOR CO-INTEGRATION To be used in combination with Sjö (2008) Testing for Unit Roots and Cointegration A Guide. Instructions: Use the Johansen method to test for Purchasing Power

More information

The multiple regression model; Indicator variables as regressors

The multiple regression model; Indicator variables as regressors The multiple regression model; Indicator variables as regressors Ragnar Nymoen University of Oslo 28 February 2013 1 / 21 This lecture (#12): Based on the econometric model specification from Lecture 9

More information

Univariate ARIMA Models

Univariate ARIMA Models Univariate ARIMA Models ARIMA Model Building Steps: Identification: Using graphs, statistics, ACFs and PACFs, transformations, etc. to achieve stationary and tentatively identify patterns and model components.

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)

More information

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics Chapter 13 Student Lecture Notes 13-1 Department of Quantitative Methods & Information Sstems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analsis QMIS 0 Dr. Mohammad

More information

MODELLING TIME SERIES WITH CONDITIONAL HETEROSCEDASTICITY

MODELLING TIME SERIES WITH CONDITIONAL HETEROSCEDASTICITY MODELLING TIME SERIES WITH CONDITIONAL HETEROSCEDASTICITY The simple ARCH Model Eva Rubliková Ekonomická univerzita Bratislava Manuela Magalhães Hill Department of Quantitative Methods, INSTITUTO SUPERIOR

More information

REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK

REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK 1 ECONOMETRICS STUDY PACK MAY/JUNE 2016 Question 1 (a) (i) Describing economic reality (ii) Testing hypothesis about economic theory (iii) Forecasting future

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Finding Relationships Among Variables

Finding Relationships Among Variables Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Applied Econometrics. Professor Bernard Fingleton

Applied Econometrics. Professor Bernard Fingleton Applied Econometrics Professor Bernard Fingleton Regression A quick summary of some key issues Some key issues Text book JH Stock & MW Watson Introduction to Econometrics 2nd Edition Software Gretl Gretl.sourceforge.net

More information

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont. TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

Formal Statement of Simple Linear Regression Model

Formal Statement of Simple Linear Regression Model Formal Statement of Simple Linear Regression Model Y i = β 0 + β 1 X i + ɛ i Y i value of the response variable in the i th trial β 0 and β 1 are parameters X i is a known constant, the value of the predictor

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information

Econometrics I Lecture 3: The Simple Linear Regression Model

Econometrics I Lecture 3: The Simple Linear Regression Model Econometrics I Lecture 3: The Simple Linear Regression Model Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 32 Outline Introduction Estimating

More information

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data?

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data? Univariate analysis Example - linear regression equation: y = ax + c Least squares criteria ( yobs ycalc ) = yobs ( ax + c) = minimum Simple and + = xa xc xy xa + nc = y Solve for a and c Univariate analysis

More information

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 24, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

Multiple Regression Analysis. Basic Estimation Techniques. Multiple Regression Analysis. Multiple Regression Analysis

Multiple Regression Analysis. Basic Estimation Techniques. Multiple Regression Analysis. Multiple Regression Analysis Multiple Regression Analysis Basic Estimation Techniques Herbert Stocker herbert.stocker@uibk.ac.at University of Innsbruck & IIS, University of Ramkhamhaeng Regression Analysis: Statistical procedure

More information

TMA4255 Applied Statistics V2016 (5)

TMA4255 Applied Statistics V2016 (5) TMA4255 Applied Statistics V2016 (5) Part 2: Regression Simple linear regression [11.1-11.4] Sum of squares [11.5] Anna Marie Holand To be lectured: January 26, 2016 wiki.math.ntnu.no/tma4255/2016v/start

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

Dummy Variables. Susan Thomas IGIDR, Bombay. 24 November, 2008

Dummy Variables. Susan Thomas IGIDR, Bombay. 24 November, 2008 IGIDR, Bombay 24 November, 2008 The problem of structural change Model: Y i = β 0 + β 1 X 1i + ɛ i Structural change, type 1: change in parameters in time. Y i = α 1 + β 1 X i + e 1i for period 1 Y i =

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Multivariate Regression Model Results

Multivariate Regression Model Results Updated: August, 0 Page of Multivariate Regression Model Results 4 5 6 7 8 This exhibit provides the results of the load model forecast discussed in Schedule. Included is the forecast of short term system

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference.

Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference. Understanding regression output from software Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals In 1966 Cyril Burt published a paper called The genetic determination of differences

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information