Using regression to study economic relationships is called econometrics. econo = of or pertaining to the economy. metrics = measurement

Similar documents
ECON The Simple Regression Model

Business Statistics. Lecture 9: Simple Regression

LECTURE 5. Introduction to Econometrics. Hypothesis testing

Multiple Regression Analysis. Basic Estimation Techniques. Multiple Regression Analysis. Multiple Regression Analysis

Correlation Analysis

Semester 2, 2015/2016

The Simple Linear Regression Model

Business Statistics. Lecture 10: Correlation and Linear Regression

ECON 450 Development Economics

Mathematics for Economics MA course

Exponential smoothing is, like the moving average forecast, a simple and often used forecasting technique

Review of Statistics

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

Lectures 5 & 6: Hypothesis Testing

STAT Chapter 11: Regression

EC4051 Project and Introductory Econometrics

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model

ECON 497 Midterm Spring

SIMPLE REGRESSION ANALYSIS. Business Statistics

Econometrics. 4) Statistical inference

22 Approximations - the method of least squares (1)

Chapter 2 The Simple Linear Regression Model: Specification and Estimation

Making sense of Econometrics: Basics

Chapter 14 Simple Linear Regression (A)

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Regression Analysis and Forecasting Prof. Shalabh Department of Mathematics and Statistics Indian Institute of Technology-Kanpur

Inferences for Regression

ECON3150/4150 Spring 2015

Linear Regression with one Regressor

Class time (Please Circle): 11:10am-12:25pm. or 12:45pm-2:00pm

Lecture 14 Simple Linear Regression

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

Ch 2: Simple Linear Regression

The regression model with one fixed regressor cont d

Chapter 11 Linear Regression

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS

1 Correlation and Inference from Regression

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

2 Prediction and Analysis of Variance

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues

5.1 Model Specification and Data 5.2 Estimating the Parameters of the Multiple Regression Model 5.3 Sampling Properties of the Least Squares

Lab 07 Introduction to Econometrics

Ordinary Least Squares Regression Explained: Vartanian

The simple linear regression model discussed in Chapter 13 was written as

Applied Econometrics (QEM)

Lecture 12. Functional form

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

Notebook Tab 6 Pages 183 to ConteSolutions

Modelling the Electric Power Consumption in Germany

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Regression #8: Loose Ends

Design & Analysis of Experiments 7E 2009 Montgomery

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Chapter 3 Multiple Regression Complete Example

Chapter 7: Correlation and regression

Regression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear

Binary Logistic Regression

Statistics and Quantitative Analysis U4320. Segment 10 Prof. Sharyn O Halloran

Statistics for Managers using Microsoft Excel 6 th Edition

Do not copy, post, or distribute

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

1 Regression with Time Series Variables

REVIEW 8/2/2017 陈芳华东师大英语系

INFERENCE FOR REGRESSION

ECON4515 Finance theory 1 Diderik Lund, 5 May Perold: The CAPM

Section Least Squares Regression

Statistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression

Essential of Simple regression

1. The OLS Estimator. 1.1 Population model and notation

Econometric Methods and Applications II Chapter 2: Simultaneous equations. Econometric Methods and Applications II, Chapter 2, Slide 1

Econ 110: Introduction to Economic Theory. 8th Class 2/7/11

Outline. Nature of the Problem. Nature of the Problem. Basic Econometrics in Transportation. Autocorrelation

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

1 A Non-technical Introduction to Regression

Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017

Block 3. Introduction to Regression Analysis

Chapter 16. Simple Linear Regression and dcorrelation

Lecture 2 Simple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: Chapter 1

Lecture 10 Multiple Linear Regression

Correlation 1. December 4, HMS, 2017, v1.1

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

ECON 497: Lecture Notes 10 Page 1 of 1

11.5 Regression Linear Relationships

Lecture-1: Introduction to Econometrics

Variance Decomposition and Goodness of Fit

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

LECTURE 2: SIMPLE REGRESSION I

Section 3: Simple Linear Regression

CHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

4.1 Least Squares Prediction 4.2 Measuring Goodness-of-Fit. 4.3 Modeling Issues. 4.4 Log-Linear Models

LECTURE 1. Introduction to Econometrics

Lecture 3: Inference in SLR

Basic Business Statistics 6 th Edition


Chapter 4: Regression Models

Finding Relationships Among Variables

Scatter plot of data from the study. Linear Regression

Transcription:

EconS 450 Forecasting part 3 Forecasting with Regression Using regression to study economic relationships is called econometrics econo = of or pertaining to the economy metrics = measurement Econometrics What we attempt to do by calculating regression parameters is find a model that best mimics the true underlying data generating process. 1

Methodology of Econometrics Economic analysis proceeds along the following lines: 1. Statement of theory or hypothesis 2. Specification of the statistical model 3. Collection of data Methodology of Econometrics 4. Estimation of the parameters of the model 5. Tests of the hypothesis derived from the model 6. Forecasting or prediction Methodology of Econometrics In forecasting, we don t need to worry as much about step 5. T d ill di h (i l di 5) Today we will discuss each step (including 5) using a demand function example. 2

Suppose you wish to examine how a proposed price increase will effect the demand for coffee. 1. Statement of theory: the law of demand when price increases, consumers tend to buy less of a product. Specification of the mathematical model of the law of demand. We will assume that the relationship between Q and P is linear: Q = b 1 + b 2 P Q = b 1 + b 2 P Here b 1 represents the intercept, while b 2 is the slope parameter. If the law of demand holds, we would expect b 2 <0 (we would expect b 1 >0 why?). 3

The preceding equation is an example of a mathematical model. 2. To convert it to a statistical model, we write it as: Q = b 1 + b 2 P + e Where e is the stochastic error term. Remember, our model which relates quantity and price, carries with it a number of ceteris paribus assumptions. All variables that may affect Q which we do not include in the model are assumed to be held constant. 3.In collecting data, there is one simple rule: The more data you can obtain, the better off you ae are. You want to find a consistent series, with no time gaps and no missing observations. 4

Data Issues There are three basic types of data: time series cross-sectionalsectional pooled Data Issues Time series data are: collected over time collected at regular intervals quantitative or qualitative Data Issues Cross-sectional data are: collected at one point in time collected for one or more cross-sectional units quantitative or qualitative 5

Data Issues Pooled data are: time series collected for two or more cross sectional units quantitative or qualitative Data Issues When forecasting, we are mostly concerned with time series data. When estimating supply or demand, you may wish to use either time series or pooled data. 4.Once we have the model and data, we can obtain parameter estimates as discussed in the last lecture. The estimates are values which minimize the The estimates are values which minimize the sum of squared errors (where the error refers to the value of the error term for each period). 6

For example, our estimation yields the following: Q 76. 05 388. P There is a hat over Q to indicate that its value is based on estimates of the parameters. 5. Having estimated the demand function, we may want to find out if the results conform to our theory. For example: can we say with statistical proof that b 2 <0 can we show that b 1 >0 In order to answer these questions, we must perform a hypothesis test. For example, we may wish to test the hypothesis that b 1 = 0. We would write it like this: H o : b 1 = 0 7

H o : b 1 = 0 is called the null hypothesis In this case the alternative hypothesis would be written: H a : b 2 0 This hypothesis can be tested using a t-test. When testing a hypothesis of H o : b 1 = 0 The test is calculated by dividing the parameter estimate by its standard error. The result from dividing the estimate by its standard error are compared to a table value. A rule of thumb is that if the resulting value A rule of thumb is that if the resulting value > 1.96 then we reject the null hypothesis. 8

6. Forecasting Suppose that given our estimated demand function, we want to know what quantity would be demanded if price = $4.50 A forecast could be obtained from: Q = 76.05 3.88 (4.50) = 58.59 When forecasting time series, you will want to re-estimate the parameters with each subsequent realization and forecast only onestep-ahead. Basic Regression The most basic regression approach is known as Ordinary Least Squares (OLS). Under certain assumptions the OLS estimator has some very attractive statistical properties. The Least Squares portion of the name indicates that the OLS estimator is one that minimizes squared errors. 9

Summarizing Characteristics of Economic Data Given an economic data set, we wish to be able to describe it in terms of its behavior over time. We would like our data to be well behaved though economic data is often not so well behaved. Finding the Center of the Data As we have mentioned in class there are three basic measures of central tendency, the mean, the median and the mode. The Mean: x T i 1 T x i Finding the Center of the Data The Median: The numeric value separating the higher half of a sample from the lower half. The middle of the data. In the case where there is no single middle value, the median is usually defined as the mean of the middle values. 10

Finding the Center of the Data The Mode: The mode is the value that occurs most frequently in the data set. The mode may not be unique, and there may actually be more than one mode or no mode at all. Finding the Dispersion of the Data It is also important to know the range of possible outcomes in your data. This applies both to forecasting and risk analysis. We measure dispersion i based on the tendency of the data to be near or far from the mean. The most common measure is the sample variance. Finding the Dispersion of the Data The variances is the second moment of a distribution of data (the mean is the first) a moment is, loosely speaking, a quantitative measure of the shape of a set of points. s 2 x T i1 ( x i x) T 1 2 11

Finding the Dispersion of the Data Two things should be obvious from the formula for the variance: The larger the variance, the greater the dispersion The variance is dependent on the scale of the mean (in other words you can t directly compare variances for two series of data with widely different mean values). Now suppose that the data are available in time series and that X t = aggregate real disposable personal income in year t Y t = aggregate real vacation expenditure in year t. Whether the data are time series or cross sectional, the simplest version of the twovariable model is: Yi X i ei 12

Let the residuals from any fitted straight line be denoted by: ei Yi Y ˆ i Yi a bx i i 1,2,..., n The values of a and b are determined such that the sum of squared residuals is minimized. 2 SSE ei f ( a, b) As with any unconstrained optimization problem, the solution is found where the first derivatives with respect to the arguments are equal to zero. In this case, e b 2 e a 2 2 2 and ( Y a bx ) 2 X ( Y a bx ) 2 e 0 Xe 0 Simplifying these gives us the for the linear regression of Y on X, that is: Y na b X XY ax b We can re-write that first equation as: a Y bx X 2 13

Substituting for a in the second normal equation gives: n i i1 b n ( X X )( Y Y ) i1 ( X i X ) Thus, the least-squares slope can be estimated from the equation, and the result substituted in to calculate the intercept. i 2 The least-squares line has three important properties. It minimizes sum of squared residuals (errors), it passes through the mean point ( X, Y ) and the least squares residuals have zero correlation in the sample with the values of X Fortunately, we don t have to memorize all of these formulas, or try to translate them to a spreadsheet. Excel has a built in Regression program that Excel has a built in Regression program that can be found (if it is installed) under the Data tab by selecting Data Analysis 14

How Good is Your Model? One measure (albeit an imperfect one) of the goodness of a model is the coefficient of determination or R 2. R 2 indicates the proportion of the total variation in the dependent variable explained by the independent variables. How Good is Your Model? R 2 can be likened to an in-sample measure of forecasting accuracy. How well y i predicts y i Can only be measured in-sample How Good is Your Model? Perhaps a better measure of the goodness of your model is how well it forecasts outof-sample. You can think of MSE as a kind of out-ofsample R 2. 15