ALR 2014 Caual Modelling Workshop
|
|
- Silas Burke
- 6 years ago
- Views:
Transcription
1 ALR 2014 Caual Modelling Workshop Example 1: Interupted time series The impact of public transportation strikes on use of a bicycle share program in London: Interrupted time series design Fuller D, Sahlqvist S, Cummins S, Ogilvie D. The impact of public transportation strikes on use of a bicycle share program in London: Interrupted time series design. Preventive Medicine. 2011;54(1):74â 76. Analysis set up Set working directory and importing the data into R options(scipen = 1, digits = 2) opts_chunk$set(warning = FALSE, message = FALSE, error = FALSE) setwd("/users/dogleg/dropbox/conferences/2014 ALR/Workshop Data/") boris_data <- read.csv(" Installing necessary packages install.packages("ggplot2", repos = " library(ggplot2) install.packages("car", repos = " library(car) Objectives We want to examine the impact of 2 tube (read subway) strikes in London on the use of the Boris Bike program. The strikes happened on September 6th, 2010 and October 4th, First we need to generate the appropriate variables for the analysis. First, we need 2 dummy variables that indicate before and after each strike. Creating necessary variables Creating dichotomous variables to assess the immediate impact of the tube strike Key dates that we need for the analysis are the tube strikes 1. September 6, October 4, 2010 The first strike happened at time 39 (September 6, 2010) so we need to create a variable that is 0 before time 39, 1 between time 39 and 65, and 0 again after time 65. boris_data$strike1 <- 1 boris_data$strike1[boris_data$time < 39] <- "0" boris_data$strike1[boris_data$time > 66] <- "0" boris_data$strike1 <- as.numeric(boris_data$strike1) The second strike happened at time 65 (October 4, 2010) so we need to create a variable that is 1 before time 65 and 0 after after time 65.
2 boris_data$strike2 <- boris_data$time - 66 boris_data$strike2 <- ifelse(boris_data$time < 67, c("0"), c("1")) boris_data$strike2 <- as.numeric(boris_data$strike2) Creating continuous variables to assess the change in impact over time Creating a variable that is 0 before the first ban and linear after the first ban boris_data$slope2 <- boris_data$time - 38 boris_data$slope2[boris_data$time < 39] <- "0" boris_data$slope2 <- as.numeric(boris_data$slope2) Creating a variable that is 0 before the second ban and linear after the second ban boris_data$slope3 <- boris_data$time - 66 boris_data$slope3[boris_data$time < 67] <- "0" boris_data$slope3 <- as.numeric(boris_data$slope3) Creating a categorical variable that defines each strike period (i.e., pre, strike1 and strike 2). This is mostly to simply our life when graphing. boris_data$strike[boris_data$time < 39] <- "0" boris_data$strike[boris_data$time > 38 & boris_data$time < 67] <- "1" boris_data$strike[boris_data$time > 66] <- "2" Dataset structure Your dataset should look like below. Notice that the dummy variables change at the correct times. library(xtable) cut_data <- boris_data[30:75, ] print(xtable(cut_data), type = "html") start_date duration_sec t_trip duration_min time time1 time2 t0_int t1_int t2_int strike1 strike2 slope2 slope3 strike Aug Aug Aug Aug Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep
3 48 15-Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Sep Oct Oct Oct Oct Oct Oct Oct Oct Oct Oct Oct Oct Some simple scatter plots analysis Scatter plot of the Date (x-axis) and Total number of trips (y-axis). Make sure the data is sorted by Date. library(ggplot2) ggplot(boris_data, aes(x = time, y = t_trip)) + geom_point(shape = 1) + xlab("days") + ylab("number of Boris Bike Trips") + theme_bw()
4 Scatter plot of the Date (x-axis) and Total number of trips (y-axis) adding the linear fit (blue) and dashed red lines to represent the two strikes. ggplot(boris_data, aes(x = time, y = t_trip)) + geom_point(shape = 1) + stat_smooth(method = "lm", se = FALSE) + geom_vline(xintercept = 37, colour = "red", linetype = "longdash") + geom_vline(xintercept = 66, colour = "red", linetype = "longdash") + xlab("days") + ylab("number of Boris Bike Trips") + theme_bw() Regression of each of the newly created variables on the total number of trips Regression equation
5 y^ = β 0 + β 1 Tim e 1 + β 2 Strike β 3 Slope β 4 Strike β 5 Slope ε trip_reg <- lm(t_trip ~ time + strike1 + slope2 + strike2 + slope3, data = boris_data) print(xtable(trip_reg), type = "html") Estimate Std. Error t value Pr(> t ) (Intercept) time strike slope strike slope Interpretation of the coefficients is as follows: Beta0 (Intercept) â Value of dependent variable at baseline Beta1 (time1) - Trend prior to 1st intervention implementation Beta2 (strike1) - Difference between the last point prior and first point post the 1st intervention implementation Beta3 (slope2) â Change in trend from pre to post 1st implementation Beta4 (strike2) - Difference between last point in 1st implementation period and first point in 2nd implementation period Beta5 (slope3) â Change in trend from 1st to 2nd implementation periods Getting predicted values from the regression and mergning them to the "boris_data"" data set trip_hat <- fitted(trip_reg) # predicted values boris_data$trip_hat <- fitted(trip_reg) Getting residual values from the regression and mergning them to the "boris_data"" data set trip_resid <- residuals(trip_reg) # residuals boris_data$trip_resid <- (trip_resid) Graph the regression plot ggplot(boris_data, aes(x = time, y = t_trip)) + geom_point(shape = 1) + geom_smooth(aes(group = strike), method = "lm", se = FALSE) + geom_vline(xintercept = 37, colour = "red", linetype = "longdash") + geom_vline(xintercept = 66, colour = "red", linetype = "longdash") + xlab("days") + ylab("number of Boris Bike Trips") + theme_bw()
6 Notes. The plot is a new linear model for each strike period. It is mostly for descriptive purposes. Including CI's in this graph is incorrect as they will be based on the actual model but rather the model used in the graph. That's why I've set the "se=false." Model fit diagnostics Fitted residuals, Normal Q-Q plot opar <- par(mfrow = c(2, 2), oma = c(0, 0, 1.1, 0)) plot(trip_reg, las = 1) The Durbin-Watson statistic is a simple numerical method for checking serial dependence.
7 If the Durbinâ Watson statistic is substantially less than 2, there is evidence of positive serial correlation. As a rough rule of thumb, if Durbinâ Watson is less than 1.0, there may be cause for alarm. Small values of d indicate successive error terms are, on average, close in value to one another, or positively correlated.wiki library(car) durbinwatsontest(trip_reg) ## lag Autocorrelation D-W Statistic p-value ## ## Alternative hypothesis: rho!= 0 Additional notes 1. We could easily add non linear terms (e.g., quadratic) to the models by squaring or cubing the time, slope2 and slope 3 variables. This might help with model fit. 2. My guess is that there is likely a weekend/weekday autocorrelation. We could add a weekend/weekday dummy variable, which would likely improve model fit. 3. If serial autocorrelation continus to be a problem (after adding weekend/weekday variable) we could add Auto-Regressive Integrated Moving Average (ARIMA) components to the model. 4. There is a package designed for interrupted time series regression in R. I have purposefully not shown this package because it is better to understand the model running the analysis the "manual" way. install.packages("segmented", repos = " library(segmented) The end
lm statistics Chris Parrish
lm statistics Chris Parrish 2017-04-01 Contents s e and R 2 1 experiment1................................................. 2 experiment2................................................. 3 experiment3.................................................
More informationDiagnostics of Linear Regression
Diagnostics of Linear Regression Junhui Qian October 7, 14 The Objectives After estimating a model, we should always perform diagnostics on the model. In particular, we should check whether the assumptions
More informationRegression Analysis: Exploring relationships between variables. Stat 251
Regression Analysis: Exploring relationships between variables Stat 251 Introduction Objective of regression analysis is to explore the relationship between two (or more) variables so that information
More informationEco and Bus Forecasting Fall 2016 EXERCISE 2
ECO 5375-701 Prof. Tom Fomby Eco and Bus Forecasting Fall 016 EXERCISE Purpose: To learn how to use the DTDS model to test for the presence or absence of seasonality in time series data and to estimate
More informationNon-independence due to Time Correlation (Chapter 14)
Non-independence due to Time Correlation (Chapter 14) When we model the mean structure with ordinary least squares, the mean structure explains the general trends in the data with respect to our dependent
More informationExample of data analysis homework
Example of data analysis homework Average temperature in Corvallis This is an example of how your response for the data analysis section of the homework should be presented. Note: Use section headings
More information( ), which of the coefficients would end
Discussion Sheet 29.7.9 Qualitative Variables We have devoted most of our attention in multiple regression to quantitative or numerical variables. MR models can become more useful and complex when we consider
More informationMultiple Regression and Model Building Lecture 20 1 May 2006 R. Ryznar
Multiple Regression and Model Building 11.220 Lecture 20 1 May 2006 R. Ryznar Building Models: Making Sure the Assumptions Hold 1. There is a linear relationship between the explanatory (independent) variable(s)
More informationOutline. Nature of the Problem. Nature of the Problem. Basic Econometrics in Transportation. Autocorrelation
1/30 Outline Basic Econometrics in Transportation Autocorrelation Amir Samimi What is the nature of autocorrelation? What are the theoretical and practical consequences of autocorrelation? Since the assumption
More informationRegression Models for Time Trends: A Second Example. INSR 260, Spring 2009 Bob Stine
Regression Models for Time Trends: A Second Example INSR 260, Spring 2009 Bob Stine 1 Overview Resembles prior textbook occupancy example Time series of revenue, costs and sales at Best Buy, in millions
More information3 Non-linearities and Dummy Variables
3 Non-linearities and Dummy Variables Reading: Kennedy (1998) A Guide to Econometrics, Chapters 3, 5 and 6 Aim: The aim of this section is to introduce students to ways of dealing with non-linearities
More information1 Forecasting House Starts
1396, Time Series, Week 5, Fall 2007 1 In this handout, we will see the application example on chapter 5. We use the same example as illustrated in the textbook and fit the data with several models of
More information22s:152 Applied Linear Regression. Returning to a continuous response variable Y...
22s:152 Applied Linear Regression Generalized Least Squares Returning to a continuous response variable Y... Ordinary Least Squares Estimation The classical models we have fit so far with a continuous
More information22s:152 Applied Linear Regression. In matrix notation, we can write this model: Generalized Least Squares. Y = Xβ + ɛ with ɛ N n (0, Σ)
22s:152 Applied Linear Regression Generalized Least Squares Returning to a continuous response variable Y Ordinary Least Squares Estimation The classical models we have fit so far with a continuous response
More informationLECTURE 11. Introduction to Econometrics. Autocorrelation
LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct
More informationExample. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences
36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 29, 2015 Lecture 5: Multiple Regression Review of ANOVA & Simple Regression Both Quantitative outcome Independent, Gaussian errors
More informationReading Assignment. Serial Correlation and Heteroskedasticity. Chapters 12 and 11. Kennedy: Chapter 8. AREC-ECON 535 Lec F1 1
Reading Assignment Serial Correlation and Heteroskedasticity Chapters 1 and 11. Kennedy: Chapter 8. AREC-ECON 535 Lec F1 1 Serial Correlation or Autocorrelation y t = β 0 + β 1 x 1t + β x t +... + β k
More informationThe Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1)
The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1) Authored by: Sarah Burke, PhD Version 1: 31 July 2017 Version 1.1: 24 October 2017 The goal of the STAT T&E COE
More informationScenario 5: Internet Usage Solution. θ j
Scenario : Internet Usage Solution Some more information would be interesting about the study in order to know if we can generalize possible findings. For example: Does each data point consist of the total
More informationLECTURE 10: MORE ON RANDOM PROCESSES
LECTURE 10: MORE ON RANDOM PROCESSES AND SERIAL CORRELATION 2 Classification of random processes (cont d) stationary vs. non-stationary processes stationary = distribution does not change over time more
More informationTIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA
CHAPTER 6 TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA 6.1. Introduction A time series is a sequence of observations ordered in time. A basic assumption in the time series analysis
More informationAnalysis of Covariance: Comparing Regression Lines
Chapter 7 nalysis of Covariance: Comparing Regression ines Suppose that you are interested in comparing the typical lifetime (hours) of two tool types ( and ). simple analysis of the data given below would
More informationWeek 8 Hour 1: More on polynomial fits. The AIC
Week 8 Hour 1: More on polynomial fits. The AIC Hour 2: Dummy Variables Hour 3: Interactions Stat 302 Notes. Week 8, Hour 3, Page 1 / 36 Interactions. So far we have extended simple regression in the following
More informationFinQuiz Notes
Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable
More informationCHAPTER 21: TIME SERIES ECONOMETRICS: SOME BASIC CONCEPTS
CHAPTER 21: TIME SERIES ECONOMETRICS: SOME BASIC CONCEPTS 21.1 A stochastic process is said to be weakly stationary if its mean and variance are constant over time and if the value of the covariance between
More informationThe Model Building Process Part I: Checking Model Assumptions Best Practice
The Model Building Process Part I: Checking Model Assumptions Best Practice Authored by: Sarah Burke, PhD 31 July 2017 The goal of the STAT T&E COE is to assist in developing rigorous, defensible test
More informationChapter 4 Describing the Relation between Two Variables
Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation The is the variable whose value can be explained by the value of the or. A is a graph that shows the relationship
More informationDummy Variables. Susan Thomas IGIDR, Bombay. 24 November, 2008
IGIDR, Bombay 24 November, 2008 The problem of structural change Model: Y i = β 0 + β 1 X 1i + ɛ i Structural change, type 1: change in parameters in time. Y i = α 1 + β 1 X i + e 1i for period 1 Y i =
More informationITSx: Policy Analysis Using Interrupted Time Series
ITSx: Policy Analysis Using Interrupted Time Series Week 3 Slides Michael Law, Ph.D. The University of British Columbia Layout of the weeks 1. Introduction, setup, data sources 2. Single series interrupted
More informationAnalysis. Components of a Time Series
Module 8: Time Series Analysis 8.2 Components of a Time Series, Detection of Change Points and Trends, Time Series Models Components of a Time Series There can be several things happening simultaneously
More informationECON 4230 Intermediate Econometric Theory Exam
ECON 4230 Intermediate Econometric Theory Exam Multiple Choice (20 pts). Circle the best answer. 1. The Classical assumption of mean zero errors is satisfied if the regression model a) is linear in the
More informationIntroduction and Single Predictor Regression. Correlation
Introduction and Single Predictor Regression Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Correlation A correlation
More informationLDA Midterm Due: 02/21/2005
LDA.665 Midterm Due: //5 Question : The randomized intervention trial is designed to answer the scientific questions: whether social network method is effective in retaining drug users in treatment programs,
More informationAnalysis of binary repeated measures data with R
Analysis of binary repeated measures data with R Right-handed basketball players take right and left-handed shots from 3 locations in a different random order for each player. Hit or miss is recorded.
More informationMath 3330: Solution to midterm Exam
Math 3330: Solution to midterm Exam Question 1: (14 marks) Suppose the regression model is y i = β 0 + β 1 x i + ε i, i = 1,, n, where ε i are iid Normal distribution N(0, σ 2 ). a. (2 marks) Compute the
More informationMultiple Regression Analysis
1 OUTLINE Basic Concept: Multiple Regression MULTICOLLINEARITY AUTOCORRELATION HETEROSCEDASTICITY REASEARCH IN FINANCE 2 BASIC CONCEPTS: Multiple Regression Y i = β 1 + β 2 X 1i + β 3 X 2i + β 4 X 3i +
More informationTime Series Analysis of United States of America Crude Oil and Petroleum Products Importations from Saudi Arabia
International Journal of Applied Science and Technology Vol. 5, No. 5; October 2015 Time Series Analysis of United States of America Crude Oil and Petroleum Products Importations from Saudi Arabia Olayan
More information1.) Fit the full model, i.e., allow for separate regression lines (different slopes and intercepts) for each species
Lecture notes 2/22/2000 Dummy variables and extra SS F-test Page 1 Crab claw size and closing force. Problem 7.25, 10.9, and 10.10 Regression for all species at once, i.e., include dummy variables for
More informationOLSQ. Function: Usage:
OLSQ OLSQ (HCTYPE=robust SE type, HI, ROBUSTSE, SILENT, TERSE, UNNORM, WEIGHT=name of weighting variable, WTYPE=weight type) dependent variable list of independent variables ; Function: OLSQ is the basic
More informationRegression Analysis Primer DEO PowerPoint, Bureau of Labor Market Statistics
Regression Analysis Primer DEO PowerPoint, Bureau of Labor Market Statistics September 27-30, 2017 Regression Analysis Stephen Birch, Economic Consultant LTIP Technical Lead, Projections Managing Partnership
More informationApplied Econometrics. Applied Econometrics. Applied Econometrics. Applied Econometrics. What is Autocorrelation. Applied Econometrics
Autocorrelation 1. What is 2. What causes 3. First and higher orders 4. Consequences of 5. Detecting 6. Resolving Learning Objectives 1. Understand meaning of in the CLRM 2. What causes 3. Distinguish
More informationLast updated: Oct 18, 2012 LINEAR REGRESSION PSYC 3031 INTERMEDIATE STATISTICS LABORATORY. J. Elder
Last updated: Oct 18, 2012 LINEAR REGRESSION Acknowledgements 2 Some of these slides have been sourced or modified from slides created by A. Field for Discovering Statistics using R. Simple Linear Objectives
More informationChapter 5 Exercises 1
Chapter 5 Exercises 1 Data Analysis & Graphics Using R, 2 nd edn Solutions to Exercises (December 13, 2006) Preliminaries > library(daag) Exercise 2 For each of the data sets elastic1 and elastic2, determine
More informationNATCOR Regression Modelling for Time Series
Universität Hamburg Institut für Wirtschaftsinformatik Prof. Dr. D.B. Preßmar Professor Robert Fildes NATCOR Regression Modelling for Time Series The material presented has been developed with the substantial
More informationSpurious regressions PE The last lecture discussed the idea of non-stationarity and described how many economic time series are non-stationary.
Spurious regressions PE 1.. The last lecture discussed the idea of non-stationarity and described how many economic time series are non-stationary. Unfortunately non-stationarity trips up many of the conventional
More informationAnswers: Problem Set 9. Dynamic Models
Answers: Problem Set 9. Dynamic Models 1. Given annual data for the period 1970-1999, you undertake an OLS regression of log Y on a time trend, defined as taking the value 1 in 1970, 2 in 1972 etc. The
More informationAUTOCORRELATION. Phung Thanh Binh
AUTOCORRELATION Phung Thanh Binh OUTLINE Time series Gauss-Markov conditions The nature of autocorrelation Causes of autocorrelation Consequences of autocorrelation Detecting autocorrelation Remedial measures
More informationSuan Sunandha Rajabhat University
Forecasting Exchange Rate between Thai Baht and the US Dollar Using Time Series Analysis Kunya Bowornchockchai Suan Sunandha Rajabhat University INTRODUCTION The objective of this research is to forecast
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 2: Linear Regression (v3) Ramesh Johari rjohari@stanford.edu September 29, 2017 1 / 36 Summarizing a sample 2 / 36 A sample Suppose Y = (Y 1,..., Y n ) is a sample of real-valued
More information11.1 Gujarati(2003): Chapter 12
11.1 Gujarati(2003): Chapter 12 Time Series Data 11.2 Time series process of economic variables e.g., GDP, M1, interest rate, echange rate, imports, eports, inflation rate, etc. Realization An observed
More informationFirstly, the dataset is cleaned and the years and months are separated to provide better distinction (sample below).
Project: Forecasting Sales Step 1: Plan Your Analysis Answer the following questions to help you plan out your analysis: 1. Does the dataset meet the criteria of a time series dataset? Make sure to explore
More informationExercise Sheet 6: Solutions
Exercise Sheet 6: Solutions R.G. Pierse 1. (a) Regression yields: Dependent Variable: LC Date: 10/29/02 Time: 18:37 Sample(adjusted): 1950 1985 Included observations: 36 after adjusting endpoints C 0.244716
More informationCircle the single best answer for each multiple choice question. Your choice should be made clearly.
TEST #1 STA 4853 March 6, 2017 Name: Please read the following directions. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directions This exam is closed book and closed notes. There are 32 multiple choice
More informationLinear Regression. In this lecture we will study a particular type of regression model: the linear regression model
1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor
More informationAGEC 621 Lecture 16 David Bessler
AGEC 621 Lecture 16 David Bessler This is a RATS output for the dummy variable problem given in GHJ page 422; the beer expenditure lecture (last time). I do not expect you to know RATS but this will give
More informationComparing Nested Models
Comparing Nested Models ST 370 Two regression models are called nested if one contains all the predictors of the other, and some additional predictors. For example, the first-order model in two independent
More informationLecture 5: Omitted Variables, Dummy Variables and Multicollinearity
Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the
More informationWorkshop 7.4a: Single factor ANOVA
-1- Workshop 7.4a: Single factor ANOVA Murray Logan November 23, 2016 Table of contents 1 Revision 1 2 Anova Parameterization 2 3 Partitioning of variance (ANOVA) 10 4 Worked Examples 13 1. Revision 1.1.
More informationInterrupted Time Series Analysis for Single Series and Comparative Designs: Using Administrative Data for Healthcare Impact Assessment
Interrupted Time Series Analysis for Single Series and Comparative Designs: Using Administrative Data for Healthcare Impact Assessment Joseph M. Caswell, Ph.D. Lead Analyst Institute for Clinical Evaluative
More informationHeteroskedasticity in Panel Data
Essex Summer School in Social Science Data Analysis Panel Data Analysis for Comparative Research Heteroskedasticity in Panel Data Christopher Adolph Department of Political Science and Center for Statistics
More informationCircle a single answer for each multiple choice question. Your choice should be made clearly.
TEST #1 STA 4853 March 4, 215 Name: Please read the following directions. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directions This exam is closed book and closed notes. There are 31 questions. Circle
More informationF9 F10: Autocorrelation
F9 F10: Autocorrelation Feng Li Department of Statistics, Stockholm University Introduction In the classic regression model we assume cov(u i, u j x i, x k ) = E(u i, u j ) = 0 What if we break the assumption?
More informationReminder: Univariate Data. Bivariate Data. Example: Puppy Weights. You weigh the pups and get these results: 2.5, 3.5, 3.3, 3.1, 2.6, 3.6, 2.
TP: To review Standard Deviation, Residual Plots, and Correlation Coefficients HW: Do a journal entry on each of the calculator tricks in this lesson. Lesson slides will be posted with notes. Do Now: Write
More informationSTAT 572 Assignment 5 - Answers Due: March 2, 2007
1. The file glue.txt contains a data set with the results of an experiment on the dry sheer strength (in pounds per square inch) of birch plywood, bonded with 5 different resin glues A, B, C, D, and E.
More informationHeteroskedasticity in Panel Data
Essex Summer School in Social Science Data Analysis Panel Data Analysis for Comparative Research Heteroskedasticity in Panel Data Christopher Adolph Department of Political Science and Center for Statistics
More informationExercise Sheet 5: Solutions
Exercise Sheet 5: Solutions R.G. Pierse 2. Estimation of Model M1 yields the following results: Date: 10/24/02 Time: 18:06 C -1.448432 0.696587-2.079327 0.0395 LPC -0.306051 0.272836-1.121740 0.2640 LPF
More informationChapter 8: Correlation & Regression
Chapter 8: Correlation & Regression We can think of ANOVA and the two-sample t-test as applicable to situations where there is a response variable which is quantitative, and another variable that indicates
More informationChapte The McGraw-Hill Companies, Inc. All rights reserved.
12er12 Chapte Bivariate i Regression (Part 1) Bivariate Regression Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed
More informationVARIANCE ANALYSIS OF WOOL WOVEN FABRICS TENSILE STRENGTH USING ANCOVA MODEL
ANNALS OF THE UNIVERSITY OF ORADEA FASCICLE OF TEXTILES, LEATHERWORK VARIANCE ANALYSIS OF WOOL WOVEN FABRICS TENSILE STRENGTH USING ANCOVA MODEL VÎLCU Adrian 1, HRISTIAN Liliana 2, BORDEIANU Demetra Lăcrămioara
More informationPart II { Oneway Anova, Simple Linear Regression and ANCOVA with R
Part II { Oneway Anova, Simple Linear Regression and ANCOVA with R Gilles Lamothe February 21, 2017 Contents 1 Anova with one factor 2 1.1 The data.......................................... 2 1.2 A visual
More informationEconometric Forecasting Overview
Econometric Forecasting Overview April 30, 2014 Econometric Forecasting Econometric models attempt to quantify the relationship between the parameter of interest (dependent variable) and a number of factors
More informationANOVA (Analysis of Variance) output RLS 11/20/2016
ANOVA (Analysis of Variance) output RLS 11/20/2016 1. Analysis of Variance (ANOVA) The goal of ANOVA is to see if the variation in the data can explain enough to see if there are differences in the means.
More informationEcon 427, Spring Problem Set 3 suggested answers (with minor corrections) Ch 6. Problems and Complements:
Econ 427, Spring 2010 Problem Set 3 suggested answers (with minor corrections) Ch 6. Problems and Complements: 1. (page 132) In each case, the idea is to write these out in general form (without the lag
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level
More informationMultiple Regression Part I STAT315, 19-20/3/2014
Multiple Regression Part I STAT315, 19-20/3/2014 Regression problem Predictors/independent variables/features Or: Error which can never be eliminated. Our task is to estimate the regression function f.
More informationFinancial Time Series Analysis: Part II
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 1 Unit root Deterministic trend Stochastic trend Testing for unit root ADF-test (Augmented Dickey-Fuller test) Testing
More informationEstimation. John M. Drake & Pejman Rohani
Estimation John M. Drake & Pejman Rohani 1 Estimating R 0 So far in this class we have focused on the theory of infectious disease. Often, however, we will want to apply this theory to particular situations.
More informationDiscriminant Analysis
Chapter 16 Discriminant Analysis A researcher collected data on two external features for two (known) sub-species of an insect. She can use discriminant analysis to find linear combinations of the features
More informationSTAT Regression Methods
STAT 501 - Regression Methods Unit 9 Examples Example 1: Quake Data Let y t = the annual number of worldwide earthquakes with magnitude greater than 7 on the Richter scale for n = 99 years. Figure 1 gives
More information22s:152 Applied Linear Regression
22s:152 Applied Linear Regression Chapter 7: Dummy Variable Regression So far, we ve only considered quantitative variables in our models. We can integrate categorical predictors by constructing artificial
More informationCHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics
CHAPTER 4 & 5 Linear Regression with One Regressor Kazu Matsuda IBEC PHBU 430 Econometrics Introduction Simple linear regression model = Linear model with one independent variable. y = dependent variable
More informationDensity Temp vs Ratio. temp
Temp Ratio Density 0.00 0.02 0.04 0.06 0.08 0.10 0.12 Density 0.0 0.2 0.4 0.6 0.8 1.0 1. (a) 170 175 180 185 temp 1.0 1.5 2.0 2.5 3.0 ratio The histogram shows that the temperature measures have two peaks,
More informationEconomics 308: Econometrics Professor Moody
Economics 308: Econometrics Professor Moody References on reserve: Text Moody, Basic Econometrics with Stata (BES) Pindyck and Rubinfeld, Econometric Models and Economic Forecasts (PR) Wooldridge, Jeffrey
More informationAugust 2018 ALGEBRA 1
August 0 ALGEBRA 3 0 3 Access to Algebra course :00 Algebra Orientation Course Introduction and Reading Checkpoint 0.0 Expressions.03 Variables.0 3.0 Translate Words into Variable Expressions DAY.0 Translate
More informationEconometrics Part Three
!1 I. Heteroskedasticity A. Definition 1. The variance of the error term is correlated with one of the explanatory variables 2. Example -- the variance of actual spending around the consumption line increases
More information1.3 Linear Functions
1.3 Linear Functions A function of the form f(x) = ax + b, where a and b are constants, is called a linear function. The graph of f(x) = ax + b or y = mx + b, is a line, with slope a (or m), and y intercept
More informationRef.: Spring SOS3003 Applied data analysis for social science Lecture note
SOS3003 Applied data analysis for social science Lecture note 05-2010 Erling Berge Department of sociology and political science NTNU Spring 2010 Erling Berge 2010 1 Literature Regression criticism I Hamilton
More informationCHAPTER 8 MODEL DIAGNOSTICS. 8.1 Residual Analysis
CHAPTER 8 MODEL DIAGNOSTICS We have now discussed methods for specifying models and for efficiently estimating the parameters in those models. Model diagnostics, or model criticism, is concerned with testing
More informationReader s Guide ANALYSIS OF VARIANCE. Quantitative and Qualitative Research, Debate About Secondary Analysis of Qualitative Data BASIC STATISTICS
ANALYSIS OF VARIANCE Analysis of Covariance (ANCOVA) Analysis of Variance (ANOVA) Main Effect Model I ANOVA Model II ANOVA Model III ANOVA One-Way ANOVA Two-Way ANOVA ASSOCIATION AND CORRELATION Association
More informationRegression on Faithful with Section 9.3 content
Regression on Faithful with Section 9.3 content The faithful data frame contains 272 obervational units with variables waiting and eruptions measuring, in minutes, the amount of wait time between eruptions,
More informationA. Incorrect! Apply the rational root test to determine if any rational roots exist.
College Algebra - Problem Drill 13: Zeros of Polynomial Functions No. 1 of 10 1. Determine which statement is true given f() = 3 + 4. A. f() is irreducible. B. f() has no real roots. C. There is a root
More informationBIOL 458 BIOMETRY Lab 9 - Correlation and Bivariate Regression
BIOL 458 BIOMETRY Lab 9 - Correlation and Bivariate Regression Introduction to Correlation and Regression The procedures discussed in the previous ANOVA labs are most useful in cases where we are interested
More informationModeling and Forecasting Currency in Circulation in Sri Lanka
Modeling and Forecasting Currency in Circulation in Sri Lanka Rupa Dheerasinghe 1 Abstract Currency in circulation is typically estimated either by specifying a currency demand equation based on the theory
More information1. Time-dependent data in general
Lecture 10 Program 1. Time-dependent data in general 2. Repeated Measurements 3. Time series 4. Time series that depend on covariate time-series 1 Time-dependent data: Outcomes that are measured at several
More informationData Analysis 1 LINEAR REGRESSION. Chapter 03
Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative
More informationMidterm 2 - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationMultiple Regression Basic
Chapter 304 Multiple Regression Basic Introduction Multiple Regression Analysis refers to a set of techniques for studying the straight-line relationships among two or more variables. Multiple regression
More informationLinear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).
Linear Regression Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). A dependent variable is a random variable whose variation
More informationG. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication
G. S. Maddala Kajal Lahiri WILEY A John Wiley and Sons, Ltd., Publication TEMT Foreword Preface to the Fourth Edition xvii xix Part I Introduction and the Linear Regression Model 1 CHAPTER 1 What is Econometrics?
More information