LINEAR REGRESSION ANALYSIS

Similar documents
Chapter 5 Transformation and Weighting to Correct Model Inadequacies

Chapter 14 Logistic Regression Models

Multiple Linear Regression Analysis

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Multivariate Transformation of Variables and Maximum Likelihood Estimation

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

STK3100 and STK4100 Autumn 2017

STK3100 and STK4100 Autumn 2018

Maximum Likelihood Estimation

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

Special Instructions / Useful Data

CHAPTER VI Statistical Analysis of Experimental Data

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

Point Estimation: definition of estimators

X ε ) = 0, or equivalently, lim

Simple Linear Regression

Chapter 5 Properties of a Random Sample

TESTS BASED ON MAXIMUM LIKELIHOOD

STA302/1001-Fall 2008 Midterm Test October 21, 2008

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Analysis of Variance with Weibull Data

LECTURE - 4 SIMPLE RANDOM SAMPLING DR. SHALABH DEPARTMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANPUR

A New Family of Transformations for Lifetime Data

ESS Line Fitting

L(θ X) s 0 (1 θ 0) m s. (s/m) s (1 s/m) m s

Sampling Theory MODULE X LECTURE - 35 TWO STAGE SAMPLING (SUB SAMPLING)

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Lecture Notes Types of economic variables

Chapter 8. Inferences about More Than Two Population Central Values

STK4011 and STK9011 Autumn 2016

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

ε. Therefore, the estimate

M2S1 - EXERCISES 8: SOLUTIONS

Lecture 8: Linear Regression

Dr. Shalabh. Indian Institute of Technology Kanpur

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

Functions of Random Variables

1 Solution to Problem 6.40

ENGI 3423 Simple Linear Regression Page 12-01

Summary of the lecture in Biostatistics

ECON 5360 Class Notes GMM

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Sampling Theory MODULE V LECTURE - 14 RATIO AND PRODUCT METHODS OF ESTIMATION

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Lecture Note to Rice Chapter 8

Objectives of Multiple Regression

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

Multiple Choice Test. Chapter Adequacy of Models for Regression

The number of observed cases The number of parameters. ith case of the dichotomous dependent variable. the ith case of the jth parameter

Bounds on the expected entropy and KL-divergence of sampled multinomial distributions. Brandon C. Roy

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2

Chapter 4 (Part 1): Non-Parametric Classification (Sections ) Pattern Classification 4.3) Announcements

Logistic regression (continued)

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

Wu-Hausman Test: But if X and ε are independent, βˆ. ECON 324 Page 1

Econometric Methods. Review of Estimation

Introduction to local (nonparametric) density estimation. methods

COV. Violation of constant variance of ε i s but they are still independent. The error term (ε) is said to be heteroscedastic.

Lecture 1 Review of Fundamental Statistical Concepts

Probability and. Lecture 13: and Correlation

Chapter 4 Multiple Random Variables

Arithmetic Mean and Geometric Mean

22 Nonparametric Methods.

Lecture 3. Sampling, sampling distributions, and parameter estimation

Solving Constrained Flow-Shop Scheduling. Problems with Three Machines

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

Statistics MINITAB - Lab 5

Chapter 3 Sampling For Proportions and Percentages

Chapter 13 Student Lecture Notes 13-1

Continuous Distributions

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

Qualifying Exam Statistical Theory Problem Solutions August 2005

CHAPTER 3 POSTERIOR DISTRIBUTIONS

Statistics: Unlocking the Power of Data Lock 5

: At least two means differ SST

Simulation Output Analysis

Simple Linear Regression Analysis

Statistical modelling and latent variables (2)

ρ < 1 be five real numbers. The

9 U-STATISTICS. Eh =(m!) 1 Eh(X (1),..., X (m ) ) i.i.d

Simple Linear Regression

Comparison of Parameters of Lognormal Distribution Based On the Classical and Posterior Estimates

Unimodality Tests for Global Optimization of Single Variable Functions Using Statistical Methods

Linear Regression with One Regressor

i 2 σ ) i = 1,2,...,n , and = 3.01 = 4.01

residual. (Note that usually in descriptions of regression analysis, upper-case

Chapter 4 Multiple Random Variables

C-1: Aerodynamics of Airfoils 1 C-2: Aerodynamics of Airfoils 2 C-3: Panel Methods C-4: Thin Airfoil Theory

Chapter 2 Supplemental Text Material

Linear Regression Linear Regression with Shrinkage. Some slides are due to Tommi Jaakkola, MIT AI Lab

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

CHAPTER 6. d. With success = observation greater than 10, x = # of successes = 4, and

6.867 Machine Learning

Training Sample Model: Given n observations, [[( Yi, x i the sample model can be expressed as (1) where, zero and variance σ

4. Standard Regression Model and Spatial Dependence Tests

Transcription:

LINEAR REGRESSION ANALYSIS MODULE V Lecture - Correctg Model Iadequaces Through Trasformato ad Weghtg Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur

Aalytcal methods for selectg a trasformato o study varable The Box-Cox method Suppose the ormalty ad/or costat varace of study varable y ca be corrected through a power trasformato o y. Ths meas y s to be trasformed as y where s the parameter to be determed. For example, f 0.5, the the trasformato s square root ad y s used as study varable place of y. Now the lear regresso model has parameters smultaeously the βσ, ad. Box ad Cox method tells how to estmate ad parameters of the model usg the method of maxmum lkelhood. y 0 Note that as approaches zero, approaches to. So there s a problem at because ths makes all the observato y to be uty. It s meagless that all the observato o study varable are costat. So there s a y dscotuty at. Oe approach to solve ths dffculty s to use as a study varable. 0 Note that as y 0, l y y for 0 W l y for 0.. So a possble soluto s to use the trasformed study varable as

3, So famly W s cotuous. Stll t has a drawback. As chages, the value of W chage dramatcally. So t s dffcult to obta the best value of. If dfferet aalyst obta dfferet values of the t wll ft dfferet models. It may the ot be approprate to compare the models wth dfferet values of y for 0 ( ) y V y* y* l y for 0 y y yy y / * (... ) where y * s the geometrc mea of s as whch s costat.. So t s preferable to use a alteratve form For calculato purpose, we ca use l y* l y. Whe V s appled to each y, we get we use t to ft a lear model V ( V V V ) V Xβ + ε,,..., ' as a vector of observato o trasformed study varable ad usg least squares or maxmum lkelhood method. The quatty y the deomator s related to the th power of Jacoba of trasformato. See how: *

4 We wat to covert y to Let ( ) y ( ) y y W ; 0. ( ) as y y, y,..., y ', W ( W, W,..., W )'. Note that f W W y W y y, the y y 0. I geeral, W y f j y j 0 f j. The Jacoba of trasformato s gve by y J( y W). W W y y

5 W W W y y y y y W W W JW ( y) y y y W W W y y y y y Y J( y W) y. JW ( ) 0 0 0 0 0 0 0 0 0 y Sce ths s a Jacoba whe we wat to trasform the whole vector y to whole vector W. If a dvdual y s to be trasform to W, the take ts geometrc mea as The quatty JY ( W) y J( y W) y. esures that ut volume s preserved movg from the set of y to the set of V. Ths s a factor whch scales ad esures that the resdual sum of squares obtaed from dfferet values of ca be compared.

6 To fd the approprate famly, cosder where ( ) y V X β + ε y y, ~ N (0, I ). ( ) ε σ y * Applyg method of maxmum lkelhood for lkelhood fucto for ( y ), ( ) ε L y exp πσ σ εε ' exp πσ σ ( ) ( ) ( y Xβ)'( y Xβ) exp πσ σ σ ( ) ( ) ( ) ( y Xβ)'( y Xβ) l L y l σ ( gorg costat).

7 Solvg ( ) l L y β ( ) l L y σ gves the maxmum lkelhood estmators ( ) ˆ( ) ( ' ) X X X ' β y 0 0 for a gve value of. ( ) ( ) y ' Hy ˆ σ ( ) y ' I X( X ' X) X ' y ( ) ( ) Substtutg these estmates the log lkelhood fucto L( ) l ˆ σ l res( ) [ SS ] where s the sum of squares due to resduals whch s a fucto of. Now maxmze wth respect to It s dffcult to obta ay closed form of the estmator of ( ) l L y SS ( ) re s L( ).. So we maxmze t umercally. gves The fucto l res( ) [ SS ] s called as the Box-Cox objectve fucto.

8 Let be the value of whch maxmzes the Box-Cox objectve fucto. The uder farly geeral codtos, for ay other max has approxmately [ ] [ ] l SS ( ) l SS ( ) re s re s max χ () Ths s explaed as follows: dstrbuto. Ths result s based o the large sample behavour of the lkelhood rato statstc. The lkelhood rato test statstc our case s Max L Ωo η η Max L Ω Max Ω o σ Max Ω σ ˆ σ ( ) ˆ σ ( max ) / SSres( ) / SSres( max )

9 lη SS ( ) re s max l SSres( ) SS ( ) res lη l SSres( max ) l re s( ) l re s( max ) [ SS ] [ SS ] L( ) + L( ) max where L( ) l re s( ) [ SS ] L( max ) l [ SSre s( max )]. lη χ () Sce uder certa regularty codtos, coverges dstrbuto to whe the ull hypothess s true, so l η ~ χ () χ () or l η ~ χ () or L( max ) L( )~.