STAT 3008 Applied Regression Analysis
|
|
- Polly Dalton
- 6 years ago
- Views:
Transcription
1 STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors, say X and Y, we have to at least assume the type of relatonshp they have, could t be lnear, logarthm, quadratc, etc. In the experment, the factor Y s lkely to be affected by the factor X. In usual experment context, X and Y are known as ndependent and dependent factors respectvely. But n lnear regresson, X and Y are known as predctor and response respectvely. Now, we state our model assumpton: Smple Lnear Regresson Model The model relatng the two factors are assumed to be: y = Y X=x = β 0 + β 1 x + e where Ee = 0, Vare = σ and e s are..d. Therefore, we have EY X = x = β 0 + β 1 x VarY X = x = σ Remark 1.1. The values x here are known constants, nstead of some realsed observatons from a random varable X. For regresson of random predctors, you may refer to [1]. Remark 1.. It should be notced that the response y s a random varable. The data set {x, y } n =1 conssts of realsed values from the random varables. Least Square Estmator Essentally, we want to ft a straght lne to the set of ponts on the Cartesan plane. However, there are many ways to defne good n a ft. The smplest way s to consder the total vertcal dstance between the ponts and the lne. The best ft lne s therefore the lne whch mnmses the dstance,.e. whch mnmses n RSSβ 0, β 1 = [y β 0 + β 1 x ] Mnmsng the total dstance s equvalent to mnmsng the total squared dstance. Hence, we have the Least Square Method. By elementary multvarate calculus and statstcal concepts, the dervaton performed n the lesson yelds the followng results. s @sta.cuhk.edu.hk 1
2 LEAST SQUARE ESTIMATOR Least Squares Estmator If we defne the followng notatons, x = 1 n ȳ = 1 n x y = SYY = SXY = x x = x n x y ȳ = y n ȳ x xy ȳ = x y n x ȳ then the estmators are gven by ˆβ 0 = ȳ ˆβ 1 x ˆβ 1 = x xy ȳ x x ˆσ = ê = SXY n = SYY SXY /. n Under our estmated model, we therefore have the ftted values gven x and the resdual,.e. the dfference between the ftted value and the realsed value. ŷ = ÊY X = x = ˆβ 0 + ˆβ 1 x ê = y ŷ = y ˆβ 0 ˆβ 1 x Remark.1. For the detaled dervaton, please refer to the lecture notes. You should be famlar wth ther dervaton as they may be tested n mdterm and fnal exam. Remark.. ˆβ0 and ˆβ 1 can be wrtten as lnear combnatons of y : [ x x 1 n x ˆβ 1 = y and ˆβ0 = ] x x y. Ths s useful when dervng the dstrbuton and consstency of the estmators n Exercse 4.1. Remark.3. By the dervatve condton of RSS wth respect to β 0, we have ê = 0 and ȳ = ˆβ 0 + ˆβ 1 x. Exercse Fall Mdterm #3 Use the smple lnear regresson model to ft a straght lne on two data ponts:, 4, 1, 3. What are the values of ˆβ 0 and ˆβ 1?
3 LEAST SQUARE ESTIMATOR Exercse.. Show that x ê = 0 and, therefore ŷ ê = 0. Exercse.3. Show that ˆσ s an unbased estmator of σ,.e. E ˆσ = σ. 3
4 3 ANALYSIS OF VARIANCEANOVA 3 Analyss of VaranceANOVA It s natural that we are nterested n whether our model assumpton s correct. The basc queston wll be f Y s really related to X. Mathematcally, ths queston s equvalent to askng whether β 1 s zero n our model. The most common way s the Analyss of Varance, whch compares two models of dfferent mean functons. We want to test the followng hypothess: H 0 : EY X = x = β 0 vs H 1 : EY X = x = β 0 + β 1 x. Therefore, we need some test statstcs related to these two hypothess and, most mportantly, of known dstrbutons. Consder the resdual sum of square for the two models. For H 0, we have ÊY X = x = ˆβ 0 = ȳ RSS H0 = y ȳ = SYY. On the other hand, for H 1, we have RSS H1 = [ y ˆβ0 + ˆβ ] SXY 1 x = SYY. Snce we use more varables to ft the ponts n H 1, t must be true that RSS H0 RSS H1. Therefore, H 1 s vald only when RSS H0 RSS H1. For easy comparson, we defne the Sum of Square due to Regresson as SSreg = RSS H0 RSS H1 Equvalently, H 1 s vald only when SSreg 0. = SXY. 3.1 Dstrbutons of Estmators To defne how large s large, we need the dstrbutons as well so that we can defne large n a probablstc sense. Under H 0, the dstrbuton of the sum of squares are gven below, By smple algebra, we rewrte By Central Lmt Theorem, we have SSreg = SXY = [ x x y ]. x x y N 0, σ. Therefore, we know SSreg /σ χ 1. Also, t s known that n ˆσ /σ χ n. The test statstc s thus gven by F = SSreg /σ 1 n σ ˆ H 1 /σ n = SSreg ˆσ H 1 F 1, n. For sgnfcance level α, we reject H 0 f the p-value s smaller than F 1 α 1, n. Remark 3.1. That n ˆσ /σ χ n s due to a fact that the degree of freedom s the number of values n the statstcs that are free to vary. The detaled proof of the dstrbuton of n ˆσ /σ ncludes the use of quadratc forms, whch s beyond the scope. Interested students may refer to []. 4
5 3. ANOVA Table 3 ANALYSIS OF VARIANCEANOVA Remark 3.. It should be notced that the ˆσ here s under the model n H 1. The reason of usng ths nstead of ˆσ under H 0 s examned n Exercse 3.1. Exercse 3.1. What s the estmator of σ under H 0? Explan why the use of t n the denomnator makes no sense. Exercse 3.. Show that y ȳ = y ŷ + ŷ ȳ 3. ANOVA Table Thanks to the result of Exercse 3., we have a neat and tdy representaton of ANOVA, whch s called the ANOVA table. Source df SS MS F p-value Regresson 1 SSreg SSreg /1 SSreg / ˆσ P F 1,n 1 > F Resdual n 1 RSS H1 ˆσ = RSS /n Total n SYY You should be extremely famlar wth the above table because t appears n every mdterm. We wll practce ths n Exercse 4.3 and 4.4. Remark 3.3. Therefore, we can defne the Coeffcent of Determnaton to be R = SSreg [0, 1]. SYY The realsed value summarses the strength of relatonshp between the sampled response and predctors. 5
6 4 INTERVALS, TESTS AND BAND 4 Intervals, Tests and Band Besdes testng the mean functons by ANOVA, we wll also want to perform test on ndvdual parameters. Therefore, we need the dstrbutons of the estmator. We begn ths secton wth an exercse. Exercse 4.1. Prove that ˆβ 0 and ˆβ 1 are unbased and fnd ther varance and asymptotc dstrbuton. 4.1 Confdence Intervals and Tests for Intercept and Slope Wth the dstrbutons of the estmator and some facts n statstcs, we can construct the test statstcs from the dstrbuton derved n Exercse 4.1. Confdence Interval and Test for Intercept If we want to test whether the ntercept s a certan value β 0,.e. H 0 : β 0 = β 0 vs H 1 : β 0 β 0, then the test statstc s t = ˆβ 0 β0 se ˆβ 0 tn where se ˆβ 1 0 = ˆσ n + x. Therefore, for sgnfcance level α, we reject H 0 when t > t 1 α n. Also, the 1 α 100% confdence nterval of β 0 s gven by ˆβ 0 t 1 α n se ˆβ 0 β 0 ˆβ 0 +t 1 α n se ˆβ 0. Confdence Interval and Test for Slope Smlarly, for the test of slope,.e. the test statstc s H 0 : β 1 = β 1 vs H 1 : β 1 β 1, t = ˆβ 1 β1 se ˆβ 1 tn where se ˆβ ˆσ 0 =. Therefore, for sgnfcance level α, we reject H 0 when t > t 1 α n. Also, the 1 α 100% confdence nterval of β 1 s gven by ˆβ 1 t 1 α n se ˆβ 1 β 1 ˆβ 1 +t 1 α n se ˆβ 1. 6
7 4.1 Confdence Intervals and Tests for Intercept and Slope 4 INTERVALS, TESTS AND BAND Remark 4.1. Obvously, a test of zero slope,.e. s equvalent to testng H 0 : β 1 = 0 vs H 1 : β 1 0, H 0 : EY X = x = β 0 vs H 1 : EY X = x = β 0 + β 1 x, whch s our ANOVA F-test n Secton 3. Therefore, they should gve the same result. Mathematcally, f we look at the t-statstcs, t = t = ˆβ 1 0 se ˆβ 1 = ˆβ1 ˆσ/ ˆβ 1 ˆσ / = ˆβ 1 = SXY ˆσ ˆσ = SSreg = F. ˆσ In general, we have F 1, m = χ 1 χ m/m = Z χ m/m = Z = tm χ m/m Exercse 4.. Construct a 95% confdence nterval for the slope from the data set {1, 1, 4, 9, 10, 10}, gven t = Bosco argues that the confdence nterval you construct has a 95% probablty of ncludng the true slope. Explan whether he s correct. Exercse Fall Mdterm #1 Fll n the mssng values n the followng tables of regresson output from a data set of sze 100. ANOVA Table Source df SS MS F Regresson Resdual Total Coeffcent Table Varable Coeffcent s.e. t-statstcs p-value Constant X n = ˆσ = R =
8 4.1 Confdence Intervals and Tests for Intercept and Slope 4 INTERVALS, TESTS AND BAND Exercse Sprng Mdterm #1 Fll n the mssng values n the followng tables of regresson output. In R, t s found that qf1 9.5e 9, 1, 6 = Also, x = 5.15, ȳ = , = ANOVA Table Source df SS MS F p-value Regresson 9.5e-09 Resdual Total Coeffcent Table Varable Coeffcent s.e. t-statstcs p-value Constant X n = ˆσ = R = 8
9 4. Confdence and Predcton Intervals 4 INTERVALS, TESTS AND BAND 4. Confdence and Predcton Intervals Besdes the true ntercept and slope, we are often nterested n the mean of response gven a predctor x,.e. EY X = x. Ths can be constructed from the ftted value ŷ snce t s an unbased estmator of the mean. For example, we consder the mean IQ of a student who scores 98 n the mdterm. On the other hand, nstead of the mean, we may also be nterested n the response tself,.e. y X=x. In ths case, we want to make a predcton on what the outcome wll be, gven x. Here, for example, we are lookng for the IQ of a student who scores 98 n the mdterm. Therefore, we want to construct ntervals for the mean and the predcton. The results are lsted below. Confdence Interval for Mean Gven a predctor value x, the true value and estmaton of the mean are respectvely EY X = x = β 0 + β 1 x and ŷ = ÊY X = x = ˆβ 0 + ˆβ 1 x. The estmaton uncertanty of the mean s Var ŷ EY X = x = Var ˆβ 0 + ˆβ 1 x = σ 1 n + x x Defne the standard error of ft as. 1 seftŷ x = ˆσ n + x x n ˆσ σ = n seftŷ x 1 n + x x χ n. Therefore, the 1 α 100% confdence nterval for the mean s gven by ŷ t 1 α seftŷ x EY X = x ŷ + t 1 α seftŷ x. Predcton Interval for Response Gven a predctor value x, the response and ts estmaton are respectvely y X=x = β 0 + β 1 x + e and ŷ = ˆβ 0 + ˆβ 1 x. The estmaton uncertanty of the predcton s Var ŷ y X=x = Var ˆβ0 + ˆβ 1 x + e = σ n + x x. Defne the standard error of predcton as sepredŷ x = ˆσ x x + n n ˆσ σ = n seftŷ x n + χ n. x x Therefore, the 1 α 100% confdence nterval for the mean s gven by ŷ t 1 α sepredŷ x y X=x ŷ + t 1 α sepredŷ x. Remark 4.. The estmaton uncertanty of predcton and mean dffers only by σ. The extra uncertanty comes from the error term n the new observaton that we wants to predct. Compare y X=x = β 0 + β 1 x + e and EY X = x = β 0 + β 1 x. Due to the extra uncertanty, predcton nterval ncludes and s larger than the confdence nterval for mean. 9
10 4.3 Confdence Band 4 INTERVALS, TESTS AND BAND 4.3 Confdence Band In the prevous subsecton, we construct confdence nterval of mean for a certan pont x. It s temptng to connect all the upper lmts and lower lmts of confdence ntervals,.e. ˆβ 0 + ˆβ 1 x ± t 1 α seftŷ x, x and say that ths random band has a 1 α 100% probablty of ncludng the true mean lne EY X = x = β 0 + β 1 x. However, ths s wrong see Remark 4.3 and Exercse 4.5. The correct band s gven below. Confdence Band for Mean Functon The 1 α 100% confdence band of the mean functon s gven by Cx = ˆβ 0 + ˆβ 1 x ± F 1 α, n seftŷ x, x. Therefore, t s true that PrThe mean lne les n the confdence band = Pr x, EY X = x Cx = 1 α. Remark 4.3. For confdence nterval Cx = ˆβ 0 + ˆβ 1 x ± t 1 α seftŷ x, we have by defnton x, Pr EY X = x Cx = 1 α. Ths relatonshp holds for each pont,.e. pontwse. Whle for the confdence band, we have Pr x, EY X = x Cx = 1 α. Here, the ncluson s for the entre lne. The two cases are dfferent. Exercse 4.5. Explan, why t s wrong to say the band, ˆβ 0 + ˆβ 1 x ± t 1 α seftŷ x, x has a 1 α 100% probablty of ncludng the mean lne EY X = x = β 0 + β 1 x. 10
11 REFERENCES Exercse 4.6. For the data set {1, 1, 4, 9, 10, 10}, construct 1. a 95% confdence nterval and a 95% nterval for the pont x = 3, and. a 95% confdence band. 3. What s the value of the band when x = 3? You are gven t = and F 0.95, 1 = Resduals To check whether our model assumpton s vald, a good way s to look at the resdual plot. Recall that the resduals ê = y ŷ = y ˆβ 0 ˆβ 1 x so ths gves Eê = Ey β 0 β 1 x = 0 and Varê = σ n + x x. The data set gves a set of realsed ê. Accordng to our observaton above, these realsed resduals should have mean close to zero, and have constant varance for all value x. A plot that satsfes the above crtera s a null plot, whch ndcates that the model assumpton s vald and the regresson s a good ft. 6 Appendx For more reference, you may refer to the followng text books. References [1] Douglas C. MONTGOMERY, Elzabeth A. PECK and G. Geoffrey VINING 006. Introducton to Lnear Regresson Analyss, Wley. [] Robert V. HOGG and Allen T. CRAIG Introducton to Mathematcal Statstcs, Pearson. 11
Chapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationEconomics 130. Lecture 4 Simple Linear Regression Continued
Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationInterval Estimation in the Classical Normal Linear Regression Model. 1. Introduction
ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model
More informationNANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis
NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More informationStatistics MINITAB - Lab 2
Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More informationLecture 6: Introduction to Linear Regression
Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6
More informationChapter 14 Simple Linear Regression
Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng
More informationa. (All your answers should be in the letter!
Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal
More informationECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics
ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott
More informationSTAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).
(out of 15 ponts) STAT 3340 Assgnment 1 solutons (10) (10) 1. Fnd the equaton of the lne whch passes through the ponts (1,1) and (4,5). β 1 = (5 1)/(4 1) = 4/3 equaton for the lne s y y 0 = β 1 (x x 0
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More informationLecture 4 Hypothesis Testing
Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to
More information[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.
PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationBasic Business Statistics, 10/e
Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson
More informationFirst Year Examination Department of Statistics, University of Florida
Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve
More informationStatistics for Business and Economics
Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear
More informationBiostatistics 360 F&t Tests and Intervals in Regression 1
Bostatstcs 360 F&t Tests and Intervals n Regresson ORIGIN Model: Y = X + Corrected Sums of Squares: X X bar where: s the y ntercept of the regresson lne (translaton) s the slope of the regresson lne (scalng
More informationLecture 3 Stat102, Spring 2007
Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study
More informationLecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management
Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =
More informationProperties of Least Squares
Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures
More informationIntroduction to Regression
Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes
More informationwhere I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).
11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e
More informationEcon Statistical Properties of the OLS estimator. Sanjaya DeSilva
Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate
More informationSTATISTICS QUESTIONS. Step by Step Solutions.
STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to
More informationEcon107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)
I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes
More informationRegression Analysis. Regression Analysis
Regresson Analyss Smple Regresson Multvarate Regresson Stepwse Regresson Replcaton and Predcton Error 1 Regresson Analyss In general, we "ft" a model by mnmzng a metrc that represents the error. n mn (y
More informationx yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.
The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator
More informationY = β 0 + β 1 X 1 + β 2 X β k X k + ε
Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty
More informationChapter 9: Statistical Inference and the Relationship between Two Variables
Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,
More informationStatistics II Final Exam 26/6/18
Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the
More informationChapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the
More informationTopic 7: Analysis of Variance
Topc 7: Analyss of Varance Outlne Parttonng sums of squares Breakdown the degrees of freedom Expected mean squares (EMS) F test ANOVA table General lnear test Pearson Correlaton / R 2 Analyss of Varance
More information/ n ) are compared. The logic is: if the two
STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence
More informationLecture 16 Statistical Analysis in Biomaterials Research (Part II)
3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan
More informationx = , so that calculated
Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to
More informatione i is a random error
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown
More informationNegative Binomial Regression
STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...
More informationCorrelation and Regression. Correlation 9.1. Correlation. Chapter 9
Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed
More informationLearning Objectives for Chapter 11
Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method
More information17 - LINEAR REGRESSION II
Topc 7 Lnear Regresson II 7- Topc 7 - LINEAR REGRESSION II Testng and Estmaton Inferences about β Recall that we estmate Yˆ ˆ β + ˆ βx. 0 μ Y X x β0 + βx usng To estmate σ σ squared error Y X x ε s ε we
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan
More informationSTAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression
STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,
More informationx i1 =1 for all i (the constant ).
Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by
More information4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA
4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected
More informationProfessor Chris Murray. Midterm Exam
Econ 7 Econometrcs Sprng 4 Professor Chrs Murray McElhnney D cjmurray@uh.edu Mdterm Exam Wrte your answers on one sde of the blank whte paper that I have gven you.. Do not wrte your answers on ths exam.
More informationβ0 + β1xi. You are interested in estimating the unknown parameters β
Ordnary Least Squares (OLS): Smple Lnear Regresson (SLR) Analytcs The SLR Setup Sample Statstcs Ordnary Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals) wth OLS
More informationReminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1
Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to
More informationRegression. The Simple Linear Regression Model
Regresson Smple Lnear Regresson Model Least Squares Method Coeffcent of Determnaton Model Assumptons Testng for Sgnfcance Usng the Estmated Regresson Equaton for Estmaton and Predcton Resdual Analss: Valdatng
More informationPolynomial Regression Models
LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance
More informationANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)
Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of
More informationTHE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for
More informationBiostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li
Bostatstcs Chapter 11 Smple Lnear Correlaton and Regresson Jng L jng.l@sjtu.edu.cn http://cbb.sjtu.edu.cn/~jngl/courses/2018fall/b372/ Dept of Bonformatcs & Bostatstcs, SJTU Recall eat chocolate Cell 175,
More information2016 Wiley. Study Session 2: Ethical and Professional Standards Application
6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton
More informationUsing T.O.M to Estimate Parameter of distributions that have not Single Exponential Family
IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran
More informationSTAT 511 FINAL EXAM NAME Spring 2001
STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte
More informationMidterm Examination. Regression and Forecasting Models
IOMS Department Regresson and Forecastng Models Professor Wllam Greene Phone: 22.998.0876 Offce: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Emal: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/regresson/outlne.htm
More informationChapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2
Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters
More informationDiagnostics in Poisson Regression. Models - Residual Analysis
Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent
More informationChapter 8 Indicator Variables
Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n
More informationDepartment of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution
Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable
More information# c i. INFERENCE FOR CONTRASTS (Chapter 4) It's unbiased: Recall: A contrast is a linear combination of effects with coefficients summing to zero:
1 INFERENCE FOR CONTRASTS (Chapter 4 Recall: A contrast s a lnear combnaton of effects wth coeffcents summng to zero: " where " = 0. Specfc types of contrasts of nterest nclude: Dfferences n effects Dfferences
More informationLecture 2: Prelude to the big shrink
Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson
More informationChapter 15 - Multiple Regression
Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term
More informationDO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes
25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton
More informationThe Ordinary Least Squares (OLS) Estimator
The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal
More informationStatistics Chapter 4
Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment
More informationJoint Statistical Meetings - Biopharmaceutical Section
Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve
More informationANOVA. The Observations y ij
ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2
More informationResource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis
Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques
More informationTwo-factor model. Statistical Models. Least Squares estimation in LM two-factor model. Rats
tatstcal Models Lecture nalyss of Varance wo-factor model Overall mean Man effect of factor at level Man effect of factor at level Y µ + α + β + γ + ε Eε f (, ( l, Cov( ε, ε ) lmr f (, nteracton effect
More informationLinear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the
Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.
More informationCIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M
CIS56: achne Learnng Lecture 3 (Sept 6, 003) Preparaton help: Xaoyng Huang Lnear Regresson Lnear regresson can be represented by a functonal form: f(; θ) = θ 0 0 +θ + + θ = θ = 0 ote: 0 s a dummy attrbute
More informationBOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu
BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com
More informationHere is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)
Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,
More information3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X
Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number
More informationScatter Plot x
Construct a scatter plot usng excel for the gven data. Determne whether there s a postve lnear correlaton, negatve lnear correlaton, or no lnear correlaton. Complete the table and fnd the correlaton coeffcent
More informationChapter 4: Regression With One Regressor
Chapter 4: Regresson Wth One Regressor Copyrght 2011 Pearson Addson-Wesley. All rghts reserved. 1-1 Outlne 1. Fttng a lne to data 2. The ordnary least squares (OLS) lne/regresson 3. Measures of ft 4. Populaton
More information8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore
8/5/17 Data Modelng Patrce Koehl Department of Bologcal Scences atonal Unversty of Sngapore http://www.cs.ucdavs.edu/~koehl/teachng/bl59 koehl@cs.ucdavs.edu Data Modelng Ø Data Modelng: least squares Ø
More information18. SIMPLE LINEAR REGRESSION III
8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.
More information4.3 Poisson Regression
of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)
More information28. SIMPLE LINEAR REGRESSION III
8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted
More informationF8: Heteroscedasticity
F8: Heteroscedastcty Feng L Department of Statstcs, Stockholm Unversty What s so-called heteroscedastcty In a lnear regresson model, we assume the error term has a normal dstrbuton wth mean zero and varance
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models
More informationPubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II
PubH 7405: REGRESSION ANALSIS SLR: INFERENCES, Part II We cover te topc of nference n two sessons; te frst sesson focused on nferences concernng te slope and te ntercept; ts s a contnuaton on estmatng
More informationIntroduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors
ECONOMICS 5* -- Introducton to Dummy Varable Regressors ECON 5* -- Introducton to NOTE Introducton to Dummy Varable Regressors. An Example of Dummy Varable Regressors A model of North Amercan car prces
More informationSystematic Error Illustration of Bias. Sources of Systematic Errors. Effects of Systematic Errors 9/23/2009. Instrument Errors Method Errors Personal
9/3/009 Sstematc Error Illustraton of Bas Sources of Sstematc Errors Instrument Errors Method Errors Personal Prejudce Preconceved noton of true value umber bas Prefer 0/5 Small over large Even over odd
More informationLaboratory 1c: Method of Least Squares
Lab 1c, Least Squares Laboratory 1c: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly
More informationGoodness of fit and Wilks theorem
DRAFT 0.0 Glen Cowan 3 June, 2013 Goodness of ft and Wlks theorem Suppose we model data y wth a lkelhood L(µ) that depends on a set of N parameters µ = (µ 1,...,µ N ). Defne the statstc t µ ln L(µ) L(ˆµ),
More informationJanuary Examinations 2015
24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More information