This is the linearity assumption. The expected value of Y is really a straight-line function of x.
|
|
- Gregory Knight
- 6 years ago
- Views:
Transcription
1 B90.30 / C.007 NOTE for Wedesday 4 FEB 00 Hadouts: mpleregressoiferece CoefIterp Matrx otato Pamphlets o multple regresso varable selecto We eed to talk about fereces that we make wth regresso. Frst we wll eed to be very clear about the model assumptos. We ll do ths for smple regresso (K ) ad later for multple regresso (K ). E(Y ) β 0 β x Ths s the learty assumpto. The expected value of Y s really a straght-le fucto of x. Y β 0 β x ε Ths s the addtve error (ose) assumpto. I partcular, the rght sde β β x e ε. s ot (β 0 β x ) ε. It s also ot 0 ε, ε,, ε are depedet radom varables, each wth the same dstrbuto The depedece meas that ε ad ε j are depedet radom varables whe j. It also meas that ε s depedet of x ad of all the other depedet varables. The same dstrbuto assumpto really meas that the ε s are a radom sample. E(ε ) 0 ad D(ε ) The ε s are assumed to be take from a populato wth mea zero. Ths s a costless assumpto. If E(ε ) ψ 0, the we would rewrte the model as Y (β 0 ψ) β x (ε ψ). I ths rewrtte form (β 0 ψ) s a ukow tercept ad the (ε ψ) are uobserved ose terms wth mea 0. The ε s are assumed to all have the same stadard devato. We eed to be wary about possble volatos of ths assumpto, as t s farly commo for D(ε ) to be larger for those data pots that have large x.
2 Ths says also that D(Y ) D(β 0 β x ε ) D( ε ). The sgal β 0 β x s ot radom, ad t does ot cotrbuted to the stadard devato. ε, ε,, ε are ormally dstrbuted Beleve t or ot, we do t eed to make ths assumpto all cases. If our oly cocers are about meas, stadard devatos, ad correlatos we do ot eed to voke ormalty. The ormal dstrbuto assumpto s crtcal for the feretal acts of cofdece tervals, hypothess tests, ad predctos. Ay actvty that gets to usg the t dstrbuto or the F dstrbuto must be based o the assumpto of ormalty. As a cosequece of all these assumptos, cludg ormalty, we ca show b ~ N β, The ~ meas s dstrbuted as. The otato N( mea, varace ) s also used here. That s, b s ormally dstrbuted wth mea β ad wth varace. The stadard devato s of course. There s a terestg dervato for ths. b xy ( x x)( Y Y ) ( x ) x Y It s terestg that we ca drop Y from ths arthmetc. ( x x)( β β x ε ) 0 ( x x)( β x ε ) Note that β 0 dsappears.
3 ( x x) x ( x x) β ε ( x x)( x x) ( x x) β ε Note that x ca be retroduced. β x x ε β x x. Recall that ( x x) ε ce ths s a lear combato of the ε s, t must be ormally dstrbuted. (The x s are treated as o-radom.) ce E(ε ) 0, ths also shows that E(b ) β. All that remas s fdg Var(b ). Var(b ) Var β ( x x) ε Var ( x x) ε Var ( x x) ce β s ot radom, t s ot volved the varace. 3 ε
4 { ε} Var ( x x) ( x x) Var ( ε) x x x x ( ) b 0 ~ N β0, x Cov(b 0, b ) x (Ths does ot requre the ormal assumpto.) ( ) s ε ~ χ Ths s the ch-squared dstrbuto. You wll also see ths χ expressed as s ε ~. It ca be show that the expected value of a ch-squared radom varable s ts degrees of freedom. Thus E( χ ), leadg us to E( s ε ). We wll say s ε s a ubased estmate of. It does ot follow that s ε s resd a ubased estmate of. ce M resd result ca be expressed as resd ~ χ. s ε s statstcally depedet of (b 0, b ) 4 s ε, ths A derved quatty, such as b, has a stadard devato that depeds o ukow thgs (usually ). Here D(b ). We ca estmate ths stadard devato by estmatg the ukow thg, ad we call ths
5 estmate the stadard error. ce s ε estmates, we wrte E(b ) sε stadard error of b. Alas, ot everyoe uses ths strct defto for stadard error. I may problems wth ormal dstrbutos ad estmates of from a ch-squared dstrbuto, we ca create radom quatty - E radom quatty E radom quatty ad t wll have a t dstrbuto. The exact statemets are a bt trcky, ad that wll be omtted here. The degrees of freedom s the degrees of freedom umber from the ch-squared dstrbuto. Ths leads us to b E( b ) ( b ) E b s ε β ~ t Ths ca be used to create a α cofdece terval for β. Thus sε α cofdece terval for β s b ± tα/;. mlar logc gets a α cofdece terval for β 0. α cofdece terval for β 0 s x b0 ± tα/; sε. The tervals for β 0 ad β are ot statstcally depedet. If t happes that β 0, the regresso s dstrbuted as χ, depedet of resd. If β 0, ths dstrbuto s a o-cetral ch-squared, but that s aother story. Mregresso If t happes that β 0, the F follows the F dstrbuto wth Mresd (, ) degrees of freedom. If β 0, the dstrbuto s o-cetral F ad teds to be bg. (That s why we reject β 0 wth large F statstcs.) 5
6 Let s suppose that we a target value x ew md, ad we wated to predct the populato average Y that would result from ths x ew. Ths s askg for a predcto of E Y ew β 0 β x ew, whch s somethg we geerally do ot eed to do! The pot predcto s clearly b 0 b x ew. We ca obta the varace of ths predcto: Var(b 0 b x ew ) Var ( b ) x Cov ( b, b ) x Var ( b ) 0 ew 0 ew x xew x xew ( x xewx xew ) ew ( x x) I ths expresso, x ad are based o the orgal data pots, ad x ew s ot part of the orgal data. It follows that we ca make a t dstrbuto out of ths: ( b0 b xew) ( β 0 β xew) ( x x) s ε ew ~ t The, of course, we ca gve a - α cofdece terval for β 0 β x ew. Here t s: b 0 b x ew ± t s α/; ε ( x x ) ew Ths s all logcally correct, but t begs the questo. Why exactly mght you wat a cofdece terval for β 0 β x ew? 6
7 Vastly more useful s a predcto for Y ew. The obvous predcto s Ŷ ew b 0 b x ew. I ths expresso, we regard x ew as o-radom, but we certaly must regard b 0 ad b (ad thus Ŷ ) as radom. ew Let s use the symbol Y ew as the ew value, whch we ll see evetually. The quatty Yˆ ew Yew s the predcto error. We ca get to the statstcal propertes of ths by usg the model o Y ew. pecfcally, we have Y ew β 0 β x ew ε ew. The ew ose term ε ew s statstcally depedet of everythg that came before t. Certaly the predcto error show that the mea s zero. Ŷ ew - Y ew wll follow a ormal dstrbuto. We ca Ŷ - Y ew ] E ( b b x ) ( β β x ε ) E[ ew 0 ew 0 ew ew ( E b0 E b xew ) ( 0 xew E( ew )) β β ε 0 The varace s trcker: Var[ ew Ŷ - Y ew ] Var ( b b x ) ( β β x ε ) 0 ew 0 ew ew Var ( b b x ) ε 0 ew ew Var ( b ) x Cov ( b, b ) x Var ( b ) Var ( ε ) 0 ew 0 ew ew x xew x xew ( x xewx xew ) ew ( x x) Ths leads to a t dstrbuto: 7
8 s ε Yˆ ew Y ew ( x x) ew ~ t You wll frequetly see the very useful predcto terval for Y ew : b 0 b x ew ± t s α/; ε ( x x ) ew It should be oted that the tervals for two ew values, say statstcally depedet of each other. x ad x are ot ew ew You wll also see plot of predcto bads. These are ot a cofdece rego for the le. It must be oted that these dstrbutoal results deped o the correctess of the model. It s curous to ote that the results hold exactly oly f we make o effort to check the tegrty of the model! Let s thk about that. If we check the model, perhaps by examg the data or lookg at resdual plots, we mght take actos. We mght replace Y by ts logarthm. We mght delete a pot as beg a outler. The actos are, of course, possble sources of errors. As a smple llustrato of that, let s recall the smple (pre-regresso) problem based o a sample of values w, w,, w. We make the model that these are a sample from a ormal populato wth mea μ w ad stadard devato w. s The a 95% cofdece terval for μ w s w ± t 0.05; -. Ths s true f the model s true ad we do t exame our data! uppose that we have a rule for fdg outlers ad throwg them away. Whether we make correct or correct decsos ay partcular case, we wll stll be messg up the 95% cofdece. Of course, every practcg statstca would look at the data to check for assumptos. We would look very foolsh f we dd ot. But let s at least be hoest ad admt that lookg at the data may alter our cofdece values! 8
9 Resduals also help detfy model falure, especally through usg the resdual-versusftted plot. (There s a secto the mple Lear Regresso pamphlet o ths; the commoly ecoutered s spreadg resduals. The cure s takg logarthms of Y. Let s exame ths the cotext of the alary data set. Ths s M\ALARY.MTP Here s the resdual versus ftted plot: Resduals Versus the Ftted Values (respose s ALARY) 0000 Resdual Ftted Value Ths s a clear case of expadg resduals. The cure for ths oe s to replace Y by log Y. * We recommed base-e logarthms. * If the Y s have some zeroes or egatve values, use log(y c), choosg c to elevate all the values to just above zero. We ca pot out also the gross sze pheomeo. Varables that ru over several orders of magtude are almost always treated wth logarthms. The resdual versus ftted plot wll also let us detect curvature. It ca fd werd values (outlers?), but we have to be careful wth such fuy pots. If ay pot s werd, t must be checked for correct codg. If the pot s ot recorded correctly, fx t ad start over. If t caot be fxed, the dscard t, but record ths acto the report. 9
10 If the pot s correct, try to decde f t has udue mpact o the regresso. If removg the pot gves a bg chage b, the you mght cosder removg t. I ay case, f you remove a pot, that acto must be cluded the report. For multple regresso the model s Y β 0 β x β x β 3 x 3 β K x K ε We say that ths s regresso wth K predctors. For K, we call t smple regresso ad for K, t s multple regresso. The otato s a problem. The subscrpt-free verso of the model s Y β 0 β x β x β 3 x 3 β K x K ε I ths form, the depedet varables would be descrbed as X, X,, X K. (We ca use upper case letters whe we thk of them as varables.) I applcatos, these wll have ames. The model mght look lke ALE β 0 β OCK OCK β HIRT HIRT β PANT PANT ε Ths would avod the cofuso of the double subscrpts o the x terms. The estmatg crtero s aga least squares. pecfcally, we try to mmze ( Y ) b0 bx bx b3x3... bkxk You ll also see ths as Y K b x j j j 0 Ths requres that we detfy x 0 for every. We have a hadout o the matrx otato. X X X Y requres the verso of a p-by-p matrx. Ths ca fal for a umber of reasos: 0 A crtcal ssue s that the soluto vector b -
11 * Rak(X) < p. The p colums of X are ot learly depedet vectors. The hadout o matrx otato shows ths. * Not eough data, meag < p. I that case, the matrx X has more colums tha rows. Ths automatcally makes Rak(X) < p. There s also the strage computatoal cocer whch Rak(X) s just barely p. Ths happes whe oe of the colums of X s very, very close to beg a exact lear combato of other colums. We wo t dscuss ths detal, as the problem shows up as collearty. The measure of ths cocer s the rato largest egevalue of smallest egevalue of ( X X) ( X X). There s ths dffcult terpretato ssue wth regard to the estmated coeffcets. Exame the amal set. Cosder the regresso of Gestato o (Bweght, Tsleep). Get matrx plot. Dscover the Bweght eeds logs. The ru the regresso, fdg that Gestato also eeds logs. For coherece, we ll eed to elmate ay amal that s ot complete o these three varables. Here s the regresso: Regresso Aalyss: log(g) versus log(w), Tsleep The regresso equato s log(g) log(w) Tsleep Predctor Coef E Coef T P Costat log(w) Tsleep R-q 6.% R-q(adj) 60.7% Aalyss of Varace ource DF M F P Regresso Resdual Error Total Ths s a good regresso (although t s depedet a absurd way o the style whch the amals were chose). Let s look at the estmated coeffcet The terpretato here s that
12 a oe ut chage log(bw) leads to a chage of 0.78 log(g), holdg Tsleep costat There s aother way to look at ths. Let log(bw)[tsleep] be the resdual from regressg log(bw) o Tsleep That s, ths s the part of log(bw) that caot be explaed by Tsleep. mlarly, let log(g)[tsleep] be the resdual from regressg log(g) o Tsleep Ths s the part of log(g) that caot be explaed by Tsleep. Now... f we regress log(g)[tsleep] o log(bw)[tsleep], we get Regresso Aalyss: log(g)[tsleep] versus log(w)[tsleep] The regresso equato s log(g)[tsleep] log(w)[tsleep] Predctor Coef E Coef T P Costat log(w)[tsleep] R-q 3.5% R-q(adj) 3.% Aalyss of Varace ource DF M F P Regresso Resdual Error Total Note that the slope coeffcet s 0.78! As a sde thought questo, why s the tercept zero? May thgs eed to be checked a multple regresso. The set CONDO09.mtp s useful for ths. * The resdual versus ftted should be patterless. If the resduals expad to the rght, you ll eed to use log(y ), or perhaps log(y c). * Wth other strage patters the resdual versus ftted plot, you may wsh to plot the resduals agast each depedet varable. Ths wll suggest that we have curvature DELEV. It s hard to kow f ths was worth correctg, but we dd.
13 ome people worry about teractos of predctors. The geeral method for dog ths s to add a product term. I the amal regresso of log(g) o {log(w), Tsleep}, cosder also the regresso of log(g) o {log(w), Tsleep, log(w) Tsleep}. That s, the regresso s exteded wth a ordary product. For ths example, the mprovemet R s trflg, ad the ew varable (the product) s ot statstcally sgfcat. Wth a p-value of 0.0 t does look terestg. Warg: If you do varable selecto ad you decde to keep the log(w) Tsleep teracto, you must also keep the predctors log(w) ad Tsleep. I other words, ever cosder regressg log(g) o {log(w), log(w) Tsleep}. I a multple regresso, the t statstcs are used to test hypotheses about dvdual coeffcets. If the model s Y β 0 β H H β K K β Q Q β V V ε, the the t statstc prted the le for b Q (the estmate of β Q ) tests the hypothess H 0 (Q): β Q 0 agast the alteratve H (Q): β Q 0. Ths must be terpreted the cotext of ths regresso ad wth the predctors {H, K, V}. Ths s most deftely ot depedet of the tests o β H, β K, or β V. uppose stead that you wshed to test the hypothess H 0 : β H β K agast the alteratve H : β H β K. Ths kd of ferece s ot automated Mtab. We ca get t, but we ll have to work for t. As a sde ote, you ca oly cosder β H β K f varables H ad K are the same uts. We are gog to use the partal F test. Ths s explaed eatly Chatterjee ad Had. Let s suppose that we have a full model ad a reduced model. I our example, the full model s Y β 0 β H H β K K β Q Q β V V ε the reduced model s Y β 0 β β Q Q β V V ε where H K 3
14 A requremet for usg ths partal F test s that the reduced model must be a sub-model of the full model. Ths meas that there s some choce for the parameters the full model that produces the reduced model. tart from Y β 0 β H H β K K β Q Q β V V ε ad the make the choce that β H β K to wrte Y β 0 β H H β H K β Q Q β V V ε Ths ca be regrouped as Y β 0 β H (H K ) β Q Q β V V ε Now reamg H K as ad also lettg β be aother ame for the assumed commo value of β H ad β K wll produce the reduced model. Do regressos for both the full model ad the reduced model. Observe that regresso (full) > regresso (reduced) resd (full) < resd (reduced) regresso (full) - regresso (reduced) resd (reduced) - resd (full) 4
15 The partal F statstc ca be wrtte several ways: F { regresso full regresso reduced } resd full DF ( full) resd ν { resd reduced resd full } resd full DF ( full) resd ν { regresso full regresso reduced } M ( full) resd ν { resd reduced resd full } M ( full) resd ν I every form, the umerator s based o a sum squares dfferece. How much better s the ft wth the full model? The value ν s the umber of parameters that dstgush H 0 from H. For ths example, the full model has E(Y ) β 0 β H H β K K β Q Q β V V ad t takes fve parameters to specfy ths (β 0, β H, β K, β Q, β V ) the reduced model has E(Y ) β 0 β β Q Q β V V For ths example, ν. ad t takes four parameters to specfy ths (β 0, β, β Q, β V ) The deomator of the partal F statstc s the mea square resdual from the full model. Ths deomator estmates whether H 0 s true or ot. The degrees of freedom for the partal F are (ν, DF resd (full) ). The rule s to reject H 0 at level α f the partal F equals or exceeds F α,df, the upper α pot for the F ν resd full dstrbuto wth (ν, DF resd (full) ) degrees of freedom. We are out of the habt of lookg up cutoff pots for the F dstrbuto because most software prts the p-value. We would just reject at level α f p α. The partal F s ot arraged by Mtab. You have to do two rus ad assemble ths yourself. Ths also meas that you wo t get a p-value. You could the use prted tables of the F dstrbuto, whch gve the upper 5% ad % pots. ce Mtab s 5
16 already actve for the work you re dog, you ca get p-value yourself. Let s suppose that your partal F works out to the value 7. wth (, 7) degrees of freedom. Use Calc Probablty Dstrbutos F ad the fll the resultg pael as follows: Ths starts wth the default rado butto settg at Cumulatve probablty, whch s exactly what you wat. The output s ths: Cumulatve Dstrbuto Fucto F dstrbuto wth DF umerator ad 7 DF deomator x P( X < x ) The probablty to the left of your 7. s foud to be The probablty to the rght (correspodg to the rejecto rego for H 0 versus H ) s the You ca report the p
Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model
Lecture 7. Cofdece Itervals ad Hypothess Tests the Smple CLR Model I lecture 6 we troduced the Classcal Lear Regresso (CLR) model that s the radom expermet of whch the data Y,,, K, are the outcomes. The
More informationThe equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.
INTRODUCTORY NOTE ON LINEAR REGREION We have data of the form (x y ) (x y ) (x y ) These wll most ofte be preseted to us as two colum of a spreadsheet As the topc develops we wll see both upper case ad
More informationSTATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1
STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Recall Assumpto E(Y x) η 0 + η x (lear codtoal mea fucto) Data (x, y ), (x 2, y 2 ),, (x, y ) Least squares estmator ˆ E (Y x) ˆ " 0 + ˆ " x, where ˆ
More informationENGI 3423 Simple Linear Regression Page 12-01
ENGI 343 mple Lear Regresso Page - mple Lear Regresso ometmes a expermet s set up where the expermeter has cotrol over the values of oe or more varables X ad measures the resultg values of aother varable
More informationSimple Linear Regression
Statstcal Methods I (EST 75) Page 139 Smple Lear Regresso Smple regresso applcatos are used to ft a model descrbg a lear relatoshp betwee two varables. The aspects of least squares regresso ad correlato
More informationMultiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades
STAT 101 Dr. Kar Lock Morga 11/20/12 Exam 2 Grades Multple Regresso SECTIONS 9.2, 10.1, 10.2 Multple explaatory varables (10.1) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (10.2) Trasformatos
More informationOrdinary Least Squares Regression. Simple Regression. Algebra and Assumptions.
Ordary Least Squares egresso. Smple egresso. Algebra ad Assumptos. I ths part of the course we are gog to study a techque for aalysg the lear relatoshp betwee two varables Y ad X. We have pars of observatos
More informationObjectives of Multiple Regression
Obectves of Multple Regresso Establsh the lear equato that best predcts values of a depedet varable Y usg more tha oe eplaator varable from a large set of potetal predctors {,,... k }. Fd that subset of
More informationChapter 13 Student Lecture Notes 13-1
Chapter 3 Studet Lecture Notes 3- Basc Busess Statstcs (9 th Edto) Chapter 3 Smple Lear Regresso 4 Pretce-Hall, Ic. Chap 3- Chapter Topcs Types of Regresso Models Determg the Smple Lear Regresso Equato
More informationSummary of the lecture in Biostatistics
Summary of the lecture Bostatstcs Probablty Desty Fucto For a cotuos radom varable, a probablty desty fucto s a fucto such that: 0 dx a b) b a dx A probablty desty fucto provdes a smple descrpto of the
More informationEconometric Methods. Review of Estimation
Ecoometrc Methods Revew of Estmato Estmatg the populato mea Radom samplg Pot ad terval estmators Lear estmators Ubased estmators Lear Ubased Estmators (LUEs) Effcecy (mmum varace) ad Best Lear Ubased Estmators
More informationMultiple Choice Test. Chapter Adequacy of Models for Regression
Multple Choce Test Chapter 06.0 Adequac of Models for Regresso. For a lear regresso model to be cosdered adequate, the percetage of scaled resduals that eed to be the rage [-,] s greater tha or equal to
More informationECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model
ECON 48 / WH Hog The Smple Regresso Model. Defto of the Smple Regresso Model Smple Regresso Model Expla varable y terms of varable x y = β + β x+ u y : depedet varable, explaed varable, respose varable,
More information12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model
1. Estmatg Model parameters Assumptos: ox ad y are related accordg to the smple lear regresso model (The lear regresso model s the model that says that x ad y are related a lear fasho, but the observed
More informationStatistics: Unlocking the Power of Data Lock 5
STAT 0 Dr. Kar Lock Morga Exam 2 Grades: I- Class Multple Regresso SECTIONS 9.2, 0., 0.2 Multple explaatory varables (0.) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (0.2) Exam 2 Re- grades Re-
More informationMultiple Linear Regression Analysis
LINEA EGESSION ANALYSIS MODULE III Lecture - 4 Multple Lear egresso Aalyss Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Cofdece terval estmato The cofdece tervals multple
More informationresidual. (Note that usually in descriptions of regression analysis, upper-case
Regresso Aalyss Regresso aalyss fts or derves a model that descres the varato of a respose (or depedet ) varale as a fucto of oe or more predctor (or depedet ) varales. The geeral regresso model s oe of
More informationUNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS
UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Exam: ECON430 Statstcs Date of exam: Frday, December 8, 07 Grades are gve: Jauary 4, 08 Tme for exam: 0900 am 00 oo The problem set covers 5 pages Resources allowed:
More informationSTA302/1001-Fall 2008 Midterm Test October 21, 2008
STA3/-Fall 8 Mdterm Test October, 8 Last Name: Frst Name: Studet Number: Erolled (Crcle oe) STA3 STA INSTRUCTIONS Tme allowed: hour 45 mutes Ads allowed: A o-programmable calculator A table of values from
More informationClass 13,14 June 17, 19, 2015
Class 3,4 Jue 7, 9, 05 Pla for Class3,4:. Samplg dstrbuto of sample mea. The Cetral Lmt Theorem (CLT). Cofdece terval for ukow mea.. Samplg Dstrbuto for Sample mea. Methods used are based o CLT ( Cetral
More informationChapter 14 Logistic Regression Models
Chapter 4 Logstc Regresso Models I the lear regresso model X β + ε, there are two types of varables explaatory varables X, X,, X k ad study varable y These varables ca be measured o a cotuous scale as
More informationLinear Regression with One Regressor
Lear Regresso wth Oe Regressor AIM QA.7. Expla how regresso aalyss ecoometrcs measures the relatoshp betwee depedet ad depedet varables. A regresso aalyss has the goal of measurg how chages oe varable,
More informationExample: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger
Example: Multple lear regresso 5000,00 4000,00 Tro Aders Moger 0.0.007 brthweght 3000,00 000,00 000,00 0,00 50,00 00,00 50,00 00,00 50,00 weght pouds Repetto: Smple lear regresso We defe a model Y = β0
More informationLecture Notes Types of economic variables
Lecture Notes 3 1. Types of ecoomc varables () Cotuous varable takes o a cotuum the sample space, such as all pots o a le or all real umbers Example: GDP, Polluto cocetrato, etc. () Dscrete varables fte
More informationESS Line Fitting
ESS 5 014 17. Le Fttg A very commo problem data aalyss s lookg for relatoshpetwee dfferet parameters ad fttg les or surfaces to data. The smplest example s fttg a straght le ad we wll dscuss that here
More informationChapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:
Chapter 3 3- Busess Statstcs: A Frst Course Ffth Edto Chapter 2 Correlato ad Smple Lear Regresso Busess Statstcs: A Frst Course, 5e 29 Pretce-Hall, Ic. Chap 2- Learg Objectves I ths chapter, you lear:
More informationLecture 8: Linear Regression
Lecture 8: Lear egresso May 4, GENOME 56, Sprg Goals Develop basc cocepts of lear regresso from a probablstc framework Estmatg parameters ad hypothess testg wth lear models Lear regresso Su I Lee, CSE
More informationChapter 8. Inferences about More Than Two Population Central Values
Chapter 8. Ifereces about More Tha Two Populato Cetral Values Case tudy: Effect of Tmg of the Treatmet of Port-We tas wth Lasers ) To vestgate whether treatmet at a youg age would yeld better results tha
More informationProbability and. Lecture 13: and Correlation
933 Probablty ad Statstcs for Software ad Kowledge Egeers Lecture 3: Smple Lear Regresso ad Correlato Mocha Soptkamo, Ph.D. Outle The Smple Lear Regresso Model (.) Fttg the Regresso Le (.) The Aalyss of
More informationMean is only appropriate for interval or ratio scales, not ordinal or nominal.
Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot
More informationUNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS
UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postpoed exam: ECON430 Statstcs Date of exam: Jauary 0, 0 Tme for exam: 09:00 a.m. :00 oo The problem set covers 5 pages Resources allowed: All wrtte ad prted
More informationChapter 8: Statistical Analysis of Simulated Data
Marquette Uversty MSCS600 Chapter 8: Statstcal Aalyss of Smulated Data Dael B. Rowe, Ph.D. Departmet of Mathematcs, Statstcs, ad Computer Scece Copyrght 08 by Marquette Uversty MSCS600 Ageda 8. The Sample
More informationStatistics MINITAB - Lab 5
Statstcs 10010 MINITAB - Lab 5 PART I: The Correlato Coeffcet Qute ofte statstcs we are preseted wth data that suggests that a lear relatoshp exsts betwee two varables. For example the plot below s of
More informationX X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then
Secto 5 Vectors of Radom Varables Whe workg wth several radom varables,,..., to arrage them vector form x, t s ofte coveet We ca the make use of matrx algebra to help us orgaze ad mapulate large umbers
More informationLecture 2: Linear Least Squares Regression
Lecture : Lear Least Squares Regresso Dave Armstrog UW Mlwaukee February 8, 016 Is the Relatoshp Lear? lbrary(car) data(davs) d 150) Davs$weght[d]
More informationRecall MLR 5 Homskedasticity error u has the same variance given any values of the explanatory variables Var(u x1,...,xk) = 2 or E(UU ) = 2 I
Chapter 8 Heterosedastcty Recall MLR 5 Homsedastcty error u has the same varace gve ay values of the eplaatory varables Varu,..., = or EUU = I Suppose other GM assumptos hold but have heterosedastcty.
More informationENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections
ENGI 441 Jot Probablty Dstrbutos Page 7-01 Jot Probablty Dstrbutos [Navd sectos.5 ad.6; Devore sectos 5.1-5.] The jot probablty mass fucto of two dscrete radom quattes, s, P ad p x y x y The margal probablty
More information( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model
Chapter 3 Asmptotc Theor ad Stochastc Regressors The ature of eplaator varable s assumed to be o-stochastc or fed repeated samples a regresso aalss Such a assumpto s approprate for those epermets whch
More informationMultivariate Transformation of Variables and Maximum Likelihood Estimation
Marquette Uversty Multvarate Trasformato of Varables ad Maxmum Lkelhood Estmato Dael B. Rowe, Ph.D. Assocate Professor Departmet of Mathematcs, Statstcs, ad Computer Scece Copyrght 03 by Marquette Uversty
More informationECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity
ECONOMETRIC THEORY MODULE VIII Lecture - 6 Heteroskedastcty Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur . Breusch Paga test Ths test ca be appled whe the replcated data
More informationSTA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1
STA 08 Appled Lear Models: Regresso Aalyss Sprg 0 Soluto for Homework #. Let Y the dollar cost per year, X the umber of vsts per year. The the mathematcal relato betwee X ad Y s: Y 300 + X. Ths s a fuctoal
More informationSimple Linear Regression
Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uversty Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal
More informationENGI 4421 Propagation of Error Page 8-01
ENGI 441 Propagato of Error Page 8-01 Propagato of Error [Navd Chapter 3; ot Devore] Ay realstc measuremet procedure cotas error. Ay calculatos based o that measuremet wll therefore also cota a error.
More informationb. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.
.46. a. The frst varable (X) s the frst umber the par ad s plotted o the horzotal axs, whle the secod varable (Y) s the secod umber the par ad s plotted o the vertcal axs. The scatterplot s show the fgure
More information4. Standard Regression Model and Spatial Dependence Tests
4. Stadard Regresso Model ad Spatal Depedece Tests Stadard regresso aalss fals the presece of spatal effects. I case of spatal depedeces ad/or spatal heterogeet a stadard regresso model wll be msspecfed.
More informationLecture 3. Sampling, sampling distributions, and parameter estimation
Lecture 3 Samplg, samplg dstrbutos, ad parameter estmato Samplg Defto Populato s defed as the collecto of all the possble observatos of terest. The collecto of observatos we take from the populato s called
More informationMidterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes
coometrcs, CON Sa Fracsco State Uverst Mchael Bar Sprg 5 Mdterm xam, secto Soluto Thursda, Februar 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes exam.. No calculators of a kd are allowed..
More informationTHE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 00 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for the
More informationHandout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1
Hadout #8 Ttle: Foudatos of Ecoometrcs Course: Eco 367 Fall/05 Istructor: Dr. I-Mg Chu Lear Regresso Model So far we have focused mostly o the study of a sgle radom varable, ts correspodg theoretcal dstrbuto,
More informationMaximum Likelihood Estimation
Marquette Uverst Maxmum Lkelhood Estmato Dael B. Rowe, Ph.D. Professor Departmet of Mathematcs, Statstcs, ad Computer Scece Coprght 08 b Marquette Uverst Maxmum Lkelhood Estmato We have bee sag that ~
More informationMidterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes
coometrcs, CON Sa Fracsco State Uversty Mchael Bar Sprg 5 Mdterm am, secto Soluto Thursday, February 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes eam.. No calculators of ay kd are allowed..
More informationLecture 9: Tolerant Testing
Lecture 9: Tolerat Testg Dael Kae Scrbe: Sakeerth Rao Aprl 4, 07 Abstract I ths lecture we prove a quas lear lower boud o the umber of samples eeded to do tolerat testg for L dstace. Tolerat Testg We have
More informationThird handout: On the Gini Index
Thrd hadout: O the dex Corrado, a tala statstca, proposed (, 9, 96) to measure absolute equalt va the mea dfferece whch s defed as ( / ) where refers to the total umber of dvduals socet. Assume that. The
More informationChapter 5 Properties of a Random Sample
Lecture 6 o BST 63: Statstcal Theory I Ku Zhag, /0/008 Revew for the prevous lecture Cocepts: t-dstrbuto, F-dstrbuto Theorems: Dstrbutos of sample mea ad sample varace, relatoshp betwee sample mea ad sample
More informationSpecial Instructions / Useful Data
JAM 6 Set of all real umbers P A..d. B, p Posso Specal Istructos / Useful Data x,, :,,, x x Probablty of a evet A Idepedetly ad detcally dstrbuted Bomal dstrbuto wth parameters ad p Posso dstrbuto wth
More informationRegresso What s a Model? 1. Ofte Descrbe Relatoshp betwee Varables 2. Types - Determstc Models (o radomess) - Probablstc Models (wth radomess) EPI 809/Sprg 2008 9 Determstc Models 1. Hypothesze
More informationTESTS BASED ON MAXIMUM LIKELIHOOD
ESE 5 Toy E. Smth. The Basc Example. TESTS BASED ON MAXIMUM LIKELIHOOD To llustrate the propertes of maxmum lkelhood estmates ad tests, we cosder the smplest possble case of estmatg the mea of the ormal
More informationChapter 2 Supplemental Text Material
-. Models for the Data ad the t-test Chapter upplemetal Text Materal The model preseted the text, equato (-3) s more properl called a meas model. ce the mea s a locato parameter, ths tpe of model s also
More informationInvestigation of Partially Conditional RP Model with Response Error. Ed Stanek
Partally Codtoal Radom Permutato Model 7- vestgato of Partally Codtoal RP Model wth Respose Error TRODUCTO Ed Staek We explore the predctor that wll result a smple radom sample wth respose error whe a
More informationbest estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best
Error Aalyss Preamble Wheever a measuremet s made, the result followg from that measuremet s always subject to ucertaty The ucertaty ca be reduced by makg several measuremets of the same quatty or by mprovg
More informationLecture 3 Probability review (cont d)
STATS 00: Itroducto to Statstcal Iferece Autum 06 Lecture 3 Probablty revew (cot d) 3. Jot dstrbutos If radom varables X,..., X k are depedet, the ther dstrbuto may be specfed by specfyg the dvdual dstrbuto
More informationSimple Linear Regression and Correlation.
Smple Lear Regresso ad Correlato. Correspods to Chapter 0 Tamhae ad Dulop Sldes prepared b Elzabeth Newto (MIT) wth some sldes b Jacquele Telford (Johs Hopks Uverst) Smple lear regresso aalss estmates
More informationCHAPTER 4 RADICAL EXPRESSIONS
6 CHAPTER RADICAL EXPRESSIONS. The th Root of a Real Number A real umber a s called the th root of a real umber b f Thus, for example: s a square root of sce. s also a square root of sce ( ). s a cube
More informationPoint Estimation: definition of estimators
Pot Estmato: defto of estmators Pot estmator: ay fucto W (X,..., X ) of a data sample. The exercse of pot estmato s to use partcular fuctos of the data order to estmate certa ukow populato parameters.
More informationChapter 2 Simple Linear Regression
Chapter Smple Lear Regresso. Itroducto ad Least Squares Estmates Regresso aalyss s a method for vestgatg the fuctoal relatoshp amog varables. I ths chapter we cosder problems volvg modelg the relatoshp
More informationArithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is
Topc : Probablty Theory Module : Descrptve Statstcs Measures of Locato Descrptve statstcs are measures of locato ad shape that perta to probablty dstrbutos The prmary measures of locato are the arthmetc
More information: At least two means differ SST
Formula Card for Eam 3 STA33 ANOVA F-Test: Completely Radomzed Desg ( total umber of observatos, k = Number of treatmets,& T = total for treatmet ) Step : Epress the Clam Step : The ypotheses: :... 0 A
More informationLecture Notes 2. The ability to manipulate matrices is critical in economics.
Lecture Notes. Revew of Matrces he ablt to mapulate matrces s crtcal ecoomcs.. Matr a rectagular arra of umbers, parameters, or varables placed rows ad colums. Matrces are assocated wth lear equatos. lemets
More informationChapter 4 Multiple Random Variables
Revew for the prevous lecture: Theorems ad Examples: How to obta the pmf (pdf) of U = g (, Y) ad V = g (, Y) Chapter 4 Multple Radom Varables Chapter 44 Herarchcal Models ad Mxture Dstrbutos Examples:
More information2SLS Estimates ECON In this case, begin with the assumption that E[ i
SLS Estmates ECON 3033 Bll Evas Fall 05 Two-Stage Least Squares (SLS Cosder a stadard lear bvarate regresso model y 0 x. I ths case, beg wth the assumto that E[ x] 0 whch meas that OLS estmates of wll
More informationChapter Two. An Introduction to Regression ( )
ubject: A Itroducto to Regresso Frst tage Chapter Two A Itroducto to Regresso (018-019) 1 pg. ubject: A Itroducto to Regresso Frst tage A Itroducto to Regresso Regresso aalss s a statstcal tool for the
More informationReaction Time VS. Drug Percentage Subject Amount of Drug Times % Reaction Time in Seconds 1 Mary John Carl Sara William 5 4
CHAPTER Smple Lear Regreo EXAMPLE A expermet volvg fve ubject coducted to determe the relatohp betwee the percetage of a certa drug the bloodtream ad the legth of tme t take the ubject to react to a tmulu.
More information1 Solution to Problem 6.40
1 Soluto to Problem 6.40 (a We wll wrte T τ (X 1,...,X where the X s are..d. wth PDF f(x µ, σ 1 ( x µ σ g, σ where the locato parameter µ s ay real umber ad the scale parameter σ s > 0. Lettg Z X µ σ we
More informationStatistics Review Part 3. Hypothesis Tests, Regression
Statstcs Revew Part 3 Hypothess Tests, Regresso The Importace of Samplg Dstrbutos Why all the fuss about samplg dstrbutos? Because they are fudametal to hypothess testg. Remember that our goal s to lear
More informationFunctions of Random Variables
Fuctos of Radom Varables Chapter Fve Fuctos of Radom Varables 5. Itroducto A geeral egeerg aalyss model s show Fg. 5.. The model output (respose) cotas the performaces of a system or product, such as weght,
More informationDiscrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand DIS 10b
CS 70 Dscrete Mathematcs ad Probablty Theory Fall 206 Sesha ad Walrad DIS 0b. Wll I Get My Package? Seaky delvery guy of some compay s out delverg packages to customers. Not oly does he had a radom package
More informationε. Therefore, the estimate
Suggested Aswers, Problem Set 3 ECON 333 Da Hugerma. Ths s ot a very good dea. We kow from the secod FOC problem b) that ( ) SSE / = y x x = ( ) Whch ca be reduced to read y x x = ε x = ( ) The OLS model
More informationCLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs
CLASS NOTES for PBAF 58: Quattatve Methods II SPRING 005 Istructor: Jea Swaso Dael J. Evas School of Publc Affars Uversty of Washgto Ackowledgemet: The structor wshes to thak Rachel Klet, Assstat Professor,
More informationWu-Hausman Test: But if X and ε are independent, βˆ. ECON 324 Page 1
Wu-Hausma Test: Detectg Falure of E( ε X ) Caot drectly test ths assumpto because lack ubased estmator of ε ad the OLS resduals wll be orthogoal to X, by costructo as ca be see from the momet codto X'
More informationStatistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018
/3/08 Sstems & Bomedcal Egeerg Departmet SBE 304: Bo-Statstcs Smple Lear Regresso ad Correlato Dr. Ama Eldeb Fall 07 Descrptve Orgasg, summarsg & descrbg data Statstcs Correlatoal Relatoshps Iferetal Geeralsg
More informationLINEAR REGRESSION ANALYSIS
LINEAR REGRESSION ANALYSIS MODULE V Lecture - Correctg Model Iadequaces Through Trasformato ad Weghtg Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Aalytcal methods for
More informationParameter, Statistic and Random Samples
Parameter, Statstc ad Radom Samples A parameter s a umber that descrbes the populato. It s a fxed umber, but practce we do ot kow ts value. A statstc s a fucto of the sample data,.e., t s a quatty whose
More informationStatistics Descriptive and Inferential Statistics. Instructor: Daisuke Nagakura
Statstcs Descrptve ad Iferetal Statstcs Istructor: Dasuke Nagakura (agakura@z7.keo.jp) 1 Today s topc Today, I talk about two categores of statstcal aalyses, descrptve statstcs ad feretal statstcs, ad
More informationSimple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation
4//6 Appled Statstcs ad Probablty for Egeers Sth Edto Douglas C. Motgomery George C. Ruger Chapter Smple Lear Regresso ad Correlato CHAPTER OUTLINE Smple Lear Regresso ad Correlato - Emprcal Models -8
More informationLecture 1 Review of Fundamental Statistical Concepts
Lecture Revew of Fudametal Statstcal Cocepts Measures of Cetral Tedecy ad Dsperso A word about otato for ths class: Idvduals a populato are desgated, where the dex rages from to N, ad N s the total umber
More informationSTK4011 and STK9011 Autumn 2016
STK4 ad STK9 Autum 6 Pot estmato Covers (most of the followg materal from chapter 7: Secto 7.: pages 3-3 Secto 7..: pages 3-33 Secto 7..: pages 35-3 Secto 7..3: pages 34-35 Secto 7.3.: pages 33-33 Secto
More informationEconometrics. 3) Statistical properties of the OLS estimator
30C0000 Ecoometrcs 3) Statstcal propertes of the OLS estmator Tmo Kuosmae Professor, Ph.D. http://omepre.et/dex.php/tmokuosmae Today s topcs Whch assumptos are eeded for OLS to work? Statstcal propertes
More informationCan we take the Mysticism Out of the Pearson Coefficient of Linear Correlation?
Ca we tae the Mstcsm Out of the Pearso Coeffcet of Lear Correlato? Itroducto As the ttle of ths tutoral dcates, our purpose s to egeder a clear uderstadg of the Pearso coeffcet of lear correlato studets
More informationPROPERTIES OF GOOD ESTIMATORS
ESTIMATION INTRODUCTION Estmato s the statstcal process of fdg a appromate value for a populato parameter. A populato parameter s a characterstc of the dstrbuto of a populato such as the populato mea,
More information{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:
Chapter 4 Exercses Samplg Theory Exercse (Smple radom samplg: Let there be two correlated radom varables X ad A sample of sze s draw from a populato by smple radom samplg wthout replacemet The observed
More informationLecture 1: Introduction to Regression
Lecture : Itroducto to Regresso A Eample: Eplag State Homcde Rates What kds of varables mght we use to epla/predct state homcde rates? Let s cosder just oe predctor for ow: povert Igore omtted varables,
More informationEcon 388 R. Butler 2016 rev Lecture 5 Multivariate 2 I. Partitioned Regression and Partial Regression Table 1: Projections everywhere
Eco 388 R. Butler 06 rev Lecture 5 Multvarate I. Parttoed Regresso ad Partal Regresso Table : Projectos everywhere P = ( ) ad M = I ( ) ad s a vector of oes assocated wth the costat term Sample Model Regresso
More informationChapter -2 Simple Random Sampling
Chapter - Smple Radom Samplg Smple radom samplg (SRS) s a method of selecto of a sample comprsg of umber of samplg uts out of the populato havg umber of samplg uts such that every samplg ut has a equal
More informationApplied Statistics and Probability for Engineers, 5 th edition February 23, b) y ˆ = (85) =
Appled Statstcs ad Probablty for Egeers, 5 th edto February 3, y.8.7.6.5.4.3.. -5 5 5 x b) y ˆ.3999 +.46(85).6836 c) y ˆ.3999 +.46(9).744 d) ˆ.46-3 a) Regresso Aalyss: Ratg Pots versus Meters per Att The
More informationSTA 105-M BASIC STATISTICS (This is a multiple choice paper.)
DCDM BUSINESS SCHOOL September Mock Eamatos STA 0-M BASIC STATISTICS (Ths s a multple choce paper.) Tme: hours 0 mutes INSTRUCTIONS TO CANDIDATES Do ot ope ths questo paper utl you have bee told to do
More informationLecture Note to Rice Chapter 8
ECON 430 HG revsed Nov 06 Lecture Note to Rce Chapter 8 Radom matrces Let Y, =,,, m, =,,, be radom varables (r.v. s). The matrx Y Y Y Y Y Y Y Y Y Y = m m m s called a radom matrx ( wth a ot m-dmesoal dstrbuto,
More informationPart 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971))
art 4b Asymptotc Results for MRR usg RESS Recall that the RESS statstc s a specal type of cross valdato procedure (see Alle (97)) partcular to the regresso problem ad volves fdg Y $,, the estmate at the
More informationCHAPTER 2. = y ˆ β x (.1022) So we can write
CHAPTER SOLUTIONS TO PROBLEMS. () Let y = GPA, x = ACT, ad = 8. The x = 5.875, y = 3.5, (x x )(y y ) = 5.85, ad (x x ) = 56.875. From equato (.9), we obta the slope as ˆβ = = 5.85/56.875., rouded to four
More informationIntroduction to Matrices and Matrix Approach to Simple Linear Regression
Itroducto to Matrces ad Matrx Approach to Smple Lear Regresso Matrces Defto: A matrx s a rectagular array of umbers or symbolc elemets I may applcatos, the rows of a matrx wll represet dvduals cases (people,
More informationAnalysis of Variance with Weibull Data
Aalyss of Varace wth Webull Data Lahaa Watthaacheewaul Abstract I statstcal data aalyss by aalyss of varace, the usual basc assumptos are that the model s addtve ad the errors are radomly, depedetly, ad
More information