Lecture 1: Empirical Research and Sampling

Size: px
Start display at page:

Download "Lecture 1: Empirical Research and Sampling"

Transcription

1 Lectre Notes o Advaced coometrcs Lectre : mprcal Research ad Samplg Taash Yamao Fall Semester 5 Wh do we std coometrcs? For almost all of s ths class, we std ecoometrcs becase we eed to codct sold emprcal aalses, stead of advacg ecoometrc theor. More precsel for o, o std ecoometrcs becase o eed t as a tool to wrte a thess or dssertato to complete or program. Some research papers do ot clde ecoometrc aalses bt se coceptal framewors ad sbject-matter owledge to spport ther argmets. The ma add descrptve data wth some basc tables ad fgres. I most cases academc stdes, however, coceptal ad descrptve data aalses ma ot be eogh. Recet ears, data are becomg avalable more tha before ad powerfl ecoometrc pacages are easer to se tha ever. Ths, t s mch easer for researchers to test ther hpotheses emprcall wth good data ad sophstcated ecoometrcs methods. Ths, there are fewer ecses for ot codctg ecoometrc-aalses. Althogh ecoometrcs ca be a powerfl tool to test hpotheses, t s qte eas to msse t. To covce readers that o have terestg ad relable reslts, o frst eed to covce them that o have arrved robst coclsos based o sod data ad well-thoght-ot methods. I hope ths corse wll help o o ths pot. Ths corse ses two tetboos: Itrodctor coometrcs d edto b Jeffre Wooldrge ad coometrcs Aalss 5 th edto b Wllam H. Greee. The frst boo s wrtte for emprcal researchers who wat to lear how to codct ecoometrc aalses. As the sb-ttle of the frst boo -- A Moder Approach -- sggests, ths boo also covers recet developmets appled ecoometrcs. The feld of Appled coometrcs s developg rapdl de to mprovemets persoal compters ad epadg avalablt of data. B sg ecoometrc-pacages that are crretl avalable the maret, t does ot tae mch tme to estmate complcated models. Large data are avalable throgh Iteret or are sold CD-ROMs. Ths meas that a wde varet of methods are avalable to o. Prevosl, those advaced methods are ol avalable to people who had advaced owledge o ecoometrcs. Ths there was lttle eed for trodctor tetboos to epla advaced methods. However, becase those advaced methods are crretl avalable from software pacages, sch as STATA, there s a eed for a trodctor

2 tetboo to epla advaced methods for people who do ot have advaced owledge o ecoometrcs. Wooldrdge realzes ths ad has wrtte the tetboo. However, becase ths tetboo covers ma methods a sgle volme, t does ot epla theor detals. Ths, I se the secod tetboo Greee s coometrc Aalss to std ecoometrc theor detals. specall, the secod boo eplas ecoometrc theor matr. coometrc theor ca be eplaed clearl wth matr as o wll fd the lectres. I wll provde o a basc trag matr calclato ths corse. I start m lectre o how to codct emprcal projects becase the ma prpose of ths corse s to provde o a tool to wrte emprcal papers. I th t s mportat for o to have a clear dea o how o are gog to se ecoometrcs or research. Carrg ot a mprcal Project Posg a qesto There are some was to pose a research qesto: New Theor New Isses New Data New Methods Chaged or Dfferet vromets New Theor: Yo ma pose ths tpe of qestos whe o come p wth a ew theor to loo at a sse. Yo ma provde a formal theor to a old sse. For stace, people ew that edcated people gaed some retrs from edcato eve before Chcago ecoomsts, sch as T.W. Shltz ad Gar Becer. However, Chcago ecoomsts created a ew theor of hma captal b coceptalzg edcato as vestmet hma captal. New Isses: Yo ma pose ths tpe of qestos whe o recogze emergg sses. These sses ma have ested before, bt o ma arge that these sses have ga mportace. Iteratoal trade ad face were ot mportat whe t was dffclt to trasfer goods or moe across cotres. Bt toda, t s ver dffclt to fd a place wthot teratoall-traded goods the world. As a reslt, teratoal trade ad face sses have gaed ther mportace ecoomc theor. New Data: Althogh ma data sets have become avalable recetl, ths s a relatvel ew pheomeo. There are ma emprcal sses that have ot bee eamed emprcall becase approprate data were ot avalable before. Good data are hard to come b. So whe o come across a ewl avalable data wth hgh qalt, o shold wrte a paper b sag Ths paper ses a ewl avalable data set whch eables s to overcome lmtatos prevos stdes

3 New Methods: o ca advace the owledge amog researchers b applg a ew ad more approprate method o old sses. B stdg ecoometrcs, o wll be able to appl advaced methods o old sses. Chaged or Dfferet vromets: hs s the most commo practce. Ths s to appl a estg research qesto ad method a chaged over tme evromet or dfferet regos or cotres. Yo ma codct a old research whe o sspect that the evromet has chaged over tme ad that reslts wold be dfferet from prevos oes. Or o ma codct a estg research a dfferet place or cotr. Bt how do o ow f or dea s reall ew? The aswer: Dg the lteratre. Lteratre Revew The most commo place to loo for research qestos s papers jorals we call them lteratre. It trs ot that o are ot the ol oe who eeds to codct ecoomc aalses. Ma people have doe or are dog eactl what o are dog, ad o ca fd other people s efforts ther pblshed papers jorals, boos, ad o Iteret. Althogh o ma wat to come p wth a brad-ew-qesto that obod has thoght abot t before, there are few qestos that people have ot thoght abot. Chaces are that some ecoomsts have thoght abot or seemgl-brad-ew-deas ma ears ago, f ot ma decades ago, ad have doe fe aalses. As o revew lteratre, tr to arrow dow or focs ad fd a le of research that s o the same research qesto. For stace, o ma wat to std edcato developg cotres ad start readg abot t. As o go throgh prevos stdes o wll fd stdes o a the effects of edcato o farm prodctvt, b the effects of mothers edcato o chld health, c the ecoomc retrs of edcato labor marets, d the effects of school qalt o test-scorg, ad so o. ach oe of them cold be called a le of research. At the ed, alog the le of research, o shold loo for a gap the lteratre. Ad or paper, o shold be able to sa: Prevos stdes have fod these reslts before, bt we stll do ot ow the aswer to ths qesto. The prpose of ths paper, therefore, s to fd a aswer to ths qesto. I or aalss, o do ot eed to cover all of the prevos stdes o the sse o the le of research that o are terested. If o are wrtg a joral paper, or readers are sall eperts o the sbject. Probabl the ow abot prevos stdes better tha o do. Ths, o jst eed to meto relevat stdes to detf a gap the lteratre that o are gog to fll. 3

4 Data Collecto Hgh qalt data ca lead researchers to hgh qalt emprcal research. coometrcs ca help o to prodce hgh qalt emprcal research from hgh qalt data. Bt ecoometrcs ca ot help o to prodce hgh qalt emprcal research from low qalt data. The qalt of the data draws the pper lmt o the qalt of research. A emprcal aalss sg low qalt data wth ver advaced ecoometrc techqes s ot sefl practcall, eve f the techqe tself s mpressve. O the other had, a smple aalss wth hgh qalt data ca provde sefl formato. I am a strog belever of the latter method. Large data sets are sefl bt ot ecessarl better tha small data becase ofte large data sets lac detaled formato, althogh large observatos ca help arrowg stadard errors. More ad more data are becomg avalable from Iteret. Ths s great. Bt whe o se sch data, o frst eed to derstad what o get. It s dageros to se data wthot a fll owledge o how the data were collected ad arraged. Whe o collect data orself from felds or from secodar data sorces, o shold th ver carefll f o wll be able to aswer or research qestos wth the data o collect. If ot, o eed to search for dfferet data or chage or research qestos. Otherwse after ma moths or ears of efforts, o wold be told b or advsers or revewers of or paper that Well, t seems o ca t aswer or qestos wth or data coometrc Aalss The prpose of or research s to fd aswers to or research qestos ad covce people readers that o have relable reslts. To acheve or goal, o ma ot eed the most advaced ecoometrc methods. For the most cases, o probabl do ot eed the most advaced ecoometrc methods. Smple methods are sall sffcet ad powerfl. Yo shold, however, std advaced methods for followg reasos. Frst, o shold be able to choose the most approprate method amog all methods. Secod, o shold have a fll derstadg o the method of or choce. Thrd, o shold recogze the lmtatos of the method. Wrtg a mprcal Paper After or aalses, o eed to smmarze or fdgs a cocse maer. A short paper s desrable. Based o m lmted epereces, I wold sggest: a b Start wth a short paper wth a specfc qesto Wor wth people who have pblcatos 4

5 c d Mmc other papers Keep o wrtg; f o have bee stc oe secto of or paper, o shold start wrtg other sectos Wrtg a emprcal paper s ver dffclt. Bt o are ot aloe. Do t be dscoraged! Referece Brorse, B. Wade. 987 Observatos o the Joral Pblcato Process, North Cetral Joral of Agrcltral coomcs, Vol. 9, No.. O glsh Wrtg, I sggest: Str ad Whte The lemets of Stle, ew edto, Massachsetts: Ala ad Baco. [Ths s the classc.] O Coer, P.T. 996 Woe Is I: The grammarphobe s gde to better glsh pla glsh, New Yor: Pttam s Sos Pblshers. [Ths s f to read.] Data Ch. Wooldrdge; Ch. Lohr, 999 Oce o have research qestos, o eed to collect data. Yo ma collect data from secodar data sorces sch as offcal statstcs from govermets or other stttos, or o ma collect data orself wth or colleages. There are several tpes of data: Tme Seres Data Cross-Sectoal Data Pooled Cross Sectos Pael or Logtdal Data Whe o cosder sg a data set, o eed to derstad the followg: Observato Ut: a object o whch a measremet s tae Target poplato: the complete collecto of observatos we wat to std Sample: a sbset of poplato Sampled poplato: the poplato from whch the sample was tae Samplg t: the t of actal sample Samplg frame: the lst of samplg ts Yo eed to fd the data that wll eable o to fd aswers to or research qestos. For stace, f o are terested dvdal behavor, o wat to have dvdallevel data. Yo ma th ths s obvos bt ma cases o ma ot fd dvdallevel data avalable to o. Whe o caot fd dvdal-level data to std dvdal behavor, o ma be forced to se aggregated data. Sppose, for stace, that o are terested wrtg a research paper o the mpacts of a droght o farm prodcto bt ol have access to aggregated data. To carr ot a aalss, o eed to ow how the data were collected from farmers. Whe was the srve codcted? Before or after the droght? Who were the target poplato? 5

6 Cotr, some regos, or some dstrcts? Were hard-ht areas clded the sampled poplato? If ot, there s o pot of sg the data. Were hard-ht areas combed wth ot-so-hard-ht areas? Farmers ot-so-hard-ht areas mght have ejoed hgh otpt prces, created b the droght. Wthot a good derstadg o data, ecoometrc aalses caot provde good aalses. Some Importat Cocepts Casalt ad Ceters Parbs I scetfc research, researchers create epermetal evromet labs. I sch evromets, researchers ca carefll cotrol factors, sch as temperatre or hmdt. Uder carefll cotrolled evromets, t s relatvel eas to chage oe varable, holdg other varables costat, ad eame a reacto a object. Becase all of the other varables are held costat, ceters parbs, t s eas ad relable to fd a casal effect of the varable, whch was chaged, o the object. I socal scece, sch as ecoomcs, t s mpossble to cotrol the socal evromet. What we ca do s to observe chages people s behavor der smltaeos chages ma factors. Whe ma factors chage at the same tme, we ma fd two varables chagg the same drecto. Ths cold be jst a cocdece a assocato. Or ths cold be a casal effect of oe factor to aother a casal effect. For stace, we fd that the demad for ar-codtoers was hgh drg the past smmer ad that the hosehold come was also hgh at the same tme. Ths, o fd a assocato betwee the demad for ar-codtoers ad the hosehold come. Bt o also fd that the temperatre was ver hgh the past smmer. Ths, t s possble that the demad for ar-codtoer was hgh ot becase of the hgh come bt becase of the hgh temperatre. Ths, the hosehold come dd ot have a casal effect o the demad for the ar-codtoers bt the hgh temperatre had a casal effect. Dstgshg a assocato from a casal effect s oe of the most dffclt sses emprcal research. Omtted Varables Wth ecoometrcs, what we hope to do s to solate a effect of oe varable from other effects of other varables. B cldg all of the varables that affect a otcome, o are hopg to cotrol for other factors. A major problem s that o ofte do ot have formato o some mportat varables. We call those varables observed varables. Some of observed varables are observable bt ot observed, whle other varables cold smpl be observable. People s ablt ad taste are eamples of observable varables. 6

7 Whe there are observed varables, those varables wll be omtted from ecoometrcs aalses. Whe mportat varables are omtted from ecoometrc models, omtted varables ma case bases estmated coeffcets the omtted varables problem. For stace, sppose that o wat to measre a casal effect of agrcltral credt o farm prodctvt. Yo have formato o farm prodctvt ad agrcltral credt, bt o do ot have formato o the edcato of farmers. Sppose o fd a postve assocato betwee farm prodctvt ad agrcltral credt: the hgher the amot of agrcltral credt, the hgher the farm prodctvt. Bt t s possble that the credt s gve to edcated farmers whose farm prodctvt s hgh eve wthot the credt. I ths case, we wold fd a postve assocato betwee credt ad farm prodctvt eve whe credt does ot have a postve mpacts o farm prodctvt. I ths case, what we wold le to do s to measre the mpacts of credt o farm prodctvt amog farmers wth the same level of edcato. Bt we wll ot be able to do ths wthot the edcato formato of farmers. ve whe edcato formato s avalable, we do ot have the formato o farmers ablt whch ca ot be measred fll b formal edcato geeral. The omtted varable problem s oe of the most commo ad seros problems crosssecto aalses. We wll dscss ths problem etesvel ths corse. Ad later ths corse, we wll lear methods to overcome ths problem. Smltaet The drecto of the casal effect ma ot be oe-wa. Two factors ma flece each other. I ths sese the drecto of casalt s ot clear. For stace, the prevos eample of farm prodctvt ad agrcltral credt ca be cosdered as a smltaeos problem. Oe cold arge that agrcltral credt creases farm prodctvt, whle other wold arge that agrcltral credt s gve to hghl prodctve farmers. I ma cases, we ca cosder a smltaeos problem as a omtted varables problem. For stace, the above smltaeos problem ca be cosdered as a omtted varables problem: f we have the formato o all of the characterstcs that represet farmers ablt to repa credt, the we ca compare two farmers who have the same ablt to repa bt oe s gve credt whle the other s ot. If we fd that the farmer who s gve credt has hgher farm prodctvt tha the other farmer, wthot credt, who otherwse has the same repamet ablt, the we ma coclde that there s a casal effect of credt o farm prodcto. The problem, however, s that t s dffclt to observe all of the characterstcs that represet farmers ablt as data. Ths the smltaeos problem betwee agrcltral credt ad farm prodctvt rema as a omtted varables problem of farmers ablt. For ow, I wold jst wat o to be aware of dffereces betwee a assocato ad a 7

8 casal effect. What we sall wat to ow s a casal effect of a varable, whch s ofte a mportat polc varable, ot a assocato. Ubasedess ad Precseess I coometrcs ad Statstcs, t s mportat to clearl dstgsh basedess ad precseess. A estmate cold be Based ad Imprecse, Based bt Precse, Ubased bt Imprecse, ad Ubased ad Precse Accrate. See Fgre. Lohr 999. I coometrcs ad Statstcs, we sppose that there s oe tre vale, b. Ths cold be a average mber e.g., the average come, epedtre, age or a sze of a mpact of oe varable o aother e.g., the mpact of smog oe pac of tobacco o the probablt of havg the lg cacer. B sg data ad ecoometrc methods, we estmate the average mber or the sze of the mpact. Let s deote the estmated mber as. Whe the estmated mber estmator,, s ver dfferet from the tre mber, b, for some sstematc reasos, we call the estmator based. For stace, f we sample people from a telephoe boo, we mss people who are ot regstered the telephoe boo. If the people who are ot o the telephoe boo are sstematcall dfferet from the people o the telephoe boo, the a estmator b sg data from the telephoe-boo sample wold be based from the tre vale of the etre poplato, cldg both people who are o the telephoe boo ad who are ot. The estmator from the telephoe-boo sample, however, cold be precse. For stace, as the mber of sample from the telephoe boo creases, the estmator becomes more precsel estmated. Bt becase the sample s ot represetg the etre poplato, the precsel estmated estmator s stll based. We wll dscss more o ths sse throghot ths corse. Fgre smmarzes the dscsso o basess ad precseess. A: based bt ot precse B: precse bt based C: based ad precse Fgre. Basess ad Precseess Sorce: Fgre., page 9, Lohr 999 8

9 Taash Yamao Fall Semester 5 Lectre : The Smple Regresso Model I ths lectre we revew the smple bvarate lear regresso model. We focs o statstcal assmptos to obta based estmators. A good derstadg o the assmptos of the sgle lear regresso model wll help o to derstad the mportace of each assmpto of the mltvarate regresso model. Defe the smple lear regresso SLR model: $ $ where s a depedet varable, s a depedet varable or a covarate, ad s the error term or dstrbace. We wat to obta ow two parameters $ ad $. These two parameters are called tre vales that ca ot be observed. What we ca do s to estmate these two parameters. We deote the estmated parameters as ad. We ma call them as estmated coeffcets or estmators. There are several was to estmate the two parameters. The most commo method s the least sqared method. Least Sqared Resdals Method: Defe $ ad $ : ow parameters or coeffcets ad : Least Sqared estmates b ad b : coeffcets that are sed to obta ad b ad b are two varables that cold tae a vales. We se these two varables whle we are searchg for ad. Whe the sm of sqared resdals s mmzed, b ad b wold be eqal to ad, respectvel. Defe b b s the resdal. Mmzg Problem: Fd a par of b ad b that mmzes the sm of sqared resdals: 9

10 m b b Frst order codtos F.O.C. wth respect to w.r.t b ad b are w.r.t. b w.r.t. b Notce that b ad b are replaced b ad becase the frst order dervatves are set to be zero ad. From, we have From, B sg, we have b

11 We ca orgaze ths to fd, Ths, the least sqared estmates are 3 4 Ubasedess of SLR I ths sb-secto, we eame the baasedess of the smple ler regresso SLR estmates. To show the baasedess, we eed for assmptos, whch are specfed below. We eed to derstad wh we eed each assmpto ad how the SLR estmates wll be based whe oe of the for assmptos s volated. Assmptos: SLR Lear parameters: $ $ SLR Radom samplg ad are radom SLR 3 Zero codtoal mea SLR 4 Sample varato - > From 4, we have

12 Here, we have sed SLR. 5 At ths pot, we ca see that: f the epected vale of the secod term s zero, the the epected vale of wll be eqal to, the ow tre vale. B tag the epectato of the both sdes of 5, we have 6 Net, we have mportat steps: becase s jst a costat. B the defto, we ow that Cov. Uder SLR3, we assme Cov, ths we have.

13 Ths, we have Therefore der assmptos SLR-4, the least sqared estmate s based. Whe s based, s also based. However, SLR 3 s a ver strog assmpto especall a sgle lear regresso model becase the dstrbace,, cldes so ma mportat omtted varables. Whe SLR 3 s volated, we have a omtted varable problem. We wll dscss abot the omtted varables problem the et lectre. For ow, we smpl assme that the for assmptos are satsfed. Varaces of OLS stmators For the OLS estmates to be the most effcet estmates amog ma other estmates, we eed to add oe more assmpto: SLR 5 Homosedastct: Var F The homosedastct assmpto dcates that the sze of the varace of s costat or does ot deped o. Whe ths assmpto s volated, the we sa the error term ehbts heterosedastct. Note that eve der the heterosedastct, the least sqared estmates are ot based as we have show prevosl. For ow, we assme the homosedastct ad obta the varaces of estmates. B sg 4, we have Var From SLR5, Var F Var Var [ ] σ 3

14 σ σ 7 Note: the larger the F, the larger the Var, ad the larger the varace, the smaller the Var. Chec the tetboo page 56 for the varace of the estmated costat term, Var. The Stadard rror of We ca get the stadard devato of b tag the postve sqared root of the varace: sd Var σ However, F s ow. Ths, we have to estmate F. The based estmator of F s SSR σ. Ths s adjsted b the - degrees of freedom. The degree of freedom s - becase we have observatos wth two estmators, whch clde a tercept.. Ths the stadard error of s se σ SSR / The stadard error for ca be obtaed b replacg the tet. σ wth σ eqato.58 4

15 R-sqared The R-sqared s the fracto of the sample varato that s eplaed b. R SS / SST SSR / SST SS: the eplaed sm of sqares; SSR: the resdal sm of sqares; SST: the total sm of sqares. SST SS SSR Regresso Throgh the Org Sppose we have a model sch as $. Least Sqared Approach: m b Frst order codtos F.O.C. wth respect to w.r.t b ad b are ~ w.r.t. b ~ 8 Method of Momets Approach To tae the method of momet approach, we start wth two assmptos. The frst assmpto s ot strog. Bt the secod assmpto s ver strog. For the least sqared estmates to be based, however, we stll eed to assme the same secod assmpto. 5

16 Ths, these assmptos are also eeded for least sqared estmates to be based. Both approaches reach the same estmates. Two assmptos are: 9 Ths we have two restrctos ad two ow parameters $ ad $. Ths we are able to solve these two eqatos. These are eactl the same as ad. Ths we obta the same estmates: 3 ad 4. 6

17 Lectre 3: Smple Omtted Varables Problem I ths lectre, we epad the smple lear regresso model to mltvarate ler model. The ma motvato to clde more tha oe varable to a regresso model s that t s ver dffclt to detf the effect of oe depedet varable,, o the depedet varable,, wthot cosderg the other factors. For stace, we ca measre the effect of schoolg more precsel f we compare people who have the eactl the same characterstcs, ecept the schoolg. There have bee ma stdes o tws. If we fd a dfferece wage rates betwee the two detcal people ecept the schoolg, the we ca coclde that the dfferece s de to the dfferece the schoolg. Ufortatel, socal scece, t s mpossble to fd eve two persos eactl the same ecept oe factor. ve tws are dfferet ma was. The problem of omttg mportat varables regresso aalses s called the omtted varables problem, ad ths s the core of ma problems ecoometrcs. I start ths lectre wth the smplest case of the omtted varables problem. Omtted Varables Problem: a smple case Sppose that the tre mode shold be: $ $ $. 3- Bt sppose that we have formato ol o bt ot so that we ca ol estmate a smple bvarate model wth bt ot. Accordg to the least sqared approach from the prevos lectre, we ow that the least sqared estmators are However, the tre model s 3-. Ths, we ca replace 3-3 b sg the tre model 3-, 7

18 8 As we dd before, we ca smplf the rght had sde of the eqato: Ths, b tag the epectato of both sdes of the eqato, the last term becomes zero der assmptos SLR -4, whle the secod term remas: ~ 3-4 Ths dcates that the least sqared estmate of, wll be based f the secod term of 3-4 s ot zero,.e., ad are ot correlated. Note, frther, that the last part of the secod term s smpl a least sqared estmator of a smple bvarate regresso model betwee ad : * * e ~ δ Ths, 3-4 ca be wrtte as ~ δ Therefore the bas s ~ δ. Ths, the sgs of ~ δ ad determe the drecto of the bas pward or dowward. If the two parameters, ~ δ ad, have the same sgs both

19 postve or both egatve, the the drecto of bas s postve. Bt f the parameters have the opposte sgs, the the drecto of bas s egatve. For stace, let s sa that we wat to estmate the effect of agrcltral credt o farm prodctvt bt do ot have formato o the edcato of farmers. Ths, the error term of a bvarate model cldes edcato as a omtted varable: farm_prod $ $ credt edc Becase edc s correlated wth farm_prod ad credt postvel, postve. The drecto of a potetal bas s postve. ~ adδ are both ~ The magtde of the bas depeds o the absolte vales of ad δ. If the correlatos betwee ad ad betwee ad are strog, the the sze of the bas becomes bgger. O the other had, f oe of the two correlatos s wea the sze of the bas becomes small. ample 3-: Idvdal No-farm Icome Ugada I rral areas of Ugada, o-farm come provdes mch eeded cash to farm hoseholds. Most edcated me ad wome rral areas have opporttes to hold reglar jobs, mag a costat mothl wage throghot a ear. Other people who are ot fortate eogh to hold reglar jobs are lel to ear o-farm come from small self-emploed bsesses sch as mag basets or tradg goods. Sppose that we are terested the geder dffereces o-farm come ad wat to test a hpothess that wome mae less o-farm come tha me. Bt a smple comparso of o-farm come betwee me ad wome does ot provde a relable test for ths hpothess f characterstcs of me ad wome are ot smlar. Oe major factor o-farm come s edcato. If me are better edcated tha wome ad me mae more o-farm come tha wome, we ca ot be sre f the hgh o-farm come s a reslt of geder or edcato. For eample, let s se the data from Ugada. The data are collected b FASID collaborato wth Maerere Uverst Ugada 3. The data come from 94 hoseholds. Amog them, we fd 648 people who eared some come from o-farm actvtes. Table dcates average o-farm come US$, average schoolg ears, age, ad observatos for me ad wome. Table. Idvdal No-farm Icome Ugada 9

20 table female, cmea ocus mea ed mea age age f%8.f row; female meaocus meaed meaage Nage Total As o ca see Table, me are slghtl better edcated ad mae more o-farm come. To compare o-farm come betwee me ad wome whle holdg edcato levels costat, we have created categores for edcato: o edcato categor, -4 ears of schoolg, 5-7 ears of schoolg, 8- ears of schoolg 3, ad more tha ears of schoolg 4. The we have calclated average o-farm come for each categor for me ad wome separatel. Althogh wome mae less o-farm come tha me geeral, t-tests dcate that the dfferece s statstcall sgfcat at the 5 percet level ol for Categor 3. Table. No-farm Icome b Geder ad dcato. table edcat female, cmea ocus f%8.f row; female edcat Dfferece p-vale Total Now, let s cosder whch drecto the bas wold be f we estmate the o-farm come eqato wth a female dmm bt ot edcato. dcato the depedet varable s postvel correlated wth edcato the omtted varable, ad edcato the omtted varable s egatvel correlated wth the female dmm the depedet varable. Ths, accordg to the theor above, the drecto of the bas shold be egatve. lo-farm come $ $ female edc. reg loc female Sorce SS df MS Nmber of obs F, Model Prob > F. Resdal R-sqared Adj R-sqared.84 Total Root MS

21 loc Coef. Std. rr. t P> t [95% Cof. Iterval] female _cos reg loc female ed Sorce SS df MS Nmber of obs F, Model Prob > F. Resdal R-sqared Adj R-sqared.894 Total Root MS loc Coef. Std. rr. t P> t [95% Cof. Iterval] female ed _cos The reslts the frst model dcate that wome mae abot 63 percet less o-farm come tha me, bt the reslts the secod model dcate that the dfferece s abot 55 percet. Ths, the reslts dcate that the estmated coeffcet was based dowward whe ed was ot clded. Of corse, there are probabl ma other varables that shold be clded ths model to estmate the geder dfferece more precsel. We wll come bac to ths model later ths corse. d of ample Mltvarate Lear Regresso Model: Illstrato It s clear that we eed to clde more tha oe varable regresso models to tae to accot ma factors. How ma varables shold be clded? We wll std how we shold select varables based o derlg ecoomc theor ad emprcal aalses later. For ow, let s cosder a geeral case wth depedet varables pls a tercept. A mltvarate ler regresso model wth -depedet varables s $ $ $. $. for,,. where,, ad are observato s, ad. As the bvarate model, let s start wth the least sqared resdals approach, whch s called the Ordar Least Sqared OLS regresso. Least Sqared Resdals Approach

22 As the bvarate regresso, we wat to fd a set of estmators, $ j, that mmze the sm of sqared resdals: m b b b b F.O.C. w.r.t. b w.r.t. b 3- w.r.t. b Whe there s ol oe depedet varable ths, we cold solve the frst order codtos easl. Bt whe there are depedet varables, solvg the frst order codtos for varables wold be complcated. Matr maes t easer.

23 Lectre 4: Mltvarate Regresso Model Matr Form I ths lectre, we rewrte the mltple regresso model matr form. A geeral mltple-regresso model ca be wrtte as $ $ $. $. for,,. I matr form, we ca rewrte ths model as X % We wat to estmate %. Least Sqared Resdal Approach Matr Form Please see Lectre Note A for detals The strateg the least sqared resdal approach s the same as the bvarate lear regresso model. Frst, we calclate the sm of sqared resdals ad, secod, fd a set of estmators that mmze the sm. Ths, the mmzg problem of the sm of the sqared resdals matr form s m t X bt - Xb Notce here that t s a scalar or mber sch as, becase ts a matr ad s a matr ad the prodct of these two matrces s a matr ths a scalar. The, we ca tae the frst dervatve of ths object fcto matr form. Frst, we smplf the matrces: t t btxt - Xb t btxt - txb btxtxb t btxt - btxt btxtxb A-: btxt txb 3

24 4 t btxt btxtxb The, b tag the frst dervatve wth respect to b, we have: t/ b Xt XtX b So the frst order codto F.O.C. s Xt XtX Β 4A- ad 4A-3 The, XtX Β Xt 4- Notce that I have replaced b wth Β becase Β satsf the F.O.C, b defto. Mltpl the verse matr of XtX o the both sdes, ad we have: Β XtX - Xt 4- Ths s the least sqared estmator for the mltvarate regresso lear model matr form. We call t as the Ordar Least Sqared OLS estmator. Note that the frst order codtos 4- ca be wrtte matr form as Xt - X Β Ths s the same as the frst order codtos, codtos, we derved the prevos lectre ote: b

25 5 b b ample 4- : A bvarate lear regresso matr form As a eample, let s cosder a bvarate model matr form. A bvarate model s $ $. for,,. I matr form, ths s X % From 4-, we have Β XtX - Xt 4-3 Let s cosder each compoet 4-3. XtX Ths s a sqare matr. Ths, the verse matr of XtX s, XtX -

26 6 The secod term s Xt Therefore we have all the compoets of the rght-had sde of 3-4. Ths the OLS estmators are: Β XtX - Xt

27 d of ample 4- Ubasedess of OLS Wooldrdge Apped I ths sb-secto, we show the basedess of OLS der the followg assmptos. Assmptos: Lear parameters: X % Zero codtoal mea: X 3 No perfect colleart: X has ra. From 4-3, we ow the OLS estmators are Β XtX - Xt We ca replace wth the poplato model, Β XtX - Xt X % XtX - XtX % XtX - Xt % XtX - Xt 4-4 Tae the epectato o the both sdes of the eqato: Β % XtX - Xt X From MLR 3, we have X. Ths, Β % The OLS estmators are based. The Varace of OLS stmators Net, we 7

28 8 Assmpto: 4 Homosedastct: Var XF ad Cov, j, ths Var X F I Becase of ths assmpto, we have [ ] I σ σ σ σ Therefore, VarΒ Var [% XtX - Xt] Var [XtX - Xt ] [XtX - Xt t X XtX - ] XtX - Xt [ t] X XtX - XtX - Xt F I X XtX - 4: Homosedastct F XtX - XtX XtX - VarΒ F XtX GAUSS-MARKOV Theorem: Uder assmptos 4, Β s the Best Lear Ubased stmator BLU.

29 ample 4-: Step b Step Regresso stmato b STATA I ths sb-secto, I wold le to show o how the matr calclatos we have stded are sed ecoometrcs pacages. Of corse, practces o do ot create matr programs: ecoometrcs pacages alread have blt- programs. The followg are matr calclatos wth STATA sg data called, NFIcomeUgada.dta. Here we wat to estmate the followg model: l_fcome $ $ female $ edc $ edcsq. All the varables are defed ample 3-. Descrptve formato abot the varables are here:. s; Varable Obs Mea Std. Dev. M Ma female ed edsq l_fcome Frst, we eed to defe matrces. I STATA, o ca load specfc varables data to matrces. The commad s called mmat. Here we create a matr, called, cotag the depedet varable, l_fcome, ad a set of depedet varables, called, cotag female, edc, edcsq.. mmat l_fcome, matr. mmat female edc edcsq, matr The, we create some compoets: XtX, XtX -, ad Xt:. matr '*;. mat lst ; smmetrc [4,4] female ed edsq cost female 44 ed edsq cost matr smv;. mat lst ; smmetrc [4,4] female ed edsq cost female.944 ed edsq e-6 cost

30 Here s Xt:. matr '*;. mat lst ; [4,] l_fcome female ed edsq cost Therefore the OLS estmators are XtX - Xt:. ** stmatg b hat;. matr bhat*;. mat lst bhat; bhat[4,] l_fcome female ed.4488 edsq cost ** stmatg stadard error for b hat;. matr e-*bhat;. matr sse'*e/648--3;. matr vecdag;. mat lst ss; smmetrc ss[,] l_fcome l_fcome mat lst ; [,4] female ed edsq cost r e Let s verf what we have fod.. reg l_fcome female ed edsq; Sorce SS df MS Nmber of obs F 3, Model Prob > F. Resdal R-sqared Adj R-sqared.93 Total Root MS l_fcome Coef. Std. rr. t P> t [95% Cof. Iterval] female ed edsq _cos ed of do-fle 3

31 Lectre 5: OLS Iferece der Fte-Sample Propertes I ths lectre, we std OLS ferece der fte-sample propertes where estmators are eact ad dstrbtos are precsel defed eve wth small sample szes. We start wth the basedess of OLS der the followg assmptos. Ubasedess of OLS Wooldrdge Apped Assmptos: Lear parameters: X % Zero codtoal mea: X 3 No perfect colleart: X has ra. Aga, der fte-sample propertes, these assmptos hold for a sample szes. From 4-3, we ow the OLS estmators are Β XtX - Xt We ca replace wth the poplato model, Β XtX - Xt X % XtX - XtX % XtX - Xt % XtX - Xt 5- Tae the epectato o the both sdes of the eqato: Β % XtX - Xt X From, we have X. Ths, Β % The OLS estmators are based. The Varace of OLS stmators 3

32 3 Net, we add oe more assmpto abot the varace of the error term ad cosder the varace the OLS estmators. Assmpto: 4 Homosedastct: Var XF ad Cov, j, ths Var X F I Becase of ths assmpto, we have [ ] I σ σ σ σ Therefore, VarΒ ] [ X ΒΒ Β Β ] [ X X X X X X X X X X I X X X σ 4: Homosedastct F XtX - XtX XtX - VarΒ F XtX - 5- GAUSS-MARKOV Theorem: Uder assmptos 4, Β s the Best Lear Ubased stmator BLU. See Wooldrdge, pp 79.

33 So far, we have obtaed OLS estmatos for Β ad Var Β. Bt we eed to ow the shape of the fll samplg dstrbto of Β order to codct statstcal tests, sch as t-tests or F-tests. The dstrbto of OLS estmator Β depeds o the derlg dstrbto of the errors. Ths, we mae the followg assmpto aga, der ftesample propertes. Assmpto 5 Normalt of rrors: ~ N, F I Note that N, F I dcates a mltvarate ormal dstrbto of wth mea ad the varace-covarace matr F I. Remember aga that ol assmptos -3 are ecessar to have based OLS estmators. I addto, assmpto 4 s eeded to show that the OLS estmators are the best lear based estmator BLU, the Gass-Marov theorem. We eed assmpto 5 to codct statstcal tests. Assmptos -5 are collectvel called as the Classcal Lear Model CLM assmptos. The model wth all assmptos -5 s called the classcal lear model. The OLS estmators wth the CLM assmptos are the mmm varace based estmators. Ths dcates that the OLS estmators are the most effcet estmators amog all models ot ol amog lear models. Normalt of Β Uder the CLM assmptos -5, Β codtoal o X s dstrbted as mltvarate ormal wth mea Β ad varace-covarace matr F XtX -. Β ~ N[ Β, σ X X Ths s a mltvarate ormal dstrbto, whch meas each elemet of Β s ormall dstrbted: ~ N[, σ X X ] XtX - s the -th dagoal elemet of XtX -. Let s deote the -th dagoal elemet of XtX - as S. Ths, ] 33

34 σ X X S. σ. S σ S σ S... S.... σ S.. Ths s the varace-covarace matr of the OLS estmator. O the dagoal, there are varaces of the OLS estmators. Off-the dagoal, there are covarace betwee estmators. The, becase each OLS estmator s assmed to be ormall dstrbted, we ca obta a stadard ormal dstrbto of a OSL estmator b sbtractg the mea ad dvdg t b the stadard devato: z. σ S However, F s ow. Ths we se a estmator of F stead. A based estmator of F s s s the sm of sqared errors. Remember s a prodct of a matr ad a matr, whch gves a sgle mber. Therefore b replacg F wth s, we have t s S / σ S. [ s / σ ]/ Ths rato has a t-dstrbto wth -- degree of freedom. It has a t-dstrbto becase t s a rato of a varable that has a stadard ormal dstrbto the omator the parethess ad a varable that has a ch-sqared dstrbto dvded b --. The stadard error of, se, s s S. Testg a Hpothess o I most cases we wat to test the ll hpothess H : wth the t-statstcs t-test: - / se ~ t --. Whe we test the ll hpothess, the t-statstcs s jst a rato of a OLS estmator over ts stadard error. 34

35 We ma test the ll hpothess agast the oe-sded alteratve or two-sded alteratves. Oe-sded Alteratve H : > We reject the ll hpothess H whe the t-statstcs s greater tha the crtcal vale for a reasoable crtcal vale, sch as 5 percet: t > c Whe the oe-sded alteratve s the opposte: H : < We reject the ll hpothess H whe the t-statstcs s smaller tha the crtcal vale for a reasoable crtcal vale, sch as 5 percet: t < -c. Two-sded Alteratves H : We reject the ll hpothess H whe the absolte vale of t-statstcs s greater tha the crtcal vale for a reasoable crtcal vale, sch as 5 percet: t > c For a two-taled test, c s chose for the half of the crtcal level. For the 5 percet level, the crtcal vale for.5 percet wth -- degree of freedom shold be chose. Testg agast a o-zero costat Yo ca test a hpothess test le H : agast H : b sg a t-test t-test: - / se ~ t --. Testg a Jot Hpotheses Test o ' s Sppose we have a mltvarate model: $ $ $ $ 3 $ 4 $ 5. Sometmes we wat to test to see whether a grop of varables jotl has effects of. Sppose we wat to ow whether depedet varables 3, 4, ad 5 jotl have effects o. Ths the ll hpothess s 35

36 H : The ll hpothess, therefore, poses a qesto whether these three varables ca be eclded from the model. Ths the hpothess s also called eclso restrctos. A model wth the eclso s called the restrcted model: $ $ $. O the other had, the model wthot the eclso s called the restrcted model: $ $ $ $ 3 $ 4 $ 5. We ca geeralze ths problem b chagg the mber of restrctos from three to q. The jot sgfcace of q varables s measred b how mch the sm of sqared resdals SSR creases whe the q-varables are eclded. Let deote the SSR of the restrcted ad restrcted models as SSR r ad SSR r, respectvel. Of corse the SSR r s smaller tha the SSR r becase the restrcted model has more varables tha the restrcted model. Bt the qesto s how mch compared wth the orgal sze of SSR. The F-statstcs s defed as F-test: F SSR SSR r r SSRr / q. / The merator measres the chage SSR, movg from restrcted model to restrcted model, per oe restrcto. Le percetage, the chage SSR s dvded b the sze of SSR at the startg pot, the SSR r stadardzed b the degree of freedom. The above defto s based o how mch the models caot epla, SSR s. Istead, we ca measre the cotrbto of a set of varables b asg how mch of the eplaator power s lost b ecldg a set of q varables. The F-statstcs ca be re-defed as F-test: F R R r r Rr / q. / Aga, becase the restrcted model has more varables, t has a larger R-sqared tha the restrcted model. Ths the merator s alwas postve. The merator measres the loss the eplaator power, per oe restrcto, whe movg from the restrcted model to the restrcted model. Ths chage s dvded b the eplaed varato b the restrcted model, stadardzed b the degree of freedom. If the decrease eplaator power s relatvel large, the the set of q-varables s cosdered a jotl sgfcat the model. Ths these q-varables shold sta the model. 36

37 Lectre 6: OLS Asmptotc Propertes Revewg: Fte-Sample Propertes of Least Sqares I the prevos secto, we have stded fte-sample propertes of OLS estmators. Uder the fte-sample propertes, the OLS estmators are based ad the error terms are ormall dstrbted eve whe sample szes are small. Here s a qc revew: Assmptos: Lear parameters: X % Zero codtoal mea: X 3 No perfect colleart: X has ra. Uder these assmptos, the OLS estmators are based becase: Ths, Β XtX - Xt becase of 3 XtX - ests Β XtX - Xt X % from % XtX - Xt Β % XtX - Xt X from Β % based To fd the varace of the OLS estmators, we added oe more assmpto. 4 Homosedastct: Var XF ad Cov, j, ths Var X F I Therefore, VarΒ [ Β Β ΒΒ X ] X X X [ X ] X X X X X X σ I X X X from 4 F XtX - Fall, for s to be able to codct statstcal tests, we eed to ow the dstrbto of the OLS estmators, the we added a assmpto abot the dstrbto of the error term: 5 Normalt of rrors: ~ N, F I Uder the assmptos -5, each of followg estmators has a eact dstrbto. Β ~ N[ Β, σ X X ] Normal dstrbto 37

38 Ths holds for t t dstrbto s S F SSR SSR r r SSRr / q / F dstrbto However, these are strog assmptos ad ca be relaed easl b sg asmptotc theor. Therefore, ths lectre, we std the asmptotc propertes or large sample propertes of the OLS estmators. Uder the asmptotc propertes, the propertes of the OLS estmators deped o the sample sze. I short, we ca show that the OLS estmators cold be based wth a small sample sze bt cosstet wth a sffcetl large sample sze. Cosstec stead of basedess Frst, we eed to defe cosstec. Sppose W s a estmator of θ o a sample of Y, Y,, Y of sze. The, W s a cosstet estmator of θ f for ever e >, P W - θ > e as. Ths sas that the probablt that the absolte dfferece betwee W ad θ beg larger tha e goes to zero as gets bgger. Whch meas that ths probablt cold be o-zero whle s ot large. For stace, let s sa that we are terested fdg the average come of Amerca people ad tae small samples radoml. Let s assme that the small samples clde Bll Gates b chace. The sample mea come s wa over the poplato average. Ths, whe sample szes are small, the probablt that the dfferece betwee the sample ad poplato averages s larger tha e, whch s a postve mber, ca be o-zero. However, the dfferece betwee the sample ad poplato averages wold be smaller as the sample sze gets bgger as log as the samplg s properl doe. As a reslt, as the sample sze goes to ft, the probablt that the dfferece betwee the two averages s bgger tha e o matter how small e s becomes zero. I other words, we sa that θ s the probablt lmt of W : plm W θ. Uder the fte-sample propertes, we sa that W s based, W θ. Uder the asmptotc propertes, we sa that W s cosstet becase W coverges to θ as gets larger. The OLS estmators From prevos lectres, we ow the OLS estmators ca be wrtte as Β XtX - Xt 38

39 39 Β B XtX - Xt I the matr form, we ca eame the probablt lmt of OLS Β Β X p X X p lm lm Here, we assme that Q X X p lm ad assme that Q - ests. From, we have lm X p. Ths, lm Β Β Q p Ths, we have show that the OLS estmator s cosstet. Nest, we focs o the asmmetrc ferece of the OLS estmator. The asmptotc dstrbto of the OLS estmator s derved b wrtg Β Β X X X ΒΒ X X X The probablt lmt of Β Β goes to zero becase the cosstec of Β. The varace of Β Β s ΒΒ Β Β X X X X X X X X X X X X From 4, the probablt lmt of goes to F I. Ad also we assmed plm of X X s Q. Ths, Q X X Q σ QQ Q σ

40 4 Q σ Therefore, the asmptotc dstrbto of the OLS estmator s ] [, ~ Β Β Q N a σ. From ths, we ca treat the OLS estmator, Β, as f t s appromatel ormall dstrbted wth mea Β ad varace-covarace matr Q / σ. ample 6-: Cosstec of OLS stmators Bvarate Lear stmato A bvarate model: $ $ ad To eame the basedess of the OLS estmator, we tae the epectato Uder the assmpto of zero codtoal mea SLR 3:, we ca separate the epectato of ad :. Ths we eed the SLR 3 to show the OLS estmator s based. Now, sppose we have a volato of SLR 3 ad ca ot show the basedess of the OLS estmator. We cosder a cosstec of the OLS estmator. lm lm lm p p p lm lm lm p p p

41 cov, p lm var p lm f cov, Ths, as log as the covarace betwee ad s zero, the OLS estmator of a bvarate model cosstet. d of ample 6-4

42 Lectre 7: OLS Frther Isses I ths lectre, we wll dscss some practcal sses related to OLS estmatos, sch as fctoal forms ad terpretatos of several tpes of varables. For detals, please read Wooldrdge chapter 6 ad 7. Measremet rror the Depedet Varable Let * deote the varable that we wold le to epla: * $ $ $ $. However, we ca ol observe whch s a measred varable of * wth measremet errors. e - * B replacg * wth ad e, we get $ $ $ $ e. Ths, f e satsf the OLS assmptos sch as e X, the OLS estmators are based or cosstet. Bt the varace of the dstrbace s larger b Vare wth the measremet error e tha wthot. Note, however, that the measremet error the depedet varable cold be correlated wth depedet varables [Cov e ]. I that case, the estmators wll be based. Measremet rror a Idepedet Varable Let * deote a depedet varable, whch cold be observed wth the measremet error, e, where e. e - * 8- * $ $ $ $ - e. Assmpto : Cov, e Uder ths assmpto, the error term - $ e has zero mea ad correlated wth the depedet varables. Ths the estmators are based cosstet. The error varace, however, s bgger b $ e. Assmpto : Cov *, e 4

43 43 Ths assmpto s called the Classc rrors--varables CV assmpto. Becase e - *, ad e mst be correlated der the assmpto : Cov, e e * e e σ e Ths, we have the omtted varables problem, whch gves cosstet estmators of all depedet varables. The Atteato Bas For a bvarate regresso model t s eas to show the eact bas cased b the CV. Now, s measred wth the measremet errors, stead of. I a bvarate regresso model, the least sqare estmator ca be wrtte as e Ths, the probablt lmt of s, cov lm σ σ σ σ σ σ σ < e e e e e Var Var e p Ths, the plm s alwas closer to zero or based toward zero tha. Ths s called the famos atteato bas OLS de to classcal errors--varables. For a mltvarate regresso model, the probablt lmt of s

44 σ r p lm < σ σ r e where r α α s the poplato error the eqato α r Aga the mplcato s the same as before. The estmated coeffcet of the varable wth measremet errors s based toward zero or less lel to reject the ll hpothess. Data Scalg Ma varables we se have ts, sch as moetar ts ad qatt ts. The bottom le s that data scalg does ot chage mch. Scalg p/dow a depedet varable: α. α α α α If o scale p/dow the depedet varable b ", the OLS estmators ad stadard errors wll be also scaled p/dow b ", bt ot t-statstcs. Ths the sgfcace level remas the same as before scalg. Scalg p/dow a depedet varable: / α α. If o scale p/dow oe depedet varable, the estmated coeffcet of the depedet varable wll be scale dow/p b the same scale. Aga the t-statstcs or sgfcace level does ot chage. Logarthmc Forms For a small chage, a chage log tmes, log, s appromatel close to a percetage chage, /. Therefore, we ca terpret the followg cases sg percetage chages: log-log: / log log / Oe percet chage chages b percet. s a elastct. 44

45 / log-level: log Oe t chage chages b percet. 3 level-log: log / Oe percet chage chages b. 4 level-level: Oe t chage chages b. Whe a chage log s ot small, the appromato betwee a chage log ad a chage ma ot be accrate. For stace, the log-level model gves s log log. 8- If the chage log s small, the there s o problem of terpretg ths as oe t of chages b percet, becase log log /. Bt whe a chage log s ot small, the appromato ma ot be appromate. Ths we eed to trasform 8- as: log log log / / ep / ep / ep % [ep ] Ths oe t chages b [ep ] percetage. Qadratc Form / Iterpretato: > ad < a crease creases wth a dmshg rate < ad > a crease decreases wth a dmshg rate Trg Pot: At the trg pot, the frst dervatve of wth respect to s zero: 45

46 46 / Ths the vale of at the trg ths pot s / * Iteracto Terms 3 The mpact of o s 3 / A Dmm Varable or where A grop of observatos wth s called a base, bechmar, or referece grop. The estmated coeffcet of measres the dfferece averages ŷ amog observatos for ad the base grop, holdg other varables costat.,,,,,, The grop B, wth, has a lower or hgher tha the base grop, wth. Iteracto Terms wth Dmmes 3 or where Whe s a cotos varable, 3 measres a dfferece the effect of o betwee a grop wth ad a grop wth, or a dfferece slopes of : / / 3 whe whe

47 Mltcolleart From the prevos lectre, we ow that the varace of OLS estmators s Var Β σ X X. The varace of a estmator,, s σ S. Ths ca be wrtte as Var σ R See Wooldrdge pp94 or Greee pp57 R s the R-sqared the regresso of agast all other varables. I other words, R s the proporto of the total varato that ca be eplaed b the other depedet varables. If two varables are perfectl correlated, the R wll be oe for both of the perfectl correlated varables, ad the varace of those two varables wll ot be measred. Obvosl, o eed to drop oe of the two perfectl correlated varables. I STATA, STATA drops oe of perfectl correlated varables atomatcall. So f o see a dropped varable STATA otpts, o shold sspect that o have clded perfectl correlated varables wthot realzg. ve f two or more varables are ot perfectl correlated, f the are hghl correlated hgh R, the varace of estmators wll be large. Ths problem s called mltcolleart. The problem of mltcolleart ca be avoded to some etet b collectg more data ad crease varace depedet varables. For stace, let s th abot a sample of for dvdals. All of for cold be male ad marred. I ths case, a geder varable ad a martal stats varable wll be perfectl correlated. Sppose three of them have collage edcato. The a edcato varable wll be hghl correlated wth the geder ad martal-stats varables. Of corse, correlatos betwee these varables wll dsappear to some etet as the sze of sample ad the varato varables crease. Whe dobt, codct a F-test o varables that o sspect casg mltcolleart. A tpcal smptom of mltcolleart s A hgh jot sgfcace ad low dvdal sgfcace a hgh F-statstcs bt low t-statstcs A smple solto s to eep them the model. If or ma focs s o a varable whch s ot the part of mltcolleart, the t s ot a seros problem to have mltcolleart or model. Yo cold drop oe of hghl correlated varables, bt b dog so ma create a omtted varable problem. Remember that a omtted varable problem ca case bases o all of estmators. Ths cold be a more seros problem tha mltcolleart. 47

Lecture 2: The Simple Regression Model

Lecture 2: The Simple Regression Model Lectre Notes o Advaced coometrcs Lectre : The Smple Regresso Model Takash Yamao Fall Semester 5 I ths lectre we revew the smple bvarate lear regresso model. We focs o statstcal assmptos to obta based estmators.

More information

DISTURBANCE TERMS. is a scalar and x i

DISTURBANCE TERMS. is a scalar and x i DISTURBANCE TERMS I a feld of research desg, we ofte have the qesto abot whether there s a relatoshp betwee a observed varable (sa, ) ad the other observed varables (sa, x ). To aswer the qesto, we ma

More information

Fundamentals of Regression Analysis

Fundamentals of Regression Analysis Fdametals of Regresso Aalyss Regresso aalyss s cocered wth the stdy of the depedece of oe varable, the depedet varable, o oe or more other varables, the explaatory varables, wth a vew of estmatg ad/or

More information

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model Chapter 3 Asmptotc Theor ad Stochastc Regressors The ature of eplaator varable s assumed to be o-stochastc or fed repeated samples a regresso aalss Such a assumpto s approprate for those epermets whch

More information

Lecture Notes 2. The ability to manipulate matrices is critical in economics.

Lecture Notes 2. The ability to manipulate matrices is critical in economics. Lecture Notes. Revew of Matrces he ablt to mapulate matrces s crtcal ecoomcs.. Matr a rectagular arra of umbers, parameters, or varables placed rows ad colums. Matrces are assocated wth lear equatos. lemets

More information

Suggested Answers, Problem Set 4 ECON The R 2 for the unrestricted model is by definition u u u u

Suggested Answers, Problem Set 4 ECON The R 2 for the unrestricted model is by definition u u u u Da Hgerma Fall 9 Sggested Aswers, Problem Set 4 ECON 333 The F-test s defed as ( SSEr The R for the restrcted model s by defto SSE / ( k ) R ( SSE / SST ) so therefore, SSE SST ( R ) ad lkewse SSEr SST

More information

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model Lecture 7. Cofdece Itervals ad Hypothess Tests the Smple CLR Model I lecture 6 we troduced the Classcal Lear Regresso (CLR) model that s the radom expermet of whch the data Y,,, K, are the outcomes. The

More information

Recall MLR 5 Homskedasticity error u has the same variance given any values of the explanatory variables Var(u x1,...,xk) = 2 or E(UU ) = 2 I

Recall MLR 5 Homskedasticity error u has the same variance given any values of the explanatory variables Var(u x1,...,xk) = 2 or E(UU ) = 2 I Chapter 8 Heterosedastcty Recall MLR 5 Homsedastcty error u has the same varace gve ay values of the eplaatory varables Varu,..., = or EUU = I Suppose other GM assumptos hold but have heterosedastcty.

More information

Lecture Notes Forecasting the process of estimating or predicting unknown situations

Lecture Notes Forecasting the process of estimating or predicting unknown situations Lecture Notes. Ecoomc Forecastg. Forecastg the process of estmatg or predctg ukow stuatos Eample usuall ecoomsts predct future ecoomc varables Forecastg apples to a varet of data () tme seres data predctg

More information

Objectives of Multiple Regression

Objectives of Multiple Regression Obectves of Multple Regresso Establsh the lear equato that best predcts values of a depedet varable Y usg more tha oe eplaator varable from a large set of potetal predctors {,,... k }. Fd that subset of

More information

Summary of the lecture in Biostatistics

Summary of the lecture in Biostatistics Summary of the lecture Bostatstcs Probablty Desty Fucto For a cotuos radom varable, a probablty desty fucto s a fucto such that: 0 dx a b) b a dx A probablty desty fucto provdes a smple descrpto of the

More information

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018 /3/08 Sstems & Bomedcal Egeerg Departmet SBE 304: Bo-Statstcs Smple Lear Regresso ad Correlato Dr. Ama Eldeb Fall 07 Descrptve Orgasg, summarsg & descrbg data Statstcs Correlatoal Relatoshps Iferetal Geeralsg

More information

Econometric Methods. Review of Estimation

Econometric Methods. Review of Estimation Ecoometrc Methods Revew of Estmato Estmatg the populato mea Radom samplg Pot ad terval estmators Lear estmators Ubased estmators Lear Ubased Estmators (LUEs) Effcecy (mmum varace) ad Best Lear Ubased Estmators

More information

Simple Linear Regression

Simple Linear Regression Statstcal Methods I (EST 75) Page 139 Smple Lear Regresso Smple regresso applcatos are used to ft a model descrbg a lear relatoshp betwee two varables. The aspects of least squares regresso ad correlato

More information

Lecture 1: Introduction to Regression

Lecture 1: Introduction to Regression Lecture : Itroducto to Regresso A Eample: Eplag State Homcde Rates What kds of varables mght we use to epla/predct state homcde rates? Let s cosder just oe predctor for ow: povert Igore omtted varables,

More information

Chapter Two. An Introduction to Regression ( )

Chapter Two. An Introduction to Regression ( ) ubject: A Itroducto to Regresso Frst tage Chapter Two A Itroducto to Regresso (018-019) 1 pg. ubject: A Itroducto to Regresso Frst tage A Itroducto to Regresso Regresso aalss s a statstcal tool for the

More information

An Expansion of the Derivation of the Spline Smoothing Theory Alan Kaylor Cline

An Expansion of the Derivation of the Spline Smoothing Theory Alan Kaylor Cline A Epaso of the Derato of the Sple Smoothg heory Ala Kaylor Cle he classc paper "Smoothg by Sple Fctos", Nmersche Mathematk 0, 77-83 967) by Chrsta Resch showed that atral cbc sples were the soltos to a

More information

Lecture Notes Types of economic variables

Lecture Notes Types of economic variables Lecture Notes 3 1. Types of ecoomc varables () Cotuous varable takes o a cotuum the sample space, such as all pots o a le or all real umbers Example: GDP, Polluto cocetrato, etc. () Dscrete varables fte

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Exam: ECON430 Statstcs Date of exam: Frday, December 8, 07 Grades are gve: Jauary 4, 08 Tme for exam: 0900 am 00 oo The problem set covers 5 pages Resources allowed:

More information

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uversty Mchael Bar Sprg 5 Mdterm am, secto Soluto Thursday, February 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes eam.. No calculators of ay kd are allowed..

More information

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model ECON 48 / WH Hog The Smple Regresso Model. Defto of the Smple Regresso Model Smple Regresso Model Expla varable y terms of varable x y = β + β x+ u y : depedet varable, explaed varable, respose varable,

More information

Lecture 1: Introduction to Regression

Lecture 1: Introduction to Regression Lecture : Itroducto to Regresso A Eample: Eplag State Homcde Rates What kds of varables mght we use to epla/predct state homcde rates? Let s cosder just oe predctor for ow: povert Igore omtted varables,

More information

Lecture 3. Sampling, sampling distributions, and parameter estimation

Lecture 3. Sampling, sampling distributions, and parameter estimation Lecture 3 Samplg, samplg dstrbutos, ad parameter estmato Samplg Defto Populato s defed as the collecto of all the possble observatos of terest. The collecto of observatos we take from the populato s called

More information

4. Standard Regression Model and Spatial Dependence Tests

4. Standard Regression Model and Spatial Dependence Tests 4. Stadard Regresso Model ad Spatal Depedece Tests Stadard regresso aalss fals the presece of spatal effects. I case of spatal depedeces ad/or spatal heterogeet a stadard regresso model wll be msspecfed.

More information

ε. Therefore, the estimate

ε. Therefore, the estimate Suggested Aswers, Problem Set 3 ECON 333 Da Hugerma. Ths s ot a very good dea. We kow from the secod FOC problem b) that ( ) SSE / = y x x = ( ) Whch ca be reduced to read y x x = ε x = ( ) The OLS model

More information

i 2 σ ) i = 1,2,...,n , and = 3.01 = 4.01

i 2 σ ) i = 1,2,...,n , and = 3.01 = 4.01 ECO 745, Homework 6 Le Cabrera. Assume that the followg data come from the lear model: ε ε ~ N, σ,,..., -6. -.5 7. 6.9 -. -. -.9. -..6.4.. -.6 -.7.7 Fd the mamum lkelhood estmates of,, ad σ ε s.6. 4. ε

More information

Lecture 3 Probability review (cont d)

Lecture 3 Probability review (cont d) STATS 00: Itroducto to Statstcal Iferece Autum 06 Lecture 3 Probablty revew (cot d) 3. Jot dstrbutos If radom varables X,..., X k are depedet, the ther dstrbuto may be specfed by specfyg the dvdual dstrbuto

More information

B-spline curves. 1. Properties of the B-spline curve. control of the curve shape as opposed to global control by using a special set of blending

B-spline curves. 1. Properties of the B-spline curve. control of the curve shape as opposed to global control by using a special set of blending B-sple crve Copyrght@, YZU Optmal Desg Laboratory. All rghts reserved. Last pdated: Yeh-Lag Hs (--9). ote: Ths s the corse materal for ME Geometrc modelg ad compter graphcs, Ya Ze Uversty. art of ths materal

More information

Sampling Theory MODULE V LECTURE - 14 RATIO AND PRODUCT METHODS OF ESTIMATION

Sampling Theory MODULE V LECTURE - 14 RATIO AND PRODUCT METHODS OF ESTIMATION Samplg Theor MODULE V LECTUE - 4 ATIO AND PODUCT METHODS OF ESTIMATION D. SHALABH DEPATMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOG KANPU A mportat objectve a statstcal estmato procedure

More information

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uverst Mchael Bar Sprg 5 Mdterm xam, secto Soluto Thursda, Februar 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes exam.. No calculators of a kd are allowed..

More information

Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean

Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean Research Joural of Mathematcal ad Statstcal Sceces ISS 30 6047 Vol. 1(), 5-1, ovember (013) Res. J. Mathematcal ad Statstcal Sc. Comparso of Dual to Rato-Cum-Product Estmators of Populato Mea Abstract

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ  1 STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Recall Assumpto E(Y x) η 0 + η x (lear codtoal mea fucto) Data (x, y ), (x 2, y 2 ),, (x, y ) Least squares estmator ˆ E (Y x) ˆ " 0 + ˆ " x, where ˆ

More information

Chapter 11 Systematic Sampling

Chapter 11 Systematic Sampling Chapter stematc amplg The sstematc samplg techue s operatoall more coveet tha the smple radom samplg. It also esures at the same tme that each ut has eual probablt of cluso the sample. I ths method of

More information

Third handout: On the Gini Index

Third handout: On the Gini Index Thrd hadout: O the dex Corrado, a tala statstca, proposed (, 9, 96) to measure absolute equalt va the mea dfferece whch s defed as ( / ) where refers to the total umber of dvduals socet. Assume that. The

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uverst Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

Correlation and Regression Analysis

Correlation and Regression Analysis Chapter V Correlato ad Regresso Aalss R. 5.. So far we have cosdered ol uvarate dstrbutos. Ma a tme, however, we come across problems whch volve two or more varables. Ths wll be the subject matter of the

More information

Econometrics. 3) Statistical properties of the OLS estimator

Econometrics. 3) Statistical properties of the OLS estimator 30C0000 Ecoometrcs 3) Statstcal propertes of the OLS estmator Tmo Kuosmae Professor, Ph.D. http://omepre.et/dex.php/tmokuosmae Today s topcs Whch assumptos are eeded for OLS to work? Statstcal propertes

More information

2.160 System Identification, Estimation, and Learning Lecture Notes No. 17 April 24, 2006

2.160 System Identification, Estimation, and Learning Lecture Notes No. 17 April 24, 2006 .6 System Idetfcato, Estmato, ad Learg Lectre Notes No. 7 Aprl 4, 6. Iformatve Expermets. Persstece of Exctato Iformatve data sets are closely related to Persstece of Exctato, a mportat cocept sed adaptve

More information

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Mean is only appropriate for interval or ratio scales, not ordinal or nominal. Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1 STA 08 Appled Lear Models: Regresso Aalyss Sprg 0 Soluto for Homework #. Let Y the dollar cost per year, X the umber of vsts per year. The the mathematcal relato betwee X ad Y s: Y 300 + X. Ths s a fuctoal

More information

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions. Ordary Least Squares egresso. Smple egresso. Algebra ad Assumptos. I ths part of the course we are gog to study a techque for aalysg the lear relatoshp betwee two varables Y ad X. We have pars of observatos

More information

= 2. Statistic - function that doesn't depend on any of the known parameters; examples:

= 2. Statistic - function that doesn't depend on any of the known parameters; examples: of Samplg Theory amples - uemploymet househol cosumpto survey Raom sample - set of rv's... ; 's have ot strbuto [ ] f f s vector of parameters e.g. Statstc - fucto that oes't epe o ay of the ow parameters;

More information

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then Secto 5 Vectors of Radom Varables Whe workg wth several radom varables,,..., to arrage them vector form x, t s ofte coveet We ca the make use of matrx algebra to help us orgaze ad mapulate large umbers

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postpoed exam: ECON430 Statstcs Date of exam: Jauary 0, 0 Tme for exam: 09:00 a.m. :00 oo The problem set covers 5 pages Resources allowed: All wrtte ad prted

More information

ENGI 3423 Simple Linear Regression Page 12-01

ENGI 3423 Simple Linear Regression Page 12-01 ENGI 343 mple Lear Regresso Page - mple Lear Regresso ometmes a expermet s set up where the expermeter has cotrol over the values of oe or more varables X ad measures the resultg values of aother varable

More information

STA302/1001-Fall 2008 Midterm Test October 21, 2008

STA302/1001-Fall 2008 Midterm Test October 21, 2008 STA3/-Fall 8 Mdterm Test October, 8 Last Name: Frst Name: Studet Number: Erolled (Crcle oe) STA3 STA INSTRUCTIONS Tme allowed: hour 45 mutes Ads allowed: A o-programmable calculator A table of values from

More information

Can we take the Mysticism Out of the Pearson Coefficient of Linear Correlation?

Can we take the Mysticism Out of the Pearson Coefficient of Linear Correlation? Ca we tae the Mstcsm Out of the Pearso Coeffcet of Lear Correlato? Itroducto As the ttle of ths tutoral dcates, our purpose s to egeder a clear uderstadg of the Pearso coeffcet of lear correlato studets

More information

9.1 Introduction to the probit and logit models

9.1 Introduction to the probit and logit models EC3000 Ecoometrcs Lecture 9 Probt & Logt Aalss 9. Itroducto to the probt ad logt models 9. The logt model 9.3 The probt model Appedx 9. Itroducto to the probt ad logt models These models are used regressos

More information

Johns Hopkins University Department of Biostatistics Math Review for Introductory Courses

Johns Hopkins University Department of Biostatistics Math Review for Introductory Courses Johs Hopks Uverst Departmet of Bostatstcs Math Revew for Itroductor Courses Ratoale Bostatstcs courses wll rel o some fudametal mathematcal relatoshps, fuctos ad otato. The purpose of ths Math Revew s

More information

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model 1. Estmatg Model parameters Assumptos: ox ad y are related accordg to the smple lear regresso model (The lear regresso model s the model that says that x ad y are related a lear fasho, but the observed

More information

Johns Hopkins University Department of Biostatistics Math Review for Introductory Courses

Johns Hopkins University Department of Biostatistics Math Review for Introductory Courses Johs Hopks Uverst Departmet of Bostatstcs Math Revew for Itroductor Courses Ratoale Bostatstcs courses wll rel o some fudametal mathematcal relatoshps, fuctos ad otato. The purpose of ths Math Revew s

More information

Simple Linear Regression - Scalar Form

Simple Linear Regression - Scalar Form Smple Lear Regresso - Scalar Form Q.. Model Y X,..., p..a. Derve the ormal equatos that mmze Q. p..b. Solve for the ordary least squares estmators, p..c. Derve E, V, E, V, COV, p..d. Derve the mea ad varace

More information

ESS Line Fitting

ESS Line Fitting ESS 5 014 17. Le Fttg A very commo problem data aalyss s lookg for relatoshpetwee dfferet parameters ad fttg les or surfaces to data. The smplest example s fttg a straght le ad we wll dscuss that here

More information

Generalized Linear Models. Statistical Models. Classical Linear Regression Why easy formulation if complicated formulation exists?

Generalized Linear Models. Statistical Models. Classical Linear Regression Why easy formulation if complicated formulation exists? Statstcal Models Geeralzed Lear Models Classcal lear regresso complcated formlato of smple model, strctral ad radom compoet of the model Lectre 5 Geeralzed Lear Models Geeralzed lear models geeral descrpto

More information

Chapter 8. Inferences about More Than Two Population Central Values

Chapter 8. Inferences about More Than Two Population Central Values Chapter 8. Ifereces about More Tha Two Populato Cetral Values Case tudy: Effect of Tmg of the Treatmet of Port-We tas wth Lasers ) To vestgate whether treatmet at a youg age would yeld better results tha

More information

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5 THE ROYAL STATISTICAL SOCIETY 06 EAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5 The Socety s provdg these solutos to assst cadtes preparg for the examatos 07. The solutos are teded as learg ads ad should

More information

Chapter 2 Supplemental Text Material

Chapter 2 Supplemental Text Material -. Models for the Data ad the t-test Chapter upplemetal Text Materal The model preseted the text, equato (-3) s more properl called a meas model. ce the mea s a locato parameter, ths tpe of model s also

More information

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades STAT 101 Dr. Kar Lock Morga 11/20/12 Exam 2 Grades Multple Regresso SECTIONS 9.2, 10.1, 10.2 Multple explaatory varables (10.1) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (10.2) Trasformatos

More information

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity ECONOMETRIC THEORY MODULE VIII Lecture - 6 Heteroskedastcty Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur . Breusch Paga test Ths test ca be appled whe the replcated data

More information

Chapter 5 Properties of a Random Sample

Chapter 5 Properties of a Random Sample Lecture 6 o BST 63: Statstcal Theory I Ku Zhag, /0/008 Revew for the prevous lecture Cocepts: t-dstrbuto, F-dstrbuto Theorems: Dstrbutos of sample mea ad sample varace, relatoshp betwee sample mea ad sample

More information

Homework Solution (#5)

Homework Solution (#5) Homework Soluto (# Chapter : #6,, 8(b, 3, 4, 44, 49, 3, 9 ad 7 Chapter. Smple Lear Regresso ad Correlato.6 (6 th edto 7, old edto Page 9 Rafall volume ( vs Ruoff volume ( : 9 8 7 6 4 3 : a. Yes, the scatter-plot

More information

Multiple Choice Test. Chapter Adequacy of Models for Regression

Multiple Choice Test. Chapter Adequacy of Models for Regression Multple Choce Test Chapter 06.0 Adequac of Models for Regresso. For a lear regresso model to be cosdered adequate, the percetage of scaled resduals that eed to be the rage [-,] s greater tha or equal to

More information

Simple Linear Regression and Correlation.

Simple Linear Regression and Correlation. Smple Lear Regresso ad Correlato. Correspods to Chapter 0 Tamhae ad Dulop Sldes prepared b Elzabeth Newto (MIT) wth some sldes b Jacquele Telford (Johs Hopks Uverst) Smple lear regresso aalss estmates

More information

Previous lecture. Lecture 8. Learning outcomes of this lecture. Today. Statistical test and Scales of measurement. Correlation

Previous lecture. Lecture 8. Learning outcomes of this lecture. Today. Statistical test and Scales of measurement. Correlation Lecture 8 Emprcal Research Methods I434 Quattatve Data aalss II Relatos Prevous lecture Idea behd hpothess testg Is the dfferece betwee two samples a reflecto of the dfferece of two dfferet populatos or

More information

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

STA 105-M BASIC STATISTICS (This is a multiple choice paper.) DCDM BUSINESS SCHOOL September Mock Eamatos STA 0-M BASIC STATISTICS (Ths s a multple choce paper.) Tme: hours 0 mutes INSTRUCTIONS TO CANDIDATES Do ot ope ths questo paper utl you have bee told to do

More information

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution: Chapter 4 Exercses Samplg Theory Exercse (Smple radom samplg: Let there be two correlated radom varables X ad A sample of sze s draw from a populato by smple radom samplg wthout replacemet The observed

More information

TESTS BASED ON MAXIMUM LIKELIHOOD

TESTS BASED ON MAXIMUM LIKELIHOOD ESE 5 Toy E. Smth. The Basc Example. TESTS BASED ON MAXIMUM LIKELIHOOD To llustrate the propertes of maxmum lkelhood estmates ad tests, we cosder the smplest possble case of estmatg the mea of the ormal

More information

Regresso What s a Model? 1. Ofte Descrbe Relatoshp betwee Varables 2. Types - Determstc Models (o radomess) - Probablstc Models (wth radomess) EPI 809/Sprg 2008 9 Determstc Models 1. Hypothesze

More information

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations HP 30S Statstcs Averages ad Stadard Devatos Average ad Stadard Devato Practce Fdg Averages ad Stadard Devatos HP 30S Statstcs Averages ad Stadard Devatos Average ad stadard devato The HP 30S provdes several

More information

Lecture Note to Rice Chapter 8

Lecture Note to Rice Chapter 8 ECON 430 HG revsed Nov 06 Lecture Note to Rce Chapter 8 Radom matrces Let Y, =,,, m, =,,, be radom varables (r.v. s). The matrx Y Y Y Y Y Y Y Y Y Y = m m m s called a radom matrx ( wth a ot m-dmesoal dstrbuto,

More information

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs CLASS NOTES for PBAF 58: Quattatve Methods II SPRING 005 Istructor: Jea Swaso Dael J. Evas School of Publc Affars Uversty of Washgto Ackowledgemet: The structor wshes to thak Rachel Klet, Assstat Professor,

More information

( ) ( ) ( ( )) ( ) ( ) ( ) ( ) ( ) = ( ) ( ) + ( ) ( ) = ( ( )) ( ) + ( ( )) ( ) Review. Second Derivatives for f : y R. Let A be an m n matrix.

( ) ( ) ( ( )) ( ) ( ) ( ) ( ) ( ) = ( ) ( ) + ( ) ( ) = ( ( )) ( ) + ( ( )) ( ) Review. Second Derivatives for f : y R. Let A be an m n matrix. Revew + v, + y = v, + v, + y, + y, Cato! v, + y, + v, + y geeral Let A be a atr Let f,g : Ω R ( ) ( ) R y R Ω R h( ) f ( ) g ( ) ( ) ( ) ( ( )) ( ) dh = f dg + g df A, y y A Ay = = r= c= =, : Ω R he Proof

More information

CHAPTER VI Statistical Analysis of Experimental Data

CHAPTER VI Statistical Analysis of Experimental Data Chapter VI Statstcal Aalyss of Expermetal Data CHAPTER VI Statstcal Aalyss of Expermetal Data Measuremets do ot lead to a uque value. Ths s a result of the multtude of errors (maly radom errors) that ca

More information

Chapter 13 Student Lecture Notes 13-1

Chapter 13 Student Lecture Notes 13-1 Chapter 3 Studet Lecture Notes 3- Basc Busess Statstcs (9 th Edto) Chapter 3 Smple Lear Regresso 4 Pretce-Hall, Ic. Chap 3- Chapter Topcs Types of Regresso Models Determg the Smple Lear Regresso Equato

More information

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger Example: Multple lear regresso 5000,00 4000,00 Tro Aders Moger 0.0.007 brthweght 3000,00 000,00 000,00 0,00 50,00 00,00 50,00 00,00 50,00 weght pouds Repetto: Smple lear regresso We defe a model Y = β0

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis LINEA EGESSION ANALYSIS MODULE III Lecture - 4 Multple Lear egresso Aalyss Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Cofdece terval estmato The cofdece tervals multple

More information

Simulation Output Analysis

Simulation Output Analysis Smulato Output Aalyss Summary Examples Parameter Estmato Sample Mea ad Varace Pot ad Iterval Estmato ermatg ad o-ermatg Smulato Mea Square Errors Example: Sgle Server Queueg System x(t) S 4 S 4 S 3 S 5

More information

Mu Sequences/Series Solutions National Convention 2014

Mu Sequences/Series Solutions National Convention 2014 Mu Sequeces/Seres Solutos Natoal Coveto 04 C 6 E A 6C A 6 B B 7 A D 7 D C 7 A B 8 A B 8 A C 8 E 4 B 9 B 4 E 9 B 4 C 9 E C 0 A A 0 D B 0 C C Usg basc propertes of arthmetc sequeces, we fd a ad bm m We eed

More information

Study of Correlation using Bayes Approach under bivariate Distributions

Study of Correlation using Bayes Approach under bivariate Distributions Iteratoal Joural of Scece Egeerg ad Techolog Research IJSETR Volume Issue Februar 4 Stud of Correlato usg Baes Approach uder bvarate Dstrbutos N.S.Padharkar* ad. M.N.Deshpade** *Govt.Vdarbha Isttute of

More information

MEASURES OF DISPERSION

MEASURES OF DISPERSION MEASURES OF DISPERSION Measure of Cetral Tedecy: Measures of Cetral Tedecy ad Dsperso ) Mathematcal Average: a) Arthmetc mea (A.M.) b) Geometrc mea (G.M.) c) Harmoc mea (H.M.) ) Averages of Posto: a) Meda

More information

SPECIAL CONSIDERATIONS FOR VOLUMETRIC Z-TEST FOR PROPORTIONS

SPECIAL CONSIDERATIONS FOR VOLUMETRIC Z-TEST FOR PROPORTIONS SPECIAL CONSIDERAIONS FOR VOLUMERIC Z-ES FOR PROPORIONS Oe s stctve reacto to the questo of whether two percetages are sgfcatly dfferet from each other s to treat them as f they were proportos whch the

More information

STK3100 and STK4100 Autumn 2017

STK3100 and STK4100 Autumn 2017 SK3 ad SK4 Autum 7 Geeralzed lear models Part III Covers the followg materal from chaters 4 ad 5: Sectos 4..5, 4.3.5, 4.3.6, 4.4., 4.4., ad 4.4.3 Sectos 5.., 5.., ad 5.5. Ørulf Borga Deartmet of Mathematcs

More information

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance Chapter, Part A Aalyss of Varace ad Epermetal Desg Itroducto to Aalyss of Varace Aalyss of Varace: Testg for the Equalty of Populato Meas Multple Comparso Procedures Itroducto to Aalyss of Varace Aalyss

More information

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y. .46. a. The frst varable (X) s the frst umber the par ad s plotted o the horzotal axs, whle the secod varable (Y) s the secod umber the par ad s plotted o the vertcal axs. The scatterplot s show the fgure

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 00 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for the

More information

ln( weekly earn) age age

ln( weekly earn) age age Problem Set 4, ECON 3033 (Due at the start of class, Wedesday, February 4, 04) (Questos marked wth a * are old test questos) Bll Evas Sprg 08. Cosder a multvarate regresso model of the form y 0 x x. Wrte

More information

UNIT 7 RANK CORRELATION

UNIT 7 RANK CORRELATION UNIT 7 RANK CORRELATION Rak Correlato Structure 7. Itroucto Objectves 7. Cocept of Rak Correlato 7.3 Dervato of Rak Correlato Coeffcet Formula 7.4 Te or Repeate Raks 7.5 Cocurret Devato 7.6 Summar 7.7

More information

Handout #1. Title: Foundations of Econometrics. POPULATION vs. SAMPLE

Handout #1. Title: Foundations of Econometrics. POPULATION vs. SAMPLE Hadout #1 Ttle: Foudatos of Ecoometrcs Course: Eco 367 Fall/015 Istructor: Dr. I-Mg Chu POPULATION vs. SAMPLE From the Bureau of Labor web ste (http://www.bls.gov), we ca fd the uemploymet rate for each

More information

CHAPTER 3 POSTERIOR DISTRIBUTIONS

CHAPTER 3 POSTERIOR DISTRIBUTIONS CHAPTER 3 POSTERIOR DISTRIBUTIONS If scece caot measure the degree of probablt volved, so much the worse for scece. The practcal ma wll stck to hs apprecatve methods utl t does, or wll accept the results

More information

Analysis of Variance with Weibull Data

Analysis of Variance with Weibull Data Aalyss of Varace wth Webull Data Lahaa Watthaacheewaul Abstract I statstcal data aalyss by aalyss of varace, the usual basc assumptos are that the model s addtve ad the errors are radomly, depedetly, ad

More information

Linear Regression with One Regressor

Linear Regression with One Regressor Lear Regresso wth Oe Regressor AIM QA.7. Expla how regresso aalyss ecoometrcs measures the relatoshp betwee depedet ad depedet varables. A regresso aalyss has the goal of measurg how chages oe varable,

More information

STK4011 and STK9011 Autumn 2016

STK4011 and STK9011 Autumn 2016 STK4 ad STK9 Autum 6 Pot estmato Covers (most of the followg materal from chapter 7: Secto 7.: pages 3-3 Secto 7..: pages 3-33 Secto 7..: pages 35-3 Secto 7..3: pages 34-35 Secto 7.3.: pages 33-33 Secto

More information

2SLS Estimates ECON In this case, begin with the assumption that E[ i

2SLS Estimates ECON In this case, begin with the assumption that E[ i SLS Estmates ECON 3033 Bll Evas Fall 05 Two-Stage Least Squares (SLS Cosder a stadard lear bvarate regresso model y 0 x. I ths case, beg wth the assumto that E[ x] 0 whch meas that OLS estmates of wll

More information

Regression. Linear Regression. A Simple Data Display. A Batch of Data. The Mean is 220. A Value of 474. STAT Handout Module 15 1 st of June 2009

Regression. Linear Regression. A Simple Data Display. A Batch of Data. The Mean is 220. A Value of 474. STAT Handout Module 15 1 st of June 2009 STAT Hadout Module 5 st of Jue 9 Lear Regresso Regresso Joh D. Sork, M.D. Ph.D. Baltmore VA Medcal Ceter GRCC ad Uversty of Marylad School of Medce Claude D. Pepper Older Amercas Idepedece Ceter Reducg

More information

Simple Linear Regression

Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uversty Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

Generative classification models

Generative classification models CS 75 Mache Learg Lecture Geeratve classfcato models Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square Data: D { d, d,.., d} d, Classfcato represets a dscrete class value Goal: lear f : X Y Bar classfcato

More information

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections ENGI 441 Jot Probablty Dstrbutos Page 7-01 Jot Probablty Dstrbutos [Navd sectos.5 ad.6; Devore sectos 5.1-5.] The jot probablty mass fucto of two dscrete radom quattes, s, P ad p x y x y The margal probablty

More information

Chapter 10 Two Stage Sampling (Subsampling)

Chapter 10 Two Stage Sampling (Subsampling) Chapter 0 To tage amplg (usamplg) I cluster samplg, all the elemets the selected clusters are surveyed oreover, the effcecy cluster samplg depeds o sze of the cluster As the sze creases, the effcecy decreases

More information

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn: Chapter 3 3- Busess Statstcs: A Frst Course Ffth Edto Chapter 2 Correlato ad Smple Lear Regresso Busess Statstcs: A Frst Course, 5e 29 Pretce-Hall, Ic. Chap 2- Learg Objectves I ths chapter, you lear:

More information

Handout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1

Handout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1 Hadout #8 Ttle: Foudatos of Ecoometrcs Course: Eco 367 Fall/05 Istructor: Dr. I-Mg Chu Lear Regresso Model So far we have focused mostly o the study of a sgle radom varable, ts correspodg theoretcal dstrbuto,

More information