so that you can see that it explains some, but certainly not nearly all!, of the variation.

Size: px
Start display at page:

Download "so that you can see that it explains some, but certainly not nearly all!, of the variation."

Transcription

1 Lecture Notes 9 Econ 20150, Prncples of Statstcs Kevn R Foster, CCNY Fall 2012 Multple Regresson more than one X varable Regressng just one varable on another can be helpful and useful (and provdes a great graphcal ntuton) but t doesn't get us very far. Here's a graph of the relaton of age and wage, so that you can see that t explans some, but certanly not nearly all!, of the varaton. Consder ths example, usng data from the March 2010 CPS. We lmt ourselves to only examnng people wth a non-zero annual wage/salary who are workng fulltme (WSAL_VAL > 0 & HRCHECK = 2). We look at the dfferent wages reported by people who label themselves as whte, Afrcan-Amercan, Asan, Natve Amercan, and Hspanc. There are

2 62,043 whtes, 9,101 Afrcan-Amercans, 4476 Asans, 2149 Natve Amercans, and 12,401 Hspancs n the data who fulfll ths condton. The average yearly salary for whtes s $50,782; for Afrcan-Amercans t s $39,131; for Asans $57,541; for Natve Amercans $38,036; for Hspancs t s $36,678. Conventonal statstcal tests fnd that these averages are sgnfcantly dfferent. Does ths prove dscrmnaton? No; there are many other reasons why groups of people could have dfferent ncomes such as educatonal level or age or a multtude of other factors. (But t s not nconsstent wth a hypothess of racsm: remember the dfference, when evaluatng hypotheses, between 'not rejectng' or 'acceptng'). We mght reasonably break these numbers down further. These groups of people are dfferent n a varety of ways. Ther average ages are dfferent between Hspancs, averagng years, and non-hspancs, averagng years. So how much of the wage dfference, for Hspancs, s due to the fact that they're younger? We could do an ANOVA on ths but that would omt other factors. The populatons also dfferent n gender ratos. For whtes, 57% were male; for Afrcan- Amercans 46% were male; for Hspancs 59% were male. Snce gender also affects ncome, we mght thnk some of the wage gap could be due, not to racal dscrmnaton, but to gender dscrmnaton. But then they're also dfferent n educatonal attanment! Among the Hspanc workers, 30% had not fnshed hgh school; for Afrcan-Amercans 8.8% had not; for whtes 9% had not fnshed wth a dploma. And 12% of whtes had an advanced degree whle 8.3% of Afrcan Amercans and 4.2% of Hspancs had such credentals. The dfferent fractons n educatonal attanment add credblty to the hypothess that not all racal/ethnc varaton means dscrmnaton (n the labor market, at least there could be dscrmnaton n educaton so certan groups get less or worse educaton). Fnally they're dfferent n what secton of the country they lve n, as measured by Census regon. So how can we keep all of these dfferent factors straght? Multple Regresson From the standpont of just usng SPSS, there s no dfference for the user between a unvarate and multvarate lnear regresson. Agan use "Analyze\ Regresson\ Lnear..." but then add a bunch of varables to the "Independent(s)" box. In formulas, model has k explanatory varables for each of 1,2, n have n > k) y x x x 0 1 1, 2 2, k k, observatons (must

3 Each coeffcent estmate, notated as ˆ j, has standardzed dstrbuton as t wth (n k) degrees of freedom. Each coeffcent represents the amount by whch the y would be expected to change, for a y small change n the partcular x-varable (.e. j ). x Note that you must be a bt careful specfyng the varables. The CPS codes educatonal attanment wth a bunch of numbers from 31 to 46 but these numbers have no nherent meanng. So too race, geography, ndustry, and occupaton. If a person graduates hgh school then ther grade codng changes from 38 to 39 but ths must be coded wth a dummy varable. If a person moves from New York to North Dakota then ths ncreases ther state code from 36 to 38; ths s not the same change as would occur for someone movng from North Dakota to Oklahoma (40) nor s t half of the change as would occur for someone movng from New York to North Carolna (37). Each state needs a dummy varable. A multvarate regresson can control for all of the dfferent changes to focus on each tem ndvdually. So we mght model a person's wage/salary value as a functon of ther age, ther gender, race/ethncty (Afrcan-Amercan, Asan, Natve Amercan, Hspanc), f they're an mmgrant, sx educatonal varables (hgh school dploma, some college but no degree, Assocate's n vocatonal feld, Assocate's n academc feld, a 4-year degree, or advanced degree), f they're marred or dvorced/wdowed/separated, f they're a unon member, and f they're a veteran. Results (from the sample above, of March 2010 fulltme workers wth nonzero wage), are gven by SPSS as: j Model Summary Model R R Square Adjusted R Square Std. Error of the Estmate a a. Predctors: (Constant), Veteran (any), Afrcan Amercan, Educaton: Assocate n vocatonal, Unon member, Educaton: Assocate n academc, Natve Amercan Indan or Alaskan or Hawaan, Dvorced or Wdowed or Separated, Asan, Educaton: Advanced Degree, Hspanc, Female, Educaton: Some College but no degree, Demographcs, Age, Educaton: 4-yr degree, Immgrant, Marred, Educaton: Hgh School Dploma ANOVA b Model Sum of Squares df Mean Square F Sg.

4 1 Regresson 4.416E E a Resdual 1.704E E9 Total 2.146E a. Predctors: (Constant), Veteran (any), Afrcan Amercan, Educaton: Assocate n vocatonal, Unon member, Educaton: Assocate n academc, Natve Amercan Indan or Alaskan or Hawaan, Dvorced or Wdowed or Separated, Asan, Educaton: Advanced Degree, Hspanc, Female, Educaton: Some College but no degree, Demographcs, Age, Educaton: 4-yr degree, Immgrant, Marred, Educaton: Hgh School Dploma b. Dependent Varable: Total wage and salary earnngs amount - Person Coeffcents a Unstandardzed Coeffcents Standardzed Coeffcents Model B Std. Error Beta t Sg. 1 (Constant) Demographcs, Age Female Afrcan Amercan Asan Natve Amercan Indan or Alaskan or Hawaan Hspanc Immgrant Educaton: Hgh School Dploma Educaton: Some College but no degree Educaton: Assocate n vocatonal Educaton: Assocate n academc Educaton: 4-yr degree

5 Educaton: Advanced Degree Marred Dvorced or Wdowed or Separated Unon member Veteran (any) a. Dependent Varable: Total wage and salary earnngs amount - Person For the "Coeffcents" table, the "Unstandardzed coeffcent B" s the estmate of ˆ, the "Std. se ˆ. (In Error" of the unstandardzed coeffcent s the standard error of that estmate, economcs we don't generally use the standardzed beta, whch dvdes the coeffcent estmate by ˆ the standard error of X.) The "t" gven n the table s the t-statstc, t and "Sg." s ts p- se ˆ value the probablty, f the coeffcent were actually zero, of seeng an estmate as large as the one that you got. (Go back and revew f you don't remember all of the detals of ths.) So see Excel sheet to show how to get predcted wages for dfferent groups. Can then nterpret the resdual from the regresson. - Statstcal sgnfcance of coeffcent estmates s more complcated for multple regresson, we can ask whether a group of varables are jontly sgnfcant, whch takes a more complcated test. The dfference between the overall regresson ft and the sgnfcance of any partcular estmate s that a hypothess test of one partcular coeffcent tests f that parameter s zero; s ˆ β = 0? Ths uses the t-statstc t and compares t to a Normal or t dstrbuton se ˆ (dependng on the degrees of freedom). The test of the regresson sgnfcance tests f ALL of the slope coeffcents are smultaneously zero; f β 1 = β 2 = β 3 =... = β K = 0. The latter s much more restrctve. The predcted value of y s notated as ŷ, where yˆ ˆ ˆ ˆ ˆ 0 1x1 2x2 kxk. Its standard error s the standard error of the regresson, gven by SPSS as "Standard Error of the Estmate." y yˆ y ˆ ˆ ˆ ˆ The resdual s 0 1x1 2x2 kxk. The resdual of, for example, a wage regresson can be nterpreted as the part of the wage that s not explaned by the factors wthn the model.

6 Resduals are often used n analyses of productvty. Suppose I am analyzng a chan's stores to fgure out whch are managed best. I know that there are many reasons for varaton n revenues and cost so I can get data on those: how many workers are there and ther pay, the locaton of the store relatve to traffc, the rent pad, any sales or promotons gong on, etc. If I ŷ, of what proft would have run a regresson on all of those factors then I get an estmate, been expected, gven external factors. Then the dfference represents the unexplaned or resdual amount of varaton: some stores would have been expected to be proftable and are ndeed; some are not lvng up to potental; some would not have been expected to do so well but somethng s gong on so they're dong much better than expected. Why do we always leave out a dummy varable? Multcollnearty. OLS basc assumptons: o The condtonal dstrbuton of u gven X has a mean of zero. Ths s a complcated way of sayng somethng very basc: I have no addtonal nformaton outsde of the model, whch would allow me to make better guesses. It can also be expressed as mplyng a zero correlaton between X and u. We wll work up to other methods that ncorporate addtonal nformaton. o The X and errors are..d. Ths s often not precsely true; on the other hand t mght be roughly rght, and t gves us a place to start. o X and errors don't have values that are "too extreme." Ths s techncal (about exstence of fourth moments) and broadly true, whenever the X and Y data have a lmt on the amount of varaton, although there mght be partcular crcumstances where t s questonable (sometmes n fnance). So f these are true then the OLS are unbased and consstent. So E ˆ 0 0 and E ˆ 1 1. The normal dstrbuton, as the sample gets large, allows us to make hypothess tests about the values of the betas. In partcular, f you look back to the "eyeball" data at the begnnng, you wll recall that a zero value for the slope, 1, s mportant. It mples no relatonshp between the varables. So we wll commonly test the estmated values of aganst a null hypothess that they are zero. Nonlnear Regresson (more properly, How to Jam Nonlneartes nto a Lnear Regresson) X, X 2, X 3, X r ln(x), ln(y), both ln(y) & ln(x) dummy varables nteractons of dummes nteractons of dummy/contnuous nteractons of contnuous varables

7 There are many examples of, and reasons for, nonlnearty. In fact we can thnk that the most general case s nonlnearty and a lnear functonal form s just a convenent smplfcaton whch s sometmes useful. But sometmes the smplfcaton has a hgh prce. For example, my kds beleve that age and heght are closely related whch s true for ther sample (.e. mostly kds of a young age, for whom there s a tght relatonshp, plus 2 parents who are aged and tall). If my sample were all chldren then that mght be a decent smplfcaton; f my sample were adults then that's lousy. The usual justfcaton for a lnear regresson s that, for any dfferentable functon, the Taylor Theorem delvers a lnear functon as beng a close approxmaton but ths s only wthn a neghborhood. We need to work to get a good approxmaton. Nonlnear terms We can return to our regresson usng CPS data. Frst, we mght want to ask why our regresson s lnear. Ths s mostly convenence, and we can easly add non-lnear terms such as Age 2, f we thnk that the typcal age/wage profle looks lke ths: Wage Age So the regresson would be: Wage Age Age (where the term "..." ndcates "other stuff" that should be n the regresson). As we remember from calculus, dwage Age dage so that the extra boost n wage from another brthday mght fall as the person gets older, and even turn negatve f the estmate of 2 0 (a bt of algebra can solve for the top of the hll dwage by fndng the Age that sets 0 dage ). We can add hgher-order effects as well. Some labor econometrcans argue for ncludng Age 3 and Age 4 terms, whch can trace out some complcated wage/age profles. However we need to be careful of "overfttng" addng more explanatory varables wll never lower the R 2. Logarthms

8 Smlarly can specfy X or Y as ln(x) and/or ln(y). But we've got to be careful: remember from math (or theory of nsurance from Intermedate Mcro) that E[ln(Y)] IS NOT EQUAL TO ln(e[y])! In cases where we're regressng on wages, ths means that the log of the average wage s not equal to the average log wage. (Try t. Go ahead, I'll wat.) When both X and Y are measured n logs then the coeffcents have an easy economc dy 1 dx nterpretaton. Recall from calculus that wth y ln x and, so dy % x -- our dx x x usual frend, the percent change. So n a regresson where both X and Y are n logarthms, then y % y j s the elastcty of Y wth respect to X. x % x Also, f Y s n logs and D s a dummy varable, then the coeffcent on the dummy varable s just the percent change when D swtches from zero to one. So the choce of whether to specfy Y as levels or logs s equvalent to askng whether dummy varables are better specfed as havng a constant level effect (.e. women make $10,000 less than men) or havng a percent change effect (women make 25% less than men). As usual there s no general answer that one or the other s always rght! Recall our dscusson of dummy varables, that take values of just 0 or 1, whch we ll represent as D. Snce, unlke the contnuous varable Age, D takes just two values, t represents a shft of the constant term. So the regresson, Wage Age D u shows that people wth D=0 have ntercept of just 0, whle those wth D=1 have ntercept equal to Graphcally, ths s: We need not assume that the 3 term s postve f t were negatve, t would just shft the lne downward. We do however assume that the rate at whch age ncreases wages s the same for both genders the lnes are parallel.

9 Dummy Varables Interactng wth Other Explanatory Varables The assumpton about parallel lnes wth the same slopes can be modfed by addng nteracton terms: defne a varable as the product of the dummy tmes age, so the regresson s Wage 0 1Age 3D 4D Age u Wage Wage so that, for those wth D=0, as before = 1 but for those wth D=1, 14. Age Age Graphcally, so now the ntercepts and slopes are dfferent. So we mght wonder f men and women have a smlar wage-age profle. We could ft a number of possble specfcatons that are varatons of our basc model that wage depends on age and age-squared. The frst possble varaton s smply that: 2 Wage Age Age D u, whch allows the wage profle lnes to have dfferent ntercept-values but otherwse to be parallel (the same hump pont where wages have ther maxmum value), as shown by ths graph: w Age The next varaton would be to allow the lnes to have dfferent slopes as well as dfferent ntercepts:

10 Wage Age Age D 4D Age 5D Age u whch allows the two groups to have dfferent-shaped wage-age profles, as n ths graph: w Age (The wage-age profles mght ntersect or they mght not t depends on the sample data.) Ths specfcaton, wth a dummy varable multplyng each term: the constant and all the explanatory varables, s equvalent to runnng two separate regressons: one for men and one for women: D 0 D 1 Wage Age Age u male male male Wage Age Age e female female female female Where the new coeffcents are related to the old by the denttes: 0 0 3, female female 1 1 4, and Sometmes breakng up the regressons s easer, f there are large datasets and many nteractons. Testng f All the New Varable Coeffcents are Zero You're wonderng how to tell f all of these new nteractons are worthwhle. Smple: Hypothess Testng! There are varous formulas, some more complcated, but for the case of homoskedastcty the formula s relatvely smple. Why any formula at all why not look at the t-tests ndvdually? Because the ndvdual t-tests are askng f each ndvdual coeffcent s zero, not f t s zero and others as well are also zero. That would be a stronger test. To measure how much a group of varables contrbutes to the regresson, we look at the resdual values how much s stll unexplaned, after the varous models? And snce ths s OLS, we look at the squared resduals. SPSS outputs the "Sum of Squares" for the Resduals n the box labeled "ANOVA". We compare the sum of squares from the two models and see how much t has gone down wth the extra varables. A bg decrease ndcates that the new

11 varables are dong good work. And how do we know, how bg s "bg"? Compare t to some gven dstrbuton, n ths case the F dstrbuton. Bascally we look at the percent change n the sum of squares, so somethng lke: SSR0 SSR1 F SSR1 wth the wavy equals sgn to show that we're not qute done. Note that model 0 s the orgnal model and model 1 s the model wth the addtonal regressors, whch wll have a smaller resdual (so ths F can never be negatve). To make ths equal, we need to make t a bt lke an elastcty what s the percent change n the number of varables n the model? Suppose that we have N observatons and that the orgnal model has K varables, to whch we're consderng addng Q more observatons. Then the orgnal model has (N K 1) degrees of freedom [that "1" s for the constant term] whle the new model has (N K Q 1) degrees of freedom, so the Q dfference s Q. So the percent change n degrees of freedom s. Then the full N K Q 1 formula for the F test s SSR0 SSR 1 SSR1 F. Q N K Q 1 Whch s, admttedly, fugly. But we know ts dstrbuton, t's F wth (Q, N-K-Q-1) degrees of freedom the F-dstrbuton has 2 sets of degrees of freedom. Calculate that F, then use Excel to calculate FDIST(F,Q,N-K-Q-1), whch wll output a p-value for the test. If the p-value s less than 5%, reject the null hypothess. Multple Dummy Varables Multple dummy varables, D 1,, D 2,,,D J,, operate on the same basc prncple. Of course we can then have many further nteractons! Suppose we have dummes for educaton and mmgrant status. The coeffcent on educaton would tell us how the typcal person (whether mmgrant or natve) fares, whle the coeffcent on mmgrant would tell us how the typcal mmgrant (whatever her educaton) fares. An nteracton of more than Bachelor s degree wth Immgrant would tell how the typcal hghly-educated mmgrant would do beyond how the typcal mmgrant and typcal hghly-educated person would do (whch mght be dfferent, for both ends of the educaton scale). Many, Many Dummy Varables Don't let the name fool you you'd have to be a dummy not to use lots of dummy varables. For example regressons to explan people's wages mght use dummy varables for the ndustry n whch a person works. Regressons about fnancal data such as stock prces mght nclude dummes for the days of the week and months of the year.

12 Dummes for ndustres are often denoted wth labels lke "two-dgt" or "three-dgt" or smlar jargon. To understand ths, you need to understand how the government classfes ndustres. A specfc ndustry mght get a 4-dgt code where each dgt makes a further more detaled classfcaton. The frst dgt refers to the broad secton of the economy, as goods pass from the frst producers (farmers and mners, frst dgt zero) to manufacturers (1 n the frst dgt for non-durable manufacturers such as meat processng, 2 for durable manufacturng, 3 for hghertech goods) to transportaton, communcatons and utltes (4), to wholesale trade (5) then retal (6). The 7's begn wth FIRE (Fnance, Insurance, and Real Estate) then servces n the later 7 and early 8 dgts whle the 9 s for governments. The second and thrd dgts gve more detal: e.g. 377 s for sawmlls, 378 for plywood and engneered wood, 379 for prefabrcated wood homes. Some data sets mght gve you 5-dgt or even 6-dgt nformaton. These classfcatons date back to the 1930s and 1940s so some parts show ther age: the everncreasng number of computer parts go where plan "offce supples" used to be. The CPS data dstngushes between "major ndustres" wth 16 categores and "detaled ndustry" wth about 50. Creatng 50 dummy varables could be tresome so I recommend that you use SPSS's syntax edtor that makes cut-and-paste work easer. For example use the buttons to "compute" the frst dummy varable then "Paste Syntax" to see the general form. Then copy-and-paste and change the number for the 51 varables: COMPUTE d_nd1 = (a_dtnd EQ 1). COMPUTE d_nd2 = (a_dtnd EQ 2). COMPUTE d_nd3 = (a_dtnd EQ 3). COMPUTE d_nd4 = (a_dtnd EQ 4). COMPUTE d_nd5 = (a_dtnd EQ 5). COMPUTE d_nd6 = (a_dtnd EQ 6). COMPUTE d_nd7 = (a_dtnd EQ 7). You get the dea take ths up to 51. Then add them to your regresson! In other models such as predctons of sales, the specfcaton mght nclude a tme trend (as dscussed earler) plus dummy varables for days of the week or months of the year, to represent the typcal sales for, say, "a Monday n June". If you're lazy lke me, you mght not want to do all of ths mousework. (And f you really have a lot of varables, then you don't even have to be lazy.) There must be an easer way! There s. SPSS s a graphcal nterface that bascally wrtes SPSS code, whch s then submtted to the program. Clckng the buttons s wrtng computer code. Look agan at ths screen, where I've started codng the next dummy varable, ed_hs (from Transform\Compute Varables )

13 That lttle button, "Paste," can be a lot of help. It pastes the SPSS code that you just created wth buttons nto the SPSS Syntax Edtor.

14 Why s ths helpful? Because you can copy and paste these lnes of code, f you are only gong to make small changes to create a bunch of new varables. So, for example, the educaton dummes could be created wth ths code: COMPUTE ed_hs = (A_HGA = 39). VARIABLE LABELS ed_hs 'Hgh School Dploma'. COMPUTE ed_smc = (A_HGA > 39) & (A_HGA < 43). VARIABLE LABELS ed_smc 'Some College'. COMPUTE ed_coll = (A_HGA = 43). VARIABLE LABELS ed_coll 'College 4 Year Degree'. COMPUTE ed_adv = (A_HGA > 43). VARIABLE LABELS ed_adv 'Advanced Degree'. EXECUTE. Then choose "Run\All" from the drop-down menus to have SPSS execute the code. You can really see the tme-savng element f, for example, you want to create dummes for geographcal area. There s a code, GEDIV, that tells what secton of the country the respondent lves n. Agan these numbers have absolutely no nherent value, they're just codes from 1, New England, to 9, Pacfc regon. We can't put GEDIV nto a regresson but we can put geographc dummes. So we use the same procedure to create these: COMPUTE geo_1 = (GEDIV = 1). COMPUTE geo_2 = (GEDIV = 2). COMPUTE geo_3 = (GEDIV = 3). COMPUTE geo_4 = (GEDIV = 4). COMPUTE geo_5 = (GEDIV = 5). COMPUTE geo_6 = (GEDIV = 6). COMPUTE geo_7 = (GEDIV = 7). COMPUTE geo_8 = (GEDIV = 8). COMPUTE geo_9 = (GEDIV = 9). EXECUTE. You can begn to realze the tme-savng capablty here. Later we mght create 50 detaled ndustry and 25 detaled occupaton dummes. If at some pont you get stuck (maybe the "Run" returns errors) or f you don't know the syntax to create a varable, you can go back to the button-pushng dalogue box. The fnal advantage s that, f you want to do the same commands on a dfferent dataset (say, the March 2009) then as long as you have saved the syntax you can easly submt t agan. Wth enough dummy varables we can start to create some respectable regressons! Use "Data\Select Cases " to use only those wth a non-zero wage. Then do a regresson of wage on Age, race & ethncty (create some dummy varables for these), educatonal attanment, and geographc regon.

15 Why am I makng you do all of ths? Because I want you to realze all of the choces that go nto creatng a regresson or dong just about anythng wth data. There are a host of choces avalable to you. Some choces are rather conventonal (for example, the educaton breakdown I used above) but you need to know the feld n order to know what assumptons are common. Sometmes these commonplace assumptons conceal mportant nformaton. You want to do enough expermentaton to understand whch of your choces are crucal to your results. Then you can begn to understand how people mght analyze the exact same data but come to varyng conclusons. If your results contradct someone else's, then you have to fgure out what are the mportant assumptons that create the dfference.

Lecture Notes 11 Econ 20150, Principles of Statistics Kevin R Foster, CCNY Spring 2012

Lecture Notes 11 Econ 20150, Principles of Statistics Kevin R Foster, CCNY Spring 2012 Lecture Notes 11 Econ 20150, Prncples of Statstcs Kevn R Foster, CCNY Sprng 2012 Multple Regresson more than one X varable Regressng just one varable on another can be helpful and useful (and provdes a

More information

Multiple Regression more than one X variable

Multiple Regression more than one X variable Class Oct 24 Kevn R Foster, CCNY, ECO B2000 Fall 2013 Multple Regresson more than one X varable Regressng just one varable on another can be helpful and useful (and provdes a great graphcal ntuton) but

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

(more properly, How to Jam Nonlinearities into a Linear Regression)

(more properly, How to Jam Nonlinearities into a Linear Regression) Lecture Notes 7 Econ B2000, Statstcs and Introducton to Econometrcs Kevn R Foster, the Cty College of New York, CUNY Fall 2014 Nonlnear Regresson (more properly, How to Jam Nonlneartes nto a Lnear Regresson)

More information

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1 Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test PBAF 528 Week 6 How do we choose our model? How do you decde whch ndependent varables? If you want to read more about ths, try Studenmund, A.H. Usng Econometrcs Chapter 7. (ether 3 rd or 4 th Edtons) 1.

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Midterm Examination. Regression and Forecasting Models

Midterm Examination. Regression and Forecasting Models IOMS Department Regresson and Forecastng Models Professor Wllam Greene Phone: 22.998.0876 Offce: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Emal: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/regresson/outlne.htm

More information

Chapter 8 Multivariate Regression Analysis

Chapter 8 Multivariate Regression Analysis Chapter 8 Multvarate Regresson Analyss 8.3 Multple Regresson wth K Independent Varables 8.4 Sgnfcance tests of Parameters Populaton Regresson Model For K ndependent varables, the populaton regresson and

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Dummy variables in multiple variable regression model

Dummy variables in multiple variable regression model WESS Econometrcs (Handout ) Dummy varables n multple varable regresson model. Addtve dummy varables In the prevous handout we consdered the followng regresson model: y x 2x2 k xk,, 2,, n and we nterpreted

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1 UNIVERSITY OF EAST ANGLIA School of Economcs Man Seres PG Examnaton 016-17 ECONOMETRIC METHODS ECO-7000A Tme allowed: hours Answer ALL FOUR Questons. Queston 1 carres a weght of 5%; Queston carres 0%;

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Soc 3811 Basic Social Statistics Third Midterm Exam Spring 2010

Soc 3811 Basic Social Statistics Third Midterm Exam Spring 2010 Soc 3811 Basc Socal Statstcs Thrd Mdterm Exam Sprng 2010 Your Name [50 ponts]: ID #: Your TA: Kyungmn Baek Meghan Zacher Frank Zhang INSTRUCTIONS: (A) Wrte your name on the lne at top front of every sheet.

More information

Linear Feature Engineering 11

Linear Feature Engineering 11 Lnear Feature Engneerng 11 2 Least-Squares 2.1 Smple least-squares Consder the followng dataset. We have a bunch of nputs x and correspondng outputs y. The partcular values n ths dataset are x y 0.23 0.19

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

III. Econometric Methodology Regression Analysis

III. Econometric Methodology Regression Analysis Page Econ07 Appled Econometrcs Topc : An Overvew of Regresson Analyss (Studenmund, Chapter ) I. The Nature and Scope of Econometrcs. Lot s of defntons of econometrcs. Nobel Prze Commttee Paul Samuelson,

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Lecture 2: Prelude to the big shrink

Lecture 2: Prelude to the big shrink Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity ECON 48 / WH Hong Heteroskedastcty. Consequences of Heteroskedastcty for OLS Assumpton MLR. 5: Homoskedastcty var ( u x ) = σ Now we relax ths assumpton and allow that the error varance depends on the

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

( )( ) [ ] [ ] ( ) 1 = [ ] = ( ) 1. H = X X X X is called the hat matrix ( it puts the hats on the Y s) and is of order n n H = X X X X.

( )( ) [ ] [ ] ( ) 1 = [ ] = ( ) 1. H = X X X X is called the hat matrix ( it puts the hats on the Y s) and is of order n n H = X X X X. ( ) ( ) where ( ) 1 ˆ β = X X X X β + ε = β + Aε A = X X 1 X [ ] E ˆ β β AE ε β so ˆ = + = β s unbased ( )( ) [ ] ˆ Cov β = E ˆ β β ˆ β β = E Aεε A AE ε ε A Aσ IA = σ AA = σ X X = [ ] = ( ) 1 Ftted values

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Analytical Chemistry Calibration Curve Handout

Analytical Chemistry Calibration Curve Handout I. Quck-and Drty Excel Tutoral Analytcal Chemstry Calbraton Curve Handout For those of you wth lttle experence wth Excel, I ve provded some key technques that should help you use the program both for problem

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting. The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator

More information

The Granular Origins of Aggregate Fluctuations : Supplementary Material

The Granular Origins of Aggregate Fluctuations : Supplementary Material The Granular Orgns of Aggregate Fluctuatons : Supplementary Materal Xaver Gabax October 12, 2010 Ths onlne appendx ( presents some addtonal emprcal robustness checks ( descrbes some econometrc complements

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Ordnary Least Squares (OLS): Smple Lnear Regresson (SLR) Analytcs The SLR Setup Sample Statstcs Ordnary Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals) wth OLS

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

a. (All your answers should be in the letter!

a. (All your answers should be in the letter! Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

Correlation and Regression

Correlation and Regression Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and

More information

University of California at Berkeley Fall Introductory Applied Econometrics Final examination

University of California at Berkeley Fall Introductory Applied Econometrics Final examination SID: EEP 118 / IAS 118 Elsabeth Sadoulet and Daley Kutzman Unversty of Calforna at Berkeley Fall 01 Introductory Appled Econometrcs Fnal examnaton Scores add up to 10 ponts Your name: SID: 1. (15 ponts)

More information

Introduction to Analysis of Variance (ANOVA) Part 1

Introduction to Analysis of Variance (ANOVA) Part 1 Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned b regresson

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

A dummy variable equal to 1 if the nearby school is in regular session and 0 otherwise;

A dummy variable equal to 1 if the nearby school is in regular session and 0 otherwise; Lehrstuhl für Betrebswrtschaftslehre, Emprsche Wrtschaftsforschung Otto-von-Guercke-Unverstät Magdeburg, Postfach 410, 39016 Magdeburg Prof. Dr. Dr. Bodo Vogt Otto-von-Guercke-Unverstät Magdeburg Fakultät

More information

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor Reduced sldes Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor 1 The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information