Statistical registers by restricted neighbor imputation

Size: px
Start display at page:

Download "Statistical registers by restricted neighbor imputation"

Transcription

1 Statstcal regsters by restrcted neghbor mputaton an applcaton to the Norwegan Agrculture Survey Abstract Nna Hagesæther 1 and L-Chun Zhang Statstcs Norway In ths paper we mplement the method of Zhang and Nordbotten (28) for constructng a statstcal regster by the use of neghbor-mputaton wth restrcton (RENI) Emprcal results from the Norwegan Agrculture Survey are shown, and further research on the topc s dscussed 1 Introducton Based on a survey, one may want to estmate the populaton total or mean for several varables To use avalable data more effcently, we can combne data from admnstratve sources and statstcal surveys A way to do ths s to use a sample survey wth target varables and an admnstratve regster wth auxlary varables and construct survey weghts for the sample unts Ths method s common n statstcal producton at the Natonal Statstcal Insttutes An alternatve approach s to predct the values of the target varables for every unt outsde the sample and construct a statstcal regster Statstcal tables wll then be constructed based on all the populaton unts, each of them havng the weght equal to one If one assumes that the probablty of non-response s condtonally ndependent of the varable of nterests gven the auxlary varables, the non-respondent unts n the sample can be predcted n the same way as unts outsde the sample Zhang and Nordbotten (28) proposed three goals they would lke to acheve n constructng a statstcal regster: 1 It should yeld effcent estmates of populaton totals of nterest 2 It should contan correct co-varances among the survey varables, as well as between the survey and auxlary varables 3 It should be non-stochastc, such that the statstcs can be reproduced on repetton The frst goal s tradtonally assocated wth the producton of statstcal tables The second one s a natural requrement now that a statstcal regster s a complete mcro-data fle The last condton s mportant for the acceptablty and face-value n offcal statstcs: e f a procedure s repeated, the results should be exactly the same In the paper they present a table where common mputaton methods are classfed wth respect to the trple-goal crteron: 1 e-mal: nh@ssbno 1

2 Table 1: Trple-goal classfcaton of common mputaton methods, from Zhang and Nordbotten (28) Method (1) (2) (3) Regresson Predcton Not always No Yes Random regresson mputaton No No, f multvarate No Multple Imputaton Not always No, f multvarate No Predctve mean matchng Not always No, f multvarate Yes, n theory Artfcal Neural Network Usually not No, f categorcal Yes Nearest neghbor mputaton Usually not Yes, non-parametrc Yes, n theory Nearest neghbor mputaton (NNI) emerges as the only practcal approach n terms of preservng the co-varances among all the varables It s therefore a prncpal method for the constructon of a statstcal regster Consder the smple stuaton wth a sample ( x 1, y1),, ( x r, y r) where y s a vector of all target varables known for the respondents r, 1 r, and all x -values are observed The NNI method mputes y by y, where r+ 1 N and N s the number of unts n the populaton, and satsfes x x = mn x x l 1 l r and denotes the chosen dstance metrc In the case of multple nearest neghbors, the donor s randomly selected as one of them Chen and Shao (2) showed that NNI yelds asymptotcally consstent estmators of the populaton totals as well as the fnte populaton dstrbutons of nterest However, ther emprcal results also ndcate that the method s often not effcent Zhang and Nordbotten suggested a new approach called RENI (restrcted neghbor mputaton), by mposng restrctons n the mputed totals, whch may be obtaned separately from the NNI such as through a regresson predcton A smple modfcaton s ntroduced n order to make the NNI nonstochastc, and addtonal neghbors are ncluded as potental donors, not only the nearest In secton 2, the algorthm for the RENI approach s descrbed In secton 3, we show emprcal results where RENI s appled to real data from the Norwegan Agrculture Survey In secton 4 we dscuss some further work that we have planned 2 Algorthm for RENI The algorthm to produce a RENI based statstcal regster n Zhang and Nordbotten (28) s descrbed as follows The ump-start (JS) phase Denote by R the set of recevers Denote by D the set of donors Let x be the varable (or varables) based on whch the dstance metrc (and the NNI) s defned 1 Set the counter d = for all D 2 For each R : a) Let m be the number of nearest neghbor (NN) donors, where m 1 b) For each NN-donor, ncrease ts counter d by 1/m 2

3 3 Let Y R denote the column vector of margnal restrctons for the recevers Let y be the correspondng vector of varables for D Put d = d g and g = 1+ (Y Y % ) A% y ' T 1 R R where Y % R = dy and D T A % = dyy It s easly verfed that D dy ' D = Y R ' 4 Let d = a + u, where a s the largest nteger that satsfes a d Sort the recevers n the ncreasng order of m For = 1,, R : a) Fnd the frst NN donor wth a 1 Impute y = y, and decrease a by 1 b) Do nothng f there s no NN-donor wth postve a ' The fne-tune (FT) phase Denote by R the remanng set of recevers that have not yet been ' mputed Extend x to x so that for each R' there s now a unque orderng among the potental donors by x and, some addtonal nformaton z For nstance, z can be the post zp code, the dentfcaton number of unt, and so on Notce that z s not consdered nformatve The unque orderng ensures that the procedure s non-stochastc 1 Set k =1 For each R', fnd the NN-donor and set y y = Let 1 Δ be the dstance between YR' = YR y and YR';k 1 = y accordng to some chosen metrc R\R' = R' 2 Set k =2 For R', let D ;k = 2 contan ts two closest NN-donors a) For each R' b) For = 1,, R', set, fnd the NN-donor and set I I = = for D ;k = 2 that yelds a closer mputed total to c) Repeat step 2b untl no changes can be made Calculate Δ 2 between Y R' Y R' and R';k 2 Y = 3 Stop f Δ 2 =, and use the mputatons from Step 2 Otherwse, stop f Δ2 Δ 1, and use the mputaton from Step 1 Otherwse, set k = 3 and let D ;k = 3 contan the three closest NNdonors for R' a) For each R' b) For = 1,, R', set, fnd the NN-donor and set I I = = for D ;k = 3 that yelds a closer mputed total to c) Repeat step 3b untl no changes can be made Calculate Δ 3 between Y R' Y R' and R';k 3 Y = 4 Stop f Δ 3 =, and use the mputatons from Step 3 Otherwse, stop f Δ3 Δ 2, and use the mputaton from Step 2 Otherwse, set k = 4 and let D ;k = 4 contan the four closest NNdonors for R' 3

4 The followng observatons are worth notng: The JS s desgned to speed up the process Y R may consst of optonal target varables At each teraton of the FT phase the donor s the one among the k nearest neghbors that best satsfes the restrctons The consstency of the NNI remans, as long as the dfference between the condtonal expectatons of a unt and ts k -th nearest neghbor s bounded by the dstance between them through a fnte constant Devaton from the margnal restrctons of the mputed totals can be reduced n two ways Frstly, one may ncrease k to allow for greater combnatoral flexblty Secondly, one may reduce the amount of mputatons acheved by the JS, or even skppng completely over t In constructng Δ, one can put a larger weght on the target varables that are consdered more mportant In return these wll be closer to the restrcton, on the expense of the other varables consdered less mportant The weght s called varable-weght and denoted by W If one wants to ntroduce restrctons on a more detaled level than the mputaton class, ths can be ncorporated n the algorthm Y R can be extended to nclude sub-populatons of the mputaton class, and the Δ can be calculated also wth respect to the sub-populatons A weght between and 1, called margnal weght and denoted by W, may be ncluded, to balance between the mputaton class total and the sub-populaton total If W =, no subpopulaton totals are ncluded n Δ, and f W = 1, only sub-populaton totals are ncluded 3 Emprcal results We have tested RENI on the Norwegan Agrculture Survey 26 Agrculture Survey 26 There are 126 respondng unts whch are the donors There are 4993 non-respondng and out-of-sample unts whch are the recevers The Agrculture Survey 26 has 84 varables of nterest: 42 of them are sze varables and the other 42 are ndcator varables dependng on whether the sze s postve Leasng, nvestment and mantenance were the most mportant topcs ncludng the followng varables, all gven n Norwegan kroner (VAT excluded): Leasng: leasng (pad leasng rent for machnes), leasng1 (value of new leasng contract, contract 1) and leasng2 (value of new leasng contract, contract 2) Investment: _fxed (sum nvested n fxed techncal equpment n outbuldngs), _buld (sum nvested n outbuldngs), mach_new (purchase of other new machnes and tools) and mach_used (purchase of other used machnes and tools) We have also ncluded sale_mach (sales of other machnes and tools) Mantenance: m_buld (mantenance of buldngs and fxed equpment), m_tractor (mantenance of tractors and combne harvesters), m_car (mantenance of cars and trucks) and m_mach (mantenance of machnes and tools) 4

5 Estmated numbers are publshed at the followng classfcaton levels: Farmng system/actvty (FA): 1=gran and olseed crops, 2=other agrcultural crops, 3=garden crops, 4=cattle, mlk producton, 5=cattle, meat producton, 6=cattle, mlk and meat producton, 7=sheep, 8=other roughage anmals, 9=swne and poultry, 1=mxed crop producton, 11=mxed farm anmal producton, 12=crop and farm anmal producton Class of farmland n decares: 1=-4, 2=5-49, 3=5-99, 4=1-199, 5=2-499 and 6=5+ County The mputaton class s FA Ths means that recevers can only get a donor wth the same farmng actvty Due to lmted space we show only results four dfferent mputaton classes: FA-2: 266 recevers and 727 donors The sample fracton s about the same as for the whole populaton, and the proporton of recevers s about average FA-4: 9984 recevers and 3296 donors Ths s the largest mputaton class n ths study FA-1: 384 recevers and 243 donors Ths s the smallest mputaton class n ths study, wth a hgh samplng fracton above 6 percent FA-11: 586 recevers and 34 donors Ths s another one of the smallest mputaton classes The varable used to fnd the nearest neghbor wthn each mputaton class s farmland n decares and an dentty number of the farms, where the latter of whch s non-nformatve The restrctons are the publshed totals of each mputaton class, obtaned by stratfed rato estmaton 31 Performance n satsfyng the restrctons Frst we examne how well the algorthm s able to satsfy the mposed restrctons Ths s mportant n order to realze the effcency goal Fgure 1 below shows the effect of k on Δ, where Δ s calculated as the un-weghted sum of relatve dfferences between the restrcton and the mputed class total for all varables Recall that k =1 means that only the nearest neghbor may be the donor, k =2 means that the donor may be selected among the two nearest neghbors, and so on It s seen that Δ decreases rapdly as k ncreases from 1 to 3, and there s bascally no change n Δ beyond k =6 Fgure 1: How vares wth k, for farmng actvtes (FA) 2, 4 1 and FA-2 FA FA-1 FA-11 k 5

6 We examne next how the JS affects the computaton tme Fve dfferent scenaros of the JS phase where used, varyng from no JS at all to JS wth respect to all varables of nterest The results are summarzed n Table 2-5, where s the same as above We notce that the number of unts that s left for the FT phase depends mostly on whether the JS s employed or not, but less on the exact restrctons used n the JS phase The computaton tme depends on number of teratons and number of recevers at the FT phase Twce as many recevers, or twce as many teratons, takes about twce as long tme --- the relatonshp s lnear as shown n Fgure 2 Snce the number of teratons does not seem to depend on the JS at all, the JS can sgnfcantly reduce the computaton tme for large mputaton classes such as FA-4 Meanwhle, the restrctons n the JS do affect the fnal Generally, t seems that the sze varables more readly put the JS n the rght drecton, and usng all the restrctons at the JS acheves the best fttng of the fnal mputed totals In concluson, the JS performed as ntended, and usng all the restrctons at the JS yelded fnal mputaton totals that were bascally as good as wthout the JS at all The latter opton requres of course much more computaton tme n large mputaton classes Table 2: Shows how and computaton tme depend on restrctons and number of unts n the fne-tune phase JS = ump-start, FT = fne-tune Farmng actvty 2 Seconds Iteratons, k=6, k=1 Restrctons # n JS # n FT No JS ,16 22, Wth JS ,57 15,48 5 sze varables Wth JS ,48 12,7 All sze varables Wth JS 9 4 5,23 12,84 All ndcator varables Wth JS ,9 11,95 All varables Table 3: Shows how and computaton tme depend on restrctons and number of unts n the fne-tune phase JS = ump-start, FT = fne-tune Farmng actvty 4 Seconds Iteratons, k=6, k=1 Restrctons # n JS # n FT No JS ,17 33, Wth JS ,2 11,34 5 sze varables Wth JS 3 4 2,29 8,8 All sze varables Wth JS ,93 1,17 All ndcator varables Wth JS ,97 8,43 All varables Table 4: Shows how and computaton tme depend on restrctons and number of unts n the fne-tune phase JS = ump-start, FT = fne-tune Farmng actvty 1 Seconds Iteratons, k=6, k=1 Restrctons # n JS # n FT No JS 3,5 4 6,11 37, Wth JS 1,5 5 7,45 32,18 5 sze varables Wth JS 1,5 4 6,13 33,6 All sze varables Wth JS 1 4 8,95 3,8 All ndcator varables Wth JS 1,5 4 7,29 34,87 All varables

7 Table 5: Shows how and computaton tme depend on restrctons and number of unts n the fne-tune phase JS = ump-start, FT = fne-tune Farmng actvty 11 Seconds Iteratons, k=6, k=1 Restrctons # n JS # n FT No JS 9,5 5 4,1 29, Wth JS 1,7 4 7,49 3,31 5 sze varables Wth JS 2,5 5 4,74 27,78 All sze varables Wth JS 2, ,82 All ndcator varables Wth JS 2,4 4 2,8 28,8 All varables Fgure 2: Shows how computaton tme (measured n seconds) dvded on number of teratons vares wth number of unts n FT, for farmng actvty (FA) 2, 4, 1 and 11 Number of unts n FT Seconds used dvded on number of teratons, FA = 2 Number of unts n FT Seconds used dvded on number of teratons, FA = Number of unts n FT ,25,5,75 1 Seconds used dvded on number of teratons, FA = 1 Number of unts n FT ,5 1 1,5 2 Seconds used dvded on number of teratons, FA = 11 Table 6 and 7 show how the mputed totals n FA-4 and FA-1, respectvely, satsfy the restrctons on the 12 most mportant varables lsted above All the mputatons were carred out usng full JS and wth k =6 Fve dfferent ways of calculatng have been used For the frst scenaro, all varables are consdered equally mportant (that s, W=1), so that s calculated as the un-weghted sum of the relatve dfferences, and only totals at the mputaton class level are consdered when calculatng (that s, W =) For all other scenaros the weghts for the 12 mportant varables are ncreased to W=1, e these relatve dfferences weght ten tmes as much as the rest n the calculaton of For the thrd scenaro, W =1, whch means that the restrctons for sub-populatons (n ths case, margnals of class and regon) n calculaton s ncluded to a small extent For the fourth and ffth, the W s ncreased to 5 and 9 Notce that n scenaro three fve, the number of mputaton restrctons s ncreased to 84 from 84 n the frst two scenaros The results ndcate that there s a lmt on the number of restrctons that can be satsfed n any gven mputaton class Moreover, the varables wth skewed dstrbutons are more lkely to have problems Here a skewed dstrbuton s ndcated by a small number (or proporton) of donors wth postve 7

8 values It follows that n practce one may need to gve prorty to certan mputaton restrctons, for nstance the varables of the most nterest or the restrctons that have the lowest varances Table 6: Table of results from fve dfferent scenaros of RENI compared to restrcton, for the mportant varables The second column shows number of donors wthn the mputaton class that has a value larger than W = varable weght, W = margnal weght Farmng actvty 4 Varable Donors > Restrcton W=1, W'= W=1, W'= W=1, W'=1 W=1, W'=5 W=1, W'=9 leasng ,,,,, leasng ,,,, -,1 leasng ,1,,,1,11 m_buld ,,,,,2 _fxed ,,,, -,2 _buld ,,,,, m_tractor ,,,,, m_car ,,,,,4 m_mach ,,,,, mach_new ,,,,,1 mach_used ,,,, -,2 sale_mach ,,,,1,6 Table 7: Table of results from fve dfferent scenaros of RENI compared to restrcton, for the mportant varables The second column shows number of donors wthn the mputaton class that has a value larger than W = varable weght, W = margnal weght Farmng actvty 1 Varable Donors > Restrcton W=1, W'= W=1, W'= W=1, W'=1 W=1, W'=5 W=1, W'=9 leasng ,2,,,,23 leasng ,5,6,5,12,27 leasng ,4 -,4 -,4 -,4 -,4 m_buld ,,,, -,4 _fxed ,4 -,5,9,11,14 _buld ,9,,3,7,19 m_tractor ,,,,2,2 m_car ,11,,,, m_mach ,,1,,7,15 mach_new , -,1,,14,17 mach_used ,5,1 -,1,,1 sale_mach ,26,,5,21,25 32 Co-varances of the mputed values It s essental that the mposed restrctons do not damage the consstency property of the NNI for the estmaton of the fnte populaton dstrbutons In partcular, wth regard to the second goal, the mputed data should contan correct co-varances among the survey varables We now compare the varances and co-varances n the statstcal regster obtaned by the RENI and the correspondng estmates by the weghtng method Table 8 shows that the correlatons by and large are very close under the two approaches n FA-1, and the stuaton s very smlar n the other mputaton classes for whch the detals are omtted due to the lmt of space The same concluson holds n terms of the coeffcent of varaton under the two approaches (Table 9) For the co-varaton among ndcator varables, we look at ther cross-tables Table 1 show these are very close under the two approaches n FA-1 Agan, the stuaton s very smlar n the other mputaton classes such that the detals are omtted 8

9 Table 8: Co-varances for RENI (above dagonal) and weghtng (under dagonal) Farmng actvty 1 leasng leasng1 leasng2 m_buld _fxed _buld m_tractor m_car m_mach mach_new mach_used sale_mach leasng,59,24,25,44,37,3,25,37 -,3,12,7 leasng1,59,43,13,47,44,14,1,25 -,1,17,1 leasng2,25,44,12,5,16,11,9,21 -,2 -,2 -,2 m_buld,25,14,14,7,8,35,59,4,13,23,7 _fxed,45,47,5,6,8,5 -,2,14 -,3 -,2 -,2 _buld,37,42,15,8,79,7 -,2,17 -,3, -,2 m_tractor,28,13,12,35,5,8,42,53,17,13,11 m_car,21,9,1,61 -,1 -,1,47,49,11,15,11 m_mach,35,25,24,41,16,18,57,52,28,18,3 mach_new -,1 -,2 -,2,15 -,3 -,3,2,15,32,11,6 mach_used,11,18 -,1,27 -,2,,13,16,17,15,6 sale_mach,9,11 -,2,7 -,1 -,2,12,13,32,56,6 Table 9: Coeffcent of varaton for RENI and weghtng, for all farmng actvtes FA = 2 FA = 4 FA = 1 FA = 11 R-NNI Weghtng R-NNI Weghtng R-NNI Weghtng R-NNI Weghtng leasng,2,19,3,3,97,97,47,47 leasng1,29,29,6,6,191,28,98,95 leasng2,153,149,18,18,836,823,229,226 m_buld,1,1,2,1,84,82,25,25 _fxed,46,46,9,8,173,164,9,94 _buld,3,3,9,9,164,166,79,73 m_tractor,7,7,1,1,43,43,23,23 m_car,12,12,4,4,91,91,48,45 m_mach,1,1,1,1,53,53,28,31 mach_new,25,26,4,3,132,117,88,81 mach_used,36,34,5,5,237,25,66,67 sale_mach,31,31,6,6,225,227,126,134 Table 1: Cross-table for the ndcator varable of the mportant varables for the RENI (above the dagonal) and weghtng method (under the dagonal) Farmng actvty 1 I_leasng I_leasng1 I_leasng2 I_m_bul I fxed I buld I_m_tractor I_m_car I_m_mach I_mach_new I_mach_used I_sale_mach I_leasng I_leasng I_leasng I_m_buld I fxed I buld I_m_tractor I_m_car I_m_mach I_mach_new I_mach_used I_sale_mach

10 4 Summary and dscusson of future works Applcaton of the RENI to the Norwegan Agrculture Survey 26 suggests that the method s capable of fulfllng the trple-goal crteron for the constructon of statstcal regsters The most problematc ssue that we have encountered concerns how well the mputed totals satsfy the mposed restrctons A large number of restrctons can easly be obtaned by method of weghtng However, not all the weghted totals are equally relable One may choose those wth small coeffcents of varaton and/or those that are consdered mportant It may also be helpful to examne systematcally the effectve choce of mputaton restrctons For nstance, gven the number of restrctons to be mposed, whch ones should be chosen n order to obtan n addton the closest agreement of the rest totals? The defnton of the dstance metrc s another ssue one mght look nto For nstance, what s a sutable balance between the absolute and relatve dfferences? Partal mssng/non-response s as common as unt mssng/non-response In general, a donor may not have exactly the same values as the observed ones for the recevers, when observatons are mssng partally It seems natural n such a stuaton that the observed values of the donors should be adusted before they are mputed for the recever Otherwse, the covarance between the varables may be dstorted Ths requres extendng the NNI to partally mssng data More generally, the donor and recever may not exactly match on the x-values, ether Thus, the extenson may have a more general bearng on the methodology 5 References Zhang, L C and Nordbotten, S (28) Predcton and mputaton n ISEE: Tools for more effcent use of combned data sources Proceedngs from UN/ECE Workshop Sesson on Statstcal Edtng n Venna Geneva 28 Chen, J and Shao, J (2) Nearest neghbor mputaton for survey data Journal of Offcal Statstcs, vol 16,

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Developing a Data Validation Tool Based on Mendelian Sampling Deviations

Developing a Data Validation Tool Based on Mendelian Sampling Deviations Developng a Data Valdaton Tool Based on Mendelan Samplng Devatons Flppo Bscarn, Stefano Bffan and Fabola Canaves A.N.A.F.I. Italan Holsten Breeders Assocaton Va Bergamo, 292 Cremona, ITALY Abstract A more

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling Open Journal of Statstcs, 0,, 300-304 ttp://dx.do.org/0.436/ojs.0.3036 Publsed Onlne July 0 (ttp://www.scrp.org/journal/ojs) Multvarate Rato Estmator of te Populaton Total under Stratfed Random Samplng

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Module 9. Lecture 6. Duality in Assignment Problems

Module 9. Lecture 6. Duality in Assignment Problems Module 9 1 Lecture 6 Dualty n Assgnment Problems In ths lecture we attempt to answer few other mportant questons posed n earler lecture for (AP) and see how some of them can be explaned through the concept

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

The Minimum Universal Cost Flow in an Infeasible Flow Network

The Minimum Universal Cost Flow in an Infeasible Flow Network Journal of Scences, Islamc Republc of Iran 17(2): 175-180 (2006) Unversty of Tehran, ISSN 1016-1104 http://jscencesutacr The Mnmum Unversal Cost Flow n an Infeasble Flow Network H Saleh Fathabad * M Bagheran

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Difference Equations

Difference Equations Dfference Equatons c Jan Vrbk 1 Bascs Suppose a sequence of numbers, say a 0,a 1,a,a 3,... s defned by a certan general relatonshp between, say, three consecutve values of the sequence, e.g. a + +3a +1

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Sampling Theory MODULE VII LECTURE - 23 VARYING PROBABILITY SAMPLING

Sampling Theory MODULE VII LECTURE - 23 VARYING PROBABILITY SAMPLING Samplng heory MODULE VII LECURE - 3 VARYIG PROBABILIY SAMPLIG DR. SHALABH DEPARME OF MAHEMAICS AD SAISICS IDIA ISIUE OF ECHOLOGY KAPUR he smple random samplng scheme provdes a random sample where every

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting Onlne Appendx to: Axomatzaton and measurement of Quas-hyperbolc Dscountng José Lus Montel Olea Tomasz Strzaleck 1 Sample Selecton As dscussed before our ntal sample conssts of two groups of subjects. Group

More information

Lecture 3: Probability Distributions

Lecture 3: Probability Distributions Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the

More information

Estimation of the Probability of Success Based on Communication History

Estimation of the Probability of Success Based on Communication History Workng paper, presented at 7-th Valenca Meetng n Bayesan Statstcs, June 22 Estmaton of the Probablty of Success Based on Communcaton Hstory Arkady E Shemyakn Unversty of St Thomas, Sant Paul, Mnnesota,

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

Lecture 4: November 17, Part 1 Single Buffer Management

Lecture 4: November 17, Part 1 Single Buffer Management Lecturer: Ad Rosén Algorthms for the anagement of Networs Fall 2003-2004 Lecture 4: November 7, 2003 Scrbe: Guy Grebla Part Sngle Buffer anagement In the prevous lecture we taled about the Combned Input

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

Exponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute

Exponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 193-9466 Vol. 10, Issue 1 (June 015), pp. 106-113 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) Exponental Tpe Product Estmator

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 1, July 2013

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 1, July 2013 ISSN: 2277-375 Constructon of Trend Free Run Orders for Orthogonal rrays Usng Codes bstract: Sometmes when the expermental runs are carred out n a tme order sequence, the response can depend on the run

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics )

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics ) Ismor Fscher, 8//008 Stat 54 / -8.3 Summary Statstcs Measures of Center and Spread Dstrbuton of dscrete contnuous POPULATION Random Varable, numercal True center =??? True spread =???? parameters ( populaton

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH)

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH) Chapter 1 Samplng wth Unequal Probabltes Notaton: Populaton element: 1 2 N varable of nterest Y : y1 y2 y N Let s be a sample of elements drawn by a gven samplng method. In other words, s s a subset of

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Lecture 12: Classification

Lecture 12: Classification Lecture : Classfcaton g Dscrmnant functons g The optmal Bayes classfer g Quadratc classfers g Eucldean and Mahalanobs metrcs g K Nearest Neghbor Classfers Intellgent Sensor Systems Rcardo Guterrez-Osuna

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

Supplementary Notes for Chapter 9 Mixture Thermodynamics

Supplementary Notes for Chapter 9 Mixture Thermodynamics Supplementary Notes for Chapter 9 Mxture Thermodynamcs Key ponts Nne major topcs of Chapter 9 are revewed below: 1. Notaton and operatonal equatons for mxtures 2. PVTN EOSs for mxtures 3. General effects

More information

A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS. Dr. Derald E. Wentzien, Wesley College, (302) ,

A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS. Dr. Derald E. Wentzien, Wesley College, (302) , A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS Dr. Derald E. Wentzen, Wesley College, (302) 736-2574, wentzde@wesley.edu ABSTRACT A lnear programmng model s developed and used to compare

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

Cokriging Partial Grades - Application to Block Modeling of Copper Deposits

Cokriging Partial Grades - Application to Block Modeling of Copper Deposits Cokrgng Partal Grades - Applcaton to Block Modelng of Copper Deposts Serge Séguret 1, Julo Benscell 2 and Pablo Carrasco 2 Abstract Ths work concerns mneral deposts made of geologcal bodes such as breccas

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Parametric fractional imputation for missing data analysis

Parametric fractional imputation for missing data analysis Secton on Survey Research Methods JSM 2008 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Wayne Fuller Abstract Under a parametrc model for mssng data, the EM algorthm s a popular tool

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Laboratory 3: Method of Least Squares

Laboratory 3: Method of Least Squares Laboratory 3: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly they are correlated wth

More information

Report on Image warping

Report on Image warping Report on Image warpng Xuan Ne, Dec. 20, 2004 Ths document summarzed the algorthms of our mage warpng soluton for further study, and there s a detaled descrpton about the mplementaton of these algorthms.

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function On Outler Robust Small Area Mean Estmate Based on Predcton of Emprcal Dstrbuton Functon Payam Mokhtaran Natonal Insttute of Appled Statstcs Research Australa Unversty of Wollongong Small Area Estmaton

More information

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek Dscusson of Extensons of the Gauss-arkov Theorem to the Case of Stochastc Regresson Coeffcents Ed Stanek Introducton Pfeffermann (984 dscusses extensons to the Gauss-arkov Theorem n settngs where regresson

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Statistical Evaluation of WATFLOOD

Statistical Evaluation of WATFLOOD tatstcal Evaluaton of WATFLD By: Angela MacLean, Dept. of Cvl & Envronmental Engneerng, Unversty of Waterloo, n. ctober, 005 The statstcs program assocated wth WATFLD uses spl.csv fle that s produced wth

More information

Markov Chain Monte Carlo Lecture 6

Markov Chain Monte Carlo Lecture 6 where (x 1,..., x N ) X N, N s called the populaton sze, f(x) f (x) for at least one {1, 2,..., N}, and those dfferent from f(x) are called the tral dstrbutons n terms of mportance samplng. Dfferent ways

More information

Problem Set 9 Solutions

Problem Set 9 Solutions Desgn and Analyss of Algorthms May 4, 2015 Massachusetts Insttute of Technology 6.046J/18.410J Profs. Erk Demane, Srn Devadas, and Nancy Lynch Problem Set 9 Solutons Problem Set 9 Solutons Ths problem

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

Small Area Estimation for Business Surveys

Small Area Estimation for Business Surveys ASA Secton on Survey Research Methods Small Area Estmaton for Busness Surveys Hukum Chandra Southampton Statstcal Scences Research Insttute, Unversty of Southampton Hghfeld, Southampton-SO17 1BJ, U.K.

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

DETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH

DETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata TC XVII IMEKO World Congress Metrology n the 3rd Mllennum June 7, 3,

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information