This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics

Size: px
Start display at page:

Download "This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics"

Transcription

1 Ths module s part of the Memobust Handbook on Methodology of Modern Busness Statstcs 26 March 2014

2 Method: Generalsed Regresson Estmator Contents General secton Summary General descrpton of the method Propertes of GREG estmator Partcular cases and extensons Preparatory phase Examples not tool specfc The Small-Medum Enterprses Survey and the current samplng strategy Examples tool specfc Glossary References Specfc secton Interconnectons wth other modules Admnstratve secton... 16

3 General secton 1. Summary The basc estmator (see Weghtng and Estmaton Man Module ) of a target parameter expands the observed values on the sample unts usng drect weghts, whch are the nverse of the ncluson probabltes. Generalsed regresson estmator s a model asssted estmator desgned to mprove the accuracy (see Qualty Aspects Qualty of Statstcs ) of the estmates by means of auxlary nformaton. GREG estmator guarantees the coherence between samplng estmates and known totals of the auxlary varables, as well. In fact, t s a specal case of a calbraton estmator (see Weghtng and Estmaton Calbraton ) when the Eucldean dstance s used. 2. General descrpton of the method In the estmaton phase, the sample values are weghted to represent also unobserved unts. When auxlary nformaton s avalable at unt or doman level, a GREG estmator can be used n order to reduce the varance of the estmates by usng the relatonshp between the target varable and the auxlary varables. At the same tme the resultng weghts allow calbraton to the known totals. Let y, x be the target varable and the vector of auxlary varables, respectvely. The GREG estmator (Cassel, Särndal, and Wretman, 1976) can be expressed as a sum of the Horvtz Thompson estmator (HT) (see Weghtng and Estmaton Man Module ) and a weghted dfference between known totals and ther HT estmator: where tˆ GREG = d y + βˆ t X d x, (1) s s d, =1, n, s the drect weght equal to the nverse of the ncluson probablty, t X s the vector of known populaton totals; moreover βˆ s an estmate of the vector of regresson coeffcents of y on x, gven by wth 1 ˆβ = d q x x dq x y s, s q scale factors chosen properly, e.g., to account for heteroscedastcty. For example, when the varablty of the target y depends on enterprses sze, z, the q can be chosen as In general, z may also be one of the covarates n the regresson model. Alternatvely, the GREG estmator can be formulated n terms of predcted values for the target varables calculated on the bass of a lnear relatonshp between y and x. More specfcally, these predcted values are used n the estmaton together wth the resduals from the model, evaluated for sample unts,.e., the GREG estmator can be wrtten as ( y yˆ ) N N tˆ = y + d e = y + d GREG = 1 ˆ s = 1 ˆ s, (2) z. 3

4 where y ˆ = x βˆ s the predcted value accordng to the lnear model that relates y and x, e s the evaluated resdual for a unt n the sample. Fnally, the GREG estmator can be convenently formulated as a weghted sum of sample values: t ˆ = d g y = w y GREG s, s s, s (3) where the correcton factor g s of the drect weghts s gven by g, s 1 = 1+ t d x X d q x x q x j s j j j s j j j j (4) whch does not depend on the target varable y. 2.1 Propertes of GREG estmator A fundamental property of the GREG estmator s that t s nearly desgn unbased (Särndal et al., 1992). The lnear GREG estmator s motvated va the lnear assstng model (Särndal et al., 1992) E( y ) = β x, V 2 ( y ) σ =. (5) However, the knowledge of all x values s not necessary to evaluate lnear GREG, because the knowledge of totals suffces to calculate the new weghts, w, (see secton 12 below). The regresson coeffcent n (5) can be estmated at natonal level or for a dsaggregated level, e.g., NUTS2. Ths level s referred as model group { U(p) }. In case of sub-natonal model group, the known totals need to be avalable at ths level. An mportant feature of the lnear GREG s that the weghtng system does not depend on the target varable but only on x values, as (4) shows. The GREG estmator s calbrated to the known totals of the assstng model, that s w = s, s x t X. In fact GREG s a partcular case of a calbraton estmator (see Weghtng and Estmaton Calbraton ) when usng the Eucldean dstance. Moreover, all the calbraton estmators can be asymptotcally approxmated by the GREG (Devlle and Särndal, 1992). Another relevant property of GREG estmator s that the evaluaton of ts varance (see Qualty Aspects Qualty of Statstcs ) s based on the varance of the resduals ( y ˆ ) s (Särndal et al., 1992). As a consequence of ths, the hgher the fttng of the lnear workng model the lower the varance of GREG estmator and therefore the hgher ts accuracy. On the contrary, f the model underlyng the GREG s not approprate for the target varable, a too large varaton of weghts may ncrease the varance wth respect to the HT estmator. In fact, varablty of weghts unrelated wth the target varable can ncrease the varance of the estmates, an approxmaton of ths mpact s gven (Ksh, 1995) by y 4

5 2 1+ CV, w s, (6) where CV stands for the coeffcent of varaton of fnal weghts. A possble drawback of the GREG estmator s that t can produce negatve weghts (cf. secton 10 below); on the contrary, n the framework of the calbraton estmator, t s possble to obtan weghts always postve usng dfferent dstance functons (see Weghtng and Estmaton Calbraton ). 2.2 Partcular cases and extensons The rato estmator s a specal case of GREG asssted by a model wth only one covarate, obtanable f the varance of the target varable s assumed to be a lnear functon of the auxlary varable 2 V ( y ) = σ x (Devlle, Särndal, 1992). Extended GREG estmators are defned replacng the assstng model (4) wth more general (nonlnear, generalsed, or mxed) models. The non-lnear GREG estmators (e.g., Lehtonen and Vejanen, 1998) requre a separate model fttng for every target varable; hence, an mportant drawback of ths knd of model asssted estmators s that they do not produce a unque system of weghts unformly applcable. On the other sde, the nonlnear GREG may gve a consderably reducton n varance, as a result of the more refned models that can be consdered when there s complete unt level auxlary nformaton. 3. Preparatory phase 4. Examples not tool specfc 4.1 The Small-Medum Enterprses Survey and the current samplng strategy Small and Medum-szed Enterprses (SME) sample survey s carred out annually by sendng a postal questonnare wth the purpose of nvestgatng proft-and-loss account of enterprses wth less than 100 persons employed, as requested by SBS EU Councl Regulaton n. 58/97 (Eurostat, 2003) and n. 295/2008. The unts nvolved n the survey have also the possblty to fll n an electronc questonnare and transmt t to Istat va web. The survey covers enterprses belongng to the followng economc actvtes accordng to the Nace Rev.1.1 classfcaton: - Sectons C, D, E, F, G, H, I, J (dvson 67), K; - Sectons M, N and O for the enterprses operatng n the prvate sector. Man varables of nterest asked to the SME sampled enterprses are Turnover, Value added at factor cost, Employment, Total purchases of goods and servces, Personnel costs, Wages and salares, Producton value. They are also asked to specfy ther economc actvty sector and geographcal locaton n order to test the correctness of the frame wth respect to these nformaton. Totals of varables of nterest are estmated wth reference to three typologes of domans of study. 5

6 4.1.1 Frame of nterest For the reference year 2007, the populaton of nterest for SME sample surveys s about 4.5 mllons actve enterprses. The frame for SME survey s represented by the Italan Statstcal Busness Regster (SBR). It results from the logcal and physcal combnaton of data from both statstcal sources (surveys) and admnstratve sources (Tax Regster, Regster of Enterprses and Local Unts, Socal Securty Regster, Work Accdent Insurance Regster, Regster of the Electrc Power Board) treated wth statstcal methodologes. Varables n the regster are both quanttatve (Average number of employees n the year t-1, Number of employees n date 31/12/year t-1, Independent employment n date 31/12/year t-1, Number of enterprses) and qualtatve (Geographcal locaton, Economc actvty accordng to Nace Rev dgt). From the Fscal Regster s also provded the VAT Turnover, whch represents a good proxy of the varable Turnover asked to the sampled enterprses by questonnare Samplng desgn (allocaton and doman of estmates) SME s a mult-purpose and mult-doman survey and t produces statstcs on several varables (manly economc and employment varables) for three types of domans, each defnng a partton of the populaton of nterest (see Tables 1 and 2). Table 1: Types of SME Survey domans Type of doman Number of Code Descrpton Domans DOM1 Class of economc actvty (4-dgt Nace Rev.1*) 461 DOM2 Group of economc actvty (3-dgt Nace Rev.1) by sze-class of employment DOM3 Dvson of economc actvty (2-dgt Nace Rev.1) by regon 984 *Nace Rev.1 = Statstcal Classfcaton of Economc Actvtes n the European Communtes Table 2: Defnton of Sze-classes of employment for doman DOM3 of SME Survey Nace Rev dgt level Sze-classes of employment 10-45; 1-9; 10-19; 20-49; 50-99; 50-52; 1; 2-9; 10-19; 20-49; 50-99; 55;60-64;67;70-74; 1; 2-9; 10-19; 20-49; 50-99; 80; 85; 90; 92; 93; 1-9; 10-19; 20-49; 50-99; Samplng desgn of the SME survey s a one stage stratfed random samplng, wth the strata defned by the combnaton of the modalty of the characters Nace Rev.1.1 economc actvty, sze class and admnstratve regon. A fxed number of enterprses are selected n each stratum wthout replacement and wth equal probabltes. The number of unts to be selected n each stratum s defned as a soluton of a lnear nteger problem (Bethel, 1989). In partcular, the mnmum sample sze s determned n order to ensure that the varance of samplng estmates of the varable of nterest n each doman does not exceed a gven threshold, n terms of coeffcent of varaton. The varables of nterest used for sample allocaton are Number of persons employed, Turnover, Value added at factor cost, whose mean and varance are estmated n each stratum by data from the frame and data collected from the prevous survey, respectvely. 6

7 About 103,000 of small and medum-szed enterprses (unts) are ncluded n the sample. The samplng unts are drawn by applyng JALES procedure (Ohlsson, 1995) n order to take under control the total statstcal burden, by achevng a negatve co-ordnaton among samples drawn from the same selecton regster The weghtng procedure After calculatng the total non-response correctng factors as the rato of the number of sampled unts and the number of respondng unts belongng to approprate weghtng adjustment cells, the weght of every sngle enterprse s further modfed n order to match known or alternatvely estmated populaton totals called benchmarks. In partcular, known totals of selected auxlary varables on the Busness Regster (Average number of employees n the year t-1, Number of enterprses) are currently used to correct for sample-survey nonresponse or for coverage error resultng from frame undercoverage or unt duplcaton. Practcal aspects n the applcaton of the weghtng procedure n the contest of SME survey The evaluaton of fnal weghts for SME survey s usually carred out usng the selected auxlary varables, for the three types of domans descrbed n Table 1. The optmsaton problem underlyng the GREG estmaton process can be therefore formulated n the followng way: the model group { U(p) } s defned as the dvson of economc actvty (2-dgt Nace Rev.1.1) of the frame (the updated Busness Regster); the domans of nterest are represented by the three typologes of parttons (descrbed n Tables 1 and 2); the auxlary varables are dentfed by x 1 = Number of enterprses x 2 = Average number of employees n the year t-1; for each enterprse, the vector x of the auxlary varables has been defned as follows: ' ( x', x' ) x = 1 2, combnaton of two vectors 1 ' 1 { λ ( j )} x =, d { αλ (j )} x ' wth d=1,,3; j=1,.., J d, 2 = d where, accordng to the updated Busness Regster nformaton: x ' and x' 2 whose form s, respectvely: - λ j ) s a dchotomous varable whose value s equal to 1 f the unt belongs to doman j d and ( d equal to 0 otherwse; -α s the number of employed of enterprse ; for each model group { U(p) },.e., for each dvson of economc actvty (2-dgt Nace Rev.1.1), the known populaton totals calculated on the updated frame, are expressed by: X U(p) ( λ (j λ (j α λ (j α λ (j ) = U(p) x ' = U(p) ),..., U(p) 3 ), U(p) 1 ),..., U(p) 3 ). 1 7

8 An example In Table 3A the NACE code of every sngle doman of nterest s lsted n each cell; n the nput data set of the weghtng procedure each of them s replaced by the respectve populaton total, n terms of the auxlary varable Average number of employees n the year t-1 (a smlar specfcaton s done n terms of the auxlary varable Number of enterprses): Table 3A: Example of benchmark specfcaton (known totals) DOMAIN DOM1: Nace-4 dgt (codes) DOM2: Nace-3 dgt * Sze class (codes) DOM3: Nace-2dgt *Nuts Nace 2 dgt Tx1 Tx2 Tx3 Tx4 Txj d Tx15 Tx16 Tx17 Tx18 Tx180 Tx181 Tx * * * *1-9.. north central south * * * * north central south * * * *1-9.. north central south * * * *50+.. north central south * * * *1-9.. north central south * north central south * * * *1-9.. north central south * * * *50+ north central south For each respondng unt (enterprse), the vector of the auxlary varable Average number of employees n the year t-1 can be expressed as n Table 3B, whether or not the unt belongs to the doman represented on the cell: Table 3B: Example of sample data specfcaton (α k = number of persons employed of enterprse k) DOM1:Nace-4 dgt (x 2 -values) DOM2: Nace-3 dgt * Sze class (x 2 --values) DOM3: Nace-2dgt *Nuts Unt dentfer Doman Nace2 q k Drect weght x1 x2 x3 x4 xj d x15 x16 x17 x18 X180 X181 X α 1 22 α α α α 2 1,4 0 α α α α 3 10, α α α α 4 3 α α α α 5 6,4 0 α α α α k.... n p 14 α np α np 0.. α np α np 0 0 8

9 The overall number of benchmarks (constraned estmates) n the optmsaton process s equal to 182. In spte of the consderable number of constrants to be satsfed, the weghtng process ends wth a good convergence between fnal estmates and know populaton totals (see Table 4). Table 4A: Example of output of the weghtng procedure for a doman of nterest (2 dgt NACE): Check on the auxlary varables Doman code =14 Constrant code Known Totals (1) Fnal Estmates of X varables (2) Drect Estmates of the X varables (3) (2)-(1) (3)-(1) Samplng unts Table 4B: Example of output of the weghtng procedure for a doman of nterest (2 dgt NACE): Fnal weghts Unt dentfer Doman Nace2 q k Drect weght Fnal weght 1 10 α , α 2 1, α 3 10, α α 5 6,4 4,2 α k n p 14 α np The estmator effect for the fnal weghts has been calculated on the sample of respondng enterprses wth less than 100 persons employed at dvson of actvty level (NACE Rev dgt), for the followng subset of target varables: 1. Turnover (code ) 9

10 2. Value added at factor cost (code ) 3. Personnel costs (code ) 4. Gross nvestment n tangble goods (code ) 5. Number of employees(code ) 6. Wages and salares (code ). The estmator effect values confrm the hgher effcency ganed by usng the GREG estmator nstead of the drect estmaton for most of the consdered dvsons of actvtes and target varables; the man excepton concerns the varable Gross nvestment n tangble goods, whch s hardly predctable by a model. Moreover, the varables Turnover and Value added at factor cost have an estmator effect hgher than 1 for some dvsons,.e., 73- Research and development and 74- Other busness actvtes, that are charactersed by specalsed actvtes where the hgh amounts nvoced by the enterprses can be attaned by a relatvely small number of sklled employees. In concluson, apart from a small group of economc actvty classes, the varable average number of employees has shown a good correlaton wth the followng target varables of nterest: turnover, producton value, whereas t s not enough correlated wth Gross nvestment n tangble goods. 5. Examples tool specfc 6. Glossary For defntons of terms used n ths module, please refer to the separate Glossary provded as part of the handbook. 7. References Bethel, J. (1989), Sample Allocaton n Multvarate Surveys. Survey Methodology 15, Bredt, F. J. and Opsomer, J. D. (2000), Local Polynomal Regresson Estmators n Survey Samplng. The Annals of Statstcs 28, Cassel, C. M., Särndal, C.-E., and Wretman, J. H. (1976), Some Results on Generalzed Dfference Estmaton and Generalzed Regresson Estmaton for Fnte Populatons. Bometrka 63, Devlle, J. C. and Särndal, C.-E. (1992), Calbraton Estmators n Survey Samplng. Journal of the Amercan Statstcal Assocaton 87, Devlle, J. C., Särndal, C.-E., and Sautory, O. (1993), Generalzed Rakng Procedures n Survey Samplng. Journal of the Amercan Statstcal Assocaton 88, Hedln, D., Falvey, H., Chambers, R., and Kokc, P. (2001), Does the Model Matter for GREG Estmaton? A Busness Survey Example. Journal of Offcal Statstcs 17, Ksh, L. (1995), Methods for Desgn Effects. Journal of Offcal Statstcs 11, Lehtonen, R. and Vejanen, A. (1998), Logstc Generalzed Regresson Estmators. Survey Methodology 24,

11 Montanar, G. E. and Ranall, M. G. (2005), Nonparametrc Model Calbraton Estmaton n Survey Samplng. Journal of the Amercan Statstcal Assocaton 100, Ohlsson, E. (1995), Coordnaton of Samples usng Permanent Random Numbers. In: Busness Survey Methods (eds. Cox, B. G., Bnder, D. A., Chnnapa, B. N., Chrstanson, A., Colledge, M. J., and Kott, P. S.), Wley, New York, Särndal, C.-E., Swensson, B., and Wretman, J. (1992), Model Asssted Survey Samplng. Sprnger Verlag, New York. 11

12 Specfc secton 8. Purpose of the method The method s used for estmaton, when auxlary nformaton s avalable at unt or doman level. It can be used to reduce the varance of the estmates f a strong correlaton between the target varable and the auxlary varables exsts. At the same tme, GREG allows to calbrate to the known populaton totals of the auxlary varables x. Ths means that GREG s a partcular case of a calbraton estmator when the dstance functon s lnear,.e., the fnal weghts that satsfy the calbraton equatons w are chosen to mnmse the followng dstance wth the ntal weghts d: 9. Recommended use of the method ( w d ) 2d s 1. GREG s recommended when a lnear relatonshp between target y and covarate varables x s present, y = β x + ε. 10. Possble dsadvantages of the method 1. GREG can ntroduce a large varaton n weghts that can cause an ncrease n varance, see formula (6) to quantfy the mpact. 2. Possbly correcton weghts g too far from unty or negatve fnal weghts as the correcton factors (see formula (4)) can be n some cases a negatve quantty. 3. Even beng asymptotcally unbased, bas can be ntroduced f sample sze s too small (see also secton 14). 4. GREG can be very senstve to presence of outlers (see Weghtng and Estmaton Outler Treatment ); an llustratve example wth dscusson can be found n Hedln et al. (2001). Ths ssue s very relevant to busness survey where target varables are typcally non-normal and very skewed. 11. Varants of the method 1. Specfc case: Rato regresson. 2. Non-lnear GREG estmators. Expresson (2) can be appled on general models. In fact, the predcton 2. ŷ, that for GREG s based on lnear model can be based on more complex models f the target varable for example s not normal. An example of non-lnear GREG s logstc GREG whch s based on logstc model when the target varable s a bnary varable. The use of more complex models, however, requres more detaled nformaton on the x varable w.r.t. the knowledge of populaton total that s needed by (lnear) GREG. 12. Input data 1. Ds-nput1 = elementary sample data contanng covarates, drect weghts and scale coeffcents q, model group (.e., level for whch the model s specfed). 12

13 2. Ds-nput2 = known totals on the covarates for each model group. 13. Logcal precondtons 1. Mssng values 1. GREG s calculated on sample values on DS-nput1 after mputaton anyway, varance estmaton s affected by the mputaton. 2. Ds-nput2 cannot contan mssng values. 2. Erroneous values Other qualty related precondtons Other types of precondtons 1. If the auxlary varables are categorcal, the known totals for dfferent parttons should not be n conflct. 14. Tunng parameters 1. Choce of the auxlary covarates n the model, a rule of thumb for the choce of categorcal varable s to defne categores so that the sample totals are greater than Choce of the model group level. 3. Choce of q. 15. Recommended use of the ndvdual varants of the method 1. Non-lnear GREG can be used when auxlary varables are avalable for each unt n the populaton and the relatonshp wth the target varable s markedly non-lnear. 16. Output data 1. Ds-output1 = elementary sample data set contanng the new fnal weghts. 17. Propertes of the output data 1. The fnal weghts allows to satsfy the mplct constrants gven by the known totals of the auxlary varables. 18. Unt of nput data sutable for the method Sample unts, also separately by model group. 19. User nteracton - not tool specfc 1. Choce of auxlary covarates. 2. Choce of the group level. 13

14 3. Choce of q. 20. Loggng ndcators 1. The run tme of the applcaton. 2. Iteratons to attan convergence n the estmaton process. 3. Characterstcs of the nput data, for nstance problem sze. 21. Qualty ndcators of the output data 1. The coeffcent of varaton of the fnal weghts n comparson wth the basc weghts. 2. Presence of negatve weghts, n ths case t may be approprate to consder a dfferent underlyng model or to use a calbraton estmator wth a functon that allows to restrct the range of fnal weghts (see the module Weghtng and Estmaton Calbraton ). 3. Varance, coeffcent of varaton of produced estmates. 4. Check of equalty of sample estmates of x and known populaton totals. 22. Actual use of the method 1. Interconnectons wth other modules 23. Themes that refer explctly to ths module 1. Weghtng and Estmaton Man Module 2. Qualty Aspects Qualty of Statstcs 24. Related methods descrbed n other modules 1. Weghtng and Estmaton Calbraton 2. Weghtng and Estmaton Outler Treatment 25. Mathematcal technques used by the method descrbed n ths module 1. Matrx algebra 26. GSBPM phases where the method descrbed n ths module s used Calculate weghts Calculate aggregates 27. Tools that mplement the method descrbed n ths module 1. CALMAR (Devlle, Särndal and Sautory 1993) 2. CLAN (Statstcs Sweden) 3. BASCULA (The Netherlands) 14

15 4. GES (StatCan) 5. GENESEES (ISTAT) 6. Survey, an R package downloadable from the CRAN 7. Samplng, an R package downloadable from the CRAN 8. REgenesees (ISTAT), an R package downloadable from the JonUP: Process step performed by the method Estmaton 15

16 Admnstratve secton 29. Module code Weghtng and Estmaton-M-Generalsed Regresson Estmator 30. Verson hstory Verson Date Descrpton of changes Author Insttute frst verson Loredana D Consglo, Clauda De Vts, Crstna Cascano second verson Loredana D Consglo, Clauda De Vts, Crstna Cascano second verson wth correctons second verson after EB revew prelmnary release fnal verson wthn the Memobust project Loredana D Consglo, Clauda De Vts, Crstna Cascano Loredana D Consglo, Clauda De Vts, Crstna Cascano ISTAT ISTAT ISTAT ISTAT 31. Template verson and prnt date Template verson used 1.0 p 4 d.d Prnt date :32 16

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Nonparametric model calibration estimation in survey sampling

Nonparametric model calibration estimation in survey sampling Ames February 18, 004 Nonparametrc model calbraton estmaton n survey samplng M. Govanna Ranall Department of Statstcs, Colorado State Unversty (Jont work wth G.E. Montanar, Dpartmento d Scenze Statstche,

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Small Area Estimation for Business Surveys

Small Area Estimation for Business Surveys ASA Secton on Survey Research Methods Small Area Estmaton for Busness Surveys Hukum Chandra Southampton Statstcal Scences Research Insttute, Unversty of Southampton Hghfeld, Southampton-SO17 1BJ, U.K.

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

A Bound for the Relative Bias of the Design Effect

A Bound for the Relative Bias of the Design Effect A Bound for the Relatve Bas of the Desgn Effect Alberto Padlla Banco de Méxco Abstract Desgn effects are typcally used to compute sample szes or standard errors from complex surveys. In ths paper, we show

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

A note on regression estimation with unknown population size

A note on regression estimation with unknown population size Statstcs Publcatons Statstcs 6-016 A note on regresson estmaton wth unknown populaton sze Mchael A. Hdroglou Statstcs Canada Jae Kwang Km Iowa State Unversty jkm@astate.edu Chrstan Olver Nambeu Statstcs

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Bias-correction under a semi-parametric model for small area estimation

Bias-correction under a semi-parametric model for small area estimation Bas-correcton under a sem-parametrc model for small area estmaton Laura Dumtrescu, Vctora Unversty of Wellngton jont work wth J. N. K. Rao, Carleton Unversty ICORS 2017 Workshop on Robust Inference for

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Uncertainty in measurements of power and energy on power networks

Uncertainty in measurements of power and energy on power networks Uncertanty n measurements of power and energy on power networks E. Manov, N. Kolev Department of Measurement and Instrumentaton, Techncal Unversty Sofa, bul. Klment Ohrdsk No8, bl., 000 Sofa, Bulgara Tel./fax:

More information

A Design Effect Measure for Calibration Weighting in Cluster Samples

A Design Effect Measure for Calibration Weighting in Cluster Samples JSM 04 - Survey Research Methods Secton A Desgn Effect Measure for Calbraton Weghtng n Cluster Samples Kmberly Henry and Rchard Vallant Statstcs of Income, Internal Revenue Servce 77 K Street, E, Washngton,

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Weighted Estimating Equations with Response Propensities in Terms of Covariates Observed only for Responders

Weighted Estimating Equations with Response Propensities in Terms of Covariates Observed only for Responders Weghted Estmatng Equatons wth Response Propenstes n Terms of Covarates Observed only for Responders Erc V. Slud, U.S. Census Bureau, CSRM Unv. of Maryland, Mathematcs Dept. NISS Mssng Data Workshop, November

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek Dscusson of Extensons of the Gauss-arkov Theorem to the Case of Stochastc Regresson Coeffcents Ed Stanek Introducton Pfeffermann (984 dscusses extensons to the Gauss-arkov Theorem n settngs where regresson

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

REPLICATION VARIANCE ESTIMATION UNDER TWO-PHASE SAMPLING IN THE PRESENCE OF NON-RESPONSE

REPLICATION VARIANCE ESTIMATION UNDER TWO-PHASE SAMPLING IN THE PRESENCE OF NON-RESPONSE STATISTICA, anno LXXIV, n. 3, 2014 REPLICATION VARIANCE ESTIMATION UNDER TWO-PHASE SAMPLING IN THE PRESENCE OF NON-RESPONSE Muqaddas Javed 1 Natonal College of Busness Admnstraton and Economcs, Lahore,

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

III. Econometric Methodology Regression Analysis

III. Econometric Methodology Regression Analysis Page Econ07 Appled Econometrcs Topc : An Overvew of Regresson Analyss (Studenmund, Chapter ) I. The Nature and Scope of Econometrcs. Lot s of defntons of econometrcs. Nobel Prze Commttee Paul Samuelson,

More information

4DVAR, according to the name, is a four-dimensional variational method.

4DVAR, according to the name, is a four-dimensional variational method. 4D-Varatonal Data Assmlaton (4D-Var) 4DVAR, accordng to the name, s a four-dmensonal varatonal method. 4D-Var s actually a drect generalzaton of 3D-Var to handle observatons that are dstrbuted n tme. The

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random

More information

Conditional and unconditional models in modelassisted estimation of finite population totals

Conditional and unconditional models in modelassisted estimation of finite population totals Unversty of Wollongong Research Onlne Faculty of Informatcs - Papers Archve) Faculty of Engneerng and Informaton Scences 2011 Condtonal and uncondtonal models n modelasssted estmaton of fnte populaton

More information

Maximizing Overlap of Large Primary Sampling Units in Repeated Sampling: A comparison of Ernst s Method with Ohlsson s Method

Maximizing Overlap of Large Primary Sampling Units in Repeated Sampling: A comparison of Ernst s Method with Ohlsson s Method Maxmzng Overlap of Large Prmary Samplng Unts n Repeated Samplng: A comparson of Ernst s Method wth Ohlsson s Method Red Rottach and Padrac Murphy 1 U.S. Census Bureau 4600 Slver Hll Road, Washngton DC

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Credit Card Pricing and Impact of Adverse Selection

Credit Card Pricing and Impact of Adverse Selection Credt Card Prcng and Impact of Adverse Selecton Bo Huang and Lyn C. Thomas Unversty of Southampton Contents Background Aucton model of credt card solctaton - Errors n probablty of beng Good - Errors n

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Nonparametric Regression Estimation. of Finite Population Totals. under Two-Stage Sampling

Nonparametric Regression Estimation. of Finite Population Totals. under Two-Stage Sampling Nonparametrc Regresson Estmaton of Fnte Populaton Totals under Two-Stage Samplng J-Yeon Km Iowa State Unversty F. Jay Bredt Colorado State Unversty Jean D. Opsomer Iowa State Unversty ay 21, 2003 Abstract

More information

Statistical registers by restricted neighbor imputation

Statistical registers by restricted neighbor imputation Statstcal regsters by restrcted neghbor mputaton an applcaton to the Norwegan Agrculture Survey Abstract Nna Hagesæther 1 and L-Chun Zhang Statstcs Norway In ths paper we mplement the method of Zhang and

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography CSc 6974 and ECSE 6966 Math. Tech. for Vson, Graphcs and Robotcs Lecture 21, Aprl 17, 2006 Estmatng A Plane Homography Overvew We contnue wth a dscusson of the major ssues, usng estmaton of plane projectve

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

T E C O L O T E R E S E A R C H, I N C.

T E C O L O T E R E S E A R C H, I N C. T E C O L O T E R E S E A R C H, I N C. B rdg n g En g neern g a nd Econo mcs S nce 1973 THE MINIMUM-UNBIASED-PERCENTAGE ERROR (MUPE) METHOD IN CER DEVELOPMENT Thrd Jont Annual ISPA/SCEA Internatonal Conference

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function On Outler Robust Small Area Mean Estmate Based on Predcton of Emprcal Dstrbuton Functon Payam Mokhtaran Natonal Insttute of Appled Statstcs Research Australa Unversty of Wollongong Small Area Estmaton

More information

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Chapter 14: Logit and Probit Models for Categorical Response Variables

Chapter 14: Logit and Probit Models for Categorical Response Variables Chapter 4: Logt and Probt Models for Categorcal Response Varables Sect 4. Models for Dchotomous Data We wll dscuss only ths secton of Chap 4, whch s manly about Logstc Regresson, a specal case of the famly

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 1, July 2013

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 1, July 2013 ISSN: 2277-375 Constructon of Trend Free Run Orders for Orthogonal rrays Usng Codes bstract: Sometmes when the expermental runs are carred out n a tme order sequence, the response can depend on the run

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Problem Set 9 Solutions

Problem Set 9 Solutions Desgn and Analyss of Algorthms May 4, 2015 Massachusetts Insttute of Technology 6.046J/18.410J Profs. Erk Demane, Srn Devadas, and Nancy Lynch Problem Set 9 Solutons Problem Set 9 Solutons Ths problem

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Cathy Walker March 5, 2010

Cathy Walker March 5, 2010 Cathy Walker March 5, 010 Part : Problem Set 1. What s the level of measurement for the followng varables? a) SAT scores b) Number of tests or quzzes n statstcal course c) Acres of land devoted to corn

More information

Time-Varying Systems and Computations Lecture 6

Time-Varying Systems and Computations Lecture 6 Tme-Varyng Systems and Computatons Lecture 6 Klaus Depold 14. Januar 2014 The Kalman Flter The Kalman estmaton flter attempts to estmate the actual state of an unknown dscrete dynamcal system, gven nosy

More information

An (almost) unbiased estimator for the S-Gini index

An (almost) unbiased estimator for the S-Gini index An (almost unbased estmator for the S-Gn ndex Thomas Demuynck February 25, 2009 Abstract Ths note provdes an unbased estmator for the absolute S-Gn and an almost unbased estmator for the relatve S-Gn for

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

Supporting Information

Supporting Information Supportng Informaton The neural network f n Eq. 1 s gven by: f x l = ReLU W atom x l + b atom, 2 where ReLU s the element-wse rectfed lnear unt, 21.e., ReLUx = max0, x, W atom R d d s the weght matrx to

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH)

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH) Chapter 1 Samplng wth Unequal Probabltes Notaton: Populaton element: 1 2 N varable of nterest Y : y1 y2 y N Let s be a sample of elements drawn by a gven samplng method. In other words, s s a subset of

More information

Robust Small Area Estimation Using a Mixture Model

Robust Small Area Estimation Using a Mixture Model Robust Small Area Estmaton Usng a Mxture Model Jule Gershunskaya U.S. Bureau of Labor Statstcs Partha Lahr JPSM, Unversty of Maryland, College Park, USA ISI Meetng, Dubln, August 23, 2011 Parameter of

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

Exponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute

Exponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 193-9466 Vol. 10, Issue 1 (June 015), pp. 106-113 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) Exponental Tpe Product Estmator

More information

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling Open Journal of Statstcs, 0,, 300-304 ttp://dx.do.org/0.436/ojs.0.3036 Publsed Onlne July 0 (ttp://www.scrp.org/journal/ojs) Multvarate Rato Estmator of te Populaton Total under Stratfed Random Samplng

More information

Annexes. EC.1. Cycle-base move illustration. EC.2. Problem Instances

Annexes. EC.1. Cycle-base move illustration. EC.2. Problem Instances ec Annexes Ths Annex frst llustrates a cycle-based move n the dynamc-block generaton tabu search. It then dsplays the characterstcs of the nstance sets, followed by detaled results of the parametercalbraton

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information