A Design Effect Measure for Calibration Weighting in Cluster Samples

Size: px
Start display at page:

Download "A Design Effect Measure for Calibration Weighting in Cluster Samples"

Transcription

1 JSM 04 - Survey Research Methods Secton A Desgn Effect Measure for Calbraton Weghtng n Cluster Samples Kmberly Henry and Rchard Vallant Statstcs of Income, Internal Revenue Servce 77 K Street, E, Washngton, DC 000 Unverstes of Mchgan & Maryland, College Park, MD Abstract We propose a model-based extenson of weghtng desgn-effect measures for two-stage samplng when calbraton weghtng s used. Our proposed desgn effect measure captures the jont effects of a nonepsem samplng desgn, unequal weghts produced usng calbraton adjustments, and the strength of the assocaton between an analyss varable and the auxlares used n calbraton. he proposed measure s compared to exstng desgn effect measures n an example nvolvng household-type data. Key words: Auxlary data; Ksh weghtng desgn effect; Spencer desgn effect; generalzed regresson estmator. Introducton he most popular measure for gaugng the effect of dfferental weghtng on the precson of an estmator s Ksh s (965, 99) desgn-based desgn effect. Gabler et al. (999) showed that, for cluster samplng, ths estmator s a specal form of a desgn effect produced usng varances from random effects models, wth and wthout ntra-class correlatons. Spencer (000) proposed a smple model-based approach that depends on a sngle covarate to estmate the mpact on varance of usng varable weghts. However, these approaches do not provde a summary measure of the mpact of the gans n precson that may accrue from samplng wth varyng probabltes and usng a calbraton estmator lke the general regresson (GREG) estmator. Whle the Ksh desgn effects attempt to measure the mpact of varable weghts, as noted n Ksh (99), they are nformatve only under specal crcumstances, do not account for alternatve varables of nterest, and can ncorrectly measure the mpact of dfferental weghtng n some crcumstances. Spencer s approach holds for wth-replacement sngle-stage samplng for a very smple estmator of the total constructed wth nverse-probablty weghts wth no further adjustments. here are also few emprcal examples comparng these measures n the lterature. Henry and Vallant (03) extended ths to gauge the mpact of dfferental calbraton weghts n sngle-stage samples. In partcular, the exstng measures, revewed n Secton, may not accurately produce desgn effects for unequal weghtng nduced by calbraton adjustments. hese are often appled to reduce varances and correct for undercoverage and/or nonresponse n surveys (e.g., Särndal and Lundström 005; Kott 009). When the calbraton covarates are correlated wth the coverage/response mechansm, calbraton weghts can mprove the mean squared error (MSE) of an estmator. In many applcatons, snce calbraton nvolves element-level adjustments, calbraton weghts can vary more than the base weghts or categorybased nonresponse or poststratfcaton adjustments (Kalton and Cervantes-Flores 003; Brck and Montaqula 009). hus, an deal measure of the mpact of calbraton weghts ncorporates not only the correlaton between the survey varable of nterest y and the weghts, but also the correlaton between y and the calbraton covarates x to avod penalzng weghts for the mere sake that they vary. 564

2 JSM 04 - Survey Research Methods Secton We extend these exstng desgn effects to produce a new measure that summarzes the mpact of calbraton weght adjustments before and after they are appled to two-stage cluster-sample survey weghts. he proposed measure n Secton 3 accounts for the jont effect of a non-epsem sample desgn and unequal weght adjustments n the larger class of calbraton estmators. Our summary measure ncorporates the survey varable lke Spencer s model, but also uses a generalzed regresson varance to reflect multple calbraton covarates and the cluster sample desgn. In secton 4, we apply the estmators n a case study nvolvng household-type survey data and demonstrate emprcally how the proposed estmator outperforms the exstng methods n the presence of unequal calbraton weghts.. Exstng Methods In ths secton, we specfy notaton and summarze the exstng desgn effect measures for cluster samplng. he assumptons used to derve each of these are also presented... otaton We consder that a fnte populaton of M elements s parttoned nto clusters, wth szes M, and denoted by U, j :,,, j,, M. On element, j, an analyss varable y j s observed. he populaton total of the y and M j M y Y j, where y s s M y j j and the populaton varance s M M and Y M. We select a sample of n clusters m elements wthn cluster, usng two-stage samplng from U and obtan a set of s, j :,, n, j,, m respondents. he total sample sze of elements s m m. In the followng sectons, when a two-stage desgn s consdered, we assume that clusters are selected wth replacement. he selecton probablty of cluster on a sngle draw s p. Wthn cluster, the sample of m elements s selected va smple random samplng wthout replacement (srswor). Assumng that we have probablty-wth-replacement (pwr) samplng of clusters, the probablty of selecton for n clusters s approxmately p np (f p s not too large), where p s the one-draw selecton probablty. he second-stage selecton probablty s j m M for element j n cluster. hen the overall selecton probablty s approxmately j j npm M. We estmate the total from the sample usng wth the pwr-estmator ˆ n m M n m pwr, y y j j w j j y j np m, w M np m. where j n 565

3 JSM 04 - Survey Research Methods Secton.. GREG Weght Adjustments Case weghts resultng from calbraton on benchmark auxlary varables can be defned wth a global regresson model for the survey varables (see Kott 009 for a revew). Devlle and Särndal (99) proposed the calbraton approach that nvolves mnmzng a dstance functon between the base weghts and fnal weghts to obtan an optmal set of survey weghts. Specfyng alternatve calbraton dstance functons produces alternatve estmators. Suppose that a sngle-stage probablty sample of m elements s selected wth beng the selecton probablty of element and x a vector of p auxlares assocated wth element. A least squares dstance functon produces the general regresson estmator (GREG): ˆ ˆ ˆ B ˆ g y, () GREG Hy x Hx s where ˆHy s y s the Horvtz-hompson estmator of the populaton total of y, ˆHx s x s the vector of H estmated totals for the auxlary varables, x x s the correspondng vector of known totals, ˆ B A X V Π y s the regresson coeffcent, wth s s ss s s As XsVss Πs X s, X s s the matrx of x values n the sample, ss dag v varance matrx specfed under the workng model (defned below), and Π dag x Hx s expresson for the GREG estmator n (), g v case weghts are w g for each sample element. V s the dagonal of the s. In the second ˆ A x s the g-weght, such that the he GREG estmator for a total s model-unbased under the assocated workng model, y x β, ~ 0, v. he GREG s consstent and approxmately desgn-unbased when the sample sze s large (Devlle and Särndal 99). When the model s correct, the GREG estmator acheves effcency gans. If the model s ncorrect, then the effcency gans wll be dampened (or nonexstent) but the GREG estmator s stll approxmately desgn-unbased. Relevant to ths work, the varance of the GREG estmator can be used to approxmate the varance of any calbraton estmator (Devlle and Särndal 99) when the sample sze s large. hs result holds for multstage samplng and allows us to produce one desgn effect measure applcable to all estmators n the famly of calbraton estmators..3. Drect Desgn-Effect Measures For a gven non-epsem sample and estmator ˆ for the fnte populaton total, one defnton for the drect desgn effect (Ksh 965) s ˆ ˆ srswr ˆ srswr Deff Var Var. () where ˆsrswr ks k M y m s the expanson estmator under smple random samplng wth ˆsrswr replacement (srswr) and Var M y m. We refer to ths as a drect estmator because t uses theoretcal varances n the numerator and denomnator. he alternatves that are presented subsequently use varous approxmatons to the components n (). he desgn effect n () measures the sze of the 3 566

4 JSM 04 - Survey Research Methods Secton varance of the estmator ˆ under the desgn, relatve to the varance of the estmator of the same total f an srswr of the same sze had been used. We can approxmate the varance of any calbraton estmator ˆcal usng the approxmate varance of the GREG (Devlle and Särndal 99), n whch case Deff ˆ Var ˆ Var ˆ. (3) cal GREG cal srswr srswr As descrbed n Sec. 3, our proposed desgn effect s a model-based approxmaton to (3)..4. Ksh s Haphazard-Samplng Desgn-Effect for Unequal Sngle-stage Sample Weghts Ksh (965, 990) proposed the desgn effect due to weghtng as a measure to quantfy the loss of precson due to usng unequal and neffcent weghts. For,, w w w m denotng the weghts from a smple random sample wthout replacement (srswor) of m elements, ths measure s deff K w CV w, (4) m w s w s where s CV w m w w w s the coeffcent of varaton of the weghts, and w m w. Expresson (4) s derved from the rato of the varance of the weghted survey mean s under dsproportonate stratfed srswor to the varance under proportonate stratfed srswor when all stratum element varances are equal (Ksh 99). Wth equal stratum varances, samplng wth a proportonal allocaton to strata s optmal, whch leads to all elements havng the same weght..5. Ksh s Measure for Cluster Samplng Weghts Ksh (987) proposed a smlar measure for cluster samplng. Assume that there are G unque weghts n s such that the m g elements wthn each cluster have the same weght, denoted by w g w g for g,, G, m g s the number of elements wthn weghtng class g and m m g g s the total number of elements n the sample. We estmate the populaton mean Y M usng the weghted sample mean G G w g jg gj gj g jg gj y w y w. Ksh s (987) decomposton model for w assumes that the G weghtng classes are randomly ( haphazardly ) formed wth respect to y j, assumng that the y j have a common varance and that s s an equal-probablty sample n whch the varaton among the m g s wthn s s not sgnfcant. he resultng desgn effect s G y 4 567

5 JSM 04 - Survey Research Methods Secton G m w gm g g KC w c G w g gm g deff y m n, (5) where m n m s the average cluster sze and c the measure of ntra-cluster homogenety. he frst component n (5) s the cluster-sample equvalent of (4), and can be wrtten n a smlar form usng the squared CV of the weghts f mg. he second (5) component s the standard desgn effect due to cluster samplng (e.g., Ksh 965). Expresson (5) may not hold f there s varaton n the m g across clusters (Park 004) or moderate correlaton between the survey characterstc and weghts (Park and Lee 004)..6. Gabler et al. s Measure for Cluster Samplng Gabler et al. (999) used a model to justfy measure (5) that assumes y j s a realzaton from a one-way random effects model (.e., a one-way AOVA-type model wth only a random cluster-level ntercept term plus an error) that assumes the followng covarance structure:, j j Covyj, y j e, j j. (6) 0 Cov yj, y j for, j j and 0 otherwse. Gabler et al. (999) take the rato of the model-based varance of the weghted survey mean under a model wth covarance structure (6) to the varance under the uncorrelated errors verson and derve If the elements are uncorrelated, then (6) reduces to G m w gm g g G wm g g deff y m w g e g where where n G G g g g g g g g G m w gm g g UB deff yw mg e G wm g g g m w m w m, (7). hey also establshed an upper bound for (7): n G n G g g g g g g g m m w m w m, (8) s a weghted average of cluster szes. Park (004) further extends ths approach to three-stage samplng, assumng that a systematc samplng s used n the frst stage to select the clusters. Lynn and Gabler (005) provde examples of specal cases of (7), such as equal samplng/coverage/response rates across domans

6 JSM 04 - Survey Research Methods Secton Lynn and Gabler (005) rewrte 3 m g G G G g, g g g g g mvar g w g g mw g g g Cov m m w m m w n Gabler et al. (999) s desgn effect. Assumng certan restrctons on the weghts, they propose a survey plannng approxmaton, mg m CVm n CV w, where G G CV m G m m uses the squared CV of cluster sample szes across clusters, g g g G G and CV w m w m g g g g g g g w uses the squared CV of weghts across observatons..7. Park and Lee s Measures for Unequal Cluster Samplng Park and Lee (004) extend the Gabler et al. (999) desgn effect to account for unequal samplng weghts wthn a two-stage cluster sample that selects m elements from each PSU: where ˆ * Hy y deff m W SyB m S y SyB M S y S yb M Y Y * m0 Q Y CV y y, CV m y pm Y 0 varablty,, (9), S M M y Y y j j s the wthn-cluster s the across-cluster varablty among cluster means, M W CVy y Y, CVy Sy Y, and Y y j M. he term Q MM pm s equal to zero f p M M or the cluster probablty of selecton s proportonal to the cluster sze M. he equvalent of (9) for the weghted sample mean s * deffpl yw m0 Wd, (0) * Wd m0 CVy Q pm D Y CVd m0, CVd Sd D, where M j M D d M, and D d M for the transformed varable dj yj Y. j j.8. Spencer s Model-based Measure for Sngle-Stage pwr Samplng Spencer (000) derves a desgn-effect measure to more fully account for the effect on varances of weghts that are correlated wth the survey varable of nterest. hs measure was not developed for cluster samplng, but we used ths modelng approach n Sec. 3 for a proposed desgn effect. he sample s assumed to be selected wth varyng probabltes and wth replacement (pwr). Suppose that p s the one-draw probablty of selectng element, whch s correlated wth y and that a lnear model holds for 6 569

7 JSM 04 - Survey Research Methods Secton y : y p. A partcular case of ths would be p x, where x s a measure of sze assocated wth element. If the entre fnte populaton were avalable, then the ordnary least squares and, where, estmates of and are A Y BP B U y Y p P U p P Y P are the fnte populaton means for y and p. he fnte populaton varance of the resduals, e y A Bp e yp y Y yp y, where yp s the fnte, s U populaton correlaton between y and he estmated total studed by Spencer s, ˆpwr, y Var n p y p U p. he usual base weght under pwr-samplng s w np ˆpwr y s. w y, wth desgn-varance n sngle-stage samplng. Spencer substtuted the model-based values for y nto the pwr-estmator s varance and took ts rato to the varance of the estmated total usng srswr to produce the followng desgn effect for unequal weghtng (see Appendx n Spencer 000; modfed for our notaton): A mw mw m w ew e Amew Deff e w S yp. () M M y My My Assumng that the correlatons n the last two terms of () are neglgble, Spencer approxmates () W w n p wth the frst two terms n (), where U U s the average weght n the populaton. When yp s zero and y s large, measure () s approxmately equvalent to Ksh s measure (4). However, Spencer s method does ncorporate the survey varable y, unlke (4), and mplctly reflects the dependence of y on the selecton probabltes p. 3. Proposed Desgn Effect Measures Henry and Vallant s (03) approach n sngle-stage samplng can be extended to produce a new weghtng desgn effect measure for a calbraton estmator n cluster samplng. We produce the desgn effect n four steps: () constructng a lnear approxmaton to the GREG estmator; () obtanng the varance of ths lnear approxmaton; (3) substtutng our model-based components nto the GREG varance; and (4) takng the rato of the model-based varance to the varance of the pwr-estmator of the total under srswr. Frst, a lnearzaton approxmaton to the GREG estmator (Exp n Särndal et al. 99) assumng clusters are selected pwr s ˆ ˆ ˆ ˆ Bˆ GREG pwr, y x pwr, x ˆ pwr, y x pwr, x BU we s js j j x BU () 7 570

8 JSM 04 - Survey Research Methods Secton x s the known populaton total of x, ˆ pwr, x ej yj j u GREG x U w s j s je j where s the vector of pwr-estmators, B U s the populaton coeffcent, x B s the resdual for element, j, and x j s a row vector of calbraton covarates. From (), ˆ B. Under the two-stage samplng assumptons (Sec..), w M np m j and the approxmaton varance of the GREG n () s also the varance of ˆ GREG x BU M s j s ej npm, (3) wˆ where w s e np and ˆ M m e. he desgn-varance of (3) s where e js j ˆ eu M m M GREG U j U Var p E e e n p np m M M E U e j j e U M m E U S Ue n p np m M, eu e ju j, S M Ue M e j e U j, and M e e M U j j, (4). he true deff for ˆGREG s defned as deff ˆ Var ˆ Var ˆ. (5) rue GREG GREG srswr 3.. Vallant et al. s Relatve Varance Deff Vallant, Dever, and Kreuter (03, Sec. 9..3) gve a desgn effect for totals n cluster samples usng the rato of the relatve varance of the estmator of the total when clusters are selected pwr and m elements wthn clusters are selected va srswor over the equvalent expresson under srswr. her measure can be modfed by usng (4) whch uses the GREG-based resduals. Assumng that m elements are selected n each cluster, the relatve varance of the GREG (.e., the varance of the GREG over the squared total) can be rewrtten as RelVar ˆ ˆ GREG RelVar srswr ke e m, (6) RelVar ˆsrswr y nmy e Be Be We, Be p eu p EU s the where, We M SUe e e e e e y, where ej yj j U between-cluster relatve varance component, k B W RelVar B W s the wthn-cluster component, x B s the resdual for element 8 57

9 JSM 04 - Survey Research Methods Secton j, E U x j s a row vector of calbraton covarates (ncludng an ntercept f one s used), and e. Usng the form of the relatve varance n (6), the assocated desgn effect s U ju j ˆ deffvdk cal ke e m. (7) When calbraton s more effcent for a gven y -varable, the term k e wll be less than and acts as a dampenng factor, reducng the overall desgn effect. 3.. An Alternatve Deff We follow Spencer s approach and substtute model values n varance (4) to formulate a desgn-effect measure. However, here we substtute n the model-based equvalent to e, not y, as Spencer does. Suppose that the second-stage samplng fracton s neglgble,.e., m M 0. o match the theoretcal varance formulaton n (4), consder the model y A x B e, where A, B are the fnte populaton model parameters, and j j U j U j x from prevous sectons s equal to j j U x x. Smlar to Henry and Vallant (03), we smplfy thngs by reformulatng the model as uj yj x jbu, such that e j u j U and ncorporate UM not as a cluster-level (random) ntercept, smply an algebrac expresson. he model assumng a random ntercept and error term s equvalent to Gabler et al. s (999) random effects model. Substtutng the GREG resduals e j nto the varance (4) and takng ts rato to the varance of the pwr- estmator n srswr, Var ˆ M nm wll produce our approxmate desgn effect due to srs srswr y unequal calbraton weghtng. Assumng that the M M and the wthn-cluster samplng fractons are neglgble, we obtan the followng approxmate desgn effect: deff u nmw U U nm U G M y nm y M are large enough such that nmw u, w u u. (8) where u yj xjbu ju, M M uw u w nm M ws U M y M y m, U u, UM, M, S U M u u j j u, U u U 9 57

10 JSM 04 - Survey Research Methods Secton, u u U u u U between the terms wthn each subscrpt., w W W, and. he -components are the fnte populaton correlatons Some approxmatons to (8) exst. If the correlatons n (8) are neglgble, then we obtan mw u U U m U M ws m U deff u G0 M M, whch can m y nm y M y be estmated wth the correlaton terms removed from (8). Assumng that M are close enough such that M M, M M, and 0, then (8) becomes mw u U M U m U deffg M y y M y nm w M uw u Uuuw nm M ws Uu. (9) M m y M y he Ksh measure s also a specal case of (9), when there are no cluster-level effects. In partcular, suppose that for all elements x j 0, and there s no cluster samplng. hen, uj yj,, U M MY, u y, and 0, and (9) reduces to nmw M. he estmator of ths from deff w, defned n (4). a partcular sample s Ksh s measure, K When the relatonshp between the calbraton covarates x and y s strong, the varance u should be smaller than y. In ths case, measure (8) s smaller than the Ksh and Park and Lee estmates. Varable weghts produced from calbraton adjustments are thus not as penalzed (shown by overly hgh desgn effects) as they would be usng the Ksh. However, f we have neffectve calbraton, or a weak relatonshp between x and y, then u can be greater than y, producng a desgn effect greater than one. hs s llustrated n Sec. 4 wth a populaton that mmcs household-type data. We also examne the extent to whch the correlaton components n our proposed desgn effect (8) are sgnfcant, or large enough to nfluence the exact measure. u y 4. Emprcal Example Usng Household Data he MDarea.pop dataset n the R Pracools package (Vallant et al. 04) s used n ths secton as an example. hs dataset contans 403,997 persons generated from data from the 000 decennal U.S. Census for Anne Arundel County n the state of Maryland and s descrbed n detal n Vallant et al. (03). Indvdual values for each person were generated usng models. Groupngs to form the varables PSU and SSU were done after sortng the census fle by tract and block group wthn tract

11 JSM 04 - Survey Research Methods Secton We treated block groups wthn Census tracts as the cluster (PSU) and selected persons as the elements (SSU). PSUs wth a small number of persons (less than 500) were excluded, leavng a pseudo-populaton of 74 PSU s and 397,065 elements. A poststratfed estmator was used, wth poststrata defned by the cross-classfcaton of gender and 5 age groups (less than 5, 5-9, 0-4, 5-7, 8-, -4, 5-9, 30-34, 35-39, 40-44, 45-49, 50-54, 55-59, 60-64, 65 and older). hus, the weghtng groups are post-strata, whch cut across the PSU s and SSU s. We examne sx varables of nterest, y, y, and y3, fcttous contnuous varables on the fle, created an addtonal varable y4 usng a lnear model wth the poststratfcaton varables as covarates, and two bnary varables ndcatng presence/absence of nsurance coverage and a hosptal stay. Fgure shows a parwse plot of the pseudo-populaton, ncludng plots of the varable values aganst each other n the lower left panels, hstograms on the dagonal panels, and the correlatons among the varables n the upper rght panels. hs plot mmcs household-type data patterns. Fgure. Pseudopopulaton Values and Loess Lnes for Desgn Effect Evaluaton Eght desgn effects are compared, wth results shown n able : he drect desgn effect measure deff rue computed from (5). hs reflects the combned effects of cluster samplng and poststratfcaton; he Ksh measure he Ksh measure Park and Lee s measure deff K (4) computed usng the GREG weghts deff KC (5) for cluster samplng; deff PL n (0); deff VDK n (7); 574

12 JSM 04 - Survey Research Methods Secton hree proposed measures: () deff G, the exact proposed desgn effect n (8), () deff G0, the zerocorrelaton approxmaton to deff G, and deff G, the equal-cluster szed approxmaton (9). All of these are meant to show the precson gans (f any) of the cluster samplng combned wth GREG estmaton. ote that we do not nclude the Gabler et al. (999) or Lynn and Gabler (005) measures, whch correspond to desgn effects of the weghted survey mean. Park and Lee (004) descrbe the crcumstances n whch desgn effects for means and totals are dfferent. Snce our proposed desgn effect s for estmaton of totals, we focus our emprcal comparson on comparng t to only the exstng measures for totals. he results are shown n able. able. Populaton Desgn Effects, by Varable of Interest Varable of Interest Desgn Effects y * y * y 3 * y 4 * Hosptal Stay** Insurance Coverage** deff rue Ksh Sngle-stage deff K Cluster deff Park and Lee Vallant et al. KC deff PL deff VDK Proposed Exact deff G Zero-corr. approx. deff G Equal cluster approx. deff G * contnuous varables; ** categorcal varables he true populaton desgn effects, denoted by deff rue, for the calbraton totals range from 0. for y 4 to 5. for y 3. hese are populaton values, not subject to samplng varablty. otably they are less than one for y and y 4, whch means that poststratfyng by age and gender wll mprove estmates of these varables totals. he Ksh sngle-stage K CV w, whch s constant regardless of the varable of nterest. he cluster sample extenson deff are all very large, 3.0 or hgher. he Park and Lee deff s smply KC 575

13 JSM 04 - Survey Research Methods Secton deff PL s account for the unequal cluster weghts but all exceed one. hs also occurred for the approxmatons, deff G0 and deff G. hs means for all of these alternatves, the deff s are too large. In partcular, one would ncorrectly beleve that calbraton would not be benefcal for y and y 4. he Vallant et al. deff VDK equals deff rue snce they are based on the same formula; the alternatve deff. estmator deff G s very close to rue Whle the Park and Lee deff PL was not desgned to capture the gans n calbraton, t could by ncorporatng the k e -factor used n deff VDK. he other components between these two deff s were nearly dentcal. hs s shown n able. able. Vallant et al. and Park and Lee Deff Components, by Varable of Interest Varable of Interest Desgn Effect Component y * y * y 3 * y 4 * Hosptal Insurance Stay** Coverage** Park and Lee deff PL * W y m Vallant et al. e deff VDK k e m e ke me * contnuous varables; ** categorcal varables he and e values are close, such that dfferences between these deff s are due to the k e -factor. Whle the Park and Lee deff PL was not desgned to account for calbraton effcency, able demonstrates how t can easly be adapted by ncorporatng k e. 5. Dscusson, Lmtatons, and Conclusons We propose new desgn effects that gauge the mpact of calbraton weghtng adjustments on an estmated total n cluster samplng. Exstng desgn effects nclude Ksh s (965) desgn effect due to weghtng and Park and Lee s (004) for unequal two-stage cluster weghts. ether of these reflect effcency gans due to calbraton. he Ksh deff s a reasonable measure f equal weghtng s optmal or nearly so, but does not reveal effcences that may accrue from samplng wth varyng probabltes. he Park and Lee deff does sgnal whether the H (or pwr) estmator n varyng probablty samplng s more effcent than srs, but does not reflect any gans from usng a calbraton estmator

14 JSM 04 - Survey Research Methods Secton he new desgn effects ntroduced n Sec. 3 measure the mpact of both samplng wth varyng probabltes and of usng a calbraton estmator, lke the GREG, that takes advantage of auxlary nformaton. As we demonstrate emprcally, the proposed desgn effects do not penalze unequal weghts when the relatonshp between the survey varable and calbraton covarate s strong. We also demonstrated emprcally that the correlaton components n the proposed measure deff G can be mportant n some stuatons. It s not overly dffcult to calculate these components, so we recommend ncorporatng them when possble to avod overly hgh estmates of the desgn effects. In cases where the auxlary nformaton s neffectve or s not used, the proposed measure approxmates Ksh s deff. One of our proposed measures, deff G, uses the model underlyng the general regresson estmator to extend the Spencer measure. he other alternatve, deff VDK, has a form smlar to the famlar m found n many texts. he survey varable, covarates, and weghts are requred to produce both new desgn effects. Snce the varance (4) s approxmately correct n large samples for all calbraton estmators, the new desgn effects should reflect the effects of many forms of commonly used weghtng adjustment methods, ncludng poststratfcaton, other forms of the GREG estmator, and rakng. Although desgn effects that do account for these adjustments can be computed drectly from estmated varances, t s mportant for practtoners to understand that the exstng deff s do not reflect any gans from those adjustments. he alternatve deff s ntroduced n ths paper, thus, serve as a correctve to that defcency. K References Brck, M.J., and Montaqula, J. M. (009). onresponse and weghtng n D. Pfeffermann and C. R. Rao (eds.), Handbook of Statstcs, Sample Surveys: Desgn, Methods and Applcatons, 9A. Amsterdam: Elsever BV. Devlle, J.-C. and Särndal, C. E. (99). Calbraton Estmators n Survey Samplng. Journal of the Amercan Statstcal Assocaton, 87, Gabler, S., Haeder, S., and Lahr, P. (999). A model-based justfcaton of Ksh s formula for desgn effects for weghtng and clusterng. Survey Methodology, 5, Henry, K., and Vallant, R. (03). A Desgn Effect Measure for Calbraton Weghts n Sngle-Stage Samples. Proceedngs of the Secton on Survey Research Methods, Amercan Statstcal Assocaton, to appear. Kalton, G., and Flores-Cervantes, I. (003). Weghtng Methods. Journal of Offcal Statstcs, 9: Ksh, L. (965). Survey Samplng, ew York: John Wley & Sons. Ksh, L. (987). Weghtng n Deft. he Survey Statstcan, June

15 JSM 04 - Survey Research Methods Secton Ksh, L. (990). Weghtng: Why, When, and How? Proceedngs of the Jont Statstcal Meetngs, Secton on Survey Research Methods, Amercan Statstcal Assocaton, -9. Ksh, L. (99). Weghtng for unequal P. Journal of Offcal Statstcs, 8, Kott, P. (009). Calbraton weghtng: combnng probablty samples and lnear predcton models, n D. Pfeffermann and C. R. Rao (eds.), Handbook of Statstcs, Sample Surveys: Inference and Analyss, 9B. Amsterdam: Elsever BV. Lynn, P. and Gabler, S. (005). Approxmatons to b* n the predcton of Desgn effects due to clusterng, Survey Methodology, 3, Park, I. (004). Assessng Complex Sample Desgns va Desgn Effect Decompostons. Proceedngs of the Secton on Survey Research Methods, Amercan Statstcal Assocaton, Park, I. and Lee, H. (004). Desgn effects for the weghted mean and total estmators under complex survey samplng. Survey Methodology, 30, Särndal, C. E., and Lundström, S. (005). Estmaton n Surveys wth onresponse, ew York: John Wley and Sons. Särndal, C.E., Swensson, B. and Wretman, J. (99). Model Asssted Survey Samplng, Berln: Sprnger. Spencer, B. D. (000). An Approxmate Desgn Effect for Unequal Weghtng When Measurements May Correlate Wth Selecton Probabltes. Survey Methodology, 6, Vallant, R., Dever, J. A., and Kreuter, F., (03). Practcal ools for Desgnng and Weghtng Survey Samples. ew York: Sprnger. Vallant, R., Dever, J.A., and Kreuter, F. (04). Pracools: ools for Desgnng and Weghtng Survey Samples. R package verson

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek Dscusson of Extensons of the Gauss-arkov Theorem to the Case of Stochastc Regresson Coeffcents Ed Stanek Introducton Pfeffermann (984 dscusses extensons to the Gauss-arkov Theorem n settngs where regresson

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

A note on regression estimation with unknown population size

A note on regression estimation with unknown population size Statstcs Publcatons Statstcs 6-016 A note on regresson estmaton wth unknown populaton sze Mchael A. Hdroglou Statstcs Canada Jae Kwang Km Iowa State Unversty jkm@astate.edu Chrstan Olver Nambeu Statstcs

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Exponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute

Exponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 193-9466 Vol. 10, Issue 1 (June 015), pp. 106-113 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) Exponental Tpe Product Estmator

More information

A Bound for the Relative Bias of the Design Effect

A Bound for the Relative Bias of the Design Effect A Bound for the Relatve Bas of the Desgn Effect Alberto Padlla Banco de Méxco Abstract Desgn effects are typcally used to compute sample szes or standard errors from complex surveys. In ths paper, we show

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Sampling Theory MODULE VII LECTURE - 23 VARYING PROBABILITY SAMPLING

Sampling Theory MODULE VII LECTURE - 23 VARYING PROBABILITY SAMPLING Samplng heory MODULE VII LECURE - 3 VARYIG PROBABILIY SAMPLIG DR. SHALABH DEPARME OF MAHEMAICS AD SAISICS IDIA ISIUE OF ECHOLOGY KAPUR he smple random samplng scheme provdes a random sample where every

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Robust Small Area Estimation Using a Mixture Model

Robust Small Area Estimation Using a Mixture Model Robust Small Area Estmaton Usng a Mxture Model Jule Gershunskaya U.S. Bureau of Labor Statstcs Partha Lahr JPSM, Unversty of Maryland, College Park, USA ISI Meetng, Dubln, August 23, 2011 Parameter of

More information

Weighted Estimating Equations with Response Propensities in Terms of Covariates Observed only for Responders

Weighted Estimating Equations with Response Propensities in Terms of Covariates Observed only for Responders Weghted Estmatng Equatons wth Response Propenstes n Terms of Covarates Observed only for Responders Erc V. Slud, U.S. Census Bureau, CSRM Unv. of Maryland, Mathematcs Dept. NISS Mssng Data Workshop, November

More information

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH)

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH) Chapter 1 Samplng wth Unequal Probabltes Notaton: Populaton element: 1 2 N varable of nterest Y : y1 y2 y N Let s be a sample of elements drawn by a gven samplng method. In other words, s s a subset of

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling Open Journal of Statstcs, 0,, 300-304 ttp://dx.do.org/0.436/ojs.0.3036 Publsed Onlne July 0 (ttp://www.scrp.org/journal/ojs) Multvarate Rato Estmator of te Populaton Total under Stratfed Random Samplng

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Small Area Estimation for Business Surveys

Small Area Estimation for Business Surveys ASA Secton on Survey Research Methods Small Area Estmaton for Busness Surveys Hukum Chandra Southampton Statstcal Scences Research Insttute, Unversty of Southampton Hghfeld, Southampton-SO17 1BJ, U.K.

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

LECTURE 9 CANONICAL CORRELATION ANALYSIS

LECTURE 9 CANONICAL CORRELATION ANALYSIS LECURE 9 CANONICAL CORRELAION ANALYSIS Introducton he concept of canoncal correlaton arses when we want to quantfy the assocatons between two sets of varables. For example, suppose that the frst set of

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Sampling Theory MODULE V LECTURE - 17 RATIO AND PRODUCT METHODS OF ESTIMATION

Sampling Theory MODULE V LECTURE - 17 RATIO AND PRODUCT METHODS OF ESTIMATION Samplng Theory MODULE V LECTURE - 7 RATIO AND PRODUCT METHODS OF ESTIMATION DR. SHALABH DEPARTMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOG KANPUR Propertes of separate rato estmator:

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Conditional and unconditional models in modelassisted estimation of finite population totals

Conditional and unconditional models in modelassisted estimation of finite population totals Unversty of Wollongong Research Onlne Faculty of Informatcs - Papers Archve) Faculty of Engneerng and Informaton Scences 2011 Condtonal and uncondtonal models n modelasssted estmaton of fnte populaton

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Variance Estimation for the General Regression Estimator

Variance Estimation for the General Regression Estimator Varance Estmaton for the General Regresson Estmator Rchard Vallant Westat 1650 Research Boulevard Rockvlle MD 0850 and Jont Program for Survey Methodology Unversty of Maryland Unversty of Mchgan February

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger JAB Chan Long-tal clams development ASTIN - September 2005 B.Verder A. Klnger Outlne Chan Ladder : comments A frst soluton: Munch Chan Ladder JAB Chan Chan Ladder: Comments Black lne: average pad to ncurred

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

Appendix B: Resampling Algorithms

Appendix B: Resampling Algorithms 407 Appendx B: Resamplng Algorthms A common problem of all partcle flters s the degeneracy of weghts, whch conssts of the unbounded ncrease of the varance of the mportance weghts ω [ ] of the partcles

More information

Estimating Design Effects in Small Areas/Domains through Aggregation

Estimating Design Effects in Small Areas/Domains through Aggregation Estmatng Desgn Effects n Small Areas/Domans through Aggregaton Jerry J. Maples Abstract Estmates of drect survey varances for areas wth small sample szes s problematc especally when combned wth complex

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Difference Equations

Difference Equations Dfference Equatons c Jan Vrbk 1 Bascs Suppose a sequence of numbers, say a 0,a 1,a,a 3,... s defned by a certan general relatonshp between, say, three consecutve values of the sequence, e.g. a + +3a +1

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random

More information

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Potential gains from using unit level cost information in a model-assisted framework

Potential gains from using unit level cost information in a model-assisted framework Unversty of Wollongong Research Onlne Centre for Statstcal & Survey Methodology Workng Paper Seres Faculty of Engneerng and Informaton Scences 2013 Potental gans from usng unt level cost nformaton n a

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Chat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980

Chat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980 MT07: Multvarate Statstcal Methods Mke Tso: emal mke.tso@manchester.ac.uk Webpage for notes: http://www.maths.manchester.ac.uk/~mkt/new_teachng.htm. Introducton to multvarate data. Books Chat eld, C. and

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Supplementary Notes for Chapter 9 Mixture Thermodynamics

Supplementary Notes for Chapter 9 Mixture Thermodynamics Supplementary Notes for Chapter 9 Mxture Thermodynamcs Key ponts Nne major topcs of Chapter 9 are revewed below: 1. Notaton and operatonal equatons for mxtures 2. PVTN EOSs for mxtures 3. General effects

More information

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,

More information

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics Ths module s part of the Memobust Handbook on Methodology of Modern Busness Statstcs 26 March 2014 Method: Generalsed Regresson Estmator Contents General secton... 3 1. Summary... 3 2. General descrpton

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University PHYS 45 Sprng semester 7 Lecture : Dealng wth Expermental Uncertantes Ron Refenberger Brck anotechnology Center Purdue Unversty Lecture Introductory Comments Expermental errors (really expermental uncertantes)

More information