arxiv: v1 [stat.me] 27 Aug 2015

Size: px
Start display at page:

Download "arxiv: v1 [stat.me] 27 Aug 2015"

Transcription

1 Submtted to Statstcal Scence Fractonal Imputaton n Survey Samplng: A Comparatve Revew Shu Yang and Jae Kwang Km Harvard Unversty and Iowa State Unversty arxv: v1 [stat.me] 27 Aug 2015 Abstract. Fractonal mputaton (FI) s a relatvely new method of mputaton for handlng tem nonresponse n survey samplng. In FI, several mputed values wth ther fractonal weghts are created for each mssng tem. Each fractonal weght represents the condtonal probablty of the mputed value gven the observed data, and the parameters n the condtonal probabltes are often computed by an teratve method such as EM algorthm. The underlyng model for FI can be fully parametrc, semparametrc, or nonparametrc, dependng on plausblty of assumptons and the data structure. In ths paper, we gve an overvew of FI, ntroduce key deas and methods to readers who are new to the FI lterature, and hghlght some new development. We also provde gudance on practcal mplementaton of FI and vald nferental tools after mputaton. We demonstrate the emprcal performance of FI wth respect to multple mputaton usng a pseudo fnte populaton generated from a sample n Monthly Retal Trade Survey n US Census Bureau. Key words and phrases: Item nonresponse, Mssng at random, Monte Carlo EM, Multple mputaton, Synthetc mputaton. 1. INTRODUCTION In survey samplng, t s a common practce to collect data on a large number of tems. Even when a sampled unt responds to the survey, ths unt may not respond to some tems. In ths scenaro, mputaton can be used to create a complete data set by fllng n mssng values wth plausble values to facltate data analyses. The goal of mputaton s three-fold: Frst, by provdng complete data, subsequent analyses are easy to mplement and can acheve consstency among dfferent users. Second, mputaton reduces the selecton bas assocated wth only usng the respondent set, whch may not necessarly represent the orgnal sample. Thrd, the mputed data can ncorporate extra nformaton so that the resultng analyses are statstcally effcent and coherent. Combnng nformaton from several surveys or creatng synthetc data from planned mssngness are cases n pont (Schenker and Raghunathan 2007). When the mputed data set s released to the publc, t should meet the goal of multple uses both for planned and unplanned parameters (Hazza, 2009). Room 437A, HSPH2, 655 Huntngton Ave, Boston, MA (e-mal: shuyang@hsph.harvard.com) Snedecor Hall, Iowa State Unversty, Ames, IA (e-mal: jkm@astate.edu). 1 msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

2 2 S. YANG AND J. K. KIM In a typcal survey stuaton, the mputers may know some of the parameters of nterest at the tme of mputaton, but hardly know the full set of possble parameters to be estmated from the data. Sngle mputaton, such as hot deck mputaton, regresson mputaton and stochastc regresson mputaton, replaces each of the mssng data wth one plausble value. Although sngle mputaton has been wdely used, one drawback s that t does not take nto account of the full uncertanty of mssng data and often falls short of multple-purpose estmaton. Multple mputaton (MI) has been proposed by Rubn (1976) to replace each of mssng data wth multple plausble values to reflect the full uncertanty n the predcton of mssng data. Several authors (Rubn 1987; Lttle and Rubn 2002; Schafer 1997) have promoted MI as a standard approach for general-purpose estmaton under tem nonresponse n survey samplng. Although the varance estmaton formula of Rubn (1987) s smple and easy to apply, t s not always consstent (Fay 1992; Wang and Robns 1998; Km et al. 2006). For usng the MI varance estmaton formula, the congenalty condton of Meng (1994) needs to be met, whch can be restrctve for general-purpose nference. For example, Km (2011) ponted out that a MI procedure that s congenal for mean estmaton s not necessarly congenal for proporton estmaton. Fractonal mputaton (FI) s another effectve mputaton tool for generalpurpose estmaton wth ts advantage of not requrng the congenalty condton. FI was orgnally proposed by Kalton and Ksh (1984) to reduce the varance of sngle mputaton methods by replacng each mssng value wth several plausble values at dfferentable probabltes reflected through fractonal weghts. Fay (1996), Km and Fuller (2004), Fuller and Km (2005), Durrant (2005), Durrant and Sknner (2006) dscussed FI as a nonparametrc mputaton method for descrptve parameters of nterest n survey samplng. Km (2011) and Km and Yang (2014) presented FI under fully parametrc model assumptons. More generally, FI can also serve as a computatonal tool for mplementng the expectaton step (E-step) n the EM algorthm (We and Tanner 1990; Km 2011). When the condtonal expectaton n the E-step s not avalable n a closed form, parametrc FI of Km (2011) smplfes computaton by drawng on the mportance samplng dea. Through fractonal weghts, FI can reduce the burden of teratve computaton, such as Markov Chan Monte Carlo, for evaluatng the condtonal expectaton assocated wth mssng data. Km and Hong (2012) extended parametrc FI to a more general class of ncomplete data, ncludng measurement error models. Despte these advantages, FI n appled research has not been wdely used due to lack of good nformaton that provdes researchers wth comprehensve understandng of ths approach. The goal of ths paper s to brng more attenton to FI by revewng exstng research on FI, ntroducng key deas and methods, and hghlghtng some new development, manly n the context of survey samplng. Ths paper also provdes gudance on practcal mplementatons and applcatons of FI. Ths paper s organzed as follows. Secton 2 provdes the basc setup and Secton 3 ntroduces FI under parametrc model assumptons. Secton 4 dscusses a nonparametrc approach to FI, specally n the context of hot deck mputaton. Secton 5 ntroduces synthetc data mputaton usng FI n the context of two-phase samplng and statstcal matchng. Secton 6 deals wth practcal conmsart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

3 FRACTIONAL IMPUTATION 3 sderatons and varatons of FI, ncludng mputaton szes, choces of proposal dstrbutons and doubly robust FI. Secton 7 compares FI wth MI n terms of effcency of the pont estmator and the varance estmator. Secton 8 demonstrates a smulaton study based on an actual data set. A dscusson concludes ths paper n Secton BASIC SETUP Consder a fnte populaton of N unts dentfed by a set of ndces U = {1, 2,, N} wth N known. The p-dmensonal study varable y = (y 1,, y p ), assocated wth each unt n the populaton, s subject to mssngness. We assume that the fnte populaton at hand s a realzaton from an nfnte populaton, called a superpopulaton. In the superpopulaton model, we often postulate a parametrc dstrbuton, f(y; θ), wth the parameter θ Ω. We can express the densty for the jont dstrbuton of y as (2.1) f(y; θ) = f 1 (y 1 ; θ 1 )f 2 (y 2 y 1 ; θ 2 ) f p (y p y 1,, y p 1 ; θ p ) where θ k s the parameter n the condtonal dstrbuton of y k gven y 1,, y k 1. Now let A denote the set of ndces for unts n a sample selected by a probablty samplng mechansm. Each unt s assocated wth a samplng weght, the nverse of the probablty of beng selected to the sample, denoted by w. We are nterested n estmatng η, defned as a (unque) soluton to the populaton estmatng equaton N =1 U(η; y ) = 0. For example, a populaton mean of y can be obtaned by lettng U(η; y ) = η y, a populaton proporton of y less than a threshold c can be obtaned by specfyng U(η; y ) = η I {y <c}, where I s an ndcator functon, a populaton medan of y can be obtaned by choosng U(η; y ) = 0.5 I {y <η}, and so on. Under complete response, a consstent estmator of η s obtaned by solvng (2.2) w U(η; y ) = 0. A Godambe and Thompson (1986), Bnder and Patak (1994) and Rao, Yung, and Hdroglou (2002) have done rgorous nvestgatons on the estmator obtaned from (2.2) under complex samplng. In the presence of mssng data, frst consder decomposng y = (y obs,, y ms, ), where y obs, and y ms, are the observed and mssng part of y, respectvely. We assume that the response mechansm s mssng at random (MAR) n the sense of Rubn (1976). That s, the probablty of nonresponse does not depend on the mssng value tself. Under MAR, a consstent estmator of η can be obtaned by solvng the condtonal estmatng equaton, gven the observed data y obs = (y obs,1,..., y obs,n ), (2.3) w E{U(η; y ) y obs, } = 0, A where the above condtonal expectaton s taken wth respect to the predcton model (also called the mputaton model), (2.4) f(y ms, y obs, ; θ) = f(y obs,, y ms, ; θ) f(yobs,, y ms, ; θ)dy ms,, msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

4 4 S. YANG AND J. K. KIM Table 1 Comparson of two approaches of nference wth mssng data Bayesan Frequentst Model Posteror dstrbuton Predcton model f(latent, θ Obs.) f(latent Obs., θ) Learnng algorthm Data augmentaton EM algorthm Predcton Imputaton(I)-step Expectaton(E)-step Parameter update Posteror(P)-step Maxmzaton(M)-step Imputaton Multple mputaton Fractonal mputaton Varance estmaton Rubn s formula Lnearzaton or replcaton whch depends on the unknown parameter θ. Imputaton s thus a computatonal tool for computng the condtonal expectaton n (2.3) for arbtrary choces of the estmatng functon U(η; y). The resultng condtonal expectaton usng mputaton can be called the mputed estmatng functon. Table 1 presents a summary of Bayesan and frequentst approaches of statstcal nference wth mssng data. In the Bayesan approach, θ s treated as a random varable and the reference dstrbuton s the jont dstrbuton of θ and the latent (mssng) data, gven the observed data. On the other hand, n the frequentst approach, θ s treated as fxed and the reference dstrbuton s the condtonal dstrbuton of the latent data, condtonal on the observed data, for a gven parameter θ. The learnng algorthm, that s, the algorthm for updatng nformaton for parameters from observed data, for the Bayesan approach s data augmentaton (Tanner and Wong 1987), whle the learnng algorthm for the frequentst approach s usually the EM algorthm. MI s a Bayesan mputaton method and the mputed estmatng functon s computed wth respect to the posteror predctve dstrbuton, ˆ f(y ms, y obs ) = f(y ms, y obs, ; θ)p(θ y obs )dθ, whch s the average of the predctve dstrbuton f(y ms, y obs, ; θ) over the posteror dstrbuton of θ. On the other hand, n the frequentst approach, the condtonal expectaton n (2.3) s taken wth respect to the predcton model (2.4) evaluated at θ = ˆθ, a consstent estmator of θ. For example, one can use the pseudo MLE ˆθ of θ obtaned by solvng the pseudo mean score equaton (Lous 1982; Pfeffermann et al. 1998), (2.5) S(θ) = w E{S(θ; y ) y,obs ; θ} = 0, A where S(θ; y ) = log f(y ; θ)/ θ. Whle the Bayesan approach to mputaton, especally n the context of MI, s well studed n the lterature, the frequentst approach to mputaton s somewhat sparse. FI has been proposed to fll n ths mportant gap. In FI, the condtonal expectaton n (2.3) s computed by a weghted mean of the mputed estmatng functons (2.6) E{U(η; y ) y obs, } = M w ju(η; y obs,, y (j) ms, ). msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

5 FRACTIONAL IMPUTATION 5 where y (j) ms,, for j = 1,..., M, are M mputed values for y ms, (f y s completely observed, y (j) ms, y ms,), wj are the fractonal weghts that satsfes w j 0, M w j = 1 and M w wjs(ˆθ; y obs,, y (j) ms, ) = 0. A Once the FI data are constructed, the FI estmator of η s obtaned by solvng (2.7) M w wju(η; y obs,, y (j) ms, ) = 0. A In general, the FI method augments the orgnal data set as (2.8) S F I = { δ (w, y ) + (1 δ ) ( w w j, y j) ; j = 1,..., M, A }, where δ s the ndcator of full response for y, and yj = (y obs,, y (j) ms, ). If (2.6) holds for an arbtrary U functon, the resultng estmator s approxmately unbased for a farly large class of parameters, whch makes the mputaton attractve for general-purpose estmaton. Km (2011) used the mportance samplng technque to satsfy (2.6) for general U functons, whch wll be presented n the next secton. 3. PARAMETRIC FRACTIONAL IMPUTATION Parametrc Fractonal Imputaton (PFI), proposed by Km (2011), features a parametrc model for fractonal mputatons, and parameters n the mputaton model are estmated by a computatonally effcent EM algorthm. To compute the condtonal estmatng equaton n (2.3) by PFI, for each mssng value y ms,, generate M mputed values, denoted by {y (1) ms,,..., y (M) ms, } from a proposal dstrbuton h(y ms, y obs, ). How to choose a proposal dstrbuton wll be dscussed n Secton 6.2. Once the mputed values are generated from h( ), compute wj f(y (j) ms, y obs,; ˆθ) h(y (j) ms, y obs,), subject to M w j = 1, as the fractonal weghts assgned to y j = (y obs,, y (j) ms, ), where ˆθ s the pseudo MLE of θ to be determned by the EM algorthm below. Snce M w j = 1, the above fractonal weght s the same as w j = w j (ˆθ), where (3.1) wj(θ) f(y obs,, y (j) ms, ; θ) h(y (j) ms, y obs,), whch only requres the knowledge of the jont dstrbuton f(y; θ) and the proposal dstrbuton h. The pseudo MLE of θ can be computed by solvng the mputed mean score equaton, (3.2) M w wj(θ)s(θ; y obs,, y (j) ms, ) = 0. A msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

6 6 S. YANG AND J. K. KIM To solve (3.2), we can ether use the Newton method or the followng EM algorthm: I-step. For each mssng value y ms,, M mputed values are generated from a proposal dstrbuton h(y ms, y obs, ). W-step. Usng the current value of the parameter estmates ˆθ (t), compute the fractonal weghts as wj(t) f(y obs,, y (j) ms, ; ˆθ (t) )/h(y (j) ms, y obs,), subject to M w j(t) = 1. M-step. Update the parameter ˆθ (t+1) by solvng the mputed score equaton, M w wj(t) S(θ; y j) = 0, A where yj = (y obs,, y (j) ms, ) and S(θ; y) = log f(y; θ)/ θ s the score functon of θ. Iteraton. Set t = t+1 and go to the W-step. Stop f ˆθ (t+1) meets the convergence crteron. Here, the I-step s the mputaton step, the W-step s the weghtng step, and the M-step s the maxmzaton step. The I- and W-steps can be combned to mplement the E-step of the EM algorthm. Unlke the Monte Carlo EM (MCEM) method, mputed values are not changed for each EM teraton only the fractonal weghts are changed. Thus, the FI method has computatonal advantages over the MCEM method. Convergence s acheved because the mputed values are not changed. Km (2011) showed that gven the M mputed values, y (1) ms,,..., y (M) ms,, the sequence of estmators {ˆθ (0), ˆθ (1),...} from the W-and M- steps converges to a statonary pont ˆθ M for fxed M. The statonary pont ˆθ M converges to the pseudo MLE of θ as M. The resultng weght wj after convergence s the fractonal weght assgned to yj = (y obs,, y (j) ms, ). We may add an addtnal step to montor the dstrbuton of the fractonal weghts so that no extremely large fractonal weghts domnate the weghts. Once the fractonal mputed data s constructed from the above steps, t can be used to estmate other parameters of nterest. That s, we can use (2.7) to estmate η from the FI data set. We now consder a bvarate mssng data example to llustrate the use of the EM algorthm n FI. Example 1. Suppose a probablty sample conssts of n unts of z = (x, y 1, y 2 ) wth samplng weght w, where x s always observed and y = (y 1, y 2 ) s subject to mssngness. Let A 11, A 10, A 01, and A 00 be the partton of the sample based on the mssng pattern, where subscrpt 1/0 n the -th poston denote that the -th y tem s observed/mssng, respectvely. For example, A 10 s the set of the sample wth y 1 observed and y 2 mssng. The condtonal expectaton n (2.3) nvolves evaluatng the condtonal dstrbuton of y ms, gven the observed data x and y obs, for each mssng pattern, msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

7 whch s then decomposed nto FRACTIONAL IMPUTATION 7 w E{U(η; z ) x, y obs, } = w U(η; x, y 1, y 2 )+ w E{U(η; x, Y 1, Y 2 ) A A 11 A 00 x }+ w E{U(η; x, Y 1, y 2 ) x, y 2 }+ w E{U(η; x, y 1, Y 2 ) x, y 1 }. A 01 A 10 Suppose the jont dstrbuton n (2.1) s (3.3) f(x, y 1, y 2 ; θ) = f x (x; θ 0 )f 1 (y 1 x; θ 1 )f 2 (y 2 x, y 1 ; θ 2 ). From the full respondent sample n A 11, obtan ˆθ 1(0) and ˆθ 2(0), whch are ntal parameter estmates for θ 1 and θ 2. In the I-step, for each mssng value y ms,, generate M mputed values from h(y ms, x, y obs, ) = f(y ms, x, y obs, ; ˆθ (0) ), where f 2 (y 2 x, y 1 ; ˆθ 2(0) ) f A 10 (3.4) f(y ms, x, y obs, ; ˆθ (0) ) = f(y 1 x, y 2 ; ˆθ (0) ) f A 01 f(y 1, y 2 x ; ˆθ (0) ) f A 00 and (3.5) f(y 1 x, y 2 ; ˆθ (0) ) = f 1 (y 1 x ; ˆθ 1(0) )f 2 (y 2 x, y 1 ; ˆθ 2(0) ) f1 (y 1 x ; ˆθ 1(0) )f 2 (y 2 x, y 1 ; ˆθ 2(0) )dy 1. Note that the margnal dstrbuton of x, f x (x; θ 0 ), s not used n (3.5). Except for some specal cases such as when both f 1 and f 2 are normal dstrbutons, the condtonal dstrbuton n (3.5) s not n a known form. Thus, some computatonal tools such as Metropols-Hastng (Hastngs 1970) or SIR (Samplng Importance Resamplng, Smth and Gelfand 1992) are needed to generate samples from (3.5) for A 01. For example, the SIR conssts of the followng steps: 1. Generate B (say B = 100) Monte Carlo samples, denoted by y (1) 1,, y (B) 1, from f 1 (y 1 x ; ˆθ 1(0) ). 2. Among the B samples obtaned from Step 1, select one sample wth the selecton probablty proportonal to f 2 (y 2 x, y (k) 1 ; ˆθ 2(0) ), where y (k) 1 s the k-th sample from Step 1 (k = 1,, B). 3. Repeat Step 1 and Step 2 ndependently M tmes to obtan M mputed values. Once we obtan M mputed values of y 1, we can use h(y 1 x, y 2 ) f 1 (y 1 x ; ˆθ 1(0) )f 2 (y 2 x, y 1 ; ˆθ 2(0) ) as the proposal densty n (3.4). Snce M w j = 1, we do not need to compute the normalzng constant n (3.5). For A 10, M mputed values of y 2 are generated from f 2 (y 2 x, y 1 ; ˆθ 2(0) ). For A (00), M mputed values of y 1 are generated from f 1 (y 1 x ; ˆθ 1(0) ) and then M mputed values of y 2 are generated from f 2 (y 2 x, y1 ; ˆθ 2(0) ). msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

8 8 S. YANG AND J. K. KIM In the W-step, the fractonal weghts are computed by wth M observed. w j(t) wj(t) f 1(y (j) 1 = 1, where y (j) 1 x ; ˆθ 1(t) )f 2 (y (j) 2 x, y 1 ; ˆθ 2(t) ) h(y (j) ms, x, y obs, ) = y 1 f y 1 s observed and y (j) 2 = y 2 f y 2 s The above example covers a broad range of applcatons n the mssng data lterature, such as mssng covarate problems, measurement error models, generalzed lnear mxed models, and so on. Yang and Km (2014) consdered regresson analyses wth mssng covarates n survey data usng FI, where n the current notaton, f(y 2 x, y 1 ) s a regresson model wth y 2 and x fully observed and y 1 subject to mssngness. In generalzed lnear mxed models, f(y 2 x, y 1 ) s a generalzed lnear mxed model where y 1 s the latent random effect. See Yang, Km, and Zhu (2013) for usng FI to estmate parameters n the generalzed lnear mxed models. For varance estmaton, note that the mputed estmator ˆη F I obtaned from the mputed estmatng equaton (2.7) depends on ˆθ obtaned from (3.2). To reflect ths dependence, we can wrte ˆη F I = ˆη F I (ˆθ). To account for the samplng varablty of ˆθ n the mputed estmator ˆη F I, ether the lnearzaton method or replcaton methods can be used. In the lnearzaton method, the mputaton model s needed n order to compute partal dervatves of the score functons. To avod dsclosng the mputaton model, replcaton methods are often preferred (Rao and Shao 1992). To mplement the replcaton varance estmaton n FI, we frst obtan the k-th replcate pseudo MLE ˆθ [k] of ˆθ by solvng (3.6) S [k] (θ) A w [k] M wj(θ)s(θ; yj) = 0, where w [k] s the k-th replcaton weght and wj (θ) s defned n (3.1). To obtan ˆθ [k] from (3.6), ether EM algorthm or the one-step Newton method can be used. EM algorthm can be mplemented smlarly as before. For the one-step Newton method, we have where { } ˆθ [k] = ˆθ 1 θ S [k] T (ˆθ) A w [k] M wj(ˆθ)s(ˆθ; yj), θ S [k] T (θ) = M w [k] w j(θ)ṡ(θ; y j) + A A 2 M S(θ; y j) wj(θ)s(θ; yj) w [k] M wj(θ) msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

9 FRACTIONAL IMPUTATION 9 wth Ṡ(θ; y) = S(θ; y)/ θt and B 2 = BB T. Once ˆθ [k] s obtaned, we obtan the k-th replcate ˆη [k] of ˆη by solvng A for η, where w [k] j = w j (ˆθ [k] ). w [k] M w [k] j U(η; y j) = 0 4. NONPARAMETRIC FRACTIONAL IMPUTATION 4.1 Fractonal Hot Deck Imputaton Hot deck mputaton uses observed responses from the sample as mputed values. The unt wth a mssng value s called the recpent and the unt provdng the value for the mputaton s called the donor. Durrant (2009), Hazza (2009) and Andrdge and Lttle (2010) provded comprehensve overvews of hot deck mputaton n survey samplng. The attractve features of hot deck mputaton nclude the followng. Frst, unlke model-based mputaton methods that generate artfcal mputed values, n hot deck mputaton, only plausble values can be mputed, and therefore dstrbutonal propertes of the data are preserved. For example, mputed values for categorcal varables wll also be categorcal, as observed from the respondents. Second, compared to fully parametrc methods, hot deck mputaton makes less or no dstrbutonal assumptons and therefore s more robust. For these reasons, hot deck mputaton s a wdely used mputaton method, especally n household surveys. Fractonal hot deck mputaton (FHDI) combnes the deas of FI and hot deck mputaton. It s effcent (due to FI), and t nherts the aforementoned good propertes of hot deck mputaton. Km and Fuller (2004), Fuller and Km (2005), and Km and Yang (2014) consdered FHDI for unvarate mssng data. We now descrbe a multvarate FHDI procedure to deal wth mssng data wth an arbtrary mssng pattern (Im et al. 2015). We frst consder categorcal data. Let z = (z 1,..., z K ) be the vector of study varables that take categorcal values. Let z = (z 1,..., z K ) be the -th realzaton of z. Let δ j be the response ndcator varable for z j. That s, δ j = 1 f z j s observed and δ j = 0 otherwse. Assume that the response mechansm s MAR. Based on δ = (δ 1,..., δ K ), the orgnal observaton z can be decomposed nto (z obs,, z ms, ), whch are the mssng and observed part of z, respectvely. Let D = {z (1) ms,,..., z (M ) ms, } be the set of all possble values of z ms,, that s, (z obs,, z (j) ms, ) s one of the actually observed value n the respondents, for j = 1,..., M, wth M > 0. If all of M possble values are taken as the mputed values for z ms,, the fractonal weght assgned to the j-th mputed value z (j) ms, s (4.1) w j = π(z obs,, z (j) ms, ) k D π(z obs,, z (k) ms, ), where π(z) s the jont probablty of z. If the jont probablty s nonparametrcally modeled, t s computed by A (4.2) π(z) = w j D wj I{(z obs,, z (j) ms, ) = z} A w, msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

10 10 S. YANG AND J. K. KIM where z (j) ms, z ms, and wj = M 1, for j = 1,..., M 1, f z s completely observed. To compute (4.1) and (4.2), EM algorthm by weghtng (Ibrahm 1990) can be used, wth the ntal values of fractonal weghts beng wj(0) = M 1. Equatons (4.1) and (4.2) correspond to the E-step and M-step of the EM algorthm, respectvely. The M-step (4.2) can be changed f there s a parametrc model for the jont probablty π(z). For example, f the jont probablty can be modeled by a multnomal dstrbuton wth parameter α, say π(z; α), then the M-step replaces (4.2) wth solvng the mputed score equaton of α to update the estmate of α. For contnuous data y = (y 1,..., y K ), we consder a dscrete approxmaton. Dscretze each contnuous varable by dvdng ts range nto a small fnte number of segments (for example, quantles). Let z k denote the dscrete verson of y k. Note that z k s observed only f y k s observed. Let the support of z, denoted by {z 1,..., z G }, whch s the same as the sample support of z from the full respondents, specfy donor cells. The jont probablty of z, denoted by π(z g ), for g = 1,..., G, can be obtaned by the EM algorthm for categorcal mssng data as descrbed above. As n the categorcal mssng data problem, let D = {z (1) ms,,..., z (M ) ms, } be the set of all possble values of z ms,. Usng a fnte mxture model, a nonparametrc approxmaton of f(y ms, y obs, ) s M (4.3) f(y ms, y obs, ) P (z = z (j) y obs, )f(y ms, z (j) ). Each z (j) = (z obs,, z (j) ms, ) defnes an mputaton cell. The approxmaton n (4.3) s based on the assumpton that (4.4) P (y ms y obs, z) = P (y ms z), whch requres (approxmate) condtonal ndependence between y ms and y obs gven z. Thus, we assume that the covarance structure between tems are captured by the dscrete approxmaton and the wthn cell errors can be safely assumed to be ndependent. Once the mputaton cells are formed to satsfy (4.4), we select m g mputed values for y ms,, denoted by y (j) = (y obs,, y (j) ms, ), for j = 1,..., m g, randomly from the full respondents n the same cell, wth the selecton probablty proportonal to the samplng weghts. The fnal fractonal weghts assgned to y (j) s wj = ˆP (z (j) ms, y obs,)m 1 g. Ths FHDI procedure resembles a two-phase stratfed samplng (Rao 1973, Km et al. 2006), where formng the mputaton cells corresponds to stratfcaton (phase one) and conductng hot deck mputaton corresponds to stratfed samplng (phase two). For more detals, see Im, Km, and Fuller (2015). If we select all possble donors n the same cell, the resultng FI estmator s fully effcent n the sense that t does not ntroduce addtonal randomness due to hot deck mputaton. Such fractonal hot deck mputaton s called fully effcent fractonal mputaton (FEFI). The FEFI opton s currently avalable at Proc Surveympute n SAS (SAS Insttute Inc. 2015). 4.2 Nonparametrc Fractonal Imputaton Usng Kernels In real-data applcatons, nonparametrc methods are preferred f less s known about the true underlyng data model. Hot deck mputaton makes less or no dsmsart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

11 FRACTIONAL IMPUTATION 11 trbutonal assumptons and therefore s more robust than fully parametrc methods. In what follows, we dscuss an alternatve way of calculatng the fractonal weghts that lnks the FI estmator to some well-known nonparametrc estmators, such as Nadaraya-Watson kernel regresson estmator (Nadaraya 1964). For smplcty, suppose we have bvarate data (x, y ) where x s completely observed and y s subject to mssng. Assume the mssng data mechansm s MAR. Let δ be the response ndcator that takes the value one f y s observed and takes zero otherwse. We are nterested n estmatng η, whch s defned through E{U(η; X, Y )} = 0. Let A R = { A; δ = 1} be the ndex set of respondents. To calculate the condtonal estmatng equaton (2.3) nonparametrcally, we use the followng fractonal mputaton: for each unt wth δ = 0, r = A R mputed values of y are taken from A R, denoted by y (1) the Kernel-based fractonal weghts w j = K h(x x (j) where K h ( ) s the kernel functon wth bandwdth h and x (j) assocated wth y (j) (4.5) A w,, y (r), and compute )/ k A R K h (x x (k) ), s the covarate. The resultng FI estmatng equaton can be wrtten as δ U(η; x, y ) + (1 δ ) wju(η; x, y (j) ) = 0, j A R where the nonparametrc fractonal weghts measure the degrees of smlarty based on the dstance between x and x (j). The FI estmator uses Û(η; x ) j A R wj U(η; x, y (j) ) to approxmate E{U(η; x, y ) x } nonparametrcally. For fxed η, Û(η; x ) s often called the Nadaraya-Watson kernel regresson estmator of E{U(η; x, y ) x } n the nonparametrc estmaton framework. Note that ths FI estmator does not rely on any parametrc model assumptons and so s nonparametrc; however t s not assumpton free because t makes an mplct assumpton of the contnuty of E{U(η; x, y) x } through the choce of kernels to defne the smlarty (Nadaraya 1964). Notably, whle the convergence of Û(η; x ) to E{U(η; x, y ) x } does not acheve the order of O p (1/ n), the soluton ˆη F I to (4.5) satsfes ˆη F I η = O p (1/ n) under some regularty condtons, whch was proved by Wang and Chen (2009) n the IID setup. Such kernel-based nonparametrc fractonal mputaton can be drectly applcable to complex survey samplng scenaros. More developments are expected by couplng FI wth other nonparametrc methods such as those usng the nearest neghbor mputaton method (Chen and Shao 2001; Ktamura et al. 2009; Km et al. 2011) or predctve mean matchng (Vnk et al. 2014). 5. SYNTHETIC DATA IMPUTATION Synthetc mputaton s a technque of creatng mputed values for the unobserved tems by ncorporatng nformaton from other surveys. For example, suppose that there are two ndependent surveys, called Survey 1 and Survey 2, and we observe x from Survey 1 and observe (x, y ) from Survey 2. In ths case, we may want to create synthetc values of y n Survey 1 by frst fttng a model relatng y to x to the data from Survey 2 and then predctng y assocated wth x observed n Survey 1. Synthetc mputaton s partcularly useful when Survey 1 s a large scale survey and tem y s very expensve to measure. Schenker and Raghunathan (2007) reported several applcatons of synthetc mputaton, usng msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

12 12 S. YANG AND J. K. KIM a model-based method to estmate parameters assocated wth varables not observed n Survey 1 but observed n a much smaller Survey 2. In one applcaton, both self-reported health measurements x and clncal measurements from physcal examnatons y for a small sample A 2 of ndvduals were observed. In the much larger Survey 1, only self-reported measurements, x were observed. Only the mputed or synthetc data from Survey 1 and assocated survey weghts were released to the publc. The setup of two ndependent samples wth common tems s often called nonnested two-phase samplng. Two-phase samplng can be treated as a mssng data problem, where the mssngness s planned and the response probablty s known. 5.1 Fractonal Imputaton for Two-phase Samplng In two-phase samplng, suppose we observe x n the frst-phase sample and observe (x, y ) n the second-phase sample, where the second-phase sample s not necessarly nested wthn the frst-phase sample. Let A 1 and w 1 be the set of ndces and the set of samplng weghts for the frst-phase sample, respectvely. Let A 2 and w 2 be the correspondng sets for the second-phase sample. Assume a workng model m(x ; β) for E(y x ). For estmaton of the populaton total of y, the two-phase regresson estmator can be wrtten as (5.1) Ŷ tp = w 1 m(x ; ˆβ) + w 2 {y m(x ; ˆβ)}, A 1 A 2 where the subscrpt tp stands for two-phase, and ˆβ s estmated from the second-phase sample. The two-phase regresson estmator s effcent f the workng model s well-specfed. The frst term of (5.1) s called the projecton estmator. Note that f the second term of (5.1) s equal to zero, the two-phase regresson estmator s equvalent to the projecton estmator. Some asymptotc propertes of the two-phase estmator and varance estmaton methods have been dscussed n Km, Navarro, and Fuller (2006), and Km and Yu (2011a). Km and Rao (2012) dscussed asymptotc propertes of the projecton estmator under non-nested two-phase samplng. In a large scale survey, t s a common practce to produce estmates for domans. Creatng an mputed data set for the frst-phase sample, often called mass mputaton, s one method for ncorporatng the second-phase nformaton nto the frst-phase sample. Bredt and Fuller (1996) dscussed the possblty of usng mputaton to get mproved estmates for domans. Fuller (2003) nvestgated mass mputaton n the context of two-phase samplng. The FI procedure can be used to obtan the two-phase regresson estmator n (5.1) and, at the same tme, mprove doman estmaton. Note that the two-phase regresson estmator (5.1) can be wrtten as (5.2) Ŷ F EF I = A 1, j A 2 w 1 w jy (j) where y (j) = ŷ + ê j, ŷ = m(x ; ˆβ), ê j = y j ŷ j, w j = w j2/( k A 2 w k2 ), and we assume A 1 w 1 = A 2 w 2. The expresson (5.2) mples that we mpute all the elements n the frst-phase sample, ncludng the elements that also belong to the second-phase sample. The estmator (5.2) s computed usng an msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

13 FRACTIONAL IMPUTATION 13 augmented data set of n 1 n 2 records, where n 1 and n 2 are the szes of A 1 and A 2, respectvely, and the (, j)-th record has an (mputed) observaton y (j) = ŷ + ê j wth weght w 1 wj. That s, for each unt A 1, we mpute n 2 values of y (j) wth fractonal weght wj. The method n (5.2) mputes all the elements n A 2 and s called fully effcent fractonal mputaton (FEFI) method, accordng to Fuller and Km (2005). The FEFI estmator s algebracally equvalent to the two-phase regresson estmator of the populaton total of y, and can also provde consstent estmates for other parameters such as populaton quantles. If t s desrable to lmt the number of mputatons to a small value m (m < n 2 ), FI usng the regresson weghtng method n Fuller and Km (2005) can be adopted. We frst select m values of y (j), denoted by y (1),, y (m), among the set of n 2 mputed values {y (j) The fractonal weghts w j (5.3) m w j ; j A 2 } usng an effcent samplng method. assgned to the selected y (j) ( 1, y (j) ) = j A 2 w j ( 1, y (j) are determned so that holds for each A 1. The fractonal weght satsfyng (5.3) can be computed usng the regresson weghtng method or the emprcal lkelhood method, see secton 6.1 for detals. The resultng FI data y (j) wth weghts w 1 w j are constructed wth n 1 m records, whch ntegrate avalable nformaton from two phases. Replcaton varance estmaton wth FI, smlar to Fuller and Km (2005), can be developed. See Secton 8.7 of Km and Shao (2013). 5.2 Fractonal Imputaton for Statstcal Matchng Statstcal matchng s used to ntegrate two or more data sets when nformaton avalable for matchng records for ndvdual partcpants across data sets s ncomplete. Statstcal matchng can be vewed as a mssng data problem where a researcher wants to perform a jont analyss of varables not jontly observed. Statstcal matchng technques can be used to construct fully augmented data fles to enable statstcally vald data analyss. Table 2 A Smple Data Structure for Matchng X Y 1 Y 2 Sample A o o Sample B o o ) To smplfy the setup, suppose that there are two surveys, Survey A and Survey B, each contanng a random sample wth partal nformaton about the populaton. Suppose that we observe x and y 1 from the Survey A sample and observe x and y 2 from the Survey B sample. Table 2 llustrates a smple data structure for matchng. Wthout loss of generalzablty, consder mputng y 1 n Survey B, snce mputng y 2 n Survey A s symmetrc. Under ths setup, we can use FI to generate y 1 from the condtonal dstrbuton of y 1 gven the observatons. That s, we generate y 1 from (5.4) f (y 1 x, y 2 ) f (y 2 x, y 1 ) f (y 1 x). msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

14 14 S. YANG AND J. K. KIM Of note, assumptons are needed to dentfy the parameters n the jont model. For example, Km, Berg, and Park (2015) used an nstrumental varable assumpton to dentfy the model. To generate y 1 from (5.4), the EM algorthm by FI can be used. For more detals, see Km, Berg, and Park (2015). 6. FRACTIONAL IMPUTATION VARIANTS 6.1 The Choce of M and Calbraton Fractonal Imputaton The choce of the mputaton sze M s a matter of tradeoff between statstcal effcency and computaton effcency: small M may lead to large varablty n Monte Carlo approxmaton; whereas large M may ncrease computatonal cost. The magntude of the mputaton error s usually O(1/ M), whch can be reduced for large M. Thus, f computatonal power allows, the larger M, the better. In survey practces, a large mputaton sze may not be desrable. Thus, nstead of releasng to publc large number of mputed values for each mssng tem, a subset of ntal mputaton values can be selected to reduce the mputaton sze. In ths case, the FI procedure can be developed n three stages. The frst stage, called Fully Effcent Fractonal Imputaton (FEFI), computes the pseudo MLE of parameters n the superpopulaton model wth suffcently large mputaton sze M, say M = 1, 000. The second stage s the Samplng Stage, whch selects small m (say, m = 10) mputed values from the set of M mputed values. The thrd stage s Calbraton Weghtng, whch nvolves constructng the fnal fractonal weghts for the m fnal mputed values to satsfy some calbraton constrants. Ths procedure can be called Calbraton FI. The FEFI step s the same as n the prevous secton. In what follows, we descrbe the last two stages n detals. In the Samplng Stage, a subset of mputed values are selected to reduce the mputaton sze. For each, we have M mputed values yj = (y obs,, y (j) ms, ) wth ther fractonal weghts w j. We treat y = {y j, j = 1,..., M} as a weghted fnte populaton wth weght w j and use an unequal probablty samplng method such as probablty-proporton-to-sze (PPS) samplng to select a sample of sze m, say m = 10, from y usng wj as the selecton probablty. Let ỹ1,..., ỹ m be the m elements sampled from y. The ntal fractonal weghts for the sampled m mputed values are gven by w j0 = m 1. Ths set of fractonal weghts may not necessarly satsfy the mputed score equaton (6.1) m w A w js(ˆθ; ỹ j) = 0, where ˆθ s the pseudo MLE of θ computed at the FEFI stage. It s desrable for the soluton to the mputed score equaton wth small m to be equal to the pseudo MLE of θ, whch specfes the calbraton constrants. At the Calbraton Weghtng stage, the ntal set of weghts are modfed to satsfy the constrant (6.1). Fndng the calbrated fractonal weghts can be acheved by the regresson weghtng technque, by whch the fractonal weghts that satsfy (6.1) and m w j = 1. The regresson fractonal weghts are constructed by (6.2) w j = w j0 + w j0 (S j S ), msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

15 where S j = S(ˆθ; y j ), S = m w j0 S j, and FRACTIONAL IMPUTATION 15 = { A m w w j0s j} T { A m w w j0(s j S ) 2 } 1. Note that some of the fractonal weghts computed by (6.2) can take negatve values. To avod negatve weghts, alternatve algorthms other than regresson weghtng should be used. For example, the fractonal weghts of the form w j = w j0 exp( S j ) m k=1 w k0 exp( S k ) are approxmately equal to the regresson fractonal weghts n (6.2) and are always postve. 6.2 The Choce of the Proposal Dstrbuton PFI s based on samplng from an mportance samplng densty h called the proposal dstrbuton. The choce of the proposal dstrbuton s somewhat arbtrary. However, wth fnte samples and mputatons, a well-specfed proposal dstrbuton may mprove the performance of the mputaton estmator. There are a number of ways to specfy the proposal dstrbuton and to assess the goodness of specfcaton. For a planned parameter, e.g., η, the populaton mean of y, Km (2011) showed the optmal h that makes Monte Carlo approxmaton varance of ȳ M w j y j as small as possble, s gven by h (y ms, y obs, ) = f(y ms, y obs,, ˆθ) y E{y y obs,, ˆθ} E{ y E{y y obs,, ˆθ} y obs,, ˆθ}, where ˆθ s the MLE of θ. For general-purpose estmaton, η s often unknown at the tme of mputaton accordng to Fay (1992), h(y ms, y obs, ) = f(y ms, y obs, ; ˆθ) s a reasonable choce n terms of statstcal effcency. For mportance samplng, snce we do not know ˆθ at the outset of the EM algorthm, we may want to have a good ntal guess θ 0 and use h(y ms, x, y obs, ) = f(y ms, x, y obs, ; θ 0 ). If we don t have a good ntal guess of the true value of θ, we can use a pror dstrbuton π(θ) to get h(y ms, y obs, ) = f(y ms, y obs, ; θ)π(θ)dθ. We now dscuss a specal choce of the proposal dstrbuton h, based on the realzed values of the varables havng mssng values, whch s akn to hot deck mputaton. Wthout loss of generalty, assume that y s observed n the frst r elements, y s mssng n the remanng (n r) elements, and x s completely observed n the sample. Usng the mportance samplng dea, we assgn a fractonal weght to donor y j (1 j r) for the mssng tem y (r +1 n) by choosng h(y j ) = f(y j δ j = 1). In calculatng the fractonal weghts, we approxmate f(y j δ j = 1) by ts emprcal dstrbuton n 1 N R k=1 δ kf (y j x k ), where n R s the number of respondents. The EM algorthm takes the followng steps: I-step For each mssng value y, = r + 1,..., n, take all values n A R = {y 1,..., y r } as donors. msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

16 16 S. YANG AND J. K. KIM W-step Wth the current estmate of θ, denoted by ˆθ (t),compute the fractonal weghts by (6.3) w j(t) f(y j x ; ˆθ (t) ) k A R w k f(y j x k ; ˆθ (t) ) M-step Update the parameter ˆθ (t+1) by solvng the followng mputed score equaton, ˆθ (t+1) : soluton to r S(θ; x, y ) + =1 n r =r+1 w (t) j S(θ; x, y j ) = 0. Iteraton Set t = t+1 and go to the W-step. Stop f ˆθ (t+1) meets the convergence crteron. The semparametrc fractonal mputaton (SFI) estmator of Ȳ s ˆȲ SF I = 1 r n r y + w n jy j. =1 =r+1 Km and Yang (2014) showed that the resultng estmator gans robustness. It s less senstve aganst the departure from the assumed condtonal regresson model. 6.3 Doubly Robust Fractonal Imputaton Suppose we have bvarate data (x, y ) where x s completely observed and y s subject to mssng and mssng data mechansm s MAR. Assume also an outcome regresson (OR) model, gven by E(y x ) = m(x ; β 0 ), and the response propensty (RP) model, gven by P (δ = 1 x, y ) = P (δ = 1 x ) = π(x ; φ 0 ). Denote the set of respondents as A R = {, δ = 1}, where δ s the response ndcator of y. We are nterested n the populaton total η = N =1 y. Note that not both the OR and RP models are needed to construct consstent estmators of η. For example, ˆη 1 = A w m(x ; ˆβ), wth ˆβ beng a consstent estmator of β 0, s consstent to η under the OR model and ˆη 2 = A R w y /π(x ; ˆφ), wth ˆφ beng a consstent estmator of φ 0, s consstent to η under the RP model. An estmator of η s doubly robust f t s consstent f ether the OR model or the RP model s correct, but not necessarly both. Ths property guards the estmator from possble model msspecfcatons. The DR estmators have been extensvely studed n the lterature, ncludng Robns, Rotntzky, and Zhao (1994), Bang and Robns (2005), Tan (2006), Kang and Schafer (2007), Cao, Tsats, and Davdan (2009), and Km and Hazza (2014). We now dscuss a fractonal mputaton estmator that has the double robustness feature. For each mssng y, let yj = ŷ + ê j be the j-th mputed value from the donor j A R, where ŷ = m(x ; ˆβ) wth ˆβ ftted under the OR model and ê j = y j m(x ; ˆβ). If A R w 1/π(x j ; ˆφ) = A w, each unt j A R represents 1/π(x j ; ˆφ) copes of the sample. Then, the fractonal weght wj assocated wth the j-th mputed value yj s proportonal to {1/π(x j; ˆφ) 1} over the donor pool msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

17 A R (mnus one because y j tself counts one), that s, (6.4) w j = FRACTIONAL IMPUTATION 17 w j {1/π(x j ; φ 0 ) 1} k A w kδ k {1/π(x k ; ˆφ) 1}. Under ths weght constructon, the fractonal mputaton estmator s gven by (6.5) ˆη F I = n w δ y + (1 δ ){ δ j wjy j}. A We show that the fractonal mputaton estmator ˆη F I n (6.5) s doubly robust. Frst notce that ˆη F I s algebracally equal to (6.6) ˆη F I = ] w [m(x ; ˆβ) δ + π(x A ; ˆφ) {y m(x ; ˆβ)}. Let ˆη n = A w y be the full sample estmator of of η, then ˆη F I ˆη n = A w { δ π(x ; ˆφ) 1 } {y m(x ; ˆβ)}. Ths s an asymptotcally unbased estmator of zero f ether the OR model or the RP model s correct, but not necessarly both. Km and Hazza (2014) dscussed effcent estmaton of (β, φ) n survey samplng. 7. COMPARISON WITH MULTIPLE IMPUTATION 7.1 Statstcal Effcency In the presence of mssng data wth MAR, multple mputaton (MI) s a popular method. It s thus of nterest to compare the behavor of these two methods. We start from a smple settng wth the complete data z beng randomly drawn from a populaton whose densty s f(z; θ), where θ R d s an unknown parameter to be estmated. Suppose that m complete data sets are created by mputng the mssng data z ms from the posteror predctve dstrbuton gven the observed data z obs f(z ms z obs ) = f(z ms z obs ; θ)π(θ z obs )dθ, where π(θ z obs ) s the posteror dstrbuton of θ. The MI estmator of θ, denoted by ˆθ MI s ˆθ MI = m 1 m k=1 ˆθ (k), where ˆθ (k) s the MLE estmator appled to the k-th mputed data set. Rubn s formula s used for varance estmaton n MI, ˆV MI (ˆθ MI ) = W m + (1 + m 1 )B m, where W m = m 1 m k=1 ˆV (k), B m = (m 1) 1 m k=1 (ˆθ (k) ˆθ MI ) 2, and ˆV (k) s the varance estmator of ˆθ under complete response appled to the k-th mputed data set. msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

18 18 S. YANG AND J. K. KIM Of note, Bayesan MI s a smulaton-based method and thus ntroduce addtonal nose. Ths explans why the asymptotc varance of the MI estmator, gven by Wang and Robns (1998), (7.1) V MI = I 1 obs + m 1 I 1 comi ms I 1 com + m 1 J T I 1 obs J, s strctly larger than the asymptotc varance of the FI estmator (7.2) V F I = I 1 obs + m 1 I 1 comi ms I 1 com, where I com = E{S(θ) 2 }, I obs = E{S obs (θ) 2 }, I ms = I com I obs, S(θ) = S(Z; θ) = log f(z; θ)/ θ s the log lkelhood score f the data were completely observed and S obs (θ) = E{S(θ) Z obs } s the score functon of the observed data log lkelhood, J = I ms Icom 1 s the fracton of mssng nformaton matrx (Rubn 1987, Chapter 4). Ths dfference between (7.1) and (7.2) can be szable for a small m. Furthermore, for a large m, although the MI estmator s effcent, the nference s neffcent snce Rubn s varance estmator of the MI estmator s only weakly unbased, that s ˆV MI (ˆθ MI ) converges n dstrbuton nstead of coverages n probablty to V MI. Ths leads to much broader confdence ntervals and less powerful tests than a consstent varance estmator would do (Nelsen 2003). For MI nference to be vald for general-purpose estmaton, mputatons must be proper accordng to Rubn (1987). A suffcent condton s gven by Meng (1994). The so-called congenalty condton, mposed on both the mputaton model and the form of subsequent complete-sample analyses, s qute restrctve for general-purpose estmaton. Otherwse, as dscussed by Fay (1992; 1996), Kott (1995), Bnder and Sun (1996), Robns and Wang (2000), Nelsen (2003), and Km et al. (2006), the MI varance estmator s not always consstent. Km et al. (2011) ponted out that MI that s congenal for mean estmaton s not necessarly congenal for proporton estmaton. Yang and Km (2015b) showed that the MI varance estmator can be postvely or negatvely based when the method of moments estmator s used as the complete-sample estmator. In contrast, FI, as we dscussed n secton 4, does not requre congenalty and always results n a consstent varance estmator for general-purpose estmaton. 7.2 Imputaton under Informatve Samplng Under nformatve samplng, the MAR assumpton s subtle. We assume that the response mechansm s MAR at the populaton level, now referred to as populaton mssng at random (PMAR), to be dstngushed from the concept of sample mssng at random (SMAR). For smplcty, assume y s a one-dmensonal varable whch s subject to mssng, δ s ts response ndcator, and I s the sample ncluson ndcator. PMAR assumes that y δ x, that s, MAR holds at the populaton level, f(y x) = f(y x, δ). On the other hand, SMAR assumes Y δ (x, I = 1), that s, MAR holds at the sample level, f(y x, I = 1) = f(y x, I = 1, δ). The two assumptons are not testable emprcally. The plausblty of these assumptons should be judged by subject matter experts. Often, PMAR s more realstc because an ndvdual s decson on whether or not to respond to a survey depends on hs or her own characterstcs, rather than the fact of hm or her beng n the sample or not. msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

19 FRACTIONAL IMPUTATION 19 δ I Y X U Fgure 1. A drected acyclc graph (DAG) for a setup where PMAR holds but SMAR does not hold. Varable U s latent n the sense that t s never observed. For nonnformatve samplng desgn, we have P (I = 1 x, y) = P (I = 1 x), under whch PMAR mples SMAR; however for nformatve samplng desgn, PMAR does not necessarly mply SMAR. In such cases, usng an mputaton model ftted to the sample data for generatng mputatons can result n based estmaton. FI does not requre SMAR to hold besdes PMAR. Under PMAR, we have f(y x, δ = 0) = f(y x). Let f(y x; β) be a parametrc model of f(y x). The parameter β can be consstently estmated by solvng (2.5), even under nformatve samplng. Snce FI generates the mputatons from f(y x; ˆβ), wth a consstent estmator ˆβ, the resultng FI estmator s approxmately unbased (Berg et al. 2015). Whereas, MI tends to problematc under nformatve samplng. By usng an augmented model, where the mputaton model s augmented to nclude samplng weghts or some functon of them, as f(y x, w), the MI pont estmator was clamed to be approxmately unbased (Rubn 1996; Schenker et al. 2006). However, as ponted out by Berg, Km, and Sknner (2015), t s not always true. For example, Y s condtonally ndependent of δ gven X as presented n Fgure 1. However, Y s not condtonally ndependent of δ gven X and I. Augmentng X by ncludng samplng weghts does not solve the problem. The exstence of the latent varable U, whch s correlated wth I and δ, makes SMAR unachevable. 8. SIMULATION STUDY We nvestgated the performance of FI compared to MI by a lmted smulaton study usng an artfcal fnte populaton generated from real survey data. The pseudo fnte populaton was generated from a sngle month of the U.S. Census Bureau s Monthly Retal Trade Survey (MRTS). Each month, the MRTS surveys a sample of about 12, 000 retal busnesses wth pad employees to collect data on sales and nventores. The MRTS s an economc ndcator survey whose monthly estmates are nputs to the Gross Domestc Product estmates. The MRTS sample desgn s typcal of busness surveys, employng one-stage stratfed samplng wth stratfcaton based on major ndustry, further substratfed by the estmated annual sales. The sample desgn requres hgher samplng rates n strata wth larger unts than n strata wth smaller unts. More detals about MRTS can be found n Mulry, Olver, and Kaputa (2014). The orgnal populaton fle contans 19, 601 retal busnesses stratfed nto 16 strata, wth a strata dentfer (h), sales (y), and nventory values (x). For smulaton purpose, we focus on the frst 5 strata as a fnte populaton, consstng of 7, 260 retal busnesses. Fgure 2 shows the scatter plot of sales and nventory msart-sts ver. 2014/10/16 fle: paper_revew_fi_sts.tex date: August 28, 2015

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Parametric fractional imputation for missing data analysis

Parametric fractional imputation for missing data analysis Secton on Survey Research Methods JSM 2008 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Wayne Fuller Abstract Under a parametrc model for mssng data, the EM algorthm s a popular tool

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement Markov Chan Monte Carlo MCMC, Gbbs Samplng, Metropols Algorthms, and Smulated Annealng 2001 Bonformatcs Course Supplement SNU Bontellgence Lab http://bsnuackr/ Outlne! Markov Chan Monte Carlo MCMC! Metropols-Hastngs

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Markov Chain Monte Carlo Lecture 6

Markov Chain Monte Carlo Lecture 6 where (x 1,..., x N ) X N, N s called the populaton sze, f(x) f (x) for at least one {1, 2,..., N}, and those dfferent from f(x) are called the tral dstrbutons n terms of mportance samplng. Dfferent ways

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria ECOOMETRICS II ECO 40S Unversty of Toronto Department of Economcs Wnter 07 Instructor: Vctor Agurregabra SOLUTIO TO FIAL EXAM Tuesday, Aprl 8, 07 From :00pm-5:00pm 3 hours ISTRUCTIOS: - Ths s a closed-book

More information

Small Area Interval Estimation

Small Area Interval Estimation .. Small Area Interval Estmaton Partha Lahr Jont Program n Survey Methodology Unversty of Maryland, College Park (Based on jont work wth Masayo Yoshmor, Former JPSM Vstng PhD Student and Research Fellow

More information

Bias-correction under a semi-parametric model for small area estimation

Bias-correction under a semi-parametric model for small area estimation Bas-correcton under a sem-parametrc model for small area estmaton Laura Dumtrescu, Vctora Unversty of Wellngton jont work wth J. N. K. Rao, Carleton Unversty ICORS 2017 Workshop on Robust Inference for

More information

Efficient estimation in missing data and survey sampling problems

Efficient estimation in missing data and survey sampling problems Graduate Theses and Dssertatons Iowa State Unversty Capstones, Theses and Dssertatons 2012 Effcent estmaton n mssng data and survey samplng problems Sxa Chen Iowa State Unversty Follow ths and addtonal

More information

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography CSc 6974 and ECSE 6966 Math. Tech. for Vson, Graphcs and Robotcs Lecture 21, Aprl 17, 2006 Estmatng A Plane Homography Overvew We contnue wth a dscusson of the major ssues, usng estmaton of plane projectve

More information

Weighted Estimating Equations with Response Propensities in Terms of Covariates Observed only for Responders

Weighted Estimating Equations with Response Propensities in Terms of Covariates Observed only for Responders Weghted Estmatng Equatons wth Response Propenstes n Terms of Covarates Observed only for Responders Erc V. Slud, U.S. Census Bureau, CSRM Unv. of Maryland, Mathematcs Dept. NISS Mssng Data Workshop, November

More information

A note on regression estimation with unknown population size

A note on regression estimation with unknown population size Statstcs Publcatons Statstcs 6-016 A note on regresson estmaton wth unknown populaton sze Mchael A. Hdroglou Statstcs Canada Jae Kwang Km Iowa State Unversty jkm@astate.edu Chrstan Olver Nambeu Statstcs

More information

Numerical Heat and Mass Transfer

Numerical Heat and Mass Transfer Master degree n Mechancal Engneerng Numercal Heat and Mass Transfer 06-Fnte-Dfference Method (One-dmensonal, steady state heat conducton) Fausto Arpno f.arpno@uncas.t Introducton Why we use models and

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors Stat60: Bayesan Modelng and Inference Lecture Date: February, 00 Reference Prors Lecturer: Mchael I. Jordan Scrbe: Steven Troxler and Wayne Lee In ths lecture, we assume that θ R; n hgher-dmensons, reference

More information

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton

More information

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1 On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool

More information

Conjugacy and the Exponential Family

Conjugacy and the Exponential Family CS281B/Stat241B: Advanced Topcs n Learnng & Decson Makng Conjugacy and the Exponental Famly Lecturer: Mchael I. Jordan Scrbes: Bran Mlch 1 Conjugacy In the prevous lecture, we saw conjugate prors for the

More information

A note on multiple imputation for method of moments estimation

A note on multiple imputation for method of moments estimation Statstcs Publcatons Statstcs 2-2016 A note on multple mputaton for method of moments estmaton Shu Yang Harvard Unversty Jae Kwang Km Iowa State Unversty, jkm@astate.edu Follow ths and addtonal works at:

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin Proceedngs of the 007 Wnter Smulaton Conference S G Henderson, B Bller, M-H Hseh, J Shortle, J D Tew, and R R Barton, eds LOW BIAS INTEGRATED PATH ESTIMATORS James M Calvn Department of Computer Scence

More information

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2) 1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons

More information

Additional Codes using Finite Difference Method. 1 HJB Equation for Consumption-Saving Problem Without Uncertainty

Additional Codes using Finite Difference Method. 1 HJB Equation for Consumption-Saving Problem Without Uncertainty Addtonal Codes usng Fnte Dfference Method Benamn Moll 1 HJB Equaton for Consumpton-Savng Problem Wthout Uncertanty Before consderng the case wth stochastc ncome n http://www.prnceton.edu/~moll/ HACTproect/HACT_Numercal_Appendx.pdf,

More information

8 : Learning in Fully Observed Markov Networks. 1 Why We Need to Learn Undirected Graphical Models. 2 Structural Learning for Completely Observed MRF

8 : Learning in Fully Observed Markov Networks. 1 Why We Need to Learn Undirected Graphical Models. 2 Structural Learning for Completely Observed MRF 10-708: Probablstc Graphcal Models 10-708, Sprng 2014 8 : Learnng n Fully Observed Markov Networks Lecturer: Erc P. Xng Scrbes: Meng Song, L Zhou 1 Why We Need to Learn Undrected Graphcal Models In the

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

EEE 241: Linear Systems

EEE 241: Linear Systems EEE : Lnear Systems Summary #: Backpropagaton BACKPROPAGATION The perceptron rule as well as the Wdrow Hoff learnng were desgned to tran sngle layer networks. They suffer from the same dsadvantage: they

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

SDMML HT MSc Problem Sheet 4

SDMML HT MSc Problem Sheet 4 SDMML HT 06 - MSc Problem Sheet 4. The recever operatng characterstc ROC curve plots the senstvty aganst the specfcty of a bnary classfer as the threshold for dscrmnaton s vared. Let the data space be

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Statistical registers by restricted neighbor imputation

Statistical registers by restricted neighbor imputation Statstcal regsters by restrcted neghbor mputaton an applcaton to the Norwegan Agrculture Survey Abstract Nna Hagesæther 1 and L-Chun Zhang Statstcs Norway In ths paper we mplement the method of Zhang and

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH)

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH) Chapter 1 Samplng wth Unequal Probabltes Notaton: Populaton element: 1 2 N varable of nterest Y : y1 y2 y N Let s be a sample of elements drawn by a gven samplng method. In other words, s s a subset of

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Boostrapaggregating (Bagging)

Boostrapaggregating (Bagging) Boostrapaggregatng (Baggng) An ensemble meta-algorthm desgned to mprove the stablty and accuracy of machne learnng algorthms Can be used n both regresson and classfcaton Reduces varance and helps to avod

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Supplementary Notes for Chapter 9 Mixture Thermodynamics

Supplementary Notes for Chapter 9 Mixture Thermodynamics Supplementary Notes for Chapter 9 Mxture Thermodynamcs Key ponts Nne major topcs of Chapter 9 are revewed below: 1. Notaton and operatonal equatons for mxtures 2. PVTN EOSs for mxtures 3. General effects

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

Report on Image warping

Report on Image warping Report on Image warpng Xuan Ne, Dec. 20, 2004 Ths document summarzed the algorthms of our mage warpng soluton for further study, and there s a detaled descrpton about the mplementaton of these algorthms.

More information

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek Dscusson of Extensons of the Gauss-arkov Theorem to the Case of Stochastc Regresson Coeffcents Ed Stanek Introducton Pfeffermann (984 dscusses extensons to the Gauss-arkov Theorem n settngs where regresson

More information

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function On Outler Robust Small Area Mean Estmate Based on Predcton of Emprcal Dstrbuton Functon Payam Mokhtaran Natonal Insttute of Appled Statstcs Research Australa Unversty of Wollongong Small Area Estmaton

More information

Lecture 3: Probability Distributions

Lecture 3: Probability Distributions Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

RELIABILITY ASSESSMENT

RELIABILITY ASSESSMENT CHAPTER Rsk Analyss n Engneerng and Economcs RELIABILITY ASSESSMENT A. J. Clark School of Engneerng Department of Cvl and Envronmental Engneerng 4a CHAPMAN HALL/CRC Rsk Analyss for Engneerng Department

More information

On mutual information estimation for mixed-pair random variables

On mutual information estimation for mixed-pair random variables On mutual nformaton estmaton for mxed-par random varables November 3, 218 Aleksandr Beknazaryan, Xn Dang and Haln Sang 1 Department of Mathematcs, The Unversty of Msssspp, Unversty, MS 38677, USA. E-mal:

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Finite Mixture Models and Expectation Maximization. Most slides are from: Dr. Mario Figueiredo, Dr. Anil Jain and Dr. Rong Jin

Finite Mixture Models and Expectation Maximization. Most slides are from: Dr. Mario Figueiredo, Dr. Anil Jain and Dr. Rong Jin Fnte Mxture Models and Expectaton Maxmzaton Most sldes are from: Dr. Maro Fgueredo, Dr. Anl Jan and Dr. Rong Jn Recall: The Supervsed Learnng Problem Gven a set of n samples X {(x, y )},,,n Chapter 3 of

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Appendix B. The Finite Difference Scheme

Appendix B. The Finite Difference Scheme 140 APPENDIXES Appendx B. The Fnte Dfference Scheme In ths appendx we present numercal technques whch are used to approxmate solutons of system 3.1 3.3. A comprehensve treatment of theoretcal and mplementaton

More information

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

More metrics on cartesian products

More metrics on cartesian products More metrcs on cartesan products If (X, d ) are metrc spaces for 1 n, then n Secton II4 of the lecture notes we defned three metrcs on X whose underlyng topologes are the product topology The purpose of

More information

MMA and GCMMA two methods for nonlinear optimization

MMA and GCMMA two methods for nonlinear optimization MMA and GCMMA two methods for nonlnear optmzaton Krster Svanberg Optmzaton and Systems Theory, KTH, Stockholm, Sweden. krlle@math.kth.se Ths note descrbes the algorthms used n the author s 2007 mplementatons

More information

1 Binary Response Models

1 Binary Response Models Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models

More information

Difference Equations

Difference Equations Dfference Equatons c Jan Vrbk 1 Bascs Suppose a sequence of numbers, say a 0,a 1,a,a 3,... s defned by a certan general relatonshp between, say, three consecutve values of the sequence, e.g. a + +3a +1

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

Notes on Frequency Estimation in Data Streams

Notes on Frequency Estimation in Data Streams Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to

More information

Problem Set 9 Solutions

Problem Set 9 Solutions Desgn and Analyss of Algorthms May 4, 2015 Massachusetts Insttute of Technology 6.046J/18.410J Profs. Erk Demane, Srn Devadas, and Nancy Lynch Problem Set 9 Solutons Problem Set 9 Solutons Ths problem

More information

An adaptive SMC scheme for ABC. Bayesian Computation (ABC)

An adaptive SMC scheme for ABC. Bayesian Computation (ABC) An adaptve SMC scheme for Approxmate Bayesan Computaton (ABC) (ont work wth Prof. Mke West) Department of Statstcal Scence - Duke Unversty Aprl/2011 Approxmate Bayesan Computaton (ABC) Problems n whch

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Feb 14: Spatial analysis of data fields

Feb 14: Spatial analysis of data fields Feb 4: Spatal analyss of data felds Mappng rregularly sampled data onto a regular grd Many analyss technques for geophyscal data requre the data be located at regular ntervals n space and/or tme. hs s

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Hidden Markov Models & The Multivariate Gaussian (10/26/04)

Hidden Markov Models & The Multivariate Gaussian (10/26/04) CS281A/Stat241A: Statstcal Learnng Theory Hdden Markov Models & The Multvarate Gaussan (10/26/04) Lecturer: Mchael I. Jordan Scrbes: Jonathan W. Hu 1 Hdden Markov Models As a bref revew, hdden Markov models

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for

More information

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics A New Method for Estmatng Overdsperson Davd Fletcher and Peter Green Department of Mathematcs and Statstcs Byron Morgan Insttute of Mathematcs, Statstcs and Actuaral Scence Unversty of Kent, England Overvew

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

The EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X

The EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X The EM Algorthm (Dempster, Lard, Rubn 1977 The mssng data or ncomplete data settng: An Observed Data Lkelhood (ODL that s a mxture or ntegral of Complete Data Lkelhoods (CDL. (1a ODL(;Y = [Y;] = [Y,][

More information

REPLICATION VARIANCE ESTIMATION UNDER TWO-PHASE SAMPLING IN THE PRESENCE OF NON-RESPONSE

REPLICATION VARIANCE ESTIMATION UNDER TWO-PHASE SAMPLING IN THE PRESENCE OF NON-RESPONSE STATISTICA, anno LXXIV, n. 3, 2014 REPLICATION VARIANCE ESTIMATION UNDER TWO-PHASE SAMPLING IN THE PRESENCE OF NON-RESPONSE Muqaddas Javed 1 Natonal College of Busness Admnstraton and Economcs, Lahore,

More information

Appendix B: Resampling Algorithms

Appendix B: Resampling Algorithms 407 Appendx B: Resamplng Algorthms A common problem of all partcle flters s the degeneracy of weghts, whch conssts of the unbounded ncrease of the varance of the mportance weghts ω [ ] of the partcles

More information

Small Area Estimation for Business Surveys

Small Area Estimation for Business Surveys ASA Secton on Survey Research Methods Small Area Estmaton for Busness Surveys Hukum Chandra Southampton Statstcal Scences Research Insttute, Unversty of Southampton Hghfeld, Southampton-SO17 1BJ, U.K.

More information

MLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012

MLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012 MLE and Bayesan Estmaton Je Tang Department of Computer Scence & Technology Tsnghua Unversty 01 1 Lnear Regresson? As the frst step, we need to decde how we re gong to represent the functon f. One example:

More information

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements CS 750 Machne Learnng Lecture 5 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square CS 750 Machne Learnng Announcements Homework Due on Wednesday before the class Reports: hand n before

More information