Adaptively Transformed Mixed Model Prediction of General Finite Population Parameters

Size: px
Start display at page:

Download "Adaptively Transformed Mixed Model Prediction of General Finite Population Parameters"

Transcription

1 Adaptvely Transformed Mxed Model Predcton of General Fnte Populaton Parameters By Shonosuke Sugasawa and Tatsuya Kubokawa Aprl 2018 CENTER FOR RESEARCH AND EDUCATION FOR POLICY EVALUATION DISCUSSION PAPER NO. 19 CENTER FOR RESEARCH AND EDUCATION FOR POLICY EVALUATION (CREPE) THE UNIVERSITY OF TOKYO

2 Adaptvely Transformed Mxed Model Predcton of General Fnte Populaton Parameters SHONOSUKE SUGASAWA Rsk Analyss Research Center, The Insttute of Statstcal Mathematcs TATSUYA KUBOKAWA Faculty of Economcs, Unversty of Tokyo Abstract. For estmatng area-specfc parameters (quanttes) n a fnte populaton, a mxed model predcton approach s attractve. However, ths approach strongly depends on the normalty assumpton of the response values although we often encounter a non-normal case n practce. In such a case, transformng observatons to make them close to normalty s a useful tool, but the problem of selectng sutable transformaton stll remans open. To overcome the dffculty, we here propose a new emprcal best predctng method by usng a parametrc famly of transformatons to estmate a sutable transformaton based on the data. We suggest a smple estmatng method for transformaton parameters based on the profle lkelhood functon, whch acheves consstency under some condtons on transformaton functons. For measurng varablty of pont predcton, we construct an emprcal Bayes confdence nterval of the populaton parameter of nterest. Through smulaton studes, we nvestgate some numercal performances of the proposed methods. Fnally, we apply the proposed method to synthetc ncome data n Spansh provnces n whch the resultng estmates ndcate that the commonly used log-transformaton s not approprate. Key words: Confdence nterval; Emprcal Bayes; Fnte populaton; Mean squared error; Random effect; Small area estmaton. Ths verson: May 11,

3 1 Introducton The mxed model predcton based on random effect models has been wdely used n small area estmaton (Rao and Molna, 2015). The random effect models used n small area estmaton are manly dvded nto two models: the Fay-Herrot model (Fay and Herrot, 1979) and the nested error regresson model (Battese et al., 1988). Especally, the nested error regresson model has been used for estmatng populaton parameters n a fnte populaton. Here we consder a fnte populaton consstng of m areas and each area has N unts for = 1,..., m. Let Y j be a characterstc of the jth unt n the th area, the man purpose s to estmate the area-specfc parameter defned as µ = 1 N T (Y j ), (1) N j=1 where T ( ) s a known (user-specfed) functon. The smplest choce s T (x) = x, n whch µ corresponds to the fnte populaton mean, and many lteratures have been focused on ths case; Chambers et al. (2014), Jang and Lahr (2006), Lahr and Mukherjee (2007) and Schmt et al. (2016). On the other hand, as noted by Molna and Rao (2010), other forms of T ( ) are used n practce. For example, n poverty mappng, we often use FGT poverty measure T (x) = {(z x)/z} α I(x < z) (Foster et al., 1984), notng that µ corresponds to the poverty rate when α = 0. If all the unts Y j n the th area were observed, we could calculate the true value of µ. However, only a part of the unts are avalable n practce. Let n (< N ) be the number of sampled unts and y s = {y j, j = 1,..., n, = 1,..., m} be the sampled data. It s known that the drect estmator of µ usng the observed unts has hgh varablty, especally n the case that n s much smaller than N. In real applcaton, some covarates assocated wth Y j are avalable not only for sampled but also for nonsampled unts, whch are denoted by x j wth j = 1,..., N and = 1,..., m. Hence, one ams to estmate µ based on the sampled data and nformaton on covarates. To ths end, a typcal strategy s to assume that all the unts follow the nested error 2

4 regresson model: Y j = x t jβ + v + ε j, j = 1,..., N, = 1,..., m, (2) where x j and β are p-dmensonal vectors of covarates and regresson coeffcents, v s the area-specfc effect whch follows N(0, τ 2 ) and ε j s a samplng error dstrbuted as N(0, σ 2 ). Then, the condtonal dstrbuton of the non-sampled data Y j gven all the sampled data y s s gven by Y j y s N (x tjβ + n τ 2 σ 2 + n τ 2 (ȳ x t β), σ 2 τ 2 ) σ 2 + n τ 2, j = n + 1,..., N, (3) whch follows from the normalty of Y j under the model (2). Then the best predctor of µ under squared error loss s the condtonal expectaton E[µ y ], whch has the form µ E[µ y s ] = 1 { n T (y j ) + N j=1 N j=n +1 } E[T (Y j ) y s ]. (4) Here, the expectaton E[T (Y j ) y s ] can be computed va the Monte Carlo ntegraton by generatng a large number of random samples from the condtonal dstrbuton (3). Moreover, the best predctor µ depends on the unknown model parameters β, τ 2 and σ 2 n the model (2), so that these parameters should be replaced wth ther estmated counterparts. To ths end, one can estmate the model parameters n the model (2) based on the sampled data y s based on, for example, the maxmum lkelhood or restrcted maxmum lkelhood methods. It s observed that the key assumpton n dervng the best predctor (4) s the normalty of Y j n (2), whch enables us to obtan the smple expresson of the condtonal dstrbuton (3). However, we often encounter the case where the normalty assumpton s not plausble for Y j. In fact, n poverty mappng, Y j s a welfare varable lke ncome, thereby the dstrbuton of Y j could be skewed. In ths case, Molna and Rao (2010) proposed assumng the nested error model (2) for the transformed varables H(Y j ) nstead of Y j. If Y j s rght skewed, one may use H(x) = log x. However, we stll suffer from the msspecfcaton of the transformaton and the predctor of µ 3

5 under msspecfed transformaton would be based, thereby selectng sutable transformatons from the data s desred to valdate the predcton method. To overcome the dffculty, we propose the adaptvely transformed emprcal best predctng method n whch we use a parametrc famly of transformatons for data transformaton nstead of the use of specfed transformatons. We derve a form of the best predctor of µ and provde a smple estmatng method for transformaton parameters based on profle lkelhood functon, whch produces a consstent estmator under some regularty condtons. We also construct an emprcal Bayes confdence nterval of µ for measurng the varablty of the pont predcton. The proposed ntervals are shown to have O(m 1 ) coverage error, and we also suggest the parametrc bootstrap calbraton for confdence ntervals wth further accuracy. As related methods, L and Lahr (2007) suggested to use the Box-Cox transformaton (Box and Cox, 1964) for the data transformaton for robust estmaton of fnte populaton totals whle ther method was developed under models wthout random effects. Hence, our method would be more effcent. Concernng the emprcal Bayes confdence ntervals, Nandram (1999) derved an emprcal Bayes confdence ntervals of the fnte populaton means, whch corresponds to the case takng T (x) = x n (1). Ths paper s organzed as follows: In Secton 2, we descrbe the proposed predcton method as well as parameter estmaton of the model parameters. In Secton 3, we construct an emprcal Bayes confdence nterval of µ. In Secton 4, we present the results from smulaton studes and a data applcaton. In Secton 5, we gve conclusons and some dscussons. The techncal proofs are gven n Appendx. 2 Adaptvely Transformed Emprcal Best Predcton 2.1 Transformed best predctor Let H λ ( ) be a famly of transformatons wth parameter λ. The transformaton parameter λ mght be multdmensonal, but we treat λ as a scalar parameter for notatonal smplcty. The assumptons and specfc choces of H λ ( ) wll be dscussed n the subsequent secton. We assume that the transformed varable H λ (y j ) follows the nested 4

6 error regresson model: H λ (Y j ) = x t jβ + v + ε j, j = 1,..., N, = 1,..., m, (5) where x j and β are p-dmensonal vectors of covarates and regresson coeffcents, v and ε j are an area-specfc effect and a samplng error, respectvely. Here we assume that v and ε j are mutually ndependent and dstrbuted as v N(0, τ 2 ) and ε j N(0, σ 2 ) wth unknown two varance parameters τ 2 and σ 2. It s worth notng that, owng to the area effect v, the unts n the same area are mutually correlated whle the unts n the dfferent area are ndependent. Specfcally, from (5), t holds Cor(H λ (Y j ), H λ (Y k )) = (τ 2 + σ 2 ) 1 τ 2, j k, thereby the unts n the same area are mutually correlated and the degree of correlaton s determned by the rato τ 2 /σ 2. From the normalty assumptons of v and ε j, t follows that H λ (Y j ) N(x t j β, τ 2 + σ 2 ). Thus, the transformaton parameter λ can be chosen to make the transformed data H λ (y j ) close to normalty. We defne ϕ = (β t, τ 2, σ 2, λ) t, as the vector of unknown model parameters n (5). The estmaton procedure wll be gven n the subsequent secton. Let y s = {y j, j = 1,..., n, = 1,..., m} be the sampled data. From the model (5), we have H λ (Y j ) y s N(θ j, s 2 + σ2 ), j = n + 1,..., N, where θ j = x t jβ + τ 2 σ 2 + n τ 2 n j=1 (H λ (y j ) x t σ jβ), s = 2 τ 2 σ 2 + n τ 2. (6) Hence, the best predctor of µ gven n (1) can be obtaned as µ (y s ; ϕ) E[µ y s ] = 1 n T (y N j ) + j=1 N j=n +1 E[T H 1 λ (u j)], (7) where the expectaton s taken wth respect to u j N(θ j, s 2 + σ2 ), and T H 1 λ ( ) s the composte functon of T ( ) and H 1 λ, the nverse functon of H λ( ). Although the expectaton E[T H 1 λ (u j)] does not have a closed form n general, t can be easly computed va the Monte Carlo ntegraton. We call the best predctor (7) adaptvely 5

7 transformed best predctor (ATBP). 2.2 Estmaton of structural parameters We here consder estmatng the unknown model parameters ϕ n (5) based on the margnal lkelhood functon. Notng that the log-margnal lkelhood functon of ϕ s gven by L(ϕ) = 1 2 m log Σ 1 2 =1 1 2 m n log 2π + =1 m =1 {H λ (y ) X β} t Σ 1 {H λ (y ) X β} m n log H λ (y j), =1 j=1 (8) where (Σ ) kl = τ 2 +σ 2 I(k = l), H λ (y ) = (H λ (y 1 ),..., H λ (y n )) t, X = (x t 1,..., xt n ) t, and H λ ( ) denotes the dervatve of H λ( ). The maxmum lkelhood estmator of ϕ can be defned as the maxmzer of L(ϕ). For maxmzng the lkelhood functon L(ϕ), we frst note that the profle lkelhood functon of λ can be expressed as m n PL(λ) = ML(λ) + log H λ (y j), (9) =1 j=1 where ML(λ) s the maxmum lkelhood of the nested error regresson model wth response values H λ (y j ) and covarate vectors x j, whch can be effcently carred out by usng well-developed numercal method (e.g. Molna and Marhuenda, 2015). Usng the ease of the pont evaluaton of the profle lkelhood PL(λ), we can obtan the maxmzer of PL(λ) by usng, for example, the golden secton method (Brent et al., 1973). Once we obtan the estmator λ, we get the estmators of other parameters by applyng the nested error regresson model to the data set {H λ(y j ), x j }. For estmatng the two varance parameters τ 2 and σ 2, the restrcted maxmum lkelhood (RML) method (Jang, 1996) mght be more attractve than the maxmum lkelhood method. To mplement the RML estmaton, the frst three terms n (8) need to be changed to the restrcted maxmum lkelhood, but the transformaton parameter λ can be easly estmated n the same manner as the maxmum lkelhood 6

8 method based on the profle lkelhood functon. However, n ths paper, we consder only the maxmum lkelhood estmator for smplcty. 2.3 Class of transformatons We here consder the concrete choce of the famly of transformatons H λ ( ). To begn wth, we gve some condtons to be satsfed by the transformatons. Assumpton 1. (Class of transformatons) 1. H λ s a dfferentable and monotone functon, and the range of H λ s R for all λ. 2. For fxed x, H λ (x) as the functon of λ s dfferentable. 3. The functon H λ (w)/ λ, 2 H λ (w)/ λ 2 and 2 log H λ (w)/ λ2 wth w = H 1 λ (x) are bounded from the upper by C 1{exp(C 2 x) + exp( C 2 x)} wth some constants C 1, C 2 > 0. The frst condton s crucal n ths context. If the range of H λ s not R, but some subset A R, the nverse functon H 1 λ problems n computng the best predctor (7). cannot be defned on R \ A, whch causes When the observatons are postve valued, the Box-Cox (BC) transformaton (Box and Cox, 1964), H λ (x) = λ 1 (x λ 1) for λ 0 and H 0 (x) = log(x), s wdely used. However, t s known that the range of BC transformaton s truncated and not whole real lne, so that the BC transformaton cannot be used n ths context. An alternatve transformaton, called dual power (DP) transformaton, has been suggested by Yang (2006): H DP λ (x) = xλ x λ, x > 0, λ > 0, (10) 2λ where lm λ 0 H DP (x) = log x. It can be seen as the mean of two BC transformatons, λ and t s easy to confrm that the range of DPT s R, so that DPT can be used as a parametrc famly ncludng log-transformaton n ths context. The expresson of the nverse functon s requred n computng the transformed best predctor (7), and the 7

9 Jacoban s also needed for computng the profle lkelhood functon (9). These are gven by ( H DP( 1) λ (x) = λx + ) 1/λ 1 + λ 2 x 2 and dhλ DP(x) = 1 dx 2 (xλ 1 + x λ 1 ). In the context of small area estmaton, the DP transformaton was used n Sugasawa and Kubokawa (2017) n the Fay-Herrot model. The orgnal DP transformaton (10) can be used when the response varables are postve. When response varables are real valued, one may use the shfted-dp transformaton of the form H λ,c (x) = {(x + c) λ (x + c) λ }/2λ, where c (mn(y j ) + ε, ) wth specfed small ε > 0. Another attractve transformaton s the snh-arcsnh (SS) transformaton suggested n Jones and Pewsey (2009) n the context of dstrbuton theory, whch has the form H SS a,b (x) = snh(b snh 1 (x) a), x (, ), a (, ), b (0, ) (11) where snh(x) = (e x e x )/2 s the hyperbolc sne functon, snh 1 (x) = log(x + x 2 + 1), and two transformaton parameter a and b control skewness and tal heavness, respectvely. The nverse transformaton and the Jacoban are obtaned as H SS( 1) a,b (x) = snh(b 1 snh 1 (x) + a), and dha,b SS(x) 1 + Ha,b SS = b (x)2 dx 1 + x 2. These transformatons wll be used and compared n the applcaton presented n Secton Large sample propertes We here consder the large sample propertes of the estmator of structural parameters. To ths end, we assume the followng condton: Assumpton 2. (Assumptons under large m) 1. The true parameter vector ϕ 0 s an nteror pont of the parameter space Φ. 8

10 2. 0 < mn =1,...,m N max =1,...,m N <. 3. The elements of X are unformly bounded and X t X s postve defnte. 4. m 1 m =1 Xt Σ 1 X converges to a postve defnte matrx as m. Snce the asymptotc varance and covarance matrx of MLE can be derved from the Fsher nformaton matrx, we frst provde the Fsher nformaton matrx n the followng Theorem, where the proof s gven n Appendx. Theorem 1. We defne the Fsher nformaton I ϕk ϕ j = E[ 2 L(ϕ)/ ϕ k ϕ j ], then t follows that I τ 2 τ 2 = 1 2 I ββ = I λβ = I λλ = m =1 m =1 (1 t n Σ 1 1 n ) 2, I τ 2 σ 2 = 1 2 m =1 X t Σ 1 X, I βτ 2 = I βσ 2 = 0, I λσ 2 = m =1 m [ E =1 [ ] X t Σ 1 E H (1) λ (y ), I λτ 2 = H (1) λ ] (y ) t Σ 1 H (1) λ (y ) + m =1 1 t n Σ 2 1 n, I σ 2 σ 2 = 1 2 =1 =1 m =1 tr (Σ 2 ), m [ ] E z t Σ 2 H (1) λ (y ), m [ E z t Σ 1 1 n 1 t n Σ 1 [ ] E z t Σ 1 H (2) λ (y ) m n =1 j=1 H (1) ] λ (y ), [ ] 2 E λ 2 log H λ (y j), where H (k) λ (y ) = k H λ (y )/ λ k for k = 1, 2, z = H λ (y ) X β, and E[ ] denotes the expectaton wth respect to y j s followng the model (5). Then, under Assumptons 1 and 2, the maxmum lkelhood estmator ϕ s asymptotcally dstrbuted as ϕ N(ϕ, I 1 ϕ ). From Theorem 1, t s observed that the nformaton matrx of (β t, τ 2, σ 2 ) does not depend on the transformaton parameter λ, and ther expressons are the same as those of the tradtonal nested error regresson models. Whle the two varance parameters τ 2 and σ 2 are orthogonal to β n the sense that I βτ 2 = I βσ 2 = 0, the transformaton parameter λ s not orthogonal to the others. The expectatons appeared n the Fsher matrx s not analytcally tractable, but t can be easly estmated by replacng the expectaton wth ts sample counterpart. In the case that λ s multdmensonal, the extenson of Theorem 1 s straghtforward. The expressons of H (k) λ (y ) and 2 log H λ (y j)/ λ 2 could be analytcally complcated and requre tedous algebrac 9

11 calculatons. In such a case, the numercal dervatve can be useful snce we need to compute only the pont values of the dervatves. 3 Emprcal Bayes Confdence Intervals 3.1 Asymptotcally vald confdence ntervals Measurng the varablty of the transformed emprcal best predctor µ s an mportant ssue n practce. Tradtonally, the mean squared error (MSE) of µ has been used, and several methods rangng from analytcal method (Prasad and Rao, 1990) to numercal methods (Hall and Mat, 2006) have been consdered. On the other hand, an emprcal Bayes confdence nterval of µ s more preferable snce t can provde dstrbutonal nformaton than MSE though constructon of the confdence nterval s generally dffcult. Here, we derve an asymptotcally vald emprcal Bayes confdence nterval of µ. The key to the confdence nterval s the condtonal dstrbuton of µ gven y s. Notng that Cov(H λ (Y j ), H λ (Y k ) y s ) = Var(v y s ) = s 2 for j k, t follows that (H λ (Y,n +1),..., H λ (Y N )) t y s N((θ,n +1,..., θ N ) t, s 2 1 N n 1 t N n + σ 2 I N n ), namely, the each component has the expresson H λ (Y j ) y = θ j + s z + σw j, j = n + 1,..., N, where z and w j are mutually ndependent standard normal random varables, and θ j and s are defned n (6). Then the posteror dstrbuton of µ can be expressed as µ y d = 1 N n T (y j ) + j=1 N j=n +1 T H 1 λ (θ j + s z + σw j ), (12) whch s a complex functon of standard normal random varables z and w j. However, random samples from the condtonal dstrbuton (12) can be easly smulated. We defne Q a (y, ϕ) as the lower 100a% quantle pont of the posteror dstr- 10

12 buton of µ wth the true ϕ, whch satsfes P(µ Q a (y, ϕ) y ) = a. Hence, the Bayes confdence nterval of µ wth nomnal level 1 α s obtaned as I α = (Q α/2 (y, ϕ), Q 1 α/2 (y, ϕ)), whch holds that P(µ I α ) = 1 α. However, the nterval I α depends on the unknown parameter ϕ, so that the feasble verson of I α s obtaned by replacng ϕ wth ts estmator ϕ, namely I N α = (Q α/2 (y s, ϕ), Q 1 α/2 (y s, ϕ)), (13) whch we call nave emprcal Bayes confdence nterval of µ. The two quantles appeared n (13) can be computed by generatng a large number of random samples from the condtonal dstrbuton (12). Owng to the asymptotc propertes of ϕ, the coverage probablty of the nave nterval (13) converges to the nomnal level as the number of areas m tends to nfnty as shown n the followng theorem proved n Appendx. Theorem 2. Under Assumptons 1 and 2, t holds P(µ I N α ) = 1 α + O(m 1 ). 3.2 Bootstrap calbrated ntervals As shown n Theorem 2, the coverage error of the nave nterval (13) s of order m 1, whch s not necessarly neglgble when m s not suffcently large. Snce the number of m s usually moderate n practce, the calbrated ntervals wth hgher accuracy would be valuable. Followng Chatterjee, et al. (2008), Hall and Mat (2006), we construct a second order corrected emprcal Bayes confdence nterval I C α satsfyng P (µ I C α ) = 1 α + o(m 1 ). To begn wth, we defne the bootstrap estmator of the coverage probablty of the nave nterval. Let Y j estmated model (5) wth ϕ = ϕ, and y s Moreover, let µ be the parametrc bootstrap samples generated from the be the bootstrap verson of µ based on Y j s. = {Y j, j = 1,..., n, = 1,..., m}. Snce the coverage probablty s P(Q a/2 (y s, ϕ) µ Q 1 a/2 (y s, ϕ)), ts parametrc bootstrap estmator can be defned as CP(a) = E [ I { Q a/2 (ys, ϕ) µ Q 1 a/2 (ys, ϕ) }], 11

13 where the expectaton s taken wth respect to the bootstrap samples Yj s. Based on the coverage probablty, we defne the calbrated nomnal level a as the soluton of the equaton CP(a ) = 1 α, whch can be solved by the bsectonal method (Brent, 1973). Then, the calbrated nterval s gven by I C α = (Q a /2(y s, ϕ), Q 1 a /2(y s, ϕ)), (14) whch has second order accuracy as shown n the followng theorem proved n Appendx. Theorem 3. Under Assumptons 1 and 2, t holds P(µ I C α ) = 1 α + o(m 1 ). 4 Numercal Studes 4.1 Evaluaton of predcton errors We frst evaluate the predcton errors of the proposed predctors together wth some exstng methods. To ths end, we consdered the followng data generatng processes: (A) (2λ) 1 (Y λ j Y λ j ) = β 0 + β 1 X j + v + ε j, v N(0, τ 2 ), ε j N(0, σ 2 ) (B) (2λ) 1 (Y λ j Y λ j ) = β 0 + β 1 X j + v + ε j, v t 5 (0, τ 2 ), ε j t 5 (0, σ 2 ) (C) Y j = exp(β 0 + β 1 X j )v ε j, v Γ(1/τ 2, 1/τ 2 ), ε j Γ(1/σ 2, 1/σ 2 ) (D) Y j = 0.2 exp(u j ) + 0.8U 2 j, U j = β 0 + β 1 X j + v + ε j, v N(0, τ 2 ), ε j N(0, σ 2 ), where = 1,..., m, j = 1,..., N, β 0 = 1, β 1 = 3, τ = 0.3, σ = 0.7, and X j were ntally generated from U(1, 2) and fxed through smulaton experments. In model () and (), we consdered three values for λ, λ = 0, 0.2 and 0.4. In ths study, we set N = 200 and m = 25, and we focus on estmatng the rato of the observaton wth values under z, namely µ = 1 N N I(Y j < z), = 1,..., m, (15) j=1 where z s defned as 0.6 tmes medan of Y j s. 12

14 Concernng the area sample szes, we dvded m = 25 areas nto fve groups wth equal number of areas, and we set the same number of n wthn the same groups. The group pattern of n we consdered was (20, 40, 60, 80, 100). Among the generated Y 1,..., Y N, we used frst n observatons y 1 (= Y 1 ),..., y n (= Y n ) as the sampled data. Then, based on the sampled data y j s and covarates X j s, we computed the predcted value of µ based on the four methods: the proposed flexble transformed emprcal best predcton (ATP) method wth DP transformaton (10), the transformed emprcal best predcton (TP) method proposed by Molna and Rao (2010) wth logtransformaton, the emprcal best predcton (EBP) method by drectly applyng the nested error regresson model to the non-transfdemd observaton y j, and the drect estmator (DE) gven by µ D = 1 n I(y j < z), = 1,..., m. n j=1 It should be noted that the TP method s correctly specfed n scenaro (A) wth λ = 0 whle the ATP method s overfttng n ths case. In the other cases n scenaro (A), the ATP method uses the same model as the data generatng model. Scenaro (B) s smlar to (A), but the dstrbuton of error terms have the t-dstrbuton. In scenaro (C) and (D), the data generaton models do not concdes wth any methods. To compare the performances of the four methods, we computed the square root of mean squared error (RMSE) defned as RMSE = 1 R R r=1 ( µ (r) ) 2, µ (r) where R = 2000 n ths study, µ (r) and µ (r) are the estmated and true values of µ, respectvely, n the rth teraton. The obtaned values of RMSEs are averaged over the same groups and the results are reported n Table 1. From Table 1, we can observe that the proposed method provdes better estmates than three exstng methods n almost all cases. As mentoned n the above, ATP method s overfttng n scenaro (A) wth λ = 0 whle TP method s correctly specfed. 13

15 However, the results show that the performances between ATP and TP are almost the same, whch mght ndcate that the MSE nflaton due to overfllng s not serous. The smlar observaton can be done n scenaro (B) wth λ = 0. On the other hand, n the other cases, the proposed ATP method can mprove the estmaton accuracy of TP method as well as EBP and DE methods, by adaptvely estmatng the transformaton parameter from the data. 4.2 Fnte sample evaluaton of emprcal Bayes confdence ntervals We next evaluate the fnte sample performances of the emprcal Bayes confdence ntervals gven n Secton 3. To ths end, we consdered the followng data generatng process for populaton varables Y j : (2λ) 1 (Y λ j Y λ j ) = β 0 + β 1 X j + v + ε j, v N(0, τ 2 ), ε j N(0, σ 2 ), where j = 1,..., N and = 1,..., m wth n = 200. We set the true parameter values λ = 0.3, β 0 = 1, β 1 = 3, τ = 0.3, σ = 0.7, and X j were ntally generated from the unform dstrbuton on (1, 2), whch were fxed through smulaton runs. We focused on the same populaton parameter gven n (15). Among the generated Y 1,..., Y N, the frst n = 50 observatons Y 1,..., Y n were used as the sampled data y 1,..., y n. Then, based on y j s and X j s, we computed two types pf confdence ntervals for µ, nave confdence nterval (13) and bootstrap calbrated confdence nterval (14), whch are denoted by NCI and BCI, respectvely. To evaluate the performances of two confdence ntervals, based on R = 2000 smulaton runs, we computed the emprcal coverage probablty (CP) and the average length of confdence nterval (AL), whch are defned as where µ (r) CP = 1 R R r=1 I(µ (r) s the true value and CI (r) CI (r) ) and AL = 1 R R r=1 CI (r), s NCI or BCI n the rth teraton. In Fgure, we show the obtaned CP and AL n each area for two cases m = 20 and m =

16 Table 1: The group-wse averaged values of smulated square root of mean squared errors (RMSE) for four methods, proposed adaptvely transformed predcton (ATP) method, Molna and Rao s transformed predcton (TP) method, emprcal best predcton (EBP) method wthout any data transformatons, and drect estmator (DE) for eght scenaros. All the values n the table are multpled by 100. Area sample sze n Scenaro Method ATP (A) λ = 0 TP EBP DE ATP (A) λ = 0.2 TP EBP DE ATP (A) λ = 0.4 TP EBP DE ATP (B) λ = 0 TP EBP DE ATP (B) λ = 0.2 TP EBP DE ATP (B) λ = 0.4 TP EBP DE ATP (C) TP EBP DE ATP (D) TP EBP DE Concernng CP, the nave method tends to produce shorter confdence ntervals, so that the coverage probablty s smaller than the nomnal level for all areas, whch s more serous n case m = 20 than m = 30. Ths comes from the accuracy of NCI presented n Theorem 2, whch mentons that the coverage accuracy of NCI s O(m 1 ). On the 15

17 other hand, bootstrap method can mprove the drawbacks of the nave method, and provdes reasonable CP around the nomnal level under both m = 20 and m = 30. The results clearly support the theoretcal property gven n Theorem 3 presentng BCI s second order accurate. Snce undervaluaton of estmaton rsk may produce serous problems n practce, we should be use the bootstrap method when the number of areas s not large. m=20 m=20 CP BCI NCI AL BCI NCI area area m=30 m=30 CP BCI NCI AL BCI NCI area area Fgure 1: Smulated coverage probablty (CP) and average length (AL) of two confdence ntervals, nave confdence nterval (NCI) and bootstrap calbrated confdence nterval (BCI) for m = 20 (upper) and m = 30 (lower). 16

18 4.3 Example: poverty mappng n Span We appled the proposed method to estmaton of poverty ndcators n Spansh provnces, usng the synthetc ncome data avalable n sae package (Molna and Marhuenda, 2015) n R language, n whch the equalzed annual net ncome are gven. The smlar data set was used n Molna and Rao (2010) and Molna et al. (2014). As auxlary varables, we consdered the ndcators of the fve qunquennal groupngs of the varable age, the ndcator of havng Spansh natonalty, the ndcators of the three levels of the varable educaton level, and the ndcators of the three categores of the varable employment, wth categores unemployed, employed and nactve. For each auxlary varable, one of the categores was consdered as base reference, omttng the correspondng ndcator and then ncludng an ntercept n the model. The poverty measures we focused on were the FGT poverty measures (Foster et al., 1984): ( ) x z α T (x) = I(x < z), z where z s a fxed poverty lne, and t corresponds to poverty ncdence or head count rato (α = 0), poverty gap (α = 1) and poverty severalty (α = 2). In ths example, we focused on poverty rato (α = 0), and we set z as the 0.6 tmes the medan of ncomes. Let E j be the ncome of jth ndvdual n th area. Such data are avalable for m = 52 areas and the sample szes are are rangng from 20 to Snce the small porton of E j take negatve values, we assume the nested error regresson model wth shfted-dpt: SDP: (2λ) 1 { (E j + c) 1/λ (E j + c) 1/λ} = x t jβ + v + ε j, (16) notng that the model has two transformaton parameters λ and c. We also consdered two submodel of (16). In both models, we set c = c mn(e j ) + 1 to ensure that E j + c s postve for all (, j). The frst submodel s dened by puttng c = c n (16), whch s referred to SDP-s. The second sub-model s the shfted-log transformaton 17

19 model: SL: log(e j + c ) = x t jβ + v + ε j, (17) whch has no longer parameters and was used n Molna and Rao (2010). Fnally, we also appled the model wth snh-arcsnh transformaton presented n Secton 2.3: SS: snh(b snh 1 (E j ) a) = x t jβ + v + ε j, (18) whch has two transformaton parameter a and b. By maxmzng the profle lkelhood functon of transformaton parameters, we obtaned as follows: (SDP) λ = ( ), ĉ = 4319 (170.69) (SDP-s) λ = ( ) (SS) â = ( ), b = ( ), where the values n the parentheses are the correspondng standard errors calculated from the Fsher nformaton matrx gven n Theorem 1. From the above result, t can be observed that the approxmate 95% confdence ntervals of the transformaton parameter λ n SDP and SDP-s are bounded from 0, whch means that the logtransformed model would be napproprate. Moreover, we computed AIC and BIC based on the maxmum margnal lkelhood, and the results are gven n Table 2 n whch the values scaled by the number of sampled unts (N = 17199) are reported. The results show that the SDP fts the best among the four models n terms of both AIC and BIC whle the SL model fts the worst. Hence, the use of parametrc transformaton can mprove AIC and BIC n ths applcaton. To see the fttng of the models n terms of normalty assumpton of the error terms, we computed the standardzed resduals defned as r j = Ĥ(y j ) x t j β τ 2 + σ 2, j = 1,..., n, = 1,..., m, 18

20 where Ĥ s the estmated transformaton functon, notng that r j s asymptotcally follow the standard normal dstrbutons f the assumed model s correctly specfed. In Fgure 2, we shows QQ-plots of r j s of the four models. We can observe that the normalty assumptons n the three models wth parametrc transformatons, SDP, SDP-s and SS, seem plausble from Fgure 2. However, the QQ-plot for SL shows that the dstrbuton of standardzed resduals s skewed and the normalty assumpton would not be approprate. Fnally, we calculated the estmated values of the poverty rates µ from the drect estmator (DE), and four model based methods. For computng the emprcal best predctor of µ, we used 500 random samples for Monte Carlo ntegraton. The obtaned values are gven n Table 3. In the parentheses, we provded the bootstrap emprcal Bayes confdence ntervals of µ wth 200 bootstrap teratons. It can be seen that the drect estmator produces qute dfferent estmates of µ from the model based methods when area sample sze n s not large lke n Avla and Tarragona. On the other hand, n provnces wth large samples szes, the dfferences of estmates between DE and the other model methods are relatvely small. We can also observe that SL method tend to produce smaller estmates than the other model based methods. However, from AIC and BIC values and QQ-plot n Fgure 2, the valdty of SL method s hghly doubtful n ths case, so that the predcted values gven n Table 3 would not be relable. As shown n Table 3, the use of dfferent transformaton functon leads to sgnfcantly dfferent predcted values of µ. Hence, t would be valuable to select an adequate transformaton functon by estmatng transformaton parameters based on the sampled data. Table 2: AIC and BIC of four models. The values are scaled by the number of sampled unts (N = 17199). SDP SDP-s SS SL AIC BIC

21 SDPs Sample Quantles Sample Quantles SDP Theoretcal Quantles Theoretcal Quantles SS SL Sample Quantles Sample Quantles Theoretcal Quantles Theoretcal Quantles Fgure 2: QQ-plots of standardzed resduals n four models. 5 Conclusons and Dscusson We have ntroduced the use of a parametrc famly of transformatons for estmatng (predctng) general area specfc parameters based on the mxed effects models. We have provded the best predctor of the parameter as well as the maxmum lkelhood method for estmatng model parameters. Moreover, for measurng varablty of the predctor, we constructed an emprcal Bayes confdence nterval of the area parameter. The smulaton and emprcal studes have revealed that the use of parametrc 20

22 Table 3: Estmated poverty rates from the drect estmator (DE) and four model based methods n fve provnces. The bootstrap emprcal Bayes confdence ntervals are gven n the parenthess. area n DE SDP SDP-s SS SL Avla (1.63, 4.05) (1.87, 4.31) (2.00, 4.50) (1.57, 3.62) Tarragona (6.02, 10.42) (6.38, 10.82) (7.28,11.27) (5.83, 9.35) Santander (5.75, 8.07) (6.03, 8.03) (6.38, 8.49) (5.21, 6.99) Sevlla (4.15, 5.75) (4.45,6.07) (4.77, 6.35) (3.73, 4.90) Ovedo (4.14, 5.44) (4.45, 5.51) (4.75, 6.00) (3.77, 4.74) transformatons would mprove the predcton accuracy of the exstng method usng specfed transformatons. Although we consdered an emprcal Bayes approach n ths paper, the herarchcal Bayes approach as consdered n Molna et al. (2014), by assgnng some pror dstrbutons for model parameters, would be useful. Moreover, one may use more flexble method for the regresson part lke the penalzed splne as used n Opsomer et al. (2008). The detaled nvestgaton of these ssues are left to a valuable future study. Acknowledgement Shonosuke Sugasawa and Tatsuya Kubokawa are supported by Grant-n-Ad for Scentfc Research (16H07406, 15H01943 and ) from Japan Socety for the Promoton of Scence. Appendx A1. Proof of Theorem 1. From the lkelhood functon (8), ts frst order derva- 21

23 tves are gven by L m β = X t Σ 1 z, =1 L σ 2 = 1 2 =1 m =1 tr (Σ 1 ) 1 2 L τ 2 = 1 2 m =1 L m λ = z t Σ 1 H (1) λ (y ) + m =1 z t Σ 2 z, m n =1 j=1 1 t n Σ 1 1 n 1 2 λ log H λ (y j), m =1 z t Σ 1 1 n 1 t n Σ 1 z where z = H λ (y ) X β. Snce E[z ] = 0, t follows that E[ 2 L/ β τ 2 ] = E[ 2 L/ β σ 2 ] = 0. The other elements of the Fsher nformaton can be obtaned by a straghtforward calculaton. Moreover, under Assumptons 1 and 2, the each element of the Fsher nformaton matrx s fnte, so that the asymptotc normalty of ϕ follows. A2. Proof of Theorem 2. Let ϕ 0 s the true values of parameters. It suffces to show that P (µ Q a (y, ϕ)) = a + O(m 1 ) for a (0, 1). We frst note that It holds that P (µ Q a (y, ϕ)) = E[P (µ Q a (y, ϕ) y s )] = E[F (Q a (y, ϕ); y, ϕ 0 )], where F ( ; y, ϕ 0 ) s a dstrbuton functon of µ gven y. Let G(y, ϕ, ϕ 0 ) = F (Q a (y, ϕ); y, ϕ 0 ), notng that 0 G(y, ϕ, ϕ 0 ) 1 and G(y, ϕ 0, ϕ 0 ) = a. The Taylor expanson of G(y, ϕ, ϕ 0 ) shows that G(y, ϕ, ϕ 0 ) = G(y, ϕ 0, ϕ 0 ) + j G ϕj (y, ϕ, ϕ 0 ) ϕ=ϕ0 ( ϕ j ϕ j ) + 1 G ϕj ϕ 2 k (y, ϕ, ϕ 0 ) ϕ=ϕ0 ( ϕ j ϕ j )( ϕ k ϕ k ) j,k + 1 G ϕj ϕ 6 k ϕ l (y, ϕ, ϕ 0 ) ϕ=ϕ ( ϕ j ϕ j )( ϕ k ϕ k )( ϕ l ϕ l ), j,k,l where ϕ s on the lne connectng ϕ and ϕ 0. Then, t follows that P (µ Q a (y, ϕ)) = E[G(y, ϕ, ϕ 0 )] = a + R R R 3, 22

24 where [ R 1 = E G ϕ (y, ϕ, ϕ 0 ) ] ϕ=ϕ0 ( ϕ ϕ 0 ) R 2 = [ E G ϕj ϕ k (y, ϕ, ϕ 0 ) ] ϕ=ϕ0 ( ϕ j ϕ j )( ϕ k ϕ k ) j,k R 3 = [ E G ϕj ϕ k ϕ l (y, ϕ, ϕ 0 ) ] ϕ=ϕ ( ϕ j ϕ j )( ϕ k ϕ k )( ϕ l ϕ l ). j,k,l Usng the Cauchy-Schwarz nequalty, we have [ E G ϕj ϕ k (y, ϕ, ϕ 0 ) ] ϕ=ϕ0 ( ϕ j ϕ j )( ϕ k ϕ k ) { } 1 { } 1 E[( ϕ j ϕ j ) 4 4 ] E[( ϕ k ϕ k ) 4 4 ] E [G ϕj ϕ k (y, ϕ, ϕ 0 ) 2 ϕ=ϕ0 ]. From the asymptotc normalty of ϕ gven n Theorem 1, t holds that E[ ϕ k ϕ k r ] = O(m r/2 ). Moreover, snce the range of G(y, ϕ, ϕ 0 ) s (0, 1), the partal dervatves of G(y, ϕ, ϕ 0 ) are bounded. Then, we obtan R 2 = O(m 1 ). Usng the smlar evaluaton, we can show that R 3 = O(m 1 ). Regardng R 1, t s noted that [ E G ϕ (y, ϕ, ϕ 0 ) ] [ ] ϕ=ϕ0 ( ϕ ϕ 0 ) = E G ϕ (y, ϕ, ϕ 0 )E[ ϕ ϕ 0 y ]. From Lohr and Rao (2009), t holds E[ ϕ ϕ 0 y ] = m 1 b ϕ I 1 ϕ L (y, ϕ 0 )/ ϕ + o p (m 1 ), where m =1 L (y, ϕ 0 ) L(ϕ) and b ϕ = lm m me[ ϕ ϕ 0 ] s the asymptotc bas of ϕ. Hence, we have [ ] E G ϕ (y, ϕ, ϕ 0 )E[ ϕ ϕ 0 y ] = 1 [ m E [G ϕ(y, ϕ, ϕ 0 )] b ϕ E G ϕ (y, ϕ, ϕ 0 )I 1 ϕ ] ϕ L (y ; ϕ 0 ) + o(m 1 ), whch s O(m 1 ). Therefore, the proof s completed. A3. Proof of Theorem 3. From the proof of Theorem 2, we have F a (ϕ 0 ) P (µ Q a (y, ϕ)) = a + c a(ϕ 0 ) m + o(m 1 ), 23

25 where c a (ϕ) s a smooth functon of ϕ. Let a and â be satsfyng F a (ϕ 0 ) = a and F a ( ϕ) = a, respectvely. Then, from the above expanson, we have â a = o(m 1 ), thereby P (µ Qâ (y, ϕ)) = P (µ Q a (y, ϕ)) + o(m 1 ) = a + o(m 1 ), whch completes the proof. A4. Checkng assumptons of transformatons. We here check the assumpton 3 n Assumpton 1 for the dual power (DP) transformaton (10) and snh-arcsnh (SS) transformaton (11). (DP transformaton) puttng x = t for t > 0, we have We frst note that H 1 λ (x) = O(x1/λ ) as x. By H 1 λ (x) = ( 1 + λ 2 t 2 λt) 1/λ = as t. A straghtforward calculaton shows that 1 ( 1 + λ 2 t 2 + λt) 1/λ = O(t 1/λ ) H λ (x) λ = xλ log x + x λ log x 2λ + xλ x λ 2λ 2, thereby, t follows that H λ λ (H 1 λ (x)) = O( x log x ) + O( x 1 log x ) + O( x ) + O( x 1 ) = O( x log x ) as x. Moreover, snce 2 H λ (x) λ 2 = xλ (log x) 2 x λ (log x) 2 xλ x λ 2λ λ 3, the smlar evaluaton leads to 2 H λ (w)/ λ 2 = O( x (log x ) 2 ) as x. Regardng 2 log H λ (x)/ λ2, t holds that 2 log H λ (w) λ 2 = 4(log w) 2 w 2 (w λ 1 + w λ 1 ) 2 = O((log x )2 x 2 ) 24

26 as x, so that the DP transformaton satsfes the assumpton. When the locaton parameter s used, namely, H λ,c (x) = {(x + c) λ (x + c) λ }/2λ, t s noted that k H λ,c (x)/ c k = k H λ,c (x)/ x k, so that the qute smlar evaluaton shows that the shfted-dp transformaton also satsfes the assumpton. (SS transformaton) It follows that H a,b (x) a = cosh(b snh 1 (x) a), H a,b (x) b = cosh(b snh 1 (x) a) snh 1 (x). Note that snh 1 (x) = O(log x ) as x, so that H 1 a,b (x) = O(exp(b 1 log x )) = O( x 1/b ). Then, we have H a,b a (H 1 a,b (x)) = O(exp(b log x 1/b )) = O( x ), H a,b b (H 1 a,b (x)) = O(exp(b log x 1/b ) log x 1/b ) = O( x log x ), as x. Moreover, t holds that 2 H a,b (x) 2 a 2 H a,b (x) a b = snh(b snh 1 (x) a), 2 H a,b (x) 2 b = snh(b snh 1 (x) a) snh 1 (x), = snh(b snh 1 (x) a){snh 1 (x)} 2 thereby the smlar evaluaton shows that 2 H a,b (x)/ 2 a = O( x ), 2 H a,b (x)/ 2 b = O( x (log x ) 2 ) and 2 H a,b (x)/ a b = O( x log x ) as x. On the other hand, a straghtforward calculaton shows that a log H a,b (x) = H a,b(x) H a,b (x) 1 + H a,b (x) 2, a b log H a,b (x) = 1 b + H a,b(x) H a,b (x) 1 + H a,b (x) 2, b whch are bounded by the functon H a,b (x)/ a and H a,b (x)/ b, respectvely. A straghtforward calculatons show that the second partal dervatves of log H a,b (x) are bounded by polynomal functons of the second partal dervatves of H a,b (x) and H a,b (x), thereby the assumpton s satsfed. 25

27 References [1] Battese, G.E., Harter, R.M. and Fuller, W.A. (1988). An error-components model for predcton of county crop areas usng survey and satellte data. Journal of the Amercan Statstcal Assocaton, 83, [2] Brent, R. (1973). Algorthms for Mnmzaton wthout Dervatves. Englewood Clffs N.J. Prentce-Hall. [3] Box, G.E.P. and Cox, D.R. (1964). An analyss of transformaton (wth dscusson). Journal of the Royal Statstcal Socety: Seres B, 26, [4] Chambers, R., Chandra, H., Salvat, N. and Tzavds, N. (2014) Outlner robust small area estmaton. Journal of the Royal Statstcal Socety: Seres B, 76, [5] Chatterjee, S., Lahr, P. and L, H. (2008). Parametrc bootstrap approxmaton to the dstrbuton of EBLUP and related predctons ntervals n lnear mxed models. The Annals of Statstcs, 36, [6] Fay, R. and Herrot, R. (1979). Estmators of ncome for small area places: an applcaton of James Sten procedures to census. Journal of the Amercan Statstcal Assocaton, 74, [7] Foster, J., Greer, J. and Thorbecke, E. (1984). A class of decomposable poverty measures. Econometrca, 52, [8] Hall, P. and Mat, T. (2006). On parametrc bootstrap methods for small area predcton. Journal of the Royal Statstcal Socety: Seres B, 68, [9] Jang, J. (1996). REML estmaton: asymptotc behavor and related topcs. The Annals of Statstcs, 24, [10] Jang, J. (2010). Large Sample Technques for Statstcs. Sprnger. [11] Jang, J. and Lahr, P. (2006). Estmaton of fnte populaton doman means: a model-asssted emprcal best predcton approach. Journal of the Amercan Statstcal Assocaton, 101, [12] Jones, M. C. and Pewsey, A. (2009). Snh-arcsnh dstrbutons. Bometrka, 96,

28 [13] Lahr, P. and Mukherjee, K. (2007). On the desgn consstency property of herarchcal Bayes estmators n fnte populaton samplng. The Annals of Statstcs, 35, [14] L, Y, and Lahr, P. (2007). Robust model-based and model-asssted predctors of the fnte populaton total. Journal of the Amercan Statstcal Assocaton, 102, [15] Lohr, S. L. and Rao, J. N. K. (2009). Jackknfe estmaton of mean squared error of small area predctors n nonlnear mxed models. Bometrka, 96, [16] Molna, I. and Marhuenda, Y. (2015). sae: an R package for small area estmaton. The R Journal, 7, [17] Molna, I. and Rao, J. N. K. (2010). Small area estmaton of poverty ndcators. Canadan Journal of Statstcs, 38, [18] Molna, I., Nandram, B. and Rao, J. N. K. (2014). Small area estmaton of general parameters wth applcaton to poverty ndcators: A herarchcal Bayes approach. The Annals of Appled Statstcs, 8, [19] Nandram, B. (1999). An emprcal Bayes predcton nterval for the fnte populaton mean of a small area. Statstca Snca, 9, [20] Opsomer, J. D., Claeskens, G., Ranall, M. G., Kauermann, G. and Bredt, F. J. (2008). Non-parametrc small area estmaton usng penalzed splne regresson. Journal of the Royal Statstcal Socety, B, 70, [21] Rao, J.N.K. and Molna, I. (2015) Small Area Estmaton, 2nd Edton. Wley. [22] Schmd, T., Tzavds, N., Münnch, R. and Chambers, R. (2016) Outler robust small-area estmaton under spatal correlaton. Scandnavan Journal of Statstcs, 43, [23] Sugasawa, S. and Kubokawa, T. (2017). Transformng response values n small area predcton. Computatonal Statstcs & Data Analyss, n press. [24] Yang, Z. L. (2006). A modfed famly of power transformatons. Economcs Letters, 92,

29 [25] Yoshmor, M. and Lahr, P. (2014). A second-order effcent emprcal Bayes confdence nterval. The Annals of Statstcs, 42,

arxiv: v3 [stat.me] 11 Jun 2018

arxiv: v3 [stat.me] 11 Jun 2018 Adaptvely Transformed Mxed Model Predcton of General Fnte Populaton Parameters SHONOSUKE SUGASAWA Center for Spatal Informaton Scence, The Unversty of Tokyo arxv:1705.04136v3 [stat.me] 11 Jun 2018 TATSUYA

More information

Small Area Interval Estimation

Small Area Interval Estimation .. Small Area Interval Estmaton Partha Lahr Jont Program n Survey Methodology Unversty of Maryland, College Park (Based on jont work wth Masayo Yoshmor, Former JPSM Vstng PhD Student and Research Fellow

More information

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function On Outler Robust Small Area Mean Estmate Based on Predcton of Emprcal Dstrbuton Functon Payam Mokhtaran Natonal Insttute of Appled Statstcs Research Australa Unversty of Wollongong Small Area Estmaton

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Bias-correction under a semi-parametric model for small area estimation

Bias-correction under a semi-parametric model for small area estimation Bas-correcton under a sem-parametrc model for small area estmaton Laura Dumtrescu, Vctora Unversty of Wellngton jont work wth J. N. K. Rao, Carleton Unversty ICORS 2017 Workshop on Robust Inference for

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Numerical Heat and Mass Transfer

Numerical Heat and Mass Transfer Master degree n Mechancal Engneerng Numercal Heat and Mass Transfer 06-Fnte-Dfference Method (One-dmensonal, steady state heat conducton) Fausto Arpno f.arpno@uncas.t Introducton Why we use models and

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Robust Small Area Estimation Using a Mixture Model

Robust Small Area Estimation Using a Mixture Model Robust Small Area Estmaton Usng a Mxture Model Jule Gershunskaya U.S. Bureau of Labor Statstcs Partha Lahr JPSM, Unversty of Maryland, College Park, USA ISI Meetng, Dubln, August 23, 2011 Parameter of

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Small Area Estimation: Methods, Applications and New Developments. J. N. K. Rao. Carleton University, Ottawa, Canada

Small Area Estimation: Methods, Applications and New Developments. J. N. K. Rao. Carleton University, Ottawa, Canada Small Area Estmaton: Methods, Applcatons and New Developments J. N. K. Rao Carleton Unversty, Ottawa, Canada Paper presented at the NTTS 2013 Conference, Brussels, March 2013 1 Introducton Censuses and

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

Small Area Estimation Under Spatial Nonstationarity

Small Area Estimation Under Spatial Nonstationarity Small Area Estmaton Under Spatal Nonstatonarty Hukum Chandra Indan Agrcultural Statstcs Research Insttute, New Delh Ncola Salvat Unversty of Psa Ray Chambers Unversty of Wollongong Nkos Tzavds Unversty

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Small Area Estimation: Methods and Applications. J. N. K. Rao. Carleton University, Ottawa, Canada

Small Area Estimation: Methods and Applications. J. N. K. Rao. Carleton University, Ottawa, Canada Small Area Estmaton: Methods and Applcatons J. N. K. Rao Carleton Unversty, Ottawa, Canada Paper presented at the Semnar Applcatons of Small Area Estmaton Technques n the Socal Scences, October 3 5, 2012,

More information

Computing MLE Bias Empirically

Computing MLE Bias Empirically Computng MLE Bas Emprcally Kar Wa Lm Australan atonal Unversty January 3, 27 Abstract Ths note studes the bas arses from the MLE estmate of the rate parameter and the mean parameter of an exponental dstrbuton.

More information

Small Area Estimation for Business Surveys

Small Area Estimation for Business Surveys ASA Secton on Survey Research Methods Small Area Estmaton for Busness Surveys Hukum Chandra Southampton Statstcal Scences Research Insttute, Unversty of Southampton Hghfeld, Southampton-SO17 1BJ, U.K.

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

An R implementation of bootstrap procedures for mixed models

An R implementation of bootstrap procedures for mixed models The R User Conference 2009 July 8-10, Agrocampus-Ouest, Rennes, France An R mplementaton of bootstrap procedures for mxed models José A. Sánchez-Espgares Unverstat Poltècnca de Catalunya Jord Ocaña Unverstat

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria ECOOMETRICS II ECO 40S Unversty of Toronto Department of Economcs Wnter 07 Instructor: Vctor Agurregabra SOLUTIO TO FIAL EXAM Tuesday, Aprl 8, 07 From :00pm-5:00pm 3 hours ISTRUCTIOS: - Ths s a closed-book

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Population Design in Nonlinear Mixed Effects Multiple Response Models: extension of PFIM and evaluation by simulation with NONMEM and MONOLIX

Population Design in Nonlinear Mixed Effects Multiple Response Models: extension of PFIM and evaluation by simulation with NONMEM and MONOLIX Populaton Desgn n Nonlnear Mxed Effects Multple Response Models: extenson of PFIM and evaluaton by smulaton wth NONMEM and MONOLIX May 4th 007 Carolne Bazzol, Sylve Retout, France Mentré Inserm U738 Unversty

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1 On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Heteroscedastic Nested Error Regression Models with Variance Functions

Heteroscedastic Nested Error Regression Models with Variance Functions Heteroscedastc Nested Error Regresson Models wth Varance Functons Shonosuke Sugasawa and Tatsuya Kubokawa Abstract The artcle consders a nested error regresson model wth heteroscedastc varance functons

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling Open Journal of Statstcs, 0,, 300-304 ttp://dx.do.org/0.436/ojs.0.3036 Publsed Onlne July 0 (ttp://www.scrp.org/journal/ojs) Multvarate Rato Estmator of te Populaton Total under Stratfed Random Samplng

More information

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors Stat60: Bayesan Modelng and Inference Lecture Date: February, 00 Reference Prors Lecturer: Mchael I. Jordan Scrbe: Steven Troxler and Wayne Lee In ths lecture, we assume that θ R; n hgher-dmensons, reference

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Non-parametric bootstrap mean squared error estimation for M-quantile estimates of small area means, quantiles and poverty indicators *

Non-parametric bootstrap mean squared error estimation for M-quantile estimates of small area means, quantiles and poverty indicators * Non-parametrc bootstrap mean squared error maton for M-quantle mates of small area means quantles and poverty ndcators * Stefano Marchett 1 Monca Prates 2 Nos zavds 3 1 Unversty of Psa e-mal: stefano.marchett@for.unp.t

More information

Appendix B. The Finite Difference Scheme

Appendix B. The Finite Difference Scheme 140 APPENDIXES Appendx B. The Fnte Dfference Scheme In ths appendx we present numercal technques whch are used to approxmate solutons of system 3.1 3.3. A comprehensve treatment of theoretcal and mplementaton

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

One-sided finite-difference approximations suitable for use with Richardson extrapolation

One-sided finite-difference approximations suitable for use with Richardson extrapolation Journal of Computatonal Physcs 219 (2006) 13 20 Short note One-sded fnte-dfference approxmatons sutable for use wth Rchardson extrapolaton Kumar Rahul, S.N. Bhattacharyya * Department of Mechancal Engneerng,

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS IN THE LOGISTIC REGRESSION MODEL

ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS IN THE LOGISTIC REGRESSION MODEL Asymptotc Asan-Afrcan Propertes Journal of Estmates Economcs for and the Econometrcs, Parameters n Vol. the Logstc, No., Regresson 20: 65-74 Model 65 ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS

More information

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Notes on Frequency Estimation in Data Streams

Notes on Frequency Estimation in Data Streams Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

Parameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples

Parameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples Appled Mathematcal Scences, Vol. 5, 011, no. 59, 899-917 Parameters Estmaton of the Modfed Webull Dstrbuton Based on Type I Censored Samples Soufane Gasm École Supereure des Scences et Technques de Tuns

More information

Parametric fractional imputation for missing data analysis

Parametric fractional imputation for missing data analysis Secton on Survey Research Methods JSM 2008 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Wayne Fuller Abstract Under a parametrc model for mssng data, the EM algorthm s a popular tool

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

The Two-scale Finite Element Errors Analysis for One Class of Thermoelastic Problem in Periodic Composites

The Two-scale Finite Element Errors Analysis for One Class of Thermoelastic Problem in Periodic Composites 7 Asa-Pacfc Engneerng Technology Conference (APETC 7) ISBN: 978--6595-443- The Two-scale Fnte Element Errors Analyss for One Class of Thermoelastc Problem n Perodc Compostes Xaoun Deng Mngxang Deng ABSTRACT

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Inductance Calculation for Conductors of Arbitrary Shape

Inductance Calculation for Conductors of Arbitrary Shape CRYO/02/028 Aprl 5, 2002 Inductance Calculaton for Conductors of Arbtrary Shape L. Bottura Dstrbuton: Internal Summary In ths note we descrbe a method for the numercal calculaton of nductances among conductors

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Chapter 4: Regression With One Regressor

Chapter 4: Regression With One Regressor Chapter 4: Regresson Wth One Regressor Copyrght 2011 Pearson Addson-Wesley. All rghts reserved. 1-1 Outlne 1. Fttng a lne to data 2. The ordnary least squares (OLS) lne/regresson 3. Measures of ft 4. Populaton

More information

MMA and GCMMA two methods for nonlinear optimization

MMA and GCMMA two methods for nonlinear optimization MMA and GCMMA two methods for nonlnear optmzaton Krster Svanberg Optmzaton and Systems Theory, KTH, Stockholm, Sweden. krlle@math.kth.se Ths note descrbes the algorthms used n the author s 2007 mplementatons

More information

More metrics on cartesian products

More metrics on cartesian products More metrcs on cartesan products If (X, d ) are metrc spaces for 1 n, then n Secton II4 of the lecture notes we defned three metrcs on X whose underlyng topologes are the product topology The purpose of

More information

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics A New Method for Estmatng Overdsperson Davd Fletcher and Peter Green Department of Mathematcs and Statstcs Byron Morgan Insttute of Mathematcs, Statstcs and Actuaral Scence Unversty of Kent, England Overvew

More information