M-Quantile Regression for Binary Data with Application to Small Area Estimation

Size: px
Start display at page:

Download "M-Quantile Regression for Binary Data with Application to Small Area Estimation"

Transcription

1 University of Wollongong Research Online Centre for Statistical & Survey Methoology Working Paper Series Faculty of Engineering an Information Sciences 2012 M-Quantile Regression for Binary Data with Application to Small Area Estimation Ray Chambers University of Wollongong, Nicola Salvati University of Pisa Nikos Tzaviis University of Southampton Recommene Citation Chambers, Ray; Salvati, Nicola; an Tzaviis, Nikos, M-Quantile Regression for Binary Data with Application to Small Area Estimation, Centre for Statistical an Survey Methoology, University of Wollongong, Working Paper 12-12, 2012, Research Online is the open access institutional repository for the University of Wollongong. For further information contact the UOW Library:

2 Centre for Statistical an Survey Methoology The University of Wollongong Working Paper M-Quantile Regression for Binary Data with Application to Small Area Estimation Ray Chambers, Nicola Salvati an Nikos Tzaviis Copyright 2008 by the Centre for Statistical & Survey Methoology, UOW. Work in progress, no part of this paper may be reprouce without permission from the Centre. Centre for Statistical & Survey Methoology, University of Wollongong, Wollongong NSW Phone , Fax anica@uow.eu.au

3 Biometrika (2012), xx, x, pp C 2007 Biometrika Trust Printe in Great Britain M-Quantile Regression for Binary Data with Application to Small Area Estimation BY RAY CHAMBERS Centre for Statistical an Survey Methoology, University of Wollongong, New South Wales 2522, Australia ray@uow.eu.au NICOLA SALVATI Dipartimento i Statistica e Matematica Applicata all Economia, Università i Pisa, Via Riolfi, 10 - I56124 Pisa, Italy, salvati@ec.unipi.it NIKOS TZAVIDIS Southampton Statistical Sciences Research Institute, University of Southampton, Southampton SO17 1BJ, UK, n.tzaviis@soton.ac.uk SUMMARY M-quantile regression moels are a robust an flexible alternative to ranom effects moels, particularly in small area estimation. However quantiles, an more generally M-quantiles, are only uniquely efine for continuous variables. In this paper we exten the M-quantile regression approach to binary ata, an more generally to count ata. This approach is then applie to estimation of a small area proportion, where a popular alternative approach is to use a plugin version of the Empirical Best (EB) preictor base on a generalise linear mixe moel for the unerlying binary variable. Results from both moel-base an esign-base simulations comparing the binary M-quantile an the plug-in EB preictors emonstrate the usefulness of the M-quantile approach in this case. The paper conclues with two illustrative applications. The first aresses estimation of the number of unemploye people age 16 an above resient in the Unitary Authorities an Local Authority Districts of Great Britain. The secon consiers estimation of the number of poor househols in each of the Local Labour Systems of the Tuscany region of Italy. Some key wors: Influence function; M-estimation; Bootstrap methos; Simulation experiments. 1. INTRODUCTION The increasing eman for reliable small area statistics has le to the evelopment of a number of efficient moel-base small area estimation (SAE) methos (Rao, 2003; Jiang & Lahiri, 2006). For example, the empirical best linear unbiase preictor (EBLUP) base on a linear mixe moel (LMM) is often recommene when the target of inference is the small area average of a continuous response variable (Battese et al., 1988; Prasa & Rao, 1990). However, using a mixe moel to characterise ifferences between small areas requires strong istributional assumptions, a formal specification of the ranom part of the moel an oes not easily allow for outlier robust inference. An alternative approach is to use M-quantile moels (Breckling & Chambers, 1988) to characterise these ifferences (Chambers & Tzaviis, 2006). Unlike traitional mixe moels,

4 2 M-quantile moels o not epen on strong istributional assumptions an automatically provie outlier robust inference. Unfortunately, many survey variables are categorical in nature an are therefore not suite to stanar SAE methos base on LMMs. One option in such cases is to aopt a Hierarchical Bayes approach (Malec et al., 1997; Nanram et al., 1999) or to use Empirical Bayes (MacGibbon & Tomberlin, 1989; Farrell et al., 1997). Alternatively, if a frequentist approach is preferre, one can follow Jiang & Lahiri (2001) who propose an empirical best preictor (EBP) for a binary response, or Jiang (2003) who extens these results to generalise linear mixe moels (GLMMs). However, application of these moels in SAE is not straightforwar, as computing the maximum likelihoo estimates of the fixe an ranom effects parameters of a GLMM can require evaluation of high imensional integrals. Furthermore, the EBP is moel epenent an estimates of GLMM parameters can be very sensitive to outliers or epartures from unerlying istributional assumptions. Large eviations from the expecte response as well as outlying points in the space of the explanatory variables are known to have a large influence on classical maximum likelihoo inference base on generalise linear moels (GLMs). This has lea to the evelopment of robust alternative methos for fitting these moels (Pregibon, 1982; Preisser & Qaqish, 1999; Cantoni & Ronchetti, 2001). But evelopment of corresponing robust methos for GLMM-base SAE is limite. Maiti (2001) escribes a Hierarchical Bayes approach to fitting a GLMM base on an outlier-robust normal mixture prior for the ranom effects, while Sinha (2004) proposes robust estimation of the fixe effects an the variance components of a GLMM, using a Metropolis algorithm to approximate the posterior istribution of the ranom effects. Similarly, Noh & Lee (2007) propose methos that allow robust inferences for GLMs, with extension to GLMMs. However, only Maiti (2001) applies these methos to SAE. To the best of our knowlege, there is no existing theory for robust SAE base on GLMMs in the frequentist framework. In this paper we therefore propose a new approach to SAE for iscrete ata base on M-quantile moelling. This allows straightforwar extension of the existing M-quantile approach for continuous ata to the case where the response is binary or, more generally, a count. As with M-quantile moelling of a continuous response (Chambers & Tzaviis, 2006) ranom effects are avoie an between area variation in the response is characterise by variation in area-specific values of quantile-like coefficients. Outlier-robust inference is also automatic in case of both misclassification error an measurement error. In the next Section we summarise estimation of a small area proportion base on a GLMM. In Section 3 we then exten the M-quantile regression approach to binary ata an, more generally, to count ata. In Section 4 we buil on the extension to binary ata to efine the corresponing M-quantile coefficients of the sample units. As with the continuous case, these coefficients capture small area effects in the ata, leaing to an M-quantile estimator of a small area proportion. In the same Section, we also propose three estimators of the mean square error (MSE) of this estimator: an analytical estimator base on first orer approximations to the variances of solutions to estimating equations, an two bootstrap methos that exten ieas set out in Tzaviis et al. (2010) an Chambers & Chanra (2012). In Section 5, we present results from moel-base an esign-base simulation stuies aime at assessing the performance of the ifferent small area preictors consiere in this paper. In Section 6 we use the M-quantile approach escribe in this paper to estimate: (1) the number of unemploye people age 16 an above resient in each of 406 Unitary Authorities an Local Authority Districts (UALAD s) of the UK; an (2) the number of poor househols in each of the 57 Local Labour Systems (LLS s) of the Tuscany region of Italy. Finally, in Section 7 we conclue the paper with a iscussion of our results an outstaning research issues.

5 2. SMALL AREA ESTIMATION BASED ON GENERALISED LINEAR MIXED MODELS Let U enote a finite population of size N which can be partitione into D omains or small areas, with U enoting small area. The small area population sizes N ; = 1,..., D are assume known. Let y j be the value of the variable of interest (typically a iscrete or a categorical variable) for unit j in area, an let x j enote a p 1 vector of unit level covariates (incluing an intercept). It is assume that the values of x j are known for all units in the population, as are the values z of a q 1 vector of area level covariates. The aim is to use the sample values of y j an the population values of x j an z to infer the values θ ; = 1,..., D of a small area characteristic of interest. To save notation, in what follows we use E s to enote the expectation conitional on this information. It is well known that the minimum mean square error preictor of θ is then E s [θ ]. In many cases θ = N 1 j U f(y j ) where f is a known function. The minimum mean square error preictor of θ is then N 1 { j s f(y j ) + j r E s [f(y j )]}, where s enotes the n sample units in small area an r enotes the N n remaining (i.e. non-sample) units in this area. In general, the conitional expectation E s [f(y j )] can be ifficult to evaluate, an so is replace by a suitable approximation. One such approximation is E[f(y j ) u ] where the u ; = 1,..., D are q-imensional inepenent ranom effects characterising the between-area ifferences in the istribution of y j given x j (see Rao, 2003; Jiang & Lahiri, 2006; González-Manteiga et al., 2007). This can be formalise by assuming a generalise linear mixe moel (GLMM) for µ j = E[y j u ] of the form g(µ j ) = η j = x T j β + zt u, (1) where g is a known invertible link function. When y j is binary-value a popular choice for g is the logistic link function an the iniviual y j values in area are taken to be inepenent Bernoulli outcomes with µ j = E[y j u ] = P (y j = 1 u ) = exp{η j }(1 + exp{η j }) 1 (2) an V ar[y j u ] = µ j (1 µ j ). The q-imensional vector u is generally assume to be inepenently istribute between areas an to follow a normal istribution with mean 0 an covariance matrix Σ u. The matrix Σ u is allowe to epen on parameters δ = (δ 1,..., δ K ), which are then referre to as the variance components of the GLMM, while the vector β in (1) is referre to as the fixe effects parameter of this moel. We focus on the situation where the target of inference is the small area proportion, ȳ = N 1 j U y j an the Bernoulli-Logistic GLMM (2) is assume. In this case the approximation to the minimum mean square error preictor of ȳ is N 1 [ j s y j + j r µ j ]. Since µ j epens on β an u, a further stage of approximation is require, where unknown parameters are replace by suitable estimates. This leas to the plug-in version of the Empirical Best Preictor (EBP) for the area proportion ȳ uner (2), ˆȳ EBP = N 1 { y j + j s j r ˆµ j 3 }, (3) where ˆµ j = exp{ˆη j }(1 + exp{ˆη j }) 1, ˆη j = x T j ˆβ + z T û, ˆβ is the vector of the estimate fixe effects an û enotes the vector of the estimate area-specific ranom effects. In the simplest case, q = 1 an z = 1, in which case the u are scalar small area effects. We refer to (3) in this case as a ranom intercepts EBP. For more etails on this preictor, incluing estimation of its MSE, see Saei & Chambers (2003), Jiang & Lahiri (2006) an González-Manteiga et al. (2007).

6 4 Unfortunately, espite their attractive properties as far as moelling non-normal response variables are concerne, application of GLMMs in small area estimation is not always straightforwar. In particular, the likelihoo function efine by a GLMM can involve high-imensional integrals which cannot be evaluate analytically (see Mc Culloch, 1994, 1997; Song et al., 2005). In such cases numerical approximations can be use, as for example in the R function glmer in the package lme4. Alternatively, estimation of the moel parameters in (2) can be carrie out using an iterative proceure that combines Maximum Penalize Quasi-Likelihoo (MPQL) estimation of β an u with REML estimation of δ. See Saei & Chambers (2003). In the empirical results reporte in Section 5, we use glmer for parameter estimation. 3. M-QUANTILE REGRESSION MODELS FOR BINARY DATA In this Section we evelop an extension of linear M-quantile regression to binary ata. Since M-quantile moels o not epen on how areas are specifie, we also rop the subscript in this Section. We start by summarising M-quantile regression for a continuous response M-quantile regression for a continuous response M-quantile regression (Breckling & Chambers, 1988) is a quantile-like generalisation of regression base on influence functions (M-regression). The M-quantile of orer q of a continuous ranom variable Y with istribution function F (Y ) is the value Q q that satisfies ( Y Qq ) ψ q F (Y ) = 0, (4) σ q where ψ q (t) = 2ψ(t){qI(t > 0) + (1 q)i(t 0)} an ψ is a user-efine influence function. Here σ q is a suitable measure of the scale of the ranom variable Y Q q. Note that when ψ(t) = t we obtain the expectile of orer q, which represents a quantile-like generalisation of the mean (Newey & Powell, 1987), an when ψ(t) = sgn(t) we obtain the stanar quantile of orer q (Koenker & Bassett, 1978). Breckling & Chambers (1988) efine a linear M-quantile regression moel as one where the ψ-base M-quantile Q q (X; ψ) of orer q of the conitional istribution of y given the vector of p auxiliary variables X satisfies Q q (X; ψ) = Xβ q. (5) Let (y j, x j ; j = 1,..., n) enote the available ata. For specifie q an continuous ψ, an estimate ˆβ q of β q is obtaine by solving the estimating equation n 1 n ψ q (r jq )x j = 0, (6) i=1 where r jq = y j Q q (x j ; ψ), ψ q (r jq ) = 2ψ(ˆσ q 1 r jq ){qi(r jq > 0) + (1 q)i(r jq 0)} an ˆσ q is a suitable robust estimator of scale, i.e. ˆσ q = meian r jq / In this paper we will always use the Huber Proposal 2 influence function ψ(t) = ti( c < t < c) + c sgn(t)i( t c). Provie the tuning constant c is boune away from zero, we can solve (6) using stanar iteratively re-weighte least squares (IRLS) M-quantile regression for a binary response: an estimating equation approach There is no obvious efinition of a quantile regression function when Y is binary since the orer q quantile of a binary variable is not unique. However, provie the unerlying influence

7 function ψ is continuous an monotone non-ecreasing, the M-quantiles of a binary variable o exist an are unique. This is easily seen by consiering the solution to (4) when Y is binary, with P (Y = 1) = p. In this case (4) becomes ( ) ( ) 1 Qq Qq pqψ = (1 p) (1 q) ψ. σ q It is easy to see that when ψ(t) = t an q = 0.5, the solution to this estimating equation is Q 0.5 = p, as shoul be the case. Furthermore, when both p an q lie strictly between 0 an 1, the preceing assumptions about ψ ensure that Q q also lies strictly between 0 an 1 an is monotone non-ecreasing in q for fixe p. It is also monotone non-ecreasing in p for fixe q uner the assumption of a fixe scale parameter. A proof of this is available from the authors on request. In the same way that we impose a linear specification (5) on Q q (X; ψ) in the continuous case, we can impose an appropriate continuous (in q) specification on Q q (X; ψ) in the binary case. In particular, it seems sensible to replace (5) by the linear logistic specification Q q (x j ; ψ) = exp(xt j β q) 1 + exp(x T j β q). (7) In orer to estimate the parameter β q we consier the extension to the M-quantile case of the Cantoni & Ronchetti (2001) approach to robust estimation of the parameters of a GLM. In particular these authors propose a robustifie version of the maximum likelihoo estimating equations for a GLM of the form: n { Ψ(β) := n 1 1 } ψ(r j )w(x j ) σ(µ j ) µ j a(β) = 0, (8) j=1 where r j = y j µ j σ(µ j ) are the Pearson resiuals, E[Y j] = µ j, V ar[y j ] = σ 2 (µ j ), µ i is the erivative of µ j with respect to β an a(β) = 1 n n j=1 E[ψ(r 1 j)]w(x j ) σ(µ j ) µ j ensures the Fisher consistency of the solution to (8). The boune influence function ψ is use to control outliers in y, whereas the weights w are use to ownweight the leverage points. When w(x j ) = 1 j Cantoni & Ronchetti (2001) refer to the solution to (8) as the Huber quasi-likelihoo estimator. When ψ is the ientity function, (8) reuces to the usual maximum likelihoo estimating equations for a GLM. Uner the M-quantile framework the estimating equations (8) can be re-written as n { Ψ(β q ) := n 1 1 } ψ q (r jq )w(x j ) σ(q q (x j ; ψ)) Q q(x j ; ψ) a(β q ) = 0, (9) j=1 where r jq = y j Q q(x j ;ψ) σ(q q(x j ;ψ)), σ(q q(x j ; ψ)) = [Q q (x j ; ψ)(1 Q q (x j ; ψ))] 1/2, Q q(x j ; ψ) = σ 2 (Q q (x j ; ψ))x j an a(β q ) is a bias correction term: { a(β q ) = ψ q ( 1 Qq (x j ; ψ) σ(q q (x j ; ψ)) σ q ) Q q (x j ; ψ) ψ q ( Qq (x j ; ψ) σ(q q (x j ; ψ)) ) } (1 Q q (x j ; ψ)). Setting w(x j ) = 1 j leas to a Huber quasi-likelihoo M-quantile estimator. An alternative choice is w(x j ) = 1 h j where h j is the jth iagonal element of the hat matrix H = X(X T X) 1 X T. This leas to a Mallows type M-quantile estimator. In either case, the estimating equations (9) can be solve numerically using a Fisher scoring proceure to obtain an estimate ˆβ q of β q. 5

8 6 Note that when q = 0.5, (9) reuces to (8). Moreover, (6) is a special case of (9) if the the linear link function Q q (x j ; ψ) = x T j β q is use an the tuning constant c in the Huber influence function tens to infinity (i.e. ψ is the ientity function). This estimating equation approach applies quite generally. For example, it can be use when Y is a Poisson ranom variable. In this case the most appealing specification for the M-quantiles of the conitional istribution of Y is log-linear. That is, Q q (x j ; ψ) = k exp(x j β q ), (10) where k is an offset term. The parameter β q can then be estimate by solving (9) with σ(q q (x j ; ψ)) = Q q (x j ; ψ) 1/2, Q q(x j ; ψ) = Q q (x j ; ψ)x j an { a(β q ) = 2w q (r jq ) cp (Y j i 2 + 1) cp (Y j i 1 ) + Q q(x j ; ψ) } σ(q q (x j ; ψ)) [P (Y j = i 1 ) P (Y j = i 2 )], with i 1 = Q q (x j ; ψ) cσ(q q (x j ; ψ)), i 2 = Q q (x j ; ψ) + cσ(q q (x j ; ψ)) an w q (r jq ) = [qi(r jq > 0) + (1 q)i(r jq 0)]. Assuming that ψ is a continuous monotone non-ecreasing function, we can write own a first orer sanwich-type approximation to the variance of (9) of the form Here V ar( ˆβ q ) = n 1{ E V ar{ψ(β q )} = n 1 n j=1 where [ { E ψq 2 yj Q q (x j ; ψ) }] = σ(q q (x j ; ψ)) [ Ψ(βq ) ]} 1 [{ V ar{ψ(βq )} β q E [ Ψ(βq ) β q ]} 1] T. (11) { [ { x j σ 2 (Q q (x j ; ψ))e ψq 2 yj Q q (x j ; ψ) }] } x T j σ(q q (x j ; ψ)) { ψ 2 q n a 2 j(β q ), j=1 ( 1 Qq (x j ; ψ) ) ( Q q (x j ; ψ) + ψ 2 Qq (x j ; ψ) q σ(q q (x j ; ψ)) σ(q q (x j ; ψ)) a 2 j (β q) is the square of the bias correction term for unit j, an the expectation E B(β q ) = n 1 n j=1 {{ σ(q q (x j ; ψ)) ψ q ( 1 Qq (x j ; ψ) σ(q q (x j ; ψ)) ) } (1 Q q (x j ; ψ)), [ ] Ψ(βq) β q is ) ( Qq (x j ; ψ) )} } + ψ q σ 2 (Q q (x j ; ψ))x j x T j. σ(q q (x j ; ψ)) An estimator of (11) is then efine by plugging in estimates of unknown quantities into these expressions. Denoting these plug-in estimates by a hat leas to a variance estimator for ˆβ q of the form V ar( ˆβ q ) = n 1 ˆB 1 ( ˆβ q ) V ar{ψ( ˆβ q )}[ ˆB 1 ( ˆβ q )] T. (12) R routines for estimation an inference using M-quantile regression moels with binary an Poisson ata are available from the authors M-quantile regression for a binary response: an econometric approach The estimating equation approach escribe in the previous subsection oes not strictly apply to stanar quantile regression for binary ata, an quantile regression for binary ata has been evelope in the econometric literature using a latent variable concept. However, as we now show, the two approaches are very closely relate, since the econometric approach can be shown to be equivalent to the solution of an estimating equation analogous

9 to (6). Since we confine ourselves to stanar quantiles in this subsection, we rop the influence function ψ from our notation an, following Koras (2006), we assume that the observe values y j represent the outcome of a continuously istribute latent variable. That is, the observe value y j is generate by an unobserve (latent) real value yj in the sense that y j = I(yj > 0). Let Q q(x j ) enote the conitional quantile function of this latent variable. Since y j = I(yj > 0) is a monotonic transformation of y j, the qth conitional quantile of y j shoul be the same transformation of the qth conitional quantile of yj. That is Q q (x j ) = I(Q q(x j ) > 0). Given that Q q(x) = Xβ q, it follows that Q q (x j ) = I(x T j β q > 0) an a maximum score estimator for β q, efine by ˆβ q = max b=1 n 1 n {y j (1 q)}i(x T j b > 0) (13) j=1 was suggeste by Manski (1975, 1985). Put I j (b) = I{y j < I(x T j b > 0)}. Since I{y j < I j (b)} = (1 y j )I j (b), we can, after some simplification, show that (13) reuces to ˆβ q = min b=1 n 1 n j=1 [ ] qi{y j I j (b)} + (1 q)i{y j < I j (b)} y j I j (b). (14) This is equivalent to fitting the quantile regression moel Q q (x j ) = I(x T j β q) to the observe y j, subject to the restriction β q = 1, or, in what amounts to the same thing, solving (6) with ψ(t) = sgn(t), subject to this restriction. Note that the restriction is necessary in orer to ensure that β q is ientifiable (since the scale of yj is unknown) an so (13) has a solution. A smoothe version of (13) has been propose by Horowitz (1992) as having better finite sample properties: ˆβ q = max b=1 n 1 n j=1 {y j (1 q)}f (σ 1 n x T j b), (15) where F is an appropriately chosen smooth cumulative istribution function efine on the entire real line an σ n 0 as n. The same simplifying steps as those leaing to (14) allow us to write (15) as ˆβ q = min b=1 n 1 n [ qi{y j F (σn 1 j=1 ] x T j b)} + (1 q)i{y j < F (σ 1 x T j b)} n y j F (σ 1 n since 0 < F (t) < 1 I{y j < F (σn 1 x T j b)} = 1 y j. That is, this smoothe loss function for regression quantiles for binary ata leas to essentially the same estimator as the logistic ( ) 1 formulation (7). In fact, if we put F (t) = exp{σn 1 x T j b} 1 + exp{σn 1 x T j b} then as σn 0, F (σ 1 n x T j b) exp{xt j b} (1 + exp{x T j b} ) 1, an we en up with the quantile analogue of the solution to (6), with Q q (x j ; ψ) efine by (7) an subject to the restriction β q = 1. 7 x T j b),

10 8 4. ESTIMATION OF SMALL AREA PROPORTIONS USING M-QUANTILE REGRESSION MODELS Much of the ata measure in business, labour force an living conitions surveys is binary in character, an there is a growing nee for reliable small area estimates base on these ata. From now on therefore we focus on using M-quantile regression moels for binary ata to estimate the small area averages (i.e. the small area proportions) efine by a binary outcome variable Point estimation The mixe moels employe in small area estimation use ranom area effects to account for between-area variation. These moels epen on istributional assumptions for the ranom part of the moel an o not easily allow for outlier robust inference. An alternative approach to moelling the variability associate with the conitional istribution of Y given X is via M- quantile regression moels. These moels o not epen on strong istributional assumptions or on a preefine small area geography, an outlier robust inference is automatic. A key concept when using an M-quantile regression moel for small area estimation is that of the M-quantile coefficient q j for a unit j s. When the variable Y is continuous this is the value q j such that y j = Q qj (x j ; ψ). Note that M-quantile coefficients are etermine at population level. If between area variation is an important part of overall population variability then units within an area will have similar M-quantile coefficients. Provie there are sample observations in area, estimate values of their M-quantile coefficients are efine by substituting ˆQ qj (x j ; ψ) in the preceing efinition. An area specific M-quantile coefficient, ˆθ can then be efine as the average value of the estimate M-quantile coefficients in area, otherwise we set ˆθ = 0.5. Following Chambers & Tzaviis (2006), the M-quantile preictor of the average ȳ in small area is ˆȳ MQ { = N 1 } y j + (x ˆQˆθ j ; ψ). (16) j s j r When Y is binary, an we moel its regression M-quantile of orer q via (7), the natural extension ( 1 of this approach is to put ˆQˆθ (x j ; ψ) = exp{x T j ˆβˆθ } 1 + exp{x T j }) ˆβˆθ in (16). However, this begs the question of how one efines ˆθ, since the estimating equation y j = ˆQ qj (x j ; ψ) for the estimate M-quantile coefficient of a continuous y j no longer has a solution when y j is binary. We therefore iscuss extension of the M-quantile coefficient concept to binary Y before we consier inference base on (16) M-quantile coefficients for binary ata A first step in efining M-quantile coefficients for binary ata is to note that any reasonable efinition of this concept has to associate a larger M-quantile coefficient with a value y j = 1 compare with a value y j = 0 at the same value of x j. The next thing to note is that the solution m j to the equation ˆQ mj (x j ; ψ) = 0.5 can be interprete as a measure of the propensity for y j = 1 to be observe relative to the propensity for y j = 0 to be observe at x j. A value m j < 0.5 inicates that y j = 1 is more likely than y j = 0 an vice versa. This leas to our first efinition of an estimate M-quantile coefficient when Y is binary. DEFINITION A: Given binary ata with fitte M-quantile regression function ˆQ q (x j ; ψ), the estimate M-quantile coefficient for observation j is q j = (m j + y j )/2, where ˆQ mj (x j ; ψ) = 0.5. Note that provie ˆQ q (x j ; ψ) is monotone in q at x j, the above efinition of an estimate M- quantile coefficient shoul be unique. In orer to unerstan the motivation for this efinition,

11 suppose that y j = 0 at x j an that there are many more Y = 0 than Y = 1 near x j. Then (a) y j = 0 is not unusual, an (b) we anticipate that the monotone increasing function f(q) = ˆQ q (x j ; ψ) will only excee half for values of q close to one. That is, m j will be close to one an so q j will be slightly less than half. On the other han, suppose y j = 1 but there are still many more Y = 0 than Y = 1 near x j. Then (a) y j = 1 is unusual, an (b) we still anticipate that the monotone increasing function f(q) = ˆQ q (x j ; ψ) will only excee half for values of q close to one. Here q j will be close to one. Conversely, suppose that there are many more observations with Y = 1 than with Y = 0 near x j, so m j is close to zero. Then if y j = 0 (an unusual value) we expect q j will also be close to zero, while if y j = 1 (not unusual) we expect q j will be slightly greater than a half. The estimate M-quantile coefficients allow us to inex the sample ata. A somewhat ifferent inexing base on quantile regression moelling of Y is escribe in Koras (2006). This takes a latent variable approach an the resulting inex is essentially efine by a quantilebase estimate of P (y j = 1 x j ). Uner linearity of the conitional quantiles of this latent variable, we have alreay seen that Q q (x j ) = I(x T j β q > 0) an so P (y j = 1 x j ) = 1 h j, where x T j β h j = 0. Consequently, given an estimate ˆβ q for each value 0 < q < 1 we can inex the sample observations by p j = 1 h j where x T j ˆβ hj = 0. Note that this inex oes not epen on y j, an so cannot reflect iniviual effects, which woul seem to limit its usefulness in characterising how groups iffer after covariate effects have been taken into account. However, we can use the approach leaing to Definition A to exten this inex by allowing it to reflect iniviual effects.this leas to our secon efinition of an estimate M-quantile coefficient for the binary case. DEFINITION B: Given binary ata with fitte M-quantile regression function ˆQ q (x j ; ψ), the estimate M-quantile coefficient for observation j is q j = (h j + y j )/2, where x T j ˆβ hj = 0. Note that if x T j ˆβ q = 0 ˆQ q (x j ; ψ) = 0.5 then Definition B an Definition A are ientical. This conition will hol, for example, whenever ψ is the ientity function an Q q (x j ; ψ) = Q q (x j ) = F (x T j β q) where F (t) is a istribution function that satisfies F (0) = 0.5. Unfortunately, both Definition A an Definition B have a serious eficiency. This follows from the fact that in applications where h j varies aroun some constant, say h, q j will be concentrate near (1 + h)/2 an h/2. Furthermore, it is impossible to observe q j = 0.5 in general. An extreme case is where there is no relationship between y j an x j, an y j = 1 is just as likely as y j = 0. In this case h j = 0.5, an there are just two possible values of q j, 0.75 (y j = 1) an 0.25 (y j = 0). The basic reason for this behaviour is that both Definition A an Definition B compute q j on the same scale as y j. This makes sense when the istribution of y j is measure on a linear scale. However, in the binary case the istribution of y j is linear in the logistic scale, an so it makes sense to efine q j in the same way. That is, we replace q j an h j in Definition B by ˆQ qj (x j ; ψ) an ˆQ 0.5 (x j ; ψ) respectively, leaing to our thir, an final, efinition of q j : DEFINITION C: Given binary ata with fitte M-quantile regression function ˆQ q (x j ; ψ), the estimate M-quantile coefficient for observation j is q j, where ˆQ qj (x j ; ψ) = ( ˆQ 0.5 (x j ; ψ) + y j )/2. Note that uner a logistic specification for ˆQ q (x j ; ψ), using Definition C is equivalent to efining q j as the solution to y j = xt j β q j, where ( 0.5{ yj ˆQ0.5 (x j ; ψ) + y j } ) = log 1 0.5{ ˆQ. 0.5 (x j ; ψ) + y j } 9

12 10 True Area Effects True Area Effects Estimate Parametric Area Effects Estimate M-quantile Coefficients (Definition C) Fig. 1. Estimate area effects vs. true area effects (left plot) an estimate M-quantile coefficients vs. true area effects (right plot) in a Monte-Carlo simulation with D = 200 an n = 25. The value yj above can be thought of as a pseuo-value that behaves like the unobservable latent variable whose istribution etermines that of y j. In the rest of this paper, an particularly in the simulation experiments reporte in Section 5, we use Definition C when calculating estimate M-quantile coefficients. Efficient estimates of area effects are necessary for small area estimation via GLMMs. Similarly, estimation of M-quantile coefficients is necessary for small area estimation using the binary M-quantile moel propose in this paper. A natural question is then the strength of the relationship between the actual area effects an the estimate M-quantile coefficients. Some empirical evience for such a relationship is isplaye in Figure 1. These scatterplots show how area effects estimate using the glmer function in R an M-quantile coefficients estimate via Definition C are relate to true area effects. The simulate ata unerpinning these plots were generate using D = 200 areas, each with a sample size of n = 25. At each simulation, values of x j were inepenently rawn as Normal(0, 1) an corresponing values of y j were then generate as Bernoulli(p j ) with p j = exp{η j }(1 + exp{η j }) 1 an η j = x j + u. The small area effects u were inepenently rawn as Normal(0, 1). Figure 1 shows how the estimate area effects an the estimate M-quantile coefficients are relate to the true area effects in one Monte-Carlo simulation. Over 1, 000 simulations, the average correlation between the true area effects an estimate area effects was 0.89, an the corresponing average correlation between the true area effects an the estimate M-quantile coefficients was These results suggest that M-quantile coefficients are comparable to estimate area effects obtaine using stanar GLMM fitting proceures as far as capturing intra-area (omain) variability is concerne. Note also that these simulations buil on ata generate via a GLMM. In real applications, where GLMM assumptions may be violate, we expect that an M-quantile approach shoul offer a robust alternative for small area estimation.

13 4 3. Mean square error estimation To start, we propose a MSE estimator for (16) base on the linearisation approach set out in Chambers et al. (2009). This assumes that the working moel for inference conitions on the realise values of the area effects, an so the MSE of interest is conitional an equal to a conitional preiction variance plus a square conitional preiction bias. In orer to conserve space, we omit some technical etails in the following evelopment, but these are available from the authors upon request. We also assume that the estimate area-level M-quantile coefficient values θ have negligible variability an so can be treate as fixe. A first orer approximation to the conitional preiction variance of (16) is then V ar(ˆȳ MQ ȳ θ ) = N 2 { [ ] V ar ˆQθ (x j ; ψ) j r {[ N 2 Q θ (x j ; ψ)x T j j r + } V ar(y j ), j r + } V ar(y j ) j r ] V ar( ˆβ [ ] T θ ) Q θ (x j ; ψ)x T j j r 11 which can be estimate by {[ ] [ V ar(ˆȳ MQ ) = N 2 (x ˆQˆθ j ; ψ)x T j V ar( ˆβˆθ ) j r + } V ar(yj ). j r j r ˆQˆθ (x j ; ψ)x T j ] T Here V ar( ˆβˆθ ) is the sanwich-type estimator (12) an V ar(y j ) can be calculate either by (i) using the sample ata from area, V ar(y j ) = ˆȳ (1 ˆȳ ) or by (ii) pooling ata from the entire sample, in which case V ar(y j ) = ˆȳ(1 ˆȳ). Note that the poole estimator shoul lea to more stable preiction variance estimates when area sample sizes are very small. The conitional preiction bias can be approximate using the results of Copas (1988): E(ˆȳ MQ ȳ θ ) 1 2N { β θ { } 1 { [{ Ψ(β θ ) tr β θ } Q θ (x j ; ψ), j r with corresponing plug-in estimator Bias(ˆȳ MQ ) = 1 { } 1{ Ψ(β θ ) 2N β βθ = ˆβˆθ θ { }. β θ j r Q θ (x j ; ψ) βθ = ˆβˆθ The estimator of the conitional MSE of ˆȳ MQ mse A (ˆȳ MQ is then [{ tr β θ β T θ Ψ(β θ ) } V ar( ˆβ ]} θ ) β θ β T θ Ψ(β θ ) βθ = ˆβˆθ } V ar( ˆβˆθ ) ) = V ar(ˆȳ MQ ) + { Bias(ˆȳ MQ )} 2. (17) We also propose two ifferent bootstrap-base methos for estimating the MSE of (16) - a nonparametric bootstrap an a ranom effect block bootstrap. In orer to save space, the compu- ]}

14 12 tational etails of each bootstrap proceure are set out in the Appenix. Here we summarise the main characteristics of each metho. The nonparametric bootstrap-base estimator (hereafter NPB) of the MSE of ˆȳ MQ is efine by using the GLMM representation (2) to generate bootstrap populations of binary values that mimic the real population, sampling from this bootstrap population an calculating (16). This proceure is an extension to binary ata of the metho propose by Tzaviis et al. (2010). The bootstrap proceure is nonparametric in the sense that it replicates area effects in the population by expressing the linear component of the logistic M-quantile regression moel (7) as γ jq = x T j β x T (β θ β 0.5 ), (18) where x T is the vector of area averages of the moel covariates. The last term on the right-han sie of (18) is then interprete as a pseuo-ranom effect for area. The ranom effect block bootstrap, see Chambers & Chanra (2012), is a robust alternative to parametric bootstrap methos for clustere ata. It is free of both the istribution an the epenence assumptions of the usual parametric bootstrap for such ata an is consistent when the mixe moel assumption is vali. In particular, it preserves area effects by bootstrap resampling within areas. We aapt this proceure (hereafter REBB) for estimating the istribution of the M-quantile preictor (16) by resampling the marginal logistic scale resiuals r MQ j = x T j ( ˆβˆθ ˆβ 0.5 ) within each area to generate bootstrap values of P (y j = 1 x j ) for the population units making up the area. Bootstrap binary population values are then obtaine using Bernoulli simulation. 5. SIMULATION STUDIES We present results from two types of simulation stuies that are use to examine the performance of the small area estimators iscusse in the preceing Sections. In Subsection 5 1 below we report results from moel-base simulations where population ata are first generate using a GLMM an a sample is then rawn from this simulate population using a pre-specifie esign. The properties of estimators of small area proportions are assesse by comparing the estimates that they generate base on the sample ata to the corresponing population proportions. In Subsection 5 2 we report results from a esign-base simulation. Here a realistic finite population is simulate by resampling from ata collecte in a sample survey, an then repeate samples are rawn from it using the same (or similar) sample esign as the original survey. In this case the survey ata were source from the European Union Statistics on Income an Living Conitions (EU-SILC) 2005 survey, an the istribution of estimates of small area proportions generate uner repeate sampling is use to compare the properties of ifferent estimators of these proportions. Two ifferent M-quantile versions of (16) were investigate in the simulations, both base on a linear logistic M-quantile moel efine by a Huber influence function with tuning constant c. In the first, referre to as M-quantile below, c = 1.345, while the secon, referre to as Expectile below, c = 100. These estimators were compare with the EBP (3) uner a GLMM with logistic link function an with the irect estimator (the sample proportion). Both MSE estimation an confience interval coverage performance were evaluate using the analytic an two bootstrap methos escribe in Subsection 4 3 for the M-quantile preictor. Note that the logistic M-quantile linear regression fit unerpinning the M-quantile an Expectile preictors was obtaine using an extene version of a M-quantile linear regression moel function for SAE written in R. The parameters of the GLMM use in the EBP were estimate using the function glmer in R.

15 5 1. Moel-base simulations In each simulation we generate N = 5, 000 population values of X an Y in D = 50 small areas with N = 100, = 1,..., D. Iniviual x j values were rawn inepenently at each simulation as Uniform(a, b ), for a = 1 an b = /4, = 1,..., D, j = 1,..., N. Values of y j were then generate as Bernoulli(p j ) with p j = exp{η j }(1 + exp{η j }) 1 an η j = x j β + u. The small area effects u were inepenently rawn from a normal istribution with mean 0 an variance ϕ = 0.25, an β = 1 (González-Manteiga et al., 2007). Population values generate uner this scenario are enote by (0) below. In aition, we generate ata corresponing to a combine misclassification error an measurement error scenario, enote (M) below. In this scenario, a ranom 1% sample of the x j values were replace by 20 (introucing measurement error) an the corresponing y j values were set to 0 (introucing misclassification error). For each of these scenarios T = 1, 000 Monte-Carlo populations were generate. For each generate population an for each area we then took simple ranom samples without replacement of sizes n = 10 an n = 20 so that the overall sample sizes were n = 500 an n = 1, 000. For each sample the M-quantile an Expectile preictors, the EBP an the irect estimator were use to estimate the small area proportions ȳ, = 1,..., D. The performances of ifferent small area estimators for area were evaluate with respect to two criteria: their average error T 1 T t=1 (ˆȳ t ȳ t ) an the square root of their average square error T 1 T t=1 (ˆȳ t ȳ t ) 2. These are enote Bias an RMSE respectively below. Here ȳ t enotes the actual area value at simulation t, with preicte value ˆȳ t. The meian values of Bias an RMSE over the D small areas are set out in Table 1, where we see that claims in the literature (Chambers & Tzaviis, 2006) about the superior outlier robustness of the M-quantile preictor compare with the EBP an the Expectile preictor certainly hol true in these simulations. In particular, uner the (0) scenario the EBP performs better than the M- quantile an Expectile preictors in terms of Bias, whereas the M-quantile preictor is the best uner the (M) scenario. In terms of RMSE, there is nothing to choose between EBP, M-quantile an Expectile uner the (0) scenario, while uner the (M) scenario the M-quantile preictor is clearly superior. Table 1. Moel-base simulation results: Preictors of small area proportions. n = 10 n = 20 Preictor/Scenario (0) (M) (0) (M) Meian values of Bias EBP M-quantile Expectile Direct Meian values of RMSE EBP M-quantile Expectile Direct In orer to evaluate the performances of the MSE estimators propose in Subsection 4 3 we use the ata generate for the scenario with D = 50 an n = 10 an also carrie out a further moel-base simulation stuy with the same sample sizes within the small areas but

16 14 with D = 100 an N = 100. Again, T = 1, 000 Monte-Carlo populations were generate, with iniviual x j values rawn inepenently as Uniform(a, b ), with a = 1 an b = /8, = 1,..., D, j = 1,..., N. For each generate population a simple ranom sample without replacement of size n = 10 was rawn from each area, the M-quantile preictor calculate as well as its linearisation MSE estimator (17) an the two bootstrap MSE estimators NPB an REBB, both base on 100 bootstrap iterations. The behaviour of these MSE estimators for each scenario is isplaye in Table 2 where we show the meians of their area specific Bias an RMSE values, expresse in relative terms (%). We also show the meian empirical coverage rates for nominal 95 per cent confience intervals base on these methos. In the case of (17) these intervals were efine by the small area estimate plus or minus twice the value of the square root of (17). For NPB an REBB these intervals were base on the 2.5 an the 97.5 percentiles of the relevant bootstrap istribution. Examination of Table 2 shows that all three MSE estimation methos ten to be biase low, but all generate nominal 95 per cent confience intervals with acceptable coverage. Overall, the REBB bootstrap estimator seems preferable because it shows smaller or similar bias an more stability than both the linearisation-base estimator (17) an the NPB bootstrap estimator. Table 2. Moel-base simulation results: MSE estimators. Meian values Meian values Meian values of Relative Bias (%) Relative RMSE (%) of Coverage Rate (%) D = 50 Estimator/Scenario (0) (M) (0) (M) (0) (M) mse A (ˆȳ MQ ) mse NP B (ˆȳ MQ ) mse REBB (ˆȳ MQ ) D = 100 mse A (ˆȳ MQ ) mse NP B (ˆȳ MQ ) mse REBB (ˆȳ MQ ) Design-base simulation The population unerpinning the esign-base simulation is base on ata collecte from a sample of 1, 560 househols sprea across 64 municipalities (out of 287) in Tuscany They were collecte by ISTAT as part of the European Union Statistics on Income an Living Conitions (EU-SILC) 2005 survey. These survey ata provie information on a variety of issues relate to living conitions of the people in Tuscany, incluing etails on income an non-income imensions of poverty in the region, an form the basis of poverty assessment in this region. Here we efine the sample municipalities as our small areas of interest. Since nine municipalities ha sample sizes less than three, these were combine with ajacent municipalities, leaing to a total of 55 small areas. We use these sample househols to generate a population of N = 74, 951 househols by sampling with replacement from the original sample of 1, 560 househols with probabilities proportional to their sample weights. Given this (fixe) population, we then inepenently rew 1, 000 stratifie ranom samples from it, with the 55 (reefine) municipalities serving as strata, an with the sample size for each municipality fixe to be the same as in the original sample. Note that these sample sizes varie from 4 to 116. The binary variable of interest Y was efine

17 15 Pearson Resiuals Sample Quantiles Municipality Theoretical Quantiles Fig. 2. Moel fit iagnostics for a logistic mixe moel fit to the EU-SILC ata: This shows the istribution of Pearson resiuals by municipality (left) an a normal probability plot of the estimate municipality effects (right). as taking the value 1 if the equivalise income of a househol was below the meian income for the simulate population an 0 otherwise. The aim was to compare the repeate sampling performance of ifferent preictors of the proportion of househols below meian equivalise income at municipality level, using househol ownership status (a strong inicator of poverty), the age of the hea of the househol, the employment status of the hea of the househol, the gener of the hea of the househol, the years of eucation of the hea of the househol an the househol size as covariates. Figure 2 shows selecte iagnostics for a logistic mixe moel fit to Y base on the original EU-SILC ata an using these covariates. The istribution of Pearson resiuals inicates the presence of potential influential observations in the ata, with a number of large resiuals ( r j > 2). Further evience for the presence of these influential observations is obtaine when we fit the moel using a robust metho (Cantoni & Ronchetti, 2001) an note that although most observations receive a weight of 1 in this fit, there are 19 (about 1% of the overall sample) that receive a weight of less than 0.7. The normal probability plot shown in Figure 2 also inicates that the Gaussian assumption for the istribution of the ranom effects in this GLMM is oubtful. Using a moel that relaxes this assumption, such as an M-quantile moel with a boune influence function, therefore seems reasonable for these ata. Table 3 shows the five point summaries of the municipality level istributions of Bias an RMSE generate uner repeate sampling for the same four estimators that were the focus of the moel-base simulations reporte in the previous Subsection. These show that the M-quantile an Expectile preictors have similar performance an both generally perform better than the EBP an the Direct estimator in terms of RMSE. In particular, if one focuses on meian performance, then the M-quantile preictor seems the best of the four. Table 4 shows the meian values of Bias an RMSE, expresse in relative terms, of the three MSE estimators for the M-quantile preictor. In this case the NPB metho performs best, with the REBB an the linearisation-base estimator (17) isplaying a somewhat larger positive Bias

18 16 Table 3. Design-base simulation results: Distributions of Bias an RMSE for preictors of the proportion of househols with below meian income. Preictor Min Q1 Meian Mean Q3 Max Bias EBP M-quantile Expectile Direct RMSE EBP M-quantile Expectile Direct an slightly more instability. Again, we see that meian coverage performance of all three MSE estimation methos appears acceptable. Table 4. Design-base simulation results: Performances of MSE estimators. Estimator Meian Meian Meian Relative Bias (%) Relative RMSE (%) Coverage Rate (%) mse A (ˆȳ MQ ) mse NP B (ˆȳ MQ ) mse REBB (ˆȳ MQ ) APPLICATIONS 6 1. Estimates of ILO unemployment for UALADs of Great Britain In this Subsection we apply the M-quantile moelling approach to estimating the number of unemploye people age 16 an over in each of 406 Unitary Authorities an Local Authority Districts (UALADs) of Great Britain. We use the ILO unemployment efinition an ata from a sample of about 169, 000 people age 16 an over who participate in the UK Labour Force Survey (LFS) carrie out by the Office for National Statistics (ONS) in 2000, with M-quantile moel covariates specifie by prior stuies of small area estimation of UK labour force characteristics (Molina et al., 2007). In particular, we use the following covariates: sex-age category of an iniviual (6 categories corresponing to Male/Female an age groups 16 25, an > 40), government office region of the UALAD (12 categories), ONS socio-economic classification of the UALAD (7 categories) an total of registere unemploye in the sex-age group for the UALAD. In orer to compare M-quantile estimates with EBP estimates, the same covariates were also use to fit a logistic mixe moel to the LFS ata, an corresponing EBP-type estimates were compute. All estimates of proportions were scale up to estimates of counts by multiplying by population counts for each sex-age group in a UALAD an then summing over these groups within a UALAD. Figure 3 shows the normal probability plot of estimate UALAD ranom effects obtaine by fitting a logistic mixe moel to the sample ata. A Shapiro-Wilk normality test rejects the null hypothesis of normality for these estimate ranom effects (p-value = ). Apply-

19 ing a robust fitting metho (Cantoni & Ronchetti, 2001) to this ata set, we note that although most observations receive a weight of 1, about 3.5% receive weights less than 0.7. Using a less parametric moel, e.g. an M-quantile moel with a boune influence function, therefore seems reasonable for these ata. 17 Sample Quantiles Theoretical Quantiles Fig. 3. Normal probability plot of estimate UALAD ranom effects for proportion of unemploye population age 16 an over, base on a logistic mixe moel fit to UK LFS ata. In orer to assess the resulting M-quantile estimates we note that moel-base small area estimates shoul be both consistent with corresponing unbiase irect estimates an more precise than them. Figure 4 plots the M-quantile estimates of total numbers of unemploye against corresponing irect estimates for each UALAD. We can see that the M-quantile estimates appear to be generally consistent with the irect estimates, although the relationship between these two sets of estimates iverges in the larger UALADs, where the M-quantile estimates ten to be larger. This is consistent with the right skewe istribution of estimate area effects shown in Figure 3. Overall, however, the correlation of 0.78 between the M-quantile estimates an the irect estimates is reasonably high. In orer to assess the gain in precision from using moel-base estimates instea of the irect estimates, we can look at the istribution of the ratios of the estimate coefficients of variation (CVs) of the irect an the moel-base estimates. A value greater than 1 for this ratio inicates that the estimate CV of the moel-base estimate is less than that of the irect estimate. Figure 5 shows the relationship between these ratios an the number of unemploye people in the LFS sample in each UALAD. Two sets of ratios are plotte - those corresponing to EBP estimates (re) an those corresponing to M-quantile estimates (blue). Note that the CV for the M-quantile estimate is calculate as [mse REBB (ˆȳ MQ )] 1/2 /ˆȳ MQ, while that for the EBP esti-

Poisson M-quantile regression for small area estimation

Poisson M-quantile regression for small area estimation University of Wollongong Research Online Centre for Statistical & Survey Methoology Working Paper Series Faculty of Engineering an Information Sciences 2013 Poisson M-quantile regression for small area

More information

Estimation of District Level Poor Households in the State of. Uttar Pradesh in India by Combining NSSO Survey and

Estimation of District Level Poor Households in the State of. Uttar Pradesh in India by Combining NSSO Survey and Int. Statistical Inst.: Proc. 58th Worl Statistical Congress, 2011, Dublin (Session CPS039) p.6567 Estimation of District Level Poor Househols in the State of Uttar Praesh in Inia by Combining NSSO Survey

More information

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013 Survey Sampling Kosuke Imai Department of Politics, Princeton University February 19, 2013 Survey sampling is one of the most commonly use ata collection methos for social scientists. We begin by escribing

More information

Survey-weighted Unit-Level Small Area Estimation

Survey-weighted Unit-Level Small Area Estimation Survey-weighte Unit-Level Small Area Estimation Jan Pablo Burgar an Patricia Dörr Abstract For evience-base regional policy making, geographically ifferentiate estimates of socio-economic inicators are

More information

A comparison of small area estimators of counts aligned with direct higher level estimates

A comparison of small area estimators of counts aligned with direct higher level estimates A comparison of small area estimators of counts aligne with irect higher level estimates Giorgio E. Montanari, M. Giovanna Ranalli an Cecilia Vicarelli Abstract Inirect estimators for small areas use auxiliary

More information

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics This moule is part of the Memobust Hanbook on Methoology of Moern Business Statistics 26 March 2014 Metho: EBLUP Unit Level for Small Area Estimation Contents General section... 3 1. Summary... 3 2. General

More information

The Role of Models in Model-Assisted and Model- Dependent Estimation for Domains and Small Areas

The Role of Models in Model-Assisted and Model- Dependent Estimation for Domains and Small Areas The Role of Moels in Moel-Assiste an Moel- Depenent Estimation for Domains an Small Areas Risto Lehtonen University of Helsini Mio Myrsylä University of Pennsylvania Carl-Eri Särnal University of Montreal

More information

A Modification of the Jarque-Bera Test. for Normality

A Modification of the Jarque-Bera Test. for Normality Int. J. Contemp. Math. Sciences, Vol. 8, 01, no. 17, 84-85 HIKARI Lt, www.m-hikari.com http://x.oi.org/10.1988/ijcms.01.9106 A Moification of the Jarque-Bera Test for Normality Moawa El-Fallah Ab El-Salam

More information

Topic 7: Convergence of Random Variables

Topic 7: Convergence of Random Variables Topic 7: Convergence of Ranom Variables Course 003, 2016 Page 0 The Inference Problem So far, our starting point has been a given probability space (S, F, P). We now look at how to generate information

More information

A COMPARISON OF SMALL AREA AND CALIBRATION ESTIMATORS VIA SIMULATION

A COMPARISON OF SMALL AREA AND CALIBRATION ESTIMATORS VIA SIMULATION SAISICS IN RANSIION new series an SURVEY MEHODOLOGY 133 SAISICS IN RANSIION new series an SURVEY MEHODOLOGY Joint Issue: Small Area Estimation 014 Vol. 17, No. 1, pp. 133 154 A COMPARISON OF SMALL AREA

More information

Small Area Estimation: A Review of Methods Based on the Application of Mixed Models. Ayoub Saei, Ray Chambers. Abstract

Small Area Estimation: A Review of Methods Based on the Application of Mixed Models. Ayoub Saei, Ray Chambers. Abstract Small Area Estimation: A Review of Methos Base on the Application of Mixe Moels Ayoub Saei, Ray Chambers Abstract This is the review component of the report on small area estimation theory that was prepare

More information

Non-Parametric Bootstrap Mean. Squared Error Estimation For M- Quantile Estimators Of Small Area. Averages, Quantiles And Poverty

Non-Parametric Bootstrap Mean. Squared Error Estimation For M- Quantile Estimators Of Small Area. Averages, Quantiles And Poverty Working Paper M11/02 Methodology Non-Parametric Bootstrap Mean Squared Error Estimation For M- Quantile Estimators Of Small Area Averages, Quantiles And Poverty Indicators Stefano Marchetti, Nikos Tzavidis,

More information

Estimating Unemployment for Small Areas in Navarra, Spain

Estimating Unemployment for Small Areas in Navarra, Spain Estimating Unemployment for Small Areas in Navarra, Spain Ugarte, M.D., Militino, A.F., an Goicoa, T. Departamento e Estaística e Investigación Operativa, Universia Pública e Navarra Campus e Arrosaía,

More information

A Review of Multiple Try MCMC algorithms for Signal Processing

A Review of Multiple Try MCMC algorithms for Signal Processing A Review of Multiple Try MCMC algorithms for Signal Processing Luca Martino Image Processing Lab., Universitat e València (Spain) Universia Carlos III e Mari, Leganes (Spain) Abstract Many applications

More information

Computing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions

Computing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions Working Paper 2013:5 Department of Statistics Computing Exact Confience Coefficients of Simultaneous Confience Intervals for Multinomial Proportions an their Functions Shaobo Jin Working Paper 2013:5

More information

Non-parametric bootstrap mean squared error estimation for M-quantile estimates of small area means, quantiles and poverty indicators

Non-parametric bootstrap mean squared error estimation for M-quantile estimates of small area means, quantiles and poverty indicators Non-parametric bootstrap mean squared error estimation for M-quantile estimates of small area means, quantiles and poverty indicators Stefano Marchetti 1 Nikos Tzavidis 2 Monica Pratesi 3 1,3 Department

More information

Time-of-Arrival Estimation in Non-Line-Of-Sight Environments

Time-of-Arrival Estimation in Non-Line-Of-Sight Environments 2 Conference on Information Sciences an Systems, The Johns Hopkins University, March 2, 2 Time-of-Arrival Estimation in Non-Line-Of-Sight Environments Sinan Gezici, Hisashi Kobayashi an H. Vincent Poor

More information

Deliverable 2.2. Small Area Estimation of Indicators on Poverty and Social Exclusion

Deliverable 2.2. Small Area Estimation of Indicators on Poverty and Social Exclusion Deliverable 2.2 Small Area Estimation of Inicators on Poverty an Social Exclusion Version: 2011 Risto Lehtonen, Ari Veijanen, Mio Myrsylä an Maria Valaste The project FP7-SSH-2007-217322 AMELI is supporte

More information

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics This moule is part of the Memobust Hanbook on Methoology of Moern Business Statistics 26 March 2014 Metho: Balance Sampling for Multi-Way Stratification Contents General section... 3 1. Summary... 3 2.

More information

Tutorial on Maximum Likelyhood Estimation: Parametric Density Estimation

Tutorial on Maximum Likelyhood Estimation: Parametric Density Estimation Tutorial on Maximum Likelyhoo Estimation: Parametric Density Estimation Suhir B Kylasa 03/13/2014 1 Motivation Suppose one wishes to etermine just how biase an unfair coin is. Call the probability of tossing

More information

Robust Forward Algorithms via PAC-Bayes and Laplace Distributions. ω Q. Pr (y(ω x) < 0) = Pr A k

Robust Forward Algorithms via PAC-Bayes and Laplace Distributions. ω Q. Pr (y(ω x) < 0) = Pr A k A Proof of Lemma 2 B Proof of Lemma 3 Proof: Since the support of LL istributions is R, two such istributions are equivalent absolutely continuous with respect to each other an the ivergence is well-efine

More information

Improving Estimation Accuracy in Nonrandomized Response Questioning Methods by Multiple Answers

Improving Estimation Accuracy in Nonrandomized Response Questioning Methods by Multiple Answers International Journal of Statistics an Probability; Vol 6, No 5; September 207 ISSN 927-7032 E-ISSN 927-7040 Publishe by Canaian Center of Science an Eucation Improving Estimation Accuracy in Nonranomize

More information

Outlier robust small area estimation

Outlier robust small area estimation University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2014 Outlier robust small area estimation Ray Chambers

More information

Optimization of Geometries by Energy Minimization

Optimization of Geometries by Energy Minimization Optimization of Geometries by Energy Minimization by Tracy P. Hamilton Department of Chemistry University of Alabama at Birmingham Birmingham, AL 3594-140 hamilton@uab.eu Copyright Tracy P. Hamilton, 1997.

More information

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION The Annals of Statistics 1997, Vol. 25, No. 6, 2313 2327 LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION By Eva Riccomagno, 1 Rainer Schwabe 2 an Henry P. Wynn 1 University of Warwick, Technische

More information

Least-Squares Regression on Sparse Spaces

Least-Squares Regression on Sparse Spaces Least-Squares Regression on Sparse Spaces Yuri Grinberg, Mahi Milani Far, Joelle Pineau School of Computer Science McGill University Montreal, Canaa {ygrinb,mmilan1,jpineau}@cs.mcgill.ca 1 Introuction

More information

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012 CS-6 Theory Gems November 8, 0 Lecture Lecturer: Alesaner Mąry Scribes: Alhussein Fawzi, Dorina Thanou Introuction Toay, we will briefly iscuss an important technique in probability theory measure concentration

More information

The total derivative. Chapter Lagrangian and Eulerian approaches

The total derivative. Chapter Lagrangian and Eulerian approaches Chapter 5 The total erivative 51 Lagrangian an Eulerian approaches The representation of a flui through scalar or vector fiels means that each physical quantity uner consieration is escribe as a function

More information

MODELLING DEPENDENCE IN INSURANCE CLAIMS PROCESSES WITH LÉVY COPULAS ABSTRACT KEYWORDS

MODELLING DEPENDENCE IN INSURANCE CLAIMS PROCESSES WITH LÉVY COPULAS ABSTRACT KEYWORDS MODELLING DEPENDENCE IN INSURANCE CLAIMS PROCESSES WITH LÉVY COPULAS BY BENJAMIN AVANZI, LUKE C. CASSAR AND BERNARD WONG ABSTRACT In this paper we investigate the potential of Lévy copulas as a tool for

More information

SYNCHRONOUS SEQUENTIAL CIRCUITS

SYNCHRONOUS SEQUENTIAL CIRCUITS CHAPTER SYNCHRONOUS SEUENTIAL CIRCUITS Registers an counters, two very common synchronous sequential circuits, are introuce in this chapter. Register is a igital circuit for storing information. Contents

More information

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21 Large amping in a structural material may be either esirable or unesirable, epening on the engineering application at han. For example, amping is a esirable property to the esigner concerne with limiting

More information

Transmission Line Matrix (TLM) network analogues of reversible trapping processes Part B: scaling and consistency

Transmission Line Matrix (TLM) network analogues of reversible trapping processes Part B: scaling and consistency Transmission Line Matrix (TLM network analogues of reversible trapping processes Part B: scaling an consistency Donar e Cogan * ANC Eucation, 308-310.A. De Mel Mawatha, Colombo 3, Sri Lanka * onarecogan@gmail.com

More information

Chapter 6: Energy-Momentum Tensors

Chapter 6: Energy-Momentum Tensors 49 Chapter 6: Energy-Momentum Tensors This chapter outlines the general theory of energy an momentum conservation in terms of energy-momentum tensors, then applies these ieas to the case of Bohm's moel.

More information

Function Spaces. 1 Hilbert Spaces

Function Spaces. 1 Hilbert Spaces Function Spaces A function space is a set of functions F that has some structure. Often a nonparametric regression function or classifier is chosen to lie in some function space, where the assume structure

More information

Contributors: France, Germany, Italy, Netherlands, Norway, Poland, Spain, Switzerland

Contributors: France, Germany, Italy, Netherlands, Norway, Poland, Spain, Switzerland ESSNET ON SMALL AREA ESTIMATION REPORT ON WORK PACKAGE 5 CASE STUDIES FINAL VERSION REVISION FEBRUARY 1 Contributors: France, Germany, Italy, Netherlans, Norway, Polan, Spain, Switzerlan Contents 1. Overview

More information

Logarithmic spurious regressions

Logarithmic spurious regressions Logarithmic spurious regressions Robert M. e Jong Michigan State University February 5, 22 Abstract Spurious regressions, i.e. regressions in which an integrate process is regresse on another integrate

More information

IPA Derivatives for Make-to-Stock Production-Inventory Systems With Backorders Under the (R,r) Policy

IPA Derivatives for Make-to-Stock Production-Inventory Systems With Backorders Under the (R,r) Policy IPA Derivatives for Make-to-Stock Prouction-Inventory Systems With Backorers Uner the (Rr) Policy Yihong Fan a Benamin Melame b Yao Zhao c Yorai Wari Abstract This paper aresses Infinitesimal Perturbation

More information

Parameter estimation: A new approach to weighting a priori information

Parameter estimation: A new approach to weighting a priori information Parameter estimation: A new approach to weighting a priori information J.L. Mea Department of Mathematics, Boise State University, Boise, ID 83725-555 E-mail: jmea@boisestate.eu Abstract. We propose a

More information

arxiv: v1 [hep-lat] 19 Nov 2013

arxiv: v1 [hep-lat] 19 Nov 2013 HU-EP-13/69 SFB/CPP-13-98 DESY 13-225 Applicability of Quasi-Monte Carlo for lattice systems arxiv:1311.4726v1 [hep-lat] 19 ov 2013, a,b Tobias Hartung, c Karl Jansen, b Hernan Leovey, Anreas Griewank

More information

Schrödinger s equation.

Schrödinger s equation. Physics 342 Lecture 5 Schröinger s Equation Lecture 5 Physics 342 Quantum Mechanics I Wenesay, February 3r, 2010 Toay we iscuss Schröinger s equation an show that it supports the basic interpretation of

More information

Spurious Significance of Treatment Effects in Overfitted Fixed Effect Models Albrecht Ritschl 1 LSE and CEPR. March 2009

Spurious Significance of Treatment Effects in Overfitted Fixed Effect Models Albrecht Ritschl 1 LSE and CEPR. March 2009 Spurious Significance of reatment Effects in Overfitte Fixe Effect Moels Albrecht Ritschl LSE an CEPR March 2009 Introuction Evaluating subsample means across groups an time perios is common in panel stuies

More information

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE Journal of Soun an Vibration (1996) 191(3), 397 414 THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE E. M. WEINSTEIN Galaxy Scientific Corporation, 2500 English Creek

More information

Balancing Expected and Worst-Case Utility in Contracting Models with Asymmetric Information and Pooling

Balancing Expected and Worst-Case Utility in Contracting Models with Asymmetric Information and Pooling Balancing Expecte an Worst-Case Utility in Contracting Moels with Asymmetric Information an Pooling R.B.O. erkkamp & W. van en Heuvel & A.P.M. Wagelmans Econometric Institute Report EI2018-01 9th January

More information

Test of Hypotheses in a Time Trend Panel Data Model with Serially Correlated Error Component Disturbances

Test of Hypotheses in a Time Trend Panel Data Model with Serially Correlated Error Component Disturbances HE UNIVERSIY OF EXAS A SAN ANONIO, COLLEGE OF BUSINESS Working Paper SERIES Date September 25, 205 WP # 000ECO-66-205 est of Hypotheses in a ime ren Panel Data Moel with Serially Correlate Error Component

More information

A Fay Herriot Model for Estimating the Proportion of Households in Poverty in Brazilian Municipalities

A Fay Herriot Model for Estimating the Proportion of Households in Poverty in Brazilian Municipalities Int. Statistical Inst.: Proc. 58th Worl Statistical Congress, 2011, Dublin (Session CPS016) p.4218 A Fay Herriot Moel for Estimating the Proportion of Househols in Poverty in Brazilian Municipalities Quintaes,

More information

R package sae: Methodology

R package sae: Methodology R package sae: Methoology Isabel Molina *, Yolana Marhuena March, 2015 Contents 1 Introuction 3 2 Function irect 3 2.1 Sampling without replacement.................... 3 2.2 Sampling with replacement......................

More information

β ˆ j, and the SD path uses the local gradient

β ˆ j, and the SD path uses the local gradient Proceeings of the 00 Winter Simulation Conference E. Yücesan, C.-H. Chen, J. L. Snowon, an J. M. Charnes, es. RESPONSE SURFACE METHODOLOGY REVISITED Ebru Angün Jack P.C. Kleijnen Department of Information

More information

Flexible High-Dimensional Classification Machines and Their Asymptotic Properties

Flexible High-Dimensional Classification Machines and Their Asymptotic Properties Journal of Machine Learning Research 16 (2015) 1547-1572 Submitte 1/14; Revise 9/14; Publishe 8/15 Flexible High-Dimensional Classification Machines an Their Asymptotic Properties Xingye Qiao Department

More information

Perturbation Analysis and Optimization of Stochastic Flow Networks

Perturbation Analysis and Optimization of Stochastic Flow Networks IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. XX, NO. Y, MMM 2004 1 Perturbation Analysis an Optimization of Stochastic Flow Networks Gang Sun, Christos G. Cassanras, Yorai Wari, Christos G. Panayiotou,

More information

6 General properties of an autonomous system of two first order ODE

6 General properties of an autonomous system of two first order ODE 6 General properties of an autonomous system of two first orer ODE Here we embark on stuying the autonomous system of two first orer ifferential equations of the form ẋ 1 = f 1 (, x 2 ), ẋ 2 = f 2 (, x

More information

TEMPORAL AND TIME-FREQUENCY CORRELATION-BASED BLIND SOURCE SEPARATION METHODS. Yannick DEVILLE

TEMPORAL AND TIME-FREQUENCY CORRELATION-BASED BLIND SOURCE SEPARATION METHODS. Yannick DEVILLE TEMPORAL AND TIME-FREQUENCY CORRELATION-BASED BLIND SOURCE SEPARATION METHODS Yannick DEVILLE Université Paul Sabatier Laboratoire Acoustique, Métrologie, Instrumentation Bât. 3RB2, 8 Route e Narbonne,

More information

Nonparametric Small Area Estimation via M-quantile Regression using Penalized Splines

Nonparametric Small Area Estimation via M-quantile Regression using Penalized Splines Nonparametric Small Estimation via M-quantile Regression using Penalized Splines Monica Pratesi 10 August 2008 Abstract The demand of reliable statistics for small areas, when only reduced sizes of the

More information

Gaussian processes with monotonicity information

Gaussian processes with monotonicity information Gaussian processes with monotonicity information Anonymous Author Anonymous Author Unknown Institution Unknown Institution Abstract A metho for using monotonicity information in multivariate Gaussian process

More information

Research Article When Inflation Causes No Increase in Claim Amounts

Research Article When Inflation Causes No Increase in Claim Amounts Probability an Statistics Volume 2009, Article ID 943926, 10 pages oi:10.1155/2009/943926 Research Article When Inflation Causes No Increase in Claim Amounts Vytaras Brazauskas, 1 Bruce L. Jones, 2 an

More information

Robust Low Rank Kernel Embeddings of Multivariate Distributions

Robust Low Rank Kernel Embeddings of Multivariate Distributions Robust Low Rank Kernel Embeings of Multivariate Distributions Le Song, Bo Dai College of Computing, Georgia Institute of Technology lsong@cc.gatech.eu, boai@gatech.eu Abstract Kernel embeing of istributions

More information

Convergence of Random Walks

Convergence of Random Walks Chapter 16 Convergence of Ranom Walks This lecture examines the convergence of ranom walks to the Wiener process. This is very important both physically an statistically, an illustrates the utility of

More information

Modeling the effects of polydispersity on the viscosity of noncolloidal hard sphere suspensions. Paul M. Mwasame, Norman J. Wagner, Antony N.

Modeling the effects of polydispersity on the viscosity of noncolloidal hard sphere suspensions. Paul M. Mwasame, Norman J. Wagner, Antony N. Submitte to the Journal of Rheology Moeling the effects of polyispersity on the viscosity of noncolloial har sphere suspensions Paul M. Mwasame, Norman J. Wagner, Antony N. Beris a) epartment of Chemical

More information

A variance decomposition and a Central Limit Theorem for empirical losses associated with resampling designs

A variance decomposition and a Central Limit Theorem for empirical losses associated with resampling designs Mathias Fuchs, Norbert Krautenbacher A variance ecomposition an a Central Limit Theorem for empirical losses associate with resampling esigns Technical Report Number 173, 2014 Department of Statistics

More information

Lecture 6: Generalized multivariate analysis of variance

Lecture 6: Generalized multivariate analysis of variance Lecture 6: Generalize multivariate analysis of variance Measuring association of the entire microbiome with other variables Distance matrices capture some aspects of the ata (e.g. microbiome composition,

More information

Inverse Theory Course: LTU Kiruna. Day 1

Inverse Theory Course: LTU Kiruna. Day 1 Inverse Theory Course: LTU Kiruna. Day Hugh Pumphrey March 6, 0 Preamble These are the notes for the course Inverse Theory to be taught at LuleåTekniska Universitet, Kiruna in February 00. They are not

More information

The Exact Form and General Integrating Factors

The Exact Form and General Integrating Factors 7 The Exact Form an General Integrating Factors In the previous chapters, we ve seen how separable an linear ifferential equations can be solve using methos for converting them to forms that can be easily

More information

Lecture 2: Correlated Topic Model

Lecture 2: Correlated Topic Model Probabilistic Moels for Unsupervise Learning Spring 203 Lecture 2: Correlate Topic Moel Inference for Correlate Topic Moel Yuan Yuan First of all, let us make some claims about the parameters an variables

More information

Estimating International Migration on the Base of Small Area Techniques

Estimating International Migration on the Base of Small Area Techniques MPRA Munich Personal RePEc Archive Estimating International Migration on the Base of Small Area echniques Vergil Voineagu an Nicoleta Caragea an Silvia Pisica 2013 Online at http://mpra.ub.uni-muenchen.e/48775/

More information

under the null hypothesis, the sign test (with continuity correction) rejects H 0 when α n + n 2 2.

under the null hypothesis, the sign test (with continuity correction) rejects H 0 when α n + n 2 2. Assignment 13 Exercise 8.4 For the hypotheses consiere in Examples 8.12 an 8.13, the sign test is base on the statistic N + = #{i : Z i > 0}. Since 2 n(n + /n 1) N(0, 1) 2 uner the null hypothesis, the

More information

Math 342 Partial Differential Equations «Viktor Grigoryan

Math 342 Partial Differential Equations «Viktor Grigoryan Math 342 Partial Differential Equations «Viktor Grigoryan 6 Wave equation: solution In this lecture we will solve the wave equation on the entire real line x R. This correspons to a string of infinite

More information

inflow outflow Part I. Regular tasks for MAE598/494 Task 1

inflow outflow Part I. Regular tasks for MAE598/494 Task 1 MAE 494/598, Fall 2016 Project #1 (Regular tasks = 20 points) Har copy of report is ue at the start of class on the ue ate. The rules on collaboration will be release separately. Please always follow the

More information

Track Initialization from Incomplete Measurements

Track Initialization from Incomplete Measurements Track Initialiation from Incomplete Measurements Christian R. Berger, Martina Daun an Wolfgang Koch Department of Electrical an Computer Engineering, University of Connecticut, Storrs, Connecticut 6269,

More information

One-dimensional I test and direction vector I test with array references by induction variable

One-dimensional I test and direction vector I test with array references by induction variable Int. J. High Performance Computing an Networking, Vol. 3, No. 4, 2005 219 One-imensional I test an irection vector I test with array references by inuction variable Minyi Guo School of Computer Science

More information

Nonparametric Additive Models

Nonparametric Additive Models Nonparametric Aitive Moels Joel L. Horowitz The Institute for Fiscal Stuies Department of Economics, UCL cemmap working paper CWP20/2 Nonparametric Aitive Moels Joel L. Horowitz. INTRODUCTION Much applie

More information

Table of Common Derivatives By David Abraham

Table of Common Derivatives By David Abraham Prouct an Quotient Rules: Table of Common Derivatives By Davi Abraham [ f ( g( ] = [ f ( ] g( + f ( [ g( ] f ( = g( [ f ( ] g( g( f ( [ g( ] Trigonometric Functions: sin( = cos( cos( = sin( tan( = sec

More information

Simple Tests for Exogeneity of a Binary Explanatory Variable in Count Data Regression Models

Simple Tests for Exogeneity of a Binary Explanatory Variable in Count Data Regression Models Communications in Statistics Simulation an Computation, 38: 1834 1855, 2009 Copyright Taylor & Francis Group, LLC ISSN: 0361-0918 print/1532-4141 online DOI: 10.1080/03610910903147789 Simple Tests for

More information

Linear First-Order Equations

Linear First-Order Equations 5 Linear First-Orer Equations Linear first-orer ifferential equations make up another important class of ifferential equations that commonly arise in applications an are relatively easy to solve (in theory)

More information

Robustness and Perturbations of Minimal Bases

Robustness and Perturbations of Minimal Bases Robustness an Perturbations of Minimal Bases Paul Van Dooren an Froilán M Dopico December 9, 2016 Abstract Polynomial minimal bases of rational vector subspaces are a classical concept that plays an important

More information

arxiv:hep-th/ v1 3 Feb 1993

arxiv:hep-th/ v1 3 Feb 1993 NBI-HE-9-89 PAR LPTHE 9-49 FTUAM 9-44 November 99 Matrix moel calculations beyon the spherical limit arxiv:hep-th/93004v 3 Feb 993 J. Ambjørn The Niels Bohr Institute Blegamsvej 7, DK-00 Copenhagen Ø,

More information

Role of parameters in the stochastic dynamics of a stick-slip oscillator

Role of parameters in the stochastic dynamics of a stick-slip oscillator Proceeing Series of the Brazilian Society of Applie an Computational Mathematics, v. 6, n. 1, 218. Trabalho apresentao no XXXVII CNMAC, S.J. os Campos - SP, 217. Proceeing Series of the Brazilian Society

More information

A. Exclusive KL View of the MLE

A. Exclusive KL View of the MLE A. Exclusive KL View of the MLE Lets assume a change-of-variable moel p Z z on the ranom variable Z R m, such as the one use in Dinh et al. 2017: z 0 p 0 z 0 an z = ψz 0, where ψ is an invertible function

More information

Similarity Measures for Categorical Data A Comparative Study. Technical Report

Similarity Measures for Categorical Data A Comparative Study. Technical Report Similarity Measures for Categorical Data A Comparative Stuy Technical Report Department of Computer Science an Engineering University of Minnesota 4-92 EECS Builing 200 Union Street SE Minneapolis, MN

More information

Qubit channels that achieve capacity with two states

Qubit channels that achieve capacity with two states Qubit channels that achieve capacity with two states Dominic W. Berry Department of Physics, The University of Queenslan, Brisbane, Queenslan 4072, Australia Receive 22 December 2004; publishe 22 March

More information

Separation of Variables

Separation of Variables Physics 342 Lecture 1 Separation of Variables Lecture 1 Physics 342 Quantum Mechanics I Monay, January 25th, 2010 There are three basic mathematical tools we nee, an then we can begin working on the physical

More information

Multi-View Clustering via Canonical Correlation Analysis

Multi-View Clustering via Canonical Correlation Analysis Technical Report TTI-TR-2008-5 Multi-View Clustering via Canonical Correlation Analysis Kamalika Chauhuri UC San Diego Sham M. Kakae Toyota Technological Institute at Chicago ABSTRACT Clustering ata in

More information

THE EFFICIENCIES OF THE SPATIAL MEDIAN AND SPATIAL SIGN COVARIANCE MATRIX FOR ELLIPTICALLY SYMMETRIC DISTRIBUTIONS

THE EFFICIENCIES OF THE SPATIAL MEDIAN AND SPATIAL SIGN COVARIANCE MATRIX FOR ELLIPTICALLY SYMMETRIC DISTRIBUTIONS THE EFFICIENCIES OF THE SPATIAL MEDIAN AND SPATIAL SIGN COVARIANCE MATRIX FOR ELLIPTICALLY SYMMETRIC DISTRIBUTIONS BY ANDREW F. MAGYAR A issertation submitte to the Grauate School New Brunswick Rutgers,

More information

Reconstructible Phylogenetic Networks: Do Not Distinguish the Indistinguishable

Reconstructible Phylogenetic Networks: Do Not Distinguish the Indistinguishable Reconstructible Phylogenetic Networks: Do Not Distinguish the Inistinguishable Fabio Pari, Celine Scornavacca To cite this version: Fabio Pari, Celine Scornavacca. Reconstructible Phylogenetic Networks:

More information

Florian Heiss; Viktor Winschel: Estimation with Numerical Integration on Sparse Grids

Florian Heiss; Viktor Winschel: Estimation with Numerical Integration on Sparse Grids Florian Heiss; Viktor Winschel: Estimation with Numerical Integration on Sparse Gris Munich Discussion Paper No. 2006-15 Department of Economics University of Munich Volkswirtschaftliche Fakultät Luwig-Maximilians-Universität

More information

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y Ph195a lecture notes, 1/3/01 Density operators for spin- 1 ensembles So far in our iscussion of spin- 1 systems, we have restricte our attention to the case of pure states an Hamiltonian evolution. Toay

More information

Combining Time Series and Cross-sectional Data for Current Employment Statistics Estimates 1

Combining Time Series and Cross-sectional Data for Current Employment Statistics Estimates 1 JSM015 - Surey Research Methos Section Combining Time Series an Cross-sectional Data for Current Employment Statistics Estimates 1 Julie Gershunskaya U.S. Bureau of Labor Statistics, Massachusetts Ae NE,

More information

Estimation of Complex Small Area Parameters with Application to Poverty Indicators

Estimation of Complex Small Area Parameters with Application to Poverty Indicators 1 Estimation of Complex Small Area Parameters with Application to Poverty Indicators J.N.K. Rao School of Mathematics and Statistics, Carleton University (Joint work with Isabel Molina from Universidad

More information

. Using a multinomial model gives us the following equation for P d. , with respect to same length term sequences.

. Using a multinomial model gives us the following equation for P d. , with respect to same length term sequences. S 63 Lecture 8 2/2/26 Lecturer Lillian Lee Scribes Peter Babinski, Davi Lin Basic Language Moeling Approach I. Special ase of LM-base Approach a. Recap of Formulas an Terms b. Fixing θ? c. About that Multinomial

More information

How to Minimize Maximum Regret in Repeated Decision-Making

How to Minimize Maximum Regret in Repeated Decision-Making How to Minimize Maximum Regret in Repeate Decision-Making Karl H. Schlag July 3 2003 Economics Department, European University Institute, Via ella Piazzuola 43, 033 Florence, Italy, Tel: 0039-0-4689, email:

More information

Disease mapping via negative binomial M-quantile regression

Disease mapping via negative binomial M-quantile regression University of Wollongong Research Online Centre for Statistical & Survey Methodology Working Paper Series Faculty of Engineering and Information Sciences 2013 Disease mapping via negative binomial M-quantile

More information

19 Eigenvalues, Eigenvectors, Ordinary Differential Equations, and Control

19 Eigenvalues, Eigenvectors, Ordinary Differential Equations, and Control 19 Eigenvalues, Eigenvectors, Orinary Differential Equations, an Control This section introuces eigenvalues an eigenvectors of a matrix, an iscusses the role of the eigenvalues in etermining the behavior

More information

Hyperbolic Moment Equations Using Quadrature-Based Projection Methods

Hyperbolic Moment Equations Using Quadrature-Based Projection Methods Hyperbolic Moment Equations Using Quarature-Base Projection Methos J. Koellermeier an M. Torrilhon Department of Mathematics, RWTH Aachen University, Aachen, Germany Abstract. Kinetic equations like the

More information

Switching Time Optimization in Discretized Hybrid Dynamical Systems

Switching Time Optimization in Discretized Hybrid Dynamical Systems Switching Time Optimization in Discretize Hybri Dynamical Systems Kathrin Flaßkamp, To Murphey, an Sina Ober-Blöbaum Abstract Switching time optimization (STO) arises in systems that have a finite set

More information

Inter-domain Gaussian Processes for Sparse Inference using Inducing Features

Inter-domain Gaussian Processes for Sparse Inference using Inducing Features Inter-omain Gaussian Processes for Sparse Inference using Inucing Features Miguel Lázaro-Greilla an Aníbal R. Figueiras-Vial Dep. Signal Processing & Communications Universia Carlos III e Mari, SPAIN {miguel,arfv}@tsc.uc3m.es

More information

Closed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device

Closed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device Close an Open Loop Optimal Control of Buffer an Energy of a Wireless Device V. S. Borkar School of Technology an Computer Science TIFR, umbai, Inia. borkar@tifr.res.in A. A. Kherani B. J. Prabhu INRIA

More information

Quantile function expansion using regularly varying functions

Quantile function expansion using regularly varying functions Quantile function expansion using regularly varying functions arxiv:705.09494v [math.st] 9 Aug 07 Thomas Fung a, an Eugene Seneta b a Department of Statistics, Macquarie University, NSW 09, Australia b

More information

Sparse Reconstruction of Systems of Ordinary Differential Equations

Sparse Reconstruction of Systems of Ordinary Differential Equations Sparse Reconstruction of Systems of Orinary Differential Equations Manuel Mai a, Mark D. Shattuck b,c, Corey S. O Hern c,a,,e, a Department of Physics, Yale University, New Haven, Connecticut 06520, USA

More information

Lecture 6: Calculus. In Song Kim. September 7, 2011

Lecture 6: Calculus. In Song Kim. September 7, 2011 Lecture 6: Calculus In Song Kim September 7, 20 Introuction to Differential Calculus In our previous lecture we came up with several ways to analyze functions. We saw previously that the slope of a linear

More information

05 The Continuum Limit and the Wave Equation

05 The Continuum Limit and the Wave Equation Utah State University DigitalCommons@USU Founations of Wave Phenomena Physics, Department of 1-1-2004 05 The Continuum Limit an the Wave Equation Charles G. Torre Department of Physics, Utah State University,

More information

Math Notes on differentials, the Chain Rule, gradients, directional derivative, and normal vectors

Math Notes on differentials, the Chain Rule, gradients, directional derivative, and normal vectors Math 18.02 Notes on ifferentials, the Chain Rule, graients, irectional erivative, an normal vectors Tangent plane an linear approximation We efine the partial erivatives of f( xy, ) as follows: f f( x+

More information

Equilibrium in Queues Under Unknown Service Times and Service Value

Equilibrium in Queues Under Unknown Service Times and Service Value University of Pennsylvania ScholarlyCommons Finance Papers Wharton Faculty Research 1-2014 Equilibrium in Queues Uner Unknown Service Times an Service Value Laurens Debo Senthil K. Veeraraghavan University

More information