University of Michigan School of Public Health

Size: px
Start display at page:

Download "University of Michigan School of Public Health"

Transcription

1 University of Michigan School of Public Health The University of Michigan Deartment of Biostatistics Working Paer Series ear 003 Paer 5 Robust Likelihood-based Analysis of Multivariate Data with Missing Values Rod Little An Hyonggin University of Michigan, rlittle@umich.edu University of Michigan, hyongg@umich.edu This working aer is hosted by The Berkeley Electronic Press (beress) and may not be commercially reroduced without the ermission of the coyright holder. htt://biostats.beress.com/umichbiostat/aer5 Coyright c 003 by the authors.

2 Robust Likelihood-based Analysis of Multivariate Data with Missing Values Rod Little and An Hyonggin Abstract The model-based aroach to inference from multivariate data with missing values is reviewed. Regression rediction is most useful when the covariates are redictive of the missing values and the robability of being missing, and in these circumstances redictions are articularly sensitive to model missecification. The use of enalized slines of the roensity score is roosed to yield robust model-based inference under the missing at random (MAR) assumtion, assuming monotone missing data. Simulation comarisons with other methods suggest that the method works well in a wide range of oulations, with little loss of efficiency relative to arametric models when the latter are correct. Extensions to more general atterns are outlined.

3 Draft Dec 4, 003 Robust likelihood-based analysis of multivariate data with missing values Roderick Little and Hyonggin An University of Michigan Hosted by The Berkeley Electronic Press

4 Abstract The model-based aroach to inference from multivariate data with missing values is reviewed. Regression rediction is most useful when the covariates are redictive of the missing values and the robability of being missing, and in these circumstances redictions are articularly sensitive to model missecification. The use of enalized slines of the roensity score is roosed to yield robust model-based inference under the missing at random (MAR) assumtion, assuming monotone missing data. Simulation comarisons with other methods suggest that the method works well in a wide range of oulations, with little loss of efficiency relative to arametric models when the latter are correct. Extensions to more general atterns are outlined. KEWORDS: double robustness, incomlete data, enalized slines, regression imutation, weighting. Introduction Missing values arise in emirical studies for many reasons. For examle, in longitudinal studies, data are missing because of attrition, when subjects dro out rior to the end of the study. In most surveys, some individuals rovide no information because of non-contact or refusal to resond (unit nonresonse). Other individuals are contacted and rovide some information, but fail to answer some of the questions (item nonresonse). Often indices are constructed by summing values of articular items. For examle, in economic studies, total net worth is a combination of values of individual assets or liabilities, some of which may be missing. If any of the items that form the index are missing, some rocedure is needed to deal with the missing data. htt://biostats.beress.com/umichbiostat/aer5

5 The missing data attern simly indicates which values in the data set are observed and which are missing. Secifically, let = ( ) denote an ( n ) rectangular dataset without y ij missing values, with ith row yi = ( yi,..., yi) where y ij is the value of variable j for subject i. With missing values, the attern of missing data is defined by the missing-data indicator matrix M = ( ) with ith row mi = ( mi,..., mi), such that m ij = if y ij is missing and m ij = 0 if y ij is m ij resent. We assume throughout that ( y, m ) are indeendent over i. i i Some methods for handling missing data aly to any attern of missing data, whereas other methods assume a secial attern. For simlicity we consider methods for the simle attern of univariate nonresonse, where missingness is confined to a single variable, say and,..., are fully observed. In Section 7 we discuss extensions of our methods to more general atterns, such as monotone missing data, where the variables can be arranged so that,,..., j+ is missing for all cases where j is missing, for all j =,..., -. This attern arises commonly in longitudinal data subject to attrition. The erformance of alternative missing-data methods deends strongly on the missingdata mechanism, which concerns the reasons why values are missing, and in articular whether missingness deends on the values of variables in the data set. For examle, subjects in a longitudinal intervention may more likely to dro out of a study because they feel the treatment was ineffective, which might be related to a oor value of an outcome measure. Rubin (976) treated M as a random matrix, and characterized the missing-data mechanism by the conditional distribution of M given, say f ( M, φ ), where φ denotes unknown arameters. When missingness does not deend on the values of the data, missing or observed, that is: f ( M, φ) = f ( M φ) for all, φ, Hosted by The Berkeley Electronic Press

6 the data are called missing comletely at random (MCAR). With the excetion of lanned missing-data designs, MCAR is a strong assumtion, and missingness often does deend on recorded variables. Let obs denote the observed values of and mis the missing values. A less restrictive assumtion is that missingness deends only on values obs that are observed, and not on values mis that are missing. That is: f ( M, φ) = f ( M, φ) for all, φ. The missing data mechanism is then called missing at random (MAR). Many methods for handling missing data assume the mechanism is MCAR or MAR, and yield biased estimates when the data are not MAR (NMAR). The main ideas of this article can be summarized in the following roositions: (a) When the missing data mechanism is unknown and NMAR, methodological otions are limited and not very aealing to the ractitioner. Thus, in studies where missing data are likely to arise, efforts should be made to render the MAR assumtion lausible, by measuring covariates that characterize nonresondents (Little and Rubin, 999). (b) The most useful covariates for nonresonse adjustment are (i) redictive of the missing values mis and (ii) redictive of the missing data indicator M. Of the two, criterion (i) is the most imortant, since conditioning on a covariate that is redictive of M but not of mis leads to a loss of efficiency without a comensating reduction in bias. Section 3 resents an analysis in suort of these statements. (c) All missing-data adjustments require modeling assumtions relating the missing data to observed covariates. Sensitivity to assumtions is a articularly serious issue for analysis involving covariates that are useful for missing-data adjustments, as described in (b). (d) Given (a) - (c), missing-data methods based on MAR and models that make relatively weak assumtions relating the covariates to the missing data are useful. Methods of this kind based on obs mis 3 htt://biostats.beress.com/umichbiostat/aer5

7 roensity slines are roosed in Sections 4 and 5 below, for the secial case of univariate nonresonse. These methods are assessed by simulation in Section 6. Some extensions of these methods to more general missing data roblems are outlined in Section 7, and Section 8 resents concluding remarks.. Limitations of NMAR analyses when the missing data mechanism is unknown. There is an extensive literature of methods for NMAR missing-data mechanisms; early examles include Heckman s (976) roosals for handling selectivity bias, and Rubin s (977) Bayesian analysis. See also Little and Rubin (00, chater 5). The difficulty of the roblem can be seen by considering the simlest situation of a single variable (that is, = ), observed for r cases and missing for n - r cases, with no covariate information. Suose the resondent values of are indeendently distributed with mean µ R and variance σ, and the nonresondent values are indeendently distributed with mean µ NR and variance σ. If the observations are indeendent, then MCAR=MAR, and µ R = µ NR. In that case, the samle mean y based on the r comlete cases is unbiased, and in many cases otimal for the mean. If, on the other hand, the data are NMAR, the bias of y for inference about the overall mean is easily seen to be f λσ, where f = ( n r)/ n is the fraction of missing values and / λ = ( µ µ )/ σ is / R NR the standardized difference in resondent and nonresondent means. Assuming asymtotic normality and ignoring t corrections, the noncoverage rate of the usual 95% confidence interval y ±.96 s / r based on the comlete cases is Φ (.96 + rfλ) +Φ (.96 rfλ), 4 Hosted by The Berkeley Electronic Press

8 Table. Coverage of 95% confidence interval for oulation mean when the resondent mean has a bias f λ = 0.. Resondent samle size Coverage rate (%) where Φ denotes the normal cumulative density function. Table tabulates this noncoverage rate as a function of the resondent samle size r, for a fixed bias of f λ = 0.. Clearly bias has an increasing distorting effect on the noncoverage as the samle size increases. Analysis otions are clearly limited in the absence of information about the nonresondents. Other than assuming the bias away, the only alternative is to widen the interval to allow for otential bias. Three aroaches to this are: (a) To develo bounds for the quantity of interest that include all ossible values of the missing data. For examle, for a binary outcome, one might calculate the samle roortion with all missing values imuted as one, and all missing values imuted as zero (Horowitz and Manski, 000). This aroach tends to be very conservative, and is limited to variables that have finite suort. (b) Conduct a sensitivity analysis for alternative models for nonignorable nonresonse (Rubin, 977; Little and Wang, 996; Scharfstein, Rotnitsky and Robins, 999). (c) Add a rior distribution for the nonresondent values and aly the Bayesian aradigm. For examle, Rubin (977) considers the model: µ R ~ const. ; µ NR µ R ~ N( µ R, λσ ) An alternative aroach is to attemt to measure covariates that cature differences between resondents and nonresondents, so that the missing-data mechanism can be considered MAR. For the remainder of this aer we consider models under the assumtion that the missing 5 htt://biostats.beress.com/umichbiostat/aer5

9 data are MAR, while recognizing that residual deendence of the missing data indicators on missing values of the data may require one of the aroaches (a) - (c) delineated above. 3. Covariates to the rescue? Suose now that fully observed covariates are available, and let,... denote the variables observed for all n cases, and r cases. The mean of can be written as the variable with missing values, observed for the first µ = E[( M ) ] + EME [ ( X)], and E[( M) ] can be estimated from the comlete cases. To estimate the second term EME [ ( X )], note that under MAR, E ( X) = E ( X, M = 0) = E ( X, M = ). Hence for incomlete cases ( M = ) one can estimate E ( X ) from the comlete cases and redict the for each incomlete case by substituting the X for that case into the regression formula. If the regression is linear, this leads to the regression estimate: µ ˆ r n ˆ = n yi + yi i= i= r+ where { yi, i =,..., r} are the observed values of, and () yˆ = βˆ + βˆ y is the rediction i 0 j= j ij from the regression of on (,..., ), comuted on the r comlete cases. Eq. () is the maximum likelihood (ML) estimate of µ for a variety of models, including multivariate normality for (,..., ) (e.g., see Little and Rubin, 00). The imact of regressing on covariates for inference about µ can be assessed by comaring the mean squared error of µ ˆ relative to the estimate based on the comlete cases, 6 Hosted by The Berkeley Electronic Press

10 y = r yi / r. Cons ider this comarison for a single covariate ( = ), where and are i= bivariate normal, and the missing data are MAR. The regression estimate () is then unbiased for µ with mean squared error (e.g. Little and Rubin, 00) ignoring ˆ ( ) mse( µ ) = ( σ / r) ( ρ ) + ( r / n) ρ + ( ρ )( r/ n), O(/ r ) terms, where ρ is the correlation between resondent values of and, and is the difference in the nonresondent and resondent mean of, divided by the resondent variance of. Note the ρ measures the association between and and measures the association between and M. The mean squared error of y is mse( y ) = ( σ / r) + ( r/ n) ρσ, where the first term on the right side is the variance and the second term is the bias. Subtracting and simlifying yields mse( y ) mse( µ ˆ ) = ( r/ n) σ ( r/ n) ρ + ρ / r ( r/ n)( ρ ) / r. () The first term in the square arenthesis in Eq. () is O () and is the bias that has been eliminated by the regression of on (more generally under NMAR one exects the regression to reduce bias, although it could increase). Both ρ and must be large for this term to be substantial. The second and third terms in the square arenthesis reresent variance reduction from the regression on. This variance reduction is substantial when ρ is large. In fact, if ρ is small and is large, as when is redictive of M but not redictive of, the net value of these terms may be negative, reflecting an increase in variance from the regression on. These results are summarized in Table. Eq. () generalizes to a multivariate set of redictors, with the 7 htt://biostats.beress.com/umichbiostat/aer5

11 Table. Effect on bias and variance of the estimated mean of of regression on a fullyobserved covariate, for combinations of the association between and ( ρ ) and the association between and M ( ). ρ Low Low bias change: 0 variance change: 0 High bias change: 0 variance change: ρ High bias change : 0 variance change: bias change : variance change: obvious generalizations of ρ and. Clearly, the key for both bias and variance reduction is that is a good redictor of. 4. Robust MAR inference with a single covariate 4.. Robust Prediction In the revious section we noted that the key to reduce mean squared error for inference about the mean of is to find redictors that are redictive of and the missing data indicator M. These are the circumstances under which inference are most sensitive to missecification of the regression of on,...,, since the bias reduction is deendent on an aroriate secification of the model relating to,...,. Thus we now consider robust alternatives to the linear additive model (). We first consider the case of a single covariate, =. Extensions to more than one covariate are discussed in Sections 5 and 7. Standard regression modeling methods, such as adding olynomial terms and interactions to the regression in (), are useful strategies. Perhas the simlest way to weaken assumtions 8 Hosted by The Berkeley Electronic Press

12 about the relationshi between and a continuous covariate is to grou the covariate into categories and regress on dummy variables for the categories. The resulting regression estimate, µ ˆ C = y, (3) c c c= is the average of the resondent mean y c in each category weighted by the samle roortion = n / n in that category. c c An attractive alternative to categorization of continuous covariates is to fit a smooth but relatively nonarametric relationshi between and the covariate (Cheng, 994). For examle, one might model the regression of on via a enalized sline (Eilers and Marx, 996, Ruert and Carroll, 000) with a ower-truncated sline basis: ( φ σ ) (, φ)~ N s (, ), q j ( φ) = φ + φ + φ ( τ ) s, y, 0 j i q+ k k + j= k= K q, (4) q q where q is the degree of olynomial, ( x) x I( x 0) =, τ + < < τk are selected fixed knots, and K is the total number of knots. Then, the enalized least-squares estimator φˆ = ( φ ) T 0,, φ q + K can be obtained by minimizing the enalized sum of squared errors n q K K j q yi φ0 φjyi φq+ k( yi τ k) + λ ζ ( φq+ k), + i= j= k= k= where ζ is a suitable nonnegative function, and λ is a smoothing arameter. The smoothing arameter can be estimated by generalized cross validation or by ML for a linear mixed model, treating ( φ,, 0 φq ) T as a fixed arameter vector and ( φ φ ) T,, as a random vector. Cheng q+ q+ K (994) achieves nonarametric smoothing by another method, kernel regression; an attractive feature of the ML version of enalized slines is that they are easily imlemented with widely 9 htt://biostats.beress.com/umichbiostat/aer5

13 available software such as PROC MIXED in SAS (SAS, 99) and lme( ) in S-lus (Pinheiro and Bates, 000). 4.. Weighting the comlete cases. An alternative to rediction, commonly used for unit nonresonse adjustments in samle surveys, is to weight the comlete cases by the inverse of an estimate of the robability of resonse (e.g. Little and Rubin, 003, Section 3.3). The mean of can be written as: µ ( M ) M, = E / E π( ) π( ) where π ( ) = Pr( M = 0 ) is the robability that is observed given. The denominator in this equation can be ignored under correct secification of the π ( ), since it then equals one. Relacing oulation quantities by samle estimates yields the weighted comlete-case estimate: or where the weight r r µ ˆ yw = wy i i / wi i= i=, (5) r µ ˆ yw = wy i i / n, (5A) i= w i for resondents is a recirocal of an estimate of y i π ( ). If is groued into categories, and resondents in category c are weighted by the inverse of the estimated resonse rate r / n in category c, then the resulting estimator (5) or (5A) is identical to the c c regression estimate (3). Note that if the true resonse rate is the same for all the categories c, as when the data are MCAR, then weighting by the true resonse rate yields the unweighted samle mean y based on the comlete cases, which is less efficient if the categorized covariate is redictive of resonse. This is a simle and instructive illustration of increased efficiency when weights are estimated from the samle rather than from oulation arameters (e.g. Robins, Rotnitsky and Zhao, 994). 0 Hosted by The Berkeley Electronic Press

14 In section 4. we used slines to smooth the redictions from a regression model. A different use of smoothing is to smooth the weights in the weighted estimator (5). That is, the weights are relaced by the inverse of the estimated roensity to resond, comuted by fitting a sline to the logistic regression of the missing-data indicator M on. w = / π ( φˆ ), π ( φ) = Pr( M = y, φ), i i i i i ( π φ ) logit ( ) = s ( y ; φ), i M i (6) where sm( yi; φ ) is a sline for the binary outcome M i analogous to (5), with unknown arameters φ, and ˆ φ is an estimate of φ. The latter can be obtained by fitting a generalized linear mixed model for the sline regression of M on (Breslow and Clayton, 993). The utility of slines for rediction, as in Eq. (4), and for weighting, as in Eq. (6) is comared in the simulations in Section Calibration estimators weighting: The mean of can be written in a way that combines the features of rediction and µ ( M ) = E E + E E π( ) ( ( )) ( ( )) Estimating quantities in this exression leads to a calibration estimator of the form r n µ ˆ ˆ ˆ = n wi( yi yi) + n yi i= i=. (7) where the redictions y ˆi from the model are calibrated by adding a term consisting of weighted residuals from the model. The estimator (7) has roerties of semi-arametric efficiency and double-robustness (Robins, Rotnitsky and Zhao, 994; Robins and Rotnitsky, 00), in the sense that the estimate is consistent if just one of the models for rediction and weighting are correctly secified. However, since the calibration of the redictions is to correct effects of. htt://biostats.beress.com/umichbiostat/aer5

15 model missecification, we believe that the calibration of the redictions in Eq. (7) is unnecessary if the rediction model does not make strong arametric assumtions, as in (4). This conjecture is suorted by the simulation studies in Section Robust MAR Inferences with More than One Covariate With sufficient samle size, a enalized sline rovides a useful model for redictions based on a single covariate. With several covariates an additive model might be fitted with slines on the continuous covariates. In articular, Scharfstein and Izzary (003) consider a flexible class of estimators that includes (7) as a secial case where the roensity score model and mean model follow generalized additive regressions. We roose here a rediction model that addresses the curse of dimensionality by focusing the sline on a articular function of the covariates most sensitive to model missecification, namely the roensity score. Suose that is subject to missing values and,..., are fully observed covariates, and 3 so that there are at least covariates. We first define the logit of the roensity score for to be observed, given the covariates,..., : ( ) = logit Pr( M = 0,..., ). (8) The key roerty of the roensity score is that conditional on the roensity score and assuming MAR, missingness of does not deend on,..., (Rosenbaum and Rubin, 983). Thus the mean of can be written as: µ = E[( M ) ] + EM [ E ( )]. This motivates the following method: Hosted by The Berkeley Electronic Press

16 (a) Estimate by a logistic regression of M on (,..., ), yielding estimated roensity ˆ ; this regression can include nonlinear terms and interactions in (,..., ), and with sufficient data could be modeled by a sline as in (6). (b) Predict the missing values of by a sline regression of on ˆ. Since variables other than ˆ may be good redictors of, the other covariates are entered in the regression arametrically, for examle as linear additive terms. More secifically, we relace one of the redictor variables, say, by multicollinearity; we then redict the missing values of, to avoid using the following model for the distribution of,... given : (( ) Σ) ( + ) (,..., )~ N s ( ),..., s ( ), (,,...,, β)~ N s ( ) g (,,...,, β), σ, (9) where s ( ) = E ( ) is a sline for the regression of j on j j, = s ( ), j j j j =,...,, and g is a arametric function indexed by unknown arameters β, with the roerty that g (,0,...,0, β ) = 0 for all β. (0) Examles of functions g() that satisfy (0) include a linear additive model for (,..., ), g (,,...,, ) =, () β βj j j= and a model that includes first order interactions between and (,..., ): β = βj j + β j+ j j= j=. g (,,...,, ) 3 htt://biostats.beress.com/umichbiostat/aer5

17 We call Eq. (9) a roensity sline rediction model. The idea of exlicitly including the roensity score as a covariate in the rediction model was reviously roosed by David et al. (983) and in a more general context in Robins (999). The use of a sline on one regressor variable is an alication of u and Ruert s (00) artially linear single-index model in the missing-data setting. The following theorem defines a double robustness roerty of rediction estimates of the mean of based on the model (9). Theorem. Let µ ˆ be the rediction estimator () based on the model (9), and assume MAR. Then µ ˆ is a consistent estimator of µ if either (a) the mean of given (,,..., ) in the model (9) is correctly secified, or (b) the roensity is correctly secified, and (b) E ( ) = s ( ) for j=,...,, that is, the slines j j s ( ) correctly model the regressions of j j on for j =,...,. The robustness feature of condition (b) is that the regression function g does not have to be correctly secified. Outline roof of Theorem. Consistency under (a) follows from the usual roerties of rediction under a well-secified regression model. For consistency under (b) and (b), we need to show that n ( i ) yˆ /( n r) E ( M = ) as ( n r). () i= r+ Let ˆi y be a rediction for a nonresondent ( i = r+,..., n). Note that yˆ = s ( ˆy ) + g( yˆ, yˆ,..., yˆ ; β ), i i i i i 4 Hosted by The Berkeley Electronic Press

18 where yˆ ˆ ( ˆ ij = yij sj yi) and s ˆ j denotes the samle estimate of the sline s j. Since by assumtion the roensity model and the slines are correctly secified, yˆ y = s ( y ) + gy (, y,..., y ; β ), i i i i i i where β is the limiting value of ˆβ, and the estimates y ˆi and s ˆ j have been relaced by limiting values y i and s j. Now ( β ) ( β ) ( β ) E( y y ) = s ( y ) + E g( y, y,..., y ; ) y i i i i i i i s ( y ) + g y, E( y y ),... E( y y ); i i i i i i = s ( y ) + g y,0,...0; = s ( y ) = E( y y ). i i i i i Hence the conditional exectation of y ˆi given y i converges to E y y, which equals ( i i) E y y m = by the balancing roerty of roensity scores. Hence ( i i, i ) n i= r+ y ˆ /( ) i n r converges to E ( M = ), as required for consistency. The double robustness roerty in this theorem is more restrictive than the double robustness roerty for the calibration estimator (7), which requires only (a) or (b) and does not require (b) in addition to (b). On the other hand, (b) is weak, since the univariate regressions on are modeled nonarametrically in (9) with slines { s ( )}. The inclusion of g in (9) does not affect consistency even if it is missecified, and it has the otential of imroving efficiency, as for examle when the variables (,..., ) are redictive of but the roensity score is not. The roerties of roensity sline rediction are further exlored in the simulation study in the next section. j 5 htt://biostats.beress.com/umichbiostat/aer5

19 6. Simulations We conducted three simulation studies to examine the erformance of the estimators of the mean of with missing data under MAR. The first simulation study involved a single fullyobserved covariate, generated from a uniform distribution between and, and one variable with missing values, generated from a normal distribution with one of four mean structures: (I) constant : ( 6, ) N, (II) linear : N( 6 0, ) +, ( ), (III) cubic: N ( ) + +, and ( 6 5sin, ) (IV) sine : N ( π ) +. The exected value of is 6 for four mean structures, and (II) (IV) model a strong redictive relationshi between and. We created missing values from four different models for the roensity to resond: ( ) (I) constant (MCAR) : r( M ) (II) linear : logit r( M 0 ) logit = 0 = 0.5, ( ) = 3 (III) cubic : ( ) =, 3 ( r M ) logit = 0 = 3, and (IV) sine: ( ) ( r M ) ( π ) logit = 0 =.5sin. The resonse rate for all these roensity structures is 0.5, and (II) (IV) model a strong redictive relationshi between and M. The non-mcar simulations are thus focused on the fourth cell of Table, when a well-secified regression adjustment has strong gains. For each combination of mean and roensity structure, 500 simulated data sets with samle size n=00 6 Hosted by The Berkeley Electronic Press

20 were generated. Then, six estimators of the mean of are comared with the mean of before deletion (BD), namely: (CC) The comlete-case estimate, deleting the incomlete cases. (LP) The rediction estimator () based on linear regression. (SP) The rediction estimator () based on a enalized sline regression model (4). (LW) The weighted estimator (5) with weights comuted as the inverse of the resonse roensity estimated by linear logistic regression of the missing-data indicator on. (SW) The weighted estimator (5) with weights comuted as the inverse of the resonse roensity estimated by sline logistic regression of the missing-data indicator on, as in (6). (SPW) The calibration estimator (7) with redictions comuted as for SP and weights comuted as for SW. For the enalized slines methods, we chose 0 equally saced fixed knots over and a truncated linear basis. The results from this simulation study are summarized in Table 3 and Figure. For each combination of mean structure and resonse roensity structure and for each estimator, the standardized bias STDBIAS = 00 (bias/emirical standard error), is tabulated, where bias is the deviation of the average estimate over the 500 simulated data sets from the true arameter value, and the emirical standard error is standard deviation of the estimates over the 500 simulated data sets. Also the relative root mean squared error comared with the BD estimator RRMSE = 00 (RMSE(estimator) RMSE(BD))/RMSE(BD) is tabulated, where RMSE is the square root of the average squared deviation of the estimate from the true value over the 500 simulated data sets. The RRMSE values for methods other than 7 htt://biostats.beress.com/umichbiostat/aer5

21 CC are dislayed in Figure, with means and medians as summaries of overall erformance. We conclude the following from Table 3 and Figure : () In terms of median or mean RRMSE over roblems, SP and SPW are the best methods, SW, LW and LP are intermediate, and CC is much worse than the other methods. In articular, methods that include redictions based a sline (SP or SPW) do better than the method that uses weights based on a sline (SW). () The bias of SP is minor and comarable to that of SPW, and the biases and RRMSE s of SP and SPW are very similar. This suggests that calibration is not needed for SP, although it does not hurt the method. (3) For the constant mean model, and are indeendent; in this case the missing data mechanism is always MCAR since missingness deends on a variable that is indeendent of. None of the methods dislay bias in this situation, as theory would redict. For data from the constant roensity model, the RRMSE s of all the methods are very similar; for data from the other roensity models, CC analysis is best. For non-constant mean models and missing-data mechanisms other than MCAR, CC analysis has a large bias and is not cometitive with other methods. (4) When and are linearly related, LP is the best method, as redicted by theory, but SP is nearly as good, showing little loss in efficiency. LW and SW are noticeably inferior in this case. (5) When and are not linearly related and data are not MCAR, LP redictably suffers from bias from model missecification; SP does much better in these cases since it is not based on a linearity assumtion. (6) When the model for the roensity is not linear, there is some evidence that SW is better than LW, consistent with the fact that SW does not make a linearity assumtion for the logit of the roensity. However gains are less dramatic than for SP over LP. 8 Hosted by The Berkeley Electronic Press

22 The second simulation study involved two fully observed covariates and, and one variable 3 with missing values. We generated and as indeendent uniform deviates between and, and 3 from a normal distribution with one of four mean structures: (I) constant: ( 0, ) N, ( 0 3, ) (II) linear : N ( ) + +, ( , ) (III) additive : N ( ) ( ) + +, ( 0 4, ) (IV) non-additive : N ( ) The exected value of 3 for all these mean structures is 0. We simulated four resonse roensity structures, all of which yield an exected resonse rate of 0.5: ( ) (I) constant : r( M ) logit = 0, = 0.5, ( ) (II) linear : logit ( 0, ) r M = = +, 3 3 ( r M ) (III) additive : logit ( 0, ) = = +, ( ) logit r M = 0, = (IV) non-additive : ( ) For each combination of mean and resonse roensity structures, 500 simulated data sets of size n=00 were generated. Then, we comared the mean before deletion (BD) with the following eight estimators of the mean of 3 from the incomlete data: (CC) The comlete-case mean. (LP) Regression rediction from a linear regression of 3 on and. (ASP) Additive Sline Prediction, namely the rediction estimator () based on an additive regression model of 3 on enalized slines for and. 9 htt://biostats.beress.com/umichbiostat/aer5

23 (LW) The weighted estimator (5) with weights comuted as the inverse of the resonse roensity estimated by linear logistic regression of the missing-data indicator on and. (ASW) Additive Sline Weighting, namely the weighted estimator (5) with weights comuted as the inverse of the resonse roensity estimated by an additive sline logistic regression of the missing-data indicator on and. (ASPW) The calibration estimator (7) with redictions comuted as for ASP and weights comuted as for ASW. (SPP) Penalized sline roensity rediction based on a regression of 3 on the sline of the linear redictor of the estimated roensity to resond from a linear logistic regression of M on, ; that is, Eq. (9) with g = 0. (SPPL) Penalized sline roensity rediction based on Eq. (9) with a linear arametric term for. That is, Eq. (9) with g given by Eq. (). We chose 5 equally saced knots over and resectively and a truncated linear basis for the ASW and ASPW. We also chose 0 equally saced knots over the estimated resonse roensity and a linear truncated basis for the SPP and SPPL. Results in Table 4 and Figure can be summarized as follows: () The roensity sline rediction methods (SPP, SPPL) and the rediction methods based on additive slines (ASP, ASPW) do best overall, followed by the linear rediction methods LP and the weighting methods, LW and ASW; CC is much worse than the other methods since it is very biased excet for simulations with a constant mean model or an MCAR resonse roensity. () The methods based on additive slines (ASP, ASPW) do slightly better than roensity sline methods (SPP, SPPL) when the rediction model is additive, and considerably worse when the rediction model is non-additive. In that case the additive models are missecified, and 3, 0 Hosted by The Berkeley Electronic Press

24 the calibration estimator is also biased when the roensity model is not additive. In fact all the methods were biased for this rather demanding roblem, but the roensity sline methods have less bias than the others. (3) Little gain was seen from adding to the roensity sline, since SPP and SPPL erformed very similarly. Greater gains might be exected in roblems with more useful covariates. (4) Results for ASPW are very similar to those of ASP, suggesting that calibration of ASP has little effect for this articular set of roblems. (5) As in the first simulation, the weighting methods LW and ASW are less efficient than the rediction methods, and smoothing the weight by a sline does not aear to hel much. This simulation study does not dislay the otential for SPPL to yield gains in recision over SPP when the covariates other than the roensity are redictive of the outcome. We thus simulated two additional cases where the logit of resonse was linear in +, but the mean was more strongly correlated with. In the first case the mean of 3 is a nonlinear additive function of and, namely 3, ~ N( (3 3) 3 -(3/7)(3 3) 3, ). In the second case the mean of 3 is a nonlinear non-additive function of and, namely 3, ~ N( 0 ( ), ): The results of these simulations are shown in Table 5. Note that in these cases both SPP and SPPL dislay minimal bias, but SPPL shows the exected gain in recision, reflected in lower RRMSE s. As might be exected, the methods that fit additive slines, ASP and ASPW, have the lowest RRMSE s in the additive case, but are inferior to SPP and SPPL in the non-additive case. The results of any simulation study are limited by the choice of oulations simulated, and should be interreted with caution. Our conclusion from these simulations is that the redictions from sline models can yield relatively robust estimates of the oulation mean. htt://biostats.beress.com/umichbiostat/aer5

25 With several covariates, additive slines work well when the effects of the variables are additive, but the roensity sline method rovides an attractive alternative way of addressing the curse of dimensionality. 7. Extensions to Monotone and General Patterns The roensity sline model (9) of the revious section can be extended to a monotone attern by alying it sequentially to each block of missing variables. Missing values of covariates are relaced by their redictions in this sequence of regressions. Multile imutation versions of this aroach, where draws from the redictive distribution are imuted rather than means, and extensions to general atterns of missing data based on the sequential imutation methods of Raghunathan et al. (00), will be examined in future research. 8. Conclusions Desite the large literature devoted to nonignorable missing data adjustments, we believe that the key to successful treatment of missing data is to measure covariates that are redictive of the missing values, and to model carefully the relationshis between the missing variables and these covariates. Likelihood-based methods based on multivariate models for the data are useful tools for making efficient use of the available data, but standard models such as the multivariate normal imly linear additive relationshis between the variables that may be too simlistic in certain settings. We roose easily-fitted sline models that yield regression redictions that are more robust to nonlinearity in the relationshi between the missing variables and the covariates, under the MAR assumtion. A key idea is to single out the roensity score for this robust form of modeling. Hosted by The Berkeley Electronic Press

26 A limitation of the work on roensity sline methods described here is that it is focuses on oint estimation. Inferences for the roensity sline rediction model require valid estimates of standard errors, and ideally Student t corrections for small samles. Possible aroaches to estimating standard errors include: (a) comuting the estimate on a set of bootstra samles, and calculating a bootstra standard error from the samle variance over the bootstra samles, or from ercentiles of the bootstra distribution; (b) ignoring samling error in estimating the roensity π (,..., ), and using asymtotic standard errors for the model (9) based on standard linear mixed model formulae. (c) using the roensity sline rediction model to multily-imute draws from the redictive distribution of the missing values, and then using multile imutation methods for estimating the variance (e.g. Rubin, 987; Little and Rubin, 00, chater 0). Simulations comaring these aroaches are currently underway. Future work will also consider extensions to general atterns of missing data and non-normal outcomes, based on extensions of the sequential imutation method of Raghunathan et al. (00). Acknowledgments This research was suorted by National Science Foundation Grant DMS We greatly areciate the helful comments of Bin Nan, Trivellore Raghunathan, an associate editor and a referee. 3 htt://biostats.beress.com/umichbiostat/aer5

27 References Breslow, N.E. and Clayton, D.G. (993). Aroximate inference in generalized linear mixed models. Journal of the American Statistical Association, 88, 9-5. Cheng, P.E. (994). Nonarametric Estimation of Mean Functionals with Data Missing at Random. Journal of the American Statistical Association, 89, David, M., Little, R.J.A., Samuhel, M.E. and Triest, R.K. (983). Imutation models based on the roensity to resond. Proceedings of the Business and Economics Section, American Statistical Association 983, Eilers, P.H.C. and Marx B.D. (996). Flexible smoothing with B-slines and enalties (with discussion), Statistical Science, 89-. Heckman, J. (976). The common structure of statistical models of truncation, samle selection and limited deendent variables, and a simle estimator for such models, Annals of Economic and Social Measurement 5, Horowitz, J.L. and Manski, C.F. (000). Nonarametric analysis of randomized exeriments with missing covariate and outcome data, Journal of the American Statistical Association, 95, (with discussion). Little, R.J. and Rubin, D.B. (999). Comment on Adjusting for Non-Ignorable Dro-out Using Semiarametric Models by D. O. Scharfstein, A. Rotnitsky and J.M. Robins. Journal of the American Statistical Association, 94, Little R.J.A. and Rubin, D.B. (00). Statistical Analysis with Missing Data, Wiley, New ork. Little, R.J., and Wang, -X (996). Pattern-mixture models for multivariate incomlete data with covariates, Biometrics 5, Hosted by The Berkeley Electronic Press

28 Pinheiro, J.C. and Bates, D.M. (000). Mixed-Effects Models in S and S-PLUS, New ork: Sringer-Verlag. Raghunathan, T. E., Lekowski, J.M., VanHoewyk, J. and Solenberger, P. (00). A multivariate technique for multily imuting missing values using a sequence of regression models. Survey Methodology, 7, Robins, J.M. and Rotnitsky (00). Comment on Inference for semiarametric models: some questions and an answer by P. Bickel and J. Kwon. Statistica Sinica,, Robins, J.M., Rotnitsky, A. and Zhao, L.P. (994). Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association, 89, Rosenbaum, P.R. and Rubin, D.B. (983). The central role of the roensity score in observational studies for causal effects, Biometrika 70, Rubin, D.B. (976). Inference and missing data, Biometrika 63, Rubin, D.B. (977). Formalizing subjective notions about the effect of nonresondents in samle surveys, Journal of the American Statistical Association 7, Ruert, D. and Carroll R.J. (000). Satially adative enalties for sline fitting, Australia and New Zealand Journal of Statistics 4, SAS (99). The Mixed Procedure, Chater 6 in SAS/STAT Software: Changes and Enhancements, Release 6.07, Technical Reort P-9, SAS Institute, Inc., Cary: NC. Scharfstein, D. and Irizarry, R. Generalized additive selection models for the analysis of nonignorable missing data. Biometrics, in ress. 5 htt://biostats.beress.com/umichbiostat/aer5

29 Scharfstein, D., Rotnitsky, A. and Robins, J. (999). Adjusting for nonignorable droout using semiarametric models, J. Am. Statist. Assoc. 94, (with discussion). u,. and Ruert, D. (00). Penalized sline estimation for artially linear single-index models, Journal of the American Statistical Association 97, Hosted by The Berkeley Electronic Press

30 Table 3. Simulation Study Comaring Estimators with a Single Covariate. Proensity Model Constant (MCAR)(I) Linear (II) Cubic (III) SINE (IV) Mean Model STDBIAS RRMSE STDBIAS RRMSE STDBIAS RRMSE STDBIAS RRMSE Constant (I) BD CC LP SP LW SW SPW BD Linear (II) CC LP SP LW SW SPW BD CC Cubic (III) LP SP LW SW SPW BD CC SINE (IV) LP SP LW SW SPW htt://biostats.beress.com/umichbiostat/aer5

31 Table 4. Simulation Study Comaring Estimators with Two Covariates. Proensity Model Constant (MCAR) (I) Linear (II) Additive (III) Non-Additive (IV) Mean Model STDBIAS RRMSE STDBIAS RRMSE STDBIAS RRMSE STDBIAS RRMSE Constant (I) Linear (II) Additive (III) Non- Additive (IV) BD CC LP ASP LW ASW SPP SPPL ASPW BD CC LP ASP LW ASW SPP SPPL ASPW BD CC LP ASP LW ASW SPP SPPL ASPW BD CC LP ASP LW ASW SPP SPPL ASPW Hosted by The Berkeley Electronic Press

32 Table 5. Sulemental Simulations where Covariates Other than the Proensity Score are Predictive of Outcome. Additive Mean Non-Additive Mean METHOD STDBIAS RRMSE STDBIAS RRMSE BD CC LP ASP LW ASW SPP SPPL -3 ASPW htt://biostats.beress.com/umichbiostat/aer5

33 Figure. Histograms of RRMMSE for Methods other than CC Analysis, for Simulation with a Single Covariate SW SPW mean = 34.6 mean = median = 9.0 median = Percent of Total 60 LP mean = 47.4 SP mean =.7 0 LW mean = 4.0 median = 35.0 median = 0.0 median = Histograms of RRMSE for Simulation 30 Hosted by The Berkeley Electronic Press

34 Figure. Histograms of RRMMSE for Methods other than CC Analysis, for Simulation with Two Covariates SPP SPPL ASPW mean = 8. mean = 7.9 mean = median = 8.5 median = 9.5 median = Percent of Total 60 LP mean = 43.0 ASP mean = 43.0 LW mean = ASW mean = 54.8 median = 6.5 median = 5.5 median = 34.5 median = Histograms of RRMSE for Simulation 3 htt://biostats.beress.com/umichbiostat/aer5

Extensions of the Penalized Spline Propensity Prediction Method of Imputation

Extensions of the Penalized Spline Propensity Prediction Method of Imputation Extensions of the Penalized Sline Proensity Prediction Method of Imutation by Guangyu Zhang A dissertation submitted in artial fulfillment of the requirements for the degree of Doctor of Philosohy (Biostatistics)

More information

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Journal of Modern Alied Statistical Methods Volume Issue Article 7 --03 A Comarison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Ghadban Khalaf King Khalid University, Saudi

More information

4. Score normalization technical details We now discuss the technical details of the score normalization method.

4. Score normalization technical details We now discuss the technical details of the score normalization method. SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules

More information

Hotelling s Two- Sample T 2

Hotelling s Two- Sample T 2 Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test

More information

Estimation of the large covariance matrix with two-step monotone missing data

Estimation of the large covariance matrix with two-step monotone missing data Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo

More information

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO) Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment

More information

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers

More information

General Linear Model Introduction, Classes of Linear models and Estimation

General Linear Model Introduction, Classes of Linear models and Estimation Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)

More information

Use of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek

Use of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek Use of Transformations and the Reeated Statement in PROC GLM in SAS Ed Stanek Introduction We describe how the Reeated Statement in PROC GLM in SAS transforms the data to rovide tests of hyotheses of interest.

More information

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process P. Mantalos a1, K. Mattheou b, A. Karagrigoriou b a.deartment of Statistics University of Lund

More information

Tests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)

Tests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test) Chater 225 Tests for Two Proortions in a Stratified Design (Cochran/Mantel-Haenszel Test) Introduction In a stratified design, the subects are selected from two or more strata which are formed from imortant

More information

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI **

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI ** Iranian Journal of Science & Technology, Transaction A, Vol 3, No A3 Printed in The Islamic Reublic of Iran, 26 Shiraz University Research Note REGRESSION ANALYSIS IN MARKOV HAIN * A Y ALAMUTI AND M R

More information

Estimating function analysis for a class of Tweedie regression models

Estimating function analysis for a class of Tweedie regression models Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal

More information

SAS for Bayesian Mediation Analysis

SAS for Bayesian Mediation Analysis Paer 1569-2014 SAS for Bayesian Mediation Analysis Miočević Milica, Arizona State University; David P. MacKinnon, Arizona State University ABSTRACT Recent statistical mediation analysis research focuses

More information

Morten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014

Morten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014 Morten Frydenberg Section for Biostatistics Version :Friday, 05 Setember 204 All models are aroximations! The best model does not exist! Comlicated models needs a lot of data. lower your ambitions or get

More information

On split sample and randomized confidence intervals for binomial proportions

On split sample and randomized confidence intervals for binomial proportions On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have

More information

Scaling Multiple Point Statistics for Non-Stationary Geostatistical Modeling

Scaling Multiple Point Statistics for Non-Stationary Geostatistical Modeling Scaling Multile Point Statistics or Non-Stationary Geostatistical Modeling Julián M. Ortiz, Steven Lyster and Clayton V. Deutsch Centre or Comutational Geostatistics Deartment o Civil & Environmental Engineering

More information

A New Asymmetric Interaction Ridge (AIR) Regression Method

A New Asymmetric Interaction Ridge (AIR) Regression Method A New Asymmetric Interaction Ridge (AIR) Regression Method by Kristofer Månsson, Ghazi Shukur, and Pär Sölander The Swedish Retail Institute, HUI Research, Stockholm, Sweden. Deartment of Economics and

More information

Chapter 3. GMM: Selected Topics

Chapter 3. GMM: Selected Topics Chater 3. GMM: Selected oics Contents Otimal Instruments. he issue of interest..............................2 Otimal Instruments under the i:i:d: assumtion..............2. he basic result............................2.2

More information

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity Bayesian Satially Varying Coefficient Models in the Presence of Collinearity David C. Wheeler 1, Catherine A. Calder 1 he Ohio State University 1 Abstract he belief that relationshis between exlanatory

More information

Finite Mixture EFA in Mplus

Finite Mixture EFA in Mplus Finite Mixture EFA in Mlus November 16, 2007 In this document we describe the Mixture EFA model estimated in Mlus. Four tyes of deendent variables are ossible in this model: normally distributed, ordered

More information

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split A Bound on the Error of Cross Validation Using the Aroximation and Estimation Rates, with Consequences for the Training-Test Slit Michael Kearns AT&T Bell Laboratories Murray Hill, NJ 7974 mkearns@research.att.com

More information

Partial Identification in Triangular Systems of Equations with Binary Dependent Variables

Partial Identification in Triangular Systems of Equations with Binary Dependent Variables Partial Identification in Triangular Systems of Equations with Binary Deendent Variables Azeem M. Shaikh Deartment of Economics University of Chicago amshaikh@uchicago.edu Edward J. Vytlacil Deartment

More information

Estimation of Separable Representations in Psychophysical Experiments

Estimation of Separable Representations in Psychophysical Experiments Estimation of Searable Reresentations in Psychohysical Exeriments Michele Bernasconi (mbernasconi@eco.uninsubria.it) Christine Choirat (cchoirat@eco.uninsubria.it) Raffaello Seri (rseri@eco.uninsubria.it)

More information

LOGISTIC REGRESSION. VINAYANAND KANDALA M.Sc. (Agricultural Statistics), Roll No I.A.S.R.I, Library Avenue, New Delhi

LOGISTIC REGRESSION. VINAYANAND KANDALA M.Sc. (Agricultural Statistics), Roll No I.A.S.R.I, Library Avenue, New Delhi LOGISTIC REGRESSION VINAANAND KANDALA M.Sc. (Agricultural Statistics), Roll No. 444 I.A.S.R.I, Library Avenue, New Delhi- Chairerson: Dr. Ranjana Agarwal Abstract: Logistic regression is widely used when

More information

Plotting the Wilson distribution

Plotting the Wilson distribution , Survey of English Usage, University College London Setember 018 1 1. Introduction We have discussed the Wilson score interval at length elsewhere (Wallis 013a, b). Given an observed Binomial roortion

More information

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)

More information

Semiparametric Efficiency in GMM Models with Nonclassical Measurement Error

Semiparametric Efficiency in GMM Models with Nonclassical Measurement Error Semiarametric Efficiency in GMM Models with Nonclassical Measurement Error Xiaohong Chen New York University Han Hong Duke University Alessandro Tarozzi Duke University August 2005 Abstract We study semiarametric

More information

arxiv: v1 [physics.data-an] 26 Oct 2012

arxiv: v1 [physics.data-an] 26 Oct 2012 Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch

More information

arxiv: v2 [stat.me] 3 Nov 2014

arxiv: v2 [stat.me] 3 Nov 2014 onarametric Stein-tye Shrinkage Covariance Matrix Estimators in High-Dimensional Settings Anestis Touloumis Cancer Research UK Cambridge Institute University of Cambridge Cambridge CB2 0RE, U.K. Anestis.Touloumis@cruk.cam.ac.uk

More information

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling Journal of Modern Alied Statistical Methods Volume 15 Issue Article 14 11-1-016 An Imroved Generalized Estimation Procedure of Current Poulation Mean in Two-Occasion Successive Samling G. N. Singh Indian

More information

ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION

ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION STATISTICA, anno LXVIII, n., 008 ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION 1. INTRODUCTION The Inverse Gaussian distribution was first introduced

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

Unobservable Selection and Coefficient Stability: Theory and Evidence

Unobservable Selection and Coefficient Stability: Theory and Evidence Unobservable Selection and Coefficient Stability: Theory and Evidence Emily Oster Brown University and NBER August 9, 016 Abstract A common aroach to evaluating robustness to omitted variable bias is to

More information

Statistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform

Statistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform Statistics II Logistic Regression Çağrı Çöltekin Exam date & time: June 21, 10:00 13:00 (The same day/time lanned at the beginning of the semester) University of Groningen, Det of Information Science May

More information

Modeling and Estimation of Full-Chip Leakage Current Considering Within-Die Correlation

Modeling and Estimation of Full-Chip Leakage Current Considering Within-Die Correlation 6.3 Modeling and Estimation of Full-Chi Leaage Current Considering Within-Die Correlation Khaled R. eloue, Navid Azizi, Farid N. Najm Deartment of ECE, University of Toronto,Toronto, Ontario, Canada {haled,nazizi,najm}@eecg.utoronto.ca

More information

arxiv: v3 [physics.data-an] 23 May 2011

arxiv: v3 [physics.data-an] 23 May 2011 Date: October, 8 arxiv:.7v [hysics.data-an] May -values for Model Evaluation F. Beaujean, A. Caldwell, D. Kollár, K. Kröninger Max-Planck-Institut für Physik, München, Germany CERN, Geneva, Switzerland

More information

Approximating min-max k-clustering

Approximating min-max k-clustering Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost

More information

Characteristics of Beam-Based Flexure Modules

Characteristics of Beam-Based Flexure Modules Shorya Awtar e-mail: shorya@mit.edu Alexander H. Slocum e-mail: slocum@mit.edu Precision Engineering Research Grou, Massachusetts Institute of Technology, Cambridge, MA 039 Edi Sevincer Omega Advanced

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell October 25, 2009 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

State Estimation with ARMarkov Models

State Estimation with ARMarkov Models Deartment of Mechanical and Aerosace Engineering Technical Reort No. 3046, October 1998. Princeton University, Princeton, NJ. State Estimation with ARMarkov Models Ryoung K. Lim 1 Columbia University,

More information

Some methods for handling missing values in outcome variables. Roderick J. Little

Some methods for handling missing values in outcome variables. Roderick J. Little Some methods for handling missing values in outcome variables Roderick J. Little Missing data principles Likelihood methods Outline ML, Bayes, Multiple Imputation (MI) Robust MAR methods Predictive mean

More information

Chapter 13 Variable Selection and Model Building

Chapter 13 Variable Selection and Model Building Chater 3 Variable Selection and Model Building The comlete regsion analysis deends on the exlanatory variables ent in the model. It is understood in the regsion analysis that only correct and imortant

More information

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests 009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract

More information

On-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015)

On-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015) On-Line Aendix Matching on the Estimated Proensity Score Abadie and Imbens, 205 Alberto Abadie and Guido W. Imbens Current version: August 0, 205 The first art of this aendix contains additional roofs.

More information

STK4900/ Lecture 7. Program

STK4900/ Lecture 7. Program STK4900/9900 - Lecture 7 Program 1. Logistic regression with one redictor 2. Maximum likelihood estimation 3. Logistic regression with several redictors 4. Deviance and likelihood ratio tests 5. A comment

More information

Probability Estimates for Multi-class Classification by Pairwise Coupling

Probability Estimates for Multi-class Classification by Pairwise Coupling Probability Estimates for Multi-class Classification by Pairwise Couling Ting-Fan Wu Chih-Jen Lin Deartment of Comuter Science National Taiwan University Taiei 06, Taiwan Ruby C. Weng Deartment of Statistics

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

One-way ANOVA Inference for one-way ANOVA

One-way ANOVA Inference for one-way ANOVA One-way ANOVA Inference for one-way ANOVA IPS Chater 12.1 2009 W.H. Freeman and Comany Objectives (IPS Chater 12.1) Inference for one-way ANOVA Comaring means The two-samle t statistic An overview of ANOVA

More information

substantial literature on emirical likelihood indicating that it is widely viewed as a desirable and natural aroach to statistical inference in a vari

substantial literature on emirical likelihood indicating that it is widely viewed as a desirable and natural aroach to statistical inference in a vari Condence tubes for multile quantile lots via emirical likelihood John H.J. Einmahl Eindhoven University of Technology Ian W. McKeague Florida State University May 7, 998 Abstract The nonarametric emirical

More information

arxiv:cond-mat/ v2 25 Sep 2002

arxiv:cond-mat/ v2 25 Sep 2002 Energy fluctuations at the multicritical oint in two-dimensional sin glasses arxiv:cond-mat/0207694 v2 25 Se 2002 1. Introduction Hidetoshi Nishimori, Cyril Falvo and Yukiyasu Ozeki Deartment of Physics,

More information

Adaptive estimation with change detection for streaming data

Adaptive estimation with change detection for streaming data Adative estimation with change detection for streaming data A thesis resented for the degree of Doctor of Philosohy of the University of London and the Diloma of Imerial College by Dean Adam Bodenham Deartment

More information

Covariance Matrix Estimation for Reinforcement Learning

Covariance Matrix Estimation for Reinforcement Learning Covariance Matrix Estimation for Reinforcement Learning Tomer Lancewicki Deartment of Electrical Engineering and Comuter Science University of Tennessee Knoxville, TN 37996 tlancewi@utk.edu Itamar Arel

More information

VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES

VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES Journal of Sound and Vibration (998) 22(5), 78 85 VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES Acoustics and Dynamics Laboratory, Deartment of Mechanical Engineering, The

More information

Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression

Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Resonse) Logistic Regression Recall general χ 2 test setu: Y 0 1 Trt 0 a b Trt 1 c d I. Basic logistic regression Previously (Handout

More information

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deriving ndicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deutsch Centre for Comutational Geostatistics Deartment of Civil &

More information

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment

More information

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules. Introduction: The is widely used in industry to monitor the number of fraction nonconforming units. A nonconforming unit is

More information

On the Fluid Dependence of Rock Compressibility: Biot-Gassmann Refined

On the Fluid Dependence of Rock Compressibility: Biot-Gassmann Refined Downloaded 0/9/3 to 99.86.4.8. Redistribution subject to SEG license or coyright; see Terms of Use at htt://library.seg.org/ On the luid Deendence of Rock Comressibility: Biot-Gassmann Refined Leon Thomsen,

More information

Solved Problems. (a) (b) (c) Figure P4.1 Simple Classification Problems First we draw a line between each set of dark and light data points.

Solved Problems. (a) (b) (c) Figure P4.1 Simple Classification Problems First we draw a line between each set of dark and light data points. Solved Problems Solved Problems P Solve the three simle classification roblems shown in Figure P by drawing a decision boundary Find weight and bias values that result in single-neuron ercetrons with the

More information

Adaptive Estimation of the Regression Discontinuity Model

Adaptive Estimation of the Regression Discontinuity Model Adative Estimation of the Regression Discontinuity Model Yixiao Sun Deartment of Economics Univeristy of California, San Diego La Jolla, CA 9293-58 Feburary 25 Email: yisun@ucsd.edu; Tel: 858-534-4692

More information

A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE

A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE THE 19 TH INTERNATIONAL CONFERENCE ON COMPOSITE MATERIALS A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE K.W. Gan*, M.R. Wisnom, S.R. Hallett, G. Allegri Advanced Comosites

More information

Notes on Instrumental Variables Methods

Notes on Instrumental Variables Methods Notes on Instrumental Variables Methods Michele Pellizzari IGIER-Bocconi, IZA and frdb 1 The Instrumental Variable Estimator Instrumental variable estimation is the classical solution to the roblem of

More information

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model Shadow Comuting: An Energy-Aware Fault Tolerant Comuting Model Bryan Mills, Taieb Znati, Rami Melhem Deartment of Comuter Science University of Pittsburgh (bmills, znati, melhem)@cs.itt.edu Index Terms

More information

Asymptotically Optimal Simulation Allocation under Dependent Sampling

Asymptotically Optimal Simulation Allocation under Dependent Sampling Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee

More information

CONVOLVED SUBSAMPLING ESTIMATION WITH APPLICATIONS TO BLOCK BOOTSTRAP

CONVOLVED SUBSAMPLING ESTIMATION WITH APPLICATIONS TO BLOCK BOOTSTRAP Submitted to the Annals of Statistics arxiv: arxiv:1706.07237 CONVOLVED SUBSAMPLING ESTIMATION WITH APPLICATIONS TO BLOCK BOOTSTRAP By Johannes Tewes, Dimitris N. Politis and Daniel J. Nordman Ruhr-Universität

More information

Downloaded from jhs.mazums.ac.ir at 9: on Monday September 17th 2018 [ DOI: /acadpub.jhs ]

Downloaded from jhs.mazums.ac.ir at 9: on Monday September 17th 2018 [ DOI: /acadpub.jhs ] Iranian journal of health sciences 013; 1(): 56-60 htt://jhs.mazums.ac.ir Original Article Comaring Two Formulas of Samle Size Determination for Prevalence Studies Hamed Tabesh 1 *Azadeh Saki Fatemeh Pourmotahari

More information

#A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS

#A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS #A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS Ramy F. Taki ElDin Physics and Engineering Mathematics Deartment, Faculty of Engineering, Ain Shams University, Cairo, Egyt

More information

A Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition

A Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition A Qualitative Event-based Aroach to Multile Fault Diagnosis in Continuous Systems using Structural Model Decomosition Matthew J. Daigle a,,, Anibal Bregon b,, Xenofon Koutsoukos c, Gautam Biswas c, Belarmino

More information

Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute

Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute ajesh Singh, ankaj Chauhan, Nirmala Sawan School of Statistics, DAVV, Indore (M.., India Florentin Smarandache Universit of New Mexico, USA atio Estimators in Simle andom Samling Using Information on Auxiliar

More information

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management

More information

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation Paer C Exact Volume Balance Versus Exact Mass Balance in Comositional Reservoir Simulation Submitted to Comutational Geosciences, December 2005. Exact Volume Balance Versus Exact Mass Balance in Comositional

More information

John Weatherwax. Analysis of Parallel Depth First Search Algorithms

John Weatherwax. Analysis of Parallel Depth First Search Algorithms Sulementary Discussions and Solutions to Selected Problems in: Introduction to Parallel Comuting by Viin Kumar, Ananth Grama, Anshul Guta, & George Karyis John Weatherwax Chater 8 Analysis of Parallel

More information

Robustness of multiple comparisons against variance heterogeneity Dijkstra, J.B.

Robustness of multiple comparisons against variance heterogeneity Dijkstra, J.B. Robustness of multile comarisons against variance heterogeneity Dijkstra, J.B. Published: 01/01/1983 Document Version Publisher s PDF, also known as Version of Record (includes final age, issue and volume

More information

On the Relationship Between Packet Size and Router Performance for Heavy-Tailed Traffic 1

On the Relationship Between Packet Size and Router Performance for Heavy-Tailed Traffic 1 On the Relationshi Between Packet Size and Router Performance for Heavy-Tailed Traffic 1 Imad Antonios antoniosi1@southernct.edu CS Deartment MO117 Southern Connecticut State University 501 Crescent St.

More information

DEPARTMENT OF ECONOMICS ISSN DISCUSSION PAPER 20/07 TWO NEW EXPONENTIAL FAMILIES OF LORENZ CURVES

DEPARTMENT OF ECONOMICS ISSN DISCUSSION PAPER 20/07 TWO NEW EXPONENTIAL FAMILIES OF LORENZ CURVES DEPARTMENT OF ECONOMICS ISSN 1441-549 DISCUSSION PAPER /7 TWO NEW EXPONENTIAL FAMILIES OF LORENZ CURVES ZuXiang Wang * & Russell Smyth ABSTRACT We resent two new Lorenz curve families by using the basic

More information

The following document is intended for online publication only (authors webpage).

The following document is intended for online publication only (authors webpage). The following document is intended for online ublication only (authors webage). Sulement to Identi cation and stimation of Distributional Imacts of Interventions Using Changes in Inequality Measures, Part

More information

Bayesian nonparametric approaches for ROC curve inference

Bayesian nonparametric approaches for ROC curve inference Bayesian nonarametric aroaches for ROC curve inference Vanda Inácio de Carvalho, Alejandro Jara, and Miguel de Carvalho Abstract The develoment of medical diagnostic tests is of great imortance in clinical

More information

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA Proceedings of the 2011 Winter Simulation Conference S. Jain, R. R. Creasey, J. Himmelsach, K. P. White, and M. Fu, eds. EFFICIENT RARE EVENT SIMULATION FOR HEAVY-TAILED SYSTEMS VIA CROSS ENTROPY Jose

More information

Hidden Predictors: A Factor Analysis Primer

Hidden Predictors: A Factor Analysis Primer Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor

More information

Casimir Force Between the Two Moving Conductive Plates.

Casimir Force Between the Two Moving Conductive Plates. Casimir Force Between the Two Moving Conductive Plates. Jaroslav Hynecek 1 Isetex, Inc., 95 Pama Drive, Allen, TX 751 ABSTRACT This article resents the derivation of the Casimir force for the two moving

More information

Developing A Deterioration Probabilistic Model for Rail Wear

Developing A Deterioration Probabilistic Model for Rail Wear International Journal of Traffic and Transortation Engineering 2012, 1(2): 13-18 DOI: 10.5923/j.ijtte.20120102.02 Develoing A Deterioration Probabilistic Model for Rail Wear Jabbar-Ali Zakeri *, Shahrbanoo

More information

On the Maximum and Minimum of Multivariate Pareto Random Variables

On the Maximum and Minimum of Multivariate Pareto Random Variables ISSN 1392 124X (rint) ISSN 2335 884X (online) INFORMATION TECHNOLOGY AND CONTROL 213 Vol.42 No.3 On the Maximum Minimum of Multivariate Pareto Rom Variables Jungsoo Woo Deartment of Statistics Yeungnam

More information

Chapter 10. Supplemental Text Material

Chapter 10. Supplemental Text Material Chater 1. Sulemental Tet Material S1-1. The Covariance Matri of the Regression Coefficients In Section 1-3 of the tetbook, we show that the least squares estimator of β in the linear regression model y=

More information

Nonparametric estimation of Exact consumer surplus with endogeneity in price

Nonparametric estimation of Exact consumer surplus with endogeneity in price Nonarametric estimation of Exact consumer surlus with endogeneity in rice Anne Vanhems February 7, 2009 Abstract This aer deals with nonarametric estimation of variation of exact consumer surlus with endogenous

More information

A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION

A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION O P E R A T I O N S R E S E A R C H A N D D E C I S I O N S No. 27 DOI:.5277/ord73 Nasrullah KHAN Muhammad ASLAM 2 Kyung-Jun KIM 3 Chi-Hyuck JUN 4 A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

E cient Semiparametric Estimation of Quantile Treatment E ects

E cient Semiparametric Estimation of Quantile Treatment E ects E cient Semiarametric Estimation of Quantile Treatment E ects Sergio Firo y First Draft: ovember 2002 This Draft: January 2004 Abstract This aer resents calculations of semiarametric e ciency bounds for

More information

Performance of lag length selection criteria in three different situations

Performance of lag length selection criteria in three different situations MPRA Munich Personal RePEc Archive Performance of lag length selection criteria in three different situations Zahid Asghar and Irum Abid Quaid-i-Azam University, Islamabad Aril 2007 Online at htts://mra.ub.uni-muenchen.de/40042/

More information

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS Proceedings of DETC 03 ASME 003 Design Engineering Technical Conferences and Comuters and Information in Engineering Conference Chicago, Illinois USA, Setember -6, 003 DETC003/DAC-48760 AN EFFICIENT ALGORITHM

More information

Decoding First-Spike Latency: A Likelihood Approach. Rick L. Jenison

Decoding First-Spike Latency: A Likelihood Approach. Rick L. Jenison eurocomuting 38-40 (00) 39-48 Decoding First-Sike Latency: A Likelihood Aroach Rick L. Jenison Deartment of Psychology University of Wisconsin Madison WI 53706 enison@wavelet.sych.wisc.edu ABSTRACT First-sike

More information

New Schedulability Test Conditions for Non-preemptive Scheduling on Multiprocessor Platforms

New Schedulability Test Conditions for Non-preemptive Scheduling on Multiprocessor Platforms New Schedulability Test Conditions for Non-reemtive Scheduling on Multirocessor Platforms Technical Reort May 2008 Nan Guan 1, Wang Yi 2, Zonghua Gu 3 and Ge Yu 1 1 Northeastern University, Shenyang, China

More information

Linear diophantine equations for discrete tomography

Linear diophantine equations for discrete tomography Journal of X-Ray Science and Technology 10 001 59 66 59 IOS Press Linear diohantine euations for discrete tomograhy Yangbo Ye a,gewang b and Jiehua Zhu a a Deartment of Mathematics, The University of Iowa,

More information

Aggregate Prediction With. the Aggregation Bias

Aggregate Prediction With. the Aggregation Bias 100 Aggregate Prediction With Disaggregate Models: Behavior of the Aggregation Bias Uzi Landau, Transortation Research nstitute, Technion-srael nstitute of Technology, Haifa Disaggregate travel demand

More information

Lecture 3 Consistency of Extremum Estimators 1

Lecture 3 Consistency of Extremum Estimators 1 Lecture 3 Consistency of Extremum Estimators 1 This lecture shows how one can obtain consistency of extremum estimators. It also shows how one can find the robability limit of extremum estimators in cases

More information

Estimating Time-Series Models

Estimating Time-Series Models Estimating ime-series Models he Box-Jenkins methodology for tting a model to a scalar time series fx t g consists of ve stes:. Decide on the order of di erencing d that is needed to roduce a stationary

More information

Asymptotic Properties of the Markov Chain Model method of finding Markov chains Generators of..

Asymptotic Properties of the Markov Chain Model method of finding Markov chains Generators of.. IOSR Journal of Mathematics (IOSR-JM) e-issn: 78-578, -ISSN: 319-765X. Volume 1, Issue 4 Ver. III (Jul. - Aug.016), PP 53-60 www.iosrournals.org Asymtotic Proerties of the Markov Chain Model method of

More information

The non-stochastic multi-armed bandit problem

The non-stochastic multi-armed bandit problem Submitted for journal ublication. The non-stochastic multi-armed bandit roblem Peter Auer Institute for Theoretical Comuter Science Graz University of Technology A-8010 Graz (Austria) auer@igi.tu-graz.ac.at

More information

Uniform Law on the Unit Sphere of a Banach Space

Uniform Law on the Unit Sphere of a Banach Space Uniform Law on the Unit Shere of a Banach Sace by Bernard Beauzamy Société de Calcul Mathématique SA Faubourg Saint Honoré 75008 Paris France Setember 008 Abstract We investigate the construction of a

More information

Supplementary Materials for Robust Estimation of the False Discovery Rate

Supplementary Materials for Robust Estimation of the False Discovery Rate Sulementary Materials for Robust Estimation of the False Discovery Rate Stan Pounds and Cheng Cheng This sulemental contains roofs regarding theoretical roerties of the roosed method (Section S1), rovides

More information