A Non-parametric \Trim and Fill" Method of. Sue Taylor. Department of Preventive Medicine and Biometrics,

Size: px
Start display at page:

Download "A Non-parametric \Trim and Fill" Method of. Sue Taylor. Department of Preventive Medicine and Biometrics,"

Transcription

1 A Non-parametric \Trim and Fill" Method of Assessing Publication Bias in Meta-analysis Sue Taylor Department of Preventive Medicine and Biometrics, University of Colorado Health Sciences Center, Denver, CO 8262 USA and Richard Tweedie Department of Statistics, Colorado State University, Fort Collins, CO 8523 USA September 16, 1998 Abstract Meta-analysis collects and synthesizes results from individual studies to estimate an overall eect size. If published studies are chosen, say through a literature review, an inherent selection bias may arise, since for example, studies may tend to be published more readily if they are statistically signicant, or deemed to be more `interesting' in terms of the impact of their outcomes. We develop a simple rank-based data augmentation technique, formalizing the use of funnel plots, to estimate and adjust for the numbers and outcomes of missing studies. Several non-parametric estimators are proposed for the number of missing studies, and their properties are developed analytically and through simulations. We apply the method to simulated and epidemiological data sets, and show it is both eective and consistent with other criteria in the literature. Corresponding author's address: tweedie@stat.colostate.edu Key words: Meta-analysis; Publication bias; Missing studies; File drawer problem; Funnel plots; Data augmentation; lung cancer; passive smoking 1

2 1 Introduction 1.1 The publication bias problem There has been an enormous recent increase in the use of meta-analysis as a statistical technique for combining the results of many individual analyses in areas of clinical trials, epidemiology, sociology and psychology (Hedges and Olkin, 1985; Olkin, 1992; Cooper and Hedges, 199). In all these contexts, one well-documented concern is the need to collect all relevant studies, both published and unpublished, if the subsequent inferences are to be valid (Iyengar and Greenhouse, 1988; Dear and Begg, 1992; Hedges, 1992; Begg, 199; Begg and Mazumdar, 199; Gleser and Olkin, 1996; Egger et al, 1997). The use of a non-representative proportion of signicant studies, or studies dierentially giving results in a \positive" direction, will lead to a non-representative set of studies in the meta-analysis data set. A standard meta-analysis model will then result in a conclusion biased toward signicance or positivity. This is particularly problematic for a meta-analysis whose data come solely from the published scientic literature, but studies may be suppressed for many other reasons than failure to be published (see Cooper (1998), Givens, Smith and Tweedie (1997) and the discussion thereto, and Misakian and Bero (1998)). This phenomenon has become known as `publication bias', or the `le-drawer problem' (Iyengar and Greenhouse, 1988). In this paper we describe a new way to account for the magnitude of this problem. There are two steps to our approach. Firstly, we estimate the number of missing studies using methods based on symmetry assumptions. These are simple to implement, in contrast to most existing methods, and they appear in practice to pick up the \missing studies" indicated visually by funnel plots (Light and Pillemer, 198), as described in Section 1.2. Secondly, we impute the missing values using a \trim and ll" approach that enables us to derive estimates of the overall eect on the inferences in the meta-analysis due to the publication bias. We show that this dual approach works well in simulation studies and illustrate its use on an epidemiological example. Rather than merely testing for the possibility of publication bias, as in, say, Begg (199) or Egger et al (1997), the trim and ll algorithm appears to give a method of evaluating and also adjusting for the possibility of publication bias, so that one can tell whether the magnitude of the problem warrants consideration or not. 1.2 Funnel plots and publication bias There are several methods which have been proposed to detect the existence of publication bias in a meta-analysis. Perhaps the most common is the funnel plot (Light and Pillemer, 198), and related graphical methods for visually determining the existence of missing studies (Galbraith, 1988). Figure 1(a) shows a funnel plot using data from a simulation of 35 studies, each of which supplies an estimate Y i of the eect in question in the i th study, and an estimate of the variance i 2 within that study. The funnel plots we use depict?1 i graphed against Y i, so that the most precise 2

3 estimates (for example, those from the largest studies) are at the top of the funnel, and those from less precise, or smaller, studies are at the base of the funnel. The data were generated as in Section with Y i N(; i 2 ) so that the true mean here is ; the sample used has metaanalyzed estimate (using the random eects model below) b = :8 with 95% condence interval (CI) (?:18; :178), which is not signicant at the usual 5% level. The fact that there is a \funnel" shape is not based on any detailed modeling: it relies on two empirical observations, namely (F1) The variances i 2 of studies in a meta-analysis are not identical, but are distributed in such a way that there are fewer precise studies and rather more imprecise studies: this appears to be borne out by data in, for example, Light and Pillemer (198, Chapter 3) amongst many others. (F2) At any xed level of i 2, studies are symmetrically distributed around the true mean. This is justied if we use the normal model in Section 2.1, but clearly is a much weaker assumption than such normality. FIGURE 1 (a), FIGURE 1 (b), FIGURE 1 (c) NEAR HERE To illustrate the eect of publication bias, in Figure 1(b) we show the same funnel plot as in Figure 1(a), but with the `left-most' 5 studies suppressed. This gives the typical pattern that is taken to indicate publication bias (Light and Pillemer, 198): the assumption is that, whether because of editorial policy or author inaction or other reasons, these papers (which show, say, no signicance, or perhaps the reverse eect (namely < ) from that envisaged when carrying out the studies (namely > )) are the ones that might not be published (see Cooper, 1998 pp 5-55). Such suppression will aect the estimate of. Indeed in this example it increases b to :12 with 95% condence interval (CI) (:37; :21), which is now signicant at the 5% level: in other words, suppressing these ve studies leads to an incorrect inference by usual standards, and increases the estimated mean by almost 6%. The evaluation of funnel plots is subjective in much of the literature. A number of more quantitative methods of detecting publication bias have been proposed: these range from a simplistic method of moments approach (Sugita, 1992,199), to more sophisticated methods such as a rank correlation test (Begg, 199; Begg and Mazumdar 199) based on the form of a biased funnel plot as in Figure 1(b), a method based on p-values (Gleser and Olkin, 1996), or a regression based test (Egger et al, 1997) based on the Galbraith (1988) version of a funnel plot. These tests are all designed to detect rather than adjust for publication bias. There also exist several quantitative methods which estimate the number of missing studies and, by explicitly modeling the probability of publication, provide estimates of the eect of the missing studies on the overall eect size (Dear and Begg, 1992; Hedges, 1992; Givens et al, 1997; Smith, Givens and Tweedie, 1998). All are complex and highly computer intensive to run, and Dear and Dobson (1997) noted \previous 3

4 methods have not been much used... (and)... the value of any new statistical methodology depends, in part, on the extent to which it is adopted"; they also noted that \the culture of meta-analysis has traditionally favoured very simple methods". We now develop a simple technique that seems to meet many of the objections to other methods. 2 Adjusting for Publication Bias 2.1 Fixed and Random Eects Models We rst describe a standard structure for a meta-analysis in the absence of publication bias. We assume we have n individual studies, all of which are addressing the same problem; and that there is some global \eect size" (such as log relative risks, risk dierences or log mortality ratios in clinical or epidemiological trials; or dierences in performance in sociological experiments) which is relevant to the overall problem, and which each study attempts to measure. For each j = 1; : : :; n, study j produces an eect size Y j which estimates, and an estimated within-study variance j 2. The random eects (RE) model which is commonly used to combine the Y j is then Y j = + j + j (1) where j N(; 2 ) is introduced to account for heterogeneity between studies, and j N(; 2 j ) represents the within-study variability of study j. The RE approach has been argued (NRC Report, 1992) to be preferable to the xed eects (FE) model which assumes that 2 =, so that any heterogeneity between studies is purely random. This model leads, through normal theory, to the meta-analyzed estimate of given by b = P Yj w j = P w j where w j = ( 2 j + 2 )?1, and with Var[b ] = 1= P wj : In tting this estimator, it is usually assumed that the j 2 are known. In the FE model we take 2 = ; in the RE model there are various moment-based and maximum likelihood approaches giving estimates of 2 (Biggersta and Tweedie, 1997), the most common being the DerSimonian-Laird estimator (DerSimonian and Laird, 1986), which we use in Section 5 and Section 6 below. 2.2 Modeling Publication Bias Now we modify the model above to account for publication bias. We assume that in addition to the n observed studies, there are an additional k relevant studies which are not observed, due to publication bias. The value of k, and the eect sizes which might have been found from these k studies, are unknown and must be estimated; and uncertainty about these estimates must be reected in the nal meta-analysis inference. The frequentist models of Dear and Begg (1992) and Hedges (1992), and the Bayesian models of Givens et al (1997), estimate both k and the positions of the missing studies using the normal

5 models above together with a model of the probabilities that studies are missing in dierent p-value intervals. In contrast, our approach is non-parametric, and relies only on symmetry assumptions, which are certainly satised by both the FE and RE models above. To describe this, we rst recall the standard assumption behind the Wilcoxon test for a collection X i ; i = 1; : : :; N of random variables (Marascuilo and McSweeney, 1977). If the hypothesis is that the median of each of the X i =, then each observed value is assumed to have arisen by rst generating jx i j and then generating the sign of X i according to an independent set of Bernoulli variables taking values?1; 1 with equal probability. If r i denotes the rank of jx i j, and if W + N = X Xi> r i is the sum of the ranks associated with the positive X i, then we say that W + N distribution. has a Wilcoxon Now let us suppose that there are originally n + k studies with this structure, and that k values from the set X j have been suppressed, leaving a set of n observed studies. Our key assumption is: The suppression has taken place in such a way that it is the k values of the X j with the most extreme negative ranks that have been suppressed. This might be expected to lead to a truncated funnel plot such as we have in Figure 1(b), for example. We will call this model for the overall set of studies the suppressed Bernoulli model. We are now left with n observed values, and these remaining jx i j are re-ranked as r i : these ranks run now from 1 to n. We will consider three statistics based on this set of observed ranks. Firstly, we let denote the length of the rightmost run of ranks associated with positive values of the observed X i : that is, if h is the index of the most negative of the X i, and if r h is its absolute rank, then = n? r h. Secondly, we consider the Wilcoxon statistic for the observed set of variables omitting the suppressed variables. In order to distinguish the suppressed data-set from a set of random variables each of which is symmetric, we will dene the `trimmed' rank test statistic for the observed n values as T n = X Xi> r i : (2) The distributions of and T n depend on k, although we omit this in the notation. Based on these quantities we dene three estimators of k, given by R =? 1; (3) L = T n? n(n + 1) 2n? 1 () 5

6 and q Q = n? 1=2? 2n 2? T n + 1=: (5) The forms of L and Q are based on method of moments considerations developed in Section 3.3. The main analytic results of this paper are the following, which are proved in Section 3. Theorem 2.1 Under the assumption that the median of the original X i is, and under the suppressed Bernoulli model, (i) the estimator R has mean and variance given by (ii) the estimator L has mean and variance given by where E[R ] = k ; Var[R ] = 2k + 2; (6) E[L ] = k? k 2 =(2n? 1); Var[L ] = 16Var(T n )=(2n? 1) 2 ; (7) Var(T n ) = 2?1 [n(n + 1)(2n + 1) + 1k k k? 18nk 2? 18nk + 6n 2 k ]; (8) (iii) the estimator Q has mean and variance given (approximately) by 2Var[T n ] E[Q ] k + ; Var[Q [(n? 1=2) 2? k (2n? k? 1)] 3 ] 2 Var[T n ] (n? 1=2) 2? k (2n? k? 1) : (9) Remark: If we consider the asymptotic situation where n is large and k is of smaller order than n, then E[L ] k ; E[Q ] k ; E[R ] = k Var[L ] n=3; Var[Q ] n=3; Var[R ] = o(n) Hence both L and Q will have similar behavior, although in practice Q is usually of greater magnitude than L and occasionally appears to give excessively large values. It can be shown that the MSE of L is smaller than that of R for certain realistic ranges of n; k, and it is more robust against certain congurations of data than is R : for example, if there is just one very negative value of X i followed by a missing collection of studies (which violates the exact assumption we have made but not the spirit of publication bias), then R will be zero although L may be non-zero. Remark: Since we have an expression for the bias of L, we can in principle remove this to at least O(n?2 ). If we dene L 1 = L [1? (1=2n)] + L 2 =(2n? 1)? 1=6? 1=n (1) then it is possible to check that, again for n large and k xed and smaller than n, E[L 1 ] = k + O(n?2 ): However, this seems to make only marginal dierence in simulations (Taylor, 1998). 6

7 3 Estimators of k 3.1 The estimator R under the suppressed Bernoulli model We rst examine the simple estimator R dened by (3). Begin by considering the sequence of signed ranks in the original (unsuppressed) data set. Thus, for example, if the original set of studies is of size n + k = 1, suppose the ranks and their associated signs are?1;?2; 3; ; 5;?6;?7; 8; 9;?1; 11; 12;?13; 1: Now suppose we suppress k = 3 studies, which (since we suppress the three most negative studies) means we suppress those corresponding to?13;?1;?7. The signed ranks after ranking the remaining 11 studies are?1;?2; 3; ; 5;?6; 7; 8; 9; 1; 11: We can consider this as being in two groups: the set f7; 8; 9; 1; 11g which were \promoted" to be the top ve ranks, with positive sign, due to the suppression; and the remaining six ranks, which still have the same random permutation as before. Thus in this case the run of positive ranks is of length ve, and = 5. But we can see that is just the number of positive terms we need in order to reach the rst negative value which is not suppressed, which is the (k + 1) st negative term in the original sequence; and hence is a negative binomial variable corresponding to a sequence of simple Bernoulli coin-tosses. Using the negative binomial form (Casella and Berger (199), p. 96) we thus have immediately the following result, which gives (6). Theorem 3.1 The distribution of the estimator R is given by P(R = m) = k + m + 1 m k +m+2 (11) Remark: The major drawback to this estimator is that it is non-robust to the existence of a relatively isolated negative term at the right hand end of the sequence of ranks. Of course, this is consistent with the suppression hypothesis we have used: but in practice, we may nd funnel plots with one such isolated value and the appearance of a gap in the next run of positive values. The use of R may be inappropriate in this case and the next set of estimators, based on T n, seem rather better in this context. 3.2 Moments of T n under the suppressed Bernoulli model We now nd the rst and second moments of T n and show that we can use these to derive the forms L and Q as in () and (5). 7

8 Theorem 3.2 For given xed k, under the suppressed Bernoulli model where W + N T n = n + (n? 1) + : : : + (n? + 1) + W + n? (12) has the Wilcoxon distribution; and has the negative binomial distribution k + m? 1 1 k +m P( = m) = : (13) m 2 The rst two moments of this mixture distribution are and V ar[t n ] given by (8). E[T n ] = [n(n + 1) + k (2n? k? 1)]= (1) Proof By removing the \top" k negative ranks we are left with matching positive ranks, and we denote the number of these by. By the suppressed Bernoulli model these have the distribution (13) (Casella and Berger (199), p. 96): note that includes one less \positive run" than. After allowing for these positive ranks, the remaining n? of the ranks are independently positive and negative leading to the nal term in (12). Recalling that W m + has mean and variance given for xed m by E[W m] + = m(m + 1)= and Var[W m] + = m(m + 1)(2m + 1)=2, and using the fact that E[] = k and E[ 2 ] = k 2 + 2k, we nd E[T n ] = E [E[W + n? + (n? + 1) + + nj]] = P 1 = [ (n?)(n?+1) + n(n+1)? (n?)(n?+1) ]P[ = ] 2 2 = n(n+1)? (2n+1) E[] + 1 Var[] + 1 [E[]]2 + (n )E[]? 1 2 E[2 ] (15) Note that this shows that = n(n+1) + k [2n?k?1] : E[T n ] = E[W n + ] + k [2n? k? 1] : (16) A similar calculation shows that, if we actually know the value of, E[T n j] = E[W + n (2n? + 1) ] + : 8

9 Using this, for the variance we have Var[T n ] = E [Var[T n j]] + Var [E[T n j]] = P 1 = Var[W + n? + (n? + 1) + + n]p[ = ] + Var [E[W + n ] + (2n?+1) ] = P 1 = Var[W + n? ]P[ = ] + Var [ (2n?+1) ] = 1 2 (P (n? )(n? + 1)(2(n? ) + 1)P[ = ]) + E[[ [2n?+1] ] 2 ]? [E[ [2n?+1] ]] 2 = 1 2 [2n3 + 3n 2 + n? [6n 2 + 6n + 1]E[] + [6n + 3]E[ 2 ]? 2E[ 3 ]] [E[ ]? [n + 2]E[ 3 ] + [n 2 + n + 1]E[ 2 ]]? [ 1 [[2n + 1]E[]? E[2 ]]] 2 = Var[W + n ] [1k3 + 27k k? 18nk 2? 18nk + 6n 2 k ] (17) where we have used the fact that E[ 3 ] = k 3 + 6k 2 + 6k and E[ ] = k + 12k k k ; and this gives our result. ut Remark: Note that in using this approach we are assuming that n is suciently larger than k that will not be truncated prior to actually nding k negative values: in practice this seems to be a reasonable assumption given that we should not be considering such methods if k is overly large with respect to n. We have also ignored the fact that we know that : using the distribution of conditioned on this inequality would be more accurate, although the algebra involved would be considerably more tedious, and the results below indicate that the estimators we use have acceptable properties without this extra level of complication. 3.3 Estimating k using T n We assume for the purposes of this section that the median of the X i is known, and take it as. We rst consider the method of moments estimator for k which follows by solving (1): this gives Q = (n? 1=2)? p (n? 1=2) 2? [T n? n(n + 1)=] (18) which gives (5). In dening this estimator, we are assuming that only the negative root of the quadratic (1) is relevant. The other root leads to a (rounded) estimate of k which is always as large as n: our choice of Q therefore implicitly requires a belief that we are in fact observing at least 1=2 of the real studies, which seems the more realistic situation. More importantly, for Q to be well dened, the discriminant 2n 2?T n +1= must be non-negative: that is, we must have T n < n 2 =2 + 1=16 for this estimator to be dened. Simulations show that when k is large with relation to n, and when n itself is small, this may be violated quite frequently. 9

10 Thus, although we do evaluate properties of Q, it clearly has drawbacks in practice. We do note, however, that the discriminant is negative only for large values of T n ; and that at the point where it is zero, our estimate would be n? 1=2. It is thus logical to use this latter, rather large, estimate as a lower bound when Q is complex. Because of these concerns, we now consider a second estimator based on (1). If we ignore the quadratic term involving k in (1), which is relatively small, at least for large n, then from the linearization T n? n(n + 1)= (2n? 1)k = we nd the explicit linear estimator as given in () above. We now prove the remainder of Theorem 2.1. (i) For the estimator L, we have, using (16) L = (T n? n(n + 1)=) ; (19) 2n? 1 E[L ] = [2n?1] E[T n? E[W + n ]] k = [2n?k?1] [2n?1] = k? k2 2n?1 ; (2) Var[L ] = ( 2n?1 )2 Var[T n? E[W + n ]] = ( 2n?1 )2 Var[T n ]: (ii) For the estimator Q we write Q = f(x) = (n? 1 2 )? q(n? 1 2 )2? X where X = T n? E[W + n ] and use a Taylor series expansion about = E[X]. Using the rst-order approximation E[Q ] f() we get from (16) that E[Q ] (n? 1 2 )? q(n? 1 2 )2? (E[T n ]? E[W + n ]) = (n? 1 2 )? q(n? 1 2 )2? k (2n? k? 1) = k : (21) Adding a second-order term, we use the approximation E[Q ] f() f ()Var[X] to give: E[Q ] k + 2Var[Tn] [(n? 1 2 )2?k (2n?k?1)] 3 2 : (22) 1

11 To derive the variance of Q we use the approximation that Var[Q ] [f ()] 2 Var[X] so that 2 Var[Q ] [ p ] 2 Var[T (n? 1 n ] 2 )2?k (2n?k?1) = Var[Tn] (n? : (23) 1 2 )2?k (2n?k?1) ut Properties of the estimators In using any of the estimators above, we will round to the nearest non-negative integer, since algorithmically we need to trim whole studies rather than fractions of studies. Thus in practice we use R + = maxf; R g; L + = [maxf; L + 1=2g]; Q + = [maxf; Q + 1=2g]; (2) where [x] is the integer part of x. We now show using simulations that the means and variances of these estimators, which we shall call \empirical" versions, are acceptably close to those in Theorem 2.1 for the ranges of n; k that we address, and thus the use of the theoretical values seems justied. To carry out the simulations we generated, for each of the cases below, 5 sets of N = n + k normal variates, each with mean zero and variances 2 i, the i being taken from a?(3; 1=9) density. Figure 2 shows the means of the simulations for R + ; L+ ; Q+ compared with the theoretical means of R ; L ; Q from Theorem 2.1. Over the range of N we have compared, when k = 5 or 1, there is very little dierence between the analytic and empirical forms in general: the means of L and L + dier by no more than.6 in the former case and no more than.16 in the latter, for example. We see that R is even more stable and that the analytic forms for Q (which we recall were only approximate) are also accurate except in the relatively extreme case that n = 15 and k = 1. When k = the eect of truncating to ensure non-negative empirical estimators is also clear, and typically means are higher than the analytic forms, as we would expect. FIGURE 2 NEAR HERE The variances of the empirical estimators are also generally well approximated by the analytic forms in the cases where k = 5 or 1: details are in Taylor (1998). The exception is again when k = 1 for Q : here we seem to need n to be greater than around 35 for the approximations to work. When k = the eect of rounding all the negative values to zero is to diminish the variances, and this seems also to be the case for L when k = 5. In these cases the analytic forms are quite conservative in the sense of leading to excessive estimates of variance in all cases. 11

12 5 The iterative \Trim and Fill" Algorithm The results above show the behavior of our estimators when the value of is known. The more dicult situation that arises in meta-analysis is caused because is unknown, so that the number and position of any missing studies is correlated with the true value of. To handle this situation we develop an iterative algorithm. Using the estimators above we proceed as follows: Step 1: Estimate b (1) using the RE estimator. Construct the initial set of centered values Y (1) i = Y i? b (1) ; i = 1; : : :; n; and estimate b k (1) using one of the estimators above (say L + for example) applied to the set Y (1) i. Step 2: Relying on a symmetry argument, remove k (1) values from the right hand end of the set of values Y i, and estimate b (2) based on the trimmed \symmetric" set of n? k b(1) values, i.e. fy 1 ; : : :; Y (1) n? b k g. Construct the next set of centered values and estimate b k (2) from the set Y (2) i ; i = 1; : : :; n. Y (2) i = Y i? b (2) ; i = 1; : : :; n; Step 3: Remove k b(2) values from the right hand end; re-estimate b (3) based on the leftmost set of n? k b(2) values; construct Y (3) i = Y i? b (3) ; i = 1; : : :; n; estimate k b(3) from Y (3) i. Step : Continue until an iteration J where k b(j) = k b(j?1) := k b, at which point we also have b (J) = b (J?1). Step 5: As a nal step, \ll" the funnel plot by using the imputed symmetric values with associated imputed standard errors Y j = 2b (J)? Yn?j+1 ; j = 1; : : :; b k j = n?j+1 ; j = 1; : : :; b k : Estimate the nal value of using the full augmented data-set by b (F ) = P n Y 1 iw i + P b k Y 1 P n 1 w i + P b k 1 w i with imputed 95% CI given by h b (F ) 1:96? q Pn w 1 i + P ; b (F ) + b k 1 w i 12 i w i 1:96 q Pn 1 w i + P b k 1 w i (25) i ; (26)

13 here w i = [i 2 + F 2 ]?1 and w i = [( i )2 + F 2 ]?1, and F 2 is estimated from the entire lled data set fy 1 ; : : :; Y n ; Y 1 ; : : :; Yb g, using the DerSimonian-Laird estimator. k We make the following observations on this algorithm: (i) Note that nding b (J) in Step is only possible if b k n? 1, and in practice we truncate the algorithm if this value is reached. (ii) When we have a xed eect model, it is clear that the algorithm must converge, since the b (j) have to decrease by denition, and so k b(j) increases. In the real (non-simulated) examples in Taylor (1998) we found that convergence never took more than iterations. However if the data are tted by a random eects model it is possible that the b (j) may oscillate, and so k b(j) will also oscillate. This appears to be rare in simulations but consideration of the possibility is required. (iii) The nal estimate reects the variability that should be imputed in the missing studies: thus we are not only adjusting the mean but also the condence interval for these imputed values. In fact, one can check that in the FE case the sum in (25) only depends on the n? b k values in the trimmed data set, so the nal mean in this context does not depend on the location but only on the number of imputed missing values. In the RE case this is not so, since the imputed (lled) values may change the estimate of 2. 6 Two examples We now apply the method to two dierent examples, the simulated data set in Section 1.2, where we know the underlying structure, and a meta-analysis of 35 studies of the eect of environmental tobacco smoke (ETS) on lung cancer, studied in Givens et al (1998) using a relatively complex Bayesian approach. We compare the results obtained with those from three other methods: the estimate of the number of unreported studies calculated by the \simple" method of equation (2) of Gleser and Olkin (1996), which is comparable in calculation eort to our estimates of k ; and the tests for publication bias of Begg (199) and Egger et al (1997). Example: simulated data In the simulated data shown in Figure 1, we know ve studies have been suppressed, and that this changes a statistically insignicant result on the original data to a statistically signicant result if publication bias is not considered. On this data set the Gleser and Olkin (1996) method estimates that there are 11 missing studies. The two-sided p-values from the Begg (199) and Egger et al (1997) methods are respectively.1 and., indicating that the null hypothesis of no suppression is not supported. Note that Begg (199) points out that his test is not powerful and a value such as this should be taken as an indication of the possible existence of publication bias. 13

14 Table 1 shows the way in which the various estimators behave on this example. As is often found in examples, we have R + L+ Q+. There is relatively little dierence on the lled value of and its CI, no matter which estimator is used to ll. Figure 1(c) is lled using the actual number, i.e. using L +. Table 1: Eect of publication bias using estimators of k for the data in Figure 1 Estimator Imputed n Value of k (SE) No. iterations b 95% CI Original data (-.18,.178) Suppressed data (.37,.21) R + 3 (3.2) 2.92 (-.1,.185) L (3.7) 5.82 (-.11,.176) Q (.6) 6.79 (-.13,.17) Example: Lung cancer and environmental tobacco smoke (ETS) Givens et al (1997) use a Bayesian approach to estimate and adjust for publication bias in a set of 35 studies which assess the risk of lung cancer in non-smoking women exposed to spousal smoking. The data consist of relative risks and associated condence intervals, and have also been studied by Lee (1992) and Mengersen, Tweedie and Biggersta (1995). We nd that R + = L+ = 8, and Q+ = 9. In this case the two-sided p- values from the Begg (199) and Egger et al (1997) methods are respectively.16 and.5, also indicative of publication bias. In contrast, the method of Gleser and Olkin (1996) estimates that k = : this is because this method is very susceptible to just one study with a large p-value against the null hypothesis (in this case due to a Y i which is only slightly negative but has a very small variance). FIGURE 3 NEAR HERE After lling eight studies, in accord with R + or L+, the overall relative risk moves from 1.2 [1.8, 1.3] to 1.12 [1.1, 1.25], almost identical to the more complicated procedure of Givens et al (1997) and the more subjective analysis of Mengersen, Tweedie and Biggersta (1995). This is illustrated in Figure 3, where the original data are shown as solid symbols, with the \lled" data as open symbols. The upper panel uses the appropriate eect size in the funnel plot, whereas the lower panel gives the meta-analyzed estimates and CIs in the original scale. 7 Discussion A meta-analysis based on only a subset of all relevant studies may result in biased conclusions. It is a common belief, backed by several empirical assessments, that studies are not uniformly 1

15 likely to be published in scientic journals (Cooper, 1998, pp 5-55; Dickersin, Min and Meinert, 1992). Easterbrook et al. (1991) suggest that statistical signicance is a major determining factor of publication. Some researchers (e.g. students with Masters' or Ph.D. theses) may not submit a nonsignicant result for publication, and editors may not publish nonsignicant results even if they are submitted (British Medical Journal, 1983). Evaluating this eect is dicult, since the missing studies inuence the mean that is estimated in the meta-analysis. As DuMouchel and Harris note in the comments on Givens et al (1997), \attempts to assess publication bias beyond simple graphs like the funnel plot seem to involve a tour de force of modeling, and as such are bound to run up against resistance from those who are not statistical modeling wonks". The method we propose here avoids the need for such a cumbersome model by using a simple approach to calculating k (in terms of computational burden and assumptions), and by applying this iteratively as we trim and ll the data. The simulations carried out indicate that the non-parametric estimators we have derived for the number of missing studies work well, giving realistic estimates using either R + or L+ in particular. Comparisons of these estimators are given in more detail in Taylor (1998), and in the real examples examined there, the trim and ll method matches the subjective impression of bias given by funnel plots, and appears to give results consistent with those of Begg (199) or Egger et al (1997). Of course, we do not actually know that the lled studies match reality in any close way. But in the sense that we can see the potential for incorrect conclusions if publication bias does exist, the trim and ll method does seem to provide a robust diagnostic method to enable us to decide which meta-analyses are safe from this bias, and which must be treated with considerable caution. References Begg, C. B. (199). Publication bias. In The Handbook of Research Synthesis, Cooper, H. and Hedges, L. V., eds., 399{9. Russell Sage Foundation, New York. Begg, C. B. and Mazumdar, M. (199). Operating characteristics of a rank correlation test for publication bias. Biometrics, 5: 188{111. Biggersta, B. J., and Tweedie, R. L. (1997). Incorporating variability in estimates of heterogeneity in the random eects model in meta-analysis. Statistics in Medicine, 16: 753{768. British Medical Journal Editorial Sta (1983). The editor regrets... (editorial). British Medical Journal, 28:58. Casella, G. and Berger, R. L. (199). Statistical Inference. Duxbury Press, Belmont, California. 15

16 Cooper, H. (1998). Synthesizing Research. Sage Publications, Thousand Oaks. Cooper, H. and Hedges, L. V., eds. (199). The Handbook of Research Synthesis. Russell Sage Foundation, New York. Dear, K. and Begg, C. (1992). An approach for assessing publication bias prior to performing a meta-analysis. Statistical Science, 7:237{25. Dear, K. and Dobson, A. (1997). Comment on Givens, G. H., Smith, D. D., and Tweedie, R. L. (1997). Publication bias in meta-analysis: A Bayesian data-augmentation approach to account for issues exemplied in the passive smoking debate (with discussion). Statistical Science, 12:25{26. DerSimonian, R. and Laird, N. M. (1986). Meta-analysis in clinical trials. Trials, 7:177{188. Controlled Clinical Dickersin, K., Min, Y., and Meinert, C. (1992). Factors inuencing publication of research results. Journal of the American Medical Association, 267:37{378. DuMouchel, W. and Harris, J. (1997). Comment on Givens, G. H., Smith, D. D., and Tweedie, R. L. (1997). Publication bias in meta-analysis: A Bayesian data-augmentation approach to account for issues exemplied in the passive smoking debate (with discussion). Statistical Science, 12:2{25. Easterbrook, P., Berlin, J., Gopalan, R., and Matthews, D. (1991). Publication bias in clinical research. Lancet, 337:867{872. Egger, M., Smith, G. D., Schneider, M., and Minder, C. (1997). Bias in meta-analysis detected by a simple, graphical test. British Medical Journal, 315:629{63. Galbraith, R. (1988). A note on the graphical presentation of estimated odds ratios from several clinical trials. Statistics in Medicine, 7:889{89. Givens, G. H., Smith, D. D., and Tweedie, R. L. (1997). Publication bias in meta-analysis: A Bayesian data-augmentation approach to account for issues exemplied in the passive smoking debate (with discussion). Statistical Science, 12:221{25. Gleser, L. J. and Olkin, I. (1996). Models for estimating the number of unpublished studies. Statistics in Medicine, 15:293{257. Hedges, L. (1992). Modeling publication selection eects in meta-analysis. Statistical Science, 7:227{236. Hedges, L. and Olkin, I. (1985). Statistical Methods for Meta-analysis. Academic Press, New York. Iyengar, S. and Greenhouse, J. B. (1988). Selection models and the le drawer problem (with discussion). Statistical Science, 3:19{135. Lee, P. N. (1992). Environmental Tobacco Smoke and Mortality. Karger, Basel. Light, R. and Pillemer, D. (198). Summing Up: the Science of Reviewing Research. Harvard University Press, Cambridge. Marascuilo, L. A. and McSweeney, M. (1977). Nonparametric and distribution-free methods for the social sciences. Brooks/Cole Publishing Co, Monterey. 16

17 Mengersen, K., Tweedie, R., and Biggersta, B. (1995). The impact of method choice in metaanalysis. Australian Journal of Statistics, 37:19{. Misakian, A. L. and Bero, L. A. (1998). Publication bias and research on passive smoking: Comparison of published and unpublished studies. Journal of the American Medical Association, 28:25{ 253. NRC Committee on Applied and Theoretical Statistics (1992). Combining Information: Statistical Issues and Opportunities for Research. National Academy Press, Washington. Olkin, I. (1992). Meta-analysis: methods for combining independent studies. Statistical Science, 7:226. Sugita, M., Kanamori, M., Izuno, T., and Miyakawa, M. (1992). Estimating a summarized odds ratio whilst eliminating publication bias in meta-analysis. Japanese Journal of Clinical Oncology, 22:35{358. Sugita, M., Yamaguchi, N., Izuno, T., Kanamori, M., and Kasuga, H. (199). Publication probability of a study on odds ratio value: Circumstantial evidence for publication bias in medical study areas. Tokai Journal of Experimental and Clinical Medicine, 19:29{37. Taylor, S. J. (1998). Eects of publication bias in meta-analysis. PhD dissertation, University of Colorado Health Sciences Center, Department of Preventive Medicine and Biometrics. Tweedie, R. L., Scott, D. S., Biggersta, B. J., and Mengersen, K. L. (1996). Bayesian metaanalysis, with application to studies of ETS and lung cancer. Lung Cancer, 1: S171{S19. 17

18 1 8 1/standard error Effect size Figure 1a. Funnel plot of 35 simulated studies: overall effect size shown is.8 with 95% CI of (-.18,.178) /standard error Effect size Figure 1b. Plot with five 'left-most' studies suppressed: overall effect size now is estimated as.12 with 95% CI of (.37,.21) /standard error Effect size Figure 1c. 'Filled' funnel plot: overall effect size is estimated using trim and fill as.82 with 95% CI of (-.11,.176).

19 15 R + Mean No of observed studies 15 L + Mean No of observed studies 15 Q + Mean No of observed studies Figure 2: Analytic means (filled symbols) and simulated means (open symbols) for each of the estimators shown, at values of k =,5,1 19

20 12 1 1/standard error Log relative risk Relative risk Figure 3: Top panel is a funnel plot of log relative risks of lung cancer associated with ETS; bottom panel shows overall relative risk and 95% CI before and after allowing for publication bias. 2

1 Introduction The second half of the twentieth century has witnessed an explosive growth in the scientic literature. For example, between 1940 and 19

1 Introduction The second half of the twentieth century has witnessed an explosive growth in the scientic literature. For example, between 1940 and 19 Modeling Publication Bias Using Weighted Distributions in a Bayesian Framework Daniel T. Larose and Dipak K. Dey* Abstract Meta-analysis refers to the quantitative synthesis of evidence from a set of related

More information

Power Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis

Power Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis Power Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis Glenn Heller Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center,

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

ADJUSTED POWER ESTIMATES IN. Ji Zhang. Biostatistics and Research Data Systems. Merck Research Laboratories. Rahway, NJ

ADJUSTED POWER ESTIMATES IN. Ji Zhang. Biostatistics and Research Data Systems. Merck Research Laboratories. Rahway, NJ ADJUSTED POWER ESTIMATES IN MONTE CARLO EXPERIMENTS Ji Zhang Biostatistics and Research Data Systems Merck Research Laboratories Rahway, NJ 07065-0914 and Dennis D. Boos Department of Statistics, North

More information

Prediction of ordinal outcomes when the association between predictors and outcome diers between outcome levels

Prediction of ordinal outcomes when the association between predictors and outcome diers between outcome levels STATISTICS IN MEDICINE Statist. Med. 2005; 24:1357 1369 Published online 26 November 2004 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/sim.2009 Prediction of ordinal outcomes when the

More information

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC Mantel-Haenszel Test Statistics for Correlated Binary Data by Jie Zhang and Dennis D. Boos Department of Statistics, North Carolina State University Raleigh, NC 27695-8203 tel: (919) 515-1918 fax: (919)

More information

Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models

Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models Sonderforschungsbereich 386, Paper 196 (2000) Online

More information

Re-sampling and exchangeable arrays University Ave. November Revised January Summary

Re-sampling and exchangeable arrays University Ave. November Revised January Summary Re-sampling and exchangeable arrays Peter McCullagh Department of Statistics University of Chicago 5734 University Ave Chicago Il 60637 November 1997 Revised January 1999 Summary The non-parametric, or

More information

Linear Regression and Its Applications

Linear Regression and Its Applications Linear Regression and Its Applications Predrag Radivojac October 13, 2014 Given a data set D = {(x i, y i )} n the objective is to learn the relationship between features and the target. We usually start

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Connexions module: m11446 1 Maximum Likelihood Estimation Clayton Scott Robert Nowak This work is produced by The Connexions Project and licensed under the Creative Commons Attribution License Abstract

More information

Statistical Inference

Statistical Inference Statistical Inference Bernhard Klingenberg Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Outline Estimation: Review of concepts

More information

Antonietta Mira. University of Pavia, Italy. Abstract. We propose a test based on Bonferroni's measure of skewness.

Antonietta Mira. University of Pavia, Italy. Abstract. We propose a test based on Bonferroni's measure of skewness. Distribution-free test for symmetry based on Bonferroni's Measure Antonietta Mira University of Pavia, Italy Abstract We propose a test based on Bonferroni's measure of skewness. The test detects the asymmetry

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Quantifying heterogeneity in a meta-analysis

Quantifying heterogeneity in a meta-analysis STATISTICS IN MEDICINE Statist. Med. 2002; 21:1539 1558 (DOI: 10.1002/sim.1186) Quantifying heterogeneity in a meta-analysis Julian P. T. Higgins ; and Simon G. Thompson MRC Biostatistics Unit; Institute

More information

Discrete Dependent Variable Models

Discrete Dependent Variable Models Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)

More information

Averaging Estimators for Regressions with a Possible Structural Break

Averaging Estimators for Regressions with a Possible Structural Break Averaging Estimators for Regressions with a Possible Structural Break Bruce E. Hansen University of Wisconsin y www.ssc.wisc.edu/~bhansen September 2007 Preliminary Abstract This paper investigates selection

More information

A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions

A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions Journal of Modern Applied Statistical Methods Volume 12 Issue 1 Article 7 5-1-2013 A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions William T. Mickelson

More information

Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection

Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection Biometrical Journal 42 (2000) 1, 59±69 Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection Kung-Jong Lui

More information

r(equivalent): A Simple Effect Size Indicator

r(equivalent): A Simple Effect Size Indicator r(equivalent): A Simple Effect Size Indicator The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Rosenthal, Robert, and

More information

Non-parametric Inference and Resampling

Non-parametric Inference and Resampling Non-parametric Inference and Resampling Exercises by David Wozabal (Last update. Juni 010) 1 Basic Facts about Rank and Order Statistics 1.1 10 students were asked about the amount of time they spend surfing

More information

Likelihood Ratio Tests and Intersection-Union Tests. Roger L. Berger. Department of Statistics, North Carolina State University

Likelihood Ratio Tests and Intersection-Union Tests. Roger L. Berger. Department of Statistics, North Carolina State University Likelihood Ratio Tests and Intersection-Union Tests by Roger L. Berger Department of Statistics, North Carolina State University Raleigh, NC 27695-8203 Institute of Statistics Mimeo Series Number 2288

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information

Performance of Low Density Parity Check Codes. as a Function of Actual and Assumed Noise Levels. David J.C. MacKay & Christopher P.

Performance of Low Density Parity Check Codes. as a Function of Actual and Assumed Noise Levels. David J.C. MacKay & Christopher P. Performance of Low Density Parity Check Codes as a Function of Actual and Assumed Noise Levels David J.C. MacKay & Christopher P. Hesketh Cavendish Laboratory, Cambridge, CB3 HE, United Kingdom. mackay@mrao.cam.ac.uk

More information

An introduction to biostatistics: part 1

An introduction to biostatistics: part 1 An introduction to biostatistics: part 1 Cavan Reilly September 6, 2017 Table of contents Introduction to data analysis Uncertainty Probability Conditional probability Random variables Discrete random

More information

G. Larry Bretthorst. Washington University, Department of Chemistry. and. C. Ray Smith

G. Larry Bretthorst. Washington University, Department of Chemistry. and. C. Ray Smith in Infrared Systems and Components III, pp 93.104, Robert L. Caswell ed., SPIE Vol. 1050, 1989 Bayesian Analysis of Signals from Closely-Spaced Objects G. Larry Bretthorst Washington University, Department

More information

TUTORIAL IN BIOSTATISTICS META-ANALYSIS: FORMULATING, EVALUATING, COMBINING, AND REPORTING

TUTORIAL IN BIOSTATISTICS META-ANALYSIS: FORMULATING, EVALUATING, COMBINING, AND REPORTING STATISTICS IN MEDICINE Statist. Med. 18, 321 359 (1999) TUTORIAL IN BIOSTATISTICS META-ANALYSIS: FORMULATING, EVALUATING, COMBINING, AND REPORTING SHARON-LISE T. NORMAND Department of Health Care Policy,

More information

Learning with Ensembles: How. over-tting can be useful. Anders Krogh Copenhagen, Denmark. Abstract

Learning with Ensembles: How. over-tting can be useful. Anders Krogh Copenhagen, Denmark. Abstract Published in: Advances in Neural Information Processing Systems 8, D S Touretzky, M C Mozer, and M E Hasselmo (eds.), MIT Press, Cambridge, MA, pages 190-196, 1996. Learning with Ensembles: How over-tting

More information

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006 Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)

More information

A characterization of consistency of model weights given partial information in normal linear models

A characterization of consistency of model weights given partial information in normal linear models Statistics & Probability Letters ( ) A characterization of consistency of model weights given partial information in normal linear models Hubert Wong a;, Bertrand Clare b;1 a Department of Health Care

More information

Managing Uncertainty

Managing Uncertainty Managing Uncertainty Bayesian Linear Regression and Kalman Filter December 4, 2017 Objectives The goal of this lab is multiple: 1. First it is a reminder of some central elementary notions of Bayesian

More information

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science EXAMINATION: QUANTITATIVE EMPIRICAL METHODS Yale University Department of Political Science January 2014 You have seven hours (and fifteen minutes) to complete the exam. You can use the points assigned

More information

Inference for correlated effect sizes using multiple univariate meta-analyses

Inference for correlated effect sizes using multiple univariate meta-analyses Europe PMC Funders Group Author Manuscript Published in final edited form as: Stat Med. 2016 April 30; 35(9): 1405 1422. doi:10.1002/sim.6789. Inference for correlated effect sizes using multiple univariate

More information

Stochastic dominance with imprecise information

Stochastic dominance with imprecise information Stochastic dominance with imprecise information Ignacio Montes, Enrique Miranda, Susana Montes University of Oviedo, Dep. of Statistics and Operations Research. Abstract Stochastic dominance, which is

More information

Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk

Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk Ann Inst Stat Math (0) 64:359 37 DOI 0.007/s0463-00-036-3 Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk Paul Vos Qiang Wu Received: 3 June 009 / Revised:

More information

An accurate test for homogeneity of odds ratios based on Cochran s Q-statistic

An accurate test for homogeneity of odds ratios based on Cochran s Q-statistic Kulinskaya and Dollinger TECHNICAL ADVANCE An accurate test for homogeneity of odds ratios based on Cochran s Q-statistic Elena Kulinskaya 1* and Michael B Dollinger 2 * Correspondence: e.kulinskaya@uea.ac.uk

More information

Sigmaplot di Systat Software

Sigmaplot di Systat Software Sigmaplot di Systat Software SigmaPlot Has Extensive Statistical Analysis Features SigmaPlot is now bundled with SigmaStat as an easy-to-use package for complete graphing and data analysis. The statistical

More information

Approximate and Fiducial Confidence Intervals for the Difference Between Two Binomial Proportions

Approximate and Fiducial Confidence Intervals for the Difference Between Two Binomial Proportions Approximate and Fiducial Confidence Intervals for the Difference Between Two Binomial Proportions K. Krishnamoorthy 1 and Dan Zhang University of Louisiana at Lafayette, Lafayette, LA 70504, USA SUMMARY

More information

Introduction and mathematical preliminaries

Introduction and mathematical preliminaries Chapter Introduction and mathematical preliminaries Contents. Motivation..................................2 Finite-digit arithmetic.......................... 2.3 Errors in numerical calculations.....................

More information

Business Statistics. Tommaso Proietti. Linear Regression. DEF - Università di Roma 'Tor Vergata'

Business Statistics. Tommaso Proietti. Linear Regression. DEF - Università di Roma 'Tor Vergata' Business Statistics Tommaso Proietti DEF - Università di Roma 'Tor Vergata' Linear Regression Specication Let Y be a univariate quantitative response variable. We model Y as follows: Y = f(x) + ε where

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 4 Problems with small populations 9 II. Why Random Sampling is Important 10 A myth,

More information

percentage of problems with ( 1 lb/ub ) <= x percentage of problems with ( 1 lb/ub ) <= x n= n=8 n= n=32 n= log10( x )

percentage of problems with ( 1 lb/ub ) <= x percentage of problems with ( 1 lb/ub ) <= x n= n=8 n= n=32 n= log10( x ) Soft vs. Hard Bounds in Probabilistic Robustness Analysis Xiaoyun Zhu Yun Huang John Doyle California Institute of Technology, Pasadena, CA 925 Abstract The relationship between soft vs. hard bounds and

More information

An explicit dynamic model of segregation

An explicit dynamic model of segregation An explicit dynamic model of segregation Gian-Italo Bischi Dipartimento di Economia e Metodi Quantitativi Universita di Urbino "Carlo Bo" e-mail:gian.bischi@uniurb.it Ugo Merlone Dip. di Statistica e Matematica

More information

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Linear Models 1. Isfahan University of Technology Fall Semester, 2014 Linear Models 1 Isfahan University of Technology Fall Semester, 2014 References: [1] G. A. F., Seber and A. J. Lee (2003). Linear Regression Analysis (2nd ed.). Hoboken, NJ: Wiley. [2] A. C. Rencher and

More information

2 Inference for Multinomial Distribution

2 Inference for Multinomial Distribution Markov Chain Monte Carlo Methods Part III: Statistical Concepts By K.B.Athreya, Mohan Delampady and T.Krishnan 1 Introduction In parts I and II of this series it was shown how Markov chain Monte Carlo

More information

On estimation of the Poisson parameter in zero-modied Poisson models

On estimation of the Poisson parameter in zero-modied Poisson models Computational Statistics & Data Analysis 34 (2000) 441 459 www.elsevier.com/locate/csda On estimation of the Poisson parameter in zero-modied Poisson models Ekkehart Dietz, Dankmar Bohning Department of

More information

Spring 2012 Math 541B Exam 1

Spring 2012 Math 541B Exam 1 Spring 2012 Math 541B Exam 1 1. A sample of size n is drawn without replacement from an urn containing N balls, m of which are red and N m are black; the balls are otherwise indistinguishable. Let X denote

More information

What p values really mean (and why I should care) Francis C. Dane, PhD

What p values really mean (and why I should care) Francis C. Dane, PhD What p values really mean (and why I should care) Francis C. Dane, PhD Session Objectives Understand the statistical decision process Appreciate the limitations of interpreting p values Value the use of

More information

Essentials of Intermediate Algebra

Essentials of Intermediate Algebra Essentials of Intermediate Algebra BY Tom K. Kim, Ph.D. Peninsula College, WA Randy Anderson, M.S. Peninsula College, WA 9/24/2012 Contents 1 Review 1 2 Rules of Exponents 2 2.1 Multiplying Two Exponentials

More information

Bootstrap tests. Patrick Breheny. October 11. Bootstrap vs. permutation tests Testing for equality of location

Bootstrap tests. Patrick Breheny. October 11. Bootstrap vs. permutation tests Testing for equality of location Bootstrap tests Patrick Breheny October 11 Patrick Breheny STA 621: Nonparametric Statistics 1/14 Introduction Conditioning on the observed data to obtain permutation tests is certainly an important idea

More information

STAT 830 Hypothesis Testing

STAT 830 Hypothesis Testing STAT 830 Hypothesis Testing Richard Lockhart Simon Fraser University STAT 830 Fall 2018 Richard Lockhart (Simon Fraser University) STAT 830 Hypothesis Testing STAT 830 Fall 2018 1 / 30 Purposes of These

More information

Meta-analysis. 21 May Per Kragh Andersen, Biostatistics, Dept. Public Health

Meta-analysis. 21 May Per Kragh Andersen, Biostatistics, Dept. Public Health Meta-analysis 21 May 2014 www.biostat.ku.dk/~pka Per Kragh Andersen, Biostatistics, Dept. Public Health pka@biostat.ku.dk 1 Meta-analysis Background: each single study cannot stand alone. This leads to

More information

LETTER TO THE EDITOR

LETTER TO THE EDITOR STATISTICS IN MEDICINE Statist. Med. 2004; 23:685 693 LETTER TO THE EDITOR Comment on: Condence limits for the ratio of two rates based on likelihood scores: non-iterative method by P. L. Graham, K. Mengersen

More information

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,

More information

Binomial Distribution *

Binomial Distribution * OpenStax-CNX module: m11024 1 Binomial Distribution * David Lane This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 When you ip a coin, there are two

More information

In: Proc. BENELEARN-98, 8th Belgian-Dutch Conference on Machine Learning, pp 9-46, 998 Linear Quadratic Regulation using Reinforcement Learning Stephan ten Hagen? and Ben Krose Department of Mathematics,

More information

Illustrating the Implicit BIC Prior. Richard Startz * revised June Abstract

Illustrating the Implicit BIC Prior. Richard Startz * revised June Abstract Illustrating the Implicit BIC Prior Richard Startz * revised June 2013 Abstract I show how to find the uniform prior implicit in using the Bayesian Information Criterion to consider a hypothesis about

More information

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example continued : Coin tossing Math 425 Intro to Probability Lecture 37 Kenneth Harris kaharri@umich.edu Department of Mathematics University of Michigan April 8, 2009 Consider a Bernoulli trials process with

More information

Schiestel s Derivation of the Epsilon Equation and Two Equation Modeling of Rotating Turbulence

Schiestel s Derivation of the Epsilon Equation and Two Equation Modeling of Rotating Turbulence NASA/CR-21-2116 ICASE Report No. 21-24 Schiestel s Derivation of the Epsilon Equation and Two Equation Modeling of Rotating Turbulence Robert Rubinstein NASA Langley Research Center, Hampton, Virginia

More information

Mixture distributions in Exams MLC/3L and C/4

Mixture distributions in Exams MLC/3L and C/4 Making sense of... Mixture distributions in Exams MLC/3L and C/4 James W. Daniel Jim Daniel s Actuarial Seminars www.actuarialseminars.com February 1, 2012 c Copyright 2012 by James W. Daniel; reproduction

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

6.867 Machine Learning

6.867 Machine Learning 6.867 Machine Learning Problem set 1 Solutions Thursday, September 19 What and how to turn in? Turn in short written answers to the questions explicitly stated, and when requested to explain or prove.

More information

var D (B) = var(b? E D (B)) = var(b)? cov(b; D)(var(D))?1 cov(d; B) (2) Stone [14], and Hartigan [9] are among the rst to discuss the role of such ass

var D (B) = var(b? E D (B)) = var(b)? cov(b; D)(var(D))?1 cov(d; B) (2) Stone [14], and Hartigan [9] are among the rst to discuss the role of such ass BAYES LINEAR ANALYSIS [This article appears in the Encyclopaedia of Statistical Sciences, Update volume 3, 1998, Wiley.] The Bayes linear approach is concerned with problems in which we want to combine

More information

Probability: Why do we care? Lecture 2: Probability and Distributions. Classical Definition. What is Probability?

Probability: Why do we care? Lecture 2: Probability and Distributions. Classical Definition. What is Probability? Probability: Why do we care? Lecture 2: Probability and Distributions Sandy Eckel seckel@jhsph.edu 22 April 2008 Probability helps us by: Allowing us to translate scientific questions into mathematical

More information

Evaluating the Performance of Estimators (Section 7.3)

Evaluating the Performance of Estimators (Section 7.3) Evaluating the Performance of Estimators (Section 7.3) Example: Suppose we observe X 1,..., X n iid N(θ, σ 2 0 ), with σ2 0 known, and wish to estimate θ. Two possible estimators are: ˆθ = X sample mean

More information

Early selection in a randomized phase II clinical trial

Early selection in a randomized phase II clinical trial STATISTICS IN MEDICINE Statist. Med. 2002; 21:1711 1726 (DOI: 10.1002/sim.1150) Early selection in a randomized phase II clinical trial Seth M. Steinberg ; and David J. Venzon Biostatistics and Data Management

More information

Confidence Distribution

Confidence Distribution Confidence Distribution Xie and Singh (2013): Confidence distribution, the frequentist distribution estimator of a parameter: A Review Céline Cunen, 15/09/2014 Outline of Article Introduction The concept

More information

Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models

Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models Sonderforschungsbereich 386, Paper 27 (996) Online unter: http://epububuni-muenchende/ Projektpartner An

More information

PROD. TYPE: COM ARTICLE IN PRESS. Computational Statistics & Data Analysis ( )

PROD. TYPE: COM ARTICLE IN PRESS. Computational Statistics & Data Analysis ( ) COMSTA 28 pp: -2 (col.fig.: nil) PROD. TYPE: COM ED: JS PAGN: Usha.N -- SCAN: Bindu Computational Statistics & Data Analysis ( ) www.elsevier.com/locate/csda Transformation approaches for the construction

More information

1 Introduction Overview of the Book How to Use this Book Introduction to R 10

1 Introduction Overview of the Book How to Use this Book Introduction to R 10 List of Tables List of Figures Preface xiii xv xvii 1 Introduction 1 1.1 Overview of the Book 3 1.2 How to Use this Book 7 1.3 Introduction to R 10 1.3.1 Arithmetic Operations 10 1.3.2 Objects 12 1.3.3

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Equivalence of random-effects and conditional likelihoods for matched case-control studies

Equivalence of random-effects and conditional likelihoods for matched case-control studies Equivalence of random-effects and conditional likelihoods for matched case-control studies Ken Rice MRC Biostatistics Unit, Cambridge, UK January 8 th 4 Motivation Study of genetic c-erbb- exposure and

More information

Core Courses for Students Who Enrolled Prior to Fall 2018

Core Courses for Students Who Enrolled Prior to Fall 2018 Biostatistics and Applied Data Analysis Students must take one of the following two sequences: Sequence 1 Biostatistics and Data Analysis I (PHP 2507) This course, the first in a year long, two-course

More information

Reports of the Institute of Biostatistics

Reports of the Institute of Biostatistics Reports of the Institute of Biostatistics No 02 / 2008 Leibniz University of Hannover Natural Sciences Faculty Title: Properties of confidence intervals for the comparison of small binomial proportions

More information

Potential Cases, Methodologies, and Strategies of Synthesis of. Solutions in Distributed Expert Systems. Minjie Zhang

Potential Cases, Methodologies, and Strategies of Synthesis of. Solutions in Distributed Expert Systems. Minjie Zhang Potential Cases, Methodologies, and Strategies of Synthesis of Solutions in Distributed Expert Systems Minjie Zhang School of Computer, and Information Sciences Edith Cowan University, WA 6027, Australia

More information

\It is important that there be options in explication, and equally important that. Joseph Retzer, Market Probe, Inc., Milwaukee WI

\It is important that there be options in explication, and equally important that. Joseph Retzer, Market Probe, Inc., Milwaukee WI Measuring The Information Content Of Regressors In The Linear Model Using Proc Reg and SAS IML \It is important that there be options in explication, and equally important that the candidates have clear

More information

Traditional Pathway: Algebra II

Traditional Pathway: Algebra II Traditional Pathway: Algebra II Building on their work with linear, quadratic, and exponential functions, students extend their repertoire of functions to include polynomial, rational, and radical functions.

More information

Richard D Riley was supported by funding from a multivariate meta-analysis grant from

Richard D Riley was supported by funding from a multivariate meta-analysis grant from Bayesian bivariate meta-analysis of correlated effects: impact of the prior distributions on the between-study correlation, borrowing of strength, and joint inferences Author affiliations Danielle L Burke

More information

X = X X n, + X 2

X = X X n, + X 2 CS 70 Discrete Mathematics for CS Fall 2003 Wagner Lecture 22 Variance Question: At each time step, I flip a fair coin. If it comes up Heads, I walk one step to the right; if it comes up Tails, I walk

More information

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article

More information

Neutral Bayesian reference models for incidence rates of (rare) clinical events

Neutral Bayesian reference models for incidence rates of (rare) clinical events Neutral Bayesian reference models for incidence rates of (rare) clinical events Jouni Kerman Statistical Methodology, Novartis Pharma AG, Basel BAYES2012, May 10, Aachen Outline Motivation why reference

More information

1/sqrt(B) convergence 1/B convergence B

1/sqrt(B) convergence 1/B convergence B The Error Coding Method and PICTs Gareth James and Trevor Hastie Department of Statistics, Stanford University March 29, 1998 Abstract A new family of plug-in classication techniques has recently been

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Introduction to Machine Learning. Lecture 2

Introduction to Machine Learning. Lecture 2 Introduction to Machine Learning Lecturer: Eran Halperin Lecture 2 Fall Semester Scribe: Yishay Mansour Some of the material was not presented in class (and is marked with a side line) and is given for

More information

Achieve Recommended Pathway: Algebra II

Achieve Recommended Pathway: Algebra II Units Standard Clusters Mathematical Practice Standards Perform arithmetic operations with complex numbers. Use complex numbers in polynomial identities and equations. Interpret the structure of expressions.

More information

EVALUATING THE REPEATABILITY OF TWO STUDIES OF A LARGE NUMBER OF OBJECTS: MODIFIED KENDALL RANK-ORDER ASSOCIATION TEST

EVALUATING THE REPEATABILITY OF TWO STUDIES OF A LARGE NUMBER OF OBJECTS: MODIFIED KENDALL RANK-ORDER ASSOCIATION TEST EVALUATING THE REPEATABILITY OF TWO STUDIES OF A LARGE NUMBER OF OBJECTS: MODIFIED KENDALL RANK-ORDER ASSOCIATION TEST TIAN ZHENG, SHAW-HWA LO DEPARTMENT OF STATISTICS, COLUMBIA UNIVERSITY Abstract. In

More information

Statistics 3858 : Maximum Likelihood Estimators

Statistics 3858 : Maximum Likelihood Estimators Statistics 3858 : Maximum Likelihood Estimators 1 Method of Maximum Likelihood In this method we construct the so called likelihood function, that is L(θ) = L(θ; X 1, X 2,..., X n ) = f n (X 1, X 2,...,

More information

The Bootstrap is Inconsistent with Probability Theory

The Bootstrap is Inconsistent with Probability Theory The Bootstrap is Inconsistent with Probability Theory David H. Wolpert SFI WORKING PAPER: 1995-10-091 SFI Working Papers contain accounts of scientific work of the author(s) and do not necessarily represent

More information

LECTURE 15 + C+F. = A 11 x 1x1 +2A 12 x 1x2 + A 22 x 2x2 + B 1 x 1 + B 2 x 2. xi y 2 = ~y 2 (x 1 ;x 2 ) x 2 = ~x 2 (y 1 ;y 2 1

LECTURE 15 + C+F. = A 11 x 1x1 +2A 12 x 1x2 + A 22 x 2x2 + B 1 x 1 + B 2 x 2. xi y 2 = ~y 2 (x 1 ;x 2 ) x 2 = ~x 2 (y 1 ;y 2  1 LECTURE 5 Characteristics and the Classication of Second Order Linear PDEs Let us now consider the case of a general second order linear PDE in two variables; (5.) where (5.) 0 P i;j A ij xix j + P i,

More information

Data Analysis and Uncertainty Part 1: Random Variables

Data Analysis and Uncertainty Part 1: Random Variables Data Analysis and Uncertainty Part 1: Random Variables Instructor: Sargur N. University at Buffalo The State University of New York srihari@cedar.buffalo.edu 1 Topics 1. Why uncertainty exists? 2. Dealing

More information

EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS

EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS Norges teknisk naturvitenskapelige universitet Institutt for matematiske fag Side 1 av 8 Contact during exam: Bo Lindqvist Tel. 975 89 418 EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS

More information

460 HOLGER DETTE AND WILLIAM J STUDDEN order to examine how a given design behaves in the model g` with respect to the D-optimality criterion one uses

460 HOLGER DETTE AND WILLIAM J STUDDEN order to examine how a given design behaves in the model g` with respect to the D-optimality criterion one uses Statistica Sinica 5(1995), 459-473 OPTIMAL DESIGNS FOR POLYNOMIAL REGRESSION WHEN THE DEGREE IS NOT KNOWN Holger Dette and William J Studden Technische Universitat Dresden and Purdue University Abstract:

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

Statistical mechanics of classical systems

Statistical mechanics of classical systems Statistical mechanics of classical systems States and ensembles A microstate of a statistical system is specied by the complete information about the states of all microscopic degrees of freedom of the

More information

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing. Previous lecture P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing. Interaction Outline: Definition of interaction Additive versus multiplicative

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

Non-parametric Inference and Resampling

Non-parametric Inference and Resampling Non-parametric Inference and Resampling Exercises by David Wozabal (Last update 3. Juni 2013) 1 Basic Facts about Rank and Order Statistics 1.1 10 students were asked about the amount of time they spend

More information

Comparing Measures of Central Tendency *

Comparing Measures of Central Tendency * OpenStax-CNX module: m11011 1 Comparing Measures of Central Tendency * David Lane This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 1.0 1 Comparing Measures

More information

ON STATISTICAL INFERENCE UNDER ASYMMETRIC LOSS. Abstract. We introduce a wide class of asymmetric loss functions and show how to obtain

ON STATISTICAL INFERENCE UNDER ASYMMETRIC LOSS. Abstract. We introduce a wide class of asymmetric loss functions and show how to obtain ON STATISTICAL INFERENCE UNDER ASYMMETRIC LOSS FUNCTIONS Michael Baron Received: Abstract We introduce a wide class of asymmetric loss functions and show how to obtain asymmetric-type optimal decision

More information