Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Size: px
Start display at page:

Download "Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations"

Transcription

1 Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate failure time data analysis, the solution to partial likelihood score equation provides useful test statistic. Proper weight insertion in the score equation allows us to use more powerful test statistic, especially when failure times within a cluster are strongly correlated. In some cases two solutions to weighted and unweighted score equations, which are calculated from the same data, are very different. This means two statistics based on the solutions to weighted and unweighted score equations yield different results of the testing. If one has no prior information in advance to prefer one statistic over the other, it is natural to consider the maximum of the two test statistics for testing the regression parameter. In this paper, we discuss the way to approximate the asymptotic distribution of the maximum statistic, and construct hypothesis testing based on the maximum statistic. Simulation studies are carried out to see the validity of the approximation of the maximum statistic and its power properties. 1 Introduction Let (T i1,..., T in ), i = 1,..., K, be K clusters of n-dimensional multivariate failure time data and let (z i1,..., z in ) be K clusters of the corresponding p-dimensional covariate vectors. Then, marginal Cox (1972)-type hazard models for the jth member in cluster i, with common regression parameter and baseline hazard function, formulates the hazard function of T ij given z ij as λ(t z ij ) = e zt ijβ λ (t), i = 1,..., K, j = 1,..., n. (1.1) Here, β is a p-vector of unknown common regression parameters and λ (t) is an unknown, unspecified common baseline hazard function. Cai and Prentice (1997) introduced estimating equation for the regression parameter β, by incorporating weight matrices into partial likelihood score equation. They proved that the solution to the estimating equation, ˆβ, is consistent and asymptotically normal. Let (C i1,..., C in ) be censoring time. Assume that, given z ij, T ij and C ij are independent for all i, j. Considering the right censoring situation, observed data are X ij = min{t ij, C ij }, δ ij = I(T ij C ij ), and z ij for every i, j. Assume that (T i1,..., T in, C i1,..., C in, z i1,..., z in ) are independent and identically distributed(i.i.d.)from the same probability model. We use only time independent covariates for brevity, but our methods can easily be extended to the case of time dependent covariates with finite variations. Define N ij (t) = I(X ij t, δ ij = 1), Y ij (t) = I(X ij t), then, M ij (t) = N ij (t) t Y ij (s)e zt ijβ λ (s)ds Graduate School of Science and Technology, Chiba University, 1-33 Yayoi-cho, Chiba , Japan; s.s.takeshi@ezweb.ne.jp The institute of Statistical Mathematics, Minami-Azabu, Minato-ku, Tokyo , Japan; tsukuma@ism.ac.jp 1

2 is a martingale with respect to a marginal filtration (see Spiekerman and Lin (1998)). An estimated martingale replaces this martingale with its empirical counterpart, and thus, ˆM ij (β, t) = N ij (t) t t Y ij (s)e zt ijβ dˆλ (β, s), where ˆΛ (β, t) = K n i=1 dn ij(s)/( K n k=1 l=1 Y kl(s)e zt β kl ) and β is an arbitrary p-vector. Then, estimating equation of Cai and Prentice (1997) are defined as i=1 { k=1 } z ik w ikj (β, s) d ˆM ij (β, s) =. Here K weight matrices w i11 (β, t) w i1n (β, t) W i (β, t) =....., i = 1,... K, (1.2) w in1 (β, t) w inn (β, t) are arbitrary functions of β and t and T > is a fixed constant. In case every W i (β, t) is an identity matrix, (1) is partial likelihood score equation, and we call these unweighted estimating equation throughout this paper. Cai and Prentice (1997) showed in their simulation studies that the inverses of some correlation matrices between martingales allow the solution to the estimating equation to have smaller variance than the solution to the unweighted estimating equation. One might not know whether weight insertion achieves great efficiency prior to the analysis of the experiment. It is expected that weight insertion always reduces the variance of the regression estimator when the weight matrices (1.2) are properly selected and are completely known. However, the proper weight matrices have generally to be estimated from data, and the deviation of the weight matrix estimators from the true values often causes the efficiency of the regression parameter estimator to reduce slightly. One might calculate two test statistics from both weighted estimating and unweighed estimating equation. Usually the two analyses yield very similar results, but in some cases, the results can be very different. On the nonparametric test of the equality of two survival curves, the quite similar problem of whether one uses the Mantel s logrank test or the generalized Wilcoxon test is discussed earlier by Tarone (1981). He suggests the maximum of the logrank test statistic and the Wilcoxon test statistic when none of prior information to prefer one statistic over the other is given. In this paper, we consider the problem of the level α hypothesis testing of H : β = β in favor of the alternative H 1 : β β under model (1.1). Let ˆβ 1 be the solution to partial likelihood score equation, or unweighted estimating equation, U 1 (β) = i=1 z ij d ˆM ij (β, s) =, (1.3) and let ˆβ 2 be the solution to weighted estimating equation U 2 (β) = i=1 { k=1 } z ik w ikj (β, s) d ˆM ij (β, s) =. (1.4) For each l(l = 1, 2), asymptotic normality of K(ˆβ l β ) (l = 1, 2) and consistency of their variance estimators ˆΩ l (ˆβ l ), which are described in Cai and Prentice (1997), yields two different Wald type hypothesis testings: One is the test which rejects H whenever Q 1 = K(ˆβ 1 β ) T ˆΩ1 (ˆβ 1 ) 1 (ˆβ 1 β ) > χ 2 p(α) (1.5) 2

3 and the other is the test which rejects H whenever Q 2 = K(ˆβ 2 β ) T ˆΩ2 (ˆβ 2 ) 1 (ˆβ 2 β ) > χ 2 p(α). (1.6) Here χ 2 p(α) is a percentage point of chi square distribution with degrees of freedom p. Since one might not be able to select a statistic prior to the analysis of experiment, we will examine the next test which rejects H whenever Q max = max{q 1, Q 2 } > ˆχ 2 max(α). (1.7) Here ˆχ 2 max(α), which is depend on data, is an estimator of a percentage point we describe later. The use of the third statistic (1.7) is regarded as the direct application of Tarone (1981) s idea to the marginal Cox-type regression analysis. In Section 2, we examine the asymptotic distribution of Q max, and construct an asymptotically level α test based on Q max. In Section 3, simulation studies are provided under several parameter configurations to see the validity of the approximation of the statistic Q max, and to compare the power properties of the three test statistics. In Section 4, it is concluded from our simulation studies that the empirical power of the test (1.7) is higher than the power of either the test (1.5) or the test (1.6) for most of the alternatives. 2 Theoretical results and test based on Q max 2.1 Notation and theoretical results We consider β an arbitrary element in a compact set B( R p ), which contains the true value β. Define a 1,ij (β, t) = z ij and a 2,ij (β, t) = n k=1 z ikw ikj (β, t), and the following notation: S () (β, t) = 1 K S (1) (β, t) = 1 K S (2) l (β, t) = 1 K S (3) l (β, t) = 1 K i=1 i=1 i=1 i=1 E l (β, t) = S(2) l (β, t) S () (β, t) Y ij (t)e zt ijβ, Y ij (t)z ij e zt ijβ, Y ij (t)a l,ij (β, t)e zt ijβ (l = 1, 2), Y ij (t)a l,ij (β, t)z T ije zt ijβ (l = 1, 2), (l = 1, 2), V l (β, t) = S(3) l (β, t) S () (β, t) S(2) l (β, t)s (1) (β, t) T S () (β, t) 2 (l = 1, 2). Under the model assumptions of the listed earlier the regularity conditions outlined in Section 3 of Cai and Prentice (1997), there exist non-random functions, on B [, T ], s (r) l (β, t), (r =, 1, 2, 3, 4), e l (β, t), and v l (β, t) such that sup β,t S (r) l (β, t) s (r) l (β, t) P, sup β,t E l (β, t) e l (β, t) P and sup β,t V l (β, t) v l (β, t) P (K ) for (l = 1, 2), and the matrix A l (β) = v l (β, s)s () (β, s)λ (s)ds 3

4 is assumed to be positive definite and symmetric. Define D l,i = { } a l,ij (β, s) e l (β, s) dm ij (s) (l = 1, 2), then K random 2p-vertors (D T 1,i, D T 2,i) are i.i.d. and the covariance matrices between D l,i and D m,i are { { } } Σ lm (β ) = E a l,1j (β, s) e l (β, s) dm 1j (s) { { } T a m,1j (β, s) e m (β, s) dm 1j (s)} (l, m = 1, 2). Asymptotic distribution of (ˆβ 1, ˆβ 2 ) such that U 1 (ˆβ 1 ) = and U 2 (ˆβ 2 ) = is summarized in the next proposition. Proposition 1 Suppose the model assumptions listed earlier and the regularity conditions A-D of the section 3 in Cai and Prentice (1997, pp 2) hold. Then, ( K(ˆβ 1 β ) T, K(ˆβ 2 β ) T ) d (W T 1, W T 2 ) (K ), where both W 1 and W 2, are p-vectors and (W T 1, W T 2 ) is 2p-dimensional normal distribution with E(W l ) = and E(W l W T m) = A l (β ) 1 Σ lm (β )A m (β ) 1 Ω lm (β ) for l, m = 1, 2. Proof. By the same argument in the Appendix of Cai and Prentice (1997, pp 212), K(ˆβl β ) = 1 K A l (β ) 1 D l,i + o p 1 (1), K i=1 for l = 1, 2, where o p 1 (1) is a p 1 vector with o p (1) elements. To establish multivariate convergence, it is sufficient by the Cramer-Wold device to prove 2 l=1 c T l K(ˆβl β ) d 2 c T l W l for all c 1 R p and c 2 R p. The central limit theorem for i.i.d. random vectors establishes 2 l=1 c T l l=1 1 K A l (β ) 1 d D l,i N(, σ), K i=1 where σ = 2 2 l=1 m=1 ct l A l(β ) 1 Σ lm (β )A m (β ) 1 c l. It is easy to see that the distribution of 2 l=1 ct l W l is equal to N(, σ). If the weight functions are unknown, one needs to replace w ikj (β, s) by its estimator ŵ ikj (β, s). We suspect that, by adding some regularity conditions such as the condition E of Cai and Prentice (1997), Proposition 1 can be extended to the case where weight functions are estimated. Note that the covariance matrix of (W T 1, W T 2 ) can be estimated by the following plug-in estimators and we define them for the estimating equation with both deterministic weights and estimated weights. ˆΩ ll (ˆβ l ) = Âl(ˆβ l ) 1 ˆΣl (ˆβ l )(Âl(ˆβ l ) 1 ) T (l = 1, 2), (2.1) 4

5 ˆΩ lm (ˆβ l, ˆβ m ) = Âl(ˆβ l ) 1 ˆΣlm (ˆβ l, ˆβ m )(Âm(ˆβ m ) 1 ) T (l m), (2.2) Â l (β) = 1 K ˆΣ 11 (β) = 1 K ˆΣ 22 (β) = 1 K ˆΣ 12 (β 1, β 2 ) = 1 K i=1 { i=1 { { i=1 { { i=1 { ˆΣ 21 (β 2, β 1 ) = ˆΣ 12 (β 1, β 2 ) T. ˆV l (β, s)dn ij (s) (l = 1, 2) { zij E 1 (β, s) } d ˆM ij (β, s)} { zij E 1 (β, s) } d ˆM } T ij (β, s), {â2,ij (β, s) Ê2(β, s) } d ˆM ij (β, s)} {â2,ij (β, s) Ê2(β, s) } d ˆM } T ij (β, s), { zij E 1 (β 1, s) } d ˆM ij (β 1, s)} {â2,ij (β 2, s) Ê2(β 2, s) } d ˆM } T ij (β 2, s), We define â 2,ij, Ê 2, and ˆV 2 as a 2,ij, E 2, and V 2 respectively if weights are not estimated, and as a 2,ij, E 2, and V 2 with w ikj (β, s) replaced by its estimator ŵ ikj (β, s) if weights are estimated. The estimators (2.1) are suggested by Cai and Prentice (1997), and they point out that these are consistent estimators for Ω 11 (β ) and Ω 22 (β ). We suggest the estimator (2.2), which is a natural extension of (2.1) to Ω lm (β ) (l, m = 1, 2), and it may be a consistent estimator. 2.2 The test based on the maximum statistics Q max Proposition 1 and the estimator of the covariance matrix, (2.1) and (2.2), allow us to approximate the percentage point of Q max. According to Proposition 1, the distribution of ( K(ˆβ 1 β ) T, K(ˆβ 2 β ) T ) is approximated by (W T 1, W T 2 ). The unknown distribution of (W T 1, W T 2 ) is further approximated by the known 2p-dimensional normal distribution [ Ŵ 1 Ŵ 2 ] ([ N 2p ], [ ˆΩ 11 (ˆβ 1 ) ˆΩ12 (ˆβ 1, ˆβ 2 ) ˆΩ 21 (ˆβ 2, ˆβ 1 ) ˆΩ22 (ˆβ 2 ) A pair of the known random variables (Ŵ T 1 ˆΩ 11 (ˆβ 1 ) 1 Ŵ 1, Ŵ T 2 ˆΩ 22 (ˆβ 2 ) 1 Ŵ 2 ) can approximate the distribution of (Q 1, Q 2 ) as long as ˆΩ ll (ˆβ l ) (l = 1, 2) are positive definite. Exactly the same approximation can be obtained by using standardization of the normal distribution of Ŵ l (l = 1, 2). Define ˆV l = ˆΩ ll (ˆβ l ) 1/2 Ŵ l and ˆΩ ll (ˆβ l ) 1/2 ˆΩll (ˆβ l ) 1/2 = ˆΩ ll (ˆβ l ) 1 (l = 1, 2). Then the distribution of ( ˆV T 1, ˆV T 2 ) is 2p-dimensional normal with mean and covariance matrix given by ) ( Ip ˆΓ12 ˆΓ T 12 where I p is a p p identity matrix and ˆΓ 12 = ˆΩ 11 (ˆβ 1 ) 1/2 ˆΩ12 (ˆβ 1, ˆβ 2 ) ˆΩ 22 (ˆβ 2 ) 1/2. Again, the known distribution ( ˆV T 1 ˆV 1, ˆV T 2 ˆV 2 ) can approximate the distribution of (Q 1, Q 2 ). The percentage 5 I p, ]).

6 point of Q max = max{q 1, Q 2 }, say ˆχ 2 max(α) in (1.7), is approximated by the percentage point of the distribution of max{ŵ T 1 ˆΩ 11 (ˆβ 1 ) 1 Ŵ 1, Ŵ T 2 ˆΩ 22 (ˆβ 2 ) 1 Ŵ 2 } or max{ ˆV T 1 ˆV 1, ˆV T 2 ˆV 2 }, for example, via Monte Carlo simulations. Similar to the fact mentioned in Fleming and Harrington (1991, pp.28) for weighted logrank tests, the test (1.7) is closely related to the p-values of the tests (1.5) and (1.6). Suppose one computes the Wald statistics Q 1 and Q 2, and corresponding p-values p 1 and p 2. The test which rejects H if min{p 1, p 2 } p(α) such that P H (min{p 1, p 2 } p(α)) = α is conceivable as long as p(α) can be computed. From the definition of p-value, we can see min{p 1, p 2 } p(α) Q max χ 2 p(p(α)). and therefore, the value p(α) is approximated by ˆp(α) = 1 F χ 2 p (ˆχ 2 max(α)), where F χ 2 p is the distribution function of χ 2 p. Hence, the test (1.7) is exactly the same as the test which rejects H whenever min{p 1, p 2 } ˆp(α). 3 Simulation Studies We focused on the case where p = 1 in (1.1) throughout our simulation studies and hence β is a scalor in which H : β = β versus H 1 : β β are tested. The models we used were 2- and 3-dimensional models of Clayton and Cuzick (1985): The conditional survival function for (T 1,..., T n ) given (z 1,..., z n ) is { S(t 1,..., t n z 1,..., z n ) = exp ( t j θe z jβ ) 1/θ (n 1)} (n = 2, 3). The parameter θ determines the degree of dependence between T j and T k, (j, k = 1,..., n), and in simulation studies we used θ =.25 (strongly correlated), θ = 1.5 (weakly correlated) and θ = 1 (nearly independent). The covariates z 1,..., z n were considered to be mutually independent, { 1} valued random variable with P (z 1 = 1) = = P (z n = 1) =.5. The censoring variables C i1,..., C in were independently generated from exponential distributions with hazard function being µ 1,..., µ n respectively. The number of clusters, K, was fixed at 1. The unweighted score equation (1.3) was solved by using Newton-Raphson procedure and ˆβ 1 denotes this solution. For weight matrices, we specified w i11 w i12 w i1n w i21 w i22 w i2n = w in1 w in2 w inn 1 ρ i12 (z i1, z i2 ) ρ i1n (z i1, z in ) ρ i12 (z i1, z i2 ) 1 ρ i2n (z i2, z in ) ρ i1n (z i1, z in ) ρ i2n (z i2, z in ) 1 where ρ ijk (z ij, z ik ) = corr{m ij (X ij ), M ik (X ik ) z ij, z ik } is a correlation between two unknown random variables given (z ij, z ik ) for i = 1,..., K, and j, k = 1,..., n. This weight matrices are slightly different from those used by Cai and Prentice (1997) in their simulation studies, in that we do not need to know potential censoring times for the failing individuals. Since these weight matrices are unknown, we replaced these by their empirical estimators. The restriction of z ij to be { 1} valued is crucial since the number of parameters in the all weight matrices reduces significantly. In the case of n = 2, only four parameters, ρ 12 (, ), ρ 12 (, 1), ρ 12 (1, ) and ρ 12 (1, 1) determine the weight matrices, and in the case of n = 3, so do only 12 parameters. Each value of ρ ijk (z ij, z ik ) was estimated by the product moment correlation coefficient of the group of pseudodata ( ˆM i j( ˆβ 1, X i j), ˆM i k( ˆβ 1, X i k)), i R(z ij, z ik ), where R(z ij, z ik ) = {i z i j = z ij, z i k = z ik }. When #R(z ij, z ik ) = or 1, we set ˆρ ijk (z ij, z ik ) =. 6 1,

7 ˆβ 2 denotes the solution to the estimating equation (1.4) with these estimated weights, and this was solved by using Newton-Raphson procedure. Rarely do extremely large values of the estimated weights cause the Newton-Raphson algorithm to stop. We regulated these phenomena by letting the values of Q l (l = 1, 2) be whenever U l (β)/ β <.1 (l = 1, 2). All simulations were based on 1, runs, in which ERR1 and ERR2 indicate the total number of this type of error regulations in both U 1 (β) and U 2 (β) respectively. For each run, the statistics Q 1, Q 2 and Q max and the estimator ˆΓ 12 were computed. The quantiles of the Q max, that is ˆχ 2 max(.5), were approximated by the quantiles based on the 1, random samples from the distribution of ( ˆV 1, ˆV 2 ) for every run. First, our aim in this simulation studies is to see how well the distribution of ( ˆV 1, ˆV 2 ) approximates that of ( K( ˆβ 1 β )/{ˆΩ 1 ( ˆβ 1 ) 1/2 }, K( ˆβ 2 β )/{ˆΩ 2 ( ˆβ 2 ) 1/2 }). Since the distribution of ( ˆV 1, ˆV 2 ) is completely determined by the estimator ˆΓ 12, it is worth seeing the performance of ˆΓ 12 compared with the actual correlation coefficient of ( K( ˆβ 1 β )/{ˆΩ 1 ( ˆβ 1 ) 1/2 }, K( ˆβ 2 β )/{ˆΩ 2 ( ˆβ 2 ) 1/2 }). Mean and standard deviation of ˆΓ 12 and product moment correlation coefficient of ( K( ˆβ 1 β )/{ˆΩ 1 ( ˆβ 1 ) 1/2 }, K( ˆβ 2 β )/{ˆΩ 2 ( ˆβ 2 ) 1/2 }) are listed as Γ 12, SD(ˆΓ 12 ), and r respectively in Table 1. P denotes the probability of censoring calculated from the actual sample. For the test of β =, that is H : β =, the empirical size and power of Q 1, Q 2 and Q max based on these 5% points are given in Tables 2 and 3. Based on the Tables 2 and 3, power functions are drawn in Figures Concluding remarks We have discussed the way to approximate the asymptotic distribution of Q max to construct the hypothesis testing (1.7). As shown in Table 1, the approximation of Q max by ( ˆV 1, ˆV 2 ) performed well in terms of the correlation coefficient of ( K( ˆβ 1 β )/{ˆΩ 1 ( ˆβ 1 ) 1/2 }, K( ˆβ 2 β )/{ˆΩ 2 ( ˆβ 2 ) 1/2 }) and ˆΓ 12. Empirical size of Q max listed in Tables 2 and 3 appears to slightly larger than the nominal 5% level, especially in the case of 3-dimensional Clayton model. At almost every alternatives, the empirical power of the test based on Q max did not become the worst ones compared to Q 1 and Q 2. Therefore, we can conclude that the test (1.7) tends to prevent the power reduction caused by selecting the wrong estimating equation. Acknowledgement The authors would like to thank Dr. Yoshihiko Konno for helpful comments. References [1] Cai, J. and Prentice, R.L. (1997). Regression estimation using multivariate failure time data and a common baseline hazard function model, Lifetime Data Analysis 3, [2] Clayton, D.G. and Cuzick, J. (1985). Multivariate generalizations of the proportional hazards model (with discussion), Journal of the Royal Statistical Society Series A 148, [3] Fleming, T.R. and Harrington, D.P. (1991). Counting Process and Survival Analysis, Wiley: New York. [4] Spiekerman, C.F. and Lin, D.Y. (1998). Marginal regression models for multivariate failure time data, Journal of the American Statistical Association 93,

8 [5] Tarone, R.E. (1981). On the distribution of the maximum of the logrank statistics and the modified Wilcoxon statistics, Biometrics 37,

9 Table 1: Mean and standard deviation of ˆΓ 12 and product moment correlation coefficient of ( K( ˆβ 1 β )/{ˆΩ 1 ( ˆβ 1 ) 1/2 }, K( ˆβ 2 β )/{ˆΩ 2 ( ˆβ 2 ) 1/2 }) 2 dimensional Clayton model θ β P Γ12 SD(ˆΓ 12 ) r ERR1 ERR2 µ 1 = µ 2 = µ 1 = µ 2 = dimensional Clayton model θ β P Γ12 SD(ˆΓ 12 ) r ERR1 ERR2 µ 1 = µ 2 = µ 3 = µ 1 = µ 2 = µ 3 =

10 Table 2: Empirical power of tests based on Q1, Q2, and Qmax for nominal 5 % significance level under 2 dimensional Clayton model (* indicates the worst empirical power among Q1, Q2, and Qmax) µ1 = µ2 =.5 β (true) θ =.25 Q Q Qmax ERR1 ERR2 θ = 1.5 Q Q Qmax ERR1 ERR2 θ = 1. Q Q Qmax ERR1 ERR2 µ1 = µ2 = 1.5 β (true) θ =.25 Q Q Qmax ERR1 ERR2 θ = 1.5 Q Q Qmax ERR1 ERR2 θ = 1. Q Q Qmax ERR1 ERR2 1

11 Figure 1: Comparison of powers of Q 1 (dots), Q 2 (dashes), and Q max (plain) for 5% significance level under 2 dimensional Clayton model: Clayton dependence parameters θ =.25 (upper figure), θ = 1.5 and θ = 1 (lower figure) with µ 1 = µ 2 =.5 11

12 Figure 2: Comparison of powers of Q 1 (dots), Q 2 (dashes), and Q max (plain) for 5% significance level under 2 dimensional Clayton model: Clayton dependence parameters θ =.25(upper figure), θ = 1.5 and θ = 1 (lower figure) with µ 1 = µ 2 =

13 Table 3: Empirical power of tests based on Q1, Q2, and Qmax for nominal 5 % significance level under 3 dimensional Clayton model (* indicates the worst empirical power among Q1, Q2, and Qmax) µ1 = µ2 = µ3 =.5 β (true) θ =.25 Q Q Qmax ERR1 ERR θ = 1.5 Q Q Qmax ERR1 ERR θ = 1. Q Q Qmax ERR1 ERR2 1 µ1 = µ2 = µ3 = 1.5 β (true) θ =.25 Q Q Qmax ERR1 ERR θ = 1.5 Q Q Qmax ERR1 ERR2 1 θ = 1. Q Q Qmax ERR1 ERR2 1 13

14 Figure 3: Comparison of powers of Q 1 (dots), Q 2 (dashes), and Q max,(plain) for 5% significance level under 3 dimensional Clayton model: Clayton dependence parameters θ =.25(upper figure), θ = 1.5 and θ = 1(lower figure) with µ 1 = µ 2 = µ 3 =.5 14

15 Figure 4: Comparison of powers of Q 1 (dots), Q 2 (dashes), and Q max,(plain) for 5% significance level under 3 dimensional Clayton model: Clayton dependence parameters θ =.25(upper figure), θ = 1.5 and θ = 1(lower figure) with µ 1 = µ 2 = µ 3 =

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

1 Introduction. 2 Residuals in PH model

1 Introduction. 2 Residuals in PH model Supplementary Material for Diagnostic Plotting Methods for Proportional Hazards Models With Time-dependent Covariates or Time-varying Regression Coefficients BY QIQING YU, JUNYI DONG Department of Mathematical

More information

Goodness-of-fit test for the Cox Proportional Hazard Model

Goodness-of-fit test for the Cox Proportional Hazard Model Goodness-of-fit test for the Cox Proportional Hazard Model Rui Cui rcui@eco.uc3m.es Department of Economics, UC3M Abstract In this paper, we develop new goodness-of-fit tests for the Cox proportional hazard

More information

Modelling Survival Events with Longitudinal Data Measured with Error

Modelling Survival Events with Longitudinal Data Measured with Error Modelling Survival Events with Longitudinal Data Measured with Error Hongsheng Dai, Jianxin Pan & Yanchun Bao First version: 14 December 29 Research Report No. 16, 29, Probability and Statistics Group

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Full likelihood inferences in the Cox model: an empirical likelihood approach

Full likelihood inferences in the Cox model: an empirical likelihood approach Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:

More information

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t) PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen

More information

A comparison study of the nonparametric tests based on the empirical distributions

A comparison study of the nonparametric tests based on the empirical distributions 통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data Efficiency Comparison Between Mean and Log-rank Tests for Recurrent Event Time Data Wenbin Lu Department of Statistics, North Carolina State University, Raleigh, NC 27695 Email: lu@stat.ncsu.edu Summary.

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Linear rank statistics

Linear rank statistics Linear rank statistics Comparison of two groups. Consider the failure time T ij of j-th subject in the i-th group for i = 1 or ; the first group is often called control, and the second treatment. Let n

More information

On the Breslow estimator

On the Breslow estimator Lifetime Data Anal (27) 13:471 48 DOI 1.17/s1985-7-948-y On the Breslow estimator D. Y. Lin Received: 5 April 27 / Accepted: 16 July 27 / Published online: 2 September 27 Springer Science+Business Media,

More information

Composite likelihood and two-stage estimation in family studies

Composite likelihood and two-stage estimation in family studies Biostatistics (2004), 5, 1,pp. 15 30 Printed in Great Britain Composite likelihood and two-stage estimation in family studies ELISABETH WREFORD ANDERSEN The Danish Epidemiology Science Centre, Statens

More information

Optimal Weight Functions for Marginal Proportional Hazards Analysis of Clustered Failure Time Data

Optimal Weight Functions for Marginal Proportional Hazards Analysis of Clustered Failure Time Data Lifetime Data Analysis, 8, 5 19, 22 # 22 Kluwer Academic Publishers. Printed in The Netherlands. Optimal Weight Functions for Marginal Proportional Hazards Analysis of Clustered Failure Time Data ROBERT

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

A note on profile likelihood for exponential tilt mixture models

A note on profile likelihood for exponential tilt mixture models Biometrika (2009), 96, 1,pp. 229 236 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asn059 Advance Access publication 22 January 2009 A note on profile likelihood for exponential

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,

More information

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses Outline Marginal model Examples of marginal model GEE1 Augmented GEE GEE1.5 GEE2 Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association

More information

Answer Key for STAT 200B HW No. 8

Answer Key for STAT 200B HW No. 8 Answer Key for STAT 200B HW No. 8 May 8, 2007 Problem 3.42 p. 708 The values of Ȳ for x 00, 0, 20, 30 are 5/40, 0, 20/50, and, respectively. From Corollary 3.5 it follows that MLE exists i G is identiable

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Quasi-likelihood Scan Statistics for Detection of

Quasi-likelihood Scan Statistics for Detection of for Quasi-likelihood for Division of Biostatistics and Bioinformatics, National Health Research Institutes & Department of Mathematics, National Chung Cheng University 17 December 2011 1 / 25 Outline for

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Logistic regression model for survival time analysis using time-varying coefficients

Logistic regression model for survival time analysis using time-varying coefficients Logistic regression model for survival time analysis using time-varying coefficients Accepted in American Journal of Mathematical and Management Sciences, 2016 Kenichi SATOH ksatoh@hiroshima-u.ac.jp Research

More information

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications Fumiya Akashi Research Associate Department of Applied Mathematics Waseda University

More information

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Right censored

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive

More information

A Bivariate Weibull Regression Model

A Bivariate Weibull Regression Model c Heldermann Verlag Economic Quality Control ISSN 0940-5151 Vol 20 (2005), No. 1, 1 A Bivariate Weibull Regression Model David D. Hanagal Abstract: In this paper, we propose a new bivariate Weibull regression

More information

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Put your solution to each problem on a separate sheet of paper. Problem 1. (5166) Assume that two random samples {x i } and {y i } are independently

More information

Statistics and econometrics

Statistics and econometrics 1 / 36 Slides for the course Statistics and econometrics Part 10: Asymptotic hypothesis testing European University Institute Andrea Ichino September 8, 2014 2 / 36 Outline Why do we need large sample

More information

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016 Statistics 255 - Survival Analysis Presented March 8, 2016 Dan Gillen Department of Statistics University of California, Irvine 12.1 Examples Clustered or correlated survival times Disease onset in family

More information

SEMIPARAMETRIC REGRESSION WITH TIME-DEPENDENT COEFFICIENTS FOR FAILURE TIME DATA ANALYSIS

SEMIPARAMETRIC REGRESSION WITH TIME-DEPENDENT COEFFICIENTS FOR FAILURE TIME DATA ANALYSIS Statistica Sinica 2 (21), 853-869 SEMIPARAMETRIC REGRESSION WITH TIME-DEPENDENT COEFFICIENTS FOR FAILURE TIME DATA ANALYSIS Zhangsheng Yu and Xihong Lin Indiana University and Harvard School of Public

More information

Survival Regression Models

Survival Regression Models Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant

More information

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we

More information

4. Comparison of Two (K) Samples

4. Comparison of Two (K) Samples 4. Comparison of Two (K) Samples K=2 Problem: compare the survival distributions between two groups. E: comparing treatments on patients with a particular disease. Z: Treatment indicator, i.e. Z = 1 for

More information

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississippi University, MS38677 K-sample location test, Koziol-Green

More information

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data Outline Frailty modelling of Multivariate Survival Data Thomas Scheike ts@biostat.ku.dk Department of Biostatistics University of Copenhagen Marginal versus Frailty models. Two-stage frailty models: copula

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

On the Comparison of Fisher Information of the Weibull and GE Distributions

On the Comparison of Fisher Information of the Weibull and GE Distributions On the Comparison of Fisher Information of the Weibull and GE Distributions Rameshwar D. Gupta Debasis Kundu Abstract In this paper we consider the Fisher information matrices of the generalized exponential

More information

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December

More information

STAT331. Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model

STAT331. Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model STAT331 Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model Because of Theorem 2.5.1 in Fleming and Harrington, see Unit 11: For counting process martingales with

More information

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen Outline Cox s proportional hazards model. Goodness-of-fit tools More flexible models R-package timereg Forthcoming book, Martinussen and Scheike. 2/38 University of Copenhagen http://www.biostat.ku.dk

More information

STT 843 Key to Homework 1 Spring 2018

STT 843 Key to Homework 1 Spring 2018 STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ

More information

TECHNICAL REPORT # 59 MAY Interim sample size recalculation for linear and logistic regression models: a comprehensive Monte-Carlo study

TECHNICAL REPORT # 59 MAY Interim sample size recalculation for linear and logistic regression models: a comprehensive Monte-Carlo study TECHNICAL REPORT # 59 MAY 2013 Interim sample size recalculation for linear and logistic regression models: a comprehensive Monte-Carlo study Sergey Tarima, Peng He, Tao Wang, Aniko Szabo Division of Biostatistics,

More information

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments CIRJE-F-466 Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments Yukitoshi Matsushita CIRJE, Faculty of Economics, University of Tokyo February 2007

More information

Modelling geoadditive survival data

Modelling geoadditive survival data Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

Estimation and Testing for Common Cycles

Estimation and Testing for Common Cycles Estimation and esting for Common Cycles Anders Warne February 27, 2008 Abstract: his note discusses estimation and testing for the presence of common cycles in cointegrated vector autoregressions A simple

More information

Simple techniques for comparing survival functions with interval-censored data

Simple techniques for comparing survival functions with interval-censored data Simple techniques for comparing survival functions with interval-censored data Jinheum Kim, joint with Chung Mo Nam jinhkim@suwon.ac.kr Department of Applied Statistics University of Suwon Comparing survival

More information

SAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000)

SAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000) SAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000) AMITA K. MANATUNGA THE ROLLINS SCHOOL OF PUBLIC HEALTH OF EMORY UNIVERSITY SHANDE

More information

Cox s proportional hazards/regression model - model assessment

Cox s proportional hazards/regression model - model assessment Cox s proportional hazards/regression model - model assessment Rasmus Waagepetersen September 27, 2017 Topics: Plots based on estimated cumulative hazards Cox-Snell residuals: overall check of fit Martingale

More information

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017 Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION By Degui Li, Peter C. B. Phillips, and Jiti Gao September 017 COWLES FOUNDATION DISCUSSION PAPER NO.

More information

STAT 331. Martingale Central Limit Theorem and Related Results

STAT 331. Martingale Central Limit Theorem and Related Results STAT 331 Martingale Central Limit Theorem and Related Results In this unit we discuss a version of the martingale central limit theorem, which states that under certain conditions, a sum of orthogonal

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

Econometrics II - EXAM Answer each question in separate sheets in three hours

Econometrics II - EXAM Answer each question in separate sheets in three hours Econometrics II - EXAM Answer each question in separate sheets in three hours. Let u and u be jointly Gaussian and independent of z in all the equations. a Investigate the identification of the following

More information

Lecture 3. Inference about multivariate normal distribution

Lecture 3. Inference about multivariate normal distribution Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates

More information

Lecture 2: Martingale theory for univariate survival analysis

Lecture 2: Martingale theory for univariate survival analysis Lecture 2: Martingale theory for univariate survival analysis In this lecture T is assumed to be a continuous failure time. A core question in this lecture is how to develop asymptotic properties when

More information

A Simple, Graphical Procedure for Comparing Multiple Treatment Effects

A Simple, Graphical Procedure for Comparing Multiple Treatment Effects A Simple, Graphical Procedure for Comparing Multiple Treatment Effects Brennan S. Thompson and Matthew D. Webb May 15, 2015 > Abstract In this paper, we utilize a new graphical

More information

Multivariate Survival Data With Censoring.

Multivariate Survival Data With Censoring. 1 Multivariate Survival Data With Censoring. Shulamith Gross and Catherine Huber-Carol Baruch College of the City University of New York, Dept of Statistics and CIS, Box 11-220, 1 Baruch way, 10010 NY.

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

UNIVERSITÄT POTSDAM Institut für Mathematik

UNIVERSITÄT POTSDAM Institut für Mathematik UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam

More information

The outline for Unit 3

The outline for Unit 3 The outline for Unit 3 Unit 1. Introduction: The regression model. Unit 2. Estimation principles. Unit 3: Hypothesis testing principles. 3.1 Wald test. 3.2 Lagrange Multiplier. 3.3 Likelihood Ratio Test.

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of

More information

Residuals and model diagnostics

Residuals and model diagnostics Residuals and model diagnostics Patrick Breheny November 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/42 Introduction Residuals Many assumptions go into regression models, and the Cox proportional

More information

arxiv:math/ v1 [math.st] 18 May 2005

arxiv:math/ v1 [math.st] 18 May 2005 Prospective survival analysis with a general semiparametric arxiv:math/55387v1 [math.st] 18 May 25 shared frailty model - a pseudo full likelihood approach Malka Gorfine 1 Department of Mathematics, Bar-Ilan

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Econometric Methods for Panel Data

Econometric Methods for Panel Data Based on the books by Baltagi: Econometric Analysis of Panel Data and by Hsiao: Analysis of Panel Data Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

Asymptotic Statistics-III. Changliang Zou

Asymptotic Statistics-III. Changliang Zou Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (

More information

Cox s proportional hazards model and Cox s partial likelihood

Cox s proportional hazards model and Cox s partial likelihood Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.

More information

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction Outline CHL 5225H Advanced Statistical Methods for Clinical Trials: Survival Analysis Prof. Kevin E. Thorpe Defining Survival Data Mathematical Definitions Non-parametric Estimates of Survival Comparing

More information

Continuous Time Survival in Latent Variable Models

Continuous Time Survival in Latent Variable Models Continuous Time Survival in Latent Variable Models Tihomir Asparouhov 1, Katherine Masyn 2, Bengt Muthen 3 Muthen & Muthen 1 University of California, Davis 2 University of California, Los Angeles 3 Abstract

More information

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model Pertanika J. Sci. & Technol. 17 (2): 313 323 (2009) ISSN: 0128-7680 Universiti Putra Malaysia Press Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model Jayanthi Arasan Department

More information

Bootstrapping high dimensional vector: interplay between dependence and dimensionality

Bootstrapping high dimensional vector: interplay between dependence and dimensionality Bootstrapping high dimensional vector: interplay between dependence and dimensionality Xianyang Zhang Joint work with Guang Cheng University of Missouri-Columbia LDHD: Transition Workshop, 2014 Xianyang

More information

Package Rsurrogate. October 20, 2016

Package Rsurrogate. October 20, 2016 Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast

More information

Statistical Analysis of Spatio-temporal Point Process Data. Peter J Diggle

Statistical Analysis of Spatio-temporal Point Process Data. Peter J Diggle Statistical Analysis of Spatio-temporal Point Process Data Peter J Diggle Department of Medicine, Lancaster University and Department of Biostatistics, Johns Hopkins University School of Public Health

More information

Lecture 15. Hypothesis testing in the linear model

Lecture 15. Hypothesis testing in the linear model 14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma

More information

Lecture 5: LDA and Logistic Regression

Lecture 5: LDA and Logistic Regression Lecture 5: and Logistic Regression Hao Helen Zhang Hao Helen Zhang Lecture 5: and Logistic Regression 1 / 39 Outline Linear Classification Methods Two Popular Linear Models for Classification Linear Discriminant

More information

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables International Journal of Statistics and Probability; Vol. 7 No. 3; May 208 ISSN 927-7032 E-ISSN 927-7040 Published by Canadian Center of Science and Education Decomposition of Parsimonious Independence

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring

Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring A. Ganguly, S. Mitra, D. Samanta, D. Kundu,2 Abstract Epstein [9] introduced the Type-I hybrid censoring scheme

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Frailty Modeling for clustered survival data: a simulation study

Frailty Modeling for clustered survival data: a simulation study Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University

More information

Published online: 10 Apr 2012.

Published online: 10 Apr 2012. This article was downloaded by: Columbia University] On: 23 March 215, At: 12:7 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information