The partially monotone tensor spline estimation of joint distribution function with bivariate current status data

Size: px
Start display at page:

Download "The partially monotone tensor spline estimation of joint distribution function with bivariate current status data"

Transcription

1 University of Iowa Iowa Research Online Theses and Dissertations Summer 00 The partially monotone tensor spline estimation of joint distribution function with bivariate current status data Yuan Wu University of Iowa Copyright 00 Yuan Wu This dissertation is available at Iowa Research Online: Recommended Citation Wu, Yuan. "The partially monotone tensor spline estimation of joint distribution function with bivariate current status data." PhD (Doctor of Philosophy) thesis, University of Iowa, Follow this and additional works at: Part of the Applied Mathematics Commons

2 THE PARTIALLY MONOTONE TENSOR SPLINE ESTIMATION OF JOINT DISTRIBUTION FUNCTION WITH BIVARIATE CURRENT STATUS DATA by Yuan Wu An Abstract Of a thesis submitted in partial fulfillment of the requirements for the Doctor of Philosophy degree in Applied Mathematical and Computational Sciences in the Graduate College of The University of Iowa July 00 Thesis Supervisor: Professor Ying Zhang

3 ABSTRACT The analysis of joint distribution function with bivariate event time data is a challenging problem both theoretically and numerically. This thesis develops a tensor splinebased nonparametric maximum likelihood estimation method to estimate the joint distribution function with bivariate current status data. Tensor I-splines are developed to replace the traditional tensor B-splines in approximating joint distribution function in order to simplify the restricted maximum likelihood estimation problem in computing. The generalized gradient projection algorithm is used to compute the restricted optimization problem. We show that the proposed tensor splinebased nonparametric estimator is consistent and that the rate of convergence can be as good as n /. Simulation studies with moderate sample sizes show that the finite-sample performance of the proposed estimator is generally satisfactory. Abstract Approved: Thesis Supervisor Title and Department Date

4 THE PARTIALLY MONOTONE TENSOR SPLINE ESTIMATION OF JOINT DISTRIBUTION FUNCTION WITH BIVARIATE CURRENT STATUS DATA by Yuan Wu A thesis submitted in partial fulfillment of the requirements for the Doctor of Philosophy degree in Applied Mathematical and Computational Sciences in the Graduate College of The University of Iowa July 00 Thesis Supervisor: Professor Ying Zhang

5 Graduate College The University of Iowa Iowa City, Iowa CERTIFICATE OF APPROVAL PH.D. THESIS This is to certify that the Ph.D. thesis of Yuan Wu has been approved by the Examining Committee for the thesis requirement for the Doctor of Philosophy degree in Applied Mathematical and Computational Sciences at the July 00 graduation. Thesis Committee: Ying Zhang, Thesis Supervisor Weimin Han Jian Huang Michael Jones Lihe Wang

6 ACKNOWLEDGEMENTS I am grateful to all those people who have made this dissertation possible and because of whom my graduate experience has been one that I will cherish forever. I deeply owe my gratitude to my advisor, Dr. Ying Zhang for his time and energy he put in developing my research skills and advising my thesis work, for his encouragement and support in setting my career goal, and for his patience in correcting my English writing. His wide knowledge and logical way of thinking have been of great value for me. Without his bright insights and personal guidance, I would never have been able to finish my dissertation. I am really grateful to my academic advisor in the Department of Mathematics, Dr. Lihe Wang for his caring during these years at the University of Iowa. I would never forget he supported me financially and in all other possible ways when I had a difficult time due to my spoken English. I would like to express my warm and sincere thanks to faculties and staffs in the Department of Mathematics, especially to Dr. Weimin Han and Dr. Yi Li. They both helped me in so many ways during my study at the University of Iowa. I am grateful to Dr. Jian Huang and Dr. Michael Jones for their detailed review and valuable comments on this work. I owe my thanks to my wife Xiaoli, who brought our newborn son, Aaron. Without her love and encouragement, it would have been impossible for me to finish this work. My special gratitude is due to my parents, my parents-in-law. ii

7 ABSTRACT The analysis of joint distribution function with bivariate event time data is a challenging problem both theoretically and numerically. This thesis develops a tensor splinebased nonparametric maximum likelihood estimation method to estimate the joint distribution function with bivariate current status data. Tensor I-splines are developed to replace the traditional tensor B-splines in approximating joint distribution function in order to simplify the restricted maximum likelihood estimation problem in computing. The generalized gradient projection algorithm is used to compute the restricted optimization problem. We show that the proposed tensor splinebased nonparametric estimator is consistent and that the rate of convergence can be as good as n /. Simulation studies with moderate sample sizes show that the finite-sample performance of the proposed estimator is generally satisfactory. iii

8 TABLE OF CONTENTS LIST OF TABLES vi LIST OF FIGURES viii CHAPTER INTRODUCTION Background of Bivariate Current Status Data Thesis Objective and Proposed Method TECHNICAL BACKGROUNDS Splines The B-splines The M-splines The I-splines Monotone Spline Functions Partially Monotone Tensor Spline Functions Results on Empirical Process Theory Some Useful Results on B-splines TENSOR SPLINE-BASED SIEVE NONPARAMETRIC MAXIMUM LIKE- LIHOOD ESTIMATION METHOD Likelihood of Bivariate Current Status Data Spline-based Maximum Likelihood Estimation B-spline-based Estimation I-spline-based Estimation ASYMPTOTIC PROPERTIES Regularity Conditions Consistency Convergence Rate Technical Lemmas NUMERICAL STUDIES Computing Algorithm iv

9 5. Simulation Studies Design of the Studies Results of the Studies DISCUSSION AND FUTURE WORK REFERENCES APPENDIX v

10 LIST OF TABLES 5. Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Clayton Copula distributed event times when the underlying distribution is correctly specified with sample size n = 00, Kendall s τ = 0.5 (the association parameter α = 5/) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Clayton Copula distributed event times when the underlying distribution is correctly specified with sample size n = 00, Kendall s τ = 0.5 (the association parameter α = 5/) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Clayton Copula distributed event times when the underlying distribution is correctly specified with sample size n = 00, Kendall s τ = 0.50 (the association parameter α = )) The comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Clayton Copula distributed event times when the underlying distribution is correctly specified with sample size n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Clayton Copula distributed event times when the underlying distribution is correctly specified with sample size n = 00, Kendall s τ = 0.75 (the association parameter α = 7) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the pseudo maximum likelihood estimation of bivariate Clayton Copula distributed event times when the underlying distribution is correctly specified with sample size n = 00, Kendall s τ = 0.75 (the association parameter α = 7) vi

11 5.7 Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Gumbel Copula distributed event times when the underlying distribution is incorrectly specified as bivariate Clayton Copula model with sample size n = 00, Kendall s τ = 0.5 (the association parameter α = /) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Gumbel Copula distributed event times when the underlying distribution is incorrectly specified as bivariate Clayton Copula model with sample size n = 00, Kendall s τ = 0.5 (the association parameter α = /) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Gumbel Copula distributed event times when the underlying distribution is incorrectly specified as bivariate Clayton Copula model with sample size n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Gumbel Copula distributed event times when the underlying distribution is incorrectly specified as bivariate Clayton Copula model and sample size n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and the maximum pseudo likelihood estimation of bivariate Gumbel Copula distributed event times when the underlying distribution is incorrectly specified as bivariate Clayton Copula model with sample size n = 00, Kendall s τ = 0.75 (the association parameter α = ) Comparison of Bias and MSE of the proposed sieve maximum likelihood estimation and maximum pseudo likelihood estimation of bivariate Gumbel Copula distributed event times when the underlying distribution is incorrectly specified as bivariate Clayton Copula model with sample size n = 00, Kendall s τ = 0.75 (the association parameter α = ) vii

12 LIST OF FIGURES 5. Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Clayton Copula model with the underlying distribution being correctly specified when sample size n = 00, Kendall s τ = 0.5 (the association parameter α = 5/) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Clayton Copula model and n = 00, Kendall s τ = 0.5 (the association parameter α = 5/) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Clayton Copula model with the underlying distribution being correctly specified when sample size n = 00, Kendall s τ = 0.5 (the association parameter α = 5/) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Clayton Copula model and n = 00, Kendall s τ = 0.5 (the association parameter α = 5/) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Clayton Copula model with the underlying distribution being correctly specified when sample size n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Clayton Copula model and n = 00, Kendall s τ = 0.50 (the association parameter α = ) viii

13 5.7 Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Clayton Copula model with the underlying distribution being correctly specified when sample size n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Clayton Copula model and n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Clayton Copula model with the underlying distribution being correctly specified when sample size n = 00, Kendall s τ = 0.75 (the association parameter α = 7) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Clayton Copula model and n = 00, Kendall s τ = 0.75 (the association parameter α = 7) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Clayton Copula model with the underlying distribution being correctly specified when sample size n = 00, Kendall s τ = 0.75 (the association parameter α = 7) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Clayton Copula model and n = 00, Kendall s τ = 0.75 (the association parameter α = 7) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Clayton Copula model with the underlying distribution being correctly specified when sample size n = 00, Kendall s τ = 0.5 (the association parameter α = /) ix

14 5. Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Gumbel Copula model and n = 00, Kendall s τ = 0.5 (the association parameter α = /) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Gumbel Copula model with the underlying distribution being incorrectly specified as bivariate Clayton Copula model when sample size n = 00, Kendall s τ = 0.5 (the association parameter α = /) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Gumbel Copula model and n = 00, Kendall s τ = 0.5 (the association parameter α = /) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Gumbel Copula model with the underlying distribution being incorrectly specified as bivariate Clayton Copula model when sample size n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Gumbel Copula model and n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Gumbel Copula model with the underlying distribution being incorrectly specified as bivariate Clayton Copula model when sample size n = 00, Kendall s τ = 0.50 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Gumbel Copula model and n = 00, Kendall s τ = 0.50 (the association parameter α = ) x

15 5. Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Gumbel Copula model with the underlying distribution being incorrectly specified as bivariate Clayton Copula model when sample size n = 00, Kendall s τ = 0.75 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Gumbel Copula model and n = 00, Kendall s τ = 0.75 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation ((a), (c)) and the maximum pseudo likelihood estimation ((b), (d)) for the joint distribution function of bivariate Gumbel Copula model with the underlying distribution being incorrectly specified as bivariate Clayton Copula model when sample size n = 00, Kendall s τ = 0.75 (the association parameter α = ) Comparison of the proposed sieve maximum likelihood estimation (Sieve) and the nonparametric maximum likelihood estimation (Pseudo) for the marginal distribution function of T when (T, T ) is distributed according to Gumbel Copula model and n = 00, Kendall s τ = 0.75 (the association parameter α = ) xi

16 CHAPTER INTRODUCTION. Background of Bivariate Current Status Data In some survival analysis applications, observation of the random event time T is restricted to the knowledge of whether or not T exceeds a random monitoring time C. This type of data is known as current status data, and sometimes referred to as interval censored data case I (Groenoboom and Wellner (99)). Current status data arise naturally in many applications, see for example, in animal tumorigenicity experiments (Hoel and Walburg (97), Finkelstein and Wolfe (985), and Finkelstein (986)). In these examples, for each experimental animal, T is the time from exposure to a potential carcinogen until occurrence of the tumor, and C is the time, on the same scale, of sacrifice. Upon sacrifice, the presence or absence of the tumor can be determined providing current status information on T. Another example of current status data arises in studies of the distribution of the age at weaning (Diamond et al. (986), Diamond and McDonald (99), and Grummer-Strawn (99)). Here T represents the age of a child at weaning and C the age at observation. It also arises in studies of human immunodeficiency virus (HIV) and acquired immunodeficiency syndrome (AIDS) ( Shiboski and Jewell (99), and Jewell et al. (99)). The univariate current status data have been thoroughly studied in recent years. Groenoboom and Wellner (99) studied the asymptotic properties of the nonparametric maximum likelihood estimator of the distribution function with current status data. Huang (996) considered Cox s proportional hazards model with current status data and showed

17 that the MLE of the regression parameter is asymptotically normal with n convergence rate, even through the MLE of the baseline cumulative hazard function only converges at n / rate. For a review of regression models for interval censored data, see Huang and Wellner (997). Bivariate event time data occur in many applications. For example, in an Australian twin study (Duffy et al. (990)) the researchers were interested in times to a certain event such as disease or disease-related symptoms in both twins. As in univariate case, both failure times can be censored. Some work has addressed the estimation of the joint distribution function of the correlated event times with bivariate right censored data. Analogous to the well-known Kaplan-Meier estimator for survival function, the bivariate Kaplan-Meier estimation on plane was proposed by Dabrowska (988). Kooperberg (998) discussed a tensor-spline estimation of the logarithm of joint density function with bivariate right censored data without studying the asymptotic properties of the estimator. For bivariate interval censored data case, a nonparametric two-stage method was proposed to estimate the joint distribution function in the literature. In this method, first the non-zero mass intersection rectangles are found, and then EM algorithm is applied to find the MLE of the joint distribution (Betensky and Finkelstein (999) and Yu, et al. (000)). In the second stage the EM algorithm could be replaced by the more efficient Iterative Convex Minarant method proposed by Groenoboom and Wellner (99). But this nonparametric estimation is not uniquely defined (Yu, Wong and He (000)) because the non-zero mass rectangles can not be uniquely determined. Moreover, the asymptotic properties of this type of nonparametric estimators

18 are difficulty to study. This thesis focuses on bivariate current status data. Let (T, T ) be the two event times of interest and (C, C ) the two corresponding random monitoring times. In this setting, bivariate current status data consist of (C, C, = I(T C ), = I(T C )), where I( ) is the indicator function. This data structure arises in the studies of two diseases in same patients or some common disease for two correlated subjects. For example, Wang and Ding (000) studied whether or not the onsets of hypertension and diabetes are correlated for people in two towns in Taiwan. By assuming a bivariate copula model Wang and Ding (000) proposed a two-stage estimation of the association parameter of two event times, in which the joint distribution of the failure time variables is assumed to follow a bivariate copula model (Nelsen (006)). First the nonparametric estimates of the marginal distributions are obtained, and then the association parameter is estimated by the maximum pseudo-likelihood method. This two-stage method facilitates an easy estimator of the joint distribution function through copula model, and is the only available method in the literature to estimate the joint distribution function with bivariate current status data. But if the copula model is not correctly specified, this estimator could be seriously biased. Jewell et al. (005) studied the relationship between the time to HIV infection to the partner and the time to diagnosis of AIDS for the index case by estimating smooth functionals of marginal distribution functions. In both examples, the bivariate event times have the same monitoring time, that is C = C = C. Hence the joint distribution function can be only studied on the diagonal, that is, only F (c, c) is identifiable. However, the common censoring assump-

19 tion may not be true. For example, suppose a study is conducted to explore the times to first use of marijuana between siblings. Although the interview may be conducted at same times for both siblings, the ages at interview (monitoring times) of the siblings are different and these give arise to the bivariate current status data with possibly different C and C. Ding and Wang (00) proposed a nonparametric procedure for testing marginal dependence in the general scenario when two censoring times for two events could be different. But their goal was not the estimation of joint distribution function. This thesis is concerned about the estimation of joint distribution function in the general scenario when C and C may or may not be equal.. Thesis Objective and Proposed Method In this thesis we propose to estimate the joint distribution function with current status data, using the tensor spline-based sieve maximum likelihood method. Conventionally, the estimate can be defined using the method similar to Betensky and Finkelstein (999), and Yu et al. (000). For these methods, the non-zero masses are determined for the intersection rectangles made by the collection of monitoring times. In addition to the uniqueness problem, as sample size increases, this problem is a high dimensional problem which is hard to deal with. To overcome these difficulties, we proposed a partially monotone tensor spline-based sieve estimation method for the the general bivariate current status data described in Section.. Under some regularity conditions, we show the tensor spline estimator is consistent and we derive the convergence rate of this estimator. Our simulation studies indicate that the proposed tensor spline estimation method performs very well and

20 5 better than a three-stage pseudo likelihood method extended from Wang and Ding method (000). The rest of thesis is organized as follows. In Chapter, Some technical backgrounds are introduced. First, we introduce the B-splines and the I-splines. We study the equivalency between the B-spines and the I- splines. The partially monotone tensor B-splines are used in studying the asymptotic properties of the proposed estimator. The partially monotone tensor I-splines are used in computing the restricted maximum likelihood estimate, since the constraints of the problem using the tensor I-splines is much simpler than that using the tensor B-splines. Second we introduce some basic concepts and results on empirical process theory, which will be heavily used in consistency proof and the derivation of convergence rate. Finally, we derive some useful technical results on the B-splines and the tensor B-splines. In Chapter, the tensor spline-based sieve nonparametric maximum likelihood estimation method of joint distribution function with bivariate current status data is proposed. First, we derive the likelihood of bivariate current status data. Second we derive the likelihood with the B-splines, in which the partially monotone tensor B-spline function is used for the joint distribution function and the monotone B-spline functions are used for the two marginal distribution functions and we represent the spline-based sieve nonparametric maximum likelihood estimation problem as a constrained optimization problem with respect to the coefficients of the B-splines. Finally we similarly derive the likelihood with the I-splines and represent the problem in terms of the I-splines in order to have a set of constraints that are easily dealt with in computation.

21 6 In Chapter, the asymptotic properties of the proposed estimator are studied using the technical results on the B-splines. Some regularity conditions are provided for the theoretical development regarding the joint distribution function, the censoring time distribution functions and the B-splines. First, we show that the proposed estimator is consistent. Second, we derive the convergence rate of the proposed estimator, which can be as good as n /. In Chapter 5, the numerical studies are carried out. First, the generalized gradient projection method is introduced to compute the estimate. Then, extensive simulation studies are conducted to examine the finite sample performance of the proposed method. We also compare the proposed method with the Wang and Ding s method (000). In Chapter 6, we describe the further applications of the proposed method are discussed.

22 7 CHAPTER TECHNICAL BACKGROUNDS In this chapter, we describe three versions of spline functions and their relationships. We also summarize some results on modern empirical process theory that will be heavily used in studying the asymptotic properties of the spline-based nonparametric estimator of the joint distribution function with bivariate current status data.. Splines In this section, we introduce three types of splines, the B-splines, the M-splines, and the I-splines. We describe the constructions of these splines and their relationships... The B-splines The normalized B-splines or simply the B-splines for brevity in this thesis, can be evaluated by the de Boor algorithm (de Boor, 00) as follows. For a partition or a knot sequence, that is, a nondecreasing sequence {u i }, the B-splines of order with this knot sequence are the characteristic functions given by, u i s < u i+, Ni (s) = 0, otherwise. (.) If u i = u i+, N i (s) = 0. The key characteristics of these functions are (i) N i (s) for i =,, are right continuous; (ii) i N i (s) =.

23 8 recursively by From the B-splines of order (.) the B-splines of higher order can be obtained Ni k (s) = ωi k (s)n k i (s) + ( ωi+(s))n k k i+ (s), with ωi k (s) = s u i u i+k u i, u i+k u i, 0, otherwise. If u i+k u i and u i+k u i+, we have Ni k (s) = s u i N k i (s) + u i+k s N k i+ (s). (.) u i+k u i u i+k u i+ There are some important properties for the B-splines which will be used throughout the rest of thesis and are summarized below. (B) (Theorem.8, Schumaker (98)) The B-splines have support on several knot intervals. Specifically, for the B-splines of order l, if u j s < u j+, then Ni(s) l 0, j l + i j, = 0, else. (B) (Theorem.0, Schumaker (98)) We already mentioned that the B-splines of order form a partition of unity. Actually, this property is true for the B-splines of any order. Specifically, for the B-splines of order l Ni(s) s l we have Ni(s) l =. i

24 9 (B) (Theorem 5.9, Schumaker (98)) The derivatives of the B-splines can be calculated by: N l i(s) s = l N l l i (s) N l i+ u i+l u i u i+l u (s). i+.. The M-splines In numerical analysis the M-splines are non-negative spline functions. In this thesis the M-splines are used to construct the I-splines. Curry and Schoenberg (966) proposed the construction of the M-splines as follows. Suppose a knot sequence is given by {u i }, the M-splines of order for this knot sequence are defined as Mi (s) = u i+ u i, u i s < u i+, 0, otherwise. (.) From the M-splines of order (.) the M-splines of higher order can be obtained recursively by Mi(s) l = l[(s u i)m l i (s) + (u i+l s)m l i+ (s)]. (.) (l )(u i+l u i ) Lemma.. Suppose the M-splines given by (.) and the B-splines given by (.) are associated with the same knot sequence, then they are closely related by M l i(s) = l u i+l u i N l i(s). (.5) Proof. (i) Note that M i (s) = u i+ u i N i (s), so the relationship is true for l =. (ii) Suppose the relation holds for l = k, then M k i (s) = k[(s u i)m k i (s) + (u i+k s)m k i+ (s)] (k )(u i+k u i )

25 0 = k[(s u i) k u i+k u i N k i (s) + (u i+k s) k u i+k u i+ N k i+ (s)] (k )(u i+k u i ) = = k s u i [ N k i (s) + u i+k s N k i+ u i+k u i u i+k u i u i+k u (s)] i+ k u i+k u i N k i (s), which implies (.5) holds for l = k. By induction (.5) holds for any positive integer l, which completes the proof... The I-splines Ramsay (988) proposed the I-splines, which are monotone functions constrained between 0 and. The I-splines can be used as spline basis functions for regression analysis and data transformation when monotonicity is desired. The I-splines are constructed through the M-splines as follows. Suppose the M-splines M l i s have the knot sequence {u i } p+l+ satisfying u = = u l+ < u l+ < < u p < u p+ = = u p+l+, (.6) where the first and the last l + knots are equal because the M-splines are of order l. The I-splines are defined as, i =, Ii(s) l = s M l L i(t)dt, < i p, (.7) with L s U, where L and U are the left and the right end point of the knot sequence {u i } p+l+, respectively. Since M l i(s) s given by (.) are piecewise polynomials of degree l, then I l i(s) s given by (.7) are piecewise polynomials of degree l. De Boor (00) showed U L M l i(t) =

26 for positive integers i and l, which, along with the fact that Mi(t) l are nonnegative, implies that the I-splines in (.7) are monotone non-decreasing function constrained between 0 and. Suppose the knot sequence {u i } p+l+ of the M-splines M l+ s satisfies (.6), a more convenient expression of the I-splines was given by Ramsay (988) as the following: i Ii(s) l = 0, i > j, j m=i (u m+l+ u m )M l+ m (s)/(l + ), j l + i j, (.8), i < j l +, for u j s < u j+ and i p. The following Lemma. and Lemma. together indicate that the I-splines given by (.7) are equivalent to the I-splines given by (.8). Lemma.. Suppose M l+ i s in (.8) and N l+ i s are associated with the same knot sequence, then the I-splines given by (.8) can be expressed by for i p. I l i(s) = p m=i N l+ m (s), (.9) Proof. By Lemma., (.8) can be further expressed in terms of the B-splines: Ii(s) l = 0, i > j, j m=i N l+ m (s), j l + i j, (.0), i < j l +,

27 for u j s < u j+ and i p. By the B-splines property (B) in Section... for the B-spline of order l +, if u j s < u j+, N l+ m (s) 0, j (l + ) + m j, = 0, else. So the expression of the I-splines given by (.0) can be rewritten as I l i(s) = p m=i Nm l+ (s). Lemma.. Suppose M l+ i s in (.8) and N l+ i s in (.9) are associated with the same knot sequence, the I-splines given by (.9) are equivalent to the I-splines given by (.7). Proof. (i) We show p m=i N m l+ (s) = s L M l i(t)dt for i =,..., p. We shall demonstrate this by the following two steps: (a) Prove ( p ( p m=i N m l+ (s)) = ( s L M i l(t)dt) s s m=i N m l+ s (s)) = m=i as follows. p l { N l l u m+l u m(s) N m u m+l+ u m+(s)} l m+ l = N u i+l u i(s) l + i p m=i l = N u i+l u i(s) l = Mi(s) l i = ( s L M l i(t)dt) s l l u m+l+ u m+ N l m+(s) (b) p m=i N l+ m (L) = 0 = L L M l i(t)dt.

28 (a) and (b) imply for i =,..., p, p m=i N m l+ (s) and s L M l i(t)dt have same derivative and they both are zero at the left end point of the knot sequence. Therefore p m=i N m l+ (s) = s M l L i(t)dt, for i =,..., p. (ii) For i =, it is trivial because of property (B) in Section... Remark.. (.9) provides a much easier way to compute the I-splines than using definition (.7) due to the available softwares for the B-splines. In Appendix the explicit form of the I-spline with order is given, along with the steps to to construct the I-splines through the B-splines in statistics package R... Monotone Spline Functions In the literature, {N l i : i =,,, l =,, } are referred to as the B-spline basis functions. In this thesis, we denote the B-spline functions as the linear combinations of the B-spline basis functions. Suppose we have the lth-order B-spline basis functions N l i(s) with knot sequence {u i } p+l satisfying u = = u l < u l+ < u p < u p+ = = u p+l. A B-spline function of order l is given by f(s) = p β i Ni(s). l (.) i= It is obvious by (B) that j f(s) = β i Ni(s) l i=j l+ for u j s < u j+.

29 According to (B) and (.), the derivative of f(s) is given by f s = = p N β i(s) l i s p l β i { N l i (s) u i+l u i i= i= p = i= (l )(β i+ β i ) N l i+ u i+l u (s), i+ l u i+l u i+ N l i+ (s)} (.) the last equality due to the fact that N l (u) 0 and N l p+(u) 0. Lemma.. If 0 β β β p, the B-spline function f(s) given by (.) is a nonnegative and nondecreasing function. Proof. It is obvious that f(s) is nonnegative. By (.), the derivative of f(s) is nonnegative as well. Similarly the I-spline functions in the thesis are denoted by linear combinations of the I-spline basis functions {I l i : i =,,, l =,, }. Specifically, a I-spline function of (l )th-order is given by f(s) = p i= γ i I l i (s). (.) Lemma.5. If γ i 0 for i =,, p, f(s) given by (.) is nonnegative and nondecreasing. Proof. Because the I-spline basis functions are nonnegative and nondecreasing, γ i 0 for i =,, p are sufficient to guarantee f(s) for being nonnegative and nondecreasing. Lemma.6. f(s) = p i= β ini(s) l with 0 β β β p and f(s) = p i= γ ii l i (s) with γ i 0 for i =,, p are equivalent to each other.

30 5 Proof. For a monotone spline function expressed in the I-splines by (.), substitute I l i (s) = p j=i N j(s), l it follows that f(s) = p i { γ j }Ni(s). l i= j= By γ i 0, i =,, p, we can rewrite the spline function (.) as f(s) = p β i Ni(s), l i= where β i = i j= γ j, i =,..., p. Then 0 β β β p. Remark.. Lemma.6 implies that a nonnegative and nondecreasing spline function can be equivalently expressed by either the B-splines or the I-splines...5 Partially Monotone Tensor Spline Functions Suppose we have the lth-order B-splines N (),l i (s) with knot sequence {u i } p+l satisfying u = = u l < u l+ < u p < u p+ = = u p+l and the lth-order B-splines N (),l j (t) with knot sequence {v i } q+l satisfying v = = v l < v l+ < v q < v q+ = = v q+l. The tensor B-spline functions are given by f(s, t) = p i= q j= α i,j N (),l i (s)n (),l j (t). (.) By (B) the partial derivatives of the tensor B-spline function can be expressed by f(s, t) s = p i= q j= (l )(α i+,j α i,j ) N (),l i+ (s)n (),l j (t), (.5) u i+l u i+ and f(s, t) t = q p i= j= (l )(α i,j+ α i,j ) v j+l v j+ N (),l i (s)n (),l j+ (t). (.6)

31 6 Lemma.7. If 0 α,j α,j α p,j for j =,,, q and 0 α i, α i, α i,q for i =,,, p, the tensor B-spline function f(s, t) given by (.) is nonnegative and nondecreasing in both s and t directions. Proof. It is obvious that f(s, t) is nonnegative. By (.5) and (.6), both partial derivatives of f(s, t) are nonnegative. Similarly the tensor I-spline functions are given by Lemma.8. If η i,j f(s, t) = p i= q i= η i,j I (),l i (s)i (),l j (t). (.7) 0 for i =,, p and j =,, q, f(s, t) given by (.7) is nonnegative and nondecreasing in both s and t directions. Proof. Because the I-spline basis functions are nonnegative and nondecreasing, η i,j 0 for i =,..., p and j =,..., q are sufficient to guarantee f(s, t) for being nonnegative and nondecreasing in both s and t directions. Let S I = {f(s, t) = and p i= q j= S B = {f(s, t) = η i,j I (),l i p i= q j= (s)i (),l j (t) : η i,j 0 for i =,..., p and j =,..., q}, α i,j N (),l i (s)n (),l j (t) : α i,js satisfy (.8)}, 0 α,j α,j α p,j, for j =,,, q, 0 α i, α i, α i,q, for i =,,, p, (α i+,j+ α i+,j ) (α i,j+ α i,j ) 0, (.8) for i =,,, p and j =,,, q.

32 7 Lemma.9. Set S B is equivalent to set S I. Proof. Substitute I (),l i = p x=i N x (),l and I (),l j = q y=j N y (),l in (.7), then f(s, t) = p i= q { j= i j x= y= η x,y }N (),l i (s)n (),l j (t). (.9) Let α i,j = i x= j y= η x,y, then (.9) can be written as f(s, t) = p i= q j= α i,j N (),l i (s)n (),l j (t). Then the following results (i), (ii), (iii) and (iv) can be easily obtained. (i) α, = η, 0. (ii) For i =,..., p and j =,..., q, α i,j+ α i,j = = j+ i i η x,y x= y= i η x,j+ 0. x= x= y= j x= y= (iii) For i =,..., p and j =,..., q, we have i+ j i α i+,j α i,j = η x,y = j η i+,y 0. y= j x= y= η x,y η x,y

33 8 (iv) For i =,..., p and j =,..., q, i+ j+ (α i+,j+ α i+,j ) (α i,j+ α i,j ) =( η x,y x= y= ( i+ j+ i η x,y x= y= i+ = η x,j+ x= x= y= i x= j η x,y ) i x= y= η x,j+ j η x,y ) (.0) By (i), (ii), (iii) and (iv), S I and S B are equivalent. =η i+,j+ 0. Remark.. Lemma.9 implies the partially monotone tensor I-spline functions are not equivalent to the partially monotone tensor B-spline functions, since there must be an extra condition (.0) for the partially monotone tensor B-spline function to make them equivalent. However this condition is very necessary for estimating the joint distribution function, because it corresponds to the fact that the mass of the joint distribution on any rectangle region of its domain is nonnegative.. Results on Empirical Process Theory Empirical process theory is a very powerful tool in studying the asymptotic properties of nonparametric or semiparametric estimator in Statistics. In this section we introduce some basic concepts and results which will be used in our theoretical development. Definition.. (Covering number, van der Vaart and Wellner (996), P. 8) The covering number N(ɛ, F, ) is the minimal number of balls {g : g f < ɛ} of radius ɛ needed to cover the set F. The entropy with covering H(ɛ, F, ) is the logarithm of the covering number.

34 9 Definition.. (Bracketing number, van der Vaart and Wellner (996), P. 8) Given two functions l and u, the bracket [l, u] is the set of all functions f with l f u. An ɛ-bracket is a bracket [l, u] with u l < ɛ. The bracketing number N [ ] (ɛ, F, ) is the minimum number of ɛ-brackets needed to cover F. The entropy with bracketing H [ ] (ɛ, F, ) is the logarithm of the bracketing number. Definition.. (VC-index and VC-class of sets, van der Vaart and Wellner (996), P. - 5) Let C a collection of subsets of a set X. An arbitrary set of n points {x,..., x n } possesses n subsets. Say that C picks out a certain subset from {x,..., x n } if this can be formed as a set of the form C {x,..., x n } for a C in C. The collection C is said to shatter {x,..., x n } if each of its n subsets can be picked out in this manner. The VC-index V (C) of the class C is the smallest n for which no set of size n is shattered by C. A collection of measurable sets C is called a VC-class of sets if its VC-index is finite. Definition.. (Symmetric convex hull, van der Vaart and Wellner (996), P. ) The symmetric convex hull sconv(f) of a class of functions is defined as the set of functions m i= α if i, with m i= α i and each f i contained in F. Definition.5. (VC-class of functions, van der Vaart and Wellner (996), P. ) The subgraph of a function f : X R is the subset of X R given by {(x, t) : t < f(x)}. A collection F of measurable functions on a sample space is called a VC-class of functions, if the collection of all subgraphs of the functions in F forms a VC-classes of sets in X R. Definition.6. (Envelop function, van der Vaart and Wellner (996), P. 8) An envelop function of a class F is any function x F (x) such that f(x) F (x), for every x and

35 0 f F. In the following lemmas, the L r (Q) norm associated with probability measure Q is defined by f Lr (Q) = (Q f r ) /r = ( f r dq) /r, for r > 0, the empirical measure P n of a sample of random elements x,..., x n is defined by P n f = n n f(x i ), i= and accordingly G n is denoted as n(p n P ). Lemma.0. (Theorem 5.7 in van der Vaart (998)) Let M n be random functions and let M be a fixed function of θ such that for every ɛ > 0 sup M n (θ) M(θ) P 0, θ Θ sup M(θ) < M(θ 0 ). θ:d(θ,θ 0 ) ɛ Then any sequence of estimators ˆθ n with M n (ˆθ n ) M n (θ 0 ) o P () converges in probability to θ 0. Lemma.. (Theorem.6.7 in van der Vaart and Wellner (996)) For a VC-class of functions with measurable envelop function F and r, one has for any probability measure Q with F Lr(Q) > 0, N(ɛ F Lr (Q), F, L r (Q)) KV (F)(6e) V (F) ( ɛ )r(v (F) ), for a universal constant K and 0 < ɛ <.

36 Lemma.. (Theorem.6.9 in van der Vaart and Wellner (996)) Let Q be a probability measure, and let F be a class of measurable functions with measurable square integrable envelop F such that QF < and N(ɛ F L (Q), F, L (Q)) C( ɛ )V for 0 < ɛ <. Then there exists a constant K that depends on C and V only such that N(ɛ F L (Q), sconv(f), L (Q)) K( ɛ )V/(V +). Lemma.. (Theorem.5. in van der Vaart and Wellner (996)) Let F be a class of measurable functions that satisfies 0 sup Q log N(ɛ F L (Q), F, L (Q))dɛ <, where the envelop function F of F is square integrable and the supremum is taken over all finitely discrete probability measures Q with F L (Q) > 0. Let the classes F δ = {f g : f, g F, f g L (P ) < δ} and F δ be P -measurable for every δ > 0. If P F <, then F is P -Donsker. Lemma.. (Example.0.7 in van der Vaart and Wellner (996)) If F and G are P - Donsker Classes with sup f F G P f <, then the pairwise infima F G, the pairwise suprema F G, and pairwise sums F + G are P -Donsker classes. Lemma.5. (Corollary.. in van der Vaart and Wellner (996)) Let F be a class of measurable functions and the semi-norm ρ P on F be defined as ρ P (f) = {P (f P f) } /. Then F being P -Donsker class implies that, if f n f as n in semi-norm ρ P for all f n and f in F, then (P n P )(f n f) = o P (n / ). Lemma.6. (Lemma.. in van der Vaart and Wellner (996)) Let F be the class of measurable functions such that P f < δ and f M for every f in F. Then there

37 exists K > 0, which is related to M, such that E P G n F K J [ ] {δ, F, L (P )}[ + J [ ] {δ, F, L (P )} δ ], n where J [ ] {δ, F, L (P )} = δ 0 + log N[ ] {ɛ, F, L (P )}dɛ and G n F = sup f F G n f. Lemma.7. (Theorem.. in van der Vaart and Wellner (996)) For each n, let M n and M n be stochastic processes indexed by a set Θ Let θ n Θ and 0 δ n < η be arbitrary. Suppose that, for every n and δ n < δ η E sup M n (θ) M n (θ n ) δ, δ/<d n(θ,θ n) δ,θ Θ n sup n[(mn M n )(θ) (M n M n )(θ n )] + Cφ(δ), δ/<d n (θ,θ n ) δ,θ Θ n for functions φ such that δ φ(δ)/δ α is decreasing on (δ n, η), for some α <. Let r n Cδ n satisfy r nφ( r n ) n, for every n. If the sequence ˆθ n takes its values in Θ n and satisfies M n (ˆθ n ) M n (θ n ) O P (rn ) and d n (ˆθ n, θ n ) converges to zero in probability, then r n d n (ˆθ n, θ n ) = O P ().. Some Useful Results on B-splines Lemma.8. (Jackson type Theorem, De boor (00), P. 9) Suppose g(x) is a function with the continuous derivative dw g(x). Then there exists a B-spline function dx w Ag(x) = p i= β in l i(x) with order l of the B-spline basis functions satisfying l w + and have knot sequence {u i } p with L = u = = u l < u l+ < < u p < u p+ = = u p+l = U, such that g Ag c T w dw g dx w

38 for some constant c > 0 depending on l only, where T = max l i p (u i+ u i ). Lemma.9. The result of Lemma.8 can be generalized to bivariate function and is given in Lemma.9. Suppose g(x, y) is a bivariate function with the continuous mixed derivatives of order w, w mg = w g(x,y) x m y w m for m =,,..., w. Then there exists a bivariate tensor B- spline function Ag(x, y) = p i= q j= α i,jn (),l i (x)n (),l j (y) with order l in both directions satisfying l w + and have knot sequences {u i } p+l with L = u = = u l < u l+ < < u p < u p+ = = u p+l = U, {v i } q+l with L = v = = v l < v l+ < < v q < v q+ = = v q+l = U, such that g Ag c T w ( g w, ), for some constant c > 0 depending on l only, where T = max{max l i p (u i+ u i ), max l j q (v j+ v j )}, and g w, = max 0 m w w g x m y. w m Proof. We define ω(g; h) = max{ g(x, y ) g(x, y ) : x x h, y y h, x, x [L, U ], y, y [L, U ]}. Then ω(g; h) is a monotone and subadditivity function of h, that is, ω(g; h ) ω(g; h + h ) ω(g; h ) + ω(g; h ) for nonnegative h

39 and h. The monotonicity of ω(g; h) is obvious by the definition. The proof of subadditivity is as follows. For any (x, y ) and (x, y ) with x x h + h and y y h + h, we can find (x, y ) such that x x h, y y h and x x h, y y h. Therefore, for any x x h + h and y y h + h, we have g(x, y ) g(x, y ) g(x, y ) g(x, y ) + g(x, y ) g(x, y ) max g(x, y ) g(x, y ) x x h y y h + max x x h y y h g(x, y ) g(x, y ) (.) =ω(g; h ) + ω(g; h ). By (.), ω(g; h + h ) ω(g; h ) + ω(g; h ) for nonnegative h and h, that is, subadditivity of ω(g; h) holds. By choosing τ < τ < < τ p in [L, U ] and ξ < ξ < < ξ q in [L, U ], we can construct a partially monotone tensor B-spline function Ag to approximate the smooth function g on [L, U ] [L, U ] as follows. Ag(x, y) = p i= q j= g(τ i, ξ j )N (),l i (x)n (),l j (y) = p { i= q j= N (),l j (y)g(τ i, ξ j )}N (),l i (x). For (ˆx, ŷ) in [u j, u j +] [v j, v j +] [L, U ] [L, U ], Ag(ˆx, ŷ) = j j i=j + l j=j + l g(τ i, ξ j )N (),l i (ˆx)N (),l j (ŷ), (.)

40 5 by property (B) in Section... Also by property (B) in Section.., we have g(ˆx, ŷ) = g(ˆx, ŷ) = g(ˆx, ŷ) = g(ˆx, ŷ) j i=j + l j i=j + l j N (),l i (ˆx) { j j=j + l j i=j + l j=j + l N (),l j N (),l i (ŷ)}n (),l i (ˆx) (ˆx)N (),l j (ŷ). (.) By (.) and (.), g(ˆx, ŷ) Ag(ˆx, ŷ) = Then we have g(ˆx, ŷ) Ag(ˆx, ŷ) j j i=j + l j=j + l j j i=j + l j=j + l {g(ˆx, ŷ) g(τ i, ξ j )}N (),l g(ˆx, ŷ) g(τ i, ξ j ) N (),l i max g(ˆx, ŷ) g(τ i, ξ j ). j + l i j j + l j j i (ˆx)N (),l j (ŷ). (ˆx)N (),l j (ŷ) Now choose the τ i s and ξ j appropriately. since the number of τ i s is greater than the number of subintervals made from knot sequence {u i } p+l, in order to guarantee that τ i+ τ i > 0 for i =,, p, we might choose τ i = u + (i )(u l+ u l ) l, i =,..., l, u i, i = l +,..., p. (.) Similarly, we might choose ξ j = v + (j )(v l+ v l ) l, j =,..., l, v j, j = l +,..., q, (.5)

41 6 (.) and (.5) imply τ i u i T and ξ j v j T for i =,..., p and j =,..., q. We also know u i ˆx u j + u j l+ l T for j l < i j and ˆx [u j, u j +] and v j ŷ v j + v j l+ l T for j l < j j and ŷ [v j, v j +]. So we have for j l < i j and ˆx [u j, u j +] τ i ˆx (l + ) T, and for j l < j j and ŷ [v j, v j +] ξ j ŷ (l + ) T. Therefore we have max g(ˆx, ŷ) g(τ i, ξ j ) max{ g(x, y ) g(x, y ) : j + l i j j + l j j x x (l + ) T, y y (l + ) T } (.6) =ω(g; (l + ) T ) =(l + )ω(g; T ), the last inequality is by subadditivity of ω(g; h). By (.6) we have g Ag = which means the distance between g and ψ l,l sup L x U L y U g(x, y) Ag(x, y) (l + )ω(g; T ), d(g, ψ l,l ) = inf s ψ l,l g s (l + )ω(g; T ), (.7)

Introduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations

Introduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations Introduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

Inferences about Parameters of Trivariate Normal Distribution with Missing Data

Inferences about Parameters of Trivariate Normal Distribution with Missing Data Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 7-5-3 Inferences about Parameters of Trivariate Normal Distribution with Missing

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

The extreme points of symmetric norms on R^2

The extreme points of symmetric norms on R^2 Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2008 The extreme points of symmetric norms on R^2 Anchalee Khemphet Iowa State University Follow this and additional

More information

The nonparametric least-squares method for estimating monotone functions with intervalcensored

The nonparametric least-squares method for estimating monotone functions with intervalcensored University of Iowa Iowa Research Online Theses and Dissertations Spring 2012 The nonparametric least-squares method for estimating monotone functions with intervalcensored observations Gang Cheng University

More information

Nonparametric estimation for current status data with competing risks

Nonparametric estimation for current status data with competing risks Nonparametric estimation for current status data with competing risks Marloes Henriëtte Maathuis A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy

More information

Two-level lognormal frailty model and competing risks model with missing cause of failure

Two-level lognormal frailty model and competing risks model with missing cause of failure University of Iowa Iowa Research Online Theses and Dissertations Spring 2012 Two-level lognormal frailty model and competing risks model with missing cause of failure Xiongwen Tang University of Iowa Copyright

More information

Estimation of the Mean Function with Panel Count Data Using Monotone Polynomial Splines

Estimation of the Mean Function with Panel Count Data Using Monotone Polynomial Splines Estimation of the Mean Function with Panel Count Data Using Monotone Polynomial Splines By MINGGEN LU, YING ZHANG Department of Biostatistics, The University of Iowa, 200 Hawkins Drive, C22 GH Iowa City,

More information

Topics in Current Status Data. Karen Michelle McKeown. A dissertation submitted in partial satisfaction of the. requirements for the degree of

Topics in Current Status Data. Karen Michelle McKeown. A dissertation submitted in partial satisfaction of the. requirements for the degree of Topics in Current Status Data by Karen Michelle McKeown A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor in Philosophy in Biostatistics in the Graduate Division

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

Reduced [tau]_n-factorizations in Z and [tau]_nfactorizations

Reduced [tau]_n-factorizations in Z and [tau]_nfactorizations University of Iowa Iowa Research Online Theses and Dissertations Summer 2013 Reduced [tau]_n-factorizations in Z and [tau]_nfactorizations in N Alina Anca Florescu University of Iowa Copyright 2013 Alina

More information

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time ASYMPTOTIC PROPERTIES OF THE GMLE WITH CASE 2 INTERVAL-CENSORED DATA By Qiqing Yu a;1 Anton Schick a, Linxiong Li b;2 and George Y. C. Wong c;3 a Dept. of Mathematical Sciences, Binghamton University,

More information

Doubly Indexed Infinite Series

Doubly Indexed Infinite Series The Islamic University of Gaza Deanery of Higher studies Faculty of Science Department of Mathematics Doubly Indexed Infinite Series Presented By Ahed Khaleel Abu ALees Supervisor Professor Eissa D. Habil

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

ESTIMATING STATISTICAL CHARACTERISTICS UNDER INTERVAL UNCERTAINTY AND CONSTRAINTS: MEAN, VARIANCE, COVARIANCE, AND CORRELATION ALI JALAL-KAMALI

ESTIMATING STATISTICAL CHARACTERISTICS UNDER INTERVAL UNCERTAINTY AND CONSTRAINTS: MEAN, VARIANCE, COVARIANCE, AND CORRELATION ALI JALAL-KAMALI ESTIMATING STATISTICAL CHARACTERISTICS UNDER INTERVAL UNCERTAINTY AND CONSTRAINTS: MEAN, VARIANCE, COVARIANCE, AND CORRELATION ALI JALAL-KAMALI Department of Computer Science APPROVED: Vladik Kreinovich,

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2008 Paper 85 Semiparametric Maximum Likelihood Estimation in Normal Transformation Models for Bivariate Survival Data Yi Li

More information

Statistical Properties of Numerical Derivatives

Statistical Properties of Numerical Derivatives Statistical Properties of Numerical Derivatives Han Hong, Aprajit Mahajan, and Denis Nekipelov Stanford University and UC Berkeley November 2010 1 / 63 Motivation Introduction Many models have objective

More information

Nonparametric predictive inference with parametric copulas for combining bivariate diagnostic tests

Nonparametric predictive inference with parametric copulas for combining bivariate diagnostic tests Nonparametric predictive inference with parametric copulas for combining bivariate diagnostic tests Noryanti Muhammad, Universiti Malaysia Pahang, Malaysia, noryanti@ump.edu.my Tahani Coolen-Maturi, Durham

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

Semiparametric maximum likelihood estimation in normal transformation models for bivariate survival data

Semiparametric maximum likelihood estimation in normal transformation models for bivariate survival data Biometrika (28), 95, 4,pp. 947 96 C 28 Biometrika Trust Printed in Great Britain doi: 1.193/biomet/asn49 Semiparametric maximum likelihood estimation in normal transformation models for bivariate survival

More information

A Study on the Correlation of Bivariate And Trivariate Normal Models

A Study on the Correlation of Bivariate And Trivariate Normal Models Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 11-1-2013 A Study on the Correlation of Bivariate And Trivariate Normal Models Maria

More information

STATISTICAL ANALYSIS OF MULTIVARIATE INTERVAL-CENSORED FAILURE TIME DATA. A Dissertation Presented. the Faculty of the Graduate School

STATISTICAL ANALYSIS OF MULTIVARIATE INTERVAL-CENSORED FAILURE TIME DATA. A Dissertation Presented. the Faculty of the Graduate School STATISTICAL ANALYSIS OF MULTIVARIATE INTERVAL-CENSORED FAILURE TIME DATA A Dissertation Presented to the Faculty of the Graduate School University of Missouri-Columbia In Partial Fulfillment Of the Requirements

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Frailty Probit model for multivariate and clustered interval-censor

Frailty Probit model for multivariate and clustered interval-censor Frailty Probit model for multivariate and clustered interval-censored failure time data University of South Carolina Department of Statistics June 4, 2013 Outline Introduction Proposed models Simulation

More information

Estimation for state space models: quasi-likelihood and asymptotic quasi-likelihood approaches

Estimation for state space models: quasi-likelihood and asymptotic quasi-likelihood approaches University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2008 Estimation for state space models: quasi-likelihood and asymptotic

More information

A Least-Squares Approach to Consistent Information Estimation in Semiparametric Models

A Least-Squares Approach to Consistent Information Estimation in Semiparametric Models A Least-Squares Approach to Consistent Information Estimation in Semiparametric Models Jian Huang Department of Statistics University of Iowa Ying Zhang and Lei Hua Department of Biostatistics University

More information

CHAPTER 10 Shape Preserving Properties of B-splines

CHAPTER 10 Shape Preserving Properties of B-splines CHAPTER 10 Shape Preserving Properties of B-splines In earlier chapters we have seen a number of examples of the close relationship between a spline function and its B-spline coefficients This is especially

More information

Semi-parametric predictive inference for bivariate data using copulas

Semi-parametric predictive inference for bivariate data using copulas Semi-parametric predictive inference for bivariate data using copulas Tahani Coolen-Maturi a, Frank P.A. Coolen b,, Noryanti Muhammad b a Durham University Business School, Durham University, Durham, DH1

More information

arxiv:submit/ [math.st] 6 May 2011

arxiv:submit/ [math.st] 6 May 2011 A Continuous Mapping Theorem for the Smallest Argmax Functional arxiv:submit/0243372 [math.st] 6 May 2011 Emilio Seijo and Bodhisattva Sen Columbia University Abstract This paper introduces a version of

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

ON THE CHEBYSHEV POLYNOMIALS. Contents. 2. A Result on Linear Functionals on P n 4 Acknowledgments 7 References 7

ON THE CHEBYSHEV POLYNOMIALS. Contents. 2. A Result on Linear Functionals on P n 4 Acknowledgments 7 References 7 ON THE CHEBYSHEV POLYNOMIALS JOSEPH DICAPUA Abstract. This paper is a short exposition of several magnificent properties of the Chebyshev polynomials. The author illustrates how the Chebyshev polynomials

More information

Lecture 2: Uniform Entropy

Lecture 2: Uniform Entropy STAT 583: Advanced Theory of Statistical Inference Spring 218 Lecture 2: Uniform Entropy Lecturer: Fang Han April 16 Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal

More information

MULTIVARIATE DISCRETE PHASE-TYPE DISTRIBUTIONS

MULTIVARIATE DISCRETE PHASE-TYPE DISTRIBUTIONS MULTIVARIATE DISCRETE PHASE-TYPE DISTRIBUTIONS By MATTHEW GOFF A dissertation submitted in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY WASHINGTON STATE UNIVERSITY Department

More information

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data Xingqiu Zhao and Ying Zhang The Hong Kong Polytechnic University and Indiana University Abstract:

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

The Kth M-ary Partition Function

The Kth M-ary Partition Function Indiana University of Pennsylvania Knowledge Repository @ IUP Theses and Dissertations (All) Fall 12-2016 The Kth M-ary Partition Function Laura E. Rucci Follow this and additional works at: http://knowledge.library.iup.edu/etd

More information

Information geometry for bivariate distribution control

Information geometry for bivariate distribution control Information geometry for bivariate distribution control C.T.J.Dodson + Hong Wang Mathematics + Control Systems Centre, University of Manchester Institute of Science and Technology Optimal control of stochastic

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

Effectiveness of the hybrid Levine equipercentile and modified frequency estimation equating methods under the common-item nonequivalent groups design

Effectiveness of the hybrid Levine equipercentile and modified frequency estimation equating methods under the common-item nonequivalent groups design University of Iowa Iowa Research Online Theses and Dissertations 27 Effectiveness of the hybrid Levine equipercentile and modified frequency estimation equating methods under the common-item nonequivalent

More information

An Approach to Constructing Good Two-level Orthogonal Factorial Designs with Large Run Sizes

An Approach to Constructing Good Two-level Orthogonal Factorial Designs with Large Run Sizes An Approach to Constructing Good Two-level Orthogonal Factorial Designs with Large Run Sizes by Chenlu Shi B.Sc. (Hons.), St. Francis Xavier University, 013 Project Submitted in Partial Fulfillment of

More information

Empirical Risk Minimization

Empirical Risk Minimization Empirical Risk Minimization Fabrice Rossi SAMM Université Paris 1 Panthéon Sorbonne 2018 Outline Introduction PAC learning ERM in practice 2 General setting Data X the input space and Y the output space

More information

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Stochastic Comparisons of Order Statistics from Generalized Normal Distributions

Stochastic Comparisons of Order Statistics from Generalized Normal Distributions A^VÇÚO 1 33 ò 1 6 Ï 2017 c 12 Chinese Journal of Applied Probability and Statistics Dec. 2017 Vol. 33 No. 6 pp. 591-607 doi: 10.3969/j.issn.1001-4268.2017.06.004 Stochastic Comparisons of Order Statistics

More information

A FRAILTY MODEL APPROACH FOR REGRESSION ANALYSIS OF BIVARIATE INTERVAL-CENSORED SURVIVAL DATA

A FRAILTY MODEL APPROACH FOR REGRESSION ANALYSIS OF BIVARIATE INTERVAL-CENSORED SURVIVAL DATA Statistica Sinica 23 (2013), 383-408 doi:http://dx.doi.org/10.5705/ss.2011.151 A FRAILTY MODEL APPROACH FOR REGRESSION ANALYSIS OF BIVARIATE INTERVAL-CENSORED SURVIVAL DATA Chi-Chung Wen and Yi-Hau Chen

More information

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 18.466 Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 1. MLEs in exponential families Let f(x,θ) for x X and θ Θ be a likelihood function, that is, for present purposes,

More information

Inconsistency of the MLE for the joint distribution. of interval censored survival times. and continuous marks

Inconsistency of the MLE for the joint distribution. of interval censored survival times. and continuous marks Inconsistency of the MLE for the joint distribution of interval censored survival times and continuous marks By M.H. Maathuis and J.A. Wellner Department of Statistics, University of Washington, Seattle,

More information

An introduction to Birkhoff normal form

An introduction to Birkhoff normal form An introduction to Birkhoff normal form Dario Bambusi Dipartimento di Matematica, Universitá di Milano via Saldini 50, 0133 Milano (Italy) 19.11.14 1 Introduction The aim of this note is to present an

More information

Inference on Association Measure for Bivariate Survival Data with Hybrid Censoring

Inference on Association Measure for Bivariate Survival Data with Hybrid Censoring Inference on Association Measure for Bivariate Survival Data with Hybrid Censoring By SUHONG ZHANG, YING ZHANG, KATHRYN CHALONER Department of Biostatistics, The University of Iowa, C22 GH, 200 Hawkins

More information

Lifetime prediction and confidence bounds in accelerated degradation testing for lognormal response distributions with an Arrhenius rate relationship

Lifetime prediction and confidence bounds in accelerated degradation testing for lognormal response distributions with an Arrhenius rate relationship Scholars' Mine Doctoral Dissertations Student Research & Creative Works Spring 01 Lifetime prediction and confidence bounds in accelerated degradation testing for lognormal response distributions with

More information

Computing and Communicating Functions over Sensor Networks

Computing and Communicating Functions over Sensor Networks Computing and Communicating Functions over Sensor Networks Solmaz Torabi Dept. of Electrical and Computer Engineering Drexel University solmaz.t@drexel.edu Advisor: Dr. John M. Walsh 1/35 1 Refrences [1]

More information

Empirical Processes: General Weak Convergence Theory

Empirical Processes: General Weak Convergence Theory Empirical Processes: General Weak Convergence Theory Moulinath Banerjee May 18, 2010 1 Extended Weak Convergence The lack of measurability of the empirical process with respect to the sigma-field generated

More information

The Liapunov Method for Determining Stability (DRAFT)

The Liapunov Method for Determining Stability (DRAFT) 44 The Liapunov Method for Determining Stability (DRAFT) 44.1 The Liapunov Method, Naively Developed In the last chapter, we discussed describing trajectories of a 2 2 autonomous system x = F(x) as level

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 08: Stochastic Convergence

Introduction to Empirical Processes and Semiparametric Inference Lecture 08: Stochastic Convergence Introduction to Empirical Processes and Semiparametric Inference Lecture 08: Stochastic Convergence Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Lebesgue Measure on R n

Lebesgue Measure on R n CHAPTER 2 Lebesgue Measure on R n Our goal is to construct a notion of the volume, or Lebesgue measure, of rather general subsets of R n that reduces to the usual volume of elementary geometrical sets

More information

Research Article Strong Convergence of a Projected Gradient Method

Research Article Strong Convergence of a Projected Gradient Method Applied Mathematics Volume 2012, Article ID 410137, 10 pages doi:10.1155/2012/410137 Research Article Strong Convergence of a Projected Gradient Method Shunhou Fan and Yonghong Yao Department of Mathematics,

More information

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data Journal of Multivariate Analysis 78, 6282 (2001) doi:10.1006jmva.2000.1939, available online at http:www.idealibrary.com on Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone

More information

Survival Analysis for Interval Censored Data - Nonparametric Estimation

Survival Analysis for Interval Censored Data - Nonparametric Estimation Survival Analysis for Interval Censored Data - Nonparametric Estimation Seminar of Statistics, ETHZ Group 8, 02. May 2011 Martina Albers, Nanina Anderegg, Urs Müller Overview Examples Nonparametric MLE

More information

Zero sum games Proving the vn theorem. Zero sum games. Roberto Lucchetti. Politecnico di Milano

Zero sum games Proving the vn theorem. Zero sum games. Roberto Lucchetti. Politecnico di Milano Politecnico di Milano General form Definition A two player zero sum game in strategic form is the triplet (X, Y, f : X Y R) f (x, y) is what Pl1 gets from Pl2, when they play x, y respectively. Thus g

More information

Empirical Processes & Survival Analysis. The Functional Delta Method

Empirical Processes & Survival Analysis. The Functional Delta Method STAT/BMI 741 University of Wisconsin-Madison Empirical Processes & Survival Analysis Lecture 3 The Functional Delta Method Lu Mao lmao@biostat.wisc.edu 3-1 Objectives By the end of this lecture, you will

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

The Closed Form Reproducing Polynomial Particle Shape Functions for Meshfree Particle Methods

The Closed Form Reproducing Polynomial Particle Shape Functions for Meshfree Particle Methods The Closed Form Reproducing Polynomial Particle Shape Functions for Meshfree Particle Methods by Hae-Soo Oh Department of Mathematics, University of North Carolina at Charlotte, Charlotte, NC 28223 June

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Bayesian Nonparametric Point Estimation Under a Conjugate Prior

Bayesian Nonparametric Point Estimation Under a Conjugate Prior University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 5-15-2002 Bayesian Nonparametric Point Estimation Under a Conjugate Prior Xuefeng Li University of Pennsylvania Linda

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Semiparametric Regression Analysis of Panel Count Data and Interval-Censored Failure Time Data

Semiparametric Regression Analysis of Panel Count Data and Interval-Censored Failure Time Data University of South Carolina Scholar Commons Theses and Dissertations 2016 Semiparametric Regression Analysis of Panel Count Data and Interval-Censored Failure Time Data Bin Yao University of South Carolina

More information

Semiparametric Regression Analysis of Bivariate Interval-Censored Data

Semiparametric Regression Analysis of Bivariate Interval-Censored Data University of South Carolina Scholar Commons Theses and Dissertations 12-15-2014 Semiparametric Regression Analysis of Bivariate Interval-Censored Data Naichen Wang University of South Carolina - Columbia

More information

Suppose R is an ordered ring with positive elements P.

Suppose R is an ordered ring with positive elements P. 1. The real numbers. 1.1. Ordered rings. Definition 1.1. By an ordered commutative ring with unity we mean an ordered sextuple (R, +, 0,, 1, P ) such that (R, +, 0,, 1) is a commutative ring with unity

More information

CHAPTER 3 Further properties of splines and B-splines

CHAPTER 3 Further properties of splines and B-splines CHAPTER 3 Further properties of splines and B-splines In Chapter 2 we established some of the most elementary properties of B-splines. In this chapter our focus is on the question What kind of functions

More information

S. Mrówka introduced a topological space ψ whose underlying set is the. natural numbers together with an infinite maximal almost disjoint family(madf)

S. Mrówka introduced a topological space ψ whose underlying set is the. natural numbers together with an infinite maximal almost disjoint family(madf) PAYNE, CATHERINE ANN, M.A. On ψ (κ, M) spaces with κ = ω 1. (2010) Directed by Dr. Jerry Vaughan. 30pp. S. Mrówka introduced a topological space ψ whose underlying set is the natural numbers together with

More information

Experimental designs for multiple responses with different models

Experimental designs for multiple responses with different models Graduate Theses and Dissertations Graduate College 2015 Experimental designs for multiple responses with different models Wilmina Mary Marget Iowa State University Follow this and additional works at:

More information

Wavelets on hierarchical trees

Wavelets on hierarchical trees University of Iowa Iowa Research Online Theses and Dissertations Fall 2016 Wavelets on hierarchical trees Lu Yu University of Iowa Copyright 2016 Lu Yu This dissertation is available at Iowa Research Online:

More information

Convex analysis and profit/cost/support functions

Convex analysis and profit/cost/support functions Division of the Humanities and Social Sciences Convex analysis and profit/cost/support functions KC Border October 2004 Revised January 2009 Let A be a subset of R m Convex analysts may give one of two

More information

Copulas. MOU Lili. December, 2014

Copulas. MOU Lili. December, 2014 Copulas MOU Lili December, 2014 Outline Preliminary Introduction Formal Definition Copula Functions Estimating the Parameters Example Conclusion and Discussion Preliminary MOU Lili SEKE Team 3/30 Probability

More information

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial

More information

STAT 512 sp 2018 Summary Sheet

STAT 512 sp 2018 Summary Sheet STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}

More information

Invariant measures and the compactness of the domain

Invariant measures and the compactness of the domain ANNALES POLONICI MATHEMATICI LXIX.(998) Invariant measures and the compactness of the domain by Marian Jab loński (Kraków) and Pawe l Góra (Montreal) Abstract. We consider piecewise monotonic and expanding

More information

Likelihood and Fairness in Multidimensional Item Response Theory

Likelihood and Fairness in Multidimensional Item Response Theory Likelihood and Fairness in Multidimensional Item Response Theory or What I Thought About On My Holidays Giles Hooker and Matthew Finkelman Cornell University, February 27, 2008 Item Response Theory Educational

More information

Supplementary Material for Nonparametric Operator-Regularized Covariance Function Estimation for Functional Data

Supplementary Material for Nonparametric Operator-Regularized Covariance Function Estimation for Functional Data Supplementary Material for Nonparametric Operator-Regularized Covariance Function Estimation for Functional Data Raymond K. W. Wong Department of Statistics, Texas A&M University Xiaoke Zhang Department

More information

Paradoxical Results in Multidimensional Item Response Theory

Paradoxical Results in Multidimensional Item Response Theory UNC, December 6, 2010 Paradoxical Results in Multidimensional Item Response Theory Giles Hooker and Matthew Finkelman UNC, December 6, 2010 1 / 49 Item Response Theory Educational Testing Traditional model

More information

Modular Monochromatic Colorings, Spectra and Frames in Graphs

Modular Monochromatic Colorings, Spectra and Frames in Graphs Western Michigan University ScholarWorks at WMU Dissertations Graduate College 12-2014 Modular Monochromatic Colorings, Spectra and Frames in Graphs Chira Lumduanhom Western Michigan University, chira@swu.ac.th

More information

NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA

NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA Statistica Sinica 213): Supplement NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA Hokeun Sun 1, Wei Lin 2, Rui Feng 2 and Hongzhe Li 2 1 Columbia University and 2 University

More information

Verifying Regularity Conditions for Logit-Normal GLMM

Verifying Regularity Conditions for Logit-Normal GLMM Verifying Regularity Conditions for Logit-Normal GLMM Yun Ju Sung Charles J. Geyer January 10, 2006 In this note we verify the conditions of the theorems in Sung and Geyer (submitted) for the Logit-Normal

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

Ef ciency considerations in the additive hazards model with current status data

Ef ciency considerations in the additive hazards model with current status data 367 Statistica Neerlandica (2001) Vol. 55, nr. 3, pp. 367±376 Ef ciency considerations in the additive hazards model with current status data D. Ghosh 1 Department of Biostatics, University of Michigan,

More information

Semiparametric Gaussian Copula Models: Progress and Problems

Semiparametric Gaussian Copula Models: Progress and Problems Semiparametric Gaussian Copula Models: Progress and Problems Jon A. Wellner University of Washington, Seattle 2015 IMS China, Kunming July 1-4, 2015 2015 IMS China Meeting, Kunming Based on joint work

More information

Durham E-Theses. Predictive Inference with Copulas for Bivariate Data MUHAMMAD, NORYANTI

Durham E-Theses. Predictive Inference with Copulas for Bivariate Data MUHAMMAD, NORYANTI Durham E-Theses Predictive Inference with Copulas for Bivariate Data MUHAMMAD, NORYANTI How to cite: MUHAMMAD, NORYANTI (2016) Predictive Inference with Copulas for Bivariate Data, Durham theses, Durham

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Lebesgue measure and integration

Lebesgue measure and integration Chapter 4 Lebesgue measure and integration If you look back at what you have learned in your earlier mathematics courses, you will definitely recall a lot about area and volume from the simple formulas

More information

Machine Learning and Computational Statistics, Spring 2017 Homework 2: Lasso Regression

Machine Learning and Computational Statistics, Spring 2017 Homework 2: Lasso Regression Machine Learning and Computational Statistics, Spring 2017 Homework 2: Lasso Regression Due: Monday, February 13, 2017, at 10pm (Submit via Gradescope) Instructions: Your answers to the questions below,

More information

Nonparametric rank based estimation of bivariate densities given censored data conditional on marginal probabilities

Nonparametric rank based estimation of bivariate densities given censored data conditional on marginal probabilities Hutson Journal of Statistical Distributions and Applications (26 3:9 DOI.86/s4488-6-47-y RESEARCH Open Access Nonparametric rank based estimation of bivariate densities given censored data conditional

More information

Semiparametric Gaussian Copula Models: Progress and Problems

Semiparametric Gaussian Copula Models: Progress and Problems Semiparametric Gaussian Copula Models: Progress and Problems Jon A. Wellner University of Washington, Seattle European Meeting of Statisticians, Amsterdam July 6-10, 2015 EMS Meeting, Amsterdam Based on

More information

Maximum likelihood estimation of a log-concave density based on censored data

Maximum likelihood estimation of a log-concave density based on censored data Maximum likelihood estimation of a log-concave density based on censored data Dominic Schuhmacher Institute of Mathematical Statistics and Actuarial Science University of Bern Joint work with Lutz Dümbgen

More information