Bayesian Cure Rate Frailty Models with Application to a Root Canal Therapy Study

Size: px
Start display at page:

Download "Bayesian Cure Rate Frailty Models with Application to a Root Canal Therapy Study"

Transcription

1 Bometrcs 61, June 2005 DOI: /j x Bayesan Cure Rate Fralty Models wth Applcaton to a Root Canal Therapy Study Guosheng Yn Department of Bostatstcs and Appled Mathematcs, M. D. Anderson Cancer Center, The Unversty of Texas, Houston, Texas 77030, U.S.A. emal: gyn@odn.mdacc.tmc.edu Summary. Due to natural or artfcal clusterng, multvarate survval data often arse n bomedcal studes, for example, a dental study nvolvng multple teeth from each subject. A certan proporton of subjects n the populaton who are not expected to experence the event of nterest are consdered to be cured or nsusceptble. To model correlated or clustered falure tme data ncorporatng a survvng fracton, we propose two forms of cure rate fralty models. One model naturally ntroduces fralty based on bologcal consderatons whle the other s motvated from the Cox proportonal hazards fralty model. We formulate the lkelhood functons based on pecewse constant hazards and derve the full condtonal dstrbutons for Gbbs samplng n the Bayesan paradgm. As opposed to the Cox fralty model, the proposed methods demonstrate great potental n modelng multvarate survval data wth a cure fracton. We llustrate the cure rate fralty models wth a root canal therapy data set. Key words: Bayesan nference; Cure fracton; Fralty model; Gbbs samplng; Multvarate falure tme data; Proportonal hazards. 1. Introducton As possbly the last endeavor to save a natural tooth before extracton, root canal therapy (RCT) corrects the dsorder and dysfuncton of the dental pulp when the cares or restoratons are deep. RCT usually nvolves removng the tooth crown and the affected pulpal tssue, followed by cleanng the surroundng nfected area to provde a healthy and bondable surface for a permanent fller. After fllng, a crown s fabrcated to complete the procedure. Many root-canal-flled (RCF) teeth last for a lfetme whle some may be lost shortly after completon of endodontc therapy. Non-RCF teeth can be lost due to nonrestorable cares, advanced alveolar bone loss, or catastrophc fracture. Besdes the above reasons, RCF teeth can be lost due to endodontc mshaps (e.g., perforaton) or postendodontc restoratons (e.g., vertcal root fracture from ntracanal posts). To quantfy the degree to whch endodontc nvolvement affects tooth survval, a retrospectve RCT study was conducted n the School of Dentstry at the Unversty of North Carolna at Chapel Hll (Caplan et al., 2005). Usng databases of the Kaser Permanente Northwest Dvson Dental Care Program, and followng the crtera of the study desgn, 202 elgble patents were dentfed. Each patent contrbuted data from one RCF tooth and one smlar non-rcf tooth. If the contralateral tooth was present, t was selected as the matchng non-rcf tooth. If that tooth was mssng or was already endodontcally treated, a tooth of the same type (.e., anteror, premolar, or molar) adjacent to the contralateral tooth was selected. Follow-up for both the RCF and non-rcf teeth started on the ndex date and contnued through the date of extracton or the end of the study, whchever came frst. Tme to extracton was the outcome of nterest whch could be censored by the study termnaton or by the RCT. From a dental scentfc perspectve, many RCF teeth are consdered sound and wll last for a lfetme after successful endodontcal treatment, and thus can be vewed as cured. If a sgnfcant number of patents are cured and thus rsk-free of the dsease of nterest, the populaton s then a mxture of susceptble and nsusceptble subjects. In these cases, the Cox (1972) proportonal hazards model may not be approprate, because t nherently assumes that all the subjects have the same susceptblty to the dsease and wll eventually experence the event over a suffcently long perod of follow-up. Cure rate models are ntended to model falure tme data wth a survvng fracton, whch becomes ncreasngly mportant and popular n clncal trals and medcal research, especally n varous types of oncology studes, such as breast cancer, leukema, and melanoma. et S (t Z )beaproper survval functon (.e., lm t S (t Z )=0), where Z and Z may share common components and the frst component of Z s 1. The mxture cure model proposed by Berkson and Gage (1952) assumes that a certan probablty θ(z )ofbeng cured s mxed wth the remanng 1 θ(z )ofnot beng cured, 552 S pop ( t Z, Z ) = θ(z )+{1 θ(z )}S ( t Z ), (1) where S pop (t Z, Z ) s the populaton survval functon and S (t Z ) s that for the uncured subjects. The cure fracton n model (1) s lm t S pop (t Z, Z ) = θ(z ). A logstc regresson structure s usually assumed so that

2 Bayesan Cure Rate Fralty Models 553 θ(z )=exp(β Z )/{1 +exp(β Z )}, where β s the parameter vector of nterest ncludng an ntercept. The mxture cure model (1) has been extensvely studed n the lterature, ncludng Gray and Tsats (1989), Kuk and Chen (1992), Taylor (1995), Maller and Zhou (1996), Sy and Taylor (2000), Peng and Dear (2000), and Betensky and Schoenfeld (2001), among others. Although (1) s ntutvely attractve and wdely used, t does not have a proportonal hazards structure n the presence of covarates, whch s an undesrable feature when dong covarate analyss. In the Bayesan computaton, f β takes an mproper unform pror,.e., π(β) 1, the posteror dstrbuton based on (1) s mproper (Chen, Ibrahm, and Snha, 1999). In partcular, when modelng the heterogenety n the populaton usng fralty models, t would be more appealng and convenent to employ the proportonal hazards modelng scheme. An alternatve defnton of the unvarate cure rate model, whch has been nvestgated by Yakovlev and Tsodkov (1996), Tsodkov (1998), Chen et al. (1999), and Ibrahm, Chen, and Snha (2001a) among others, s gven by S pop (t Z )=exp{ θ(z )F (t)}, (2) where F(t) saproper cumulatve dstrbuton functon. The correspondng cure rate n model (2) s lm t S pop (t Z )= exp{ θ(z )}. For subject, θ(z )slnked to the covarates va the exponental relaton θ(z )=exp(β Z ). For a more detaled dscusson of (2), see Chapter 5 of the book by Ibrahm, Chen, and Snha (2001b). The aforementoned methods are based on a crtcal assumpton that the survval tmes are ndependent. However, we often encounter multvarate falure tme data n bomedcal research where the correlaton may be nduced by natural or artfcal clusterng effects. In famly studes of genetc dseases, ltter-matched mce experments, or ophthalmologc research, observatons n the same cluster or group may be correlated. The underlyng correlaton needs to be carefully adjusted to ensure vald estmaton and nference. In the RCT example, each subject contrbuted a par of observatons from RCF and non-rcf teeth. Clearly, data collected on two teeth from the same ndvdual cannot be assumed to be ndependent. Extensve research has been carred out for multvarate falure tme data. The fralty model and the margnal model are the most popular approaches. Focusng on subject-specfc effects, the fralty model explctly formulates the nature of the underlyng dependence structure. The margnal model takes a populaton-average approach to model the margnal mean whle treatng the correlaton as a nusance. In ths artcle, we concentrate on the fralty model whch accommodates the ntraclass correlaton through an unobservable random effect, or a fralty. For the lth subject n the th cluster, wth a covarate vector Z l ( =1,..., n; l =1,..., ), the usual Cox shared fralty model s gven by λ(t Z l,w )=λ(t)w exp(β Z l ), (3) where λ(t) sthe unknown and unspecfed baselne hazard functon and W s the unobservable fralty nduced by the th cluster. Condtonal on W, the falure tmes n cluster are assumed to be ndependent. The most studed parametrc assumpton s that the W s are ndependent and dentcally dstrbuted (..d.) from a gamma densty wth mean 1 (Clayton, 1978). Another popular dstrbuton for W s the postve stable dstrbuton (Hougaard, 2000), whch preserves the proportonal hazards structure uncondtonally (after ntegratng W out). Semparametrc Bayesan methods for multvarate falure tme data have been proposed n varous contexts (Clayton, 1991; Snha, 1993; Sargent, 1998; Qou, Ravshanker, and Dey, 1999; among others). However, lmted attenton has been pad to the research n multvarate cure rate models. In the frequentst framework, Chatterjee and Shh (2001) proposed a margnal approach usng bvarate Copula models. Prce and Manatunga (2001) mposed fralty to account for correlaton and conducted the maxmum lkelhood estmaton under a parametrc model assumpton. Both methods were based on the mxture cure model (1). In the Bayesan paradgm, Chen, Ibrahm, and Snha (2002) generalzed the work of Chen et al. (1999) to bvarate falure tme data by ntroducng a postve stable fralty, where an llustratve example was gven for smultaneously modelng two dstnct events,.e., tme to cancer relapse and tme to death. A recent revew paper by Tsodkov, Ibrahm, and Yakovlev (2003) gves a comprehensve treatment and dscusson of the development of the cure rate model (2). Motvated by the RCT study, n whch cure s apparently a possblty and correlaton naturally arses from the pared teeth of the same subject, we propose two new cure rate fralty models for multvarate falure tme data wth a survvng fracton. The proposed methods are closely related to the unvarate cure rate model (2), because t mght not be ntutvely convenent to ncorporate fralty to model (1), and the nterpretaton could be potentally cumbersome. We formulate the model n a Bayesan framework, so that the pror nformaton can be easly ncorporated through hstorcal data. The rest of ths artcle s organzed as follows. In Secton 2, we motvate one form of a cure rate fralty model from a clonogenc tumor cell example, and propose the other by mmckng the Cox fralty model. In Secton 3, we derve the lkelhood functons for the proposed cure fralty models wthn the Bayesan paradgm, and obtan the full condtonal dstrbutons based on sutable pror dstrbutons. In Secton 4, we propose a model selecton technque usng the devance nformaton crteron (DIC) proposed by Spegelhalter et al. (2002). We llustrate the proposed methods wth the RCT example n Secton 5, and provde concludng remarks n Secton Cure Rate Fralty Models We ntroduce a cure rate fralty model that s motvated by the followng clonogenc tumor cell example. For the th ndvdual n the populaton, let N be the number of tumor cells that have the potental of metastaszng,.e., N s the number of metastass-competent tumor cells. Assume that N has a Posson dstrbuton wth mean θ(z ). Gven N = K, let (X 1,..., X K )bethe promoton tmes for all the K tumor cells n the th subject. That s X k (k =1,..., K) sthe tme for the kth metastass-competent tumor cell n subject to produce a detectable tumor mass. Because the N cells belong to the same subject, we assume a random effect W for subject to account for the wthn-subject correlaton among

3 554 Bometrcs, June 2005 the X k s. Condtonal on N and W,weassume that the X k s are..d. from F(t). Here, we emphasze that both N and (X 1,...,X N ) are unobservable random varables. The tme to cancer relapse for the th subject, whch s observed, s defned as T = mn(x 1,...,X N ). Therefore, the populaton survval functon for the th subject s gven by S pop (t Z,W ) =Pr(N =0)+ =Pr(N =0)+ = exp{ θ(z )} + Pr(T >t N = K, W )Pr(N = K) K=1 Pr(X 1 >t,...,x K >t N K=1 = K, W )Pr(N = K) exp{ W Λ(t)K} K=1 θ(z ) K exp{ θ(z )} K! = exp θ(z )+θ(z ) exp{ W Λ(t)}], (4) where Λ(t) s the common cumulatve hazard functon of the X k s. Note that λ(t) =dλ(t)/dt, then the populaton hazard functon of (4) s λ pop (t Z,W )=θ(z )W λ(t) exp{ W Λ(t)}. (5) Model (5), referred to as the promoton tme cure rate fralty model, s a generalzaton of the work of Chen et al. (1999), whch dd not consder the heterogenety of metastaszng tumor cells from dfferent subjects. The formulaton of model (5) s substantally dfferent from that of Chen et al. (2002) where they ncorporated the fralty through the Posson means to model parallel dstnct types of falures. In contrast, we focus on the tme to the same type of event whle correlaton arses from the clusterng effects. We assume W n (5) to be a random varable from a gamma dstrbuton wth mean 1 and varance η 1,.e., W Ga(η, η). In the lmt η,model (5) reduces to the unvarate cure rate model (2). Asde from the bologcal motvaton, model (5) s sutable for correlated falure tme data wth a cure fracton n a wde varety of contexts. Thus, clustered survval data wth a cure fracton, whch may not ft the defnton of a metastaszng tumor cell gven above, can stll be modeled by (5). For ease of exposton, we formulate the cure rate fralty models n the followng setup. Suppose that there are n clusters, and wthn cluster, there are subjects. For = 1,..., n, and l =1,...,, let T l be the falure tme for the lth member n the th cluster, C l be the censorng varable, and Y l = mn(t l, C l )bethe observed tme. Defne the censorng ndcator ν l = I(T l C l ), where I( ) sthe ndcator functon. et Z l be the (p +1) 1vector of bounded covarates, where the frst component of Z l s 1 correspondng to the ntercept. Falure tmes are assumed to be ndependent of censorng tmes condtonal on Z l. Wthn cluster, {(T l, C l, Z l ), l =1,..., } may be dependent but exchangeable. We frst propose a promoton tme cure rate fralty model, of whch the populaton hazard s λ (1) pop(t Z l,w )=λ(t)w exp{ Λ(t)W } exp(β Z l ). (6) As an alternatve, we then ntroduce a dfferent form of cure rate fralty model that s analogous to the Cox fralty model (3). Model (2) can be rewrtten as λ pop (t Z ) = f(t) exp(β Z ), where f(t) s an unknown baselne densty functon. Hence, we propose the followng cure gamma fralty model, λ (2) pop(t Z l,w )=f(t)w exp(β Z l ). (7) Both cure rate fralty models (6) and (7) are constructed to take the wthn-cluster correlaton nto consderaton, whch apparently reduces to the unvarate case f W kelhoods and Full Condtonals We assume a pecewse exponental dstrbuton for the baselne hazard functon λ(t). The pecewse exponental model s useful and smple for modelng survval data, whch serves as a benchmark for comparsons wth other semparametrc and fully parametrc models. The lkelhood functon s constructed as follows. et J be the fnte number of parttons of the tme axs,.e., 0 <s 1 < <s J, wth s J >y l for =1,..., n; l =1,...,.Thus, we have J ntervals, (0, s 1 ], (s 1, s 2 ],...,(s J 1, s J ], where each nterval contans at least one falure and a reasonable way to allocate the data s to balance the number of events among ntervals. The pecewse exponental model assumes that λ(y) =λ j for y (s, s j ], j =1,..., J. Defne δ lj =1fthe lth subject n the th cluster fals or s censored n the jth nterval, and 0 otherwse. When J = 1, namely wth no partton, the baselne hazard reduces to that of an exponental dstrbuton wth λ(t) λ 1.Byncreasng J, wewould obtan fner parttons of the tme scale such that a more flexble structure of the underlyng baselne hazard can be captured. et D denote the observed data, W = (W 1,..., W n ) and λ = (λ 1,..., λ J ). The random effects W ( =1,..., n) are usually assumed to follow a gamma dstrbuton, W Ga(η, η), wth mean 1 and varance η 1.Thus, the condtonal lkelhood functon concernng model (6) s gven by (1) (β, λ W,D)= n (β, λ W,D), where =1 (1) (1) (β, λ W,D) = λ(yl )W exp{ Λ(y l )W } exp(β Z l ) ] ν l { 1 exp{ Λ(y l )W }] exp(β Z l )} { { J = λ j W exp λ j (y l s ) j=1 } ] + λ q (s q s q 1 ) W exp(β Z l ) ( { δlj 1 exp { λ j (y l s ) } νl δ lj } ]) } + λ q (s q s q 1 ) W exp(β Z l ).

4 Bayesan Cure Rate Fralty Models 555 Smlarly, for model (7), (2) (β, λ W,D)= n λ W,D), where (2) (β, λ W,D) = {f(y l )W exp(β Z l )} ν l { F (y l )W exp(β Z l )} { J = λ j exp λ j (y l s ) j=1 } λ q (s q s q 1 ) W exp(β Z l ) { δlj 1 exp { λ j (y l s ) =1 (2) ] νl δ lj (β, }] } λ q (s q s q 1 ) W exp(β Z l ). We take nonnformatve prors for all the parameters such that the lkelhood functons domnate the posteror dstrbutons. Wthout loss of generalty, we assume that β and λ are ndependent, and ther components are ndependent, a pror. Specfcally, we take β k N(µ, σ 2 ) for k =0,1,..., p, and λ j Ga(α, γ) for j =1,..., J. Furthermore, we take W Ga(η, η) and assume that η Ga(a, b), where the hyperparameters a and b are chosen to yeld a large pror varance for W. et U V ] denote the posteror dstrbuton of U gven V. For m =1,2;k =0,1,..., p; j =1,..., J; and =1,..., n, the full condtonal dstrbutons of the parameters are gven as follows: βk β ( k), λ, W,D ] (m) (β, λ W,D)π(β k ), λj β, λ ( j), W,D ] (m) (β, λ W,D)π(λ j ), W β, λ, W ( ),η,d ] (m) (β, λ W,D)W η 1 exp( ηw ), ( n ) η 1 { ( n )} η nη+a 1 W exp η W + b η W,D] =1 =1 {Γ(η)} n, where β ( k) s the rest of β after deletng the kth component, λ ( j) and W ( ) are defned smlarly, and π(β k ) and π(λ j ) are the pror denstes. In partcular for model (7),.e., m =2, due to the conjugate property, the complete condtonal dstrbuton of W has the closed form of ( { J Ga η + ν l,η+ δ lj 1 exp λ j (y l s ) j=1 }] ) λ q (s q s q 1 ) exp(β Z l ). 4. Model Adequacy Evaluaton An mportant part of selectng regresson models s evaluatng the adequacy of the model ft. It s crtcal to compare several competng models for a gven data set and select the one that best fts the data. The DIC, recently proposed by Spegelhalter et al. (2002), s a Bayesan model selecton crteron, whch s gven by DIC = Dev(β, λ)+p Dev. The term p Dev s a penalty term for model complexty, whch s reflected by the effectve number of parameters n the model. The devance s obtaned from the condtonal lkelhood,.e., Dev(β, λ) = 2 log (β, λ W,D), Dev(β, λ) sthe posteror mean of Dev(β, λ),p Dev = Dev(β, λ) Dev( β, λ) and thus DIC = 2Dev(β, λ) Dev( β, λ), where β and λ are the posteror means of β and λ, respectvely. Wth nonnformatve prors, DIC s approxmately equvalent to the Akake nformaton crteron (AIC) proposed by Akake (1973). Specfcally, for the proposed cure fralty models, m =1and 2, DIC m = 4 G G log (m)( β g], λ g] Wg],D ) g=1 +2log (m) ( β, λ W,D), where β g], λ g], and W g] are the correspondng posteror samples of the gth Gbbs teraton, W s the posteror mean, and G s the number of Gbbs teratons after burn-n. The smaller the DIC value, the better the model fts. 5. Example As an llustraton, we appled the proposed methods to the RCT data. In ths analyss, we had three covarates: the RCF tooth ndcator, tooth type, and pocket varables. We combned the anteror and premolar teeth together (nonmolar), because there were relatvely fewer anteror teeth. Among 404 teeth n the data set, there were 176 molars and 228 nonmolars (64 anterors and 164 premolars). Pocket depths had been recorded at sx stes for each tooth. If at least one of the sx perodontal pockets was 5 mm, a bnary varable took a value of 1 (31%), and was otherwse 0 (69%). Fgure 1 shows the Kaplan Meer survval curves for four groups stratfed by the root canal treatment and tooth type. The approprateness of the applcaton of cure rate models needs to be examned cautously. The survval curves level off and show plateaus at the tal parts, whch suggests a possblty of cure. The length of the follow-up was 2916 days, and the last event occurred at the 2678th day for the RCF group and the 2662th day for the non-rcf group. There were 56 teeth censored between the last event and the end of the study for each group. A general gudelne for the proper usage of cure rate models s to have a suffcent perod of follow-up, and a strong bologcal justfcaton for the cure or mmune possblty, as n the RCT study. We ncorporated a gamma fralty W to account for the correlated observatons from the pared teeth of the same patent. We took β =(β 0, β 1, β 2, β 3 ) and λ =(λ 1,..., λ J ) to

5 556 Bometrcs, June 2005 Survval Probablty Non-RCF Non-Molar Non-RCF Molar RCF Non-Molar RCF Molar Tme to Extracton (days) Fgure 1. Kaplan Meer curves stratfed by the root canal treatment and tooth type. be ndependent, a pror, and gave them nonnformatve pror dstrbutons, for example, β k N(0, 100) for k =0,1,2,3, and λ j Ga(α, γ) wth α =2and γ = 0.1, and ndependent for j =1,..., J. For the fralty, we took W Ga(η, η), where η Ga(a, b) and we specfed a =2and b = 100. We chose prors n such a way that the lkelhood functons clearly domnated the posteror dstrbutons. We took the parttons of the tme scale, J =1upto J =5.Alarger J gves more flexblty to model the baselne hazard, whereas at the same tme t brngs n more unknown parameters (the λ j s) that need to be estmated. For comparson, we also appled the Cox shared gamma fralty model (3) to the data, for whch the correspondng condtonal lkelhood based on the pecewse constant hazards s Cox (β, λ W,D)= n (β, λ W,D), where =1 Cox Cox (β, λ W,D) = {λ(y l )W exp(β Z l )} ν l yl j=1 { δlj 0 {λ(t)w exp(β Z l )} dt J = {λ j W exp(β Z l )} δ lj ν l λ j (y l s ) } ] + λ q (s q s q 1 ) W exp(β Z l ). ] We ran 30,000 Gbbs samples for each Markov Chan Monte Carlo (MCMC) chan and recorded a sample every fve teratons, after 3000 burn-ns. The chans appeared to mx well and the convergence could usually be acheved after 500 teratons. We used the dagnostc methods recommended by Cowles and Carln (1996) to montor the chans. Table 1 shows the DICs wth respect to three competng models and fve dfferent J s. The DIC statstcs clearly ndcate that the promoton tme fralty model wth J =3s the best fttng one, wth the smallest DIC = Table 2 summarzes the analyss of the RCT data under the Cox gamma fralty model and the proposed cure rate fralty models, usng J = 1 and 3, respectvely. We present the posteror mean, standard devaton, and 95% hghest posteror densty (HPD) nterval for each parameter. The three dfferent models consstently show that the root canal treatment sgnfcantly reduced tooth survval, whereas the tooth type and pocket varables were not mportant factors. We carred out senstvty analyses on the pror dstrbutons by varyng the hyperparameters (σ and γ). The results n Table 3 demonstrate that the posteror estmaton s very robust wth respect to a wde range of prors. Table 1 Model selecton crteron based on DIC wth respect to J, for the RCT data Fralty model Cox gamma Promoton tme Cure gamma J

6 Bayesan Cure Rate Fralty Models 557 Table 2 The posteror mean, standard devaton, and 95% HPD nterval for the RCT data J Fralty model Covarate Mean Std. Dev. 95% HPD nterval 1 Cox gamma Root (1.7106, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.2192, ) Promoton tme Intercept ( , ) Root (1.7108, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.2068, ) Cure gamma Intercept ( , ) Root (1.7791, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.2137, ) 3 Cox gamma Root (1.8133, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.2034, ) Promoton tme Intercept ( , ) Root (1.5256, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.1952, ) Cure gamma Intercept ( , ) Root (1.9098, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.2154, ) Table 3 Senstvty analyss wth dfferent prors, based on the promoton tme fralty model wth J =3 Std. 95% HPD σ γ Covarate Mean Dev. nterval Intercept ( , ) Root (1.4973, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.1890, ) Intercept ( , ) Root (1.5857, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.1947, ) Intercept ( , ) Root (1.5463, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.1809, ) Intercept ( , ) Root (1.4965, ) Molar ( , ) Pocket ( , ) Fralty (η) (0.2064, ) 6. Remarks We have proposed cure rate fralty models for multvarate falure tme data ncorporatng a survval fracton n the Bayesan paradgm. The proposed methods have attractve features n model formulaton and Bayesan computaton. Model (6) has a strong bologcal motvaton whle model (7) s statstcally and computatonally desrable. It s partcularly appealng that the full condtonal dstrbuton of W under model (7) has a closed form due to conjugacy. Both cure fralty models reduce to the same unvarate case (2) when all the observatons are ndependent. The exstence of nsusceptble or mmune ndvduals n the populaton s the key condton for the applcablty of cure rate models. In such stuatons, censored data are a mxture of cured subjects and uncured subjects who are censored due to ncomplete follow-up. For..d. survval data, Maller and Zhou (1995) have nvestgated the Kaplan Meer estmator of the cumulatve rsk functon for testng for suffcent follow-up and a heterogeneous populaton. The development of a correspondng formal test wth correlated falure tme data s mportant and requres further nvestgaton. Acknowledgements We would lke to thank the edtor, an assocate edtor, and two referees for ther constructve comments and suggestons, whch led to substantal mprovement of the artcle. We thank Dongln Zeng and Joseph Ibrahm for helpful dscussons, and Danel Caplan n the School of Dentstry at the Unversty of North Carolna at Chapel Hll, for provdng the RCT data. References Akake, H. (1973). Informaton theory and an extenson of the maxmum lkelhood prncple. In Internatonal

7 558 Bometrcs, June 2005 Symposum on Informaton Theory, B.N.Petrov and F. Csak (eds), Budapest: Akadema Kado. Berkson, J. and Gage, R. P. (1952). Survval curve for cancer patents followng treatment. Journal of the Amercan Statstcal Assocaton 47, Betensky, R. A. and Schoenfeld, D. A. (2001). Nonparametrc estmaton n a cure model wth random cure tmes. Bometrcs 57, Caplan, J. D., Ca, J., Yn, G., and Whte, B. A. (2005). Root canal flled versus non-root canal flled teeth: A retrospectve comparson of survval tmes. Journal of Publc Health Dentstry, npress. Chatterjee, N. and Shh, J. (2001). A bvarate cure-mxture approach for modelng famlal assocaton n dseases. Bometrcs 57, Chen, M. H., Ibrahm, J. G., and Snha, D. (1999). A new Bayesan model for survval data wth a survvng fracton. Journal of the Amercan Statstcal Assocaton 94, Chen, M. H., Ibrahm, J. G., and Snha, D. (2002). Bayesan nference for multvarate survval data wth a cure fracton. Journal of Multvarate Analyss 80, Clayton, D. G. (1978). A model for assocaton n bvarate lfe tables and ts applcaton n epdemologcal studes of famlar tendency n chronc dsease ncdence. Bometrka 65, Clayton, D. G. (1991). A Monte Carlo method for Bayesan nference n fralty models. Bometrcs 47, Cowles, M. K. and Carln, B. P. (1996). Markov chan Monte Carlo convergence dagnostcs: A comparatve revew. Journal of the Amercan Statstcal Assocaton 91, Cox, D. R. (1972). Regresson models and lfe-tables (wth dscusson). Journal of the Royal Statstcal Socety, Seres B 34, Gray, R. J. and Tsats, A. A. (1989). A lnear rank test for use when the man nterest s n dfferences n cure rates. Bometrcs 45, Hougaard, P. (2000). Analyss of Multvarate Survval Data. New York: Sprnger. Ibrahm, J. G., Chen, M. H., and Snha, D. (2001a). Bayesan semparametrc models for survval data wth a cure fracton. Bometrcs 57, Ibrahm, J. G., Chen, M. H., and Snha, D. (2001b). Bayesan Survval Analyss. New York: Sprnger. Kuk, A. Y. C. and Chen, C. H. (1992). A mxture model combnng logstc regresson wth proportonal hazards regresson. Bometrka 79, Maller, R. A. and Zhou, S. (1995). Testng for the presence of mmune or cured ndvduals n censored survval data. Bometrcs 51, Maller, R. A. and Zhou, X. (1996). Survval Analyss wth ong-term Survvors. New York: Wley. Peng, Y. and Dear, K. B. G. (2000). A nonparametrc mxture model for cure rate estmaton. Bometrcs 56, Prce, D.. and Manatunga, A. K. (2001). Modellng survval data wth a cured fracton usng fralty models. Statstcs n Medcne 20, Qou, Z., Ravshanker, N., and Dey, D. K. (1999). Multvarate survval analyss wth postve stable fraltes. Bometrcs 55, Sargent, D. J. (1998). A general framework for random effects survval analyss n the Cox proportonal hazards settng. Bometrcs 54, Snha, D. (1993). Semparametrc Bayesan analyss of multple event tme data. Journal of the Amercan Statstcal Assocaton 88, Spegelhalter, D. J., Best, N. G., Carln, B. P., and van der nde, A. (2002). Bayesan measures of model complexty and ft. Journal of the Royal Statstcal Socety, Seres B 64, Sy, J. P. and Taylor, J. M. G. (2000). Estmaton n a Cox proportonal hazards cure model. Bometrcs 56, Taylor, J. M. G. (1995). Sem-parametrc estmaton n falure tme mxture models. Bometrcs 51, Tsodkov, A. (1998). A proportonal hazards model takng account of long-term survvors. Bometrcs 54, Tsodkov, A., Ibrahm, J. G., and Yakovlev, A. Y. (2003). Estmatng cure rates from survval data: An alternatve to two-component mxture models. Journal of the Amercan Statstcal Assocaton 98, Yakovlev, A. Y. and Tsodkov, A. D. (1996). Stochastc Models of Tumor atency and Ther Bostatstcal Applcatons. Hackensack, New Jersey: World Scentfc. Receved March Revsed June Accepted August 2004.

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Chapter 20 Duration Analysis

Chapter 20 Duration Analysis Chapter 20 Duraton Analyss Duraton: tme elapsed untl a certan event occurs (weeks unemployed, months spent on welfare). Survval analyss: duraton of nterest s survval tme of a subject, begn n an ntal state

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

How its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013

How its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013 Andrew Lawson MUSC INLA INLA s a relatvely new tool that can be used to approxmate posteror dstrbutons n Bayesan models INLA stands for ntegrated Nested Laplace Approxmaton The approxmaton has been known

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Spatial Frailty Survival Model for Infection of Tuberculosis among HIV Infected Individuals

Spatial Frailty Survival Model for Infection of Tuberculosis among HIV Infected Individuals SSRG Internatonal Journal of Medcal Scence (SSRG-IJMS) Volume 4 Issue 2 February 217 Spatal Fralty Survval Model for Infecton of uberculoss among HIV Infected Indvduals Srnvasan R 1, Ponnuraja C 1, Rajendran

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

Time-Varying Coefficient Model with Linear Smoothing Function for Longitudinal Data in Clinical Trial

Time-Varying Coefficient Model with Linear Smoothing Function for Longitudinal Data in Clinical Trial Tme-Varyng Coeffcent Model wth Lnear Smoothng Functon for Longtudnal Data n Clncal Tral Masanor Ito, Toshhro Msum and Hdek Hrooka Bostatstcs Group, Data Scence Dept., Astellas Pharma Inc. Introducton In

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

A joint frailty-copula model between disease progression and death for meta-analysis

A joint frailty-copula model between disease progression and death for meta-analysis CSA-KSS-JSS Specal Invted Sessons 4 / / 6 A jont fralty-copula model between dsease progresson and death for meta-analyss 3/5/7 Takesh Emura Graduate Insttute of Statstcs Natonal Central Unversty TAIWAN

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1 On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool

More information

A Bayesian Approach to Arrival Rate Forecasting for Inhomogeneous Poisson Processes for Mobile Calls

A Bayesian Approach to Arrival Rate Forecasting for Inhomogeneous Poisson Processes for Mobile Calls A Bayesan Approach to Arrval Rate Forecastng for Inhomogeneous Posson Processes for Moble Calls Mchael N. Nawar Department of Computer Engneerng Caro Unversty Caro, Egypt mchaelnawar@eee.org Amr F. Atya

More information

Statistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals

Statistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals Internatonal Journal of Scentfc World, 2 1) 2014) 1-9 c Scence Publshng Corporaton www.scencepubco.com/ndex.php/ijsw do: 10.14419/jsw.v21.1780 Research Paper Statstcal nference for generalzed Pareto dstrbuton

More information

Cure Rate Models. Dr.R. Elangovan 1 * and B. Jayakumar 2. ISSN (Print) : ISSN (Online) :

Cure Rate Models. Dr.R. Elangovan 1 * and B. Jayakumar 2. ISSN (Print) : ISSN (Online) : Asa Pacfc Journal of Research Vol: I. Issue XXXVIII, Aprl 206 ISSN (Prnt : 2320-5504 ISSN (Onlne : 2347-4793 Cure Rate Models Dr.R. Elangovan * and B. Jayakumar 2 Professor, Department of Statstcs, Annamala

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Issues To Consider when Estimating Health Care Costs with Generalized Linear Models (GLMs): To Gamma/Log Or Not To Gamma/Log? That Is The New Question

Issues To Consider when Estimating Health Care Costs with Generalized Linear Models (GLMs): To Gamma/Log Or Not To Gamma/Log? That Is The New Question Issues To Consder when Estmatng Health Care Costs wth Generalzed Lnear Models (GLMs): To Gamma/Log Or Not To Gamma/Log? That Is The New Queston ISPOR 20th Annual Internatonal Meetng May 19, 2015 Jalpa

More information

Conjugacy and the Exponential Family

Conjugacy and the Exponential Family CS281B/Stat241B: Advanced Topcs n Learnng & Decson Makng Conjugacy and the Exponental Famly Lecturer: Mchael I. Jordan Scrbes: Bran Mlch 1 Conjugacy In the prevous lecture, we saw conjugate prors for the

More information

Estimation of accelerated failure time models with random effects

Estimation of accelerated failure time models with random effects Retrospectve Theses and Dssertatons Iowa State Unversty Capstones, Theses and Dssertatons 6 Estmaton of accelerated falure tme models wth random effects Yaqn Wang Iowa State Unversty Follow ths and addtonal

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Introduction to Generalized Linear Models

Introduction to Generalized Linear Models INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model

More information

Semiparametric geographically weighted generalised linear modelling in GWR 4.0

Semiparametric geographically weighted generalised linear modelling in GWR 4.0 Semparametrc geographcally weghted generalsed lnear modellng n GWR 4.0 T. Nakaya 1, A. S. Fotherngham 2, M. Charlton 2, C. Brunsdon 3 1 Department of Geography, Rtsumekan Unversty, 56-1 Tojn-kta-mach,

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

STK4080/9080 Survival and event history analysis

STK4080/9080 Survival and event history analysis SK48/98 Survval and event hstory analyss Lecture 7: Regresson modellng Relatve rsk regresson Regresson models Assume that we have a sample of n ndvduals, and let N (t) count the observed occurrences of

More information

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics J Stat Appl Pro No 3 93- () 93 NSP Journal of Statstcs Applcatons & Probablty --- An Internatonal Journal @ NSP Natural Scences Publshng Cor A Note on Test of Homogenety Aganst Umbrella Scale Alternatve

More information

Regression Analysis of Clustered Failure Time Data under the Additive Hazards Model

Regression Analysis of Clustered Failure Time Data under the Additive Hazards Model A^VÇÚO 1 33 ò 1 5 Ï 217 c 1 Chnese Journal of Appled Probablty and Statstcs Oct., 217, Vol. 33, No. 5, pp. 517-528 do: 1.3969/j.ssn.11-4268.217.5.8 Regresson Analyss of Clustered Falure Tme Data under

More information

International Journal of Industrial Engineering Computations

International Journal of Industrial Engineering Computations Internatonal Journal of Industral Engneerng Computatons 4 (03) 47 46 Contents lsts avalable at GrowngScence Internatonal Journal of Industral Engneerng Computatons homepage: www.growngscence.com/ec Modelng

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Comparison on Modelling the Relative. Risk Estimation: Bayesian Study

Comparison on Modelling the Relative. Risk Estimation: Bayesian Study Appled Mathematcal Scences, Vol. 4, 00, no. 54, 663-68 Comparson on Modellng the Relatve Rsk Estmaton: Bayesan Study Rafda M. Elobad and Noor Akma I. Laboratory of Appled & Computatonal Statstcs Insttute

More information

Explaining the Stein Paradox

Explaining the Stein Paradox Explanng the Sten Paradox Kwong Hu Yung 1999/06/10 Abstract Ths report offers several ratonale for the Sten paradox. Sectons 1 and defnes the multvarate normal mean estmaton problem and ntroduces Sten

More information

Proportional Hazards Regression Model for Time to Event Breast Cancer Data: A Bayesian Approach

Proportional Hazards Regression Model for Time to Event Breast Cancer Data: A Bayesian Approach IJIRST Internatonal Journal for Innovatve Research n Scence & Technology Volume 3 Issue 08 January 2017 ISSN (onlne): 2349-6010 Proportonal Hazards Regresson Model for Tme to Event Breast Cancer Data:

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

GROUP SEQUENTIAL TEST OF NON-PARAMETRIC STATISTICS FOR SURVIVAL DATA

GROUP SEQUENTIAL TEST OF NON-PARAMETRIC STATISTICS FOR SURVIVAL DATA Hacettepe Journal of Mathematcs and Statstcs Volume 34 (2005), 67 74 GROUP SEQUETIAL TEST OF O-PARAMETRIC STATISTICS FOR SURVIVAL DATA Yaprak Parlak Demrhan and Sevl Bacanlı Receved 2 : 02 : 2005 : Accepted

More information

An Introduction to Censoring, Truncation and Sample Selection Problems

An Introduction to Censoring, Truncation and Sample Selection Problems An Introducton to Censorng, Truncaton and Sample Selecton Problems Thomas Crossley SPIDA June 2003 1 A. Introducton A.1 Basc Ideas Most of the statstcal technques we study are for estmatng (populaton)

More information

Estimation of the Mean of Truncated Exponential Distribution

Estimation of the Mean of Truncated Exponential Distribution Journal of Mathematcs and Statstcs 4 (4): 84-88, 008 ISSN 549-644 008 Scence Publcatons Estmaton of the Mean of Truncated Exponental Dstrbuton Fars Muslm Al-Athar Department of Mathematcs, Faculty of Scence,

More information

Hidden Markov Models & The Multivariate Gaussian (10/26/04)

Hidden Markov Models & The Multivariate Gaussian (10/26/04) CS281A/Stat241A: Statstcal Learnng Theory Hdden Markov Models & The Multvarate Gaussan (10/26/04) Lecturer: Mchael I. Jordan Scrbes: Jonathan W. Hu 1 Hdden Markov Models As a bref revew, hdden Markov models

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Tail Dependence Comparison of Survival Marshall-Olkin Copulas

Tail Dependence Comparison of Survival Marshall-Olkin Copulas Tal Dependence Comparson of Survval Marshall-Olkn Copulas Hajun L Department of Mathematcs and Department of Statstcs Washngton State Unversty Pullman, WA 99164, U.S.A. lh@math.wsu.edu January 2006 Abstract

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Managing Snow Risks: The Case of City Governments and Ski Resorts

Managing Snow Risks: The Case of City Governments and Ski Resorts Managng Snow Rsks: The Case of Cty Governments and Sk Resorts Haruyosh Ito* Assstant Professor of Fnance Graduate School of Internatonal Management Internatonal Unversty of Japan 777 Kokusacho, Mnam Uonuma-sh

More information

Web-based Supplementary Materials for Inference for the Effect of Treatment. on Survival Probability in Randomized Trials with Noncompliance and

Web-based Supplementary Materials for Inference for the Effect of Treatment. on Survival Probability in Randomized Trials with Noncompliance and Bometrcs 000, 000 000 DOI: 000 000 0000 Web-based Supplementary Materals for Inference for the Effect of Treatment on Survval Probablty n Randomzed Trals wth Noncomplance and Admnstratve Censorng by Ne,

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Semiparametric Methods of Time Scale Selection

Semiparametric Methods of Time Scale Selection 1 Semparametrc Methods of Tme Scale Selecton Therry Duchesne Unversty of Toronto, Toronto, Canada Abstract: In several relablty applcatons, there may not be a unque plausble scale n whch to analyze falure

More information

Jon Deeks and Julian Higgins. on Behalf of the Statistical Methods Group of The Cochrane Collaboration. April 2005

Jon Deeks and Julian Higgins. on Behalf of the Statistical Methods Group of The Cochrane Collaboration. April 2005 Standard statstcal algorthms n Cochrane revews Verson 5 Jon Deeks and Julan Hggns on Behalf of the Statstcal Methods Group of The Cochrane Collaboraton Aprl 005 Data structure Consder a meta-analyss of

More information

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek Dscusson of Extensons of the Gauss-arkov Theorem to the Case of Stochastc Regresson Coeffcents Ed Stanek Introducton Pfeffermann (984 dscusses extensons to the Gauss-arkov Theorem n settngs where regresson

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

DrPH Seminar Session 3. Quantitative Synthesis. Qualitative Synthesis e.g., GRADE

DrPH Seminar Session 3. Quantitative Synthesis. Qualitative Synthesis e.g., GRADE DrPH Semnar Sesson 3 Quanttatve Synthess Focusng on Heterogenety Qualtatve Synthess e.g., GRADE Me Chung, PhD, MPH Research Assstant Professor Nutrton/Infecton Unt, Department of Publc Health and Communty

More information

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for

More information

Supplementary Notes for Chapter 9 Mixture Thermodynamics

Supplementary Notes for Chapter 9 Mixture Thermodynamics Supplementary Notes for Chapter 9 Mxture Thermodynamcs Key ponts Nne major topcs of Chapter 9 are revewed below: 1. Notaton and operatonal equatons for mxtures 2. PVTN EOSs for mxtures 3. General effects

More information

Sampling Theory MODULE VII LECTURE - 23 VARYING PROBABILITY SAMPLING

Sampling Theory MODULE VII LECTURE - 23 VARYING PROBABILITY SAMPLING Samplng heory MODULE VII LECURE - 3 VARYIG PROBABILIY SAMPLIG DR. SHALABH DEPARME OF MAHEMAICS AD SAISICS IDIA ISIUE OF ECHOLOGY KAPUR he smple random samplng scheme provdes a random sample where every

More information

and V is a p p positive definite matrix. A normal-inverse-gamma distribution.

and V is a p p positive definite matrix. A normal-inverse-gamma distribution. OSR Journal of athematcs (OSR-J) e-ssn: 78-578, p-ssn: 39-765X. Volume 3, ssue 3 Ver. V (ay - June 07), PP 68-7 www.osrjournals.org Comparng The Performance of Bayesan And Frequentst Analyss ethods of

More information

The optimal delay of the second test is therefore approximately 210 hours earlier than =2.

The optimal delay of the second test is therefore approximately 210 hours earlier than =2. THE IEC 61508 FORMULAS 223 The optmal delay of the second test s therefore approxmately 210 hours earler than =2. 8.4 The IEC 61508 Formulas IEC 61508-6 provdes approxmaton formulas for the PF for smple

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

A new construction of 3-separable matrices via an improved decoding of Macula s construction

A new construction of 3-separable matrices via an improved decoding of Macula s construction Dscrete Optmzaton 5 008 700 704 Contents lsts avalable at ScenceDrect Dscrete Optmzaton journal homepage: wwwelsevercom/locate/dsopt A new constructon of 3-separable matrces va an mproved decodng of Macula

More information

RELIABILITY ASSESSMENT

RELIABILITY ASSESSMENT CHAPTER Rsk Analyss n Engneerng and Economcs RELIABILITY ASSESSMENT A. J. Clark School of Engneerng Department of Cvl and Envronmental Engneerng 4a CHAPMAN HALL/CRC Rsk Analyss for Engneerng Department

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Change Point Estimation for Pareto Type-II Model

Change Point Estimation for Pareto Type-II Model Journal of Modern Appled Statstcal Methods Volume 3 Issue Artcle 5--04 Change Pont Estmaton for Pareto Type-II Model Gyan Prakash S. N. Medcal College, Agra, U. P., Inda, ggyanj@yahoo.com Follow ths and

More information

THE APPLICATION OF LINEAR MIXED-EFFECTS MODEL TO THE EFFECT OF MICROCURRENT ON DECUBITUS WOUNDS. A STUDY IN LIMBURG PROVINCE OF BELGIUM

THE APPLICATION OF LINEAR MIXED-EFFECTS MODEL TO THE EFFECT OF MICROCURRENT ON DECUBITUS WOUNDS. A STUDY IN LIMBURG PROVINCE OF BELGIUM The 3 rd Internatonal Conference on Mathematcs and Statstcs (ICoMS-3) Insttut Pertanan Bogor, Indonesa, 5-6 August 008 THE APPLICATIO OF LIEAR MIXED-EFFECTS MODEL TO THE EFFECT OF MICROCURRET O DECUBITUS

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

CS-433: Simulation and Modeling Modeling and Probability Review

CS-433: Simulation and Modeling Modeling and Probability Review CS-433: Smulaton and Modelng Modelng and Probablty Revew Exercse 1. (Probablty of Smple Events) Exercse 1.1 The owner of a camera shop receves a shpment of fve cameras from a camera manufacturer. Unknown

More information

Computing MLE Bias Empirically

Computing MLE Bias Empirically Computng MLE Bas Emprcally Kar Wa Lm Australan atonal Unversty January 3, 27 Abstract Ths note studes the bas arses from the MLE estmate of the rate parameter and the mean parameter of an exponental dstrbuton.

More information

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement Markov Chan Monte Carlo MCMC, Gbbs Samplng, Metropols Algorthms, and Smulated Annealng 2001 Bonformatcs Course Supplement SNU Bontellgence Lab http://bsnuackr/ Outlne! Markov Chan Monte Carlo MCMC! Metropols-Hastngs

More information

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling Open Journal of Statstcs, 0,, 300-304 ttp://dx.do.org/0.436/ojs.0.3036 Publsed Onlne July 0 (ttp://www.scrp.org/journal/ojs) Multvarate Rato Estmator of te Populaton Total under Stratfed Random Samplng

More information

Homework 9 STAT 530/J530 November 22 nd, 2005

Homework 9 STAT 530/J530 November 22 nd, 2005 Homework 9 STAT 530/J530 November 22 nd, 2005 Instructor: Bran Habng 1) Dstrbuton Q-Q plot Boxplot Heavy Taled Lght Taled Normal Skewed Rght Department of Statstcs LeConte 203 ch-square dstrbuton, Telephone:

More information

Stat 642, Lecture notes for 01/27/ d i = 1 t. n i t nj. n j

Stat 642, Lecture notes for 01/27/ d i = 1 t. n i t nj. n j Stat 642, Lecture notes for 01/27/05 18 Rate Standardzaton Contnued: Note that f T n t where T s the cumulatve follow-up tme and n s the number of subjects at rsk at the mdpont or nterval, and d s the

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances Unversty of Wollongong Research Onlne Centre for Statstcal & Survey Methodology Workng Paper Seres Faculty of Engneerng and Informaton Scences 0 A nonparametrc two-sample wald test of equalty of varances

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

1 Binary Response Models

1 Binary Response Models Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Lab 4: Two-level Random Intercept Model

Lab 4: Two-level Random Intercept Model BIO 656 Lab4 009 Lab 4: Two-level Random Intercept Model Data: Peak expratory flow rate (pefr) measured twce, usng two dfferent nstruments, for 17 subjects. (from Chapter 1 of Multlevel and Longtudnal

More information