Cure Rate Models. Dr.R. Elangovan 1 * and B. Jayakumar 2. ISSN (Print) : ISSN (Online) :

Size: px
Start display at page:

Download "Cure Rate Models. Dr.R. Elangovan 1 * and B. Jayakumar 2. ISSN (Print) : ISSN (Online) :"

Transcription

1 Asa Pacfc Journal of Research Vol: I. Issue XXXVIII, Aprl 206 ISSN (Prnt : ISSN (Onlne : Cure Rate Models Dr.R. Elangovan * and B. Jayakumar 2 Professor, Department of Statstcs, Annamala Unversty, Annamalanagar , Tamlnadu, Inda. 2 Research Scholar, Department of Statstcs, Annamala Unversty, Annamalanagar , Tamlnadu, Inda. *Presented n the Workshop on Relablty Theory and Survval Analyss. Department of Statstcs, Punjab Unversty, Chandgarh, Inda. 4th to 6th of November 205. Abstract: models, used for modelng tme-to-event data, conssts of a cured (non-susceptble fracton and an uncured (susceptble fracton and have become ncreasngly popular n the analyss of data from clncal trals. Because of the great mprovements n the treatment of cancer and some other dseases, cure rate models have become ncreasngly popular n the analyss of data from many clncal trals. Tradtonal methods of survval analyss, ncludng the well-known Cox s regresson model, assume that no patent s cured but that all reman at rsk of death or relapse. These models are concerned wth survval only and do not accommodate the possblty of cure.. For example, certan types of cancers, lke breast cancer, leukema, a substantal proporton of patents may now be cured by treatment,.e., cured proporton. The patents who are cured are called mmunes or long-term survvors, whle the remanng patents who develop a recurrence of the dseases are termed susceptble. The populaton of nterest s thus dvded nto two groups vz., cured and noncured. models provdes satsfactory models n such cases. These models are more sutable to analyze data wth cure proportons, as n the above example. Much of work has been done by many researchers n ths area. To name a few Anderson, Balakrshnan, Chatterjee Chen, Clayton, Gray, Pal, Rodrgues and Yn. Keywords: Tme-to-Event Data, Cured Fracton, Cox s Regresson Model, Cure Rate Model.. INTRODUCTION models, used for modelng tme-to-event data, consst of a survvng fracton and have become ncreasngly popular n the analyss of data from clncal trals. Recently, the developments of new drugs and treatment regmens have resulted n patents lvng longer wth certan types of cancer and heart dsease. models have applcatons n a wde array of areas such as bomedcal studes, crmnology, fnance, demography, manufacturng, and ndustral relablty. At an ndvdual level, dagnoss of cancer s regarded as a human tragedy. At the level of socety, cancer s one of the major chronc dseases, causng a notable amount of health admnstratve costs. Prognoss and possble cure from cancer are thus mportant measures of lfetmes whch can be assessed by analyzng the survval of cancer patents. Dfferent statstcal approaches are used for analyzng the cancer survval data. The results of survval analyss for cancer patents have been wdely presented and reported for dfferent human sub populatons of the globe Woolson (98, Kardaun (983, Beadle et al., (984, Sedmak et al., (989. However, very few survval results at natonal level are avalable for the populatons of Pakstan refer to Khan et al., (2004. McGarty (974 has mentoned that for adoptng any sutable. Statstcal technque for analyzng survval data, t should be assumed that the statstcal model embodes the evaluaton of some natural process wth the beleve that the model s a useful approxmaton of the real process. Several approaches have been proposed n the lterature by Leung et al. (997 and Lttle and Rubn (2002 for analyzng the survval data. In ths context, t s proposed to develop cure rate models and analyze such models usng the nferental methods avalable n the lterature based on EM algorthm. The analytcal results are substantated through real data example. 2. EARLY DEVELOPMENTS IN CURE RATE MODELS Destructve weghted Posson cure rate models have been dscussed by Rodrgues et.al. (20. Left truncated and rght censored Webull data and lkelhood nference wth an llustraton have been dscussed by Balakrshnan et.al. (202. EM algorthm-based lkelhood estmaton for some cure rate models s dscussed by Balakrshnan and pal (202. Lognormal lfetmes and lkelhood-based nference for flexble cure rate models based on COM- Posson famly s dscussed by (203. Expectaton maxmzaton-based lkelhood nference for flexble cure rate models wth Webull lfetmes s dscussed by Balakrshnan and Pal (203. An EM algorthm for the estmaton of parameters of a flexble cure rate model wth generalzed gamma lfetme and model dscrmnaton usng lkelhood-and nformaton-based methods has been dscussed by Balakrshnan and Pal (204. Latent cure rate model under repar system and threshold effect have been dscussed by Balakrshnan, et.al. (

2 49 Dr.R. Elangovan and B. Jayakumar 3. CURE RATE MODELS: models are consstng of a cured (non- susceptble fracton and an uncured (susceptble fracton. It s assumed that the survval functon for the entre populaton S (t s a mxture of cured and uncured subjects. Hence ths model s known as a mxture cure rate model and can be wrtten as S(t = S (t + ( + S (t ( c uc Where s a proporton of patents cured on treatment and - s the proporton of patents uncured and wth respect to the cure rate model we can assume a partcular dstrbuton for the survval functon of the cured groups S c (t and survval functon of the uncured groups S uc (t. Snce those who fall nto the cured part of the equaton ( wll never experence the event of nterest, the lm t S t =, and the survval dstrbuton functon S(t for the entre populaton of patents, ths leads to a parametrc survval model. The popular dstrbuton consdered are Gamma, Webull, Extreme value and ts becomes, S (t = ( + S (t (2 uc The probablty mxture densty functon correspondng to the above equaton (2 s f(t = ( f (t (3 uc And the hazard functon of the populaton of patents from equatons (2 and (3 s h(t ( f (t = uc ( Suc ( t (4 The mxture model may be parametrc or non-parametrc dependng on whether s specfed or not. Suppose there are n patents enterng to a clncal study. Let t, =, 2,, n be the observed survval tme for the th patent and let δ be a censorng ndcator defned such that d t = f s censored 0 otherwse Lkelhood for the mxture cure rate model, whch was ntally ntroduced by De Angels et.al., (997 s gven by d { } { ( ( } n ( q ( ( d L t, = f t + S t. = uc uc (5 Where are estmated by use of a lnk functon, usually the log-log, dentty, Logt or probt functons. For a detaled study, refer to Anderson (2007, Lambert et.al.,( Bayesan Cure Rate Survval Model The survval functon for T, and the survval functon for the populaton usng equaton S(t = + ( + S uc (t. We get S (t = exp( q + ( exp( q S ( t pop Where S pop (t s standard cure rate model wth cure rate equal to φ=exp(-θ and survval functon for the non-cured populaton s S uc (t. If the covarates depend on θ through the relatonshp θ=exp (x β, where x s a p x vector of regresson coeffcent for the cured and non-cured group. For the cured group, the sgn of regresson coeffcents affects the cure fracton. Those negatve regresson co-effcent leads to a large cure fracton, when the correspondng covarate takes postve values. For the non-cured group the regresson coeffcents affect the hazards functon. Specfcally, a negatve hazard, whereas a postve regresson coeffcent leads to a smaller hazard, when the correspondng covarate takes a postve value Bayesan Cure Rate Survval Model wth Fralty In ths secton the cure rate survval model can ncorporate random effects (RE to ft lfe tme data set. We apply ths method to examne patterns of clncal trals usng survval data. Models for fralty n multvarate cure fracton models consdered by Yn (2008.Thus for tmes t j observed on subjects and events j, Yn proposed multplcatve fralty at subject level combned wth Posson regresson for n the cure fracton sex p(-θ j. One opton takes S ( t = exp( - q Z Ft ( uc j j j wth Hazard rates h uc (t j = θ j Z f (t j. In ths context usng the above two equatons n the Bayesan cure rate fralty wth canoncal lnk functon n the settng of generalzed lnear models refer to Sundaram and Venkatesan ( Estmaton of the parameters usng the EM algorthm We frst note that the ndcator varable I takes on the value f the lfetme of a subject s actually observed. However, f the lfetme of a subject s censored, the subject mght belong to the cured group or mght belong to the susceptble group. Thus, the value of the ndcator varable I s unknown f the subject s lfetme s censored. Ths actually ntroduces the mssng data and motvates uc

3 50 Cure Rate Models us to employ the EM algorthm for the determnaton of the MLEs of the model parameters. Interested readers may refer to the book by McLachlan and Krshnan (2008 for a detaled dscusson on the EM algorthm. Some nterestng result regardng EM algorthm can be seen n Balakrshnan and Pal (202, 203 and 204. Usng EM algorthm, the real data example s shown n secton REAL DATA EXAMPLES Case (: Parametrc, Non-Parametrc approaches to cure rate models usng cancer patents. Dfferent statstcal approaches are used for analyzng the cancer survval data. The results of survval analyss for cancer patents have been wdely presented and reported for dfferent human sub populatons of the globe, refer to Woolson (98, Kardaun(983, Beadle et al.(984, Sedmak et.al.(989.wth the followng objectves.. To estmate the survval functon usng the standard Kaplan-Meer Estmator, 2. To estmate the cumulatve hazard functon, usng the Nelson- Aalen estmator, 3. To ft an approprate parametrc lfetme model based on Anderson- Darlng goodness of ft test. The relevant lfetme data on the patents of cancer n accord wth the Cancer Hosptal n Chenna was selected. Ths hosptal receves ts patents from a wde area n the lmts of Southern Inda. A retrospectve smple random sample desgn was used; the lfetme data on 200 male and 40 female patents of cancer belongng to dfferent classes was selected. These 340 patents of cancer were treated n Cancer Hosptal n Chenna and other Multspecalty hosptals located n Chenna and combatore and the data were collected usng January 200 to December 204. The regstraton tme was January, 200 to December 3, Assumpton and notatons In ths study the generalzed type-i censorng was consdered. For more convenence, the censorng was due to the followng reasons: a. A patent emgrated out of the study area was mpossble to follow. b. An ndvdual survved past the end of the study perod. c. The censorng was non-nformatve. For the representaton of the data consdered n ths study, each ndvdual had ts own specfc lfetme whch was rescaled at startng tme to t 0 = 0. Accordng to Klen and Moeschberger (997.. was taken as a non-negatve random varable, the tme untl the event of nterest (death due to cancer occurred.. It was assumed to be ndependently and dentcally dstrbuted wth probablty densty functon f(t the survval functon S(t and hazard functon h(t.. C r was the fxed rght censorng tme; T and C r were assumed to be ndependent. The exact lfetme of an ndvdual was known f and only f T was less than or equal to C r. v. Pars of random varables convenently represented the data, (X,δwhere δ was the censorng ndcator and X was equal to, f the lfetme was observed, and to C r f t was censored and X=mn(T,Cr Parametrc approach Consderng the lfetme parametrc model as the useful approxmaton of the real process, three lfetme models vz., Gamma, Webull and Extreme value dstrbuton were consdered. The Anderson-Darlng test, whch makes the use of these specfc lfetme dstrbutons n calculatng crtcal values, was defned wth H 0 : The data followed a specfed parametrc lfetme model. H a : The data dd not follow the specfed lfetme model. The Anderson and Darlng (954 test statstc was (2 n 2 n A = logfy ( log( ( FYn n = + + where F the cumulatve dstrbuton functon of the specfed dstrbuton was, Y was the ordered data and n was the number of observatons. The test was a one sded test and the hypothess that the dstrbuton of a specfc form was rejected f the test statstc A was greater than the crtcal value. From the class of specfed lfetme dstrbutons, the parametrc lfetme model, whch one has the mnmum Anderson-Darlng (adjusted value, gave the better ft Nonparametrc approach Cox and Oakes (984 and Kalbflesch and Prentce (2002 presented a nonparametrc approach to estmate survval functon usng standard Kaplan Meer (KM technque (Kaplan and Meer, 958. There were D dstnct tmes wth t < t 2 <.t D,d deaths or events occurred at tme and were the number of ndvduals who were at rsk at tme t. The KM estmator was defned as for all values of t n the range where there was data: f t < t S( t = d f t t < t < t Y

4 5 Dr.R. Elangovan and B. Jayakumar It was obvous from KM estmator, for t < t, S ( t = and when then ( 0 S t = for t t Cox and Oakes (984 also establshed the varance of the KM estmator usng Greenwood s relaton as: V S ( t = S ( t 2 t ty Y d ( d Moreover, KM estmator was also used to estmate the = The Nelson Aalen (NA estmator of the cumulatve hazard rate was defned up to the largest observed tme on the study (see Aalen, 978: cumulatve hazard functon; H ( t In S ( t H ( t 0 = t t d Y f t t f t t The estmated varance of the NA estmator was gven by 2 s H ( t d =. t ty The Webull model The probablty densty functon of a Webull random varables k k x ( x k l e, f( x; l, k= l l x 0 0 x < 0, Where k > 0 s the shape parameter and λ>0 s the scale parameter of the dstrbuton. The Webull dstrbuton s related to a number of other probablty dstrbuton; n partcular, t nterpolates between the exponental dstrbuton (k = and the Raylegn dstrbuton (k = 2 and λ= 2σ f the quantty X s a tme- to- falure, the Webull dstrbuton gves a dstrbuton for whch the falure rate s proportonal to a power of tme. The shape parameter k, s that power plus one, and so ths parameter can be nterpreted drectly as follows: A value of k < ndcates that the falure rate decreases over tme. Ths happens f there s sgnfcant nfant mortalty, or defectve tems falng early and the falure rate decreasng over tme as the defectve tems are weeded out of the populaton. A value k = ndcates that the falure rate s constant over tme. Ths mght suggest random external events are causng mortalty, or falure. A value of k > ndcates that the falure rate ncreases wth tme. Ths happens f there s an agng process, or parts that are more lkely to fal as tme go on The Gamma model The two parameter Gamma dstrbutons s one of the mportant models n relablty for descrbng the falure tme dstrbuton of the tems under test. The two parameter Gamma dstrbuton wth probablty densty functon s gven by x s p F( x; s, p = e x ; x 0, s> 0; p< p ps Where σ s a scale parameter and p s a shape parameter. The hazard rate µ(t or λ(t of the gamma lfe dstrbuton are ncreasng functon of t ndcatng the agng effect. If P =, then Gamma becomes an exponental dstrbuton havng the constant hazard-rate µt ( =. s For p<, the falure rate functons are decreasng. For the ntegral value of p, the Gamma dstrbuton arses as a sum of p dentcally and ndependently dstrbuted exponental random varables. If p tems ware on test and t s assumed that the falure tme dstrbuton s exponental wth parameter σ, then total tme on test would be a Gamma varable wth parameters p and σ The Extreme value model The Dstrbuton often referred to as the Extreme value dstrbuton (Type s the lmtng dstrbuton of the mnmum of a large number of unbounded dentcally dstrbuted random varables. The probablty dstrbuton functon and cumulatve dstrbuton functon are gven by x m x m b b f( x = e exp e, < X <, b > 0. b x m F x e X b ( = exp, < <, b > 0. If the values are bounded below (as s the case wth tmes of falure then the lmtng dstrbuton s the Webull. Usng the parametrc and non-parametrc approach as dscussed n secton 4 the results were shown n table, table 2 and table 3. On the whole, t appeared from Table that the female group survved more than the male group based on the two descrptve measures. These descrptve measures dd not compare the two groups at dfferent tme ponts n

5 52 Cure Rate Models tme of follow up; however, such a vsual comparson of gender survval by usng the standard non-parametrc KM and NA methods for the group of bone tumor patents s presented n Fg. and Fg. 2, respectvely. It s apparent from Fg. that female group consstently lay above that for male group partcularly upto 75 years of age. Ths dfference ndcates that female patents are the Table : Cancer group Descrptve measures of survval tme and hazard rate Average survval tme n years (T Average hazard rate (h Males Females Males Females Bone tumor Bran tumor Lung Cancer Leukema Lver cancer Oran cancer Overall Table 2: Statstcal sgnfcance test of gender survval Test Ch-Square Degrees of Freedom P-Value Log-Rank Wlcoxon better survvors. Fg. 2 s the plot of the cumulatve hazard rate for bone tumor patents, whch also shows that female prognoss of survval were better than ther male counterparts. In a smlar fashon, the survval prognoss about the remanng types of the cancer patents can also be determned. To explore the statstcal sgnfcance of gender survval, Log-rank and Wlcoxon test were used. In Table 2, the p-values for both tests were near to zero whch provded the strong statstcal evdence that males were dyng faster than females. The emprcal results of descrptve characterstcs due to nonparametrc KM and best ftted Parametrc Webull model approaches are presented n Tables 3 and 4 respectvely. Table 3 shows that female patents had greater mean survval tme (MST of 64. years than years for males. Aldna et al. (2004 also used the Kaplan-Meer approach to estmate the mean survval tme for the esophageal cancer patents n Pakstan. They estmated the mean age of 56 years n 59 percent male and 4 percent female patents, whle n ths study, 58 percent males and 42 percent females had mean age of 42.4 and 5 years, respectvely. Table 3 also shows that the estmate of medan survval tme for males was 49 years progressng to 64 years for female patents, whch agan confrms the survval superorty of females. From the parametrc pont of vew, the Webull dstrbuton seemed to be the best ftted lfe tme model based on the lower values of adjusted Anderson- Darlng test for males and females for all classes of cancer. The parameter estmates along wth the major characterstcs of the dstrbuton are shown n Table 4. Mean survval tme (MST Table 3: Descrptve characterstcs of bone tumor patents by Kaplan-Meer procedure MST Standard Error 95% Normal CI Lower Upper Medan Q Q3 F M F M F M F M F M F M F M Parameter Table 4: Parameter estmates and major characterstcs of nterest of Webull Dstrbuton for bone tumor patents Estmate Standard error 95.0% Normal CI Male Female Male Female Lower Male Lower Female Upper Male Shape Upper Female Scale Mean Survval tme Standard devaton Medan Frst Quartle Thrd Quartle

6 53 Dr.R. Elangovan and B. Jayakumar Fg.. Comparson of survval functon of male and female bone tumor patents by usng KM estmator Fg. 2. Comparson of cumulatve hazard functon of male bone tumor and female patents by usng NA estmator of males (53.24 years was lower than years for females. The percentage devatons of MST for males and females were 0.73 and.84, whch ndcated a very close estmaton of MST by usng both approaches.e. nonparametrc and parametrc. By observng the comparatve graphs of survval functon and hazard functon based on KM estmator, NA estmator and Webull lfetme model, t was observed that at young age survval rate of tumor patents was hghest, whle as age ncreased survval rate decreased. Case ( Bayesan Cure Rate Survval Models for Pulmonary Tuberculoss Followng the approach are Anderson (2007 and Chen et.al. (999 and the data set avalable usng TRC-ICMR (2007, the data relatng to Sex, Age, Weght and Doses relatng to 360 Cancer patents are collected and Mean, SD, MCSE and Coverage probabltes based and teraton relatng to Gamma, Webull, and Extreme value cure rate models has been calculated

7 54 Cure Rate Models Table 5: Posteror summares of Pulmonary Tuberculoss data under Bayesan Gamma Cure Rate Survval model for n=360 Censored % =0 at and teratons Parameters l= ; a=. 000;DIC = 787 Covarates Mean SD MCSE Percentles Iteraton 25% 50% 75% Table 6: Posteror summares of Pulmonary Tuberculoss data under Bayesan Gamma Survval model for n=360 Censored % =0 at and teratons Parameters l= ; a= ;DIC = 784 Covarates Mean SD MCSE Percentles Iteraton 25% 50% 75% Sex(β Table 7: Posteror summares of Pulmonary Tuberculoss data under Bayesan Webull Cure Rate Survval Model for n=360 Censored % =0 at and teratons Parameters l= ; a=. 7079;DIC = 689 Covarates Mean SD MCSE Percentles Iteraton 25% 50% 75%

8 55 Dr.R. Elangovan and B. Jayakumar Table 7: Contnued Table 8: Posteror summares of Pulmonary Tuberculoss data under Bayesan Webull Cure Rate Survval Model for n= 360, Censored % =0 at and teratons Parameters l= ; a= ;DIC = 58 Covarates Mean SD MCSE Percentles Iteraton 25% 50% 75% Table 9: Posteror summares of Pulmonary Tuberculoss data under Bayesan Extreme value Cure Rate survval Model for n= 360, censored % = 0 at and teratons Parameters l= ; a= ;DIC = 842 Covarates Mean SD MCSE Percentles Iteraton 25% 50% 75%

9 56 Cure Rate Models Table 0: Posteror summares of Pulmonary Tuberculoss data under Bayesan Extreme value survval model for n = 360, censored % = 0 at and teratons Parameters l= ; a= ; DIC = 445 Covarates Mean SD MCSE Percentles Iteraton 25% 50% 75% l=0.267 α=.0000 DIC=857 l=0.528 α= DIC=774 l=0.505 α=.7089 DIC=697 l=0.450 α= DIC=578 l=0.995 α= DIC=842 l=0.356 α=.053 DIC=445 Fg. : Bayesan Cure Rate model (Intercept l=0.267 α=.0000 DIC=857 l=0.528 α= DIC=774 l=0.505 α=.7089 DIC=697 l=0.450 α= DIC=578 l=0.995 α= DIC=842 l=0.356 α=.053 DIC=445 Fg. 2: Bayesan Cure Rate model (Sex

10 57 Dr.R. Elangovan and B. Jayakumar l=0.267 α=.0000 DIC=857 l=0.528 α= DIC=774 l=0.505 α=.7089 DIC=697 l=0.450 α= DIC=578 l=0.995 α= DIC=842 l=0.356 α=.053 DIC=445 Fg. 3: Bayesan Cure Rate model (Age l=0.267 α=.0000 DIC=857 l=0.528 α= DIC=774 l=0.505 α=.7089 DIC=697 l=0.450 α= DIC=578 l=0.995 α= DIC=842 l=0.356 α=.053 DIC=445 Fg. 4: Bayesan Cure Rate model (Weght and results are shown Table 5, Table 6, Table 7, Table 8, Table 9 and Table 0. The results were also depcted n Fg, Fg 2, Fg 3, Fg 4, Fg 5, and Fg 6 respectvely. The smulatons were carred out usng SAS avalable at Tamlnadu Dr. M.G.R Medcal Unversty located at Chenna. The parametrc Bayesan Cure Rate survval model wth and wthout fralty for lfetme censored pulmonary tuberculoss data have been ftted. The parameter have been estmated (unknown shape, scale, Posteror Summares of regresson coeffcents and random effects usng MCMC technques wth the help of SAS. The detaled analyses have been carred out n Tamlnadu Dr. M. G. R Medcal Unversty, Gundy, based on to teraton. It s observed that from Table 5 to 0 the Bayesan cure rate survval model performance very well and best ft to the data based on to teraton. It s further observed that the data well ft to the rght censored pulmonary Tuberculoss data compared wth cancer data set dscussed n secton 4.

11 58 Cure Rate Models l=0.267 α=.0000 DIC=857 l=0.528 α= DIC=774 l=0.505 α=.7089 DIC=697 l=0.450 α= DIC=578 l=0.995 α= DIC=842 l=0.356 α=.053 DIC=445 Fg. 5: Bayesan Cure Rate model (Doses l=0.267 α=.0000 DIC=857 l=0.528 α= DIC=774 l=0.505 α=.7089 DIC=697 l=0.450 α= DIC=578 l=0.995 α= DIC=842 l=0.356 α=.053 DIC=445 Fg. 6: Bayesan Cure Rate model ( ACKNOWLEDGEMENT The authors ndebted to Mr. M. Shanmugam, Head, Computer secton, Tamlnadu Dr. M. G. R Medcal Unversty, Chenna, for hs encouragement and data analyss and smulaton to complete ths work. REFERENCE 3. Balakrshnan, N. and Pal (204. An EM algorthm for the estmaton of parameters of a flexble cure rate model wth generalzed gamma lfetme and model dscrmnaton usng lkelhood-and nformaton-based methods Computatonal Statstcs, 30 (, Balakrshnan, N. and Pal, S. (202. EM algorthm-based lkelhood estmaton for some cure rate models. Journal of Statstcal Theory and Practce, 6, Balakrshnan, N. and Pal, S. (203. Expectaton maxmzaton-based lkelhood nference for flexble cure rate models wth Webull lfetmes. Statstcal Methods n Medcal Research (To appear. DOI: 0.77/ Balakrshnan, N. and Pal, S. (203. Lognormal lfetmes and lkelhood-based nference for flexble cure rate models based

12 59 Dr.R. Elangovan and B. Jayakumar on COM-Posson famly. Computatonal Statstcs & Data Analyss, 67, Josemar Rodrgues, N. Balakrshnan, Gauss M. Cordero, Maro de Castro and Vcente G. Cancho (205. Latent cure rate model under repar system and threshold effect. Journal of Statstcal Computaton and Smulaton. Vol. 85, No. 4, Kaplan, E. L. and P. Meer, (958. Nonparametrc estmaton from ncomplete observatons. J. Amer.Stat. Assoc., 53: Kardaun, O., (983. Statstcal analyss of male larynx cancer patents: A case study. Statstcal Nederlandca, 37: Khan, T. H., S. Iqbal, M. Abram and A. S. Warrach, (2004. The relatonshp of age, sex and martal status wth the prevalence of cancer n the patents vstng Nshtar Hosptal, Multan, Pakstan.. Leung, K. M., R. M. Elashoff and A. A. Aff, (997. Censorng ssues n survval analyss. AnnualRevew of Publc Health, 8: M.R. Gupta and Y.Chen (200. Theory and use of the EM Algorthm do: 0.56/ A well wrtten short book on EM Algorthm, ncludng detal dervaton of EM Algorthm. 3. McGarty, T. P., (974. Stochastc System and State Estmaton. John Wley and sons Inc., New York,USA. 4. Pakstan J. Zool., 36: Klen, J. P. and M. L. Moeschberger, (997. Survval Analyss: Technques for Censored and Truncated Data. Sprnger-Verlang Inc., New York, USA. 5. Rodrgues, J., de Castro, M., Balakrshnan, N. and Cancho, V. G. (20. Destructve weghted Posson cure rate models. Lfetme Data Analyss 7; Sedmak, D. D., T. A. Meneke, D. S. Knechtges and J. Anderson, (989. Prognostc sgnfcance ofcytokeratn-postve breast cancer metastases. Modern Pathol., 2: Torey, P. and A. Broom, (2007. Cancer patent s negotatons of therapeutc optons n Pakstan. Qualtatve Hlth. Res., 7: Woolson, R. F., (98. Rank test and a one-sample log rank test for comparng observed survval data to a standard populaton. Bometrcs, 37:

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Chapter 20 Duration Analysis

Chapter 20 Duration Analysis Chapter 20 Duraton Analyss Duraton: tme elapsed untl a certan event occurs (weeks unemployed, months spent on welfare). Survval analyss: duraton of nterest s survval tme of a subject, begn n an ntal state

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

GROUP SEQUENTIAL TEST OF NON-PARAMETRIC STATISTICS FOR SURVIVAL DATA

GROUP SEQUENTIAL TEST OF NON-PARAMETRIC STATISTICS FOR SURVIVAL DATA Hacettepe Journal of Mathematcs and Statstcs Volume 34 (2005), 67 74 GROUP SEQUETIAL TEST OF O-PARAMETRIC STATISTICS FOR SURVIVAL DATA Yaprak Parlak Demrhan and Sevl Bacanlı Receved 2 : 02 : 2005 : Accepted

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

RELIABILITY ASSESSMENT

RELIABILITY ASSESSMENT CHAPTER Rsk Analyss n Engneerng and Economcs RELIABILITY ASSESSMENT A. J. Clark School of Engneerng Department of Cvl and Envronmental Engneerng 4a CHAPMAN HALL/CRC Rsk Analyss for Engneerng Department

More information

Web-based Supplementary Materials for Inference for the Effect of Treatment. on Survival Probability in Randomized Trials with Noncompliance and

Web-based Supplementary Materials for Inference for the Effect of Treatment. on Survival Probability in Randomized Trials with Noncompliance and Bometrcs 000, 000 000 DOI: 000 000 0000 Web-based Supplementary Materals for Inference for the Effect of Treatment on Survval Probablty n Randomzed Trals wth Noncomplance and Admnstratve Censorng by Ne,

More information

CS-433: Simulation and Modeling Modeling and Probability Review

CS-433: Simulation and Modeling Modeling and Probability Review CS-433: Smulaton and Modelng Modelng and Probablty Revew Exercse 1. (Probablty of Smple Events) Exercse 1.1 The owner of a camera shop receves a shpment of fve cameras from a camera manufacturer. Unknown

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Computing MLE Bias Empirically

Computing MLE Bias Empirically Computng MLE Bas Emprcally Kar Wa Lm Australan atonal Unversty January 3, 27 Abstract Ths note studes the bas arses from the MLE estmate of the rate parameter and the mean parameter of an exponental dstrbuton.

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Hydrological statistics. Hydrological statistics and extremes

Hydrological statistics. Hydrological statistics and extremes 5--0 Stochastc Hydrology Hydrologcal statstcs and extremes Marc F.P. Berkens Professor of Hydrology Faculty of Geoscences Hydrologcal statstcs Mostly concernes wth the statstcal analyss of hydrologcal

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Change Point Estimation for Pareto Type-II Model

Change Point Estimation for Pareto Type-II Model Journal of Modern Appled Statstcal Methods Volume 3 Issue Artcle 5--04 Change Pont Estmaton for Pareto Type-II Model Gyan Prakash S. N. Medcal College, Agra, U. P., Inda, ggyanj@yahoo.com Follow ths and

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1 On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Spatial Frailty Survival Model for Infection of Tuberculosis among HIV Infected Individuals

Spatial Frailty Survival Model for Infection of Tuberculosis among HIV Infected Individuals SSRG Internatonal Journal of Medcal Scence (SSRG-IJMS) Volume 4 Issue 2 February 217 Spatal Fralty Survval Model for Infecton of uberculoss among HIV Infected Indvduals Srnvasan R 1, Ponnuraja C 1, Rajendran

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

STK4080/9080 Survival and event history analysis

STK4080/9080 Survival and event history analysis SK48/98 Survval and event hstory analyss Lecture 7: Regresson modellng Relatve rsk regresson Regresson models Assume that we have a sample of n ndvduals, and let N (t) count the observed occurrences of

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

How its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013

How its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013 Andrew Lawson MUSC INLA INLA s a relatvely new tool that can be used to approxmate posteror dstrbutons n Bayesan models INLA stands for ntegrated Nested Laplace Approxmaton The approxmaton has been known

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Time-Varying Coefficient Model with Linear Smoothing Function for Longitudinal Data in Clinical Trial

Time-Varying Coefficient Model with Linear Smoothing Function for Longitudinal Data in Clinical Trial Tme-Varyng Coeffcent Model wth Lnear Smoothng Functon for Longtudnal Data n Clncal Tral Masanor Ito, Toshhro Msum and Hdek Hrooka Bostatstcs Group, Data Scence Dept., Astellas Pharma Inc. Introducton In

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

MLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012

MLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012 MLE and Bayesan Estmaton Je Tang Department of Computer Scence & Technology Tsnghua Unversty 01 1 Lnear Regresson? As the frst step, we need to decde how we re gong to represent the functon f. One example:

More information

Sample Size Calculation Based on the Semiparametric Analysis of Short-term and Long-term Hazard Ratios. Yi Wang

Sample Size Calculation Based on the Semiparametric Analysis of Short-term and Long-term Hazard Ratios. Yi Wang Sample Sze Calculaton Based on the Semparametrc Analyss of Short-term and Long-term Hazard Ratos Y Wang Submtted n partal fulfllment of the requrements for the degree of Doctor of Phlosophy under the Executve

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Parameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples

Parameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples Appled Mathematcal Scences, Vol. 5, 011, no. 59, 899-917 Parameters Estmaton of the Modfed Webull Dstrbuton Based on Type I Censored Samples Soufane Gasm École Supereure des Scences et Technques de Tuns

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION Internatonal Worshop ADVANCES IN STATISTICAL HYDROLOGY May 3-5, Taormna, Italy DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION by Sooyoung

More information

The Occurrence and Timing of Events: The Application of Event History Models in Accounting and Finance Research

The Occurrence and Timing of Events: The Application of Event History Models in Accounting and Finance Research The Occurrence and Tmng of Events: The Applcaton of Event Hstory Models n Accountng and Fnance Research Marc J. LeClere Assstant Professor Department of Accountng School of Busness Admnstraton Loyola Unversty

More information

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2) 1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

DETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH

DETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata TC XVII IMEKO World Congress Metrology n the 3rd Mllennum June 7, 3,

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Estimation of the Mean of Truncated Exponential Distribution

Estimation of the Mean of Truncated Exponential Distribution Journal of Mathematcs and Statstcs 4 (4): 84-88, 008 ISSN 549-644 008 Scence Publcatons Estmaton of the Mean of Truncated Exponental Dstrbuton Fars Muslm Al-Athar Department of Mathematcs, Faculty of Scence,

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Hidden Markov Models & The Multivariate Gaussian (10/26/04)

Hidden Markov Models & The Multivariate Gaussian (10/26/04) CS281A/Stat241A: Statstcal Learnng Theory Hdden Markov Models & The Multvarate Gaussan (10/26/04) Lecturer: Mchael I. Jordan Scrbes: Jonathan W. Hu 1 Hdden Markov Models As a bref revew, hdden Markov models

More information

Time to dementia onset: competing risk analysis with Laplace regression

Time to dementia onset: competing risk analysis with Laplace regression Tme to dementa onset: competng rsk analyss wth Laplace regresson Gola Santon, Debora Rzzuto, Laura Fratglon 4 th Nordc and Baltc STATA Users group meetng, Stockholm, November 20 Agng Research Center (ARC),

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

XII.3 The EM (Expectation-Maximization) Algorithm

XII.3 The EM (Expectation-Maximization) Algorithm XII.3 The EM (Expectaton-Maxzaton) Algorth Toshnor Munaata 3/7/06 The EM algorth s a technque to deal wth varous types of ncoplete data or hdden varables. It can be appled to a wde range of learnng probles

More information

Copyright 2010 Cengage Learning, Inc. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part.

Copyright 2010 Cengage Learning, Inc. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. yes to (3) two-sample problem? no to (4) underlyng dstrbuton normal or can centrallmt theorem be assumed to hold? and yes to (5) underlyng dstrbuton bnomal? We now refer to the flowchart at the end of

More information

a. (All your answers should be in the letter!

a. (All your answers should be in the letter! Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal

More information

Statistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals

Statistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals Internatonal Journal of Scentfc World, 2 1) 2014) 1-9 c Scence Publshng Corporaton www.scencepubco.com/ndex.php/ijsw do: 10.14419/jsw.v21.1780 Research Paper Statstcal nference for generalzed Pareto dstrbuton

More information

Structure and Drive Paul A. Jensen Copyright July 20, 2003

Structure and Drive Paul A. Jensen Copyright July 20, 2003 Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Laboratory 3: Method of Least Squares

Laboratory 3: Method of Least Squares Laboratory 3: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly they are correlated wth

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Statistical tables are provided Two Hours UNIVERSITY OF MANCHESTER. Date: Wednesday 4 th June 2008 Time: 1400 to 1600

Statistical tables are provided Two Hours UNIVERSITY OF MANCHESTER. Date: Wednesday 4 th June 2008 Time: 1400 to 1600 Statstcal tables are provded Two Hours UNIVERSITY OF MNCHESTER Medcal Statstcs Date: Wednesday 4 th June 008 Tme: 1400 to 1600 MT3807 Electronc calculators may be used provded that they conform to Unversty

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

Homework 9 STAT 530/J530 November 22 nd, 2005

Homework 9 STAT 530/J530 November 22 nd, 2005 Homework 9 STAT 530/J530 November 22 nd, 2005 Instructor: Bran Habng 1) Dstrbuton Q-Q plot Boxplot Heavy Taled Lght Taled Normal Skewed Rght Department of Statstcs LeConte 203 ch-square dstrbuton, Telephone:

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

Stat 543 Exam 2 Spring 2016

Stat 543 Exam 2 Spring 2016 Stat 543 Exam 2 Sprng 206 I have nether gven nor receved unauthorzed assstance on ths exam. Name Sgned Date Name Prnted Ths Exam conssts of questons. Do at least 0 of the parts of the man exam. I wll score

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

The EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X

The EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X The EM Algorthm (Dempster, Lard, Rubn 1977 The mssng data or ncomplete data settng: An Observed Data Lkelhood (ODL that s a mxture or ntegral of Complete Data Lkelhoods (CDL. (1a ODL(;Y = [Y;] = [Y,][

More information

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics J Stat Appl Pro No 3 93- () 93 NSP Journal of Statstcs Applcatons & Probablty --- An Internatonal Journal @ NSP Natural Scences Publshng Cor A Note on Test of Homogenety Aganst Umbrella Scale Alternatve

More information

Heterogeneous Treatment Effect Analysis

Heterogeneous Treatment Effect Analysis Heterogeneous Treatment Effect Analyss Ben Jann ETH Zurch In cooperaton wth Jenne E. Brand (UCLA) and Yu Xe (Unversty of Mchgan) German Stata Users Group Meetng Berln, June 25, 2010 Ben Jann (ETH Zurch)

More information

Managing Snow Risks: The Case of City Governments and Ski Resorts

Managing Snow Risks: The Case of City Governments and Ski Resorts Managng Snow Rsks: The Case of Cty Governments and Sk Resorts Haruyosh Ito* Assstant Professor of Fnance Graduate School of Internatonal Management Internatonal Unversty of Japan 777 Kokusacho, Mnam Uonuma-sh

More information

Credit Card Pricing and Impact of Adverse Selection

Credit Card Pricing and Impact of Adverse Selection Credt Card Prcng and Impact of Adverse Selecton Bo Huang and Lyn C. Thomas Unversty of Southampton Contents Background Aucton model of credt card solctaton - Errors n probablty of beng Good - Errors n

More information