STK4080/9080 Survival and event history analysis
|
|
- Donna Hood
- 6 years ago
- Views:
Transcription
1 SK48/98 Survval and event hstory analyss Lecture 7: Regresson modellng Relatve rsk regresson Regresson models Assume that we have a sample of n ndvduals, and let N (t) count the observed occurrences of the event of nterest for ndvdual as a functon of (study) tme t We have the decomposton dn ( t) = λ ( t) dt + dm ( t) Partal lkelhood Estmaton of cumulatve hazards and survval probabltes Martngale resduals and model check Stratfed models observaton sgnal nose We wll consder regresson models where the ntensty process λ ( t) for ndvdual depends on a vector of (possbly) tme-dependent covarates x ( t) = ( x ( t), x ( t),..., x ( t)) 2 p 2 he ntensty process for ndvdual may be gven as λ ( t) = Y ( t) α( t x ) at rsk ndcator hazard rate (ntensty) (tme-dependency of covarates suppressed n the notaton) A regresson model specfes how the hazard rate depends on the covarates We wll consder two types of regresson models: Relatve rsk regresson models (secton 4.) Addtve regresson models (secton 4.2) 3 A note on covarates We assume that the ntensty processes depend on the covarate processes x ( t) = ( x ( t), x ( t),..., x ( t)) =,..., n 2 p hroughout we wll assume that the covarate processes are predctable hs mples that: fxed covarates should be measured n advance (.e. at tme zero) and reman fxed throughout the study the values at tme t of tme-dependent covarates should be known "ust before" tme t You should never let covarates depend on nformaton from the future! 4
2 It s useful to dstngush between external (or exogenous) and nternal (or endogenous) covarates Examples of external covarates are: Fxed covarates Defned tme-dependent covarates: the complete covarate path s gven at the outset of the study (e.g. a person's age at study tme t ) Ancllary tme-dependent covarates: the path of a stochastc process that s not nfluenced by the event beng studed (e.g. observed level of ar polluton) me-dependent covarates that are not external, are called nternal One example of an nternal covarate s a bomarker measured for the ndvduals durng follow-up Interpretaton of regresson analyses wth nternal tme-dependent covarates s not at all straghtforward! 5 Relatve rsk regresson models Assume that the hazard rate for ndvdual takes the form α ( t x ) = α ) (, x ( t)) baselne hazard ( t rβ hazard rato (relatve rsk) β α ( t) We assume r (, ) =, so the baselne hazard s the hazard for an ndvdual wth all covarates equal to zero We make no assumptons of the form of the baselne hazard hus the model contans a nonparametrc part (the baselne hazard) and a parametrc part (the relatve rsk functon) We say that the model s semparametrc 6 he common choce of relatve rsk functon s ( ) ( β β ) r( β, x ( t)) = exp β x ( t) = exp x ( t) + + x ( t) p p whch gves Cox's regresson model Consder two ndvduals, ndexed and 2, and assume that all components of x ( t) and ( t) are equal, 2 except the -th component where x ( t) = x ( t) + hen: α( t x2) α( t x ) ( β x t ) ( β x t ) α( t)exp 2( ) = α ( t)exp ( ) x 2 { ( 2 t t )} = exp β x ( ) x ( ) = e β hus e β s the hazard rato for one unt's ncrease n the -th covarate, keepng all other covarates constant 7 Other possble choces of the relatve rsk functon are: he addtve rsk functon: r( β, x ( t)) = + β x ( t) he excess relatve rsk functon: p r( β, x ( t)) = { + β x ( t)} = Cox regresson s the only relatve rsk regresson model mplemented n R 8
3 Partal lkelhood and estmaton of β Ordnary ML-estmaton does not work for the relatve rsk regresson models (due to the nonparametrc baselne) Instead we have to use a partal lkelhood We wll se how ths may be derved he ntensty process of N (t) s gven as λ ( t) = Y ( t) α( t x ) = Y ( t) α ( t) r( β, x ( )) t he ntensty process of the aggregated countng process takes the form (assumng no ont events) We consder the condtonal probablty of observng an event for ndvdual at tme t, gven the past and gven that an event s observed at tme t : ( t) = P( dn ( t) = dn ( t) =, F ) π t P( dn( t) = F t ) = P( dn ( t) = F ) t hen the ntensty process of N (t) may be factorzed as 9 We obtan the partal lkelhood by multplyng together the condtonal probabltes over all observed event tmes (thereby dsregardng the nformaton on the regresson coeffcents contaned n the aggregated process) hen, f s the ndex of the ndvdual who experences an event at, the partal lkelhood becomes We wll show (later) that the maxmum partal lkelhood estmator enoys "the usual propertes" of ML-estmators hus s approxmately multvarate normally dstrbuted around the true value of wth a covarance matrx that may be estmated by, where s the observed nformaton matrx For general relatve rsk functons t may be better to use the expected nformaton matrx. But as ths concdes wth the observed nformaton matrx for Cox regresson, we wll not go nto these detals (cf. secton 4..5) where s the rsk set at 2
4 o test the null hypothess H : β =, we may use the Wald test statstc ˆ β Z = se ( ˆ β ) whch s approxmately standard normally dstrbuted under the null hypothess o obtan a confdence nterval for the hazard rato we transform the lmts of the standard confdence nterval for to get the 95% confdence nterval : β { ˆ β ˆ ± se β } exp.96 ( ) e β 3 o test the smple null hypothess H : β = β for a specfed value of β (typcally β = ) we may apply the usual lkelhood based tests statstcs: he lkelhood rato test statstc: he score test statstc: where = Uβ ( ) Iβ ( ) Uβ ( ) 2 χ SC he Wald test statstc: χ = ( βˆ β ) Iβ ( ˆ)( βˆ β ) 2 W s the vector of score functons All the test statstcs are approxmately ch-squared dstrbuted wth p df under the null hypothess 4 All the tests may be generalzed to a composte null hypothess, where on want to test the hypothess that r of the regresson coeffcents are zero (or equvalently, after a reparameterzaton, that there are r lnear restrctons among the regresson coeffcents) In partcular f s the maxmum partal lkelhood estmator under the null hypothess, the lkelhood rato test statstc takes the form and t s approxmately ch-squared dstrbuted wth r df under the null hypothess 5 Usng R For llustraton we use the melanoma data (cf practcal exercses and 2) # Read data: path=" melanoma=read.table(path,header=) # We frst consder the model wth log-thckness as the only covarate: ft.t=coxph(surv(lfetme,status==)~log2(thckn),data=melanoma) summary(ft.t) # Note that we use base 2 logarthms for ease of nterpretaton # hen we consder the model wth log-thckness and sex as covarates: ft.ts=coxph(surv(lfetme,status==)~log2(thckn)+sex,data=melanoma) summary(ft.ts) # Note that snce sex s a bnary covarate (coded and 2), we get the same # estmates f we treat sex as a numerc covarate or as a categorcal # covarate [by usng factor(sex) n the coxph-command] # he two models may be compared usng the lkelhood rato test: anova(ft.t,ft.ts,test="chsq") 6
5 Estmaton of cumulatve hazards and survval probabltes We wll estmate the cumulatve baselne hazard A t t ( ) ( ) = α u du We take the aggregated countng process as our startng pont Its ntensty process s gven by For a gven value ofβ, we may therefore estmate A ( t) by Snce s unknown, we replace t by to obtan the Breslow estmator: If we had knownβ, ths would have been an example of the multplcatve ntensty model 7 8 If all covarates are fxed, the cumulatve hazard correspondng to an ndvdual wth a gven covarate vector s x he correspondng survval functon s gven by and t may be estmated by and t may be estmated by For a gven path of an external tme-dependent covarate, the cumulatve hazard may be estmated by Alternatvely we may use (as s done n R): { A ˆ t x } Sɶ ( t x ) = exp ( ) For practcal purposes there s lttle dfference between the two estmators he estmators of the cumulatve hazards and survval functons are approxmately normal and ther varances may be estmated as descrbed n secton 4..6 (whch s not part of the currculum) 2
6 Usng R For llustraton we contnue to use the melanoma data # We frst consder ulceraton as the only covarate and start by # makng Nelson-Aalen plots for patents wth and wthout ulceraton: ft.su=coxph(surv(lfetme,status==)~strata(ulcer),data=melanoma) surv.su=survft(ft.su) plot(surv.su,fun="cumhaz", mark.tme=f,xlm=c(,),ylm=c(,.7), xlab="years snce operaton",ylab="cumulatve hazard",lty=:2) legend("topleft",c("ulceraton","no ulceraton"),lty=:2) # We then ft a Cox model wth ulceraton as the only covarate and plot # the model based estmates of the cumulatve hazards n the same plot: ft.u=coxph(surv(lfetme,status==)~ulcer,data=melanoma) surv.u=survft(ft.u,newdata=data.frame(ulcer=c(,2))) lnes(surv.u,fun="cumhaz", mark.tme=f,conf.nt=f, lty=:2,col="red") # We then consder the model wth ulceraton and log-thckness ft.ut=coxph(surv(lfetme,status==)~ulcer+log2(thckn),data=melanoma) summary(ft.ut) # We wll plot the cumulatve hazards for the four covarate combnatons # ) ulcer=2, thckn= # 2) ulcer=2, thckn=4 # 3) ulcer=, thckn=4 # 3) ulcer=, thckn=8 new.covarates=data.frame(ulcer=c(2,2,,),thckn=c(,4,4,8)) surv.ut=survft(ft.ut,newdata= new.covarates) plot(surv.ut,fun="cumhaz", mark.tme=f, xlm=c(,), xlab="years snce operaton",ylab="cumulatve hazard",lty=:4) legend("topleft",c("","2","3","4"), lty=:4) # o plot the survval functons for the same combnatons of the # covarates we ust omt the "cumhaz" opton: plot(surv.ut,mark.tme=f, xlm=c(,), xlab="years snce peraton",lty=:4) legend("bottomleft",c("","2","3","4"), lty=:4) 2 22 Martngale resduals and model check We know that the processes wth Λ ( t) = t λ ( u) du = Y ( u) r( β, x ( u)) α ( u) du M ( t) = N ( t) Λ ( t) are martngales f the model s correctly specfed Λ ( t) ˆβ β α u du We may estmate by nsertng for and da ˆ ( u) for ( ) where A ˆ ( t) s the Breslow estmator t 23 Estmated cumulatve ntensty processes: Λ ˆ ( t ) = ( ) ( ˆ, ( )) ˆ t Y u r β x u d A ( u ) Y ( ) ( ˆ r β, x ( )) r( βˆ, x ( )) = t l l R Martngale resdual processes Mˆ ( t) = N ( t) Λˆ ( t) Martngale resduals M ˆ = M ˆ ( τ ) where τ s upper tme lmt of study 24
7 In the ABG-book (secton 4..3) a method s descrbed for checkng goodness-of-ft for relatve rsk regresson models usng grouped martngale resdual processes We wll not consder ths method, but rather present the methods of Ln et al (Bometrka 993) for checkng the assumptons of Cox regresson usng cumulatve sums of martngale resdual processes So consder Cox's regresson model wth fxed covarates: α( t x) = α ( t)exp( β x) he model assumes: ) Log-lnearty: 2) Proportonal hazards: log{ α( t x)} = log{ α ( t)} + β x α( t x2) = exp{ β ( x2 x)} (ndependent of tme) α( t x ) For checkng log-lnearty,.e. f the k-th covarate has correct functonal form, we may consder ( ) = ( ) ˆ n k k = W x I x x M n = I( x x) N ( τ ) I( x x) exp ( βˆ x ) ( βˆ x ) k k = exp R l l R he two terms are the observed and expected number of falures for covarate values x 27 Illustraton for melanoma data wth ulceraton and tumor thckness (not log-transformed) as covarates If the model s correctly specfed, the test process should fluctuate around zero So «large» values ndcate that the covarate has a wrong functonal form But how large s «large»? umor thckness 28
8 Ln et al. (993) showed that f the model s correctly specfed, Wk ( x) s asymptotcally dstrubuted as a mean zero Gaussan process he lmtng dstrbuton s ntractable, but Ln et al. suggested a way to approxmate the dstrbuton usng Monte Carlo smulatons he trck s to consder an asymptotc approxmaton of Wk ( x) and to replace dm (t) n ths approxmaton by G dn (t) where the G 's are sampled from a standard normal dstrbuton (keepng the data fxed) 29 Cumulatve MG-resduals Plot of the observed test process together wth 5 smulated processes (assumng a correct model) umor thckness he computaton may be performed usng the tmereg package n R, cf. below and secton 6.2 n Martnussen & Scheke (Sprnger 26) he plot ndcates that the model predcts too many deaths for thn tumors o get a P-value we compare sup Wk ( x) wth smulated processes, gvng P=.64 3 For a model wth log thckness and ulceraton we get the followng result: Cumulatve MG-resduals -5 5 For checkng proportonal hazards, we for the k-th covarate consder ( ) ˆ = n n exp( βˆ x ) Uk t xk M ( t) = xk N ( t) xk ˆ = exp β x Illustraton for melanoma data wth log tumor thckness and ulceraton: prop(ulcer) ( ) = t R l l R prop(log2(thckn)) log tumor thckness Here the assumpton of a log-lnear effect seems fne (P=.3) 3 Cumulatve MG-resduals P= Cumulatve MG-resduals P= Years snce operaton Years snce operaton
9 Usng R For llustraton we contnue to use the melanoma data We wll use the tmereg package so ths needs to be nstalled and loaded # We frst consder a model wth ulceraton and thckness (not log-transformed) ft.ut=cox.aalen(surv(lfetme,status==)~prop(ulcer)+prop(thckn), data=melanoma, weghted.test=, resduals=,rate.sm=,n.sm=) #Check of log-lnearty resds.ut=cum.resduals(ft.ut,data=melanoma,cum.resd=) plot(resds.ut,score=2,xlab="umor thckness") summary(resds.ut) # We then check log-lnearty and proportonal hazards for a model wth log-transformed thckness ft.ult=cox.aalen(surv(lfetme,status==)~prop(ulcer)+prop(log2(thckn)), data=melanoma, weghted.test=, resduals=,rate.sm=,n.sm=) resds.ult=cum.resduals(ft.ult,data=melanoma,cum.resd=) plot(resds.ult,score=2,xlab="log tumor thckness") summary(resds.ult) par(mfrow=c(,2)) plot(ft.ult,score=,xlab="years snce operaton") summary(ft.ult) Stratfed models So far we have assumed a common baselne hazard for all ndvduals,.e. α ( t x ) = α ) (, x ( t)) ( t rβ When ths s not a realstc assumpton, one may adopt a stratfed verson of the model hen the study popolaton s grouped nto k strata, and for an ndvdual n stratum s we assume that the hazard takes the form: α ( t x,stratu m s) = α ( t) r( β, x ( t)) Note that the effects of the covarates are assumed to be the same accross strata, whle the baselne hazard may vary between strata s We now estmate β by maxmzng the partal lkelhood We may estmate the stratum-specfc cumulatve baselne hazards t A ( t) ( u) du = α s s where are the observed event tmes n by the Breslow estmators stratum s and s the rsk set n ths stratum at tme s he maxmum partal lkelhood estmator enoys smlar propertes as for the stuaton wthout stratfcaton and statstcal test may be performed as before 35 As before these provde the bass for estmatng cumulatve hazards and survval functons for gven values of fxed covarates (or gven paths of external tme-varyng covarates) 36
10 Usng R For llustraton we contnue to use the melanoma data # We ft a model where we stratfy on ulceraton use log-thckness as covarate ft.strat=coxph(surv(lfetme,status==)~log2(thckn)+strata(ulcer), data=melanoma) summary(ft.strat) # We may plot the cumuatve baselne hazards for the two ulceraton strata: baselne.covar=data.frame(thckn=) surv.strat=survft(ft.strat,newdata=baselne.covar) plot(surv.strat,fun="cumhaz", mark.tme=f,xlm=c(,), xlab="years snce operaton",ylab="cumulatve hazard",lty=:2) 37
Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More informationPredictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.
More informationx i1 =1 for all i (the constant ).
Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More informationANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)
Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of
More informationComposite Hypotheses testing
Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More informationDiagnostics in Poisson Regression. Models - Residual Analysis
Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent
More informationBIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data
Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More informationChapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationSTAT 3008 Applied Regression Analysis
STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed
More informationNegative Binomial Regression
STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...
More informationLecture 6: Introduction to Linear Regression
Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6
More informationStat 642, Lecture notes for 01/27/ d i = 1 t. n i t nj. n j
Stat 642, Lecture notes for 01/27/05 18 Rate Standardzaton Contnued: Note that f T n t where T s the cumulatve follow-up tme and n s the number of subjects at rsk at the mdpont or nterval, and d s the
More information2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification
E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton
More informationNested case-control and case-cohort studies
Outne: Nested case-contro and case-cohort studes Ørnuf Borgan Department of Mathematcs Unversty of Oso NORBIS course Unversty of Oso 4-8 December 217 1 Radaton and breast cancer data Nested case contro
More informationLogistic regression with one predictor. STK4900/ Lecture 7. Program
Logstc regresson wth one redctor STK49/99 - Lecture 7 Program. Logstc regresson wth one redctor 2. Maxmum lkelhood estmaton 3. Logstc regresson wth several redctors 4. Devance and lkelhood rato tests 5.
More informationChapter 20 Duration Analysis
Chapter 20 Duraton Analyss Duraton: tme elapsed untl a certan event occurs (weeks unemployed, months spent on welfare). Survval analyss: duraton of nterest s survval tme of a subject, begn n an ntal state
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More information( t) Cox regression part 2. Outline: Recapitulation. Estimation of cumulative hazards and survival probabilites. Ørnulf Borgan
Outline: Cox regression part 2 Ørnulf Borgan Department of Mathematics University of Oslo Recapitulation Estimation of cumulative hazards and survival probabilites Assumptions for Cox regression and check
More informationSee Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)
Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes
More informationDO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes
25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton
More informationTests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions
ECONOMICS 35* -- NOTE ECON 35* -- NOTE Tests of Sngle Lnear Coeffcent Restrctons: t-tests and -tests Basc Rules Tests of a sngle lnear coeffcent restrcton can be performed usng ether a two-taled t-test
More informationEcon107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)
I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes
More informationPsychology 282 Lecture #24 Outline Regression Diagnostics: Outliers
Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.
More informationProfessor Chris Murray. Midterm Exam
Econ 7 Econometrcs Sprng 4 Professor Chrs Murray McElhnney D cjmurray@uh.edu Mdterm Exam Wrte your answers on one sde of the blank whte paper that I have gven you.. Do not wrte your answers on ths exam.
More informationJanuary Examinations 2015
24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)
More information1 Binary Response Models
Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models
More informationsince [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation
Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics
ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott
More informationLimited Dependent Variables and Panel Data. Tibor Hanappi
Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range
More informationSTAT 511 FINAL EXAM NAME Spring 2001
STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours
UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x
More informationNon-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT
Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal
More informationLecture 3 Stat102, Spring 2007
Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture
More informationTHE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for
More informationComputation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models
Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,
More informationLINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity
LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have
More informationEconomics 130. Lecture 4 Simple Linear Regression Continued
Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do
More informationMLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012
MLE and Bayesan Estmaton Je Tang Department of Computer Scence & Technology Tsnghua Unversty 01 1 Lnear Regresson? As the frst step, we need to decde how we re gong to represent the functon f. One example:
More informationChapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the
More informationLINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity
LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased
More informationQUASI-LIKELIHOOD APPROACH TO RATER AGREEMENT PLUS LINEAR BY LINEAR ASSOCIATION MODEL FOR ORDINAL CONTINGENCY TABLES
Journal of Statstcs: Advances n Theory and Applcatons Volume 6, Number, 26, Pages -5 Avalable at http://scentfcadvances.co.n DOI: http://dx.do.org/.8642/jsata_72683 QUASI-LIKELIHOOD APPROACH TO RATER AGREEMENT
More informationLecture Notes on Linear Regression
Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume
More informationDurban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications
Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department
More informationStatistics for Business and Economics
Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an
More informationBasically, if you have a dummy dependent variable you will be estimating a probability.
ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study
More informationEconometrics of Panel Data
Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random
More informationConvergence of random processes
DS-GA 12 Lecture notes 6 Fall 216 Convergence of random processes 1 Introducton In these notes we study convergence of dscrete random processes. Ths allows to characterze phenomena such as the law of large
More information4.1. Lecture 4: Fitting distributions: goodness of fit. Goodness of fit: the underlying principle
Lecture 4: Fttng dstrbutons: goodness of ft Goodness of ft Testng goodness of ft Testng normalty An mportant note on testng normalty! L4.1 Goodness of ft measures the extent to whch some emprcal dstrbuton
More informationChapter 12 Analysis of Covariance
Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty
More informationSTAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression
STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,
More informationResource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis
Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques
More information4.3 Poisson Regression
of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)
More informationAPPROXIMATE PRICES OF BASKET AND ASIAN OPTIONS DUPONT OLIVIER. Premia 14
APPROXIMAE PRICES OF BASKE AND ASIAN OPIONS DUPON OLIVIER Prema 14 Contents Introducton 1 1. Framewor 1 1.1. Baset optons 1.. Asan optons. Computng the prce 3. Lower bound 3.1. Closed formula for the prce
More informationMarginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients
ECON 5 -- NOE 15 Margnal Effects n Probt Models: Interpretaton and estng hs note ntroduces you to the two types of margnal effects n probt models: margnal ndex effects, and margnal probablty effects. It
More information4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA
4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria
ECOOMETRICS II ECO 40S Unversty of Toronto Department of Economcs Wnter 07 Instructor: Vctor Agurregabra SOLUTIO TO FIAL EXAM Tuesday, Aprl 8, 07 From :00pm-5:00pm 3 hours ISTRUCTIOS: - Ths s a closed-book
More informationA Comparative Study for Estimation Parameters in Panel Data Model
A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and
More informationIntroduction to Generalized Linear Models
INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model
More information/ n ) are compared. The logic is: if the two
STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence
More informationChapter 9: Statistical Inference and the Relationship between Two Variables
Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,
More informationLimited Dependent Variables
Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages
More informationSample Size Calculation Based on the Semiparametric Analysis of Short-term and Long-term Hazard Ratios. Yi Wang
Sample Sze Calculaton Based on the Semparametrc Analyss of Short-term and Long-term Hazard Ratos Y Wang Submtted n partal fulfllment of the requrements for the degree of Doctor of Phlosophy under the Executve
More informationRELIABILITY ASSESSMENT
CHAPTER Rsk Analyss n Engneerng and Economcs RELIABILITY ASSESSMENT A. J. Clark School of Engneerng Department of Cvl and Envronmental Engneerng 4a CHAPMAN HALL/CRC Rsk Analyss for Engneerng Department
More informationProperties of Least Squares
Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures
More informationParameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples
Appled Mathematcal Scences, Vol. 5, 011, no. 59, 899-917 Parameters Estmaton of the Modfed Webull Dstrbuton Based on Type I Censored Samples Soufane Gasm École Supereure des Scences et Technques de Tuns
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationHere is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)
Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationDepartment of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution
Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable
More information3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X
Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number
More information[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact
Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β
More informationGlobal Sensitivity. Tuesday 20 th February, 2018
Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values
More informationFirst Year Examination Department of Statistics, University of Florida
Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve
More informationChapter 15 - Multiple Regression
Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term
More information9. Binary Dependent Variables
9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons
More informationBasic Business Statistics, 10/e
Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson
More informationwhere I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).
11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e
More informationMultivariate Ratio Estimator of the Population Total under Stratified Random Sampling
Open Journal of Statstcs, 0,, 300-304 ttp://dx.do.org/0.436/ojs.0.3036 Publsed Onlne July 0 (ttp://www.scrp.org/journal/ojs) Multvarate Rato Estmator of te Populaton Total under Stratfed Random Samplng
More informationInfluence Diagnostics on Competing Risks Using Cox s Model with Censored Data. Jalan Gombak, 53100, Kuala Lumpur, Malaysia.
Proceedngs of the 8th WSEAS Internatonal Conference on APPLIED MAHEMAICS, enerfe, Span, December 16-18, 5 (pp14-138) Influence Dagnostcs on Competng Rsks Usng Cox s Model wth Censored Data F. A. M. Elfak
More informationOutline. Recall that Aalen additive hazards model and the semiparametric version
Outlne Clustered survval data (addtve models) Addtve-Multplcatve hazards model Advanced Survval Analyss 21 Copenhagen Gulana Cortese gco@bostat.ku.dk Clustered survval data Margnal addtve models Addtve-Multplcatve
More informationChapter 6. Supplemental Text Material
Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.
More information28. SIMPLE LINEAR REGRESSION III
8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted
More informatione i is a random error
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown
More informationStatistics II Final Exam 26/6/18
Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the
More informationChapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of
Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When
More informationParametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010
Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton
More information8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore
8/5/17 Data Modelng Patrce Koehl Department of Bologcal Scences atonal Unversty of Sngapore http://www.cs.ucdavs.edu/~koehl/teachng/bl59 koehl@cs.ucdavs.edu Data Modelng Ø Data Modelng: least squares Ø
More informationLINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables
LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models
More informationHydrological statistics. Hydrological statistics and extremes
5--0 Stochastc Hydrology Hydrologcal statstcs and extremes Marc F.P. Berkens Professor of Hydrology Faculty of Geoscences Hydrologcal statstcs Mostly concernes wth the statstcal analyss of hydrologcal
More information