AFT Models and Empirical Likelihood


 Alisha Skinner
 1 years ago
 Views:
Transcription
1 AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky)
2 Accelerated Failure Time (AFT) models: Y = log(t ) = Xβ + ɛ. The responses T, log(t ) or Y are subject to right censoring. We study the use of empirical likelihood method (Owen 2001) for inference about β.
3 Owen (1991) studied the use of empirical likelihood with linear models. (no censoring) He defines the Empirical Likelihood as: EL = n i=1 p i ; where pi = 1; p i 0. Is this a likelihood of the Y s or is this the likelihood for the ɛ s? Either interpretation is plausible here.
4 Owen (1991) identified two different linear models from the viewpoint of Empirical Likelihood (EL): regression models and correlation models, which came from Freedman (1981) bootstrap study of linear models.
5 Correlation Model: We observe iid random vectors (Y i, X i ); i = 1,..., n where Y i are one dimensional responses, X i are kdimensional covariates. Unknown parameter β is the minimizer of g(b) = E Y X T b 2.
6 Regression Model: The covariates x i ; i = 1,..., n are fixed constants, observable kdimensional vectors. Y i are independent random variables with distributions having location parameters x t i β.
7 The EL for the two types of models appears identical in Owen (1991) as given above, but have different interpretations and require different proofs for the generalized Wilks theorem under different assumptions. Owen (1991) showed the EL Wilks Theorem holds for both models: 2 log max EL 1/n χ 2 where the max of EL is taken over p i such that 0 = X i ψ(y i X i β 0 )p i.
8 Notice for a given (noncensored) data set, under either regression or correlation model the two EL ratios have the same value and the least squares estimator ˆβ is also the same under either model. So the two different models only pertain to two different sets of assumptions, under which the EL Wilks theorem hold.
9 With censored responses, not all Y i are available. Estimation equations, as well as the empirical likelihood needs to be changed. Censored data: randomly right censoring: Q i = min(y i, C i ) ; δ i = I[Y i C i ] where C i are the censoring times.
10 We see two different ways of defining the empirical likelihood for censored data, corresponding to the two types of linear model: Define the empirical likelihood based on the censored residuals : e i (b) = Q i X i b; δ i : EL (res) = n i=1 (p i ) δ i 1 p j 1 δi. e j e i Notice the order in the summation is based on the resid
11 uals. This empirical likelihood is more suitable for regression models.
12 The second approach is to consider the empirical likelihood for (censored) vectors: (Q i, δ i, X i ): which we shall call it casewise EL. EL (case) = n i=1 (p i ) δ i 1 p j 1 δi. Q j Q i Notice the order in the summation of the EL is based on Q s. This empirical likelihood is more suitable for correlation models.
13 The two definitions of the EL is parallel to the two ways of bootstrapping in linear models: (1) resample the residuals e i, for the regression models. (2) resample the vectors or cases (Y i, X i ), for the correlation models. See for example Freedman (1981).
14 We consider two types of estimates ˆβ suitable for the two types of EL and AFT models: (1) BuckleyJames (1979) estimate. Rely on the fact that the errors (residuals) are iid. More suitable for the regression models. (2) Case weighted estimators (Koul, Susarla, Van Ryzin (1982) [not 1981!], Zhou (1992), Stute (1993, 1994, 1999); Huang, Ma, Xie (2005) etc.) Idea is based on the
15 (iid) cases (Y i, X i ) or (Q i, δ i, X i ). correlation models. More suitable for the
16 Let us first consider the BuckleyJames estimator and EL (res). Let e i (b) = Q i X i b, be the (censored) residuals. The BuckleyJames estimator of β is the solution to the estimation equation 0 = n X i i=1 δ i e i (b) + (1 δ i ) e j (b) ˆF (e j ) j:e j >e i 1 ˆF (e i ),
17 where ˆF ( ) is the KaplanMeier estimator computed from (e i (b), δ i ). The expression in blue is an estimation of the conditional expectation, Ê(Y i X i b Q i, β = b, δ i = 0). There are two summation signs in the above equation. (one for index i one for index j). Exchange the order of the summations we have 0 = n j=1 δ j e j (b) X j + i:e i <e j,δ i =0 X i ˆF (e j ) 1 ˆF (e i ).
18 This form of BuckleyJames estimation equation will lead to the constraint equations we use with the censored empirical likelihood EL res.
19 The above defined EL (res) is to be maximized with and without the following added constraint equations (with b = β 0 ): 0 = n j=1 p j δ j e j (b) X j + i:e i <e j,δ i =0 X i ˆF (e j ) 1 ˆF (e i ) 1 ˆF (e j ). Notice the same p i appears in this constraint equations and in the definition of EL (res).
20 Without this constraint, the EL res is maximized at the KaplanMeier estimator, computed from e i (β 0 ), δ i. Under null hypothesis, i.e. when b = β 0, the 2logEL res ratio has asymptotically a chi squared distribution: 2 log max EL res max EL res χ 2 Proof See Zhou and Li (2004).
21 Simulation confirmation of the chi squared limit: QQ plot of 2 log ELR
22 QQ plot of 2 log ELR
23 QQ plot of 2 log ELR
24 QQ plot of 2 log ELR
25 Now let us consider the correlation model and the case weighted estimator.
26 Many authors have proposed and studied the caseweighted estimator, including large sample property of the estimator under various assumptions. The earliest reference I can find is Koul, Susarla and Van Ryzin (1982), the latest reference is Huang, Ma and Xie (2005). In between there are: Zhou (1992), Stute (1993, 1994, 1999), Gross and Lai (1996), and van der Laan and Robins (2003): in their book they call this inverse censoring probability weight : since ˆF (Q i ) = δ i 1 Ĝ(Q i ) = w i
27 The case weighted estimator is defined by the estimating equations (cf. Zhou 1992) 0 = n i=1 w i δ i X i ψ(q i X i b) where w i is the jump of the KaplanMeier estimator at Q i computed from (Q i, δ i ). ψ() is monotone, usually the derivative of ρ(). The estimator ˆβ is easy to obtain, no iteration needed.
28 The constraint equations to work with the EL case are 0 = n i=1 p i δ i X i ψ(q i X i b). The EL (case) is to be maximized with and with out the above constraint equations. We have a chi square limit theorem (Wilks): Under null hypothesis, we have 2 log max EL case max EL case χ 2
29 in distribution. In the denominator, the max of EL case occurs when p i = jumps of the KaplanMeier estimator based on Q i, δ i.
30 If we interprets the constraint equations as constraints on the marginal DF of Y, then (since the EL is also on the Y marginal), existing results apply.
31 The constraint equation can be thought of as a constraint on the marginal CDF of the Y g(y, x)df Y (y) = 0
32 R Censored Data AFT Models Regression Model BuckleyJames Est. EL(residual) Wilks Theorem Correlation Model Weighted Est. EL(case) Wilks Theorem
33 Notice with censored data, the two estimators are different and the two EL are also different. Yet they both have EL ratio converge to chisquare distribution under certain conditions.
34 Simulation results for correlation AFT models: QQ plot of 2 log ELR
35 QQ plot of 2 log ELR
36 QQ plot of 2 log ELR
37 QQ plot of 2 log ELR
38 Less censoring: QQ plot of 2 log ELR
39 QQ plot of 2 log ELR
40 Quantile regression is an important model in econometrics (Koenker 2005) Quantile of Y i = βx i Estimate β by arg min n i=1 ρ(y i βx i ) where ρ is the so called check function.
41 Estimating equation for β 0 = n i=1 X i ψ(y i βx i ) where ψ needs to be defined carefully.
42 Median regression: ψ(t) = 1, 0 or 1 depend on t > 0, t = 0 or t < 0. Variance of the ˆβ is hard to estimate. Using EL, we do not need to estimate the variance of ˆβ.
43 QQ plot of 2 log ELR
44 QQ plot of 2 log ELR
45 QQ plot of 2 log ELR
46 QQ plot of 2 log ELR
47 QQ plot of 2 log ELR
48 25% Quantile regression: ψ(t) = 0.5, 0.5 or 1.5 depend on t > 0, t = 0 or t < 0. 75% Quantile regression: ψ(t) = 1.5, 0.5 or 0.5 depend on t > 0, t = 0 or t < 0. These functions equal to 2 times the derivative of the so called check functions defined in Koenker Quantile Regression. We take this definition so that the median regression ψ is the more commonly used 1/+1 function.
49 The (heteroscedastic) model we are simulating next is generated by Y = X + 0.5(1 + X)ɛ where ɛ is iid Normal(0,1). X is uniform (0,1) and Y are right censored by exp(1). For 25% quantile regression, the true regression line is Y = X
50 For 75% quantile regression, the true regression line is Y = X From the plot we see that the 25% quantile regression is less sensitive to the right censoring, compared to median regression and 75% quantile regression.
51 QQ plot of 2 log ELR
52 QQ plot of 2 log ELR
53 QQ plot of 2 log ELR
54 QQ plot of 2 log ELR
55 QQ plot of 2 log ELR
56 QQ plot of 2 log ELR
57 QQ plot of 2 log ELR
58 QQ plot of 2 log ELR
59 QQ plot of 2 log ELR
60 QQ plot of 2 log ELR
61 QQ plot of 2 log ELR
62 QQ plot of 2 log ELR
63 Example: Wei 1995) Smallcell Lung Cancer Data (Ying, Jung & log(t ) = β 0 + X 1 β 1 + X 2 β 2 + σ(x 1, X 2 )ɛ X 1 = indicator of treatment; X 2 = age at entry.
64 Contour plot of 2 log ELR for β 1 β 2 ; β 0 fixed.
65 Comments about heteroscedasticity. The correlation model can accommodate some heteroscedasticity. When Y is censored, X can be missing. Censoring: C independent of Y. (stronger than conditional independence usually assumed by regression model and BuckleyJames estimator). But BuckleyJames assumes iis error.
66 References Buckley, J. J. and James, I. R. (1979). Linear regression with censored data. Biometrika, Freedman, D.A. (1981). Bootstrapping regression models. Annals of Statist., 9, Gross, S. and Lai, T.L. (1996). Nonparametric estimation and regression analysis with left truncated and right censored data. J. Amer. Statist. Assoc. 91, Huang, J., Ma, S. and Xie, H. (2005) Least Absolute Deviations Estimation for the Accelerated Failure Time Model. Tech Report #350 Univ. of Iowa, Dept. Stat. and Actuarial Science. Koul, H., Susarla, V, and Van Ryzin, J. (1982). Least squares regression analysis with censored survival data. In Topics in Applied Statistics, Ed.Y.P. Chaubey and T.D. Dwivedi, pp Marcel Dekker, New York. Laan, M. van der, and Robins, J. M. (2003) Unified Methods for Censored Longitudinal Data and Causality. Springer, New York. Owen, A (1991), Empirical likelihood for linear models. Annals of Statist., 19, Owen, A. (2001). Empirical likelihood. Chapman & Hall, London. Stute, W. (1999). Nonlinear Censored Regression. Statistica Sinica, 9, 4,
67 Jin, Z, Lin, D. Y. and Ying, Z. (2006). On leastsquares regression with censored data Biometrika Zhou, M. (1992). Mestimation in censored linear models. Biometrika, 79, Zhou, M. and Li, G. (2004). Empirical likelihood analysis of the BuckleyJames estimator.
Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm
Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier
More informationA COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky
A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),
More informationSTAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where
STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL
October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIANJIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach
More informationGoodnessofFit Tests for Time Series Models: A ScoreMarked Empirical Process Approach
GoodnessofFit Tests for Time Series Models: A ScoreMarked Empirical Process Approach By Shiqing Ling Department of Mathematics Hong Kong University of Science and Technology Let {y t : t = 0, ±1, ±2,
More informationNonparametric rank based estimation of bivariate densities given censored data conditional on marginal probabilities
Hutson Journal of Statistical Distributions and Applications (26 3:9 DOI.86/s4488647y RESEARCH Open Access Nonparametric rank based estimation of bivariate densities given censored data conditional
More informationUNIVERSITÄT POTSDAM Institut für Mathematik
UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam
More informationMedian regression using inverse censoring weights
Median regression using inverse censoring weights Sundarraman Subramanian 1 Department of Mathematical Sciences New Jersey Institute of Technology Newark, New Jersey Gerhard Dikta Department of Applied
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2009 Paper 251 Nonparametric population average models: deriving the form of approximate population
More informationEstimation of Conditional Kendall s Tau for Bivariate Interval Censored Data
Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 22877843 / Online ISSN 23834757 Estimation of Conditional
More informationTargeted Maximum Likelihood Estimation for Adaptive Designs: Adaptive Randomization in Community Randomized Trial
Targeted Maximum Likelihood Estimation for Adaptive Designs: Adaptive Randomization in Community Randomized Trial Mark J. van der Laan 1 University of California, Berkeley School of Public Health laan@berkeley.edu
More informationQuantile regression and heteroskedasticity
Quantile regression and heteroskedasticity José A. F. Machado J.M.C. Santos Silva June 18, 2013 Abstract This note introduces a wrapper for qreg which reports standard errors and t statistics that are
More informationFrom semi to nonparametric inference in general time scale models
From semi to nonparametric inference in general time scale models Thierry DUCHESNE duchesne@matulavalca Département de mathématiques et de statistique Université Laval Québec, Québec, Canada Research
More informationFull likelihood inferences in the Cox model: an empirical likelihood approach
Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s104630100272y Full likelihood inferences in the Cox model: an empirical likelihood approach JianJian Ren Mai Zhou Received: 22 September 2008 / Revised:
More informationInvestigation of goodnessoffit test statistic distributions by random censored samples
d samples Investigation of goodnessoffit test statistic distributions by random censored samples Novosibirsk State Technical University November 22, 2010 d samples Outline 1 Nonparametric goodnessoffit
More informationEstimation and Inference of Quantile Regression. for Survival Data under Biased Sampling
Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased
More informationOutline of GLMs. Definitions
Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density
More informationNegative Multinomial Model and Cancer. Incidence
Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence S. Lahiri & Sunil K. Dhar Department of Mathematical Sciences, CAMS New Jersey Institute of Technology, Newar,
More informationEconomics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models
University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 7 Introduction to Specification Testing in Dynamic Econometric Models In this lecture I want to briefly describe
More informationComputational treatment of the error distribution in nonparametric regression with rightcensored and selectionbiased data
Computational treatment of the error distribution in nonparametric regression with rightcensored and selectionbiased data Géraldine Laurent 1 and Cédric Heuchenne 2 1 QuantOM, HECManagement School of
More informationDiscussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon
Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics
More informationNONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA
Statistica Sinica 14(2004, 533546 NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA Mai Zhou University of Kentucky Abstract: The nonparametric Bayes estimator with
More informationAnalysis of least absolute deviation
Analysis of least absolute deviation By KANI CHEN Department of Mathematics, HKUST, Kowloon, Hong Kong makchen@ust.hk ZHILIANG YING Department of Statistics, Columbia University, NY, NY, 10027, U.S.A.
More informationA Review on Empirical Likelihood Methods for Regression
Noname manuscript No. (will be inserted by the editor) A Review on Empirical Likelihood Methods for Regression Song Xi Chen Ingrid Van Keilegom Received: date / Accepted: date Abstract We provide a review
More informationLogistic regression model for survival time analysis using timevarying coefficients
Logistic regression model for survival time analysis using timevarying coefficients Accepted in American Journal of Mathematical and Management Sciences, 2016 Kenichi SATOH ksatoh@hiroshimau.ac.jp Research
More informationRank Regression Analysis of Multivariate Failure Time Data Based on Marginal Linear Models
doi: 10.1111/j.14679469.2005.00487.x Published by Blacwell Publishing Ltd, 9600 Garsington Road, Oxford OX4 2DQ, UK and 350 Main Street, Malden, MA 02148, USA Vol 33: 1 23, 2006 Ran Regression Analysis
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationINVERSE PROBABILITY WEIGHTED ESTIMATION FOR GENERAL MISSING DATA PROBLEMS
IVERSE PROBABILITY WEIGHTED ESTIMATIO FOR GEERAL MISSIG DATA PROBLEMS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 488241038 (517) 3535972 wooldri1@msu.edu
More informationLecture 3. Truncation, lengthbias and prevalence sampling
Lecture 3. Truncation, lengthbias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in
More informationSmoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors
Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors Yuanshan Wu, Yanyuan Ma, and Guosheng Yin Abstract Censored quantile regression is an important alternative
More informationasymptotic normality of nonparametric Mestimators with applications to hypothesis testing for panel count data
asymptotic normality of nonparametric Mestimators with applications to hypothesis testing for panel count data Xingqiu Zhao and Ying Zhang The Hong Kong Polytechnic University and Indiana University Abstract:
More informationBusiness Statistics. Lecture 10: Correlation and Linear Regression
Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form
More informationChapter 2 Inference on Mean Residual LifeOverview
Chapter 2 Inference on Mean Residual LifeOverview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate
More informationOn leastsquares regression with censored data
Biometrika (2006), 93, 1, pp. 147 161 2006 Biometrika Trust Printed in Great Britain On leastsquares regression with censored data BY ZHEZHEN JIN Department of Biostatistics, Columbia University, New
More informationStatistics 135 Fall 2008 Final Exam
Name: SID: Statistics 135 Fall 2008 Final Exam Show your work. The number of points each question is worth is shown at the beginning of the question. There are 10 problems. 1. [2] The normal equations
More informationAN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY
Econometrics Working Paper EWP0401 ISSN 14856441 Department of Economics AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY Lauren Bin Dong & David E. A. Giles Department of Economics, University of Victoria
More informationLecture 5 Models and methods for recurrent event data
Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.
More informationA lackoffit test for quantile regression models with highdimensional covariates
A lackoffit test for quantile regression models with highdimensional covariates Mercedes CondeAmboage 1, César SánchezSellero 1,2 and Wenceslao GonzálezManteiga 1 arxiv:1502.05815v1 [stat.me] 20
More informationStatistical inference based on nonsmooth estimating functions
Biometrika (2004), 91, 4, pp. 943 954 2004 Biometrika Trust Printed in Great Britain Statistical inference based on nonsmooth estimating functions BY L. TIAN Department of Preventive Medicine, Northwestern
More informationSmooth nonparametric estimation of a quantile function under right censoring using beta kernels
Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Chanseok Park 1 Department of Mathematical Sciences, Clemson University, Clemson, SC 29634 Short Title: Smooth
More informationPackage emplik. August 13, 2016
Maintainer Mai Zhou Version 1.03 Depends R (>= 3.0), quantreg Suggests KMsurv, boot Package emplik August 13, 2016 Author Mai Zhou. (Art Owen for el.test(). Yifan Yang for C code.) Empirical
More informationDivideandcombine Strategies in Statistical Modeling for Massive Data
Divideandcombine Strategies in Statistical Modeling for Massive Data Liqun Yu Washington University in St. Louis March 30, 2017 Liqun Yu (WUSTL) D&C Statistical Modeling for Massive Data March 30, 2017
More informationAsymmetric least squares estimation and testing
Asymmetric least squares estimation and testing Whitney Newey and James Powell Princeton University and University of WisconsinMadison January 27, 2012 Outline ALS estimators Large sample properties Asymptotic
More informationNotes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes.
Unit 2: Models, Censoring, and Likelihood for FailureTime Data Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. Ramón
More informationHazard Function, Failure Rate, and A Rule of Thumb for Calculating Empirical Hazard Function of ContinuousTime Failure Data
Hazard Function, Failure Rate, and A Rule of Thumb for Calculating Empirical Hazard Function of ContinuousTime Failure Data Fengfeng Li,2, Gang Xie,2, Yong Sun,2, Lin Ma,2 CRC for Infrastructure and
More informationParameters Estimation for a Linear Exponential Distribution Based on Grouped Data
International Mathematical Forum, 3, 2008, no. 33, 16431654 Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data A. Alkhedhairi Department of Statistics and O.R. Faculty
More informationTypical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction
Outline CHL 5225H Advanced Statistical Methods for Clinical Trials: Survival Analysis Prof. Kevin E. Thorpe Defining Survival Data Mathematical Definitions Nonparametric Estimates of Survival Comparing
More informationLikelihood Construction, Inference for Parametric Survival Distributions
Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make
More informationEconomics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation
University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation The simplest of this general class of models is Tobin s (1958)
More informationBayesian Inference on Joint Mixture Models for SurvivalLongitudinal Data with Multiple Features. Yangxin Huang
Bayesian Inference on Joint Mixture Models for SurvivalLongitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January
More informationEmpirical Likelihood Methods for PretestPosttest Studies
Empirical Likelihood Methods for PretestPosttest Studies by Min Chen A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Doctor of Philosophy in
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationCURE MODEL WITH CURRENT STATUS DATA
Statistica Sinica 19 (2009), 233249 CURE MODEL WITH CURRENT STATUS DATA Shuangge Ma Yale University Abstract: Current status data arise when only random censoring time and event status at censoring are
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and QQ plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and QQ plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More informationLinear Regression (1/1/17)
STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression
More informationCourse 4 Solutions November 2001 Exams
Course 4 Solutions November 001 Exams November, 001 Society of Actuaries Question #1 From the YuleWalker equations: ρ φ + ρφ 1 1 1. 1 1+ ρ ρφ φ Substituting the given quantities yields: 0.53 φ + 0.53φ
More informationThe HodrickPrescott Filter
The HodrickPrescott Filter A Special Case of Penalized Spline Smoothing Alex Trindade Dept. of Mathematics & Statistics, Texas Tech University Joint work with Rob Paige, Missouri University of Science
More informationGARCH Models Estimation and Inference
GARCH Models Estimation and Inference Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics  013 1 / 1 Likelihood function The procedure most often used in estimating θ 0 in
More informationApplied linear statistical models: An overview
Applied linear statistical models: An overview Gunnar Stefansson 1 Dept. of Mathematics Univ. Iceland August 27, 2010 Outline Some basics Course: Applied linear statistical models This lecture: A description
More informationLecture 3: Multiple Regression
Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint
More informationStat 642, Lecture notes for 04/12/05 96
Stat 642, Lecture notes for 04/12/05 96 HosmerLemeshow Statistic The HosmerLemeshow Statistic is another measure of lack of fit. Hosmer and Lemeshow recommend partitioning the observations into 10 equal
More informationReliability Engineering I
Happiness is taking the reliability final exam. Reliability Engineering I ENM/MSC 565 Review for the Final Exam Vital Statistics What R&M concepts covered in the course When Monday April 29 from 4:30 6:00
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More informationAnalysis of Cure Rate Survival Data Under Proportional Odds Model
Analysis of Cure Rate Survival Data Under Proportional Odds Model Yu Gu 1,, Debajyoti Sinha 1, and Sudipto Banerjee 2, 1 Department of Statistics, Florida State University, Tallahassee, Florida 32310 5608,
More informationUnderstanding Ding s Apparent Paradox
Submitted to Statistical Science Understanding Ding s Apparent Paradox Peter M. Aronow and Molly R. OfferWestort Yale University 1. INTRODUCTION We are grateful for the opportunity to comment on A Paradox
More informationGOODNESS OF FIT TESTS FOR PARAMETRIC REGRESSION MODELS BASED ON EMPIRICAL CHARACTERISTIC FUNCTIONS
K Y B E R N E T I K A V O L U M E 4 5 2 0 0 9, N U M B E R 6, P A G E S 9 6 0 9 7 1 GOODNESS OF FIT TESTS FOR PARAMETRIC REGRESSION MODELS BASED ON EMPIRICAL CHARACTERISTIC FUNCTIONS Marie Hušková and
More informationDiscrete Dependent Variable Models
Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)
More informationFormulary Applied Econometrics
Department of Economics Formulary Applied Econometrics c c Seminar of Statistics University of Fribourg Formulary Applied Econometrics 1 Rescaling With y = cy we have: ˆβ = cˆβ With x = Cx we have: ˆβ
More information1 Appendix A: Matrix Algebra
Appendix A: Matrix Algebra. Definitions Matrix A =[ ]=[A] Symmetric matrix: = for all and Diagonal matrix: 6=0if = but =0if 6= Scalar matrix: the diagonal matrix of = Identity matrix: the scalar matrix
More informationGeneralized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence
Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey
More informationUsing Estimating Equations for Spatially Correlated A
Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship
More informationPreliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference
1 / 171 Bootstrap inference Francisco CribariNeto Departamento de Estatística Universidade Federal de Pernambuco Recife / PE, Brazil email: cribari@gmail.com October 2013 2 / 171 Unpaid advertisement
More informationChapter 1. Linear Regression with One Predictor Variable
Chapter 1. Linear Regression with One Predictor Variable 1.1 Statistical Relation Between Two Variables To motivate statistical relationships, let us consider a mathematical relation between two mathematical
More informationLeast Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions
Journal of Modern Applied Statistical Methods Volume 8 Issue 1 Article 13 512009 Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error
More informationAn Introduction to Nonstationary Time Series Analysis
An Introduction to Analysis Ting Zhang 1 tingz@bu.edu Department of Mathematics and Statistics Boston University August 15, 2016 Boston University/Keio University Workshop 2016 A Presentation Friendly
More informationSampling Distributions
Merlise Clyde Duke University September 8, 2016 Outline Topics Normal Theory Chisquared Distributions Student t Distributions Readings: Christensen Apendix C, Chapter 12 Prostate Example > library(lasso2);
More informationLarge Sample Properties of Estimators in the Classical Linear Regression Model
Large Sample Properties of Estimators in the Classical Linear Regression Model 7 October 004 A. Statement of the classical linear regression model The classical linear regression model can be written in
More informationGeneral Linear Model: Statistical Inference
Chapter 6 General Linear Model: Statistical Inference 6.1 Introduction So far we have discussed formulation of linear models (Chapter 1), estimability of parameters in a linear model (Chapter 4), least
More informationCovariance and Correlation
Covariance and Correlation ST 370 The probability distribution of a random variable gives complete information about its behavior, but its mean and variance are useful summaries. Similarly, the joint probability
More informationFrailty Models and Copulas: Similarities and Differences
Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt
More informationModified Simes Critical Values Under Positive Dependence
Modified Simes Critical Values Under Positive Dependence Gengqian Cai, Sanat K. Sarkar Clinical Pharmacology Statistics & Programming, BDS, GlaxoSmithKline Statistics Department, Temple University, Philadelphia
More informationSome Monte Carlo Evidence for Adaptive Estimation of UnitTime Varying Heteroscedastic Panel Data Models
Some Monte Carlo Evidence for Adaptive Estimation of UnitTime Varying Heteroscedastic Panel Data Models G. R. Pasha Department of Statistics, Bahauddin Zakariya University Multan, Pakistan Email: drpasha@bzu.edu.pk
More informationEmpirical Bayes Moderation of Asymptotically Linear Parameters
Empirical Bayes Moderation of Asymptotically Linear Parameters Nima Hejazi Division of Biostatistics University of California, Berkeley stat.berkeley.edu/~nhejazi nimahejazi.org twitter/@nshejazi github/nhejazi
More informationMaster s Written Examination  Solution
Master s Written Examination  Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2
More informationUniversity of California San Diego and Stanford University and
First International Workshop on Functional and Operatorial Statistics. Toulouse, June 1921, 2008 Ksample Subsampling Dimitris N. olitis andjoseph.romano University of California San Diego and Stanford
More informationResampling and the Bootstrap
Resampling and the Bootstrap Axel Benner Biostatistics, German Cancer Research Center INF 280, D69120 Heidelberg benner@dkfz.de Resampling and the Bootstrap 2 Topics Estimation and Statistical Testing
More informationTesting GoodnessofFit of a Uniform Truncation Model
Testing GoodnessofFit of a Uniform Truncation Model Micha Mandel and Rebecca A. Betensky Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts 02115 email: mmandel@hsph.harvard.edu
More informationA NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL
Discussiones Mathematicae Probability and Statistics 36 206 43 5 doi:0.75/dmps.80 A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Tadeusz Bednarski Wroclaw University email: t.bednarski@prawo.uni.wroc.pl
More informationWhat s New in Econometrics? Lecture 14 Quantile Methods
What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression
More informationHeteroskedasticityRobust Inference in Finite Samples
HeteroskedasticityRobust Inference in Finite Samples Jerry Hausman and Christopher Palmer Massachusetts Institute of Technology December 011 Abstract Since the advent of heteroskedasticityrobust standard
More informationCOS513: FOUNDATIONS OF PROBABILISTIC MODELS LECTURE 9: LINEAR REGRESSION
COS513: FOUNDATIONS OF PROBABILISTIC MODELS LECTURE 9: LINEAR REGRESSION SEAN GERRISH AND CHONG WANG 1. WAYS OF ORGANIZING MODELS In probabilistic modeling, there are several ways of organizing models:
More informationMultistate Modeling and Applications
Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)
More informationSequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process
Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 30833093 Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Julia Bondarenko HelmutSchmidt University Hamburg University
More informationOn robust and efficient estimation of the center of. Symmetry.
On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 276958203, U.S.A (email: bondell@stat.ncsu.edu) Abstract
More informationGeneralized linear models
Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models
More informationPanel Data Seminar. Discrete Response Models. CrestInsee. 11 April 2008
Panel Data Seminar Discrete Response Models Romain Aeberhardt Laurent Davezies CrestInsee 11 April 2008 Aeberhardt and Davezies (CrestInsee) Panel Data Seminar 11 April 2008 1 / 29 Contents Overview
More informationIntroduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017
Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent
More informationMultivariate Survival Data With Censoring.
1 Multivariate Survival Data With Censoring. Shulamith Gross and Catherine HuberCarol Baruch College of the City University of New York, Dept of Statistics and CIS, Box 11220, 1 Baruch way, 10010 NY.
More information