Ensemble estimation and variable selection with semiparametric regression models

Size: px
Start display at page:

Download "Ensemble estimation and variable selection with semiparametric regression models"

Transcription

1 Ensemble estimation and variable selection with semiparametric regression models Sunyoung Shin Department of Mathematical Sciences University of Texas at Dallas Joint work with Jason Fine, Yufeng Liu, and Steve Cole. July 30, 2018

2 Multicenter AIDS cohort study (MACS) Largest HIV AIDS prospective cohort study created to examine natural history of AIDS: aidscohortstudy.org (Kaslow, 1987) Clinical research sites Los Angeles: UCLA Chicago: Northwestern University Pittsburgh/Columbus: University of Pittsburgh Baltimore/Washington DC: Johns Hopkins University 2 / 27

3 Multicenter AIDS cohort study (MACS) Participants: 7,000 homosexual men Study design: Every six months, the participants biological and behavioral data are collected from a physical exam, questionnaires and laboratory testing. Goal: Find risk factors for the natural history of HIV infection. Response: time to HIV infection from the date of birth Possible risk factors: demographics, drug usage, sexual behaviors/disease, and medical histories. 3 / 27

4 Prospective doubly censored data (PDCD) Many subjects are infected at the time of study enrollment while those who are uninfected at enrollment may not be infected during the course of follow-up. Subjects Not Monitoring Event Right Censoring Not Time 4 / 27

5 Decomposition of prospective doubly censored data Subjects Not Monitoring Event Right Censoring Not Subjects Not infected Not infected Not infected Not infected Time Monitoring Not infected Time Subjects Time Not Monitoring Event Right Censoring Not 5 / 27

6 Decomposition of prospecitve doubly censored data Subjects Not infected Not infected Subjects Current status data Not infected Prospective doubly censored data Not infected Monitoring Not infected Not Monitoring Event Right Censoring Time Subjects Not Not Monitoring Event Right Censoring Left truncated right censored data Not Time Time 6 / 27

7 Ensemble of current status data and left truncated right censored data Full likelihood for prospective doubly censored data Maximization is difficult. Approximate EM algorithms (Kim, 2010; Su and Wang, 2016) Component likelihoods for current status data and left truncated right censored data Easy to maximize both component likelihoods. Both have been well studied, with theoretical and computational issues addressed rigorously. Ensemble method Separate component estimation Efficient combination of the component estimators 7 / 27

8 Semiparametric regression models Semiparametric regression model with (θ, Λ) θ: p-dimensional regression parameter Λ: an infinite dimensional nuisance parameter True parameter: (θ 0, Λ 0 ) Sparse θ 0 = (θ T 10, θ T 20) T = (θ T 10, 0) T, where θ 10 R s and θ 20 = 0. 8 / 27

9 Factorizable semiparametric likelihoods Data: n independent identically distributed observations, (z 1,, z n ) The log likelihood based on the data: l n (θ, Λ) = n i=1 l θ,λ(z i ) The log likelihood separates into fixed K component log likelihoods (Cox, 2001): l n (θ, Λ) = K lθ,λ k (z). k=1 Goals Efficient estimation of regression parameter Selection of the zero coefficients of regression parameter Oracle estimation of the nonzero coefficients 9 / 27

10 PDCD: Likelihood construction (C i, T i, R i, x i ): enrollment time, failure time, study termination time, and covariate, i = 1,, n (C i, C i Y i, δ i, ν i, x i ) is observed, where Y i = T i R i, δ i = I (T i C i ) is infection status at enrollment, ν i = I (T i R i ) is right censoring status. n i=1 L(C i, Y i, δ i, ν i x i ): F (C i x i ) δ i f (Y i x i ) ν i (1 δ i ) (1 F (Y i x i )) (1 ν i )(1 δ i ) 10 / 27

11 PDCD: Likelihood factorization n i=1 L(C i, Y i, δ i, ν i x i ): F (C i x i ) δ i f (Y i x i ) ν i (1 δ i ) (1 F (Y i x i )) (1 ν i )(1 δ i ) n i=1 L(C i, δ i x i ): F (C i x i ) δ i (1 F (C i x i )) 1 δ i n i=1 L(Y i, ν i C i, δ i, x i ): f (Y i x i ) {[ 1 F (C i x i ) ]ν i [ 1 F (Y i x i ) 1 F (C i x i ) ]1 ν i } 1 δ i 11 / 27

12 Cox model with PDCD Cox model is designed to estimate the effect of covariates on the hazard rate as well as the baseline hazard function. (Cox, 1972) The model parameter has two components (β, H) β: the effect parameter H: the baseline survival curve/hazard function The conditional hazard rate is assumed to satisfy H(t x) = H(t)exp(xβ). n l PDCD n (β, H) = log[1 exp{ exp(x T i β)h(c i )}] δ i (1 δ i ){ exp(x T i β)h(c i )} i=1 n + i=1 n ν i (1 δ i ){x T i + (1 ν i )(1 δ i ){ exp(x T i=1 β + logh(y i ) exp(x T i β)h(y i ) + exp(x T i β)h(c i )} i β)h(y i ) + exp(x T i β)h(c i )}. 12 / 27

13 PDCD: Likelihoods in proportional hazards models l PDCD n (β, H) l LTRC n (β, H) n i=1 n ν i (1 δ i ){x T i β + logh(y i ) exp(x T i β)h(y i ) + exp(x T i β)h(c i )} + (1 ν i )(1 δ i ){ exp(x T i β)h(y i ) + exp(x T i β)h(c i )} i=1 n i=1 log[1 exp{ exp(x T i l CS n (β, H) n β)h(c i )}] δ i (1 δ i ){ exp(x T i β)h(c i )} i=1 13 / 27

14 I. Initial ensemble estimation Efficient estimators of θ from the component likelihoods: ˆθ k F, k = 1,, K An estimator of the kth asymptotic inverse covariance, I k θ 0,Λ 0 : Î k F Initial ensemble estimator K ˆθ F = argmin (θ ˆθ k F ) T ÎF k (θ ˆθ k K F ) = ( Î k K F ) 1 ( Î k ˆθ k F F ) θ k=1 k=1 k=1 An estimator of I θ0,λ 0 : ÎF = K k=1 Î F k (Cox, 2001) 14 / 27

15 II. Ensemble variable selection Intermediate estimator with ensemble variable selection ˆθ E,λn = argmin{(θ ˆθ F ) T p Î F (θ ˆθ F ) + λ n θ j / ˆθ Fj }. θ j=1 Least squares approximation of the profile likelihood of θ with adaptive lasso penalty (Wang and Leng, 2007; Zou, 2006) Tuning parameter minimizing modified BIC is selected: BIC λn = (ˆθ E,λn ˆθ F ) T Î F (ˆθ E,λn ˆθ F )+logn p j=1 I (ˆθ E,λn,j 0)/n The selected model from ensemble variable selection: A = {j ˆθ E,λn,j 0} 15 / 27

16 III. Refit & ensemble re-estimation Refit component estimators: ˆθ k Θ R p Its subvector indexed by the model, A: ˆθ k A = (ˆθ j k, j A) R A Its asymptotic inverse covariance estimator: ÎA k R A R A Ensemble re-estimator by A: ˆθ A = ( K k=1 Î A k ) 1 ( K k=1 Î Aˆθ k k ) R A Its asymptotic inverse covariance estimator: Î A = K k=1 Î k A R A R A The resulting estimator: ˆθ = [ˆθ A ; ˆθ A c ] = [ˆθ A ; 0] R p 16 / 27

17 Uncorrelated condition C1. Component score functions for θ are pairwise uncorrelated, E[ l k θ 0,Λ 0 l k T θ 0,Λ 0 ] = 0, k k. C2. Component tangent spaces are pairwise orthogonal, Ṗ k θ 0,Λ 0 Ṗ k θ 0,Λ 0, k k. C3. Component score functions for θ are orthogonal to all other component tangent spaces, l k θ 0,Λ 0 Ṗ k θ 0,Λ 0, k k. 17 / 27

18 Full efficient score and information additivity Proposition 1 Under Conditions 1-3, l θ 0,Λ 0 = K k=1 l k θ 0,Λ 0, and E(l k θ 0,Λ 0 l k T θ 0,Λ 0 ) = 0, k k. Consequently, I θ 0,Λ 0 = K k=1 I k θ 0,Λ 0. Example Data: i.i.d observations of (Z i, W i, X i ) X i : a covariate, W i and Z i : dependent variables X i (θ 0, Λ 0) There is a semiparametric regression model which leads to a full log likelihood: n i=1 l θ,λ (Z i, W i X i ) = n i=1 n lθ,λ(w 1 i X i ) + lθ,λ(z 2 i W i, X i ) i=1 18 / 27

19 Assumptions on the initial estimation A1. Semiparametric efficient estimation of component regression parameters ˆθ k F, k = 1,, K, are regular, asymptotically linear and semiparametric efficient with the component likelihoods such that n 1/2 (ˆθ k F θ 0 ) = n 1/2 n i=1 I θ k 1 0,Λ 0 lθ k 0,Λ 0 (z i ) + o p (1), and n 1/2 (ˆθ k F θ 0 ) D N(0, Iθ k 1 0,Λ 0 ). A2. Consistent estimation of component efficient information A consistent estimator of Iθ k 0,Λ 0, ÎF k, exists for k = 1,, K. Remark. The convergence rates of component nuisance parameter estimators may be slower than the convergence rate of n 1/2. 19 / 27

20 Asymptotic properties Theorem 1 (Asymptotic Efficiency) n 1/2 (ˆθ F θ 0 ) = O p (1) and n 1/2 (ˆθ F θ 0 ) D N(0, I 1 θ 0,Λ 0 ). Theorem 2 (Selection Consistency) If n 1/2 λ n 0 and nλ n, then n 1/2 (ˆθ θ 0 ) = O p (1) and P(ˆθ 2 = 0) 1. Theorem 3 (Oracle Property) If n 1/2 λ n 0 and nλ n, then n 1/2 (ˆθ 1 θ 10 ) D N(0, I a 1 θ 10,Λ 0 ). 20 / 27

21 Simulation set-up Data from the following exponential hazard model: h(t x) = exp(x T β), β = (0.8, 0, 0, 1, 0, 0, 0.6, 0, 0, 0) x N(0, Σ), Σ ij = 0.5 i j C i follows an exponential distribution, R i follows an exponential distribution shifted by C i. Censoring rates: (20%, 20%), (30%, 30%) n = 250 or n = / 27

22 Simulation results n = 250 (20%, 20%) RMSE TP FP UF CF OF Oracle CS MLE LSA Refit LSA Oracle LTRC MLE LSA Refit LSA Oracle EE Ensemble EVS Refit on CS Refit on LTRC Refit Ensemble Table : RMSE: Relative mean squared errors to the ensemble oracle estimator, TP: true positives, FP: false positives, (UF, CF, OF): ratios of simulated datasets which are underfitted, correctly fitted or overfitted to the true model 22 / 27

23 MACS data analysis Covariates EE (SE) EVS LTRC (SE) CS (SE) ENS (SE) UNEMP 0.19 (0.09).... BLACK 0.71 (0.08) * (0.18) 0.69 (0.08) 0.70 (0.08) HISPA 0.28 (0.10) * (0.22) 0.32 (0.13) 0.20 (0.11) OTHER 0.06 (0.20).... PRECOL (0.07) *.... COL (0.08) * (0.13) (0.06) (0.06) POSCOL (0.07) * (0.12) (0.11) (0.07) NDRNK (0.01).... PACKS (0.03).... NEEDL 0.54 (0.10) * (0.22) 0.45 (0.12) 0.57 (0.10) COK2Y 0.49 (0.06) * (0.10) 0.55 (0.09) 0.41 (0.07) HAS2Y 0.12 (0.08) *.... MSX2Y 0.31 (0.08) * (0.14) 0.27 (0.09) 0.24 (0.07) OPI2Y (0.12) / 27

24 MACS data analysis Covariates EE (SE) EVS LTRC (SE) CS (SE) ENS (SE) DIABE (0.29).... GONOE 0.53 (0.08) * (0.10) 0.50 (0.14) 0.62 (0.08) RADTE (0.21).... WARTE 0.37 (0.05) * (0.11) 0.39 (0.05) 0.34 (0.05) CON2P 0.07 (0.23).... CON2Y (0.21).... REC2P 0.58 (0.05) * (0.12) 0.59 (0.05) 0.57 ( 0.05) REC2Y 0.06 (0.05).... Table : EE: initial ensemble estimator, EVE: intermediate estimator with ensemble variable selection, LTRC, CS: refit estimators, ENS: ensemble re-estimator 24 / 27

25 Summary & Future research Summary Prospective doubly censored data Semiparametric factorizable likelihoods Ensemble method Efficient combination of component estimators Lasso penalization Future works Ensemble method in the high dimensional context, p >> n Ensemble estimation of nonparametric nuisance parameter, Λ 25 / 27

26 Bibliography I D.R. Cox. Regression models and life-tables. Journal of the Royal Statistical Society. Series B (Methodological), pages , DR Cox. Some remarks on likelihood factorization. Lecture Notes-Monograph Series, pages , C. R Kaslow, R. A. & Rinaldo. The multicenter aids cohort study: rationale, organization, and selected characteristics of the participants. American Journal of Epidemiology, 126(1): , Y Kim. Asymptotic properties of the maximum likelihood estimator for the proportional hazards model with doubly censored data. Journal of Multivariate Analysis, 101(101): , Y-R. Su and J-L Wang. Semiparametric efficient estimation for shared-frailty models with doubly-censored clustered data. The Annals of Statistics, 44: , R. Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological), pages , H. Wang and C. Leng. Unified lasso estimation by least squares approximation. Journal of the American Statistical Association, 102(479): , H. Zou. The adaptive lasso and its oracle properties. Journal of the American Statistical Association, 101(476): , 2006.

27 Thank you! 27 / 27

28 Preliminary l θ0,λ 0 (z i ) = l θ,λ0 (z i )/( θ) θ=θ0 L 2 (P θ0,λ 0 ): the score function for θ at θ 0 l θ0,λ(t)(z i )/( t) t=0 : score function for the submodel of Λ Λ(t): one-dimensional parametric submodels of Λ Λ(t) Λ 0 as t 0 Ṗ θ0,λ 0 L 2 (P θ0,λ 0 ): tangent space for Λ at the true distribution closed span of the tangent set, which is a collection of score functions for one-dimensional submodels l θ 0,Λ 0 = l θ0,λ 0 0 l θ0,λ 0 : efficient score function at (θ 0, Λ 0 ), where 0 is the projection onto Ṗ θ0,λ 0 in L 2 (P θ0,λ 0 ). Iθ 0,Λ 0 = E(lθ 0,Λ 0 lθ T 0,Λ 0 ): efficient information matrix. Similarly, define them with respect to component likelihoods: l θ k 0,Λ 0 (z i ), lθ k 0,Λ(t) (z i)/( t) t=0, Ṗθ k 0,Λ 0, lθ k 0,Λ 0, and Iθ k 0,Λ / 27

29 Application of ensemble method I. Efficient MLEs: ˆβ CS F, ˆβ LTRC F Consistent variance estimator: ÎF CS 1, ÎF LTRC 1 Computation: iterative convex minorant algorithm, partial likelihood maximization Huang (1996), Andersen et al. (1997) Full ensemble estimator: ˆβ F = (Î CS F Î F = Î CS F + Î LTRC F + Î LTRC F ) 1 (Î LTRC F ˆβ CS F + ÎF LTRC ˆβ LTRC F ) 29 / 27

30 Application of ensemble method II. Intermediate estimator with ensemble variable selection: ˆβ E,λn = argmin(β ˆβ F ) T Î F (β ˆβ F ) + λ n β j / ˆβ Fj β R p The selected model: M = {j ˆβ Eλn,j 0} p j=1 30 / 27

31 Application of ensemble method III. Refit estimators: ˆβ CS, ˆβ LTRC R p Its subvector indexed by M: ˆβ LTRC M, ˆβ CS M Its asymptotic covariances: Î M CS 1, Î M LTRC 1 Ensemble re-estimator indexed by M: ˆβ M = (ÎM CS + ÎM LTRC ) 1 (ÎM CS Î M = (ÎM CS + ÎM LTRC ) ˆβ CS M + ÎM LTRC Final estimator: ˆβ = [ˆβ M ; ˆβ M c ] = [ˆβ M ; 0] R p LTRC ˆβ M ) 31 / 27

32 I. Full ensemble estimation MLE from lβ,h CS CS (C, δ x): ˆβ F A consistent bootstrap variance estimator, Î F CS 1 Computation: the iterative convex minorant (ICM) algorithm MLE from lβ,h LTRC(Y, ν C, δ, x): ˆβ LTRC F A consistent variance estimator, Î F LTRC Computation: the partial log likelihood Full ensemble estimator: ˆβ F = (Î CS F + Î LTRC F ) 1 (Î LTRC F ˆβ CS F + ÎF LTRC ˆβ LTRC F ) Its inverse covariance estimator is Î F = Î F CS + Î F LTRC 32 / 27

33 II. Ensemble variable selection Intermediate estimator with ensemble variable selection ˆβ E,λn = argmin(β ˆβ F ) T ÎF (β ˆβ F ) + λ n β j / ˆβ Fj, β R p p j=1 Selection of optimal tuning parameter: minimizing the modified BIC Computation: the path-finding algorithm least angle regression (lars) (Efron et al., 2004) The selected model: M = {j ˆβ Eλn,j 0} 33 / 27

34 III. Refit ensemble estimation Refit estimators based on M: ˆβ CS and ˆβ LTRC ˆβ LTRC M ˆβ CS M Its subvector indexed by M: and Its asymptotic inverse covariances: Î M CS LTRC and ÎM Computation: maximizing the component likelihoods under a constraint that {β j = 0, for j M Ensemble re-estimator indexed by M: ˆβ M = (Î CS M + Î LTRC M ) 1 (Î CS M ˆβ CS M + ÎM LTRC ˆβ LTRC M ) Asymptotic inverse covariance estimator: Î M Final estimator: ˆβ = [ˆβ M ; ˆβ M c ] = [ˆβ M ; 0] CS = (ÎM + Î M LTRC ) 34 / 27

35 Norm of a matrix A R m m Frobenius norm: A F = tr(a T A) Spectral norm: A 2 = λ max (A T A), λ max (A T A) is the largest eigenvalue of A T A. m m L 1 norm: A 1 = a ij i=1 j=1 35 / 27

36 Theoretical properties Corollary 4 (Asymptotic Efficiency) We have n 1/2 (ˆβ F β 0 ) D N(0, I 1 β 0,H 0 ). Corollary 5 (Selection Consistency) If n 1/2 λ n 0 and nλ n, then n 1/2 (ˆβ β 0 ) = O p (1) and P(M = M T ) 1. Corollary 6 (Oracle Property) If n 1/2 λ n 0 and nλ n, then n 1/2 (ˆβ a β 0a ) D N(0, I a 1 β 0a,H 0 ). 36 / 27

37 Baseline hazard rate estimators 1. The baseline hazard function estimator, Ĥ LTRC, in Cox model with left-truncated and right-censored data has a regular convergence rate, n 1/2. 2. The convergence rate of Ĥ CS in Cox model with current status data is n 1/3. Regardless of the convergence rate of the component baseline hazard rate estimators, our proposed method establishes valid asymptotical properties for estimation of β. Practically, we may use Ĥ LTRC as a baseline hazard function estimator. 37 / 27

38 Decomposition into marginal and conditional likelihoods Data: n independent identically distributed observations of (Z i, W i, X i ) X i is a covariate and W i and Z i are dependent variables. A semiparametric regression model with parameters (θ, Λ) which leads to a full log likelihood, n l θ,λ (Z i, W i X i ). i=1 Distribution of X i is independent of (θ, Λ). l θ,λ (Z, W X) = Decomposition of full log likelihood into the marginal and conditional log likelihoods: lθ,λ 1 n (W X) = lθ,λ 1 (W i X i ) and lθ,λ 2 n (Z W, X) = lθ,λ 2 (Z i W i, X i ). i=1 i=1 38 / 27

39 Variance estimators of ˆβ 1 n = 250 CS LTRC (20%, 20%) Oracle MLE Refit Oracle MLE Refit LSA LSA ESE ASE % coverage Ensemble Oracle EE Refit Refit Refit on CS on LTRC Ensemble ESE ASE % coverage Table : ESE: Empirical standard errors, ASE: Average estimated standard errors, 95% empirical coverage probabilities of ˆβ 1 39 / 27

40 Norm of empirical covariances between two component MLEs Censoring rates (20%, 20%) (30%, 30%) Norm n = 250 n = 500 n = 250 n = 500 Frobenius Spectral L / 27

CONTRIBUTIONS TO PENALIZED ESTIMATION. Sunyoung Shin

CONTRIBUTIONS TO PENALIZED ESTIMATION. Sunyoung Shin CONTRIBUTIONS TO PENALIZED ESTIMATION Sunyoung Shin A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for the degree

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables

Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables Smoothly Clipped Absolute Deviation (SCAD) for Correlated Variables LIB-MA, FSSM Cadi Ayyad University (Morocco) COMPSTAT 2010 Paris, August 22-27, 2010 Motivations Fan and Li (2001), Zou and Li (2008)

More information

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Article Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Fei Jin 1,2 and Lung-fei Lee 3, * 1 School of Economics, Shanghai University of Finance and Economics,

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

Lecture 14: Variable Selection - Beyond LASSO

Lecture 14: Variable Selection - Beyond LASSO Fall, 2017 Extension of LASSO To achieve oracle properties, L q penalty with 0 < q < 1, SCAD penalty (Fan and Li 2001; Zhang et al. 2007). Adaptive LASSO (Zou 2006; Zhang and Lu 2007; Wang et al. 2007)

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Modelling geoadditive survival data

Modelling geoadditive survival data Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty

Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty Journal of Data Science 9(2011), 549-564 Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty Masaru Kanba and Kanta Naito Shimane University Abstract: This paper discusses the

More information

The Adaptive Lasso and Its Oracle Properties Hui Zou (2006), JASA

The Adaptive Lasso and Its Oracle Properties Hui Zou (2006), JASA The Adaptive Lasso and Its Oracle Properties Hui Zou (2006), JASA Presented by Dongjun Chung March 12, 2010 Introduction Definition Oracle Properties Computations Relationship: Nonnegative Garrote Extensions:

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

Frailty Modeling for clustered survival data: a simulation study

Frailty Modeling for clustered survival data: a simulation study Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University

More information

Statistical Inference

Statistical Inference Statistical Inference Liu Yang Florida State University October 27, 2016 Liu Yang, Libo Wang (Florida State University) Statistical Inference October 27, 2016 1 / 27 Outline The Bayesian Lasso Trevor Park

More information

Univariate shrinkage in the Cox model for high dimensional data

Univariate shrinkage in the Cox model for high dimensional data Univariate shrinkage in the Cox model for high dimensional data Robert Tibshirani January 6, 2009 Abstract We propose a method for prediction in Cox s proportional model, when the number of features (regressors)

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes. Donglin Zeng, Department of Biostatistics, University of North Carolina

Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes. Donglin Zeng, Department of Biostatistics, University of North Carolina Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes Introduction Method Theoretical Results Simulation Studies Application Conclusions Introduction Introduction For survival data,

More information

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data Outline Frailty modelling of Multivariate Survival Data Thomas Scheike ts@biostat.ku.dk Department of Biostatistics University of Copenhagen Marginal versus Frailty models. Two-stage frailty models: copula

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Sparse Linear Models (10/7/13)

Sparse Linear Models (10/7/13) STA56: Probabilistic machine learning Sparse Linear Models (0/7/) Lecturer: Barbara Engelhardt Scribes: Jiaji Huang, Xin Jiang, Albert Oh Sparsity Sparsity has been a hot topic in statistics and machine

More information

Penalized Empirical Likelihood and Growing Dimensional General Estimating Equations

Penalized Empirical Likelihood and Growing Dimensional General Estimating Equations 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 Biometrika (0000), 00, 0, pp. 1 15 C 0000 Biometrika Trust Printed

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial

More information

Integrated likelihoods in survival models for highlystratified

Integrated likelihoods in survival models for highlystratified Working Paper Series, N. 1, January 2014 Integrated likelihoods in survival models for highlystratified censored data Giuliana Cortese Department of Statistical Sciences University of Padua Italy Nicola

More information

Estimating subgroup specific treatment effects via concave fusion

Estimating subgroup specific treatment effects via concave fusion Estimating subgroup specific treatment effects via concave fusion Jian Huang University of Iowa April 6, 2016 Outline 1 Motivation and the problem 2 The proposed model and approach Concave pairwise fusion

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

Sparse survival regression

Sparse survival regression Sparse survival regression Anders Gorst-Rasmussen gorst@math.aau.dk Department of Mathematics Aalborg University November 2010 1 / 27 Outline Penalized survival regression The semiparametric additive risk

More information

The Iterated Lasso for High-Dimensional Logistic Regression

The Iterated Lasso for High-Dimensional Logistic Regression The Iterated Lasso for High-Dimensional Logistic Regression By JIAN HUANG Department of Statistics and Actuarial Science, 241 SH University of Iowa, Iowa City, Iowa 52242, U.S.A. SHUANGE MA Division of

More information

Longitudinal + Reliability = Joint Modeling

Longitudinal + Reliability = Joint Modeling Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,

More information

Nonconcave Penalized Likelihood with A Diverging Number of Parameters

Nonconcave Penalized Likelihood with A Diverging Number of Parameters Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Statistical Data Mining and Machine Learning Hilary Term 2016

Statistical Data Mining and Machine Learning Hilary Term 2016 Statistical Data Mining and Machine Learning Hilary Term 2016 Dino Sejdinovic Department of Statistics Oxford Slides and other materials available at: http://www.stats.ox.ac.uk/~sejdinov/sdmml Naïve Bayes

More information

High-dimensional regression with unknown variance

High-dimensional regression with unknown variance High-dimensional regression with unknown variance Christophe Giraud Ecole Polytechnique march 2012 Setting Gaussian regression with unknown variance: Y i = f i + ε i with ε i i.i.d. N (0, σ 2 ) f = (f

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

Lecture 3 September 1

Lecture 3 September 1 STAT 383C: Statistical Modeling I Fall 2016 Lecture 3 September 1 Lecturer: Purnamrita Sarkar Scribe: Giorgio Paulon, Carlos Zanini Disclaimer: These scribe notes have been slightly proofread and may have

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Theoretical results for lasso, MCP, and SCAD

Theoretical results for lasso, MCP, and SCAD Theoretical results for lasso, MCP, and SCAD Patrick Breheny March 2 Patrick Breheny High-Dimensional Data Analysis (BIOS 7600) 1/23 Introduction There is an enormous body of literature concerning theoretical

More information

A Confidence Region Approach to Tuning for Variable Selection

A Confidence Region Approach to Tuning for Variable Selection A Confidence Region Approach to Tuning for Variable Selection Funda Gunes and Howard D. Bondell Department of Statistics North Carolina State University Abstract We develop an approach to tuning of penalized

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

Adaptive Piecewise Polynomial Estimation via Trend Filtering

Adaptive Piecewise Polynomial Estimation via Trend Filtering Adaptive Piecewise Polynomial Estimation via Trend Filtering Liubo Li, ShanShan Tu The Ohio State University li.2201@osu.edu, tu.162@osu.edu October 1, 2015 Liubo Li, ShanShan Tu (OSU) Trend Filtering

More information

On Measurement Error Problems with Predictors Derived from Stationary Stochastic Processes and Application to Cocaine Dependence Treatment Data

On Measurement Error Problems with Predictors Derived from Stationary Stochastic Processes and Application to Cocaine Dependence Treatment Data On Measurement Error Problems with Predictors Derived from Stationary Stochastic Processes and Application to Cocaine Dependence Treatment Data Yehua Li Department of Statistics University of Georgia Yongtao

More information

Beyond GLM and likelihood

Beyond GLM and likelihood Stat 6620: Applied Linear Models Department of Statistics Western Michigan University Statistics curriculum Core knowledge (modeling and estimation) Math stat 1 (probability, distributions, convergence

More information

Multistate models and recurrent event models

Multistate models and recurrent event models Multistate models Multistate models and recurrent event models Patrick Breheny December 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Multistate models In this final lecture,

More information

Estimation for two-phase designs: semiparametric models and Z theorems

Estimation for two-phase designs: semiparametric models and Z theorems Estimation for two-phase designs:semiparametric models and Z theorems p. 1/27 Estimation for two-phase designs: semiparametric models and Z theorems Jon A. Wellner University of Washington Estimation for

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Grouped variable selection in high dimensional partially linear additive Cox model

Grouped variable selection in high dimensional partially linear additive Cox model University of Iowa Iowa Research Online Theses and Dissertations Fall 2010 Grouped variable selection in high dimensional partially linear additive Cox model Li Liu University of Iowa Copyright 2010 Li

More information

Variable Selection and Model Choice in Survival Models with Time-Varying Effects

Variable Selection and Model Choice in Survival Models with Time-Varying Effects Variable Selection and Model Choice in Survival Models with Time-Varying Effects Boosting Survival Models Benjamin Hofner 1 Department of Medical Informatics, Biometry and Epidemiology (IMBE) Friedrich-Alexander-Universität

More information

Frailty Probit model for multivariate and clustered interval-censor

Frailty Probit model for multivariate and clustered interval-censor Frailty Probit model for multivariate and clustered interval-censored failure time data University of South Carolina Department of Statistics June 4, 2013 Outline Introduction Proposed models Simulation

More information

Maximum likelihood estimation for Cox s regression model under nested case-control sampling

Maximum likelihood estimation for Cox s regression model under nested case-control sampling Biostatistics (2004), 5, 2,pp. 193 206 Printed in Great Britain Maximum likelihood estimation for Cox s regression model under nested case-control sampling THOMAS H. SCHEIKE Department of Biostatistics,

More information

Models for Multivariate Panel Count Data

Models for Multivariate Panel Count Data Semiparametric Models for Multivariate Panel Count Data KyungMann Kim University of Wisconsin-Madison kmkim@biostat.wisc.edu 2 April 2015 Outline 1 Introduction 2 3 4 Panel Count Data Motivation Previous

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

Confidence Intervals for Low-dimensional Parameters with High-dimensional Data

Confidence Intervals for Low-dimensional Parameters with High-dimensional Data Confidence Intervals for Low-dimensional Parameters with High-dimensional Data Cun-Hui Zhang and Stephanie S. Zhang Rutgers University and Columbia University September 14, 2012 Outline Introduction Methodology

More information

Graduate Econometrics I: Maximum Likelihood II

Graduate Econometrics I: Maximum Likelihood II Graduate Econometrics I: Maximum Likelihood II Yves Dominicy Université libre de Bruxelles Solvay Brussels School of Economics and Management ECARES Yves Dominicy Graduate Econometrics I: Maximum Likelihood

More information

Sample-weighted semiparametric estimates of cause-specific cumulative incidence using left-/interval censored data from electronic health records

Sample-weighted semiparametric estimates of cause-specific cumulative incidence using left-/interval censored data from electronic health records 1 / 22 Sample-weighted semiparametric estimates of cause-specific cumulative incidence using left-/interval censored data from electronic health records Noorie Hyun, Hormuzd A. Katki, Barry I. Graubard

More information

CURE MODEL WITH CURRENT STATUS DATA

CURE MODEL WITH CURRENT STATUS DATA Statistica Sinica 19 (2009), 233-249 CURE MODEL WITH CURRENT STATUS DATA Shuangge Ma Yale University Abstract: Current status data arise when only random censoring time and event status at censoring are

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2009 Paper 248 Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Kelly

More information

VARIABLE SELECTION AND STATISTICAL LEARNING FOR CENSORED DATA. Xiaoxi Liu

VARIABLE SELECTION AND STATISTICAL LEARNING FOR CENSORED DATA. Xiaoxi Liu VARIABLE SELECTION AND STATISTICAL LEARNING FOR CENSORED DATA Xiaoxi Liu A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements

More information

Generalized Elastic Net Regression

Generalized Elastic Net Regression Abstract Generalized Elastic Net Regression Geoffroy MOURET Jean-Jules BRAULT Vahid PARTOVINIA This work presents a variation of the elastic net penalization method. We propose applying a combined l 1

More information

An Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models

An Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models Proceedings 59th ISI World Statistics Congress, 25-30 August 2013, Hong Kong (Session CPS023) p.3938 An Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models Vitara Pungpapong

More information

Various types of likelihood

Various types of likelihood Various types of likelihood 1. likelihood, marginal likelihood, conditional likelihood, profile likelihood, adjusted profile likelihood 2. semi-parametric likelihood, partial likelihood 3. empirical likelihood,

More information

Multistate models and recurrent event models

Multistate models and recurrent event models and recurrent event models Patrick Breheny December 6 Patrick Breheny University of Iowa Survival Data Analysis (BIOS:7210) 1 / 22 Introduction In this final lecture, we will briefly look at two other

More information

Efficient Estimation of Population Quantiles in General Semiparametric Regression Models

Efficient Estimation of Population Quantiles in General Semiparametric Regression Models Efficient Estimation of Population Quantiles in General Semiparametric Regression Models Arnab Maity 1 Department of Statistics, Texas A&M University, College Station TX 77843-3143, U.S.A. amaity@stat.tamu.edu

More information

High-dimensional Covariance Estimation Based On Gaussian Graphical Models

High-dimensional Covariance Estimation Based On Gaussian Graphical Models High-dimensional Covariance Estimation Based On Gaussian Graphical Models Shuheng Zhou, Philipp Rutimann, Min Xu and Peter Buhlmann February 3, 2012 Problem definition Want to estimate the covariance matrix

More information

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30 MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD Copyright c 2012 (Iowa State University) Statistics 511 1 / 30 INFORMATION CRITERIA Akaike s Information criterion is given by AIC = 2l(ˆθ) + 2k, where l(ˆθ)

More information

STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL. A Thesis. Presented to the. Faculty of. San Diego State University

STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL. A Thesis. Presented to the. Faculty of. San Diego State University STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL A Thesis Presented to the Faculty of San Diego State University In Partial Fulfillment of the Requirements for the Degree

More information

Regularization and Variable Selection via the Elastic Net

Regularization and Variable Selection via the Elastic Net p. 1/1 Regularization and Variable Selection via the Elastic Net Hui Zou and Trevor Hastie Journal of Royal Statistical Society, B, 2005 Presenter: Minhua Chen, Nov. 07, 2008 p. 2/1 Agenda Introduction

More information

UNIVERSITÄT POTSDAM Institut für Mathematik

UNIVERSITÄT POTSDAM Institut für Mathematik UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam

More information

A New Bayesian Variable Selection Method: The Bayesian Lasso with Pseudo Variables

A New Bayesian Variable Selection Method: The Bayesian Lasso with Pseudo Variables A New Bayesian Variable Selection Method: The Bayesian Lasso with Pseudo Variables Qi Tang (Joint work with Kam-Wah Tsui and Sijian Wang) Department of Statistics University of Wisconsin-Madison Feb. 8,

More information

Likelihood inference in the presence of nuisance parameters

Likelihood inference in the presence of nuisance parameters Likelihood inference in the presence of nuisance parameters Nancy Reid, University of Toronto www.utstat.utoronto.ca/reid/research 1. Notation, Fisher information, orthogonal parameters 2. Likelihood inference

More information

STA414/2104 Statistical Methods for Machine Learning II

STA414/2104 Statistical Methods for Machine Learning II STA414/2104 Statistical Methods for Machine Learning II Murat A. Erdogdu & David Duvenaud Department of Computer Science Department of Statistical Sciences Lecture 3 Slide credits: Russ Salakhutdinov Announcements

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

ISyE 691 Data mining and analytics

ISyE 691 Data mining and analytics ISyE 691 Data mining and analytics Regression Instructor: Prof. Kaibo Liu Department of Industrial and Systems Engineering UW-Madison Email: kliu8@wisc.edu Office: Room 3017 (Mechanical Engineering Building)

More information

Unified LASSO Estimation via Least Squares Approximation

Unified LASSO Estimation via Least Squares Approximation Unified LASSO Estimation via Least Squares Approximation Hansheng Wang and Chenlei Leng Peking University & National University of Singapore First version: May 25, 2006. Revised on March 23, 2007. Abstract

More information

Lecture 2 Machine Learning Review

Lecture 2 Machine Learning Review Lecture 2 Machine Learning Review CMSC 35246: Deep Learning Shubhendu Trivedi & Risi Kondor University of Chicago March 29, 2017 Things we will look at today Formal Setup for Supervised Learning Things

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Unsupervised Learning Techniques Class 07, 1 March 2006 Andrea Caponnetto

Unsupervised Learning Techniques Class 07, 1 March 2006 Andrea Caponnetto Unsupervised Learning Techniques 9.520 Class 07, 1 March 2006 Andrea Caponnetto About this class Goal To introduce some methods for unsupervised learning: Gaussian Mixtures, K-Means, ISOMAP, HLLE, Laplacian

More information

A note on L convergence of Neumann series approximation in missing data problems

A note on L convergence of Neumann series approximation in missing data problems A note on L convergence of Neumann series approximation in missing data problems Hua Yun Chen Division of Epidemiology & Biostatistics School of Public Health University of Illinois at Chicago 1603 West

More information

FEATURE SCREENING IN ULTRAHIGH DIMENSIONAL

FEATURE SCREENING IN ULTRAHIGH DIMENSIONAL Statistica Sinica 26 (2016), 881-901 doi:http://dx.doi.org/10.5705/ss.2014.171 FEATURE SCREENING IN ULTRAHIGH DIMENSIONAL COX S MODEL Guangren Yang 1, Ye Yu 2, Runze Li 2 and Anne Buu 3 1 Jinan University,

More information

TECHNICAL REPORT NO. 1091r. A Note on the Lasso and Related Procedures in Model Selection

TECHNICAL REPORT NO. 1091r. A Note on the Lasso and Related Procedures in Model Selection DEPARTMENT OF STATISTICS University of Wisconsin 1210 West Dayton St. Madison, WI 53706 TECHNICAL REPORT NO. 1091r April 2004, Revised December 2004 A Note on the Lasso and Related Procedures in Model

More information

Effect of outliers on the variable selection by the regularized regression

Effect of outliers on the variable selection by the regularized regression Communications for Statistical Applications and Methods 2018, Vol. 25, No. 2, 235 243 https://doi.org/10.29220/csam.2018.25.2.235 Print ISSN 2287-7843 / Online ISSN 2383-4757 Effect of outliers on the

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013 Exam policy: This exam allows two one-page, two-sided cheat sheets; No other materials. Time: 2 hours. Be sure to write your name and

More information

On Mixture Regression Shrinkage and Selection via the MR-LASSO

On Mixture Regression Shrinkage and Selection via the MR-LASSO On Mixture Regression Shrinage and Selection via the MR-LASSO Ronghua Luo, Hansheng Wang, and Chih-Ling Tsai Guanghua School of Management, Peing University & Graduate School of Management, University

More information

Linear Model Selection and Regularization

Linear Model Selection and Regularization Linear Model Selection and Regularization Recall the linear model Y = β 0 + β 1 X 1 + + β p X p + ɛ. In the lectures that follow, we consider some approaches for extending the linear model framework. In

More information

Model comparison and selection

Model comparison and selection BS2 Statistical Inference, Lectures 9 and 10, Hilary Term 2008 March 2, 2008 Hypothesis testing Consider two alternative models M 1 = {f (x; θ), θ Θ 1 } and M 2 = {f (x; θ), θ Θ 2 } for a sample (X = x)

More information

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data Outline Frailty modelling of Multivariate Survival Data Thomas Scheike ts@biostat.ku.dk Department of Biostatistics University of Copenhagen Marginal versus Frailty models. Two-stage frailty models: copula

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

Likelihood-Based Methods

Likelihood-Based Methods Likelihood-Based Methods Handbook of Spatial Statistics, Chapter 4 Susheela Singh September 22, 2016 OVERVIEW INTRODUCTION MAXIMUM LIKELIHOOD ESTIMATION (ML) RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION (REML)

More information

Tuning Parameter Selection in Regularized Estimations of Large Covariance Matrices

Tuning Parameter Selection in Regularized Estimations of Large Covariance Matrices Tuning Parameter Selection in Regularized Estimations of Large Covariance Matrices arxiv:1308.3416v1 [stat.me] 15 Aug 2013 Yixin Fang 1, Binhuan Wang 1, and Yang Feng 2 1 New York University and 2 Columbia

More information

The picasso Package for Nonconvex Regularized M-estimation in High Dimensions in R

The picasso Package for Nonconvex Regularized M-estimation in High Dimensions in R The picasso Package for Nonconvex Regularized M-estimation in High Dimensions in R Xingguo Li Tuo Zhao Tong Zhang Han Liu Abstract We describe an R package named picasso, which implements a unified framework

More information

Some Curiosities Arising in Objective Bayesian Analysis

Some Curiosities Arising in Objective Bayesian Analysis . Some Curiosities Arising in Objective Bayesian Analysis Jim Berger Duke University Statistical and Applied Mathematical Institute Yale University May 15, 2009 1 Three vignettes related to John s work

More information

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING DATE AND TIME: August 30, 2018, 14.00 19.00 RESPONSIBLE TEACHER: Niklas Wahlström NUMBER OF PROBLEMS: 5 AIDING MATERIAL: Calculator, mathematical

More information

Large sample theory for merged data from multiple sources

Large sample theory for merged data from multiple sources Large sample theory for merged data from multiple sources Takumi Saegusa University of Maryland Division of Statistics August 22 2018 Section 1 Introduction Problem: Data Integration Massive data are collected

More information