Likelihood Ratio Tests. that Certain Variance Components Are Zero. Ciprian M. Crainiceanu. Department of Statistical Science
|
|
- Loren King
- 5 years ago
- Views:
Transcription
1 1 Likelihood Ratio Tests that Certain Variance Components Are Zero Ciprian M. Crainiceanu Department of Statistical Science Work done jointly with David Ruppert, School of ORIE Cornell University
2 2 Outline A. Examples of problems (Ruppert, Wand and Carroll, 2003) B. Penalized splines as a particular case of LMMs C. Null hypotheses including zero random effects variance in LMMs D. LRTs and F-type tests for this type of hypotheses E. Null distributions and power properties of LRT and RLRT F. Other applications: additive, nonlinear models, goodness-of-fit G. Conclusions
3 3 Mother age vs. child birthweight data ML fit GCV fit birthweight (grams) maternal age (years)
4 4 Janka hardness data linear fit p spline fit log(janka hardness) density
5 5 Nonparametric framework Consider the regression equation, where ɛ i are i.i.d. N ( 0,σɛ 2 ) y i = m (x i ) + ɛ i, Null hypothesis: m( ) is a p q degree polynomial m (x,β) = β 0 + β 1 x β p q x p q Alternative hypothesis: m( ) is a regression spline m (x,θ) = β 0 + β 1 x β p x p + Idea: use likelihood ratio tests K k=1 b k (x κ k ) p +
6 6 P-splines as LMMs Penalized sum of squares estimation criterion (avoid overfitting) n {y i m (x i ;θ)} λ θt Dθ i=1 The Penalized Spline criterion is equivalent to 1 σ 2 ɛ Y Xβ Zb λσɛ 2 b T Σ 1 b, If σb 2 = λσ2 ɛ then minimizing this criterion ML for the LMM Y = Xβ + Zb + ɛ, Cov b ɛ = σ 2 b Σ O K n O n K σ 2 ɛi n
7 7 Smoothing parameter estimation ML(Θ) = n log(σ 2 ɛ) + log {det(v λ )} + (y Xβ)T V 1 λ (y Xβ) REML(Θ) = ML(Θ) (p + 1) log(σ 2 ɛ) + log σ 2 ɛ { det(x T V 1 λ X) }, where Θ = (β, σ 2 ɛ, λ), Cov(Y ) = σ 2 ɛv λ, V λ = I n + λzσz T. Generalized Cross Validation (GCV) can be used to estimate λ (not suited for LRT testing).
8 8 Hypotheses ) H 0 : β p q+1 =... = β p = 0 and λ = 0 (σ b 2 = 0 ) H A : β p q β p 0 λ > 0 (σ b 2 > 0 Why is the problem hard? Non-standard hypothesis under the null (boundary problem) Correlated observations (at least under the alternative) Technicalities (e.g. fixed number of knots, not o(n)!)
9 9 Likelihood Ratio Tests Likelihood and Restricted Likelihood Ratio Tests LRT n = inf H 0 ML(Θ) inf H A ML(Θ) RLRT n = inf H 0 REML(Θ) inf H A REML(Θ) Note: RLRT makes sense only when the fixed effects are the same under H 0 and H A Generality acknowledgement: Same considerations hold for any LMM with one variance component!
10 10 Asymptotic Theory Trivialities Asymptotic distribution is useful iff Provides a good approximation of the finite sample distribution(s) Is much simpler than the finite sample distribution(s) Asymptotic reasoning makes sense Key fact: Asymptotics is just an approximation of finite sample
11 11 Boundary problem asymptotics Test for one parameter on the boundary (q = 0) For independent observations under the alternative: Chernoff (1954), Moran (1971), Chant (1974), Self & Liang (1987, 1995) LRT n 0.5χ χ2 1 Longitudinal Mixed Effects Model, same is true if K Stram and Lee (1994) Y i = X i β + Z i b i + ɛ i
12 12 Pinheiro and Bates (2000), simulations RLRT n : 0.5χ χ2 1, LRT n : 0.65χ χ2 1 Shephard and Harvey (1990) found p in a related model
13 13 Probability mass at zero Test for zero random effects variance in LMM with one variance component (Crainiceanu, Ruppert, Vogelsang, 2002). 1st order conditions for local minimum at λ = 0 β ML(Θ) = 0, σɛ 2 ML(Θ) = 0, λ ML(Θ) 0 Null finite sample LRT n mass at zero ( ) u T P P 0 ZΣZ T P 0 u u T 1 P 0 u n tr(zσzt ) ( 1 u N(0,I n ) and P 0 = I n X X X) T X T
14 14 One-way ANOVA: best case scenario Model Y ik = β 0 + b k + ɛ ik i = 1,...,I; k = 1,...,K; b N(0,σ 2 b I K); ɛ N(0,σ 2 ɛi n ) H 0 : σ 2 b = 0 vs. H A : σ 2 b > 0 Asymptotic (I, K cons.) probability mass at zero ) ) p ML (K) = P (χ 2 K 1 < K p REML (K) = P (χ 2 K 1 < K 1
15 15 ANOVA: Asymptotic mass at zero LRT RLRT 0.8 Probability Number of levels (K)
16 16 Non-parametric testing example Constant mean (p = 0) vs. piecewise constant spline K Y i = β 0 + b k I{x i > κ k } + ɛ i k=1 i = 1,...,n; k = 1,...,K; b N(0,σ 2 b I K); ɛ N(0,σ 2 ɛi n ) H 0 : σ 2 b = 0 vs. H A : σ 2 b > 0 Asymptotic (n, K cons.) Example: x i are equally spaced, κ k is the sample quantile corresponding to k/k + 1
17 17 Asymptotic mass at zero ML REML 0.85 Probability Number of knots
18 18 Cracking the nut 0.5 : 0.5 chi-square mixture approximation useless in this case Calculation of probability mass at zero (finite sample and asymptotic): simple 0.5 : improved by p n : 1 p n approximation Maybe non-zero part of the distribution is not χ 2 1 What happens when q > 0? Number of knots (levels) fixed not o(n)!
19 19 LRT n = n log h(λ,w,µ,ξ) = n log Distribution of LRT n (I) ( q ) 1 + s=1 u2 s n p 1 s=1 ws 2 { 1 + N n(λ,w,µ) D n (λ,w,µ) N n (λ,w,µ) = D n (λ,w,µ) = u s, w s, are i.i.d. N(0, 1) K s=1 K s=1 + sup h(λ,w,µ,ξ) λ 0 } K s=1 λµ s,n 1 + λµ s,n w 2 s w 2 s 1 + λµ s,n + n p 1 s=k+1 log(1 + λξ s,n ) µ s,n ; ξ s,n - K eigenvalues of Σ 1/2 Z T P 0 ZΣ 1/2 ; Σ 1/2 Z T ZΣ 1/2 w 2 s
20 20 Distribution of LRT n (II) Simulation of the finite sample distribution is very simple For K = 20: 5,000 sim/sec (2.66Ghz CPU) Simulations are feasible as long as we can diagonalize matrices Finite sample probability mass at zero for LRT n (q=0) Ks=1 P µ s,n ws 2 n p 1 s=1 ws 2 1 K ξ s,n n s=1 ξ s,n µ s,n for RLRT n (one eigenvalue dominates the others) Wilks phenomenon in finite sample
21 21 What kind of asymptotics? Number of observations n Number of knots K-fixed Conditions on Σ 1/2 Z T P 0 ZΣ 1/2 and Σ 1/2 Z T ZΣ 1/2 n α µ s,n µ s and n α ξ s,n ξ s Many observations, mean function has at most 1 + p + K degrees of freedom, asymptotic structure of K K design matrices
22 22 LRT n asymptotic distribution LRT n q s=1 u 2 s + sup d 0 K s=1 dµ s 1 + dµ s w 2 s K log(1 + dξ s ) s=1 u s, w s are i.i.d. N(0, 1); first part corresponds to q fixed effects; second part corresponds to zero variance; independent Asymptotic probability mass at zero (q = 0) K K LRT : P µ s ws 2 RLRT : P s=1 s=1 ξ s K s=1 µ s w 2 s K s=1 µ s
23 23 Proof details Establish convergence in C[0, ) of the profile LRT (finite dimensional convergence + tightness) Show that a Continuous Mapping Theorem type result holds Catch: sup is not continuous on C[0, ) x n (t) = (n 2 t + n n 3 )I{n 1/n < t n} + ni{t > n} 0 sup t 0 Crainiceanu and Ruppert (2002) x n (t)
24 24 One-way ANOVA: asymptotics LRT n K { XK 1 log ( XK )} { I X K > 1 } RLRT n (K 1) {X K 1 log (X K )} I {X K > 1} X K χ2 K 1 K, X K χ2 K 1 K 1 When K the two distributions 0.5χ χ2 1 Probability at zero always > 0.5. Is the non-zero part χ 2 1?
25 25 QQ plot for LRT LRT>0 ANOVA I: K=3, 5, 20 levels 9 χ Q =6.63
26 26 (R)LRT of linearity Linear mean (p = 1) vs. linear spline (q = 0) K Y i = β 0 + β 1 x i + b k (x i κ k ) + + ɛ i k=1 i = 1,...,n; k = 1,...,K; b N(0,σ 2 b I K); ɛ N(0,σ 2 ɛi n ) H 0 : σ 2 b = 0 vs. H A : σ 2 b > 0 Asymptotic (n, K cons.) Example: x i are equally spaced, κ k is the sample quantile for k/k + 1. Finite sample and asymptotic LRT n distributions χ 2 0!
27 27 Testing linear regression vs. penalized linear spline (K=20 knots). REML Quantiles of distributions n=50 n=100 n= 0.5:0.5 mixture q 0.66 q 0.95 q 0.99 q Quantiles of the asymptotic distribution (n= )
28 28 F and F-type tests Hastie and Tibshirani (1990), Cantoni and Hastie (2002) H 0 : λ = λ 0 vs. H A : λ = λ 1 F γ0,γ 1 = (RSS 0 RSS 1 )/(γ 0 γ 1 ) RSS 1 /γ 1, R λ0,λ 1 = Y T ( Sλ1 S λ0 ) Y Y T ( I n S λ1 ) Y γ λ = tr (I n S λ ) 2 = # d.f. of residuals, S λ = smoother matrix. H 0 : λ = 0 vs. H A : λ > 0 (λ = ˆλ). The null distribution of R 0,λ1 has no mass at zero The null distribution of R 0,ˆλ has >> 0.5 mass at zero
29 29 Power properties Test constant mean vs. a general alternative piecewise constant spline (RLRT) linear spline (LRT) Types of alternatives considered increasing, concave, periodic Crainiceanu, Ruppert, Claeskens, Wand, 2002
30 30 Notations R test is from Cantoni and Hastie (2002) F test is from Hastie and Tibshirani (1990) C : alternative is modeled by a piecewise constant spline L : alternative is modeled by a linear spline 1 : estimate under the alternative has DF one greater than p O : the design matrix is orthogonalized ML : smoothing parameter is estimated using (RE)ML GCV : smoothing parameter is estimated using GCV
31 31 Tests Average Maximum Minimum RLRT-C R-GCV-L R-ML-C F-ML-L R-ML-L F-ML-C F-GCV-L LRT-L LRT-L-O F-1-L R-1-L R-GCV-C RLRT-C-O
32 32 Power results No most powerful test for the three alternatives considered RLRT-C has good power compared to competing tests LRT-L is worse (probability mass at zero) Other good tests exist but their null distributions have to be simulated (5,000 simulations, n = 100, K = 20) R-GCV-L: 30 min vs. RLRT-C: 1 sec Our approach: R and F tests include variability in ˆλ
33 33 Mother age/child birthweight - no effect K = 10 K = 20 value p-value value p-value RLRT-C F-ML-C F-GCV-C LRT-L F-ML-L F-GCV-L
34 34 Janka hardness - linearity K = 5 K = 10 value p-value value p-value RLRT-L F-ML
35 35 Null: more than one covariate y i = m 1 (x 1i ) + m 2 (x 2i ) + ɛ i Both m 1 ( ) and m 2 ( ) can be modeled as splines. The model is Y = X 1 β 1 + X 2 β 2 + Z 1 b 1 + Z 2 b 2 + ɛ b 1 N(0,σ1 2); b 2 N(0,σ2 2); ɛ N(0,σ2 ɛ); independent Tests σ1 2 = 0 and/or σ2 2 = 0 tests vs. nonparametric H A Spectral decomposition of (R)LRT n for LMMs with > 1 variance components (Crainiceanu, Ruppert, Claeskens, Wand, 2002) Warning: theoretical result, practical limitations
36 36 Null: nonlinear regression Linearity in parameters is the only special property of polynomials A parametric regression function can be embedded in a larger space (e.g. using p-splines) Y = f(x, β 1 ) + Xβ 2 + Zb + ɛ b N(0,σ 2 b Σ), ɛ N(0,σ2 ɛi n ), independent H 0 : β 2 = 0, σ 2 b = 0 vs. H A : β 2 0 or σ 2 b 0
37 37 LRT for non-linear regression H 0 : Y i = a + b exp(cx i ) + ɛ i H A : Y i = a + b exp(cx i ) + βx i + K k=1 b k (x i κ k ) + n = 100, x i equally spaced in [0, 2], K = 20 equally spaced knots. 5,000 simulations / 20 minutes Distribution LRT n for 2 parameters (β = 0, σ 2 b = 0): χ2 1 WARNING: NO RLRT!
38 38 Model linearization Let ˆβ 1 be the MLE under the null Y = f(x, β 1 ) + ɛ Approximate f(x,β 1 ) by a first order Taylor expansion f(x,β 1 ) = f(x, ˆβ 1 ) + [ ] T f(x, β ˆβ 1 ) (β 1 ˆβ 1 ) 1 Test for the null of linear regression against a general alternative This problem is solved! RLRT can be used!
39 39 Goodness-of-fit Testing (constant, linear mean, etc.) against a general alternative is testing goodness-of-fit Classical goodness-of-fit tests use standardized residuals If the null model is correct, standardized residuals under the null should behave like white noise We can use (R)LRT for no-effect of the estimated residuals r i = µ + e i Empirical approach: Ruppert, Wand, Carroll, 2002
40 40 (R)LRT bootstrap R(LRT) distribution for LMM with more than one variance component (feasible: L = 2, 3) Exact linear, non-linear, GLM null distributions Low order smoothers make simulations tractable Hard to implement in standard software Recommended when unsure about theoretical results Our initial approach to inference (that s how we got p 0 >> 0.5)
41 41 Conclusions Testing for null including zero random effects variance in LMMs Unified testing theory (penalized likelihood models) (R)LRT null finite sample, asymptotic distributions + power Standard asymptotics for i.i.d. data does not apply Feasible extensions: additive, nonlinear models, GLMs P-splines are a powerful and flexible tool for (R)LRT testing of parametric versus nonparametric models
Some properties of Likelihood Ratio Tests in Linear Mixed Models
Some properties of Likelihood Ratio Tests in Linear Mixed Models Ciprian M. Crainiceanu David Ruppert Timothy J. Vogelsang September 19, 2003 Abstract We calculate the finite sample probability mass-at-zero
More informationExact Likelihood Ratio Tests for Penalized Splines
Exact Likelihood Ratio Tests for Penalized Splines By CIPRIAN CRAINICEANU, DAVID RUPPERT, GERDA CLAESKENS, M.P. WAND Department of Biostatistics, Johns Hopkins University, 615 N. Wolfe Street, Baltimore,
More informationRestricted Likelihood Ratio Tests in Nonparametric Longitudinal Models
Restricted Likelihood Ratio Tests in Nonparametric Longitudinal Models Short title: Restricted LR Tests in Longitudinal Models Ciprian M. Crainiceanu David Ruppert May 5, 2004 Abstract We assume that repeated
More informationLikelihood ratio testing for zero variance components in linear mixed models
Likelihood ratio testing for zero variance components in linear mixed models Sonja Greven 1,3, Ciprian Crainiceanu 2, Annette Peters 3 and Helmut Küchenhoff 1 1 Department of Statistics, LMU Munich University,
More informationRLRsim: Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models
RLRsim: Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models Fabian Scheipl 1 joint work with Sonja Greven 1,2 and Helmut Küchenhoff 1 1 Department of Statistics, LMU
More informationON EXACT INFERENCE IN LINEAR MODELS WITH TWO VARIANCE-COVARIANCE COMPONENTS
Ø Ñ Å Ø Ñ Ø Ð ÈÙ Ð Ø ÓÒ DOI: 10.2478/v10127-012-0017-9 Tatra Mt. Math. Publ. 51 (2012), 173 181 ON EXACT INFERENCE IN LINEAR MODELS WITH TWO VARIANCE-COVARIANCE COMPONENTS Júlia Volaufová Viktor Witkovský
More informationOn testing an unspecified function through a linear mixed effects model with. multiple variance components
Biometrics 63, 1?? December 2011 DOI: 10.1111/j.1541-0420.2005.00454.x On testing an unspecified function through a linear mixed effects model with multiple variance components Yuanjia Wang and Huaihou
More informationHypothesis Testing in Smoothing Spline Models
Hypothesis Testing in Smoothing Spline Models Anna Liu and Yuedong Wang October 10, 2002 Abstract This article provides a unified and comparative review of some existing test methods for the hypothesis
More informationPenalized Splines, Mixed Models, and Recent Large-Sample Results
Penalized Splines, Mixed Models, and Recent Large-Sample Results David Ruppert Operations Research & Information Engineering, Cornell University Feb 4, 2011 Collaborators Matt Wand, University of Wollongong
More informationThe Hodrick-Prescott Filter
The Hodrick-Prescott Filter A Special Case of Penalized Spline Smoothing Alex Trindade Dept. of Mathematics & Statistics, Texas Tech University Joint work with Rob Paige, Missouri University of Science
More informationOn the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models
On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Institute of Statistics and Econometrics Georg-August-University Göttingen Department of Statistics
More informationLikelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis
Likelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis Ana-Maria Staicu Yingxing Li Ciprian M. Crainiceanu David Ruppert October 13, 2013 Abstract The
More informationNow consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown.
Weighting We have seen that if E(Y) = Xβ and V (Y) = σ 2 G, where G is known, the model can be rewritten as a linear model. This is known as generalized least squares or, if G is diagonal, with trace(g)
More informationOn the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models
On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of
More informationStatistics 203: Introduction to Regression and Analysis of Variance Course review
Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying
More informationStat 710: Mathematical Statistics Lecture 31
Stat 710: Mathematical Statistics Lecture 31 Jun Shao Department of Statistics University of Wisconsin Madison, WI 53706, USA Jun Shao (UW-Madison) Stat 710, Lecture 31 April 13, 2009 1 / 13 Lecture 31:
More informationNonparametric Small Area Estimation Using Penalized Spline Regression
Nonparametric Small Area Estimation Using Penalized Spline Regression 0verview Spline-based nonparametric regression Nonparametric small area estimation Prediction mean squared error Bootstrapping small
More informationPeter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8
Contents 1 Linear model 1 2 GLS for multivariate regression 5 3 Covariance estimation for the GLM 8 4 Testing the GLH 11 A reference for some of this material can be found somewhere. 1 Linear model Recall
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth
More informationLinear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,
Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,
More informationNonparametric Inference In Functional Data
Nonparametric Inference In Functional Data Zuofeng Shang Purdue University Joint work with Guang Cheng from Purdue Univ. An Example Consider the functional linear model: Y = α + where 1 0 X(t)β(t)dt +
More informationNonconcave Penalized Likelihood with A Diverging Number of Parameters
Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized
More informationDiscussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon
Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics
More informationSmoothness Selection. Simon Wood Mathematical Sciences, University of Bath, U.K.
Smoothness Selection Simon Wood Mathematical Sciences, Universit of Bath, U.K. Smoothness selection approaches The smoothing model i = f ( i ) + ɛ i, ɛ i N(0, σ 2 ), is represented via a basis epansion
More informationRestricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model
Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Xiuming Zhang zhangxiuming@u.nus.edu A*STAR-NUS Clinical Imaging Research Center October, 015 Summary This report derives
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationA NOTE ON A NONPARAMETRIC REGRESSION TEST THROUGH PENALIZED SPLINES
Statistica Sinica 24 (2014), 1143-1160 doi:http://dx.doi.org/10.5705/ss.2012.230 A NOTE ON A NONPARAMETRIC REGRESSION TEST THROUGH PENALIZED SPLINES Huaihou Chen 1, Yuanjia Wang 2, Runze Li 3 and Katherine
More informationRestricted Likelihood Ratio Lack of Fit Tests using Mixed Spline Models
Restricted Likelihood Ratio Lack of Fit Tests using Mixed Spline Models Gerda Claeskens Texas A&M University, College Station, USA and Université catholique de Louvain, Louvain-la-Neuve, Belgium Summary
More informationInversion Base Height. Daggot Pressure Gradient Visibility (miles)
Stanford University June 2, 1998 Bayesian Backtting: 1 Bayesian Backtting Trevor Hastie Stanford University Rob Tibshirani University of Toronto Email: trevor@stat.stanford.edu Ftp: stat.stanford.edu:
More information1 Mixed effect models and longitudinal data analysis
1 Mixed effect models and longitudinal data analysis Mixed effects models provide a flexible approach to any situation where data have a grouping structure which introduces some kind of correlation between
More informationBIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation
BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationRegression, Ridge Regression, Lasso
Regression, Ridge Regression, Lasso Fabio G. Cozman - fgcozman@usp.br October 2, 2018 A general definition Regression studies the relationship between a response variable Y and covariates X 1,..., X n.
More informationHigh-dimensional regression with unknown variance
High-dimensional regression with unknown variance Christophe Giraud Ecole Polytechnique march 2012 Setting Gaussian regression with unknown variance: Y i = f i + ε i with ε i i.i.d. N (0, σ 2 ) f = (f
More informationA Significance Test for the Lasso
A Significance Test for the Lasso Lockhart R, Taylor J, Tibshirani R, and Tibshirani R Ashley Petersen May 14, 2013 1 Last time Problem: Many clinical covariates which are important to a certain medical
More informationGeneralized Additive Models
Generalized Additive Models The Model The GLM is: g( µ) = ß 0 + ß 1 x 1 + ß 2 x 2 +... + ß k x k The generalization to the GAM is: g(µ) = ß 0 + f 1 (x 1 ) + f 2 (x 2 ) +... + f k (x k ) where the functions
More information9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures
FE661 - Statistical Methods for Financial Engineering 9. Model Selection Jitkomut Songsiri statistical models overview of model selection information criteria goodness-of-fit measures 9-1 Statistical models
More informationLecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011
Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector
More informationIntegrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University
Integrated Likelihood Estimation in Semiparametric Regression Models Thomas A. Severini Department of Statistics Northwestern University Joint work with Heping He, University of York Introduction Let Y
More informationCan we do statistical inference in a non-asymptotic way? 1
Can we do statistical inference in a non-asymptotic way? 1 Guang Cheng 2 Statistics@Purdue www.science.purdue.edu/bigdata/ ONR Review Meeting@Duke Oct 11, 2017 1 Acknowledge NSF, ONR and Simons Foundation.
More informationSemiparametric Mixed Model for Evaluating Pathway-Environment Interaction
Semiparametric Mixed Model for Evaluating Pathway-Environment Interaction arxiv:1206.2716v1 [stat.me] 13 Jun 2012 Zaili Fang 1, Inyoung Kim 1, and Jeesun Jung 2 June 14, 2012 1 Department of Statistics,
More informationLinear Mixed Models. One-way layout REML. Likelihood. Another perspective. Relationship to classical ideas. Drawbacks.
Linear Mixed Models One-way layout Y = Xβ + Zb + ɛ where X and Z are specified design matrices, β is a vector of fixed effect coefficients, b and ɛ are random, mean zero, Gaussian if needed. Usually think
More informationGeneralized, Linear, and Mixed Models
Generalized, Linear, and Mixed Models CHARLES E. McCULLOCH SHAYLER.SEARLE Departments of Statistical Science and Biometrics Cornell University A WILEY-INTERSCIENCE PUBLICATION JOHN WILEY & SONS, INC. New
More informationChapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)
Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December
More informationBasis Penalty Smoothers. Simon Wood Mathematical Sciences, University of Bath, U.K.
Basis Penalty Smoothers Simon Wood Mathematical Sciences, University of Bath, U.K. Estimating functions It is sometimes useful to estimate smooth functions from data, without being too precise about the
More informationPh.D. Qualifying Exam Friday Saturday, January 3 4, 2014
Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Put your solution to each problem on a separate sheet of paper. Problem 1. (5166) Assume that two random samples {x i } and {y i } are independently
More informationStatistical Inference
Statistical Inference Liu Yang Florida State University October 27, 2016 Liu Yang, Libo Wang (Florida State University) Statistical Inference October 27, 2016 1 / 27 Outline The Bayesian Lasso Trevor Park
More informationNonstationary Regressors using Penalized Splines
Estimation and Inference for Varying-coefficient Models with Nonstationary Regressors using Penalized Splines Haiqiang Chen, Ying Fang and Yingxing Li Wang Yanan Institute for Studies in Economics, MOE
More informationA Modern Look at Classical Multivariate Techniques
A Modern Look at Classical Multivariate Techniques Yoonkyung Lee Department of Statistics The Ohio State University March 16-20, 2015 The 13th School of Probability and Statistics CIMAT, Guanajuato, Mexico
More informationRecap. HW due Thursday by 5 pm Next HW coming on Thursday Logistic regression: Pr(G = k X) linear on the logit scale Linear discriminant analysis:
1 / 23 Recap HW due Thursday by 5 pm Next HW coming on Thursday Logistic regression: Pr(G = k X) linear on the logit scale Linear discriminant analysis: Pr(G = k X) Pr(X G = k)pr(g = k) Theory: LDA more
More informationIntroduction to Estimation Methods for Time Series models Lecture 2
Introduction to Estimation Methods for Time Series models Lecture 2 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 2 SNS Pisa 1 / 21 Estimators:
More informationOn the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models
On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of
More informationLecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012
Lecture 3: Linear Models Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector of observed
More informationStat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010
1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of
More informationWeb Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.
Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we
More informationA brief introduction to mixed models
A brief introduction to mixed models University of Gothenburg Gothenburg April 6, 2017 Outline An introduction to mixed models based on a few examples: Definition of standard mixed models. Parameter estimation.
More informationIllustration of the Varying Coefficient Model for Analyses the Tree Growth from the Age and Space Perspectives
TR-No. 14-06, Hiroshima Statistical Research Group, 1 11 Illustration of the Varying Coefficient Model for Analyses the Tree Growth from the Age and Space Perspectives Mariko Yamamura 1, Keisuke Fukui
More informationParametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1
Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson
More informationREGRESSION WITH SPATIALLY MISALIGNED DATA. Lisa Madsen Oregon State University David Ruppert Cornell University
REGRESSION ITH SPATIALL MISALIGNED DATA Lisa Madsen Oregon State University David Ruppert Cornell University SPATIALL MISALIGNED DATA 10 X X X X X X X X 5 X X X X X 0 X 0 5 10 OUTLINE 1. Introduction 2.
More informationStatistics for analyzing and modeling precipitation isotope ratios in IsoMAP
Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable
More informationModelling Ireland s exchange rates: from EMS to EMU
From the SelectedWorks of Derek Bond November, 2007 Modelling Ireland s exchange rates: from EMS to EMU Derek Bond, University of Ulster Available at: https://works.bepress.com/derek_bond/15/ Background
More informationNonparametric Regression. Badr Missaoui
Badr Missaoui Outline Kernel and local polynomial regression. Penalized regression. We are given n pairs of observations (X 1, Y 1 ),...,(X n, Y n ) where Y i = r(x i ) + ε i, i = 1,..., n and r(x) = E(Y
More informationA Tolerance Interval Approach for Assessment of Agreement in Method Comparison Studies with Repeated Measurements
A Tolerance Interval Approach for Assessment of Agreement in Method Comparison Studies with Repeated Measurements Pankaj K. Choudhary 1 Department of Mathematical Sciences, University of Texas at Dallas
More informationIntroduction to machine learning and pattern recognition Lecture 2 Coryn Bailer-Jones
Introduction to machine learning and pattern recognition Lecture 2 Coryn Bailer-Jones http://www.mpia.de/homes/calj/mlpr_mpia2008.html 1 1 Last week... supervised and unsupervised methods need adaptive
More informationSome General Types of Tests
Some General Types of Tests We may not be able to find a UMP or UMPU test in a given situation. In that case, we may use test of some general class of tests that often have good asymptotic properties.
More informationLinear Regression. September 27, Chapter 3. Chapter 3 September 27, / 77
Linear Regression Chapter 3 September 27, 2016 Chapter 3 September 27, 2016 1 / 77 1 3.1. Simple linear regression 2 3.2 Multiple linear regression 3 3.3. The least squares estimation 4 3.4. The statistical
More informationAFT Models and Empirical Likelihood
AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky) Accelerated Failure Time (AFT) models: Y = log(t
More informationAPTS course: 20th August 24th August 2018
APTS course: 20th August 24th August 2018 Flexible Regression Preliminary Material Claire Miller & Tereza Neocleous The term flexible regression refers to a wide range of methods which provide flexibility
More informationChapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests
Chapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests Throughout this chapter we consider a sample X taken from a population indexed by θ Θ R k. Instead of estimating the unknown parameter, we
More informationLecture 32: Asymptotic confidence sets and likelihoods
Lecture 32: Asymptotic confidence sets and likelihoods Asymptotic criterion In some problems, especially in nonparametric problems, it is difficult to find a reasonable confidence set with a given confidence
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationLinear Methods for Prediction
Chapter 5 Linear Methods for Prediction 5.1 Introduction We now revisit the classification problem and focus on linear methods. Since our prediction Ĝ(x) will always take values in the discrete set G we
More informationQuick Review on Linear Multiple Regression
Quick Review on Linear Multiple Regression Mei-Yuan Chen Department of Finance National Chung Hsing University March 6, 2007 Introduction for Conditional Mean Modeling Suppose random variables Y, X 1,
More informationsimple if it completely specifies the density of x
3. Hypothesis Testing Pure significance tests Data x = (x 1,..., x n ) from f(x, θ) Hypothesis H 0 : restricts f(x, θ) Are the data consistent with H 0? H 0 is called the null hypothesis simple if it completely
More informationSchool of Education, Culture and Communication Division of Applied Mathematics
School of Education, Culture and Communication Division of Applied Mathematics MASTER THESIS IN MATHEMATICS / APPLIED MATHEMATICS Estimation and Testing the Quotient of Two Models by Marko Dimitrov Masterarbete
More informationECON 5350 Class Notes Functional Form and Structural Change
ECON 5350 Class Notes Functional Form and Structural Change 1 Introduction Although OLS is considered a linear estimator, it does not mean that the relationship between Y and X needs to be linear. In this
More informationBinary choice 3.3 Maximum likelihood estimation
Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood
More informationData Mining Stat 588
Data Mining Stat 588 Lecture 9: Basis Expansions Department of Statistics & Biostatistics Rutgers University Nov 01, 2011 Regression and Classification Linear Regression. E(Y X) = f(x) We want to learn
More informationA significance test for the lasso
1 First part: Joint work with Richard Lockhart (SFU), Jonathan Taylor (Stanford), and Ryan Tibshirani (Carnegie-Mellon Univ.) Second part: Joint work with Max Grazier G Sell, Stefan Wager and Alexandra
More informationThe Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series
The Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series Willa W. Chen Rohit S. Deo July 6, 009 Abstract. The restricted likelihood ratio test, RLRT, for the autoregressive coefficient
More informationAn Introduction to Functional Data Analysis
An Introduction to Functional Data Analysis Chongzhi Di Fred Hutchinson Cancer Research Center cdi@fredhutch.org Biotat 578A: Special Topics in (Genetic) Epidemiology November 10, 2015 Textbook Ramsay
More informationGeneral Linear Model: Statistical Inference
Chapter 6 General Linear Model: Statistical Inference 6.1 Introduction So far we have discussed formulation of linear models (Chapter 1), estimability of parameters in a linear model (Chapter 4), least
More informationTesting Some Covariance Structures under a Growth Curve Model in High Dimension
Department of Mathematics Testing Some Covariance Structures under a Growth Curve Model in High Dimension Muni S. Srivastava and Martin Singull LiTH-MAT-R--2015/03--SE Department of Mathematics Linköping
More informationSupplementary Materials for Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach
Supplementary Materials for Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach Part A: Figures and tables Figure 2: An illustration of the sampling procedure to generate a surrogate
More informationLet us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided
Let us first identify some classes of hypotheses. simple versus simple H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided H 0 : θ θ 0 versus H 1 : θ > θ 0. (2) two-sided; null on extremes H 0 : θ θ 1 or
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7
MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 1 Random Vectors Let a 0 and y be n 1 vectors, and let A be an n n matrix. Here, a 0 and A are non-random, whereas y is
More informationStat 579: Generalized Linear Models and Extensions
Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject
More informationA General Framework for Variable Selection in Linear Mixed Models with Applications to Genetic Studies with Structured Populations
A General Framework for Variable Selection in Linear Mixed Models with Applications to Genetic Studies with Structured Populations Joint work with Karim Oualkacha (UQÀM), Yi Yang (McGill), Celia Greenwood
More informationStatistical Inference
Statistical Inference J. Daunizeau Institute of Empirical Research in Economics, Zurich, Switzerland Brain and Spine Institute, Paris, France SPM Course Edinburgh, April 2011 Image time-series Spatial
More informationEcon 583 Final Exam Fall 2008
Econ 583 Final Exam Fall 2008 Eric Zivot December 11, 2008 Exam is due at 9:00 am in my office on Friday, December 12. 1 Maximum Likelihood Estimation and Asymptotic Theory Let X 1,...,X n be iid random
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS 1a) The model is cw i = β 0 + β 1 el i + ɛ i, where cw i is the weight of the ith chick, el i the length of the egg from which it hatched, and ɛ i
More informationA Resampling Method on Pivotal Estimating Functions
A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation
More informationSTAT5044: Regression and Anova. Inyoung Kim
STAT5044: Regression and Anova Inyoung Kim 2 / 51 Outline 1 Matrix Expression 2 Linear and quadratic forms 3 Properties of quadratic form 4 Properties of estimates 5 Distributional properties 3 / 51 Matrix
More informationSTA 2201/442 Assignment 2
STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution
More information1 One-way analysis of variance
LIST OF FORMULAS (Version from 21. November 2014) STK2120 1 One-way analysis of variance Assume X ij = µ+α i +ɛ ij ; j = 1, 2,..., J i ; i = 1, 2,..., I ; where ɛ ij -s are independent and N(0, σ 2 ) distributed.
More informationPart IB Statistics. Theorems with proof. Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua. Lent 2015
Part IB Statistics Theorems with proof Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua Lent 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly)
More informationCh 3: Multiple Linear Regression
Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery
More informationStat 5102 Final Exam May 14, 2015
Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions
More informationMultilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2
Multilevel Models in Matrix Form Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Today s Lecture Linear models from a matrix perspective An example of how to do
More informationAIR FORCE INSTITUTE OF TECHNOLOGY
TOPICS IN STATISTICAL CALIBRATION DISSERTATION Brandon M. Greenwell, Civilian AFIT-ENC-DS-14-M-01 DEPARTMENT OF THE AIR FORCE AIR UNIVERSITY AIR FORCE INSTITUTE OF TECHNOLOGY Wright-Patterson Air Force
More information