Function-on-Scalar Regression with the refund Package

Size: px
Start display at page:

Download "Function-on-Scalar Regression with the refund Package"

Transcription

1 University of Haifa From the SelectedWorks of Philip T. Reiss July 30, 2012 Function-on-Scalar Regression with the refund Package Philip T. Reiss, New York University Available at:

2 Function-on-scalar regression with the refund package Philip T. Reiss New York University and Nathan Kline Institute Joint Statistical Meetings San Diego, July 30, 2012 Joint work with Lei Huang (NYU / Johns Hopkins), Lan Huo (NYU), Fabian Scheipl (LMU Munich) Research supported in part by grant DMS , U.S. National Science Foundation 1 / 22

3 2 / 22 Outline The refund package Function-on-scalar regression: raw vs. preprocessed responses Within-function dependence Hypothesis testing

4 3 / 22 Regression functions in the refund package (Crainiceanu et al., 2012) Predictors Scalar Functional Responses Scalar fpcr, (l)pfr, (l)peer, wcr, wnet Functional fosr, pffr pffr

5 4 / 22 A function-on-scalar regression example: Midsagittal corpus callosum width

6 5 / 22 We ll consider corpus callosum width functions derived from 98 controls 70 subjects with very mild dementia, and 28 with mild dementia, based on MRI scans from the Open Access Series of Imaging Studies (OASIS) database (Marcus et al., 2007). Image processing at Nathan Kline Institute: each individual s corpus callosum was extracted by an automated procedure developed by Babak Ardekani; width at each of 99 points, from anterior to posterior, along the midsagittal plane was computed by Alvin Bachman.

7 We thereby obtain 196 corpus callosum width curves (functional responses)... Effect age Effect Control Very mild Mild Position Position... which we are primarily interested in regressing on diagnostic group. 6 / 22

8 7 / 22 Function-on-scalar regression with raw-data responses Given response functions y 1,..., y n each observed at s 1,..., s T S, and scalar predictor matrix X = (x im ) 1 i n,1 m p (usually x i1 1), we consider the repeated-measures varying-coefficient model y i (s t) = x im β m(s t) + ε i (s t), (1) where the error functions ε i arise from a mean-zero process on S. Representing the coefficient functions as β m(s) = θ T mb(s) for some θ m R K (m = 1,..., p), where b(s) = [b 1 (s),..., b K (s)] T denotes basis functions (often B-splines), we reduce (1) to the n T matrix equation Y = XΘB T + E where Y = [y i (s t)] 1 i n,1 t T, Θ (p K ) is the coefficient matrix, B (T K ) is the basis function evaluation matrix, and E = [ε i (s t)] 1 i n,1 t T. Our estimate of Θ is the minimizer of a penalized double SSE, say n T [ ] 2 [y i (s t) (XΘB T ) it ] 2 + λ m θ T mb (s) ds. i=1 t=1 }{{} β m (s)2

9 8 / 22 Criterion n i=1 T [y i (s t) (XΘB T ) it ] 2 + t=1 [ 2 λ m θ T mb (s)] ds can be rewritten in the generalized ridge regression form where vec(y T ) (X B)vec(Θ T ) 2 + vec(θ T ) T (Λ P)vec(Θ T ) = y X B θ 2 + θ T P λ θ, (2) y = vec(y T ), θ = vec(θ T ), X B = X B, [ ] P λ = Λ P = diag(λ 1,..., λ p) b k (s)b l (s)ds For given λ 1,..., λ p, (2) is minimized by ˆθ = (X T BX B + P Λ ) 1 X T By. 1 k,l K.

10 9 / 22 From raw to preprocessed responses Sometimes (e.g., large data sets) one works with preprocessed response functions given by an n K matrix C of basis coefficients such that y i (s) = c T i b(s) for i = 1,..., n. fda package (Ramsay and Silverman, 2005; Ramsay et al., 2009) chooses Θ to minimize the penalized integrated sum of squared errors n [ 2 [c T i b(s) x T i Θb(s)] 2 ds + λ m θ T mb (s)] ds. i=1 Solution can be derived (Reiss et al., 2010) as limit of raw-response fit θ = (X T BX B + P Λ ) 1 X T By for uniform grid s 1,..., s T as T : 1 T X T BX B = 1 T (X 1 B)T (X B) = T (X T X) (B T B) [ ] = (X T 1 T X) b k (s t)b l (s t) T t=1 [ ] (X T X) b k (s)b l (s)ds fosr works with either raw or preprocessed responses. 1 k,l K 1 k,l K.

11 We simulated 3 groups 50 functional responses from y i (s) = µ(s) + β gp(i) (s) + η i (s) }{{} GP(0,σ 2 η ρ s t ) + ξ i (s) }{{} WN(σ 2 ξ =0.22 ) σ η = 0.2 σ η = and estimated µ and β 1, β 2, β 3 using raw vs. preprocessed responses. 10 / 22

12 1000 mean integrated squared error: raw vs. preprocessed σ η = 0.2 σ η = 0.5 T = 30 T = 120 µ β 1 β 2 β µ β 1 β 2 β µ β 1 β 2 β µ β 1 β 2 β / 22

13 12 / 22 Extension to function-on-concurrent-function regression? The raw-response design matrix X (n p) B (T K ) has (i, m) block x im B = [x im b k (s t )] 1 t T,1 k K (1 i n, 1 m p). For the concurrent function-on-function model y i (s) = x im (s)β m (s) + ε i (s) (i.e., time-varying predictors), the (i, m) block becomes [x im (s t )b k (s t )] 1 t T,1 k K. For preprocessed responses, the extension is more complex!

14 13 / 22 Callosum thickness example Intercept Very mild Mild Coefficient function Coefficient function Coefficient function Position Position Position Brain size Female Male Age Coefficient function Coefficient function Coefficient function Position Position Position

15 14 / 22 Residual dependence When minimizing the penalized OLS criterion n y i i=1 2 x im β m + λ m β m(s) 2 ds, where y i = [y i (s 1 ),..., y i (s T )] T, β m = [β m(s 1 ),..., β m(s T )] T, selection of λ 1,..., λ p by REML or GCV is not straightforward since in general Σ = Cov(y i x i1,..., x ip ) I. Three ways to take this within-function dependence into account in smoothness selection: 1. leave-one-function-out cross-validation 2. penalized generalized least squares 3. functional mixed-effects models extensions of repeated-measures methodology to varying-coefficient models.

16 15 / 22 Method 1: Leave-one-function-out cross-validation This is the classical method (Rice and Silverman, 1991; Ramsay and Silverman, 2005): minimize n (y i ŷ ( i) i ) 2. (3) i=1 For given λ, fosr uses the leave-one-function-out lemma y i ŷ ( i) i = (I H ii ) 1 (y i ŷ i ) to compute (3) without refitting. But, there s no standard algorithm for minimizing (3) wrt multiple λ s.

17 16 / 22 Method 2: Penalized generalized least squares Reiss et al. (2010) consider replacing the penalized OLS criterion n y i i=1 2 x im β m + λ m β m(s) 2 ds with the penalized GLS criterion [ n y i i=1 ] T [ x im β ˆΣ 1 m y i ] x im β m + λ m β m(s) 2 ds, where ˆΣ is an estimate of Cov(y i x i1,..., x ip ) (cf. Krafty et al., 2008). Intuition: Equivalent to penalized OLS for prewhitened responses ˆΣ 1/2 y 1,..., ˆΣ 1/2 y n. Viewing these as approximately IID lets us use mgcv (Wood, 2006, 2011) for multiple smoothing parameter selection. Can iterate between updated estimates of Σ and of β 1,..., β p.

18 17 / 22 Method 3: Functional mixed effects Idea: y i (s) = x im β m (s)+η i (s) + ξ i (s), i.e., decompose residual function ε i (s) into random effect function η i (s) + noise function ξ i (s). Express η i (s) as a linear combination of the leading functional principal components. This is implemented for pffr, but not yet in fosr (for version 0.1-6).

19 Details for functional mixed effects The penalized OLS criterion can be written as n y i (x i1 B... x ip B) i=1 θ 1. θ p Consider instead the doubly penalized criterion 2 + λ mθ T mpθ m. β u i {}}{{}}{ X i n θ {}}{ 1 Z i {}}{ y i (x i1 B... x ip B). ( ρ 1 φ 1... u i1 ρ C φ C ). θ p u ic i=1 2 n + λ mθ T mpθ m+λ u u i 2, i=1 where φ j = [φ j (s 1 ),..., φ j (s T )] T is the jth eigenfunction of the residual covariance operator, and ρ j is the jth eigenvector ( u ij s iid). 18 / 22

20 Hypothesis testing Test M 0 vs. M A via pointwise F-statistics (Ramsay and Silverman, 2005) F(s) = [SSEM (s) 0 SSEM (s)]/(p A A p 0 ). SSE MA /(n p A ) Declare significance at t such that F(t) > 100(1 α) percentile, }{{} estimated by permutation under M 0, of max F(s). s Diagnostic group effect F statistics Location 19 / 22

21 20 / 22 Great Books

22 21 / 22 And many thanks to Thank you! Eva Petkova, Lassell Lu and Aaron Chen (NYU), Todd Ogden (Columbia), Ciprian Crainiceanu (Johns Hopkins) and Giles Hooker (Cornell), for valuable discussions; the NIH, for supporting refund via grant 5R01EB (PI: Ogden) and for funding the OASIS project via grants P50 AG05681, P01 AG03991, R01 AG021910, P50 MH071616, U24 RR021382, R01 MH56584; Babak Ardekani, Al Bachman and Sang Han Lee (Nathan Kline Institute), for providing the corpus callosum data derived from the OASIS database.

23 22 / 22 References Crainiceanu, C. M., P. T. Reiss, J. Goldsmith, L. Huang, L. Huo, and F. Scheipl (2012). refund: Regression with functional data. R package version Krafty, R. T., P. A. Gimotty, D. Holtz, G. Coukos, and W. Guo (2008). Varying coefficient model with unknown within-subject covariance for analysis of tumor growth curves. Biometrics 64, Marcus, D., T. Wang, J. Parker, J. Csernansky, J. Morris, and R. Buckner (2007). Open Access Series of Imaging Studies (OASIS): cross-sectional MRI data in young, middle aged, nondemented, and demented older adults. Journal of Cognitive Neuroscience 19(9), Ramsay, J. O., G. Hooker, and S. Graves (2009). Functional Data Analysis with R and MATLAB. New York: Springer. Ramsay, J. O. and B. W. Silverman (2005). Functional Data Analysis (2nd ed.). New York: Springer. Reiss, P. T., L. Huang, and M. Mennes (2010). Fast function-on-scalar regression with penalized basis expansions. International Journal of Biostatistics 6(1), article 28. Rice, J. and B. Silverman (1991). Estimating the mean and covariance structure nonparametrically when the data are curves. Journal of the Royal Statistical Society, Series B 53(1), Wood, S. N. (2006). Generalized Additive Models: An Introduction with R. Boca Raton, FL: Chapman & Hall. Wood, S. N. (2011). Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. Journal of the Royal Statistical Society: Series B 73(1), 3 36.

Simultaneous Confidence Bands for the Coefficient Function in Functional Regression

Simultaneous Confidence Bands for the Coefficient Function in Functional Regression University of Haifa From the SelectedWorks of Philip T. Reiss August 7, 2008 Simultaneous Confidence Bands for the Coefficient Function in Functional Regression Philip T. Reiss, New York University Available

More information

Semiparametric Methods for Mapping Brain Development

Semiparametric Methods for Mapping Brain Development University of Haifa From the SelectedWorks of Philip T. Reiss May, 2012 Semiparametric Methods for Mapping Brain Development Philip T. Reiss Yin-Hsiu Chen Lan Huo Available at: https://works.bepress.com/phil_reiss/27/

More information

Flexible penalized regression for functional data...and other complex data objects

Flexible penalized regression for functional data...and other complex data objects University of Haifa From the SelectedWorks of Philip T. Reiss November, 2015 Flexible penalized regression for functional data...and other complex data objects Philip T. Reiss Available at: https://works.bepress.com/phil_reiss/41/

More information

SUPPLEMENTARY APPENDICES FOR WAVELET-DOMAIN REGRESSION AND PREDICTIVE INFERENCE IN PSYCHIATRIC NEUROIMAGING

SUPPLEMENTARY APPENDICES FOR WAVELET-DOMAIN REGRESSION AND PREDICTIVE INFERENCE IN PSYCHIATRIC NEUROIMAGING Submitted to the Annals of Applied Statistics SUPPLEMENTARY APPENDICES FOR WAVELET-DOMAIN REGRESSION AND PREDICTIVE INFERENCE IN PSYCHIATRIC NEUROIMAGING By Philip T. Reiss, Lan Huo, Yihong Zhao, Clare

More information

Varying-smoother models for functional responses

Varying-smoother models for functional responses Varying-smoother models for functional responses Philip T. Reiss 1,2,, Lei Huang 3, Huaihou Chen 4, and Stan Colcombe 2 arxiv:1412.0778v1 stat.me 2 Dec 2014 1 Department of Child and Adolescent Psychiatry

More information

Modeling Multiple Correlated Functional Outcomes with Spatially. Heterogeneous Shape Characteristics

Modeling Multiple Correlated Functional Outcomes with Spatially. Heterogeneous Shape Characteristics Biometrics 000, 1 25 DOI: 000 000 0000 Modeling Multiple Correlated Functional Outcomes with Spatially Heterogeneous Shape Characteristics Kunlaya Soiaporn 1,, David Ruppert 2,, and Raymond J. Carroll

More information

Regression When the Predictors Are Images

Regression When the Predictors Are Images University of Haifa From the SelectedWorks of Philip T. Reiss May, 2009 Regression When the Predictors Are Images Philip T. Reiss, New York University Available at: https://works.bepress.com/phil_reiss/15/

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

Likelihood ratio testing for zero variance components in linear mixed models

Likelihood ratio testing for zero variance components in linear mixed models Likelihood ratio testing for zero variance components in linear mixed models Sonja Greven 1,3, Ciprian Crainiceanu 2, Annette Peters 3 and Helmut Küchenhoff 1 1 Department of Statistics, LMU Munich University,

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Supplementary Material to General Functional Concurrent Model

Supplementary Material to General Functional Concurrent Model Supplementary Material to General Functional Concurrent Model Janet S. Kim Arnab Maity Ana-Maria Staicu June 17, 2016 This Supplementary Material contains six sections. Appendix A discusses modifications

More information

On the Identifiability of the Functional Convolution. Model

On the Identifiability of the Functional Convolution. Model On the Identifiability of the Functional Convolution Model Giles Hooker Abstract This report details conditions under which the Functional Convolution Model described in Asencio et al. (2013) can be identified

More information

Fundamental Issues in Bayesian Functional Data Analysis. Dennis D. Cox Rice University

Fundamental Issues in Bayesian Functional Data Analysis. Dennis D. Cox Rice University Fundamental Issues in Bayesian Functional Data Analysis Dennis D. Cox Rice University 1 Introduction Question: What are functional data? Answer: Data that are functions of a continuous variable.... say

More information

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1 FUNCTIONAL DATA ANALYSIS Contribution to the International Handbook (Encyclopedia) of Statistical Sciences July 28, 2009 Hans-Georg Müller 1 Department of Statistics University of California, Davis One

More information

Introduction to Functional Data Analysis A CSCU Workshop. Giles Hooker Biological Statistics and Computational Biology

Introduction to Functional Data Analysis A CSCU Workshop. Giles Hooker Biological Statistics and Computational Biology Introduction to Functional Data Analysis A CSCU Workshop Giles Hooker Biological Statistics and Computational Biology gjh27@cornell.edu www.bscb.cornell.edu/ hooker/fdaworkshop 1 / 26 Agenda What is Functional

More information

Tutorial on Functional Data Analysis

Tutorial on Functional Data Analysis Tutorial on Functional Data Analysis Ana-Maria Staicu Department of Statistics, North Carolina State University astaicu@ncsu.edu SAMSI April 5, 2017 A-M Staicu Tutorial on Functional Data Analysis April

More information

Morphometrics with SPM12

Morphometrics with SPM12 Morphometrics with SPM12 John Ashburner Wellcome Trust Centre for Neuroimaging, 12 Queen Square, London, UK. What kind of differences are we looking for? Usually, we try to localise regions of difference.

More information

General Functional Concurrent Model

General Functional Concurrent Model General Functional Concurrent Model Janet S. Kim Arnab Maity Ana-Maria Staicu June 17, 2016 Abstract We propose a flexible regression model to study the association between a functional response and multiple

More information

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we

More information

Functional Mixed Effects Spectral Analysis

Functional Mixed Effects Spectral Analysis Joint with Robert Krafty and Martica Hall June 4, 2014 Outline Introduction Motivating example Brief review Functional mixed effects spectral analysis Estimation Procedure Application Remarks Introduction

More information

Final Review. Yang Feng. Yang Feng (Columbia University) Final Review 1 / 58

Final Review. Yang Feng.   Yang Feng (Columbia University) Final Review 1 / 58 Final Review Yang Feng http://www.stat.columbia.edu/~yangfeng Yang Feng (Columbia University) Final Review 1 / 58 Outline 1 Multiple Linear Regression (Estimation, Inference) 2 Special Topics for Multiple

More information

Missing Data Issues in the Studies of Neurodegenerative Disorders: the Methodology

Missing Data Issues in the Studies of Neurodegenerative Disorders: the Methodology Missing Data Issues in the Studies of Neurodegenerative Disorders: the Methodology Sheng Luo, PhD Associate Professor Department of Biostatistics & Bioinformatics Duke University Medical Center sheng.luo@duke.edu

More information

Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting

Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting Andreas Groll 1 and Gerhard Tutz 2 1 Department of Statistics, University of Munich, Akademiestrasse 1, D-80799, Munich,

More information

ADAPTIVE SPECTRAL ANALYSIS OF REPLICATED NONSTATIONARY TIME SERIES

ADAPTIVE SPECTRAL ANALYSIS OF REPLICATED NONSTATIONARY TIME SERIES ADAPTIVE SPECTRAL ANALYSIS OF REPLICATED NONSTATIONARY TIME SERIES ROBERT KRAFTY Department of Biostatistics University of Pittsburgh Joint work with: Scott Bruce Department of Statistical Science, Temple

More information

Generalized Functional Concurrent Model

Generalized Functional Concurrent Model Biometrics, 1 25 2015 Generalized Functional Concurrent Model Janet S. Kim, Arnab Maity, and Ana-Maria Staicu Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695, U.S.A.

More information

Smooth Scalar-on-Image Regression via. Spatial Bayesian Variable Selection

Smooth Scalar-on-Image Regression via. Spatial Bayesian Variable Selection Smooth Scalar-on-Image Regression via Spatial Bayesian Variable Selection Jeff Goldsmith 1,*, Lei Huang 2, and Ciprian M. Crainiceanu 2 1 Department of Biostatistics, Columbia University School of Public

More information

Some properties of Likelihood Ratio Tests in Linear Mixed Models

Some properties of Likelihood Ratio Tests in Linear Mixed Models Some properties of Likelihood Ratio Tests in Linear Mixed Models Ciprian M. Crainiceanu David Ruppert Timothy J. Vogelsang September 19, 2003 Abstract We calculate the finite sample probability mass-at-zero

More information

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Elizabeth C. Mannshardt-Shamseldin Advisor: Richard L. Smith Duke University Department

More information

Functional Linear Mixed Models for. Irregularly or Sparsely Sampled Data

Functional Linear Mixed Models for. Irregularly or Sparsely Sampled Data Functional Linear Mixed Models for Irregularly or Sparsely Sampled Data Jona Cederbaum 1, Marianne Pouplier 2, Phil Hoole 2, and Sonja Greven 1 arxiv:150801686v1 [statme] 7 Aug 2015 1 Department of Statistics,

More information

RLRsim: Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models

RLRsim: Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models RLRsim: Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models Fabian Scheipl 1 joint work with Sonja Greven 1,2 and Helmut Küchenhoff 1 1 Department of Statistics, LMU

More information

for Sparsely Observed Functional Data University of Glasgow

for Sparsely Observed Functional Data University of Glasgow Spatial Correlation Estimation for Sparsely Observed Functional Data Workshop on High dimensional and dependent functional data, Bristol, Sep 2012 Surajit Ray University of Glasgow Contributors t Chong

More information

Uncertainty Quantification for Inverse Problems. November 7, 2011

Uncertainty Quantification for Inverse Problems. November 7, 2011 Uncertainty Quantification for Inverse Problems November 7, 2011 Outline UQ and inverse problems Review: least-squares Review: Gaussian Bayesian linear model Parametric reductions for IP Bias, variance

More information

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model 1. Introduction Varying-coefficient partially linear model (Zhang, Lee, and Song, 2002; Xia, Zhang, and Tong, 2004;

More information

Inference After Variable Selection

Inference After Variable Selection Department of Mathematics, SIU Carbondale Inference After Variable Selection Lasanthi Pelawa Watagoda lasanthi@siu.edu June 12, 2017 Outline 1 Introduction 2 Inference For Ridge and Lasso 3 Variable Selection

More information

FuncICA for time series pattern discovery

FuncICA for time series pattern discovery FuncICA for time series pattern discovery Nishant Mehta and Alexander Gray Georgia Institute of Technology The problem Given a set of inherently continuous time series (e.g. EEG) Find a set of patterns

More information

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013 Applied Regression Chapter 2 Simple Linear Regression Hongcheng Li April, 6, 2013 Outline 1 Introduction of simple linear regression 2 Scatter plot 3 Simple linear regression model 4 Test of Hypothesis

More information

Incorporating Covariates in Skewed Functional Data Models

Incorporating Covariates in Skewed Functional Data Models Incorporating Covariates in Skewed Functional Data Models Meng Li, Ana-Maria Staicu and Howard D. Bondell Department of Statistics, North Carolina State University ana-maria staicu@ncsu.edu November 5,

More information

LONGITUDINAL FUNCTIONAL MODELS WITH STRUCTURED PENALTIES

LONGITUDINAL FUNCTIONAL MODELS WITH STRUCTURED PENALTIES Johns Hopkins University, Dept. of Biostatistics Working Papers 11-2-2012 LONGITUDINAL FUNCTIONAL MODELS WITH STRUCTURED PENALTIES Madan G. Kundu Indiana University School of Medicine, Department of Biostatistics

More information

Sparse regularization for functional logistic regression models

Sparse regularization for functional logistic regression models Sparse regularization for functional logistic regression models Hidetoshi Matsui The Center for Data Science Education and Research, Shiga University 1-1-1 Banba, Hikone, Shiga, 5-85, Japan. hmatsui@biwako.shiga-u.ac.jp

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable

More information

An Introduction to Functional Data Analysis

An Introduction to Functional Data Analysis An Introduction to Functional Data Analysis Chongzhi Di Fred Hutchinson Cancer Research Center cdi@fredhutch.org Biotat 578A: Special Topics in (Genetic) Epidemiology November 10, 2015 Textbook Ramsay

More information

Likelihood Ratio Tests. that Certain Variance Components Are Zero. Ciprian M. Crainiceanu. Department of Statistical Science

Likelihood Ratio Tests. that Certain Variance Components Are Zero. Ciprian M. Crainiceanu. Department of Statistical Science 1 Likelihood Ratio Tests that Certain Variance Components Are Zero Ciprian M. Crainiceanu Department of Statistical Science www.people.cornell.edu/pages/cmc59 Work done jointly with David Ruppert, School

More information

COMPARING PARAMETRIC AND SEMIPARAMETRIC ERROR CORRECTION MODELS FOR ESTIMATION OF LONG RUN EQUILIBRIUM BETWEEN EXPORTS AND IMPORTS

COMPARING PARAMETRIC AND SEMIPARAMETRIC ERROR CORRECTION MODELS FOR ESTIMATION OF LONG RUN EQUILIBRIUM BETWEEN EXPORTS AND IMPORTS Applied Studies in Agribusiness and Commerce APSTRACT Center-Print Publishing House, Debrecen DOI: 10.19041/APSTRACT/2017/1-2/3 SCIENTIFIC PAPER COMPARING PARAMETRIC AND SEMIPARAMETRIC ERROR CORRECTION

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes Chapter 4 - Fundamentals of spatial processes Lecture notes Geir Storvik January 21, 2013 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites Mostly positive correlation Negative

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Institute of Statistics and Econometrics Georg-August-University Göttingen Department of Statistics

More information

Bayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson

Bayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson Bayesian variable selection via penalized credible regions Brian Reich, NC State Joint work with Howard Bondell and Ander Wilson Brian Reich, NCSU Penalized credible regions 1 Motivation big p, small n

More information

Multiple QTL mapping

Multiple QTL mapping Multiple QTL mapping Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] 1 Why? Reduce residual variation = increased power

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Functional Generalized Additive Models

Functional Generalized Additive Models This article was downloaded by: [Cornell University] On: 27 November 2012, At: 05:50 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office:

More information

Functional time series

Functional time series Rob J Hyndman Functional time series with applications in demography 4. Connections, extensions and applications Outline 1 Yield curves 2 Electricity prices 3 Dynamic updating with partially observed functions

More information

DESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA

DESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA Statistica Sinica 18(2008), 515-534 DESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA Kani Chen 1, Jianqing Fan 2 and Zhezhen Jin 3 1 Hong Kong University of Science and Technology,

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.

More information

Compressive Inference

Compressive Inference Compressive Inference Weihong Guo and Dan Yang Case Western Reserve University and SAMSI SAMSI transition workshop Project of Compressive Inference subgroup of Imaging WG Active members: Garvesh Raskutti,

More information

Smooth Common Principal Component Analysis

Smooth Common Principal Component Analysis 1 Smooth Common Principal Component Analysis Michal Benko Wolfgang Härdle Center for Applied Statistics and Economics benko@wiwi.hu-berlin.de Humboldt-Universität zu Berlin Motivation 1-1 Volatility Surface

More information

Incorporating Covariates in Skewed Functional Data Models

Incorporating Covariates in Skewed Functional Data Models Incorporating Covariates in Skewed Functional Data Models Meng Li, Ana-Maria Staicu and Howard D. Bondell Department of Statistics, North Carolina State University Abstract Most existing models for functional

More information

A hierarchical group ICA model for assessing covariate effects on brain functional networks

A hierarchical group ICA model for assessing covariate effects on brain functional networks A hierarchical group ICA model for assessing covariate effects on brain functional networks Ying Guo Joint work with Ran Shi Department of Biostatistics and Bioinformatics Emory University 06/03/2013 Ying

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

Likelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis

Likelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis Likelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis Ana-Maria Staicu Yingxing Li Ciprian M. Crainiceanu David Ruppert October 13, 2013 Abstract The

More information

Methods for scalar-on-function regression

Methods for scalar-on-function regression University of Haifa From the SelectedWorks of Philip T. Reiss August, 2017 Methods for scalar-on-function regression Philip T. Reiss Jeff Goldsmith, Columbia University Han Lin Shang, Australian National

More information

Learned-Loss Boosting

Learned-Loss Boosting Noname manuscript No. (will be inserted by the editor) Learned-Loss Boosting Giles Hooker, James O. Ramsay Department of Psychology McGill University Montreal, QC, H2X 2G3 Canada giles.hooker@mcgill.ca

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

STAT 730 Chapter 4: Estimation

STAT 730 Chapter 4: Estimation STAT 730 Chapter 4: Estimation Timothy Hanson Department of Statistics, University of South Carolina Stat 730: Multivariate Analysis 1 / 23 The likelihood We have iid data, at least initially. Each datum

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Maximum Smoothed Likelihood for Multivariate Nonparametric Mixtures

Maximum Smoothed Likelihood for Multivariate Nonparametric Mixtures Maximum Smoothed Likelihood for Multivariate Nonparametric Mixtures David Hunter Pennsylvania State University, USA Joint work with: Tom Hettmansperger, Hoben Thomas, Didier Chauveau, Pierre Vandekerkhove,

More information

A NOTE ON A NONPARAMETRIC REGRESSION TEST THROUGH PENALIZED SPLINES

A NOTE ON A NONPARAMETRIC REGRESSION TEST THROUGH PENALIZED SPLINES Statistica Sinica 24 (2014), 1143-1160 doi:http://dx.doi.org/10.5705/ss.2012.230 A NOTE ON A NONPARAMETRIC REGRESSION TEST THROUGH PENALIZED SPLINES Huaihou Chen 1, Yuanjia Wang 2, Runze Li 3 and Katherine

More information

Functional SVD for Big Data

Functional SVD for Big Data Functional SVD for Big Data Pan Chao April 23, 2014 Pan Chao Functional SVD for Big Data April 23, 2014 1 / 24 Outline 1 One-Way Functional SVD a) Interpretation b) Robustness c) CV/GCV 2 Two-Way Problem

More information

Regression, Ridge Regression, Lasso

Regression, Ridge Regression, Lasso Regression, Ridge Regression, Lasso Fabio G. Cozman - fgcozman@usp.br October 2, 2018 A general definition Regression studies the relationship between a response variable Y and covariates X 1,..., X n.

More information

Diagnostics for Linear Models With Functional Responses

Diagnostics for Linear Models With Functional Responses Diagnostics for Linear Models With Functional Responses Qing Shen Edmunds.com Inc. 2401 Colorado Ave., Suite 250 Santa Monica, CA 90404 (shenqing26@hotmail.com) Hongquan Xu Department of Statistics University

More information

Score test for random changepoint in a mixed model

Score test for random changepoint in a mixed model Score test for random changepoint in a mixed model Corentin Segalas and Hélène Jacqmin-Gadda INSERM U1219, Biostatistics team, Bordeaux GDR Statistiques et Santé October 6, 2017 Biostatistics 1 / 27 Introduction

More information

Modeling motor learning using heteroskedastic functional principal components analysis

Modeling motor learning using heteroskedastic functional principal components analysis Journal of the American Statistical Association ISSN: 0162-1459 (Print) 1537-274X (Online) Journal homepage: http://www.tandfonline.com/loi/uasa20 Modeling motor learning using heteroskedastic functional

More information

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 Rejoinder Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 1 School of Statistics, University of Minnesota 2 LPMC and Department of Statistics, Nankai University, China We thank the editor Professor David

More information

Mixture of Gaussian Processes and its Applications

Mixture of Gaussian Processes and its Applications Mixture of Gaussian Processes and its Applications Mian Huang, Runze Li, Hansheng Wang, and Weixin Yao The Pennsylvania State University Technical Report Series #10-102 College of Health and Human Development

More information

Approaches to Modeling Menstrual Cycle Function

Approaches to Modeling Menstrual Cycle Function Approaches to Modeling Menstrual Cycle Function Paul S. Albert (albertp@mail.nih.gov) Biostatistics & Bioinformatics Branch Division of Epidemiology, Statistics, and Prevention Research NICHD SPER Student

More information

Integrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University

Integrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University Integrated Likelihood Estimation in Semiparametric Regression Models Thomas A. Severini Department of Statistics Northwestern University Joint work with Heping He, University of York Introduction Let Y

More information

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University

More information

mboost - Componentwise Boosting for Generalised Regression Models

mboost - Componentwise Boosting for Generalised Regression Models mboost - Componentwise Boosting for Generalised Regression Models Thomas Kneib & Torsten Hothorn Department of Statistics Ludwig-Maximilians-University Munich 13.8.2008 Boosting in a Nutshell Boosting

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Xiuming Zhang zhangxiuming@u.nus.edu A*STAR-NUS Clinical Imaging Research Center October, 015 Summary This report derives

More information

Tolerance Bands for Functional Data

Tolerance Bands for Functional Data Tolerance Bands for Functional Data Lasitha N. Rathnayake and Pankaj K. Choudhary 1 Department of Mathematical Sciences, FO 35 University of Texas at Dallas Richardson, TX 75080-3021, USA Abstract Often

More information

Small Area Modeling of County Estimates for Corn and Soybean Yields in the US

Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Matt Williams National Agricultural Statistics Service United States Department of Agriculture Matt.Williams@nass.usda.gov

More information

Ridge and Lasso Regression

Ridge and Lasso Regression enote 8 1 enote 8 Ridge and Lasso Regression enote 8 INDHOLD 2 Indhold 8 Ridge and Lasso Regression 1 8.1 Reading material................................. 2 8.2 Presentation material...............................

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Spatial omain Hierarchical Modelling for Univariate Spatial ata Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A.

More information

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This

More information

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind

More information

Linear regression methods

Linear regression methods Linear regression methods Most of our intuition about statistical methods stem from linear regression. For observations i = 1,..., n, the model is Y i = p X ij β j + ε i, j=1 where Y i is the response

More information

Modeling Real Estate Data using Quantile Regression

Modeling Real Estate Data using Quantile Regression Modeling Real Estate Data using Semiparametric Quantile Regression Department of Statistics University of Innsbruck September 9th, 2011 Overview 1 Application: 2 3 4 Hedonic regression data for house prices

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites

More information

Estimating prediction error in mixed models

Estimating prediction error in mixed models Estimating prediction error in mixed models benjamin saefken, thomas kneib georg-august university goettingen sonja greven ludwig-maximilians-university munich 1 / 12 GLMM - Generalized linear mixed models

More information

Hierarchical Linear Models

Hierarchical Linear Models Hierarchical Linear Models Statistics 220 Spring 2005 Copyright c 2005 by Mark E. Irwin The linear regression model Hierarchical Linear Models y N(Xβ, Σ y ) β σ 2 p(β σ 2 ) σ 2 p(σ 2 ) can be extended

More information

The Poisson transform for unnormalised statistical models. Nicolas Chopin (ENSAE) joint work with Simon Barthelmé (CNRS, Gipsa-LAB)

The Poisson transform for unnormalised statistical models. Nicolas Chopin (ENSAE) joint work with Simon Barthelmé (CNRS, Gipsa-LAB) The Poisson transform for unnormalised statistical models Nicolas Chopin (ENSAE) joint work with Simon Barthelmé (CNRS, Gipsa-LAB) Part I Unnormalised statistical models Unnormalised statistical models

More information

Bayesian Linear Models

Bayesian Linear Models Eric F. Lock UMN Division of Biostatistics, SPH elock@umn.edu 03/07/2018 Linear model For observations y 1,..., y n, the basic linear model is y i = x 1i β 1 +... + x pi β p + ɛ i, x 1i,..., x pi are predictors

More information

Stat 5100 Handout #26: Variations on OLS Linear Regression (Ch. 11, 13)

Stat 5100 Handout #26: Variations on OLS Linear Regression (Ch. 11, 13) Stat 5100 Handout #26: Variations on OLS Linear Regression (Ch. 11, 13) 1. Weighted Least Squares (textbook 11.1) Recall regression model Y = β 0 + β 1 X 1 +... + β p 1 X p 1 + ε in matrix form: (Ch. 5,

More information

ftsa: An R Package for Analyzing Functional Time Series by Han Lin Shang

ftsa: An R Package for Analyzing Functional Time Series by Han Lin Shang CONTRIBUTED RESEARCH ARTICLES 64 ftsa: An R Package for Analyzing Functional Time Series by Han Lin Shang Abstract Recent advances in computer recording and storing technology have tremendously increased

More information

Introduction to Simple Linear Regression

Introduction to Simple Linear Regression Introduction to Simple Linear Regression Yang Feng http://www.stat.columbia.edu/~yangfeng Yang Feng (Columbia University) Introduction to Simple Linear Regression 1 / 68 About me Faculty in the Department

More information

Applied Econometrics (QEM)

Applied Econometrics (QEM) Applied Econometrics (QEM) The Simple Linear Regression Model based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #2 The Simple

More information

Nonparametric Regression. Badr Missaoui

Nonparametric Regression. Badr Missaoui Badr Missaoui Outline Kernel and local polynomial regression. Penalized regression. We are given n pairs of observations (X 1, Y 1 ),...,(X n, Y n ) where Y i = r(x i ) + ε i, i = 1,..., n and r(x) = E(Y

More information

Restricted Likelihood Ratio Tests in Nonparametric Longitudinal Models

Restricted Likelihood Ratio Tests in Nonparametric Longitudinal Models Restricted Likelihood Ratio Tests in Nonparametric Longitudinal Models Short title: Restricted LR Tests in Longitudinal Models Ciprian M. Crainiceanu David Ruppert May 5, 2004 Abstract We assume that repeated

More information

Outline. Remedial Measures) Extra Sums of Squares Standardized Version of the Multiple Regression Model

Outline. Remedial Measures) Extra Sums of Squares Standardized Version of the Multiple Regression Model Outline 1 Multiple Linear Regression (Estimation, Inference, Diagnostics and Remedial Measures) 2 Special Topics for Multiple Regression Extra Sums of Squares Standardized Version of the Multiple Regression

More information

Functional Data Analysis

Functional Data Analysis FDA 1-1 Functional Data Analysis Michal Benko Institut für Statistik und Ökonometrie Humboldt-Universität zu Berlin email:benko@wiwi.hu-berlin.de FDA 1-2 Outline of this talk: Introduction Turning discrete

More information