Smooth Common Principal Component Analysis

Size: px
Start display at page:

Download "Smooth Common Principal Component Analysis"

Transcription

1 1 Smooth Common Principal Component Analysis Michal Benko Wolfgang Härdle Center for Applied Statistics and Economics Humboldt-Universität zu Berlin

2 Motivation 1-1 Volatility Surface Volatility Surface (3500.0,0.0,0.5) Vola (3500.0,0.0,0.5) 0.4 Maturit t 0.5 Vola Maturit t Moneyness 0.4 (3500.0,1.0,0.3) (3500.0,0.0,0.3) (7000.0,0.0,0.3) 0.4 Moneyness (3500.0,1.0,0.3) (3500.0,0.0,0.3) (7000.0,0.0,0.3) Figure 1: Implied Volatility Surfaces: t 1 = and t 2 = , ODAX <film> XFGiv02.xpl

3 Motivation 1-2 Time Series of PCs: 1 month Time Figure 2: 1st, 2nd and 3rd principal component of the 1 months maturity

4 Motivation 1-3 Parallel Coordinate Plot: 1. Eigenvector Factor loading Index of moneyness Figure 3: 1 st eigenvectors (sep. PCA) for 1, 2 and 3 months maturity index 1 to 6 is κ {0.85, 0.90, 0.95, 1.00, 1.05, 1.10}

5 Motivation 1-4 Parallel Coordinate Plot: 2. Eigenvector Factor loading Index of moneyness Figure 4: 2 nd eigenvectors (sep. PCA) for 1, 2 and 3 months maturity index 1 to 6 is κ {0.85, 0.90, 0.95, 1.00, 1.05, 1.10}

6 Motivation 1-5 Simulated CPC Model Figure 5: Simulated CPC model as observable in vole data; compare Flury (1988)

7 Motivation 1-6 Factor loading CPC Coordinate Plot: First three Eigenvectors Index of eigenvectors Figure 6: First three eigenvectors under CPC 1, 2 and 3 months maturity index 1 to 6 is κ {0.85, 0.90, 0.95, 1.00, 1.05, 1.10} XFGiv06.xpl

8 Motivation CPC for IV yields the desired dimension reduction + trading signals and strategies can be obtained from CPC model - the problem thought is of functional nature - eigenvectors are rough? it is possible to (elegantly) combine CPC & FDA

9 Motivation 1-8 Outline 1. Motivation 2. Functional Data Analysis 3. Functional Data Example 4. Smooth Functional Principal Components Analysis 5. Smooth Common Principal Components 6. Application

10 Functional Data Analysis 2-1 Functional Data Analysis - Problem Analyze (random) functions on an interval J: X(t), t J Using n-observations of X: X 1 (t),..., X n (t)

11 Functional Data Analysis 2-2 Moments X µ (t) = E X(t), t J Var(X) = E{X(t) E X(t)} 2, t J Γ(s, t) = E{X(s) E X(s)}{X(t) E X(t)}, s, t J In order to simplify the notation, assume X µ (t) = 0, t J.

12 Functional Data Analysis 2-3 Summary Statistics X(t) = 1 n σ(t) = 1 n Γ(s, t) = 1 n n X i (t) i=1 n { Xi (t) X(t) } 2 i=1 n { Xi (s) X(s) } { X i (t) X(t) } i=1

13 Functional PCA 3-1 Functional Principal Components Analysis: Analogue to Multivariate PCA in Functional Space: arg max (var < γ k, X >) (1) <γ l,γ k >=δ lk,l k Where < f, g > def = J f(t)g(t)dt Results are eigenfunctions γ k

14 Functional PCA 3-2 Theoretical Aspects of FDA Eigenfunctions are solutions of the eigenequation: (homogenous Fredholm equation of the second kind) Γ(, t)γ(t)dt = λγ( ) (2) Ramsay, J.,& Silverman, B., (1997)

15 Functional PCA 3-3 Eigenelements: f k - Principal Components f k =< γ k, X > γ k (t) - eigenfunctions λ k - eigenvalues = Var(f k ) = Var{ J γ k (t)x(t)dt} = γ k (s)γ(s, t)γ k (t)dsdt

16 Functional PCA 3-4 How to Estimate Functional Principal Components? Replace Γ in (2) by sample covariance function ˆΓ: ˆΓ(, t)ˆγ(t)dt = ˆλˆγ( ) (3) Resulting ˆλ k, ˆγ k are consistent estimators of λ k, γ k

17 Functional PCA 3-5 Basis Function Expansion Assume that X i (t) can be expanded in terms of a function basis Φ = (Φ 1,..., Φ L ) : L X i (t) c il Φ l (t) (4) l=1 Simultaneous expansion of all X i, i = 1,..., n: X CΦ Summary Statistics: X(t) CΦ(t) (5) ˆΓ(s, t) Φ(s) Cov(C)Φ(t) (6)

18 Functional PCA 3-6 Fourier Basis Well known example of functional Basis is a Fourier Basis: (1, sinωt, cosωt, sin2ωt, cos2ωt,...) where ω is frequency and determines the period J = 2π ω Functional expansion using Fourier basis: X i (t) a L 0i 2 + (a li cosωlt + b li sinωlt) l=1

19 Functional PCA 3-7 The eigenequation (2) can then be expressed: where Φ(t) Cov(C)Wb = λφ(t) b γ(t) b = (b 1, b 2,..., b L ) L b l φ l (t) l=1 W, w l1,l 2 def = < φ l1, φ l2 > Note: < γ l, γ k > corresponds with b l Wb k.

20 Functional PCA 3-8 Remark In the most practical applications even X i (t) can be observed only on discrete grids: Data Function X i is n-times observed only on discrete grid (t 1i,..., t pi ) J (X i (t 1i ), X i (t 2i ),..., X i (t pi )), i = 1,..., n

21 Functional PCA 3-9 The X i (t) itself need to be estimated from (X i (t 1i ), X i (t 2i ),..., X i (t pi )), i.e. for Basis expansion technique we need to estimate Ĉ this can be done by standard numerical or statistical tools.

22 Functional PCA 3-10 Functional Data Example Temperature Functions Y X XCSfdaex01.xpl Canadian temperature dataset using FFT with L = 31

23 Functional PCA 3-11 Mean and Variance Function Temperature Functions Temperature Functions Y X Y X XCSfdaMom.xpl

24 Functional PCA 3-12 Covariance Function Covariance Function XCSfdaCOV.xpl

25 Smooth Principal Components 4-1 Smooth Principal Components Analysis Eigenfunctions are usually too rough for clear interpretation, e.g. Functional PCA Y*E X*E2 Eigenfunctions for temperature dataset, using Fourier expansion with L = 31. XCSfda02.xpl

26 Smooth Principal Components 4-2 IDEA: smooth eigenfunctions: 1. Plug the roughness penalty into objective function - SPCA 1 2. Plug the roughness penalty into scalar product - SPCA 2

27 Smooth Principal Components 4-3 Roughness Penalty in Objective Function ) arg max (var(< γ k, X >) α k < γ k, γ k > <γ l,γ k >=δ lk,l k (7) Idea: penalize rough functions direct in the maximization criterion Smoothing parameter α k has to be chossen (e.g. CV criterion)

28 Smooth Principal Components 4-4 Change inner product arg max (var < γ k, X >) (8) (γ l,γ k )=δ lk,l k (.,.) is of Sobolev type, e.g. (f, g) =< f, g > +α k < f g > Idea is to insert the roughness penalty in to the inner product

29 Smooth Principal Components 4-5 Remark Changing the Objective Function max var < γ k, X > +α k < γ k, γ k > < γ k, γ k > Changing the inner product var < γ k, X > max < γ k, γ k > +α k < γ k, γ k >

30 Smooth Principal Components 4-6 Problems How to choose α in roughness penalty? Is e.g. Cross-Validation appropriate? How to choose Φ and L?

31 Smooth Principal Components 4-6 Example Roughness Penalty by Changing the Norm Smoothed eigenfunctions (Fourier), 2000 Y*E X*E2 XCSfda03.xpl

32 Common Principal Components 5-1 Common Principal Components k-sample problem in the multivariate framework: k random vectors: X (1), X (2),..., X (k) R p. Model: Ψ j = Cov(X (j) ) = ΓΛ j Γ i.e. eigenvectors same across samples, eigenvalues (variances) differs. Let us denote sample covariance by S j.

33 Common Principal Components 5-2 Maximum Likelihood Estimation Further: L(Ψ 1, Ψ 2,..., Ψ k ) = C. S i W p (n i, Ψ i /n i ) k exp i=1 { ( tr n )} i 2 Ψ is i (Ψ i ) n i/2 Maximization of L is equivalent to: k [ det diag(γ ] S i Γ) Φ(Γ) = det(γ S i Γ) i=1 (9)

34 Common Principal Components 5-3 Testing the CPC Hypothesis Test Statistics χ 2 CP C = 2log L( ˆΨ 1, ˆΨ 2,..., ˆΨ k ) L(S 1, S 2,..., S k ) = n i log det ˆΨ i detŝ i χ (k 1)p(p 1)/2

35 Common Principal Components 5-4 Estimation SCPC SPCA are implemented through Spectral Analysis of Matrices Multivariate CPC criterion (9) - F G algorithm may be employed Remark: In the view of minimization property of FG algorithm this is reasonable approach, however obtained estimator may not be a ML estimator.

36 Common Principal Components 5-5 Testing the SCPC hypothesis Idea motivated from application: test the distribution of PC (f j i ) Takeda, Y., & Sugiyama, T., (2003) use this idea for testing λ 1 1 = λ Small Simulation indicates poor behavior for SCPC hypothesis

37 Common Principal Components 5-6 Simulated SCPC Estimation Example } (x 500)2 Y 1 (t) = X 1,1 exp { 50 2 } (x 500)2 Y 2 (t) = X 1,2 exp { 50 2 } (x 600)2 + X 2,1 exp { 25 2 } (x 600)2 + X 2,2 exp { 25 2 where X 1,1 N(7, 0.5), X 2,1 N(3, 0.5) X 1,2 N(7, 0.75), X 2,2 N(3, 0.5) See Caglar, H., & Caglar, N., (2003)

38 Common Principal Components 5-7 Simulation Setup Functions X i simulated on discrete grid Basis: Fourier Basis L = 21 Coefficient estimated using FFT T = {350, 351, } Number of Simulated Processes - n = 30

39 Common Principal Components 5-8 Simulated Processes Simulated Processes Y1 Simulated Processes Y Y Y X X Simulated Processes {Y 1i (t), } n i=1, {Y 2i(t)} n i=1 t T, blue - true function

40 Common Principal Components 5-9 Fourier Expansion of the Simulated Processes Estimated Functions Y1 Estimated Functions Y Y Y X X Fourier Expansion of the Simulated Processes {Ŷ1i(t)} n i=1, {Ŷ2i(t)} n i=1 (Fast Fourier Transformation, L = 21)

41 Common Principal Components 5-10 Principal Components Y*E X*E2 1st components of simulated process, α = Process 1, Process 2, FG estimator (CPC)

42 Common Principal Components 5-11 Principal Components Y*E X*E2 2st components of simulated process, α = Process 1, Process 2, FG estimator (CPC)

43 Common Principal Components 5-12 Challenges in Implied Volatility Modelling 1. Data Preparation MD*Base 2. Strings 3. Time Shift of the strings

44 Common Principal Components 5-13 Summary and Outlook estimation of SCPC using simultaneous diagonalization statistical properties of estimation procedure proper functional bases and diagonalization method for the applications testing the SCPC hypothesis

45 References 6-1 References Dauxois, J., Pousse, A.,& Romain, Y., (1982). Asymptotic Theory for the Principal Component Analysis of a Vector Random Function: Some Applications to Statistical Inference, Journal of Multivariate Analysis 12,p Rice, J.,& Silverman, B., (1991). Estimating the Mean and Covariance Structure Nonparametrically when the Data are Curves, Journal of Royal Statistical Society, Ser. B 53,p Flury, B., (1988). Common Principal Components and Related Models, Wiley, New York Ramsay, J.,& Silverman, B., (1997). Functional Data Analysis, Springer, New York Härdle, W., (1990). Applied Nonparametric Regression, Econometric Society Monographs

46 References 6-2 Fengler, M., & Härdle, W., (2003). Voles, Volas, Values, Talk CASE, HU-Berlin Fengler, M., Härdle, W., & Villa, C., (2001). The Dynamics of Implied Volatilities: A CPC Approach, under submission Fengler, M., Härdle, W., & Mammen, E., (2003). Implied Volatitlity String Dynamics, under submission Takeda, Y., & Sugiyama, T., (2003). Some tests in Functional Principal Components Analysis, unpublished, Chuo University Caglar, H., & Caglar, N., (2003). The PCA of Continous Sample Curves with Higher-order B-spline Functions, unpublished, Istanbul University

Functional Data Analysis

Functional Data Analysis FDA 1-1 Functional Data Analysis Michal Benko Institut für Statistik und Ökonometrie Humboldt-Universität zu Berlin email:benko@wiwi.hu-berlin.de FDA 1-2 Outline of this talk: Introduction Turning discrete

More information

Functional Principal Components Analysis, Implementation and Applications

Functional Principal Components Analysis, Implementation and Applications Functional Principal Components Analysis, Implementation and Applications ABSCHLUSSARBEIT zur Erlangung des akademischen Grades Master of Science (M.Sc.) im Masterstudiengang Statistik an der Wirtschaftwissenschaftlichen

More information

9.1 Orthogonal factor model.

9.1 Orthogonal factor model. 36 Chapter 9 Factor Analysis Factor analysis may be viewed as a refinement of the principal component analysis The objective is, like the PC analysis, to describe the relevant variables in study in terms

More information

Robust Methods for Multivariate Functional Data Analysis. Pallavi Sawant

Robust Methods for Multivariate Functional Data Analysis. Pallavi Sawant Robust Methods for Multivariate Functional Data Analysis by Pallavi Sawant A dissertation submitted to the Graduate Faculty of Auburn University in partial fulfillment of the requirements for the Degree

More information

Chapter 4: Factor Analysis

Chapter 4: Factor Analysis Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.

More information

Regularized principal components analysis

Regularized principal components analysis 9 Regularized principal components analysis 9.1 Introduction In this chapter, we discuss the application of smoothing to functional principal components analysis. In Chapter 5 we have already seen that

More information

Second-Order Inference for Gaussian Random Curves

Second-Order Inference for Gaussian Random Curves Second-Order Inference for Gaussian Random Curves With Application to DNA Minicircles Victor Panaretos David Kraus John Maddocks Ecole Polytechnique Fédérale de Lausanne Panaretos, Kraus, Maddocks (EPFL)

More information

Principal components in an asymmetric norm

Principal components in an asymmetric norm Ngoc Mai Tran Maria Osipenko Wolfgang Karl Härdle Ladislaus von Bortkiewicz Chair of Statistics C.A.S.E. Centre for Applied Statistics and Economics School of Business and Economics Humboldt-Universität

More information

Functional Data Analysis & Variable Selection

Functional Data Analysis & Variable Selection Auburn University Department of Mathematics and Statistics Universidad Nacional de Colombia Medellin, Colombia March 14, 2016 Functional Data Analysis Data Types Univariate - Contains numbers as its observations

More information

An Introduction to Functional Data Analysis

An Introduction to Functional Data Analysis An Introduction to Functional Data Analysis Chongzhi Di Fred Hutchinson Cancer Research Center cdi@fredhutch.org Biotat 578A: Special Topics in (Genetic) Epidemiology November 10, 2015 Textbook Ramsay

More information

Tests for separability in nonparametric covariance operators of random surfaces

Tests for separability in nonparametric covariance operators of random surfaces Tests for separability in nonparametric covariance operators of random surfaces Shahin Tavakoli (joint with John Aston and Davide Pigoli) April 19, 2016 Analysis of Multidimensional Functional Data Shahin

More information

SMOOTHED FUNCTIONAL PRINCIPAL COMPONENTS ANALYSIS BY CHOICE OF NORM 1. BY BERNARD W. SILVERMAN University of Bristol

SMOOTHED FUNCTIONAL PRINCIPAL COMPONENTS ANALYSIS BY CHOICE OF NORM 1. BY BERNARD W. SILVERMAN University of Bristol The Annals of Statistics 1996, Vol. 4, No. 1, 14 SMOOTHED FUNCTIONAL PRINCIPAL COMPONENTS ANALYSIS BY CHOICE OF NORM 1 BY BERNARD W. SILVERMAN University of Bristol The principal components analysis of

More information

Interpretable Functional Principal Component Analysis

Interpretable Functional Principal Component Analysis Biometrics 000, 000 000 DOI: 000 000 0000 Interpretable Functional Principal Component Analysis Zhenhua Lin 1,, Liangliang Wang 2,, and Jiguo Cao 2, 1 Department of Statistical Sciences, University of

More information

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1 FUNCTIONAL DATA ANALYSIS Contribution to the International Handbook (Encyclopedia) of Statistical Sciences July 28, 2009 Hans-Georg Müller 1 Department of Statistics University of California, Davis One

More information

Piotr Majer Risk Patterns and Correlated Brain Activities

Piotr Majer Risk Patterns and Correlated Brain Activities Alena My²i ková Piotr Majer Song Song Alena Myšičková Peter N. C. Mohr Peter N. C. Mohr Wolfgang K. Härdle Song Song Hauke R. Heekeren Wolfgang K. Härdle Hauke R. Heekeren C.A.S.E. Centre C.A.S.E. for

More information

Fundamental concepts of functional data analysis

Fundamental concepts of functional data analysis Fundamental concepts of functional data analysis Department of Statistics, Colorado State University Examples of functional data 0 1440 2880 4320 5760 7200 8640 10080 Time in minutes The horizontal component

More information

Designing Kernel Functions Using the Karhunen-Loève Expansion

Designing Kernel Functions Using the Karhunen-Loève Expansion July 7, 2004. Designing Kernel Functions Using the Karhunen-Loève Expansion 2 1 Fraunhofer FIRST, Germany Tokyo Institute of Technology, Japan 1,2 2 Masashi Sugiyama and Hidemitsu Ogawa Learning with Kernels

More information

Appendix A : rational of the spatial Principal Component Analysis

Appendix A : rational of the spatial Principal Component Analysis Appendix A : rational of the spatial Principal Component Analysis In this appendix, the following notations are used : X is the n-by-p table of centred allelic frequencies, where rows are observations

More information

Nonparametric Inference In Functional Data

Nonparametric Inference In Functional Data Nonparametric Inference In Functional Data Zuofeng Shang Purdue University Joint work with Guang Cheng from Purdue Univ. An Example Consider the functional linear model: Y = α + where 1 0 X(t)β(t)dt +

More information

A Semi-Parametric Measure for Systemic Risk

A Semi-Parametric Measure for Systemic Risk Natalia Sirotko-Sibirskaya Ladislaus von Bortkiewicz Chair of Statistics C.A.S.E. - Center for Applied Statistics and Economics Humboldt Universität zu Berlin http://lvb.wiwi.hu-berlin.de http://www.case.hu-berlin.de

More information

Introduction to Functional Data Analysis A CSCU Workshop. Giles Hooker Biological Statistics and Computational Biology

Introduction to Functional Data Analysis A CSCU Workshop. Giles Hooker Biological Statistics and Computational Biology Introduction to Functional Data Analysis A CSCU Workshop Giles Hooker Biological Statistics and Computational Biology gjh27@cornell.edu www.bscb.cornell.edu/ hooker/fdaworkshop 1 / 26 Agenda What is Functional

More information

Curve alignment and functional PCA

Curve alignment and functional PCA Curve alignment and functional PCA Juhyun Par* Department of Mathematics and Statistics, Lancaster University, Lancaster, U.K. juhyun.par@lancaster.ac.u Abstract When dealing with multiple curves as functional

More information

Time Series and Forecasting Lecture 4 NonLinear Time Series

Time Series and Forecasting Lecture 4 NonLinear Time Series Time Series and Forecasting Lecture 4 NonLinear Time Series Bruce E. Hansen Summer School in Economics and Econometrics University of Crete July 23-27, 2012 Bruce Hansen (University of Wisconsin) Foundations

More information

GARCH Models Estimation and Inference

GARCH Models Estimation and Inference GARCH Models Estimation and Inference Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 1 Likelihood function The procedure most often used in estimating θ 0 in

More information

Principal Components Theory Notes

Principal Components Theory Notes Principal Components Theory Notes Charles J. Geyer August 29, 2007 1 Introduction These are class notes for Stat 5601 (nonparametrics) taught at the University of Minnesota, Spring 2006. This not a theory

More information

Eigenvalues, Eigenvectors, and an Intro to PCA

Eigenvalues, Eigenvectors, and an Intro to PCA Eigenvalues, Eigenvectors, and an Intro to PCA Eigenvalues, Eigenvectors, and an Intro to PCA Changing Basis We ve talked so far about re-writing our data using a new set of variables, or a new basis.

More information

Eigenvalues, Eigenvectors, and an Intro to PCA

Eigenvalues, Eigenvectors, and an Intro to PCA Eigenvalues, Eigenvectors, and an Intro to PCA Eigenvalues, Eigenvectors, and an Intro to PCA Changing Basis We ve talked so far about re-writing our data using a new set of variables, or a new basis.

More information

Factor Analysis and Kalman Filtering (11/2/04)

Factor Analysis and Kalman Filtering (11/2/04) CS281A/Stat241A: Statistical Learning Theory Factor Analysis and Kalman Filtering (11/2/04) Lecturer: Michael I. Jordan Scribes: Byung-Gon Chun and Sunghoon Kim 1 Factor Analysis Factor analysis is used

More information

Exam 2. Jeremy Morris. March 23, 2006

Exam 2. Jeremy Morris. March 23, 2006 Exam Jeremy Morris March 3, 006 4. Consider a bivariate normal population with µ 0, µ, σ, σ and ρ.5. a Write out the bivariate normal density. The multivariate normal density is defined by the following

More information

Generalized Power Method for Sparse Principal Component Analysis

Generalized Power Method for Sparse Principal Component Analysis Generalized Power Method for Sparse Principal Component Analysis Peter Richtárik CORE/INMA Catholic University of Louvain Belgium VOCAL 2008, Veszprém, Hungary CORE Discussion Paper #2008/70 joint work

More information

Functional Latent Feature Models. With Single-Index Interaction

Functional Latent Feature Models. With Single-Index Interaction Generalized With Single-Index Interaction Department of Statistics Center for Statistical Bioinformatics Institute for Applied Mathematics and Computational Science Texas A&M University Naisyin Wang and

More information

The problem is to infer on the underlying probability distribution that gives rise to the data S.

The problem is to infer on the underlying probability distribution that gives rise to the data S. Basic Problem of Statistical Inference Assume that we have a set of observations S = { x 1, x 2,..., x N }, xj R n. The problem is to infer on the underlying probability distribution that gives rise to

More information

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Maximilian Kasy Department of Economics, Harvard University 1 / 37 Agenda 6 equivalent representations of the

More information

Diagnostics for Linear Models With Functional Responses

Diagnostics for Linear Models With Functional Responses Diagnostics for Linear Models With Functional Responses Qing Shen Edmunds.com Inc. 2401 Colorado Ave., Suite 250 Santa Monica, CA 90404 (shenqing26@hotmail.com) Hongquan Xu Department of Statistics University

More information

AN INTRODUCTION TO THEORETICAL PROPERTIES OF FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS. Ngoc Mai Tran Supervisor: Professor Peter G.

AN INTRODUCTION TO THEORETICAL PROPERTIES OF FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS. Ngoc Mai Tran Supervisor: Professor Peter G. AN INTRODUCTION TO THEORETICAL PROPERTIES OF FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS Ngoc Mai Tran Supervisor: Professor Peter G. Hall Department of Mathematics and Statistics, The University of Melbourne.

More information

Learning gradients: prescriptive models

Learning gradients: prescriptive models Department of Statistical Science Institute for Genome Sciences & Policy Department of Computer Science Duke University May 11, 2007 Relevant papers Learning Coordinate Covariances via Gradients. Sayan

More information

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics

More information

Independent component analysis for functional data

Independent component analysis for functional data Independent component analysis for functional data Hannu Oja Department of Mathematics and Statistics University of Turku Version 12.8.216 August 216 Oja (UTU) FICA Date bottom 1 / 38 Outline 1 Probability

More information

Principal components in an asymmetric norm

Principal components in an asymmetric norm Ngoc Mai Tran Petra Burdejova Maria Osipenko Wolfgang Karl Härdle Ladislaus von Bortkiewicz Chair of Statistics School of Business and Economics Humboldt-Universität zu Berlin http://lvb.wiwi.hu-berlin.de

More information

PCA and admixture models

PCA and admixture models PCA and admixture models CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar, Alkes Price PCA and admixture models 1 / 57 Announcements HW1

More information

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley

More information

Functional principal components analysis via penalized rank one approximation

Functional principal components analysis via penalized rank one approximation Electronic Journal of Statistics Vol. 2 (2008) 678 695 ISSN: 1935-7524 DI: 10.1214/08-EJS218 Functional principal components analysis via penalized rank one approximation Jianhua Z. Huang Department of

More information

Bayesian Modeling of Conditional Distributions

Bayesian Modeling of Conditional Distributions Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings

More information

Principal Components Analysis (PCA)

Principal Components Analysis (PCA) Principal Components Analysis (PCA) Principal Components Analysis (PCA) a technique for finding patterns in data of high dimension Outline:. Eigenvectors and eigenvalues. PCA: a) Getting the data b) Centering

More information

Chapter 13: Functional Autoregressive Models

Chapter 13: Functional Autoregressive Models Chapter 13: Functional Autoregressive Models Jakub Černý Department of Probability and Mathematical Statistics Stochastic Modelling in Economics and Finance December 9, 2013 1 / 25 Contents 1 Introduction

More information

CS281A/Stat241A Lecture 17

CS281A/Stat241A Lecture 17 CS281A/Stat241A Lecture 17 p. 1/4 CS281A/Stat241A Lecture 17 Factor Analysis and State Space Models Peter Bartlett CS281A/Stat241A Lecture 17 p. 2/4 Key ideas of this lecture Factor Analysis. Recall: Gaussian

More information

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53 State-space Model Eduardo Rossi University of Pavia November 2014 Rossi State-space Model Fin. Econometrics - 2014 1 / 53 Outline 1 Motivation 2 Introduction 3 The Kalman filter 4 Forecast errors 5 State

More information

Sliced Inverse Regression

Sliced Inverse Regression Sliced Inverse Regression Ge Zhao gzz13@psu.edu Department of Statistics The Pennsylvania State University Outline Background of Sliced Inverse Regression (SIR) Dimension Reduction Definition of SIR Inversed

More information

3.1. The probabilistic view of the principal component analysis.

3.1. The probabilistic view of the principal component analysis. 301 Chapter 3 Principal Components and Statistical Factor Models This chapter of introduces the principal component analysis (PCA), briefly reviews statistical factor models PCA is among the most popular

More information

CONCEPT OF DENSITY FOR FUNCTIONAL DATA

CONCEPT OF DENSITY FOR FUNCTIONAL DATA CONCEPT OF DENSITY FOR FUNCTIONAL DATA AURORE DELAIGLE U MELBOURNE & U BRISTOL PETER HALL U MELBOURNE & UC DAVIS 1 CONCEPT OF DENSITY IN FUNCTIONAL DATA ANALYSIS The notion of probability density for a

More information

An Introduction to Multivariate Statistical Analysis

An Introduction to Multivariate Statistical Analysis An Introduction to Multivariate Statistical Analysis Third Edition T. W. ANDERSON Stanford University Department of Statistics Stanford, CA WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents

More information

Unsupervised Learning: Dimensionality Reduction

Unsupervised Learning: Dimensionality Reduction Unsupervised Learning: Dimensionality Reduction CMPSCI 689 Fall 2015 Sridhar Mahadevan Lecture 3 Outline In this lecture, we set about to solve the problem posed in the previous lecture Given a dataset,

More information

Efficient Estimation for the Partially Linear Models with Random Effects

Efficient Estimation for the Partially Linear Models with Random Effects A^VÇÚO 1 33 ò 1 5 Ï 2017 c 10 Chinese Journal of Applied Probability and Statistics Oct., 2017, Vol. 33, No. 5, pp. 529-537 doi: 10.3969/j.issn.1001-4268.2017.05.009 Efficient Estimation for the Partially

More information

Quick Review on Linear Multiple Regression

Quick Review on Linear Multiple Regression Quick Review on Linear Multiple Regression Mei-Yuan Chen Department of Finance National Chung Hsing University March 6, 2007 Introduction for Conditional Mean Modeling Suppose random variables Y, X 1,

More information

Alternatives to Basis Expansions. Kernels in Density Estimation. Kernels and Bandwidth. Idea Behind Kernel Methods

Alternatives to Basis Expansions. Kernels in Density Estimation. Kernels and Bandwidth. Idea Behind Kernel Methods Alternatives to Basis Expansions Basis expansions require either choice of a discrete set of basis or choice of smoothing penalty and smoothing parameter Both of which impose prior beliefs on data. Alternatives

More information

MATH 829: Introduction to Data Mining and Analysis Principal component analysis

MATH 829: Introduction to Data Mining and Analysis Principal component analysis 1/11 MATH 829: Introduction to Data Mining and Analysis Principal component analysis Dominique Guillot Departments of Mathematical Sciences University of Delaware April 4, 2016 Motivation 2/11 High-dimensional

More information

Jianhua Z. Huang, Haipeng Shen, Andreas Buja

Jianhua Z. Huang, Haipeng Shen, Andreas Buja Several Flawed Approaches to Penalized SVDs A supplementary note to The analysis of two-way functional data using two-way regularized singular value decompositions Jianhua Z. Huang, Haipeng Shen, Andreas

More information

Switching Regime Estimation

Switching Regime Estimation Switching Regime Estimation Series de Tiempo BIrkbeck March 2013 Martin Sola (FE) Markov Switching models 01/13 1 / 52 The economy (the time series) often behaves very different in periods such as booms

More information

x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ).

x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ). .8.6 µ =, σ = 1 µ = 1, σ = 1 / µ =, σ =.. 3 1 1 3 x Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ ). The Gaussian distribution Probably the most-important distribution in all of statistics

More information

Testing the Equality of Covariance Operators in Functional Samples

Testing the Equality of Covariance Operators in Functional Samples Scandinavian Journal of Statistics, Vol. 4: 38 5, 3 doi:./j.467-9469..796.x Board of the Foundation of the Scandinavian Journal of Statistics. Published by Blackwell Publishing Ltd. Testing the Equality

More information

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments CIRJE-F-466 Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments Yukitoshi Matsushita CIRJE, Faculty of Economics, University of Tokyo February 2007

More information

Machine Learning - MT & 14. PCA and MDS

Machine Learning - MT & 14. PCA and MDS Machine Learning - MT 2016 13 & 14. PCA and MDS Varun Kanade University of Oxford November 21 & 23, 2016 Announcements Sheet 4 due this Friday by noon Practical 3 this week (continue next week if necessary)

More information

1 Data Arrays and Decompositions

1 Data Arrays and Decompositions 1 Data Arrays and Decompositions 1.1 Variance Matrices and Eigenstructure Consider a p p positive definite and symmetric matrix V - a model parameter or a sample variance matrix. The eigenstructure is

More information

Alignment and Analysis of Proteomics Data using Square Root Slope Function Framework

Alignment and Analysis of Proteomics Data using Square Root Slope Function Framework Alignment and Analysis of Proteomics Data using Square Root Slope Function Framework J. Derek Tucker 1 1 Department of Statistics Florida State University Tallahassee, FL 32306 CTW: Statistics of Warpings

More information

Robust scale estimation with extensions

Robust scale estimation with extensions Robust scale estimation with extensions Garth Tarr, Samuel Müller and Neville Weber School of Mathematics and Statistics THE UNIVERSITY OF SYDNEY Outline The robust scale estimator P n Robust covariance

More information

FUNCTIONAL DATA ANALYSIS FOR VOLATILITY PROCESS

FUNCTIONAL DATA ANALYSIS FOR VOLATILITY PROCESS FUNCTIONAL DATA ANALYSIS FOR VOLATILITY PROCESS Rituparna Sen Monday, July 31 10:45am-12:30pm Classroom 228 St-C5 Financial Models Joint work with Hans-Georg Müller and Ulrich Stadtmüller 1. INTRODUCTION

More information

Learning with Singular Vectors

Learning with Singular Vectors Learning with Singular Vectors CIS 520 Lecture 30 October 2015 Barry Slaff Based on: CIS 520 Wiki Materials Slides by Jia Li (PSU) Works cited throughout Overview Linear regression: Given X, Y find w:

More information

A Least Squares Formulation for Canonical Correlation Analysis

A Least Squares Formulation for Canonical Correlation Analysis A Least Squares Formulation for Canonical Correlation Analysis Liang Sun, Shuiwang Ji, and Jieping Ye Department of Computer Science and Engineering Arizona State University Motivation Canonical Correlation

More information

From Data To Functions Howdowegofrom. Basis Expansions From multiple linear regression: The Monomial Basis. The Monomial Basis

From Data To Functions Howdowegofrom. Basis Expansions From multiple linear regression: The Monomial Basis. The Monomial Basis From Data To Functions Howdowegofrom Basis Expansions From multiple linear regression: data to functions? Or if there is curvature: y i = β 0 + x 1i β 1 + x 2i β 2 + + ɛ i y i = β 0 + x i β 1 + xi 2 β

More information

Approximate Kernel Methods

Approximate Kernel Methods Lecture 3 Approximate Kernel Methods Bharath K. Sriperumbudur Department of Statistics, Pennsylvania State University Machine Learning Summer School Tübingen, 207 Outline Motivating example Ridge regression

More information

FUNCTIONAL DATA ANALYSIS

FUNCTIONAL DATA ANALYSIS FUNCTIONAL DATA ANALYSIS Hans-Georg Müller Department of Statistics University of California, Davis One Shields Ave., Davis, CA 95616, USA. e-mail: mueller@wald.ucdavis.edu KEY WORDS: Autocovariance Operator,

More information

Functional responses, functional covariates and the concurrent model

Functional responses, functional covariates and the concurrent model Functional responses, functional covariates and the concurrent model Page 1 of 14 1. Predicting precipitation profiles from temperature curves Precipitation is much harder to predict than temperature.

More information

Statistics 910, #5 1. Regression Methods

Statistics 910, #5 1. Regression Methods Statistics 910, #5 1 Overview Regression Methods 1. Idea: effects of dependence 2. Examples of estimation (in R) 3. Review of regression 4. Comparisons and relative efficiencies Idea Decomposition Well-known

More information

Time Varying Hierarchical Archimedean Copulae (HALOC)

Time Varying Hierarchical Archimedean Copulae (HALOC) Time Varying Hierarchical Archimedean Copulae () Wolfgang Härdle Ostap Okhrin Yarema Okhrin Ladislaus von Bortkiewicz Chair of Statistics C.A.S.E. Center for Applied Statistics and Economics Humboldt-Universität

More information

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data. Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two

More information

A direct formulation for sparse PCA using semidefinite programming

A direct formulation for sparse PCA using semidefinite programming A direct formulation for sparse PCA using semidefinite programming A. d Aspremont, L. El Ghaoui, M. Jordan, G. Lanckriet ORFE, Princeton University & EECS, U.C. Berkeley A. d Aspremont, INFORMS, Denver,

More information

Can we do statistical inference in a non-asymptotic way? 1

Can we do statistical inference in a non-asymptotic way? 1 Can we do statistical inference in a non-asymptotic way? 1 Guang Cheng 2 Statistics@Purdue www.science.purdue.edu/bigdata/ ONR Review Meeting@Duke Oct 11, 2017 1 Acknowledge NSF, ONR and Simons Foundation.

More information

Vector autoregressions, VAR

Vector autoregressions, VAR 1 / 45 Vector autoregressions, VAR Chapter 2 Financial Econometrics Michael Hauser WS17/18 2 / 45 Content Cross-correlations VAR model in standard/reduced form Properties of VAR(1), VAR(p) Structural VAR,

More information

Functional modeling of longitudinal data

Functional modeling of longitudinal data CHAPTER 1 Functional modeling of longitudinal data 1.1 Introduction Hans-Georg Müller Longitudinal studies are characterized by data records containing repeated measurements per subject, measured at various

More information

CS281 Section 4: Factor Analysis and PCA

CS281 Section 4: Factor Analysis and PCA CS81 Section 4: Factor Analysis and PCA Scott Linderman At this point we have seen a variety of machine learning models, with a particular emphasis on models for supervised learning. In particular, we

More information

Conditional functional principal components analysis

Conditional functional principal components analysis Conditional functional principal components analysis Hervé Cardot CESAER, UMR INRA-ENESAD. March 27, 2006 Abstract This work proposes an extension of the functional principal components analysis, or Karhunen-Loève

More information

CSC 411: Lecture 09: Naive Bayes

CSC 411: Lecture 09: Naive Bayes CSC 411: Lecture 09: Naive Bayes Class based on Raquel Urtasun & Rich Zemel s lectures Sanja Fidler University of Toronto Feb 8, 2015 Urtasun, Zemel, Fidler (UofT) CSC 411: 09-Naive Bayes Feb 8, 2015 1

More information

Functional time series

Functional time series Rob J Hyndman Functional time series with applications in demography 4. Connections, extensions and applications Outline 1 Yield curves 2 Electricity prices 3 Dynamic updating with partially observed functions

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Sparse PCA with applications in finance

Sparse PCA with applications in finance Sparse PCA with applications in finance A. d Aspremont, L. El Ghaoui, M. Jordan, G. Lanckriet ORFE, Princeton University & EECS, U.C. Berkeley Available online at www.princeton.edu/~aspremon 1 Introduction

More information

Basics of Multivariate Modelling and Data Analysis

Basics of Multivariate Modelling and Data Analysis Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 6. Principal component analysis (PCA) 6.1 Overview 6.2 Essentials of PCA 6.3 Numerical calculation of PCs 6.4 Effects of data preprocessing

More information

Support Vector Method for Multivariate Density Estimation

Support Vector Method for Multivariate Density Estimation Support Vector Method for Multivariate Density Estimation Vladimir N. Vapnik Royal Halloway College and AT &T Labs, 100 Schultz Dr. Red Bank, NJ 07701 vlad@research.att.com Sayan Mukherjee CBCL, MIT E25-201

More information

Modeling Multi-Way Functional Data With Weak Separability

Modeling Multi-Way Functional Data With Weak Separability Modeling Multi-Way Functional Data With Weak Separability Kehui Chen Department of Statistics University of Pittsburgh, USA @CMStatistics, Seville, Spain December 09, 2016 Outline Introduction. Multi-way

More information

Lecture 1: OLS derivations and inference

Lecture 1: OLS derivations and inference Lecture 1: OLS derivations and inference Econometric Methods Warsaw School of Economics (1) OLS 1 / 43 Outline 1 Introduction Course information Econometrics: a reminder Preliminary data exploration 2

More information

The loss function and estimating equations

The loss function and estimating equations Chapter 6 he loss function and estimating equations 6 Loss functions Up until now our main focus has been on parameter estimating via the maximum likelihood However, the negative maximum likelihood is

More information

Bayesian Decision Theory

Bayesian Decision Theory Bayesian Decision Theory Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Fall 2017 CS 551, Fall 2017 c 2017, Selim Aksoy (Bilkent University) 1 / 46 Bayesian

More information

Functional SVD for Big Data

Functional SVD for Big Data Functional SVD for Big Data Pan Chao April 23, 2014 Pan Chao Functional SVD for Big Data April 23, 2014 1 / 24 Outline 1 One-Way Functional SVD a) Interpretation b) Robustness c) CV/GCV 2 Two-Way Problem

More information

Likelihood Ratio Tests. that Certain Variance Components Are Zero. Ciprian M. Crainiceanu. Department of Statistical Science

Likelihood Ratio Tests. that Certain Variance Components Are Zero. Ciprian M. Crainiceanu. Department of Statistical Science 1 Likelihood Ratio Tests that Certain Variance Components Are Zero Ciprian M. Crainiceanu Department of Statistical Science www.people.cornell.edu/pages/cmc59 Work done jointly with David Ruppert, School

More information

LINEAR MODELS FOR CLASSIFICATION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

LINEAR MODELS FOR CLASSIFICATION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception LINEAR MODELS FOR CLASSIFICATION Classification: Problem Statement 2 In regression, we are modeling the relationship between a continuous input variable x and a continuous target variable t. In classification,

More information

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Elizabeth C. Mannshardt-Shamseldin Advisor: Richard L. Smith Duke University Department

More information

Lecture 16: Small Sample Size Problems (Covariance Estimation) Many thanks to Carlos Thomaz who authored the original version of these slides

Lecture 16: Small Sample Size Problems (Covariance Estimation) Many thanks to Carlos Thomaz who authored the original version of these slides Lecture 16: Small Sample Size Problems (Covariance Estimation) Many thanks to Carlos Thomaz who authored the original version of these slides Intelligent Data Analysis and Probabilistic Inference Lecture

More information

TAMS39 Lecture 10 Principal Component Analysis Factor Analysis

TAMS39 Lecture 10 Principal Component Analysis Factor Analysis TAMS39 Lecture 10 Principal Component Analysis Factor Analysis Martin Singull Department of Mathematics Mathematical Statistics Linköping University, Sweden Content - Lecture Principal component analysis

More information

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17 Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 17 Outline Filters and Rotations Generating co-varying random fields Translating co-varying fields into

More information

Hypothesis Testing For Multilayer Network Data

Hypothesis Testing For Multilayer Network Data Hypothesis Testing For Multilayer Network Data Jun Li Dept of Mathematics and Statistics, Boston University Joint work with Eric Kolaczyk Outline Background and Motivation Geometric structure of multilayer

More information

News Shocks: Different Effects in Boom and Recession?

News Shocks: Different Effects in Boom and Recession? News Shocks: Different Effects in Boom and Recession? Maria Bolboaca, Sarah Fischer University of Bern Study Center Gerzensee June 7, 5 / Introduction News are defined in the literature as exogenous changes

More information