Modeling Multi-Way Functional Data With Weak Separability

Size: px
Start display at page:

Download "Modeling Multi-Way Functional Data With Weak Separability"

Transcription

1 Modeling Multi-Way Functional Data With Weak Separability Kehui Chen Department of Statistics University of Pittsburgh, Seville, Spain December 09, 2016

2 Outline Introduction. Multi-way FPCA, marginal FPCA and product FPCA. Weak separability. Data examples.

3 Based on joint work with Brian Lynch (Pittsburgh), Pedro Delicado (Barcelona) and Hans-Georg Müller (Davis). K. Chen and H.G. Müller (2012), Modeling repeated functional observations, JASA. K. Chen, P. Delicado and H.G. Müller (2016), Modeling functional-valued stochastic processes, with applications to fertility dynamics, JRSSB. B. Lynch and K. Chen (2016+), Weak Separability: Concepts and Inference. In Preparation. Partially supported by NSF

4 Functional Data Functional data consist of fully or partially observed samples of random functions [0, T] R q, usually q = 1. As infinite-dimensional objects, functional data require dimension reduction. Such data are ubiquitous longitudinal studies; tracking and monitoring. Any data observed over a continuum for many individuals.

5 Function-Valued Stochastic Processes Traditional: T X(t) R, X L 2 (T ), X is smooth Now the value of the process at each t T is a random function X(, t) on a domain S: with T X(, t) L 2 (S), X L 2 (S T ), E(X(s, t)) = µ(s, t), C(s, t; u, v) = cov(x(s, t), X(u, v)).

6 Fertility Data ASFR(.,t) for USA ASFR(s,t) for USA ASFR s= t=1960 t=1980 t=2000 s= t=

7 Other Examples of Multi-Way Functional Data Multi-way functional data: the time index t of the stochastic process and the argument s of the observed functions; or the other way around. X(s, t) L 2 (S T ). Data obtained from tracking apps where animal s/people s 24-hour profiles of activities are recorded every day. EEG/fMRI data with spatial and/or temporal index, or even with longitudinal follow-ups over years. Different from traditional spatial-temporal data, as we consider a sample of n realizations.

8 Karhunen-Loève (KL) Representation X(s, t) = µ(s, t) + Z r γ r (s, t), s S, t T. (1) r=1 Here {γ r : r 1} are the eigenfunctions of the linear operator with kernel C(s, t; u, v) = cov(x(s, t), X(u, v)). {Z r = γ r (s, t)x c (s, t)dsdt : r 1} are the (uncorrelated) functional principal components (FPCs) with E(Z r ) = 0. Multi-Way FPCA: Modeling the variability in the random process and dimension reduction. Serve as building blocks for further modeling and regularization.

9 Challenges with Multi-Way FPCA A non-parametric modeling of C(s, t; u, v) = cov(x(s, t), X(u, v)) is very difficult. For dense regular design with p 1 grids in T and p 2 grids in S, a nonparametric estimation of C(s, t; u, v) based on sample covariance has a huge dimension p 2 1 p2 2. Covariance estimation for sparse designs requires 4-dimensional smoothing. The effects of s and t are hard to separate and to visualize.

10 Marginal Kernel Define the the marginal covariance functions t C(s, t; u, t)dt C S (s, u) = s,t C(s, t; s, t)dsdt, C T (t, v) = C(s, t; s, v)ds. C(s, t; s, t)dsdt s s,t The marginal kernels are normalized to have s C S(s, s)ds = 1 and t C T (t, t)dt = 1. The eigen decompositions are C S (s, u) = j=1 λ jψ j (s)ψ j (u) and C T (t, v) = k=1 γ kφ k (t)φ k (v), where λ 1 λ 2..., and γ 1 γ 2..., are the marginal eigenvalues.

11 A Tensor Product Representation The product of marginal eigen functions φ k ψ j forms an orthogonal basis on L 2 (S T ). We have the representation of X(s, t), Product FPCA X(s, t) = µ(s, t) + X(s, t) µ(s, t) + j=1 k=1 P j=1 k=1 χ jk φ k (t)ψ j (s). K χ jk φ k (t)ψ j (s).

12 Loss bound on product FPCA For P 1 and K 1, consider the following loss minimization P K E {X(s, t) X, φ k ψ j φ k (t)ψ j (s)} 2 dsdt. S,T j=1 k=1 Let Q be the minimum unexplained variance that can be achieved using any basis f k (t)g j (s), we have Q < Q +ae X 2 where a = min(a T, a S ), with (1 a T ) denoting the fraction of variance explained by K terms for G T (t, v) and analogously for a S.

13 The product FPCA under strong separability Strong separability: C(s, t; u, v) = ac S (s, u)c T (t, v), with s C S(s, s)ds = 1 and t C T (t, t)dt = 1. We can show that C S and C T are the same as the marginal kernels. Under strong separability, the eigenfunctions of C(s, t; u, v) is in the form of the product of marginal eigenfunctions.

14 Strong separability Under strong separability: X(s, t) = µ(s, t) + j=1 k=1 is the same as the Multi-Way KL expansion. The scores χ jk are mutually uncorrelated. χ jk φ k (t)ψ j (s), However, the strong separability assumption of a covariance is often too restrictive.

15 Brian & Chen (2016+) Concept of weak separability For any orthogonal bases {f j, j 1} in L 2 (S) and {g k, k 1} in L 2 (T ), we can have X(s, t) = µ(s, t) + j=1 k=1 χ jk f j (s)g k (t). X(s, t) is weakly separable if there exist orthogonal bases {f j, j 1} and {g k, k 1} such that cov( χ jk, χ j k ) = 0 for j j or k k., i.e., the scores { χ jk, j 1, k 1} are uncorrelated to each other.

16 Unique bases for weak separability Brian & Chen (2016+) If X is weakly separable, the pair of bases {f j, j 1} and {g k, k 1} that satisfies weak separability is unique: f j (s) ψ j (s), g k (t) φ k (t), where ψ j (s) and φ k (t) are the eigenfunctions of the marginal kernels. Weak separability is testable.

17 Weak separabile covariance structure Under weak separability, the covariance becomes C(s, t; u, v) = η jk ψ j (s)ψ j (u)φ k (t)φ k (v), (2) j=1 k=1 where η jk = var(χ jk ). Strong separability is a special case of weak separability: Strong separability is equivalent to the condition that η jk = aλ j γ k.

18 Weak separable covariance structure cont. Define the two-way array V = (η jk, j 1, k 1), where η jk = var(χ jk ). rank + (V) = min{l : V = V V l, V i 0, rank(v i ) = 1, i}, where V i 0 means that V i is entrywise nonnegative. If V has rank + (V) = L, then V has nonnegative decomposition V = L l=1 al Λ l (Γ l ) T, where Λ l = (λ l j, j 1) and Γ l = (γk l, k 1) are all nonnegative for l = 1,..., L.

19 Weak separable covariance structure cont. Brian & Chen (2016+): a notion of L-separability, and the strong separability corresponds to 1-separability. We have C(s, t; u, v) = L a l CS l (s, u)cl T (t, v), l=1 where C l S (s, u) = j λl j ψ j(s)ψ j (u) and C l T (t, v) = k γl k φ k(t)φ k (v).

20 A variation: Marginal FPCA Using the marginal eigenfunctions {ψ j : j 1} of L 2 (S), X(s, t) = µ(s, t) + ξ j (t)ψ j (s) with random coefficient functions {ξ j : j 1}. Applying the KL representations of ξ j (t), ξ j (t) = k=1 χ jkφ jk (t), with eigenfunctions φ jk and FPCs χ jk, leads to X(s, t) = µ(s, t) + j=1 j=1 k=1 χ jk φ jk (t)ψ j (s).

21 Fertility Data -Specific Fertility Rate (AFSR) for 17 countries, 1951 to s of mothers s range from 12 to 55 years old. (Human Fertility Database 2013 (HFD-2013)) Data: 17 independent units (countries), corresponding to a realization of the function valued stochastic process ASFR(, t) at each year t. Observation grid (age, calendar-year) has equidistant points. AFSR for age s (expressed in years) and calendar year t: Births during the year t to women of age s Person-years lived for the year t by women of age s.

22 AUT BGR CAN CZE FIN FRA HUN JPN NLD PRT SVK SWE

23 PRT SVK SWE CHE GBRTENW GBR_SCO USA ESP

24 Mean function µ(s, t) ASFR sample mean ASFR sample mean

25 Comparing 2-d FPCA with Marginal FPCA and Product FPCA (1) As expected, standard FPCA based on the two-dimensional Karhunen-Loève expansion needs less components to explain a certain amount of variance, as 4 eigenfunctions lead to a FVE of 89.74%, while marginal FPCA representation achieves a FVE of 87.51% with 6 terms, and product FPCA needs 7 terms to explain 87.42%.

26 Table: Fraction of Variance Explained (FVE) of ASFR(s, t) for the leading terms in the proposed marginal FPCA, product FPCA and 2d FPCA. Number of terms are selected to have accumulative FVE more than 85%. marginalfpca FVE productfpca FVE 2d FPCA FVE Six Seven Four ˆφ 11 (t) ˆψ 1 (s) ˆφ1 (t) ˆψ 1 (s) 53.7 ˆγ 1 (s, t) ˆφ 21 (t) ˆψ 2 (s) ˆφ2 (t) ˆψ 2 (s) 8.18 ˆγ 2 (s, t) ˆφ 22 (t) ˆψ 2 (s) 6.88 ˆφ1 (t) ˆψ 2 (s) 8.06 ˆγ 3 (s, t) ˆφ 12 (t) ˆψ 1 (s) 4.63 ˆφ3 (t) ˆψ 2 (s) 5.54 ˆγ 4 (s, t) 6.05 ˆφ 23 (t) ˆψ 2 (s) 4.41 ˆφ2 (t) ˆψ 1 (s) 4.4 ˆφ 31 (t) ˆψ 3 (s) 4.22 ˆφ4 (t) ˆψ 2 (s) 3.86 ˆφ 1 (t) ˆψ 3 (s) 3.68

27 Comparison Cont. (2) Product FPCA and Marginal FPCA represent the functional data as a sum of terms that are products of two functions, each depending on only one argument. This provides for much better interpretability and feature discovery. (3) Marginal FPCA makes it much easier than standard FPCA to analyze the time dynamics of the fertility process.

28 Eigen Functions and Score Functions Eigenfunction 1 (FVE: 61.16%) Eigenfunction 2 (FVE: 27.72%) Eigenfunction 3 (FVE: 6.93%) Eigenfunction Eigenfunction Eigenfunction Functional scores at eigenfunction 1 Functional scores at eigenfunction 2 Functional scores at eigenfunction 3 Scores at eigenfunction USA BGR CZE HUN SVK AUT CAN FRA SWE GBRTENW FIN GBR_SCO PRT CHE JPN ESP NLD BGR USA SVK GBRTENW GBR_SCO HUN CZE PRT AUT CAN ESP CHE JPN FRA FIN SWE NLD Scores at eigenfunction SVK JPN CAN USA FIN PRT NLD CZE FRA BGR GBR_SCO HUN CHE ESP SWE GBRTENW AUT USA FRA GBRTENW SWE FIN NLD GBR_SCO CAN CHE AUT BGR HUN CZE JPN SVK ESP PRT Scores at eigenfunction NLD PRT ESP AUT FIN SWE CAN CHE HUN GBR_SCO SVK USA BGR FRA GBRTENW CZE JPN ESP GBRTENW GBR_SCO SWE CHE NLD CAN PRT JPN FIN HUN USA FRA AUT SVK CZE BGR

29 Track Plots For Selected Countries Scores at eigenfunctions 2 vs. 1 Scores at eigenfunction JPN 1966.PRT 1966.ESP 1966.NLD 1951.PRT 1951.NLD 1981.PRT 2006.NLD 1981.ESP 1981.JPN 1996.NLD 1996.JPN 1996.PRT 1981.USA 1981.NLD 2006.CZE 2006.JPN 2006.ESP 2006.PRT 1951.ESP 1996.ESP 1966.JPN 2006.USA 1951.USA 1996.USA 1981.CZE 1966.USA 1951.CZE 1996.CZE 1966.CZE Scores at eigenfunction 1 Scores at eigenfunctions 3 vs NLD 2006.ESP Scores at eigenfunction PRT 1996.ESP 1996.NLD 1966.PRT 1951.ESP 1981.USA 2006.NLD 1981.PRT 1966.ESP 1981.ESP 2006.PRT 1996.JPN 2006.JPN 1996.PRT 1966.NLD 1981.NLD 1951.JPN 2006.CZE 1966.JPN 1981.JPN 1996.USA 1966.USA 1981.CZE 1966.CZE 2006.USA 1951.USA 1951.CZE 1996.CZE Scores at eigenfunction 1

30 Summary Observe a sample of X i (s, t), we want to model the variability in the random process and perform dimension reduction. Tensor product representation: allows for analyzing the separate (possibly asymmetric) effects of s and t. Optimal representation based on marginal kernels: Marginal FPCA and product FPCA. The new concept of weak separability: much more flexible than strong separable covariance assumption.

31 Thank You! Questions?

Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics

Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics Kehui Chen 1, Pedro Delicado 2 and Hans-Georg Müller 3 1 Dept. of Statistics, University of Pittsburgh, Pittsburgh,

More information

Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics

Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics Kehui Chen 1, Pedro Delicado 2 and Hans-Georg Müller 3 1 Dept. of Statistics, University of Pittsburgh, Pittsburgh,

More information

Weak Separablility for Two-way Functional Data: Concept and Test

Weak Separablility for Two-way Functional Data: Concept and Test Weak Separablility for Two-way Functional Data: Concept and Test arxiv:703.020v2 [stat.me] 3 Mar 207 Kehui Chen, Brian Lynch University of Pittsburgh April 3, 207 Abstract: This paper concerns multi-way

More information

Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics

Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics Modeling Function-Valued Stochastic Processes, With Applications to Fertility Dynamics Kehui Chen 1,2,PedroDelicado 3 and Hans-Georg Müller 4 1 Dept. of Statistics, University of Pittsburgh, Pittsburgh,

More information

Modeling Repeated Functional Observations

Modeling Repeated Functional Observations Modeling Repeated Functional Observations Kehui Chen Department of Statistics, University of Pittsburgh, Hans-Georg Müller Department of Statistics, University of California, Davis Supplemental Material

More information

A test of weak separability for multi-way functional data, with application to brain connectivity studies

A test of weak separability for multi-way functional data, with application to brain connectivity studies Biometrika (-), -, -, pp. 1 16 Printed in Great Britain - A test of weak separability for multi-way functional data, with application to brain connectivity studies BY BRIAN LYNCH AND KEHUI CHEN Department

More information

FUNCTIONAL DATA ANALYSIS FOR VOLATILITY PROCESS

FUNCTIONAL DATA ANALYSIS FOR VOLATILITY PROCESS FUNCTIONAL DATA ANALYSIS FOR VOLATILITY PROCESS Rituparna Sen Monday, July 31 10:45am-12:30pm Classroom 228 St-C5 Financial Models Joint work with Hans-Georg Müller and Ulrich Stadtmüller 1. INTRODUCTION

More information

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1 FUNCTIONAL DATA ANALYSIS Contribution to the International Handbook (Encyclopedia) of Statistical Sciences July 28, 2009 Hans-Georg Müller 1 Department of Statistics University of California, Davis One

More information

Tests for separability in nonparametric covariance operators of random surfaces

Tests for separability in nonparametric covariance operators of random surfaces Tests for separability in nonparametric covariance operators of random surfaces Shahin Tavakoli (joint with John Aston and Davide Pigoli) April 19, 2016 Analysis of Multidimensional Functional Data Shahin

More information

Second-Order Inference for Gaussian Random Curves

Second-Order Inference for Gaussian Random Curves Second-Order Inference for Gaussian Random Curves With Application to DNA Minicircles Victor Panaretos David Kraus John Maddocks Ecole Polytechnique Fédérale de Lausanne Panaretos, Kraus, Maddocks (EPFL)

More information

Functional Latent Feature Models. With Single-Index Interaction

Functional Latent Feature Models. With Single-Index Interaction Generalized With Single-Index Interaction Department of Statistics Center for Statistical Bioinformatics Institute for Applied Mathematics and Computational Science Texas A&M University Naisyin Wang and

More information

Weakly dependent functional data. Piotr Kokoszka. Utah State University. Siegfried Hörmann. University of Utah

Weakly dependent functional data. Piotr Kokoszka. Utah State University. Siegfried Hörmann. University of Utah Weakly dependent functional data Piotr Kokoszka Utah State University Joint work with Siegfried Hörmann University of Utah Outline Examples of functional time series L 4 m approximability Convergence of

More information

Tutorial on Functional Data Analysis

Tutorial on Functional Data Analysis Tutorial on Functional Data Analysis Ana-Maria Staicu Department of Statistics, North Carolina State University astaicu@ncsu.edu SAMSI April 5, 2017 A-M Staicu Tutorial on Functional Data Analysis April

More information

Modeling Repeated Functional Observations

Modeling Repeated Functional Observations Modeling Repeated Functional Observations Kehui Chen Department of Statistics, University of Pittsburgh, Hans-Georg Müller Department of Statistics, University of California, Davis ABSTRACT We introduce

More information

CONCEPT OF DENSITY FOR FUNCTIONAL DATA

CONCEPT OF DENSITY FOR FUNCTIONAL DATA CONCEPT OF DENSITY FOR FUNCTIONAL DATA AURORE DELAIGLE U MELBOURNE & U BRISTOL PETER HALL U MELBOURNE & UC DAVIS 1 CONCEPT OF DENSITY IN FUNCTIONAL DATA ANALYSIS The notion of probability density for a

More information

Fractal functional regression for classification of gene expression data by wavelets

Fractal functional regression for classification of gene expression data by wavelets Fractal functional regression for classification of gene expression data by wavelets Margarita María Rincón 1 and María Dolores Ruiz-Medina 2 1 University of Granada Campus Fuente Nueva 18071 Granada,

More information

Nonparametric Inference In Functional Data

Nonparametric Inference In Functional Data Nonparametric Inference In Functional Data Zuofeng Shang Purdue University Joint work with Guang Cheng from Purdue Univ. An Example Consider the functional linear model: Y = α + where 1 0 X(t)β(t)dt +

More information

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes TTU, October 26, 2012 p. 1/3 Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes Hao Zhang Department of Statistics Department of Forestry and Natural Resources Purdue University

More information

Polynomial Chaos and Karhunen-Loeve Expansion

Polynomial Chaos and Karhunen-Loeve Expansion Polynomial Chaos and Karhunen-Loeve Expansion 1) Random Variables Consider a system that is modeled by R = M(x, t, X) where X is a random variable. We are interested in determining the probability of the

More information

Derivative Principal Component Analysis for Representing the Time Dynamics of Longitudinal and Functional Data 1

Derivative Principal Component Analysis for Representing the Time Dynamics of Longitudinal and Functional Data 1 Derivative Principal Component Analysis for Representing the Time Dynamics of Longitudinal and Functional Data 1 Short title: Derivative Principal Component Analysis Xiongtao Dai, Hans-Georg Müller Department

More information

A Statistical Look at Spectral Graph Analysis. Deep Mukhopadhyay

A Statistical Look at Spectral Graph Analysis. Deep Mukhopadhyay A Statistical Look at Spectral Graph Analysis Deep Mukhopadhyay Department of Statistics, Temple University Office: Speakman 335 deep@temple.edu http://sites.temple.edu/deepstat/ Graph Signal Processing

More information

Fundamental concepts of functional data analysis

Fundamental concepts of functional data analysis Fundamental concepts of functional data analysis Department of Statistics, Colorado State University Examples of functional data 0 1440 2880 4320 5760 7200 8640 10080 Time in minutes The horizontal component

More information

Stochastic Spectral Approaches to Bayesian Inference

Stochastic Spectral Approaches to Bayesian Inference Stochastic Spectral Approaches to Bayesian Inference Prof. Nathan L. Gibson Department of Mathematics Applied Mathematics and Computation Seminar March 4, 2011 Prof. Gibson (OSU) Spectral Approaches to

More information

Geostatistical Modeling for Large Data Sets: Low-rank methods

Geostatistical Modeling for Large Data Sets: Low-rank methods Geostatistical Modeling for Large Data Sets: Low-rank methods Whitney Huang, Kelly-Ann Dixon Hamil, and Zizhuang Wu Department of Statistics Purdue University February 22, 2016 Outline Motivation Low-rank

More information

Efficient Observation of Random Phenomena

Efficient Observation of Random Phenomena Lecture 9 Efficient Observation of Random Phenomena Tokyo Polytechnic University The 1st Century Center of Excellence Program Yukio Tamura POD Proper Orthogonal Decomposition Stochastic Representation

More information

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)

More information

Singular Additive Models for Function to. Function Regression

Singular Additive Models for Function to. Function Regression Statistica Sinica Singular Additive Models for Function to Function Regression Byeong U. Park, Chun-Jui Chen, Wenwen Tao and Hans-Georg Müller Seoul National University and University of California at

More information

Independent component analysis for functional data

Independent component analysis for functional data Independent component analysis for functional data Hannu Oja Department of Mathematics and Statistics University of Turku Version 12.8.216 August 216 Oja (UTU) FICA Date bottom 1 / 38 Outline 1 Probability

More information

Empirical Dynamics for Longitudinal Data

Empirical Dynamics for Longitudinal Data Empirical Dynamics for Longitudinal Data December 2009 Short title: Empirical Dynamics Hans-Georg Müller 1 Department of Statistics University of California, Davis Davis, CA 95616 U.S.A. Email: mueller@wald.ucdavis.edu

More information

Karhunen-Loève Expansions of Lévy Processes

Karhunen-Loève Expansions of Lévy Processes Karhunen-Loève Expansions of Lévy Processes Daniel Hackmann June 2016, Barcelona supported by the Austrian Science Fund (FWF), Project F5509-N26 Daniel Hackmann (JKU Linz) KLE s of Lévy Processes 0 / 40

More information

Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics

Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics Data from one or a series of random experiments are collected. Planning experiments and collecting data (not discussed here). Analysis:

More information

Shrinkage Estimation for Functional Principal Component Scores, with Application to the Population Kinetics of Plasma Folate

Shrinkage Estimation for Functional Principal Component Scores, with Application to the Population Kinetics of Plasma Folate Shrinkage Estimation for Functional Principal Component Scores, with Application to the Population Kinetics of Plasma Folate Fang Yao, Hans-Georg Müller,, Andrew J. Clifford, Steven R. Dueker, Jennifer

More information

for Sparsely Observed Functional Data University of Glasgow

for Sparsely Observed Functional Data University of Glasgow Spatial Correlation Estimation for Sparsely Observed Functional Data Workshop on High dimensional and dependent functional data, Bristol, Sep 2012 Surajit Ray University of Glasgow Contributors t Chong

More information

Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques

Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques Institut für Numerische Mathematik und Optimierung Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques Oliver Ernst Computational Methods with Applications Harrachov, CR,

More information

LECTURES 2-3 : Stochastic Processes, Autocorrelation function. Stationarity.

LECTURES 2-3 : Stochastic Processes, Autocorrelation function. Stationarity. LECTURES 2-3 : Stochastic Processes, Autocorrelation function. Stationarity. Important points of Lecture 1: A time series {X t } is a series of observations taken sequentially over time: x t is an observation

More information

A Vector-Space Approach for Stochastic Finite Element Analysis

A Vector-Space Approach for Stochastic Finite Element Analysis A Vector-Space Approach for Stochastic Finite Element Analysis S Adhikari 1 1 Swansea University, UK CST2010: Valencia, Spain Adhikari (Swansea) Vector-Space Approach for SFEM 14-17 September, 2010 1 /

More information

Australia. Accepted author version posted online: 08 Aug 2014.

Australia. Accepted author version posted online: 08 Aug 2014. This article was downloaded by: [University of Toronto Libraries] On: 17 November 2014, At: 19:34 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered

More information

Smooth Common Principal Component Analysis

Smooth Common Principal Component Analysis 1 Smooth Common Principal Component Analysis Michal Benko Wolfgang Härdle Center for Applied Statistics and Economics benko@wiwi.hu-berlin.de Humboldt-Universität zu Berlin Motivation 1-1 Volatility Surface

More information

Modeling Sparse Generalized Longitudinal Observations With Latent Gaussian Processes

Modeling Sparse Generalized Longitudinal Observations With Latent Gaussian Processes Modeling Sparse Generalized Longitudinal Observations With Latent Gaussian Processes Peter Hall,2, Hans-Georg Müller,3 and Fang Yao 4 December 27 SUMMARY. In longitudinal data analysis one frequently encounters

More information

An Introduction to Functional Data Analysis

An Introduction to Functional Data Analysis An Introduction to Functional Data Analysis Chongzhi Di Fred Hutchinson Cancer Research Center cdi@fredhutch.org Biotat 578A: Special Topics in (Genetic) Epidemiology November 10, 2015 Textbook Ramsay

More information

Nonparametric Covariance Function Estimation for Functional and Longitudinal Data

Nonparametric Covariance Function Estimation for Functional and Longitudinal Data Nonparametric Covariance Function Estimation for Functional and Longitudinal Data. ony Cai 1 and Ming Yuan University of Pennsylvania and Georgia Institute of echnology (September 4, 010 Abstract Covariance

More information

Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen

Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen PARAMetric UNCertainties, Budapest STOCHASTIC PROCESSES AND FIELDS Noémi Friedman Institut für Wissenschaftliches Rechnen, wire@tu-bs.de

More information

Dynamic Relations for Sparsely Sampled Gaussian Processes

Dynamic Relations for Sparsely Sampled Gaussian Processes TEST manuscript No. (will be inserted by the editor) Dynamic Relations for Sparsely Sampled Gaussian Processes Hans-Georg Müller Wenjing Yang Received: date / Accepted: date Abstract In longitudinal studies,

More information

Lecture 1: Center for Uncertainty Quantification. Alexander Litvinenko. Computation of Karhunen-Loeve Expansion:

Lecture 1: Center for Uncertainty Quantification. Alexander Litvinenko. Computation of Karhunen-Loeve Expansion: tifica Lecture 1: Computation of Karhunen-Loeve Expansion: Alexander Litvinenko http://sri-uq.kaust.edu.sa/ Stochastic PDEs We consider div(κ(x, ω) u) = f (x, ω) in G, u = 0 on G, with stochastic coefficients

More information

Package denseflmm. R topics documented: March 17, 2017

Package denseflmm. R topics documented: March 17, 2017 Package denseflmm March 17, 2017 Type Package Title Functional Linear Mixed Models for Densely Sampled Data Version 0.1.0 Author Sonja Greven, Jona Cederbaum Maintainer Jona Cederbaum

More information

Analysis of Fractals, Image Compression and Entropy Encoding

Analysis of Fractals, Image Compression and Entropy Encoding Analysis of Fractals, Image Compression and Entropy Encoding Myung-Sin Song Southern Illinois University Edwardsville Jul 10, 2009 Joint work with Palle Jorgensen. Outline 1. Signal and Image processing,

More information

Solving the steady state diffusion equation with uncertainty Final Presentation

Solving the steady state diffusion equation with uncertainty Final Presentation Solving the steady state diffusion equation with uncertainty Final Presentation Virginia Forstall vhfors@gmail.com Advisor: Howard Elman elman@cs.umd.edu Department of Computer Science May 6, 2012 Problem

More information

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data. Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two

More information

Functional Data Analysis for Big Data: A case study on California temperature trends

Functional Data Analysis for Big Data: A case study on California temperature trends Functional Data Analysis for Big Data: A case study on California temperature trends Pantelis Zenon Hadjipantelis and Hans-Georg Müller Abstract In recent years, detailed historical records, remote sensing,

More information

Functional modeling of longitudinal data

Functional modeling of longitudinal data CHAPTER 1 Functional modeling of longitudinal data 1.1 Introduction Hans-Georg Müller Longitudinal studies are characterized by data records containing repeated measurements per subject, measured at various

More information

A Stochastic Collocation based. for Data Assimilation

A Stochastic Collocation based. for Data Assimilation A Stochastic Collocation based Kalman Filter (SCKF) for Data Assimilation Lingzao Zeng and Dongxiao Zhang University of Southern California August 11, 2009 Los Angeles Outline Introduction SCKF Algorithm

More information

Functional Linear Mixed Models for. Irregularly or Sparsely Sampled Data

Functional Linear Mixed Models for. Irregularly or Sparsely Sampled Data Functional Linear Mixed Models for Irregularly or Sparsely Sampled Data Jona Cederbaum 1, Marianne Pouplier 2, Phil Hoole 2, and Sonja Greven 1 arxiv:150801686v1 [statme] 7 Aug 2015 1 Department of Statistics,

More information

Basic Operator Theory with KL Expansion

Basic Operator Theory with KL Expansion Basic Operator Theory with Karhunen Loéve Expansion Wafa Abu Zarqa, Maryam Abu Shamala, Samiha Al Aga, Khould Al Qitieti, Reem Al Rashedi Department of Mathematical Sciences College of Science United Arab

More information

Conditional Autoregressif Hilbertian process

Conditional Autoregressif Hilbertian process Application to the electricity demand Jairo Cugliari SELECT Research Team Sustain workshop, Bristol 11 September, 2012 Joint work with Anestis ANTONIADIS (Univ. Grenoble) Jean-Michel POGGI (Univ. Paris

More information

Missing Data Issues in the Studies of Neurodegenerative Disorders: the Methodology

Missing Data Issues in the Studies of Neurodegenerative Disorders: the Methodology Missing Data Issues in the Studies of Neurodegenerative Disorders: the Methodology Sheng Luo, PhD Associate Professor Department of Biostatistics & Bioinformatics Duke University Medical Center sheng.luo@duke.edu

More information

Collocation based high dimensional model representation for stochastic partial differential equations

Collocation based high dimensional model representation for stochastic partial differential equations Collocation based high dimensional model representation for stochastic partial differential equations S Adhikari 1 1 Swansea University, UK ECCM 2010: IV European Conference on Computational Mechanics,

More information

On prediction and density estimation Peter McCullagh University of Chicago December 2004

On prediction and density estimation Peter McCullagh University of Chicago December 2004 On prediction and density estimation Peter McCullagh University of Chicago December 2004 Summary Having observed the initial segment of a random sequence, subsequent values may be predicted by calculating

More information

Groundwater permeability

Groundwater permeability Groundwater permeability Easy to solve the forward problem: flow of groundwater given permeability of aquifer Inverse problem: determine permeability from flow (usually of tracers With some models enough

More information

PCA & ICA. CE-717: Machine Learning Sharif University of Technology Spring Soleymani

PCA & ICA. CE-717: Machine Learning Sharif University of Technology Spring Soleymani PCA & ICA CE-717: Machine Learning Sharif University of Technology Spring 2015 Soleymani Dimensionality Reduction: Feature Selection vs. Feature Extraction Feature selection Select a subset of a given

More information

A Note on Hilbertian Elliptically Contoured Distributions

A Note on Hilbertian Elliptically Contoured Distributions A Note on Hilbertian Elliptically Contoured Distributions Yehua Li Department of Statistics, University of Georgia, Athens, GA 30602, USA Abstract. In this paper, we discuss elliptically contoured distribution

More information

Joint Factor Analysis for Speaker Verification

Joint Factor Analysis for Speaker Verification Joint Factor Analysis for Speaker Verification Mengke HU ASPITRG Group, ECE Department Drexel University mengke.hu@gmail.com October 12, 2012 1/37 Outline 1 Speaker Verification Baseline System Session

More information

New Developments in LS-OPT - Robustness Studies

New Developments in LS-OPT - Robustness Studies 5. LS-DYNA - Anwenderforum, Ulm 2006 Robustheit / Optimierung I New Developments in LS-OPT - Robustness Studies Ken Craig, Willem Roux 2, Nielen Stander 2 Multidisciplinary Design Optimization Group University

More information

On the Karhunen-Loève basis for continuous mechanical systems

On the Karhunen-Loève basis for continuous mechanical systems for continuous mechanical systems 1 1 Department of Mechanical Engineering Pontifícia Universidade Católica do Rio de Janeiro, Brazil e-mail: rsampaio@mec.puc-rio.br Some perspective to introduce the talk

More information

Entropy Encoding Using Karhunen-Loève Transform

Entropy Encoding Using Karhunen-Loève Transform Entropy Encoding Using Karhunen-Loève Transform Myung-Sin Song Southern Illinois University Edwardsville Sept 17, 2007 Joint work with Palle Jorgensen. Introduction In most images their neighboring pixels

More information

AN INTRODUCTION TO THEORETICAL PROPERTIES OF FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS. Ngoc Mai Tran Supervisor: Professor Peter G.

AN INTRODUCTION TO THEORETICAL PROPERTIES OF FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS. Ngoc Mai Tran Supervisor: Professor Peter G. AN INTRODUCTION TO THEORETICAL PROPERTIES OF FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS Ngoc Mai Tran Supervisor: Professor Peter G. Hall Department of Mathematics and Statistics, The University of Melbourne.

More information

A Stickiness Coefficient for Longitudinal Data

A Stickiness Coefficient for Longitudinal Data A Stickiness Coefficient for Longitudinal Data June 2011 Andrea Gottlieb 1 Graduate Group in Biostatistics University of California, Davis 1 Shields Avenue Davis, CA 95616 U.S.A. Phone: 1 (530) 752-2361

More information

Module 9: Stationary Processes

Module 9: Stationary Processes Module 9: Stationary Processes Lecture 1 Stationary Processes 1 Introduction A stationary process is a stochastic process whose joint probability distribution does not change when shifted in time or space.

More information

MIXTURE INNER PRODUCT SPACES AND THEIR APPLICATION TO FUNCTIONAL DATA ANALYSIS

MIXTURE INNER PRODUCT SPACES AND THEIR APPLICATION TO FUNCTIONAL DATA ANALYSIS MIXTURE INNER PRODUCT SPACES AND THEIR APPLICATION TO FUNCTIONAL DATA ANALYSIS Zhenhua Lin 1, Hans-Georg Müller 2 and Fang Yao 1,3 Abstract We introduce the concept of mixture inner product spaces associated

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

Random Vectors and Multivariate Normal Distributions

Random Vectors and Multivariate Normal Distributions Chapter 3 Random Vectors and Multivariate Normal Distributions 3.1 Random vectors Definition 3.1.1. Random vector. Random vectors are vectors of random 75 variables. For instance, X = X 1 X 2., where each

More information

ftsa: An R Package for Analyzing Functional Time Series by Han Lin Shang

ftsa: An R Package for Analyzing Functional Time Series by Han Lin Shang CONTRIBUTED RESEARCH ARTICLES 64 ftsa: An R Package for Analyzing Functional Time Series by Han Lin Shang Abstract Recent advances in computer recording and storing technology have tremendously increased

More information

A. Cuñat 1 R. Zymek 2

A. Cuñat 1 R. Zymek 2 1 2 1 University of Vienna and CESifo, alejandro.cunat@univie.ac.at 2 University of Edinburgh and CESifo, robert.zymek@ed.ac.uk and Outline accounting: quantitative assessment of the contribution of measurable

More information

Lecture Note 1: Probability Theory and Statistics

Lecture Note 1: Probability Theory and Statistics Univ. of Michigan - NAME 568/EECS 568/ROB 530 Winter 2018 Lecture Note 1: Probability Theory and Statistics Lecturer: Maani Ghaffari Jadidi Date: April 6, 2018 For this and all future notes, if you would

More information

Testing the Equality of Covariance Operators in Functional Samples

Testing the Equality of Covariance Operators in Functional Samples Scandinavian Journal of Statistics, Vol. 4: 38 5, 3 doi:./j.467-9469..796.x Board of the Foundation of the Scandinavian Journal of Statistics. Published by Blackwell Publishing Ltd. Testing the Equality

More information

Curve alignment and functional PCA

Curve alignment and functional PCA Curve alignment and functional PCA Juhyun Par* Department of Mathematics and Statistics, Lancaster University, Lancaster, U.K. juhyun.par@lancaster.ac.u Abstract When dealing with multiple curves as functional

More information

Fully Bayesian Deep Gaussian Processes for Uncertainty Quantification

Fully Bayesian Deep Gaussian Processes for Uncertainty Quantification Fully Bayesian Deep Gaussian Processes for Uncertainty Quantification N. Zabaras 1 S. Atkinson 1 Center for Informatics and Computational Science Department of Aerospace and Mechanical Engineering University

More information

Properties of Principal Component Methods for Functional and Longitudinal Data Analysis 1

Properties of Principal Component Methods for Functional and Longitudinal Data Analysis 1 Properties of Principal Component Methods for Functional and Longitudinal Data Analysis 1 Peter Hall 2 Hans-Georg Müller 3,4 Jane-Ling Wang 3,5 Abstract The use of principal components methods to analyse

More information

Curves clustering with approximation of the density of functional random variables

Curves clustering with approximation of the density of functional random variables Curves clustering with approximation of the density of functional random variables Julien Jacques and Cristian Preda Laboratoire Paul Painlevé, UMR CNRS 8524, University Lille I, Lille, France INRIA Lille-Nord

More information

Covariance and Correlation

Covariance and Correlation Covariance and Correlation ST 370 The probability distribution of a random variable gives complete information about its behavior, but its mean and variance are useful summaries. Similarly, the joint probability

More information

Principal Components Theory Notes

Principal Components Theory Notes Principal Components Theory Notes Charles J. Geyer August 29, 2007 1 Introduction These are class notes for Stat 5601 (nonparametrics) taught at the University of Minnesota, Spring 2006. This not a theory

More information

Introduction to Functional Data Analysis A CSCU Workshop. Giles Hooker Biological Statistics and Computational Biology

Introduction to Functional Data Analysis A CSCU Workshop. Giles Hooker Biological Statistics and Computational Biology Introduction to Functional Data Analysis A CSCU Workshop Giles Hooker Biological Statistics and Computational Biology gjh27@cornell.edu www.bscb.cornell.edu/ hooker/fdaworkshop 1 / 26 Agenda What is Functional

More information

01 Probability Theory and Statistics Review

01 Probability Theory and Statistics Review NAVARCH/EECS 568, ROB 530 - Winter 2018 01 Probability Theory and Statistics Review Maani Ghaffari January 08, 2018 Last Time: Bayes Filters Given: Stream of observations z 1:t and action data u 1:t Sensor/measurement

More information

A Stickiness Coefficient for Longitudinal Data

A Stickiness Coefficient for Longitudinal Data A Stickiness Coefficient for Longitudinal Data Revised Version November 2011 Andrea Gottlieb 1 Department of Statistics University of California, Davis 1 Shields Avenue Davis, CA 95616 U.S.A. Phone: 1

More information

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17 Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 17 Outline Filters and Rotations Generating co-varying random fields Translating co-varying fields into

More information

arxiv: v2 [math.pr] 27 Oct 2015

arxiv: v2 [math.pr] 27 Oct 2015 A brief note on the Karhunen-Loève expansion Alen Alexanderian arxiv:1509.07526v2 [math.pr] 27 Oct 2015 October 28, 2015 Abstract We provide a detailed derivation of the Karhunen Loève expansion of a stochastic

More information

REGRESSING LONGITUDINAL RESPONSE TRAJECTORIES ON A COVARIATE

REGRESSING LONGITUDINAL RESPONSE TRAJECTORIES ON A COVARIATE REGRESSING LONGITUDINAL RESPONSE TRAJECTORIES ON A COVARIATE Hans-Georg Müller 1 and Fang Yao 2 1 Department of Statistics, UC Davis, One Shields Ave., Davis, CA 95616 E-mail: mueller@wald.ucdavis.edu

More information

9.1 Orthogonal factor model.

9.1 Orthogonal factor model. 36 Chapter 9 Factor Analysis Factor analysis may be viewed as a refinement of the principal component analysis The objective is, like the PC analysis, to describe the relevant variables in study in terms

More information

Statistics for Applications. Chapter 9: Principal Component Analysis (PCA) 1/16

Statistics for Applications. Chapter 9: Principal Component Analysis (PCA) 1/16 Statistics for Applications Chapter 9: Principal Component Analysis (PCA) 1/16 Multivariate statistics and review of linear algebra (1) Let X be a d-dimensional random vector and X 1,..., X n be n independent

More information

Statistical signal processing

Statistical signal processing Statistical signal processing Short overview of the fundamentals Outline Random variables Random processes Stationarity Ergodicity Spectral analysis Random variable and processes Intuition: A random variable

More information

Network Representation Using Graph Root Distributions

Network Representation Using Graph Root Distributions Network Representation Using Graph Root Distributions Jing Lei Department of Statistics and Data Science Carnegie Mellon University 2018.04 Network Data Network data record interactions (edges) between

More information

Machine Learning. Dimensionality reduction. Hamid Beigy. Sharif University of Technology. Fall 1395

Machine Learning. Dimensionality reduction. Hamid Beigy. Sharif University of Technology. Fall 1395 Machine Learning Dimensionality reduction Hamid Beigy Sharif University of Technology Fall 1395 Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1395 1 / 47 Table of contents 1 Introduction

More information

1. Fundamental concepts

1. Fundamental concepts . Fundamental concepts A time series is a sequence of data points, measured typically at successive times spaced at uniform intervals. Time series are used in such fields as statistics, signal processing

More information

Multilevel accelerated quadrature for elliptic PDEs with random diffusion. Helmut Harbrecht Mathematisches Institut Universität Basel Switzerland

Multilevel accelerated quadrature for elliptic PDEs with random diffusion. Helmut Harbrecht Mathematisches Institut Universität Basel Switzerland Multilevel accelerated quadrature for elliptic PDEs with random diffusion Mathematisches Institut Universität Basel Switzerland Overview Computation of the Karhunen-Loéve expansion Elliptic PDE with uniformly

More information

Chapter 4: Factor Analysis

Chapter 4: Factor Analysis Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.

More information

Solving the Stochastic Steady-State Diffusion Problem Using Multigrid

Solving the Stochastic Steady-State Diffusion Problem Using Multigrid Solving the Stochastic Steady-State Diffusion Problem Using Multigrid Tengfei Su Applied Mathematics and Scientific Computing Advisor: Howard Elman Department of Computer Science Sept. 29, 2015 Tengfei

More information

Finance and Synchronization

Finance and Synchronization Finance and Synchronization A. Cesa-Bianchi 1 J. Imbs 2 J. Saleheen 3 1 Bank of England & Centre for Macroeconomics 2 Paris School of Economics (CNRS), and CEPR 3 Bank of England ECB/BoE/IMF Workshop Global

More information

Alternatives to Basis Expansions. Kernels in Density Estimation. Kernels and Bandwidth. Idea Behind Kernel Methods

Alternatives to Basis Expansions. Kernels in Density Estimation. Kernels and Bandwidth. Idea Behind Kernel Methods Alternatives to Basis Expansions Basis expansions require either choice of a discrete set of basis or choice of smoothing penalty and smoothing parameter Both of which impose prior beliefs on data. Alternatives

More information

(1) (2) (3) Baseline (replicated)

(1) (2) (3) Baseline (replicated) Table A1: Regressions with nonlinear term and fixed effects (based on baseline specification of Table IV, column 4) (refers to footnotes 42 and 45 in the published paper) (1) (2) (3) Baseline (replicated)

More information

A Selective Review of Sufficient Dimension Reduction

A Selective Review of Sufficient Dimension Reduction A Selective Review of Sufficient Dimension Reduction Lexin Li Department of Statistics North Carolina State University Lexin Li (NCSU) Sufficient Dimension Reduction 1 / 19 Outline 1 General Framework

More information

Mathematische Methoden der Unsicherheitsquantifizierung

Mathematische Methoden der Unsicherheitsquantifizierung Mathematische Methoden der Unsicherheitsquantifizierung Oliver Ernst Professur Numerische Mathematik Sommersemester 2014 Contents 1 Introduction 1.1 What is Uncertainty Quantification? 1.2 A Case Study:

More information