Georgia State University. Georgia State University. Jing Wang Georgia State University. Fall

Size: px
Start display at page:

Download "Georgia State University. Georgia State University. Jing Wang Georgia State University. Fall"

Transcription

1 Georgi Stte University Georgi Stte University Mthemtics Disserttions Deprtment of Mthemtics nd Sttistics Fll Functionl Principl Component Anlysis for Discretely Observed Functionl Dt nd Sprse Fisher s Discriminnt Anlysis with Thresholded Liner Constrints Jing Wng Georgi Stte University Follow this nd dditionl works t: Recommended Cittion Wng, Jing, "Functionl Principl Component Anlysis for Discretely Observed Functionl Dt nd Sprse Fisher s Discriminnt Anlysis with Thresholded Liner Constrints." Disserttion, Georgi Stte University, This Disserttion is brought to you for free nd open ccess by the Deprtment of Mthemtics nd Sttistics t Georgi Stte University. It hs been ccepted for inclusion in Mthemtics Disserttions by n uthorized dministrtor of Georgi Stte University. For more informtion, plese contct scholrworks@gsu.edu.

2 Functionl Principl Component Anlysis for Discretely Observed Functionl Dt nd Sprse Fisher s Discriminnt Anlysis with Thresholded Liner Constrints by Jing Wng Under the Direction of Xin Qi, PhD ABSTRACT We propose new method to perform functionl principl component nlysis (FPCA) for discretely observed functionl dt by solving successive optimiztion problems. The new frmework cn be pplied to both regulrly nd irregulrly observed dt, nd to both dense nd sprse dt. Our method does not require estimtes of the individul smple functions or the covrince functions. Hence, it cn be used to nlyze functionl dt

3 with multidimensionl rguments (e.g. rndom surfces). Furthermore, it cn be pplied to mny processes nd models with complicted or nonsmooth covrince functions. In our method, smoothness of eigenfunctions is controlled by directly imposing roughness penlties on eigenfunctions, which mkes it more efficient nd flexible to tune the smoothness. Efficient lgorithms for solving the successive optimiztion problems re proposed. We provide the existence nd chrcteriztion of the solutions to the successive optimiztion problems. The consistency of our method is lso proved. Through simultions, we demonstrte tht our method performs well in the cses with smooth smples curves, with discontinuous smple curves nd nonsmooth covrince nd with smple functions hving two dimensionl rguments (rndom surfces), repectively. We pply our method to clssifiction problems of retinl pigment epithelil cells in eyes of mice nd to longitudinl CD4 counts dt. In the second prt of this disserttion, we propose sprse Fisher s discriminnt nlysis method with thresholded liner constrints. Vrious regulrized liner discriminnt nlysis (LDA) methods hve been proposed to ddress the problems of the LDA in high-dimensionl settings. Asymptotic optimlity hs been estblished for some of these methods when there re only two clsses. A difficulty in the symptotic study for the multiclss clssifiction is tht for the two-clss clssifiction, the clssifiction boundry is hyperplne nd n explicit formul for the clssifiction error exists, however, in the cse of multiclss, the boundry is usully complicted nd no explicit formul for the error generlly exist. Another difficulty in proving the symptotic consistency nd optimlity for sprse Fisher s discriminnt nlysis is tht the covrince mtrix is involved in the constrints of the optimiztion problems for high order components. It is not esy to estimte generl high-dimensionl covrince mtrix. Thus, we propose sprse Fisher s discriminnt nlysis method which voids the estimtion of the

4 covrince mtrix, provide symptotic consistency results nd the corresponding convergence rtes for ll components. To prove the symptotic optimlity, we provide n symptotic upper bound for generl liner clssifiction rule in the cse of muticlss which is pplied to our method to obtin the symptotic optimlity nd the corresponding convergence rte. In the specil cse of two clsses, our method chieves the sme s or better convergence rtes compred to the existing method. The proposed method is pplied to multivrite functionl dt with wvelet trnsformtions. INDEX WORDS: Functionl PCA, discretely observed functionl dt, successive optimiztion problems, roughness penlty, consistency, sprse Fisher s discriminnt nlysis, thresholded liner constrints, symptotic consistency, symptotic optimlity, convergence rte

5 Functionl Principl Component Anlysis for Discretely Observed Functionl Dt nd Sprse Fisher s Discriminnt Anlysis with Thresholded Liner Constrints by Jing Wng A Disserttion Submitted in Prtil Fulfillment of the Requirements for the Degree of Doctor of Philosophy in the College of Arts nd Sciences Georgi Stte University 2016

6 Copyright by Jing Wng 2016

7 Functionl Principl Component Anlysis for Discretely Observed Functionl Dt nd Sprse Fisher s Discriminnt Anlysis with Thresholded Liner Constrints by Jing Wng Committee Chir: Xin Qi Committee: Gengsheng Qin Yichun Zho Ruiyn Luo Electronic Version Approved: Office of Grdute Studies College of Arts nd Sciences Georgi Stte University October 2016

8 iv DEDICATION This disserttion is dedicted to my der prents, my dvisor, nd ll my der friends.

9 v ACKNOWLEDGEMENTS I would like to give my deep thnks nd sincere grtitude to everyone for helping me to complete this disserttion. First nd foremost I wnt to express my sincere grtitude to my thesis dvisor, Xin Qi, for the continuous support of my Ph.D study nd relted reserch, for his ptience, motivtion, nd immense knowledge. I hve been mzingly fortunte to hve n dvisor who gve me the freedom to explore on my own, nd t the sme time the guidnce to recover when my steps fltered. His guidnce helped me in ll the time of reserch nd writing of this disserttion. I could not hve imgined hving better dvisor nd mentor for my Ph.D study. He hs been my dvisor, mentor, collbortor nd friend. I pprecite Professors, Gengsheng Qin, Yichun Zho nd Ruiyn Luo, who kindly greed to serve on my disserttion committee nd hve given me continuous help throughout my grdute study. I m lso thnkful to them for reding my reports, commenting on my views nd helping me understnd nd enrich my ides. Without their precious support it would not be possible to conduct this reserch. Besides, I would like to thnk other professors from our deprtment for educting me in vrious courses. At lst, I wnt to thnk everyone who hs helped me throughout my grdute life, especilly my immedite fmily to whom this disserttion is dedicted to. None of this would hve been possible without the love nd ptience of my fmily. I m lso grteful to friends. Their support nd cre helped me overcome setbcks nd sty focused on my grdute study. I gretly vlue their friendship nd I deeply pprecite their belief in me.

10 vi TABLE OF CONTENTS ACKNOWLEDGEMENTS v LIST OF TABLES ix LIST OF FIGURES x LIST OF ABBREVIATIONS xi Chpter 1 INTRODUCTION Chpter 2 FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS FOR DISCRETELY OBSERVED FUNCTIONAL DATA Bckground nd Nottions Silvermn s pproch to smoothed functionl PCA Functionl PCA for discretely observed functionl dt Regulr cse Irregulr cse Computtionl issues Consistency Extensions to FPCA for functionl dt with multidimensionl rguments Simultion studies Smooth rndom curves with 2 PC curves Smooth rndom curves with 3 PC curves Rndom surfce Applictions Retinl pigment epithelium (RPE) dt Longitudinl CD4 counts dt

11 vii Chpter 3 SPARSE FISHER S DISCRIMINANT ANALYSIS WITH THRESHOLDED LINEAR CONSTRAINTS Fisher s discriminnt nlysis Sprse Fisher s discriminnt nlysis with thresholded liner constrints The cse of K = The cse of K > Computtion Asymptotic consistency nd symptotic optimlity The cse of K = The cse of K > Simultion studies Appliction to multivrite functionl dt Dily nd sports ctivities dt Austrlin sign lnguge dt Chpter 4 PROOFS OF THEOREMS Proof of Theorem Proof of Theorem Proof of Theorem Proof of Theorem Proof of Theorem Proof of Theorem Proof of Theorem Proof of Theorem Proof of Theorem Proof of Theorem Proof of Lemms Proof of Lemm Proof of Lemm

12 viii Proof of Lemm Proof of Lemm Proof of Lemm Proof of Lemm Proof of Lemm Proof of Lemm Proof of Lemm Proof of Lemm Proof of Lemm Bibliogrphy

13 ix LIST OF TABLES Tble 2.1 The verges nd stndrd devitions of cumultive vrince of selected principl component scores for the simultions in Section 2.4.1: Regulr Cse Tble 2.2 The verges nd stndrd devitions of cumultive vrince of selected principl component scores for the simultions in Section 2.4.1: Irregulr Cse Tble 2.3 Selected smoothing prmeter with the usul cross-vlidtion procedure for the simultions in Section 2.4.2: Regulr Cse Tble 2.4 The verges nd stndrd devitions of cumultive vrince of selected principl component scores for the simultions in Section 2.4.2: Regulr Cse Tble 2.5 Selected smoothing prmeter with the usul cross-vlidtion procedure for the simultions in Section 2.4.2: Irregulr Cse Tble 2.6 The verges nd stndrd devitions of cumultive vrince of selected principl component scores for the simultions in Section 2.4.2: Irregulr Cse Tble 3.1 The verges nd stndrd devitions of misclssifiction rtes (%) for the simultions in Section Tble 3.2 The verges nd stndrd devitions of the misclssifiction rtes (%) for the dily nd sports ctivities dt Tble 3.3 The verges nd stndrd devitions of the misclssifiction rtes (%) for the Austrlin sign lnguge dt

14 x LIST OF FIGURES Figure 2.1 The First Two Principl Component Curves Figure 2.2 Simulted Smple Curves for the simultions in Section Figure 2.3 The First Three Principl Component Curves Figure 2.4 Simulted Smple Curves for the simultions in Section Figure 2.5 Eigenfunctions nd their estimtes in one simultion Figure 2.6 Locl regions of two RPE cells Figure 2.7 Men joint densities of four ctegories Figure 2.8 Plots of the first four estimted principl component functions for RPE dt Figure 2.9 Smple curves from CD4 dt Figure 2.10 Estimtes of the first three eigenfunctions for CD4 dt

15 xi LIST OF ABBREVIATIONS GSU - Georgi Stte University PCA - Principl Component Anlysis FPCA - Functionl Principl Component Anlysis LDA - Liner Discriminnt Anlysis CV - Cross-vlidtion

16 1 Chpter 1 INTRODUCTION Principl component nlysis (PCA) is one of the best known techniques in both multivrite nlysis nd functionl dt nlysis. Different from clssicl PCA, functionl principl component nlysis (PCA) requires smoothing or regulrizing of the estimted principl component curves (see Chpter 9 in Rmsy nd Silvermn [33]). Reders cn find generl overview of mny methods for computing the smoothed functionl principl components when the smple curves re fully observed in Rmsy nd Silvermn [33]. Ferrty nd Vieu [16] provides more discussions on nonprmetric methods nd developments for functionl dt nlysis. However, in prctice, the smple functions re usully observed t discrete points with mesurement errors. The observtion points might be irregulr or sprse. Severl FPCA methods for discretely smpled functionl dt or longitudinl dt hve been developed. Shi, Weiss nd Tylor [38], Rice nd Wu [35] nd Jmes, Hstie nd Sugr [23] proposed mixed effects pproches in which individul smple curves or eigenfunctions of the covrince function re represented by bsis function expnsions. Stniswlis nd Lee [44] nd Yo, Müller nd Wng [54] used nonprmetric methods to estimte covrince functions nd then obtined the eigenfunctions. Hung, Shen nd Buj [22] proposed n FPCA method for regulrly observed discrete functionl dt bsed on penlized rnk one pproximtion to the dt mtrix. Peng nd Pul [29] ssumed finite rnk model for the covrince function, represented the eigenfunctions s bsis function expnsions, nd proposed the restricted mximum likelihood method to estimte the prmeters. Other nonprmetric pproches to this problem tend to fll into two clsses. The pproches in the first clss smooth ech individul curve by the smoothing spline method or other methods (see section 9.5 in Rmsy nd Silvermn [33]). Then the smoothed principl component curves cn be obtined by the usul functionl PCA or other methods. For exmple, motivted by the dulity reltion between row nd column spces of dt mtrix, Benko et l. [6] proposed n FPCA method for regulrly observed discrete functionl dt. The methods

17 2 in the second clss ssume tht covrince functions re smooth. Smoothing methods such s kernel methods nd free-knot spline smoothing re used to obtin smoothed estimtes of men functions nd covrince functions. Then the principl components curves cn be estimted by the eigenfunctions of the smoothed covrince function. However, some of these methods (Hung et l. [22] nd Benko et l. [6] ) cnnot be pplied to the discrete functionl dt in which the observtion points re irregulr nd sprse. Some of them (Stniswlis nd Lee [44] nd Yo et l.[54]) need to estimte the covrince functions, hence it is hrd to pply these methods to functionl dt with two or three dimensionl rguments since we hve to estimte four or six dimensionl covrince functions. In this disserttion, we first propose new method to perform FPCA for discretely observed functionl dt by solving successive optimiztion problems. The new frmework cn be pplied to both regulrly nd irregulrly observed dt, nd to both dense nd sprse dt. First, our method does not need to estimte the individul smple functions or the covrince functions nd we do not ssume tht they re smooth. Hence, it cn be esily pplied to discretely observed functionl dt with two or three dimensionl rguements nd to processes nd models with complicted or nonsmooth covrince functions. Most of the current methods ssume tht either the smple functions or the covrince functions re smooth explicitly or implicitly. Some of them need to obtin the smoothed estimtions of the smple functions or the covrince functions. However, there re mny importnt processes nd models with nonsmooth smple functions nd nonsmooth covrince functions but with smooth eigenfunctions. Our methods cn be pplied to these processes nd models. Some rel functionl dt hve complicted covrince functions in which we re not interested. In this cse, our methods void estimting the complicted covrince functions. Second, our method controls the smoothness of eigenfunctions by directly imposing roughness penlties on eigenfunctions nd cn use different smoothing prmeters for different eigenfunctions. Hence, it is efficient nd flexible to tune the smoothness of eigenfunctions in this method. Our methods cn lso be esily extended to nlyze the discretely observed functions defined on high-dimensionl spces, e.g. rndom surfces. Section 5 in Müller (2005) listed some open problems concern the ppliction of FDA methods including nlysis of rndom surfces nd higher-dimensionl functions. We pplied our methods to simulted discretely observed

18 3 rndom surfce dt. For high-dimensionl dt, the covrince functions re defined on higher dimension spce with dimensions equl to two times of the dimensions of the smple functions. Hence, it is very hrd to obtin good estimtions of covrince functions in this cse. Efficient lgorithms for solving the successive optimiztion problems re proposed. We provide the existence nd chrcteriztion of the solutions to the successive optimiztion problems. The consistency of our method is lso proved. The following rel exmple is used to motivte nd illustrte the method developed in this disserttion. The liner discriminnt nlysis (LDA) hs been fvored tool for supervised clssifiction in the settings of smll p nd lrge n. However, it fces mjor problems for high-dimensionl dt. In theory, Bickel nd Levin [7] nd Sho et l. [37] showed tht the usul LDA cn be s bd s the rndom guessing when p > n. In prctice, the clssic LDA methods hve bd predictive performnce in high-dimensionl settings. To ddress these problems, vrious regulrized discriminnt nlysis methods hve been proposed, including Friedmn [17], Krznowski et l. [26], Dudoit et l. [13], Bickel nd Levin [7], Guo et l. [19], Xu et l. [53], Tibshirni et l. [47], Witten nd Tibshirni [52], Clemmensen et l. [11], Sho et l. [37], Ci nd Liu [9], Fn et l. [15], Qi et l. [32] nd mny others. Asymptotic optimlity hs been estblished in some of these ppers when there re two clsses. Sho et l. [37] mde sprsity ssumptions on both the difference δ = µ 2 µ 1, where µ 1 nd µ 2 re the popultion mens of the two clss, nd the within-clss covrince mtrix Σ. Then thresholding procedures were pplied to both the difference between the two smple clss mens nd the smple within-clss covrince mtrix Σ. The symptotic optimlity nd the corresponding convergence rte for their clssifiction rule were obtined. Ci nd Liu [9] observed tht in the cse of two clsses, the optiml clssifiction rule depends on Σ only through Σ 1 δ. Hence, they ssumed l 1 sprsity for Σ 1 δ, proposed sprse estimte of it through minimizing its l 1 norm with n l constrint, nd provided symptotic optimlity of their clssifiction rule. Fn et l. [15] imposed l 0 sprsity ssumption on Σ 1 δ, estimted it through minimiztion problem with n l 1 constrint nd derived the symptotic optimlity. A mjor difficulty preventing the derivtion of symptotic optimlity of the liner clssifiction rules for multiple clsses is tht for the two-clss clssifiction, the clssifiction boundry of LDA is hyperplne nd n explicit formul for the clssifiction error exists, however,

19 4 for the multiclss clssifiction, the clssifiction boundry is usully complicted nd no explicit formul for the clssifiction error generlly exist. As specil cse of LDA, the Fisher s discriminnt nlysis projects the originl vribles X to low dimensionl subspce to generte new predictor vribles, Xα 1, Xα 2,..., Xα K 1, where the coefficient vectors α 1, α 2,..., α K 1 re sequentilly clculted nd K is the number of clsses. The coefficient vectors re found by mximizing the between clss vrition of the new predictor vribles reltive to their within clss vrition nd the new predictors re orthogonl to ech other, tht is, the liner constrints α T i Σα j = 0 for ny 1 j < i < K re stisfied. Once the coefficients re determined nd the clssifiction rule is to ssign new observtion to the clss with the smple clss men closest to this observtion in the projection subspce. Besides the complicted clssifiction boundry for multiclss, the liner constrint α T i Σα j = 0 poses dditionl difficulty in studying the symptotic consistency nd optimlity for the Fisher s discriminnt nlysis in high dimensionl setting for K > 2 becuse the covrince mtrix Σ is involved. It is not esy to find consistent estimte for generl Σ in the high-dimensionl settings. Qi et l. [32] introduced sprse Fisher s discriminnt nlysis method, n dvntge of which is tht the proposed lgorithm is pplicble to ny liner constrints imposed on the higher order components. In the second prt of this disserttion, insted of iming to find consistent estimte of Σ, we pply soft-thresholding procedure to obtin consistent estimte of the subspce {Σα 1,, Σα i 1 } which defines the liner constrints for α i, for ny 1 < i K 1. Then tking dvntge of the lgorithm in the pper bove, we propose the estimtes of α i, for ll 1 i K 1, nd n clssifiction rule. We study the theoreticl properties of this method in high dimensionl settings, including the symptotic consistency of the estimte of α i nd the subspces defining the orthogonl constrints, the symptotic optimlity, nd the corresponding convergence rtes, where the number K of clsses cn be ny fixed positive integer. In the specil cse of K = 2, the symptotic optimlity of the our method is compred to the existing method nd our method hs the sme or better convergence rte. We pply our method to the clssifiction problems for multivrite functionl dt through the wvelet trnsformtions. The reminder of the disserttion is orgnized s follow. In Chpter 2, we present our new method to perform FPCA for discretely observed functionl dt by solving successive

20 5 optimiztion problems. We first give some bckground, bsic nottions nd our min ssumptions. The clssic Silvermn s method to perform smoothed FPCA is lso introduced. We then present our method long with its theoreticl properties, nd n lgorithm for solving the successive optimiztion problems in prctice. Simultion results with comprison to other estblished method re reported to illustrte the effectiveness of our method. At lst, we pply our method on 2 rel dt sets: the RPE dt set nd the Longitudinl CD4 counts dt set. In Chpter 3, we propose sprse Fisher s discriminnt nlysis method with thresholded liner constrints which voids the estimtion of the covrince mtrix. We first introduce nottions nd briefly review the clssic Fisher s discriminnt nlysis. Then our sprse Fisher s LDA method with thresholded liner constrints re introduced. We lso present the min theoreticl results long with simultion studies nd pplictions. All proofs of our theorems cn be found in the Chpter 4 of the disserttion.

21 6 Chpter 2 FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS FOR DISCRETELY OBSERVED FUNCTIONAL DATA In this chpter, we present our new method to perform function principl nlysis (FPCA) for discretely observed functionl dt by solving successive optimiztion problems. We first give some bckground, bsic nottions nd our min ssumptions. The clssic Silvermn s method to perform smoothed FPCA is introduced in Section 2.2. We then present our method long with its theoreticl properties, nd n lgorithm for solving the successive optimiztion problems in prctice. Simultion results re reported to illustrte the effectiveness of our method. At lst, we pply our method on the RPE dt set nd the Longitudinl CD4 counts dt set in Section 2.5. The proofs of ll theorems re provided in Section 4.11 of Chpter Bckground nd Nottions First, we introduce nottions nd definitions used in this chpter. Let N denote the collection of ll the positive integers. In this chpter, we will minly consider functions defined in finite intervl [, b] in the following two spces, the spce L 2 ([, b]) of squre integrble functions b L 2 ([, b]) = {f : f is mesurble function on [, b] nd f(t) 2 dt < } nd the Sobolev spce W 2 2 ([, b]) of functions with squre integrble second derivtives, W 2 ([, b]) = {f : f, f re bsolutely continuous on[, b] ndf L 2 ([, b])}

22 7 where f nd f denote the first nd second derivtives of f, respectively. For ny f, g L 2 ([, b]), define the usul inner product f, g = b f(t)g(t)dt with corresponding squred norm f 2 = f, f. Given smoothing prmeter α 0, for ny f, g W2 2 ([, b]), define [f, g] = b f (t)g (t)dt nd the inner product f, g α = f, g + α[f, g] with corresponding squred norm f 2 α = f, f α. Here we use the sme nottions s those in Silvermn [41]. Let X(t), t b be mesurble stochstic process (rndom function) on [, b] nd X 1 (t), X 2 (t),, X n (t) be i.i.d smple functions from the distribution of X(t). Below we give three bsic ssumptions on X(t) which re essentilly the sme s those in Silvermn [41]. Assumption 1. E [ X 4 ] = E [ ( b X(t) 2 dt ) 2 ] <. Under Assumption 1, X(t) L 2 ([, b]).s.. Assume tht men function EX(t) = µ(t). Define the covrince function of X(t) Γ(s, t) = E [(X(s) µ(s)) (X(t) µ(t))], s, t [, b], (2.1) Under Assumption 1, Γ hs sequence of nonnegtive eigenvlues λ 1 λ 2 0 nd the corresponding eigenfunctions γ 1, γ 2,. Every eigenfunction hs been scled to hve L 2 -norm 1. The set of ll the eigenfunctions forms n orthonorml bsis of L 2 ([, b]). Furthermore, we hve decomposition Γ(s, t) = λ j γ j (s)γ j (t) j=1. Suppose tht we re interested in estimting the first K eigenvlues nd eigenfunctions of the covrince function Γ.

23 8 Assumption 2. Any eigenvlue λ j, 1 j K hs multiplicity 1, so tht λ 1 > λ 2 > > λ K > λ K+1. This ssumption is just the third ssumption in Section 5.2 of Silvermn [41]. If n eigenvlue hs multiplicity 1, then the corresponding eigenfunction is uniquely determined up to sign. If the multiplicity is lrger thn 1, the eigenfunctions cn not be uniquely determined up to sign (Qi nd Zho [31]). Assumption 3. The eigenfunctions γ j, 1 j K belong to W 2 2 ([, b]). If the covrince function Γ is smooth, then Assumption 3 holds. However, there re mny importnt rndom processes whose covrince mtrices re nonsmooth, but the eigenfunctions belong to W 2 2 ([, b]) (Qi nd Zho [31]). For exmple, the continuous prmeter AR(1) model in time series, Brownin motion, Poisson process nd the stochstic differentil eqution models driven by them. Exmple 1. (Brownin motion nd Poisson process). Consider the stndrd Brownin motion nd the Poisson process with rte 1 in time intervl [0, 1]. Their covrince functions re the sme nd equl to min(s, t), 0 s, t 1 (see Pge 89 in the book Glssermn [18]) which is nonsmooth. The eigenvlues nd eigenfunctions re ( ) 2 2 λ j =, γ j = ( ) (2j 1)πt 2 sin, j = 1, 2, (2.2) (2j 1)π 2 Exmple 2. (Stochstic differentil eqution models). SDE re widely used to model rndom processes in my res. One exmple is the fmous Blck-Scholes Model in finnce. Let S t denote the price of stock t time t. Then S t stisfies the following SDE, ds t = νs t dt + σs t dw t where ν is the instntneous men return, σ is the instntneous return voltility nd W t is Brownin motion. Another exmple is the counting processes model in survivl nlysis. Let N t be the number

24 9 of the occurrences of the event in [0, t]. Then N t stisfies dn t = λ(t)dt + dm t where λ(t) is smooth intensity function nd M t is mrtingle (Qi nd Zho [31]). Exmple 3. (Continuous prmeter models in time series). Consider the con-tinuous prmeter AR(1) model in time series. Its covrince function is Γ(s, t) = e α s t α where α is positive number (see Section 3.7 in Priestley [30]). This covrince function is nonsmooth. For these models, the covrince functions re nonsmooth but the eigenfunctions re smooth. In ddition to these processes nd models, some rel functionl dt hve covrince functions with complicted ptterns. 2.2 Silvermn s pproch to smoothed functionl PCA In this section, the independent smple curves from the distribution of X(t), {X 1 (t), X 2 (t),, X n (t) : t b} re ssumed to be entirely observed nd t could be continuum in [, b], or in twodimensionl region [, b] [c, d], or in higher-dimensionl regions.. The covrince function is defined s 2.1 nd covrince opertor (Γγ)(t) = b Γ(t, s)γ(s)ds. For ny β, γ L 2 ([, b]), Cov [ β, X, γ, X ] = β, Γγ

25 10 FPCA is one of the key techniques in functionl dt nlysis for ptterns discovery nd dimension reduction in dt sets. The first popultion functionl principl component s one-dimensionl projection of X γ 1, X = b γ 1 (t)x(t)dt, γ 1 = 1 which mximizes the vrince of principl component scores γ, Γγ V r( γ 1, X ) = mx V r( γ, X ) = mx γ, Γγ = mx (2.3) γ =1 γ =1 γ =1 γ 2 for ll nonzero liner functionls l in L 2 ([, b]) with the norm l = 1. γ 1 is clled the first principl component weight function or the first PC curve. Let λ 1 be the mximum vlue of (2.3). The pir (λ 1, γ 1 ) re the first eigenvlue nd eigenfunction of Γ (see Section 2, Chpter 3 in Weinberger [51]), Γγ 1 = λ 1 γ 1 The second functionl principl component (γ 2, X): γ 2 is the solution to γ, Γγ mx (2.4) γ =1, γ,γ 1 =0 γ 2 Let λ 2 be the mximum vlue of (2.4). The pir (λ 2, γ 2 ) re the second eigenvlue nd eigenfunction of Γ, Γγ 2 = λ 2 γ 2 Similry, the successive popultion functionl principl components re defined. However, we usully do not know the true covrince function Γ nd the popultion principl component weight functions cn not be obtined directly. We cn use the smple covrince function ˆΓ n to estimte Γ nd use the eigenvlues nd eigenfunctions of ˆΓ n to estimte the eigenvlues nd eigenfunctions of Γ, which re clled non-smooth estimtors. It ws pointed out tht the non-smooth principl component curves cn show substntil vribility (see Chpter 9 in Rmsy nd Silvermn [33]). Therefore, smoothing of the estimted principl component weight functions is necessry.

26 11 Silvermn [41] (see lso Chpter 9 in Rmsy nd Silvermn [33]) proposed n importnt method which incorportes smoothing by replcing the usul L 2 norm with norm tht tkes the roughness of the functions into ccount. Qi nd Zho [31] summrizes the theoreticl nd prcticl dvntges of Silvermn s pproch s follows: First, the wek ssumptions underlying this method mke it pplicble to dt from mny fields. Silvermn [41] did not mke ny ssumptions on the men curves nd smple curves. Hence, in ddition to dt with smooth rndom curves, this method cn be pplied to nlyze dt where the smple curves cn be unsmooth or even discontinuous, such s those encountered in finncil engineering, survivl nlysis nd other fields. For covrince functions, Silvermn [41] only ssumed tht they hve series expnsions by their eigenfunctions without imposing smoothing constrint. This is ttrctive becuse the covrince functions re continuous but unsmooth in mny importnt models such s stochstic differentil eqution models in finncil engineering nd counting process models in survivl nlysis. Second, Silvermn s method controls the smoothness of eigenfunction curves by directly imposing roughness penlties on these functions insted of on smple curves or covrince functions. Furthermore, this pproch chnges the eigenvlue nd eigenfunction problems in the usul L 2 spce to problems in nother Hilbert spce, the Sobolev spce (with norm different from the usul norm in the Sobolev spce). Therefore, mny powerful tools from the theory of Hilbert spce cn be employed to study the properties of this method. Third, this pproch incorportes the smoothing step into the step for computing eigenvlues nd eigenfunctions. Therefore, this method is computtionlly efficient with the sme computtionl lod s the usul unsmoothed functionl PCA. Fourth, the estimtes produced by this method re invrint under scle trnsformtions. As pointed out by Hung et l. [22], the invrince property under scle trnsformtions should be guiding principle in introducing roughness penlties to functionl PCA. Let α be nonnegtive smoothing prmeter. Silvermn defines the smoothed estimtors

27 12 {(ˆλ [α] j, ˆγ [α] ) : j N}of {(λ j, γ j ) : j N} to be the solutions of the following successive j optimiztion problems: First,ˆγ [α] 1 is the solution of the optimiztion problem mx γ =1 γ, ˆΓ n γ γ, γ + α[γ, γ] = mx γ =1 γ, ˆΓ n γ. (2.5) γ 2 α Let ˆγ [α] 1 be the mximum vlue of (2.5). For ny k N, if we hve obtined {ˆγ [α] j, j = 1, 2,, k 1} nd {ˆλ [α] j, j = 1, 2,, k 1},ˆγ [α] k is the solution of the optimiztion problem mx γ =1, γ,ˆγ [α] j α=0, 1 j k 1 γ, ˆΓ n γ γ 2 α (2.6) nd ˆλ [α] k is the mximum vlue of (2.6). Note tht {(ˆλ [α] j smple size n nd the smoothing prmeter α (Qi nd Zho [31])., ˆγ [α] ) : j N} depends on both the j 2.3 Functionl PCA for discretely observed functionl dt We consider two smple scenrios for smple functions observed t discrete points, regulr cse nd irregulr cse, respectively Regulr cse In this cse, we ssume tht the smple functions re observed t the sme set {t 1, t 2, t m } of discrete observtion points cross ll the subjects with mesurement errors, where m is the totl number of observtion points for ech smple function. After sorting the observtion points from the smllest to the lrgest, we get = t (1) < t (2) < t (m 1) < t (m) = b. Let us consider the following model: Y pq = X p (t (q) ) + ɛ pq, p = 1,, n, q = 1,, m, (2.7) where Y pq is the observtion of the smple function X p t point t (q) with mesurement error ɛ pq nd n is the totl number of smple curves. Our estimtes {ˆλ k, ˆγ k } k 1 of {λ k, γ k } k 1 re the solutions to the following successive optimiztion problems. The first pir of estimtes

28 13 {ˆλ 1, ˆγ 1 } re the mximum vlue nd the solution to the following optimiztion problem: mq=1 ml=1 ˆΣql γ(t (q) )γ(t (l) )w q w l mx, (2.8) γ W2 2([,b]), γ =1 γ 2 + α 1 [γ, γ] where α 1 > 0 is smoothing prmeter, ˆΣ ql = 1 n np=1 ( Ypq Ȳ q) ( Ypl Ȳ l), 1 q, l m, Ȳ q = 1 n (Y 1q + + Y nq ), nd ( t(2) t (1) ) /2 q = 1 w q = ( t(q+1) t (q 1) ) /2 1 < q < m ( t(m) t (m 1) ) /2 q = m. (2.9) The higher order estimtes {ˆλ k, ˆγ k }, k 2 re the solutions to the following optimiztion problems: mx γ = 1, γ, ˆγ j = 0, j = 1,, k 1 mq=1 ml=1 ˆΣql γ(t (q) )γ(t (l) )w q w l, (2.10) γ 2 + α k [γ, γ] where α k is the smoothing prmeter for the k-th estimtes nd the estimtes of eigenfunctions re orthogonl to ech other. We cn choose different smoothing prmeters for different principl components. The ide behind our method is s follows. The true eigenvlues nd eigenfunctions re the solutions to the following successive optimiztion problems: mx γ = 1, γ, γ j = 0, j = 1,, k 1 γ, Γγ γ 2. where Γγ is the function defined by (Γγ)(t) = b Γ(t, s)γ(s)ds. These optimiztion problems depend on the covrince function Γ only through the inner product γ, Γγ. Hence, we use the numertors in (4.95) nd (2.10) to pproximte γ, Γγ. However, if there re no penlty terms in the denomintors in (4.95) nd (2.10), the mximum vlues of (4.95) nd (2.10) re infinities. Since tuning α k does not ffect the first k 1 estimtes, we cn tune

29 14 the prmeters one by one. We give theorem on the existence nd chrcteriztion of solutions of the successive optimiztion problems (4.95) nd (2.10). Our methods solve the optimiztion problems in the Sobolev spce, hence mny powerful tools from the theory of Hilbert spce cn be used to study the symptotic consistency of our method. We give theorem on the existence nd chrcteriztion of solutions of the successive optimiztion problems (4.95) nd (2.10). Theorem The solutions {ˆλ k, ˆγ k : k 1} of the successive optimiztion problems (4.95) nd (2.10) exist for ny {α k > 0, k 1}. Moreover, for ech k, ˆγ k hs continuous second derivtives on [, b] nd, on ny subintervl {[t (q 1), t (q) ], 1 q m 1}, it cn be written s liner combintion of the following t most 4k functions, where 1 j k. exp t 1 sin t 1, exp t 1 cos t 1 2α 2α 2α 2α 4 j 4 j 4 j exp t 1 sin t 1, exp t 1 cos t 1, 2α 2α 2α 2α 4 j Hence, the first solution ˆγ 1 is similr to smoothing splines except tht the solutions to the optimiztion problems in smoothing spline methods re cubic polynomils between ny two djcent observtion points. 4 j 4 j 4 j 4 j Irregulr cse In this cse, we ssume the observtion time points re {t pq : p = 1,, n, q = 1,, N p }, where n is the number of smple curves nd N p is the number of the observtion points of the p-th smple function X p. The model is Y pq = X p (t pq ) + ɛ pq, p = 1,, n, q = 1,, N p, (2.11)

30 15 where Y pq is the observtion of the rndom function X p t time t pq nd ɛ pq is the mesurement error. For irregulr cse, we ssume tht the men function µ(t) is smooth nd the observtion points t pq re rndom vribles with density function h(t) which is bounded below wy from zero on [, b]. Our FPCA procedure for irregulr cse hs three steps. In the first step, we estimte the men function µ(t) bsed on the pooled dt from ll individuls by locl liner smoother. This step is the sme s the first step of the procedure in Yo et l. [54]. We define the estimte ˆµ(t) of µ(t) by solving the following optimiztion problem min,b N n p ( ) tpq t κ {Y pq b(t t pq )} 2, (2.12) p=1 q=1 η µ where κ is the kernel, nd η µ is the bndwidth. Let â(t) nd ˆb(t) be the minimizers, then ˆµ(t) = â(t). In the second step, we estimte the density function h(t) bsed on pooled observtion time points by the mximum penlized likelihood estimtion method (see Silvermn [40], Silvermn [42] nd Chpter 6 in Rmsy nd Silvermn [33]). Let ĝ(t) be the minimizer of the functionl 1 N n N p b g(t pq ) + e g(t) dt + η g [g, g], (2.13) p=1 q=1 where N = n p=1 N p nd η g is smoothing prmeter, then the estimte ĥ(t) = eĝ(t). Here we use the mximum penlized likelihood estimtion method insted of the kernel density estimtion method becuse the density estimte in this step will pper in the denomintors in the third step. Hence, the density estimte must be positive. In the mximum penlized likelihood estimtion, the log density is first estimted, then its exponentil is clculted s the density estimte. Hence, the mximum penlized likelihood density estimte is strictly positive. The third step is to solve the following successive optimiztion problems. The first pir of estimtes {ˆλ 1, ˆγ 1 } of {λ 1, γ 1 } re the mximum vlue nd the solution to the optimiztion

31 16 problem: mx γ W 2 2 ([, b]), γ = 1 1 n n 1 N p χ [Np>1] p=1 N p (N p 1) l q:1 γ 2 + α 1 [γ, γ] U (p) ql, (2.14) where α 1 > 0 is smoothing prmeter, χ [Np>1] is the indictor function of N p > 1, n = n p=1 χ [Np>1] is the totl number of the smple functions with t lest two observtion points nd U (p) ql = γ(t pq)(y pq ˆµ(t pq )) ĥ(t pq ) γ(t pl)(y pl ˆµ(t pl )). ĥ(t pl ) The higher order estimtes {ˆλ k, ˆγ k }, k 2 re the solutions to the following optimiztion problems: mx γ = 1, γ, ˆγ j = 0, j = 1,, k 1 1 n n 1 N p χ [Np>1] p=1 N p (N p 1) l q:1 γ 2 + α k [γ, γ] U (p) ql, (2.15) where α k is the positive smoothing prmeter for the k-th estimtes. Now we intuitively explin (2.14) nd (2.15). For ech 1 p n, if N p > 1, then is n pproximtion to the U-sttistic 1 N p (N p 1) N p l q:1 U (p) ql 1 N p (N p 1) N p l q:1 γ(t pq )(Y pq µ(t pq )) h(t pq ) γ(t pl)(y pl µ(t pl )). (2.16) h(t pl ) For different p s with N p > 1, (2.16) re independently nd identiclly distributed rndom vribles if we ssume tht N p is rndom vrible independent of t pq nd the rndom

32 17 function X. Therefore, by the lw of lrge numbers, the numertors in (2.14) nd (2.15) re pproximtion to γ, Γγ. We give similr theorem s Theorem for the irregulr cse. Theorem The solutions {(ˆλ k, ˆγ k ) : k 1} of the successive optimiztion problems (2.14) nd (2.15) exist for ny {α k > 0, k 1}. Moreover, for ech k, ˆγ k hs continuous second derivtives on [, b] nd on the subintervl between ny two djcent pooled observtion points, it cn be written s liner combintion of the following t most 4k functions, where 1 j k. exp t 1 sin t 1, exp t 1 cos t 1 2α 2α 2α 2α 4 j 4 j 4 j exp t 1 sin t 1, exp t 1 cos t 1, 2α 2α 2α 2α 4 j 4 j 4 j 4 j 4 j Computtionl issues Although Theorems nd give the forms of the solutions to the successive optimiztion problems in our FPCA procedure, it is not convenient to compute the exct solutions in prctice. Insted, we choose n pproprite bsis nd use the bsis expnsions to pproximte the solutions to the successive optimiztion problems s did [33] in Section 9.4. We develop similr lgorithms for computing the solutions to the successive optimiztion problems in our method s those in Section 9.4 of [33]. We first choose n pproprite bsis {φ ν } M ν=1, where M is the number of bsis functions. For exmple, we cn choose the Fourier series s our bsis for the periodic cse nd the B-spline bsis for the nonperiodic cse. Let γ k = M ν=1 c kν φ ν, k 1, be the solutions to (4.2) or (2.15) restricted to the liner spce spnned by the bsis functions. They re the pproximtions to {ˆγ k } k 1. The coefficients c k = (c k1,, c km ) T re solutions to the following successive optimiztion problems, mx c R M, c T Jc = 1 c T j Jc = 0, j = 1,, k 1 c T Vc c T Jc + α k c T Kc. (2.17)

33 18 J nd K re M M mtrices with elements J νν = b φ ν(t)φ ν (t)dt nd K νν = b φ ν(t)φ ν (t)dt, ν, ν = 1,, M, where φ ν is the second derivtive of φ ν. V is M M mtrix with elements in regulr cse nd V νν = m q=1 l=1 m ˆΣ ql φ ν (t (q) )φ ν (t (l) )w q w l, (2.18) V νν = 1 n in irregulr cse. n p=1 χ [Np>1] N p (N p 1) N p l q:1 φ ν (t pq )(Y pq ˆµ(t pq )) ĥ(t pq ) φν (t pl)(y pl ˆµ(t pl )), (2.19) ĥ(t pl ) The lgorithm for solving (2.17) is s follows: Perform Cholesky fctoriztion L T 1 L 1 = J + α 1 K nd clculte the inverse mtrix L 1 1 of L 1. Let B 1 = (L 1 1 ) T VL 1 1 nd compute the first eigenvector d 1 of B 1. Then c 1 = L 1 where r 1 is rel number chosen such tht c T 1 Jc 1 = 1. 1 d 1 r 1, For k > 1, suppose tht we hve obtined c 1, c k 1. Perform the Cholesky fctoriztion L T k L k = J + α k K nd clculte the inverse mtrix L 1 k of L k. Let C k 1 = [c 1, c k 1 ], tht is, C k 1 is n M (k 1) mtrix with the j-th column equl to c j. Perform QR-decomposition Q k R k = (L 1 k )T JC k 1, where Q k is M (k 1) mtrix with columns hve norm 1 nd orthogonl to ech other nd R k is n upper tringulr mtrix. Clculte the projection mtrix P k = I Q k Q T k onto the liner spce orthogonl to the liner spce spnned by the columns of (L 1 k )T JC k 1, where I is the identity mtrix of M dimension. Let B k = P k (L 1 k )T VL 1 k P k nd compute the first eigenvector d k of B k. Then c k = L 1 k d k r k, where r k is rel number chosen such tht c T k Jc k = 1.

34 Consistency We ssume throughout this section tht we wnt to estimtes the first K principl component curves, where K is ny fixed positive integer number. First, we consider the regulr model (2.7). For this model, we consider the following two cses for the distributions of t q : Cse 1 (Nonrndom Cse). {t q, 1 q m} re nonrndom. Define ( ) δ m = mx t(q) t (q 1). (2.20) 2 q m Cse 2 (Rndom Cse). {t q, 1 q m} re i.i.d. rndom vribles hving density functions h(t) in [, b] with respect to Lebesgue mesure nd re independent of the rndom functions X p, 1 p n. Furthermore, h(t) hs positive lower bound c. In order to give the consistency result for the regulr model (2.7), we need the following two more ssumptions: Assumption 4. The mesurement errors ɛ pq, 1 p n, 1 q m re independent rndom vribles nd re independent of the rndom functions X p, 1 p n nd the observtion times t q, 1 q m. For ech q, {ɛ 1q,, ɛ nq } hve the sme distribution with men 0 nd vrince σq. 2 Furthermore, sup σq 2 σ 2, q sup E ɛ ql 3 ρ, q,l where σ nd ρ re some positive numbers nd do not depend on m. Remrk We do not ssume tht ll the mesurement errors hve the sme distributions. Insted we only ssume tht the errors rising t the sme observtion time hve the sme distribution, which is more generl thn the former. Assumption 5. The covrince function Γ(s, t) is continuous function in [, b] [, b]. Define function ϖ(δ) = sup [Γ(t, t) 2Γ(s, t) + Γ(s, s)], (2.21) s,t [,b], s t δ

35 20 where 0 < δ b. Note tht Γ(t, t) 2Γ(s, t) + Γ(s, s) = E [ ((X(s) µ(s)) (X(t) µ(t))) 2]. Under Assumption 5, we hve lim δ 0 ϖ(δ) = 0 nd Γ is bounded. If G is smooth, then ϖ(δ) = O(δ). Although the covrince functions of Brownin motion nd Poisson process with rte 1 re not smooth, for both of them, we hve E [ ((X(s) µ(s)) (X(t) µ(t))) 2] = t s, nd therefore, ϖ(δ) = δ Theorem Under Assumptions 1 5, suppose tht m, n, mx 1 k K α k 0 nd If the following is stisfied tht for Cse 1, mx 1 k K α k min 1 k K α k = O p (1). (2.22) 1 δm ϖ(δ m ) + δ m + 0 min 1 k K α k n nd for Cse 2, 1 min 1 k K α k ϖ( 3 log m cm ) + log m m + log m 0, nm then the estimtors {(ˆλ k, ˆγ k ) : 1 k K} re consistent. Second, we consider the irregulr model (4.130). For this model, we mke the following ssumptions on the number of observtion points, mesurement errors, men functions nd density functions. They re ctully prts of ssumptions in Yo et l. [54] nd Hll et l. [20]. Assumption 6. The numbers of the observtion points N p, 1 p n, re i.i.d rndom vribles tking positive integer vlues with EN p < nd P (N p > 1) > 0. The mesurement

36 21 errors ɛ pq, 1 p n, 1 q m re i.i.d rndom vribles with men zero nd finite vrince. The rndom functions, the observtion points, the number of the observtion points nd the mesurement errors re independent. Assumption 7. Both the men function nd the density function hve squre integrble second derivtives, tht is, µ(t), h(t) W2 2 ([, b]). The kernel κ in (4.91) is compctly supported, symmetric nd Hölder continuous. The smoothing prmeter η µ in (4.91) stisfies n ρ ηµ = o(1), η µ = o(n 1 4 ), where ρ 1 > 0 is some constnt. There re two positive constnts c < C such tht c h(t) C t b nd the smoothing prmeter η g stisfies η g 0, n 1 ρ 2 η g, where ρ 2 > 0 is some constnt. Now we present the consistency result for the irregulr model. Theorem Under Assumptions 1 3 nd 6 7, suppose tht n, mx 1 k K α k 0 nd If the following is stisfied mx 1 k K α k min 1 k K α k = O p (1). [ ] 1 n 1 2 (η 1 + η 1 2 ɛ g ) + η 3 4 ɛ g 0, min 1 k K α k for some ɛ > 0, then the estimtors {(ˆλ k, ˆγ k ) : 1 k K} re consistent Extensions to FPCA for functionl dt with multidimensionl rguments Functionl dt with multidimensionl rguments re collected in growing number of fields. For exmple, in sptil dt nlysis, dt re collected from different plces

37 22 nd t different times. Such dt cn be view s discretely observed functionl dt which re functions of both spce nd time. Anlysis of such dt is considered n importnt direction of functionl dt nlysis (see Section 22.2 in Rmsy nd Silvermn [33] nd Section 5 in Müller [28]). Our method cn be esily extended in this context by defining similr successive optimiztion problems in multidimensionl spces. The numertors in the successive optimiztion problems (2.14) nd (2.15) cn be strightforwrdly extended to the multidimensionl cse nd the penlty terms in the denomintors cn be replced with J d m(γ) = j 1 + +j d =m m! j 1! j d! Ω m γ t j 1 1 t j d j 2 dt 1 dt d where d is the dimension of the spce of the rguments, Ω R d is the region in which the function is defined nd we ssume tht the eigenfunctions hve squre integrble m-th derivtives. Our method voids the estimtes of covrince functions which hve 2d rguments nd re not very esy to estimte when d Simultion studies To illustrte the performnce of our method, we conduct three simultion studies. In the first study, the smple curves re smooth with both eqully nd uneqully spced observtion time points, nd we will compre our method with n lterntive method (Method II) which first obtins the smooth estimte of men curve nd covrince functions, nd then compute the eigenfunctions of the smoothed covrince function s the estimtions of the PC curves. We use the softwre pckge PACE for the second method, which ws developed by Yo et l, nd downloded the softwre from In the second study, the smple curves re simulted with 3 true principle curves nd we will compre our mthod with Method II. In the third study, we simulte rndom surfces nd perform FPCA in two-dimensionl spce with our method.

38 Smooth rndom curves with 2 PC curves We first simulte 200 curves from the following rndom curve on [0, 1], X(t) = 2 sin(2πu) sin( πt ) + cos(2πu) sin(3πt 2 2 ), where U is rndom vrible with uniform distribution on [0, 1]. The covrince function of X(t) hs two nonzero eigenvlues nd the corresponding eigenfunctions re 2 sin( πt) nd 2 2 sin( 3πt). Figure 2.1 shows the plot of the first two principl component curves. 2 The first two principl component curves γ(t) The 1st PC curves The 2nd PC curves t Figure 2.1 The First Two Principl Component Curves

39 24 one smple curve with discrete nd noise observtions X(t) o one smple curve 101 noise observtion points t 200 smple curves without noise 200 discretely observed smple curve with noise X(t) X(t) t t Figure 2.2 Simulted Smple Curves for the simultions in Section For discrete observtions, we will consider two cses. Regulr cse The observed dt re generted from the following model Y pq = X p (t q ) + ɛ pq, t q = q 1, q = 1, n, p = 1,,

40 25 where the mesurement errors ɛ pq N(0, 3), the observtion points {t pq } re eqully spced on [0, 1]. We consider n = 101, 51 nd 21, tht is, we smple different number of observtion points: 101 eqully spced with mesurement errors; 51 eqully spced with mesurement errors; 21 eqully spced with mesurement errors. Plots of the simulted smple curves re shown in 2.2. We will estimte the first two eigenfunctions by our method nd Method II respectively. We conducted 200 simultions nd in ech simultion, 200 observtions re generted s trining smple nd 500 observtions s test smple. For our method, use the usul crossvlidtion procedure to select the smoothing prmeter α from {1e 10, 1e 09, 1e 08, 1e 07, 1e 06, 1e 05, 1e 04, 1e 03, 1e 02, 1e 01}, such tht the totl vrince ccounted for by ll the principl components on the test dt is mximized. We obtin the smoothing prmeter α = 1e 04. The prmeters for Method II cn be chosen by generlized crossvlidtion (GCV) method. Tble 2.1 lists the cumultive vrince of selected principl component scores. Under different settings, the first two estimted principle component curves obtined by our method explin lrger totl vrition in the dt. Tble 2.1 The verges nd stndrd devitions of cumultive vrince of selected principl component scores for the simultions in Section 2.4.1: Regulr Cse. For ech smpling strtegy shown in column 1, the first row is the verge nd stndrd devition of the first estimted PC score vrince; the second row is for the second esitmted PC score vrince. Our Method Method II (PACE) Selected PCs Vr.PC.Score Vr.PC.Score Vr.PC.Score Vr.PC.Score (Men) (Vrince) (Men) (Vrince) 101 Eqully spced 1st PC nd PC Eqully spced 1st PC nd PC Eqully spced 1st PC nd PC

41 26 Irregulr cse In this cse, the observed dt re generted from the following model Y pq = X p (t q ) + ɛ pq, q = 1, n, p = 1,, 200. where the mesurement errors ɛ pq N(0, 3), the observtion points {t pq } re i.i.d rndom vribles from Uniform[0, 1]. Tht is, we smple 200 curves nd mke different number of observtion points: 101 uneqully spced with mesurement errors; 51 uneqully spced with mesurement errors; 21 uneqully spced with mesurement errors. We use the principl component curves estimted from both methods to pproximte the true principl component curves. We conducted 200 simultions. Similrly to the regulr cse, we use the usul cross-vlidtion procedure to obtin the prmeters for our method such tht the totl vrince ccounted for by ll the principl components on the test dt is mximized. We obtin the smoothing prmeter α = 1e 04 nd the cumultive vrince of selected PC scores re listed in Tble 2.2. Under different settings, the estimted PC curves obtined by our method explin lrger totl vrition in the dt. Tble 2.2 The verges nd stndrd devitions of cumultive vrince of selected PC scores for the simultions in Section 2.4.1: Irregulr Cse. For ech smpling strtegy shown in column 1, the first row is the verge nd stndrd devition of the first estimted PC score vrince; the second row is for the second esitmted PC score vrince. Our Method Method II (PACE) Selected PCs Vr.PC.Score Vr.PC.Score Vr.PC.Score Vr.PC.Score (Avg.) (Std.) (Avg.) (Std.) 101 Uneqully spced 1st PC nd PC Uneqully spced 1st PC nd PC Uneqully spced 1st PC nd PC

42 Smooth rndom curves with 3 PC curves We consider the following rndom curve on [0, 1], X(t) = 3β 1 sin( πt 2 ) + 2β 2 sin( 3πt 2 ) + β 3 sin( 5πt 2 ), where β i, i = 1, 2, 3 re rndom vribles with norml distribution on [0, 1]. The covrince function of X(t) hs three nonzero eigenvlues nd the corresponding eigenfunctions re 2 sin( πt ), 2 sin( 3πt) nd 2 sin( 5πt ). Figure 2.3 shows the plot of the first three principl component curves. The first three principl component curves γ(t) The 1st PC curves The 2nd PC curves The 3rd PC curves t Figure 2.3 The First Three Principl Component Curves

43 28 one smple curve with discrete nd noise observtions X(t) t 200 smple curves without noise 200 discretely observed smple curve with noise X(t) X(t) t t Figure 2.4 Simulted Smple Curves for the simultions in Section 2.4.2

44 29 Similr to the previous simultion, we will consider two cses. Regulr cse The observed dt re generted from the following model Y pq = X p (t q ) + ɛ pq, t q = q 1, q = 1, n, p = 1,, where the mesurement errors ɛ pq N(0, 3), the observtion points {t pq } re eqully spced on [0, 1]. Plots of the simulted smple curves from one simultion re shown in 2.4. We conducted 300 simultions with different number of observtion points smpled: 101 eqully spced mesurements; 51 eqully spced mesurements; 21 eqully spced mesurements. In ech simultion,200 observtions re generted s the trining smple nd 500 observtions s the test smple. For our method, we use the usul cross-vlidtion to obtin the smoothing prmeter from {1e 10, 1e 09, 1e 08, 1e 07, 1e 06, 1e 05, 1e 04, 1e 03, 1e 02, 1e 01}, such tht the totl vrince ccounted for by ll the principl components on the test dt is mximized. The smoothing prmeter chosen under different smpling strtegies re listed in 2.3. We use the principl component curves estimted from both methods to pproximte the true principl component curves. The cumultive vrince of selected PC scores re listed in Tble 2.4. We cn see tht the first three estimted principle component curves obtined by our method cccount for lrger vrition in the dt. Tble 2.3 Selected smoothing prmeter with the usul cross-vlidtion procedure for the simultions in Section 2.4.2: Regulr Cse Smoothing Prmeter 101 Eqully Spced 51 Eqully Spced 21 Eqully Spced α 1e-09 1e-06 1e-04 Irregulr cse In this cse, the observted dt re generted from the following model Y pq = X p (t q ) + ɛ pq, q = 1, n, p = 1,, 200. where the mesurement errors ɛ pq N(0, 3), the observtion points {t pq } re i.i.d rndom vribles from Uniform[0, 1]. We use the principl component curves estimted from both

45 30 Tble 2.4 The verges nd stndrd devitions of cumultive vrince of selected PC scores for the simultions in Section 2.4.2: Regulr Cse. For ech smpling strtegy shown in column 1, the first row is the verge nd stndrd devition of the first estimted PC score vrince; the second row is for the second esitmted PC score vrince. Our Method Method II (PACE) Selected PCs Vr.PC.Score Vr.PC.Score Vr.PC.Score Vr.PC.Score (Avg.) (Std.) (Avg.) (Std.) 101 Eqully spced 1st PC nd PC Eqully spced 1st PC nd PC Eqully spced 1st PC nd PC rd PC methods to pproximte the true principl component curves with our method nd Method II. Similrly to the regulr cse, we conducted 300 simultions with different number of observtion points smpled: 101 uneqully spced mesurements; 51 uneqully spced mesurements; 21 uneqully spced mesurements. We use the usul cross-vlidtion procedure to obtin the prmeters from for our method, such tht the totl vrince ccounted for by ll the principl components on the test dt is mximized. The smoothing prmeter chosen under different smpling strtegies re listed in 2.5 nd the cumultive vrince of selected PC scores re listed in Tble 2.6. Tble 2.5 Selected smoothing prmeter with the usul cross-vlidtion procedure for the simultions in Section 2.4.2: Irregulr Cse 101 Unqully Spced 51 Unqully Spced 21 Unqully Spced α 1e-06 1e-04 1e-03

46 31 Tble 2.6 The verges nd stndrd devitions of cumultive vrince of selected PC scores for the simultions in Section 2.4.2: Irregulr Cse. For ech smpling strtegy shown in column 1, the first row is the verge nd stndrd devition of the first estimted PC score vrince; the second row is for the second esitmted PC score vrince. Our Method Method II (PACE) Selected PCs Vr.PC.Score Vr.PC.Score Vr.PC.Score Vr.PC.Score (Avg.) (Std.) (Avg.) (Std.) 101 Uneqully spced 1st PC nd PC rd PC Uneqully spced 1st PC nd PC rd PC Uneqully spced 1st PC nd PC rd PC Rndom surfce We smple 200 surfces from the distribution of ( ) ( ) π(s t) π(s + t) X(s, t) = 1 + e s cos (t) + (s 1) 2 t + ξ 1 sin + ξ 2 sin, 2 2 where 0 s, t 1, ξ 1 N(0, 2), ξ 2 N(0, 1). The summtion of the first three terms is the men function. The covrince function of X(s, t) hs two nonzero eigenvlues with eigenfunctions 1 (1 2 ( ) π(s t) 2 π ) sin, 2 1 (1 + 2 ( ) π(s + t) 2 π ) sin, 0 s, t 1. 2 For ech smpled surfce, we mke 10 to 30 observtions (irregulr cse) from distribution with truncted bivrite norml density with men (0.4, 0.6) nd covrince mtrix I restricted to the region [0, 1] [0, 1]. The number of the observtions for ech surfce is rndom vrible with discrete uniform distribution on {10, 11,, 30}. The

47 32 mesurement error ɛ N(0, ). The eigenfunctions nd their estimtes in one simultion re plotted in Figure 2.5. The smllest MISE of our method re nd for the first two eigenfunctions respectively f(s,t) 0 f(s,t) s t s t f(s,t) 0 f(s,t) t s t s Figure 2.5 Eigenfunctions nd their estimtes in one simultion: The top left is the first true eigenfunction; the top right is the estimte of the first eigenfunction; the bottom left is the second eigenfunction; the bottom right is the estimte of the second eigenfunction.

48 Applictions Retinl pigment epithelium (RPE) dt The retinl pigment epithelium (RPE) is the pigmented cell lyer between the choroid nd the photoreceptor cell lyer of eye. RPE is essentil for visul function (see Struss Struss [45]). It provides multiple functions tht support norml photoreceptor function, such s shielding the retin from excess incoming light, trnsporting wter, nutrients nd metbolic end products between the subretinl spce nd the blood, s well s secreting vriety of growth fctors nd signling molecules (Zinn nd Mrmor Zinn nd Mrmor [55]). RPE is key site of pthogenesis of ge-relted mculr degenertion (AMD) which is min source of vision loss even blindness in the elderly (Spide nd et l. Spide nd et l. [43]). The dt is the collection of imges of RPE cells of 88 mouse eyes provided in Emory Eye Center s L. F. Montgomery Lb t Emory University (Jing nd et l. Jing nd et l. [24]). The purpose of the study is to exmine the reltionship between the morphology of RPE lyer nd the ge nd disese sttus of the eye. Specificlly, it is desirble to construct clssifiction rule bsed on the dt so tht the morphology of RPE of the eyes with different genotypes nd in different ge groups cn be seprted. There re two genotypes: wild nd mutted, nd two ge groups: young (ge 60 dys) nd elderly (ge > 60) groups in the dt. Hence, we hve four clsses (tht is, four combintions of genotypes nd ge groups). In ech imge, there re severl thousnds of cells. Severl chrcteristics of ech cell were mesured including re, perimeter, spect rtio, nd so on. Locl regions of two imges with different genotypes, but hving the sme ge equl to 60 dys, re shown in Figure 2.6 (Chrenek nd et l. Chrenek nd et l. [10]). It cn be seen tht the distributions of the re nd the shpe of cells in the two imges re quite different. Hence, we use the distributions of the re nd the spect rtio ( mesure of shpes) of cells s clssifiers respectively. The density curves of the re nd the spect rtio for ech eye re estimted using the penlized likelihood method (see Section in Rmsy, Hooker nd Grves Rmsy et l. [34]), respectively, nd the principl component scores re clculted nd used to construct the clssifiction rules. The eyes in different ge groups cn be seprted using the distribution of the re of cells which cnnot distinguish the eyes with different genotypes in the sme

49 34 Figure 2.6 Locl regions of two imges with different genotypes, but sme ge equl to 60 dys: (Left) RPE cells of the wild type nd ge 60 dys; (Right) RPE cells of the mutted type nd ge 60 dys. ge groups. Conversely, the distribution of the spect rtio of cells cn seprte the eyes with different genotypes, but cnnot distinguish those with the sme genotypes in different ge groups. Hence, we will combine the informtion of the re nd the spect rtio of cells together nd pply our method to the joint density functions. The dt contins 88 imges of mouse eyes. 27 re in the young ge group with the wild genotype, 13 re in the elderly ge group with the wild genotype, 27 re in the young ge group with the mutted genotype nd 21 re in the elderly ge group with the mutted genotype. We first estimte the joint density function of the re nd the spect rtio of cells in ech imge using the kernel method (see Section 5.6 in Venbles nd Ripley [49]). The vlues of the density functions re clculted on grid of eqully spced points in the two-dimensionl spce (the re of cells re distributed between 0 nd 730 µm 2 nd the spect rtio between 0 nd 1). The men joint densities of the four ctegories re plotted in Figure 2.7 which indictes the joint density curve is good clssifier of the genotype nd the ge group. We pply our method to the 88 joint density functions. Most of vritions in the dt re ccounted for by the first four principl components which re plotted in Figure 2.8. Then we clculte four PC scores for ech eye imge, hence ll the PC scores form 88 4 mtrix which is used to construct clssifiction rules. We pply three clssifiction methods, LDA (liner discriminnt nlysis), QDA (qudrtic discriminnt nlysis) nd SVM (support vector mchine), to the mtrix. Leve-one-out cross vlidtion is used to

Review of Calculus, cont d

Review of Calculus, cont d Jim Lmbers MAT 460 Fll Semester 2009-10 Lecture 3 Notes These notes correspond to Section 1.1 in the text. Review of Clculus, cont d Riemnn Sums nd the Definite Integrl There re mny cses in which some

More information

Math 270A: Numerical Linear Algebra

Math 270A: Numerical Linear Algebra Mth 70A: Numericl Liner Algebr Instructor: Michel Holst Fll Qurter 014 Homework Assignment #3 Due Give to TA t lest few dys before finl if you wnt feedbck. Exercise 3.1. (The Bsic Liner Method for Liner

More information

MATH34032: Green s Functions, Integral Equations and the Calculus of Variations 1

MATH34032: Green s Functions, Integral Equations and the Calculus of Variations 1 MATH34032: Green s Functions, Integrl Equtions nd the Clculus of Vritions 1 Section 1 Function spces nd opertors Here we gives some brief detils nd definitions, prticulrly relting to opertors. For further

More information

Continuous Random Variables

Continuous Random Variables STAT/MATH 395 A - PROBABILITY II UW Winter Qurter 217 Néhémy Lim Continuous Rndom Vribles Nottion. The indictor function of set S is rel-vlued function defined by : { 1 if x S 1 S (x) if x S Suppose tht

More information

8 Laplace s Method and Local Limit Theorems

8 Laplace s Method and Local Limit Theorems 8 Lplce s Method nd Locl Limit Theorems 8. Fourier Anlysis in Higher DImensions Most of the theorems of Fourier nlysis tht we hve proved hve nturl generliztions to higher dimensions, nd these cn be proved

More information

Chapter 5 : Continuous Random Variables

Chapter 5 : Continuous Random Variables STAT/MATH 395 A - PROBABILITY II UW Winter Qurter 216 Néhémy Lim Chpter 5 : Continuous Rndom Vribles Nottions. N {, 1, 2,...}, set of nturl numbers (i.e. ll nonnegtive integers); N {1, 2,...}, set of ll

More information

NUMERICAL INTEGRATION. The inverse process to differentiation in calculus is integration. Mathematically, integration is represented by.

NUMERICAL INTEGRATION. The inverse process to differentiation in calculus is integration. Mathematically, integration is represented by. NUMERICAL INTEGRATION 1 Introduction The inverse process to differentition in clculus is integrtion. Mthemticlly, integrtion is represented by f(x) dx which stnds for the integrl of the function f(x) with

More information

New Expansion and Infinite Series

New Expansion and Infinite Series Interntionl Mthemticl Forum, Vol. 9, 204, no. 22, 06-073 HIKARI Ltd, www.m-hikri.com http://dx.doi.org/0.2988/imf.204.4502 New Expnsion nd Infinite Series Diyun Zhng College of Computer Nnjing University

More information

Best Approximation. Chapter The General Case

Best Approximation. Chapter The General Case Chpter 4 Best Approximtion 4.1 The Generl Cse In the previous chpter, we hve seen how n interpolting polynomil cn be used s n pproximtion to given function. We now wnt to find the best pproximtion to given

More information

The steps of the hypothesis test

The steps of the hypothesis test ttisticl Methods I (EXT 7005) Pge 78 Mosquito species Time of dy A B C Mid morning 0.0088 5.4900 5.5000 Mid Afternoon.3400 0.0300 0.8700 Dusk 0.600 5.400 3.000 The Chi squre test sttistic is the sum of

More information

Orthogonal Polynomials and Least-Squares Approximations to Functions

Orthogonal Polynomials and Least-Squares Approximations to Functions Chpter Orthogonl Polynomils nd Lest-Squres Approximtions to Functions **4/5/3 ET. Discrete Lest-Squres Approximtions Given set of dt points (x,y ), (x,y ),..., (x m,y m ), norml nd useful prctice in mny

More information

Tests for the Ratio of Two Poisson Rates

Tests for the Ratio of Two Poisson Rates Chpter 437 Tests for the Rtio of Two Poisson Rtes Introduction The Poisson probbility lw gives the probbility distribution of the number of events occurring in specified intervl of time or spce. The Poisson

More information

CMDA 4604: Intermediate Topics in Mathematical Modeling Lecture 19: Interpolation and Quadrature

CMDA 4604: Intermediate Topics in Mathematical Modeling Lecture 19: Interpolation and Quadrature CMDA 4604: Intermedite Topics in Mthemticl Modeling Lecture 19: Interpoltion nd Qudrture In this lecture we mke brief diversion into the res of interpoltion nd qudrture. Given function f C[, b], we sy

More information

1 The Lagrange interpolation formula

1 The Lagrange interpolation formula Notes on Qudrture 1 The Lgrnge interpoltion formul We briefly recll the Lgrnge interpoltion formul. The strting point is collection of N + 1 rel points (x 0, y 0 ), (x 1, y 1 ),..., (x N, y N ), with x

More information

Numerical integration

Numerical integration 2 Numericl integrtion This is pge i Printer: Opque this 2. Introduction Numericl integrtion is problem tht is prt of mny problems in the economics nd econometrics literture. The orgniztion of this chpter

More information

Numerical Analysis: Trapezoidal and Simpson s Rule

Numerical Analysis: Trapezoidal and Simpson s Rule nd Simpson s Mthemticl question we re interested in numericlly nswering How to we evlute I = f (x) dx? Clculus tells us tht if F(x) is the ntiderivtive of function f (x) on the intervl [, b], then I =

More information

Discrete Least-squares Approximations

Discrete Least-squares Approximations Discrete Lest-squres Approximtions Given set of dt points (x, y ), (x, y ),, (x m, y m ), norml nd useful prctice in mny pplictions in sttistics, engineering nd other pplied sciences is to construct curve

More information

Theoretical foundations of Gaussian quadrature

Theoretical foundations of Gaussian quadrature Theoreticl foundtions of Gussin qudrture 1 Inner product vector spce Definition 1. A vector spce (or liner spce) is set V = {u, v, w,...} in which the following two opertions re defined: (A) Addition of

More information

Chapter 4 Contravariance, Covariance, and Spacetime Diagrams

Chapter 4 Contravariance, Covariance, and Spacetime Diagrams Chpter 4 Contrvrince, Covrince, nd Spcetime Digrms 4. The Components of Vector in Skewed Coordintes We hve seen in Chpter 3; figure 3.9, tht in order to show inertil motion tht is consistent with the Lorentz

More information

Estimation of Binomial Distribution in the Light of Future Data

Estimation of Binomial Distribution in the Light of Future Data British Journl of Mthemtics & Computer Science 102: 1-7, 2015, Article no.bjmcs.19191 ISSN: 2231-0851 SCIENCEDOMAIN interntionl www.sciencedomin.org Estimtion of Binomil Distribution in the Light of Future

More information

Monte Carlo method in solving numerical integration and differential equation

Monte Carlo method in solving numerical integration and differential equation Monte Crlo method in solving numericl integrtion nd differentil eqution Ye Jin Chemistry Deprtment Duke University yj66@duke.edu Abstrct: Monte Crlo method is commonly used in rel physics problem. The

More information

Data Assimilation. Alan O Neill Data Assimilation Research Centre University of Reading

Data Assimilation. Alan O Neill Data Assimilation Research Centre University of Reading Dt Assimiltion Aln O Neill Dt Assimiltion Reserch Centre University of Reding Contents Motivtion Univrite sclr dt ssimiltion Multivrite vector dt ssimiltion Optiml Interpoltion BLUE 3d-Vritionl Method

More information

The Regulated and Riemann Integrals

The Regulated and Riemann Integrals Chpter 1 The Regulted nd Riemnn Integrls 1.1 Introduction We will consider severl different pproches to defining the definite integrl f(x) dx of function f(x). These definitions will ll ssign the sme vlue

More information

Advanced Calculus: MATH 410 Notes on Integrals and Integrability Professor David Levermore 17 October 2004

Advanced Calculus: MATH 410 Notes on Integrals and Integrability Professor David Levermore 17 October 2004 Advnced Clculus: MATH 410 Notes on Integrls nd Integrbility Professor Dvid Levermore 17 October 2004 1. Definite Integrls In this section we revisit the definite integrl tht you were introduced to when

More information

Estimation on Monotone Partial Functional Linear Regression

Estimation on Monotone Partial Functional Linear Regression A^VÇÚO 1 33 ò 1 4 Ï 217 c 8 Chinese Journl of Applied Probbility nd Sttistics Aug., 217, Vol. 33, No. 4, pp. 433-44 doi: 1.3969/j.issn.11-4268.217.4.8 Estimtion on Monotone Prtil Functionl Liner Regression

More information

Student Activity 3: Single Factor ANOVA

Student Activity 3: Single Factor ANOVA MATH 40 Student Activity 3: Single Fctor ANOVA Some Bsic Concepts In designed experiment, two or more tretments, or combintions of tretments, is pplied to experimentl units The number of tretments, whether

More information

5.7 Improper Integrals

5.7 Improper Integrals 458 pplictions of definite integrls 5.7 Improper Integrls In Section 5.4, we computed the work required to lift pylod of mss m from the surfce of moon of mss nd rdius R to height H bove the surfce of the

More information

MATH 144: Business Calculus Final Review

MATH 144: Business Calculus Final Review MATH 144: Business Clculus Finl Review 1 Skills 1. Clculte severl limits. 2. Find verticl nd horizontl symptotes for given rtionl function. 3. Clculte derivtive by definition. 4. Clculte severl derivtives

More information

Numerical Integration

Numerical Integration Chpter 5 Numericl Integrtion Numericl integrtion is the study of how the numericl vlue of n integrl cn be found. Methods of function pproximtion discussed in Chpter??, i.e., function pproximtion vi the

More information

Math 1B, lecture 4: Error bounds for numerical methods

Math 1B, lecture 4: Error bounds for numerical methods Mth B, lecture 4: Error bounds for numericl methods Nthn Pflueger 4 September 0 Introduction The five numericl methods descried in the previous lecture ll operte by the sme principle: they pproximte the

More information

Lecture 14: Quadrature

Lecture 14: Quadrature Lecture 14: Qudrture This lecture is concerned with the evlution of integrls fx)dx 1) over finite intervl [, b] The integrnd fx) is ssumed to be rel-vlues nd smooth The pproximtion of n integrl by numericl

More information

THE EXISTENCE-UNIQUENESS THEOREM FOR FIRST-ORDER DIFFERENTIAL EQUATIONS.

THE EXISTENCE-UNIQUENESS THEOREM FOR FIRST-ORDER DIFFERENTIAL EQUATIONS. THE EXISTENCE-UNIQUENESS THEOREM FOR FIRST-ORDER DIFFERENTIAL EQUATIONS RADON ROSBOROUGH https://intuitiveexplntionscom/picrd-lindelof-theorem/ This document is proof of the existence-uniqueness theorem

More information

Best Approximation in the 2-norm

Best Approximation in the 2-norm Jim Lmbers MAT 77 Fll Semester 1-11 Lecture 1 Notes These notes correspond to Sections 9. nd 9.3 in the text. Best Approximtion in the -norm Suppose tht we wish to obtin function f n (x) tht is liner combintion

More information

Calculus of Variations

Calculus of Variations Clculus of Vritions Com S 477/577 Notes) Yn-Bin Ji Dec 4, 2017 1 Introduction A functionl ssigns rel number to ech function or curve) in some clss. One might sy tht functionl is function of nother function

More information

Approximation of functions belonging to the class L p (ω) β by linear operators

Approximation of functions belonging to the class L p (ω) β by linear operators ACTA ET COMMENTATIONES UNIVERSITATIS TARTUENSIS DE MATHEMATICA Volume 3, 9, Approximtion of functions belonging to the clss L p ω) β by liner opertors W lodzimierz Lenski nd Bogdn Szl Abstrct. We prove

More information

1.9 C 2 inner variations

1.9 C 2 inner variations 46 CHAPTER 1. INDIRECT METHODS 1.9 C 2 inner vritions So fr, we hve restricted ttention to liner vritions. These re vritions of the form vx; ǫ = ux + ǫφx where φ is in some liner perturbtion clss P, for

More information

1B40 Practical Skills

1B40 Practical Skills B40 Prcticl Skills Comining uncertinties from severl quntities error propgtion We usully encounter situtions where the result of n experiment is given in terms of two (or more) quntities. We then need

More information

Variational Techniques for Sturm-Liouville Eigenvalue Problems

Variational Techniques for Sturm-Liouville Eigenvalue Problems Vritionl Techniques for Sturm-Liouville Eigenvlue Problems Vlerie Cormni Deprtment of Mthemtics nd Sttistics University of Nebrsk, Lincoln Lincoln, NE 68588 Emil: vcormni@mth.unl.edu Rolf Ryhm Deprtment

More information

Physics 116C Solution of inhomogeneous ordinary differential equations using Green s functions

Physics 116C Solution of inhomogeneous ordinary differential equations using Green s functions Physics 6C Solution of inhomogeneous ordinry differentil equtions using Green s functions Peter Young November 5, 29 Homogeneous Equtions We hve studied, especilly in long HW problem, second order liner

More information

Predict Global Earth Temperature using Linier Regression

Predict Global Earth Temperature using Linier Regression Predict Globl Erth Temperture using Linier Regression Edwin Swndi Sijbt (23516012) Progrm Studi Mgister Informtik Sekolh Teknik Elektro dn Informtik ITB Jl. Gnesh 10 Bndung 40132, Indonesi 23516012@std.stei.itb.c.id

More information

Lecture 19: Continuous Least Squares Approximation

Lecture 19: Continuous Least Squares Approximation Lecture 19: Continuous Lest Squres Approximtion 33 Continuous lest squres pproximtion We begn 31 with the problem of pproximting some f C[, b] with polynomil p P n t the discrete points x, x 1,, x m for

More information

STURM-LIOUVILLE BOUNDARY VALUE PROBLEMS

STURM-LIOUVILLE BOUNDARY VALUE PROBLEMS STURM-LIOUVILLE BOUNDARY VALUE PROBLEMS Throughout, we let [, b] be bounded intervl in R. C 2 ([, b]) denotes the spce of functions with derivtives of second order continuous up to the endpoints. Cc 2

More information

Self-similarity and symmetries of Pascal s triangles and simplices mod p

Self-similarity and symmetries of Pascal s triangles and simplices mod p Sn Jose Stte University SJSU ScholrWorks Fculty Publictions Mthemtics nd Sttistics Februry 2004 Self-similrity nd symmetries of Pscl s tringles nd simplices mod p Richrd P. Kubelk Sn Jose Stte University,

More information

Research Article Moment Inequalities and Complete Moment Convergence

Research Article Moment Inequalities and Complete Moment Convergence Hindwi Publishing Corportion Journl of Inequlities nd Applictions Volume 2009, Article ID 271265, 14 pges doi:10.1155/2009/271265 Reserch Article Moment Inequlities nd Complete Moment Convergence Soo Hk

More information

PENALIZED LEAST SQUARES FITTING. Manfred von Golitschek and Larry L. Schumaker

PENALIZED LEAST SQUARES FITTING. Manfred von Golitschek and Larry L. Schumaker Serdic Mth. J. 28 2002), 329-348 PENALIZED LEAST SQUARES FITTING Mnfred von Golitschek nd Lrry L. Schumker Communicted by P. P. Petrushev Dedicted to the memory of our collegue Vsil Popov Jnury 4, 942

More information

Integral points on the rational curve

Integral points on the rational curve Integrl points on the rtionl curve y x bx c x ;, b, c integers. Konstntine Zeltor Mthemtics University of Wisconsin - Mrinette 750 W. Byshore Street Mrinette, WI 5443-453 Also: Konstntine Zeltor P.O. Box

More information

Unit #9 : Definite Integral Properties; Fundamental Theorem of Calculus

Unit #9 : Definite Integral Properties; Fundamental Theorem of Calculus Unit #9 : Definite Integrl Properties; Fundmentl Theorem of Clculus Gols: Identify properties of definite integrls Define odd nd even functions, nd reltionship to integrl vlues Introduce the Fundmentl

More information

Numerical Methods I Orthogonal Polynomials

Numerical Methods I Orthogonal Polynomials Numericl Methods I Orthogonl Polynomils Aleksndr Donev Cournt Institute, NYU 1 donev@cournt.nyu.edu 1 MATH-GA 2011.003 / CSCI-GA 2945.003, Fll 2014 Nov 6th, 2014 A. Donev (Cournt Institute) Lecture IX

More information

Review of basic calculus

Review of basic calculus Review of bsic clculus This brief review reclls some of the most importnt concepts, definitions, nd theorems from bsic clculus. It is not intended to tech bsic clculus from scrtch. If ny of the items below

More information

Non-Linear & Logistic Regression

Non-Linear & Logistic Regression Non-Liner & Logistic Regression If the sttistics re boring, then you've got the wrong numbers. Edwrd R. Tufte (Sttistics Professor, Yle University) Regression Anlyses When do we use these? PART 1: find

More information

MAC-solutions of the nonexistent solutions of mathematical physics

MAC-solutions of the nonexistent solutions of mathematical physics Proceedings of the 4th WSEAS Interntionl Conference on Finite Differences - Finite Elements - Finite Volumes - Boundry Elements MAC-solutions of the nonexistent solutions of mthemticl physics IGO NEYGEBAUE

More information

SOLUTIONS FOR ANALYSIS QUALIFYING EXAM, FALL (1 + µ(f n )) f(x) =. But we don t need the exact bound.) Set

SOLUTIONS FOR ANALYSIS QUALIFYING EXAM, FALL (1 + µ(f n )) f(x) =. But we don t need the exact bound.) Set SOLUTIONS FOR ANALYSIS QUALIFYING EXAM, FALL 28 Nottion: N {, 2, 3,...}. (Tht is, N.. Let (X, M be mesurble spce with σ-finite positive mesure µ. Prove tht there is finite positive mesure ν on (X, M such

More information

Chapters 4 & 5 Integrals & Applications

Chapters 4 & 5 Integrals & Applications Contents Chpters 4 & 5 Integrls & Applictions Motivtion to Chpters 4 & 5 2 Chpter 4 3 Ares nd Distnces 3. VIDEO - Ares Under Functions............................................ 3.2 VIDEO - Applictions

More information

Research Article On Existence and Uniqueness of Solutions of a Nonlinear Integral Equation

Research Article On Existence and Uniqueness of Solutions of a Nonlinear Integral Equation Journl of Applied Mthemtics Volume 2011, Article ID 743923, 7 pges doi:10.1155/2011/743923 Reserch Article On Existence nd Uniqueness of Solutions of Nonliner Integrl Eqution M. Eshghi Gordji, 1 H. Bghni,

More information

Orthogonal Polynomials

Orthogonal Polynomials Mth 4401 Gussin Qudrture Pge 1 Orthogonl Polynomils Orthogonl polynomils rise from series solutions to differentil equtions, lthough they cn be rrived t in vriety of different mnners. Orthogonl polynomils

More information

Chapter 0. What is the Lebesgue integral about?

Chapter 0. What is the Lebesgue integral about? Chpter 0. Wht is the Lebesgue integrl bout? The pln is to hve tutoril sheet ech week, most often on Fridy, (to be done during the clss) where you will try to get used to the ides introduced in the previous

More information

State space systems analysis (continued) Stability. A. Definitions A system is said to be Asymptotically Stable (AS) when it satisfies

State space systems analysis (continued) Stability. A. Definitions A system is said to be Asymptotically Stable (AS) when it satisfies Stte spce systems nlysis (continued) Stbility A. Definitions A system is sid to be Asymptoticlly Stble (AS) when it stisfies ut () = 0, t > 0 lim xt () 0. t A system is AS if nd only if the impulse response

More information

The First Fundamental Theorem of Calculus. If f(x) is continuous on [a, b] and F (x) is any antiderivative. f(x) dx = F (b) F (a).

The First Fundamental Theorem of Calculus. If f(x) is continuous on [a, b] and F (x) is any antiderivative. f(x) dx = F (b) F (a). The Fundmentl Theorems of Clculus Mth 4, Section 0, Spring 009 We now know enough bout definite integrls to give precise formultions of the Fundmentl Theorems of Clculus. We will lso look t some bsic emples

More information

MATH 174A: PROBLEM SET 5. Suggested Solution

MATH 174A: PROBLEM SET 5. Suggested Solution MATH 174A: PROBLEM SET 5 Suggested Solution Problem 1. Suppose tht I [, b] is n intervl. Let f 1 b f() d for f C(I; R) (i.e. f is continuous rel-vlued function on I), nd let L 1 (I) denote the completion

More information

Lecture 1. Functional series. Pointwise and uniform convergence.

Lecture 1. Functional series. Pointwise and uniform convergence. 1 Introduction. Lecture 1. Functionl series. Pointwise nd uniform convergence. In this course we study mongst other things Fourier series. The Fourier series for periodic function f(x) with period 2π is

More information

Credibility Hypothesis Testing of Fuzzy Triangular Distributions

Credibility Hypothesis Testing of Fuzzy Triangular Distributions 666663 Journl of Uncertin Systems Vol.9, No., pp.6-74, 5 Online t: www.jus.org.uk Credibility Hypothesis Testing of Fuzzy Tringulr Distributions S. Smpth, B. Rmy Received April 3; Revised 4 April 4 Abstrct

More information

1 Online Learning and Regret Minimization

1 Online Learning and Regret Minimization 2.997 Decision-Mking in Lrge-Scle Systems My 10 MIT, Spring 2004 Hndout #29 Lecture Note 24 1 Online Lerning nd Regret Minimiztion In this lecture, we consider the problem of sequentil decision mking in

More information

Chapter 3 Polynomials

Chapter 3 Polynomials Dr M DRAIEF As described in the introduction of Chpter 1, pplictions of solving liner equtions rise in number of different settings In prticulr, we will in this chpter focus on the problem of modelling

More information

QUADRATURE is an old-fashioned word that refers to

QUADRATURE is an old-fashioned word that refers to World Acdemy of Science Engineering nd Technology Interntionl Journl of Mthemticl nd Computtionl Sciences Vol:5 No:7 011 A New Qudrture Rule Derived from Spline Interpoltion with Error Anlysis Hdi Tghvfrd

More information

Recitation 3: More Applications of the Derivative

Recitation 3: More Applications of the Derivative Mth 1c TA: Pdric Brtlett Recittion 3: More Applictions of the Derivtive Week 3 Cltech 2012 1 Rndom Question Question 1 A grph consists of the following: A set V of vertices. A set E of edges where ech

More information

A NOTE ON ESTIMATION OF THE GLOBAL INTENSITY OF A CYCLIC POISSON PROCESS IN THE PRESENCE OF LINEAR TREND

A NOTE ON ESTIMATION OF THE GLOBAL INTENSITY OF A CYCLIC POISSON PROCESS IN THE PRESENCE OF LINEAR TREND A NOTE ON ESTIMATION OF THE GLOBAL INTENSITY OF A CYCLIC POISSON PROCESS IN THE PRESENCE OF LINEAR TREND I WAYAN MANGKU Deprtment of Mthemtics, Fculty of Mthemtics nd Nturl Sciences, Bogor Agriculturl

More information

1. Gauss-Jacobi quadrature and Legendre polynomials. p(t)w(t)dt, p {p(x 0 ),...p(x n )} p(t)w(t)dt = w k p(x k ),

1. Gauss-Jacobi quadrature and Legendre polynomials. p(t)w(t)dt, p {p(x 0 ),...p(x n )} p(t)w(t)dt = w k p(x k ), 1. Guss-Jcobi qudrture nd Legendre polynomils Simpson s rule for evluting n integrl f(t)dt gives the correct nswer with error of bout O(n 4 ) (with constnt tht depends on f, in prticulr, it depends on

More information

Calculus of Variations: The Direct Approach

Calculus of Variations: The Direct Approach Clculus of Vritions: The Direct Approch Lecture by Andrejs Treibergs, Notes by Bryn Wilson June 7, 2010 The originl lecture slides re vilble online t: http://www.mth.uth.edu/~treiberg/directmethodslides.pdf

More information

How do we solve these things, especially when they get complicated? How do we know when a system has a solution, and when is it unique?

How do we solve these things, especially when they get complicated? How do we know when a system has a solution, and when is it unique? XII. LINEAR ALGEBRA: SOLVING SYSTEMS OF EQUATIONS Tody we re going to tlk bout solving systems of liner equtions. These re problems tht give couple of equtions with couple of unknowns, like: 6 2 3 7 4

More information

Here we study square linear systems and properties of their coefficient matrices as they relate to the solution set of the linear system.

Here we study square linear systems and properties of their coefficient matrices as they relate to the solution set of the linear system. Section 24 Nonsingulr Liner Systems Here we study squre liner systems nd properties of their coefficient mtrices s they relte to the solution set of the liner system Let A be n n Then we know from previous

More information

NOTES ON HILBERT SPACE

NOTES ON HILBERT SPACE NOTES ON HILBERT SPACE 1 DEFINITION: by Prof C-I Tn Deprtment of Physics Brown University A Hilbert spce is n inner product spce which, s metric spce, is complete We will not present n exhustive mthemticl

More information

P 3 (x) = f(0) + f (0)x + f (0) 2. x 2 + f (0) . In the problem set, you are asked to show, in general, the n th order term is a n = f (n) (0)

P 3 (x) = f(0) + f (0)x + f (0) 2. x 2 + f (0) . In the problem set, you are asked to show, in general, the n th order term is a n = f (n) (0) 1 Tylor polynomils In Section 3.5, we discussed how to pproximte function f(x) round point in terms of its first derivtive f (x) evluted t, tht is using the liner pproximtion f() + f ()(x ). We clled this

More information

Spanning tree congestion of some product graphs

Spanning tree congestion of some product graphs Spnning tree congestion of some product grphs Hiu-Fi Lw Mthemticl Institute Oxford University 4-9 St Giles Oxford, OX1 3LB, United Kingdom e-mil: lwh@mths.ox.c.uk nd Mikhil I. Ostrovskii Deprtment of Mthemtics

More information

S. S. Dragomir. 2, we have the inequality. b a

S. S. Dragomir. 2, we have the inequality. b a Bull Koren Mth Soc 005 No pp 3 30 SOME COMPANIONS OF OSTROWSKI S INEQUALITY FOR ABSOLUTELY CONTINUOUS FUNCTIONS AND APPLICATIONS S S Drgomir Abstrct Compnions of Ostrowski s integrl ineulity for bsolutely

More information

Math 8 Winter 2015 Applications of Integration

Math 8 Winter 2015 Applications of Integration Mth 8 Winter 205 Applictions of Integrtion Here re few importnt pplictions of integrtion. The pplictions you my see on n exm in this course include only the Net Chnge Theorem (which is relly just the Fundmentl

More information

g i fφdx dx = x i i=1 is a Hilbert space. We shall, henceforth, abuse notation and write g i f(x) = f

g i fφdx dx = x i i=1 is a Hilbert space. We shall, henceforth, abuse notation and write g i f(x) = f 1. Appliction of functionl nlysis to PEs 1.1. Introduction. In this section we give little introduction to prtil differentil equtions. In prticulr we consider the problem u(x) = f(x) x, u(x) = x (1) where

More information

Numerical Linear Algebra Assignment 008

Numerical Linear Algebra Assignment 008 Numericl Liner Algebr Assignment 008 Nguyen Qun B Hong Students t Fculty of Mth nd Computer Science, Ho Chi Minh University of Science, Vietnm emil. nguyenqunbhong@gmil.com blog. http://hongnguyenqunb.wordpress.com

More information

Partial Derivatives. Limits. For a single variable function f (x), the limit lim

Partial Derivatives. Limits. For a single variable function f (x), the limit lim Limits Prtil Derivtives For single vrible function f (x), the limit lim x f (x) exists only if the right-hnd side limit equls to the left-hnd side limit, i.e., lim f (x) = lim f (x). x x + For two vribles

More information

An instructive toy model: two paradoxes

An instructive toy model: two paradoxes Tel Aviv University, 2006 Gussin rndom vectors 27 3 Level crossings... the fmous ice formul, undoubtedly one of the most importnt results in the ppliction of smooth stochstic processes..j. Adler nd J.E.

More information

New Integral Inequalities for n-time Differentiable Functions with Applications for pdfs

New Integral Inequalities for n-time Differentiable Functions with Applications for pdfs Applied Mthemticl Sciences, Vol. 2, 2008, no. 8, 353-362 New Integrl Inequlities for n-time Differentible Functions with Applictions for pdfs Aristides I. Kechriniotis Technologicl Eductionl Institute

More information

Abstract inner product spaces

Abstract inner product spaces WEEK 4 Abstrct inner product spces Definition An inner product spce is vector spce V over the rel field R equipped with rule for multiplying vectors, such tht the product of two vectors is sclr, nd the

More information

Positive Solutions of Operator Equations on Half-Line

Positive Solutions of Operator Equations on Half-Line Int. Journl of Mth. Anlysis, Vol. 3, 29, no. 5, 211-22 Positive Solutions of Opertor Equtions on Hlf-Line Bohe Wng 1 School of Mthemtics Shndong Administrtion Institute Jinn, 2514, P.R. Chin sdusuh@163.com

More information

Songklanakarin Journal of Science and Technology SJST R1 Thongchan. A Modified Hyperbolic Secant Distribution

Songklanakarin Journal of Science and Technology SJST R1 Thongchan. A Modified Hyperbolic Secant Distribution A Modified Hyperbolic Secnt Distribution Journl: Songklnkrin Journl of Science nd Technology Mnuscript ID SJST-0-0.R Mnuscript Type: Originl Article Dte Submitted by the Author: 0-Mr-0 Complete List of

More information

Czechoslovak Mathematical Journal, 55 (130) (2005), , Abbotsford. 1. Introduction

Czechoslovak Mathematical Journal, 55 (130) (2005), , Abbotsford. 1. Introduction Czechoslovk Mthemticl Journl, 55 (130) (2005), 933 940 ESTIMATES OF THE REMAINDER IN TAYLOR S THEOREM USING THE HENSTOCK-KURZWEIL INTEGRAL, Abbotsford (Received Jnury 22, 2003) Abstrct. When rel-vlued

More information

Penalized least squares smoothing of two-dimensional mortality tables with imposed smoothness

Penalized least squares smoothing of two-dimensional mortality tables with imposed smoothness Penlized lest squres smoothing of two-dimensionl mortlit tbles with imposed smoothness Eliud Silv 1 Víctor M. Guerrero (speker) 1 Acturil School, Universidd Anáhuc, México Deprtment of Sttistics, Instituto

More information

Main topics for the First Midterm

Main topics for the First Midterm Min topics for the First Midterm The Midterm will cover Section 1.8, Chpters 2-3, Sections 4.1-4.8, nd Sections 5.1-5.3 (essentilly ll of the mteril covered in clss). Be sure to know the results of the

More information

1 2-D Second Order Equations: Separation of Variables

1 2-D Second Order Equations: Separation of Variables Chpter 12 PDEs in Rectngles 1 2-D Second Order Equtions: Seprtion of Vribles 1. A second order liner prtil differentil eqution in two vribles x nd y is A 2 u x + B 2 u 2 x y + C 2 u y + D u 2 x + E u +

More information

Entropy and Ergodic Theory Notes 10: Large Deviations I

Entropy and Ergodic Theory Notes 10: Large Deviations I Entropy nd Ergodic Theory Notes 10: Lrge Devitions I 1 A chnge of convention This is our first lecture on pplictions of entropy in probbility theory. In probbility theory, the convention is tht ll logrithms

More information

Numerical Integration

Numerical Integration Chpter 1 Numericl Integrtion Numericl differentition methods compute pproximtions to the derivtive of function from known vlues of the function. Numericl integrtion uses the sme informtion to compute numericl

More information

Chapter 3. Vector Spaces

Chapter 3. Vector Spaces 3.4 Liner Trnsformtions 1 Chpter 3. Vector Spces 3.4 Liner Trnsformtions Note. We hve lredy studied liner trnsformtions from R n into R m. Now we look t liner trnsformtions from one generl vector spce

More information

Definition of Continuity: The function f(x) is continuous at x = a if f(a) exists and lim

Definition of Continuity: The function f(x) is continuous at x = a if f(a) exists and lim Mth 9 Course Summry/Study Guide Fll, 2005 [1] Limits Definition of Limit: We sy tht L is the limit of f(x) s x pproches if f(x) gets closer nd closer to L s x gets closer nd closer to. We write lim f(x)

More information

Lecture 21: Order statistics

Lecture 21: Order statistics Lecture : Order sttistics Suppose we hve N mesurements of sclr, x i =, N Tke ll mesurements nd sort them into scending order x x x 3 x N Define the mesured running integrl S N (x) = 0 for x < x = i/n for

More information

Energy Bands Energy Bands and Band Gap. Phys463.nb Phenomenon

Energy Bands Energy Bands and Band Gap. Phys463.nb Phenomenon Phys463.nb 49 7 Energy Bnds Ref: textbook, Chpter 7 Q: Why re there insultors nd conductors? Q: Wht will hppen when n electron moves in crystl? In the previous chpter, we discussed free electron gses,

More information

Lecture 6: Singular Integrals, Open Quadrature rules, and Gauss Quadrature

Lecture 6: Singular Integrals, Open Quadrature rules, and Gauss Quadrature Lecture notes on Vritionl nd Approximte Methods in Applied Mthemtics - A Peirce UBC Lecture 6: Singulr Integrls, Open Qudrture rules, nd Guss Qudrture (Compiled 6 August 7) In this lecture we discuss the

More information

NUMERICAL INTEGRATION

NUMERICAL INTEGRATION NUMERICAL INTEGRATION How do we evlute I = f (x) dx By the fundmentl theorem of clculus, if F (x) is n ntiderivtive of f (x), then I = f (x) dx = F (x) b = F (b) F () However, in prctice most integrls

More information

p-adic Egyptian Fractions

p-adic Egyptian Fractions p-adic Egyptin Frctions Contents 1 Introduction 1 2 Trditionl Egyptin Frctions nd Greedy Algorithm 2 3 Set-up 3 4 p-greedy Algorithm 5 5 p-egyptin Trditionl 10 6 Conclusion 1 Introduction An Egyptin frction

More information

ODE: Existence and Uniqueness of a Solution

ODE: Existence and Uniqueness of a Solution Mth 22 Fll 213 Jerry Kzdn ODE: Existence nd Uniqueness of Solution The Fundmentl Theorem of Clculus tells us how to solve the ordinry differentil eqution (ODE) du = f(t) dt with initil condition u() =

More information

New data structures to reduce data size and search time

New data structures to reduce data size and search time New dt structures to reduce dt size nd serch time Tsuneo Kuwbr Deprtment of Informtion Sciences, Fculty of Science, Kngw University, Hirtsuk-shi, Jpn FIT2018 1D-1, No2, pp1-4 Copyright (c)2018 by The Institute

More information

A Bernstein polynomial approach for solution of nonlinear integral equations

A Bernstein polynomial approach for solution of nonlinear integral equations Avilble online t wwwisr-publictionscom/jns J Nonliner Sci Appl, 10 (2017), 4638 4647 Reserch Article Journl Homepge: wwwtjnscom - wwwisr-publictionscom/jns A Bernstein polynomil pproch for solution of

More information

Lecture Note 9: Orthogonal Reduction

Lecture Note 9: Orthogonal Reduction MATH : Computtionl Methods of Liner Algebr 1 The Row Echelon Form Lecture Note 9: Orthogonl Reduction Our trget is to solve the norml eution: Xinyi Zeng Deprtment of Mthemticl Sciences, UTEP A t Ax = A

More information