arxiv: v1 [math.st] 5 Oct 2009

Size: px
Start display at page:

Download "arxiv: v1 [math.st] 5 Oct 2009"

Transcription

1 Nonparametric estimation of an extreme-value copula in arbitrary dimensions arxiv: v1 [math.st] 5 Oct 2009 Gordon Gudendorf 1, Johan Segers 1, Institut de Statistique, Université catholique de Louvain, Voie du Roman Pays 20, B-1348 Louvain-la-Neuve, Belgium Abstract Inference on an extreme-value copula usually proceeds via its Pickands dependence function, hich is a convex function on the unit simplex satisfying certain inequality constraints. In the setting of an iid random sample from a multivariate distribution ith knon margins and unknon extreme-value copula, an extension of the Capéraà Fougères Genest estimator as introduced by D. Zhang, M. T. Wells and L. Peng [Journal of Multivariate Analysis 99 (2008) ]. The joint asymptotic distribution of the estimator as a random function on the simplex as not provided. Moreover, implementation of the estimator requires the choice of a number of eight functions on the simplex, the issue of their optimal selection being left unresolved. A ne, simplified representation of the CFG-estimator combined ith standard empirical process theory provides the means to uncover its asymptotic distribution in the space of continuous, real-valued functions on the simplex. Moreover, the ordinary least-squares estimator of the intercept in a certain linear regression model provides an adaptive version of the CFG-estimator hose asymptotic behavior is the same as if the variance-minimizing eight functions ere used. As illustrated in a simulation study, the gain in efficiency can be quite sizeable. Key ords: empirical process, linear regression, minimum-variance estimator, multivariate extreme-value distribution, ordinary least squares, Pickands dependence function, unit simplex 2010 MSC: 60F17, 62G32, 62H20 Corresponding author addresses: gordon.gudendorf@uclouvain.be (Gordon Gudendorf), johan.segers@uclouvain.be (Johan Segers) 1 Research supported by IAP research netork grant nr. P6/03 of the Belgian government (Belgian Science Policy) and by contract nr. 07/12/002 of the Projet d Actions de Recherche Concertées of the Communauté française de Belgique, granted by the Académie universitaire Louvain. Preprint submitted to Elsevier January 25, 2018

2 1. Introduction Let X i = (X i1,...,x ip ), i {1,...,n}, be iid random vectors from a p-variate, continuous distribution function F ith multivariate extreme-value copula C: for u (0,1] p \{(1,...,1)}, denoting the margins of F by F 1,...,F p, C(u) = P ( F 1 (X i1 ) u 1,...,F p (X ip ) u p ) = exp{ y A(y/ y )} here y j = logu j and y = y y p. (1.1) The function A, hose domain is p = { [0,1] p : p = 1}, is called the Pickands dependence function of C, after Pickands (1981). Multivariate extreme-value copulas arise as the limits of copulas of vectors of component-ise maxima of independent random samples (Deheuvels, 1984; Galambos, 1987). As a consequence, they coincide ith the class of copulas of multivariate extreme-value or max-stable distributions. Therefore, they provide models for dependence beteen extreme values that allo extrapolation beyond the support of the sample. It is then of interest to estimate the Pickands dependence function A. A necessary condition for C in (1.1) to be a copula is that A is convex and satisfies max( 1,..., p ) A() 1 for all p ; in the bivariate case, this is also sufficient. In general, A should admit an integral representation in terms of a spectral measure. Some other properties of Pickands dependence functions are studied in Obretenov (1991) and Falk and Reiss (2008). The upshot of all this is that the class of Pickands dependence functions is infinite-dimensional. This arrants the use of nonparametric methods. Whereas most papers hitherto concentrated on the bivariate case, a nonparametric estimator for general multivariate Pickands dependence functions as introduced in Zhang et al. (2008). This estimator is in fact a multivariate generalization of the one by Capéraà Fougères Genest (Capéraà et al., 1997). The estimator as shon to be uniformly consistent and pointise asymptotically normal. Hoever, the joint asymptotic distribution of the estimator as a random function on p as not provided. Moreover, implementation of the estimator requires the choice of p eight functions λ j on p, the issue of their optimal selection being left unresolved. Using a simplified representation of the above-mentioned estimator, e are able to uncover its asymptotic distribution in the space C( p ) of continuous, real-valued functions on p. Moreover, e give explicit expressions for the eight functions λ j that minimize the pointise asymptotic variance of the estimator. These optimal eight functions depend on the unknon distribution. We sho that the CFG-estimator ith estimated variance-minimizing eight functions can be implemented as the intercept estimator in a certain linear regression model via ordinary least squares. The OLS-estimator is data-adaptive in the sense that the asymptotic distribution is the same as if the optimal eight functions ere used. In a simulation study, the gain in efficiency is shon to be quite sizeable. As in Zhang et al. (2008), the setting here is that of a random sample from a distribution hose margins are knon and hose copula is an extreme-value 2

3 copula. It ould be orthhile to extend this to the case of unknon margins (Guillotte and Perron, 2008; Genest and Segers, 2009) and the case that the copula of F is merely in the domain of attraction of an extreme-value copula (Capéraà and Fougères, 2000; Einmahl and Segers, 2009). The outline of our paper is as follos. The CFG-estimator is introduced in the next section, including its simplified representation and asymptotic distribution. The variance-minimizing eight functions are computed in Section 3 together ith an adaptive estimator based on ordinary least squares in a linear regression frameork. Section 4 reports on a simulation study. The proofs of the results in Sections 2 and 3 are deferred to Appendices A and B, respectively. 2. CFG-estimator and variants Let X i = (X i1,...,x ip ), i {1,...,n}, be iid random vectors from a p-variate, continuous distribution function F ith multivariate extreme-value copula C and Pickands dependence function A as in (1.1). Let F 1,...,F p be the marginal distribution functions of F. Put Y i = (Y i1,...,y ip ) here Y ij = logf j (X ij ) (2.1) for i {1,...,n} and j {1,...,p}. The marginal distributions of the random variables Y ij are standard exponential. The random vectors Y 1,...,Y p are iid ith common joint survivor function P(Y i1 > y 1,...,Y ip > y p ) = C(e y1,...,e yp ) = exp{ y A(y/ y )}, for y [0, ) p \{(0,...,0)}, here y = y y p. Put ξ i () = p Y ij j, p, i {1,...,n}, (2.2) ith denoting minimum and ith the obvious convention for division by zero; in particular, ξ i (e j ) = Y ij for the p standard unit vectors e 1,...,e p in R p. For p and x > 0, e have P ( ξ i () > x ) = P(Y i1 > 1 x,...,y ip > p x) = exp{ xa()}. (2.3) Hencetherandomvariablesξ 1 (),...,ξ n ()constituteanindependentrandom sample from the exponential distribution ith mean 1/A(). It follos that the distribution of logξ i () is Gumbel ith location parameter loga(), hence E[ logξ i ()] = loga()+γ, (2.4) the Euler Mascheroni constant γ = Γ (1) = being the mean of the standard Gumbel distribution. This suggests the naive estimator logân() = 1 n n logξ i () γ, p. (2.5) i=1 3

4 The naive estimator is itself not a valid Pickands dependence function. For instance,itdoesnotverifythevertexconstraintsa(e j ) = 1forallj {1,...,p}. A simple ay to at least remedy this defect is by putting logâcfg n () = logân() λ j () logân(e j ), p, (2.6) here λ 1,...,λ p : p R are continuous functions verifying λ j (e k ) = δ jk for all j,k {1,...,p}. Continuity of the functions λ j is assumed merely to ensure that the resulting estimator is a continuous function of as ell. The superscript CFG refers to the bivarate estimator by Capéraà Fougères Genest in Capéraà et al.(1997), generalized to the multivariate case in Zhang et al. (2008). Actually, the original definition in Zhang et al. (2008) is logâzwp n () = λ j () 1 j 0 n 1 n i=1 1{Z ij() z} z dz, (2.7) z(1 z) here, ith Y ij as in (2.1), Z ij () = Y ik k:k j k Y ij 1 j +, Y p. ik k:k j k Moreover, in (2.7), the eight functions λ j are supposed to be nonnegative and to satisfy the additional constraint λ j () = 1, p. (2.8) Hoever, if (2.8) holds, then actually the to estimators coincide, that is, Â ZWP n () = ÂCFG n (), p. (2.9) The proof of (2.9) is essentially the same as the one in Segers (2007) for the bivariate case, the key being that the integrals in (2.7) can be solved: 1 j 0 1{Z ij () z} z z(1 z) dz = log[1 {(1 j ) Z ij ()}]+log(1 j ) log{(1 j ) Z ij ()} = logy ij logξ i (). In our representation (2.6), hoever, there is no reason hatsoever to restrict the eight functions to satisfy (2.8). The asymptotics of the naive estimator and the CFG-estimator follo from standard empirical process theory as presented for instance in van der Vaart and Wellner (1996) andvan der Vaart (1998). Let C( p ) denote the Banachspace ofcontinuousfunctions from p intorequipped ith the supremumnorm. Convergence in distribution is denoted by the arro. 4

5 Proposition 2.1 (Naive estimator). Let X i = (X i1,...,x ip ), i {1,...,n}, be iid random variables from a p-variate, continuous distribution function F ith multivariate extreme-value copula C and Pickands dependence function A. The naive estimator Ân in (2.5) satisfies sup p Ân() A() 0, n, almost surely, (2.10) and in C( p ), n( Â n A) Aζ, n, (2.11) here ζ is a centered Gaussian process ith covariance function cov ( ζ(v),ζ() ) = cov ( logξ i (v), logξ i () ), v, p, (2.12) ith ξ i ( ) as in (2.2). Theorem 2.2 (CFG-estimator). If, in addition to the assumptions in Proposition 2.1, the functions λ 1,...,λ p : p R are continuous, then sup p ÂCFG n () A() 0, n, almost surely, (2.13) and in C( p ), n( Â CFG n A) Aη, n, (2.14) here η is a centered Gaussian process defined by η() = ζ() ith ζ as in Proposition 2.1. λ j ()ζ(e j ), p, (2.15) Remark 2.3 (Covariance function). The covariance function (2.12) can be expressed in terms of A as follos. An application of the identity log(x) = 0 {1(s x) 1(s 1)}s 1 ds for x (0, ) yields, by Fubini s theorem, cov ( logξ i (v), logξ i () ) ( = P ( ξ i (v) s, ξ i () t ) P ( ξ i (v) s ) P ( ξ i () t )) ds s = [exp{ l(( 1 s) (v 1 t),...,( p s) (v p t))} exp{ sa(v)}exp{ ta()}] ds s here l(y) = y A(y/ y ) and y = y y p. Replacing A by any estimator of it results in an estimator of the covariance function. Hoever, a more practical ay to estimate this function is by the sample covariance of the pairs ( logξ i (v), logξ i ()); see also (the proof of) Theorem 3.2. dt t dt t. 5

6 Remark 2.4 (Shape constraints). A further enhancement to the CFG-estimator is to replace it by the convex minorant of the function min[max{âcfg n (), 1,..., p },1], p, as in Deheuvels (1991) and Jiménez et al. (2001) for the bivariate case. Although the resulting estimator ould be a convex function respecting the bounds max( 1,..., p ) A() 1, in case p 3 this ould still not guarantee it to be a genuine Pickands dependence function. Still other ays to impose(some of) the shape restrictions are spline smoothing under constraints (Hall and Tajvidi, 2000), orthogonal projection (Fils-Villetard et al., 2008), or Bayesian nonparametrics (Guillotte and Perron, 2008). Remark 2.5 (Pickands estimator). A different ay to exploit the exponentiality of the random variables ξ i () in (2.3) ould be via the Pickands estimator 1 Â P n () = 1 n n ξ i () as in Pickands (1981). To impose the vertex constraints A(e j ) = 1, the techniquesofdeheuvels(1991)orhall and Tajvidi(2000)canbeused,seeZhang et al. (2008, p. 578). In the bivariate case hoever, it is knon that the resulting estimators are outperformed by the CFG-estimator ÂCFG n (Segers, 2007; Genest and Segers,2009). ThisisconfirmedinthesimulationstudyinZhang et al. (2008, Section 3), asellasbyouronsimulationsinsection 4. Forthis reason, e restrict attention here to the family of CFG-estimators. i=1 3. The OLS-estimator The question remains hich eight functions λ j to choose in the CFGestimator (2.6). In Zhang et al. (2008), the choice λ j () = j as recommended as a pragmatic one. The option of using variance-minimizing functions λ j as mentioned but not carried out. By casting the estimation problem in a linear regression frameork, e ill obtain an estimator ith the same asymptotic performance as the CFG-estimator ith those optimal eights. In this section, e define the estimator and prove its consistency and asymptotic normality, both in the functional sense. In the next section, the gain in efficiency is assessed by means of simulations. In vie of Theorem 2.2, for each p e have n (ÂCFG n () A() ) A()η(), n, here η() is a zero-mean normal random variable. We ill look for those λ j () that minimise the variance of η(). Let ζ be the Gaussian process on C( p ) in Proposition 2.1. For ease of notation, put λ() = ( λ 1 (),...,λ p () ), ζ(e) = ( ζ(e1 ),...,ζ(e p ) ), 6

7 the symbol denoting matrix transposition. Then varη() = var ( ζ() λ() ζ(e) ) Note that = varζ() 2λ() E[ζ(e)ζ()]+λ() E[ζ(e)ζ(e) ]λ(). Σ = E[ζ(e)ζ(e) ] (3.1) is the covariance matrix of ( logξ(e 1 ),..., logξ(e p )). Provided this matrix is non-singular, var η() attains a unique global minimum for λ() equal to λ opt () = Σ 1 E[ζ(e)ζ()]. (3.2) With this choice of the eight functions, the variance of is equal to η opt () = ζ() λ opt () ζ(e) (3.3) varη opt () = varζ() E[ζ()ζ(e) ]Σ 1 E[ζ(e)ζ()]. (3.4) This variance is minimal over all possible choices of eight functions λ j. The optimal eight functions λ opt j in (3.2) depend on the unknon Pickands dependence function A. Fortunately, replacing these eight functions by uniformly consistent estimators ˆλ n,j is just as good asymptotically. For such estimated eight functions, define the adaptive CFG-estimator by logâcfg n,ad() = logân() ˆλ n,j () logân(e j ), (3.5) Proposition 3.1 (Adaptive CFG-estimator). Assume that, in addition to the assumptions in Proposition 2.1, the matrix Σ in (3.1) is non-singular and ˆλ n,j are random elements in C( p ) such that, for every j {1,...,p}, sup ˆλ n,j () λ opt j () 0, n, almost surely, p ith λ opt j as in (3.2). Then the adaptive CFG-estimator in (3.5) satisfies sup ÂCFG n,ad p and in C( p ), () A() 0, n, almost surely, (3.6) n( Â CFG n,ad A) Aη opt, n, (3.7) here η opt is the zero-mean Gaussian process defined in (3.3). 7

8 Finally e propose a particularly convenient ay to implement the adaptive CFG-estimator in (3.5). For p, let ˆβ n () = (ˆβ n,0 (),..., ˆβ n,p ()) be the minimizer in (b 0,...,b p ) of n ( ( logξi () γ ) b 0 i=1 ( b j logξi (e j ) γ )) 2. (3.8) In ords, ˆβn () is the ordinary least-squares (OLS) estimator of the vector of regression coefficients in a linear regression of the dependent variable logξ i () γ upon the explanatory variables logξ i (e j ) γ, j {1,...,p}. Define the OLS-estimator of A via the estimated intercept by Since the residuals logâols n () = ˆβ n,0 (), p. ˆǫ n,i () = ( logξ i () γ ) ˆβ n,0 () verify n i=1ˆǫ n,i() = 0, e have logâols n () = ˆβ n,0 () = logân() ˆβ n,j () ( logξ i (e j ) γ ) ˆβ n,p () logân(e j ), (3.9) that is, the OLS-estimator is equal to the adaptive CFG-estimator ith estimated eights ˆλ n,j () = ˆβ n,j (). The variance of the (logarithm of the) OLSestimator can be estimated by the sample variance of the residuals, properly corrected for the loss in number of degrees of freedom, ˆσ 2 n,ols() = 1 n p 1 n ˆǫ 2 n,i(), p. (3.10) i=1 Theorem 3.2 (OLS-estimator). Assume that, in addition to the assumptions in Proposition 2.1, the matrix Σ in (3.1) is non-singular. Then, ith probability tending to one, the minimizer ˆβ n () of (3.8) is uniquely defined and for j {1,...,p}, sup ˆβ n,j () λ opt j () 0, n, almost surely. (3.11) p As a consequence, the OLS-estimator in (3.9) is uniformly consistent, sup p ÂOLS n () A() 0, n, almost surely, (3.12) and in C( p ), n( Â OLS n A) Aη opt, n, (3.13) 8

9 here η opt is the zero-mean Gaussian process defined in (3.3). In addition, the variance estimator in (3.10) satisfies sup ˆσ n,ols 2 () varη opt() 0, n, almost surely. p Remark 3.3 (Non-singularity assumption). In the bivariate case, the assumption that the covariance matrix Σ in (3.1) is non-singular is equivalent to the assumption that the copula C is not the comonotone copula (Segers, 2007). We conjecture that in the general multivariate case, a necessary and sufficient condition for Σ to be non-singular is that none of the bivariate margins of C is equal to the comonotone copula. 4. Simulations In order to investigate the finite-sample properties of the estimators discussed in the previous sections, e generated pseudo-random samples from trivariate extreme-value copulas of logistic type as presented in Tan (1990): A() = (θ r r 1 +φ r r 2) 1/r +(θ r r 2 +φ r r 3) 1/r +(θ r r 3 +φ r r 1) 1/r +ψ( r 1 + r 2 + r 3) 1/r +1 θ φ ψ, p, (4.1) for (r,θ,φ,ψ) [1, ) [0,1] 3. To facilitate comparisons, e opted for the same parameter values as chosen in Zhang et al. (2008): a symmetric case, (r,θ,φ,ψ) = (3,0,0,1), and an asymmetric one, (r,θ,φ,ψ) = (6,0.6,0.3,0). For each case samples ere generated of size n {50,100,200} using the simulation algorithms in Stephenson (2003) and implemented in the R-package evd (Stephenson, 2002). Four estimators ere compared: the CFG-estimator ÂCFG n ith eight functions λ j () = j (as recommended in Zhang et al., 2008), the OLS-estimator in (3.9), and the enhanced versions of the original Pickands estimator due to Deheuvels (1991) and Hall and Tajvidi (2000) as presented in Zhang et al. (2008). To visualize the performances of the estimators, e plotted their biases and mean squarederrorsalongthe line { p : 1 = 2 }; see Figures1and 2 for the symmetric and asymmetric logistic dependence functions respectively. In accordance to the theory, the OLS-estimator is in virtually all cases considered more efficient than the CFG-estimator. Moreover, our simulations confirm the findings in Zhang et al. (2008) that the CFG-estimator is typically more efficient than the ones of Deheuvels and Hall Tajvidi. Note that the finite-sample bias of the OLS-estimator is somehat larger than for the other estimators. Hoever, thanks to its minimum-variance property it ends up as an overall inner in terms of mean squared error. Â OLS n 9

10 Symmetric, n=50 Symmetric, n=50 Bias MSE Symmetric, n=100 Symmetric, n=100 Bias MSE 0e+00 2e 04 4e 04 6e 04 8e 04 Symmetric, n=200 Symmetric, n=200 Bias 1e 03 5e 04 0e+00 5e 04 1e 03 MSE 0e+00 1e 04 2e 04 3e 04 4e 04 Figure1: Biases(left)and meansquarederrors(right)ofâols n ()(solid), ÂCFG n ()(dashed), Â HT n () (dash-dotted) and ÂD n () (dotted) along the line 1 = 2 for samples of size n {50, 100, 200} from the trivariate extreme-value copula C ith symmetric logistic dependence function A() = (1 r +r 2 +r 3 )1/r at r = 3. 10

11 Asymmetric, n=50 Asymmetric, n=50 Bias MSE Asymmetric, n=100 Asymmetric, n=100 Bias MSE Asymmetric, n=200 Asymmetric, n=200 Bias MSE Figure2: Biases(left)and meansquarederrors(right)ofâols n ()(solid), ÂCFG n ()(dashed), Â HT n () (dash-dotted) and ÂD n() (dotted) along the line 1 = 2 for samples of size n {50, 100, 200} from the trivariate extreme-value copula C ith asymmetric logistic dependence function A in (4.1) for (r,θ,φ,ψ) = (6,0.6,0.3,0). 11

12 Appendix A. Proofs for Section 2 Proof of Proposition 2.1. For p, define f : (0, ) p R by We can rite ( p f (y) = log y j j logân() = 1 n ) γ, y (0, ) p. (A.1) n f (Y i ). Consider the function class F = {f : p }. We ill sho that F is P- Donsker and therefore also P-Glivenko Cantelli, here P denotes the common probability distribution on (0, ) p of the random vectors Y i. According to Theorem in van der Vaart and Wellner (1996) and the proof thereof, e need to verify that F is a pointise separable Vapnik Cervonenkis-class (VCclass) that admits an envelope function ith a finite second moment under P. Pointise separability follos from the fact that the map f (y) is continuousin p foreachy (0, ) p. TheVC-propertycanbeestablished by repeated applications of Lemmas and , items (i) and (viii), in van der Vaart and Wellner (1996). Finally, the readily established bound p log { y j p } max log y j,log(p)+ logy j (A.2) j yields an envelope function of F all of hose moments are finite under P. Observe that the distribution of p Y ij is Exponential ith mean equal to {pa(1/p,...,1/p)} 1 [1/p,1]. From the fact that F is P-Glivenko Cantelli it follos that sup logân() loga() p = sup 1 n f (Y i ) E[f (Y )] n 0, n, almost surely. p i=1 (Here, e dropped a subscript i for convenience.) Continuity of the map exp : C( p ) C( p ) : f exp(f) yields uniform consistency as in (2.10). Moreover, the P-Donsker property entails n(log  CFG n loga) ζ, n, (A.3) in the space l ( p ) of bounded functions from p into R equipped ith the topology of uniform convergence, here e identified F ith p. The process ζ is zero-mean Gaussian ith covariance function given in (2.12). The sample paths of the limit process ζ are continuous ith respect to the standard deviation (semi-)metric ρ on p defined by i=1 ρ(v,) = [var{f v (Y ) f (Y )}] 1/2, v, p. 12

13 If lim n v n = v in p according to the Euclidean metric, then by continuity of f (y) in and by uniform integrability, also lim n ρ(v n,v) = 0. (Uniform integrability is checked by using the bound in (A.2).) It follos that the trajectories of ζ are also continuous ith respect to the Euclidean metric on p, that is, ζ actually takes its values in C( p ). As the trajectories of the left-hand side in (A.3) are continuous too, the convergence in (A.3) takes place not only l ( p ) but also in C( p ). The convergence in (2.11) follos from the Hadamard-differentiability of the map exp : C( p ) C( p ) : f expf and the functional delta-method (van der Vaart and Wellner, 1996, Section 3.9). Proof of Theorem 2.2. Uniform consistency of ÂCFG n in (2.13) follos from uniform consistency of Ân in (2.10) and the fact that the functions λ j are continuous, hence bounded. To sho (2.14), define L : C( p ) C( p ) by Lf() = f() λ j ()f(e j ) for f C( p ) and p. The operator L is linear and bounded. We have logâcfg n = L(logÂn). Moreover, as A(e j ) = 1 for all j {1,...,p}, also L(logA) = loga. We find n(log  CFG n loga) = L ( n(logân loga) ) Lζ = η, n. The eak convergence in(2.14) follos from the functional delta-method(van der Vaart and Wellner, 1996, Section 3.9). The representation η = Lζ coincides ith (2.15). Appendix B. Proofs for Section 3 Proof of Proposition 3.1. If the optimal eight functions λ opt j ere knon, e could consider the optimal CFG-estimator logâcfg n,opt() = logân() λ opt j () logân(e j ), p. By Theorem 2.2, the optimal CFG-estimator is uniformly consistent (2.13) and is asymptotically normal in the sense of (2.14) ith η = η opt. No logâcfg n,opt () logâcfg n,ad () ˆλ n,j () λ opt j () logân(e j ). By uniform consistency of ˆλ n,j and asymptotic normality of n logân(e j ), e obtain, as n, sup p logâcfg n,opt() logâcfg n,ad() 0, sup n log  CFG n,opt() logâcfg n,ad() 0. p almost surely, 13

14 As a consequence, the adaptive CFG-estimator is uniformly consistent (3.6) and asymptotically normal (3.7). Proof of Theorem 3.2. In analogy to the linear regression frameork, define the n (p+1) matrix X = 1 logξ 1(e 1 ) γ... logξ 1 (e p ) γ logξ n (e 1 ) γ... logξ n (e p ) γ and the n 1 vector Y () = ( logξ 1 () γ,..., logξ n () γ ), p. (No confusion should arise beteen this Y () and the random vectors Y i in (2.1).) Provided the matrix X X is non-singular, the OLS-estimator ˆβ n () is given by ˆβ n () = (X X) 1 X Y (). Recall the functions f in (A.1). For v, p, define g v, : (0, ) p R by g v, (y) = f v (y)f (y), y (0, ) p. By (A.2) and by Example in van der Vaart and Wellner (1996), the function class {g v, : v, p } is P-Donsker and thus P-Glivenko Cantelli, here P is the common distribution on (0, ) p of the random vectors Y i. It follos that, almost surely as n, ( ) n X X, (B.1) 0 Σ ( ) sup 1 loga() n X Y () 0, (B.2) E[ζ(e)ζ()] p As Σ is non-singular, e have ( ) 1 ( ) = 0 Σ 0 Σ 1, hile 1 n X X is ith probability tending to one a non-singular matrix too. We find, almost surely and uniformly in p, ( ) ˆβ n () = n X X n X Y () ( )( ) 1 0 loga() 0 Σ 1 = E[ζ(e)ζ()] ( ) loga() λ opt, n. () Equation (3.11) follos. Proposition 3.1 and equation (3.9) then yield equations (3.12) and (3.13). 14

15 Finally, for the estimation of the variance, note that it does not matter asymptotically if e divide by n or by n p 1. Elementary calculations yield 1 n n ˆǫ 2 n,i() = 1 ( Y () Xˆβn () ) ( Y () Xˆβn () ) n i=1 = 1 ( ) ( ) n Y 1 () Y () n X Y () n X X n X Y (). The Glivenko Cantelli property yields, almost surely and uniformly in p, 1 n Y () Y () = 1 n n ( logξi () γ ) 2 i=1 E [( logξ i () γ ) 2] = varζ()+ ( loga() ) 2, n. Incombinationith (B.1)and(B.2), eobtainthat n 1 n almost surely and uniformly in p to i=1ˆǫ2 n,i () converges varζ()+ ( loga() ) ( ) ( )( ) 2 loga() 1 0 loga() E[ζ(e)ζ()] 0 Σ 1 E[ζ(e)ζ()] hich by (3.4) is equal to varη opt (). = varζ() E[ζ(e) ζ()]σ 1 E[ζ(e)ζ()], References J. Pickands, Multivariate extreme value distributions, in: Proceedings of the 43rd session of the International Statistical Institute, Vol. 2 (Buenos Aires, 1981), vol. 49, , , ith a discussion, P. Deheuvels, Probabilistic aspects of multivariate extremes, in: J. Tiago de Oliveira (Ed.), Statistical extremes and applications, Reidel, , J. Galambos, The asymptotic theory of extreme order statistics, Krieger, Melbourne, Florida, 2nd edn., A. Obretenov, On the dependence function of Sibuya in multivariate extreme value theory, Journal of Multivariate Analysis 36 (1) (1991) M. Falk, R. Reiss, On Pickands coordinates in arbitrary dimensions, Journal of Multivariate Analysis 92 (2) (2008) D. Zhang, M. T. Wells, L. Peng, Nonparametric estimation of the dependence function for a multivariate extreme value distribution, Journal of Multivariate Analysis 99 (4) (2008) P. Capéraà, A.-L. Fougères, C. Genest, A nonparametric estimation procedure for bivariate extreme value copulas, Biometrika 84 (1997)

16 S. Guillotte, F. Perron, A Bayesian estimator for the dependence function of a bivariate extreme-value distribution, The Canadian Journal of Statistics 36 (3) (2008) C. Genest, J. Segers, Rank-based inference for bivariate extreme-value copulas, Annals of Statistics 37 (5B) (2009) P. Capéraà, A.-L. Fougères, Estimation of a bivariate extreme value distribution, Extremes 3 (2000) J. H. J. Einmahl, J. Segers, Maximum empirical likelihood estimation of the spectral measure of an extreme-value distribution, The Annals of Statistics 37 (5B) (2009) J. Segers, Non-parametric inference for bivariate extreme-value copulas, in: M. Ahsanulah, S. Kirmani (Eds.), Extreme Value Distributions, chap. 9, Nova Science Publishers, Inc., , older version available as CentER DP , Tilburg University, A. W. van der Vaart, J. A. Wellner, Weak Convergence and Empirical Processes, Cambridge Series in Statistical and Probabilistic Mathematics, Springer, Ne York, A. W. van der Vaart, Asymptotic statistics, vol. 3 of Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge University Press, Cambridge, P. Deheuvels, On the limiting behavior of the Pickands estimator for bivariate extreme-value distributions, Statistics & Probability Letters 12 (5) (1991) J. R. Jiménez, E. Villa-Diharce, M. Flores, Nonparametric estimation of the dependence function in bivariate extreme value distributions, Journal of Multivariate Analysis 76 (2) (2001) P. Hall, N. Tajvidi, Distribution and dependence-function estimation for bivariate extreme-value distributions, Bernoulli 6 (5) (2000) A. Fils-Villetard, A. Guillou, J. Segers, Projection estimators of Pickands dependence functions, The Canadian Journal of Statistics 36 (3) (2008) J. Tan, Modelling Multivariate Extreme Vlaue Distributions, Biometrika 77 (2) (1990) A. G. Stephenson, Simulating multivariate extrme value analysis of logistic type, Extremes 6 (2003) A. G. Stephenson, evd: Extreme Value Distributions, R Nes 2(2) (2002) June, URL 16

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution p. /2 Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

More information

Accounting for extreme-value dependence in multivariate data

Accounting for extreme-value dependence in multivariate data Accounting for extreme-value dependence in multivariate data 38th ASTIN Colloquium Manchester, July 15, 2008 Outline 1. Dependence modeling through copulas 2. Rank-based inference 3. Extreme-value dependence

More information

Bivariate Uniqueness in the Logistic Recursive Distributional Equation

Bivariate Uniqueness in the Logistic Recursive Distributional Equation Bivariate Uniqueness in the Logistic Recursive Distributional Equation Antar Bandyopadhyay Technical Report # 629 University of California Department of Statistics 367 Evans Hall # 3860 Berkeley CA 94720-3860

More information

Semi-parametric estimation of non-stationary Pickands functions

Semi-parametric estimation of non-stationary Pickands functions Semi-parametric estimation of non-stationary Pickands functions Linda Mhalla 1 Joint work with: Valérie Chavez-Demoulin 2 and Philippe Naveau 3 1 Geneva School of Economics and Management, University of

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research

More information

T E C H N I C A L R E P O R T PROJECTION ESTIMATORS OF PICKANDS DEPENDENCE FUNCTIONS. FILS-VILLETARD, A., GUILLOU, A. and J.

T E C H N I C A L R E P O R T PROJECTION ESTIMATORS OF PICKANDS DEPENDENCE FUNCTIONS. FILS-VILLETARD, A., GUILLOU, A. and J. T E C H N I C A L R E P O R T 08014 PROJECTION ESTIMATORS OF PICKANDS DEPENDENCE FUNCTIONS FILS-VILLETARD, A., GUILLOU, A. and J. SEGERS * I A P S T A T I S T I C S N E T W O R K INTERUNIVERSITY ATTRACTION

More information

Endogeny for the Logistic Recursive Distributional Equation

Endogeny for the Logistic Recursive Distributional Equation Zeitschrift für Analysis und ihre Anendungen Journal for Analysis and its Applications Volume 30 20, 237 25 DOI: 0.47/ZAA/433 c European Mathematical Society Endogeny for the Logistic Recursive Distributional

More information

y(x) = x w + ε(x), (1)

y(x) = x w + ε(x), (1) Linear regression We are ready to consider our first machine-learning problem: linear regression. Suppose that e are interested in the values of a function y(x): R d R, here x is a d-dimensional vector-valued

More information

A Conditional Approach to Modeling Multivariate Extremes

A Conditional Approach to Modeling Multivariate Extremes A Approach to ing Multivariate Extremes By Heffernan & Tawn Department of Statistics Purdue University s April 30, 2014 Outline s s Multivariate Extremes s A central aim of multivariate extremes is trying

More information

Introduction A basic result from classical univariate extreme value theory is expressed by the Fisher-Tippett theorem. It states that the limit distri

Introduction A basic result from classical univariate extreme value theory is expressed by the Fisher-Tippett theorem. It states that the limit distri The dependence function for bivariate extreme value distributions { a systematic approach Claudia Kluppelberg Angelika May October 2, 998 Abstract In this paper e classify the existing bivariate models

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

D I S C U S S I O N P A P E R

D I S C U S S I O N P A P E R I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S ( I S B A ) UNIVERSITÉ CATHOLIQUE DE LOUVAIN D I S C U S S I O N P A P E R 2014/06 Adaptive

More information

RANK-BASED INFERENCE FOR BIVARIATE EXTREME-VALUE COPULAS

RANK-BASED INFERENCE FOR BIVARIATE EXTREME-VALUE COPULAS The Annals of Statistics 29, Vol. 37, No. 5B, 299 322 DOI: 1.1214/8-AOS672 Institute of Mathematical Statistics, 29 RANK-BASED INFERENCE FOR BIVARIATE EXTREME-VALUE COPULAS BY CHRISTIAN GENEST 1 AND JOHAN

More information

Modelling Multivariate Peaks-over-Thresholds using Generalized Pareto Distributions

Modelling Multivariate Peaks-over-Thresholds using Generalized Pareto Distributions Modelling Multivariate Peaks-over-Thresholds using Generalized Pareto Distributions Anna Kiriliouk 1 Holger Rootzén 2 Johan Segers 1 Jennifer L. Wadsworth 3 1 Université catholique de Louvain (BE) 2 Chalmers

More information

ESTIMATING BIVARIATE TAIL

ESTIMATING BIVARIATE TAIL Elena DI BERNARDINO b joint work with Clémentine PRIEUR a and Véronique MAUME-DESCHAMPS b a LJK, Université Joseph Fourier, Grenoble 1 b Laboratoire SAF, ISFA, Université Lyon 1 Framework Goal: estimating

More information

DISCUSSION PAPER 2016/43

DISCUSSION PAPER 2016/43 I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S ( I S B A ) DISCUSSION PAPER 2016/43 Bounds on Kendall s Tau for Zero-Inflated Continuous

More information

Concentration behavior of the penalized least squares estimator

Concentration behavior of the penalized least squares estimator Concentration behavior of the penalized least squares estimator Penalized least squares behavior arxiv:1511.08698v2 [math.st] 19 Oct 2016 Alan Muro and Sara van de Geer {muro,geer}@stat.math.ethz.ch Seminar

More information

CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum

CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum 1997 19 CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION 3.0. Introduction

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

Exercises Measure Theoretic Probability

Exercises Measure Theoretic Probability Exercises Measure Theoretic Probability 2002-2003 Week 1 1. Prove the folloing statements. (a) The intersection of an arbitrary family of d-systems is again a d- system. (b) The intersection of an arbitrary

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

Nonconcave Penalized Likelihood with A Diverging Number of Parameters

Nonconcave Penalized Likelihood with A Diverging Number of Parameters Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized

More information

arxiv: v1 [math.pr] 24 Aug 2018

arxiv: v1 [math.pr] 24 Aug 2018 On a class of norms generated by nonnegative integrable distributions arxiv:180808200v1 [mathpr] 24 Aug 2018 Michael Falk a and Gilles Stupfler b a Institute of Mathematics, University of Würzburg, Würzburg,

More information

Verifying Regularity Conditions for Logit-Normal GLMM

Verifying Regularity Conditions for Logit-Normal GLMM Verifying Regularity Conditions for Logit-Normal GLMM Yun Ju Sung Charles J. Geyer January 10, 2006 In this note we verify the conditions of the theorems in Sung and Geyer (submitted) for the Logit-Normal

More information

On the Estimation and Application of Max-Stable Processes

On the Estimation and Application of Max-Stable Processes On the Estimation and Application of Max-Stable Processes Zhengjun Zhang Department of Statistics University of Wisconsin Madison, WI 53706, USA Co-author: Richard Smith EVA 2009, Fort Collins, CO Z. Zhang

More information

STAT 992 Paper Review: Sure Independence Screening in Generalized Linear Models with NP-Dimensionality J.Fan and R.Song

STAT 992 Paper Review: Sure Independence Screening in Generalized Linear Models with NP-Dimensionality J.Fan and R.Song STAT 992 Paper Review: Sure Independence Screening in Generalized Linear Models with NP-Dimensionality J.Fan and R.Song Presenter: Jiwei Zhao Department of Statistics University of Wisconsin Madison April

More information

(Part 1) High-dimensional statistics May / 41

(Part 1) High-dimensional statistics May / 41 Theory for the Lasso Recall the linear model Y i = p j=1 β j X (j) i + ɛ i, i = 1,..., n, or, in matrix notation, Y = Xβ + ɛ, To simplify, we assume that the design X is fixed, and that ɛ is N (0, σ 2

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 18.466 Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 1. MLEs in exponential families Let f(x,θ) for x X and θ Θ be a likelihood function, that is, for present purposes,

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, Metric Spaces

Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, Metric Spaces Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, 2013 1 Metric Spaces Let X be an arbitrary set. A function d : X X R is called a metric if it satisfies the folloing

More information

Linear Regression Linear Regression with Shrinkage

Linear Regression Linear Regression with Shrinkage Linear Regression Linear Regression ith Shrinkage Introduction Regression means predicting a continuous (usually scalar) output y from a vector of continuous inputs (features) x. Example: Predicting vehicle

More information

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Xiuming Zhang zhangxiuming@u.nus.edu A*STAR-NUS Clinical Imaging Research Center October, 015 Summary This report derives

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Econometrics Workshop UNC

More information

Calibration Estimation for Semiparametric Copula Models under Missing Data

Calibration Estimation for Semiparametric Copula Models under Missing Data Calibration Estimation for Semiparametric Copula Models under Missing Data Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Economics and Economic Growth Centre

More information

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model 1. Introduction Varying-coefficient partially linear model (Zhang, Lee, and Song, 2002; Xia, Zhang, and Tong, 2004;

More information

Notes on Asymptotic Theory: Convergence in Probability and Distribution Introduction to Econometric Theory Econ. 770

Notes on Asymptotic Theory: Convergence in Probability and Distribution Introduction to Econometric Theory Econ. 770 Notes on Asymptotic Theory: Convergence in Probability and Distribution Introduction to Econometric Theory Econ. 770 Jonathan B. Hill Dept. of Economics University of North Carolina - Chapel Hill November

More information

Modèles stochastiques II

Modèles stochastiques II Modèles stochastiques II INFO 154 Gianluca Bontempi Département d Informatique Boulevard de Triomphe - CP 1 http://ulbacbe/di Modéles stochastiques II p1/50 The basics of statistics Statistics starts ith

More information

Group-invariant solutions of nonlinear elastodynamic problems of plates and shells *

Group-invariant solutions of nonlinear elastodynamic problems of plates and shells * Group-invariant solutions of nonlinear elastodynamic problems of plates and shells * V. A. Dzhupanov, V. M. Vassilev, P. A. Dzhondzhorov Institute of mechanics, Bulgarian Academy of Sciences, Acad. G.

More information

Uncertainty Quantification for Inverse Problems. November 7, 2011

Uncertainty Quantification for Inverse Problems. November 7, 2011 Uncertainty Quantification for Inverse Problems November 7, 2011 Outline UQ and inverse problems Review: least-squares Review: Gaussian Bayesian linear model Parametric reductions for IP Bias, variance

More information

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Statistica Sinica 20 2010, 365-378 A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Liang Peng Georgia Institute of Technology Abstract: Estimating tail dependence functions is important for applications

More information

arxiv: v2 [math.st] 8 May 2014

arxiv: v2 [math.st] 8 May 2014 Extreme value copula estimation based on block maxima of a multivariate stationary time series arxiv:1311.3060v2 [math.st] 8 May 2014 Axel Bücher Universität Heidelberg Institut für Angewandte Mathematik

More information

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Maximilian Kasy Department of Economics, Harvard University 1 / 37 Agenda 6 equivalent representations of the

More information

Multivariate Distributions

Multivariate Distributions IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate

More information

CURRENT STATUS LINEAR REGRESSION. By Piet Groeneboom and Kim Hendrickx Delft University of Technology and Hasselt University

CURRENT STATUS LINEAR REGRESSION. By Piet Groeneboom and Kim Hendrickx Delft University of Technology and Hasselt University CURRENT STATUS LINEAR REGRESSION By Piet Groeneboom and Kim Hendrickx Delft University of Technology and Hasselt University We construct n-consistent and asymptotically normal estimates for the finite

More information

COS 424: Interacting with Data. Lecturer: Rob Schapire Lecture #15 Scribe: Haipeng Zheng April 5, 2007

COS 424: Interacting with Data. Lecturer: Rob Schapire Lecture #15 Scribe: Haipeng Zheng April 5, 2007 COS 424: Interacting ith Data Lecturer: Rob Schapire Lecture #15 Scribe: Haipeng Zheng April 5, 2007 Recapitulation of Last Lecture In linear regression, e need to avoid adding too much richness to the

More information

Statistical inference on Lévy processes

Statistical inference on Lévy processes Alberto Coca Cabrero University of Cambridge - CCA Supervisors: Dr. Richard Nickl and Professor L.C.G.Rogers Funded by Fundación Mutua Madrileña and EPSRC MASDOC/CCA student workshop 2013 26th March Outline

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics

More information

Bivariate generalized Pareto distribution

Bivariate generalized Pareto distribution Bivariate generalized Pareto distribution in practice Eötvös Loránd University, Budapest, Hungary Minisymposium on Uncertainty Modelling 27 September 2011, CSASC 2011, Krems, Austria Outline Short summary

More information

Quantile Processes for Semi and Nonparametric Regression

Quantile Processes for Semi and Nonparametric Regression Quantile Processes for Semi and Nonparametric Regression Shih-Kang Chao Department of Statistics Purdue University IMS-APRM 2016 A joint work with Stanislav Volgushev and Guang Cheng Quantile Response

More information

Linear Regression Linear Regression with Shrinkage

Linear Regression Linear Regression with Shrinkage Linear Regression Linear Regression ith Shrinkage Introduction Regression means predicting a continuous (usually scalar) output y from a vector of continuous inputs (features) x. Example: Predicting vehicle

More information

Estimating Bivariate Tail: a copula based approach

Estimating Bivariate Tail: a copula based approach Estimating Bivariate Tail: a copula based approach Elena Di Bernardino, Université Lyon 1 - ISFA, Institut de Science Financiere et d'assurances - AST&Risk (ANR Project) Joint work with Véronique Maume-Deschamps

More information

Empirical Processes: General Weak Convergence Theory

Empirical Processes: General Weak Convergence Theory Empirical Processes: General Weak Convergence Theory Moulinath Banerjee May 18, 2010 1 Extended Weak Convergence The lack of measurability of the empirical process with respect to the sigma-field generated

More information

Review (Probability & Linear Algebra)

Review (Probability & Linear Algebra) Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Institute of Statistics

More information

Contributions to extreme-value analysis

Contributions to extreme-value analysis Contributions to extreme-value analysis Stéphane Girard INRIA Rhône-Alpes & LJK (team MISTIS). 655, avenue de l Europe, Montbonnot. 38334 Saint-Ismier Cedex, France Stephane.Girard@inria.fr February 6,

More information

Manifold Learning for Subsequent Inference

Manifold Learning for Subsequent Inference Manifold Learning for Subsequent Inference Carey E. Priebe Johns Hopkins University June 20, 2018 DARPA Fundamental Limits of Learning (FunLoL) Los Angeles, California http://arxiv.org/abs/1806.01401 Key

More information

MMSE Equalizer Design

MMSE Equalizer Design MMSE Equalizer Design Phil Schniter March 6, 2008 [k] a[m] P a [k] g[k] m[k] h[k] + ṽ[k] q[k] y [k] P y[m] For a trivial channel (i.e., h[k] = δ[k]), e kno that the use of square-root raisedcosine (SRRC)

More information

Logistic Regression. Machine Learning Fall 2018

Logistic Regression. Machine Learning Fall 2018 Logistic Regression Machine Learning Fall 2018 1 Where are e? We have seen the folloing ideas Linear models Learning as loss minimization Bayesian learning criteria (MAP and MLE estimation) The Naïve Bayes

More information

f-divergence Estimation and Two-Sample Homogeneity Test under Semiparametric Density-Ratio Models

f-divergence Estimation and Two-Sample Homogeneity Test under Semiparametric Density-Ratio Models IEEE Transactions on Information Theory, vol.58, no.2, pp.708 720, 2012. 1 f-divergence Estimation and Two-Sample Homogeneity Test under Semiparametric Density-Ratio Models Takafumi Kanamori Nagoya University,

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Model Selection and Geometry

Model Selection and Geometry Model Selection and Geometry Pascal Massart Université Paris-Sud, Orsay Leipzig, February Purpose of the talk! Concentration of measure plays a fundamental role in the theory of model selection! Model

More information

DISCUSSION PAPER 2016/46

DISCUSSION PAPER 2016/46 I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S ( I S B A ) DISCUSSION PAPER 2016/46 Bounds on Concordance-Based Validation Statistics

More information

1. The Multivariate Classical Linear Regression Model

1. The Multivariate Classical Linear Regression Model Business School, Brunel University MSc. EC550/5509 Modelling Financial Decisions and Markets/Introduction to Quantitative Methods Prof. Menelaos Karanasos (Room SS69, Tel. 08956584) Lecture Notes 5. The

More information

Estimation of the functional Weibull-tail coefficient

Estimation of the functional Weibull-tail coefficient 1/ 29 Estimation of the functional Weibull-tail coefficient Stéphane Girard Inria Grenoble Rhône-Alpes & LJK, France http://mistis.inrialpes.fr/people/girard/ June 2016 joint work with Laurent Gardes,

More information

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,

More information

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING DATE AND TIME: August 30, 2018, 14.00 19.00 RESPONSIBLE TEACHER: Niklas Wahlström NUMBER OF PROBLEMS: 5 AIDING MATERIAL: Calculator, mathematical

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

Mathematical Institute, University of Utrecht. The problem of estimating the mean of an observed Gaussian innite-dimensional vector

Mathematical Institute, University of Utrecht. The problem of estimating the mean of an observed Gaussian innite-dimensional vector On Minimax Filtering over Ellipsoids Eduard N. Belitser and Boris Y. Levit Mathematical Institute, University of Utrecht Budapestlaan 6, 3584 CD Utrecht, The Netherlands The problem of estimating the mean

More information

Statistics: Learning models from data

Statistics: Learning models from data DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial

More information

Exercises Measure Theoretic Probability

Exercises Measure Theoretic Probability Exercises Measure Theoretic Probability Chapter 1 1. Prove the folloing statements. (a) The intersection of an arbitrary family of d-systems is again a d- system. (b) The intersection of an arbitrary family

More information

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

A measure of radial asymmetry for bivariate copulas based on Sobolev norm A measure of radial asymmetry for bivariate copulas based on Sobolev norm Ahmad Alikhani-Vafa Ali Dolati Abstract The modified Sobolev norm is used to construct an index for measuring the degree of radial

More information

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY Time Series Analysis James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY PREFACE xiii 1 Difference Equations 1.1. First-Order Difference Equations 1 1.2. pth-order Difference Equations 7

More information

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS SOME CONVERSE LIMIT THEOREMS OR EXCHANGEABLE BOOTSTRAPS Jon A. Wellner University of Washington The bootstrap Glivenko-Cantelli and bootstrap Donsker theorems of Giné and Zinn (990) contain both necessary

More information

Max-Margin Ratio Machine

Max-Margin Ratio Machine JMLR: Workshop and Conference Proceedings 25:1 13, 2012 Asian Conference on Machine Learning Max-Margin Ratio Machine Suicheng Gu and Yuhong Guo Department of Computer and Information Sciences Temple University,

More information

Testing Some Covariance Structures under a Growth Curve Model in High Dimension

Testing Some Covariance Structures under a Growth Curve Model in High Dimension Department of Mathematics Testing Some Covariance Structures under a Growth Curve Model in High Dimension Muni S. Srivastava and Martin Singull LiTH-MAT-R--2015/03--SE Department of Mathematics Linköping

More information

Improving linear quantile regression for

Improving linear quantile regression for Improving linear quantile regression for replicated data arxiv:1901.0369v1 [stat.ap] 16 Jan 2019 Kaushik Jana 1 and Debasis Sengupta 2 1 Imperial College London, UK 2 Indian Statistical Institute, Kolkata,

More information

The extremal elliptical model: Theoretical properties and statistical inference

The extremal elliptical model: Theoretical properties and statistical inference 1/25 The extremal elliptical model: Theoretical properties and statistical inference Thomas OPITZ Supervisors: Jean-Noel Bacro, Pierre Ribereau Institute of Mathematics and Modeling in Montpellier (I3M)

More information

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Overview of Extreme Value Theory. Dr. Sawsan Hilal space Overview of Extreme Value Theory Dr. Sawsan Hilal space Maths Department - University of Bahrain space November 2010 Outline Part-1: Univariate Extremes Motivation Threshold Exceedances Part-2: Bivariate

More information

YAN GUO, JUHI JANG, AND NING JIANG

YAN GUO, JUHI JANG, AND NING JIANG LOCAL HILBERT EXPANSION FOR THE BOLTZMANN EQUATION YAN GUO, JUHI JANG, AND NING JIANG Abstract. We revisit the classical ork of Caflisch [C] for compressible Euler limit of the Boltzmann equation. By using

More information

Stochastic Optimization with Inequality Constraints Using Simultaneous Perturbations and Penalty Functions

Stochastic Optimization with Inequality Constraints Using Simultaneous Perturbations and Penalty Functions International Journal of Control Vol. 00, No. 00, January 2007, 1 10 Stochastic Optimization with Inequality Constraints Using Simultaneous Perturbations and Penalty Functions I-JENG WANG and JAMES C.

More information

ECON The Simple Regression Model

ECON The Simple Regression Model ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In

More information

Large Sample Properties of Estimators in the Classical Linear Regression Model

Large Sample Properties of Estimators in the Classical Linear Regression Model Large Sample Properties of Estimators in the Classical Linear Regression Model 7 October 004 A. Statement of the classical linear regression model The classical linear regression model can be written in

More information

Linear models and their mathematical foundations: Simple linear regression

Linear models and their mathematical foundations: Simple linear regression Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction

More information

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY Time Series Analysis James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY & Contents PREFACE xiii 1 1.1. 1.2. Difference Equations First-Order Difference Equations 1 /?th-order Difference

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Nonlinear error dynamics for cycled data assimilation methods

Nonlinear error dynamics for cycled data assimilation methods Nonlinear error dynamics for cycled data assimilation methods A J F Moodey 1, A S Lawless 1,2, P J van Leeuwen 2, R W E Potthast 1,3 1 Department of Mathematics and Statistics, University of Reading, UK.

More information

Simulation of multivariate distributions with fixed marginals and correlations

Simulation of multivariate distributions with fixed marginals and correlations Simulation of multivariate distributions with fixed marginals and correlations Mark Huber and Nevena Marić June 24, 2013 Abstract Consider the problem of drawing random variates (X 1,..., X n ) from a

More information

Semiparametric posterior limits

Semiparametric posterior limits Statistics Department, Seoul National University, Korea, 2012 Semiparametric posterior limits for regular and some irregular problems Bas Kleijn, KdV Institute, University of Amsterdam Based on collaborations

More information

Artificial Neural Networks. Part 2

Artificial Neural Networks. Part 2 Artificial Neural Netorks Part Artificial Neuron Model Folloing simplified model of real neurons is also knon as a Threshold Logic Unit x McCullouch-Pitts neuron (943) x x n n Body of neuron f out Biological

More information

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich Modelling Dependence with Copulas and Applications to Risk Management Filip Lindskog, RiskLab, ETH Zürich 02-07-2000 Home page: http://www.math.ethz.ch/ lindskog E-mail: lindskog@math.ethz.ch RiskLab:

More information

On A Comparison between Two Measures of Spatial Association

On A Comparison between Two Measures of Spatial Association Journal of Modern Applied Statistical Methods Volume 9 Issue Article 3 5--00 On A Comparison beteen To Measures of Spatial Association Faisal G. Khamis Al-Zaytoonah University of Jordan, faisal_alshamari@yahoo.com

More information

Modelling Dependent Credit Risks

Modelling Dependent Credit Risks Modelling Dependent Credit Risks Filip Lindskog, RiskLab, ETH Zürich 30 November 2000 Home page:http://www.math.ethz.ch/ lindskog E-mail:lindskog@math.ethz.ch RiskLab:http://www.risklab.ch Modelling Dependent

More information

Tail negative dependence and its applications for aggregate loss modeling

Tail negative dependence and its applications for aggregate loss modeling Tail negative dependence and its applications for aggregate loss modeling Lei Hua Division of Statistics Oct 20, 2014, ISU L. Hua (NIU) 1/35 1 Motivation 2 Tail order Elliptical copula Extreme value copula

More information

1 Effects of Regularization For this problem, you are required to implement everything by yourself and submit code.

1 Effects of Regularization For this problem, you are required to implement everything by yourself and submit code. This set is due pm, January 9 th, via Moodle. You are free to collaborate on all of the problems, subject to the collaboration policy stated in the syllabus. Please include any code ith your submission.

More information

Simulation of Max Stable Processes

Simulation of Max Stable Processes Simulation of Max Stable Processes Whitney Huang Department of Statistics Purdue University November 6, 2013 1 / 30 Outline 1 Max-Stable Processes 2 Unconditional Simulations 3 Conditional Simulations

More information

arxiv: v4 [math.pr] 7 Feb 2012

arxiv: v4 [math.pr] 7 Feb 2012 THE MULTIVARIATE PIECING-TOGETHER APPROACH REVISITED STEFAN AULBACH, MICHAEL FALK, AND MARTIN HOFMANN arxiv:1108.0920v4 math.pr] 7 Feb 2012 Abstract. The univariate Piecing-Together approach (PT) fits

More information

Lecture Quantitative Finance Spring Term 2015

Lecture Quantitative Finance Spring Term 2015 on bivariate Lecture Quantitative Finance Spring Term 2015 Prof. Dr. Erich Walter Farkas Lecture 07: April 2, 2015 1 / 54 Outline on bivariate 1 2 bivariate 3 Distribution 4 5 6 7 8 Comments and conclusions

More information

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:. MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss

More information

On the approximation of real powers of sparse, infinite, bounded and Hermitian matrices

On the approximation of real powers of sparse, infinite, bounded and Hermitian matrices On the approximation of real poers of sparse, infinite, bounded and Hermitian matrices Roman Werpachoski Center for Theoretical Physics, Al. Lotnikó 32/46 02-668 Warszaa, Poland Abstract We describe a

More information