TEST D'HOMOGÉNÉITÉ AVEC LES L-MOMENTS MULTIVARIÉS. Rapport de recherche No R-933 Mai 2007

Size: px
Start display at page:

Download "TEST D'HOMOGÉNÉITÉ AVEC LES L-MOMENTS MULTIVARIÉS. Rapport de recherche No R-933 Mai 2007"

Transcription

1 TEST D'HOMOGÉNÉITÉ AVEC LES L-MOMENTS MULTIVARIÉS Rapport de recherche No R-933 Mai 2007

2

3 TEST D'HOMOGÉNÉITÉ AVEC LES L-MOMENTS MULTIVARIÉS par : F Chebana * et TBMJ Ouarda Chaire en hydrologie statistique (Hydro-Québec / CRSNG) Chaire du Canada en estimation des variables hydrologiques INRS-ETE, Université du Québec 490, rue de la Couronne (Québec), Canada G1K 9A9 Rapport de recherche N R-933 Mai 2007

4 ISBN :

5 TABLE OF CONTENTS TABLE OF CONTENTSiv LIST OF FIGURESv LIST OF TABLESvii ABSTRACT ix 1 INTRODUCTION AND LITERATURE REVIEW 1 2 THEORETICAL BACKGROUND7 21 Multivariate L-moments7 22 Copulas9 3 DEVELOPMENT OF THE PROPOSED MULTIVARIATE STATISTICS Discordancy test Homogeneity test15 4 ADAPTATION TO FLOODS19 5 SIMULATION STUDY 23 6 SIMULATION RESULTS29 7 CONCLUSIONS43 8 ACKNOWLEDGMENTS: 45 9 REFERENCES NOTATIONS53 iii

6

7 LIST OF FIGURES Figure 1 Typical flood hydrograph 20 Figure 2 Geographical location of the Skootamatta basin in Ontario, Canada 24 Figure 3a Test power for completely heterogeneous region with n = Figure 3b Test power for marginally heterogeneous region with n = Figure 3c Test power for dependence heterogeneous region with n = Figure 4 Rates of heterogeneity measure for dependence heterogeneous region with n = 3037 v

8

9 LIST OF TABLES Table 1 Flood volume and peak models for some basins considered in the literature 20 Table 2 Discordancy values of sites in the described regions, n = 30, N = Table 3 Simulation results for homogeneity test when regions are homogeneous, n = Table 4 Table 5 Table 6 Simulation results for homogeneity test when regions are 50% heterogeneous with n = Simulation results for homogeneity test when regions are 30% bimodal with n = Simulation results for homogeneity test when regions are 50% heterogeneous and variable n from site to site and N = Table 7 Simulation results for homogeneity test when n = 60 and N = vii

10

11 ABSTRACT Several types of hydrological events are described with multivariate characteristics (droughts, floods, rain storms, etc) When carrying out a multivariate regional frequency analysis for these events it is important to jointly consider all these characteristics The aim of this paper is to extend the statistical homogeneity test of Hosking and Wallis (1993) to the multivariate case As a tool, multivariate L-moments are used to define the statistics and general copula models to describe the statistical behaviour of dependent variables The usefulness of the methodology is illustrated on flood events Monte-Carlo simulations are also performed for a bivariate Gumbel logistic model with GEV marginal distributions Results illustrate the power of the proposed multivariate L-moment homogeneity test to detect heterogeneity on the whole structure of the model and on the marginal distributions In a bivariate flood setting, a comparison is carried out with the classical homogeneity test of Hosking and Wallis based on several types of regions ix

12

13 1 INTRODUCTION AND LITERATURE REVIEW Hydrologic events are complex and often characterized by the joint behaviour of several random variables, which are not usually independent Examples of multivariate representation of hydrologic phenomena include storm duration and intensity (Yue, 2001a; Salvadori and De Michele, 2004b); flood peak, volume and duration (Ashkar, 1980; Yue et al, 1999; Ouarda et al, 2000; Yue, 2001b; Yue and Rasmussen, 2002; Shiau, 2003; De Michele et al, 2005; Zhang and Singh 2006); and drought volume, duration and magnitude (Kim et al, 2003; Ashkar et al, 1998) This multivariate understanding of flood, storm or drought event characteristics is essential for several engineering planning, design, and management activities Multivariate approaches represent these events better than classical univariate tools Snyder (1962) and Wong (1963) realized the first applications of multivariate analysis tools in hydrological analysis A thorough understanding of multivariate hydrological events requires the study of the joint probabilistic behaviour of two or more correlated random variables that characterize the events (Yue et al, 2001) For investigating the statistical behaviour of dependent variables, copulas are recently shown to represent a useful mathematical tool for hydrological applications (El Adlouni et al, 2004; Salvadori and De Michele; 2004a) Several bivariate distributions were considered in the literature for local multivariate studies For instance, an expression of the joint distribution function of the largest flood peak and its time of occurrence is developed by Gupta et al (1976) To study rainfall intensity and the corresponding depth, Singh and Singh (1991) derived a bivariate probability density with marginal exponential distributions Bacchi et al (1994) modeled extreme rainfall duration and severity by using a bivariate distribution with marginal 1

14 exponential distributions The bivariate normal distribution was investigated by Goel et al (1998) to represent the joint distribution of flood peaks and volumes based on a partial duration series To represent the joint probability distribution of flood peaks and volumes and the joint probability distribution of flood volumes and durations, Yue et al (1999) used the Gumbel mixed model with standard Gumbel marginal distributions Yue (2001b), Yue and Rasmussen (2002) and Shiau (2003) used the Gumbel logistic model with standard Gumbel marginal distributions to model flood volume and peak for different basins Salvadori and De Michele (2004b) considered storm duration-intensity using the generalized Pareto distribution with suitable 2-copula Frank s family In the study by El Adlouni et al (2004) several copulas are considered to model flood peak and volume with respectively Gumbel and Gamma marginal distributions Regional frequency analysis is commonly used for the estimation of extreme hydrological events, such as floods, at sites where little or no data are available It allows to utilize data available from other stations in the same hydrologic region In general a regional flood frequency procedure consists of two steps: delineation of hydrological homogeneous regions and regional estimation This subject was investigated by several studies including Stedinger and Tasker (1986), Burn (1990), Hosking and Wallis (1993), Durrans and Tomic (1996), Nguyen and Pendey (1996), Alila (1999, 2000) GREHYS (1996a,b) presented the results of an intercomparison of various regional flood estimation procedures obtained by coupling four methods for delineating homogenous regions and seven regional estimation methods The size of a region is a factor that is closely related to the notion of degree of homogeneity Indeed, the consideration of a region with few sites guarantees its high degree of homogeneity However, in such a situation, the available data may not be sufficient to carry out a suitable 2

15 regional estimation On the other hand, large regions may contain some dissimilar sites to the target one In the region of influence approach (ROI), Burn (1990) proposed three options to select a threshold value combined with a weight function in order to define a convenient homogeneous region for a given site Using a jack-knife resampling procedure, Ouarda et al (2001) proposed to address this problem by optimizing the relative bias and the relative mean square error of quantile estimates Hosking and Wallis (1993) proposed a procedure to deal with the issue of homogeneity testing in the univariate framework Their procedure consists of three statistics to measure the discordancy of sites, the heterogeneity of the region and the goodnessof-fit of the regional distribution The proposed statistics are defined on the basis of the L- moments of local data (Hosking, 1990) Most literature related to multivariate representation of hydrological phenomena dealt with atsite (local) multivariate frequency analysis of hydrological events Very little effort has been devoted to the joint representation of the characteristics of hydrological events in regional hydrological modeling at ungauged sites Joint regional study of flood peaks and volumes using a canonical correlation analysis procedure was carried out by Ouarda et al (2000) in the province of Quebec, Canada It is possible to carry out a regional frequency analysis for each variable of the event, each having its own homogeneous region However, it is of interest to identify a single homogeneous region for which the given variables have approximately the same joint distribution across sites Univariate regional hydrological frequency analysis can only provide limited evaluation of these events at ungauged sites and is not sufficient to fully represent multiple hydrological event phenomena It is possible to treat the problem of testing the homogeneity of a region with several characteristics, from a statistical point of view, by using either one multivariate test or a series of 3

16 univariate tests One important aspect of the use of a multivariate test in opposition to the use of a series of univariate tests concerns the control of first kind errors If p independent univariate tests are carried out, each of which at the 5% significance level, then the probability of getting a non significant result is095 p Therefore, the probability of getting at least one significant result is (1 095 p ), which may be unacceptably large On the other hand, a multivariate test using the 5% level of significance gives a 005 probability of first kind error, independently of the number of involved variables This is a distinct advantage over a series of univariate tests, particularly when the number of variables is large It can also be argued that the use of a single multivariate test provides a better procedure in many cases than making a large number of univariate tests A multivariate test has also the additional advantage of taking proper account of the correlation between variables (See Manly, 2005) The aim of the present paper is to extend the discordancy statistic and the homogeneity test of Hosking and Wallis (1993) to the multivariate case This is considered as an important step in regional frequency analysis of multivariate events, and consists in testing whether a region is homogeneous or heterogeneous A multivariate version of L-moments, defined by Serfling and Xiao (2006), is used to develop the multivariate discordancy and homogeneity statistics The multivariate nature of hydrological events is modeled using copulas In the univariate context, Hosking and Wallis (1993) consider the index flood model originally proposed by Dalrymple (1960) for regional estimation It is based on the assumption that floods at different sites within a region are identically distributed except for a scale factor Hosking and Wallis (1993) treat their statistic as a heterogeneity measure Therefore, they judge its performance with the evaluation of the relative root mean square error (RRMSE) of quantile estimates in an index flood model However, Fill and Stedinger (1995) used the Hosking-Wallis 4

17 statistic as a statistical test to examine homogeneity and they evaluated its power for comparison with other tests In this paper, both potential uses of the statistics are considered: homogeneity test and heterogeneity measure Hence, the evaluation of the performance of the proposed tests is two folds: On one hand the power is used as a criterion when it is treated as a statistical homogeneity test, and on the other hand the occurrence rates of three kinds of regions (homogeneous, acceptably homogeneous and definitely heterogeneous) is used when it is treated as a heterogeneity measure Recall that the power of a statistical test is the probability that a sample falls in the critical region when the alternative hypothesis is true The reasons for this choice are: (a) a multivariate version of the index flood model has yet to be developed to allow the computation of quantiles and consequently their RRMSE, and (b) the evaluation based on quantiles RRMSE evaluates the whole regional frequency analysis procedure (the homogeneity test as well as the parameter and quantile estimation method) Consequently, when a result is not satisfactory, in the RRMSE sense, one can not identify the source of the poor performance which may be caused by the homogeneity test, the model or the parameter estimation method Monte Carlo simulations are drawn to validate and evaluate the results The selected bivariate model for the local joint variable volume-peak is the Gumbel logistic model with Gumbel marginal distributions Several kinds of heterogeneous regions are generated When the generated regions are homogeneous, the parameters of the model are approximately those of the Skootamatta basin in Ontario, Canada (Yue and Rasmussen, 2002) The paper is organized as follows In Section 2, a short discussion of the theoretical background is presented: multivariate L-moments and copulas The theoretical developments of the discordancy and homogeneity tests are presented in Section 3 Section 4 deals with the adaptation of the approach to flood events The simulation experiment is presented in Section 5, 5

18 and Section 6 deals with the discussion and interpretation of results Concluding remarks are presented in the last section 6

19 2 THEORETICAL BACKGROUND In this section the mathematical tools needed for the development of the multivariate homogeneity and discordancy tests are briefly presented All notations used in the following are reported in Section Multivariate L-moments The L-moment approach offers strong advantages for the modeling of heavy-tailed distributions such as some of the distributions used in hydrology The properties and advantages of L- moments are presented in Hosking and Wallis (1997) Multivariate L-moments are principally developed by Serfling and Xiao (2006) In the following the bivariate L-moment case is briefly presented Let ( j) X be a random variable with distribution F j, for j=1,2 By analogy with a covariance representation of L-moments of order k 1, multivariate L-moments are matrices comoment elements defined by: λ ( ( )) () i * ( j ) = Cov X, P F( X ), i, j= 1,2 and k= 2,3, (1) kij [ ] k1 j Λ k with L- where P * k is the so-called shifted Legendre polynomial Note that the elements λ kij [ ] and λk[ ji] not necessarily equal Particularly, the first L-comoment elements are: are 7

20 λ λ λ 2[12] 2 (1) (2) ( X F X ) (1) (2) X ( F X ) = 2Cov, ( ) ( 2 ) ( X (1) ( F X (2) 3 ) ( F X (2) ) ) = 6Cov, ( ) 1/ 2 3[12] 2 = Cov,20 ( ) 1/2 3 ( ) 1/ [12] 2 2 (2) which are respectively the L-covariance, L-coskewness and L-cokurtosis Note that the kth L-comoment of (1) X with respect to (2) X is translation invariant and scale equivariant with respect to transformations of (1) X and translation and scale invariant with respect to transformations of (2) X ; that is for positive b and d, and arbitrary a and c, it satisfies: λ ( a+ bx, c+ d X ) = bλ ( X, X ) (3) (1) (2) (1) (2) k[12] k[12] The L-comoment coefficients are given by τ λ =, for k 3 and k[12] k[12] (1) λ2 τ λ = (4) 2[12] 2[12] (1) λ1 where λ ( j) k = λ is the classical kth L-moment of the variable k[ jj] ( j X ), j = 1,2 as defined by Hosking (1990) A hierarchy of intuitively appealing analogues of the classical covariance and the central comoments is thus provided by L-comoments Their interpretations and comparisons are facilitated by the fact that they are defined in terms of the classical covariance operator The matrix of the L-comoment coefficients is written as ( τk[ ij] ) τ * k[11] k[12] Λ k = ij, = 1,2 = τk[21] τ k[22] τ (5) 8

21 Particularly, for k=2 the L-covariation matrix is given by: τ τ * 2[11] 2[12] Λ 2 = τ2[21] τ2[22] (6) and for k = 1, the first order bivariate L-moment corresponds to the mean vector λ = E X X (1) (2) 1 (, ) t As indicated by Serfling and Xiao (2006), the L-comoments are similar in structure and behavior to the univariate L-moments and capture their attractive properties The multivariate L-moments defined previously are based on a theoretical population distribution; however their finite sample versions are useful to define statistical tests and also to estimate multivariate distribution parameters Their formulas and properties are presented in Serfling and Xiao (2006) In the R software package, William (2006) proposes an implementation of these finite sample versions 22 Copulas To overcome the limitations of classical dependence measures, copulas have recently received increasing attention in various science fields (see for instance Nelsen, 1999) Copula is a description and a model of the dependence structure between random variables, independently of the marginal laws The general development of copulas theory can be found in Nelsen (1999) A copula is a function C: I I I (I = [0, 1]) such that: for all u, v I : C(u, 0) = 0, C(u, 1) = u, C(0, v) = 0, and C(1,v) = v; for all u1, u2, v 1, v 2 I such that u1 u2 and v1 v2: 9

22 Cu (,v ) Cu (,v ) Cu (,v ) + Cu (,v ) 0 (7) The link between copulas and bivariate distributions is provided by the following Sklar s (1959) result which states that the most general marginal-free description of the dependence structure of multivariate distributions is through its copula: Let F 1 and F 2 denote the marginal distribution functions of the random variables (1) X and F (2) X, let 1,2 be a joint distribution function with marginals F1 and F 2 Then, there exists a copula C such that, for all real x 1 and x 2, we have: ( ) F ( x, x ) = C F( x ), F ( x ) (8) 1, F if 1 and 2 F are continuous, then C is unique Archimedean and Extreme Value copulas represent classes of particular interest The class of extreme-value copulas arises as the possible limit of copulas of the variable( M, M ) where M max X, M max X = = and ( X (1), (2) i Xi ) (1) (2) n1 1 i n i n2 1 i n i 1 i n n1 n2 is a bivariate sample of independent and identically distributed random variables A useful representation proposed by Pickands (1981) facilitates the use of bivariate extreme-value copulas Formally, a copula C is an extremevalue copula if and only if there exists a real-valued function A on the interval [0, 1] such that: log u Cuv (,) = exp ( logu+ log v) A, 0 < uv, < 1 log u+ log v (9) where A is a convex function defined on [0, 1] with { } max t,1 t A( t) 1 10

23 The case 1 A corresponds to independence, and A() t max { t,1 t} = corresponds to the copula Cuv (, ) = min( uv, ) Statistical inference on Pickand s function A can summarize the inference on its bivariate extreme-value copula C A bivariate Archimedean copula is characterized by a generator ψ () that is a convex decreasing function satisfying ψ (1) = 0 where: Cuv (,) = ψ ψ() u + ψ(), v 0 < uv, < 1 (10) 1 ( ) Copulas that belong to this class are symmetric and associative A simple and popular model is the Gumbel logistic model, where the corresponding copula is the only one to meet at the same time the conditions of the extreme-value copula with: m m ( ) 1/ m At () = t + (1 t), m 1 (11) and the conditions of Archimedean copulas with: ( ) m ψ () t = log t, m 1 (12) 11

24

25 3 DEVELOPMENT OF THE PROPOSED MULTIVARIATE STATISTICS In this section, the proposed multivariate discordancy and homogeneity tests are presented in their general forms It is important to note that, before proceeding with the multivariate regional procedure, it is advisable to test the independence between variables The reader is referred to Ondo et al (1997) for a review of such tests In the case of independence, it is sufficient to use several univariate Hosking and Wallis (1993) tests according to the number of variables 31 Discordancy test The assessment of the discordancy measure of a site i among a set of N sites is a preliminary step before proceeding with the homogeneity analysis In the following the discordancy test proposed by Hosking and Wallis (1993) is extended to the multivariate framework For this purpose, the matrix t *(i) *(i) *(i) Ui = Λ2 Λ3 Λ4 is considered for each site i It contains the three matrices Λ, Λ and Λ defined in equations (5) and (6) The *(i) *(i) *(i) following matrix Di is defined by: t 1 ( ) ( ) 1 Di = Ui U S Ui U (13) 3 where N 1 i= 1 ( i )( i ) t S = ( N 1) U U U U (14) 13

26 N 1 U = N U (15) i= 1 i and t A is the transpose of a matrix or a vector A The number 3 appearing in the denominator of expression (13) can be replaced by the value 12 which represents the number of elements in the three matrices Λ, Λ and Λ, and it can be *(i) *(i) *(i) seen as the number of degrees of freedom of the chi-square distribution However, simulations show that the use of the value 12 reduces the discordancy statistic values and prevents it to make discordant sites more evident In order to evaluate the discordancy of a site i, it is possible to use a norm D of the matrix D i i This transformation from the multidimensional space to the real line has the advantage of defining an intuitive distance in the vector space and reducing exactly to the usual univariate case Several matrix norms can be used for this purpose For instance, the maximum absolute column sum norm A of a matrix A with elements a 1 ij is defined as: A = a (16) max 1 ij j i The spectral norm A 2 is the square root of the maximum eigenvalue of t AA: A maximum eigenvalue of 2 t = AA (17) The maximum absolute row sum norm is defined by: A = max a ij (18) i j 14

27 Finally, the Frobenius norm, sometimes called the Euclidean norm, is given by: t A = aij = trace of A A F (19) i, j The reader can consult Horn and Johnson (1990) for more details about the matrix norms These various matrix norms are tested in the simulation study of the present work A site i is discordant, with respect to the considered set of sites, if D i takes large values, where is one of the abovementioned norms As a critical value for D i, the constant c = (3) 3 = may be considered for large regions, where χ1 ( d ) is the quantile of a χ α chi-square distribution of order α with d degrees of freedom As indicated in Hosking and Wallis (1997), it is advisable to examine the data for the sites with the largest D i values, regardless of the magnitude of these values Special attention should be given to the definition of the critical value taken for small regions For instance, these constants may be obtained for finite sample sizes by the use of the bootstrap technique, see eg Efron and Tibshirani (1994) 32 Homogeneity test The proposed homogeneity test is described as a multivariate analogue of the statistic given by Hosking and Wallis (1993) Following the same logic as in the above section, the statistic V is defined as: V = n n Λ Λ N 1 N 2 *( i) * i i 2 2 i= 1 i= 1 1/2 (20) 15

28 where is one of the norms defined above, N 1 N * *( i) 2 ni niλ2 i= 1 i= 1 Λ = and Λ defined in (6) is *( i) 2 the L-covariation coefficient matrix for site i, with record length n i, i = 1,, N When handling only one variable the statistic V reduces to the V statistic of Hosking and Wallis (1993) whatever the norm taken Similarly to the univariate case, the observed value of the statistic V is standardized using the mean and standard deviation values of V computed on the basis of a large number of simulated homogeneous regions Hence, the statistic that measures the heterogeneity of a set of sites is given by: H V μ Vsim = (21) σ Vsim where μ Vsim and σvsim are the mean and standard deviation of the N sim values of V of simulated regions The simulated regions are homogeneous with sites having the same record lengths as their observed counterparts To avoid any subjective choice of the bivariate distribution on which the simulations are carried out to compute μ Vsim and σ Vsim, this bivariate distribution should be as general as possible and include most distributions commonly used in hydrology Recall that in the univariate setting, a kappa distribution with 4 parameters is simulated by Hosking and Wallis (1993) If it exists, the extension of this distribution to the multivariate case requires a large number of parameters Indeed, this distribution would possess at least 4 parameters for each variable along with the covariance parameters To overcome these difficulties related to classical dependence measures, copulas are used at this level In hydrology, 16

29 particular classes of copulas are defined: the extreme-value copulas characterized by a dependence function A, and the Archimedean copulas determined by a generator functionψ The dependence function A may be estimated by several nonparametric methods existing in the literature The reader is referred to Segers (2004) for a review of these methods On the other hand, a convenient choice behind the bivariate extreme value copula is the four parameter kappa distribution for the marginals These marginal distributions do not necessarily need to be from the same family Aside from avoiding the subjective choice of a distribution, this avoids committing errors in the goodness of fit test along with errors of parameter estimation of the fitted distribution (1) (2) In order to generate samples from the variables (, ) X X according to extreme value copula with a general setting of the function A, Ghoudi et al (1998) developed an algorithm of special interest when performing simulations To summarize this algorithm, let U 1, U 2 be uniform random variables and Z be a random variable with a cumulative distribution function GZ and probability density function g where G ( z) = z+ z(1 z) A'( z)/ A( z), 0 z 1 This Z Z algorithm consists of the following steps: 1 Simulate Z 2 Given Z, take W = U1 with probability p( Z ) and W = UU 1 2with probability, where pz = z za z ( Azg z) 1 p( Z) ( ) (1 ) ''( ) ( ) ( ) Z 3 Set (1) Z / AZ ( ) (2) (1 Z) / AZ ( ) X = W and X = W 17

30 It is important to take into consideration the numerical nonparametric smoothing when using this algorithm in practice, since it is also based on the first and second derivatives of the function A Despite the general validity of this procedure, extra information about the model, eg parametric form of A, can be useful to increase the speed and accuracy of the generation algorithm Depending on the value of H a decision concerning the homogeneity of the observed region can be taken Two scenarios can then be considered In the first scenario, H is considered as a statistical homogeneity test with first kind error 5%, as in Fill and Stedinger (2004) for univariate distributions The rejection region of the homogeneity is then H > 164 In the second scenario, H is considered as a measure of heterogeneity, as in Hosking and Wallis (1993) In this case, a region of sites is declared to be homogeneous if H < 1, acceptably homogenous if 1< H < 2 and definitely heterogonous if H > 2 The extension of the univariate heterogeneity measure considered here concerns only the LCV measure of variation Other measures used in Hosking and Wallis (1993) can also be considered for the extension by following the same procedure and using the same tools 18

31 4 ADAPTATION TO FLOODS The discordancy and homogeneity tests defined in the previous section are general and can be applied to several hydrological phenomena such as floods, droughts and rain storms In this section, the focus is on flood events as multivariate random events that are characterized by their peak Q, volume V and duration D These combined characteristics determine the severity of a flood and can be correlated Figure 1 illustrates a typical hydrograph with these characteristics In general, there exists a close correlation between flood peak and volume and between food volume and duration, but there is little significant correlation between flood peak and duration, as noticed by Yue (2001b) Without loss of generality, the bivariate vector ( VQ, ) is considered in this study In order to proceed to simulations, the joint distribution of the random vector ( VQ, ) must be determined In the remainder of the study, the univariate homogeneity tests for flood volumes and peaks are respectively denoted by discordancy statistics are DiV, and DiQ, for a site i H V and H, and also the corresponding Q In practice, flood peaks and flood volumes may be marginally represented by the Gumbel distribution, as illustrated by previous studies of real data from several sites In Table 1, the cases studied by Yue et al (1999), Yue (2001b), Yue and Rasmussen (2002) and Shiau (2003) are summarized In the present study, both volume and peak variables are considered Gumbel marginally distributed The Gumbel cumulative distribution function is given by: x β { ( )} Fx ( ) = exp exp α, xreal, α> 0 and β real (22) 19

32 Flow Flood Peak Q Flood Volume V Flood Duration D Time Figure 1 Typical flood hydrograph Table 1 Flood volume and peak models for some basins considered in the literature α β model m and ρ Reference Basin n Q (m 3 /s) Gumbel m =1294 Pachang river, V (daym 3 /s) logistic ρ =0403 Shiau (2003) Taiwan (small basin) 39 Q (m 3 /s) Gumbel m =1414 Skootamatta,ON, V (daym 3 /s) logistic ρ =05 Yue and Rasmussen (2002) CA 41 Q (m 3 /s) Gumbel m =1969 Harricana, QC, V (daym 3 /s) logistic ρ =0742 Yue (2001b) CA 63 Q (m 3 /s) Gumbel * Ashuapmushuan, QC, V (daym 3 /s) mixed ρ =0596 Yue et al (1999) CA 33 * This model is not defined with a parameter m 20

33 The dependence between V and Q may be modeled by the so-called Gumbel logistic model, expressed according to the following copula: m m 1/ m { } Cm( x, y) = exp ( log x) + ( log y), m 1 and 0 x, y 1 (23) where m is the dependence parameter, which is related to the correlation coefficient ρ by the relationship m = 1/ 1 ρ, 0 ρ < 1 Fortunately, the corresponding Gumbel logistic copula m m C is an extreme value copula with dependence function At ( t t ) 1/ m () = (1 ) + and it is also an m Archimedean copula with generator function () t ( logx) m ψ = Zhang and Singh (2006) showed the superiority of the Gumbel logistic copula for modeling the flood volume and peak distribution, using the Akaike Information Criterion (AIC) They compared its performance with four other kinds of copulas and also with non-copula models such as the mixed Gumbel model and the Cox-Box transformation The Gumbel logistic model is also considered by De Michele et al (2005) with Frechet extreme value marginal distributions The marginal distributions are not necessarily the same which is one of the advantages of copula modeling An example of such situation is treated by El Adlouni et al (2004) 21

34

35 5 SIMULATION STUDY The objective of the simulation study is to evaluate the performance of the proposed homogeneity test H defined by equation (21) as well as to illustrate the use of the discordancy statistic D i where Di is given by (13) Once the model is defined by the marginal distributions (22) and the copula (23), the corresponding parameters must be specified The parameters selected for simulation study are those of the Skootamatta basin in Ontario, Canada (Yue and Rasmussen, 2002) Figure 2 illustrates the geographical location of the Skootamatta basin in the province of Ontario, Canada The gauging station 02HL004 is near the outlet of the basin at latitude N and longitude W A homogenous region with parameters α V = 300, β = 1240, α = 16, β = 52 and m = 141 (which is equivalent to ρ = 05) is hence defined V Q Q The steps of the simulation concerning the homogeneity tests are presented herein Several elements of the procedure are inspired by Hosking and Wallis (1993): 1 Simulation of regions: Generate a region for both variables volume V and peak Q with N sites (N = 10, 15, 20 and 30) and a fixed record length n = n = 30for each site i In order to study the impact of the record length on the performance of the tests, the case ni = n = 60 is investigated for N = 30 Four cases are also considered when the record length is variable from site to site with N = 21: n = 10,12,,50; n = 50,48,,10 ; n = 10,14,,50,46,,10 and n = 50,46,,10,14,,50 i i i i i 23

36 Figure 2 Geographical location of the Skootamatta basin in Ontario, Canada To generate heterogeneous regions, without loss of generality, the location parameter β in the Gumbel marginal distributions (22) is fixed since V is location invariant, and then different values of the parameters αv, αq and m are considered The variation of these parameters in the same region suggests several types of regions to be generated, and the following are selected: a Homogeneous: All parameters of the model are the same for all sites in the region 24

37 b Completely heterogeneous: All parameters increase linearly from the first to the N th site in the 50% range centered around the homogeneous region parameters, eg, for α V = 300 the variation is in the range [ 300(1-05/ 2), 300(1 + 05/ 2) ] = [ 225, 375] c Heterogeneous on the marginal parameters: It is the same as the above region but the dependence parameter m is fixed and the variation is on the parameters α and α V Q d Heterogeneous on the dependence parameter: The marginal parameters α V and α are fixed and the variation is on the dependence parameter m This Q region is marginally homogenous e Completely bimodal: Two groups of parameters are defined by the two limits of a 30% range centered on the parameters of the homogeneous region; half the sites have high values of all parameters αv, α Q and m while the other half have low values of these parameters, eg, for α V = 300 the lower value is 300(1-03/ 2) = 255 and the higher one is 300(1 + 03/ 2) = 345 f Bimodal on the marginal parameters: The dependence parameter m is fixed and the marginal parameters above region α V and α take the two sided values similarly to the Q g Bimodal on the dependence parameter: It is the same as the completely bimodal region with fixed marginal parameters α V and α and the dependence parameter Q m takes the two sided values 25

38 In order to generate a bivariate Gumbel logistic sample with uniform marginal distributions, the algorithm of Ghoudi at al (1998) described in section 32 is used Then, the desired sample is obtained with the quantile transformation 1 F of the marginal distributions 2 Computation of the homogeneity statistics: Assess the H statistics HV, HQ and H with each norm, on P = 100 generated regions as in step 1 Note that no significant differences have been observed with other values of P such as P = 150 or Performance evaluation: The rates of the values of each statistic H (, or ) V Q H H H found in step 2 are computed according to the corresponding conditions to be satisfied The rate is the ratio of the number of samples where the H value satisfies the desired condition to the total number of generated samples That means, when the statistic is considered as a homogeneity test, its power is computed, that is the rate of H > 164; and when it is considered as a heterogeneity measure, the rates of the occurrences H < 1, 1< H < 2 and H > 2 are computed It is important to note that it is advised to use a kappa distribution to simulate the marginals However, in the simulation study to evaluate the performances of the tests a Gumbel distribution is used This is justified by the fact that the treated characteristics are specific variables and the "real regions to be tested" are simulated Hence, sources of error are reduced since there is no goodness-of- fit testing or parameter estimation Note that Hosking and Wallis (1993) used the GEV distribution rather than kappa to evaluate the performance of their test in a general univariate framework 26

39 In order to illustrate the discordancy measure D i, regions are simulated which contain N = 20 sites each of which with record length n = 30 Due to space limitations, the exercise was not carried out for all cases of regions and values of n and N On the basis of P = 100 generated regions for each case, the mean values of DiV,, DiQ, and Di for each site are computed In the generated regions, all sites have the same parameters as the homogenous region a in the previous procedure except some discordant sites (sites number 1, 6, 11 and 16) These sites have different parameter values Parameter values of these sites equal ten times the values of the other sites The difference in these sites concerns all parameters (complete discordancy), marginal parameters (marginal discordancy) and only the dependence parameter (dependent discordancy) A set of P = 100 homogeneous regions is also generated for comparison 27

40

41 6 SIMULATION RESULTS The discordancy measure indicates the sites with gross errors that may cause the heterogeneity in the generated regions Table 2 illustrates the discordancy values DiV,, DiQ, and Di of sites in the 2 described regions for n = 30 and N = 20 Only results with the spectral norm are presented 2 since no significant difference with the other norms is observed Results in Table 2 show that for the homogenous regions all sites have approximately the same values of the discordancy statistics in both univariate and bivariate cases When the discordant sites are affected on all parameters, the six supposed discordant sites are detected with higher values of the bivariate statistics than the univariates which have close values This is due to the fact that, in the bivariate samples, the dependence parameter m causes more discordancy When the discordancy affects the marginal parameters, the six sites are detected also, with a slightly lower values in the bivariate cases compared to those of completely discordant sites Finally, when the discordancy affects only the dependence parameter, none of the six sites is detected by the univariate statistics where the values are similar to homogeneous regions; whereas the bivariate statistic detects only one site among the six supposed to be discordant This can be explained by the fact that there are no differences between sites on the marginals, and the discordancy caused by the dependence parameter alone is not enough to be detected by the discordancy test since it represents little information compared to all the information given by the remainder of non affected parameters 29

42 Table 2 Discordancy values of sites in the described regions, n = 30, N = 20 Homogenous regions Regions with completely discordant sites Regions with marginally Discordant sites Regions with dependence Discordant sites Site i D iv, D iq, D i D 2 iv, D iq, D i D 2 iv, D iq, D i D 2 iv, D iq, D i 2 1 1,03 1,04 1,54 2,36 2,51 3,92 2,46 2,22 3,57 0,92 0,90 1,26 2 0,96 1,01 1,37 0,63 0,79 1,16 0,61 0,66 1,05 0,92 0,97 1,39 3 0,95 0,91 1,37 0,69 0,68 1,07 0,80 0,72 1,07 0,99 0,76 1,32 4 0,86 0,93 1,43 0,67 0,71 1,05 0,69 0,67 1,01 0,97 0,97 1,41 5 0,95 0,89 1,48 0,69 0,76 1,06 0,71 0,80 1,01 0,83 0,94 1,33 6 0,91 0,91 1,29 1,58 1,55 2,62 1,42 1,51 2,36 0,92 0,86 1,35 7 1,10 0,95 1,46 0,67 0,70 0,98 0,70 0,64 0,98 0,95 1,00 1,37 8 0,84 1,06 1,42 0,78 0,62 1,13 0,63 0,64 1,03 1,08 0,91 1,37 9 0,87 0,92 1,34 0,63 0,75 1,06 0,69 0,68 1,10 0,93 0,88 1, ,08 1,05 1,45 0,69 0,67 1,14 0,68 0,71 1,03 0,97 0,96 1, ,93 0,95 1,35 2,25 2,20 3,90 2,43 2,49 3,75 1,04 1,09 1, ,90 0,86 1,32 0,77 0,81 1,23 0,85 0,70 1,07 0,94 1,03 1, ,96 0,87 1,31 0,68 0,61 1,10 0,65 0,72 1,06 1,00 0,91 1, ,99 0,89 1,35 0,59 0,69 1,09 0,64 0,77 1,06 0,93 0,88 1, ,02 0,95 1,37 0,80 0,56 1,02 0,68 0,71 1,09 0,87 0,97 1, ,85 0,92 1,42 1,79 1,69 2,84 1,73 1,66 2,76 1,07 1,02 1, ,94 0,94 1,34 0,60 0,62 1,03 0,65 0,62 1,09 0,91 0,98 1, ,90 0,95 1,41 0,71 0,70 1,09 0,65 0,73 1,03 0,93 0,92 1, ,99 1,00 1,43 0,66 0,65 1,18 0,58 0,65 0,99 0,89 1,09 1, ,98 1,00 1,42 0,75 0,76 1,11 0,76 0,72 1,08 0,96 0,98 1,35 Numbers written in bold character in the first column indicate prior discordant sites, in the other columns they represent the detected discordant sites Tables 3 to 7 present the simulation results for homogeneity testing for all configurations of regions and for the various values of n and N Selected illustrations of the test power and the 30

43 rates of heterogeneity measure are presented respectively in Figures 3a to 3c and Figure 4 for the considered regions and for n = 30 The statistic H corresponding to the norm leads almost always to the lowest power value among the various statistics H The corresponding loss of power may reach 20%; nevertheless, H leads to relatively good performances under homogeneity Due to space limitations, results about H are not presented in Figures 3a-3c Given the similarity in the results of Tables 3-7 H, H and H, only results of 1 2 F H are presented in 2 In the case of generated homogeneous regions, the levels (first kind errors) of the H statistics are presented in Table 3 The H statistics mean values are close to zero even for small regions ( N = 10) and the regions are actually identified to be homogeneous More than 81% of the samples are identified to be homogeneous with both univariate and bivariate statistics Therefore, there are no significant differences between the results of the univariate and bivariate tests Note that, occasionally, small negative values of the H statistics are obtained for homogeneous regions From the relation (21), this may happen when the region is less dispersed, in the sense of V, than what would be expected from the simulated homogeneous regions that serve to compute μ Vsim Results concerning the three 50% heterogeneous regions with record length sites n = 30 are presented in Table 4 For a given region with size N, the power, the rates and the statistic mean values are presented for each statistical test H The corresponding powers are illustrated in Figures 3a-3c and the occurrence rates of the heterogeneity on the dependence are presented in Figure 4 Similar figures can be produced from Table 4 31

44 Table 3 Simulation results for homogeneity test when regions are homogeneous, n = 30 Region type Region size N Stat Level* (%) Hom (%) Poss Hom (%) Not Hom (%) Stat Mean Value Hom 10 H H V H Q H H V H Q H H V H Q H H V H Q Stat: Statistic Hom: Homogeneous Poss Hom: Possibly Homogeneous *Under the homogeneity, the computed probability corresponds to the empirical first kind error For the completely heterogeneous regions and for any fixed N, the mean value of H is larger 2 than those of H V and H Q as can be seen in the first part of Table 4 Using H, with mean 2 values larger than 2, the region is correctly indicated to be definitely heterogeneous even for small values of N In contrast, the univariate statistics H V and H Q, with mean values between 136 and 2, wrongly indicate that the region is possibly homogeneous From Figure 3a, the bivariate tests H lead to equivalent powers which increase from 70% to 90% with respect to N The power of both H V and H Q increases also but in the range 30%-60% This indicates the superiority of H in comparison to H V and 2 H Q 32

45 Table 4 Simulation results for homogeneity test when regions are 50% heterogeneous with n = 30 Region type Region size N Stat Power (%) Hom (%) Poss Hom (%) Not Hom(%) Stat Mean Value He Co 10 H H V H Q H H V H Q H H V H Q H H V H Q He Ma 10 H H V H Q H H V H Q H H V H Q H H V H Q He De 10 H H V H Q H H V H Q H H V H Q H H V H Q Stat: Statistic Hom: Homogeneous Poss Hom: Possibly Homogeneous He : Heterogeneous Co: Completely Ma: Marginally De: Dependence 33

46 100 H 1 90 H 2 H F 80 H V H Q Power % N Figure 3a Test power for completely heterogeneous region with n = 30 In the second part of Table 4, results about marginal heterogeneity are presented For a fixed N, the H mean values are approximately in the same order for all statistics On the other hand, for each H statistic, the corresponding mean value increases with respect to N Since all mean values range between 1 and 2, all procedures partially fail to indicate the right kind of heterogeneity In terms of power, Figure 3b shows that all the H homogeneity tests indicate the heterogeneity on the marginals with approximately the same power Generally, powers increase with N in the range 35%-60% 34

47 60 H 1 H 2 55 H F H V Power % H Q N Figure 3b Test power for marginally heterogeneous region with n = 30 The simulation results, when the generated regions are heterogeneous on the dependence parameter m, are presented in the last part of Table 4 The H mean values increase slightly 2 with N whereas the mean values of H and V H Q are almost constant With mean values of H less than 1, the bivariate and univariate approaches fail to indicate the right kind of heterogeneity In Figure 3c, we observe that, despite the low power of the bivariate tests, they perform clearly better than the univariate tests which fail to indicate any heterogeneity in that region The power values concerning the univariate tests are around 5% which corresponds to the first kind error Therefore, these results are not necessarily expected to be increasing with respect to N, since under the univariate tests, the regions are viewed as homogeneous Figure 4 illustrates clearly the gain of the bivariate tests compared to the univariate tests, in terms of the occurrence rates 35

48 Indeed, the first bar in each cell of Figure 4 is very high for the univariates in comparison with those of the bivariates This means that the univariate tests lead to false conclusions concerning the heterogeneity on the dependence parameter 30 H 1 25 H 2 H F 20 H V H Q Power % N Figure 3c Test power for dependence heterogeneous region with n = 30 36

49 H H H F H V H Q N= N= N= N=30 Figure 4 Rates of heterogeneity measure for dependence heterogeneous region with n = 30 From the previous remarks, it is important to make the following observation: When the heterogeneity is on the two marginal parameters α V and α, the statistical test H has a lower Q power than in complete heterogeneity on all three parameters αv, α Q and m However, powers in both above cases are higher than in heterogeneity on the only one dependence parameter m This can be explained by the fact that the higher the heterogeneity in terms of the number of parameters to be varied, the higher the power will be The arguments previously developed for heterogeneous regions can also be presented when the regions are bimodal Hence, the following discussion will be brief The results of the homogeneity tests for the various 30% bimodal regions with site record length n = 30 are 37

50 presented in Table 5 from which illustrations similar to Figures 3a-3c can be derived for powers As shown in the fist part of Table 5, for the completely bimodal regions, the power of the H statistics ranges between 70% and 90% These values are considerably higher than the power of H and H The second part of Table 5 shows that, when the bimodality is on the V Q marginals, both univariate and bivariate tests have similar power values, generally less than 55% The last part of Table 5 indicates that the bimodality on the dependence parameter m can not be detected by the univariate tests On the other hand, bivariate tests identify the heterogeneity on dependence parameter m but with a low power This power ranges between 15% and 30% for n = 30 The total record length of sites in the region has an effect on the performance of homogeneity tests That is, the larger the total record length in the region, the higher the power will be Indeed, from Table 6, it can be seen that the power of H is very high (96%) when 2 n = 50,46,,10,14,,50 where the total record length is 650 Both regions with i n = 10,12,,50 and n = 50,48,,10 correspond to moderate powers (69% and 78% i i respectively) where the total record length is 630 Finally, when n i = 10,14,,50, 46,,10, the power is low (57%) as this corresponds to the shortest total record length of 610 In all these variations of n, the power of H is always higher than the power of both H and H Table 7 2 V Q shows that when site record lengths are high ( n = 60 ), except for the case of heterogeneity on the dependence parameter m, the power of all tests is very high and can reach 100% for H The 2 first kind error is then very close to the theoretical value of 5% for both univariate and bivariate tests For dependence heterogeneous or dependence bimodal regions, the power of H 2 increases 38

51 considerably from 20% when n = 30 to 50% for n = 60 On the other hand, for such regions, H V and H give always false results Q Finally, if a choice has to be made among the various norms, the authors suggest the adoption of the norm for both the homogeneity and discordancy tests The reason for this choice is that, 2 statistically, the norm is the most appropriate to properly quantify the variability 2 39

52 Table 5 Simulation results for homogeneity test when regions are 30% bimodal with n = 30 Region type Region size Stat Power Hom Poss Hom Not Hom Stat Mean Bi Co 10 H H V H Q H H V H Q H H V H Q H H V H Q Bi Ma 10 H H V H Q H H V H Q H H V H Q H H V H Q Bi De 10 H H V H Q H H V H Q H H V H Q H H V H Q Stat: Statistic Hom: Homogeneous Poss Hom: Possibly Homogeneous Bi: Bimodal Co: Completely Ma : Marginally De: Dependence 40

Multivariate L-moment homogeneity test

Multivariate L-moment homogeneity test WATER RESOURCES RESEARCH, VOL. 43, W08406, doi:10.1029/2006wr005639, 2007 Multivariate L-moment homogeneity test F. Chebana 1 and T. B. M. J. Ouarda 1 Received 24 October 2006; revised 27 April 2007; accepted

More information

Bivariate Flood Frequency Analysis Using Copula Function

Bivariate Flood Frequency Analysis Using Copula Function Bivariate Flood Frequency Analysis Using Copula Function Presented by : Dilip K. Bishwkarma (student,msw,ioe Pulchok Campus) ( Er, Department of Irrigation, GoN) 17 th Nov 2016 1 Outlines Importance of

More information

Fast and direct nonparametric procedures in the. L-moment homogeneity test

Fast and direct nonparametric procedures in the. L-moment homogeneity test Fast and direct nonparametric procedures in the L-moment homogeneity test Pierre Masselot 1*, Fateh Chebana 1, Taha B.M.J. Ouarda 1,2 February, 2015 1 Centre Eau-Terre-Environnement (ETE), Institut national

More information

Estimation of extreme flow quantiles and quantile uncertainty for ungauged catchments

Estimation of extreme flow quantiles and quantile uncertainty for ungauged catchments Quantification and Reduction of Predictive Uncertainty for Sustainable Water Resources Management (Proceedings of Symposium HS2004 at IUGG2007, Perugia, July 2007). IAHS Publ. 313, 2007. 417 Estimation

More information

Estimation of multivariate critical layers: Applications to rainfall data

Estimation of multivariate critical layers: Applications to rainfall data Elena Di Bernardino, ICRA 6 / RISK 2015 () Estimation of Multivariate critical layers Barcelona, May 26-29, 2015 Estimation of multivariate critical layers: Applications to rainfall data Elena Di Bernardino,

More information

Simulating Uniform- and Triangular- Based Double Power Method Distributions

Simulating Uniform- and Triangular- Based Double Power Method Distributions Journal of Statistical and Econometric Methods, vol.6, no.1, 2017, 1-44 ISSN: 1792-6602 (print), 1792-6939 (online) Scienpress Ltd, 2017 Simulating Uniform- and Triangular- Based Double Power Method Distributions

More information

Multivariate Distributions

Multivariate Distributions IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate

More information

Trivariate copulas for characterisation of droughts

Trivariate copulas for characterisation of droughts ANZIAM J. 49 (EMAC2007) pp.c306 C323, 2008 C306 Trivariate copulas for characterisation of droughts G. Wong 1 M. F. Lambert 2 A. V. Metcalfe 3 (Received 3 August 2007; revised 4 January 2008) Abstract

More information

Copulas. MOU Lili. December, 2014

Copulas. MOU Lili. December, 2014 Copulas MOU Lili December, 2014 Outline Preliminary Introduction Formal Definition Copula Functions Estimating the Parameters Example Conclusion and Discussion Preliminary MOU Lili SEKE Team 3/30 Probability

More information

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich Modelling Dependence with Copulas and Applications to Risk Management Filip Lindskog, RiskLab, ETH Zürich 02-07-2000 Home page: http://www.math.ethz.ch/ lindskog E-mail: lindskog@math.ethz.ch RiskLab:

More information

Bivariate return periods and their importance for flood peak and volume estimation

Bivariate return periods and their importance for flood peak and volume estimation Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse CH-0 Zurich www.zora.uzh.ch Year: 01 Bivariate return periods and their importance for flood peak and volume estimation

More information

Extreme Value Analysis and Spatial Extremes

Extreme Value Analysis and Spatial Extremes Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models

More information

Modelling Dependent Credit Risks

Modelling Dependent Credit Risks Modelling Dependent Credit Risks Filip Lindskog, RiskLab, ETH Zürich 30 November 2000 Home page:http://www.math.ethz.ch/ lindskog E-mail:lindskog@math.ethz.ch RiskLab:http://www.risklab.ch Modelling Dependent

More information

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Overview of Extreme Value Theory. Dr. Sawsan Hilal space Overview of Extreme Value Theory Dr. Sawsan Hilal space Maths Department - University of Bahrain space November 2010 Outline Part-1: Univariate Extremes Motivation Threshold Exceedances Part-2: Bivariate

More information

First steps of multivariate data analysis

First steps of multivariate data analysis First steps of multivariate data analysis November 28, 2016 Let s Have Some Coffee We reproduce the coffee example from Carmona, page 60 ff. This vignette is the first excursion away from univariate data.

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

The Instability of Correlations: Measurement and the Implications for Market Risk

The Instability of Correlations: Measurement and the Implications for Market Risk The Instability of Correlations: Measurement and the Implications for Market Risk Prof. Massimo Guidolin 20254 Advanced Quantitative Methods for Asset Pricing and Structuring Winter/Spring 2018 Threshold

More information

Statistical modeling of flood discharges and volumes in Continental Portugal: convencional and bivariate analyses

Statistical modeling of flood discharges and volumes in Continental Portugal: convencional and bivariate analyses Statistical modeling of flood discharges and volumes in Continental Portugal: convencional and bivariate analyses Filipa Leite Rosa Extended Abstract Dissertation for obtaining the degree of master in

More information

Regional Frequency Analysis of Extreme Climate Events. Theoretical part of REFRAN-CV

Regional Frequency Analysis of Extreme Climate Events. Theoretical part of REFRAN-CV Regional Frequency Analysis of Extreme Climate Events. Theoretical part of REFRAN-CV Course outline Introduction L-moment statistics Identification of Homogeneous Regions L-moment ratio diagrams Example

More information

ON THE TWO STEP THRESHOLD SELECTION FOR OVER-THRESHOLD MODELLING

ON THE TWO STEP THRESHOLD SELECTION FOR OVER-THRESHOLD MODELLING ON THE TWO STEP THRESHOLD SELECTION FOR OVER-THRESHOLD MODELLING Pietro Bernardara (1,2), Franck Mazas (3), Jérôme Weiss (1,2), Marc Andreewsky (1), Xavier Kergadallan (4), Michel Benoît (1,2), Luc Hamm

More information

Package homtest. February 20, 2015

Package homtest. February 20, 2015 Version 1.0-5 Date 2009-03-26 Package homtest February 20, 2015 Title Homogeneity tests for Regional Frequency Analysis Author Alberto Viglione Maintainer Alberto Viglione

More information

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach The 8th Tartu Conference on MULTIVARIATE STATISTICS, The 6th Conference on MULTIVARIATE DISTRIBUTIONS with Fixed Marginals Modelling Dropouts by Conditional Distribution, a Copula-Based Approach Ene Käärik

More information

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories Entropy 2012, 14, 1784-1812; doi:10.3390/e14091784 Article OPEN ACCESS entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories Lan Zhang

More information

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline. Practitioner Course: Portfolio Optimization September 10, 2008 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y ) (x,

More information

Maximum Monthly Rainfall Analysis Using L-Moments for an Arid Region in Isfahan Province, Iran

Maximum Monthly Rainfall Analysis Using L-Moments for an Arid Region in Isfahan Province, Iran 494 J O U R N A L O F A P P L I E D M E T E O R O L O G Y A N D C L I M A T O L O G Y VOLUME 46 Maximum Monthly Rainfall Analysis Using L-Moments for an Arid Region in Isfahan Province, Iran S. SAEID ESLAMIAN*

More information

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

A measure of radial asymmetry for bivariate copulas based on Sobolev norm A measure of radial asymmetry for bivariate copulas based on Sobolev norm Ahmad Alikhani-Vafa Ali Dolati Abstract The modified Sobolev norm is used to construct an index for measuring the degree of radial

More information

Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE. Rick Katz

Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE. Rick Katz 1 Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE Rick Katz Institute for Study of Society and Environment National Center for Atmospheric Research Boulder, CO USA email: rwk@ucar.edu Home

More information

ESTIMATING JOINT FLOW PROBABILITIES AT STREAM CONFLUENCES USING COPULAS

ESTIMATING JOINT FLOW PROBABILITIES AT STREAM CONFLUENCES USING COPULAS ESTIMATING JOINT FLOW PROBABILITIES AT STREAM CONFLUENCES USING COPULAS Roger T. Kilgore, P.E., D. WRE* Principal Kilgore Consulting and Management 2963 Ash Street Denver, CO 80207 303-333-1408 David B.

More information

Clearly, if F is strictly increasing it has a single quasi-inverse, which equals the (ordinary) inverse function F 1 (or, sometimes, F 1 ).

Clearly, if F is strictly increasing it has a single quasi-inverse, which equals the (ordinary) inverse function F 1 (or, sometimes, F 1 ). APPENDIX A SIMLATION OF COPLAS Copulas have primary and direct applications in the simulation of dependent variables. We now present general procedures to simulate bivariate, as well as multivariate, dependent

More information

Accounting for extreme-value dependence in multivariate data

Accounting for extreme-value dependence in multivariate data Accounting for extreme-value dependence in multivariate data 38th ASTIN Colloquium Manchester, July 15, 2008 Outline 1. Dependence modeling through copulas 2. Rank-based inference 3. Extreme-value dependence

More information

A Brief Introduction to Copulas

A Brief Introduction to Copulas A Brief Introduction to Copulas Speaker: Hua, Lei February 24, 2009 Department of Statistics University of British Columbia Outline Introduction Definition Properties Archimedean Copulas Constructing Copulas

More information

Regional Estimation from Spatially Dependent Data

Regional Estimation from Spatially Dependent Data Regional Estimation from Spatially Dependent Data R.L. Smith Department of Statistics University of North Carolina Chapel Hill, NC 27599-3260, USA December 4 1990 Summary Regional estimation methods are

More information

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations John R. Michael, Significance, Inc. and William R. Schucany, Southern Methodist University The mixture

More information

Multivariate Non-Normally Distributed Random Variables

Multivariate Non-Normally Distributed Random Variables Multivariate Non-Normally Distributed Random Variables An Introduction to the Copula Approach Workgroup seminar on climate dynamics Meteorological Institute at the University of Bonn 18 January 2008, Bonn

More information

Regional Frequency Analysis of Extreme Precipitation with Consideration of Uncertainties to Update IDF Curves for the City of Trondheim

Regional Frequency Analysis of Extreme Precipitation with Consideration of Uncertainties to Update IDF Curves for the City of Trondheim Regional Frequency Analysis of Extreme Precipitation with Consideration of Uncertainties to Update IDF Curves for the City of Trondheim Hailegeorgis*, Teklu, T., Thorolfsson, Sveinn, T., and Alfredsen,

More information

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Applied Mathematical Sciences, Vol. 4, 2010, no. 14, 657-666 Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Pranesh Kumar Mathematics Department University of Northern British Columbia

More information

La dépendance de queue en analyse des événements hydrologiques bivariés. Rapport de recherche No R-1426 Décembre 2012

La dépendance de queue en analyse des événements hydrologiques bivariés. Rapport de recherche No R-1426 Décembre 2012 La dépendance de queue en analyse des événements hydrologiques bivariés Rapport de recherche No R-1426 Décembre 2012 On the tail dependence in bivariate hydrological frequency analysis Alexandre Lekina,

More information

Lecture Quantitative Finance Spring Term 2015

Lecture Quantitative Finance Spring Term 2015 on bivariate Lecture Quantitative Finance Spring Term 2015 Prof. Dr. Erich Walter Farkas Lecture 07: April 2, 2015 1 / 54 Outline on bivariate 1 2 bivariate 3 Distribution 4 5 6 7 8 Comments and conclusions

More information

Gaussian random variables inr n

Gaussian random variables inr n Gaussian vectors Lecture 5 Gaussian random variables inr n One-dimensional case One-dimensional Gaussian density with mean and standard deviation (called N, ): fx x exp. Proposition If X N,, then ax b

More information

Flood frequency analysis using copula with mixed marginal distributions Project Report, Aug Project Report.

Flood frequency analysis using copula with mixed marginal distributions Project Report, Aug Project Report. Project Report August 2007-1 - Prepared by and - 2 - Abstract...7 I. Introduction...8 II. Nonparametric method of estimating marginal distribution...14 II.1 Univariate kernel density estimation...14 II.2

More information

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 Contents Preface to Second Edition Preface to First Edition Abbreviations xv xvii xix PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 1 The Role of Statistical Methods in Modern Industry and Services

More information

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 14 GEE-GMM Throughout the course we have emphasized methods of estimation and inference based on the principle

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

Physics and Chemistry of the Earth

Physics and Chemistry of the Earth Physics and Chemistry of the Earth 34 (2009) 626 634 Contents lists available at ScienceDirect Physics and Chemistry of the Earth journal homepage: www.elsevier.com/locate/pce Performances of some parameter

More information

Financial Econometrics and Volatility Models Copulas

Financial Econometrics and Volatility Models Copulas Financial Econometrics and Volatility Models Copulas Eric Zivot Updated: May 10, 2010 Reading MFTS, chapter 19 FMUND, chapters 6 and 7 Introduction Capturing co-movement between financial asset returns

More information

Probability Distributions of Annual Maximum River Discharges in North-Western and Central Europe

Probability Distributions of Annual Maximum River Discharges in North-Western and Central Europe Probability Distributions of Annual Maximum River Discharges in North-Western and Central Europe P H A J M van Gelder 1, N M Neykov 2, P Neytchev 2, J K Vrijling 1, H Chbab 3 1 Delft University of Technology,

More information

Regionalization Approach for Extreme Flood Analysis Using L-moments

Regionalization Approach for Extreme Flood Analysis Using L-moments J. Agr. Sci. Tech. (0) Vol. 3: 83-96 Regionalization Approach for Extreme Flood Analysis Using L-moments H. Malekinezhad, H. P. Nachtnebel, and A. Klik 3 ABSTRACT Flood frequency analysis is faced with

More information

How Significant is the BIAS in Low Flow Quantiles Estimated by L- and LH-Moments?

How Significant is the BIAS in Low Flow Quantiles Estimated by L- and LH-Moments? How Significant is the BIAS in Low Flow Quantiles Estimated by L- and LH-Moments? Hewa, G. A. 1, Wang, Q. J. 2, Peel, M. C. 3, McMahon, T. A. 3 and Nathan, R. J. 4 1 University of South Australia, Mawson

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Econometrics Workshop UNC

More information

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8] 1 Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8] Insights: Price movements in one market can spread easily and instantly to another market [economic globalization and internet

More information

Bivariate generalized Pareto distribution

Bivariate generalized Pareto distribution Bivariate generalized Pareto distribution in practice Eötvös Loránd University, Budapest, Hungary Minisymposium on Uncertainty Modelling 27 September 2011, CSASC 2011, Krems, Austria Outline Short summary

More information

Imputation Algorithm Using Copulas

Imputation Algorithm Using Copulas Metodološki zvezki, Vol. 3, No. 1, 2006, 109-120 Imputation Algorithm Using Copulas Ene Käärik 1 Abstract In this paper the author demonstrates how the copulas approach can be used to find algorithms for

More information

Modelling bivariate rainfall distribution and generating bivariate correlated rainfall data in neighbouring meteorological subdivisions using copula

Modelling bivariate rainfall distribution and generating bivariate correlated rainfall data in neighbouring meteorological subdivisions using copula HYDROLOGICAL PROCESSES Hydrol. Process. 24, 3558 3567 (2010) Published online 2 July 2010 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/hyp.7785 Modelling bivariate rainfall distribution

More information

Regression. Oscar García

Regression. Oscar García Regression Oscar García Regression methods are fundamental in Forest Mensuration For a more concise and general presentation, we shall first review some matrix concepts 1 Matrices An order n m matrix is

More information

The multivariate probability integral transform

The multivariate probability integral transform The multivariate probability integral transform Fabrizio Durante Faculty of Economics and Management Free University of Bozen-Bolzano (Italy) fabrizio.durante@unibz.it http://sites.google.com/site/fbdurante

More information

A New Generalized Gumbel Copula for Multivariate Distributions

A New Generalized Gumbel Copula for Multivariate Distributions A New Generalized Gumbel Copula for Multivariate Distributions Chandra R. Bhat* The University of Texas at Austin Department of Civil, Architectural & Environmental Engineering University Station, C76,

More information

Multivariate Distribution Models

Multivariate Distribution Models Multivariate Distribution Models Model Description While the probability distribution for an individual random variable is called marginal, the probability distribution for multiple random variables is

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David

More information

Empirical Power of Four Statistical Tests in One Way Layout

Empirical Power of Four Statistical Tests in One Way Layout International Mathematical Forum, Vol. 9, 2014, no. 28, 1347-1356 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/imf.2014.47128 Empirical Power of Four Statistical Tests in One Way Layout Lorenzo

More information

Correlation: Copulas and Conditioning

Correlation: Copulas and Conditioning Correlation: Copulas and Conditioning This note reviews two methods of simulating correlated variates: copula methods and conditional distributions, and the relationships between them. Particular emphasis

More information

Comparison of region-of-influence methods for estimating high quantiles of precipitation in a dense dataset in the Czech Republic

Comparison of region-of-influence methods for estimating high quantiles of precipitation in a dense dataset in the Czech Republic Hydrol. Earth Syst. Sci., 13, 2203 2219, 2009 Author(s) 2009. This work is distributed under the Creative Commons Attribution 3.0 License. Hydrology and Earth System Sciences Comparison of region-of-influence

More information

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution p. /2 Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

More information

Review (Probability & Linear Algebra)

Review (Probability & Linear Algebra) Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint

More information

Journal of Environmental Statistics

Journal of Environmental Statistics jes Journal of Environmental Statistics February 2010, Volume 1, Issue 3. http://www.jenvstat.org Exponentiated Gumbel Distribution for Estimation of Return Levels of Significant Wave Height Klara Persson

More information

Does k-th Moment Exist?

Does k-th Moment Exist? Does k-th Moment Exist? Hitomi, K. 1 and Y. Nishiyama 2 1 Kyoto Institute of Technology, Japan 2 Institute of Economic Research, Kyoto University, Japan Email: hitomi@kit.ac.jp Keywords: Existence of moments,

More information

MULTIDIMENSIONAL POVERTY MEASUREMENT: DEPENDENCE BETWEEN WELL-BEING DIMENSIONS USING COPULA FUNCTION

MULTIDIMENSIONAL POVERTY MEASUREMENT: DEPENDENCE BETWEEN WELL-BEING DIMENSIONS USING COPULA FUNCTION Rivista Italiana di Economia Demografia e Statistica Volume LXXII n. 3 Luglio-Settembre 2018 MULTIDIMENSIONAL POVERTY MEASUREMENT: DEPENDENCE BETWEEN WELL-BEING DIMENSIONS USING COPULA FUNCTION Kateryna

More information

Supporting Information for Estimating restricted mean. treatment effects with stacked survival models

Supporting Information for Estimating restricted mean. treatment effects with stacked survival models Supporting Information for Estimating restricted mean treatment effects with stacked survival models Andrew Wey, David Vock, John Connett, and Kyle Rudser Section 1 presents several extensions to the simulation

More information

Construction and estimation of high dimensional copulas

Construction and estimation of high dimensional copulas Construction and estimation of high dimensional copulas Gildas Mazo PhD work supervised by S. Girard and F. Forbes Mistis, Inria and laboratoire Jean Kuntzmann, Grenoble, France Séminaire Statistiques,

More information

FLOOD REGIONALIZATION USING A MODIFIED REGION OF INFLUENCE APPROACH

FLOOD REGIONALIZATION USING A MODIFIED REGION OF INFLUENCE APPROACH JOURNAL O LOOD ENGINEERING J E 1(1) January June 2009; pp. 55 70 FLOOD REGIONALIZATION USING A MODIFIED REGION OF INFLUENCE APPROACH Saeid Eslamian Dept. of Water Engineering, Isfahan University of Technology,

More information

Estimation of risk measures for extreme pluviometrical measurements

Estimation of risk measures for extreme pluviometrical measurements Estimation of risk measures for extreme pluviometrical measurements by Jonathan EL METHNI in collaboration with Laurent GARDES & Stéphane GIRARD 26th Annual Conference of The International Environmetrics

More information

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Review. DS GA 1002 Statistical and Mathematical Models.   Carlos Fernandez-Granda Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with

More information

THE VINE COPULA METHOD FOR REPRESENTING HIGH DIMENSIONAL DEPENDENT DISTRIBUTIONS: APPLICATION TO CONTINUOUS BELIEF NETS

THE VINE COPULA METHOD FOR REPRESENTING HIGH DIMENSIONAL DEPENDENT DISTRIBUTIONS: APPLICATION TO CONTINUOUS BELIEF NETS Proceedings of the 00 Winter Simulation Conference E. Yücesan, C.-H. Chen, J. L. Snowdon, and J. M. Charnes, eds. THE VINE COPULA METHOD FOR REPRESENTING HIGH DIMENSIONAL DEPENDENT DISTRIBUTIONS: APPLICATION

More information

PERFORMANCE OF PARAMETER ESTIMATION TECHNIQUES WITH INHOMOGENEOUS DATASETS OF EXTREME WATER LEVELS ALONG THE DUTCH COAST.

PERFORMANCE OF PARAMETER ESTIMATION TECHNIQUES WITH INHOMOGENEOUS DATASETS OF EXTREME WATER LEVELS ALONG THE DUTCH COAST. PERFORMANCE OF PARAMETER ESTIMATION TECHNIQUES WITH INHOMOGENEOUS DATASETS OF EXTREME WATER LEVELS ALONG THE DUTCH COAST. P.H.A.J.M. VAN GELDER TU Delft, Faculty of Civil Engineering, Stevinweg 1, 2628CN

More information

Bayesian Modelling of Extreme Rainfall Data

Bayesian Modelling of Extreme Rainfall Data Bayesian Modelling of Extreme Rainfall Data Elizabeth Smith A thesis submitted for the degree of Doctor of Philosophy at the University of Newcastle upon Tyne September 2005 UNIVERSITY OF NEWCASTLE Bayesian

More information

LQ-Moments for Statistical Analysis of Extreme Events

LQ-Moments for Statistical Analysis of Extreme Events Journal of Modern Applied Statistical Methods Volume 6 Issue Article 5--007 LQ-Moments for Statistical Analysis of Extreme Events Ani Shabri Universiti Teknologi Malaysia Abdul Aziz Jemain Universiti Kebangsaan

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Review of Basic Probability The fundamentals, random variables, probability distributions Probability mass/density functions

More information

Digital Michigan Tech. Michigan Technological University. Fredline Ilorme Michigan Technological University

Digital Michigan Tech. Michigan Technological University. Fredline Ilorme Michigan Technological University Michigan Technological University Digital Commons @ Michigan Tech Dissertations, Master's Theses and Master's Reports - Open Dissertations, Master's Theses and Master's Reports 2011 Development of a physically-based

More information

Delta Method. Example : Method of Moments for Exponential Distribution. f(x; λ) = λe λx I(x > 0)

Delta Method. Example : Method of Moments for Exponential Distribution. f(x; λ) = λe λx I(x > 0) Delta Method Often estimators are functions of other random variables, for example in the method of moments. These functions of random variables can sometimes inherit a normal approximation from the underlying

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

International Journal of World Research, Vol - 1, Issue - XVI, April 2015 Print ISSN: X

International Journal of World Research, Vol - 1, Issue - XVI, April 2015 Print ISSN: X (1) ESTIMATION OF MAXIMUM FLOOD DISCHARGE USING GAMMA AND EXTREME VALUE FAMILY OF PROBABILITY DISTRIBUTIONS N. Vivekanandan Assistant Research Officer Central Water and Power Research Station, Pune, India

More information

Inference For High Dimensional M-estimates. Fixed Design Results

Inference For High Dimensional M-estimates. Fixed Design Results : Fixed Design Results Lihua Lei Advisors: Peter J. Bickel, Michael I. Jordan joint work with Peter J. Bickel and Noureddine El Karoui Dec. 8, 2016 1/57 Table of Contents 1 Background 2 Main Results and

More information

Sharp statistical tools Statistics for extremes

Sharp statistical tools Statistics for extremes Sharp statistical tools Statistics for extremes Georg Lindgren Lund University October 18, 2012 SARMA Background Motivation We want to predict outside the range of observations Sums, averages and proportions

More information

Songklanakarin Journal of Science and Technology SJST R1 Sukparungsee

Songklanakarin Journal of Science and Technology SJST R1 Sukparungsee Songklanakarin Journal of Science and Technology SJST-0-0.R Sukparungsee Bivariate copulas on the exponentially weighted moving average control chart Journal: Songklanakarin Journal of Science and Technology

More information

Regional Flood Frequency Analysis of the Red River Basin Using L-moments Approach

Regional Flood Frequency Analysis of the Red River Basin Using L-moments Approach Regional Flood Frequency Analysis of the Red River Basin Using L-moments Approach Y.H. Lim 1 1 Civil Engineering Department, School of Engineering & Mines, University of North Dakota, 3 Centennial Drive

More information

Extreme Rain all Frequency Analysis for Louisiana

Extreme Rain all Frequency Analysis for Louisiana 78 TRANSPORTATION RESEARCH RECORD 1420 Extreme Rain all Frequency Analysis for Louisiana BABAK NAGHAVI AND FANG XIN Yu A comparative study of five popular frequency distributions and three parameter estimation

More information

Chapter 5. The multivariate normal distribution. Probability Theory. Linear transformations. The mean vector and the covariance matrix

Chapter 5. The multivariate normal distribution. Probability Theory. Linear transformations. The mean vector and the covariance matrix Probability Theory Linear transformations A transformation is said to be linear if every single function in the transformation is a linear combination. Chapter 5 The multivariate normal distribution When

More information

THE MODELLING OF HYDROLOGICAL JOINT EVENTS ON THE MORAVA RIVER USING AGGREGATION OPERATORS

THE MODELLING OF HYDROLOGICAL JOINT EVENTS ON THE MORAVA RIVER USING AGGREGATION OPERATORS 2009/3 PAGES 9 15 RECEIVED 10. 12. 2007 ACCEPTED 1. 6. 2009 R. MATÚŠ THE MODELLING OF HYDROLOGICAL JOINT EVENTS ON THE MORAVA RIVER USING AGGREGATION OPERATORS ABSTRACT Rastislav Matúš Department of Water

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Multivariate random variables

Multivariate random variables DS-GA 002 Lecture notes 3 Fall 206 Introduction Multivariate random variables Probabilistic models usually include multiple uncertain numerical quantities. In this section we develop tools to characterize

More information

Estimating Bivariate Tail: a copula based approach

Estimating Bivariate Tail: a copula based approach Estimating Bivariate Tail: a copula based approach Elena Di Bernardino, Université Lyon 1 - ISFA, Institut de Science Financiere et d'assurances - AST&Risk (ANR Project) Joint work with Véronique Maume-Deschamps

More information

Volterra 62, I-00146, Rome, Italy. Version of record first published: 10 Oct 2012.

Volterra 62, I-00146, Rome, Italy. Version of record first published: 10 Oct 2012. This article was downloaded by: [Universita degli Studi Roma Tre] On: 10 October 2012, At: 02:34 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered

More information

X

X Correlation: Pitfalls and Alternatives Paul Embrechts, Alexander McNeil & Daniel Straumann Departement Mathematik, ETH Zentrum, CH-8092 Zürich Tel: +41 1 632 61 62, Fax: +41 1 632 15 23 embrechts/mcneil/strauman@math.ethz.ch

More information

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline. MFM Practitioner Module: Risk & Asset Allocation September 11, 2013 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y

More information

PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION. Alireza Bayestehtashk and Izhak Shafran

PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION. Alireza Bayestehtashk and Izhak Shafran PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION Alireza Bayestehtashk and Izhak Shafran Center for Spoken Language Understanding, Oregon Health & Science University, Portland, Oregon, USA

More information

A Conditional Approach to Modeling Multivariate Extremes

A Conditional Approach to Modeling Multivariate Extremes A Approach to ing Multivariate Extremes By Heffernan & Tawn Department of Statistics Purdue University s April 30, 2014 Outline s s Multivariate Extremes s A central aim of multivariate extremes is trying

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

Examination of homogeneity of selected Irish pooling groups

Examination of homogeneity of selected Irish pooling groups Hydrol. Earth Syst. Sci., 15, 819 830, 2011 doi:10.5194/hess-15-819-2011 Author(s) 2011. CC Attribution 3.0 License. Hydrology and Earth System Sciences Examination of homogeneity of selected Irish pooling

More information

Subject CS1 Actuarial Statistics 1 Core Principles

Subject CS1 Actuarial Statistics 1 Core Principles Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology

Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology M. Soleymani Fall 2012 Some slides have been adopted from Prof. H.R. Rabiee s and also Prof. R. Gutierrez-Osuna

More information