Knowable moments and K-climacogram

Similar documents
Scaling properties of fine resolution point rainfall and inferences for its stochastic modelling

Scaling properties of fine resolution point rainfall and inferences for its stochastic modelling

Parsimonious entropy-based stochastic modelling for changing hydroclimatic processes

Stochastic disaggregation of spatial-temporal rainfall with limited data

Stochastic similarities between the microscale of turbulence and hydrometeorological processes

Spatial and temporal variability of wind speed and energy over Greece

Review of existing statistical methods for flood frequency estimation in Greece

Basic Probability space, sample space concepts and order of a Stochastic Process

A world-wide investigation of the probability distribution of daily rainfall

Memory in climate and things not to be forgotten

Long term properties of monthly atmospheric pressure fields

Stochastic investigation of wind process for climatic variability identification

Long-term properties of annual maximum daily rainfall worldwide

Special concepts of probability theory in geophysical applications

Rainfall Disaggregation Methods: Theory and Applications

Random musings on stochastics (Lorenz Lecture)

Rainfall Disaggregation Methods: Theory and Applications

Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk

TEMPORAL RAINFALL DISAGGREGATION AT THE TIBER RIVER, ITALY

From fractals to stochastics: Seeking theoretical consistency in analysis of geophysical data

p(z)

Multivariate rainfall disaggregation at a fine time scale

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Hydrological statistics for engineering design in a varying climate

Fourier and Stats / Astro Stats and Measurement : Stats Notes

If we want to analyze experimental or simulated data we might encounter the following tasks:

Multivariate Normal-Laplace Distribution and Processes

Predictability in dice motion: how does it differ from hydrometeorological processes?

Chapter 4 Sequences and Series

AP Calculus (BC) Chapter 9 Test No Calculator Section Name: Date: Period:

Hurst-Kolomogorov Dynamics and Uncertainty Demetris Koutsoyiannis, National Technical University, Athens

Simple IDF Estimation Under Multifractality

Stationary independent increments. 1. Random changes of the form X t+h X t fixed h > 0 are called increments of the process.

Why is the field of statistics still an active one?

Uses of Asymptotic Distributions: In order to get distribution theory, we need to norm the random variable; we usually look at n 1=2 ( X n ).

In defence of stationarity

Introduction to Statistical Analysis

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories

Long-range dependence

Spring 2012 Math 541B Exam 1

Week 1 Quantitative Analysis of Financial Markets Distributions A

12.1 Arithmetic Progression Geometric Progression General things about sequences

Multivariate Distribution Models

Section 9.7 and 9.10: Taylor Polynomials and Approximations/Taylor and Maclaurin Series

PROBABILITY DISTRIBUTION

Financial Econometrics and Quantitative Risk Managenent Return Properties

Concentration of Measures by Bounded Couplings

Lecture 4: Probability and Discrete Random Variables

CS242: Probabilistic Graphical Models Lecture 7B: Markov Chain Monte Carlo & Gibbs Sampling

f (1 0.5)/n Z =

Journal of Hydrology

Modern Methods of Data Analysis - WS 07/08

Statistics GIDP Ph.D. Qualifying Exam Theory Jan 11, 2016, 9:00am-1:00pm

where r n = dn+1 x(t)

Using R in Undergraduate and Graduate Probability and Mathematical Statistics Courses*

Statistical Methods in HYDROLOGY CHARLES T. HAAN. The Iowa State University Press / Ames

Battle of extreme value distributions: A global survey on extreme daily rainfall

Math-Stat-491-Fall2014-Notes-V

Risks from dismissing stationarity

Limits at Infinity. Horizontal Asymptotes. Definition (Limits at Infinity) Horizontal Asymptotes

Extreme Value Analysis and Spatial Extremes

4 Moment generating functions

Econ 508B: Lecture 5

Lecture 1 Measure concentration

Nonlinear Time Series Modeling

Lectures on Elementary Probability. William G. Faris

What is MuDRain (Multivariate disaggregation of rainfall)? # $ K

Some Concepts of Probability (Review) Volker Tresp Summer 2018

Lecture 1: Overview of percolation and foundational results from probability theory 30th July, 2nd August and 6th August 2007

1. Stochastic Processes and Stationarity

Deccan Education Society s FERGUSSON COLLEGE, PUNE (AUTONOMOUS) SYLLABUS UNDER AUTOMONY. SECOND YEAR B.Sc. SEMESTER - III

LQ-Moments for Statistical Analysis of Extreme Events

Probability Methods in Civil Engineering Prof. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

Infinite Continued Fractions

The battle of extreme value distributions: A global survey on the extreme

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

Final exam (practice) UCLA: Math 31B, Spring 2017

Lecture 11. Multivariate Normal theory

Estimation of risk measures for extreme pluviometrical measurements

Gaussian distributions and processes have long been accepted as useful tools for stochastic

Economics 573 Problem Set 4 Fall 2002 Due: 20 September

Rearrangement Algorithm and Maximum Entropy

A generalized mathematical framework for stochastic simulation and

5.6 The Normal Distributions

Estimation and Confidence Intervals for Parameters of a Cumulative Damage Model

Random Process. Random Process. Random Process. Introduction to Random Processes

Geometric Skew-Normal Distribution

Physics 509: Bootstrap and Robust Parameter Estimation

Monte Carlo Studies. The response in a Monte Carlo study is a random variable.

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

TAMS39 Lecture 2 Multivariate normal distribution

Multivariate Distributions

Correlation: Copulas and Conditioning

Intermittency, Fractals, and β-model

Solutions to Problem Set 5

AP Calculus Testbank (Chapter 9) (Mr. Surowski)

1. Fundamental concepts

Chapter 9. Non-Parametric Density Function Estimation

Stat 710: Mathematical Statistics Lecture 31

Transcription:

Lecture notes on stochastics Part 3 Rome, 3-4 May 208 Knowable moments and K-climacogram Demetris Koutsoyiannis Department of Water Resources and Environmental Engineering School of Civil Engineering National Technical University of Athens, Greece (dk@itia.ntua.gr, http://www.itia.ntua.gr/dk/) Presentation available online: http://www.itia.ntua.gr/835/

Introduction Classical moments, raw or central, express important theoretical properties of probability distributions but cannot be estimated from typical samples for order beyond 2 cf. Lombardo et al. (204): Just two moments!. L-moments are better estimated but they are all of first order in terms of the random variable of interest. They are good to characterize independent series or to infer the marginal distribution of stochastic processes, but they cannot characterize even second order dependence of processes. Picking from both categories, we introduce K-moments, which combine advantages of both classical and L moments. They enable reliable estimation from samples (in some cases even more reliable than L moments) and effective description of high order statistics, useful for marginal and joint distributions of stochastic processes. High-order joint statistics of stochastic properties involve multivariate functions expressing joint high-order moments. Here, by extending the notion of climacogram (Koutsoyiannis, 20, 206) and climacospectrum (Koutsoyiannis, 207) we introduce the K-climacogram and the K- climacospectrum, which enable characterization of high-order properties of stochastic processes, as well as preservation thereof in simulations, in terms of univariate functions. D. Koutsoyiannis, Knowable moments and K-climacogram

A note on classical moments The classical definitions of raw and central moments of order p are: μ p E[x p ], μ p E[(x μ) p ] () respectively, where μ μ = E[x] is the mean of the random variable x. Their standard estimators from a sample xi, i =,, n, are μ p = n x p n i= i, μ p = b(n,p) n (x n i μ ) p i= (2) where a(n, p) is a bias correction factor (e.g. for the variance μ2 =: σ 2, b(n, 2) = n/(n )). The estimators of the raw moments μ p are in theory unbiased, but it is practically impossible to use them in estimation if p > 2 cf. Lombardo et al. (204), Just two moments. In fact, because for large p, it holds that ( x n i= i ) /p max i n (x i ) *, we can conclude that, for an unbounded variable x, asymptotically μ p is not an estimator of μ p but one of an extreme quantity, i.e., the nth order statistic raised to power p. Thus, unless p is very small, μ p is not a knowable quantity: we cannot infer its value from a sample. This is the case even if n is very large! n p * This is precise if xi are positive; see also p. 5. D. Koutsoyiannis, Knowable moments and K-climacogram 2

Definition of K-moments To derive knowable moments for high orders p, in the expectation defining the pth moment we raise x μ to a lower power q < p and for the remaining p q terms we replace x μ with 2F(x), where F(x) is the distribution function. This leads to the following (central) K-moment definition: K pq (p q + )E[(2F(x) ) p q (x μ) q ] (3) Likewise, we define non-central K-moments as: K pq (p q + )E [(F(x)) p q x q ] (4) The quantity (2F(x) ) p q is estimated from a sample without using powers of x. Specifically, for the ith element of a sample x(i) of size n, sorted in ascending order, F(x(i)), is estimated as F (x (i) ) = (i )/(n ), thus taking values from 0 to precisely and irrespective of the values x(i); likewise, 2F(x(i)) is estimated as 2F (x (i) ) = (2i n + )/(n ), taking values from to precisely and irrespective of the values x(i). Hence, the estimators are: = n ( i q n i=, K pq = n n (2i n+)p q (x (i) μ ) q K pq n )p q x (i) i= (5) n D. Koutsoyiannis, Knowable moments and K-climacogram 3

Rationale of the definition. Assuming that the distribution mean is close to the median, so that F(μ) /2 (this is precisely true for a symmetric distribution), the quantity whose expectation is taken in (3) is A(x) (2F(x) ) p q (x μ) q and its Taylor expansion is A(x) = (2f(μ)) p q (x μ) p + (p q)(2f(μ)) p q f (μ)(x μ) p+ + O((x μ) p+2 ) (6) where f(x) is the probability density function of x. Clearly then, K pq depends on μ p as well as classical moments of x of order higher than p. The independence of K pq from classical moments of order < p makes it a good knowable surrogate of the unknowable μ p. 2. As p becomes large, by virtue of the multiplicative term (p q + ) in definition (3), K pq shares similar asymptotic properties with μ p q/p (the estimate, not the true μ q/p p ). To illustrate this for q =, we consider the variable z max i p x i and denote f( ) and h( ) the probability densities of x i and z, respectively. Then (Papoulis, 990, p. 209): and thus, by virtue of (4), h(z) = pf(z)(f(z)) p (7) E[z] = pe [(F(x)) p x] = K p (8) On the other hand, as seen in p. 2, for positive x and large p n, E[μ p /p ] = E [( n n x p i= i ) /p ] E [ max x i] = E[z] = K p (9) i n Note also that the multiplicative term (p q + ) in definition (3) and (4) makes K-moments increasing functions of p. D. Koutsoyiannis, Knowable moments and K-climacogram 4

Asymptotic properties of moment estimates Generally, as p becomes large (approaching n), estimates of both classical and K moments, central or non-central, become estimates of expressions involving extremes such as (max i p x i ) q or max i p (x i μ) q. For negatively skewed distributions these quantities can also involve minimum, instead of maximum quantities. For the K-moments this is consistent with their theoretical definition. For the classical moments this is an inconsistency. A common property of both classical and K moments is that symmetrical distributions have all their odd moments equal to zero. Both classical and K moments are non-decreasing functions of p, separately for odd and even p. In geophysical processes we can justifiably assume that the variance μ2 σ 2 K22 is finite (an infinite variance would presuppose infinite energy to materialize, which is absurd). Hence, high order K-moments Kp2 will be finite too, even if classical moments μp diverge to infinity beyond a certain p (i.e., in heavy tailed distributions). D. Koutsoyiannis, Knowable moments and K-climacogram 5

Moment value Moment value Moment value Moment value Justification of the notion of unknowable vs. knowable Theoretical μ'ₚ¹ᵖ μₚ¹ᵖ K'ₚ₁ Kₚ₁ Sample Estimates μ'ₚ¹ᵖ μₚ¹ᵖ K'ₚ₁ Kₚ₁ 00 0 Theoretical μ'ₚ¹ᵖ μₚ¹ᵖ K'ₚ₁ Kₚ₁ Sample Estimates μ'ₚ¹ᵖ μₚ¹ᵖ K'ₚ₁ Kₚ₁ These are supposed to describe the same thing. Do they? 0. Normal, q = even Note: odd moments are zero Lognormal, q = 0. 0 0 odd 0 Theoretical μ'ₚ²ᵖ μₚ²ᵖ K'ₚ₂ Kₚ₂ Sample Estimates μ'ₚ²ᵖ μₚ²ᵖ K'ₚ₂ Kₚ₂ 0000 000 00 Theoretical μ'ₚ²ᵖ μₚ²ᵖ K'ₚ₂ Kₚ₂ Sample Estimates μ'ₚ²ᵖ μₚ²ᵖ K'ₚ₂ Kₚ₂ 0 Normal, q = 2 0. 0 Lognormal, q = 2 0 Note: Sample sizes are ten times higher than the maximum p shown in graphs, i.e., 00. D. Koutsoyiannis, Knowable moments and K-climacogram 6

Relationship among different moment types The classical moments can be recovered as a special case of K moments: M p K pp. In particular, in uniform distribution, classical and K moments are proportional to each other: K pq (p q + )μ p, K pq (p q + )μ p () The probability weighted moments (PWM), defined as β p E [x (F(x)) p ], are a special case of K- moments corresponding to q = : K p = pβ p () The L-moments defined as λ p p p ( )k ( p k=0 k ) E[x (p k):p], where x k:p denotes the kth order statistic in an independent sample of size p. L-moments are also related to PWM and through them to K moments. In particular, the relationships for the different types of moments for the first four orders are: K = μ = β 0, K = 0 K 2 = 2β, K 2 = 2(K 2 μ) = 4β 2β 0 = 2λ 2 K 3 = 3β 2, K 3 = 4(K 3 μ) 6(K 2 μ) = 2β 2 2β + 2β 0 = 2λ 3 K 4 = 4β 3, K 4 = 8(K 4 μ) 6(K 3 μ) + 2(K 2 μ) = 32β 3 48β 2 + 24β 4β 0 = 8 λ 5 4 + 2 λ 5 2 (2) D. Koutsoyiannis, Knowable moments and K-climacogram 7

Basic characteristics of marginal distribution Within the framework of K-moments, we can (and should) use Just two moments in terms of the power of x, i.e. q = or 2, but we can obtain knowable statistical characteristics for much higher order p. In this manner, for p > we have two alternative options to define statistical characteristics related to moments of the distribution, as in the table below. (Which of the two is preferable depends on the statistical behaviour and in particular, mean, mode and variance of the estimator.) Characteristic Order p Option Option 2 Location K = μ Variability 2 K 2 = 2(K 2 μ) = 2λ 2 K 22 = σ 2 (the classical variance) Skewness 3 K 3 = λ 3 K 32 (dimensionless) K 2 λ 2 K 22 Kurtosis 4 K 4 = 4 λ 4 + 6 K 42 (dimensionless) K 2 5 λ 2 5 K 22 D. Koutsoyiannis, Knowable moments and K-climacogram 8

High order moments for stochastic processes: the K-climacogram and the K-climacospectrum Second order properties of stochastic properties are typically expressed by the autocovariance function, c(h) := cov[x(t), x(t + h)]. An equivalent description is by the power spectrum, which is the Fourier transform of the autocovariance, s(w) 4 c(h) cos(2πwh) dh. 0 Another fully equivalent description with many advantages (Dimitriadis and Koutsoyiannis 205, Koutsoyiannis 206) is through the climacogram, the variance of the averaged process, i.e., t γ(k) var[x(k)/k], where X(t) x(ξ)dξ. The climacogram is connected to autocovariance by 0 d 2 (h 2 γ(h)) γ(k) = 2 ( χ)c(χk)dχ and c(h) =. A surrogate of the power spectrum with several 0 2 dh 2 advantages over it is the climacospectrum (Koutsoyiannis, 207) defined as ζ(k) k(γ(k) γ(2k)). Full description of the third-order, fourth-order, etc., properties of a stochastic process requires functions of 2, 3,, variables. For example, the third order properties are expressed in terms of c3(h, h2) := E[(x(t) μ) (x(t + h) μ) (x(t + h2) μ)]. Such a description is not parsimonious and its accuracy holds only in theory, because sample estimates are not reliable. Therefore we introduce single-variable descriptions for any order p, expanding the idea of the climacogram and climacospectrum based on K-moments. K-climacogram: γ pq (k) = (p q + )E[(2F(X(k)/k) ) p q (X(k)/k μ) q ] (3) K-climacospectrum: ζ pq (k) = k(γ pq(k) γ pq (2k)) (4) ln 2 where γ 22 (k) γ(k) and ζ 22 (k) ζ(k). Even though the K-moment description is not equivalent to the multivariate high-order one, it suffices to define the marginal distribution at any scale k. ln 2 D. Koutsoyiannis, Knowable moments and K-climacogram 9

K-moment value K-climacospectrum value K-moment value K-climacospectrum value Example : Turbulent velocity 00 0 0. 0.0 6 9 20 (2**) 0 00 000 0. 0.0 6 9 20 (2*) 0 00 000 0 6 7 8 9 20 00 0 0. 0 00 000 Data: 60 000 values of turbulent velocity along the flow direction (Kang, 2003; Koutsoyiannis 207, Dimitriadis and Koutsoyiannis, 208); the original series was averaged so that time scale corresponds to 0.5 s. Note: Plot (2*) is constructed from the variance and (2**) corresponds to standard deviation. 0. 6 7 8 9 20 0 00 000 D. Koutsoyiannis, Knowable moments and K-climacogram

K-moment value K-climacospectrum value K-moment value K-climacosectrum value Example 2: Rainfall rate at Iowa measured every s 0 6 9 20 (2**) 000 00 0 0 00 000 0. 0.0 6 9 20 (2*) 0 00 000 00 0 6 7 8 9 20 000 00 0 0 00 000 Data: 29542 values of rainfall at Iowa measured at temporal resolution of s (merger of seven events from Georgakakos et al. 994; see also Lombardo et al. 202). Plot (2*) is constructed from the variance and (2**) corresponds to standard deviation. 6 7 8 9 20 0 00 000 D. Koutsoyiannis, Knowable moments and K-climacogram

Moment value K-moment value K-climacospectrum value Example 3: Daily rainfall at Padova 00 0 6 7 8 9 20 00 0 0. 0.0 0 00 000 00 μ'ₚ¹ᵖ K'ₚ₁ K'ₚ₂¹² Kₚ₂¹² E[max(x₁,..,xₚ)] 0 0 00 000 Order p 6 7 8 9 20 0 00 000 Data: 0 442 values of daily rainfall at Padova (the longest rainfall record existing worldwide; Marani and Zanetti, 205). Note about the graph on the left: Notice that moments are plotted against order p and thus approximately represent maxima for a time window of length p. For independent processes E[max(x,, x p )] should be equal to K p, but when there is dependence the two quantities slightly differ; the former reflects the joint distribution and the latter the marginal one. D. Koutsoyiannis, Knowable moments and K-climacogram 2

References Dimitriadis, P., and Koutsoyiannis, D. (205), Climacogram versus autocovariance and power spectrum in stochastic modelling for Markovian and Hurst Kolmogorov processes, Stochastic Environmental Research & Risk Assessment, 29 (6), 649 669, doi:.07/s00477-05-23-7. Dimitriadis, P., and Koutsoyiannis, D. (208), Stochastic synthesis approximating any process dependence and distribution, Stochastic Environmental Research & Risk Assessment, doi:.07/s00477-08-540-2. Georgakakos, K. P., Cârsteanu, A. A., Sturdevant, P. L. and Cramer, J. A. (994), Observation and Analysis of Midwestern Rain Rates, Journal of Applied Meteorology, 33, 433-444. Kang, H.S., Chester, S., and Meneveau, C. (2003). Decaying turbulence in an active-grid-generated flow and comparisons with large-eddy simulation, Journal of Fluid Mechanics, 480, 29 60. Koutsoyiannis, D. (20), A random walk on water, Hydrology and Earth System Sciences, 4, 585 60, doi:.594/hess-4-585-20. Koutsoyiannis, D. (206), Generic and parsimonious stochastic modelling for hydrology and beyond, Hydrological Sciences Journal, 6 (2), 225 244, doi:.80/02626667.205.6950. Koutsoyiannis, D. (207), Entropy production in stochastics, Entropy, 9 (), 58, doi:.3390/e958. Lombardo, F., Volpi, E., and Koutsoyiannis, D. (202), Rainfall downscaling in time: Theoretical and empirical comparison between multifractal and Hurst-Kolmogorov discrete random cascades, Hydrological Sciences Journal, 57 (6), 52 66. Lombardo, F., Volpi, E., Koutsoyiannis, D., and Papalexiou, S.M. (204), Just two moments! A cautionary note against use of high-order moments in multifractal models in hydrology, Hydrology and Earth System Sciences, 8, 243 255, doi:.594/hess-8-243-204. Marani, M., and Zanetti, S. (205), Long-term oscillations in rainfall extremes in a 268 year daily time series, Water Resources Research, 5, 639 647. D. Koutsoyiannis, Knowable moments and K-climacogram 3