Analysis of non-stationary Data. Illia Horenko

Similar documents
On Robust Estimation of Low-Frequency Variability Trends in Discrete Markovian Sequences of Atmospheric Circulation Patterns

On robust estimation of low-frequency. variability trends in discrete Markovian. sequences of atmospheric circulation patterns

Computational Time Series Analysis (IPAM, Part I, stationary methods) Illia Horenko

Persistent Regime Modes Of Mid Latitude

Numerics of Stochastic Processes II ( ) Illia Horenko

On clustering of non-stationary meteorological time series

Introduction to Probabilistic Graphical Models: Exercises

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

Models for models. Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research

Hidden Markov Models for precipitation

COURSE INTRODUCTION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

On simultaneous data based dimension reduction and hidden phase identification

Modeling conditional distributions with mixture models: Applications in finance and financial decision-making

Seasonal forecasting of climate anomalies for agriculture in Italy: the TEMPIO Project

Nonlinear atmospheric response to Arctic sea-ice loss under different sea ice scenarios

A spatio-temporal model for extreme precipitation simulated by a climate model

Bayesian Estimation with Sparse Grids

Autonomous Navigation for Flying Robots

Likelihood-Based Methods

STOCHASTIC MODELING OF ENVIRONMENTAL TIME SERIES. Richard W. Katz LECTURE 5

Communications in Applied Mathematics and Computational Science

The priority program SPP1167 Quantitative Precipitation Forecast PQP and the stochastic view of weather forecasting

Pattern Recognition and Machine Learning

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Sparse and Robust Optimization and Applications

Multi-resolution models for large data sets

Markov Switching Regular Vine Copulas

Time Series: Theory and Methods

Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes

Iterative Gaussian Process Regression for Potential Energy Surfaces. Matthew Shelley University of York ISNET-5 Workshop 6th November 2017

Lecture 9: Markov Switching Models

STOCHASTIC MODELS LECTURE 1 MARKOV CHAINS. Nan Chen MSc Program in Financial Engineering The Chinese University of Hong Kong (ShenZhen) Sept.

A Gaussian state-space model for wind fields in the North-East Atlantic

Semi-parametric estimation of non-stationary Pickands functions

CPSC 540: Machine Learning

Machine Learning. Gaussian Mixture Models. Zhiyao Duan & Bryan Pardo, Machine Learning: EECS 349 Fall

Probabilistic Graphical Models

Statistical analysis of regional climate models. Douglas Nychka, National Center for Atmospheric Research

Nonlinear and/or Non-normal Filtering. Jesús Fernández-Villaverde University of Pennsylvania

Recent Advances in Bayesian Inference Techniques

Massive MIMO: Signal Structure, Efficient Processing, and Open Problems II

STA 4273H: Statistical Machine Learning

5.5.3 Statistical Innovative Trend Test Application Crossing Trend Analysis Methodology Rational Concept...

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

Markov Chain Monte Carlo methods

Temporal Wind Variability and Uncertainty

GENERALIZED LINEAR MODELING APPROACH TO STOCHASTIC WEATHER GENERATORS

Reduced Overdispersion in Stochastic Weather Generators for Statistical Downscaling of Seasonal Forecasts and Climate Change Scenarios

EM-algorithm for Training of State-space Models with Application to Time Series Prediction

Linear Regression. Aarti Singh. Machine Learning / Sept 27, 2010

Extending clustered point process-based rainfall models to a non-stationary climate

Introduction to Spatial Data and Models

Linear Dynamical Systems

Localization of Radioactive Sources Zhifei Zhang

Radar data assimilation using a modular programming approach with the Ensemble Kalman Filter: preliminary results

Hidden Markov Models (HMM) and Support Vector Machine (SVM)

Ergodicity in data assimilation methods

Conditioning stochastic properties of daily precipitation on indices of atmospheric circulation

Introduction to Spatial Data and Models

The analog data assimilation: method, applications and implementation

STA414/2104 Statistical Methods for Machine Learning II

Human Mobility Pattern Prediction Algorithm using Mobile Device Location and Time Data

ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications

Multivariate Non-Normally Distributed Random Variables

Advanced Introduction to Machine Learning

Statistical modelling in climate science

Modeling and Simulating Rainfall

Gaussian with mean ( µ ) and standard deviation ( σ)

Assessment of uncertainty in computer experiments: from Universal Kriging to Bayesian Kriging. Céline Helbert, Delphine Dupuy and Laurent Carraro

Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE. Rick Katz

Advanced uncertainty evaluation of climate models by Monte Carlo methods

FORECAST VERIFICATION OF EXTREMES: USE OF EXTREME VALUE THEORY

Cross-covariance Functions for Tangent Vector Fields on the Sphere

Modeling conditional distributions with mixture models: Theory and Inference

An Application of Graph Theory in Markov Chains Reliability Analysis

Switching Regime Estimation

Gaussian Processes 1. Schedule

An Introduction to Nonstationary Time Series Analysis

Condensed Table of Contents for Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control by J. C.

3. ESTIMATION OF SIGNALS USING A LEAST SQUARES TECHNIQUE

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Modelling trends in the ocean wave climate for dimensioning of ships

Detection and Attribution of Climate Change. ... in Indices of Extremes?

Finding Robust Solutions to Dynamic Optimization Problems

A Bayesian Perspective on Residential Demand Response Using Smart Meter Data

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling

STATISTICAL METHODS FOR RELATING TEMPERATURE EXTREMES TO LARGE-SCALE METEOROLOGICAL PATTERNS. Rick Katz

R&D Research Project: Scaling analysis of hydrometeorological time series data

Ensemble Data Assimilation and Uncertainty Quantification

Linear Dynamical Systems (Kalman filter)

Sparsity in system identification and data-driven control

Clustering Techniques and their applications at ECMWF

DATA-DRIVEN TECHNIQUES FOR ESTIMATION AND STOCHASTIC REDUCTION OF MULTI-SCALE SYSTEMS

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

STA 414/2104: Machine Learning

Preface to Second Edition... vii. Preface to First Edition...

Identification in closed-loop, MISO identification, practical issues of identification

Data. Climate model data from CMIP3

ALMA MEMO : the driest and coldest summer. Ricardo Bustos CBI Project SEP 06

Transcription:

Analysis of non-stationary Data Illia Horenko Scientific Computing Institute of Mathematics Freie Universität Berlin (FU) DFG Research Center MATHEON Mathematics in key technologies

Motivation global stationarity, local stationarity, examples Non-Stationarity for continuous state space processes: Fuzzy Clustering with Regression Models (FCRM) Finite Element Clustering for continuos state space processes Non-Stationarity for discrete state space processes: Kernel Filtering Methods Single Trend Model FEM-Clustering of Markov-chain output Example I: analysis of historical weather patterns Example II: analysis of the historical temperatures (1947-2007)

Non-Stationarity Market Phase 2 Fluid Mechanics Market Phase 1 Meteorology/Climate Computational Finance Biophysics/Drug Design (Local) weak stationarity: (local) time independence of mean values and covariances Not fulfilled in many cases

Non-Stationarity in Continuous State Space Time Series

Example: weather data analysis Global Historical Data Local Historical Data Non-stationarity because of the climate change => standard data-analysis methods are not applicable

Memo I: K-Means clustering Assumption: time-independence of cluster centers local stationarity

Toy Example: Non-stationary Extension of K-Means: FCRM

Non-stationary Extension of K-Means: FCRM Exercise 1: Derive an expression for the optimal estimator of the regression parameters θ (x and γ are fixed). What happens to this estimate if R is growing? How to define the optimal R? Suggest and discuss the possible numerical solutions.

Non-stationary Extension of K-Means: FCRM Toy Example: Proper identification of persistent clusters impossible (try the FEM-Clustering?)

FEM: Regularized Clustering Functional Regularized clustering functional:

FEM: Regularized Clustering Functional Iterative Subspace Minimization: sparse QP canbeused

Toy Example: FEM: Regularized Clustering Functional

FEM: Regularized Clustering Functional

FEM: Regularized Clustering Functional

Non-Stationarity in Discrete State Space Time Series

Example:Weather in UK(1945-2007) Historical Circulation Data: weather regimes (Data from the Univ. of East Anglia) 3 atmospherical states considered

Memo II: Markov Chain Log-Likelihood Observed Time Series:, Markov-Property: Log-Likelihood: Maximization problem is ill-posed => regularization necessary

Circulation Patterns for UK(1945-2007) Historical Circulation Data: 27 Lamb regimes (Data from the Univ. of East Anglia) 3 atmospherical states considered Regularization: global stationarity assumption anticyclonic anticyclonic other cyclonic 0.34 0.40 0.26 other 0.35 0.41 0.24 cyclonic 0.34 0.41 0.25

Locally Stationary Markov Chain Regularization: local stationarity assumption inside of the window Gaussian Window: Approximate Log-Likelihood: Gaussian Kernel Filtering: Standard statistical methods applicable to calculate the conf. intervals

Locally Stationary Markov Chain Regularization: local stationarity assumption inside of the window Gaussian Window: Approximate Log-Likelihood: Gaussian Kernel Filtering: Exercise 2: proof that this formula is true

Circulation Patterns for UK(1945-2007) Historical Circulation Data: 27 Lamb regimes (Data from the Univ. of East Anglia) 3 atmospherical states considered Gaussian Window vs. Homogenous Estimator anticyclonic anticyclonic other cyclonic cyclonic other

Circulation Patterns for UK(1945-2007) Historical Circulation Data: 27 Lamb regimes (Data from the Univ. of East Anglia) 3 atmospherical states considered

Parametric Estimation (H. 08, to appear in J. of Atmos. Sci.) Markovian Trend Model: Log-Likelihood: Numerics: Nelder-Mead Optimization Algorithm

Circulation Patterns for UK(1945-2007) Historical Circulation Data: 27 Lamb regimes (Data from the Univ. of East Anglia) 3 atmospherical states considered Polynomial Trend Model anticyclonic anticyclonic other cyclonic cyclonic other

Circulation Patterns for UK(1945-2007) Historical Circulation Data: 27 Lamb regimes (Data from the Univ. of East Anglia) 3 atmospherical states considered Polynomial Trend Model

FEM: Regularized Clustering Functional Regularized clustering functional:

FEM: Regularized Clustering Functional Regularized clustering functional: (H. 08, to appear in J. of Atmos. Sci.)

FEM: Regularized Clustering Functional Regularized clustering functional: (H. 08, to appear in J. of Atmos. Sci.) Iterative Subspace Minimization: sparse QP canbeused

Circulation Patterns for UK(1945-2007) Historical Circulation Data: 28 Lamb regimes (Data from the Univ. of East Anglia) 3 atmospherical states considered anticyclonic anticyclonic other cyclonic cyclonic other

Circulation Patterns for UK(1945-2007) Historical Circulation Data: 28 Lamb regimes (Data from the Univ. of East Anglia) 3 atmospherical states considered Gaussian Kernel Estimator FEM-Clustering vs. Single Trend

Example: analysis of hystorical temperatures (1947-2007)

Temperature Data Analysis Global Historical Temperatures Historical Temperatures in Europe Data from Boulder Center, USA Data from Ch. Franzke, BAS Cambridge Temperature Data on the 2D-Grid (1947-2007)

European Data Analysis

European Data Analysis

European Data Analysis Cluster 1 Cluster 2 Cluster 3

European Data Analysis Cluster 1 Cluster 2 Cluster 3

European Data Analysis Cluster 1 Cluster 2 Cluster 3

European Data Analysis Cluster 1 Cluster 2 Cluster 3

European Data Analysis

Global Data Analysis Global Historical Temperature Data (80x120 grid, daily values 1947-2007) (Data from the NCAR, Boulder, US )

Global Data Analysis Global Historical Temperature Data (80x120 grid, daily values 1947-2007) (Data from the NCAR, Boulder, US )

Take-Home-Messages 1. Different methods for analysis of non-stationary data were presented 2. One can use similar approaches in both continuous and discrete data analysis 3. Issue of non-stationarity is important in analysis of historical time series

Thank you for attention!