Short tutorial on data assimilation

Similar documents
TerrSysMP-PDAF: Technical concepts and application examples

Smoothers: Types and Benchmarks

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Fundamentals of Data Assimilation

Multivariate and Multiscale Data Assimilation in Terrestrial Systems: A Review

A new Hierarchical Bayes approach to ensemble-variational data assimilation

Fundamentals of Data Assimila1on

Fundamentals of Data Assimila1on

The Kalman Filter ImPr Talk

1. Current atmospheric DA systems 2. Coupling surface/atmospheric DA 3. Trends & ideas

Dynamic System Identification using HDMR-Bayesian Technique

Data Assimilation Research Testbed Tutorial

Towards a probabilistic hydrological forecasting and data assimilation system. Henrik Madsen DHI, Denmark

An Efficient Ensemble Data Assimilation Approach To Deal With Range Limited Observation

Lagrangian Data Assimilation and Its Application to Geophysical Fluid Flows

State and Parameter Estimation in Stochastic Dynamical Models

Estimation of State Noise for the Ensemble Kalman filter algorithm for 2D shallow water equations.

Convective-scale data assimilation in the Weather Research and Forecasting model using a nonlinear ensemble filter

4. DATA ASSIMILATION FUNDAMENTALS

The Ensemble Kalman Filter:

Introduction to Ensemble Kalman Filters and the Data Assimilation Research Testbed

Introduction to Particle Filters for Data Assimilation

A Note on the Particle Filter with Posterior Gaussian Resampling

(Extended) Kalman Filter

Bayesian Calibration of Simulators with Structured Discretization Uncertainty

Correcting biased observation model error in data assimilation

Constrained data assimilation. W. Carlisle Thacker Atlantic Oceanographic and Meteorological Laboratory Miami, Florida USA

DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING. Inversion basics. Erkki Kyrölä Finnish Meteorological Institute

DART_LAB Tutorial Section 2: How should observations impact an unobserved state variable? Multivariate assimilation.

Ergodicity in data assimilation methods

Using Data Assimilation to Explore Precipitation - Cloud System - Environment Interactions

EnKF-based particle filters

Enhancing information transfer from observations to unobserved state variables for mesoscale radar data assimilation

Organization. I MCMC discussion. I project talks. I Lecture.

Methods of Data Assimilation and Comparisons for Lagrangian Data

Ensemble Data Assimilation and Uncertainty Quantification

Forecasting and data assimilation

UNDERSTANDING DATA ASSIMILATION APPLICATIONS TO HIGH-LATITUDE IONOSPHERIC ELECTRODYNAMICS

Data assimilation in high dimensions

Par$cle Filters Part I: Theory. Peter Jan van Leeuwen Data- Assimila$on Research Centre DARC University of Reading

Sequential Monte Carlo Methods for Bayesian Computation

The Ensemble Kalman Filter:

Bayesian Inverse problem, Data assimilation and Localization

Bayesian Statistics and Data Assimilation. Jonathan Stroud. Department of Statistics The George Washington University

Adaptive ensemble Kalman filtering of nonlinear systems

Inverse problems and uncertainty quantification in remote sensing

A Stochastic Collocation based. for Data Assimilation

Data Assimilation Research Testbed Tutorial

Data Analysis I. Dr Martin Hendry, Dept of Physics and Astronomy University of Glasgow, UK. 10 lectures, beginning October 2006

Multi-variate hydrological data assimilation opportunities and challenges. Henrik Madsen DHI, Denmark

A data-driven method for improving the correlation estimation in serial ensemble Kalman filter

Lagrangian Data Assimilation and Manifold Detection for a Point-Vortex Model. David Darmon, AMSC Kayo Ide, AOSC, IPST, CSCAMM, ESSIC

Gaussian Process Approximations of Stochastic Differential Equations

Ensemble Kalman Filters for WRF-ARW. Chris Snyder MMM and IMAGe National Center for Atmospheric Research

Stochastic Collocation Methods for Polynomial Chaos: Analysis and Applications

Assessing the Value of Information from Inverse Modelling for Optimising Long-Term Oil Reservoir Performance

Ensemble Kalman Filter

Strong Lens Modeling (II): Statistical Methods

Soil Moisture Prediction and Assimilation

Lecture 7: Optimal Smoothing

A Spectral Approach to Linear Bayesian Updating

Lecture 6: Bayesian Inference in SDE Models

Application of the Ensemble Kalman Filter to History Matching

Hierarchical Bayes Ensemble Kalman Filter

ECE295, Data Assimila0on and Inverse Problems, Spring 2015

On a Data Assimilation Method coupling Kalman Filtering, MCRE Concept and PGD Model Reduction for Real-Time Updating of Structural Mechanics Model

European High-Resolution Soil Moisture Analysis (EHRSOMA)

A nested sampling particle filter for nonlinear data assimilation

ECE276A: Sensing & Estimation in Robotics Lecture 10: Gaussian Mixture and Particle Filtering

Data assimilation : Basics and meteorology

Recap on Data Assimilation

Winter 2019 Math 106 Topics in Applied Mathematics. Lecture 1: Introduction

Markov Chain Monte Carlo

Building Ensemble-Based Data Assimilation Systems. for High-Dimensional Models

Gaussian Process Approximations of Stochastic Differential Equations

Performance of ensemble Kalman filters with small ensembles

DART Tutorial Part II: How should observa'ons impact an unobserved state variable? Mul'variate assimila'on.

Comparing the SEKF with the DEnKF on a land surface model

An evaluation of the nonlinear/non-gaussian filters for the sequential data assimilation

F denotes cumulative density. denotes probability density function; (.)

Seminar: Data Assimilation

Graphical Models for Statistical Inference and Data Assimilation

The Canadian approach to ensemble prediction

Data assimilation with and without a model

DART Tutorial Sec'on 5: Comprehensive Filtering Theory: Non-Iden'ty Observa'ons and the Joint Phase Space

Practical Bayesian Optimization of Machine Learning. Learning Algorithms

Towards a Bayesian model for Cyber Security

Ji-Sun Kang. Pr. Eugenia Kalnay (Chair/Advisor) Pr. Ning Zeng (Co-Chair) Pr. Brian Hunt (Dean s representative) Pr. Kayo Ide Pr.

Bayesian Inference for DSGE Models. Lawrence J. Christiano

Data assimilation; comparison of 4D-Var and LETKF smoothers

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets

Advanced uncertainty evaluation of climate models by Monte Carlo methods

Probabilistic Graphical Models

Data assimilation with and without a model

Relationship between Singular Vectors, Bred Vectors, 4D-Var and EnKF

Ensemble Kalman filter assimilation of transient groundwater flow data via stochastic moment equations

Robotics 2 Target Tracking. Kai Arras, Cyrill Stachniss, Maren Bennewitz, Wolfram Burgard

Bayesian Inference for DSGE Models. Lawrence J. Christiano

Coupled Ocean-Atmosphere Assimilation

Transcription:

Mitglied der Helmholtz-Gemeinschaft Short tutorial on data assimilation 23 June 2015 Wolfgang Kurtz & Harrie-Jan Hendricks Franssen Institute of Bio- and Geosciences IBG-3 (Agrosphere), Forschungszentrum Jülich GmbH Centre for High-Performance Computing in Terrestrial Systems, Geoverbund ABC/J, Jülich

Data assimilation Optimal merging of (uncertain) model predictions with (uncertain) measurements. 23 June 2015 IBG-3: Agrosphere 1

Data assimilation Optimal merging of (uncertain) model predictions with (uncertain) measurements. Models are applied in a variety of earth system disciplines: Atmosphere Oceanography Land surface Surface water/ groundwater Glaciology Radiative transfer models Vegetation dynamics/ Biogeochemistry 23 June 2015 IBG-3: Agrosphere 1

Why are model predictions uncertain? Model structural errors: Richards equation in land surface models Soil respiration in land surface models: simple black-box concept 23 June 2015 IBG-3: Agrosphere 2

Why are model predictions uncertain? Model structural errors: Richards equation in land surface models Soil respiration in land surface models: simple black-box concept Parameter errors: Soil hydraulic parameters like saturated conductivity or porosity Ecosystem parameters like rooting depth 23 June 2015 IBG-3: Agrosphere 2

Why are model predictions uncertain? Model structural errors: Richards equation in land surface models Soil respiration in land surface models: simple black-box concept Parameter errors: Soil hydraulic parameters like saturated conductivity or porosity Ecosystem parameters like rooting depth Errors in model forcings: Precipitation Short wave radiation 23 June 2015 IBG-3: Agrosphere 2

Why are model predictions uncertain? Model structural errors: Richards equation in land surface models Soil respiration in land surface models: simple black-box concept Parameter errors: Soil hydraulic parameters like saturated conductivity or porosity Ecosystem parameters like rooting depth Errors in model forcings: Precipitation Short wave radiation Errors in initial conditions: Initial soil moisture content Carbon pools 23 June 2015 IBG-3: Agrosphere 2

Measurement data Provide information on model states Provide (indirectly) information on parameters and model forcings Data are more valuable if they provide information over larger spatial and temporal scales 23 June 2015 IBG-3: Agrosphere 3

Measurement data Provide information on model states Provide (indirectly) information on parameters and model forcings Data are more valuable if they provide information over larger spatial and temporal scales Important limitations Information is always incomplete: not everywhere, not always Random measurement errors (e.g., instrument precision) Systematic measurement errors (e.g., LAI from SMOS) Complicated relationship between what is measured and the quantity of interest (e.g., brightness temperature from SMOS and soil moisture content) 23 June 2015 IBG-3: Agrosphere 3

Bayes Law Posterior {}}{ p(x y) = Likelihood {}}{ p(x) p(y x) p(y) }{{} Evidence Prior {}}{ 23 June 2015 IBG-3: Agrosphere 4

Bayes Law Posterior {}}{ p(x y) = Likelihood {}}{ p(x) p(y x) p(y) }{{} Evidence Prior {}}{ Often a Gaussian distribution is assumed as prior: p(x) exp ( 1 2 (x µ) C 1 (x µ) ) The likelihood in case of a Gaussian assumption is given by: p(y x) exp ( 1 2 (y Hx) R 1 (y Hx) ) 23 June 2015 IBG-3: Agrosphere 4

Bayes Law Posterior {}}{ p(x y) = Likelihood {}}{ p(x) p(y x) p(y) }{{} Evidence Prior {}}{ Often a Gaussian distribution is assumed as prior: p(x) exp ( 1 2 (x µ) C 1 (x µ) ) The likelihood in case of a Gaussian assumption is given by: p(y x) exp ( 1 2 (y Hx) R 1 (y Hx) ) Kalman filter and variational DA follow as solutions 23 June 2015 IBG-3: Agrosphere 4

DA methods Inverse modelling / Variational DA Markov Chain Monte Carlo (MCMC) Ensemble Kalman Filter (EnKF) Particle Filter (PF) Ensemble Kalman Smoother (EnKS)/ Particle Smoother (PS) 23 June 2015 IBG-3: Agrosphere 5

DA methods Inverse modelling / Variational DA Markov Chain Monte Carlo (MCMC) Ensemble Kalman Filter (EnKF) Particle Filter (PF) Ensemble Kalman Smoother (EnKS)/ Particle Smoother (PS) Differ with respect to e.g.: Temporal treatment of observations Intrinsic assumptions Computational cost Uncertainty quantification 23 June 2015 IBG-3: Agrosphere 5

Observations in variational DA/ MCMC t 1 t 2 t 3 t 4 t 5 Measurements are processed in batch to update states/parameters for all time steps. 23 June 2015 IBG-3: Agrosphere 6

Observations in filters (EnKF/ PF) t 1 t 2 t 3 t 4 t 5 Incoming measurements are only used to update states/parameters at current time step. 23 June 2015 IBG-3: Agrosphere 7

Observations in smoothers (EnKS/ PS) t 1 t 2 t 3 t 4 t 5 Incoming measurements are used to update states/parameters at current and previous time steps. 23 June 2015 IBG-3: Agrosphere 8

DA methods Markov Chain Monte Carlo very general, but expensive Inverse modelling/ Variational DA Gaussian assumption, uncertainty estimates relatively poor Particle Filter Markovian assumption (sequential), expensive, uncertainty estimates relatively poor related to filter collapse Ensemble Kalman Filter Marcovian assumption (sequential), Gaussian assumption, efficient, better uncertainty estimates than for gradient based inverse Ensemble Kalman Smoother Gaussian assumption, but data over longer time period 23 June 2015 IBG-3: Agrosphere 9

Ensemble Kalman Filter t-1 t t+1 23 June 2015 IBG-3: Agrosphere 10

Ensemble Kalman Filter t-1 t t+1 23 June 2015 IBG-3: Agrosphere 10

Ensemble Kalman Filter t-1 t t+1 23 June 2015 IBG-3: Agrosphere 10

Ensemble Kalman Filter t-1 t t+1 23 June 2015 IBG-3: Agrosphere 10

Ensemble Kalman Filter t-1 t t+1 23 June 2015 IBG-3: Agrosphere 10

Ensemble Kalman filter - Analysis scheme Prediction equation i = M(x t 1, p i, q i ) + ωi t x t x = model states p = model parameters q = model forcings ω = model errors M = (non-linear) forward model t = time i = model realization i 23 June 2015 IBG-3: Agrosphere 11

Ensemble Kalman filter - Analysis scheme Measurement equation ỹ i = Hx i + ɛ i x = model states ỹ = simulation at observation point H = measurement operator ɛ = measurement error i = model realization 23 June 2015 IBG-3: Agrosphere 12

Measurement operator 7 8 9 4 5 6 1 2 3 Model grid Measurement locations + + + X 1 x 2 x 3 ỹ 1 1 0 0 0 0 0 0 0 0 x 4 ỹ 2 = 0 0 0 0 1 0 0 0 0 x 5 ỹ 3 0 0 0 0 0 0 1 0 0 x 6 x 7 x 8 23 June 2015 IBG-3: Agrosphere x 9 13

Measurement operator 7 8 9 4 5 6 1 2 3 Model grid Measurement locations not at cell centers + + + X 1 x 2 x 3 ỹ 1 0.5 0.5 0 0 0 0 0 0 0 x 4 ỹ 2 = 0 0 1 0 0 0 0 0 0 x 5 ỹ 3 0 0 0 0 0.25 0.25 0 0.25 0.25 x 6 x 7 x 8 23 June 2015 IBG-3: Agrosphere x 9 14

Measurement operator 7 8 9 4 5 6 1 2 3 Model grid One remote sensing measurement X 1 x 2 x 3 ] [ [ỹ1 = 1 ] x 4 1 1 1 1 1 1 1 1 9 9 9 9 9 9 9 9 9 x 5 x 6 x 7 x 8 23 June 2015 IBG-3: Agrosphere x 9 15

Ensemble Kalman filter - Analysis scheme Updating equation x + i = x t i + K (y ỹ i ) x = model states (predicted) x = model states (updated) K = Kalman gain y = measurement ỹ = simulation at observation point i = model realization 23 June 2015 IBG-3: Agrosphere 16

Ensemble Kalman filter - Analysis scheme Kalman gain K = C xỹ (HC xỹ + R) 1 K = Kalman gain C xỹ = covariance matrix of states and simulated measurements R = measurement error covariance matrix 23 June 2015 IBG-3: Agrosphere 17

Covariance matrix 7 8 9 4 5 6 1 2 3 Model grid Measurement locations + + + C x1 ỹ 1 C x1 ỹ 2 C x1 ỹ 3 C x2 ỹ 1 C x2 ỹ 2 C x2 ỹ 3 C x3 ỹ 1 C x3 ỹ 2 C x3 ỹ 3 C x4 ỹ 1 C x4 ỹ 2 C x4 ỹ 3 C xỹ = C x5 ỹ 1 C x5 ỹ 2 C x5 ỹ 3 C x6 ỹ 1 C x6 ỹ 2 C x6 ỹ 3 C x7 ỹ 1 C x7 ỹ 2 C x7 ỹ 3 C x8 ỹ 1 C x8 ỹ 2 C x8 ỹ 3 C x9 ỹ 1 C x9 ỹ 2 C x9 ỹ 3 23 June 2015 IBG-3: Agrosphere 18

Covariance matrix 7 8 9 4 5 6 1 2 3 Model grid Measurement locations + + + C x1 ỹ 1 C x1 ỹ 2 C x1 ỹ 3 C x2 ỹ 1 C x2 ỹ 2 C x2 ỹ 3 C x3 ỹ 1 C x3 ỹ 2 C x3 ỹ 3 C x4 ỹ 1 C x4 ỹ 2 C x4 ỹ 3 C xỹ = C x5 ỹ 1 C x5 ỹ 2 C x5 ỹ 3 C x6 ỹ 1 C x6 ỹ 2 C x6 ỹ 3 C x7 ỹ 1 C x7 ỹ 2 C x7 ỹ 3 C x8 ỹ 1 C x8 ỹ 2 C x8 ỹ 3 HC xỹ = Cỹ1 ỹ 1 Cỹ1 ỹ 2 Cỹ1 ỹ 3 Cỹ2 ỹ 1 Cỹ2 ỹ 2 Cỹ2 ỹ 3 Cỹ3 ỹ 1 Cỹ3 ỹ 2 Cỹ3 ỹ 3 C ɛ1 ɛ 1 C ɛ1 ɛ 2 C ɛ1 ɛ 3 R = C ɛ2 ɛ 1 C ɛ2 ɛ 2 C ɛ2 ɛ 3 C ɛ3 ɛ 1 C ɛ3 ɛ 2 C ɛ3 ɛ 3 C x9 ỹ 1 C x9 ỹ 2 C x9 ỹ 3 23 June 2015 IBG-3: Agrosphere 18

Kalman gain Kalman gain weigths model prediciton uncertainty and measurement uncertainty. For a scalar (one point): K = σ2 sim σ 2 sim +σ2 obs Model prediction uncertainty estimated by covariance matrix C xỹ (from the ensemble) Kalman gain matrix also determines how a measurement affects the surroundings and corrects surrounding states: Depending on the strength of spatial correlation, a measurement might correct the states in the neighbourhood strongly, or only weakly Spatial correlations depend on model physics, but also correlations of static parameters like land use or soil properties 23 June 2015 IBG-3: Agrosphere 19

EnKF update Filter performance dependent on: Relation between model uncertainty and measurement uncertainty Update frequency Ensemble size Amount of observations 23 June 2015 IBG-3: Agrosphere 20

Joint state-parameter update Measurement data can also be used to update model parameters jointly with the states 23 June 2015 IBG-3: Agrosphere 21

Joint state-parameter update Measurement data can also be used to update model parameters jointly with the states Parameters [ ] are appended to the state vector: s x = p x = state-parameter vector s = state vector p = parameter vector 23 June 2015 IBG-3: Agrosphere 21

Joint state-parameter update Measurement data can also be used to update model parameters jointly with the states Parameters [ ] are appended to the state vector: s x = p x = state-parameter vector s = state vector p = parameter vector Covariance matrix C xỹ then also contains covariances between states and parameters 23 June 2015 IBG-3: Agrosphere 21

Joint state-parameter update Measurement data can also be used to update model parameters jointly with the states Parameters [ ] are appended to the state vector: s x = p x = state-parameter vector s = state vector p = parameter vector Covariance matrix C xỹ then also contains covariances between states and parameters Measurements are used to update parameters indirectly 23 June 2015 IBG-3: Agrosphere 21

Data assimilation software 23 June 2015 IBG-3: Agrosphere 22

Data assimilation software Data Assimilation Research Testbed (DART) https://www.image.ucar.edu/dares/dart/ Parallel Data Assimilation Framework (PDAF) http://pdaf.awi.de/trac/wiki OpenDA http://www.openda.org/joomla/index.php 23 June 2015 IBG-3: Agrosphere 22

Data assimilation software Data Assimilation Research Testbed (DART) https://www.image.ucar.edu/dares/dart/ Parallel Data Assimilation Framework (PDAF) http://pdaf.awi.de/trac/wiki OpenDA http://www.openda.org/joomla/index.php Differ with respect to e.g.: Implementation strategy (Fortran, Java,...) Available filter methods Model coupling Parallelism Additional utilities (localization, covariance inflation, measurement operators,...) 23 June 2015 IBG-3: Agrosphere 22

Coupling to DA software User needs to link model with DA and provide certain functionality Allow ensemble propagation of different model realizations Extract state(-parameter) vector from model output Provide measurements and measurement operators Redirect updated states vector (parameters) to model input for next time step 23 June 2015 IBG-3: Agrosphere 23

Coupling to DA software Offline coupling Data transfer between model and DA module via input/output files Requires utilties to read/write the required input/output files Program could be proprietary (no source code needed) Easy to implement Performance degradation due to high I/O 23 June 2015 IBG-3: Agrosphere 24

Coupling to DA software Online coupling Data transfer between model and DA module via main memory DA module is wrapped around program Source code required Programming effort depends on model Usually faster than offline coupling 23 June 2015 IBG-3: Agrosphere 25

Lorenz 63 system dx dt dy dt dz dt = σ(y x) (1) = x(ρ z) y (2) = xy βz (3) x : convective flow y : horizontal temperature distribution z : vertical temperature distribution σ : viscosity / thermal conductivity ρ : temperature difference top/bottom β : width to height ratio of cell 0 10 20 30 40 50 30 20 10 20 10 0 10 20 30 20 10 0 23 June 2015 IBG-3: Agrosphere 26 X Z Y