Practical Aspects of Ensemble-based Kalman Filters

Similar documents
Aspects of the practical application of ensemble-based Kalman filters

A Unification of Ensemble Square Root Kalman Filters. and Wolfgang Hiller

Building Ensemble-Based Data Assimilation Systems. for High-Dimensional Models

Data Assimilation with the Ensemble Kalman Filter and the SEIK Filter applied to a Finite Element Model of the North Atlantic

On the influence of model nonlinearity and localization on ensemble Kalman smoothing

Zentrum für Technomathematik

A Comparison of Error Subspace Kalman Filters

Local Ensemble Transform Kalman Filter

Optimal Localization for Ensemble Kalman Filter Systems

Gaussian Filtering Strategies for Nonlinear Systems

Using sea level data to constrain a finite-element primitive-equation ocean model with a local SEIK filter

Relative Merits of 4D-Var and Ensemble Kalman Filter

Kalman Filter and Ensemble Kalman Filter

Curriculum Vitae. Lars Nerger. Alfred Wegener Institute Helmholtz Center for Polar and Marine Research Am Handelshafen 12 D Bremerhaven, Germany

Ensemble square-root filters

Adaptive ensemble Kalman filtering of nonlinear systems

Addressing the nonlinear problem of low order clustering in deterministic filters by using mean-preserving non-symmetric solutions of the ETKF

Implications of the Form of the Ensemble Transformation in the Ensemble Square Root Filters

Asynchronous data assimilation

Local Ensemble Transform Kalman Filter: An Efficient Scheme for Assimilating Atmospheric Data

Convergence of Square Root Ensemble Kalman Filters in the Large Ensemble Limit

A Note on the Particle Filter with Posterior Gaussian Resampling

A data-driven method for improving the correlation estimation in serial ensemble Kalman filter

Quarterly Journal of the Royal Meteorological Society

Ensemble Kalman Filter

Four-dimensional ensemble Kalman filtering

A comparison of error subspace Kalman filters

The Ensemble Kalman Filter:

Localization in the ensemble Kalman Filter

Robust Ensemble Filtering With Improved Storm Surge Forecasting

Ensemble 4DVAR for the NCEP hybrid GSI EnKF data assimilation system and observation impact study with the hybrid system

Maximum Likelihood Ensemble Filter Applied to Multisensor Systems

Fundamentals of Data Assimila1on

Four-Dimensional Ensemble Kalman Filtering

P3.11 A COMPARISON OF AN ENSEMBLE OF POSITIVE/NEGATIVE PAIRS AND A CENTERED SPHERICAL SIMPLEX ENSEMBLE

P 1.86 A COMPARISON OF THE HYBRID ENSEMBLE TRANSFORM KALMAN FILTER (ETKF)- 3DVAR AND THE PURE ENSEMBLE SQUARE ROOT FILTER (EnSRF) ANALYSIS SCHEMES

Ensemble Kalman Filters for WRF-ARW. Chris Snyder MMM and IMAGe National Center for Atmospheric Research

4DEnVar. Four-Dimensional Ensemble-Variational Data Assimilation. Colloque National sur l'assimilation de données

4D-Var or Ensemble Kalman Filter? TELLUS A, in press

Data assimilation; comparison of 4D-Var and LETKF smoothers

Generating climatological forecast error covariance for Variational DAs with ensemble perturbations: comparison with the NMC method

The Local Ensemble Transform Kalman Filter (LETKF) Eric Kostelich. Main topics

The Ensemble Kalman Filter: Theoretical Formulation and Practical Implementation

Overview of data assimilation in oceanography or how best to initialize the ocean?

Comparisons between 4DEnVar and 4DVar on the Met Office global model

Cross-validation methods for quality control, cloud screening, etc.

Can hybrid-4denvar match hybrid-4dvar?

Data assimilation with and without a model

How 4DVAR can benefit from or contribute to EnKF (a 4DVAR perspective)

Error (E) Covariance Diagnostics. in Ensemble Data Assimilation. Kayo Ide, Daryl Kleist, Adam Kubaryk University of Maryland

A Spectral Approach to Linear Bayesian Updating

Data Assimilation: Finding the Initial Conditions in Large Dynamical Systems. Eric Kostelich Data Mining Seminar, Feb. 6, 2006

Fundamentals of Data Assimila1on

A new Hierarchical Bayes approach to ensemble-variational data assimilation

Analysis Scheme in the Ensemble Kalman Filter

Estimation of positive sum-to-one constrained parameters with ensemble-based Kalman filters: application to an ocean ecosystem model

Relationship between Singular Vectors, Bred Vectors, 4D-Var and EnKF

An Efficient Ensemble Data Assimilation Approach To Deal With Range Limited Observation

On the Kalman Filter error covariance collapse into the unstable subspace

The Ensemble Kalman Filter:

Advances and Challenges in Ensemblebased Data Assimilation in Meteorology. Takemasa Miyoshi

(Toward) Scale-dependent weighting and localization for the NCEP GFS hybrid 4DEnVar Scheme

Efficient Data Assimilation for Spatiotemporal Chaos: a Local Ensemble Transform Kalman Filter

Review of Covariance Localization in Ensemble Filters

Some ideas for Ensemble Kalman Filter

The Ensemble Kalman Filter: Theoretical Formulation and Practical Implementation

OOPC-GODAE workshop on OSE/OSSEs Paris, IOCUNESCO, November 5-7, 2007

arxiv: v1 [physics.ao-ph] 23 Jan 2009

4. DATA ASSIMILATION FUNDAMENTALS

UNIVERSITY OF OKLAHOMA GRADUATE COLLEGE EFFECTS OF SEQUENTIAL OR SIMULTANEOUS ASSIMILATION OF OBSERVATIONS AND LOCALIZATION METHODS ON THE PERFORMANCE

Abstract 2. ENSEMBLE KALMAN FILTERS 1. INTRODUCTION

Data assimilation in the geosciences An overview

A Matrix-Free Posterior Ensemble Kalman Filter Implementation Based on a Modified Cholesky Decomposition

Lagrangian Data Assimilation and Its Application to Geophysical Fluid Flows

Relationship between Singular Vectors, Bred Vectors, 4D-Var and EnKF

Data assimilation with and without a model

Ensemble Kalman Filter potential

Ensemble Data Assimilation: Theory and Applications

WRF-LETKF The Present and Beyond

Stochastic Collocation Methods for Polynomial Chaos: Analysis and Applications

Numerical Weather prediction at the European Centre for Medium-Range Weather Forecasts (2)

An ETKF approach for initial state and parameter estimation in ice sheet modelling

Improved analyses and forecasts with AIRS retrievals using the Local Ensemble Transform Kalman Filter

Towards a probabilistic hydrological forecasting and data assimilation system. Henrik Madsen DHI, Denmark

EnKF Localization Techniques and Balance

EnKF Review. P.L. Houtekamer 7th EnKF workshop Introduction to the EnKF. Challenges. The ultimate global EnKF algorithm

Ensemble Data Assimilation and Uncertainty Quantification

Convergence of the Ensemble Kalman Filter in Hilbert Space

Hybrid Variational Ensemble Data Assimilation for Tropical Cyclone

Nonlinear error dynamics for cycled data assimilation methods

The Variational approach on the road to Data Assimilation (DA) for Chemical Transport Models (CTM).

Handling nonlinearity in Ensemble Kalman Filter: Experiments with the three-variable Lorenz model

Weakly coupled assimilation for decadal prediction with MPI-ESM

Hybrid variational-ensemble data assimilation. Daryl T. Kleist. Kayo Ide, Dave Parrish, John Derber, Jeff Whitaker

Multivariate Correlations: Applying a Dynamic Constraint and Variable Localization in an Ensemble Context

Strongly coupled data assimilation: Could scatterometer winds improve ocean analyses?

Recent Advances in EnKF

A Non-Gaussian Ensemble Filter Update for Data Assimilation

Applications of information theory in ensemble data assimilation

Model Uncertainty Quantification for Data Assimilation in partially observed Lorenz 96

Transcription:

Practical Aspects of Ensemble-based Kalman Filters Lars Nerger Alfred Wegener Institute for Polar and Marine Research Bremerhaven, Germany and Bremen Supercomputing Competence Center BremHLR Bremen, Germany lars.nerger@awi.de

Outline Aspects Computing Analysis formulation Localization Collaborations: AWI: W. Hiller, J. Schröter, S. Loza, P. Kirchgessner, T. Janjic (now DWD) BSH: F. Janssen, S. Massmann Bremen University: A. Bunse-Gerstner

The Problem

Application Example Model surface temperature Satellite surface temperature Norway Sweden Finland UK Oberwolfach Information: Model Information: Observation Forecasting in North & Baltic Seas Combine model and observations for optimal initial condition State vector size: 2.6 0 6 (4 fields 3D, field 2D) Obervations: 0000 37000 (Surface temperature only) Ensemble size 8 S. Loza et al., Journal of Marine Systems 05 (202) 52-62

Forecast deviation from satellite data No assimilation Assimilation RMS bias Improvements also sub-surface and in other fields

Data Assimilation Problem: Estimate model state (trajectory) from guess at initial time model dynamics observational data Characteristics of system: approximated by discretized differential equations high-dimension - O(0 7-0 9 ) sparse observations non-linear Current standard methods: Optimization algorithms ( 4DVar ) This talk! Ensemble-based estimation algorithms

Ensemble-based Kalman Filter First formulated by G. Evensen (EnKF, 994) Kalman filter: express probability distributions by mean and covariance matrix EnKF: Use ensembles to represent probability distributions state estimate ensemble forecast initial sampling forecast analysis ensemble transformation observation Looks trivial! BUT: There are many possible choices! time 0 time time 2

Computational and Practical Issues Data assimilation with ensemble-based Kalman filters is costly! Memory: Huge amount of memory required (model fields and ensemble matrix) Computing: Huge requirement of computing time (ensemble integrations) Parallelism: Natural parallelism of ensemble integration exists (needs to be implemented) Fixes : Filter algorithms do not work in their pure form ( fixes and tuning are needed) because Kalman filter optimal only in linear case Lars Nerger Ensemble square-root KFs

What we are looking for Goal: Find the assimilation method with smallest estimation error most accurate error estimate least computational cost least tuning Want to understand and improve performance Difficulty: Optimality of Kalman filter well known for linear systems No optimality for non-linear systems limited analytical possibilities apply methods to test problems Lars Nerger Ensemble square-root KFs

Computing

Logical separation of assimilation system single program time state Filter Initialization analysis re-initialization Core of PDAF state observations Model initialization time integration post processing mesh data Observations obs. vector obs. operator obs. error Explicit interface Indirect exchange (module/common) Nerger, L., Hiller, W. (202). Software for Ensemble-based DA Systems Implementation and Scalability. Computers and Geosciences. In press. doi:0.06/j.cageo.202.03.026

PDAF: A tool for data assimilation PDAF - Parallel Data Assimilation Framework a software to provide assimilation methods an environment for ensemble assimilation for testing algorithms and real applications useable with virtually any numerical model also: apply identical methods to different models test influence of different observations makes good use of supercomputers (Fortran and MPI; tested on up to 4800 processors) More information and source code available at http://pdaf.awi.de

Analysis Formulations

Ensemble-based/error-subspace Kalman filters A little zoo (not complete): New study ESTKF _ Properties and (Nerger differences et al. 202) are hardly understood _ EnKF(94/98) Studied in Nerger et al. (2005) RRSQRT ROEK SEEK SEIK EnKF(2003) EnKF(2004) EAKF EnSRF ETKF Learn from studying relations and differences MLEF SPKF ESSE RHF anamorphosis New filter formulation ESTKF: L. Nerger et al., Monthly Weather Review 40 (202) 2335-2345

Model Equations Stochastic dynamic model: x t i = M i,i [x t i ]+ i, x t i, i R n Stochastic observation model: y k = H k [x t k]+ k, y k, k R m i N(0, Q i ); i T j = Q i ij k N (0, R k ); k T l = R k kl Assumptions: x t N i N ( x t i, P i ) k T k =0; i (x ti) T =0; k (x tk) T =0 Model error Observation error

The Ensemble Kalman Filter (EnKF, Evensen 94) Initialization: Generate random ensemble Ensemble statistics approximate and covariance Forecast: x a(l) i = M i,i [x a(l) (l) i ]+ i Analysis: x a(l) k = x f(l) k + K k y (l) k H k x f(l) k Kalman filter K k = P f k HT k H k P fk HTk + R k P f k := N T x f(l) k x f k x f(l) k x f k N l= x a k := N x a(l) k N l=

Issues of the EnKF94 Monte Carlo Method ensemble of observations required (samples matrix R; introduces sampling error) Inversion of large matrix H k P f k HT k + R k R m m (can be singular, possibly large differences in eigenvalues >0) Alternative: Compute analysis in space spanned by ensemble Methods: Ensemble Square-Root Kalman Filters, e.g. SEIK (Pham et al., 998) ETKF (Bishop et al., 200)

Ensemble Transform Kalman Filter - ETKF Ensemble perturbation matrix X k := X k X k size (n x N) Analysis covariance matrix P a = X f A(X f ) T. Transform matrix (in ensemble space) A := (N )I +(HX f ) T R HX f. Ensemble transformation X a = X f W ETKF. Ensemble weight matrix W ETKF := N CΛ. (n x n) (N x N) (n x N) (N x N) CC T = A (symmetric square root) Λ is identity or random orthogonal matrix with EV (,...,) T )

SEIK Filter Error-subspace basis matrix L := X f T, size (n x N-) (T subtracts ensemble mean and removes last column) Analysis covariance matrix P a = LÃLT (n x n) Transform matrix (in error subspace) à := (N )T T T +(HL) T R HL. (N- x N-) Ensemble transformation X a = LW SEIK. (n x N) Ensemble weight matrix W SEIK := N CΩ T (N- x N) C. is square root of à (originally Cholesky decomposition) CΩ T is transformation from N- to N (random or deterministic)

Weight Matrices (W in X a = X f W ) ETKF ETKF: Transformation matrix 0. SEIK-Cholesky sqrt SEIK chol: Transformation matrix 0. 0.05 0.05 0 0 0.05 0.05 0. 0. ETKF main contribution from diagonal (minimum transformation) Off-diagonals of similar weight Minimum change in distribution of ensemble variance SEIK with Cholesky sqrt main contribution from diagonal Off-diagonals with strongly varying weights Changes distribution of variance in ensemble

Transformation Matrix of SEIK/symmetric sqrt SEIK symmetric sqrt SEIK sym: Transformation matrix 0. 0.05 0 0.05 0. Difference SEIK-ETKF transformation matrices difference: SEIK ETKF x 0 3 4 0-3 3 2 0 2 3 4 Transformation matrices of ETKF and SEIK-sym very similar Largest difference for last ensemble member (Experiments with Lorenz96 model: This can lead to smaller ensemble variance of this member)

SEIK depends on ensemble order Switch last two ensemble members SEIK sym: Difference of transformation matrices x 0 3-3 4 3 2 0 2 3 4 (Switched back last two columns & rows for comparison) Ensemble transformation depends on order of ensemble members (For ETKF the difference is 0-5 ) Statistically fine, but not desirable!

Revised T matrix Identical transformations require different projection matrix for SEIK: For SEIK: T subtracts ensemble mean and drops last column Dependence on order of ensemble members! Solution: Redefine T: Distribute last member over first N- columns Also replace Ω L := X f T, by new ˆT New filter formulation: Error Subspace Transform Kalman Filter (ESTKF)

T-matrix in SEIK and ESTKF SEIK: ESTKF: T i,j = ˆT i,j = N N N N N p + N p + N p N for i = j, i < N for i = j, i < N for i = N for i = j, i < N for i = j, i < N for i = N Efficient implementation as subtraction of means & last column ETKF: improve compute performance using a matrix T

ESTKF: New filter with identical transformation as ETKF New filter ESTKF properties like ETKF: Minimum transformation Transformation independent of ensemble order But: analysis computed in dimension N- direct access to error subspace smaller condition number of A L. Nerger et al., Monthly Weather Review 40 (202) 2335-2345

Localization

Localization: Why and how? Combination of observations and model state based on estimated error covariance matrices 4 3 2 Example: Sampling error and localization true sampled localized Finite ensemble size leads to significant sampling errors particularly for small covariances! 0 2 020 0 20 0 0 30 20 40 distance Remove estimated long-range correlations Increases degrees of freedom for analysis (globally not locally!) Increases size of analysis correction

Global vs. local SEIK, N=32 (March 993) Error reduced to 83.6% Error reduced to 3.7% Improvement is error reduction by assimilation Localization extents improvements into regions not improved by global SEIK Regions with error increase diminished for local SEIK Underestimation of errors reduced by localization L. Nerger et al. Ocean Dynamics 56 (2006) 634

Localization Types Simplified analysis equation: x a = x f + Pf P f + R y xf Covariance localization Modify covariances in forecast covariance matrix P f Element-wise product with correlation matrix of compact support Requires that P f is computed (not in ETKF or SEIK) E.g.: Houtekamer/Mitchell (998, 200), Whitaker/Hamill (2002), Keppenne/ Rienecker (2002) Observation localization Modify observation error covariance matrix R Needs distance of observation (achieved by local analysis or domain localization) Possible in all filter formulations E.g.: Evensen (2003), Ott et al. (2004), Nerger/Gregg (2007), Hunt et al. (2007)

Local SEIK filter domain & observation localization Local Analysis: Update small regions (like single vertical columns) Observation localizations: Observations weighted according to distance Consider only observations with weight >0 State update and ensemble transformation fully local S: Analysis region D: Corresponding data region Similar to localization in LETKF (e.g. Hunt et al, 2007) L. Nerger et al., Ocean Dynamics 56 (2006) 634 L. Nerger & W.W. Gregg, J. Mar. Syst. 68 (2007) 237

Different effect of localization methods Experimental result: Twin experiment with simple Lorenz96 model Covariance localization better than observation localization (Also reported by Greybush et al. (20) with other model) Time-mean RMS errors / Inflation forgetting factor 0.98 0.96 0.94 0.92 0.9 Covariance EnKF sqrt, obs. localization error=0.5 2 6 0 4 8 22 26 30 34 support radius Localization radius 0.5 0.4 0.35 0.3 0.25 0.2 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.5 0. 0.05 0. 0.095 0.09 / inflation T. Janjic et al., Mon. Wea. Rev. 39 (20) 2046-2060 forgetting factor 0.98 0.96 0.94 0.92 0.9 Observation LSEIK fix, obs. localization error=0.5 2 6 0 4 8 22 26 30 34 support radius Localization radius 0.5 0.4 0.35 0.3 0.25 0.2 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.5 0. 0.05 0. 0.095 0.09

Different effect of localization methods (cont.) Larger differences for smaller observation errors / Inflation forgetting factor 0.98 0.96 0.94 0.92 0.9 Covariance localization EnKF sqrt, obs. error=0. 2 6 0 4 8 22 26 30 34 support radius Localization radius 0. 0.08 0.06 0.04 0.03 0.025 0.024 0.0235 0.023 0.0225 0.022 0.025 0.02 0.0205 0.02 0.095 0.09 0.085 0.08 forgetting factor 0.98 0.96 0.94 0.92 0.9 Observation LSEIK fix, obs. localization error=0. 2 6 0 4 8 22 26 30 34 support radius Localization radius 0. 0.08 0.06 0.04 0.03 0.025 0.024 0.0235 0.023 0.0225 0.022 0.025 0.02 0.0205 0.02 0.095 0.09 0.085 0.08

Covariance vs. Observation Localization Some published findings: Both methods are similar Slightly smaller width required for observation localization But note for observation localization: Effective localization length depends on errors of state and observations Small observation error wide localization Possibly problematic: in initial transient phase of assimilation if large state errors are estimated locally effective weight effective weight 0.8 0.6 0.4 0.2 0 0 20 40 60 80 distance 0.8 0.6 0.4 0.2 prescribed weight effective weight P= R= P= R=0. 0 0 20 40 60 80 distance P: state error variance R: observation error variance

Regulated Localization New localization function for observation localization formulated to keep effective length constant (exact for single w OLR = wcl σr 2 observation) ( HPH T + σr 2 wcl HPH T ) depends on state and observation HPH T + errors σr 2 depends on fixed localization function cheap to compute for each observation Only exact for single observation works for multiple weight 0.8 0.6 0.4 w fixed w reg, R=0 w reg, R=.0 w reg, R=0. P= 0.2 0 0 20 40 60 80 00 distance L. Nerger et al. QJ Royal. Meterol. Soc. 38 (202) 802-82

Lorenz96 Experiment: Regulated Localization forgetting factor Fixed Covariance localization, loc., N=0, R=0.5 0.98 0.96 0.94 0.92 EnKF sqrt, LSEIK fix, obs. error=0.5 0.9 2 6 0 4 8 22 26 30 34 support radius 0.5 0.4 0.35 0.3 0.25 0.2 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.5 0. 0.05 0. 0.095 0.09 Reduced minimum rms errors Increased stability region forgetting factor 0.98 0.96 0.94 0.92 Still need to test in real application Regulated localization, N=0, R=0.5 0.9 LSEIK reg, obs. error=0.5 2 6 0 4 8 22 26 30 34 support radius Description of effective localization length explains the findings of other studies! 0.5 0.4 0.35 0.3 0.25 0.2 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.5 0. 0.05 0. 0.095 0.09

Summary Ensemble-based KFs not exact But they work! Improve methods Least cost; least tuning; best state and error estimates Study relations for improvements Efficient analysis formulations Efficient localization Thank you! Lars.Nerger@awi.de