R. Wójcik, P.A. Troch, H. Stricker, P.J.J.F. Torfs, E.F. Wood, H. Su, and Z. Su (2004), Mixtures of Gaussians for uncertainty description in latent

Similar documents
D. Lohmann, Pablo Grunman, and Kenneth Mitchell (2004), Land data assimilation at NOAA/NCEP/EMC, in Proceedings of the 2 nd international CAHMDA

Ate Visser, Roelof Stuurman, and Marc F.P. Bierkens (2004), Real time forecasting of water table depth and soil moisture profiles, in Proceedings of

Land Data Assimilation at NCEP NLDAS Project Overview, ECMWF HEPEX 2004

K. Schulz, I. Andrä, V. Stauch, and A.J. Jarvis (2004), Scale dependent SVAT-model development towards assimilation of remotely sensed information

Performance of Radar Wind Profilers, Radiosondes, and Surface Flux Stations at the Southern Great Plains (SGP) Cloud and Radiation Testbed (CART) Site

Evaporative Fraction and Bulk Transfer Coefficients Estimate through Radiometric Surface Temperature Assimilation

P1.20 MICROWAVE LAND EMISSIVITY OVER COMPLEX TERRAIN: APPLIED TO TEMPERATURE PROFILING WITH NOGAPS ABSTRACT

Gaussian Process Approximations of Stochastic Differential Equations

A data-driven method for improving the correlation estimation in serial ensemble Kalman filter

Dynamic Inference of Background Error Correlation between Surface Skin and Air Temperature

1. Current atmospheric DA systems 2. Coupling surface/atmospheric DA 3. Trends & ideas

GRID RAINFALL DISAGGREGATION TOWARD A PATCH-BASED ENSEMBLE KALMAN FILTER FOR SOIL MOISTURE DATA ASSIMILATION

Polar Weather Prediction

EUMETSAT LSA-SAF EVAPOTRANSPIRATION PRODUCTS STATUS AND PERSPECTIVES

The Planetary Boundary Layer and Uncertainty in Lower Boundary Conditions

An Intercomparison of Single-Column Model Simulations of Summertime Midlatitude Continental Convection

Towards a probabilistic hydrological forecasting and data assimilation system. Henrik Madsen DHI, Denmark

Soil Moisture Prediction and Assimilation

Stochastic methods for representing atmospheric model uncertainties in ECMWF's IFS model

May 3, :41 AOGS - AS 9in x 6in b951-v16-ch13 LAND SURFACE ENERGY BUDGET OVER THE TIBETAN PLATEAU BASED ON SATELLITE REMOTE SENSING DATA

J5.8 ESTIMATES OF BOUNDARY LAYER PROFILES BY MEANS OF ENSEMBLE-FILTER ASSIMILATION OF NEAR SURFACE OBSERVATIONS IN A PARAMETERIZED PBL

Density Propagation for Continuous Temporal Chains Generative and Discriminative Models

Density Estimation. Seungjin Choi

Data Assimilation Research Testbed Tutorial

Apport altimétrie pour modélisation hydrologique des grands fleuves

The Two Source Energy Balance model using satellite, airborne and proximal remote sensing

John R. Mecikalski #1, Martha C. Anderson*, Ryan D. Torn #, John M. Norman*, George R. Diak #

At A Glance. UQ16 Mobile App.

Relative Merits of 4D-Var and Ensemble Kalman Filter

A methodology for estimation of surface evapotranspiration

Dynamic System Identification using HDMR-Bayesian Technique

Air sea satellite flux datasets and what they do (and don't) tell us about the air sea interface in the Southern Ocean

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 13: SEQUENTIAL DATA

Using satellite-based remotely-sensed data to determine tropical cyclone size and structure characteristics

Multivariate verification: Motivation, Complexity, Examples

Estimation of Surface Fluxes of Carbon, Heat, Moisture and Momentum from Atmospheric Data Assimilation

Earth s Energy Budget: How Is the Temperature of Earth Controlled?

Regional dry-season climate changes due to three decades of Amazonian deforestation

Autonomous Navigation for Flying Robots

Name(s) Period Date. Earth s Energy Budget: How Is the Temperature of Earth Controlled?

ATMOS 5140 Lecture 1 Chapter 1

Using Data Assimilation to Explore Precipitation - Cloud System - Environment Interactions

Seasonal Variations of the Urban Heat Island Effect:

At-A-Glance SIAM Events Mobile App

Assessing the Radiative Impact of Clouds of Low Optical Depth

IMPACT EXPERIMENTS ON GMAO DATA ASSIMILATION AND FORECAST SYSTEMS WITH MODIS WINDS DURING MOWSAP. Lars Peter Riishojgaard and Yanqiu Zhu

Gaussian Process Approximations of Stochastic Differential Equations

Cross-validation methods for quality control, cloud screening, etc.

Motivation & Goal. We investigate a way to generate PDFs from a single deterministic run

REVISION OF THE STATEMENT OF GUIDANCE FOR GLOBAL NUMERICAL WEATHER PREDICTION. (Submitted by Dr. J. Eyre)

5. General Circulation Models

Ensembles and Particle Filters for Ocean Data Assimilation

) was measured using net radiometers. Soil heat flux (Q g

An Ensemble Land Surface Modeling and Assimilation Testbed for HEPEX

Lagrangian data assimilation for point vortex systems

Short tutorial on data assimilation

Drought Monitoring with Hydrological Modelling

Evaluation of a New Land Surface Model for JMA-GSM

Snow Data Assimilation

ECMWF global reanalyses: Resources for the wind energy community

Applications of an ensemble Kalman Filter to regional ocean modeling associated with the western boundary currents variations

Land Surface Temperature in the EUMETSAT LSA SAF: Current Service and Perspectives. Isabel Trigo

Forecasting and data assimilation

Sensor Tasking and Control

The Canadian approach to ensemble prediction

Lecture 2: From Linear Regression to Kalman Filter and Beyond

EnKF Review. P.L. Houtekamer 7th EnKF workshop Introduction to the EnKF. Challenges. The ultimate global EnKF algorithm

Consider the joint probability, P(x,y), shown as the contours in the figure above. P(x) is given by the integral of P(x,y) over all values of y.

11280 Electrical Resistivity Tomography Time-lapse Monitoring of Three-dimensional Synthetic Tracer Test Experiments

Radar data assimilation using a modular programming approach with the Ensemble Kalman Filter: preliminary results

Probabilistic fog forecasting with COSMO model

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Assimilating terrestrial remote sensing data into carbon models: Some issues

Professor David B. Stephenson

Efficient Likelihood-Free Inference

Investigating the urban climate characteristics of two Hungarian cities with SURFEX/TEB land surface model

Local Ensemble Transform Kalman Filter

ESTIMATING CORRELATIONS FROM A COASTAL OCEAN MODEL FOR LOCALIZING AN ENSEMBLE TRANSFORM KALMAN FILTER

Gaussian process for nonstationary time series prediction

Data Assimilation Working Group

Predicting Climate Change

Remote Sensing Data Assimilation for a Prognostic Phenology Model

2234 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 52, NO. 8, AUGUST Nonparametric Hypothesis Tests for Statistical Dependency

Aircraft Validation of Infrared Emissivity derived from Advanced InfraRed Sounder Satellite Observations

The Kalman Filter. Data Assimilation & Inverse Problems from Weather Forecasting to Neuroscience. Sarah Dance

The National Operational Hydrologic Remote Sensing Center Operational Snow Analysis

The Global Flood Awareness System

DART_LAB Tutorial Section 2: How should observations impact an unobserved state variable? Multivariate assimilation.

Uncertainty propagation in a sequential model for flood forecasting

Data assimilation in the MIKE 11 Flood Forecasting system using Kalman filtering

Lecture: Gaussian Process Regression. STAT 6474 Instructor: Hongxiao Zhu

Convective-scale data assimilation in the Weather Research and Forecasting model using a nonlinear ensemble filter

Land data assimilation in the NASA GEOS-5 system: Status and challenges

Mobile Robot Localization

Smoothers: Types and Benchmarks

WATER AND ENERGY BALANCE ESTIMATION IN PUERTO RICO USING SATELLITE REMOTE SENSING

A PARTICLE FILTER APPROACH FOR SOLAR RADIATION ESTIMATE USING SATELLITE IMAGE AND IN SITU DATA

Remote sensing estimation of land surface evapotranspiration of typical river basins in China

On the Nature of Random System Matrices in Structural Dynamics

An ARM SCM Intercomparison Study-Overview and Preliminary Results for Case 1

Transcription:

R. Wójcik, P.A. Troch, H. Stricker, P.J.J.F. Torfs, E.F. Wood, H. Su, and Z. Su (4), Mixtures of Gaussians for uncertainty description in latent heat flux estimates from remotely sensed information, in Proceedings of the 2 nd international CAHMDA workshop on: The Terrestrial Water Cycle: Modelling and Data Assimilation Across Catchment Scales, edited by A.J. Teuling, H. Leijnse, P.A. Troch, J. Sheffield and E.F. Wood, pp. 41 44, Princeton, NJ, October 25 27

CAHMDA-II workshop Session 1 Wójcik Mixtures of Gaussians for uncertainty description in latent heat flux estimates from remotely sensed information R. Wójcik 1, P.A. Troch 1, H. Stricker 1, P.J.J.F. Torfs 1, E.F. Wood 2, H. Su 3, and Z. Su 3 1 Hydrology and Quantitative Water Management Group, Wageningen University, Wageningen, The Netherlands 2 Princeton University, Princeton, New Jersey, USA 3 Alterra Green World Research, Wageningen, The Netherlands Latent heat flux (LE) is the key variable that provides a link between energy and water budgets at the land surface. The conventional methods to estimate LE are based on point measurements of energy balance components and are representative only for very local scales. Recently a new class of techniques based on remotely sensed (RS) information has been developed to compute LE at scales from a point to a continent. Despite their potential, especially for regional and global hydrological applications, satellite-derived LE sat usually does not compare well with in-situ measured LE is. Both proxies of LE, however, contain the information about the true value of this quantity. The difficulty in inferring this information from data is due to different sources of uncertainty involved (e.g., measurement errors, scale problems, inadequacies in physical models that transform satellite observations into LE estimates). In this work we seek to investigate the use of non-parametric Gaussian mixture density models (GMDM s) to describe the conditional uncertainty of LE sat given LE is. This approach does not require any a priori assumptions on the form of the conditional density i.e. the algorithms we use in this study are completely data driven. An extra benefit from having the conditionals described by GMDM s is that they can further be applied to identify the recently developed non-linear Kalman filter for ensemble data assimilation (see Anderson and Anderson, 1999; Torfs et al., 2). This is the long run objective of this research. Data and methods LE is estimates used in this study come from seven Energy Balance Bowen Ratio (EBBR) ARM/CART stations (E15, E4, E9, E2, E7, E25, E8) distributed across the Southern Great Planes (SGP) region of the United States. These estimates are based on 3-min averaged observations. The LE sat estimates were obtained using SEBS (Surface Energy Balance System) developed by Su (2) and are based on instantaneous observations. The both types of LE proxies were obtained at 1 hourly resolution in the period of 1 July 1 3 September 1. To describe the conditional uncertainty of LE is given LE sat a joint probability density function (pdf) f needs first to be fitted to bivariate sample {LE sat,k ;LE is,k } K k=1. In this work the focus is on the use of GMDM s (see e.g. McLachlan and Peel, ) which are defined as linear combinations of Gaussian densities (see Figure 1.14), called components: f (x) = N c n=1 w n g (mn,c n )(x) (1.1) where x is a vector of variables, N c the number of components, g (mn,c n ) stands for the Gaussian density with mean m n and covariance C n. Here x = [LE sat LE is ] T. The w n s are the component weights and satisfy w n and w n = 1. Note that the conditional density f (LE is LE sat ) that is 41

Wójcik Session 1 CAHMDA-II workshop Figure 1.14: 1D and 2D example of GMDM (in both cases as a linear combination of 3 components). calculated from (1.1) is also a GMDM. To fit (1.1) to the data the procedure of Figueiredo and Jain (2) was applied. First attempt To apply the above described methodology we derived the estimates of LE sat by forcing SEBS with the following instantaneous RS inputs: short wave radiation derived from a 5 km GOES product and 1/8 degree GOES surface temperature. The rest of the input variables that were needed to run SEBS (see Su, 2) was either measured or taken from LDAS database. Next we grouped LE sat and LE is data according to landuse. Our hypothesis here is that at regional scale the bivariate dependency structure should be invariant within a particular landuse class. Moreover, this step is intended to tackle the dimensionality reduction issue in non-linear ensemble Kalman filters as described by Anderson and Anderson (1999) and Torfs et al. (2). Figure 1.15 shows the result of this operation. It can be seen in the figure that the dependency pattern between LE is and LE sat is not really visible. Thus, the data in Figure 1.15 would be of little use for data assimilation purpose. The blurring effect might be due to undersampling which stems from the fact that data availability of GOES temperature is greatly affected by the cloud cover and the algorithm that is used to retrieve the surface temperature. Moreover, there is a spatial and temporal scaling problem involved (we compare point values with 1/8 decimal degree values), there is a measurement error in LE is values and there is an error in LE sat values. The latter might be a combination of errors in RS inputs to SEBS and limitations of SEBS itself to reflect the complicated physical situation in the near-surface layer of air. In what follows we address this issue by performing Monte-Carlo sensitivity analysis of SEBS to two RS inputs that in our view greatly influence the quality of LE sat estimates: net radiation and surface temperature. E15_E9 ( cropland ) E7_E25_E2 ( open shrubland ) E8_E4 ( grassland ) SEBS LE [W/m2] SEBS LE [W/m2] SEBS LE [W/m2] OBSERVED LE [W/m2] OBSERVED LE [W/m2] OBSERVED LE [W/m2] Figure 1.15: Bivariate 1-hourly LE data grouped according to landuse for the period 1 July 1 3 September 1. LE sat estimates are derived from GOES products. 42

CAHMDA-II workshop Session 1 1 1 E15_E9 ( cropland ) E7_E25_E2 ( open shrubland ) E8_E4 ( grassland ) 1 1 1 1 1 1 1 1 E8_E4 (grassland) E7_E25_E2 (open shrubland) E15_E9 (cropland) Wójcik 1 1 Figure 1.16: Bivariate 1-hourly LE data grouped according to landuse for the period 1 July 1 3 September 1. Upper panel: control run. Lower panel: surrogates. Control run and surrogate RS data Accordingly, SEBS was forced with net radiation calculated from measured radiation components. Surface temperature was derived from outgoing long wave radiation. The rest of the inputs remained the same as mentioned in the previous section. In this way we obtained somewhat idealized LEsat data (read: no RS error involved) which is reffered to as the control run (see upper panel of Figure 1.16). Note the transparent non-gaussian dependency structure of bivariate LE data. Next, surrogate LEsat data was created by perturbing the control run with percentual error in the net radiation (by comparing RS derived net radiation with measured net radiation we estimated this error as 15%). Technically, each net radiation measurement in the control run was treated as a mode of log-normal distribution and the 15% error as its coefficient of variation. From each distribution 3 points were drawn at random and propagated through SEBS to obtain LEsat surrogates. Those are shown in lower panel of Figure 1.16. Then, bivariate GMDM s were fitted to both control run and surrogate data from Figure 1.16 (for an example of fitted pdf s see upper panel of Figure 1.17). To determine to which extent the bivariate structure in control run was deteriorated due to sattelite error in net radiation we compared the fitted pdf s in terms of probabilistic similarity measure introduced by Scott and Szewczyk (1): R f1 (x) f2 (x)dx sim( f1 ; f2 ) = R R 1 ( f1 (x)2 dx f2 (x)2 dx) 2 (1.2) This measure is if two pdf s show no similarity and 1 if two pdf s are just the same. For cropland, open shrubland and grassland sim( f1 ; f2 ) was.96,.96 and.98 respectively. This implies that 43

Wójcik Session 1 CAHMDA-II workshop E7_E25_E2 (open shrubland) 1.8e 5 1.6e 5 1.4e 5 1.2e 5 1e 5 8e 6 6e 6 4e 6 2e 6 2e 5 1.8e 5 1.6e 5 1.4e 5 1.2e 5 1e 5 8e 6 6e 6 4e 6 2e 6.16 1 1.25 1 1.14.12.2.1.8.6.15.1.4.2 1 1.5 1 1 Figure 1.17: An example of MDGM s fitted to open shrubland LE data. Upper panel: joint pdf s f (LE is,le sat ) for control run and surrogates respectively (the similarity between the two pdf s is.96). Lower panel: conditional pdf s f (LE is LE sat ) for control run and surrogates. The solid line in X-Y plane represents conditional expectation and dashed lines represent standard deviation bands. the error in net radiation has negligible effect on probability structure in control run data. Continuation The same analysis will be performed for the surface temperature. The results will be shown during the poster session. In parallel we work on an uncertainty analysis of LE measurements from EBBR ARM/CART stations. 44

CAHMDA-II workshop Bibliography Wójcik Bibliography Anderson, J., and S. Anderson, A Monte Carlo implementation of the nonlinear filtering problem to produce ensemble assimilations and forecasts., Mon. Weather Rev., 128, 1971 1981, 1999. Figueiredo, M., and A. Jain, Unsupervised learning of finite mixture models, IEEE T. Pattern Anal., 24(3), 381 396, 2. McLachlan, G., and D. Peel, Finite Mixture Models, Wiley Iterscience, New York,. Scott, D., and W. Szewczyk, From kernels to mixtures, Technometrics, 43, 323 335, 1. Su, Z., The surface energy balance system (SEBS) for estimation of turbulent heat fluxes, Hydrol. Earth Syst. Sci., 6(1), 85 99, 2. Torfs, P., E. van Loon, R. Wójcik, and P. Troch, Data assimilation by non-parametric local density estimation, in Computational Methods in Water Resources, edited by S. Hassanizadeh, R. Schotting, W. Gray, and G. Pinder, pp. 1355 1362, Elsevier, Amsterdam, 2.