Assessment of Ensemble Forecasts

Similar documents
The Impact of Horizontal Resolution and Ensemble Size on Probabilistic Forecasts of Precipitation by the ECMWF EPS

Feature-specific verification of ensemble forecasts

TC/PR/RB Lecture 3 - Simulation of Random Model Errors

Adaptation for global application of calibration and downscaling methods of medium range ensemble weather forecasts

Precipitation verification. Thanks to CMC, CPTEC, DWD, ECMWF, JMA, MF, NCEP, NRL, RHMC, UKMO

Five years of limited-area ensemble activities at ARPA-SIM: the COSMO-LEPS system

Seasonal Hydrometeorological Ensemble Prediction System: Forecast of Irrigation Potentials in Denmark

Ensemble Verification Metrics

Verification of Probability Forecasts

Quantitative Precipitation Forecasts over the United States by the ECMWF Ensemble Prediction System

Hydrologic Ensemble Prediction: Challenges and Opportunities

Basic Verification Concepts

Model error and seasonal forecasting

Main characteristics and performance of COSMO LEPS

Implementation and Evaluation of a Mesoscale Short-Range Ensemble Forecasting System Over the Pacific Northwest

EMC Probabilistic Forecast Verification for Sub-season Scales

Basic Verification Concepts

Implementation and Evaluation of a Mesoscale Short-Range Ensemble Forecasting System Over the Pacific Northwest

Drought forecasting methods Blaz Kurnik DESERT Action JRC

Standardized Anomaly Model Output Statistics Over Complex Terrain.

Application and verification of ECMWF products 2016

operational status and developments

Seasonal Forecasts of River Flow in France

Verification of ensemble and probability forecasts

Operational Hydrologic Ensemble Forecasting. Rob Hartman Hydrologist in Charge NWS / California-Nevada River Forecast Center

Precipitation Calibration for the NCEP Global Ensemble Forecast System

Application of Radar QPE. Jack McKee December 3, 2014

Probabilistic seasonal forecast verification

J11.5 HYDROLOGIC APPLICATIONS OF SHORT AND MEDIUM RANGE ENSEMBLE FORECASTS IN THE NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS)

Exploring ensemble forecast calibration issues using reforecast data sets

Strategy for Using CPC Precipitation and Temperature Forecasts to Create Ensemble Forcing for NWS Ensemble Streamflow Prediction (ESP)

Medium-range Ensemble Forecasts at the Met Office

Near-surface weather prediction and surface data assimilation: challenges, development, and potential data needs

ICRC-CORDEX Sessions A: Benefits of Downscaling Session A1: Added value of downscaling Stockholm, Sweden, 18 May 2016

The Texas drought. Kingtse Mo Climate Prediction Center NWS/NCEP/NOAA

Uncertainty assessment for short-term flood forecasts in Central Vietnam

István Ihász, Máté Mile and Zoltán Üveges Hungarian Meteorological Service, Budapest, Hungary

FLORA: FLood estimation and forecast in complex Orographic areas for Risk mitigation in the Alpine space

Application and verification of ECMWF products at the Finnish Meteorological Institute

Application and verification of ECMWF products 2010

Application and verification of ECMWF products 2009

Verification of Probabilistic Quantitative Precipitation Forecasts over the Southwest United States during Winter 2002/03 by the RSM Ensemble System

Jaime Ribalaygua 1,2, Robert Monjo 1,2, Javier Pórtoles 2, Emma Gaitán 2, Ricardo Trigo 3, Luis Torres 1. Using ECMWF s Forecasts (UEF2017)

QPF? So it s a Land-falling Atmospheric River, Can That Help the Forecaster Make a Better. David W. Reynolds

Understanding Weather and Climate Risk. Matthew Perry Sharing an Uncertain World Conference The Geological Society, 13 July 2017

Introduction to NCEP's time lagged North American Rapid Refresh Ensemble Forecast System (NARRE-TL)

Talk Overview. Concepts. Climatology. Monitoring. Applications

Verification of ECMWF products at the Finnish Meteorological Institute

Application and verification of ECMWF products 2014

Focus on parameter variation results

Climate Change and Runoff Statistics in the Rhine Basin: A Process Study with a Coupled Climate-Runoff Model

JP3.7 SHORT-RANGE ENSEMBLE PRECIPITATION FORECASTS FOR NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS): PARAMETER ESTIMATION ISSUES

Diagnosing the Climatology and Interannual Variability of North American Summer Climate with the Regional Atmospheric Modeling System (RAMS)

Application and verification of ECMWF products 2009

Application and verification of ECMWF products 2015

Forecasting Extreme Events

Recent ECMWF Developments

TRMM Multi-satellite Precipitation Analysis (TMPA)

Added Value of Convection Resolving Climate Simulations (CRCS)

Southern California Storm of July 2015

Verification of the Seasonal Forecast for the 2005/06 Winter

Linking the Hydrologic and Atmospheric Communities Through Probabilistic Flash Flood Forecasting

Implementation and Evaluation of a Mesoscale Short-Range Ensemble Forecasting System Over the. Pacific Northwest. Eric P. Grimit

Conditional weather resampling for ensemble streamflow forecasting

Methods of forecast verification

Probabilistic Weather Forecasting and the EPS at ECMWF

Sensitivity of COSMO-LEPS forecast skill to the verification network: application to MesoVICT cases Andrea Montani, C. Marsigli, T.

IAP Dynamical Seasonal Prediction System and its applications

A Report on a Statistical Model to Forecast Seasonal Inflows to Cowichan Lake

Seasonal Prediction in France : Application to Hydrology

Global Flash Flood Forecasting from the ECMWF Ensemble

LATE REQUEST FOR A SPECIAL PROJECT

Studying 2006 dry and 2007 wet events using surface observations and NCEP Reanalysis

Minor Winter Flooding Event in northwestern Pennsylvania January 2017

Three main areas of work:

El Niño, Climate Change and Water Supply Variability

The ECMWF Extended range forecasts

Application and verification of ECMWF products 2015

Climate Change Impact on Drought Risk and Uncertainty in the Willamette River Basin

Impacts of climate change on flooding in the river Meuse

Operational Perspectives on Hydrologic Model Data Assimilation

Improvement and Ensemble Strategy of Heavy-Rainfall Quantitative Precipitation Forecasts using a Cloud-Resolving Model in Taiwan

Concepts of Forecast Verification FROST-14

FPAW October Pat Murphy & David Bright NWS Aviation Weather Center

Use of extended range and seasonal forecasts at MeteoSwiss

Application and verification of ECMWF products 2008

HPC Ensemble Uses and Needs

NCEP ENSEMBLE FORECAST SYSTEMS

Hydrologic Research Laboratory Department of Civil and Environmental Engineering University of California, Davis

Verification at JMA on Ensemble Prediction

Evaluation of Satellite Precipitation Products over the Central of Vietnam

Sanjeev Kumar Jha Assistant Professor Earth and Environmental Sciences Indian Institute of Science Education and Research Bhopal

CAPS Storm-Scale Ensemble Forecasting (SSEF) System

Application and verification of ECMWF products 2017

Application and verification of ECMWF products 2009

Land Data Assimilation at NCEP NLDAS Project Overview, ECMWF HEPEX 2004

Weather Forecasting: Lecture 2

DEPARTMENT OF EARTH & CLIMATE SCIENCES Name SAN FRANCISCO STATE UNIVERSITY Nov 29, ERTH 360 Test #2 200 pts

The Hungarian Meteorological Service has made

The Canadian Regional Ensemble Prediction System (REPS)

Transcription:

Assessment of Ensemble Forecasts S. L. Mullen Univ. of Arizona HEPEX Workshop, 7 March 2004

Talk Overview Ensemble Performance for Precipitation Global EPS and Mesoscale 12 km RSM Biases, Event Discrimination Regional Assessment Calibration of Ensemble Output What it can and can t do well Analysis Uncertainty Effect on Verification Scores Fields Needed by Hydro. Runoff Model Ensemble Validation Issues What remains to be done?

RMSE Deposition Murphy (1988) Clim Bias 2 MSE= f o + s + s 2s s 2 2 f o f o fo Model Observed Variance Variance Variance Weighted Correlation 2 s ρ2 ρ f f fo fo o o Phase Reliability Bias o SS = s s ρ 2

Root Mean Square Error ECMWF Quantitative Precipitation Cool + Warm Seasons 12 Error (mm/day) 10 8 6 4 2 2σ σ RMSE T319 BIAS T319 RMSE T159 BIAS T159 Forecast error growth Bias-Spread Decomposition 0 1 2 3 4 5 6 7 8 9 10 Fcst Day

0.25 0.20 0.15 0.10 0.05 0.00 0.25 0.20 0.15 0.10 0.05 0.00 0.25 0.20 0.15 0.10 0.05 0.00 0.25 0.20 0.15 0.10 0.05 0.00 a) 24 h 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 b) 48 h 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 c) 120 h 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 d) 240 h 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 T159 Rank Histograms Under Dispersion Verification lies outside envelope of ensemble too frequently Related to weak model variance

Brier Score Decomposition Murphy (1973) BS = BS BS + BS where rel 1 I = [ ] 2 rel N i i i i= 1 1 I [ ] 2 res = N i i i= 1 BS = o[1 o] unc Skill Score res BS N f o BS N o o BSS = cli = BS cli unc BS BS BSres BS BS unc rel

Reliability for Old T159 Cool Season 10 mm/day Obs Freq 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 Day 1 Day 4 Day 7 Day 10 Want Forecasts that are Reliable Discriminating Sharp T159 EPS over forecasts likelihood of rain 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Fcst Prob Mullen and Buizza (2001)

Ranked Probability Skill Score T159 Mullen and Buizza (2001)

Ranked Probability Skill Score T159 Mullen and Buizza (2001)

24 h RPSS, 12 km RSM, 4 km grid Overall RPSS Gauge stations Average monthly precip RPSS > 0.5 are mainly located along the Pacific Coast, and the windward slopes of Sierra Nevada Mountains and Mogollon Rim of the central Arizona. Spatial Correlations RPSS and Precipitation: ~0. 60 RPSS and Gauge Density: ~0.30 (Yuan et al. 2004, in progress)

24 h Bias for 12 km RSM Ranked Histogram Reliability Diagram L shape of RH denotes large wet bias Wet bias reflected in Reliability Curves 1200 UTC shows stronger wet bias! (Yuan et al. 2004, in progress) UTC Reliability Resolution BSS 0000 0.0035 0.0049 0.1127 1200 0.0057 0.0042-0.1250

Personal Anecdotal Observation Ensemble forecast systems, both global and limit-area, seem to have very similar error characteristics for precipitation Wet conditional bias for 24 h thresholds of 50 mm and lower Under dispersion

+5 Day T255 +5 Day T319 Hurricane Georges - Day 5 Forecast 29 SEP 98: Pr > 50 mm 990130/1200 verification (mm) +5 Day T159 Mullen and Buizza (2002)

30 JAN 99: Pr > 50 mm 990130/1200 verification (mm) Mullen and Buizza (2001) +5 Day T159 +5 Day T255 +5 Day T319 Wintertime Severe Thunderstorm Outbreak

Forecast Variations Skillful ensemble forecast systems might always yield a few busts What are sensitivities of user hydro user community and how do they deal with this situation?

Forecast Discrimination How well do ensembles discern precipitation events if biases are removed/ignored?

EPS ROC Areas for Summer-Winter 20 mm Threshold, Model Grid 1.0 0.9 T255 EPS Summer Winter Summer precipitation is tougher to discern than winter ones ROC Area 0.8 0.7 Why? CONVECTION! 0.6 Small sub-grid scale 0.5 1 2 3 4 5 6 7 8 9 10 Fcst Day Intermittency Weak synoptic forcing

24 h ROC Areas for 12 km RSM (Yuan et al. 2004, in progress) Outstanding ability to discriminate precipitation events ROC areas ~0.90 Local regions can show better performance e.g. Sierra Nevada Implication: ensembles contain valuable predictive input to drive runoff models

RSM Verification for River Basins

Regional Variations in 12 km RSM Skill (Yuan et al. 2004, in progress)

Regional Variations in 12 km RSM Skill (Yuan et al. 2004, in progress)

12 km RSM ROC Central Valley

Optimal Potential Economic Value (PEV)

Calibration Questions Calibration of EPS Ensemble Output by Artificial Neural Networks (ANN) How much can calibration improve medium-range QPF skill? Can post-processing of just precipitation output from EPS significantly improve more than reliability term?

Brier Skill Score (4 Summers---DCA, OKC, FL, PNW) 0.6 0.5 0.4 0.3 0.2 NET 1mm NET 10mm NET 25mm EPS 1mm EPS 10mm EPS 25mm Skill Increases for 1, 10 and 25 mm but not 50 mm Largest Improvement Early in Forecast 0.1 0.0 1 2 3 4 5 6 7 8 9 10 Fcst Day

Obs Prob 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 Attributes Diagram Day 2 (4 Summers---DCA, OKC, FL, PNW) NET EPS 10 mm 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Fcst Prob 1 0.1 0.01 0.001 0.0001 0.00001 Excellent Calibration Every Year-Season No High Probabilities (e.g. No Probs 90% for 10 mm at D+2) NET not as Sharp Note Differences in Forecast Frequencies (logarithmic scale)

Summer Brier Decomposition Where Does Improvement Come? 0.05 0.04 0.03 0.02 0.01 0.00 10 mm REL NET REL EPS RES NET RES EPS UNC.088 1 2 3 4 5 Fcst Day REL (Reliability) NET Increases Skill Through D+5 RES (Resolution) Slight Increase @ D+1 Calibration slightly improves ability to discriminate events early in forecast

Average Regional Skill (Averaged over all forecast projections) 0.25 0.20.249-.213-.084-.018 0.15 0.10.168-.166-.057-.002 0.05 NET EP S 0.00-0.05 1 mm 10 mm 25 mm 50 mm 0.15 0.10 0.05 0.00-0.05 NET EP S.125-.094-.049-.014.010-.023-.004-.009 1 mm 10 mm 25 mm 50 mm 0.15 0.10 0.05 0.00-0.05.151-.107-.049-.035.088-.073-.052-.005 NET EP S 1 mm 10 mm 25 mm 50 mm

General Conclusions Calibration of Only QPF Output -Improves Brier Skill Score by ~20% -Improves Reliability -Improves Resolution only small amount -Calibrated forecasts lack sharpness few or no extreme probability forecasts

Somewhat Related Questions How should ensemble output be calibrated prior to input into hydrological models? In what form (state vector for single forecast or state matrix for entire ensemble) should forecast fields from ensembles be input into hydro models?

Inclusion of Analysis Uncertainty in Verification Thorough Verification Should include estimate of observational or analyses uncertainty Inclusion can lead to markedly different - values for accuracy measures - conclusions Rainfall marked by LARGE uncertainty QPE differences can be comparable to spread at 24-48 h for QPF in localized regions!

Uncertainty in Verification Analyses The NCEP precipitation analyses Resolution Data source QC Interval Time (UTC) mask Gauge RFC8 1/8 th (14km) Radar+Gauge Yes 24 h 1200 Yes 7~8000 RFC4 4 km Gauge only No 24 h 1200 Yes 7~8000 Stage4 4 km Radar+Gauge Yes 6 h/24 h 0000&1200 No 3000 QC: Quality Control done at RFCs Accumulated 24-h precipitation for 1200 UTC 8 Nov-9 Nov, 2002 RFC8 RFC4 Stage4

Different Verifying Analyses 4 km grid for CONUS region NDJFM 2002-03 Red Green different QPE on same ~4 km grid Black Blue same QPE on different 14 vs. 4 km grid Yuan et al. (2004, in progress)

Analysis-Observational Uncertainty What is the impact from uncertainty associate with other variables that might be needed as input by hydrological runoff models?

Driving Hydro Runoff Models with EPS/Ensemble Output QPF/NWP Forecasts are NOW Sufficiently Accurate to Use as Forcing for Predictive Hydrological Runoff Models may require DOWNSCALING Time-Space Scales Need to Verify Additional Atmospheric Parameters Not Commonly Examined. SFC Fluxes, Radiation, H 2 O Vapor, Cloud

Hydrologic Models KINEROS Some Basically Need Precipitation from Atmospheric Ensembles

Hydrologic Models (from Marchand 1999) Others Require Atmospheric Input Fields of Precipitation (amount, type, intensity) Wind, T, Q, Clouds, Fluxes, Radiation, BC s

Driving Hydro Runoff Models with EPS/Ensemble Output What is Ensemble Performance for Other Parameters Needed by Hydro Models? Fluxes, Radiation, Water Vapor, Cloud How Well Are These Fields Observed? What is the uncertainty? How Does it Affect Estimates of Skill?

Time-Space Spectra, Intermittency Runoff is Sensitive to Precipitation Intensity 25 mm/1 hr runoff 25 mm/24 hr none 500 mb H Spectra Hacker and Baumhefner (2004) Intermittency Issue of Heavy Precipitation Better Documentation and Understanding of Ensemble Variances

Feature Based Verification Moncrieff and collaborators (2002 ongoing) Carbone et al (2002)

Feature Based Verification Moncrieff and collaborators (2002) Mucho Better Climate! Simulates Aspect of Feature Carbone et al (2002)

Feature Based Verification Heavy Precipitation/Flash Flooding is Often Associated with Features Like: Quasi-Stationary Convection Training Convection Topographic Interactions Warm Clouds Land Falling Hurricanes, Tropical Systems Warm Rain over Snow How well do ensemble systems perform?

Closing Thoughts Data/Com Requirements Analyses/Observations (atmospheric and hydrologic) at requisite time/space scales with uncertainty estimates Frequent, full resolution ensemble fields for parameters of relevance to hydrological models for calibration of ensemble and hydrological models What does Hydro Community need?