J th Conference on Aviation, Range, and Aerospace Meteorology, 90th AMS Annual Meeting, Atlanta, GA, Jan, 2010

Similar documents
Introduction to NCEP's time lagged North American Rapid Refresh Ensemble Forecast System (NARRE-TL)

Evaluation of Ensemble Icing Probability Forecasts in NCEP s SREF, VSREF and NARRE-TL Systems

Ensemble forecast and verification of low level wind shear by the NCEP SREF system

3.6 NCEP s Global Icing Ensemble Prediction and Evaluation

FAA Weather Research Plans

Canadian Meteorological and Oceanographic Society and American Meteorological Society 21 st Conference on Numerical Weather Prediction 31 May 2012

The Impact of Horizontal Resolution and Ensemble Size on Probabilistic Forecasts of Precipitation by the ECMWF EPS

NCEP Short Range Ensemble Forecast (SREF) System: what we have and what we need? Jun Du. NOAA/NWS/NCEP Environmental Modeling Center

2012 and changes to the Rapid Refresh and HRRR weather forecast models

Transitioning NCAR s Aviation Algorithms into NCEP s Operations

Verifying Ensemble Forecasts Using A Neighborhood Approach

Weather Hazard Modeling Research and development: Recent Advances and Plans

Update on CoSPA Storm Forecasts

11.1 CONVECTION FORECASTS FROM THE HOURLY UPDATED, 3-KM HIGH RESOLUTION RAPID REFRESH (HRRR) MODEL

4A.4 NCEP Short Range Ensemble Forecast (SREF) System Upgrade in 2009

An Investigation of Reforecasting Applications for NGGPS Aviation Weather Prediction: An Initial Study of Ceiling and Visibility Prediction

Figure 1. Observations assimilated into RUC as of 2008 after upgrade, new observations in bold and with leading arrows.

VERFICATION OF OCEAN WAVE ENSEMBLE FORECAST AT NCEP 1. Degui Cao, H.S. Chen and Hendrik Tolman

Richard L. Bankert* and Michael Hadjimichael Naval Research Laboratory, Monterey, CA

J11.5 HYDROLOGIC APPLICATIONS OF SHORT AND MEDIUM RANGE ENSEMBLE FORECASTS IN THE NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS)

Consolidated Storm Prediction for Aviation (CoSPA) Overview

Numerical Forecast of Fog Central Solutions

Methods of forecast verification

3.1 TAMDAR JET FLEETS AND THEIR IMPACT ON RAPID UPDATE CYCLE (RUC) FORECASTS

4.3.2 Configuration. 4.3 Ensemble Prediction System Introduction

Amy Harless. Jason Levit, David Bright, Clinton Wallace, Bob Maxson. Aviation Weather Center Kansas City, MO

12.2 PROBABILISTIC GUIDANCE OF AVIATION HAZARDS FOR TRANSOCEANIC FLIGHTS

ASSIMILATION OF METAR CLOUD AND VISIBILITY OBSERVATIONS IN THE RUC

Towards Operational Probabilistic Precipitation Forecast

8A.6 AN OVERVIEW OF PRECIPITATION TYPE FORECASTING USING NAM AND SREF DATA

Aircraft-based Observations: Impact on weather forecast model performance

Basic Verification Concepts

P3.1 Development of MOS Thunderstorm and Severe Thunderstorm Forecast Equations with Multiple Data Sources

Development of High-Resolution Rapid Refresh (HRRR) Ensemble Data Assimilation, Forecasts and Post Processing

FPAW October Pat Murphy & David Bright NWS Aviation Weather Center

RTMA Flowchart (CONUS)

Peter J. Sousounis*, James L. Menard, and Roy Strasser Weather Services International Corporation Andover, MA

Global aviation turbulence forecasting using the Graphical Turbulence Guidance (GTG) for the WAFS Block Upgrades

University of Oklahoma, Norman, OK INTRODUCTION

AN ENSEMBLE STRATEGY FOR ROAD WEATHER APPLICATIONS

Allison Monarski, University of Maryland Masters Scholarly Paper, December 6, Department of Atmospheric and Oceanic Science

Combining Deterministic and Probabilistic Methods to Produce Gridded Climatologies

Denver International Airport MDSS Demonstration Verification Report for the Season

RSMC WASHINGTON USER'S INTERPRETATION GUIDELINES ATMOSPHERIC TRANSPORT MODEL OUTPUTS

Panel Session 8: Current Capabilities and Future Plans for Surface Transportation Weather Support

J3.4 FROM THE RADAR-ENHANCED RUC TO THE WRF-BASED RAPID REFRESH. NOAA Earth System Research Laboratory, Boulder, CO

CAS & CAeM Aviation Research and Demonstration Project Paris-CDG airport

1.2 DEVELOPMENT OF THE NWS PROBABILISTIC EXTRA-TROPICAL STORM SURGE MODEL AND POST PROCESSING METHODOLOGY

Verification and performance measures of Meteorological Services to Air Traffic Management (MSTA)

5.4 USING PROBABILISTIC FORECAST GUIDANCE AND AN UPDATE TECHNIQUE TO GENERATE TERMINAL AERODROME FORECASTS

22 nd Conference on Weather Analysis and Forecasting / 18 th Conference on Numerical Weather Prediction June 2007, Park City, UT, Amer. Meteor. Soc.

Justin Arnott and Michael Evans NOAA National Weather Service, Binghamton, NY. Richard Grumm NOAA National Weather Service, State College, PA

COMPOSITE-BASED VERIFICATION OF PRECIPITATION FORECASTS FROM A MESOSCALE MODEL

Methodology for the creation of meteorological datasets for Local Air Quality modelling at airports

HPC Ensemble Uses and Needs

An Initial Assessment of a Clear Air Turbulence Forecasting Product. Ankita Nagirimadugu. Thomas Jefferson High School for Science and Technology

Extracting probabilistic severe weather guidance from convection-allowing model forecasts. Ryan Sobash 4 December 2009 Convection/NWP Seminar Series

AMPS Update June 2016

INTEGRATED TURBULENCE FORECASTING ALGORITHM 2001 METEOROLOGICAL EVALUATION

New 16km NCEP Short-Range Ensemble Forecast (SREF) system: what we have and what we need? (SREF.v6.0.0, implementation date: Aug. 21.

Aaron Johnson* and Xuguang Wang. University of Oklahoma School of Meteorology and Center for Analysis and Prediction of Storms (CAPS), Norman, OK

Plans for NOAA s regional ensemble systems: NARRE, HRRRE, and a regional hybrid assimilation

Localized Aviation Model Output Statistics Program (LAMP): Improvements to convective forecasts in response to user feedback

Mark Miller NOAA NextGen Weather Program Manager July 21, 2010

Welcome to the 5 th NCEP Ensemble Users Workshop William. M. Lapenta Acting Director Environmental Modeling Center NOAA/NWS/NCEP

Report on EN6 DTC Ensemble Task 2014: Preliminary Configuration of North American Rapid Refresh Ensemble (NARRE)

Probabilistic weather hazard forecast guidance for transoceanic flights based on merged global ensemble forecasts

Implementation and Evaluation of a Mesoscale Short-Range Ensemble Forecasting System Over the Pacific Northwest

THE REAL-TIME VERIFICATION SYSTEM (RTVS) AND ITS APPLICATION TO AVIATION WEATHER FORECASTS

Implementation and Evaluation of a Mesoscale Short-Range Ensemble Forecasting System Over the Pacific Northwest

Assessment of Ensemble Forecasts

Concepts of Forecast Verification FROST-14

Comparison of the NCEP and DTC Verification Software Packages

Quantifying Uncertainty through Global and Mesoscale Ensembles

A new mesoscale NWP system for Australia

NOTES AND CORRESPONDENCE. Improving Week-2 Forecasts with Multimodel Reforecast Ensembles

Southern United States Heavy rain and flood event 6-8 April 2014

HPC Winter Weather Desk Operations and 2011 Winter Weather Experiment Dan Petersen Winter weather focal point

Motivation & Goal. We investigate a way to generate PDFs from a single deterministic run

Exploring ensemble forecast calibration issues using reforecast data sets

CMO Terminal Aerodrome Forecast (TAF) Verification Programme (CMOTafV)

Evaluating Forecast Quality

Forecasting of Optical Turbulence in Support of Realtime Optical Imaging and Communication Systems

Severe storm forecast guidance based on explicit identification of convective phenomena in WRF-model forecasts

Verification of nowcasts and short-range forecasts, including aviation weather

Verification of Probability Forecasts

Southern Heavy rain and floods of 8-10 March 2016 by Richard H. Grumm National Weather Service State College, PA 16803

Verification and Calibration of Simulated Reflectivity Products. Mark T. Stoelinga University of Washington

STUDY UNIT SEVENTEEN GRAPHICAL AIRMAN S METEOROLOGICAL ADVISORY (G-AIRMET)

TC/PR/RB Lecture 3 - Simulation of Random Model Errors

Five years of limited-area ensemble activities at ARPA-SIM: the COSMO-LEPS system

Measuring In-cloud Turbulence: The NEXRAD Turbulence Detection Algorithm

A 21st century HRRR-based approach to estimating probable maximum precipitation to enhance dam safety and community resilience

Status of Atmospheric Winds in Relation to Infrasound. Douglas P. Drob Space Science Division Naval Research Laboratory Washington, DC 20375

Emerging Aviation Weather Research at MIT Lincoln Laboratory*

eccharts Ensemble data and recent updates Cihan Sahin, Sylvie Lamy-Thepaut, Baudouin Raoult Web development team, ECMWF

SPC Ensemble Applications: Current Status and Future Plans

weather forecast Improving NOAA models for energy Stan Benjamin, Boulder, CO USA Joseph Olson, Curtis Alexander, Eric James,

operational status and developments

Precipitation verification. Thanks to CMC, CPTEC, DWD, ECMWF, JMA, MF, NCEP, NRL, RHMC, UKMO

Transcription:

J12.3 14th Conference on Aviation, Range, and Aerospace Meteorology, 90th AMS Annual Meeting, Atlanta, GA, 17-21 Jan, 20 Introduction to the NCEP Very Short Range Ensemble Forecast System (VSREF) Binbin Zhou*, Jun Du and Geoff DiMego Environmental Model Center/NCEP/NOAA, Camp Springs, MD Abstract The NCEP Very Short Range Ensemble Forecast (VSREF) System is specially developed for aviation weather and potential NextGen applications. Its development was initiated in 2009. The VSREF system is based on NCEP's operational Rapid Update Cycle (RUC) and North American Mesoscale (NAM) model runs using a time-lag ensembling technique. The current domain is over the contiguous US (CONUS). The VSREF is routinely updated every hour with hourly output through 6 hr forecast length. In the current developing stage, there are 11 aviation-weather related probabilistic products in the VSREF system, including icing, clear air turbulence (CAT), visibility, fog, ceiling, low level wind shear, jet stream, surface wind gust, simulated reflectivity, convection, and freezing height. There is an experimental web page to display these products for demonstration. In this paper, more detailed information about the NCEP VSREF including its system configuration, product generation, verification and future plan will be described. 1. Introduction Ensemble forecasting is a new modeling technique to deal with forecast uncertainties, steming from either initial conditions or models. The National Centers for Environmental Prediction (NCEP) of NOAA has developed ensemble forecast systems at both global scale (Toth and Kalnay 1993) and regional range scale (Du and Tracton 2001, Du et al. 2006) over Contiguous US (CONUS), Alaska and Hawaii. In recent years, NCEP has been making efforts to apply its Short Range Ensemble Forecast (SREF) System to aviation weather forecast (Zhou et al, 2004), including system configuration, postprocessing, preliminary aviation-related ensemble products. However, the current system configuration for the NCEP SREF still dose not meet very short range requirement of NextGen, the FAA s new Air Traffic Management System (ATM). Currently, NCEP has a Rapid Update Cycle model (RUC: http://ruc.noaa.gov) that is hourly run and specially serves deterministic aviation weather prediction. However, due to Corresponding author address: Binbin Zhou, NOAA/NWS/NCEP/EMC, 5200 Auth Rd. Camp Springs, MD 20746. Binbin.Zhou@noaa.gov uncertainties of model prediction and its critic impacts on aviation traffic decision making procedure, NextGen will heavily rely on ensemble-based probabilistic forecast data as input (Benjamin and DiMego 20; Souders et al. 20). As a promise to support and comply with the NextGen requirements, NCEP and Global Systems Division (GSD) of Earth System Research Laboratory (ESRL) of NOAA will cooperate to develop NCAR ARW-WRF based Rapid Refresh (RR) and High-Resolution RR (HRRR) systems. The RR or HRRR based ensemble forecast system (NARRE North American Rapid Refresh Ensemble and HRRRE High-Resolution Rapid Refresh Ensemble) will be established in 2014 as planed. As a prelude of the RR/HRRR ensemble development stages, a RUC- NAM based Very Short Range Ensemble Forecast System (VSREF) has been first suggested and developed at NCEP recently. The basic idea of the VSREF is time-lagging the forecasts from existing RUC and NAM cycles. An obvious advantage of the time-lagged technique is its low computational cost since it uses existing model output data without large amount of computational resources including CPU and memory space as requested mainly by, for instance, integrating a model and 1

generating initial condition perturbations such as Ensemble Transform used by GEFS and breeding technique used by the SREF ( Du et al. 2006 ). The only cost for the time-lagged ensemble forecast system is on the procedures to collect the existing model data and then apply a post processor to generate required ensemble products. For the VSREF, a rapid refreshed ensemble forecast system, employing the time-lagged ensembling technique is an appropriate choice for current computational condition at NCEP. The current VSREF system consists of three procedures, including the ensemble member creation, the aviation ensemble product generation and the ensemble product visualization. This paper will give a brief description of each of these procedures. 2. System configuration There are two types of approaches for the time lagged technique: Direct Time-Lagged and Scaled Time-Lagged. The Direct Time-Lagged approach (Hoffman and Kalnay 1983) directly pulls multiple forecasts which are initiated from different past times but verified at a same time together as an ensemble (a mixture of old and young forecasts). This method views the error of a past forecast at t=0 (current initial time) directly as initial condition perturbation which should reflect error of the day and has dynamically growing structure leading to larger ensemble spread than random perturbation. The advantage of the method is that the generation of perturbation is absolutely free. However, a main concern is that the quality (magnitude) of perturbation depends on the age of a forecast since forecast quality usually decreases with lead time. To avoid this weakness, past forecast errors are first scaled down by their ages (assuming error growth is quasi-linear) at t=0 to have similar magnitude in all perturbations and then added to or subtracted from the current control analysis to create multiple analyses to initiate an ensemble of forecasts (Ebisusaki and Kalnay 1983). The Time-Lagged approach has been used in many ensemble research and operations (e.g. Saha et. al. 2006; Hou et. al. 2001; Lu et al. 2006; Brankovic et al. 2006). A limitation of the Time-Lagged method is that it cannot create an ensemble with large enough member size since the number of good old forecasts available is limited in reality. Otherwise, the ensemble quality will be severely contaminated if too old forecasts are included to have a large size ensemble. To keep more good members and emphasize on more recently finished model cycles in the VSREF member group, a set of time-decaying weights are assigned to each VSREF member. The details of weighting will be presented in the next discussions. Before the VSREF ensemble members are generated, the VSREF configuration procedure first determines which previously finished NAM (North American Mesoscale model http://www.emc.ncep.noaa.gov) and RUC (Rapid Update Cycle: http://ruc.noaa.gova) model cycles are available for building the VSREF system. Currently at NCEP, the operational NAM is run four cycles per day at 00Z, 06Z, 12Z and 18Z with hourly output out to 87 forecast hours while the operational RUC is run every one hour with hourly output out to 12 forecast hours for 00, 03, 06, 09, 12, 15, 18, 21Z cycles and out to 9 forecast hours for other cycles. In order to create a rapid update ensemble forecast system, each VSREF cycle is launched right after each cycle of RUC is finished. That is, the VSREF also is hourly updated, or 24 cycles per day. Since the ensemble size of a time-lagged ensemble can not be too large as discussed, the membership of the current VSREF is designed as members. To coordinate current NAM and RUC models in running and output configurations, the VSREF will use four previously finished NAM cycles, five previously finished and one most recently finished RUC cycles. To better understand the VSREF system configuration, take a look at a specific VSREF cycle, e.g. 06Z run as shown in Fig. 1. Figure 1 shows the configuration of (today s) 06Z cycle of VSREF membership, which consists of 4 NAM cycles of 00Z, 06Z, 12Z and 18Z yesterday (finished 30, 24, 18 and 12 hours ago), and 6 RUC cycles of 01Z,02Z, 03Z, 04Z, 05Z and 06Z today (finished 5, 4, 3, 2, 1 and 0 hour ago). The total forecast and output time, as shown in Fig.1, is at 07, 08, 09,, 11 and 12 Z, respectively. Please note that 11Z and 12Z only have and 11 members, respectively, because 01Z and 02Z RUC cycles only have 9 forecast hours. 2

NAM RUC 00Z 06Z 12Z 18Z 01Z 02 03 04 05 06Z 07 08 09 11 12Z Figure 1. VSREF membership configuration for today s 06Z cycle, which is composed of finished cycles, including 00Z, 06Z, 12Z and 18Z NAM cycles (blue dots) of yesterday, and today s 01Z, 02Z, 03Z, 04Z, 05Z and just finished 06Z RUC cycles (red dots), with hourly output at 07Z, 08Z, 09Z, Z, 11Z and 12Z (black dots), totally 6 forecast hours. The weights for each member in 06Z cycle VSREF are assigned as following order. The most recent RUC cycle (06Z) is assigned 1.0. The rest of members are assigned 0.9, 0.8,.0.1, according to the ages of the members. The older, the smaller. The oldest NAM always has smallest weight 0.1. This implies that the VSREF is more weighted on RUC than on NAM. As soon as all of the members are available right after 06Z, the VSREF system configuration procedure begins to collect member data to construct the ensemble. Another important notice is that the current operational RUC and NAM are output in different grid scales (http://www.nco. ncep.noaa.gov/pmb/docs/on388/tableb.html) although their resolutions are close (NAM 12 km and RUC 13 km). The NAM grid is on Grid-218 while RUC grid is on Grid-130. To make ensemble computation at same grid, the NAM model data are converted to RUC grid by using the NCEP s copygb utility (http://www.nco.ncep.noaa.gov/pmb/codes/nwprod/ util/sorc/copygb.fd) before conducting ensemble product generation. 3. Product generation After all of weighted member data are ready, the ensemble mean, spread and probability computations for each aviation product X i can be conducted as following: the ensemble mean wm X i, m m1 X i (1) w m1 and the ensemble spread w m m i, m i m1 1/ 2 S i [ ] (2) m1 X w m X 1 where w m is the weight for member m. Before computing the probability for product X i, first diagnose if it exceeds a specific threshold t i,j. In general a series of thresholds for this product are checked, giving yes (exceeding the threshold) 2 3

or no (not exceeding the threshold) results for all of thresholds (j =1, 2, ). Counting the yes members for a specific threshold t i,j, the yes ensemble probability for product X i can be computed as yes~ member m m1 P ( X i ti, j ) (3) m1 Obviously, if all weights are given equal, the above computations will reduce to regular equalweighting mean/spread/probability ensemble product computation. It is also implied that if having more recent members (more members are young), the probability computation will result in larger ensemble probability, or more confidence with the forecast. At current stage, 11 most concerned aviation weather related products are considered in the VSREF system, including icing, clear air turbulence (CAT), ceiling height, visibility, fog, jet stream, low level wind shear (LLWS), convection, simulated w w m reflectivity, freezing level and precipitation type. See Table 1, in which the computation/diagnosis methods for each product are also shown. From Table 1 we also can see that some products only have probability, such as icing, CAT, fog, convection, reflectivity, and precipitation type, some only have mean/spread such as freezing level, and some have both mean/spread and probability such as visibility and ceiling. The reason for only ensemble probability is that these product are event-driven and diagnosed as binary forecast (yes/no) in the post processor. Some products, e.g. the reflectivity has dbz value in model output. However, averaging of dbz is a tricky problem since it can not conduct simple average over reflectivity dbz value. So at current stage, only probability of reflection larger than a series of thresholds are presented in the VSREF system. The icing, CAT and jet stream are in-flight products. The others are TAF products (surface). The icing levels are FL000, FL030, FL060, FL090, FL120, FL150, FL180and FL240. The CAT levels are FL180, FL2, FL240, FL270, FL300, FL330, FL360, FL390, and FL 420.The jet stream levels are FL045, FL150 and FL350. Table 1. VSREF aviation ensemble products Product ensemble Method 1 Icing occurrence probability T and RH diagnosis (same as NCEP SREF) at 8 flight levels 2 Clear Air Turbulence probability of U, V, T diagnosis (Ellrod 1992, same as NCEP SREF) light, moderate, severe at 9 flight levels 3 Ceiling height mean/spread and prob of Cloud base height (RUC has no cloud coverage) <00, 2000, 3000, 6000 and 000 feet 4 Visibility range mean/spread and prob of < ¼, ½, 1, 2, and 4 miles NAM: Stoelinga and Warner (1999) RUC: Smirnova et al. (2000) 5 Jet stream at 3 flight levels, probability of U,V components diagnosis wind speed > 20, 40, 60, 80 0 knots 6 Low level wind shear mean/spread and prob of (wind shear > 20 knots/2000feet) Federal Meteor Handbook -1995 7 Fog occurrence probability Zhou and Du (20) 8 Convection occurrence probability Convective precipitation diagnosis (Weygandt et al. 2008) 9 Simulated reflectivity probability of Ferrier ( see Koch et al. 2005) >, 20, 30, 40 dbz Freezing level mean/spread Temperature-profile diagnosis 11 Precipitation type occurrence probability Rain, snow and freezing rain from model 4

ETS Precip threshold (mm/h Some explanations for the methods of listed products are briefly described here. For ceiling height, the definition is a combination of both cloud base height and cloud fraction percentage as was defined by NWS (FCM-1999). However current RUC has no cloud fraction percentage output. Therefore at current stage, the VSREF s ceiling is just cloud base height which is different from actual ceiling height. The visibility computation in NAM and RUC is different. NAM uses Stoelinga and Warner (1999) method while RUC uses Smirnova et al. (2000) method, an improved algorithm of Stoelinga and Warner. A limited verification over CONUS shows that the RUC visibility is more skillful than that of NAM, particularly at higher visibility range (see Fig. 2). The fog diagnosis in VSREF follows the method used in current SREF where the fog event at a grid point is diagnosed with so-called multiple-variable diagnostic method including 0.2 6 hour visibility fcst ETS from Oct 15 to Dec 1, 2009 surface cloud water, cloud base, cloud top, surface RH, and wind speed (Zhou and Du 20). Only all of these variables meet certain thresholds will fog be diagnosed instead of using visibility < 00m as a threshold. The verification has shown that the multi-variable diagnostic method significantly improves the forecast performance over the visibility-only method. The convection is diagnosed with the scheme of Weygandt et al. (2008). The method diagnoses the grid-filtered convection with hourly convective precipitation accumulation which varies with eastern US (Fig. 3), western US regions and diagonal times. The grid-filter performs a 7x7 grid average for eastern US region and a 9x9 grid average for western US region. Considering the less organization feature in the western US region, the convective precipitation threshold for western US regions is 0.6 factor of the eastern region. 1.2 0.175 0.15 0.125 RUC NAM 1.1 1 0.1 0.075 0.05 0.025 0.9 0.8 0.7 0 400 800 1600 3200 6400 000 0.6 0 3 6 9 12 15 18 21 Visibility threshold (< m) UTC Figure 2. Equitable threat score comparison between NAM and RUC s 6 hour visibility forecasts. 4. Visualization of the products After all of products are diagnosed or available directly from each members, the ensemble products are produced from the NCEP ensemble product generator and then display at the VSREF web page (http://wwwt.emc.ncep.noaa.gov/mmb/ SREF_avia/FCST/VSREF/web_site/html/refl.html ) for demonstration purpose. The VSREF web page is updated every hour but only today s forecast is present. The historical image will be Figure 3. Convective precipitation threshold (mm/hr) used for eastern US region. removed from the NCEP server to save space. If users request historical data, please send email to Binbin.Zhou@noaa. The interface of the VSREF web page looks like Fig. 3 where the probability distribution of reflectivity > 20 dbz over CONUS is shown. From the VSREF web page, different cycles, different products under which different thresholds can be selected and displayed. The animation through different forecast times can also be displayed but zooming capability is still not available. 5

Fig.3 VSREF web page interface where Dec. 22, 2009, cycle 14Z, 4-hour forecast probability of reflectivity > 20 dbz over CONUS, valid at 18Z on the same day, is shown. 5. Verification Verification of aviation weather forecast is difficult due to lack of sufficient and reliable truth data. GSD s Real Time Verification System (RTVS: http://rtvs.noaa.gov) is one that is a specific tool for evaluation of aviation weather products but the RTVS is for single model instead of probabilistic prediction verification. At NCEP, we have developed a grid to grid (g2g) forecast verification system (www.emc.ncep.noaa.gov /mmb/papers/zhou/ncepgrid2gridverificationsy stem- V2.doc) that can be used for verifying both single and ensemble forecasts, or deterministic and probabilistic verifications. The deterministic verification uses traditional evaluation measures like Bias, Hit rate (POD), FAR, Missing rate, Threat score (TS), and Equitable threat score (ETS). In general, an ensemble forecast can be converted to deterministic forecast through selecting a specific ensemble probability threshold. For instance, select 50% as an ensemble probability threshold. If the ensemble probability forecast for an event (such as visibility < 400m) at a grid point is more than 50% (or more than 5 of members predict true ), then issue the true forecast for this event at this grid point. Then we can use the traditional (deterministic) verification technique to verify the ensemble forecast system over various probability thresholds. The probabilistic verification uses Reliability, Resolution, Brier Skill Score (BSS), Relative Operational Curve (ROC), etc. Particularly, the Economic Value (EV) in a cost/loss (C/L) ratio can also be estimated (Zhu et al. 2002). The EV can be further applied in a decision making procedure and may be particularly useful in NextGen. Currently, some gridded data have available to us, including the Aviation Weather Center s ADDS (http://adds. aviationweather.gov) data and the National Radar Reflectivity Mosaic (http://www.emc.ncep.noaa. gov/mmb/wx22hl/ref) data. With the ADDS data, the VSREF s ensemble icing, CAT, ceiling 6

height, fog and visibility forecasts can be evaluated while with the radar mosaic data, the reflectivity or convection ensemble forecast can be verified. As a primary verification step, we have conducted some verifications of visibility ensemble probability ( of less than various visibility ranges) forecasts including deterministic and probabilistic aspects as shown in Fig. 4 and 5. Fig.4 is the accumulated deterministic verification scores over 3 weeks from Oct. to Nov. 2009. Fig. 5 is the probabilistic verification diagrams over the same period. a b Vis<6400 c Vis<3200 Vis<1600 Vis<800 m Vis<400 m d Fig. 4. Deterministic verification: VSREF s 3 week accumulated 6 hour forecast POD (a), Bias (b, bias=1means no bias), FAR (c) and ETS (d) of visibility range < selected values (400, 800, 1600, 3200 and 6400 m, respectively) for various selected probability thresholds (x-axis). Accumulation period is from Oct 12 to Nov. 8, 2009. 7

Ecoomic Valu POD 1 0.8 a 0.6 0.4 0.2 Vis<800m Vis<3200m Vis<6400m 0 0 0.2 0.4 0.6 0.8 1 False alarm rate 0.5 0.4 0.3 0.2 0.1 Vis< 800 m Vis< 3200 m Vis < 6400 m b 0 0-0.1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9-0.2-0.3-0.4-0.5 Cost/Loss Ratio Fig. 5. Probabilistic verification: VSREF s ROC(a) and Economic Values(b) for 3 selected visibility range thresholds (<800, 3200 and 6400 m, respectively) 8

Fig. 4 shows that using a smaller ensemble probability as a threshold can get larger hit rate (POD) but also suffers from a higher false alarm ratio (FAR) and a larger bias while with a larger ensemble probability as a threshold, the situation is reversed. So there will be a trade-off in selecting a proper ensemble probability threshold. In general, selecting a mid-range (40-60%) probability as a threshold can obtain a better ETS, just as shown in Fig. 4 (d). The ETS actually is a overall score considering both POD and FAR. For a user s application, how to select an appropriate probability threshold depends on user s own situation. Some prefer a higher POD, some don t care about FAR, and some care about both FAR and POD. We can see that the ensemble forecast can meet a wide range of requirements of different users, which is an important advantage of an ensemble forecast over a single model forecast. Fig. 5 shows the probabilistic measures of the VSREF visibility forecast over same period as Fig. 4. Fig 5 (a) shows the ROC which represents the forecast skill in terms of resolution, or how a forecast is different from the climatological prediction (diagonal line). Only ROC with larger than 0.5 area (or ROC curve is above the diagonal line), the forecast is skillful. We can see that all of forecasts for selected visibility ranges are skillful, or much better than climatological prediction. The Fig.5 (b) presents the Economic Value (EV) in various coast/loss ratios for different visibility range forecasts. Only for a lower cost/loss (C/L) ratio, the 3 VSREF visibility forecasts have positive economic values. For too small or too large C/L ratios, the forecasts have no economic values. Here the cost means the coast of a protection action (such as airplane delay in case of lower airport visibility, e.g. < 800m) while the loss is the cost of property/life losses. A lower C/L ratio implies that the protection cost should not be very high compared to the property/life losses. User should have his/her own C/L ratio estimate before checking the EV plot to see if the VSREF visibility forecast has economic value to his/her application. Of cause, the EV shown here is over a short period. Only EV over a long term can the EV be useful for a practical application. Furthermore, improvements in the ensemble visibility forecast from VSREF, such as improved POD, FAR, ETS, and ROC, etc., can eventually improve the EV. In other words, we hope to have higher EV over a wider range of L/C ratios. 6. Summary and future plan To support the probabilistic requirement by NextGen, the Very Short Range Ensemble Forecast System (VSREF), as a prelude and a demonstration, has been experimentally developed with minimum computational resources through the time-lagged ensembling technique with 4 finished NAM and 6 finished RUC cycles as ensemble members weighted according to their ages. The VSREF is hourly updated out to 6 forecast hours and includes 11 aviation weather related ensemble products, including icing, clear air turbulence (CAT), ceiling height, visibility, fog, jet stream, low level wind shear (LLWS), convection, simulated reflectivity, freezing level and precipitation type. The products have been visualized and displayed at the NCEP web page for demonstration, which also can be accessed and evaluated by outside users. At this moment, only visibility product has been evaluated with limited data over a shorter period. But the deterministic verification has shown that the performance of the VSREF visibility ensemble forecast depends on selected visibility range thresholds and selected ensemble probability thresholds. The probabilistic verification indicated that the VSREF visibility ensemble forecast is skillful in terms of resolution and has economic values for lower cost/loss ratios. The VSREF products are still in development stage and will be further evaluated and improved by using better algorithms and adding new products in the future. Since RUC has been upgraded to 18 forecast hour length recently, the VSREF will be upgrade to 12 forecast hour length. To increase ensemble membership, the NCEP SREF members will also be tested as part of the VSREF in future. As has been planed, 6 NRRE members based on 3 NMM and 3 ARW cores, hourly updated with 24 forecast hour length, in -12km resolution over CONUS will be initiated followed by 6 HRRRE members, each nested within the six NRRE members, in 3km resolution, over CONUS and Alaska domain. 9

Acknowledgement We are grateful to Stan Benjamin and Steve Weygandt of GSD for their comments and suggestions. This work is funded by FAA. Reference Benjamin S., and G. DiMego, 20: Upcoming improvement in NOAA modeling toward meeting NextGen aviation requirements. Preprint, 14 th Conf. on Aviation, Range, and Aerospace Meteorology, Atlanta, GA, Amer. Meteor. Soc. Du, J., and M. S. Tracton, 2001: Implementation of a real-time short-range ensemble forecasting system at NCEP: an update. Preprints, 9th Conference on Mesoscale Processes, Ft. Lauderdale, Florida, Amer. Meteor. Soc., 355-356. Du, J., J. McQueen, G. DiMego, Z. Toth, D. Jovic, B. Zhou, and H. Chuang, 2006: New Dimension of NCEP Short-Range Ensemble Forecasting (SREF) System: Inclusion of WRF Members, Preprint, WMO Expert Team Meeting on Ensemble Prediction System, Exeter, UK, Feb. 6-, 2006. Ebisuzaki, W. and E. Kalnay, 1991: Ensemble experiments with a new lagged average forecasting scheme. WMO, Research activities in atmospheric and oceanic modeling. Report 15, 6.31-32. Ellrod G. P., and D. I. Knapp, 1992: An Objective Clear-Air Turbulence Forecasting Technique: Verification and Operational Use. Wea. and Forecasting, 7, 150-165 Federal Meteorological Handbook, No. 1, 1995 (FMH- 1 1995) Hou, D., E. Kalnay and K. K. Droegemeier, 2001: Objective verification of the SAMEX'98 ensemble forecasts. Mon. Wea. Rev., 129, 73-91. Koch, St., et al., 2005: The use of simulated radar reflectivity fields in the diagnosis of mesoscale phenomena from high-resolution WRF model forecasts. Preprint, 11 th Conf. on Mesoscale Processes and 32th Conf. on Radar Meteorology, Albuquerque, NM, Amer. Meteor. Soc. Office of the Federal Coordinator for Meteorology: National Aviation Weather Initiatives, FCM-P34-1999, Washington D.C. February 1999. Saha, S., et al, 2006: The NCEP Climate Forecast System. J. Climate, 19, 3483 3517. Smirnova, T. G., S. G. Benjamin and J. M. Brown, 2000: Case study verification of RUC/MAPS fog and visibility forecasts. Preprint, 9 th Conf. on Aviation, Range, and Aerospace Meteorology, Orlando, FL, Amer. Meteor. Soc. Souders, C. G., et al., 20: NextGen weather requirements: an update. Preprint, 14 th Conf. on Aviation, Range, and Aerospace Meteorology, Atlanta, GA, Amer. Meteor. Soc. Stoelinga M. T. and T. T. Warner, 1999: Nonhydrostatic, Mesobeta-scale model simulations of cloud ceiling and visibility for east coast winter precipitation event. J. Appl. Meteor. 38, 385-404. Toth, Z., and Kalnay, E., 1993. Ensemble Forecasting at the NMC: The generation of perturbations. Bull. Amer. Meteorol. Soc., 74, 2317-2330. Weygandt, S. S., et al, 2008: Hourly convective probability forecasts and experimental high-resolution predictions based on the radar reflectivity assimilating RUC model. Preprints, 13 th Conf. on Aviation, Range, and Aerospace Meteorology. New Orleans, LA, Amer. Meteor. Soc. Zhou B., et al, 2004: An Introduction to NCEP SREF Aviation Project. Preprint, 11th Conference on Aviation, Range, and Aerospace Meteorology, Hyannis, MA, Amer. Meteor. Soc. Zhou B. and J. Du: 20: Fog Prediction from a Multi- Model Mesoscale Ensemble Prediction System. Wea. Forecasting, (in press). Zhu Y. et al. 2002: The economic value of ensemblebased weather forecasts. Bull. of Amer. Meteor. Soc., January, 73-83. Lu, C., H. Yuan, B.E. Schwartz and S.G. Benjamin, 2007: Short-Range Numerical Weather Prediction Using Time-Lagged Ensembles. Wea. Forecasting, 22, 580 595.