The Calibration Simplex: A Generalization of the Reliability Diagram for Three-Category Probability Forecasts

Size: px
Start display at page:

Download "The Calibration Simplex: A Generalization of the Reliability Diagram for Three-Category Probability Forecasts"

Transcription

1 1210 W E A T H E R A N D F O R E C A S T I N G VOLUME 28 The Calibration Simplex: A Generalization of the Reliability Diagram for Three-Category Probability Forecasts DANIEL S. WILKS Department of Earth and Atmospheric Sciences, Cornell University, Ithaca, New York (Manuscript received 20 February 2013, in final form 10 May 2013) ABSTRACT Full exposition of the performance of a set of forecasts requires examination of the joint frequency distribution of those forecasts and their corresponding observations. In settings involving probability forecasts, this joint distribution has a high dimensionality, and communication of its information content is often best achieved graphically. This paper describes an extension of the well-known reliability diagram, which displays the joint distribution for probability forecasts of dichotomous events, to the case of probability forecasts for three disjoint events, such as below, near, and above normal. The resulting diagram, called the calibration simplex, involves a discretization of the 2-simplex, which is an equilateral triangle. Characteristics and interpretation of the calibration simplex are illustrated using both idealized verification datasets, and the and 8 14-day temperature and precipitation forecasts produced by the U.S. Climate Prediction Center. 1. Introduction Forecast verification is the process of systematically and quantitatively describing the relationships between forecasts and the events they are meant to predict. It is an essential process both for optimal use of forecasts in decision making (Katz and Murphy 1997), and as a part of the process of improving the forecasts themselves (e.g., Murphy 1997; Wilks 2011; Jolliffe and Stephenson 2012). Most often forecast quality is characterized using scalar (i.e., single number) verification statistics such as mean squared error, which has been referred to as the measures-oriented approach to verification (Murphy 1997). Although restricting attention to one or a small number of scalar statistics simplifies the verification process both computationally and conceptually, this practice inevitably masks some aspects of forecast performance even in the most restricted verification settings. The problem arises because the dimensionality (Murphy 1991) of the joint distribution of forecasts and observations (Murphy and Winkler 1987) is large, especially in settings involving probability forecasts. Distributions-oriented (Murphy 1997), or diagnostic Corresponding author address: Daniel S. Wilks, Dept. of Earth and Atmospheric Sciences, Bradfield Hall, Rm. 1113, Cornell University, Ithaca, NY dsw5@cornell.edu (Murphy et al. 1989; Murphy and Winkler 1992), verification methods communicate the richness of verification datasets by respecting their dimensionality, but at the same time must overcome the problem of how best to portray the high-dimensional information. Arguably the most effective approaches to communicating the full information content of a joint distribution of forecasts and observations have involved the use of well-designed graphics, although only a small number of these have yet been devised. Murphy et al. (1989) used bivariate histograms to convey the joint distribution of nonprobabilistic daily temperature forecasts and their corresponding observations, and also introduced the conditional quantile plot to portray the same information in terms of both the calibration refinement, and the likelihood-base rate factorizations (Murphy and Winkler 1987) of the joint distribution. By far the most commonly used diagnostic verification graphic is the reliability diagram (Murphy and Winkler 1977; Wilks 2011). Reliability diagrams show the calibration-refinement factorization of the joint distribution of probability forecasts for dichotomous (yes no) events, by plotting the subsample event relative frequency as a function of forecast probability (the calibration), and including also a plot of the frequency of use of each of the possible scalar probability forecasts (the refinement). Probability forecasts are most frequently issued for dichotomous events, so that the reliability diagram, or DOI: /WAF-D Ó 2013 American Meteorological Society

2 OCTOBER 2013 W I L K S 1211 the closely allied attributes diagram (Murphy and Hsu 1986), is appropriate and effective for graphically displaying the relevant joint distribution. However, probability forecasts may also be issued jointly for more than two predictand categories, notably including threecategory temperature or precipitation forecasts at lead times of a week and longer (e.g., Van den Dool 2007; Livezey and Timofeyeva 2008; Barnston et al. 2010). In this format the predictand categories are often defined in terms of the climatological terciles, so that the belownormal (cold or dry), near-normal, and above-normal (warm or wet) categories each have climatological occurrence probabilities of 1 /3. Diagnostic verification of such forecasts has been approached by reducing the three-element probability forecast vectors to collections of dichotomous probabilities (e.g., Wilks 2000; Wilks and Godfrey 2002; Barnston et al. 2010), but that approach is not fully satisfying because it neglects the relationships among probabilities assigned to the different events (Murphy and Hsu 1986). This paper proposes an extension of the well-known reliability diagram, to verification of probability forecast vectors pertaining to three distinct outcome categories using a two-dimensional graphic called the calibration simplex. The calibration simplex graphically represents the calibration-refinement factorization of the full joint distribution of these forecasts and their corresponding observations. Section 2 defines the structure of the calibration simplex in terms of the joint distribution of the forecasts and observations, and in relation to the conventional reliability diagram. Characteristic calibration simplex forms that are diagnostic for important aspects of forecast performance are illustrated in section 3, using artificial data. Section 4 shows example results using and 8 14-day forecasts for U.S. temperature and precipitation, and section 5 concludes the paper. 2. Structure of the calibration simplex in relation to the reliability diagram Diagnostic verification methods are those that communicate the joint frequency distribution of the forecasts and their corresponding observations (Murphy and Winkler 1987): Pr ff i, o j g 5 Prff i \ o j g; i 5 1,..., I; j 5 1,..., J. (1) Here, there are I distinct forecasts f i possible, each of which pertains to J possible observations or outcomes o j. In the case of forecasts for dichotomous outcomes, J 5 2. For the purposes of constructing a reliability diagram, the forecast probabilities are rounded to a discrete set of I values if they have not already been issued as such. For example, I 5 11 for probability forecasts rounded to 10ths, in which case the forecasts range from f through f It can be more informative to work with the joint distribution in terms of one of its factorizations (Murphy and Winkler 1987). The reliability diagram is based on the calibration-refinement factorization: Prff i, o j g 5 Prfo j j f i gprff i g; i 5 1,..., I; j 5 1,..., J. (2) Thus, the full joint distribution can be decomposed into a collection of I conditional distributions Prfo j j f i g of the observations given each of the possible forecasts f i, called the calibration distributions, and a single I-element frequency distribution Prff i g specifying the frequencies of use of the possible forecasts, called the refinement distribution. In the case of the reliability diagram, the calibration distributions are Bernoulli (i.e., binomial with N 5 1) distributions, defined by the probability distribution function: Prfo j f i g 5 p o i (1 2 p i )12o, o 5 0, 1, (3) where each p i is estimated by its empirical (in the verification dataset) conditional relative frequency ^p i of the yes event being forecast on occasions following the corresponding forecast f i. Each of the I calibration distributions is fully characterized by its estimated Bernoulli probability ^p i, and, collectively these define the vertical positions of the plotted points in the main portion of the reliability diagram. A histogram, or other quantitative representation of the refinement distribution, completes the graphical portrayal of Eq. (2). Probability vectors f i 5 [f B,i, f N,i, f A,i ] T pertaining to three mutually exclusive and collectively exhaustive outcomes (e.g., below, near, and above normal) can be plotted in two dimensions because the three forecast probabilities in each vector must sum to 1. The geometrically appropriate graph in this case is the regular 2- simplex (Epstein and Murphy 1965; Murphy 1972), which takes the shape of an equilateral triangle. Each of the corners of the 2-simplex corresponds to forecast certainty (i.e., 100% probability) for one of the three outcomes being forecast. The point within the simplex at which a three-element probability vector f i is plotted is located at distances proportional to the probabilities for each of the three outcomes, perpendicularly from the sides of the simplex opposite the respective corners. This barycentric plotting system (a plotting position is at the center of masses corresponding to the probabilities,

3 1212 W E A T H E R A N D F O R E C A S T I N G VOLUME 28 FIG. 1. Calibration simplexes illustrating well-calibrated forecasts exhibiting (a) lower and (b) higher sharpness. placed at the simplex vertices) generalizes the reliability diagram because the 1-simplex appropriate to two-element probability forecasts for dichotomous events is the unit interval on the real line, which is the horizontal axis of the reliability diagram. Figure 1 illustrates the plotting of discretized forecast vectors onto the simplex, which has been rendered as a tessellation of hexagons. Each scalar forecast probability has been rounded to one of the K 5 10 values 0/9, 1/9,..., 9/9, yielding I 5 K(K 1 1)/ distinct possible vector forecasts f i, each of which is represented by one of the hexagons. This choice for the discretization seems natural for forecasts pertaining to tercile-based outcome definitions, although any regular discretization (e.g., the K 5 11 scalar forecast elements 0.0, 0.1,...,1.0) may be used. The hexagons at the three vertices represent forecasts assigning all probability to the outcome labeled at that corner. Hexagons representing other forecast vectors are located at perpendicular distances from the respective opposite sides, which are indicated by the probability labels in the margins. For example, forecasts of equal probability for the middle category, f N, are located along the same horizontal row of hexagons (perpendicularly upward from the horizontal bottom edge of the simplex), as indicated by the probability labels increasing upward along the left edge of the figure. Probability labels for the above-normal category, f A, increase downward along the right edge of the simplex, and probability labels for the below-normal category, f B, increase to the left along the bottom edge of the simplex. The result, for example, is that the large dot in the center of Fig. 1a locates the climatological forecast vector f clim 5 [1/3, 1/3, 1/3] T. Similarly, the forecasts in Fig. 1a having the largest above-normal probabilities are [2/9, 2/9, 5/9] T and [1/9, 3/9, 5/9] T, which are represented by the glyphs located farthest to the right in that diagram. Any two of the three forecast vector elements are sufficient to locate that vector s position on the simplex. Other choices for arranging the labels on the simplex edges are possible and equally valid, but those in Fig. 1 have been used in order for the outcomes from below to above normal to increase from left to right. Figure 1 illustrates the graphical representation of refinement distributions Prff i g as glyph scatterplots (e.g., Wilks 2011), where the circle areas are proportional to the subsample sizes. Empty hexagons represent forecast vectors that were never used in the verification datasets under consideration. The two refinement distributions represented in Fig. 1 will be defined quantitatively in section 3. Generalizing Eq. (3) for the calibration distributions, in the case of three-element vector forecasts each of the I calibration distributions is multinomial, again with N 5 1: Prfo j f i g 5 p o B B,i po A A,i (1 2 p B,i 2 p A,i )12o 2o B A, o B, o A 5 0, 1, and o B 1 o A # 1. (4) Thus, each of the I calibration distributions are fully determined by any two of the three empirical (within the verification dataset) conditional relative frequencies ^p B,i, ^p A,i, and ^p N,i ^p B,i 2 ^p A,i of the three events being forecast by f i, and therefore can be represented by a two-dimensional vector defined by, for example, ^p B,i and ^p A,i within the ith hexagon of the simplex. Figure 2 illustrates schematically how these conditional relative frequencies for the below- and above-normal

4 OCTOBER 2013 W I L K S 1213 FIG. 2. Plotting schematic showing locations of refinement distribution glyphs for calibrated forecasts (zero miscalibration errors, gray dashed arrows and point), and miscalibrated forecasts (nonzero calibration errors, black solid arrows and point). toward the A vertex of the simplex, consistent with the conditional outcome vector having a higher relative frequency than forecast for the above-normal category. It is necessary to choose a scale for these refinement distribution glyph displacements, which should be indicated in a figure s legend. Here, this scale has been chosen as 2 /3 of a probability unit corresponding to a long (vertex to vertex) diameter of the hexagons. With this choice, displacement of a plotting glyph of 1 /3 the distance from the hexagon center to one of the corners equates to a miscalibration of 1 /9, which corresponds to one discrete probability category when K Of course the scale in Fig. 2 could be increased to one or as much as two probability units in order to deemphasize random variations due to small subsample sizes, or to accommodate large conditional miscalibration errors. Similarly, the scale could be decreased in order to better discern subtle miscalibrations exhibited by a generally well-calibrated set of forecasts with large subsample sizes. It may be more convenient when programming plotting software to use the horizontal and vertical Cartesian displacements from the hexagon centers, which are proportional (depending on the chosen scaling) to x 52tan(308)e B 1 tan(308)e A e B e A, and y 52e B 2 e A, respectively. 3. Idealized forecast examples outcomes are represented within each hexagon, using the corresponding miscalibration errors, or differences between the conditional average observations ^p B,i and ^p A,i, and the respective forecast vector elements: " eb,i e A,i # ^p B,i 2 f B,i 5. (5) ^p A,i 2 f A,i These vector conditional miscalibration errors are plotted within their respective hexagons, using barycentric coordinates similar to those in the overall simplex, but with the origins at the centers of the hexagons. For a well-calibrated forecast subsample f i, both elements of Eq. (5) will be zero, and the dot representing the corresponding subsample size will be plotted at the center of the ith hexagon, as illustrated by the gray dot in Fig. 2, and by all subsamples in both panels of Fig. 1. In contrast, the solid dot in Fig. 2 shows the plotting position when the above-normal category has been conditionally underforecast by 0.2 probability units, together with overforecasting of the below- and near-normal categories by 0.1 probability units each. In this case the location for the glyph representing the corresponding element of the refinement distribution will be displaced Characteristic patterns of simplex glyph sizes and placements are diagnostic for different aspects of overall forecast performance, analogously to the situation for the reliability diagram (Wilks 2011, p. 335). These are illustrated in this section using simple idealized statistical models for the calibration and refinement distributions, designed to demonstrate results for a range of cases with known expected behaviors. a. Sharpness: Refinement distributions Sharpness is a characteristic of the refinement distribution, independently of the relationship between the forecasts and observations. Forecast sharpness is commonly characterized using a measure of the dispersion of the refinement distribution, such as the standard deviation or variance (e.g., Wilks 2001). Forecasts that deviate frequently, including relatively large differences from, the climatological forecast are referred to as sharp. Forecasts exhibiting poor sharpness deviate rarely and relatively little from the climatological forecast. Although in the present setting the refinement distribution is trivariate, it is sufficient to define it in terms of the joint distribution of two of the three forecast probabilities because they must all sum to 1. Here, joint distributions of the log-odds transformations of f B and f A,

5 1214 W E A T H E R A N D F O R E C A S T I N G VOLUME 28 (f ) 5 ln f 1 2 f, (6) will be modeled using bivariate normal distributions, in order to ensure 0, f B, 1 and 0, f A, 1. This is an arbitrary but convenient assumption that allows an easy exposition of the characteristic diagnostics for the diagram. The climatological forecast of f B 5 f A 5 1/3 maps to E[ (f B )] 5 E[ (f A )] for the log-odds bivariate normal distributions. In addition, a strong negative correlation of yields vanishingly small probabilities for the impossible f B 1 f A. 1, as well as relatively small deviations of the near-normal forecast f N from its mean value of 1 /3. Within this framework, relatively sharp forecasts are generated using var[ (f B )] 5 var[ (f A )] 5 1, and forecasts with low sharpness are generated using var[ (f B )] 5 var[ (f A )] Some 10 6 independent bivariate normal realizations were generated to construct each of the two refinement distributions. Figure 1 shows these two refinement distributions, plotted as glyph histogram scatterplots on the simplex. In both cases the resulting forecasts are shown as being perfectly calibrated, so all of the circles representing subsample relative frequencies are plotted at the centers of their hexagons. For the low-sharpness forecasts in Fig. 1a, it is clear that the most common forecast is f clim 5 ( 1 /3, 1 /3, 1 /3) T, which accounts for 68.3% of the 10 6 realizations. Furthermore, the individual forecast elements deviate no more than 2 /9 from f clim, and only very rarely are those deviations larger than 1 /9. In contrast, the sharper forecasts in Fig. 1b often deviate quite strongly from f clim (which accounts for only 10.7% of the 10 6 realizations), and take on values throughout the unit interval for the extreme-category probabilities f B and f A. b. Conditional and unconditional biases: Calibration distributions Figure 3 shows calibration simplexes illustrating overconfident forecasts (Fig. 3a), underconfident forecasts (Fig. 3a), and forecasts exhibiting an unconditional overforecasting bias (Fig. 3c)for the above-normal category. Each of these cases is illustrated using the highersharpness refinement distribution shown in Fig. 1b. For the overconfident forecasts shown in Fig. 3a, the parameters of the multinomial calibration distributions [Eq. (4)] are based on those used to define overconfident idealized reliability diagrams in Wilks (2001): ^p B,i (f B,i ) 5 (f B,i 1 1/3)/2 ^p N,i (f N,i ) 5 2/3 2 (f B,i 1 f A,i )/2 ^p A,i (f A,i ) 5 (f A,i 1 1/3)/2: (7) FIG. 3. Calibration simplexes illustrating (a) overconfident [Eq. (7)], (b) underconfident [Eq. (8)], and (c) unconditionally biased [Eq. (9)] forecasts. In each case the refinement distribution is the same as in Fig. 1b.

6 OCTOBER 2013 W I L K S 1215 These functional relationships describe conditionally biased forecasts, which exhibit overforecasting for forecast probabilities above the climatological 1 /3, andunderforecasting for forecast probabilities below 1 /3. These were chosen in Wilks (2001) to correspond to the no skill line (Murphy and Hsu 1986) on the attributes diagram. If such forecasts had been generated by a dynamical ensemble forecasting system, the overconfidence would be diagnostic for underdispersion (Wilks 2011, p. 372). Figure 3a shows that the signature for overconfidence in the calibration simplex is a displacement of the relative frequency glyphs toward the center of the diagram, analogously to the calibration function in conventional reliability or attributes diagrams being tilted from the ideal 458 diagonal toward the horizontal, climatological, no resolution (Murphy and Hsu 1986) line. It can be seen in Fig. 3a that the degree of miscalibration increases as the forecast probabilities become more extreme, so that the glyphs for the most extreme probabilities in the two lower corners of the diagram (f B 5 9/9 and f A 5 9/9) have been plotted at the hexagon vertices nearest the center of the diagram, indicating conditional miscalibrations of 1 /3 of a probability unit. Figure 3b shows the calibration simplex for conditionally biased forecasts that are underconfident, in the sense that probabilities smaller than 1 /3 are overforecast and probabilities larger than 1 /3 are underforecast, according to ^p B,i (f B,i ) 5 2f B,i 2 1/3, ^p N,i (f N,i ) 5 2(1 2 f B,i 2 f A,i ) 2 1/3, and ^p A,i (f A,i ) 5 2f A,i 2 1/3: (8) Here, conditional average observations more extreme than the unit interval are truncated at 0 or 1, and the values for the three outcomes are renormalized to sum to 1 if necessary. The relationships in Eq. (8) also generalize the underconfident calibration function used in Wilks (2001), and would be diagnostic for overdispersion of a dynamical ensemble. Figure 3b shows that the signature for underconfidence in the calibration simplex is the displacement of the relative frequency glyphs away from the climatological forecast at the center of the diagram, and toward the corners of the simplex. Again, this is analogous to the signature for underconfidence in the reliability diagram, in which the calibration function is tilted at an angle steeper than 458, and away from the climatological horizontal no-resolution line. Finally, Fig. 3c shows the calibration simplex for unconditionally biased forecasts, exhibiting uniform overforecasting of the above-normal category equally at the expense of both the below- and near-normal categories: ^p B,i (f B,i ) 5 f B,i 1 1/9, ^p N,i (f N,i ) 5 f N,i 1 1/9, and ^p A,i (f A,i ) 5 f A,i 2 2/9. (9) Again, results outside the unit interval are truncated and renormalized if necessary. In Fig. 3c the relative frequency glyphs are displaced upward and to the left, away from the A corner of the simplex. Equivalently, the below- and near-normal categories have been uniformly and equally underforecast, at the expense of the abovenormal category, and the result is that the glyphs are displaced toward the simplex edge connecting the B and N corners. 4. Real forecast examples This section illustrates the use of the calibration simplex to understand the performance of the Climate Prediction Center (CPC) extended range forecasts for average temperature and accumulated precipitation, at lead times of 6 10 and 8 14 days. These are subjective probability forecasts generated on weekdays, during for the 6 10-day forecasts, and for the 8 14-day forecasts, and interpolated to station locations from the graphical map products posted operationally online ( The text-format versions and general information on these forecasts are also available online ( products/archives/short_range/ and noaa.gov/products/archives/short_range/readme.6-10day.txt, respectively). Figure 4 shows the calibration simplexes for the temperature forecasts, which include n six-toten-day forecasts (Fig. 4a) and n eight-tofourteen-day forecasts (Fig. 4b). Glyphs are plotted only for forecasts having subsample sizes of 20 or more, and verifying categories have been determined relative to the normals for forecasts made through April 2011, and using the normals thereafter. At each of the lead times, the overwhelming majority of forecasts include the climatological or near-climatological near-normal forecast f N 5 1 /3, consistent with the conventional expectation that forecasts of the near-normal category will exhibit intrinsically weak skill (Van den Dool and Toth 1991). Thus, the sharpness for the nearnormal probabilities is quite low. The most frequent forecast for each of the lead times is the climatological

7 1216 W E A T H E R A N D F O R E C A S T I N G VOLUME 28 FIG. 4. Calibration simplexes for (a) and (b) 8 14-day temperature forecasts. probability vector f clim, which was issued for 34.9% of the 6 10-day forecasts and 35.4% of the 8 14-day forecasts. These percentages correspond to n clim and for the climatological forecast f clim (largest dots in the middles of the plots) in Figs. 4a and 4b, respectively (cf. glyph sizes in the legend). The corresponding error vectors [e B, e N, e A ] T are [20.085, 0.056, 0.029] T (Fig. 4a) and [20.88, 0.036, 0.052] T (Fig. 4b). Accordingly, both of these glyphs have been displaced away from the B vertices, indicating too few belownormal verifications when the climatological temperature forecast was issued. The more extreme probability vectors were used correspondingly less frequently at the 8 14-day lead time, but overall the 6 10-day forecasts are only slightly sharper. For both lead times, the temperature forecasts are only moderately well calibrated, with typical miscalibration errors in the range of 1/9 2/9. The abovenormal outcome is underforecast for the larger ($4/9) above-normal and near-normal probabilities, as these subsample-size glyphs are displaced toward the A corner of the simplex. The glyph representing the very large subsample of f clim forecasts is displaced toward the edge connecting the N and A vertices, indicating a smaller fraction of below-normal outcomes than the climatologically expected 1 /3. Both of these results are consistent with the baseline 30-yr normals lagging the quasi-linear warming trend that has been evident in U.S. temperature data since the mid-1970s (e.g., Livezey et al. 2007; Wilks 2013), together with the mean forecasts not fully tracking the warming, as has also been observed for seasonal tercile forecasts (Wilks 2000; Wilks and Godfrey 2002; Barnston et al. 2010). On the other hand, displacement of glyphs away from the simplex center for below-normal probabilities of 4/9 and larger indicates an overall underconfidence (consistent with the pattern of glyph dispersion away from the center of Fig. 3b), suggesting that these forecasts could be somewhat sharper without degrading their skill. Calibration simplexes for the precipitation forecasts are shown in Fig. 5, which include n six-to-tenday forecasts (Fig. 5a) and n eight-to-fourteenday forecasts (Fig. 5b). The precipitation forecasts exhibit notably less sharpness than their temperature counterparts in Fig. 4, with 44.9% of the 6 10-day forecasts and 44.7% of the 8 14-day forecasts using the climatological probabilities f clim. As was the case for the temperature forecasts, f N 5 1 /3 is overwhelmingly the most common near-normal forecast. The most extreme probabilities have been used somewhat less frequently in the 8 14-day precipitation forecasts, but overall these are only slightly less sharp that the 6 10-day forecasts. At both lead times there is a clear tendency for the subsample-size glyphs to be displaced toward the B vertex, indicating underforecasting of the below-normal category, and corresponding overforecasting in roughly equal proportions of the near-normal and above-normal outcomes. Figure 5 also indicates strong overconfidence in the larger ($4/9) probabilities for the near-normal category, for both lead times. 5. Summary and conclusions Gaining a full appreciation of the performance of a set of forecasts requires investigation of the joint distribution of the forecasts and corresponding observations (Murphy and Winkler 1987), and a graphical approach to this exposition will often be the most immediately

8 OCTOBER 2013 W I L K S 1217 FIG. 5. As in Fig. 4, but for precipitation forecasts. informative. The well-known reliability diagram for probability forecasts of dichotomous events is the most commonly encountered such graphical device. This paper has defined and illustrated a natural extension of the reliability diagram to probability forecasts for three disjoint events, called the calibration simplex. It displays the refinement distribution for such forecasts using a glyph scatterplot histogram of the K(K 1 1)/2 possible vector forecasts that result when each probability element has been rounded to one of K discrete values. Simultaneously, it shows the two empirical outcome relative frequencies conditional on each forecast vector that are necessary to fully characterize the corresponding refinement distributions, using displacements of the glyphs from central plotting locations. Two somewhat similar approaches to this same problem have been proposed previously, although apparently neither has been used subsequently. Epstein and Murphy (1965) suggested that the information in the calibration distributions (although not using this terminology) could be sketched by printing numerical values of the scalar lengths of the calibration error vectors [Eq. (5)] inside each of a tessellation of equilateral triangles within the 2-simpex. However, this approach neither expressed the vector nature of the miscalibration errors, nor did it communicate the refinement distribution in any way. Murphy and Daan (1985, p. 418) proposed to draw line segments on the 2-simplex connecting points locating each distinct forecast vector f i with its corresponding average outcome vector ^p i, and indicating the corresponding subsample sizes with numerals printed at the midpoints of these line segments. This approach does indeed express the full information content of the joint distribution, but overall the diagram is somewhat difficult to read. Recently, this latter idea has been independently rediscovered by Jupp et al. (2012), who call it the ternary reliability diagram. They also propose a color scheme for representing the threeelement probability vectors. In principle, the calibration simplex could be extended to settings having more than three predictand categories, although this would be difficult or impossible to realize in practice because of graphical and cognitive limitations with respect to visually rendering the highdimensional geometries. For example the 3-simplex, which would be appropriate for probability forecasts of four distinct events, is a three-dimensional regular tetrahedron, within the volume of which the spherical glyphs corresponding to possible four-element forecast vectors would reside. It is difficult to imagine how such a geometrical object could be presented graphically in an understandable way. Use of the calibration simplex has been illustrated using the CPC and 8 14-day subjective temperature and precipitation forecasts. These temperature forecasts exhibit an unconditional bias consistent with the average forecasts lagging the ongoing climate warming, as has been observed also for seasonal forecasts, but these graphs also indicate that greater sharpness for the below- and above-normal elements of the forecast vectors could be employed without degrading the overall accuracy or skill. The near-normal element of the precipitation forecast vectors were seen to be strongly overconfident, with overall underforecasting of the below-normal category, and corresponding overforecasting in roughly equal proportions of the near-normal and above-normal outcomes.

9 1218 W E A T H E R A N D F O R E C A S T I N G VOLUME 28 Acknowledgments. I thank Scott Handel for supplying the CPC forecasts and corresponding verifications. This research was supported by the National Science Foundation under Grant AGS REFERENCES Barnston, A. G., S. Li, S. J. Mason, D. G. DeWitt, L. Goddard, and X. Gong, 2010: Verification of the first 11 years of IRI s seasonal climate forecasts. J. Appl. Climatol. Meteor., 49, Epstein, E. S., and A. H. Murphy, 1965: A note on the attributes of probabilistic predictions and the probability score. J. Appl. Meteor., 4, Jolliffe, I. T., and D. B. Stephenson, 2012: Introduction. Forecast Verification: A Practitioner s Guide in Atmospheric Sciences, I. T. Jolliffe and D. B. Stephenson, Eds., Wiley-Blackwell, 1 9. Jupp, T. E., R. Lowe, C. A. S. Coelho, and D. B. Stephenson, 2012: On the visualization, verification and recalibration of ternary probabilistic forecasts. Philos. Trans. Roy. Soc. London, A370, Katz, R. W., and A. H. Murphy, Eds., 1997: Economic Value of Weather and Climate Forecasts. Cambridge University Press, 222 pp. Livezey, R. E., and M. M. Timofeyeva, 2008: The first decade of long-lead U.S. seasonal forecasts Insights from a skill analysis. Bull. Amer. Meteor. Soc., 89, , K. Y. Vinnikov, M. M. Timofeyeva, R. Tinker, and H. M. Van den Dool, 2007: Estimation and extrapolation of climate normals and climatic trends. J. Appl. Meteor. Climatol., 46, Murphy, A. H., 1972: Scalar and vector partitions of the probability score: Part II. N-state situation. J. Appl. Meteor., 11, , 1991: Forecast verification: Its complexity and dimensionality. Mon. Wea. Rev., 119, , 1997: Forecast verification. Economic Value of Weather and Climate Forecasts, R.W. Katz and A.H. Murphy, Eds., Cambridge University Press, , and R. L. Winkler, 1977: Reliability of subjective probability forecasts of precipitation and temperature. Appl. Stat., 26, , and H. Daan, 1985: Forecast evaluation. Probability, Statistics, and Decision Making in the Atmospheric Sciences, A. H. Murphy and R. W. Katz, Eds., Westview, , and W.-R. Hsu, 1986: The attributes diagram: A geometrical framework for assessing the quality of probability forecasts. Int. J. Forecasting, 2, , and R. L. Winkler, 1987: A general framework for forecast verification. Mon. Wea. Rev., 115, , and, 1992: Diagnostic verification of probability forecasts. Int. J. Forecasting, 7, , B. G. Brown, and Y.-S. Chen, 1989: Diagnostic verification of temperature forecasts. Wea. Forecasting, 4, Van den Dool, H., 2007: Empirical Methods in Short-Term Climate Prediction. Oxford University Press, 215 pp., and Z. Toth, 1991: Why do forecasts for near normal often fail? Wea. Forecasting, 6, Wilks, D. S., 2000: Diagnostic verification of the Climate Prediction Center long-lead outlooks, J. Climate, 13, , 2001: A skill score based on economic value for probability forecasts. Meteor. Appl., 8, , 2011: Statistical Methods in the Atmospheric Sciences. 3rd ed. Academic Press, 676 pp., 2013: Projecting normals in a nonstationary climate. J. Appl. Meteor. Climatol., 52, , and C. M. Godfrey, 2002: Diagnostic verification of the IRI net assessment forecasts, J. Climate, 15,

Ensemble Verification Metrics

Ensemble Verification Metrics Ensemble Verification Metrics Debbie Hudson (Bureau of Meteorology, Australia) ECMWF Annual Seminar 207 Acknowledgements: Beth Ebert Overview. Introduction 2. Attributes of forecast quality 3. Metrics:

More information

Enhancing Weather Information with Probability Forecasts. An Information Statement of the American Meteorological Society

Enhancing Weather Information with Probability Forecasts. An Information Statement of the American Meteorological Society Enhancing Weather Information with Probability Forecasts An Information Statement of the American Meteorological Society (Adopted by AMS Council on 12 May 2008) Bull. Amer. Meteor. Soc., 89 Summary This

More information

Proper Scores for Probability Forecasts Can Never Be Equitable

Proper Scores for Probability Forecasts Can Never Be Equitable APRIL 2008 J O L LIFFE AND STEPHENSON 1505 Proper Scores for Probability Forecasts Can Never Be Equitable IAN T. JOLLIFFE AND DAVID B. STEPHENSON School of Engineering, Computing, and Mathematics, University

More information

Model verification / validation A distributions-oriented approach

Model verification / validation A distributions-oriented approach Model verification / validation A distributions-oriented approach Dr. Christian Ohlwein Hans-Ertel-Centre for Weather Research Meteorological Institute, University of Bonn, Germany Ringvorlesung: Quantitative

More information

NOTES AND CORRESPONDENCE. Improving Week-2 Forecasts with Multimodel Reforecast Ensembles

NOTES AND CORRESPONDENCE. Improving Week-2 Forecasts with Multimodel Reforecast Ensembles AUGUST 2006 N O T E S A N D C O R R E S P O N D E N C E 2279 NOTES AND CORRESPONDENCE Improving Week-2 Forecasts with Multimodel Reforecast Ensembles JEFFREY S. WHITAKER AND XUE WEI NOAA CIRES Climate

More information

Realizations of Daily Weather in Forecast Seasonal Climate

Realizations of Daily Weather in Forecast Seasonal Climate APRIL 00 WILKS 195 Realizations of Daily Weather in Forecast Seasonal Climate D. S. WILKS Department of Earth and Atmospheric Sciences, Cornell University, Ithaca, New York (Manuscript received 6 March

More information

A skill score based on economic value for probability forecasts

A skill score based on economic value for probability forecasts Meteorol. Appl. 8, 209 219 (2001) A skill score based on economic value for probability forecasts D S Wilks, Department of Earth and Atmospheric Sciences, Cornell University, Ithaca, New York, USA An approach

More information

PRICING AND PROBABILITY DISTRIBUTIONS OF ATMOSPHERIC VARIABLES

PRICING AND PROBABILITY DISTRIBUTIONS OF ATMOSPHERIC VARIABLES PRICING AND PROBABILITY DISTRIBUTIONS OF ATMOSPHERIC VARIABLES TECHNICAL WHITE PAPER WILLIAM M. BRIGGS Abstract. Current methods of assessing the probability distributions of atmospheric variables are

More information

Distributions-Oriented Verification of Probability Forecasts for Small Data Samples

Distributions-Oriented Verification of Probability Forecasts for Small Data Samples 903 Distributions-Oriented Verification of Probability Forecasts for Small Data Samples A. ALLE BRADLEY AD TEMPEI HASHIO IIHR Hydroscience and Engineering, and Department of Civil and Environmental Engineering,

More information

Basic Verification Concepts

Basic Verification Concepts Basic Verification Concepts Barbara Brown National Center for Atmospheric Research Boulder Colorado USA bgb@ucar.edu Basic concepts - outline What is verification? Why verify? Identifying verification

More information

Probabilistic verification

Probabilistic verification Probabilistic verification Chiara Marsigli with the help of the WG and Laurie Wilson in particular Goals of this session Increase understanding of scores used for probability forecast verification Characteristics,

More information

Methods of forecast verification

Methods of forecast verification Methods of forecast verification Kiyotoshi Takahashi Climate Prediction Division Japan Meteorological Agency 1 Outline 1. Purposes of verification 2. Verification methods For deterministic forecasts For

More information

The economic value of weather forecasts for decision-making problems in the profit/loss situation

The economic value of weather forecasts for decision-making problems in the profit/loss situation METEOROLOGICAL APPLICATIONS Meteorol. Appl. 4: 455 463 (27) Published online in Wiley InterScience (www.interscience.wiley.com) DOI:.2/met.44 The economic value of weather forecasts for decision-making

More information

Coefficients for Debiasing Forecasts

Coefficients for Debiasing Forecasts Reprinted from MONTHLY WEATHER REVIEW, VOI. 119, NO. 8, August 1991 American Meteorological Society Coefficients for Debiasing Forecasts University Center for Policy Research, The University at Albany,

More information

Validation of Forecasts (Forecast Verification) Overview. Ian Jolliffe

Validation of Forecasts (Forecast Verification) Overview. Ian Jolliffe Validation of Forecasts (Forecast Verification) Overview Ian Jolliffe 1 Outline 1. Introduction and history (4) 2. Types of forecast (2) 3. Properties of forecasts (3) verification measures (2) 4. Terminology

More information

Evaluating Forecast Quality

Evaluating Forecast Quality Evaluating Forecast Quality Simon J. Mason International Research Institute for Climate Prediction Questions How do we decide whether a forecast was correct? How do we decide whether a set of forecasts

More information

Verification of Continuous Forecasts

Verification of Continuous Forecasts Verification of Continuous Forecasts Presented by Barbara Brown Including contributions by Tressa Fowler, Barbara Casati, Laurence Wilson, and others Exploratory methods Scatter plots Discrimination plots

More information

Developing Operational MME Forecasts for Subseasonal Timescales

Developing Operational MME Forecasts for Subseasonal Timescales Developing Operational MME Forecasts for Subseasonal Timescales Dan C. Collins NOAA Climate Prediction Center (CPC) Acknowledgements: Stephen Baxter and Augustin Vintzileos (CPC and UMD) 1 Outline I. Operational

More information

J11.5 HYDROLOGIC APPLICATIONS OF SHORT AND MEDIUM RANGE ENSEMBLE FORECASTS IN THE NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS)

J11.5 HYDROLOGIC APPLICATIONS OF SHORT AND MEDIUM RANGE ENSEMBLE FORECASTS IN THE NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS) J11.5 HYDROLOGIC APPLICATIONS OF SHORT AND MEDIUM RANGE ENSEMBLE FORECASTS IN THE NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS) Mary Mullusky*, Julie Demargne, Edwin Welles, Limin Wu and John Schaake

More information

Predicting uncertainty in forecasts of weather and climate (Also published as ECMWF Technical Memorandum No. 294)

Predicting uncertainty in forecasts of weather and climate (Also published as ECMWF Technical Memorandum No. 294) Predicting uncertainty in forecasts of weather and climate (Also published as ECMWF Technical Memorandum No. 294) By T.N. Palmer Research Department November 999 Abstract The predictability of weather

More information

Reduced Overdispersion in Stochastic Weather Generators for Statistical Downscaling of Seasonal Forecasts and Climate Change Scenarios

Reduced Overdispersion in Stochastic Weather Generators for Statistical Downscaling of Seasonal Forecasts and Climate Change Scenarios Reduced Overdispersion in Stochastic Weather Generators for Statistical Downscaling of Seasonal Forecasts and Climate Change Scenarios Yongku Kim Institute for Mathematics Applied to Geosciences National

More information

Verification of African RCOF Forecasts

Verification of African RCOF Forecasts Technical Report #09-02 RCOF Review 2008 Position Paper: Verification of African RCOF Forecasts by Dr. Simon Mason and Mr. Simbarashe Chidzambwa Dr Simon J. Mason International Research Institute for Climate

More information

The Impact of Horizontal Resolution and Ensemble Size on Probabilistic Forecasts of Precipitation by the ECMWF EPS

The Impact of Horizontal Resolution and Ensemble Size on Probabilistic Forecasts of Precipitation by the ECMWF EPS The Impact of Horizontal Resolution and Ensemble Size on Probabilistic Forecasts of Precipitation by the ECMWF EPS S. L. Mullen Univ. of Arizona R. Buizza ECMWF University of Wisconsin Predictability Workshop,

More information

Assessing the skill of yes/no forecasts for Markov observations

Assessing the skill of yes/no forecasts for Markov observations Assessing the skill of yes/no forecasts for Markov observations William Briggs General Internal Medicine, Weill Cornell Medical College 525 E. 68th, Box 46, New York, NY 10021 email: wib2004@med.cornell.edu

More information

Upgrade of JMA s Typhoon Ensemble Prediction System

Upgrade of JMA s Typhoon Ensemble Prediction System Upgrade of JMA s Typhoon Ensemble Prediction System Masayuki Kyouda Numerical Prediction Division, Japan Meteorological Agency and Masakazu Higaki Office of Marine Prediction, Japan Meteorological Agency

More information

Estimation of Seasonal Precipitation Tercile-Based Categorical Probabilities from Ensembles

Estimation of Seasonal Precipitation Tercile-Based Categorical Probabilities from Ensembles 10 J O U R A L O F C L I M A T E VOLUME 0 Estimation of Seasonal Precipitation Tercile-Based Categorical Probabilities from Ensembles MICHAEL K. TIPPETT, ATHOY G. BARSTO, AD ADREW W. ROBERTSO International

More information

Estimation of seasonal precipitation tercile-based categorical probabilities. from ensembles. April 27, 2006

Estimation of seasonal precipitation tercile-based categorical probabilities. from ensembles. April 27, 2006 Estimation of seasonal precipitation tercile-based categorical probabilities from ensembles MICHAEL K. TIPPETT, ANTHONY G. BARNSTON AND ANDREW W. ROBERTSON International Research Institute for Climate

More information

JP3.7 SHORT-RANGE ENSEMBLE PRECIPITATION FORECASTS FOR NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS): PARAMETER ESTIMATION ISSUES

JP3.7 SHORT-RANGE ENSEMBLE PRECIPITATION FORECASTS FOR NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS): PARAMETER ESTIMATION ISSUES JP3.7 SHORT-RANGE ENSEMBLE PRECIPITATION FORECASTS FOR NWS ADVANCED HYDROLOGIC PREDICTION SERVICES (AHPS): PARAMETER ESTIMATION ISSUES John Schaake*, Mary Mullusky, Edwin Welles and Limin Wu Hydrology

More information

12.2 PROBABILISTIC GUIDANCE OF AVIATION HAZARDS FOR TRANSOCEANIC FLIGHTS

12.2 PROBABILISTIC GUIDANCE OF AVIATION HAZARDS FOR TRANSOCEANIC FLIGHTS 12.2 PROBABILISTIC GUIDANCE OF AVIATION HAZARDS FOR TRANSOCEANIC FLIGHTS K. A. Stone, M. Steiner, J. O. Pinto, C. P. Kalb, C. J. Kessinger NCAR, Boulder, CO M. Strahan Aviation Weather Center, Kansas City,

More information

The U. S. Winter Outlook

The U. S. Winter Outlook The 2017-2018 U. S. Winter Outlook Michael Halpert Deputy Director Climate Prediction Center Mike.Halpert@noaa.gov http://www.cpc.ncep.noaa.gov Outline About the Seasonal Outlook Review of 2016-17 U. S.

More information

Verification of Probability Forecasts

Verification of Probability Forecasts Verification of Probability Forecasts Beth Ebert Bureau of Meteorology Research Centre (BMRC) Melbourne, Australia 3rd International Verification Methods Workshop, 29 January 2 February 27 Topics Verification

More information

Standardized Verification System for Long-Range Forecasts. Simon Mason

Standardized Verification System for Long-Range Forecasts. Simon Mason Standardized Verification System for Long-Range Forecasts Simon Mason simon@iri.columbia.edu MedCOF 2015 Training Workshop Madrid, Spain, 26 30 October 2015 SVS for LRF: Goal Provide verification information

More information

BAYESIAN PROCESSOR OF OUTPUT FOR PROBABILISTIC FORECASTING OF PRECIPITATION OCCURRENCE. Coire J. Maranzano. Department of Systems Engineering

BAYESIAN PROCESSOR OF OUTPUT FOR PROBABILISTIC FORECASTING OF PRECIPITATION OCCURRENCE. Coire J. Maranzano. Department of Systems Engineering BAYESIAN PROCESSOR OF OUTPUT FOR PROBABILISTIC FORECASTING OF PRECIPITATION OCCURRENCE By Coire J. Maranzano Department of Systems Engineering University of Virginia P.O. Box 400747 Charlottesville, VA

More information

Combining Deterministic and Probabilistic Methods to Produce Gridded Climatologies

Combining Deterministic and Probabilistic Methods to Produce Gridded Climatologies Combining Deterministic and Probabilistic Methods to Produce Gridded Climatologies Michael Squires Alan McNab National Climatic Data Center (NCDC - NOAA) Asheville, NC Abstract There are nearly 8,000 sites

More information

Evaluating weather and climate forecasts

Evaluating weather and climate forecasts Evaluating weather and climate forecasts Chris Ferro Department of Mathematics University of Exeter, UK RSS Highlands Local Group and University of St Andrews (St Andrews, 1 September 2016) Monitoring

More information

NIWA Outlook: September October November 2013

NIWA Outlook: September October November 2013 September-November 2013 Issued: 30 August 2013 Hold mouse over links and press ctrl + left click to jump to the information you require: Overview Regional predictions for the next three months: Northland,

More information

Upscaled and fuzzy probabilistic forecasts: verification results

Upscaled and fuzzy probabilistic forecasts: verification results 4 Predictability and Ensemble Methods 124 Upscaled and fuzzy probabilistic forecasts: verification results Zied Ben Bouallègue Deutscher Wetterdienst (DWD), Frankfurter Str. 135, 63067 Offenbach, Germany

More information

Seasonal forecasts presented by:

Seasonal forecasts presented by: Seasonal forecasts presented by: Latest Update: 9 February 2019 The seasonal forecasts presented here by Seasonal Forecast Worx are based on forecast output of the coupled ocean-atmosphere models administered

More information

Assessment of Ensemble Forecasts

Assessment of Ensemble Forecasts Assessment of Ensemble Forecasts S. L. Mullen Univ. of Arizona HEPEX Workshop, 7 March 2004 Talk Overview Ensemble Performance for Precipitation Global EPS and Mesoscale 12 km RSM Biases, Event Discrimination

More information

4.3.2 Configuration. 4.3 Ensemble Prediction System Introduction

4.3.2 Configuration. 4.3 Ensemble Prediction System Introduction 4.3 Ensemble Prediction System 4.3.1 Introduction JMA launched its operational ensemble prediction systems (EPSs) for one-month forecasting, one-week forecasting, and seasonal forecasting in March of 1996,

More information

Deterministic and Probabilistic prediction approaches in Seasonal to Inter-annual climate forecasting

Deterministic and Probabilistic prediction approaches in Seasonal to Inter-annual climate forecasting RA 1 EXPERT MEETING ON THE APPLICATION OF CLIMATE FORECASTS FOR AGRICULTURE Banjul, Gambia, 9-13 December 2002 Deterministic and Probabilistic prediction approaches in Seasonal to Inter-annual climate

More information

Elements of probability theory

Elements of probability theory The role of probability theory in statistics We collect data so as to provide evidentiary support for answers we give to our many questions about the world (and in our particular case, about the business

More information

Will it rain? Predictability, risk assessment and the need for ensemble forecasts

Will it rain? Predictability, risk assessment and the need for ensemble forecasts Will it rain? Predictability, risk assessment and the need for ensemble forecasts David Richardson European Centre for Medium-Range Weather Forecasts Shinfield Park, Reading, RG2 9AX, UK Tel. +44 118 949

More information

Peter P. Neilley. And. Kurt A. Hanson. Weather Services International, Inc. 400 Minuteman Road Andover, MA 01810

Peter P. Neilley. And. Kurt A. Hanson. Weather Services International, Inc. 400 Minuteman Road Andover, MA 01810 6.4 ARE MODEL OUTPUT STATISTICS STILL NEEDED? Peter P. Neilley And Kurt A. Hanson Weather Services International, Inc. 400 Minuteman Road Andover, MA 01810 1. Introduction. Model Output Statistics (MOS)

More information

BAYESIAN APPROACH TO DECISION MAKING USING ENSEMBLE WEATHER FORECASTS

BAYESIAN APPROACH TO DECISION MAKING USING ENSEMBLE WEATHER FORECASTS BAYESIAN APPROACH TO DECISION MAKING USING ENSEMBLE WEATHER FORECASTS Rick Katz Institute for Study of Society and Environment National Center for Atmospheric Research Email: rwk@ucar.edu Paper: R.W. Katz

More information

Overview of the TAMSAT drought forecasting system

Overview of the TAMSAT drought forecasting system Overview of the TAMSAT drought forecasting system The TAMSAT drought forecasting system produces probabilistic forecasts of drought by combining information on the contemporaneous condition of the land

More information

Multivariate verification: Motivation, Complexity, Examples

Multivariate verification: Motivation, Complexity, Examples Multivariate verification: Motivation, Complexity, Examples A.Hense, A. Röpnack, J. Keune, R. Glowienka-Hense, S. Stolzenberger, H. Weinert Berlin, May, 5th 2017 Motivations for MV verification Data assimilation

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools PRE-CALCULUS 40 Pre-Calculus 40 BOE Approved 04/08/2014 1 PRE-CALCULUS 40 Critical Areas of Focus Pre-calculus combines the trigonometric, geometric, and algebraic

More information

presented by: Latest update: 11 January 2018

presented by: Latest update: 11 January 2018 Seasonal forecasts presented by: Latest update: 11 January 2018 The seasonal forecasts presented here by Seasonal Forecast Worx are based on forecast output of the coupled ocean-atmosphere models administered

More information

Downscaling in Time. Andrew W. Robertson, IRI. Advanced Training Institute on Climate Variability and Food Security, 12 July 2002

Downscaling in Time. Andrew W. Robertson, IRI. Advanced Training Institute on Climate Variability and Food Security, 12 July 2002 Downscaling in Time Andrew W. Robertson, IRI Advanced Training Institute on Climate Variability and Food Security, 12 July 2002 Preliminaries Crop yields are driven by daily weather variations! Current

More information

On Sampling Errors in Empirical Orthogonal Functions

On Sampling Errors in Empirical Orthogonal Functions 3704 J O U R N A L O F C L I M A T E VOLUME 18 On Sampling Errors in Empirical Orthogonal Functions ROBERTA QUADRELLI, CHRISTOPHER S. BRETHERTON, AND JOHN M. WALLACE University of Washington, Seattle,

More information

statistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI

statistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI statistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI tailored seasonal forecasts why do we make probabilistic forecasts? to reduce our uncertainty about the (unknown) future

More information

Extending logistic regression to provide full-probability-distribution MOS forecasts

Extending logistic regression to provide full-probability-distribution MOS forecasts METEOROLOGICAL APPLICATIONS Meteorol. Appl. 16: 361 368 (29) Published online 9 March 29 in Wiley InterScience (www.interscience.wiley.com).134 Extending logistic regression to provide full-probability-distribution

More information

Calibration of extreme temperature forecasts of MOS_EPS model over Romania with the Bayesian Model Averaging

Calibration of extreme temperature forecasts of MOS_EPS model over Romania with the Bayesian Model Averaging Volume 11 Issues 1-2 2014 Calibration of extreme temperature forecasts of MOS_EPS model over Romania with the Bayesian Model Averaging Mihaela-Silvana NEACSU National Meteorological Administration, Bucharest

More information

VERFICATION OF OCEAN WAVE ENSEMBLE FORECAST AT NCEP 1. Degui Cao, H.S. Chen and Hendrik Tolman

VERFICATION OF OCEAN WAVE ENSEMBLE FORECAST AT NCEP 1. Degui Cao, H.S. Chen and Hendrik Tolman VERFICATION OF OCEAN WAVE ENSEMBLE FORECAST AT NCEP Degui Cao, H.S. Chen and Hendrik Tolman NOAA /National Centers for Environmental Prediction Environmental Modeling Center Marine Modeling and Analysis

More information

Seasonal forecasts presented by:

Seasonal forecasts presented by: Seasonal forecasts presented by: Latest Update: 10 November 2018 The seasonal forecasts presented here by Seasonal Forecast Worx are based on forecast output of the coupled ocean-atmosphere models administered

More information

Application and verification of ECMWF products in Croatia - July 2007

Application and verification of ECMWF products in Croatia - July 2007 Application and verification of ECMWF products in Croatia - July 2007 By Lovro Kalin, Zoran Vakula and Josip Juras (Hydrological and Meteorological Service) 1. Summary of major highlights At Croatian Met

More information

COMPARING NWS POP FORECASTS TO THIRD-PARTY PROVIDERS

COMPARING NWS POP FORECASTS TO THIRD-PARTY PROVIDERS COMPARING NWS POP FORECASTS TO THIRD-PARTY PROVIDERS P.3 J. Eric Bickel (The University of Texas at Austin)*, Eric Floehr (Intellovations, LLC), and Seong Dae Kim (University of Alaska Anchorage) 1. INTRODUCTION

More information

Prediction of Snow Water Equivalent in the Snake River Basin

Prediction of Snow Water Equivalent in the Snake River Basin Hobbs et al. Seasonal Forecasting 1 Jon Hobbs Steve Guimond Nate Snook Meteorology 455 Seasonal Forecasting Prediction of Snow Water Equivalent in the Snake River Basin Abstract Mountainous regions of

More information

NOTES AND CORRESPONDENCE. A Quantitative Estimate of the Effect of Aliasing in Climatological Time Series

NOTES AND CORRESPONDENCE. A Quantitative Estimate of the Effect of Aliasing in Climatological Time Series 3987 NOTES AND CORRESPONDENCE A Quantitative Estimate of the Effect of Aliasing in Climatological Time Series ROLAND A. MADDEN National Center for Atmospheric Research,* Boulder, Colorado RICHARD H. JONES

More information

Probabilistic Weather Prediction with an Analog Ensemble

Probabilistic Weather Prediction with an Analog Ensemble 3498 M O N T H L Y W E A T H E R R E V I E W VOLUME 141 Probabilistic Weather Prediction with an Analog Ensemble LUCA DELLE MONACHE National Center for Atmospheric Research, Boulder, Colorado F. ANTHONY

More information

NIWA Outlook: October - December 2015

NIWA Outlook: October - December 2015 October December 2015 Issued: 1 October 2015 Hold mouse over links and press ctrl + left click to jump to the information you require: Overview Regional predictions for the next three months: Northland,

More information

Decomposition and graphical portrayal of the quantile score

Decomposition and graphical portrayal of the quantile score Decomposition and graphical portrayal of the quantile score Sabrina Bentzien and Petra Friederichs Meteorological Institute, University of Bonn, and Hans-Ertel-Centre for Weather Research Abstract This

More information

Adaptation for global application of calibration and downscaling methods of medium range ensemble weather forecasts

Adaptation for global application of calibration and downscaling methods of medium range ensemble weather forecasts Adaptation for global application of calibration and downscaling methods of medium range ensemble weather forecasts Nathalie Voisin Hydrology Group Seminar UW 11/18/2009 Objective Develop a medium range

More information

Figure ES1 demonstrates that along the sledging

Figure ES1 demonstrates that along the sledging UPPLEMENT AN EXCEPTIONAL SUMMER DURING THE SOUTH POLE RACE OF 1911/12 Ryan L. Fogt, Megan E. Jones, Susan Solomon, Julie M. Jones, and Chad A. Goergens This document is a supplement to An Exceptional Summer

More information

Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model?

Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model? Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model? Andrew W. Robertson International Research Institute for Climate

More information

Climate Forecasts and Forecast Uncertainty

Climate Forecasts and Forecast Uncertainty Climate Forecasts and Forecast Uncertainty Holly Hartmann Department of Hydrology and Water Resources University of Arizona, Tucson 520-626-8523 hollyh@hwr.arizona.edu CLIMAS-SAHRA press briefing August

More information

A comparison of ensemble post-processing methods for extreme events

A comparison of ensemble post-processing methods for extreme events QuarterlyJournalof theroyalmeteorologicalsociety Q. J. R. Meteorol. Soc. 140: 1112 1120, April 2014 DOI:10.1002/qj.2198 A comparison of ensemble post-processing methods for extreme events R. M. Williams,*

More information

A Probability Model for Verifying Deterministic Forecasts of Extreme Events

A Probability Model for Verifying Deterministic Forecasts of Extreme Events OCTOBER 2007 F E R R O 1089 A Probability Model for Verifying Deterministic Forecasts of Extreme Events CHRISTOPHER A. T. FERRO School of Engineering, Computing and Mathematics, University of Exeter, Exeter,

More information

The U. S. Winter Outlook

The U. S. Winter Outlook The 2018-2019 U. S. Winter Outlook Michael Halpert Deputy Director Climate Prediction Center Mike.Halpert@noaa.gov http://www.cpc.ncep.noaa.gov Outline About the Seasonal Outlook Review of 2017-18 U. S.

More information

EMC Probabilistic Forecast Verification for Sub-season Scales

EMC Probabilistic Forecast Verification for Sub-season Scales EMC Probabilistic Forecast Verification for Sub-season Scales Yuejian Zhu Environmental Modeling Center NCEP/NWS/NOAA Acknowledgement: Wei Li, Hong Guan and Eric Sinsky Present for the DTC Test Plan and

More information

Application and verification of the ECMWF products Report 2007

Application and verification of the ECMWF products Report 2007 Application and verification of the ECMWF products Report 2007 National Meteorological Administration Romania 1. Summary of major highlights The medium range forecast activity within the National Meteorological

More information

SHORT COMMUNICATION EXPLORING THE RELATIONSHIP BETWEEN THE NORTH ATLANTIC OSCILLATION AND RAINFALL PATTERNS IN BARBADOS

SHORT COMMUNICATION EXPLORING THE RELATIONSHIP BETWEEN THE NORTH ATLANTIC OSCILLATION AND RAINFALL PATTERNS IN BARBADOS INTERNATIONAL JOURNAL OF CLIMATOLOGY Int. J. Climatol. 6: 89 87 (6) Published online in Wiley InterScience (www.interscience.wiley.com). DOI:./joc. SHORT COMMUNICATION EXPLORING THE RELATIONSHIP BETWEEN

More information

Basic Verification Concepts

Basic Verification Concepts Basic Verification Concepts Barbara Brown National Center for Atmospheric Research Boulder Colorado USA bgb@ucar.edu May 2017 Berlin, Germany Basic concepts - outline What is verification? Why verify?

More information

FORECASTING poor air quality events associated with

FORECASTING poor air quality events associated with A Comparison of Bayesian and Conditional Density Models in Probabilistic Ozone Forecasting Song Cai, William W. Hsieh, and Alex J. Cannon Member, INNS Abstract Probabilistic models were developed to provide

More information

Statistical Analysis of Climatological Data to Characterize Erosion Potential: 2. Precipitation Events in Eastern Oregon/Washington

Statistical Analysis of Climatological Data to Characterize Erosion Potential: 2. Precipitation Events in Eastern Oregon/Washington E jiyu), Statistical Analysis of Climatological Data to Characterize Erosion Potential:. Precipitation Events in Eastern Oregon/Washington Special Report September Agricultural Experiment Station Oregon

More information

Standardized Anomaly Model Output Statistics Over Complex Terrain.

Standardized Anomaly Model Output Statistics Over Complex Terrain. Standardized Anomaly Model Output Statistics Over Complex Terrain Reto.Stauffer@uibk.ac.at Outline statistical ensemble postprocessing introduction to SAMOS new snow amount forecasts in Tyrol sub-seasonal

More information

LINEAR ALGEBRA - CHAPTER 1: VECTORS

LINEAR ALGEBRA - CHAPTER 1: VECTORS LINEAR ALGEBRA - CHAPTER 1: VECTORS A game to introduce Linear Algebra In measurement, there are many quantities whose description entirely rely on magnitude, i.e., length, area, volume, mass and temperature.

More information

Towards Operational Probabilistic Precipitation Forecast

Towards Operational Probabilistic Precipitation Forecast 5 Working Group on Verification and Case Studies 56 Towards Operational Probabilistic Precipitation Forecast Marco Turco, Massimo Milelli ARPA Piemonte, Via Pio VII 9, I-10135 Torino, Italy 1 Aim of the

More information

Three-Dimensional Coordinate Systems. Three-Dimensional Coordinate Systems. Three-Dimensional Coordinate Systems. Three-Dimensional Coordinate Systems

Three-Dimensional Coordinate Systems. Three-Dimensional Coordinate Systems. Three-Dimensional Coordinate Systems. Three-Dimensional Coordinate Systems To locate a point in a plane, two numbers are necessary. We know that any point in the plane can be represented as an ordered pair (a, b) of real numbers, where a is the x-coordinate and b is the y-coordinate.

More information

BUREAU OF METEOROLOGY

BUREAU OF METEOROLOGY BUREAU OF METEOROLOGY Building an Operational National Seasonal Streamflow Forecasting Service for Australia progress to-date and future plans Dr Narendra Kumar Tuteja Manager Extended Hydrological Prediction

More information

Did we see the 2011 summer heat wave coming?

Did we see the 2011 summer heat wave coming? GEOPHYSICAL RESEARCH LETTERS, VOL. 39,, doi:10.1029/2012gl051383, 2012 Did we see the 2011 summer heat wave coming? Lifeng Luo 1 and Yan Zhang 2 Received 16 February 2012; revised 15 March 2012; accepted

More information

JMA s Seasonal Prediction of South Asian Climate for Summer 2018

JMA s Seasonal Prediction of South Asian Climate for Summer 2018 JMA s Seasonal Prediction of South Asian Climate for Summer 2018 Atsushi Minami Tokyo Climate Center (TCC) Japan Meteorological Agency (JMA) Contents Outline of JMA s Seasonal Ensemble Prediction System

More information

FORECAST VERIFICATION OF EXTREMES: USE OF EXTREME VALUE THEORY

FORECAST VERIFICATION OF EXTREMES: USE OF EXTREME VALUE THEORY 1 FORECAST VERIFICATION OF EXTREMES: USE OF EXTREME VALUE THEORY Rick Katz Institute for Study of Society and Environment National Center for Atmospheric Research Boulder, CO USA Email: rwk@ucar.edu Web

More information

NIWA Outlook: April June 2019

NIWA Outlook: April June 2019 April June 2019 Issued: 28 March 2019 Hold mouse over links and press ctrl + left click to jump to the information you require: Outlook Summary Regional predictions for the next three months Northland,

More information

Overview of Verification Methods

Overview of Verification Methods Overview of Verification Methods Joint Working Group on Forecast Verification Research (JWGFVR) Greg Smith on behalf of Barbara Casati, ECCC Existing Verification Techniques Traditional (point-by-point)

More information

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13 Year 10 Mathematics Semester 2 Bivariate Data Chapter 13 Why learn this? Observations of two or more variables are often recorded, for example, the heights and weights of individuals. Studying the data

More information

Application and verification of ECMWF products 2015

Application and verification of ECMWF products 2015 Application and verification of ECMWF products 2015 Hungarian Meteorological Service 1. Summary of major highlights The objective verification of ECMWF forecasts have been continued on all the time ranges

More information

Comparing NWS PoP Forecasts to Third-Party Providers

Comparing NWS PoP Forecasts to Third-Party Providers 3304 M O N T H L Y W E A T H E R R E V I E W VOLUME 139 Comparing NWS PoP Forecasts to Third-Party Providers J. ERIC BICKEL The University of Texas at Austin, Austin, Texas ERIC FLOEHR Intellovations,

More information

Use of extended range and seasonal forecasts at MeteoSwiss

Use of extended range and seasonal forecasts at MeteoSwiss Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss Use of extended range and seasonal forecasts at MeteoSwiss Current use and ongoing developments Christoph

More information

Week Topics of study Home/Independent Learning Assessment (If in addition to homework) 7 th September 2015

Week Topics of study Home/Independent Learning Assessment (If in addition to homework) 7 th September 2015 Week Topics of study Home/Independent Learning Assessment (If in addition to homework) 7 th September Functions: define the terms range and domain (PLC 1A) and identify the range and domain of given functions

More information

COMPOSITE-BASED VERIFICATION OF PRECIPITATION FORECASTS FROM A MESOSCALE MODEL

COMPOSITE-BASED VERIFICATION OF PRECIPITATION FORECASTS FROM A MESOSCALE MODEL J13.5 COMPOSITE-BASED VERIFICATION OF PRECIPITATION FORECASTS FROM A MESOSCALE MODEL Jason E. Nachamkin, Sue Chen, and Jerome M. Schmidt Naval Research Laboratory, Monterey, CA 1. INTRODUCTION Mesoscale

More information

4.3. David E. Rudack*, Meteorological Development Laboratory Office of Science and Technology National Weather Service, NOAA 1.

4.3. David E. Rudack*, Meteorological Development Laboratory Office of Science and Technology National Weather Service, NOAA 1. 43 RESULTS OF SENSITIVITY TESTING OF MOS WIND SPEED AND DIRECTION GUIDANCE USING VARIOUS SAMPLE SIZES FROM THE GLOBAL ENSEMBLE FORECAST SYSTEM (GEFS) RE- FORECASTS David E Rudack*, Meteorological Development

More information

Categorical Verification

Categorical Verification Forecast M H F Observation Categorical Verification Tina Kalb Contributions from Tara Jensen, Matt Pocernich, Eric Gilleland, Tressa Fowler, Barbara Brown and others Finley Tornado Data (1884) Forecast

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Application and verification of ECMWF products 2010

Application and verification of ECMWF products 2010 Application and verification of ECMWF products Hydrological and meteorological service of Croatia (DHMZ) Lovro Kalin. Summary of major highlights At DHMZ, ECMWF products are regarded as the major source

More information

Guidance on Verification of Operational Seasonal Climate Forecasts

Guidance on Verification of Operational Seasonal Climate Forecasts WORLD METEOROLOGICAL ORGANIZATION Guidance on Verification of Operational Seasonal Climate Forecasts Simon J. Mason Final revision: 4 March 206 Prepared under the auspices of World Meteorological Organization,

More information

Reprinted from MONTHLY WEATHER REVIEW, Vol. 109, No. 12, December 1981 American Meteorological Society Printed in I'. S. A.

Reprinted from MONTHLY WEATHER REVIEW, Vol. 109, No. 12, December 1981 American Meteorological Society Printed in I'. S. A. Reprinted from MONTHLY WEATHER REVIEW, Vol. 109, No. 12, December 1981 American Meteorological Society Printed in I'. S. A. Fitting Daily Precipitation Amounts Using the S B Distribution LLOYD W. SWIFT,

More information

Investigation of the January Thaw in the Blue Hill Temperature Record

Investigation of the January Thaw in the Blue Hill Temperature Record Investigation of the January Thaw in the Blue Hill Temperature Record Michael J. Iacono Blue Hill Meteorological Observatory January 2010 One of the most familiar and widely studied features of winter

More information

NOTES AND CORRESPONDENCE. On the Seasonality of the Hadley Cell

NOTES AND CORRESPONDENCE. On the Seasonality of the Hadley Cell 1522 JOURNAL OF THE ATMOSPHERIC SCIENCES VOLUME 60 NOTES AND CORRESPONDENCE On the Seasonality of the Hadley Cell IOANA M. DIMA AND JOHN M. WALLACE Department of Atmospheric Sciences, University of Washington,

More information