Selection of Best Fit Probability Distribution for Flood Frequency Analysis in South West Western Australia

Similar documents
A New Probabilistic Rational Method for design flood estimation in ungauged catchments for the State of New South Wales in Australia

Regional Flood Estimation for NSW: Comparison of Quantile Regression and Parameter Regression Techniques

for explaining hydrological losses in South Australian catchments by S. H. P. W. Gamage The Cryosphere

Development of a regional flood frequency estimation model for Pilbara, Australia

Design Flood Estimation in Ungauged Catchments: Quantile Regression Technique And Probabilistic Rational Method Compared

NEW REGIONAL FLOOD FREQUENCY ESTIMATION (RFFE) METHOD FOR THE WHOLE OF AUSTRALIA: OVERVIEW OF PROGRESS

New design rainfalls. Janice Green, Project Director IFD Revision Project, Bureau of Meteorology

Regional Flood Modelling in the New Australian Rainfall and Runoff

BUREAU OF METEOROLOGY

STUDY OF FIXED-DURATION DESIGN RAINFALLS IN AUSTRALIAN RAINFALL RUNOFF AND JOINT PROBABILITY BASED DESIGN RAINFALLS

IT S TIME FOR AN UPDATE EXTREME WAVES AND DIRECTIONAL DISTRIBUTIONS ALONG THE NEW SOUTH WALES COASTLINE

NRC Workshop - Probabilistic Flood Hazard Assessment Jan 2013

Secular variation in rainfall and intensity-frequency-duration curves in Eastern Australia

Selecting parameter values for the AWBM daily rainfall-runoff model for use on ungauged catchments by Walter Boughton 1

Australian Rainfall & Runoff

Stochastic Hydrology. a) Data Mining for Evolution of Association Rules for Droughts and Floods in India using Climate Inputs

A PROBABILITY DISTRIBUTION OF ANNUAL MAXIMUM DAILY RAINFALL OF TEN METEOROLOGICAL STATIONS OF EVROS RIVER TRANSBOUNDARY BASIN, GREECE

Regional Flood Frequency Analysis in the Range of Small to Large Floods: Development and Testing of Bayesian Regression-based Approaches

CATCHMENT DESCRIPTION. Little River Catchment Management Plan Stage I Report Climate 4.0

How Significant is the BIAS in Low Flow Quantiles Estimated by L- and LH-Moments?

New Intensity-Frequency- Duration (IFD) Design Rainfalls Estimates

Results of Intensity-Duration- Frequency Analysis for Precipitation and Runoff under Changing Climate

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Best Fit Probability Distributions for Monthly Radiosonde Weather Data

Impact of climate change on Australian flood risk: A review of recent evidence

Probability distributions for explaining hydrological losses in South Australian catchments

Reliability of Daily and Annual Stochastic Rainfall Data Generated from Different Data Lengths and Data Characteristics

RADAR Rainfall Calibration of Flood Models The Future for Catchment Hydrology? A Case Study of the Stanley River catchment in Moreton Bay, Qld

South West Queensland Region

The South Eastern Australian Climate Initiative

High resolution rainfall projections for the Greater Sydney Region

Keywords: rainfall variation, extreme rainfall, intensity frequency duration, Eastern Australia

ENGINEERING HYDROLOGY

Global Climate Change, Weather, and Disasters

AFAC 2006 page 536. Climate-Change Impacts on fire-weather in SE Australia Kevin Hennessy, Principal Research Scientist, CSIRO

NRC Workshop Probabilistic Flood Hazard Assessment (PFHA) Jan 29-31, Mel Schaefer Ph.D. P.E. MGS Engineering Consultants, Inc.

PLANNED UPGRADE OF NIWA S HIGH INTENSITY RAINFALL DESIGN SYSTEM (HIRDS)

Chapter 5 Identifying hydrological persistence

University of Wollongong. Research Online

International Journal of World Research, Vol - 1, Issue - XVI, April 2015 Print ISSN: X

Flood Inundation Analysis by Using RRI Model For Chindwin River Basin, Myanmar

Technology Madras, Chennai

A DAILY RAINFALL GENERATING MODEL FOR WATER YIELD AND FLOOD STUDIES

Hydrological forecasting and decision making in Australia

MONTE CARLO SIMULATION RISK ANALYSIS AND ITS APPLICATION IN MODELLING THE INCLEMENT WEATHER FOR PROGRAMMING CIVIL ENGINEERING PROJECTS

On the use of radar rainfall estimates and nowcasts in an operational heavy rainfall warning service

Quantifying Weather Risk Analysis

Background and History

FLORA: FLood estimation and forecast in complex Orographic areas for Risk mitigation in the Alpine space

A Comparison of Rainfall Estimation Techniques

LQ-Moments for Statistical Analysis of Extreme Events

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study

Chapter 6 Problems with the calibration of Gaussian HMMs to annual rainfall

An Estimation of Probable Maximum Precipitation for Trinidad

Interdecadal variation in rainfall patterns in South West of Western Australia

Effect of land use/land cover changes on runoff in a river basin: a case study

Interrelationship between Indian Ocean Dipole (IOD) and Australian Tropical Cyclones

Extreme Rain all Frequency Analysis for Louisiana

Regional climate-change downscaling for hydrological applications using a nonhomogeneous hidden Markov model

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories

CONCEPTUAL AND TECHNICAL CHALLENGES IN DEFINING FLOOD PLANNING AREAS IN URBAN CATCHMENTS

(Towards) using subseasonal-to-seasonal (S2S) extreme rainfall forecasts for extendedrange flood prediction in Australia

Estimation Of SIMHYD Parameter Values For Application In Ungauged Catchments

Regionalization for one to seven day design rainfall estimation in South Africa

Australian Rainfall & Runoff

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

From Climate Science to Climate Services

Design Storms for Hydrologic Analysis

Floodplain Risk Management in Dungog after the April 2015 Super Storm. Presentation Outline

Fire Weather Drivers, Seasonal Outlook and Climate Change. Steven McGibbony, Severe Weather Manager Victoria Region Friday 9 October 2015

A review: regional frequency analysis of annual maximum rainfall in monsoon region of Pakistan using L-moments

Flash Flood Guidance System On-going Enhancements

investment decisions

Regionalization Approach for Extreme Flood Analysis Using L-moments

Monte Carlo Simulations for Probabilistic Flood Hazard Assessment

Long-term streamflow projections: A summary of recent results from the Millennium Drought

High intensity rainfall estimation in New Zealand

New approach to calibration of the AWBM for use on ungauged catchments

Modeling Rainfall Intensity Duration Frequency (R-IDF) Relationship for Seven Divisions of Bangladesh

CURRENT AND FUTURE TROPICAL CYCLONE RISK IN THE SOUTH PACIFIC

Analysis and mapping of spatio-temporal hydrologic data: HYDROSTOCHASTICS

1990 Intergovernmental Panel on Climate Change Impacts Assessment

Rainfall Analysis. Prof. M.M.M. Najim

ANALYSIS OF RAINFALL DATA FROM EASTERN IRAN ABSTRACT

Review of existing statistical methods for flood frequency estimation in Greece

The elevations on the interior plateau generally vary between 300 and 650 meters with

Short-term Stream Flow Forecasting at Australian River Sites using Data-driven Regression Techniques

TABLE OF CONTENTS. 3.1 Synoptic Patterns Precipitation and Topography Precipitation Regionalization... 11

Extreme value analysis for gridded data

8 Current Issues and Research on Sediment Movement in the River Catchments of Japan

The Analysis of Uncertainty of Climate Change by Means of SDSM Model Case Study: Kermanshah

Badger Flood Situation Report. Robert Picco Amir Ali Khan Ken Rollings. Department of Environment Water Resources Management Division

MODEL FOR DISTRIBUTION OF WAGES

peak half-hourly New South Wales

Aurora Bell*, Alan Seed, Ross Bunn, Bureau of Meteorology, Melbourne, Australia

20. EXTREME RAINFALL (R20MM, RX5DAY) IN YANGTZE HUAI, CHINA, IN JUNE JULY 2016: THE ROLE OF ENSO AND ANTHROPOGENIC CLIMATE CHANGE

Predicting present and future coastal flood risk in North East Wales

Hydrological extremes. Hydrology Flood Estimation Methods Autumn Semester

RISK ASSESSMENT COMMUNITY PROFILE NATURAL HAZARDS COMMUNITY RISK PROFILES. Page 13 of 524

EL NINO-SOUTHERN OSCILLATION (ENSO): RECENT EVOLUTION AND POSSIBILITIES FOR LONG RANGE FLOW FORECASTING IN THE BRAHMAPUTRA-JAMUNA RIVER

Transcription:

Abstract Selection of Best Fit Probability Distribution for Flood Frequency Analysis in South West Western Australia Benjamin P 1 and Ataur Rahman 2 1 Student, Western Sydney University, NSW, Australia 2 Associate Professor, Western Sydney University, NSW, Australia Corresponding author s E-mail: 17882910@student.westernsydney.edu.au Design floods are needed to design safe and adequate infrastructure such as bridges, levees and dams. The estimation of design floods is generally performed by at-site flood frequency analysis (FFA) when adequate historical streamflow data is available at the site of interest. The 3 rd edition of Australian Rainfall & Runoff (ARR) 1987 recommended the use of the log-pearson type 3 (LP3) distribution to estimate design floods for all Australian catchments including South West Western Australia. The 4 th edition of ARR no longer prescribes a particular probability distribution for FFA in Australia. In this study six different probability distributions are used to assess which probability distribution could be recommended when performing FFA for catchments located in South West Western Australia. The distributions that are assessed in this study are: normal distribution, log-normal (LN) distribution, LP3 distribution, generalised extreme value (GEV) distribution, extreme value type 1 (EV1) distribution and generalized Pareto (GP) distribution. Use of the software package TUFLOW Flike, recommended in the 4 th edition of ARR, is made to fit the LN, LP3, GEV, EV1 & GP distributions, while the R software package is used to fit the normal distribution. Goodness of fit testing is performed to assist in the ranking of the various distributions for the selected catchments. From this study of the six distributions on 103 catchments located in South West Western Australia we find that the GEV distribution provides the best estimation of design floods from FFA. However, some catchments in South West Western Australia do not follow the GEV distribution where other distribution should be applied. This study highlights the difficulties in recommending a single probability distribution for FFA in a given region. Keywords: Floods, LP3 distribution, GEV distribution, flood frequency analysis, ARR 1. INTRODUCTION Floods are one of the most devastating natural disasters which may cause significant damage to property and crop, and loss of life. For example, 2010-11 Brisbane flood caused economic damage over $10 billion and loss of over 30 human lives. Estimation of design flood is one of the most researched topics in hydrology. One of the most effective methods of design flood estimation is through the application of flood frequency analysis (FFA). Haddad & Rahman (2010) noted that FFA is one of the most challenging topics in hydrology as there are many probability distributions and parameter estimation methods to choose for a given application. FFA assumes that the observed flood discharges can be described by a selected probability distribution (Merz & Thieken, 2009). The aim of FFA is to estimate the flood discharge for a given annual exceedance probability (AEP) through the application of a selected probability distribution to a given flood data set. A design flood is defined as the peak flood discharge for an associated AEP (Caballero & Rahman, 2014). Wang (2015) noted that FFA is one of the most efficient methods of design flood estimation when there is adequate flood data available at the location of interest. Schendel & Thongwichian (2015) noted that estimated design floods may not perfectly match the recorded flood data due to uncertainty introduced by short data length and hence understanding of inherent uncertainty in FFA is important (Mirzaei et al., 2015; Yan & Moradkhani, 2015). 1 st International Conference on Water and Environmental Engineering, 20-22 Nov 2017, Sydney, Australia 108

TUFLOW Flike is a software that has been specifically developed for FFA in Australia as a part of ARR (Kuczera and Franks, 2016) and has been adopted in this study. Determination of the probability distribution that best suits a given dataset is important so that informed decisions can be made as to the most appropriate probability distribution to apply. This is commonly performed through the application of goodness of fit (GoF) tests. A GoF test describes how well a probability distribution fits a given flood dataset through analytical and graphical methods (Heo et al., 2013). 2. STUDY AREA This study focuses on South West (SW) of Western Australia (WA) located within Drainage Division 6 (BOM, 2012). The study area is presented in Figure 1. Drainage Division 6 has an area of approximately 326,000 km 2, which is bordered by the Indian Ocean to the West, the Great Australian Bight to the South, the Pilbara-Gascoyne region to the North and the South-Western Plateau region to the East. Drainage Division 6 is generally considered to have a flat topography, with a maximum elevation of 1000 m above the mean sea level. Drainage Division 6 is generally found to be made up of sandy soil and dune systems along the coastal regions. The climate of this region is considered to be temperate with warm summers and cool winters. A total of 103 gauged catchments from SW WA are used in this study. These data were collated as a part of ARR Revision Project 5 Regional flood methods (Rahman et al., 2016). Figure 1 Drainage Division 6 in South West of Western Australia 3. SELECTION OF CANDIDATE PROBABILITY DISTRIBUTIONS The principal aim of FFA is determination of the relationship between flood magnitude and the return period (Cunnane, 1978; Cunnane, 1985). The probability distributions that are applied in this study are as follows; normal (Norm), log-normal (LN), log-pearson type 3 (LP3), generalised extreme value (GEV), extreme value type 1 (EV1) and generalised Pareto (GP) distributions. The non-parametric method (Bardsley, 1989) is also used in this study to visualise the goodness-of-fit for a given distribution. Cunnane s plotting position formula is used here, which can be described by: T = (n+0.2)/(m-0.4) (1) where T is the return period in years, n is the sample size and m is the rank of the data. 1 st International Conference on Water and Environmental Engineering, 20-22 Nov 2017, Sydney, Australia 109

4. GOODNESS OF FIT TESTING In this study, the Easyfit software is adopted to assess the goodness of fit (GoF). Three GoF tests are built into this software; the Kolmogorov-Smirnov (KS) test, Anderson-Darling (AD) test and the Chisquared (Chi) test. All the three tests are applied to the AM flood data to select the best fit probability distribution for each of the selected catchments. The Kolmogorov-Smirnov (KS) test is based on the largest difference between the sample cumulative distribution function and the hypothesized cumulative distribution function. The sample cumulative distribution function is described as: F n (x) = 1. (Number of observation x) (2) n where n is the sample size and x is a random sample of the dataset being assessed. The AD test compares the fit of an observed cumulative distribution function to that of an expected cumulative distribution function. This test gives more weight to the extreme data values. To perform an AD test the Anderson-Darling statistic (A 2 ) is determined as below: A 2 = n 1 n (2i 1) [lnf(x n i=1 n) + (ln(1 F(X n 1+i )))] (3) where n is the sample size and X n is a sample from the dataset to be assessed. The Chi-squared test is a simple method to determine if a sample comes from a population that belongs to a specific distribution. This GoF test is performed on binned data, where binned data divides the entire range of the dataset into specific intervals for analysis. Each of these bins must contain a minimum of 5 data points for the Chi-squared test to be valid. After the data has been binned the Chi-squared statistic (χ 2 ) is determined by the following equation: χ 2 k = (O i E i ) 2 i=1 (4) E i where O i is the observed frequency for a bin i and E i is the expected frequency for a bin i which is calculated by: E i = F(x 2 ) F(x 1 ) (5) where F is the cumulative distribution function of the probability distribution being assessed and x 1, x 2 are the limits for bin i. 5. RESULTS AND DISCUSSION Estimation of the AEP flood quantiles Q 50, Q 20, Q 10, Q 5, Q 2 & Q 1 for the 103 catchments located in SW WA was performed using the R software package for normal distribution, and TUFLOW Flike, for the log normal (LN), log-pearson type 3 (LP3), generalised extreme value (GEV), extreme value type 1 (EV1) & generalised pareto (GP) distributions. The Kolmogorov-Smirnov (KS), Anderson- Darling (AD) and Chi-squared goodness of fit (GoF) tests were applied to each catchment to determine the best fit distributions for each catchment. The probability distributions were then ranked according to the GoF test results. On review of the GoF test results, it was decided to exclude the results obtained from the Chi-squared test as it was inconsistent. Ranking of the distributions based on the GoF test results was performed through the use of a consistent ranking system. Liu (2011) describes that there are many ranking systems available but to ensure consistency it is best to maintain one ranking system for a project. For this study, the 1 st International Conference on Water and Environmental Engineering, 20-22 Nov 2017, Sydney, Australia 110

competitive ranking system has been used for the ranking of the GoF test results. The competitive ranking system ranks results numerically in descending order with ties in results being given the same ranking but any results that follow these tied results receiving a skipped ranking value. For example, in a ranking system of 5, 6, 6 and 7, the first result would receive a rank of 1, the next two results would receive a rank of 2 and the final result would receive a rank of 4, skipping rank 3. This system has been chosen due to its simplicity and that there are only 6 distributions for ranking. If more distribution were to be used then a more in-depth ranking system may be required. For the ranking scores found within this system a rank of 1 refers to the best fitting distribution result and a rank of 6 refers to the worst fitting distribution result. The overall ranking for the six probability distributions for the 103 catchments is summarised in Table 1. From these rankings, it can be seen that the GEV provides the overall best choice of probability distribution for FFA in the SW of WA. This is followed by the LP3, the LN, the GP, the EV1 and normal distribution. Table 1. Overall ranking of probability distributions Distribution Average Ranking Number of Catchments to Achieve Rank 1 with Associated Distribution GEV 1.7 63 LP3 2.4 33 LN 3.1 7 GP 3.71 16 EV1 3.73 9 Norm 5.3 1 From the results presented in Table 1 it can be seen that GEV is the best fit distribution for 63 out of the 103 sites, i.e. for 61% of the sites. This finding is consistent with Vogel, McMahon & Chiew (1993) who found (from their study of 7 catchments located in Drainage Division 6) that the GEV was the best-fit distribution. In sub-region 2 (Albany coast) we find that the LP3 distribution performs the best with a rank of 1 for the three sites, and GEV is ranked 2 nd in this sub-region. Sub-regions 6 (Shannon River), 11 (Preston River) and 13 (Harvey River) have GEV as the rank 1 distribution. Subregions 6 and 11 have EV1 as the 2 nd ranking distribution, and sub-region 13 has LP3 as the 2 nd ranking distribution. We can see from Table 1 that the GP distribution is the best fit distribution for 16 sites, and all of these are located along the South West coastline of Drainage Division 6. Only one site shows normal distribution as the best-fit distribution, which is as expected as normal distribution is generally a poor descriptor of annual maximum flood data. 6. CONCLUSION A study of the selection of the best fit probability distributions for flood frequency analysis is performed for South West of Western Australia (Drainage Division 6). For this study, the observed annual maximum (AM) streamflow data are analysed through the use of the R software and Tuflow FLIKE to estimate flood quantiles. A total of six probability distributions are examined: normal (Norm), log-normal (LN), log-pearson type 3 (LP3), generalised extreme value (GEV), extreme value type 1 (EV1) and generalised Pareto (GP) distributions. The Kolmogorov-Smirnov (KS) and Anderson-Darling (AD) goodness of fit (GoF) tests are applied at each of the study catchments to assess the goodness of fit of the selected distributions. It is found that the GEV distribution provides the best outcome where 63 out of the 103 sites, i.e. for 61% of the sites have GEV as the best-fit distribution. The second best performer is the LP3 distribution, which is the best-fit distribution for 33 sites (i.e. for 32% of the sites). The results of this study should be used with caution as there are 39% of the catchments where GEV is not the best choice in South West of Western Australia and hence an alternative distribution like LP3 may provide more accurate design flood estimates. 1 st International Conference on Water and Environmental Engineering, 20-22 Nov 2017, Sydney, Australia 111

REFERENCES Bardsley WE (1989). Using historical data in nonparametric flood estimation, Journal of Hydrology, 108, 249-255. Bureau of Meteorology (BOM) (2012). Australian Water Resources Assessment 2012, <http://www.bom.gov.au/water/awra/2012/swcoast.shtml>. Cunnane C (1978). Unbiased plotting position - a review, Journal of Hydrology, 37, 205-222. Cunnane C (1985). Factors affecting choice of distribution for flood series, Hydrological Sciences Journal, 30, 25-36. Caballero WL, Rahman A (2014). Application of Monte Carlo simulation technique for flood estimation for two catchments in New South Wales, Australia, Natural Hazards, 74, 1475-1488. Haddad K, Rahman A (2010). Selection of the best fit flood frequency distribution and parameter estimation procedure: a case study for Tasmania in Australia, Stochastic Environmental Research and Risk Assessment, 25, 415-428. Heo J-H, Shin H, Nama W, Oma J, Jeong C (2013). Approximation of modified Anderson Darling test statistics for extreme value distributions with unknown shape parameter, Journal of Hydrology, 499, 41-49. Institution of Engineers Australia (I. E. Aust.) (1987). Australian Rainfall and Runoff: A Guide to Flood Estimation, Editor: Pilgrim, D.H., Engineers Australia, Canberra. Kuczera G, Franks S (2016). At-site flood frequency analysis. In: Australian Rainfall & Runoff, Chapter 2, Book 2, edited by Ball et al., Commonwealth of Australia. Liu TY (2011). Statistical Learning Theory for Ranking, Learning to Rank for Information Retrieval, Springer-Verlag, Berlin, pp. 195-200. Merz B, Thieken AH (2009). Flood risk curves and uncertainty bounds, Natural Hazards, 51, 437-458. Mirzaei M, Huang YF, El-Shafie A, Chimeh T, Lee J, Vaizadeh N, Adamowski J (2015). Uncertainty analysis for extreme flood events in a semi-arid region, Natural Hazards, 78, 1947-1960. Rahman AS, Rahman A, Zaman MA, Haddad K, Ahsan A, Imteaz M (2013). A study on selection of probability distributions for at-site flood frequency analysis in Australia, Natural Hazards, 69, 1803-1813. Rahman A, Haddad K, Kuczera G, Weinmann PE (2016). Regional flood methods. In: Australian Rainfall & Runoff, Chapter 3, Book 3, edited by Ball et al., Commonwealth of Australia. Schendel S, Thongwichian R (2015). Flood frequency analysis: confidence interval estimation by test inversion bootstrapping, Advances in Water Resources, 83, pp. 1-9, DOI 10.1016/j.advwatres.2015.05.004. Vogel RM, McMahon TA, Chiew FHS (1993). Floodflow frequency model selection in Australia, Journal of Hydrology, 146, 421-49. Wang C (2015). A joint probability approach for coincidental flood frequency analysis at ungauged basins confluences, Natural Hazards, 82, 1727-1741. Yan H, Moradkhani H (2015). Toward more robust extreme flood prediction by Bayesian hierarchical and multimodeling, Natural Hazards, 81, 203-225. 1 st International Conference on Water and Environmental Engineering, 20-22 Nov 2017, Sydney, Australia 112