Spatial discrete hazards using Hierarchical Bayesian Modeling

Similar documents
Bayesian Hierarchical Models

Markov chain Monte Carlo

Bayesian modelling. Hans-Peter Helfrich. University of Bonn. Theodor-Brinkmann-Graduate School

Matthias Schubert. Jochen Köhler. Michael H. Faber

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016

Freeway rear-end collision risk for Italian freeways. An extreme value theory approach

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

A Full Bayes Approach to Road Safety: Hierarchical Poisson. Mixture Models, Variance Function Characterization, and. Prior Specification

Contents. Part I: Fundamentals of Bayesian Inference 1

Evaluation of Road Safety in Portugal: A Case Study Analysis. Instituto Superior Técnico

Variability within multi-component systems. Bayesian inference in probabilistic risk assessment The current state of the art

Petr Volf. Model for Difference of Two Series of Poisson-like Count Data

Modelling trends in the ocean wave climate for dimensioning of ships

Dynamic System Identification using HDMR-Bayesian Technique

A Probabilistic Framework for solving Inverse Problems. Lambros S. Katafygiotis, Ph.D.

How to Incorporate Accident Severity and Vehicle Occupancy into the Hot Spot Identification Process?

Image segmentation combining Markov Random Fields and Dirichlet Processes

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

Principles of Bayesian Inference

TRB Paper # Examining the Crash Variances Estimated by the Poisson-Gamma and Conway-Maxwell-Poisson Models

Practical Bayesian Optimization of Machine Learning. Learning Algorithms

Bayesian inference J. Daunizeau

Density Estimation. Seungjin Choi

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

Bayesian spatial hierarchical modeling for temperature extremes

Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE. Rick Katz

The STS Surgeon Composite Technical Appendix

Community Health Needs Assessment through Spatial Regression Modeling

Multivariate Normal & Wishart

Markov Chain Monte Carlo

DEVELOPMENT OF TRAFFIC ACCIDENT ANALYSIS SYSTEM USING GIS

DEVELOPING DECISION SUPPORT TOOLS FOR THE IMPLEMENTATION OF BICYCLE AND PEDESTRIAN SAFETY STRATEGIES

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Statistics for extreme & sparse data

Bayesian Regression Linear and Logistic Regression

Evaluating the value of structural heath monitoring with longitudinal performance indicators and hazard functions using Bayesian dynamic predictions

Statistics & Data Sciences: First Year Prelim Exam May 2018

Finite Mixture and Markov Switching Models

Part 8: GLMs and Hierarchical LMs and GLMs

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Bayesian Analysis of Multivariate Normal Models when Dimensions are Absent

Bayesian Methods for Machine Learning

Markov Chain Monte Carlo

STAT 518 Intro Student Presentation

Bayesian Methods in Multilevel Regression

Bayesian Nonparametric Regression for Diabetes Deaths

Probabilistic Machine Learning

Report and Opinion 2016;8(6) Analysis of bivariate correlated data under the Poisson-gamma model

Longitudinal breast density as a marker of breast cancer risk

STAT 499/962 Topics in Statistics Bayesian Inference and Decision Theory Jan 2018, Handout 01

Introduction to Bayesian Inference

New Achievement in the Prediction of Highway Accidents

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Bayesian Dynamic Linear Modelling for. Complex Computer Models

Monte Carlo in Bayesian Statistics

The Bayesian Choice. Christian P. Robert. From Decision-Theoretic Foundations to Computational Implementation. Second Edition.

Quantile POD for Hit-Miss Data

Riemann Manifold Methods in Bayesian Statistics

arxiv: v1 [stat.ap] 11 Aug 2008

Use of Bayesian multivariate prediction models to optimize chromatographic methods

Harrison B. Prosper. Bari Lectures

STAT Advanced Bayesian Inference

Bayesian tsunami fragility modeling considering input data uncertainty

Bayesian decision procedures for dose escalation - a re-analysis

Summary STK 4150/9150

Indirect Clinical Evidence of Driver Inattention as a Cause of Crashes

A Bayesian Nonparametric Model for Predicting Disease Status Using Longitudinal Profiles

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

Statistical Rock Physics

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Bayesian Estimation of Input Output Tables for Russia

Principles of Bayesian Inference

Coresets for Bayesian Logistic Regression

ST 740: Markov Chain Monte Carlo

Generalized common spatial factor model

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Spatial Analysis of Incidence Rates: A Bayesian Approach

An introduction to Bayesian inference and model comparison J. Daunizeau

Hierarchical Modeling and Analysis for Spatial Data

A short introduction to INLA and R-INLA

Risk Estimation and Uncertainty Quantification by Markov Chain Monte Carlo Methods

VCMC: Variational Consensus Monte Carlo

Gaussian kernel GARCH models

Bayesian GLMs and Metropolis-Hastings Algorithm

November 2002 STA Random Effects Selection in Linear Mixed Models

Spatial Variation in Local Road Pedestrian and Bicycle Crashes

David Giles Bayesian Econometrics

Bayesian Inference for DSGE Models. Lawrence J. Christiano

Example: Ground Motion Attenuation

Advanced Statistical Modelling

TRB Paper Hot Spot Identification by Modeling Single-Vehicle and Multi-Vehicle Crashes Separately

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models

Linear, Generalized Linear, and Mixed-Effects Models in R. Linear and Generalized Linear Models in R Topics

Artificial Intelligence Uncertainty

Case Study in the Use of Bayesian Hierarchical Modeling and Simulation for Design and Analysis of a Clinical Trial

Expression Data Exploration: Association, Patterns, Factors & Regression Modelling

Bayesian course - problem set 5 (lecture 6)

MULTILEVEL IMPUTATION 1

Bayesian Networks BY: MOHAMAD ALSABBAGH

Transcription:

Spatial discrete hazards using Hierarchical Bayesian Modeling Mathias Graf ETH Zurich, Institute for Structural Engineering, Group Risk & Safety 1

Papers -Maes, M.A., Dann M., Sarkar S., and Midtgaard, A.K., (2007) Fatality rate modeling within a spatial network using hierarchical Bayes methods, Webpublished in the Proceedings International Forum on Engineering Decision Making, IFED2007, Port Stephens, December. - Ng, K., Hung, W. and Wong, W. (2002) Algorithm for assessing the risk of traffic accident. Journal of Safety Research, 33, pp. 387-410. 2

Content Risk analysis of a road network - Establish a model using a classical approach - Establish a model using a hierarchical Bayesian modeling approach - Using WinBUGS to establish a model and estimate model parameters - Using the model for risk analysis and decision making 3

Risk analysis of a road network - Establishing an accident event estimation model by means of potential causal factors -Identifying significant causal factors -Identifying high risk zones 4

Classical approach Example Hong Kong (Ng, K. et all) -Data merging -Data clustering -Model calibration -Black site identification 5

Data merging Combining accident data with: -Digital map -Zones -Potential causal factors -Land use information -Road information 6

Data merging -274 traffic analysis zone -Land use data 7

Data merging -274 traffic analysis zone -Accident locations 8

Data merging - 274 traffic analysis zone - Fatality locations 9

Data clustering -Internal homogenous & external heterogeneous 10

Defining model - Literature search for existing models for similar problems. - Appling physical models. - Establishing a statistical model using availible data. 11

Model calibration - Choosing model - Estimating model parameters for each zone separately ( y) PY ( y α ) ( 1! α ) Γ + αλ 1 = = y Γ 1+ αλ 1+ αλ 1 y α 1 α α 0 y λ = overdispersion parameter = number of accident events = expected mean number of accident events. 12

Model calibration - Choosing model - Estimating model parameters for each zone separately λ = ( y) PY λ x β = expected mean number of accident events exp x ( β ) ( y α ) ( 1! α ) Γ + αλ 1 = = y Γ 1+ αλ 1+ αλ 1 y α 1 = vector with indicators = vector with regression coefficients 13

Model calibration ( y) PY ( y α ) ( 1! α ) Γ + αλ 1 = = y Γ 1+ αλ 1+ αλ λ = exp x ( β ) 1 y α 1 14

Establishing a model using a hierarchical Bayesian modeling approach Example Norway costal road E18 (Maes, M.A et all) 15

Segmentation of the road - Different road characteristics - Accident data 16

Accident data Accident outcomes Random variables describing the accident outcome 17

Adding spatial reference to accident data Number of events Y observed in 5 years and empirical estimates of annual accident and fatality rates θ 18

Available Indicators - Annual average daily traffic (AADT) - Speed limit - Number of crossings - Number of lanes - Curviness - Bridges - Tunnels 19

Available Indicators For each segment 20

Defining model Y i - The occurrence of within a segment with the length L during a period with a time t is modeled i by Y Poisson θ L t ( ) i i i 21

Defining model - The occurrence of within a segment with the length L during a period with a time t is modeled i by Y Poisson θ L t -Whereby Y i ( ) i i i θ = exp( μ + ε + φ ) i i i i 22

Defining model - The occurrence of within a segment with the length L during a period with a time t is modeled i by Y Poisson θ L t -Whereby Y i μ i = meanlog-risk μi = xiβ x = 1, x,..., x = spatial covariates characterizing the segment i } } { i 1i mi { i 0 1 m ( ) i i i θ = exp( μ + ε + φ ) i i i i β = β, β,..., β = regression coefficients m = number of indicators 23

Defining model - The occurrence of within a segment with the length L during a period with a time t is modeled i by Y Poisson θ L t -Whereby Y i ( ) i i i θ = exp( μ + ε + φ ) i i i i ε ( 2 0, σ ) i N ε σ 2 ε system wide variability (nugget effect) 24

Defining model - The occurrence of within a segment with the length L during a period with a time t is modeled i by Y Poisson θ L t -Whereby Y i ( ) i i i θ = exp( μ + ε + φ ) i i i i φ i = Zero-mean random field capturing any cluster effects { φ } 2 ( 2 ) 1,..., φn σφ, rφ Multivariate normal 0, Σ σφ, rφ d ( ( 2, )) 2 ij Σ σφ rφ = σφ exp r φ = correlation lenght ij r φ d = distance between centroids of the segment i and segment j ij 25

Using WinBUGS for parameter estimation - Using doodle to scratch model - Generating code and defining model - Preparing input data - Updating the model to estimate parameters - Using WinBUGS scripts to easily perform several parameter estimations in a batch mode 26

Using WinBUGS for parameter estimation - Using doodle to scratch model 27

Using WinBUGS for parameter estimation - Generate code and defining model - Check definition of parameters in WinBUGS 28

Using WinBUGS for parameter estimation - Preparing input data and initial values WinBUGS format: Start with list( ) - Single values: n = 17 - Vectors: Y = c(0,1,2,3,3,0,3,0,5,0,4,7,2,8,1,0,1) - Matrix: - x=structure(.data=c(6.92,1.56,,3.14),.dim = c(17,8)) Using matlab to generate input files 29

Using WinBUGS for parameter estimation - Checking model and loading data 30

Using WinBUGS for parameter estimation - Defining which variables will be monitored - Updating: using Markov chain Monte Carlo (MCMC) methods to sample posterior distribution. 31

Using WinBUGS for parameter estimation - Using WinBUGS scripts to easily perform several parameter estimations in a batch mode - Sample script code: display('log') #progress is show in a log window check('model.odc') #load and check the model data('data.txt') #load data file compile(1) #compile model gen.inits() #generate initial values #define which variable will be monitored set(lambda) set(theta) 32

Using WinBUGS for parameter estimation - Using WinBUGS scripts to easily perform several parameter estimations in a batch mode beg(1000) end(1000000) update(10000) refresh(1000) stats(*) save('stats.txt') # select a subset of the stored sample for analysis. #set the number of MCMC updates #start updates and refresh screen after 1000 updates #show statistics of all monitored variables #save the statistics in a file 33

Estimated Parameters - Check influence of indicators - Use estimated parameters to estimate risk for road segments 34

Checking model Cross validation Establish model using 16 segments and test it on the 17 th segment. Estimate the prediction error of the model. 35

Difference between hierarchical and nonhierarchical approach. 36

Decision making - Estimating risk for the selected road segments - Estimating risk for new road segments - Using the risk analysis to allocate resources to improve high risk areas 37

Real-time risk analysis - Conditioning network on road conditions - Lanes closed, construction site - Including other indicators - Traffic news -Day time - Weather (forecast) - Real time monitoring system 38