THE development of real-time crash prediction models

Size: px
Start display at page:

Download "THE development of real-time crash prediction models"

Transcription

1 574 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 14, NO. 2, JUNE 2013 A Genetic Programming Model for Real-Time Crash Prediction on Freeways Chengcheng Xu, Wei Wang, and Pan Liu Abstract This paper aimed at evaluating the application of the genetic programming (GP) model for real-time crash prediction on freeways. Traffic, weather, and crash data used in this paper were obtained from the I-880N freeway in California, United States. The random forest (RF) technique was conducted to select the variables that affect crash risk under uncongested and congested traffic conditions. The GP model was developed for each traffic state based on the candidate variables that were selected by the RF technique. The traffic flow characteristics that contribute to crash risk were found to be quite different between congested and uncongested traffic conditions. This paper applied the receiver operating characteristic (ROC) curve to evaluate the prediction performance of the developed GP model for each traffic state. The validation results showed that the prediction performance of the GP models were satisfactory. The binary logit model was also developed for each traffic state using the same training data set. The authors compared the ROC curve of the GP model and the binary logit model for each traffic state. The GP model produced better prediction performance than did the binary logit model for each traffic state. The GP model was found to increase the crash prediction accuracy under uncongested traffic conditions by an average of 8.2% and to increase the crash prediction accuracy under congested traffic conditions by an average of 4.9%. Index Terms Binary logit model, freeway, genetic programming (GP), real-time crash prediction, traffic safety. I. INTRODUCTION THE development of real-time crash prediction models for freeways has recently received much attention from transportation professionals. One of the important practical applications of real-time crash prediction models is to identify hazardous traffic conditions that lead to crash occurrences in advanced traffic management systems (ATMSs) on freeways. In real-time crash prediction models, the likelihood of crash occurrences was related to freeway geometric characteristics and various real-time traffic flow variables such as vehicle speed, traffic occupancy, and the coefficient of the variation of Manuscript received March 15, 2012; revised July 28, 2012 and October 3, 2012; accepted October 19, Date of publication November 20, 2012; date of current version May 29, This work was supported in part by the National Key Basic Research Program of China under Grant 2012CB725400, the National High-Tech R&D Program of China under Grant 2012AA112304, the Ministry of Education of China through the Scholarship Award for Excellent Doctoral Students, and the Graduate Scientific Innovation Research Foundation of Jiangsu, China, under Grant CXZZ0164. The Associate Editor for this paper was S. C. Wong. The authors are with the Key Laboratory of Traffic Planning and Management, School of Transportation, Southeast University, Nanjing , China ( iamxcc1@gmail.com; wangwei@seu.edu.cn; liupan@seu.edu.cn). Color versions of one or more of the figures in this paper are available online at Digital Object Identifier /TITS vehicle speed. Real-time crash prediction models can be used to develop proactive traffic management strategies in ATMSs to improve traffic safety on freeways [1] [3]. Previous studies that documented the development and application of real-time crash prediction models have usually focused on the statistical regression techniques, including the conditional logit model [4] [11], the log-linear model [12], [13], the nonparametric Bayesian model [14], logistic regression [15] [19], discriminate analysis [20], and the multivariate probit model [21]. Among these models, the conditional logit model and their variations are probably the most commonly used modeling techniques. In these studies, real-time crash prediction models were developed based on case-controlled data, in which traffic data before crash occurrence were used as cases, whereas the matched traffic data under crash-free conditions were used as controls. For example, Abdel-Aty et al. applied the conditional logit model to develop a real-time crash prediction model based on the matched-case-controlled data, in which each crash case was matched with a number of noncrash cases [4]. The results demonstrated that the likelihood of crash occurrence was affected by the average occupancy at the upstream station and by speed variance at the downstream station. Zheng et al. used the conditional logit model to evaluate the impacts of the standard deviation of speed that results from the oscillating traffic conditions on the likelihood of crash occurrence based on the case-controlled data [6]. Traditional statistical regression models usually require assumptions about the distribution of data and a well-defined functional form such as a linear functional form between dependent variable and independent variables. When the basic assumptions of the traditional statistical regression models were violated, inefficient estimations and incorrect inferences would be produced [22] [24]. In response to the limitations associated with the statistical regression models, a few researchers have proposed nonparametric methods and artificial intelligence models for developing real-time freeway crash prediction models. These models include probabilistic neural networks [25], [26], Bayesian networks [27], [28], artificial neural networks [29] [31], the classification and regression tree model [32], and the support vector machine (SVM) model [33]. The major limitation associated with the aforementioned artificial intelligence models is that these models work as black boxes, which cannot directly be used to identify the relationships between crash likelihood and various traffic flow variables. Thus, most times, these models are difficult for practical implementation. The genetic programming (GP) model is a relatively new modeling technique that was proposed to solve classification and regression problems [34]. The GP model is rooted in the /$ IEEE

2 XU et al.: GENETIC PROGRAMMING MODEL FOR REAL-TIME CRASH PREDICTION ON FREEWAYS 575 Fig. 1. Location of the loop detector and weather stations along the selected I-880N freeway section. evolutionary theory and has recently been applied to classifications [35] and regression analyses [36] in transportation engineering. Compared with the traditional statistical regression models and artificial intelligence models, the GP model has two major advantages. First, the GP model can find a solution to a problem without any prespecified functional forms. The solutions of the GP model can be any functional forms describable by mathematics. In addition, the GP model could select the best functional form for the solution to the problem based on the training data. Second, in contrast to the artificial intelligence models, the GP models could remove the black box effect and make the model understandable. The output of a GP model is a readable mathematical model, which defines tangible relationships between dependent and independent variables. This allows the results to easily be applied in practical engineering applications. In addition, previous studies demonstrated that the GP model has better prediction performance over traditional modeling techniques [37] [39]. So far, no applications of the GP model for real-time freeway crash prediction have been identified. The primary objective of this paper is to investigate the applications of the GP model for real-time crash prediction on freeways. The random forest (RF) modeling technique was applied to select the contributing factors to crash risk under congested and uncongested traffic states. The RF model is a machine-learning method that consists of an ensemble of randomized classification and regression trees. It is one of the most efficient methods in evaluating variable importance. Based on the candidate variables selected by RF, the GP models were developed to identify the hazardous conditions that lead to crash occurrence under each traffic state. The prediction performance of the GP models would be compared with that of the binary logit models developed using the same training data set. In the rest of this paper, a brief description of the data used in this paper is presented, followed by the theoretical background of the RF and GP model. Then, the development and evaluation of the GP model for each traffic state are discussed. Finally, the prediction performance of the GP models is compared with to the binary logit model. II. DATA SOURCES To accomplish the research objective, traffic, weather, geometry, and crash data were obtained from a 21-mi freeway section on the I-880N freeway in the United States. As shown in Fig. 1, a total of 40 loop detectors stations and 3 weather stations are located along the selected freeway section. The average spacing between detector stations was about 0.5 mi, and the average spacing between weather stations was about 7 mi. All the three weather stations were located within 1 mi from the I-880N freeway. Crash, traffic, and weather data were collected from January 1, 2008 to December 31, 2008 and from January 1, 2010 to December 31, A total of 807 crashes were identified and used for further data analysis. The crash data reported at the selected freeway segment during the 24-month study period were obtained from the Statewide Integrated Traffic Records System (SWITRS) maintained by the California Department of Transportation (Caltrans). The actual location for each crash was identified by checking the variations in speed and occupancy near the reported location of crash. If there was no easily identifiable abrupt change in speed or occupancy, the reported crash location were used for further data analysis. The traffic data were obtained from the Highway Performance Measurement System (PeMS) maintained by Caltrans. The PeMS database provided 30-s raw loop detector data, including vehicle count, vehicle speed, and traffic occupancy. The loop detectors sometimes suffer from hardware problems and random errors, which might result in invalid traffic data. Traffic data were excluded as invalid or not usable under one or more of the following conditions: 1) the average speed was greater than 100 mi/h; 2) the average occupancy was greater than 100%; 3) the flow rate was greater than 0 vph, whereas the occupancy was equal to 0%; 4) the average speed was greater than 0 mi/h, whereas the flow rate was equal to 0 vph; and 5) the occupancy was greater than 0%, whereas the flow rate was equal to 0 vph. Traffic data were collected from the nearest upstream and downstream stations to each crash location, as shown in Fig. 1. The research team extracted 30-s raw traffic data in the time interval between 10 and 15 min prior to crash occurrence. The purpose of doing so was to identify hazardous traffic conditions ahead of the crash occurrence time to make preemptive measures possible [18], [31]. For example, if a crash occurred at 9:00 A.M., traffic data were extracted from 8:45 8:50 A.M. A similar time lag was also adopted in previous studies to develop real-time crash prediction models [26], [40]. The 30-s raw traffic data that were collected from upstream and downstream

3 576 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 14, NO. 2, JUNE 2013 TABLE I CANDIDATE VARIABLES FOR THE GP MODEL TABLE II SAMPLE SIZE FOR EACH TRAFFIC STATE A total of 807 crash cases and 8070 noncrash cases were included in the data set. Because the traffic flow characteristics that contribute to the crash likelihood would be different between congested traffic and uncongested traffic [5], [8], the authors separately developed the GP models for congested and uncongested traffic states. The critical occupancy is often used to classify traffic flow conditions into congested and uncongested states. Based on the visual inspection of a flowoccupancy diagram that was developed using one-month traffic data, the critical occupancy was found to be 15%. Therefore, the traffic data for crash and noncrash cases were classified into congested and uncongested traffic states based on the mean value of traffic occupancy at upstream and downstream stations. If the average occupancy was larger than 15%, the traffic data were identified as a congested traffic state; otherwise, they were identified as an uncongested traffic state. Based on this rule, the original data set was separated into two subsamples: one subsample was for the congested traffic state, and the other subsample was for the uncongested traffic state. The distributions of crash and noncrash cases under different traffic states were summarized in Table II. The subsample for each traffic state was further randomly separated into a training data set and a validation data set with a ratio of 3 : 2. The training data sets were used to develop GP models under different traffic states, and the validation data sets were used to test the prediction performance of the developed GP models. stations were further aggregated to a 5-min station level and converted into the 21 traffic flow variables, as shown in Table I. Weather conditions for each crash were extracted based on the time of the crash from the weather station that is nearest its location. Considering the sample size in each category, rain and fog were combined as adverse weather conditions. As a result, this paper considered the following two different weather conditions: 1) clear weather and 2) adverse weather. Traffic and weather data for noncrash cases were randomly selected from crash-free days. For each selected crash case in the data set, the authors randomly selected m observations of noncrash cases. Different m : 1 ratios between noncrash and crash cases were applied to develop the following GP models. The number of m was set from 1 to 10. The prediction accuracy was found to reach a maximum when m was set to be 10. III. RESEARCH METHODOLOGY A. RF for Variable Selection RF is a machine-learning method that consists of an ensemble of randomized classification and regression trees [41]. In the RF model, a predetermined number of classification and regression trees are randomly generated and finally aggregated to give one single prediction. When solving classification problems, the RF model chooses the classification with the most votes from all the trees in the forest. In the training procedure of an RF model, each classification and regression tree is developed based on a bootstrap sample that is created by randomly selecting a number of samples with replacement from the original training data set. When building a classification and regression tree, the best split at each node is searched from a randomly selected subset of the whole predictors. Currently, RF analysis is considered one of the most efficient methods in evaluating variable importance [42]. In the RF model, two measures, based on the Gini index and classification accuracy of out-of-bag (OOB) data, are usually used to evaluate the variable importance. In this paper, the measure based on

4 XU et al.: GENETIC PROGRAMMING MODEL FOR REAL-TIME CRASH PREDICTION ON FREEWAYS 577 Fig. 2. Example of a function tree in GP. the Gini index was used to evaluate the variable importance and to select the candidate variables for developing GP models. In this measure, the decrease in the Gini index at each node is calculated for the variable that is used to make the split. Then, the Gini-index-based variable importance measure for this variable is given as the average decrease in the Gini index over all trees in the forest. B. GP The GP model is an evolutionary computation method that was introduced by Koza [34]. The GP model can be used to generate mathematical models that represent approximate or exact solutions to a problem [34]. It can be considered an extension of the genetic algorithm (GA). The main difference between GP and GA is the representation of individuals. The individuals in a GA model are numbers that were coded as fixedlength binary strings, whereas the individuals in a GP model are mathematical models that were coded as function trees. Fig. 2 illustrates an example of a GP function tree. As shown in Fig. 2, the inner nodes represent mathematical functions such as + and, and the leaf nodes represent predictors and constants. The mathematical model that is represented by the function tree in Fig. 2 is f(x, Y, Z) =[(X/0.2)+(Y X)] [log(z)+ (Y/0.45)]. In a particular problem, the set of functions and predictors should be prespecified. The mathematical models in GP are generated from a prespecified set of functions and predictors. In general, GP works on a population of mathematical models (individuals) based on the evolution theory. In each generation, multiple models are stochastically selected based on their fitness and modified to form a new population of models by crossover, selection, and mutation operations. The new population of models is then used in the next iteration of the algorithm. A GP model will stop when the predetermined maximum number of generations has been produced or the predetermined fitness level has been reached for the population. Therefore, the evolution process is expected to continuously produce a better model for a problem that is intended to be solved. 1) Genetic Operation: The new models in a GP model are usually created by the following three genetic operators: 1) crossover; 2) mutation; and 3) reproduction. The reproduction operator simply selects a proportion of models and includes them into the next generation without any alterations. The crossover operator creates new or offspring models by combining information that was extracted from selected parents. Fig. 3. Fig. 4. Crossover operation in GP. Mutation operation in GP. Two parent models are randomly selected based on their fitness level, and subtrees are chosen from both parent models. Then, the crossover operator swaps the subtrees from the two parent models. Fig. 3 illustrates an example of the crossover operation. The purpose of the mutation operation is to introduce new information into the population and avoid the premature convergence of a GP model. In mutation, a single parent is randomly selected based on its fitness level. A random subtree on the parent model is selected and replaced with a new random tree created from the prespecified set of predictors and functions. This process is illustrated in Fig. 4. A new model is created by replacing the subtree on the left tree with a new randomly generated subtree. 2) Fitness Function: One of the most important components of a GP model is the fitness function, which determines how well a model in the population can solve the problem. The fitness function greatly varies across different types of problems. The fitness function is developed based on the error between the values predicted by the model and the actual data. For example, when a GP model is developed to set the time of a clock, the fitness function could be the summation of time that the clock is wrong.

5 578 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 14, NO. 2, JUNE 2013 The most commonly used fitness functions for classification problems include the number of hits, sensitivity or specificity, relative square error, and mean square error. In this paper, a fitness function for real-time freeway crash prediction was developed based on the number of hits and square errors. Assuming a data set S = {(y 1,x 1 ), (y 2,x 2 ),...,(y n,x n )} of data (x i ) for crash (y i = 1) and noncrash (y i = 0), the functional form of the fitness function is expressed as follows: F (B j )= n i=1 ( β y i y i C (B j (x i )) +(y i B j (x i )) 2) (1) where F (B j ) denotes the fitness of the jth model B j in the population, B j (x i ) is the value calculated by the jth model B j in the population, and C(B j (x i )) represents the function that converts the value calculated by the model B j into 1 or 0. The functional form of this C(B j (x i )) is expressed as follows: C (B j (x i )) = { 1 ifbj (x i ) >c 0 ifb j (x i ) <c (2) where c is the cut value. Note that the selection of cut value c in (2) does not change the prediction performance of the evolved models, because the GP model has the ability to compensate for any change in the cut value c in (2) by corresponding changes in the generated models. Previous studies also confirmed that the cut value in the fitness function does not change the distribution of types I and II errors of the evolved GP model [38], [39]. Therefore, the cut value in (2) was arbitrarily set to 0.5 in the following analysis. Because the number of noncrash cases is much greater than that of crash cases in the training data set, the GP model might ignore the information from crash cases and classify all the observations as noncrash cases to improve the overall classification accuracy. To account for this problem, the weighting factor β was introduced in the fitness function. The weighting factor β was set to the ratio between the number of noncrash and crash cases in each training data set. As shown in (1), β y i is equal to 1 if y i = 0 (noncrash), and β when if y i = 1 (crash). Hence, in (1), correctly classifying a crash case will contribute more to the fitness than correctly classifying a noncrash case. The fitness function based on the number of hits only is not sensitive to the marginal improvements in deviations from the target. Therefore, the square error was introduced in the fitness function to measure how closely the evolved outputs and the target outputs in the training data match up. 3) Procedure of GP: The GP model is based on a repetitive computational process. As shown in Fig. 5, the GP model uses the following steps to solve problems. a) Initialization. Create at random an initial population of M models that represent potential solutions to the prediction of crash occurrence on freeways. b) Execute each model in the current population on the training data set and evaluate the fitness of each model in the current population. c) Select the parent models that will be used to produce offspring models. Fig. 5. Flowchart of GP. d) Probabilistically select the reproduction, crossover, and mutation operators. e) Generate a new model by performing one of the three genetic operators. f) Repeat steps c e until the predetermined population size M has been reached. g) Replace the M old models by new generated M models. h) Repeat steps b g until the predetermined maximum generation N has been reached. i) The model with the best fitness level in any generation is designated as the result of GPs. C. Binary Logit Model This paper aimed at developing a method of predicting crash occurrence on freeways based on traffic data that were collected from loop detector stations on freeways. This gave a binary outcome that can be coded as one if a crash occurs and zero if no crashes occur. Hence, the binary logit model was used to benchmark the GP model. The binary logit model has been widely used for predicting a binary-dependent variable as a function of predictor variables in transportation engineering [43] [45]. Using a binary logit model, the probability of crash occurrence can be estimated using the following equation [46]: P (x i )= e g(x i) (i = 1, 2,...,n) (3) where P (x i ) denotes the probability that the certain traffic flow conditions lead to crash occurrence, and g(x) is the multiple

6 XU et al.: GENETIC PROGRAMMING MODEL FOR REAL-TIME CRASH PREDICTION ON FREEWAYS 579 TABLE III DISTRIBUTION OF CRASHES IN THIS PAPER Moreover, 71.2% of the crashes occurred during daylight, and 13.2% crashes occurred at night without street light. linear combination of explanatory variables, which can be expressed as P (x i ) g(x) =ln 1 P (x i ) = β 0 + β 1 x 1i + + β k x ki (4) where x ki denotes the value of a variable k for sample i, and β k is the coefficient of variable k. The parameters β 0,β 1,β 2,...,β k can be estimated based on the log-likelihood function for (3), which is given by n ln L(β,x i )= [β 0 + β 1 x 1i + + β ki x ki i=1 ln(1 + e β 0+β 1 x 1i + +β ki x ki ) ]. (5) IV. DATA ANALYSIS AND RESULTS A. Preliminary Analysis A preliminary analysis of the crashes used in this paper was conducted in this section. The distributions of the crashes used in this paper were summarized in Table III. Most of the crashes (55.6%) in the data set were rear-end crashes, followed by sideswipe crashes of about 23.9%. With regard to the severity outcome, 71.5% of the crashes in the data set were propertydamage-only crashes, and 28.5% were injury crashes. With regard to the crash occurrence time, 35.5% of the crashes occurred during peak hours, and 64.5% occurred during offpeak hours. As aforementioned, a total of 40 loop detector stations are located along the selected 21-mi freeway section. The 40 loop detector stations divided the 21-mi freeway section into 39 freeway segments. Considering the crash location, 42.3% of the crashes occurred in a segment with a length larger than 0.6 mi. Among the 39 segments, only 7 segments are larger than 0.6 mi. This may imply that the geometric characteristic (segments length) was a significant variable that affects crash likelihood. B. Variable Selection Using RF As shown in Table I, 28 candidate variables were obtained in this paper. The RF model was used to select the significant variables that affect crash risk under each traffic state. The RF package in the MATLAB software was used to develop the RF models [47]. When using the RF model, the number of trees in the forest and the number of variables tried at each node first need to be specified. To obtain stable estimations of variable importance, we conducted the RF model and calculated OOB error rates for different numbers of trees. It was found that 1000 trees were sufficient to obtain a constant minimum error rate for both congested and uncongested traffic states. After the number of trees has been determined, the RF models were conducted with different numbers of variables tried at each node. The OOB error rate of the RF model for uncongested traffic state reached a minimum when the number of variables tried at each node was equal to 9. In addition, six variables tried at each node produced the minimum OOB error rate of the RF model for the congested traffic state. The RF model for each traffic state was first conducted using all the 28 variables, as shown in Table I. The variable importance of the 28 candidate variables for congested and uncongested traffic states was illustrated in Fig. 6. As expected, the traffic flow variables that contribute to crash risk were quite different between uncongested and congested traffic states. The average speed, occupancy difference between adjacent lanes, speed difference between adjacent lanes, and speed variance were the main contributing factors to crash risk in uncongested traffic conditions, whereas the average occupancy, standard deviation of occupancy, average speed, and speed difference between upstream and downstream stations were the main contributing factors to crash risk in congested traffic conditions. To select the number of important variables for developing GP models, the RF models were further conducted in a successive phase in which the number of input variables was set from 1 to 28. To be more specific, we first conducted the RF model using the top 1 variable in Fig. 6, then conducted the RF model using the top 2 variables in Fig. 6, and so on. The OOB error rate of the RF model for the uncongested traffic state reached a minimum when the top 12 important variables in Fig. 6(a) were used to develop the RF model. Therefore, the top 12 important variables in Fig. 6(a) were selected to develop the GP model for predicting the crash risk under the uncongested traffic state. In addition, the top 8 variables in Fig. 6(b) produced the minimum OOB error rate of the RF model for the congested traffic state. The GP model for the congested traffic state was developed based on these top 8 variables in Fig. 6(b). C. GP Model In this paper, two GP models were developed to separately predict the crash occurrence under congested and uncongested traffic states based on the traffic data collected from the loop detector stations on freeways. The research team developed the

7 580 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 14, NO. 2, JUNE 2013 Fig. 6. Variable importance based on the normalized Gini index for each traffic state. TABLE IV SUMMARY OF THE CONFIGURATION PARAMETERS OF THE GP MODEL GP model using GPLab toolbox 3.0 [48], which is a popular GP software coded in MATLAB. The various parameters used in the GP models were given in Table IV. As shown in Table IV, the function set that was used to build the GP model contained six standard arithmetic operators, i.e., +,,,, protected square root, and protected natural logarithm. The terminal set included the variables selected by RF and constant terminals (randomly generated floating-point numbers between 0 and 10). The population size was set to 1000, and the maximum number of generations was 100. With regard to the probability of operators, the reproduction probability was 0. The purpose of doing so was to let the crossover and mutation operation govern the evolutionary process. The probability of crossover and mutation was optimized during GP running by the automatic adaptation procedure [49]. In the automatic adaptation procedure, the probability values of operators will be increased if they have been producing models with better fitness. Otherwise, their probability values will be decreased. The initial population of individuals was generated by a ramped half-and-half method [34], and the depth for trees in the initial population was limited to 6. The maximum depth for generating trees in other populations was 30. The selection process was used to choose the parent models for producing offspring models. In this paper, the lexictour method was chosen as the selection process. In the lexictour method, a random number of models are chosen from the population, and the model with the best fitness among the chosen models is selected. In addition, if two models have the same fitness, the model with fewer nodes will be chosen as the best [50]. 1) GP Model Under the Uncongested Traffic State: Fig. 7 illustrates the crash prediction model under uncongested traffic conditions. Six traffic flow variables, weather conditions, and the spacing between upstream and downstream stations were found to be correlated with the crash risk in the GP model under uncongested traffic conditions. The GP model indicates a complex relationship of crash risk with traffic flow conditions, weather conditions, and geometry characteristics. Accordingly, Fig. 8 was developed to indicate the change in crash risk as the value of each continuous variable changes. Each continuous variable changed over a normal range of values when the other variables were kept at their sample mean.

8 XU et al.: GENETIC PROGRAMMING MODEL FOR REAL-TIME CRASH PREDICTION ON FREEWAYS 581 2) GP Model Under the Congested Traffic State: The crash prediction model under congested traffic conditions is illustrated in Fig. 9. The occupancy at downstream stations, the standard deviation of occupancy at the upstream station, the speed difference between the upstream and downstream stations, weather conditions, and the spacing between the upstream and downstream stations were found to be correlated with the crash risk in the GP model under congested traffic conditions. Fig. 10 was developed to explore the relationship of crash risk with different predictors in the GP model. As shown in Fig. 10(a), crash risk decreases with the increase in occupancy at the downstream station under congested traffic conditions. This is consistent with the findings in previous studies that drivers behave differently in spare and heavy traffic and crash risk may decrease with the increases in traffic density under congested traffic conditions [8], [52], [53]. As shown in Fig. 10(b) and (c), crash risk under congested traffic conditions tends to be high when the speed difference between the upstream and downstream stations and the standard deviation of occupancy at the upstream station are high. In addition, the spacing between the upstream and downstream stations was found to increase crash risk under congested traffic conditions. Fig. 7. GP model for the uncongested traffic state. As shown in Figs. 8(a) (d), the high occupancy and standard deviation of speed at the downstream station were found to be associated with the increase in crash risk. In addition, crash risk increases as the upstream and downstream speed decrease. The decreasing speed might represent the increase in traffic density and queue formations. These results are consistent with the findings of previous studies [4], [8], [10], [25]. As shown in Fig. 8(e), high occupancy difference between adjacent lanes at the downstream station results in an increase in crash risk. The large occupancy difference between adjacent lanes was found to be associated with high lane-change frequency [51]. Thus, high lane-change frequency increases the crash risk in uncongested traffic conditions. The occupancy difference between upstream and downstream stations was found to be associated with the increase in crash risk [Fig. 8(f)]. The findings of a previous study also demonstrated that crashes were more likely to occur under traffic flow states with a large occupancy difference between upstream and downstream [8]. Moreover, as shown in Fig. 8(g), the spacing between upstream and downstream stations increases the crash risk between them, indicating the fact that crashes are more likely to occur in a freeway segment with a large length. This is consistent with the findings shown in the preliminary analysis. D. Prediction Performance The prediction performance of the GP model under each traffic state was tested. The authors developed receiver operating characteristic (ROC) curves to evaluate the prediction performance of the GP models with different thresholds. The ROC curve is a graphical plot of the sensitivity on the y-axis against (1 specificity) on the x-axis for different thresholds. In this paper, the sensitivity represents the proportion of crash cases predicted as a crash, and (1 specificity) represents one minus the proportion of noncrash cases predicted as a noncrash. To develop the ROC curve of the GP model for each traffic state, we calculated the sensitivity and (1 specificity) for multiple thresholds by using the validation data sets. To test the relative prediction performance of the GP model for each traffic state, the research team benchmarked it against the binary logit model, which is a more prevalent methodology. For comparison, the authors developed the binary logit model for each traffic state using the same training data set. In addition, the validation data set for each traffic state was used to test the prediction performance of the binary logit models developed. The ROC curves for GP and the binary logit model under uncongested traffic conditions on the validation data set are illustrated in Fig. 11. As shown in Fig. 11, the ROC curve for the GP model is always to the left of the curve for the binary logit model, indicating that the prediction accuracy of the GP model under uncongested traffic conditions is always greater than that of the binary logit model, no matter what threshold is selected. Fig. 12 presents the ROC curves for GP and the binary logit model under congested traffic conditions. The prediction accuracy of the GP model under congested traffic conditions was also found to be greater than that of the binary logit model.

9 582 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 14, NO. 2, JUNE 2013 Fig. 8. Relationship between crash risk and different predictors in the GP model under uncongested traffic conditions. Tables V and VI summarize the prediction performance of the GP model under uncongested and congested traffic conditions, respectively. The prediction performance is measured by the percentage of the correctly predicted crash cases for different false-alarm rates (1 specificity). As shown in Tables V and VI, there is a tradeoff between crash prediction accuracy and false-alarm rate. The prediction accuracy of crash cases increased as the false-alarm rate was increased. Thus, the threshold needs to carefully be determined to meet the requirement of the practical implementation or the preference of a specific traffic agency. After determining the threshold, the prediction performance of the GP model could easily be evaluated using the aforementioned ROC curves. For example, if a threshold value is selected to accept a 20% false-alarm rate for identifying crash occurrence under uncongested traffic conditions, the crash prediction accuracy of the GP model is found to be 61.0%. For comparison, Tables V and VI also summarize the crash prediction accuracy of the binary logit model under each traffic state at the same false-alarm rate. As shown in Table V, the crash prediction accuracy of the GP model is much better than that of the binary logit model. The average difference in crash prediction accuracy between GP and the binary logit model is 8.2%, indicating that the GP model could increase the crash prediction accuracy under uncongested traffic conditions by an average of 8.2% compared with the binary logit model. For congested traffic flow conditions, the average difference in crash prediction accuracy between the two models was found to be 4.9%. Thus, the GP model could increase the crash prediction accuracy under congested traffic conditions by an average of 4.9%. V. C ONCLUSION This paper investigated the application of the GP model for real-time crash prediction on freeways. Traffic, weather, geometry, and crash data were collected from the I-880N

10 XU et al.: GENETIC PROGRAMMING MODEL FOR REAL-TIME CRASH PREDICTION ON FREEWAYS 583 Fig. 9. GP model for the congested traffic state. Fig. 10. Relationship between crash risk and different predictors in the GP model under congested traffic conditions. freeway located in the United States, in 2008 and RF was applied to select the contributing factors to crash risk under uncongested and congested traffic conditions. Based on the candidate variables selected by RF, the GP model was conducted to develop the crash prediction model for each traffic state. The traffic flow characteristics that affect crash risk were found to be quite different between congested and uncongested traffic conditions. In general, the traffic density, speed variance, lane-change frequency, and occupancy difference between upstream and downstream were the main contributing factors to crash risk under the uncongested traffic conditions. Under the congested traffic conditions, crash risk decreases with the increase in traffic density. The speed difference between upstream and downstream stations and the standard deviation of occupancy also affected crash risk under the congested traffic conditions.

11 584 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 14, NO. 2, JUNE 2013 TABLE VI PREDICTION PERFORMANCE OF THE GP MODEL FOR THE CONGESTED TRAFFIC STATE Fig. 11. ROC curves of GP and the binary logit model for the uncongested traffic state. The real-time crash prediction model developed by the GP algorithm can be used in ATMSs such as variable-speed-limit systems [54] and ramp metering [55], [56] to improve traffic safety on freeways. For example, when a freeway segment is found to be susceptible to crash occurrence, the variable speed limit system could be activated to reduce crash risk by predetermined management strategies. However, before the findings in this paper are used in practical engineering applications, additional research is still needed to test the transferability of the GP model using data that were collected from other freeways. In addition, this paper partly demonstrated the benefits of using the GP model in real-time crash prediction over the traditional logit model. However, the authors did not compare the GP model to other artificial intelligence models such as SVM and artificial neural network models. The authors recommend that future studies may focus on these issues. ACKNOWLEDGMENT Fig. 12. ROC curves of GP and the binary logit model for the congested traffic state. TABLE V PREDICTION PERFORMANCE OF THE GP MODEL FOR THE UNCONGESTED TRAFFIC STATE The ROC curves were developed to evaluate the prediction performance of the GP model under each traffic state. The validation results demonstrated that the prediction performance of the GP models was deemed satisfactory. For comparison, the binary logit model was also developed for each traffic state using the same training data set. In general, the prediction performance of the GP models was better than that of the binary logit models, no matter what threshold is selected. Compared with the binary logit model, the GP model could increase the crash prediction accuracy under uncongested traffic conditions by an average of 8.2% and increase the crash prediction accuracy under congested traffic conditions by an average of 4.9%. The authors would like to thank the Highway Performance Measurement System (PeMS) and California Department of Transportation (Caltrans) for providing the crash and traffic data that were used in this paper and X. Jiang for her help in the data collection. REFERENCES [1] C. Lee, B. Hellinga, and K. Ozbay, Quantifying effects of ramp metering on freeway safety, Accid. Anal. Prev., vol. 38, no. 2, pp , Mar [2] M. Abdel-Aty, J. Dilmore, and A. Dhindsa, Evaluation of variable speed limits for real-time freeway safety improvement, Accid. Anal. Prev., vol. 38, no. 2, pp , Mar [3] M. Abdel-Aty, J. Dilmore, and V. Gayah, Considering various ALINEA ramp metering strategies for crash risk mitigation on freeways under congested regime, Transp. Res. C: Emerging Technol., vol. 15, no. 2, pp , Apr [4] M. Abdel-Aty, N. Uddin, F. Abdalla, A. Pande, and L. Hsia, Predicting freeway crashes from loop detector data using matched-case-control logistic regression, Transp. Res. Rec., vol. 1897, no. 1, pp , [5] M. Abdel-Aty, N. Uddin, and A. Pande, Split models for predicting multivehicle crashes during high-speed and low-speed operating conditions on freeways, Transp. Res. Rec., vol. 1908, no. 1, pp , [6] Z. Zheng, S. Ahna, and C. Monsere, Impact of traffic oscillations on freeway crash occurrences, Accid. Anal. Prev., vol. 42, no. 2, pp , Mar [7] M. Abdel-Aty and P. Rajashekar, Calibrating a real-time traffic crashprediction model using archived weather and ITS traffic data, IEEE Trans. Intell. Transp. Syst., vol. 7, no. 2, pp , Jun [8] C. Xu, P. Liu, W. Wang, and Z. Li, Evaluation of the impacts of traffic states on crash risks on freeways, Accid. Anal. Prev., vol. 47, pp , Jul [9] M. Abdel-Aty, H. Hassan, M. Ahmed, and A. Al-Ghamdi, Real-time prediction of visibility-related crashes, Transp. Res. C, vol. 24, pp , Oct

12 XU et al.: GENETIC PROGRAMMING MODEL FOR REAL-TIME CRASH PREDICTION ON FREEWAYS 585 [10] M. Ahmed and M. Abdel-Aty, The viability of using automatic vehicle identification data for real-time crash prediction, IEEE Trans. Intell. Transp. Syst., vol. 13, no. 2, pp , Jun [11] M. Ahmed, M. Abdel-Aty, and R. Yu, A Bayesian updating approach for real-time safety evaluation using AVI data, presented at the Annu. Meeting Transport. Res. Board, Washington, DC, [12] C. Lee, F. Saccomanno, and B. Hellinga, Analysis of crash precursors on instrumented freeways, Transp. Res. Rec., vol. 1784, pp. 1 8, [13] C. Lee, B. Hellinga, and F. Saccomanno, Real-time crash prediction model for application to crash prevention in freeway traffic, Transp. Res. Rec., vol. 1840, pp , [14] J. Oh, C. Oh, S. Ritchie, and M. Chang, Real-time estimation of accident likelihood for safety enhancement, J. Transp. Eng., vol. 131, no. 5, pp , May [15] M. Ahmed, M. Abdel-Aty, and R. Yu, Assessment of the interaction between crash occurrence, mountainous freeway geometry, real-time weather and AVI traffic data, presented at the Annu. Meeting Transport. Res. Board, Washington, DC, [16] C. Lee, M. Abdel-Aty, and L. Hsia, Potential real-time indicators of sideswipe crashes on freeways, Transp. Res. Rec., vol. 1953, no. 1, pp , [17] T. Golob, W. Recker, and Y. Pavlis, Probabilistic models of freeway safety performance using traffic flow data as predictors, Safety Sci., vol. 46, no. 9, pp , Nov [18] C. Lee, P. Park, and M. Abdel-Aty, Lane-by-lane analysis of crash occurrence based on driver s lane-changing and car-following behavior, J. Transp. Safety Security, vol. 3, no. 2, pp , [19] C. Xu, P. Liu, W. Wang, and C. Yu, Exploration and identification of hazardous traffic flow states before crash occurrences on freeways, presented at the Annu. Meeting Transport. Res. Board, Washington, DC, [20] C. Xu, P. Liu, W. Wang, and Z. Li, Development of a crash risk index to identify real-time crash risks on freeways, presented at the Annu. Meeting Transport. Res. Board, Washington, DC, [21] Z. Christoforou, S. Cohen, and M. Karlaftis, Identifying crash type propensity using real-time traffic data on freeways, J. Safety Res.,vol.42, no. 1, pp , Feb [22] L. Mussone, A. Ferrari, and M. Oneta, An analysis of urban collisions using an artificial intelligence model, Accid. Anal. Prev., vol. 31, no. 6, pp , Nov [23] D. Delen, R. Sharda, and M. Bessonov, Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks, Accid. Anal. Prev., vol. 38, no. 3, pp , May [24] P. Miaou and D. Lord, Modeling traffic crash-flow relationships for intersections: Dispersion parameter, functional form, and Bayes versus empirical Bayes, Transp. Res. Rec., vol. 1840, pp , [25] C. Oh, J. Oh, and S. Ritchie, Real-time hazardous traffic condition warning system: Framework and evaluation, IEEE Trans. Intell. Transp. Syst., vol. 6, no. 3, pp , Sep [26] M. Abdel-Aty and A. Pande, Identifying crash propensity using specific traffic speed conditions, J. Safety Res., vol. 36, no. 1, pp , Jan [27] M. Hossain and Y. Muromachi, Development of a real-time crash prediction model for urban expressway, J. Eastern Asia Soc. Transp. Stud., vol. 8, pp , [28] M. Hossain and Y. Muromachi, Evaluating location of placement and spacing of detectors for real-time crash prediction on urban expressways, presented at the Annu. Meeting Transport. Res. Board, Washington, DC, [29] A. Pande and M. Abdel-Aty, Assessment of freeway traffic parameters leading to lane-change-related collisions, Accid. Anal. Prev., vol. 38, no. 5, pp , [30] A. Pande and M. Abdel-Aty, Comprehensive analysis of the relationship between real-time traffic surveillance data and rear-end crashes on freeways, Transp. Res. Rec., vol. 1953, pp , [31] A. Pande, A. Das, M. Abdel-Aty, and H. Hassan, Real-time crash risk estimation: Are all freeways created equal? Transp. Res. Rec., vol. 2237, pp , [32] M. Hossain and Y. Muromachi, Understanding crash mechanism and selecting appropriate interventions for real-time hazard mitigation on urban expressways, Transp. Res. Rec., vol. 2213, pp , [33] X. Qu, W. Wang, P. Liu, and D. Noyce, Real-time prediction of freeway rear-end crash potential by support vector machine, presented at the Annu. Meeting Transport. Res. Board, Washington, DC, [34] R. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection. Cambridge, MA: MIT Press, [35] A. Das and M. Abdel-Aty, A genetic programming approach to explore the crash severity on multilane roads, Accid. Anal. Prev., vol. 42, no. 2, pp , Mar [36] A. Das, M. Abdel-Aty, and A. Pande, Genetic programming to investigate design parameters contributing to crash occurrence on urban arterials, Transp. Res. Rec., vol. 2147, pp , [37] H. Etemadi, A. Rostamy, and H. Dehkordi, A genetic programming model for bankruptcy prediction: Empirical evidence from Iran, Expert Syst. Appl., vol. 36, no. 2, pp , Mar [38] E. McKee and T. Lensberg, Genetic programming and rough sets: A hybrid approach to bankruptcy classification, Eur. J. Oper. Res., vol. 138, no. 2, pp , Apr [39] T. Lensberg, A. Eilifsen, and E. McKee, Bankruptcy theory development and classification via genetic programming, Eur. J. Oper. Res., vol. 169, no. 2, pp , Mar [40] M. Abdel-Aty, A. Pande, C. Lee, V. Gayah, R. Cunningham, A. Dhindsa, and J. Dilmore, Linking crash patterns to ITS-related archived data Phase II, Volume I: Real-time crash risk assessment models BD-550-5, Florida Dept. Transp., Tallahassee, FL, [41] L. Breiman, Random forests, Mach. Learn., vol. 45, no. 1, pp. 5 32, [42] R. Harb, X. Yan, E. Radwan, and X. Su, Exploring precrash maneuvers using classification trees and random forests, Accid. Anal. Prev., vol. 41, no. 1, pp , Jan [43] F. Xu and Z. Tian, Driver behavior and gap-acceptance characteristics at roundabouts in California, Transp. Res. Rec., vol. 2071, pp , [44] P. Liu, J. Lu, H. Zhou, and G. Sokolow, Operational effects of U-turns as alternatives to direct left-turns, J. Transp. Eng., vol. 133, no. 5, pp , May [45] S. Hubbard, D. Bullock, and F. Mannering, Right turns on green and pedestrian level of service: Statistical assessment, J. Transp. Eng., vol. 135, no. 4, pp , Apr [46] S. Washington, M. Karlaftis, and F. Mannering, Statistical and Econometric Methods for Transportation Data Analysis. Boca Raton, FL: CRC Press, 2003, pp [47] RandomForest: Breiman and Cutler s Random Forests for Classification and Regression. [Online]. Available: randomforest-matlab/ [48] S. Silva, GPLAB A Genetic Programming Toolbox for MATLAB, MathWorks, Natick, MA. [Online]. Available: net/ [49] L. Davis, Adapting operator probabilities in genetic algorithms, in Proc. 3rd Int. Conf. Genetic Algorithms, 1989, pp [50] S. Luke and L. Panait, Lexicographic parsimony pressure, in Proc. GECCO, 2002, pp [51] D. Gazis, R. Herman, and G. H. Weiss, Density oscillations between lanes of a multilane highway, Oper. Res., vol. 10, no. 5, pp , Sep./Oct [52] M. Lord, A. Manar, and A. Vizioli, Modeling crash-flow-density and crash-flow-v/c ratio relationships for rural and urban freeway segments, Accid. Anal. Prev., vol. 37, no. 1, pp , [53] E. Hauer, Observational Before After Studies in Road Safety: Estimating the Effect of Highway and Traffic Engineering Measures on Road Safety. New York: Pergamon, [54] R. Carlson, I. Papamichail, and M. Papageorgiou, Local feedback-based mainstream traffic flow control on motorways using variable speed limits, IEEE Trans. Intell. Transp. Syst., vol. 12, no. 4, pp , Dec [55] J. Frejo and E. Camacho, Global versus local MPC algorithms in freeway traffic control with ramp metering and variable speed limits, IEEE Trans. Intell. Transp. Syst., vol. 13, no. 4, pp , Dec [56] D. Zhao, X. Bai, F.-Y. Wang, J. Xu, and W. Yu, DHP method for ramp metering of freeway traffic, IEEE Trans. Intell. Transp. Syst., vol. 12, no. 4, pp , Dec Chengcheng Xu received the B.Eng. and M.Eng. degrees in transportation planning and management from Southeast University, Nanjing, China, in 2008 and 2010, respectively. He is currently working toward the Ph.D. degree with the Key Laboratory of Traffic Planning and Management, School of Transportation, Southeast University. His research interests include traffic safety and intelligent transportation systems.

13 586 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 14, NO. 2, JUNE 2013 Wei Wang received the M.Eng. and Ph.D. degrees in civil engineering from Southeast University, Nanjing, China, in 1985 and 1989, respectively. He is currently a Professor with the Key Laboratory of Traffic Planning and Management, School of Transportation, Southeast University. His research interests include urban transportation and intelligent transportation systems. Dr. Wang is a Member of the Model Traffic Technology Panel of the National High-Tech R&D Program of China (863 Program) and the panel of the National Natural Science Foundation of China. He received the National Distinguished Teacher Award of China in Pan Liu received the Ph.D. degree in civil engineering from the University of South Florida, Tampa, in He is currently a Professor with the Key Laboratory of Traffic Planning and Management, School of Transportation, Southeast University, Nanjing, China. His research interests include traffic design, traffic safety, and intelligent transportation systems.

Accident Analysis and Prevention

Accident Analysis and Prevention Accident Analysis and Prevention 94 (2016) 59 64 Contents lists available at ScienceDirect Accident Analysis and Prevention journal homepage: www.elsevier.com/locate/aap Utilizing the eigenvectors of freeway

More information

Traffic Surveillance from a Safety Perspective: An ITS Data Application

Traffic Surveillance from a Safety Perspective: An ITS Data Application Proceedings of the 8th International IEEE Conference on Intelligent Transportation Systems Vienna, Austria, September 13-16, 2005 WB4.2 Traffic Surveillance from a Safety Perspective: An ITS Data Application

More information

Assessment of Interaction of Crash Occurrence, Mountainous Freeway Geometry, Real-Time Weather, and Traffic Data

Assessment of Interaction of Crash Occurrence, Mountainous Freeway Geometry, Real-Time Weather, and Traffic Data Assessment of Interaction of Crash Occurrence, Mountainous Freeway Geometry, Real-Time Weather, and Traffic Data Mohamed M. Ahmed, Mohamed Abdel-Aty, and Rongjie Yu This study investigated the effect of

More information

Bayesian Updating Approach for Real-Time Safety Evaluation with Automatic Vehicle Identification Data

Bayesian Updating Approach for Real-Time Safety Evaluation with Automatic Vehicle Identification Data Bayesian Updating Approach for Real-Time Safety Evaluation with Automatic Vehicle Identification Data Mohamed M. Ahmed, Mohamed Abdel-Aty, and Rongjie Yu Although numerous studies have attempted to use

More information

Accident Analysis and Prevention

Accident Analysis and Prevention Accident Analysis and Prevention 64 (2014) 52 61 Contents lists available at ScienceDirect Accident Analysis and Prevention j ourna l h om epage: www.elsevier.com/locate/aap Surrogate safety measure for

More information

University of Central Florida, USA

University of Central Florida, USA Dr. Mohamed Abdel-Aty, PE Trustee Chair Pegasus Professor and Chair Department of Civil, Environmental & Construction Engineering University of Central Florida, USA Editor-in-Chief, Accident Analysis &

More information

Responsive Traffic Management Through Short-Term Weather and Collision Prediction

Responsive Traffic Management Through Short-Term Weather and Collision Prediction Responsive Traffic Management Through Short-Term Weather and Collision Prediction Presenter: Stevanus A. Tjandra, Ph.D. City of Edmonton Office of Traffic Safety (OTS) Co-authors: Yongsheng Chen, Ph.D.,

More information

Transportation Research Part C

Transportation Research Part C Transportation Research Part C 26 (2013) 203 213 Contents lists available at SciVerse ScienceDirect Transportation Research Part C journal homepage: www.elsevier.com/locate/trc A data fusion framework

More information

Evaluation of fog-detection and advisory-speed system

Evaluation of fog-detection and advisory-speed system Evaluation of fog-detection and advisory-speed system A. S. Al-Ghamdi College of Engineering, King Saud University, P. O. Box 800, Riyadh 11421, Saudi Arabia Abstract Highway safety is a major concern

More information

Exploring Crash Injury Severity on Urban Motorways by Applying Finite Mixture Models

Exploring Crash Injury Severity on Urban Motorways by Applying Finite Mixture Models Available online at www.sciencedirect.com ScienceDirect Transportation Research Procedia 00 (2018) 000 000 www.elsevier.com/locate/procedia mobil.tum 2018 "Urban Mobility Shaping the Future Together" -

More information

Bayesian Network-Based Road Traffic Accident Causality Analysis

Bayesian Network-Based Road Traffic Accident Causality Analysis 2010 WASE International Conference on Information Engineering Bayesian Networ-Based Road Traffic Accident Causality Analysis Xu Hongguo e-mail: xhg335@163.com Zong Fang (Contact Author) Abstract Traffic

More information

Capacity Drop. Relationship Between Speed in Congestion and the Queue Discharge Rate. Kai Yuan, Victor L. Knoop, and Serge P.

Capacity Drop. Relationship Between Speed in Congestion and the Queue Discharge Rate. Kai Yuan, Victor L. Knoop, and Serge P. Capacity Drop Relationship Between in Congestion and the Queue Discharge Rate Kai Yuan, Victor L. Knoop, and Serge P. Hoogendoorn It has been empirically observed for years that the queue discharge rate

More information

Risk Assessment of Pedestrian Accident Area Using Spatial Analysis and Deep Learning

Risk Assessment of Pedestrian Accident Area Using Spatial Analysis and Deep Learning Risk Assessment of Pedestrian Accident Area Using Spatial Analysis and Deep Learning Ilyoung Hong*, Hanseung Choi, Songpyo Hong Department of GIS Engineering, Namseoul University, Republic of Korea. *

More information

Local Calibration Factors for Implementing the Highway Safety Manual in Maine

Local Calibration Factors for Implementing the Highway Safety Manual in Maine Local Calibration Factors for Implementing the Highway Safety Manual in Maine 2017 Northeast Transportation Safety Conference Cromwell, Connecticut October 24-25, 2017 MAINE Darryl Belz, P.E. Maine Department

More information

EXAMINATION OF THE SAFETY IMPACTS OF VARYING FOG DENSITIES: A CASE STUDY OF I-77 IN VIRGINIA

EXAMINATION OF THE SAFETY IMPACTS OF VARYING FOG DENSITIES: A CASE STUDY OF I-77 IN VIRGINIA 0 0 0 EXAMINATION OF THE SAFETY IMPACTS OF VARYING FOG DENSITIES: A CASE STUDY OF I- IN VIRGINIA Katie McCann Graduate Research Assistant University of Virginia 0 Edgemont Road Charlottesville, VA 0 --

More information

LEVERAGING HIGH-RESOLUTION TRAFFIC DATA TO UNDERSTAND THE IMPACTS OF CONGESTION ON SAFETY

LEVERAGING HIGH-RESOLUTION TRAFFIC DATA TO UNDERSTAND THE IMPACTS OF CONGESTION ON SAFETY LEVERAGING HIGH-RESOLUTION TRAFFIC DATA TO UNDERSTAND THE IMPACTS OF CONGESTION ON SAFETY Tingting Huang 1, Shuo Wang 2, Anuj Sharma 3 1,2,3 Department of Civil, Construction and Environmental Engineering,

More information

Estimation Of Hybrid Models For Real-time Crash Risk Assessment On Freeways

Estimation Of Hybrid Models For Real-time Crash Risk Assessment On Freeways University of Central Florida Electronic Theses and Dissertations Doctoral Dissertation (Open Access) Estimation Of Hybrid Models For Real-time Crash Risk Assessment On Freeways 2005 anurag pande University

More information

Accident Prediction Models for Freeways

Accident Prediction Models for Freeways TRANSPORTATION RESEARCH RECORD 1401 55 Accident Prediction Models for Freeways BHAGWANT PERSAUD AND LESZEK DZBIK The modeling of freeway accidents continues to be of interest because of the frequency and

More information

Effect of Environmental Factors on Free-Flow Speed

Effect of Environmental Factors on Free-Flow Speed Effect of Environmental Factors on Free-Flow Speed MICHAEL KYTE ZAHER KHATIB University of Idaho, USA PATRICK SHANNON Boise State University, USA FRED KITCHENER Meyer Mohaddes Associates, USA ABSTRACT

More information

Short-term traffic volume prediction using neural networks

Short-term traffic volume prediction using neural networks Short-term traffic volume prediction using neural networks Nassim Sohaee Hiranya Garbha Kumar Aswin Sreeram Abstract There are many modeling techniques that can predict the behavior of complex systems,

More information

Classification using stochastic ensembles

Classification using stochastic ensembles July 31, 2014 Topics Introduction Topics Classification Application and classfication Classification and Regression Trees Stochastic ensemble methods Our application: USAID Poverty Assessment Tools Topics

More information

Real-Time Crash Prediction Model for the Application to Crash Prevention in Freeway Traffic

Real-Time Crash Prediction Model for the Application to Crash Prevention in Freeway Traffic Paper No. 032749 RealTime Crash Prediction Model for the Application to Crash Prevention in Freeway Traffic Chris Lee Department of Civil Engineering University of Waterloo Waterloo, Ontario, Canada N2L

More information

FREEWAY INCIDENT FREQUENCY ANALYSIS BASED ON CART METHOD

FREEWAY INCIDENT FREQUENCY ANALYSIS BASED ON CART METHOD XUECAI XU, Ph.D. E-mail: xuecai_xu@hust.edu.cn Huazhong University of Science and Technology & University of Hong Kong China ŽELJKO ŠARIĆ, Ph.D. Candidate E-mail: zeljko.saric@fpz.hr Faculty of Transport

More information

TRB Paper Examining Methods for Estimating Crash Counts According to Their Collision Type

TRB Paper Examining Methods for Estimating Crash Counts According to Their Collision Type TRB Paper 10-2572 Examining Methods for Estimating Crash Counts According to Their Collision Type Srinivas Reddy Geedipally 1 Engineering Research Associate Texas Transportation Institute Texas A&M University

More information

AN INVESTIGATION INTO THE IMPACT OF RAINFALL ON FREEWAY TRAFFIC FLOW

AN INVESTIGATION INTO THE IMPACT OF RAINFALL ON FREEWAY TRAFFIC FLOW AN INVESTIGATION INTO THE IMPACT OF RAINFALL ON FREEWAY TRAFFIC FLOW Brian L. Smith Assistant Professor Department of Civil Engineering University of Virginia PO Box 400742 Charlottesville, VA 22904-4742

More information

PREDICTING TRAFFIC CRASHES USING REAL-TIME TRAFFIC SPEED PATTERNS

PREDICTING TRAFFIC CRASHES USING REAL-TIME TRAFFIC SPEED PATTERNS PREDICTING TRAFFIC CRASHES USING REAL-TIME TRAFFIC SPEED PATTERNS Mohamed Abdel-Aty", Anurag Pande Abstract of the recent advances in traffic surveillance technology and concern over there have been very

More information

Real-Time Crash Prediction Model for Application to Crash Prevention in Freeway Traffic

Real-Time Crash Prediction Model for Application to Crash Prevention in Freeway Traffic Paper No. 032749 RealTime Crash Prediction Model for Application to Crash Prevention in Freeway Traffic Chris Lee Department of Civil Engineering University of Waterloo Waterloo, Ontario, Canada N2L 3G1

More information

Relationship between Traffic Density, Speed and Safety and Its Implication on Setting Variable Speed Limits on Freeways

Relationship between Traffic Density, Speed and Safety and Its Implication on Setting Variable Speed Limits on Freeways 1 1 1 1 1 1 1 1 0 1 0 1 0 1 Relationship between Traffic Density, Speed and Safety and Its Implication on Setting Variable Speed Limits on Freeways Jake Kononov, Ph.D., P.E. Colorado Department of Transportation

More information

Active Traffic Management Case Study: Phase 1

Active Traffic Management Case Study: Phase 1 University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Nebraska Department of Transportation Research Reports Nebraska LTAP 3-2016 Active Traffic Management Case Study: Phase

More information

Chapter 5 Traffic Flow Characteristics

Chapter 5 Traffic Flow Characteristics Chapter 5 Traffic Flow Characteristics 1 Contents 2 Introduction The Nature of Traffic Flow Approaches to Understanding Traffic Flow Parameters Connected with Traffic Flow Categories of Traffic Flow The

More information

The relationship between urban accidents, traffic and geometric design in Tehran

The relationship between urban accidents, traffic and geometric design in Tehran Urban Transport XVIII 575 The relationship between urban accidents, traffic and geometric design in Tehran S. Aftabi Hossein 1 & M. Arabani 2 1 Bandar Anzali Branch, Islamic Azad University, Iran 2 Department

More information

A MODIFIED CELLULAR AUTOMATON MODEL FOR RING ROAD TRAFFIC WITH VELOCITY GUIDANCE

A MODIFIED CELLULAR AUTOMATON MODEL FOR RING ROAD TRAFFIC WITH VELOCITY GUIDANCE International Journal of Modern Physics C Vol. 20, No. 5 (2009) 711 719 c World Scientific Publishing Company A MODIFIED CELLULAR AUTOMATON MODEL FOR RING ROAD TRAFFIC WITH VELOCITY GUIDANCE C. Q. MEI,,

More information

STOCHASTIC CUSP CATASTROPHE MODELS WITH TRAFFIC AND WEATHER DATA FOR CRASH SEVERITY ANALYSIS ON URBAN ARTERIALS

STOCHASTIC CUSP CATASTROPHE MODELS WITH TRAFFIC AND WEATHER DATA FOR CRASH SEVERITY ANALYSIS ON URBAN ARTERIALS STOCHASTIC CUSP CATASTROPHE MODELS WITH TRAFFIC AND WEATHER DATA FOR CRASH SEVERITY ANALYSIS ON URBAN ARTERIALS Athanasios Theofilatos*, PhD Research Associate National Technical University of Athens,

More information

VIRGINIA S I-77 VARIABLE SPEED LIMIT SYSTEM FOR LOW VISIBILITY CONDITIONS

VIRGINIA S I-77 VARIABLE SPEED LIMIT SYSTEM FOR LOW VISIBILITY CONDITIONS VIRGINIA S I-77 VARIABLE SPEED LIMIT SYSTEM FOR LOW VISIBILITY CONDITIONS Christopher D. McDonald, PE, PTOE Regional Operations Director, Southwest Region NRITS and ITS Arizona Annual Conference October

More information

Learning theory. Ensemble methods. Boosting. Boosting: history

Learning theory. Ensemble methods. Boosting. Boosting: history Learning theory Probability distribution P over X {0, 1}; let (X, Y ) P. We get S := {(x i, y i )} n i=1, an iid sample from P. Ensemble methods Goal: Fix ɛ, δ (0, 1). With probability at least 1 δ (over

More information

Real-time Traffic Safety Evaluation Models And Their Application For Variable Speed Limits

Real-time Traffic Safety Evaluation Models And Their Application For Variable Speed Limits University of Central Florida Electronic Theses and Dissertations Doctoral Dissertation (Open Access) Real-time Traffic Safety Evaluation Models And Their Application For Variable Speed Limits 2013 Rongjie

More information

New Achievement in the Prediction of Highway Accidents

New Achievement in the Prediction of Highway Accidents Article New Achievement in the Prediction of Highway Accidents Gholamali Shafabakhsh a, * and Yousef Sajed b Faculty of Civil Engineering, Semnan University, University Sq., P.O. Box 35196-45399, Semnan,

More information

Optimization of Short-Term Traffic Count Plan to Improve AADT Estimation Error

Optimization of Short-Term Traffic Count Plan to Improve AADT Estimation Error International Journal Of Engineering Research And Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 13, Issue 10 (October 2017), PP.71-79 Optimization of Short-Term Traffic Count Plan

More information

Electric Load Forecasting Using Wavelet Transform and Extreme Learning Machine

Electric Load Forecasting Using Wavelet Transform and Extreme Learning Machine Electric Load Forecasting Using Wavelet Transform and Extreme Learning Machine Song Li 1, Peng Wang 1 and Lalit Goel 1 1 School of Electrical and Electronic Engineering Nanyang Technological University

More information

Lossless Online Bayesian Bagging

Lossless Online Bayesian Bagging Lossless Online Bayesian Bagging Herbert K. H. Lee ISDS Duke University Box 90251 Durham, NC 27708 herbie@isds.duke.edu Merlise A. Clyde ISDS Duke University Box 90251 Durham, NC 27708 clyde@isds.duke.edu

More information

A Simple Algorithm for Learning Stable Machines

A Simple Algorithm for Learning Stable Machines A Simple Algorithm for Learning Stable Machines Savina Andonova and Andre Elisseeff and Theodoros Evgeniou and Massimiliano ontil Abstract. We present an algorithm for learning stable machines which is

More information

Data Warehousing & Data Mining

Data Warehousing & Data Mining 13. Meta-Algorithms for Classification Data Warehousing & Data Mining Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 13.

More information

The Research of Urban Rail Transit Sectional Passenger Flow Prediction Method

The Research of Urban Rail Transit Sectional Passenger Flow Prediction Method Journal of Intelligent Learning Systems and Applications, 2013, 5, 227-231 Published Online November 2013 (http://www.scirp.org/journal/jilsa) http://dx.doi.org/10.4236/jilsa.2013.54026 227 The Research

More information

P7.7 A CLIMATOLOGICAL STUDY OF CLOUD TO GROUND LIGHTNING STRIKES IN THE VICINITY OF KENNEDY SPACE CENTER, FLORIDA

P7.7 A CLIMATOLOGICAL STUDY OF CLOUD TO GROUND LIGHTNING STRIKES IN THE VICINITY OF KENNEDY SPACE CENTER, FLORIDA P7.7 A CLIMATOLOGICAL STUDY OF CLOUD TO GROUND LIGHTNING STRIKES IN THE VICINITY OF KENNEDY SPACE CENTER, FLORIDA K. Lee Burns* Raytheon, Huntsville, Alabama Ryan K. Decker NASA, Marshall Space Flight

More information

Effectiveness of Experimental Transverse- Bar Pavement Marking as Speed-Reduction Treatment on Freeway Curves

Effectiveness of Experimental Transverse- Bar Pavement Marking as Speed-Reduction Treatment on Freeway Curves Effectiveness of Experimental Transverse- Bar Pavement Marking as Speed-Reduction Treatment on Freeway Curves Timothy J. Gates, Xiao Qin, and David A. Noyce Researchers performed a before-and-after analysis

More information

Tornado Hazard Risk Analysis: A Report for Rutherford County Emergency Management Agency

Tornado Hazard Risk Analysis: A Report for Rutherford County Emergency Management Agency Tornado Hazard Risk Analysis: A Report for Rutherford County Emergency Management Agency by Middle Tennessee State University Faculty Lisa Bloomer, Curtis Church, James Henry, Ahmad Khansari, Tom Nolan,

More information

Active Sonar Target Classification Using Classifier Ensembles

Active Sonar Target Classification Using Classifier Ensembles International Journal of Engineering Research and Technology. ISSN 0974-3154 Volume 11, Number 12 (2018), pp. 2125-2133 International Research Publication House http://www.irphouse.com Active Sonar Target

More information

Statistics and learning: Big Data

Statistics and learning: Big Data Statistics and learning: Big Data Learning Decision Trees and an Introduction to Boosting Sébastien Gadat Toulouse School of Economics February 2017 S. Gadat (TSE) SAD 2013 1 / 30 Keywords Decision trees

More information

TRAFFIC FLOW MODELING AND FORECASTING THROUGH VECTOR AUTOREGRESSIVE AND DYNAMIC SPACE TIME MODELS

TRAFFIC FLOW MODELING AND FORECASTING THROUGH VECTOR AUTOREGRESSIVE AND DYNAMIC SPACE TIME MODELS TRAFFIC FLOW MODELING AND FORECASTING THROUGH VECTOR AUTOREGRESSIVE AND DYNAMIC SPACE TIME MODELS Kamarianakis Ioannis*, Prastacos Poulicos Foundation for Research and Technology, Institute of Applied

More information

CAPACITY DROP: A RELATION BETWEEN THE SPEED IN CONGESTION AND THE QUEUE DISCHARGE RATE

CAPACITY DROP: A RELATION BETWEEN THE SPEED IN CONGESTION AND THE QUEUE DISCHARGE RATE CAPACITY DROP: A RELATION BETWEEN THE SPEED IN CONGESTION AND THE QUEUE DISCHARGE RATE Kai Yuan, PhD candidate TRAIL research school Department of Transport and Planning Faculty of Civil Engineering and

More information

Safety effects of lower speed limits during winter months

Safety effects of lower speed limits during winter months Safety effects of lower speed limits during winter months Jaakko Klang Traffic Safety Engineer Centres for Economic Development, Transport and the Environment Finland 13.9.2018 1 1. Finland 13.9.2018 2

More information

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data. Fred Mannering University of South Florida

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data. Fred Mannering University of South Florida Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data Fred Mannering University of South Florida Highway Accidents Cost the lives of 1.25 million people per year Leading cause

More information

Modifications of asymmetric cell transmission model for modeling variable speed limit strategies

Modifications of asymmetric cell transmission model for modeling variable speed limit strategies Modifications of asymmetric cell transmission model for modeling variable speed limit strategies Josep Maria Torné* CENIT Center for Innovation in Transport, Technical University of Catalonia (UPC) Barcelona,

More information

ANALYSIS OF INTRINSIC FACTORS CONTRIBUTING TO URBAN ROAD CRASHES

ANALYSIS OF INTRINSIC FACTORS CONTRIBUTING TO URBAN ROAD CRASHES S. Raicu, et al., Int. J. of Safety and Security Eng., Vol. 7, No. 1 (2017) 1 9 ANALYSIS OF INTRINSIC FACTORS CONTRIBUTING TO URBAN ROAD CRASHES S. RAICU, D. COSTESCU & S. BURCIU Politehnica University

More information

Measuring Wave Velocities on Highways during Congestion using Cross Spectral Analysis

Measuring Wave Velocities on Highways during Congestion using Cross Spectral Analysis Measuring Wave Velocities on Highways during Congestion using Cross Spectral Analysis Yun Wang, Member, IEEE, Diane Foster, and Benjamin Coifman, Member, IEEE Abstract Previous research has shown that

More information

Research Article Weather Forecasting Using Sliding Window Algorithm

Research Article Weather Forecasting Using Sliding Window Algorithm ISRN Signal Processing Volume 23, Article ID 5654, 5 pages http://dx.doi.org/.55/23/5654 Research Article Weather Forecasting Using Sliding Window Algorithm Piyush Kapoor and Sarabjeet Singh Bedi 2 KvantumInc.,Gurgaon22,India

More information

Statistical Machine Learning from Data

Statistical Machine Learning from Data Samy Bengio Statistical Machine Learning from Data 1 Statistical Machine Learning from Data Ensembles Samy Bengio IDIAP Research Institute, Martigny, Switzerland, and Ecole Polytechnique Fédérale de Lausanne

More information

Estimating the vehicle accumulation: Data-fusion of loop-detector flow and floating car speed data

Estimating the vehicle accumulation: Data-fusion of loop-detector flow and floating car speed data Estimating the vehicle accumulation: Data-fusion of loop-detector flow and floating car speed data Paul B.C. van Erp (corresponding author) Victor L. Knoop Serge P. Hoogendoorn Department of Transport

More information

Modeling Simple and Combination Effects of Road Geometry and Cross Section Variables on Traffic Accidents

Modeling Simple and Combination Effects of Road Geometry and Cross Section Variables on Traffic Accidents Modeling Simple and Combination Effects of Road Geometry and Cross Section Variables on Traffic Accidents Terrance M. RENGARASU MS., Doctoral Degree candidate Graduate School of Engineering, Hokkaido University

More information

Predicting flight on-time performance

Predicting flight on-time performance 1 Predicting flight on-time performance Arjun Mathur, Aaron Nagao, Kenny Ng I. INTRODUCTION Time is money, and delayed flights are a frequent cause of frustration for both travellers and airline companies.

More information

A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES

A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES V.Sridevi, P.Malarvizhi, P.Mathivannan Abstract Rain removal from a video is a challenging problem due to random spatial distribution and

More information

Texas Transportation Institute The Texas A&M University System College Station, Texas

Texas Transportation Institute The Texas A&M University System College Station, Texas 1. Report No. FHWA/TX-06/0-4946-1 2. Government Accession No. 3. Recipient's Catalog No. 4. Title and Subtitle DYNAMIC TRAFFIC FLOW MODELING FOR INCIDENT DETECTION AND SHORT-TERM CONGESTION PREDICTION:

More information

Traffic Flow Forecasting for Urban Work Zones

Traffic Flow Forecasting for Urban Work Zones TRAFFIC FLOW FORECASTING FOR URBAN WORK ZONES 1 Traffic Flow Forecasting for Urban Work Zones Yi Hou, Praveen Edara, Member, IEEE and Carlos Sun Abstract None of numerous existing traffic flow forecasting

More information

TRB Paper Hot Spot Identification by Modeling Single-Vehicle and Multi-Vehicle Crashes Separately

TRB Paper Hot Spot Identification by Modeling Single-Vehicle and Multi-Vehicle Crashes Separately TRB Paper 10-2563 Hot Spot Identification by Modeling Single-Vehicle and Multi-Vehicle Crashes Separately Srinivas Reddy Geedipally 1 Engineering Research Associate Texas Transportation Institute Texas

More information

PLANNING TRAFFIC SAFETY IN URBAN TRANSPORTATION NETWORKS: A SIMULATION-BASED EVALUATION PROCEDURE

PLANNING TRAFFIC SAFETY IN URBAN TRANSPORTATION NETWORKS: A SIMULATION-BASED EVALUATION PROCEDURE PLANNING TRAFFIC SAFETY IN URBAN TRANSPORTATION NETWORKS: A SIMULATION-BASED EVALUATION PROCEDURE Michele Ottomanelli and Domenico Sassanelli Polytechnic of Bari Dept. of Highways and Transportation EU

More information

DAYLIGHT, TWILIGHT, AND NIGHT VARIATION IN ROAD ENVIRONMENT-RELATED FREEWAY TRAFFIC CRASHES IN KOREA

DAYLIGHT, TWILIGHT, AND NIGHT VARIATION IN ROAD ENVIRONMENT-RELATED FREEWAY TRAFFIC CRASHES IN KOREA DAYLIGHT, TWILIGHT, AND NIGHT VARIATION IN ROAD ENVIRONMENT-RELATED FREEWAY TRAFFIC CRASHES IN KOREA Sungmin Hong, Ph.D. Korea Transportation Safety Authority 17, Hyeoksin 6-ro, Gimcheon-si, Gyeongsangbuk-do,

More information

Freeway Travel Time Forecast Using Artifical Neural Networks With Cluster Method

Freeway Travel Time Forecast Using Artifical Neural Networks With Cluster Method 12th International Conference on Information Fusion Seattle, WA, USA, July 6-9, 2009 Freeway Travel Time Forecast Using Artifical Neural Networks With Cluster Method Ying LEE Department of Hospitality

More information

Estimating Traffic Accidents in Turkey Using Differential Evolution Algorithm

Estimating Traffic Accidents in Turkey Using Differential Evolution Algorithm SSP - JOURNAL OF CIVIL ENGINEERING Vol. 12, Issue 1, 2017 DOI: 10.1515/sspjce-2017-0008 Estimating Traffic Accidents in Turkey Using Differential Evolution Algorithm Ali Payıdar Akgüngör, Ersin Korkmaz

More information

Yu (Marco) Nie. Appointment Northwestern University Assistant Professor, Department of Civil and Environmental Engineering, Fall present.

Yu (Marco) Nie. Appointment Northwestern University Assistant Professor, Department of Civil and Environmental Engineering, Fall present. Yu (Marco) Nie A328 Technological Institute Civil and Environmental Engineering 2145 Sheridan Road, Evanston, IL 60202-3129 Phone: (847) 467-0502 Fax: (847) 491-4011 Email: y-nie@northwestern.edu Appointment

More information

A Cellular Automaton Model for Heterogeneous and Incosistent Driver Behavior in Urban Traffic

A Cellular Automaton Model for Heterogeneous and Incosistent Driver Behavior in Urban Traffic Commun. Theor. Phys. 58 (202) 744 748 Vol. 58, No. 5, November 5, 202 A Cellular Automaton Model for Heterogeneous and Incosistent Driver Behavior in Urban Traffic LIU Ming-Zhe ( ), ZHAO Shi-Bo ( ô ),,

More information

Univariate Short-Term Prediction of Road Travel Times

Univariate Short-Term Prediction of Road Travel Times MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Univariate Short-Term Prediction of Road Travel Times D. Nikovski N. Nishiuma Y. Goto H. Kumazawa TR2005-086 July 2005 Abstract This paper

More information

Improving the travel time prediction by using the real-time floating car data

Improving the travel time prediction by using the real-time floating car data Improving the travel time prediction by using the real-time floating car data Krzysztof Dembczyński Przemys law Gawe l Andrzej Jaszkiewicz Wojciech Kot lowski Adam Szarecki Institute of Computing Science,

More information

FIELD EVALUATION OF SMART SENSOR VEHICLE DETECTORS AT RAILROAD GRADE CROSSINGS VOLUME 4: PERFORMANCE IN ADVERSE WEATHER CONDITIONS

FIELD EVALUATION OF SMART SENSOR VEHICLE DETECTORS AT RAILROAD GRADE CROSSINGS VOLUME 4: PERFORMANCE IN ADVERSE WEATHER CONDITIONS CIVIL ENGINEERING STUDIES Illinois Center for Transportation Series No. 15-002 UILU-ENG-2015-2002 ISSN: 0197-9191 FIELD EVALUATION OF SMART SENSOR VEHICLE DETECTORS AT RAILROAD GRADE CROSSINGS VOLUME 4:

More information

Predicting freeway traffic in the Bay Area

Predicting freeway traffic in the Bay Area Predicting freeway traffic in the Bay Area Jacob Baldwin Email: jtb5np@stanford.edu Chen-Hsuan Sun Email: chsun@stanford.edu Ya-Ting Wang Email: yatingw@stanford.edu Abstract The hourly occupancy rate

More information

Gradient Boosting, Continued

Gradient Boosting, Continued Gradient Boosting, Continued David Rosenberg New York University December 26, 2016 David Rosenberg (New York University) DS-GA 1003 December 26, 2016 1 / 16 Review: Gradient Boosting Review: Gradient Boosting

More information

Classification of Hand-Written Digits Using Scattering Convolutional Network

Classification of Hand-Written Digits Using Scattering Convolutional Network Mid-year Progress Report Classification of Hand-Written Digits Using Scattering Convolutional Network Dongmian Zou Advisor: Professor Radu Balan Co-Advisor: Dr. Maneesh Singh (SRI) Background Overview

More information

day month year documentname/initials 1

day month year documentname/initials 1 ECE471-571 Pattern Recognition Lecture 13 Decision Tree Hairong Qi, Gonzalez Family Professor Electrical Engineering and Computer Science University of Tennessee, Knoxville http://www.eecs.utk.edu/faculty/qi

More information

Numerical Learning Algorithms

Numerical Learning Algorithms Numerical Learning Algorithms Example SVM for Separable Examples.......................... Example SVM for Nonseparable Examples....................... 4 Example Gaussian Kernel SVM...............................

More information

A Proposed Driver Assistance System in Adverse Weather Conditions

A Proposed Driver Assistance System in Adverse Weather Conditions 1 A Proposed Driver Assistance System in Adverse Weather Conditions National Rural ITS Conference Student Paper Competition Second runner-up Ismail Zohdy Ph.D. Student, Department of Civil & Environmental

More information

STATISTICAL ANALYSIS OF LAW ENFORCEMENT SURVEILLANCE IMPACT ON SAMPLE CONSTRUCTION ZONES IN MISSISSIPPI (Part 1: DESCRIPTIVE)

STATISTICAL ANALYSIS OF LAW ENFORCEMENT SURVEILLANCE IMPACT ON SAMPLE CONSTRUCTION ZONES IN MISSISSIPPI (Part 1: DESCRIPTIVE) STATISTICAL ANALYSIS OF LAW ENFORCEMENT SURVEILLANCE IMPACT ON SAMPLE CONSTRUCTION ZONES IN MISSISSIPPI (Part 1: DESCRIPTIVE) Tulio Sulbaran, Ph.D 1, David Marchman 2 Abstract It is estimated that every

More information

Real Time Voltage Control using Genetic Algorithm

Real Time Voltage Control using Genetic Algorithm Real Time Voltage Control using Genetic Algorithm P. Thirusenthil kumaran, C. Kamalakannan Department of EEE, Rajalakshmi Engineering College, Chennai, India Abstract An algorithm for control action selection

More information

Analytical investigation on the minimum traffic delay at a two-phase. intersection considering the dynamical evolution process of queues

Analytical investigation on the minimum traffic delay at a two-phase. intersection considering the dynamical evolution process of queues Analytical investigation on the minimum traffic delay at a two-phase intersection considering the dynamical evolution process of queues Hong-Ze Zhang 1, Rui Jiang 1,2, Mao-Bin Hu 1, Bin Jia 2 1 School

More information

HOPFIELD neural networks (HNNs) are a class of nonlinear

HOPFIELD neural networks (HNNs) are a class of nonlinear IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 4, APRIL 2005 213 Stochastic Noise Process Enhancement of Hopfield Neural Networks Vladimir Pavlović, Member, IEEE, Dan Schonfeld,

More information

DEVELOPMENT OF ROAD SURFACE TEMPERATURE PREDICTION MODEL

DEVELOPMENT OF ROAD SURFACE TEMPERATURE PREDICTION MODEL International Journal of Civil, Structural, Environmental and Infrastructure Engineering Research and Development (IJCSEIERD) ISSN(P): 2249-6866; ISSN(E): 2249-7978 Vol. 6, Issue 6, Dec 2016, 27-34 TJPRC

More information

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Bayesian learning: Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Let y be the true label and y be the predicted

More information

FINAL REPORT. City of Toronto. Contract Project No: B

FINAL REPORT. City of Toronto. Contract Project No: B City of Toronto SAFETY IMPACTS AND REGULATIONS OF ELECTRONIC STATIC ROADSIDE ADVERTISING SIGNS TECHNICAL MEMORANDUM #2B BEFORE/AFTER COLLISION ANALYSIS AT MID-BLOCK LOCATIONS FINAL REPORT 3027 Harvester

More information

Available online at ScienceDirect. Procedia Engineering 119 (2015 ) 13 18

Available online at   ScienceDirect. Procedia Engineering 119 (2015 ) 13 18 Available online at www.sciencedirect.com ScienceDirect Procedia Engineering 119 (2015 ) 13 18 13th Computer Control for Water Industry Conference, CCWI 2015 Real-time burst detection in water distribution

More information

Modeling Crash Frequency of Heavy Vehicles in Rural Freeways

Modeling Crash Frequency of Heavy Vehicles in Rural Freeways Journal of Traffic and Logistics Engineering Vol. 4, No. 2, December 2016 Modeling Crash Frequency of Heavy Vehicles in Rural Freeways Reza Imaninasab School of Civil Engineering, Iran University of Science

More information

Individual speed variance in traffic flow: analysis of Bay Area radar measurements

Individual speed variance in traffic flow: analysis of Bay Area radar measurements Individual speed variance in traffic flow: analysis of Bay Area radar measurements Sebastien Blandin 1, Amir Salam, Alex Bayen Submitted For Publication 1 th Annual Meeting of the Transportation Research

More information

arxiv: v1 [cs.cv] 28 Nov 2017

arxiv: v1 [cs.cv] 28 Nov 2017 A fatal point concept and a low-sensitivity quantitative measure for traffic safety analytics arxiv:1711.10131v1 [cs.cv] 28 Nov 2017 Shan Suthaharan Department of Computer Science University of North Carolina

More information

Evaluation of Road Safety in Portugal: A Case Study Analysis. Instituto Superior Técnico

Evaluation of Road Safety in Portugal: A Case Study Analysis. Instituto Superior Técnico Evaluation of Road Safety in Portugal: A Case Study Analysis Ana Fernandes José Neves Instituto Superior Técnico OUTLINE Objectives Methodology Results Road environments Expected number of road accidents

More information

AN ARTIFICIAL NEURAL NETWORK MODEL FOR ROAD ACCIDENT PREDICTION: A CASE STUDY OF KHULNA METROPOLITAN CITY

AN ARTIFICIAL NEURAL NETWORK MODEL FOR ROAD ACCIDENT PREDICTION: A CASE STUDY OF KHULNA METROPOLITAN CITY Proceedings of the 4 th International Conference on Civil Engineering for Sustainable Development (ICCESD 2018), 9~11 February 2018, KUET, Khulna, Bangladesh (ISBN-978-984-34-3502-6) AN ARTIFICIAL NEURAL

More information

Assessing the uncertainty in micro-simulation model outputs

Assessing the uncertainty in micro-simulation model outputs Assessing the uncertainty in micro-simulation model outputs S. Zhu 1 and L. Ferreira 2 1 School of Civil Engineering, Faculty of Engineering, Architecture and Information Technology, The University of

More information

DEVELOPMENT OF CRASH PREDICTION MODEL USING MULTIPLE REGRESSION ANALYSIS Harshit Gupta 1, Dr. Siddhartha Rokade 2 1

DEVELOPMENT OF CRASH PREDICTION MODEL USING MULTIPLE REGRESSION ANALYSIS Harshit Gupta 1, Dr. Siddhartha Rokade 2 1 DEVELOPMENT OF CRASH PREDICTION MODEL USING MULTIPLE REGRESSION ANALYSIS Harshit Gupta 1, Dr. Siddhartha Rokade 2 1 PG Student, 2 Assistant Professor, Department of Civil Engineering, Maulana Azad National

More information

Feature Selection by Genetic Programming, And Artificial Neural Network-based Machine Condition Monitoring

Feature Selection by Genetic Programming, And Artificial Neural Network-based Machine Condition Monitoring Feature Selection by Genetic Programming, And Artificial Neural Network-based Machine Condition Monitoring Pooja Gupta, Dr. Sulochana Wadhwani Abstract This Paper presents the performance of bearing fault

More information

Two-stage Pedestrian Detection Based on Multiple Features and Machine Learning

Two-stage Pedestrian Detection Based on Multiple Features and Machine Learning 38 3 Vol. 38, No. 3 2012 3 ACTA AUTOMATICA SINICA March, 2012 1 1 1, (Adaboost) (Support vector machine, SVM). (Four direction features, FDF) GAB (Gentle Adaboost) (Entropy-histograms of oriented gradients,

More information

Freeway rear-end collision risk for Italian freeways. An extreme value theory approach

Freeway rear-end collision risk for Italian freeways. An extreme value theory approach XXII SIDT National Scientific Seminar Politecnico di Bari 14 15 SETTEMBRE 2017 Freeway rear-end collision risk for Italian freeways. An extreme value theory approach Gregorio Gecchele Federico Orsini University

More information

Real-Time Travel Time Prediction Using Multi-level k-nearest Neighbor Algorithm and Data Fusion Method

Real-Time Travel Time Prediction Using Multi-level k-nearest Neighbor Algorithm and Data Fusion Method 1861 Real-Time Travel Time Prediction Using Multi-level k-nearest Neighbor Algorithm and Data Fusion Method Sehyun Tak 1, Sunghoon Kim 2, Kiate Jang 3 and Hwasoo Yeo 4 1 Smart Transportation System Laboratory,

More information

IT S TIME FOR AN UPDATE EXTREME WAVES AND DIRECTIONAL DISTRIBUTIONS ALONG THE NEW SOUTH WALES COASTLINE

IT S TIME FOR AN UPDATE EXTREME WAVES AND DIRECTIONAL DISTRIBUTIONS ALONG THE NEW SOUTH WALES COASTLINE IT S TIME FOR AN UPDATE EXTREME WAVES AND DIRECTIONAL DISTRIBUTIONS ALONG THE NEW SOUTH WALES COASTLINE M Glatz 1, M Fitzhenry 2, M Kulmar 1 1 Manly Hydraulics Laboratory, Department of Finance, Services

More information

Queuing and Statistical Analysis of Freeway Bottleneck Formation

Queuing and Statistical Analysis of Freeway Bottleneck Formation Queuing and Statistical Analysis of Freeway Bottleneck Formation Shantanu Das 1 and David Levinson 2 Abstract: A modified approach to treat traffic flow parameters (flow, density, and speed) has been introduced

More information