ACCOUNTING FOR SERIAL CORRELATION IN COUNT MODELS OF TRAFFIC SAFETY

Size: px
Start display at page:

Download "ACCOUNTING FOR SERIAL CORRELATION IN COUNT MODELS OF TRAFFIC SAFETY"

Transcription

1 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 ACCOUNTING FOR SERIAL CORRELATION IN COUNT MODELS OF TRAFFIC SAFETY Sttpan SITTIKARIYA PhD Canddate Department of Cvl and Envronmental Engneerng Pennsylvana State Unversty 104 Transportaton Research Buldng, Unversty Park, PA USA Fax: Emal: Mng-Bang SHYU PhD Canddate Department of Cvl and Envronmental Engneerng Pennsylvana State Unversty 104 Transportaton Research Buldng, Unversty Park, PA USA Fax: Emal: Venky N. SHANKAR, PhD, PE Assocate Professor Department of Cvl and Envronmental Engneerng Pennsylvana State Unversty 226C Sackett Buldng, Unversty Park, PA USA Fax: Emal: Songrt CHAYANAN, PhD DKS Assocates, Inc. 719 Second Avenue, Sute 1250 Seattle, WA 98104, USA Emal: Abstract: Ths research explores the mpact of nflated varances on statstcal sgnfcance and dentfcaton of varables correlated wth medan crossover accdent frequences. In roadway and accdent data that s comprsed of cross-sectonal panels, seral-correlaton effects across tme arse due to shared unobserved effects through random error terms. We use an emprcal technque to adjust for downward bas n standard errors n count models of traffc accdents, so that proper dentfcaton of varables s ensured. The dentfed model structures reflect the mpact of nflated sgnfcance values, and provde specfcatons that nclude truly sgnfcant geometrc, traffc and envronmental effects contrbutng to medan crossover accdents. It should be noted that the results from ths study are localzed and hence lmted to nferences from the medan crossover context n Washngton State n the Unted States. Further study s requred to ensure the transferablty of these fndngs to other contexts. Key Words: Medan crossover accdents, Correlated data, Negatve bnomal, Zero-nflated count models, Negatve multnomal, 1. INTRODUCTION There are some common modelng problems n estmaton of count models. Unobserved heterogenety s a common occurrence n accdent occurrence, leadng to the well-known overdsperson problem (Shankar et al., 1995). The negatve bnomal (NB) model s sutable for overdspersed accdent frequences, as show n several pror works (see for example Poch and Mannerng, 1996; Mlton and Mannerng, 1996). 3645

2 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 Another common ssue that arses n accdent data contexts s the ssue of seral correlaton. Multple years of cross-sectonal data on hghway accdent occurrences are often avalable from publc domans, ncludng tme seres nformaton on traffc volumes, accdent counts and roadway geometrcs. In addton, weather nformaton s also avalable from natonal databases (see for example Shankar et al., 2004.) In the presence of seral correlaton, the effcency of parameter estmates comes nto queston. Smlar to the classcal lnear regresson model, one can expect parameter estmates to be neffcent n the presence of seral correlaton n count models (Greene, 2000.) A method to adjust for repeated observaton effects on parameter estmates s necessary to adjust for seral correlaton effects. One such method relates to the use of the negatve multnomal (NM) model (Ulfarsson and Shankar, 2003.) In that model, a jont lkelhood based on repeated observatons s constructed to modfy the tradtonal negatve bnomal lkelhood. As a result, parameter estmates reflect an adjustment n ther standard errors, wth much of the adjustment resultng from proxmate years. The second type of adjustment for seral correlaton nvolves the development of a random effects or fxed effects specfcaton, where segment-specfc correlaton s accounted for. In the traffc accdent context, the random effects approach has been shown to be reasonable (See for example, Shankar et al., 1998). The thrd common ssue that arses n traffc accdent contexts s one that pertans to partal observablty and the resultng problem of excess zero counts. For example, especally n events of potentally hgh severty such medan crossover accdents, the problem of excess zeros s a sgnfcant one. A hghway segment wth hstorcally excess zeros may appear to be a safe locaton, but there s no guarantee that no medan crossover accdents wll occur n the future. As nteractons between traffc volumes and geometrcs, and geometrcs and weather condtons exacerbate accdent proneness, one would expect that over tme, wth ncrease n traffc volumes and adverse weather condtons, non-zero count probabltes would be expected to rse. Lmted hstory of observaton may not adequately capture the accdent proneness of the hghway segment, as nteractons ncrease over tme. It s the combnaton of the second and thrd common problems, namely, seral correlaton and excess zeros, whch we address n ths paper. We choose the emprcal context of medan crossover accdent settngs on the hghway network n Washngton State n the USA. Overdsperson, the frst problem dscussed s also mplctly dscussed n ths study; although n ths partcular case, overdsperson arses prmarly through the occurrence of excess zeros, and not necessarly through hgh-valued counts that are best modeled by a negatve bnomal dstrbuton. The rest of ths paper s organzed as follows. Model prelmnares are presented frst, where a bref dscusson of the negatve multnomal s presented, along wth the preferred model of excess zeros, namely, the zero-nflated Posson model for medan crossover accdents. The emprcal settng s then dscussed, followed by a dscusson of model results, along wth adjustments of standard errors for the zero-nflated model, due to seral correlaton. Fnally, conclusons and recommendatons are offered. 2. MODELING PRELIMINARIES 2.1 The Negatve Multnomal Model for Seral Correlaton n Accdent Counts The negatve multnomal (NM) model s suggested as a proper estmator when seral correlatons are present n the dataset (Ulfarsson and Shankar, 2003). The dervaton of the 3646

3 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 NM model begns wth the Posson probablty densty functon, whch s an expresson for the probablty of the frequency equalng a partcular count (Guo, 1996): P(Y t λt yt e λ t = yt ) = ; y t = 0,1,2,..., (1) y! t where Yt s the observed frequency of medan crossover accdents n secton at tme t, and λ t s the mean of the number of medan crossover accdents. The Posson model s estmated by defnng: where xt ln λ = x β, (2) t s a vector of secton- and tme-specfc explanatory varables; and β s a vector of coeffcents to be estmated. To account for the secton-specfc varaton, we proceed as s done for the NB model. A random error term s added to the expresson for the mean, t ln λ +, (3) t = xt β ε where ε s a secton-specfc (not observaton-specfc as n the NB model) random error term, exp( ε ) s assumed to be ndependently and dentcally dstrbuted gamma wth mean 1 and varance α = 1/ θ. The assumpton of mean 1 does not cause loss of generalty f (3) ncludes an ntercept term. The condtonal jont densty functon of all ndvdual counts for a partcular secton, gven that the ndvdual counts are dstrbuted by (1) and condtoned on ε, can now be wrtten as: t P( Y 1 = y 1,..., Yt = yt ε ) = P( Yt' = yt' ε ), (4) where t denotes the number of tme perods observed for secton. Ths assumes the accdent counts n dfferent sectons are ndependent. Ths s not unreasonable because these sectons are generally not next to each other and wll therefore only share mnmal unobserved effects. The uncondtonal jont densty functon for the negatve multnomal dstrbuton can now be derved by ntegratng (4) and by usng the assumed dstrbuton of exp( ε ) to gve: t' = 1 P( Y Γ( y + θ ) Γ( θ ) y!... y 1 t 1 t y Yt y 1 = 1,..., = t ) =..., (5) 1 t θ θ η! η + θ η + θ y η η + θ y x β where Γ s the gamma functon, ηt = e t, η = η ηt, and y = y yt. Recall that the varance of exp( ε ) s α = 1/ θ. The degenerate case, when each secton has only one observaton, (.e. there s no secton-specfc correlaton) yelds the negatve bnomal dstrbuton. The expected value, the varance and covarance for the NM model are: E( Yt t ) = η, Var( Yt ) = E( Yt )[1 + α E( Yt )], Cov( Y, ) t Y t ' = αηtηt'. (6) 3647

4 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 A lkelhood functon s wrtten usng (5) to gve (7): L( β, θ ) = n = 1 θ 1 Γ( y + θ ) θ η η 1 t... Γ( θ ) η + θ η + θ η + θ y yt, (7) where n s the total number of sectons. The estmable coeffcents β and α = 1/ θ are estmated by maxmzng the lkelhood functon (7), (Green, 2003). In medan crossover accdent contexts, t s mportant to keep n mnd that there are the possbltes of near-crossover cases n the roadway secton. For example, as medan wdth ncreases, the probablty that a potental crossover wll occur decreases, but may be enhanced wth ncreasng volumes and nteractons wth adverse weather condtons over tme. A nearcrossover roadsde encroachment usually takes the form of a rollover, fxed object or other smlar types of roadsde accdents that do not result n the vehcle crossng the medan and enterng opposng traffc. The segment hence may be characterzed as operatng n multple states. For smplcty, we can consder two states, ncludng a zero crossover state and a nonzero crossover state. Dependng on how nteractons vary over tme, the segment may swtch to a non-zero state. In such a case, a process that adequately captures the probablty of the two states s necessary. The tradtonal count models, such as the Posson or negatve bnomal, only address the non-zero state, thus gnorng the potental shft of excess zeros nto non-zero counts. A plausble model presented below to address the excess zero problem s the zero-nflated Posson or the zero-nflated negatve bnomal model. 2.2 Zero Inflated Posson Model of Medan Crossover Accdents Let Z represents the zero-crash count state of the medan crossover accdent ste, and Y * denote the crash count state for that roadway secton. Nether Z nor Y * s observed, but only the observed medan crossover count Y s, such that Y=Z*Y *. Determnng the latent components can then be vewed as a mxng dstrbuton problem, wth Z beng modeled as a dchotomous outcome probablty and Y * modeled as a count probablty. In vehcular accdent contexts, such dstrbutons have been found to be approprate (Shankar et al., 1997). In partcular these studes have hghlghted the mportance of roadway desgn devatons as a motvator for partal observablty effects. The effect of such devatons has been found to, at the least, cause partal observablty, and n certan desgn stuatons, overdsperson as well. In the medan crossover accdent context, desgn devatons are a sgnfcant ssue. To hghlght ths ssue, a bref dscusson of medan barrer warrants s necessary. Medan barrer warrants n Washngton State, and for that matter n the entre Unted States, are based on two smplfyng factors, namely, average daly traffc (ADT) and medan wdth. No account of nteractons between traffc volumes and geometrcs, or weather and geometrcs s consdered n the decson to nstall a medan barrer. Hence, potental nteractons arsng due to the presence of devatons from preferred geometrc desgn are gnored, leadng to the potental for the excess zero medan crossover problem. To formally address the excess zero problem, let Y be the annual number of accdent counts reported for secton, and let be the probablty that secton wll exst n the zero-count p state over ts lfetme. Thus 1- p s the probablty that secton wll operate n the non-zero accdent state. It s to be noted that n the non-zero count state, the probablty of zero 3648

5 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 accdent counts stll exsts. For our mmedate purposes, we assume that ths count state follows a Posson dstrbuton. Gven ths, and, = 0 λ Y wth probablty ( ) p + 1 p e, (8) λ k e λ Y = k wth probablty (1 p )( ), (9) k! In (8) and (9), the probablty of beng n the zero-accdent state p s formulated as a logstc p dstrbuton such that log = G γ and λ s defned by log( λ ) = H β, where G and H 1 p are covarate vectors, and γ and β are coeffcent vectors. The covarates that affect the mean λ of the Posson state may or may not be the same as the covarates that affect the zeroaccdent state probablty (.e., p ). Alternatvely, vectors G and H may be related to each other by a sngle parameter τ. Ths s essentally a parametrc constrant n the sense that the explanatory varables are forced to be the same n both the zero and count states. In such a case, a natural parameterzaton s log(p /1 p ) = τ H β. It s useful n the accdent context to begn wth unconstraned parameter vectors γ and β. Ths allows dfferent effects to be correlated wth the zero state and count state respectvely. For example, n the medan crossover accdent context, t may be useful to consder the mpact of medan wdths alone n the specfcaton of the zero-state probablty functon, whle ncludng desgn, traffc and envronmental nteractons as well n the count state functon. Equatons (8) and (9) combned provde the zero-nflated Posson (ZIP) model. We refer to a model of unconstraned parametrc vectors such as the one dscussed above s referred to as the ZIP-Full model. The maxmum lkelhood estmaton usng the gradent/lne search approach proposed by Green (1996) s performed to estmate the β and γ. A lkelhood functon s gven by (10): L(β, γ,) = n = 1 (1 p λ yt e λ ) y! + Z p, (10) where Z = 1 when Y = s observed to be zero and 0 otherwse. An alternate method to estmate ZIP parameters s to employ the expectatons maxmzaton technque. Greene (1996) argues that the method shown n (10) s relable. 2.3 Statstcal Valdaton of the ZIP Medan Crossover Model In statstcally valdatng the ZIP model, one has to dstngush the base count model (such as the Posson model for medan crossover counts) from the zero-nflated probablty model (such as the ZIP). The reasonng behnd ths test s that the Posson model does not adequately capture the entre zero densty, and that the ZIP structure s more relable. A statstcal test for ths has been proposed by Vuong n The Vuong test s a t-statstcbased test wth reasonable power n count-data applcatons (Green, 1994). The Vuong statstc (V-statstc) s computed as: 3649

6 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 V = m N, (11) S m where m s the mean wth m = log [f 1 (.) /f 2 (.)], (wth f 1 (.) beng the densty functon of the ZIP dstrbuton and f 2 (.) s the densty functon of the parent-posson dstrbuton), and S m and N are the standard devaton and sample sze respectvely. The advantage of usng the Vuong test s that the entre dstrbuton s used for comparson of the means, as opposed to just the excess zero mass. A value greater than 1.96 (the 95 percent confdence level for the t- test) for the V-statstc favors the ZIP-Full model whle a value less than favors the parent-posson (values n between 1.96 and mean that the test s ndecsve). The ntutve reasonng behnd ths test s that f the processes are statstcally not dfferent, the mean rato of ther denstes should equal one. To carry out the test, both the parent and zeronflated dstrbutons need to be estmated and tested usng a t-statstc. Studes (Greene 1994) have shown that a Vuong statstc has reasonable power and hence s qute relable. Greene ndcates that a sgnfcant Vuong statstc n favor the ZIP model wll also favor the ZIP model over the negatve bnomal model, whch would otherwse be the default model for overdsperson from excess zeros. 2.4 The Relevance of the Negatve Multnomal to Medan Crossover Accdents It s mportant to keep n mnd that the estmated parameters from the ZIP-Full model would be effcent f seral correlaton dd not exst. Such s not the case n medan crossover contexts where multple years of observatons are avalable. In fact, as n the classcal lnear regresson model, one would expect standard errors to be downward based f seral correlaton s not accounted for. To correct the varances n a dual-state dstrbuton (such as the ZIP) wth temporal correlaton, the negatve multnomal model offers an ntutve approach. We showed that the negatve multnomal captures seral correlaton across years wthout basng the parameters of the negatve bnomal dstrbuton. Emprcally, ths characterstc s of some use n our current ZIP-Full context. We can benchmark standard errors from a negatve multnomal model aganst a naïve negatve bnomal model frst, when repeated observatons are the prmary source of seral correlaton. The pont of mportance to note here s that the negatve multnomal model adjusts the standard errors upward to account for seral correlaton. We call ths upward adjustment the loadng factor. In other words, the standard errors of the negatve bnomal model are factored up by loadng factors derved from the negatve multnomal model. As an example, consder the vectors of parameters estmated as β by a negatve bnomal model. We assume here that β has dmensonalty k. Consder the negatve multnomal model vector of parameters β ~ also of dmensonalty k, wth exactly the same set of regressors as those used for β. Then, the ~ loadng factor for β s a vector µ of wth dmensonalty k, where µ k = s.e.( βk )/s.e.(β k ). The loadng factor for any gven parameter β k vares dependng on the effect of seral correlaton n the varance-covarance matrx. The loadng factors are then used to adjust upward the standard errors obtaned for a naïve ZIP-Full model. The adjusted standard errors are then used to determne parameter sgnfcance under seral correlaton. To llustrate the development of loadng factors for standard errors emprcally, we present fndngs from a negatve bnomal/negatve multnomal analyss of medan crossover accdents n Washngton State. Pror to presentng these fndngs, we dscuss the emprcal settng used for the development of the negatve bnomal and negatve multnomal models. 3650

7 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , EMPIRICAL SETTING OF MEDIAN CROSSOVER ACCIDENTS IN WASHINGTON STATE In ths dscusson, nformaton regardng the data, framework, and crtera for model analyss and evaluatons s presented. The Washngton State hghway system contans over 7,000 centerlne mles of state hghways. The annual records n the database developed by the Washngton State Department of Transportaton (WSDOT) contan almost every roadway and roadsde ncdent that occurs durng the year. The accdent records from ths database are the prmary source of data for the current medan crossover accdents study. The panel data n ths research conssts of fve years ( nclusve) of annual accdent counts for 275 roadway sectons n Washngton State. The panel s balanced, wth all sectons havng a full fve-year hstory. Ths panel represents all sectons (longer than 2,624 ft) wthout medan barrers on dvded state hghways. The reasons why only sectons longer than 2,624 ft are selected are that about 95% of shorter sectons on dvded hghways have barrers, and that the shorter sectons are more affected by access controls and ntersectons (Ulfarsson and Shankar, 2003). In addton to medan crossover accdent nformaton, other components of data extracted from the database ncluded roadway geometrcs, medan characterstcs and traffc volumes. The database dd not provde any weather nformaton requred for the study. The GIS program ArcVew 3.2 was used to match the roadway sectons to ther weather attrbutes stored n the hstorcal weather database provded by the Western Regonal Clmate Center. The mappng crtera nvolved lnkng the non-medan barrer roadway sectons to the nearest correspondng weather statons. Each weather staton provded clmate data ncludng daly, monthly and annual measurements of temperature, precptaton and snowfall ncludng snow depth, wth the records datng back to The selected roadway sectons and all weather statons n Washngton State are llustrated n fgure 1. Table 1 provdes descrptve statstcs of key varables n the medan crossover accdent dataset. * Blue (darker color) = The Selected Roadway Sectons, ** Green (lghter color) = The Weather Statons Fgure 1. Selected Roadway Sectons and Weather Statons n Washngton State 3651

8 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 Table 1. Descrptve Statstcs of Key Medan Crossover Accdent Related Varables Varable Mean Std. Dev. Mn Max Number of crossover accdents n secton AADT 37,355 36,975 3, ,560 AADT per lane 7,445 5, ,690 Sngle truck percentage Double truck percentage Truck-tran percentage Percentage of AADT n the peak hour Speed lmt Maxmum medan shoulder wdth n feet Mnmum medan shoulder wdth n feet Percent medans narrower than 40 feet Percent medans between 40 feet and 50 feet n wdth Percent medans between 50 feet and 60 feet n wdth Percent medans wder than 60 feet n wdth Percent medans that are paved Length of the roadway secton n mles The number of nterchanges n secton The number of horzontal curves n secton The number of horzontal curves per mle Maxmum horzontal central angle n degrees Mnmum radus of horzontal curve n feet ,400 Average annual snow depth n nches * Average annual precptaton n nches Number of grade changes Average roadway wdth * Weather staton data for mountanous secton 3.1. Development of Loadng Factors for Standard Error Adjustment Table 2 presents a comparatve analyss of standard errors for sgnfcant parameters n the negatve bnomal and negatve multnomal models of medan crossover accdents. It s to be noted here that only sgnfcant effects appearng n the negatve multnomal model are used to develop the loadng factors. It may be that varables sgnfcant n the negatve bnomal may not be sgnfcant n the negatve multnomal due to nflaton n the standard error. As s noted n table 2, the loadng factor vares by parameter, from a mnmum adjustment of for the precptaton-horzontal curves nteracton varable to for the length varable for medans narrower than or equal to 40 feet. The standard error adjustment for the alpha parameter (overdsperson effect) s gnored n our dscusson, snce the alpha parameter does not appear n the ZIP model of medan crossover accdents. The ZIP model s dscussed next, consderng the loadng factors developed n table

9 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 Table 2. Load Factors Developed from NB and NM Models for Adjustng Standard Errors n Zero-Inflated Posson Model of Medan Crossover Accdents Negatve Bnomal Negatve Multnomal Varable Model Model Load Estmated Standard. Estmated Standard Factor * Coeffcent Error Coeffcent Error Constant Per-lane AADT ndcator (1 f perlane AADT <= 5000 vehcles, 0 otherwse) Length of the roadway secton on medan wdths less than or equal to 40 feet Length of the roadway secton on medan wdths greater than or equal to 41 feet and less than or equal to 60 feet Length of the roadway secton on medan wdths greater than 60 feet The number of nterchanges n the secton Interacton 1 between average monthly precptaton ndcator and the number of horzontal curves per mle ndcator (1 f average monthly precptaton <= 1.5 nches and the number of horzontal curves per mle <= 0.5, 0 otherwse) Interacton 2 between average monthly precptaton ndcator and the number of horzontal curves per mle ndcator (1 f average monthly precptaton > 4.0 nches and the number of horzontal curves per mle > 0.5, 0 otherwse) α N.A. θ N.A. Restrcted log-lkelhood (All -1, , parameters = 0, α = 0) Log-lkelhood at convergence Adjusted ρ Number of observatons 1,375 *** * The load factor was the proporton of the S.E. of NM to the S.E. of NB ** θ = 1/α *** 5 years of data for 275 separate non-medan barrer sectons 3653

10 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , AN EMPIRICALLY ADJUSTED SERIAL CORRELATION ZERO INFLATED POISSON MODEL OF MEDIAN CROSSOVER ACCIDENTS A ZIP-Full model of medan crossover accdents was estmated and loadng factors from table 2 were used to adjust for seral correlaton n the fve-year medan crossover dataset. Medan crossover accdent counts were the dependent varable n ths estmaton. The ZIP-Full model was estmated and valdated by the Vuong statstc. The result revealed a Vuong statstc of suggestng that the ZIP-Full as the favorable model compared to the basc Posson model. In estmatng the sgnfcance of key varables n the ZIP-Full model, standard errors estmated usng the lkelhood functon shown n equaton 10 were adjusted through multplcaton wth the loadng factors developed n table 2. By so dong, we emprcally account for seral correlaton present n the dataset. Table 3 shows the emprcally adjusted seral correlaton ZIP-Full model of medan crossover accdents. The results show that before accountng for seral correlaton, all the varables ncluded n the ZIP-Full model were hghly sgnfcant and thus have a hgh level of confdence (exceedng 95%). However, after accountng for seral correlaton, the level of sgnfcance for all varables decreased. The length varable for medans wder than 60 feet had the lowest t-statstc of It s also noted that the parameters n non-zero accdent state always negatvely correlate wth the same parameters n zero accdent state (e.g. opposte sgns). For example, the length varable for medans narrower than 40 feet postvely correlates wth medan crossover accdents, whle t negatvely correlates wth the probablty of a secton beng n a zero count state n ts lfetme. In the non-zero count state, factors postvely correlatng wth medan crossover accdents ncluded the nteracton varables between length and three categores of medan wdths,, as well as the number of nterchanges n the secton. The three categores of the medan wdth nteracted wth the length of the secton were 1) less than or equal to 40-foot medan wdth, 2) between 40 to 60-foot medan wdth, and 3) greater than 60-foot medan wdth. Dfferent ranges of the medan wdth were expermented but the result showed that these three categores, when nterrelated wth the secton length, had the greatest mpact on the crossover accdent lkelhood. The magntudes of the coeffcents of the length varables suggest that all for a gven length, the lkelhood of medan crossover accdents ncreases the greatest on sectons wth medan wdths between 40 and 60 feet wde n comparson to the other wdth categores. On the other hand, sectons wth medan wdth wder than 60 feet have the least contrbuton on the lkelhood of medan crossover frequency. In fact, n the presence of seral correlaton, ther mpact s statstcally margnal. Three factors negatvely correlate wth medan crossover accdent frequences. These ncluded the ndcator varable f average annual daly traffc was less than 5,000, the nteracton varable between the number of horzontal curves less than or equal to 0.5 per mle n the secton and the average monthly precptaton beng less than or equal 1.5 nches, and the nteracton varable between the number of horzontal curves greater than 0.5 per mle n the secton and the average monthly precptaton beng greater than 4 nches. 3654

11 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 Table 3. Zero-Inflated Posson Model of Medan Crossover Accdents wth Seral Correlaton Adjusted Standard Errors Varables Zero-Inflated Posson Model Adjusted Adjusted Load Estmated Standard. T- Standard T- Factor Coeffcent Error statstc Error Statstc Non-zero accdent Posson probablty state Constant Per-lane AADT ndcator (1 f per-lane AADT <= 5000 vehcles, 0 otherwse) Length of secton where medans are less than 40 feet wde Length of secton where medans are between 40 feet and 60 feet wde Length of secton where medans are wder than 60 feet Number of nterchanges n secton Interacton 1 between average monthly precptaton ndcator and the number of horzontal curves per mle ndcator (1 f average monthly precptaton <= 1.5 nches and the number of horzontal curves per mle <= 0.5, 0 otherwse) Interacton 2 between average monthly precptaton ndcator and the number of horzontal curves per mle ndcator (1 f average monthly precptaton > 4.0 nches and the number of horzontal curves per mle > 0.5, 0 otherwse) Zero accdent probablty state as logstc functon Constant Length of secton where medans are less than 40 feet wde Length of secton where medans are between 40 feet and 60 feet wde Length of secton where medans are wder than 60 feet Restrcted log-lkelhood * (constant only) Log-lkelhood at convergence Vuong statstc * Ths restrcted log-lkelhood was obtaned from the restrcted log-lkelhood computed by the Posson model. 3655

12 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 The weather effect appearng n the form of the nteracton varables played a sgnfcant role n medan crossover lkelhood. In a secton where the average number of horzontal curves was less than or equal to 0.5 per mle, the medan crossover counts were expected to decrease f the average monthly precptaton was less than or equal to 1.5 nches. Crossover accdent frequency also decreases when average monthly precptaton exceeds 4 nches on sectons wth greater than 0.5 horzontal curves per mle. Both effects pont to the range of nteractons between precptaton and horzontal curves on medan crossover frequences. As a fnal thought, t should be noted that the mpact of medan wdths on the probablty of a medan crossover accdent occurrng n a secton s lfetme (zero versus non-zero state) follows a trend that s dfferent from ther mpacts on postve crossover frequences (the Posson state.) In the zero state, for a gven length, medan wdths n excess of 60 feet have the hghest odds rato of beng n a non-zero accdent state, whle the count state effects suggests they wll have the least number of postve crossover counts. 5. CONCLUSIONS AND RECOMMENDATIONS Ths paper presents an emprcal technque to adjust for seral correlaton effects on standard errors n medan crossover accdent models. Smlar to the classcal lnear regresson model, standard errors n count models are downward based as observed n emprcal analyses on medan crossover accdents n ths study. To develop a technque for adjustng standard errors, we used the negatve multnomal specfcaton to account for correlaton across tme. Usng the rato of standard errors from the NM model to the nave NB model, we developed loadng factors whch represented the level of nflaton n standard errors requred to account for seral correlaton. We then developed a ZIP-Full model of medan crossover accdent counts to account for the presence of excess zeros n the dataset. Ths excess zero s unque to medan crossover accdents, especally due to the fact that only fve years of observaton are avalable. Usng the loadng factors developed on the bass of the NM model, we adjusted the standard errors of the ZIP-Full model to dentfy key factors affectng medan crossover accdent frequences. It s our belef that the emprcally standard errors n the ZIP-Full model are closer to effcent estmates that would be theoretcally derved. An obvous recommendaton from ths study s to compare theoretcally derved standard errors of a ZIP- Full model wth emprcal fndngs from ths study. Such a study would shed sgnfcant lght on the mpact of seral correlaton n a varety of count contexts. To summarze our emprcal fndngs, we conclude that the ZIP-Full model s the most approprate model among zero-altered probablty processes for predctng the number of medan crossover accdents. The man effects such as average daly traffc, and the number of nterchanges were nvestgated to have statstcally sgnfcant mpact on the frequency of medan crossover accdents. The nteracton varables of length of the roadway secton and the medan wdth as well as the nteracton between monthly precptaton and the number of horzontal curves per mle are also presented as mportant contrbutors to medan crossover lkelhoods. REFERENCES Chayanan, S., Shankar, V.N., Sttkarya, S., Ulfarsson, G.F., Shyu, M.B., and Juvva, N.K. (2004) Medan crossover accdent analyses and the effectveness of medan barrers. Fnal 3656

13 Journal of the Eastern Asa Socety for Transportaton Studes, Vol. 6, pp , 2005 Research Report, WA-RD 580.1, Washngton State Department of Transportaton, Washngton. ESRI (2002) ArcVew GIS Verson 3.2, Envronmental Systems Research Insttute, Inc., Calforna. Greene, W.H. (1994) Accountng for excess zeros and sample selecton n Posson and negatve bnomal regresson models. Workng papers EC-94-10, Stern School of Busness, New York Unversty, New York. Greene, W.H. (2004) LIMDEP, Verson 8.0, User s Manual. Econometrc Software Inc., New York. Greene, W.H. (2003) Econometrc Analyss. Ffth Edton, Prentce Hall Inc., New Jersey. Guo, G. (1996) Negatve multnomal regresson models for clustered event counts, Socologcal Methodology, Vol. 26, Mlton, J.C. and Mannerng, F.L., (1996) The relatonshp between hghway geometrcs, traffc related elements and motor vehcle accdent. Fnal research report, WA-RD 403.1, Washngton State Department of Transportaton, Washngton. Poch, M. and Mannerng, F.L. (1996) Negatve bnomal analyss of ntersecton-accdent frequences, Journal of Transportaton Engneerng, Vol. 122, No. 3, Shankar, V.N., Albn, R.B., Mlton, J.C. and Mannerng, F.L. (1998) Evaluatng medan cross-over lkelhoods wth clustered accdent counts: an emprcal nqury usng the random effects negatve bnomal model, Transportaton Research Record, Vol. 1635, Shankar, V.N., Chayanan, S., Sttkarya, S., Shyu, M.B., Juvva, N.K. and Mlton J.C. (2004) The margnal mpacts of desgn, traffc, weather, and related nteractons on roadsde crashes, Transportaton Research Record, Vol. 1897, Shankar, V.N., Mannerng, F.L. and Barfeld, W. (1995) Effect of roadway geometrcs and envronmental condtons on rural accdent frequences, Accdent Analyss and Preventon, Vol. 27, No.3, Shankar, V.N., Mlton, J.C. and Mannerng, F.L. (1997) Modelng statewde accdent frequences as zero-altered probablty processes: an emprcal nqury, Accdent Analyss and Preventon, Vol. 29, No.6, Ulfarsson, G. F. and Shankar V.N. (2003) An accdent count model based on mult-year cross-sectonal roadway data wth seral correlaton, Transportaton Research Record, Vol. 1840, Voung, Q. (1989) Lkelhood rato tests for model selecton and non-nested hypotheses, Econometrca, Vol. 57, No. 2, Western Regonal Clmate Center (2002) Hstorcal Clmate Informaton, Desert Research Insttute, Retreved from

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Analyzing Different Functional Forms of the Varying Weight Parameter for Finite Mixture of Negative Binomial Regression Models

Analyzing Different Functional Forms of the Varying Weight Parameter for Finite Mixture of Negative Binomial Regression Models Analyzng Dfferent Functonal Forms of the Varyng Weght Parameter for Fnte Mxture of Negatve Bnomal Regresson Models By Yae Zou* Ph.D. Canddate, Zachry Department of Cvl Engneerng Texas A&M Unversty, 336

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH Turbulence classfcaton of load data by the frequency and severty of wnd gusts Introducton Oscar Moñux, DEWI GmbH Kevn Blebler, DEWI GmbH Durng the wnd turbne developng process, one of the most mportant

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Alternative Risk Models for Ranking Locations for Safety Improvement

Alternative Risk Models for Ranking Locations for Safety Improvement Alternatve Rsk Models for Rankng Locatons for Safety Improvement Lus F. Mranda-Moreno, Lpng Fu, Frank F. Saccomanno, and Aurele Labbe Many types of statstcal models have been proposed for estmatng accdent

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION Internatonal Worshop ADVANCES IN STATISTICAL HYDROLOGY May 3-5, Taormna, Italy DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION by Sooyoung

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

Chapter 20 Duration Analysis

Chapter 20 Duration Analysis Chapter 20 Duraton Analyss Duraton: tme elapsed untl a certan event occurs (weeks unemployed, months spent on welfare). Survval analyss: duraton of nterest s survval tme of a subject, begn n an ntal state

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting Onlne Appendx to: Axomatzaton and measurement of Quas-hyperbolc Dscountng José Lus Montel Olea Tomasz Strzaleck 1 Sample Selecton As dscussed before our ntal sample conssts of two groups of subjects. Group

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

Statistical Evaluation of WATFLOOD

Statistical Evaluation of WATFLOOD tatstcal Evaluaton of WATFLD By: Angela MacLean, Dept. of Cvl & Envronmental Engneerng, Unversty of Waterloo, n. ctober, 005 The statstcs program assocated wth WATFLD uses spl.csv fle that s produced wth

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Estimation of the Probability of Success Based on Communication History

Estimation of the Probability of Success Based on Communication History Workng paper, presented at 7-th Valenca Meetng n Bayesan Statstcs, June 22 Estmaton of the Probablty of Success Based on Communcaton Hstory Arkady E Shemyakn Unversty of St Thomas, Sant Paul, Mnnesota,

More information

Simulation and Probability Distribution

Simulation and Probability Distribution CHAPTER Probablty, Statstcs, and Relablty for Engneers and Scentsts Second Edton PROBABILIT DISTRIBUTION FOR CONTINUOUS RANDOM VARIABLES A. J. Clark School of Engneerng Department of Cvl and Envronmental

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Financing Innovation: Evidence from R&D Grants

Financing Innovation: Evidence from R&D Grants Fnancng Innovaton: Evdence from R&D Grants Sabrna T. Howell Onlne Appendx Fgure 1: Number of Applcants Note: Ths fgure shows the number of losng and wnnng Phase 1 grant applcants over tme by offce (Energy

More information

The Bootstrap Maximum Likelihood Method for Estimation of Dispersion Parameter in the Negative Binomial Regression Model

The Bootstrap Maximum Likelihood Method for Estimation of Dispersion Parameter in the Negative Binomial Regression Model The Bootstrap Maxmum Lkelhood Method for Estmaton of Dsperson Parameter n the Negatve Bnomal Regresson Model Huang Jewu 1, Xaychalern Kongkeo 1,2,*, L Rong 1 1 College of Data Scence and Informaton Engneerng,

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Limited Dependent Variables and Panel Data. Tibor Hanappi

Limited Dependent Variables and Panel Data. Tibor Hanappi Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor Taylor Enterprses, Inc. Control Lmts for P Charts Copyrght 2017 by Taylor Enterprses, Inc., All Rghts Reserved. Control Lmts for P Charts Dr. Wayne A. Taylor Abstract: P charts are used for count data

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Semiparametric geographically weighted generalised linear modelling in GWR 4.0

Semiparametric geographically weighted generalised linear modelling in GWR 4.0 Semparametrc geographcally weghted generalsed lnear modellng n GWR 4.0 T. Nakaya 1, A. S. Fotherngham 2, M. Charlton 2, C. Brunsdon 3 1 Department of Geography, Rtsumekan Unversty, 56-1 Tojn-kta-mach,

More information

Lecture 12: Classification

Lecture 12: Classification Lecture : Classfcaton g Dscrmnant functons g The optmal Bayes classfer g Quadratc classfers g Eucldean and Mahalanobs metrcs g K Nearest Neghbor Classfers Intellgent Sensor Systems Rcardo Guterrez-Osuna

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Lecture 3: Probability Distributions

Lecture 3: Probability Distributions Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the

More information

Empirical Methods for Corporate Finance. Identification

Empirical Methods for Corporate Finance. Identification mprcal Methods for Corporate Fnance Identfcaton Causalt Ultmate goal of emprcal research n fnance s to establsh a causal relatonshp between varables.g. What s the mpact of tangblt on leverage?.g. What

More information

UNR Joint Economics Working Paper Series Working Paper No Further Analysis of the Zipf Law: Does the Rank-Size Rule Really Exist?

UNR Joint Economics Working Paper Series Working Paper No Further Analysis of the Zipf Law: Does the Rank-Size Rule Really Exist? UNR Jont Economcs Workng Paper Seres Workng Paper No. 08-005 Further Analyss of the Zpf Law: Does the Rank-Sze Rule Really Exst? Fungsa Nota and Shunfeng Song Department of Economcs /030 Unversty of Nevada,

More information

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test PBAF 528 Week 6 How do we choose our model? How do you decde whch ndependent varables? If you want to read more about ths, try Studenmund, A.H. Usng Econometrcs Chapter 7. (ether 3 rd or 4 th Edtons) 1.

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

RELIABILITY ASSESSMENT

RELIABILITY ASSESSMENT CHAPTER Rsk Analyss n Engneerng and Economcs RELIABILITY ASSESSMENT A. J. Clark School of Engneerng Department of Cvl and Envronmental Engneerng 4a CHAPMAN HALL/CRC Rsk Analyss for Engneerng Department

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information