Identifying Vehicular Crash High Risk Locations along Highways via Spatial Autocorrelation Indices and Kernel Density Estimation

Size: px
Start display at page:

Download "Identifying Vehicular Crash High Risk Locations along Highways via Spatial Autocorrelation Indices and Kernel Density Estimation"

Transcription

1 World Journal of Engneerng and Technology, 2017, 5, ISSN Onlne: ISSN Prnt: Identfyng Vehcular Crash Hgh Rsk Locatons along Hghways va Spatal Autocorrelaton Indces and Kernel Densty Estmaton Azad Abdulhafedh 1 Department of Cvl and Envronmental Engneerng, Unversty of Mssour-Columba, MO, USA How to cte ths paper: Abdulhafedh, A. (2017) Identfyng Vehcular Crash Hgh Rsk Locatons along Hghways va Spatal Autocorrelaton Indces and Kernel Densty Estmaton. World Journal of Engneerng and Technology, 5, Receved: February 27, 2017 Accepted: May 7, 2017 Publshed: May 10, 2017 Copyrght 2017 by author and Scentfc Research Publshng Inc. Ths work s lcensed under the Creatve Commons Attrbuton Internatonal Lcense (CC BY 4.0). Open Access Abstract Identfyng vehcular crash hgh rsk locatons along hghways s mportant for understandng the causes of vehcle crashes and to determne effectve countermeasures based on the analyss. Ths paper presents a GIS approach to examne the spatal patterns of vehcle crashes and determnes f they are spatally clustered, dspersed, or random. Moran s I and Gets-Ord G* statstc are employed to examne spatal patterns, clusters mappng of vehcle crash data, and to generate hgh rsk locatons along hghways. Kernel Densty Estmaton (KDE) s used to generate crash concentraton maps that show the road densty of crashes. The proposed approach s evaluated usng the 2013 vehcle crash data n the state of Indana. Results show that the approach s effcent and relable n dentfyng vehcle crash hot spots and unsafe road locatons. Keywords Spatal Autocorrelaton, Kernel Densty, Moran s I, G* statstc, Hot Spots Analyss 1. Introducton Identfyng vehcular crash hgh rsk locatons along hghways s a useful tool that can help transportaton agences allocate lmted resources more effcently, and fnd effectve countermeasures. A crash hot spot s a locaton showng concentraton of ncdents, and hot spot analyss s a method for analyzng the spatal tendency between ponts or events wthn ths locaton [1]. If a feature s spatal tendency s hgh, and the values of ts neghborng features s also hgh, t s a part of a hot spot, and f the tendency of a feature and ts neghborhoods s low, t s a part of a cold spot. Spatal patterns of traffc crash data can be analyzed by 1 PhD n Cvl Engneerng DOI: /wjet May 10, 2017

2 spatal autocorrelaton, whch s a measure of the correlaton of an observaton wth other observatons through space. The spatal autocorrelaton phenomenon can be summarzed by the Tobler s frst law of Geography that everythng s usually related to all else but those whch are near to each other are more related when compared to those that are further away [2]. Most statstcal analyses are based on the assumpton that the values of observatons n each sample are ndependent of one another. Spatal autocorrelaton volates ths assumpton, because samples taken from nearby locatons are related to each other, and hence, they are statstcally not ndependent of one another [1] [3]. To assess spatal autocorrelaton, a dstance measure must be specfed n order to defne what s meant by two observatons beng close together. These dstances are usually presented n the form of a weght matrx, whch defnes the relatonshps between locatons at whch the observatons occur [4]. If data were collected at n locatons, then the weght matrx wll be n x n wth zeroes on the dagonal. The weght matrx s often row-standardzed, (.e. all weghts n a row sum to one), and can be constructed gven a varety of assumptons, such as [1]: A constant dstance that represents the weght for any two dfferent locatons. A fxed weght for all observatons wthn a specfed dstance. k nearest neghbors that represents a fxed weght, and all others non-neghbors are zero. Weght could be proportonal to the nverse dstance, or nverse dstance squared. There are a number of ndces or statstcs that attempt to measure spatal autocorrelaton for contnuous data, such as Moran s I, Geary s C, and Gets-Ord G statstc [5]. These ndces can be used as Global or Local measures dependng on the scope of the analyss. Global mples that all elements n the weght matrx are ncluded n the calculaton of spatal autocorrelaton provdng a sngle measurement of spatal autocorrelaton for an entre data set. Local ndces calculate spatal autocorrelaton for all areal unts of analyss. In other words, the global autocorrelaton s the extent to whch ponts that are close together n space have smlar values, and the local autocorrelaton s the extent to whch ponts that are close to a gven pont or area have smlar values. Anseln [6] outlned a general class of local ndcators of spatal autocorrelaton termed the Local Indcator of Spatal Autocorrelaton (LISA) statstc, whch mples that the LISA statstc decomposes global results nto ther local parts. For example, a sgnfcant global ndex at a gven spatal pont or secton may hde large spatal patches of no autocorrelaton, and LISA can detect ths and show us the locaton of these nsgnfcant patches n space. Conversely, an nsgnfcant global result may hde patches of strong autocorrelaton, and LISA can detect ths agan. Generally, both hot spots and cold spots can be dentfed as locatons for whch the LISA statstc s sgnfcant [6]. Hgh and sgnfcant values of Moran s I and G statstc n a spot ndcate a hgh spatal clusterng (hot spot), whereas low and sgnfcant values ndcate a low spatal clusterng (cold spot). The type of clus- 199

3 terng and ts statstcal sgnfcance s evaluated based on a confdence level and on the output z-scores and the correspondent p-values. These wll determne whether a data pont or a locaton belongs to a hot spot (denoted by Hgh-Hgh, HH), cold spot (denoted by Low-Low, LL) or an outler (a hgh data value surrounded by low data values or vce versa, denoted by Hgh-Low, HL or Low-Hgh, LH). Other methods for studyng spatal patterns of crash data as pont events have recently been developed. One of the most wdely used s the Kernel Densty Estmaton (KDE). The goal of KDE s to develop a contnuous surface of densty estmates of dscrete events such as road crashes by summng the number of events wthn a search bandwdth. Many recent studes have used the 2-D planar KDE for hot spot analyss. However, ths method has been crtczed n relaton to the fact that road crashes usually happen on the road lnks and need to be consdered n a road network space represented by 1-D dmenson. Therefore, some studes have extended the planar KDE to network spaces, whch estmates the crash densty over a dstance unt n a 1-D measurement nstead of an area unt [7] [8] [9]. However, a major weakness of the KDE methods s that t cannot be tested for statstcal sgnfcance [8] [10]. 2. Lterature Revew Vehcle crashes have been nvestgated from dfferent spatal and temporal perspectves by dfferent researchers usng vared procedures. The Black and Thomas [11] paper was the frst major work that clearly dstngushes traffc crash hotspots. Ther work ndcated that a postve network autocorrelaton of road crashes can cause the spatal clusterng of traffc hotspots. However, the analyss focused on network autocorrelatons at a global level wthn an entre dataset by usng Moran s I and the assocated z-score tests. Flahaut [12] ntroduced the use of the local ndces of spatal autocorrelaton (LISA) to examne the crash patterns of road networks n a Belgan provnce. Ths work explaned the advantages of hotspots and further developed logstc regresson models to explan traffc crash hotspots wth road characterstcs and local envronmental condtons. Yamada and Thll [7] explaned and compared hotspot methods by Kernel Densty Estmaton (KDE) at the planar and network-constraned. Ther work ndcated that the planar KDE analyss can produce over-detectng cluster patterns. More recently, Yamada and Thll [13] ntroduced a method called local ndcators of network-constraned clusters (LINCS) to dentfy hotspots by usng the network-based approach. Theoretcally, traffc crashes can occur at every possble locaton over the entre road network. However, t s mpractcal to examne the clusterng pattern at every possble pont usng the network-based approach. Hence, they suggested usng reference ponts along the network wth an equal nterval dstances. Schwetzer [14] used a process called kernel smoothng for hot spot analyss. Ths process creates local estmates of the measure of the spatal ntensty usng the count of frequency of ponts wthn a gven dstance of each pont, relatve to symmetrc dstrbuton. Xe and Yan [8] used a planar and a network-based KDE approach to examne traffc crashes n the state of Ken- 200

4 tucky. In mplementng the network KDE, they suggested usng a lnear segment of roads as the basc unt for aggregatng crashes, calculatng densty, and for vsualzaton. They found that segments of shorter length are more capable of showng the local varatons of the segments, and concluded that the networkconstraned KDE s more approprate than the planar KDE for traffc crash analyss. Erdogan et al. [15] used a repeatablty analyss to dentfy hotspots wth the hghest 5% and 1% area of the Posson dstrbuton over ten years, usng a bandwdth of 500 m. They concluded that repeatablty analyss determned more hot spot locatons than the Kernel Densty analyss. Yamada and Thll [16] appled the network-based framework to analyze hghway crashes that occurred on a small hghway network n Buffalo, New York. The method was mplemented n conjuncton wth Monte Carlo smulaton to obtan crtera aganst whch statstcal nferences from the observed patterns can be made. They found that ncorporatng GIS and spatal statstcal approaches can effectvely detect crash hotspots. 3. Moran s I Moran s I [17] s one of the oldest ndces of spatal autocorrelaton and can be used to test for global and local spatal autocorrelaton among contnuous data. For any contnuous varable, x a mean x, can be calculated and the devaton of any observaton from that mean can be calculated based on the cross products of the devatons from the mean. The statstc then compares the value of the varable at any one locaton wth the values at all other locatons [18] [19] [20]. For n observatons on a varable x at locatons, j, Moran s I s calculated by Equaton (1) as follows: ( )( ) n n j j j 0 n ( )) 2 n w x x x x I = S x x where, x : s the mean of the varable x; w j : are the elements of the weght matrx; S 0 : s the sum of the elements of the weght matrx: S0 n n = j Values for ths ndex typcally, range from 1.0 to +1.0, where a value of -1.0 ndcates negatve spatal autocorrelaton, and a value of +1.0 ndcates postve spatal autocorrelaton. When nearby ponts or segments have smlar values, ther cross product s hgh. Conversely, when nearby ponts or segments have dssmlar values, ther cross-product s low. The expectaton of Moran s I s: ( ) E I 1 = n 1 wth a Moran s I value larger than E(I), ndcates postve spatal autocorrelaton, and a Moran s I less than E(I), ndcates negatve spatal autocorrelaton. In Moran s formulaton, the weght varable, w j, s a contguty matrx. If zone j s adjacent to zone, the product receves a weght of 1.0. Otherwse, the product w j (1) (2) 201

5 receves a weght of 0.0. The z-scores of Moran s I can be computed by Equaton (3): ( ) ( ) I E I Z = (3) V I where E(I) s the expected value of I, and V(I) s the varance of I, as shown n Equaton (4): 2 2 ( ) ( ) ( ) V I = E I E I (4) The dstrbuton of the z-scores s assumed to be approxmately normal wth a mean of 0.0 and a varance of 1.0 (Clff and Ord 1981). A statstcally sgnfcant postve z-score ndcates that the dstrbuton of the observatons s spatally autocorrelated producng Hgh-Hgh (HH) clusters, whereas a negatve z-score ndcates that the observatons tend to be more dssmlar producng Low-Low (LL) clusters. A z-score close to zero ndcates that observatons are randomly and ndependently dstrbuted n space. By assumng a z-score s from a standard normal dstrbuton, ther assocated p-value can be obtaned, and can be used to determne the sgnfcance of the ndex at each locaton [4]. To determne f the z- score s statstcally sgnfcant, t should be compared to the range of values for a partcular confdence level. For example, at a sgnfcance level of 95%, a z-score would have to be less than 1.96 or greater than to be statstcally sgnfcant. The null hypothess H 0 s that there s no spatal autocorrelaton among the observatons. The null hypothess can be rejected, f the p-value shows that the z-score s sgnfcant. 4. Gets-Ord G Statstc The Gets-Ord G statstc s another ndex of spatal autocorrelaton [21] that can dstngush between postve spatal autocorrelaton wth hgh values from postve spatal autocorrelaton wth low values. The General (Global) G statstc computes a sngle statstc for the entre study area, whle the local G statstc s an ndcator for local autocorrelaton for each data pont. There are two types of G statstcs, although almost the two types produce dentcal results [22] [23]. The frst one, G, does not nclude the autocorrelaton of a zone wth tself, whereas the ncludes the nteracton of a zone wth tself (.e. the G statstc does not nclude the value of X tself, but only the neghborhood values, but ncludes X as well as the neghborhood values), and formally both can be computed by the formulae [5]: G G n ( ) j j ( ) w d = n x j j d x n ( ) j 1 j ( ) w d d = n = x j = where, d s the neghborhood (threshold) dstance, and w j s the weght matrx j 1 j x j (5) (6) 202

6 that has only 1.0 or 0.0 values, 1.0 f j s wthn d dstance of, and 0.0 f ts beyond that dstance. These formulae ndcate that the cross-product of the value of X at locaton and at another locaton j s weghted by a dstance weght, w j whch s defned by ether a 1.0 f the two locatons are equal to or closer than a threshold dstance, d, or a 0.0 otherwse. The G statstc can vary between 0.0 and 1.0. The statstcal sgnfcance of the local autocorrelaton between each pont and ts neghbors s assessed by the z-score test and the p-value. The expected G value for a threshold dstance, d, s defned as: W E G( d ) = (7) n n 1 ( ) where, W s the sum of weghts for all pars of locatons ( W n = n j w j ), and n s the number of observatons. Assumng normal dstrbuton, the varance of G(d) s defned as [24]: 2 2 ( ) = ( ) ( ) Var G d E G E G (8) The standard error of G(d) s the square root of the varance of G. Therefore, a z-test can be computed by: ( ) = ( ) SE.. G d Var G d (9) G d Z G( d) = ( ) E G( d) SE.. G( d) Where, a postve z-value ndcates spatal clusterng of hgh values, whle a negatve z-value ndcates spatal clusterng of low values. Sometmes, the G statstc may not follow a normal standard error, and the dstrbuton of the statstc may not be normally dstrbuted, such as the case of a skewed varable wth some ponts havng very hgh values whle the majorty of other ponts havng low values. In ths case, a permutaton type smulaton should be used [6] [25], wth a randomzaton dstrbuton to test the null hypothess of no local autocorrelaton (H 0 ). Ths wll mantan the dstrbuton of the varable z but wll estmate the value of G under random assgnment of ths varable, and the user can take the usual 95% or 99% confdence ntervals based on the level used. 5. Planar Kernel Densty Estmaton Kernel Densty Estmaton s a non-parametrc method to estmate the probablty densty functon of a varable that produces a smooth densty surface of pont events over a 2-D geographc space (.e. planar space). Kernel densty estmatons are closely related to hstograms, but can be constructed wth propertes such as smoothness or contnuty by usng a sutable kernel. The dsadvantages of hstograms provde the motvaton for kernel estmaton. When we construct a hstogram, we need to consder the wdth of the bns n whch the whole data nterval s dvded by, and the end ponts of the bns. As a result, the problems wth hstograms are that they are not smooth, and therefore we can allevate these problems by usng kernel densty estmaton that centers a kernel functon (10) 203

7 at each data pont [26]. KDE tends to produce a smooth densty surface of pont events over space by computng event ntensty as densty estmaton. The general form of a KDE n a 2-D space s gven by [8]: n 1 ds λ ( s) = k (11) 1 2 π r r Where λ(s) s the densty at locaton s, r s the search radus (bandwdth) of the KDE, k s the weght of a pont at dstance d s to locaton s. The kernel functon k s usually consdered as a functon of the rato between d s and r. As a result, the longer the dstance between a pont and locaton s, the less that pont s weghted for calculatng the overall densty. All ponts wthn the bandwdth r of locaton s are summed for calculatng the densty at s. A number of dstrbutons can be used to measure the spatal weghts k, such as Gaussan, Quartc, Conc, Mnmum varance functon, negatve exponental, and epanchnekov [27] [28]. Some of the mostly used forms of kernel functons are [29]: The Gaussan functon: d k r The Quartc functon: d k r s 1 d = EXP 2π 2r s d = K 1 r 2 s 2 2 s 2 (12) (13) Where K s a scalng factor to ensure the total volume under Quartc curve s 1.0, and usually used as ¾. The mnmum varance functon: d k r s 3 d = r To fnd the KDE value, two key parameters must be chosen: the kernel functon k; and the search radus (bandwdth) r. Many studes have found that the type of the dstrbuton of the kernel functon k has a very lttle effect on the results compared to the choce of search bandwdth r [1] [26] [29] [30] [31]. The value of search bandwdth r s very mportant because t usually determnes the smoothness of the estmated densty and can affect the outcome. If t s too small, t wll not produce a contnuous smooth surface, and too large bandwdth wll suppress spatal varaton of events. Therefore, the bandwdth of the kernel densty estmaton often proves to be more nfluental to outcomes than the kernel shape dstrbuton. Hence, an optmal value of r must be chosen that mnmzes the sum of the squared errors of the kernel estmaton [32]. There s no unque defnton of the optmal search radus, and dfferent optmalty crtera have been used. For example, ESRI ArcGIS 10.2 uses the followng formula as a default optmal search radus [33]: r = 0.9 mn SD, Dm n ln ( 2) (15) 2 s 2 (14) 204

8 where, SD: the standard dstance D m : the medan dstance n: the number of ponts f no populaton feld s used, or f a populaton feld s suppled, n s the sum of the populaton feld values, and mn: means that whchever of the two optons that results n a smaller value wll be used. Slverman [26] suggested a rule of thumb for calculatng the optmal search radus as follows: = 4 (16) r SD 5 3 n where, the SD s the standard devaton of the samples provded that the kernel functon s Gaussan type and that samples follow normal dstrbuton. The cell sze depends on the user choce and the dataset. Okabe et al. [9] suggested usng a cell sze of (r/10) as a rule of thumb. 6. Network Kernel Densty Estmaton In the real world, there are many knds of network-constraned events, such as traffc crashes, street crmes, leakages n gas ppe lnes along roadways, and rver contamnaton. In planar KDE, the space s characterzed as a 2-D homogeneous Eucldan space and densty s usually estmated at a large number of locatons that are regularly spaced over a grd. However, n analyzng the hot spots of network-constrant events, the assumpton of homogenety of 2-D space does not hold and the relevant KDE methods may produce based results [9]. Therefore, the planar KDE has been extended to the network KDE, whch dffers from the planar KDE n several aspects: () the network space s used as the pont event context; () both search bandwdth r and kernel functon k are based on network dstance (calculated as the shortest path dstance n a network) nstead of straght-lne Eucldean dstance; and () densty s measured per lnear unt nstead of area unt. The network KDE s a 1-D measurement whle the planar KDE s a 2-D measurement [9]. The network KDE s an extenson of the planar 2-D KDE and t uses the followng equaton for the densty estmaton of network-constraned pont events n a network space [8]: λ ( s) n 1 ds = k r r (17) Instead of calculatng the kernel densty over an area unt, the equaton estmates the densty over a lnear unt, and any of the dfferent forms of kernel functons k may be used. 7. Data The analyss s conducted on a dataset that presents a road network n the state of Indana as shown n Fgure 1. The data ncludes a crash pont layer and a road layer wth crash records of the year 2013 that ncludes all types of crashes (.e. fatal, njury, and property damage). The dataset ncludes 2983 crash pont 205

9 Fgure 1. Roads and Crashes of Indana network used n the analyss. events on the network. 8. Results and Dscusson The Global Moran s I evaluates whether the overall network crashes are clustered, dspersed, or random, and assesses the overall spatal pattern of the crash data. The GIS spatal statstcs tool s used to compute the Global Moran s I, and fve values are generated from runnng ths tool: The Moran s Index, the Expected Index, the Varance, the z-score, and the p-value as shown n Table 1. The results of the analyss are nterpreted wthn the context of the null hypothess. For the Global Moran s I statstc, the null hypothess states that the attrbutes (.e. crashes) beng analyzed are randomly dstrbuted among the features n the study area (.e. no global spatal autocorrelaton exsts for the entre network). However, snce the p-value beng generated s less than 0.01 (usng a confdence level of 99%), then ths ndcates that the Global Moran s I spatal autocorrelaton s sgnfcant, and hence, we can reject the null hypothess, and state that t s qute possble that the spatal dstrbuton of the overall network crashes s the result of clusterng pattern, and there s less than 1% probablty that ths pattern could be the result of random process. Smlarly, the Global (General) Gets-Ord statstc evaluates whether the overall network crashes are clustered, dspersed, or random, and assesses the overall pattern and trend of the crash data, and fve values are generated from runnng the ArcMap spatal statstc tool: The General G statstc, the Expected Index, the Varance, the z-score, and the p-value as shown n Table 2. Snce the p-value beng generated s less than 0.01 (usng a confdence level of 99%), then ths ndcates that the General G statstc spatal autocorrelaton s sgnfcant, and hence, we can reject the null hypothess, and state that t s qute possble that the spatal dstrbuton of the overall network crashes s the result of clusterng patterns, and there s less than 1% probablty that ths pattern 206

10 Table 1. Global Moran s I Summery for Indana road network. Global Moran s I Expected Index Varance z-score p-value Decson sgnfcant Table 2. General G statstc Summery for Indana road network General G Expected Index Varance z-score p-value Decson sgnfcant could be the result of random process. Ths result s analogous to the Global Moran s I n determnng the overall clusterng pattern of the crashes. Next, the statstcally sgnfcant hot spots, cold spots, and spatal outlers are dentfed usng the Anseln Local Moran s I, and the local statstc. The z-scores and p-values can be used to evaluate the statstcal sgnfcance of the computed ndex values. Ths method can dstngush between a statstcally sgnfcant cluster of hgh values (HH), cluster of low values (LL), and outlers n whch a hgh value s surrounded by low values (HL), and outlers n whch a low value s surrounded by hgh values (LH). Table 3 shows the HH, LL, HL, LH dentfed by both Moran s I and statstc. Fgure 2 shows the HH, LL, HL, LH dentfed by Moran s I. Fgure 3 shows the HH, LL, HL, LH of Moran s I wth renderng that clearly llustrates the range of the z-scores of the dentfed clusters between the range LL < - 2.0, and the HH > 2.0. It can be seen that the statstc has dentfed a larger number of sgnfcant hot spots (157 HHs) and sgnfcant cold spots (307 LLs) than the Moran s I (102 HHs, 287 LLs). However, Moran s I has dentfed a larger number of sgnfcant outlers (79 HLs, 82 LHs) than the (48 HLs, 0.0 LHs). In addton, the extent and locatons of hot spots, cold spots and outlers dffer from one method to the other. For example, cluster # 1 s dentfed by the statstc as purely HH hot spot, whle t has been dentfed as a mxed HH and LH hot spot by Moran s I. Clusters # 3, 4, and 5 are dentfed by the as purely cold spots, whle they have been dentfed as mxed HH, LH, and non-sgnfcant hot spots by Moran s I. Clusters # 2 and 6 are dentfed as non-sgnfcant spots by both methods. Fgure 4 shows the HH, LL, HL, LH dentfed by the statstc. Fgure 5 shows the HH, LL, HL, LH of wth renderng that clearly llustrates the range of the z-scores of the dentfed clusters between the range of LL < - 2.0, and the HH > 2.0. The planar Kernel Densty Estmaton s determned by the ArcMap spatal analyst tools. Kernel Densty calculates the densty of a pont around each output raster cell, and a smoothly curved surface s ftted over each pont. Densty surfaces show where pont features are concentrated. The surface value s hghest at the locaton of the pont beng analyzed and decreases wth ncreasng dstance from the pont, reachng zero at the search radus dstance from the pont. Fgure 6 shows the hot spots dentfed by the planar KDE, and ther average dens- 207

11 Fgure 2. Hot spots and outlers by Anseln Moran s I. Fgure 3. Hot Spots by Moran s I wth z-scores renderng. Fgure 4. Hot Spots and Outlers by G* statstc. tes per km 2. It can be seen that the planar KDE has dentfed seven clusters wth dfferent crash denstes. For example, cluster # 1 contans 8 densty levels rang- 208

12 Fgure 5. Hot Spots by G* statstc wth z-scores renderng. Fgure 6. Hot Spots by Planar Kernel Densty Estmaton. Table 3. Hot spots, Cold Spots, and outlers of Indana network by Moran s I and G *. Method Hot Spot HH Cold Spot LL Outler HL Outler LH Anseln Moran s I G * statstc ng from the hghest densty value of 147 crashes/km 2 to the lowest densty value of 16 crashes/km 2. Cluster # 2 and # 3 contan 3 densty levels rangng from the hghest densty of 65 crashes/km 2 to the lowest densty of 16 crashes/km 2. Cluster # 4 and # 5 contan 2 densty levels that decreases from the hghest value of 48 crashes/km 2 to the lowest value of 16 crashes/km 2. Cluster # 6 and # 7 contan only one densty level of at least 16 crashes/km 2. The remanng whte raster area contans the mnmum densty (between 0 15 crashes/km 2 ). The network-constraned Kernel Densty Estmaton s determned usng the SANET V4.1 software [34]. Tradtonally, network events are analyzed wth spatal methods assumng Eucldean dstance on a 2-D plane, however, ths assumpton does not hold n practce when analyzng network events, because Eucldean 209

13 dstances and ther correspondng network shortest-path dstances are sgnfcantly dfferent. Alternatvely, network spatal analyss assumes the shortest-path dstance on networks that enables more practcal nvestgaton of network events than planar spatal analyss. Hence, planar KDE s lkely to lead to false conclusons when appled to network events [35]. A clear example s provded n Fgure 7, whch shows that for a road segment AB, the planar KDE consders 14 crashes wthn the crcular area surroundng the segment, whle the network KDE consders only 11 crashes on that segment. Fgure 8 shows the hot spots dentfed by the network KDE, and ther average denstes per lnear km. Fgure 9 shows the hot spots by the network KDE wth renderng of ther z-scores. It can be notced from Fgure 8 that the network KDE has dentfed dfferent clusterng patterns. For example, cluster # 1 s dentfed by the network KDE as purely hgh densty hot spot smlar to the pattern. The average densty at ths juncton s (between 4.8 to 5.7 crashes/km). Clusters # 3, 4, and 5 are dentfed by the network KDE as purely hgh densty hot spots (ther densty between 4.8 to 5.7 crashes/km), whle they have been dentfed as mxed HH, LH, and non-sgnfcant hot spots by Moran s I, and purely LL cold spots by. Clusters # 2 and 6 are dentfed by the network KDE as mxed hgh densty hot spots and low densty spots (densty for the low spots s between 1.8 to 2.7 crashes/km, and densty for the hgh spots s between 4.8 to 5.8 crashes/km), whle they are dentfed as non-sgnfcant spots by both Moran s I and statstc. The crash densty for the cold spots s between 2.8 to 4.7 crashes/km as shown n Fgure 8. Snce each method has dentfed dfferent clusterng patterns, therefore we recommend usng a combnaton of these methods n hot spot analyss. Comparable results can show more dverse and flexble nterpretatons among the clusterng patterns. Usng only one method can result n msleadng conclusons. For example, as we saw above, cluster # 1 s dentfed as a pure HH hot spot n Fgure 7. An example of dfferent outcomes between the planar KDE and the network KDE. 210

14 Fgure 8. Hot Spots by Network Kernel Densty Estmaton. Fgure 9. Hot Spots by network KDE wth z-score renderng. and hgh densty spot n network KDE (densty from 4.8 to 5.7 crashes/km), whle t s dentfed as a mxed HH and LH n Moran s I. Lkewse, cluster # 2 s dentfed as a pure non-sgnfcant spot n both Moran s I and, whle t s a mxed HH densty spot (wth densty from 4.8 to 5.7 crashes/km) and low densty spot (from 1.8 to 2.7 crashes/km) n network KDE. Hence, usng a combnaton of these methods would probably produce more relable results than usng one method alone. Table 4 shows a comparson between some characterstcs of Moran s I, 9. Concluson statstc, and network KDE n dentfyng hot spots. Hot pot analyss focuses on hghlghtng areas whch have hgher than average ncdence of events, and t s a valuable technque for vsualzng the concentraton of events on networks. Ths paper presented two methods: Moran s I and Gets-Ord statstc based on network spatal autocorrelaton and another thrd method, kernel densty estmaton (KDE) to examne the spatal patterns of 211

15 Table 4. Comparson between Moran s I, G* statstc, and network KDE. Moran s I G* statstc Network KDE measures spatal correlaton, dentfes hot spots wth hgh-hgh values, and cold spots wth low-low values Identfes outlers (dspersed ncdents) wth hgh-low values and low-hgh values Lookng at the value n the context of ts neghbors values wthn the nverse dstance between locatons does not nclude the nteracton of a zone wth tself but only wth ts neghborhoods n measurng spatal correlaton reports an ndex-value, and a z-score measures spatal correlaton, dentfes hot spots wth hgh-hgh values, and cold spots wth low-low values does not dentfy outlers Lookng at the value n the context of ts neghbors values that fall wthn a specfed dstance of each other ncludes the nteracton of a zone wth tself n addton to ts neghborhoods n measurng spatal correlaton reports a combned ndex-value and a z-score measures probablty densty functon, dentfes only hot spots n term of densty per lnear dstance unt does not dentfy outlers conduct densty calculaton based on the user-specfed search radus and raster cell sze does not nclude the nteracton of a zone wth tself but only wth ts neghborhoods n measurng kernel densty reports a lnear densty value, and a z-score reports a p-value reports a p-value does not report a p-value presents the statstcal sgnfcance of clusterng presents the statstcal sgnfcance of clusterng does not present the statstcal sgnfcance of clusterng ranges from -1.0 to ranges from 0.0 to any postve value vehcle crashes and determnes f they are spatally clustered, dspersed, or random usng the 2013 vehcle crash data n the state of Indana. The Global values of both Moran s I and showed that t s qute possble that the spatal dstrbuton of the overall network crashes s the result of clusterng patterns, and there s less than 1% probablty that ths pattern could be the result of random process. The local Moran s I and the road network. The dentfed dfferent clusterng patterns on statstc has dentfed a larger number of sgnfcant hot spots (157 HHs) and sgnfcant cold spots (307 LLs) than the Moran s I (102 HHs, 287 LLs). However, Moran s I has dentfed a larger number of sgnfcant outlers (79 HLs, 82 LHs) than the (48 HLs, 0.0 LHs). The kernel densty estmaton s evaluated as planar KDE and network KDE. In planar KDE, the space s characterzed as a 2-D homogeneous Eucldan space that does not hold when analyzng network events. Therefore, the planar KDE has been extended to the network KDE, whch s a 1-D measurement whle the planar KDE s a 2-D measurement. In applyng the planar KDE to the network, t dentfed seven clusters wth dfferent crash denstes. Cluster # 1 contaned 8 densty levels, cluster # 2 and # 3 contaned 3 densty levels, cluster # 4 and # 5 contaned 2 densty levels, and cluster # 6 and # 7 contaned only one densty level. The network KDE dentfed dfferent patterns of clusters than what Moran s I and statstc have dentfed. For example, cluster # 1 s dentfed as a pure HH hot spot n both and network KDE (densty from 4.8 to 5.7 crashes/km), whle t s dentfed as a mxed HH and LH n Moran s I. Lkewse, cluster # 2 s den- 212

16 tfed as a pure non-sgnfcant spot n both Moran s I and, whle t s a mxed HH densty spot (wth densty from 4.8 to 5.7 crashes/km) and low densty spot (from 1.8 to 2.7 crashes/km) n network KDE. Snce each method has dentfed dfferent clusterng patterns, therefore ths paper recommends usng a combnaton of these methods n hot spot analyss. Comparable results from these methods can produce more relable nterpretatons among the clusterng patterns. Usng only one method can probably produce msleadng results. References [1] Baley, T.C. and Gatrell, A.C. (1995) Interactve Spatal Data Analyss. Addson Wesley Longman Ltd., Harlow, England. [2] Tobler, W.R. (1970) A Computer Move Smulatng Urban Growth n the Detrot Regon. Economc Geography, 46, [3] Black, W. (1992) Network Autocorrelaton n Transport Network and Flow Systems. Geographcal Analyss, 24, [4] Clff, A.D. and Ord, J.K. (1981) Spatal Processes: Models and Applcatons. Pon, London, UK. [5] Fscher, M. and Wang, J. (2011) Spatal Data Analyss: Models, Methods, and Technques. Sprnger, New York. [6] Anseln, L. (1995) Local Indcators of Spatal Assocaton LISA. Geographcal Analyss, 27, [7] Yamada, I. and Thll, J.C. (2004) Comparson of Planar and Network K-Functons n Traffc Accdent Analyss. Journal of Transport Geography, 12, [8] Xe, Z. and Yan, J. (2008) Kernel Densty Estmaton of Traffc Accdents n a Network Space. Computers, Envronment and Urban Systems, 32, [9] Okabe, A., Satoh, T. and Sughara, K. (2009) A Kernel Densty Estmaton Method for Networks, Its Computatonal Method and a GIS-Based Tool. Internatonal Journal of Geographcal Informaton Scence, 23, [10] Anderson, T.K. (2009) Kernel Densty Estmaton and K-Means Clusterng to Profle Road Accdent Hot Spots. Accdent Analyss & Preventon, 41, [11] Black, W. and Thomas, I. (1998) Accdents on Belgum s Motorway: A Network Autocorrelaton Analyss. Journal of Transport Geography, 6, [12] Flahaut, B. (2004) Impact of Infrastructure and Local Envronment on Road Unsafety: Logstc Modelng wth Spatal Autocorrelaton. Accdent Analyss & Preventon, 36, [13] Yamada, I. and Thll, J.C. (2007) Local Indcators of Network-Constraned Clusters n Spatal Pont Patterns. Geographcal Analyss, 39, [14] Schwetzer, L. (2006) Envronmental Justce and Hazmat Transport: A Spatal Analyss n Southern Calforna. Transportaton Research Part D: Transport and Envronment, 11, [15] Erdogan, S., Ylmaz, I., Baybura, T. and Gullu, M. (2008) Geographcal Informaton Systems Aded Traffc Accdent Analyss System Case Study: Cty of Afyonkarahsar. Accdent Analyss & Preventon, 40,

17 [16] Yamada, I. and Thll, J.C. (2010) Local Indcators of Network-Constraned Clusters n Spatal Patterns Represented by a Lnk Attrbute. Annals of the Assocaton of Amercan Geographers, 100, [17] Moran, P. (1948) The Interpretaton of Statstcal Maps. Journal of the Royal Statstcal Socety, 10, [18] Anseln, L. (1992) Space Stat: A Program for the Statstcal Analyss of Spatal Data. Natonal Center for Geographc Informaton and Analyss, Unversty of Calforna, Santa Barbara, CA. [19] Goodchld, M.F. (1987) Spatal Autocorrelaton. Concepts and Technques n Modern Geography, 48, [20] Grffth, D.A. (1987) Spatal Autocorrelaton: A Prmer. Resource Publcatons n Geography, The Assocaton of Amercan Geographers, Washngton DC. [21] Gets, A. and Ord, J.K. (1992) The Analyss of Spatal Assocaton by Use of Dstance Statstcs. Geographcal Analyss, 24, [22] Gets, A. and Ord, J.K. (1996) Local Spatal Statstcs: An Overvew. Geo Informaton Internatonal, Cambrdge, England. [23] Berglund, S. and Karlstrom, A. (1999) Identfyng Local Spatal Assocaton n Flow Data Geographcal Systems. Transportaton Geography, 1, [24] Lee, J. and Wong, D.W.S. (2005) Statstcal Analyss wth ArcVew GIS and ArcGIS. Wley & Sons, Inc., New York. [25] Mobley, L.R., Kuo, T.M., Drscoll, D., Clayton, L. and Anseln, L. (2008) Heterogenety n Mammography Use across the Naton: Separatng Evdence of Dspartes from the Dsproportonate Effects of Geography. Internatonal Journal of Health Geographcs, 7, [26] Slverman, B.W. (1986) Densty Estmaton for Statstcs and Data Analyss. Chapman Hall, London. [27] Levne, N. (2004) CrmeStat III: A Spatal Statstcs Program for the Analyss of Crme Incdent Locatons. Ned Levne & Assocates The Natonal Insttute of Justce, Houston, TX; Washngton DC. [28] Gbn, M., Longley, P. and Atknson, P. (2007) Kernel Densty Estmaton and Percent Volume Contours n General Practce Catchment Area Analyss n Urban Areas. Proceedngs of the GIScence Research UK Conference (GISRUK) 2007, Maynooth, Ireland. [29] Schabenberger, O. and Gotway, C.A. (2005) Statstcal Methods for Spatal Data Analyss. Chapman & Hall/CRC, Boca Raton, FL. [30] O Sullvan, D. and Unwn, D.J. (2002) Geographc Informaton Analyss. John Wley, Hoboken, NJ. [31] O Sullvan, D. and Wong, D.W.S. (2007) A Surface-Based Approach to Measurng Spatal Segregaton. Geographc Analyss, 39, [32] Scott, D.W. (1992) Multvarate Densty Estmaton, Theory, Practce, and Vsualzaton. John Wley & Sons, New York. [33] ArcGIS Resources. [34] Okabe, A., Okunuk, K. and Shode, S. (2006) The SANET Toolbox: New Methods for Network Spatal Analyss. Transactons n GIS, 10,

18 [35] Okabe, A. and Sughara, K. (2012) Spatal Analyss along Networks: Statstcal and Computatonal Methods. John Wley & Sons, New York. Submt or recommend next manuscrpt to SCIRP and we wll provde best servce for you: Acceptng pre-submsson nqures through Emal, Facebook, LnkedIn, Twtter, etc. A wde selecton of journals (nclusve of 9 subjects, more than 200 journals) Provdng 24-hour hgh-qualty servce User-frendly onlne submsson system Far and swft peer-revew system Effcent typesettng and proofreadng procedure Dsplay of the result of downloads and vsts, as well as the number of cted artcles Maxmum dssemnaton of your research work Submt your manuscrpt at: Or contact wjet@scrp.org 215

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Spatial Statistics and Analysis Methods (for GEOG 104 class).

Spatial Statistics and Analysis Methods (for GEOG 104 class). Spatal Statstcs and Analyss Methods (for GEOG 104 class). Provded by Dr. An L, San Dego State Unversty. 1 Ponts Types of spatal data Pont pattern analyss (PPA; such as nearest neghbor dstance, quadrat

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

On Graphs with Same Distance Distribution

On Graphs with Same Distance Distribution Appled Mathematcs, 07, 8, 799-807 http://wwwscrporg/journal/am ISSN Onlne: 5-7393 ISSN Prnt: 5-7385 On Graphs wth Same Dstance Dstrbuton Xulang Qu, Xaofeng Guo,3 Chengy Unversty College, Jme Unversty,

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics )

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics ) Ismor Fscher, 8//008 Stat 54 / -8.3 Summary Statstcs Measures of Center and Spread Dstrbuton of dscrete contnuous POPULATION Random Varable, numercal True center =??? True spread =???? parameters ( populaton

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor Taylor Enterprses, Inc. Control Lmts for P Charts Copyrght 2017 by Taylor Enterprses, Inc., All Rghts Reserved. Control Lmts for P Charts Dr. Wayne A. Taylor Abstract: P charts are used for count data

More information

The Order Relation and Trace Inequalities for. Hermitian Operators

The Order Relation and Trace Inequalities for. Hermitian Operators Internatonal Mathematcal Forum, Vol 3, 08, no, 507-57 HIKARI Ltd, wwwm-hkarcom https://doorg/0988/mf088055 The Order Relaton and Trace Inequaltes for Hermtan Operators Y Huang School of Informaton Scence

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Semiparametric geographically weighted generalised linear modelling in GWR 4.0

Semiparametric geographically weighted generalised linear modelling in GWR 4.0 Semparametrc geographcally weghted generalsed lnear modellng n GWR 4.0 T. Nakaya 1, A. S. Fotherngham 2, M. Charlton 2, C. Brunsdon 3 1 Department of Geography, Rtsumekan Unversty, 56-1 Tojn-kta-mach,

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Structure and Drive Paul A. Jensen Copyright July 20, 2003

Structure and Drive Paul A. Jensen Copyright July 20, 2003 Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Spatial Modelling of Peak Frequencies of Brain Signals

Spatial Modelling of Peak Frequencies of Brain Signals Malaysan Journal of Mathematcal Scences 3(1): 13-6 (9) Spatal Modellng of Peak Frequences of Bran Sgnals 1 Mahendran Shtan, Hernando Ombao, 1 Kok We Lng 1 Department of Mathematcs, Faculty of Scence, and

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Lecture 12: Classification

Lecture 12: Classification Lecture : Classfcaton g Dscrmnant functons g The optmal Bayes classfer g Quadratc classfers g Eucldean and Mahalanobs metrcs g K Nearest Neghbor Classfers Intellgent Sensor Systems Rcardo Guterrez-Osuna

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

Research Article Green s Theorem for Sign Data

Research Article Green s Theorem for Sign Data Internatonal Scholarly Research Network ISRN Appled Mathematcs Volume 2012, Artcle ID 539359, 10 pages do:10.5402/2012/539359 Research Artcle Green s Theorem for Sgn Data Lous M. Houston The Unversty of

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

Cathy Walker March 5, 2010

Cathy Walker March 5, 2010 Cathy Walker March 5, 010 Part : Problem Set 1. What s the level of measurement for the followng varables? a) SAT scores b) Number of tests or quzzes n statstcal course c) Acres of land devoted to corn

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE Expected Outcomes Able to test the goodness of ft for categorcal data. Able to test whether the categorcal data ft to the certan dstrbuton such as Bnomal,

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION Internatonal Worshop ADVANCES IN STATISTICAL HYDROLOGY May 3-5, Taormna, Italy DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION by Sooyoung

More information

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Chat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980

Chat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980 MT07: Multvarate Statstcal Methods Mke Tso: emal mke.tso@manchester.ac.uk Webpage for notes: http://www.maths.manchester.ac.uk/~mkt/new_teachng.htm. Introducton to multvarate data. Books Chat eld, C. and

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics

A Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics J Stat Appl Pro No 3 93- () 93 NSP Journal of Statstcs Applcatons & Probablty --- An Internatonal Journal @ NSP Natural Scences Publshng Cor A Note on Test of Homogenety Aganst Umbrella Scale Alternatve

More information

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University PHYS 45 Sprng semester 7 Lecture : Dealng wth Expermental Uncertantes Ron Refenberger Brck anotechnology Center Purdue Unversty Lecture Introductory Comments Expermental errors (really expermental uncertantes)

More information

Modeling and Simulation NETW 707

Modeling and Simulation NETW 707 Modelng and Smulaton NETW 707 Lecture 5 Tests for Random Numbers Course Instructor: Dr.-Ing. Magge Mashaly magge.ezzat@guc.edu.eg C3.220 1 Propertes of Random Numbers Random Number Generators (RNGs) must

More information

1 Binary Response Models

1 Binary Response Models Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

On the correction of the h-index for career length

On the correction of the h-index for career length 1 On the correcton of the h-ndex for career length by L. Egghe Unverstet Hasselt (UHasselt), Campus Depenbeek, Agoralaan, B-3590 Depenbeek, Belgum 1 and Unverstet Antwerpen (UA), IBW, Stadscampus, Venusstraat

More information

on the improved Partial Least Squares regression

on the improved Partial Least Squares regression Internatonal Conference on Manufacturng Scence and Engneerng (ICMSE 05) Identfcaton of the multvarable outlers usng T eclpse chart based on the mproved Partal Least Squares regresson Lu Yunlan,a X Yanhu,b

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov,

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov, UCLA STAT 3 ntroducton to Statstcal Methods for the Lfe and Health Scences nstructor: vo Dnov, Asst. Prof. of Statstcs and Neurology Chapter Analyss of Varance - ANOVA Teachng Assstants: Fred Phoa, Anwer

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Appendix B: Resampling Algorithms

Appendix B: Resampling Algorithms 407 Appendx B: Resamplng Algorthms A common problem of all partcle flters s the degeneracy of weghts, whch conssts of the unbounded ncrease of the varance of the mportance weghts ω [ ] of the partcles

More information