Measuring Association between Spatially Defined Variables: An Alternative Procedure
|
|
- Paul Richards
- 5 years ago
- Views:
Transcription
1 Lawrence 1. Hubert, Reginald G. Golledge C. Michael Costanxo, and Nathan Gale Measuring Association between Spatially Defined Variables: An Alternative Procedure INTRODUCTION An earlier paper (Hubert and Golledge 1982) developed a measure of association between two spatially defined variables, generalizing some earlier work of Tjestheim (1978). As a very brief review, there are two variables, F and G, with n realizations of each. The values of F are denoted by f,,..., f, and are observed in locations 0,,..., 0,; those on G are denoted by g,,..., g, and are observed in locations p,,..., p,. In the applications we discuss, the sets of unordered locations {O,,...,On} and { p,,..., p, } are exactly the same, though the indexing of the locations may differ for the two variables. Typically, F and G are commensurable with the observations being ranks or possibly z-scores generated for each separate variable. Throughout the paper, our major concern will be with the measurement of association in a spatial sense, as distinct from what may be best termed point-topoint association. The latter is usually approached by some well-known index of correlation, such as those attributed to Pearson, Spearman, or Kendall, and refers to the correspondence between F and G over identical spatial locations. Spatial association, on the other hand, deals with the degree to which the values on the two variables are reflected by locations that are spatially close or distant. Note that this distinction between point-to-point and spatial association does not preclude a single spatial process being responsible for producing both. Rather, our overall interest is in a generic notion of association between two spatially distributed variables that might be reflected as a point-to-point correspondence and/or as one that is best characterized in terms of the similarity of the values on two variables observed in different locations, with weights proportional to their spatial separation. Somewhat more formally, to obtain a measure of association between the variables F and G in relation to the available spatial information, two n x n matrices are defined: { dij}, where dij is some spatial separation measure between Oi and pj, and { cij}, where cij = h(f;, gj) for some particular bivariate function. This project was funded by the National Institute of Justice Grant 82-IJ-CX4019. Lawrence J. Hubert is professor of education, Reginald G. Golledge is professor of geography, C. Michael Costanu, and Nathan Gale are doctoral students in geography, University of Califmia, Santa Barbara. Geographical Analysis, Vol. 17, No. 1 (Jan. 1985) Ohio State University Press Submitted 5/83. Revised version accepted 1/84.
2 Hubert, Golledge, Costanzo, and Gale / 37 Both d j and ci j characterize two different notions of similarity between the spatial locations Oi and pi. The former is based on spatial separation and the latter on the observed values of the variables through the chosen function h(.,.), for example, h(x, gj) = If; - gjl. For us, the task of assessing association reduces to comparing the correspondence between the two matrices { dij} and { cij} through a raw cross-product statistic i,j ci jd i. For example, suppose F and G denote the same variable but measured at times t and t + 1, respectively. If h(f;, gi) is specified as, say, lx - gjl, then the raw cross-product measure should be relatively extreme in size if the common variable reflects a process that is partially determined spatially (e.g., see the various models described by Cliff and Ord 1981). If p is some permutation on the first n integers, a general index of the form = CCijdip(j) i, j can be defined; the observed index of association is then obtained through an identity permutation pi: '(PI) = Ccijdij. i.j Significance testing proceeds by assuming that all n! possible permutations for the index A( p) are equally likely and by comparing A( p r) to the resulting distribution. This is equivalent to assuming that the observations on F are fixed in their respective locations and all permutations of g,,..., g, among their corresponding n locations are equally likely. Monte Carlo sampling can be used to obtain significance levels or possibly a Pearson Type I11 approximation based on the mean and variance of A(p) given in Hubert and GoUedge (1982) plus the skewness parameter y: - where a, = d G - - a: + d *, and where "j = (l/n)cdg i a:= (l/n)cdg i a* = (l/n2)cdg i,j This process of fitting a Type I11 approximation is discussed by Mielke, Berry, and Brier (1981) and Costanzo, Hubert, and Golledge (1983). Normalized indexes that correspond to the raw index A(p) are discussed by Hubert and Golledge (1982).
3 38 / Geographical Analysis From a broader historical perspective, the reader may also wish to consult Bachi (1962), whose descriptive work on measuring the spatial association between two variables predates that of Tjestheim (1978). Bachi was concerned, in part, with evaluating a one-to-one correspondence between two spatial sets using the concept of standard distance, that is, the function h(., - ) is (0-1) dichotomous and defines a pairing of the locations in {O,,...,On} and { p,,..., p,}; the measure of spatial separation between two locations is their squared Euclidean distance apart. Although no inference model was proposed by Bachi, he did offer several tentative suggestions at a descriptive level for generalizing the (0-1) mapping to one that uses more complete information from the two variables. This earlier work can now be incorporated within our framework through the use of the cross-product measure between the two matrices { cij} and { dij}. Reference might also be made to Tobler (1965), who incorporates Bachi s ideas in discussing the correspondence between two geographical patterns, and to Glick (1982), who evaluates Tjestheim s original proposal through a small Monte Carlo simulation. Finally, for a general discussion of the importance of assessing association between two spatially distributed variables, we refer the reader to the classic empirical work of McCarty and Salisbury (1962). AN ALTERNATIVE INFERENCE STRATEGY FOR ASSESSING THE ASSOCIATION BETWEEN TWO VARIABLES Although the generalized Tjestheim approach to measuring association between variables is a natural strategy to follow and one that is capable of a number of variations, it has at least one anomaly because the index A(p) does not take advantage of the fact that the row and column objects are the same. To be more explicit, suppose we consider Tjestheim s original index Ec,ld,JccI, where c = 1 if 1. rank f; = rank g,, and 0 otherwise. In constructing the reference distnbution against which to compare an observed index, different permutations lead to a range of values for the index A(p). The ranks for variable F are fixed in given spatial locations and those for G are redistributed according to the particular permutation being considered. Each permutation of the ranks on G results in some pairing of integers at each of the n spatial locations; therefore, in addition to A( p), we could compute a Spearman rank order correlation measure for the given pairing. In general, there is a nonzero covariance between these two statistics suggesting that the inference model for Tjestheim s measure is spatially contaminated with an association statistic that is aspatial in design. Following Fraser (1956), we could, in fact, go further and provide an explicit formula for this covariance for any arbitrary spatial separation matrix { d,, }. The important point, however, is that the covariance is typically nonzero, and, thus, we do not need to go into the details. An alternate way of examining the difficulty of assessing the significance of the Tjestheim index relies on emphasizing the conceptual distinction between two types of correspondence that may be present for spatially distributed variables. As mentioned earlier, one of these is a point-tepoint notion that does not depend on the locational arrangement of the observations, but only on the relationship within the pairs (A, gt). This latter relationship may OT may not be due to some spatial process and there is no way of making this determination by relying solely on the point-tepoint framework. The other notion of spatial association depends on the relationship between distinct pairs treated as indivisible units. In this case, the relationship can be attributed unambiguously to spatial process. It may be true (and in most geographically relevant instances we would expect it to be true) that some spatial process is responsible for both point-tepoint and spatial association; thus, reasonable measures of each should be positively related over most data sets. In fact, given two such measures, we might even wish to aggregate them into one final
4 Hubert, Golledge, Costanzo, and Gale / 39 measure of association. Nevertheless, spatial association, as we have characterized it, is conceptually distinct from a point-tepoint relationship because the latter cannot be attributed unambiguously to some spatially determined process. Consequently, it would be convenient to have a strategy for evaluating the relative size of spatial association per se, conditional on the fixed level of point-tepoint association that is present in the data and without any commitment to the point-tepoint relationship itself being spatially determined. Staying within the framework we have set out, there does not seem to be any obvious solution to the problem. For example, given the symmetry in the use of variables F and G, fixing the second variable G and permuting the values of F does not help. Nor does a model discussed by Klauber (1971) offer a solution in which both the sets of values on F and G would be permuted at random among the n locations. Taking a somewhat different tactic, however, there is a way of resolving this difficulty as long as the realizations on the two variables F and G are not all identical. What is required is to treat the association task as a special case of the spatial autocorrelation index discussed by Hubert, Golledge, and Costanzo (1981). In our notation, this would take the form The observed statistics A(pl) and T(p,) are numerically the same (assuming zero diagonals for { d } and { ci }), but we would use T( p) to develop the appropriate reference distribution. Because the inference model permutes rows and columns simultaneously, all permutations would lead to varying values of the association statistic but exactly to the same value of Spearman s index or, for that matter, to any of the usual correlation coefficients, for example, Kendall s tau. Pairs of observations on F and G, (fl, g,), are permuted as indivisible units among the n locations; that is, conditional on the observed pairs of observations, the null assumption is that the n pairs represent n independent observations on a common underlying bivariate distribution; the alternative hypothesis is one of nonindependence. Viewed slightly differently, suppose the spatial separation matrix { d } is symmetric and cij = If; - gjl, i # j. Then, without loss of generality, each offdiagonal value ci could be redefined as and T(p) is a generalized spatial autocorrelation statistic designed to assess the correspondence between two square symmetric matrices, and, in turn, the degree to which the association between F and G can be explained by their spatial realizations. In short, a seemingly minor change in the usual measure of spatial autocorrelation for a single variable leads to a strategy for assessing the association between two variables in a spatial sense. Compared to Tjestheim s alternative, the inference strategy based on T does not have the same contamination with an aspatial measure. Practical significance testing procedures for T(p,) would be the same as for A(p,), for example, Monte Carlo methods or Type I11 approximations based on the first three moments of r(p) given, say, by Mielke (1979). EXAMPLE As an illustration of the difference between a measure of spatial association and a traditional correlation coefficient based on point-tepoint correspondence, consider the data displayed in Figures la and lb. Using Tobler s unclassed choropleth
5 40 / Geographical Analysis FIG. la (bottoln). Rank Order of Per Capita Residential Burglary, lb (top). Rank Order of Per Capita Residential Burglary, FIG lc City of Santa Barbara Police Reporting Distncts
6 -- Y TABLE 1 ~ Rt?poltlllg Distnct _ ~~ Hubert, Golledge, Costanzo, and Gale / 41 -~ ~ - _ ~ Reporting - - ~ - Bstnct 19x0 ~ ~ - _-~- ~ ~ - ~ 19x mapping procedure (see Tobler 1973; Gale and Halperin 1982), these figures portray the rank order of per capita residential burglary for 1980 and 1981, respectively, over the sixty-four police reporting districts used by the city of Santa Barbara, California. Figure lc merely indicates the arbitrary numbering of the sixty-four districts for the actual data given in Table 1. Spearman s rank order correlation between these two sets is quite high. At 0.66, it is significant at less than the 0.01 level based on the usual normal approximation discussed in Kendall(l970). In addition to a significant Spearman correlation across the two data sets, Figures la and lb also display a significant amount of spatial association using the average of the rank differences as given in (1) and the Euclidean distances between district centroids. In particular, the raw index r has a value of x lo5, with E(T) = x lo5, V(T) = x lo6, and a skewness parameter of Thus, based on either a Type I11 approximation or the Monte Carlo distribution of Table 2 for a random sample size of 99, this latter raw index would be assigned similar significance levels of less than 0.01 (see Costanzo et al. 1983). Finally, as a more convenient normalized index, we might note that the Pearson product-moment correlation between the offdiagonal entries of the distance matrix and the matrix of rank differences from (1) is 0.20.
7 ~ ~ ~ ~~ ~~ 42 / Geographical Analysis ~- TABLE 2 Approximate Monte Carlo Distnbution for the Index r, Using Rank Differences of Data in Table 1 and Euclidean Distances between Distncts (sample size = 99) -~ (,miidatwe Frequency ~ FIG. 2. Maximized Spatid Association, Residential Burglary, 1980 (bottom), 1981 (top)
8 Hubert, Golledge, Costanzo, and Gale / 43 To show several possible patternings of spatid association that might conceivably be present in data of this same form, a simple optimization algorithm was applied to the data as originally given in Table 1. Specifically, the pairs of ranks attached to each of the sixty-four districts were treated as intact units but were reauocated to maximize (or minimize) the raw spatial association index r. Starting with the data as listed, the pairs of ranks between two districts were interchanged to increase (or decrease) r the most. Starting at this point, another interchange was implemented in the same way, and so on until no painvise interchange could be found that would increase (or decrease) r. These final allocations are not necessarily globally optimal but the mappings they produce, given in Figure 2 (Fig. 3), at least show rather clearly how strong positive (or negative) spatial association might look graphically. The point-tepoint Spearman correlation for Figures la, lb, 2, and 3 are all exactly the same, that is, a value of What has changed is the degree of spatial association. For Figures la and lb we obtained a normalized index of 0.20 (the correlation between the offdiagonal entries of the distance matrix and a second based on rank differences); the index is 0.84 for Figure 2 and for Figure 3. Although the pairs of maps in Figures 1, 2, and 3 were concerned with spatial association, note the degree of spatial autocorrelation exhibited by the single variables mapped in each of the six representations. As given in Table 3, we use a FIG. 3. Minimized Spatial Association, Residential Burglary, 1980 (bottom), 1981 (top)
9 ~~ -- ~ ~ ~~~ ~~~ ~ ~ - ~~ 44 / Geographical Analysis TABLE3 ~ Spatial Autocorrelation Analysis, Using Data from Table 1 in Initial and Optimized Fonn ~ -- ~ ~~~~~ ~~~~- ~ Data 1 (X105) ~~ ~- Initial Initial Maximized Maxlrnized Minimized s Minimized ~ ~~ - ~ ~~ - ~~ ~ ~ ~ NOTES The CxpeCtdtlOll, VdndllLe dlld 5keWle5S Of 1 dre LOnltdllt dcio1s the 5IX CdSrS h ( 1 ) = X in5 v( 1 ) = skewers = Ba\d 011 Type I11 approxmatlons all value5 of 1 xt. slgnlftc.+rrt at less tllall the 0 01 level raw index of the same form as r, that is, or a normalized counterpart based on the correlation between the distances and the rank differences. As apparent from Table 3, there is a significant degree of spatial autocorrelation in the original variables (Figs. la and lb). The normalization represented in Figure 2 leads to a much stronger positive spatial autocorrelation for the two mapped variables; conversely, Figure 3 leads to a strong negative spatial autocorrelation. In short, maximization (or minimization) of spatial association brings an increased positive (or negative) spatial autocorrelation in the two variables. When individuals subjectively compare maps for correspondence, it appears that at least two rather distinct sources of information are being used. The first is the obvious point-to-point relationship, which can be measured by Spearman s index or one of its usual competitors. Second, there is an additional component that we have discussed as spatial association. To show this distinction in a very controlled way, Figure 4 displays hypothetical data constructed so that the rank order correlation between the sixty-four cells of the checkerboard is nearly zero (a value of ). In addition, almost no spatial association exists (the correlation between the FIG. 4. Hypothetical Data, Variable F (left), Variable G (Tight)
10 Hubert, Golledge, Costanzo, and Gale / 45 FIG. 5. Maximized Spatial Association, Variable F (left), Variable G (right) distance matrix and rank differences from (1) is 0.0oO'). Reorganizing these data in the same manner as before, a local maximum shown in Figure 5 was identified for a normalized index of +0.58; a local minimum of is displayed in Figure 6. The positive correspondence of the pair of figures in Figure 5 and the negative association of the pair of figures in Figure 6 should be apparent even though the point-tepoint correlation between each of these pairs of figures is essentially zero. What the observer is recognizing is spatial association and not a point-to-point correspondence. SOME FINAL COMMENTS This paper points to the need for renewed interest in the concept of spatial association and the way in which it might be evaluated. The ideas outlined in the FIG. 6. Minimized Spatial Association, Variable F (left), Variable G (right)
11 46 / Geographical Analysis paper are being pursued in several directions, especially in relating the proposed formal measures introduced above to alternative strategies for assessing the correspondence between maps (e.g., spectral methods) and to other ways of defining the measure of proximity between the variables F and G. A particularly interesting problem relates to the interface between the normative discussion of correspondence, as presented here, and a descriptive notion based on human perception of correspondence (cf. McCarty and Salisbury 1962; Lloyd and Steinke 1977). This latter research area obviously overlaps behavioral and quantitative research, and, given the importance of these problems to geographical data analysis, we anticipate more widespread interest in this topic in the future. LITERATURE CITED Bachi, R. (1962). Standard Distance Measures and Related Methods for Spatial Analysis. Pupms, Regiunul Science Association, 10, Cliff, A. D., and J. K. Ord (1981). Sputial Processes: Models and Applications. London: Pion. Costanzo, C. M., L. J. Hubert, and R. G. Golledge (1983). A Higher Moment for Spatial Statistics. Geographical Analysis, 15, Fraser, D. A. S. (1956). A Vector Form of the Wald-Wolfowitz-Hoeffding Theorem. Annuls of Mathmatical Statistics, 27, Gale, N., and W. C. Halperin (1982). A Case for Better Graphics: The Unclassed Choropleth Map. The Ammican Statistician, 36, Glick, B. J. (1982). A Spatial Rank-Order Correlation Measure. Geogruphicul Anulysis, 14, Hubert, L. J., and R. G. Golledge (1982). Measuring Association between Spatially Defined Variables: Tjostheim s Index and Some Generalizations. Geographical Analysis, 14, Hubert, L. J., R. G. Golledge, and C. M. Costanzo (1981). Generalized Procedures for Evaluating Spatial Autocorrelation. Geographical Analysis, 13, Kendall, M. G. (1970). Rank Correlation Methods. London: Griffin. Klauber, M. R. (1971). Two Sample Randomization Tests for Space-Time Clustering. Biometries, 27, Lloyd, R. E., and T. R. Steinke (1977). Visual and Statistical Comparison of Choropleth Maps. Annuls, Association of American Geogruphers, 67, McCarty, H. H., and N. E. Salisbury (1962). Visual Corrparison of Isopletli Maps us (I Meuns of Determining Correlations between Spatially Distributed Phenomena. Iowa City: Department of Geography, University of Iowa. Mielke, P. W. (1979). On Asymptotic Non-Normality of Null Distributions of MRPP Statistics. Cummunicutions in Statistics, Theory and Methods, A8, Mielke, P. W., K. J. Berry, and G. W. Brier (1981). Application of Multi-Response; Permutation Procedures for Examining Seasonal Changes in Monthly Mean Sea-Level Pressure Patterns. Monthly Weather Reoiew, 109, Tjostheiiu, D. (1978). A Measure of Association for Spatial Variables. Biometrika, 65, Tobler, W. R. (1965). Computing the Correspondence of Geographical Patterns. Pupers, Regional Science Association, (1973). Choropleth Maps without Class Intervals? Geographical Analysis, 3,
Research Notes and Comments I 347
Research Notes and Comments I 347 mum-likelihood estimation of the constant and does not need to be applied a posteriori. Overall, the replacement of the lognormal model by the Poisson model provides a
More informationCHAPTER 4 THE COMMON FACTOR MODEL IN THE SAMPLE. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum
CHAPTER 4 THE COMMON FACTOR MODEL IN THE SAMPLE From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum 1997 65 CHAPTER 4 THE COMMON FACTOR MODEL IN THE SAMPLE 4.0. Introduction In Chapter
More informationBivariate Paired Numerical Data
Bivariate Paired Numerical Data Pearson s correlation, Spearman s ρ and Kendall s τ, tests of independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More informationThe Nature of Geographic Data
4 The Nature of Geographic Data OVERVIEW Elaborates on the spatial is special theme Focuses on how phenomena vary across space and the general nature of geographic variation Describes the main principles
More informationStructure in Data. A major objective in data analysis is to identify interesting features or structure in the data.
Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two
More informationA Monte-Carlo study of asymptotically robust tests for correlation coefficients
Biometrika (1973), 6, 3, p. 661 551 Printed in Great Britain A Monte-Carlo study of asymptotically robust tests for correlation coefficients BY G. T. DUNCAN AND M. W. J. LAYAKD University of California,
More informationUsing AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms
Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms Arthur Getis* and Jared Aldstadt** *San Diego State University **SDSU/UCSB
More informationLecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad
Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Key message Spatial dependence First Law of Geography (Waldo Tobler): Everything is related to everything else, but near things
More informationStudies in Nonlinear Dynamics & Econometrics
Studies in Nonlinear Dynamics & Econometrics Volume 9, Issue 2 2005 Article 4 A Note on the Hiemstra-Jones Test for Granger Non-causality Cees Diks Valentyn Panchenko University of Amsterdam, C.G.H.Diks@uva.nl
More informationLecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad
Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Key message Spatial dependence First Law of Geography (Waldo Tobler): Everything is related to everything else, but near things
More informationGeography 360 Principles of Cartography. April 17, 2006
Geography 360 Principles of Cartography April 17, 2006 Outlines Cartographic design principles Enhancing visual hierarchies on maps Kinds of numerical data and color scheme What is proportional symbol
More informationUnsupervised machine learning
Chapter 9 Unsupervised machine learning Unsupervised machine learning (a.k.a. cluster analysis) is a set of methods to assign objects into clusters under a predefined distance measure when class labels
More informationRigid Geometric Transformations
Rigid Geometric Transformations Carlo Tomasi This note is a quick refresher of the geometry of rigid transformations in three-dimensional space, expressed in Cartesian coordinates. 1 Cartesian Coordinates
More informationEXAM PRACTICE. 12 questions * 4 categories: Statistics Background Multivariate Statistics Interpret True / False
EXAM PRACTICE 12 questions * 4 categories: Statistics Background Multivariate Statistics Interpret True / False Stats 1: What is a Hypothesis? A testable assertion about how the world works Hypothesis
More informationCHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum
CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum 1997 19 CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION 3.0. Introduction
More informationMonte Carlo Studies. The response in a Monte Carlo study is a random variable.
Monte Carlo Studies The response in a Monte Carlo study is a random variable. The response in a Monte Carlo study has a variance that comes from the variance of the stochastic elements in the data-generating
More informationAlexander Klippel and Chris Weaver. GeoVISTA Center, Department of Geography The Pennsylvania State University, PA, USA
Analyzing Behavioral Similarity Measures in Linguistic and Non-linguistic Conceptualization of Spatial Information and the Question of Individual Differences Alexander Klippel and Chris Weaver GeoVISTA
More informationTrip Distribution Modeling Milos N. Mladenovic Assistant Professor Department of Built Environment
Trip Distribution Modeling Milos N. Mladenovic Assistant Professor Department of Built Environment 25.04.2017 Course Outline Forecasting overview and data management Trip generation modeling Trip distribution
More informationANOVA approach. Investigates interaction terms. Disadvantages: Requires careful sampling design with replication
ANOVA approach Advantages: Ideal for evaluating hypotheses Ideal to quantify effect size (e.g., differences between groups) Address multiple factors at once Investigates interaction terms Disadvantages:
More informationThe Matrix Algebra of Sample Statistics
The Matrix Algebra of Sample Statistics James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) The Matrix Algebra of Sample Statistics
More informationReview of Matrices and Block Structures
CHAPTER 2 Review of Matrices and Block Structures Numerical linear algebra lies at the heart of modern scientific computing and computational science. Today it is not uncommon to perform numerical computations
More informationEXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China
EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS Wei Tian 1,2, Lai Wei 1,2, Pieter de Wilde 3, Song Yang 1,2, QingXin Meng 1 1 College of Mechanical Engineering, Tianjin University
More informationGIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara
GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara Location as attribute The data table Census summary table What value is location as an explanatory
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationSpatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved
Spatial Regression 1. Introduction and Review Luc Anselin http://spatial.uchicago.edu matrix algebra basics spatial econometrics - definitions pitfalls of spatial analysis spatial autocorrelation spatial
More informationModel Estimation Example
Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions
More informationSpatial Analysis I. Spatial data analysis Spatial analysis and inference
Spatial Analysis I Spatial data analysis Spatial analysis and inference Roadmap Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for analyses: working with
More informationOn the concordance among empirical confusion matrices for visual and tactual letter recognition
Perception & Psychophysics 2004, 66 (3), 392-397 On the concordance among empirical confusion matrices for visual and tactual letter recognition MICHAEL J. BRUSCO Florida State University, Tallahassee,
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2
MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2 1 Bootstrapped Bias and CIs Given a multiple regression model with mean and
More informationEE731 Lecture Notes: Matrix Computations for Signal Processing
EE731 Lecture Notes: Matrix Computations for Signal Processing James P. Reilly c Department of Electrical and Computer Engineering McMaster University September 22, 2005 0 Preface This collection of ten
More informationBackground on Chevalley Groups Constructed from a Root System
Background on Chevalley Groups Constructed from a Root System Paul Tokorcheck Department of Mathematics University of California, Santa Cruz 10 October 2011 Abstract In 1955, Claude Chevalley described
More informationCHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the
CHAPTER 4 VARIABILITY ANALYSES Chapter 3 introduced the mode, median, and mean as tools for summarizing the information provided in an distribution of data. Measures of central tendency are often useful
More informationMeasuring relationships among multiple responses
Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.
More informationSpatial Autocorrelation
Spatial Autocorrelation Luc Anselin http://spatial.uchicago.edu spatial randomness positive and negative spatial autocorrelation spatial autocorrelation statistics spatial weights Spatial Randomness The
More informationSTATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002
Time allowed: 3 HOURS. STATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002 This is an open book exam: all course notes and the text are allowed, and you are expected to use your own calculator.
More informationRigid Geometric Transformations
Rigid Geometric Transformations Carlo Tomasi This note is a quick refresher of the geometry of rigid transformations in three-dimensional space, expressed in Cartesian coordinates. 1 Cartesian Coordinates
More informationTESTING FOR EQUAL DISTRIBUTIONS IN HIGH DIMENSION
TESTING FOR EQUAL DISTRIBUTIONS IN HIGH DIMENSION Gábor J. Székely Bowling Green State University Maria L. Rizzo Ohio University October 30, 2004 Abstract We propose a new nonparametric test for equality
More informationSpatial Clusters of Rates
Spatial Clusters of Rates Luc Anselin http://spatial.uchicago.edu concepts EBI local Moran scan statistics Concepts Rates as Risk from counts (spatially extensive) to rates (spatially intensive) rate =
More informationGIST 4302/5302: Spatial Analysis and Modeling
GIST 4302/5302: Spatial Analysis and Modeling Basics of Statistics Guofeng Cao www.myweb.ttu.edu/gucao Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Spring 2015 Outline of This Week
More informationarxiv: v1 [math.st] 26 Jan 2015
An alternative to Moran s I for spatial autocorrelation arxiv:1501.06260v1 [math.st] 26 Jan 2015 Yuzo Maruyama Center for Spatial Information Science, University of Tokyo e-mail: maruyama@csis.u-tokyo.ac.jp
More informationThe Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale
The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale António Manuel RODRIGUES 1, José António TENEDÓRIO 2 1 Research fellow, e-geo Centre for Geography and Regional Planning,
More information22 : Hilbert Space Embeddings of Distributions
10-708: Probabilistic Graphical Models 10-708, Spring 2014 22 : Hilbert Space Embeddings of Distributions Lecturer: Eric P. Xing Scribes: Sujay Kumar Jauhar and Zhiguang Huo 1 Introduction and Motivation
More informationHypothesis Testing for Var-Cov Components
Hypothesis Testing for Var-Cov Components When the specification of coefficients as fixed, random or non-randomly varying is considered, a null hypothesis of the form is considered, where Additional output
More informationA spatial literacy initiative for undergraduate education at UCSB
A spatial literacy initiative for undergraduate education at UCSB Mike Goodchild & Don Janelle Department of Geography / spatial@ucsb University of California, Santa Barbara ThinkSpatial Brown bag forum
More informationPARAMETRIC TESTS OF PERFECT JUDGMENT RANKING BASED ON ORDERED RANKED SET SAMPLES
REVSTAT Statistical Journal Volume 16, Number 4, October 2018, 463 474 PARAMETRIC TESTS OF PERFECT JUDGMENT RANKING BASED ON ORDERED RANKED SET SAMPLES Authors: Ehsan Zamanzade Department of Statistics,
More informationSeparation of a Signal of Interest from a Seasonal Effect in Geophysical Data: I. El Niño/La Niña Phenomenon
International Journal of Geosciences, 2011, 2, **-** Published Online November 2011 (http://www.scirp.org/journal/ijg) Separation of a Signal of Interest from a Seasonal Effect in Geophysical Data: I.
More informationSpatial autocorrelation: robustness of measures and tests
Spatial autocorrelation: robustness of measures and tests Marie Ernst and Gentiane Haesbroeck University of Liege London, December 14, 2015 Spatial Data Spatial data : geographical positions non spatial
More informationSignal Modeling Techniques in Speech Recognition. Hassan A. Kingravi
Signal Modeling Techniques in Speech Recognition Hassan A. Kingravi Outline Introduction Spectral Shaping Spectral Analysis Parameter Transforms Statistical Modeling Discussion Conclusions 1: Introduction
More informationStatistics and Measurement Concepts with OpenStat
Statistics and Measurement Concepts with OpenStat William Miller Statistics and Measurement Concepts with OpenStat William Miller Urbandale, Iowa USA ISBN 978-1-4614-5742-8 ISBN 978-1-4614-5743-5 (ebook)
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationReview of Linear Algebra
Review of Linear Algebra Definitions An m n (read "m by n") matrix, is a rectangular array of entries, where m is the number of rows and n the number of columns. 2 Definitions (Con t) A is square if m=
More informationMultivariate Distribution Models
Multivariate Distribution Models Model Description While the probability distribution for an individual random variable is called marginal, the probability distribution for multiple random variables is
More informationLogistic Regression: Regression with a Binary Dependent Variable
Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression
More informationReview Questions REVIEW QUESTIONS 71
REVIEW QUESTIONS 71 MATLAB, is [42]. For a comprehensive treatment of error analysis and perturbation theory for linear systems and many other problems in linear algebra, see [126, 241]. An overview of
More informationBivariate Relationships Between Variables
Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods
More informationPlace Syntax Tool (PST)
Place Syntax Tool (PST) Alexander Ståhle To cite this report: Alexander Ståhle (2012) Place Syntax Tool (PST), in Angela Hull, Cecília Silva and Luca Bertolini (Eds.) Accessibility Instruments for Planning
More informationLinear Algebra. The analysis of many models in the social sciences reduces to the study of systems of equations.
POLI 7 - Mathematical and Statistical Foundations Prof S Saiegh Fall Lecture Notes - Class 4 October 4, Linear Algebra The analysis of many models in the social sciences reduces to the study of systems
More informationBiometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.
Biometrika Trust Discrete Sequential Boundaries for Clinical Trials Author(s): K. K. Gordon Lan and David L. DeMets Reviewed work(s): Source: Biometrika, Vol. 70, No. 3 (Dec., 1983), pp. 659-663 Published
More informationA correlation coefficient for circular data
BiomelriL-a (1983). 70. 2, pp. 327-32 327 Prinltd in Great Britain A correlation coefficient for circular data BY N. I. FISHER CSIRO Division of Mathematics and Statistics, Lindfield, N.S.W., Australia
More informationNemours Biomedical Research Biostatistics Core Statistics Course Session 4. Li Xie March 4, 2015
Nemours Biomedical Research Biostatistics Core Statistics Course Session 4 Li Xie March 4, 2015 Outline Recap: Pairwise analysis with example of twosample unpaired t-test Today: More on t-tests; Introduction
More informationBIG IDEAS. Area of Learning: SOCIAL STUDIES Urban Studies Grade 12. Learning Standards. Curricular Competencies
Area of Learning: SOCIAL STUDIES Urban Studies Grade 12 BIG IDEAS Urbanization is a critical force that shapes both human life and the planet. The historical development of cities has been shaped by geographic,
More informationTHE JORDAN-FORM PROOF MADE EASY
THE JORDAN-FORM PROOF MADE EASY LEO LIVSHITS, GORDON MACDONALD, BEN MATHES, AND HEYDAR RADJAVI Abstract A derivation of the Jordan Canonical Form for linear transformations acting on finite dimensional
More informationThe Study on Trinary Join-Counts for Spatial Autocorrelation
Proceedings of the 8th International Symposium on Spatial Accuracy Assessment in Natural Resources and Environmental Sciences Shanghai, P. R. China, June 5-7, 008, pp. -8 The Study on Trinary Join-Counts
More informationReal Analysis Prof. S.H. Kulkarni Department of Mathematics Indian Institute of Technology, Madras. Lecture - 13 Conditional Convergence
Real Analysis Prof. S.H. Kulkarni Department of Mathematics Indian Institute of Technology, Madras Lecture - 13 Conditional Convergence Now, there are a few things that are remaining in the discussion
More informationOn block bootstrapping areal data Introduction
On block bootstrapping areal data Nicholas Nagle Department of Geography University of Colorado UCB 260 Boulder, CO 80309-0260 Telephone: 303-492-4794 Email: nicholas.nagle@colorado.edu Introduction Inference
More informationON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES
ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES Hisashi Tanizaki Graduate School of Economics, Kobe University, Kobe 657-8501, Japan e-mail: tanizaki@kobe-u.ac.jp Abstract:
More informationSAMPLE OF THE STUDY MATERIAL PART OF CHAPTER 1 Introduction to Linear Algebra
1.1. Introduction SAMPLE OF THE STUDY MATERIAL PART OF CHAPTER 1 Introduction to Linear algebra is a specific branch of mathematics dealing with the study of vectors, vector spaces with functions that
More informationNature of Spatial Data. Outline. Spatial Is Special
Nature of Spatial Data Outline Spatial is special Bad news: the pitfalls of spatial data Good news: the potentials of spatial data Spatial Is Special Are spatial data special? Why spatial data require
More informationInteraction Analysis of Spatial Point Patterns
Interaction Analysis of Spatial Point Patterns Geog 2C Introduction to Spatial Data Analysis Phaedon C Kyriakidis wwwgeogucsbedu/ phaedon Department of Geography University of California Santa Barbara
More informationRepresentation of Geographic Data
GIS 5210 Week 2 The Nature of Spatial Variation Three principles of the nature of spatial variation: proximity effects are key to understanding spatial variation issues of geographic scale and level of
More informationRobustness of Principal Components
PCA for Clustering An objective of principal components analysis is to identify linear combinations of the original variables that are useful in accounting for the variation in those original variables.
More informationDefinition 2.3. We define addition and multiplication of matrices as follows.
14 Chapter 2 Matrices In this chapter, we review matrix algebra from Linear Algebra I, consider row and column operations on matrices, and define the rank of a matrix. Along the way prove that the row
More informationStatistical Perspectives on Geographic Information Science. Michael F. Goodchild University of California Santa Barbara
Statistical Perspectives on Geographic Information Science Michael F. Goodchild University of California Santa Barbara Statistical geometry Geometric phenomena subject to chance spatial phenomena emphasis
More informationFundamentals of Linear Algebra. Marcel B. Finan Arkansas Tech University c All Rights Reserved
Fundamentals of Linear Algebra Marcel B. Finan Arkansas Tech University c All Rights Reserved 2 PREFACE Linear algebra has evolved as a branch of mathematics with wide range of applications to the natural
More information5.1 Vectors A vector is a one-dimensional array of numbers. A column vector has its m elements arranged from top to bottom.
Youngstown State University Industrial and Systems Engineering ISEGR Systems Analysis and Design Session - Vector and Matrix Operations. Vectors A vector is a one-dimensional array of numbers. A column
More informationExpressions for the covariance matrix of covariance data
Expressions for the covariance matrix of covariance data Torsten Söderström Division of Systems and Control, Department of Information Technology, Uppsala University, P O Box 337, SE-7505 Uppsala, Sweden
More informationStatistical Significance of Ranking Paradoxes
Statistical Significance of Ranking Paradoxes Anna E. Bargagliotti and Raymond N. Greenwell 1 February 28, 2009 1 Anna E. Bargagliotti is an Assistant Professor in the Department of Mathematical Sciences
More informationFINITE ABELIAN GROUPS Amin Witno
WON Series in Discrete Mathematics and Modern Algebra Volume 7 FINITE ABELIAN GROUPS Amin Witno Abstract We detail the proof of the fundamental theorem of finite abelian groups, which states that every
More informationContents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47
Contents 1 Non-parametric Tests 3 1.1 Introduction....................................... 3 1.2 Advantages of Non-parametric Tests......................... 4 1.3 Disadvantages of Non-parametric Tests........................
More informationTexas A&M University
Texas A&M University CVEN 658 Civil Engineering Applications of GIS Hotspot Analysis of Highway Accident Spatial Pattern Based on Network Spatial Weights Instructor: Dr. Francisco Olivera Author: Zachry
More informationChapter III. Stability of Linear Systems
1 Chapter III Stability of Linear Systems 1. Stability and state transition matrix 2. Time-varying (non-autonomous) systems 3. Time-invariant systems 1 STABILITY AND STATE TRANSITION MATRIX 2 In this chapter,
More informationQueens College, CUNY, Department of Computer Science Numerical Methods CSCI 361 / 761 Spring 2018 Instructor: Dr. Sateesh Mane.
Queens College, CUNY, Department of Computer Science Numerical Methods CSCI 361 / 761 Spring 2018 Instructor: Dr. Sateesh Mane c Sateesh R. Mane 2018 8 Lecture 8 8.1 Matrices July 22, 2018 We shall study
More informationPasinetti s notion of a unit of vertically integrated productive capacity : a comment
Pasinetti s notion of a unit of vertically integrated productive capacity : a comment Jaime L. del Valle Caballero Ensayos y Monografías Numero 87 marzo 1998 Pasinetti s notion of a unit of vertically
More informationComputational Data Analysis!
12.714 Computational Data Analysis! Alan Chave (alan@whoi.edu)! Thomas Herring (tah@mit.edu),! http://geoweb.mit.edu/~tah/12.714! Introduction to Spectral Analysis! Topics Today! Aspects of Time series
More informationFinding normalized and modularity cuts by spectral clustering. Ljubjana 2010, October
Finding normalized and modularity cuts by spectral clustering Marianna Bolla Institute of Mathematics Budapest University of Technology and Economics marib@math.bme.hu Ljubjana 2010, October Outline Find
More informationDiscrimination Among Groups. Discrimination Among Groups
Discrimination Among Groups Id Species Canopy Snag Canopy Cover Density Height 1 A 80 1.2 35 2 A 75 0.5 32 3 A 72 2.8 28..... 31 B 35 3.3 15 32 B 75 4.1 25 60 B 15 5.0 3..... 61 C 5 2.1 5 62 C 8 3.4 2
More informationTextbook Examples of. SPSS Procedure
Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of
More information1 Matrices and Systems of Linear Equations
March 3, 203 6-6. Systems of Linear Equations Matrices and Systems of Linear Equations An m n matrix is an array A = a ij of the form a a n a 2 a 2n... a m a mn where each a ij is a real or complex number.
More informationTHE N-VALUE GAME OVER Z AND R
THE N-VALUE GAME OVER Z AND R YIDA GAO, MATT REDMOND, ZACH STEWARD Abstract. The n-value game is an easily described mathematical diversion with deep underpinnings in dynamical systems analysis. We examine
More informationA3. Statistical Inference
Appendi / A3. Statistical Inference / Mean, One Sample-1 A3. Statistical Inference Population Mean μ of a Random Variable with known standard deviation σ, and random sample of size n 1 Before selecting
More informationA TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED
A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED by W. Robert Reed Department of Economics and Finance University of Canterbury, New Zealand Email: bob.reed@canterbury.ac.nz
More informationLeast Squares Optimization
Least Squares Optimization The following is a brief review of least squares optimization and constrained optimization techniques. Broadly, these techniques can be used in data analysis and visualization
More informationGIS AND TERRITORIAL INTELLIGENCE. Using Microdata. Jean Dubé and Diègo Legros
GIS AND TERRITORIAL INTELLIGENCE Spatial Econometrics Using Microdata Jean Dubé and Diègo Legros Spatial Econometrics Using Microdata To the memory of Gilles Dubé. For Mélanie, Karine, Philippe, Vincent
More informationOverview of Statistical Analysis of Spatial Data
Overview of Statistical Analysis of Spatial Data Geog 2C Introduction to Spatial Data Analysis Phaedon C. Kyriakidis www.geog.ucsb.edu/ phaedon Department of Geography University of California Santa Barbara
More information5.3 Determinants and Cramer s Rule
304 53 Determinants and Cramer s Rule Unique Solution of a 2 2 System The 2 2 system (1) ax + by = e, cx + dy = f, has a unique solution provided = ad bc is nonzero, in which case the solution is given
More informationGenerating Valid 4 4 Correlation Matrices
Generating Valid 4 4 Correlation Matrices Mark Budden, Paul Hadavas, Lorrie Hoffman, Chris Pretz Date submitted: 10 March 2006 Abstract In this article, we provide an algorithm for generating valid 4 4
More informationCSL361 Problem set 4: Basic linear algebra
CSL361 Problem set 4: Basic linear algebra February 21, 2017 [Note:] If the numerical matrix computations turn out to be tedious, you may use the function rref in Matlab. 1 Row-reduced echelon matrices
More informationROBUST ESTIMATION OF A CORRELATION COEFFICIENT: AN ATTEMPT OF SURVEY
ROBUST ESTIMATION OF A CORRELATION COEFFICIENT: AN ATTEMPT OF SURVEY G.L. Shevlyakov, P.O. Smirnov St. Petersburg State Polytechnic University St.Petersburg, RUSSIA E-mail: Georgy.Shevlyakov@gmail.com
More informationSAMPLE OF THE STUDY MATERIAL PART OF CHAPTER 1 Introduction to Linear Algebra
SAMPLE OF THE STUDY MATERIAL PART OF CHAPTER 1 Introduction to 1.1. Introduction Linear algebra is a specific branch of mathematics dealing with the study of vectors, vector spaces with functions that
More informationWorking with Block Structured Matrices
Working with Block Structured Matrices Numerical linear algebra lies at the heart of modern scientific computing and computational science. Today it is not uncommon to perform numerical computations with
More information