The Study on Trinary Join-Counts for Spatial Autocorrelation

Similar documents
Basics of Geographic Analysis in R

Global Spatial Autocorrelation Clustering

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Spatial Autocorrelation (2) Spatial Weights

2/7/2018. Module 4. Spatial Statistics. Point Patterns: Nearest Neighbor. Spatial Statistics. Point Patterns: Nearest Neighbor

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Contents. Learning Outcomes 2012/2/26. Lecture 6: Area Pattern and Spatial Autocorrelation. Dr. Bo Wu

Representation of Geographic Data

Construction Engineering. Research Laboratory. Approaches Towards the Identification of Patterns in Violent Events, Baghdad, Iraq ERDC/CERL CR-09-1

Spatial Autocorrelation

Chapter 6 Spatial Analysis

Local Spatial Autocorrelation Clusters

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale

Nature of Spatial Data. Outline. Spatial Is Special

What s special about spatial data?

B.N.Bandodkar College of Science, Thane. Random-Number Generation. Mrs M.J.Gholba

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

Outline. 15. Descriptive Summary, Design, and Inference. Descriptive summaries. Data mining. The centroid

Where to Invest Affordable Housing Dollars in Polk County?: A Spatial Analysis of Opportunity Areas

Quiz 1. Quiz Instruc ons. Question 1. Question 2. 2 pts. 3 pts. This is a preview of the published version of the quiz. Started: Jul 3 at 4:29pm

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Hypothesis Testing hypothesis testing approach

Comparison of MLC and FCM Techniques with Satellite Imagery in A Part of Narmada River Basin of Madhya Pradesh, India

SASI Spatial Analysis SSC Meeting Aug 2010 Habitat Document 5

Creating and Managing a W Matrix

GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara

In matrix algebra notation, a linear model is written as

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

An improved Fuzzy Kappa statistic that accounts for spatial autocorrelation

Spatial Data Mining. Regression and Classification Techniques

Introduction to Spatial Statistics and Modeling for Regional Analysis

Discrete Multivariate Statistics

Part III: Unstructured Data. Lecture timetable. Analysis of data. Data Retrieval: III.1 Unstructured data and data retrieval

Deriving Uncertainty of Area Estimates from Satellite Imagery using Fuzzy Land-cover Classification

Link to related Solstice article: Sum-graph article, Arlinghaus, Arlinghaus, and Harary.

Spatial Analysis 2. Spatial Autocorrelation

Lecture 1 Maths for Computer Science. Denis TRYSTRAM Lecture notes MoSIG1. sept. 2017

High-Throughput Sequencing Course

Approximate Test for Comparing Parameters of Several Inverse Hypergeometric Distributions

Measures of Spatial Dependence

Chapter 5. GIS The Global Information System

Raster Spatial Analysis Specific Theory

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

GEO-DATA INPUT AND CONVERSION. Christos G. Karydas,, Dr. Lab of Remote Sensing and GIS Director: Prof. N. Silleos

Σ x i. Sigma Notation

GIS Data Structure: Raster vs. Vector RS & GIS XXIII

The Nature of Geographic Data

Discrete Spatial Distributions Responsible persons: Claude Collet, Dominique Schneuwly, Regis Caloz

Auto-correlation of retinal ganglion cell mosaics shows hexagonal structure

On A Comparison between Two Measures of Spatial Association

Digital Change Detection Using Remotely Sensed Data for Monitoring Green Space Destruction in Tabriz

Spatial autocorrelation: robustness of measures and tests

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Sensitivity Analysis of Boundary Detection on Spatial Features of Heterogeneous Landscape

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Application of the Getis-Ord Gi* statistic (Hot Spot Analysis) to seafloor organisms

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Spatial Analysis II. Spatial data analysis Spatial analysis and inference

Statistical Perspectives on Geographic Information Science. Michael F. Goodchild University of California Santa Barbara

Quality and Coverage of Data Sources

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

Multiple Sample Categorical Data

International Journal of Remote Sensing, in press, 2006.

Relational Nonlinear FIR Filters. Ronald K. Pearson

Finding Hot Spots in ArcGIS Online: Minimizing the Subjectivity of Visual Analysis. Nicholas M. Giner Esri Parrish S.

Measuring Software Coupling

Spatial interaction and spatial autocorrelation: a crossproduct

Probability and Statistics

Overview key concepts and terms (based on the textbook Chang 2006 and the practical manual)

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

Relate Attributes and Counts

Hypothesis Testing for Var-Cov Components

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Lesson 6: Accuracy Assessment

Lecture 4. Spatial Statistics

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

A DATA FIELD METHOD FOR URBAN REMOTELY SENSED IMAGERY CLASSIFICATION CONSIDERING SPATIAL CORRELATION

11-2 Multinomial Experiment

Spatial analysis. Spatial descriptive analysis. Spatial inferential analysis:

Review of Statistics 101

Covariance function estimation in Gaussian process regression

Tutorial 8 Raster Data Analysis

A Spatially Adjusted ANOVA Model, by Daniel A. GrifSzth. yi,( i = 1,2,...,q;

An Introduction to Geographic Information System

Assignment Problems with Weighted and Nonweighted Neighborhood Constraints in 3 6, 4 4 and 6 3 Tilings

Statistics for Managers Using Microsoft Excel (3 rd Edition)

Workbook Exercises for Statistical Problem Solving in Geography

Geographers Perspectives on the World

Moran s Autocorrelation Coefficient in Comparative Methods

Land Use/Cover Change Detection Using Feature Database Based on Vector-Image Data Conflation

Exploratory Spatial Data Analysis (ESDA)

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Spatial Trends of unpaid caregiving in Ireland

Identification of Local Clusters for Count Data: A. Model-Based Moran s I Test

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2

Transcription:

Proceedings of the 8th International Symposium on Spatial Accuracy Assessment in Natural Resources and Environmental Sciences Shanghai, P. R. China, June 5-7, 008, pp. -8 The Study on Trinary Join-Counts for Spatial Autocorrelation Songlin Zhang, Kun Zhang Department of Surveying and Geomatics, Tongji University, Shanghai, 0009, China Lab. of Geographic Information Science, East China Normal University, Shanghai 0006, China Abstract. Spatial autocorrelation must handle two kinds of geographic data. One is continuous valued variables in which the observations are real numbers. Another is nominal variables which consist of a set of discrete categories. The frequently used spatial autocorrelation statistic for nominal variable is join-counts, which deals with two categories that are often referred to as black and white. However, three categories are also common case in present world. For example, as to land cover, the attribute of each parcel may be changed, unchanged or uncertain which represents parcels not belonging to the first two categories. In three valued logic, values could be true, false or unknown. This paper extended join-counts to trinary join-count. The trinary categories are referred as black, white and gray in this paper and the possible type of joins are limited to black-black (BB), white-white (WW), gray-gray (GG), black-white (BW), black-gray (BG) and white-gray (WG). In order to calculating joins, we assign a trinary variable xi to each region with xi= if region is black, xi=0 if region is white, and xi=- if region is gray. Formulas for counting six kinds of joins are deduced. The aim of trinary join-counts is to test the null hypothesis that the values are assigned to the regions randomly and independently. Means and variance of trinary join-counts are calculated under sampling without replacement assumption. The tests statistic is computed by transforming the join-counts into standardized value. At last, two kinds of examples are designed. The first one is regular grid with trinary values, the selection of regular grid is particularly motivated by widely use of remote sensing data. The second one is irregular spatial regions, such as land cover statues. The results suggest that the trinary joincount is useful and could be used in spatial autocorrelation analysis. Keywords: trinary variable, join-counts statistics, spatial autocorrelation. Introduction Spatial autocorrelation must handle two kinds of geographic data--continuous valued variables and nominal variables (Wrigley 985). Spatial statistics is a value that tells you something about your data. Descriptive statistics can be displayed on a map (e.g., mean core, standard distance), and inferential statistics are values that tell you whether there is a pattern or a relationship within you data. Different statistical methods are generally required for each of these types of data, so it is not surprising that different spatial autocorrelation statistics are required as well(john Odland 988). The frequently used spatial autocorrelation statistic for nominal variable is binary join-counts. Join-count statistics are the simplest measure of spatial autocorrelation which were put forward by Moran in 948. They are used for a binary variable, "0" and "", which are often used referred to as white and black. A join links two neighboring areas, the possible types of joins are limited to white-white(ww), black-white(bw), and black-black(bb). Join-counts are frequencies or counts of the numbers of (0,0), (0,), and (,) joins in the study area, and these numbers are compared to the expected numbers of them under the null hypothesis of no spatial autocorrelation. The observed number of (0,0), (0,) and (,) joins are given by n n n n n n C x x C ( x )( x ) C xx (0,): (0,0): (,): ij i j ij i j ij i j i= j= i= j= i= j= () ISBN: -8466-70-8, ISBN3: 978--8466-70-

Where x i is the binary attribute value related to region i, or 0, C ij is a value assigned to region i and region j by a spatial weighting function, which will be explained in section. Join-count statistics have been widely applied to analysis spatial autocorrelation in remote sensing data (e.g., Congalton, 988; Chuang and Huang, 99a; Pugh and Congalton, 00). Most of these applications on join-count are restrained on binary variables, the values of each spatial unit are either (black) or 0 (white). The third and forth moments for the k-colour join counts are given by Cliff (969) for queen's, bishop's, row-only, and cilumn-only cases. However, three categories are also common case in present world. For example, as to land cover, the attribute of each parcel may be changed, unchanged or uncertain which represents parcels not belonging to the first two categories. In three valued logic, values could be true, false or unknown. This paper extended join-counts to trinary join-count. The trinary categories are referred as black, white and gray in this paper and the possible type of joins are limited to black-black (BB), white-white (WW), gray-gray (GG), black-white (BW), black-gray (BG) and white-gray (WG). In order to calculating joins, we assign a trinary variable x i to each region with x i = if region is black, x i =0 if region is white, and x i =- if region is gray. Six types exhaust sample are (,), (0,0), (-,-), (0,), (0,-), (,-). The rest of paper is organized as follows: the following section explained the three common contiguity patterns - rooks case, bishops case and queens case for weight matrixes. In section 3, formulas for counting six kinds of joins are deduced. In section 4, expected means and variances for the statistics are deduced. And in section 5, two kinds of examples are designed. The first one is regular grid with trinary values, the selection of regular grid is particularly motivated by widely use of remote sensing data. The second one is irregular spatial regions, such as census blocks.. Weight Matrices Weight matrixes provide the means of specifying spatial dependence in spatial lattice models. A weight matrixes C is a non-negative, n by n matrix, with zeros on the diagonal and C ij >0, if observation j influences observation i. It is always symmetric. Various weight matrixes that meet these requirements can be constructed using contiguity relation, nearing neighbors, Delaney triangles, or decreasing functions of distance, etc.. In spatial autocorrelation analysis considering of continuous data in a raster format, some measure of contiguity is required. Adjacency matrix is the simplest one. if two areas are contiguous, 0 otherwise (John 988). Contiguity has a rather broad definition depending on the theme, however, most analyses in spatial autocorrelation adhere to a common definition of neighborhood relations. Namely, neighborhood relations are defined as either rooks case, bishops case or queens case(sawada 007). Rooks case contiguity is by a neighborhood of 4 locations adjacent to each cell, Bishops only considers the diagonals of the relation and Queens case considers a neighborhood of eight cells. Of these three, the Rooks case is the most commonly used and most programs only will compute this particular case. When talking about the contiguity matrix or weight matrix below, only those cells that have a Rooks case contiguous relation are given a value of. 3. Trinary Variables Join Counts In order to get a set of simple calculating formulas to count the 6 kinds of joins, as mentioned before, we assign, or 0, or - to each region with three different attribute vale (which displayed in different colors), such as if region i is black, 0 if region i is white and - if region i is gray. The possible types of joins are (,), (0,-), (-,-), (,-).Let n is the total number of spatial units, the join counts calculating formulas of (0,), (0,0), (,), (0,-), (-,-), (,-) are deduced as:

n n n n Cij xi xj ( + xi + xj)( xi + xj) Cij( xi)( xj)( + xi + xj)( + xi + xj) i= j= i= j= (0,) : (0,0) : 4 4 n n n n Cijxx i j( xi + xj)( + xi + xj) Cij( xi + xj)( xi + xj )( + xi + xj)( xi xj) i= j= i= j= (,): (0, : 6 n n n n Cx ij ixj( xi + xj)( xi + xj ) Cijxx i j( xi + xj )( + xi + xj) i= j= i= j= (, : (, : 6 8 with x i as the feature value of unit i, and C ij as the element in the spatial weights matrix corresponding to the observation pair of i and j. In general, the tests statistic is computed by transforming the join-counts into standardized value Count Z, which is calculated by dividing the deviation by variance: Count E( Count) Count Z = Var( Count) (3) where Count is the join count, E(Count) is the theoretical mean and Var(Count) is the theoretical variation. As to binary variables, if Count Z of joins are all approaching zero, the values in the study area are randomly distributed. A negative and significant (0,) Count Z indicates the count of (0,) join is less than that under random distribution, 0 and are more spatially clustered than could be caused by chance and (0,0) and (,) joins are dominant;a positive Count Z of (0,) indicates the join counts of (0,) are more than that under spatial random distribution, 0 and are dispersion, positive and significant (0,) Count Z indicates the join of (0,0) and (,) are infrequent(chuang and Huang, 99). As to trinary variable join-counts, the standard Count Z of six kinds of joins can be compared with the theoretical value under random hypothesis. The standardizing formulas are similar to formula (3). Analysis the character of Count Z of (0,), (0,-) and (-,) joins, assisting with the Count Z of (0,0), (-,-) and (,) joins, we can interpret the spatial clustering of units with 0, and -. 4. Expected Means and Variances for Trinary Join-count Statistics The tests statistic is computed by transforming the join-counts into standardized value, which is calculated by dividing the deviation by variance. In this section, means and variance of trinary join-counts are calculated under two sampling assumptions. For different assumptions about the data, the theoretical expressions for E(0,0), E(,), E(-,-), E(0,), E(0,-)and E(,-) will vary, because different sampling process are associated with slightly different rules for assigning values to regions and they lead to different variances for the expected join-counts. Two sampling assumptions of simple random sampling are available for categorical data. They amount to sampling with replacement (free sampling) and sampling without replacement (nonfree sampling). Sampling without replacement imposes fixed numbers of black, white and gray regions on the study area. The spatial arrangement of black, white and gray regions will vary in repeated trials but the numbers of these three kinds of regions always matches the number observed for the data. Sampling without replacement corresponds to sampling from a hypergeometric distribution. The choice between the two sampling assumptions should be based on some knowledge about the underlying process that assign values to regions. As to remotely sensed data, an assumption of sampling without replacement is probably more appropriate and the statistical hypothesis is that the number of joins for each combination follows a normal random distribution, because the numbers of black, white and gray units are limited to the observed value. The numbers of black, white and gray regions are fixed under sampling without replacement, so the assumption of random assignments means that the probability that any region is black is n b /n, where nb is the number of black regions and n is the total number of regions. The probability that the members of any pair of regions are both black is n b (n b -)/n(n-), and the probability that one member is black and the other () 3

white is n b (n w -)/n(n-). The same is true for other pairs. So, in the case of sampling without replacement, we deduced the expected counts and variances of all joins as formula (4) and (5): nn b w nw( nw S0 nb( nb S0 E(0,) = S0 E(0,0) = E(,) = nn ( nn ( nn ( nn w g nn b g ng( ng S0 E(0, ) = S0 E(, ) = S0 E(, ) = nn ( nn ( nn ( S 0 = n n C ij i= j= Where n w is the number of white regions, n g is the number of gray regions, and S 0 indicate summation over pairs of regions under the taken weighting function. And their variances are Snb( nb ( S S) nb( nb ( nb ) ( S0 + S S) nb( nb ( nb )( nb 3) S0nb( nb Var(), = + + 4 nn ( nn ( ( n ) nn ( ( n )( n 3) nn ( Snn b w ( S S) nn b w( nb + nw ) 4( S0 + S S) nb( nb nw( nw S0nb( nw Var(0, ) = + + nn ( nn ( ( n ) nn ( ( n )( n 3) nn ( Sn w( nw ( S S) nw( nb ( nw ) ( S0 + S S) nw( nw ( nw )( nw 3) Sn 0 w( nw Var(0, 0) = + + 4 nn ( nn ( ( n ) nn ( ( n )( n 3) nn ( Sn wng ( S S) nwng( nw + ng ) 4( S0 + S S) nw( nw ng( ng S0nw( ng Var(0, ) = + + nn ( nn ( ( n ) nn ( ( n )( n 3) nn ( Snbng ( S S) nbng( nb + ng ) 4( S0 + S S) nb( nb ng( ng S0nb( ng Var(, ) = + + nn ( nn ( ( n ) nn ( ( n )( n 3) nn ( Sn g( ng ( S S) ng( ng ( ng ) ( S0 + S S) ng( ng ( ng )( ng 3) Sn 0 g( ng Var(, ) = + + 4 nn ( nn ( ( n ) nn ( ( n )( n 3) nn ( n n n in which n n S = ( Cij + Cji ) and S = Cij + Cji i= j= i= j= j= 5. Examples Cliff and Ord (975) have examined the efficiency of the 3 test statistics of binary variable by calculating the asymptotic relative efficiencies for the BB and BW statistics under a variety of circumstances. The asymptotic relative efficiency is related to the power of a test. They conclude that test of the numbers of (BW) joins are preferable to tests of the number of (BB) or (WW) joins (Wrigley 985). As to trinary variables, the (BW), (BG) and (WG) statistics are probably the most reliable ones, it may be useful to calculate all statistics for some problems. In the examples, we calculate all the statistics to catch some conclusions. Two kinds of examples are designed. The first one is regular grid with trinary values, the selection of regular grid is particularly motivated by widely use of remote sensing data, the main aim of the first example is to check the correctness of aforementioned formulas in which the values (or image) are based on simulated values. The second one is irregular spatial regions, such as census blocks. 5.. Example There three kinds of spatial regular planar tessellations are often used in the research of spatial autocorrelation, which are triangles, squares, and hexagons (Barry and Michael, 000). A regular tessellation consists of squares are designed to check the correctness of the Join-count statistics about trinary variables. The selection of squares is particularly motivated by their use in raster based GIS and remote sensing. In the study area, there three kinds of possible status of each unit, black, white, and gray, with corresponding values of, 0, and -. Six schemes are elaborately designed in the study area, see Fig.. One is spatial random distribution, see scheme (a). Scheme (b) is an extreme clustering structure with just one clustering core, there one big black (4) (5) 4

clustering block in the study area have value to each unit. This is a typical structure of spatial autocorrelation with one clustering core. Scheme (c) is an clustering structure with two clustering cores, there one black clustering block in the bottom-left have value to each unit, and one white clustering block in the top-right have value to each unit. This is a typical structure of spatial autocorrelation with one clustering core. Scheme (d) is an extreme clustering structure with three clustering cores, there three clustering zones in the study area with different value, it is a representation of multi-core spatial clustering, and scheme (e) and (f) are two extreme dispersion structures in part of the study area respectively with unitpairs have the value of (0,) and (0,-). (g) in Fig. is the legend of scheme (a)-(f). The six kinds of structures are typical, the join counts can be counted by manual to check the correctness, and the spatial clustering or dispersion can be recognized directly by eyes to check the results from the aforementioned formulas. In this section, the simplest weight, contiguity weight matrix, are used. We get the first order rook contiguity weights as C ij equal to if unit i and unit j are adjacent by a common side, 0 otherwise. (a) (b) (c) (d) (e) (f) (g) Fig.: The six simulated distribution schemes We calculate the Moran s I by software GeoDa of the six structures.the Moran s I are listed in Table. As to the scheme (a), the standardized Moran s I is -0.0038, which means there no significant spatial autocorrelation in the study area. It is consistent with the intuitionistic impress. There are 7 black units, 59 white units, and 35 gray units. As to the scheme (b) and (c), the standardized Moran s I are 0.3944 and 0.4370 respectively, which means there are very significant spatial clustering in the study area. There are 8 black units, 5 white units, and 5 gray units in (b), and 50 black units, 54 white units, and 7 gray units in (c). As to the scheme (d), the standardized Moran s I is 0.9337, which means there are very notable spatial autocorrelation in the study area. There are 44 black units, 33 white units, and 44 gray units. As to the scheme (e) and (f), the standardized Moran s I is -0.703 and -0.364 respectively, which means there are Table : Join-count statistics of (a) in Fig. scheme a b c d e f Moran s I -0.0038 0.3944 0.4370 0.9337-0.703-0.364 significant and negative spatial autocorrelation in the study area. There are 47 black units, 60 white units, and 4 gray units in scheme (e), 7 black units, 60 white units, and 54 gray units in scheme (f). The Join-count statistics are listed in Table -7. Table : Join-count statistics of (a) in Fig. (0,0) 54 5.306 0.53 0.6 (0,) 53 47.874 0.533 0.487 (0,-) 57 6.059 0.866-0.466 (,) 9 0.954 4.553-0.49 (,-) 8 8.400 9.055-0.044 (-,-) 9 8.407 6.9 0.097 Choosing 0.05 (or equivalently, 5%) as the significance level, the threshold for significance is ±.96. In Table, the join counts of (0,), (0,0), (,), (0,-), (-,-), (,-) are 53, 54, 9, 57, 9 and 8, which can be verified by manual count. This denotes that formular () which calculating the join counts are correct. The standard join counts of (0,), (0,-) and (,-) in Table are 0.487, -0.466 and -0.044 respectively. All of 5

them are approaching zero, which means that in the whole study area, all of the three kinds of units are randomly distributed, there are no significant spatial clustering. 6 Table 3: Join-count statistics of (b) in Fig. (0,0) 0 9.394 4.665.546 (0,) 3 60.8565 0.8593 -.7494 (0,-) 8.697 6.796.089 (,) 5 98.5875 3.535.0080 (,-) 9 36.539 9.8366 -.7805 (-,-) 7 3.3809.3076.5683 In Table 3, the standard join counts of (,) is significant and positive, and the standard join counts of (0,) and (,-), the unlike joins have relation with units, are significant and negative. We can get that the units with value are spatial clustering. Although the standard join count of (0,0) is also significant and positive, and the relative count of join (0,) is significant and negative, but the standard join counts of (0,-) is neither significant and positive nor significant and negative(0.930). Then we can get that the units with value 0 are not spatial clustering. Similar conclusion can be deduced about the units with value -. Together with the aforementioned conclusion, we can get that the black units with value have spatial clustering effect. In Table 4, the standard join counts of (0,) is -4.7053, significant and negative, while.398 and 3.4999 for (0,0) and (,) respectively, significant and positive, and the others are not significant. It means the units with value of 0 is clustering. In Table 5, the standard join counts of (0,) (0,-) and (,-) are -3.6, -3.6 and -5.37 respectively. All of them are significant and negative, which means the counts of (0,) (0,-) and (,-) are significantly less than that under random distribution. Significant and positive standard counts of (0,0) (,) and (-,-) indicate the counts of (0,0) (,) and (-,-) are significantly more than that under random distribution, and means the units with value of 0, and - are clustering. Table 4: Join-count statistics of simulating structure (c) in Fig. (0,0) 67 43.867 9.6930.398 (0,) 3 8.40 0.4440-4.7053 (0,-) 3 7.5883 8.966 0.493 (,) 69 37.5657 8.986 3.4999 (,-) 3 5.5447 8.74 -.4400 (-,-) 7 4.346.6660 0.9968 Table 5: Join-count statistics of (d) in Fig. (0,0) 5 6.364 5.76 6.3 (0,) 43.636 0.3-3.6 (0,-) 43.636 0.3-3.6 (,) 73 9.09 7.864 5.584 (,-) 0 58.8 0.830-5.37 (-,-) 73 9.09 7.864 5.584 In Table 6, the standard join counts of (0,) is significant and positive, this means that the units with value and 0, are not spatial clustering. Together with the standard join counts of (0,0) and (, ) are all significant and negative (-3.890 and -3.9379), we can get that the units with value, 0 are spatial dispersing. The conclusion is consistent with the anticipative results of the structure in Fig.(e). As to the (0,- ) dispersing scheme, the result in Table 7 is similar to Table 6, the standard join counts of (0,-) is significant and positive, and the standard join counts of (0,0) and (-,-) are all significant and negative.

Table 6: Join-count statistics of (e) in Fig. (0,0) 0 54.0947 0.695-3.890 (0,) 59 84.7483 0.46 7.468 (0,-) 5.44 8.679-0.4894 (,) 0 33.93 8.49-3.9379 (,-) 4 9.7746 7.8794-0.739 (-,-) 6.945.36.433 Table 7: Join-count statistics of (f) in Fig. (0,0) 0 54.0947 0.695-3.890 (0,) 8.6 6.506 0.87 (0,-) 6 97.3704 9.0 7.040 (,) 0 0.7363 0.9757-0.7546 (,-) 8.3599 6.08-0.548 (-,-) 43.867 9.6930-3.85 The calculating results of the six kinds of simulated distribution are all consistent with the anticipative impress of the schemes and the Moran s indices, we can get that the Formula () and (4) are correct. And we can get some rules to determine the spatial distribution model of trinary variable: () Spatial random distribution. When all the standardized join-count of (0,), (0,-) and (,-) are not significant and positive or negative under certain level of significance. Conversely, if all the standardized join-count of between different attributes are not significant, we can determine a spatial random distribution. () Spatial clustering. The standardized join-count of the like joins of clustering attribute is significant and positive, and the standardized join-count of the unlike joins about the clustering attribute is significant and negative. One-attribute and multi-attribute spatial clustering are in a similar way. It means that if the standardized join-count of the like join of one attribute is significant and positive, and the standardized joincount of unlike joins about this attribute are significant and negative, we can determine a spatial clustering of this attribute. (3) Spatial dispersion: The standardized join-counts of like joins of dispersion attribute is significant and positive, while the standardized join-count of unlike join is significant and negative. On the other hand, if the standardized join-count of unlike joins of certain attribute is significant and negative, and the standardized join-count of like joins about that attributes are significant and positive, we can determine a spatial dispersion of this attribute. 5.. Example As to land cover, the character of each spatial units may be changed, unchanged or unsure, Fig is a land cover statues of a region of China in a certain period, in which are unchanged blocks, 0 are changed ones, and - are unsure ones. We calculated the standardized join-counts which are listed in Table 8. Fig.: Land cover statues Choosing 0.05 (or equivalently, 5%) as the significance level, the standard join counts of (,-) is significant and negative (-3.708), while 3.635 and.40 for (,) and (-,-) respectively, significant and positive, and the others are not significant. It means the unchanged land cover blocks are spatial clustering. 7

6. Conclusion 8 Table 8: Join-count statistics of land cover statues in Fig. (0,0) 3 6.44 5.749-0.5995 (0,) 5 57.9804 0.873-0.6447 (0,-) 5 9.49 9.69-0.484 (,) 89 5.43 0.40 3.635 (,-) 8 5.877 0.6843-3.708 (-,-) 64 3.69 5.0594.40 Based on the theory of binary Join-count statistics, this paper deduced the formulas to calculate the trinary Join-count statistics, including frequency counts of all joins, their expectations and variations. The simulating calculations have proved the formulas are correct and the explanations are reasonable. The practical example has proved the method is useful in image spatial autocorrelation analysis. From the results of Example, we can get that, six typical spatial distributions can be detected by the Join-count statistics of trinary variables which are produced in Section 3 and Section 4. It is in accordance with the Moran s indices and manual counted joins. If all the standardized join-counts of between different attributes are not significant, we can determine a spatial random distribution, if the standardized join-count of the like join of one attribute is significant and positive, and the standardized join-count of unlike joins about this attribute are significant and negative, we can determine a spatial clustering of this attribute, and if the standardized join-count of unlike joins of certain attribute is significant and negative, and the standardized join-count of like joins about that attributes are significant and positive, we can determine a spatial dispersion of this attribute. Based on these criterias and the calculating results of Example, we get that the unchanged land cover blocks are spatial clustering in the study area. The results suggest that the trinary join-count is useful and could be used in spatial autocorrelation analysis. 7. Acknowledgements This study was funded by the National Natural Science Foundation of China (4060400) and the Key Subjects of Natural Science Foundation of China (4073056). 8. References [] A. Cliff and J. K. Ord. Spatial Processes, Models and Applications, London: Pion, 98. [] B. Barry and T. Michael. Global and local spatial autocorrelation in bounded regular tessellations. Geograpgical Systems, 000, 4: 39-348. [3] J. Arthur and J. Lembo. Spatial Autocorrelation, Join Count Analysis, [007-05-6], http://www.css. cornell.edu /courses/ 60/lecture0.ppt. [4] K. Chuang and H. Huang, Comparison of chi-square and join-count methods for evaluating digital image data, IEEE Trans. Medical Image. 99, : 8-33. [5] K. Chuang and H. Huang,. Assessment of noise in a digital image using the Join-count statistic and the Moran test, Phys. Med. Biol. 99, : 357-369. [6] M. F. Dacey. A review of measures of continguity for two- and k-color maps. Prentice-Hall, Engle wood Cliffs, 968. [7] M. Sawada. Global Spatial Autocorrelation Indices - Moran's I, Geary's C and the General Cross-Product Statistic. [007-06-6], http://www.lpc.uottawa.ca/publications/moransi/moran.htm. [8] N.A. Cressie. Statistics for Spatial Data, John Wiley: Chichester, 995. [9] N. Wrigley. Caregorical data analysis for geographers and environmental scientists. London: Longman, 985. [0] O. John. Spatial autocorrelation. Newbury Park: Sage Pub, 988. [] R. Congalton. Using spatial autocorrelation analysis to explore the errors in maps generated from remotely sensed data. Photogrammetric Engineering and Remote Sensing, 988, 5: 587-59.