AUTOMATIC CATEGORIZATION OF CLUSTERS IN UNSUPERVISED CLASSIFICATION

Similar documents
LAND COVER CATEGORY DEFINITION BY IMAGE INVARIANTS FOR AUTOMATED CLASSIFICATION

Land cover/land use mapping and cha Mongolian plateau using remote sens. Title. Author(s) Bagan, Hasi; Yamagata, Yoshiki. Citation Japan.

IMPROVING REMOTE SENSING-DERIVED LAND USE/LAND COVER CLASSIFICATION WITH THE AID OF SPATIAL INFORMATION

Geospatial technology for land cover analysis

International Journal of Scientific & Engineering Research, Volume 6, Issue 7, July ISSN

Progress and Land-Use Characteristics of Urban Sprawl in Busan Metropolitan City using Remote sensing and GIS

A RESEARCH ON THE URBAN DISASTER PREVENTION PLAN CONCERNIN G EARTHQUAKE RISK FORECAST BY REMOTO SENSING TN THE TOKYO BAY AREA

Landuse and Landcover change analysis in Selaiyur village, Tambaram taluk, Chennai

Information theoretical approach for domain ontology exploration in large Earth observation image archives

DAMAGE DETECTION OF THE 2008 SICHUAN, CHINA EARTHQUAKE FROM ALOS OPTICAL IMAGES

M.C.PALIWAL. Department of Civil Engineering NATIONAL INSTITUTE OF TECHNICAL TEACHERS TRAINING & RESEARCH, BHOPAL (M.P.), INDIA

1. Introduction. S.S. Patil 1, Sachidananda 1, U.B. Angadi 2, and D.K. Prabhuraj 3

Deriving Uncertainty of Area Estimates from Satellite Imagery using Fuzzy Land-cover Classification

AUTOMATED BUILDING DETECTION FROM HIGH-RESOLUTION SATELLITE IMAGE FOR UPDATING GIS BUILDING INVENTORY DATA

The Self-adaptive Adjustment Method of Clustering Center in Multi-spectral Remote Sensing Image Classification of Land Use

Yanbo Huang and Guy Fipps, P.E. 2. August 25, 2006

AN INVESTIGATION OF AUTOMATIC CHANGE DETECTION FOR TOPOGRAPHIC MAP UPDATING

A COMPARISON BETWEEN DIFFERENT PIXEL-BASED CLASSIFICATION METHODS OVER URBAN AREA USING VERY HIGH RESOLUTION DATA INTRODUCTION

CSC Neural Networks. Perceptron Learning Rule

o 3000 Hannover, Fed. Rep. of Germany

CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition

COMPARISON OF PIXEL-BASED AND OBJECT-BASED CLASSIFICATION METHODS FOR SEPARATION OF CROP PATTERNS

High resolution wetland mapping I.

Module 2.1 Monitoring activity data for forests using remote sensing

Comparison of MLC and FCM Techniques with Satellite Imagery in A Part of Narmada River Basin of Madhya Pradesh, India

ESTIMATION OF LANDFORM CLASSIFICATION BASED ON LAND USE AND ITS CHANGE - Use of Object-based Classification and Altitude Data -

EFFECT OF ANCILLARY DATA ON THE PERFORMANCE OF LAND COVER CLASSIFICATION USING A NEURAL NETWORK MODEL. Duong Dang KHOI.

IMAGE CLASSIFICATION TOOL FOR LAND USE / LAND COVER ANALYSIS: A COMPARATIVE STUDY OF MAXIMUM LIKELIHOOD AND MINIMUM DISTANCE METHOD

ASSESSING THEMATIC MAP USING SAMPLING TECHNIQUE

A Method to Improve the Accuracy of Remote Sensing Data Classification by Exploiting the Multi-Scale Properties in the Scene

Classification trees for improving the accuracy of land use urban data from remotely sensed images

Principals and Elements of Image Interpretation

SATELLITE REMOTE SENSING

LAND USE MAPPING FOR CONSTRUCTION SITES

Land Cover Classification Over Penang Island, Malaysia Using SPOT Data

2.2 Geographic phenomena

PRINCIPLES OF PHOTO INTERPRETATION

Spatial Planning Platform Meeting

Sparse Representation-based Analysis of Hyperspectral Remote Sensing Data

Overview on Land Cover and Land Use Monitoring in Russia

Molinas. June 15, 2018

Preparation of LULC map from GE images for GIS based Urban Hydrological Modeling

9/26/17. Ridge regression. What our model needs to do. Ridge Regression: L2 penalty. Ridge coefficients. Ridge coefficients

URBAN LAND COVER AND LAND USE CLASSIFICATION USING HIGH SPATIAL RESOLUTION IMAGES AND SPATIAL METRICS

GLOBAL/CONTINENTAL LAND COVER MAPPING AND MONITORING

International Journal of Intellectual Advancements and Research in Engineering Computations

Impacts of sensor noise on land cover classifications: sensitivity analysis using simulated noise

KNOWLEDGE-BASED CLASSIFICATION OF LAND COVER FOR THE QUALITY ASSESSEMENT OF GIS DATABASE. Israel -

Urban land cover and land use extraction from Very High Resolution remote sensing imagery

Land cover classification methods

1. Introduction. Chaithanya, V.V. 1, Binoy, B.V. 2, Vinod, T.R. 2. Publication Date: 8 April DOI: /cloud.ijarsg.

DEPENDENCE OF URBAN TEMPERATURE ELEVATION ON LAND COVER TYPES. Ping CHEN, Soo Chin LIEW and Leong Keong KWOH

GEOG 4110/5100 Advanced Remote Sensing Lecture 12. Classification (Supervised and Unsupervised) Richards: 6.1, ,

Land Cover Project ESA Climate Change Initiative. Processing chain for land cover maps dedicated to climate modellers.

STUDY OF NORMALIZED DIFFERENCE BUILT-UP (NDBI) INDEX IN AUTOMATICALLY MAPPING URBAN AREAS FROM LANDSAT TM IMAGERY

Network optimization of urban heat island measurements -Effect of reduction of observation points-

7.1 INTRODUCTION 7.2 OBJECTIVE

DIFFERENTIATING A SMALL URBAN AREA FROM OTHER LAND COVER CLASSES EMPLOYING LANDSAT MSS

OBJECT BASED IMAGE ANALYSIS FOR URBAN MAPPING AND CITY PLANNING IN BELGIUM. P. Lemenkova

Spatio-Temporal Analysis of Land Cover/Land Use Changes Using Geoinformatics (A Case Study of Margallah Hills National Park)

Human Activities and Environmental Risks Natural Hazards and Urban Development Issues Vallée de la Bruche, Alsace

The Positional and Thematic Accuracy for Analysis of Multi-Temporal Satellite Images on Mangrove Areas

6. Image Classification

Object-based feature extraction of Google Earth Imagery for mapping termite mounds in Bahia, Brazil

LAND USE MAPPING AND MONITORING IN THE NETHERLANDS (LGN5)

Developments in Remote Sensing and Object-Based Image Classification for Urban Applications at University of Sao Paulo

URBAN MAPPING AND CHANGE DETECTION

Mitsuhiko Kawakami and Zhenjiang Shen Department of Civil Engineering Faculty of Engineering Kanazawa University Japan ABSTRACT

Fundamentals of Photographic Interpretation

Table 1 PRISM/OB1 data summary in this study. PI400-2 Table 2 PRISM/OB1 data summary in this study. 2'#7-#/ :> 8>?= B-/"# 899:;98;<= :> 8A89

IMPACTS OF SEA LEVEL RISE in Bangkok, Thailand

An Automated Object-Oriented Satellite Image Classification Method Integrating the FAO Land Cover Classification System (LCCS).

CORRELATION BETWEEN URBAN HEAT ISLAND EFFECT AND THE THERMAL INERTIA USING ASTER DATA IN BEIJING, CHINA

GEOGRAPHICAL DATABASES FOR THE USE OF RADIO NETWORK PLANNING

Application of Goodall's Affinity Index in Remote Sensing Image Classification

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

URBAN LAND COVER DETECTION EMPLOYING A WINTER LANDSAT MSS DATA SET

Comparison between Land Surface Temperature Retrieval Using Classification Based Emissivity and NDVI Based Emissivity

Contents. Introduction Study area Data and Methodology Results Conclusions

WGIA7 9th July, Noriko KISHIMOTO

A COMPARISON OF INTER-ANALYST DIFFERENCES IN THE CLASSIFICATION OF A LANDSAT ETM+ SCENE IN SOUTH-CENTRAL VIRGINIA

Digital Change Detection Using Remotely Sensed Data for Monitoring Green Space Destruction in Tabriz

Monitoring of Forest Cover Change in Sundarban mangrove forest using Remote sensing and GIS

Land cover classification methods. Ned Horning

CHAPTER 2 REMOTE SENSING IN URBAN SPRAWL ANALYSIS

Dr.N.Chandrasekar Professor and Head Centre for GeoTechnology Manonmaniam Sundaranar University Tirunelveli

Outline: Introduction - Data used - Methods - Results

Introduction to Machine Learning

ISO Land Cover for Agricultural Regions of Canada, Circa 2000 Data Product Specification. Revision: A

Evaluating Urban Vegetation Cover Using LiDAR and High Resolution Imagery

Development and Implementation of the Japanese Enhanced Fujita Scale

DETECTION AND ANALYSIS OF LAND-USE/LAND-COVER CHANGES IN NAY PYI TAW, MYANMAR USING SATELLITE REMOTE SENSING IMAGES

Urban Growth Analysis: Calculating Metrics to Quantify Urban Sprawl

the Clinch/Hidden Valley study site were used in this mountainous classification.

Land Cover and Asset Mapping Operational Change Detection

Monitoring and Change Detection along the Eastern Side of Qena Bend, Nile Valley, Egypt Using GIS and Remote Sensing

Modeling Complex Temporal Composition of Actionlets for Activity Prediction

Trimble s ecognition Product Suite

Table 4.1 Images used for the Study. Spatial Resolution (Meter) Resolution (Bits) 30 (For 6th band 120)

The Road to Data in Baltimore

Transcription:

UTOMTIC CTEGORIZTION OF CLUSTERS IN UNSUPERVISED CLSSIFICTION Sunpyo HONG. Kiyonari FUKUE. Haruhisa SHIMOD, Toshibumi SKT Tokai University Research and Information Center 2-28-4 Tomigaya. Shibuya-ku. Tokyo 151. JPN BSTRCT: cluster categorization used for remote sensing performed automatically. process. method is necessary when an unsupervised classification is image classification. It is desirable that this method is because manual categorization is a highly time consuming In this paper. several automatic determination methods were proposed and evaluated. They are 1) maximum number method, which assigns the target cluster to the category which occupies the largest area of that cluster; 2) maximum percentage method, which assigns the target cluster to the category which shows the maximum percentage within the category in that cluster; 3) minimum distance method, which assigns the target cluster to the category having minimum distance with that cluster; 4) element ratio matching method. which assigns the local region to the category having the most similar element ratio of that region. From the results of experiments. it was certified that the result by the minimum distance method was almost the same as the result made by a human operator. Kef Words: unsupervised classification, post processing, categorization, clustering. 1. INTRODUCTION With the launch of second generation high resolution sensors like LNDST TM and SPOT HRV, clustering method has been revaluated recently. However, the main problem of clustering for practical use is that clustering is an unsupervised classification. That is. clusters generated by clustering are defined in feature vector space. not in image data. Therefore. in order to use the classified result for a meaningful reference map. it is necessary to determine the relation of clusters and categories, and to label the classified result with the categories. Conventionally. this relation has been determined mainly by interpretation of an operator. However. this process is time consuming and is not objective. The purpose of this research is to try several methods of automatic categorization and find out the most useful method. In this paper, 4 methods have been exami ned. 2. PROBLEMS OF CONVENTIONL METHOD In this method, each classified cluster is overlaid with the target image data on the display, and that cluster is interpreted by an operator to determine the category. Therefore, it can be thought that the obtained result is natural and reliable. However. since everything is deterlined by an operator in this lethod, there are many problems as follows. (1) The result depends on the skill of an operator. (2) Objective and quantitative evaluation is difficult. (3) It is time consuming when the number of cluster is large or there are many small clusters. 3. UTOMTIC CTEGORIZTION METHOD To solve the above problems. several automatic categorization methods are considered as follows. In all methods, training category areas(tc) are first extracted from the target image similar to supervised trainings. (1) Maximum Number Method In this method, the number of pixels in each TC for each cluster is calculated. Then the category having the maximum number is assigned to that cluster. (2) Maximum Percentage Method In this method, for each cluster. the percentage(occupation rate) of that cluster in each TC is calculated. Then the category having the maximum percentage is assigned to that cluster. Fig. 1 shows a comparison of these two methods in a simple case. Suppose that cluster k is composed of three categories, Band C. s shown in Fig. 1(a), category occupies the largest area in cluster k and C occupies the minimum area. In the maximum number method. cluster k is always assigned to category. However, this figure does not show the difference of areas of each category. Fig. l(b) shows the case that the total area of each category is the same and (c) shows the case that the total area of each category is different. s shown from this figure. categories which occupy small areas in the image tends to be neglected in the maximum number method. On the contrary. small area categories 139

are treated favorably in the maximum percentage method as shown in Fig. 1. (3) Minimum Distance Method In this method, first, TC is overlaid with the target image and the centroid of ecah category is calculated. Secondly, the distance between each category and each cluster is calculated. Then the category having the minimum distance is assigned to that cluster. For the distance, Eucl idian distance was used in this experiment. In the case of the maximum number method and the maximum percentage method, the result is dependent upon the size and location of training area. In the minimum distance method, it needs less number of pixels for TC compared to other 3 methods but needs caution on spectral featute of TC. (4) Element Ratio Matching Method In this method, based on the idea that the category can be represented by the set of clusters, the relation of categories and cluster is determined by the ratio of clusters which compose a category. For defining the element ratio. area is necessary. Therefore, the process is performed by local region as follows. t first, TC is overlaid with the result of clustering, which is classified by the clusters. and the element ratio of each category is calculated. Secondly, for each local region that is fixed on the result of clustering, element ratio is calculated and matched with the element ratio of each category precalculated. Then the category having the most sililar element ratio of concerned region is assigned to that region. 4. EXPERIMENTS ND RESULT 4.1 FLOW.Q EXPERIMENTS In order to evaluate the proposed methods described in chapter 3. following LND ST TM data was used in the experilent. Sagami River basin was seleted "for target area to pereform the quantitative evaluation. This area includes the test site area which landcover is already investigated and categorized in 52 categories. sensor LNDST TM date Nov. 4, 1984 path-row 107-35 area Sagami River basin in Japan pi xe I size: 25m X 25m image size: 512 X 480 pixels t first. clusters were generated by a hierarchical clustering using Ward lethod. Since the image data in remote sensing is very large. usually clustering is performed with sampled data. In this experiment, 2500 samples(about 1% of entire image data) were used to generate 66 clusters. Based on the 66 clusters, the target image data was classified by a maximum likelihood method. Secondly, representative area of each category in the target image(training category area) was selected. 14 categories were selected as shown in Table 1. Finally, the relations of clusters with categories were determined by 4 methods described in chapter 3. To evaluate quantitatively, classification accuracy was estimated based on the test site area. The classification accuracy was ca I cu I a ted over 5 maj or ca tegories as shown in table 1 to adjust the selected 14 categories in target image to 52 test site categories. 4.2 Results Qf Experiments (1) Maximum Number Method Fig. 2 shows the result by this method. data. s expected, this result shows the tendency that clusters were labeled with the categories having large area such as urban. (2) Maximum Percentage Method Fig. 3 shows the result by this method, data. s expected, this result shows the tendency that clusters were labeled with the categories having small area such as other. (3) Minimum Distance Method Fig. 4 shows the result by this method, area. This result is very similar to the result(fig. 2) by a human operator. In other words, a good result was obtained with roughly selected TC. (4) Element Ratio Matching Method Fig. 5 shows the result by this method, area. In this method. 5 X 5 sized local region was used. (5) Conventional Supervised Method For the purpose of comparison, conventional supervised method was executed. In this method, clusters were categorized by a human operator. 66 clusters were labeled with 14 categories (Table 1) as shown in Fig. 6. Table 2 shows the classification accuracy by this method with the test site area. 5. CONCLUSIONS Four categorization methods for clusters were evaluated by experiments using LND ST TM data. From the results of experiments, following conclusions were obtained. (1) In method, the the case of the maximul clusters are hardly number labeled 140

with the small area categories. On the contrary, in the case of the maximum percentage method, the clusters are hardly labeled with large area categories. These are general problems for these two methods, because usually the area of each category is not the same. (2) The classification accuracies(area weighted mean) were the hightest for the element ratio matching method but detail information such as river or road was lost as shown in Fig. 6 because that cateogorization was performed on each local region. (3) The classification accuracies(simple mean) were the hightest for supervised method, second, the minimum distance method. the lowest for the elemene ratio matching method. (4) From the view point of practical use, the maximum number method,the maximum percentage method and the element ratio matching method are easy to execute because they don't need the spectral feature, but the classification accuracies (simple mean) of three methods are lower than that of the minimum distance method. (5) mong the methods which were evaluated, the minimum distance method showed best result. In this method, obtained result is almost the same with a supervised method. Theoretically. this method also needs less number of pixels for TC compared to other 3 methods because the geometrical information of TC is not used. category area and occupation rate of cluster k in that category area categorized by maximum number method categorized by max. percentage method Class k BC IJ[]D 50% 30% 20% (b) case of salle size of category ar.ea BC (a) cluster k. composed of category.b,c I B [] C c 22% 30% 80% (c) case of different size of area Fig. 1 Cluster and Categories 141

Fig. 2 Result by Maximum Number Method Fig. 3 Result by Maximum Percentage Method Fig. 4 Result by Minimum Distance Method Fig. 5 Result by Element Ratio Matching Fig. 6 Result by Supervised Method 142

Table 1 Categories and Clusters categories clusters Major selected category test site category Supervised Number Percen t. D ist. coniferous tree coniferous tree. 3 7 5 4 TREE broad leaved tree broad leaved tree, 8 4 4 3 mixed tree mixed tree, bamboo 4 3 5 7 shadow orchard. etc, 2 2 2 2 PDDY paddy paddy 1 0 1 1 1 0 8 city area concrete, factory, 6 4 2 3 URBN house area building, railway, 6 1 1 1 3 1 1 factory house, urban, etc 7 8 8 7 WTER sea sea, river, 2 2 2 2 river poo I, pond 1 1 1 2 farm farlll, vinyl house, 1 6 6 6 OTHER grass land ground. grassland. 1 0 4 4 3 ground lawn, gravel,sand, 5 3 3 8 sand area ordered land, etc 0 0 1 0 5 1 4 44 6 6 6 6 6 6 6 6 Table 2 Classification ccuracies un it: % CTEGORY TREE PDDY URBN WTER OTHER Weighted Simple (occupation Rate) (6.8%) (9.6%) (49.9%) (5.0%) (28.7%) Mean Mean Max. Number 55.9 58.3 84.4 51.7 26.6 61.7 55.4 Max. Percentage 48. 1 61.7 75.9 51.7 35.6 59.9 54.6 Min. Distance 47.5 48.9 85.6 67.8 32.7 63.4 55.5 Element Ratio 33.8 51.5 91.2 58.3 33.6 65.4 53.7 -------------------------- ------------------------------------------------------ Supervised 46.2 56.3 78.0 67.8 46.4 64.2 58.9 ------------ ---------- 143