SECOND ORDER STATISTICS FOR HYPERSPECTRAL DATA CLASSIFICATION. Saoussen Bahria and Mohamed Limam

Similar documents
RVM-based multi-class classification of remotely sensed data

Deep Learning. Convolutional Neural Networks Applications

A New Semi-Supervised Classification Strategy Combining Active Learning and Spectral Unmixing of Hyperspectral Data

Proceedings - AutoCarto Columbus, Ohio, USA - September 16-18, Dee Shi and Xiaojun Yang

Mahalanobis kernel for the classification of hyperspectral images

Classification of High Spatial Resolution Remote Sensing Images Based on Decision Fusion

Hyperspectral image classification using Support Vector Machine

Classification of Pavia University Hyperspectral Image using Gabor and SVM Classifier

Spectral and Spatial Methods for the Classification of Urban Remote Sensing Data

Performance measures for classification systems with rejection

Kernel-Based Retrieval of Atmospheric Profiles from IASI Data

Deriving Uncertainty of Area Estimates from Satellite Imagery using Fuzzy Land-cover Classification

Object-based land use/cover extraction from QuickBird image using Decision tree

1. Introduction. S.S. Patil 1, Sachidananda 1, U.B. Angadi 2, and D.K. Prabhuraj 3

Artificial Neural Network Approach for Land Cover Classification of Fused Hyperspectral and Lidar Data

Analyzing High Dimensional Multispectral Data

Hyperspectral Image Processing: Methods and Approaches

Extreme Sparse Multinomial Logistic Regression: A Fast and Robust Framework for Hyperspectral Image Classification

Adaptive Sampling Under Low Noise Conditions 1

Spatially Adaptive Semi-supervised Learning with Gaussian Processes for Hyperspectral Data Analysis

Performance measures for classification systems with rejection

Testing the Sensitivity of Vegetation Indices for Crop Type Classification Using Rapideye Imagery

AN INTEGRATED METHOD FOR FOREST CANOPY COVER MAPPING USING LANDSAT ETM+ IMAGERY INTRODUCTION

DETECTING HUMAN ACTIVITIES IN THE ARCTIC OCEAN BY CONSTRUCTING AND ANALYZING SUPER-RESOLUTION IMAGES FROM MODIS DATA INTRODUCTION

A Method to Improve the Accuracy of Remote Sensing Data Classification by Exploiting the Multi-Scale Properties in the Scene

HYPERSPECTRAL imaging sensors collect hyperspectral

Robust Multiple Estimator Systems for the Analysis of Biophysical Parameters From Remotely Sensed Data

Support Vector Machine (SVM) & Kernel CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2012

Performance Analysis of Some Machine Learning Algorithms for Regression Under Varying Spatial Autocorrelation

REMOTE sensing data provide synoptic and timely information

Observing Dark Worlds (Final Report)

A DATA FIELD METHOD FOR URBAN REMOTELY SENSED IMAGERY CLASSIFICATION CONSIDERING SPATIAL CORRELATION

Comparison of MLC and FCM Techniques with Satellite Imagery in A Part of Narmada River Basin of Madhya Pradesh, India

GEOG 4110/5100 Advanced Remote Sensing Lecture 12. Classification (Supervised and Unsupervised) Richards: 6.1, ,

Learning the Semantic Correlation: An Alternative Way to Gain from Unlabeled Text

Jeff Howbert Introduction to Machine Learning Winter

A COMPARISON BETWEEN DIFFERENT PIXEL-BASED CLASSIFICATION METHODS OVER URBAN AREA USING VERY HIGH RESOLUTION DATA INTRODUCTION

Dr.N.Chandrasekar Professor and Head Centre for GeoTechnology Manonmaniam Sundaranar University Tirunelveli

CLASSIFICATION OF MULTI-SPECTRAL DATA BY JOINT SUPERVISED-UNSUPERVISED LEARNING

Learning with kernels and SVM

Support Vector Regression (SVR) Descriptions of SVR in this discussion follow that in Refs. (2, 6, 7, 8, 9). The literature

Support vector machines Lecture 4

CS534 Machine Learning - Spring Final Exam

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

DETERMINING HIGH VOLTAGE CABLE CONDUCTOR TEMPERATURES. Guy Van der Veken. Euromold, Belgium. INVESTIGATIONS. INTRODUCTION.

Bagging and Other Ensemble Methods

Deep Spatio-Temporal Time Series Land Cover Classification

Holdout and Cross-Validation Methods Overfitting Avoidance

Classification of Hand-Written Digits Using Scattering Convolutional Network

Land Cover Classification via Multitemporal Spatial Data by Deep Recurrent Neural Networks [Ienco17]

ECE662: Pattern Recognition and Decision Making Processes: HW TWO

Using geographically weighted variables for image classification

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013

Best linear unbiased prediction when error vector is correlated with other random vectors in the model

Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland

Discriminative Learning and Big Data

Reduced Order Greenhouse Gas Flaring Estimation

Multiscale and supervised image analysis in earth observation and remote sensing

ECS289: Scalable Machine Learning

Identifying Volcanoes From Elevation Profile

Polychotomous regression : application to landcover prediction

AUTOMATED DETECTION OF SMALL AND SHALLOW LANDSLIDES AFTER THE 2010 MADEIRA ISLAND FLASH-FLOODS IN VHR IMAGERY

Reading, UK 1 2 Abstract

BAYESIAN CLASSIFICATION OF HIGH DIMENSIONAL DATA WITH GAUSSIAN PROCESS USING DIFFERENT KERNELS

Midterm Review CS 7301: Advanced Machine Learning. Vibhav Gogate The University of Texas at Dallas

Machine Learning Methods for Radio Host Cross-Identification with Crowdsourced Labels

MULTI-SOURCE IMAGE CLASSIFICATION

Kernel-based Approximation. Methods using MATLAB. Gregory Fasshauer. Interdisciplinary Mathematical Sciences. Michael McCourt.

Comparison of Feature Selection Techniques for SVM Classification

Learning SVM Classifiers with Indefinite Kernels

10-701/ Machine Learning, Fall

DYNAMIC TEXTURE RECOGNITION USING ENHANCED LBP FEATURES

A Novel Rejection Measurement in Handwritten Numeral Recognition Based on Linear Discriminant Analysis

Stat 502X Exam 2 Spring 2014

FACTORIZATION MACHINES AS A TOOL FOR HEALTHCARE CASE STUDY ON TYPE 2 DIABETES DETECTION

Identifying optimal classification rules for geographic object-based image analysis

KNOWLEDGE-BASED CLASSIFICATION OF LAND COVER FOR THE QUALITY ASSESSEMENT OF GIS DATABASE. Israel -

A Support Vector Regression Model for Forecasting Rainfall

A new lack-of-fit test for quantile regression models using logistic regression

Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers

6.867 Machine Learning

Machine Learning to Automatically Detect Human Development from Satellite Imagery

PAC-learning, VC Dimension and Margin-based Bounds

One-class Label Propagation Using Local Cone Based Similarity

A Deep Interpretation of Classifier Chains

On the Classification of Classes with Nearly Equal Spectral Response in Remote Sensing Hyperspectral Image Data

School of Information Engineering, Guangdong University of Technology, Guangzhou, , China; 2

Learning with multiple models. Boosting.

Urban land use information retrieval based on scene classification of Google Street View images

MACHINE LEARNING IN REMOTE SENSING DATA PROCESSING. Gustavo Camps-Valls

EFFECT OF ANCILLARY DATA ON THE PERFORMANCE OF LAND COVER CLASSIFICATION USING A NEURAL NETWORK MODEL. Duong Dang KHOI.

CS4495/6495 Introduction to Computer Vision. 8C-L3 Support Vector Machines

Land cover/land use mapping and cha Mongolian plateau using remote sens. Title. Author(s) Bagan, Hasi; Yamagata, Yoshiki. Citation Japan.

Midterm. Introduction to Machine Learning. CS 189 Spring You have 1 hour 20 minutes for the exam.

Quantum machine learning for quantum anomaly detection CQT AND SUTD, SINGAPORE ARXIV:

Modeling Data with Linear Combinations of Basis Functions. Read Chapter 3 in the text by Bishop

TDT4173 Machine Learning

A note on top-k lists: average distance between two top-k lists

Support Vector Machine (continued)

A SURVEY OF REMOTE SENSING IMAGE CLASSIFICATION APPROACHES

Transcription:

Manuscrit auteur, publié dans "42èmes Journées de Statistique (2010)" SECOND ORDER STATISTICS FOR HYPERSPECTRAL DATA CLASSIFICATION Saoussen Bahria and Mohamed Limam LARODEC laboratory- High Institute of Management- University of Tunis- 1007 Tunis- Tunisia Abstract. Hyperspectral imagery classification, taking into account spectral and spatial features, is a promising task in remote sensing field. In this paper, the incorporation of second order statistics in hyperspectral data classification using support vector machines is proposed. The effect of using the semivariance geostatistic as a spatial feature rather than first order statistics (mean and standard deviation) is tested. The overall classification accuracy is evaluated for the AVIRIS Indian Pines-1992 benchmark data set. Empirical results show that the proposed approach gives better performance than the method based on first order statistics. Keywords: Classification, spatial statistic. Résumé. La classification des données hyperspectrales basée sur la combinaison des attributs spectraux et spatiaux, est une question prometteuse dans la télédétection spatiale. Ce papier propose l incorporation des statistiques de second ordre dans la classification des données hyperspectrales en utilisant la technique des séparateurs à vaste marge ou support vector machines. L effet de la considération de la semivariance comme attribut spatial au lieu des statistiques de premier ordre (moyenne et écart type) a été testé. La performance de classification a été évaluée pour l mage hyperspectrale benchmark «AVIRIS Indian Pines-1992». les résultats empiriques montrent que la méthode proposée fournit une classification meilleure par rapport à la méthode considérant les statistiques de premier ordre. Mots-clés. Apprentissage et classification, statistique spatiale. Bibliographie [1] CAMPS-VALLS, G., GOMEZ-CHOVA, L., MUNOZ-MARI, J., VILA- FRANCES, J., CALPE- MARAVILLA, J., 2006, Composite kernels for hyperspectral data classification. IEEE Geosciences and Remote Sensing Letters, 3, pp. 93-97. [2] GUALTIERI, J. A., and CHETTRI, S., 2000, Support vector machines for classification of hyperspectral data. Geoscience and Remote Sensing Symposium, 2000. Proceedings. IGARSS 2000. IEEE 2000 International. 2, pp. 813-815. 1

[3] GOOVAERTS, P. 2002, Geostatistical incorporation of spatial coordinates into supervised classification of hyperspectral data. Journal of Geographical Systems, 4, pp. 99-111. [4] YANG, F., XIATAO, L., GUANGZHOU, Z., CUIYU, S., XIAONING, S., 2006, Remote sensing image classification based on geostatistics and ANN, Proceedings of the SPIE, 6405, pp. 64052C. 1. Introduction In the remote sensing literature, many learning methods have been developed to tackle the problem of hyperspectral data classification. Most common approaches are based on spectral properties of the data. Gualtieri and Chettri (2000) discuss the application of support vector machines (SVM) for hyperspectral remote sensing data classification. They demonstrate the effectiveness of the SVM as a new machine learning tool suitable to handle high dimensional data sets, such as hyperspectral images. Kuo and Landgrebe (2001) discuss the problem of high estimation error in hyperspectral data classification due to limited training data. To avoid this problem, they propose a pair of covariance estimators called Mixed- Leave-One-Out Covariance. Liu et al. (2006) undertook a broad study of Purdue s Indian Pines test site. They compare classification performance of many supervised approaches including SVM. Recently, a number of studies focusing on the improvement of hyperspectral data classification by including both spectral and spatial features have been conducted. Goovaerts (2002) presents a methodology to incorporate spatial coordinates of pixels and spectral properties in maximum likelihood classification. Melgani and Serpico (2002) describe a statistical approach for the combination of spectral and spatiotemporal contextual information for remote sensing data classification. Camps-Valls et al. (2006) propose a family of composite kernels that combine spatial and spectral information based on first order statistics (mean and standard deviation) as spatial features. Yang et al. (2006) combine textural and spectral information using the semivariogram and a back propagation neural network for remote sensing data classification. Huang et al. (2008) compare the accuracy of semivariogram geostatistical analysis with the co-occurrence matrix for spatial features incorporation in the remote sensing data classification. The objective of this paper is to investigate hyperspectral data classification based on the incorporation of spatial features using SVM framework. In previous studies, only first order statistics are used as spatial features to improve traditional spectral classification. However, first order statistics, the mean and the standard deviation, provide simple local statistical information and ignore the spatial 2

arrangement structure of the data within the moving window. Therefore, in this paper the incorporation of second order statistics in hyperspectral data classification using support vector machines is proposed. The effect of using the semivariance geostatistic as a spatial feature rather than first order statistics is tested. The paper is organized as follows. Section 2 presents the data and the methodology used in this work. Empirical results are discussed in Section 3. Advantages and limitations of the proposed approach compared to previous classification methods are discussed in Section 4. 2. Data and Methodology 2.1. Dataset: The AVIRIS Indian Pines-1992 The AVIRIS Indian Pines hyperspectral benchmark dataset is available online at http://aviris.jpl.nasa.gov/html/aviris.freedata.html. Ground reference data is available atftp.ecn.purdue.edu/pub/~biehl/pc_multispec/thyfiles.zip. The scene used for the experiment was acquired by NASA on 12 June 1992. It consists of 145 by 145 pixels (20 m spatial resolution) and 220 spectral bands, with about two-thirds agriculture, and one-third forest or other perennial vegetation. A total of 16 cover classes are identified in the ground reference data set. The scene is termed Indian Pines 1 in the AVIRIS JPL repository. A total of 20 bands [104-108, 150-163, 220] were removed to exclude the water absorption region. The remaining 200 spectral channels were used for the experimental study. A total of 10148 and 2031 pixels were used for training and testing, respectively. The numbers of training and testing samples for the 16 classes are given in table 1. 3

Table 1. Numbers of training samples for the 16 classes in the AVIRIS Indian Pines data set Class Names Training samples Testing samples 1-Alfalfa 54 10 2-Corn-notill 1423 286 3-Corn-min 834 166 4-Corn 234 46 5-Grass/Pasture 497 99 6-Grass/Trees 747 149 7-Grass/Pasture-mowed 26 5 8-Hay-windrowed 489 97 9-Oats 20 4 10-Soybeans-notill 797 159 11-Soybeans-min 2468 493 12-Soybeans-clean 614 122 13-Wheat 212 42 14-Woods 1294 258 15-Bldg-Grass-Trees 380 76 16-Stone-steel-towers 95 19 Total samples 10184 2031 2.2. Methodology A three-stage hyperspectral data classification process is proposed. First, the semivariogram is used to measure the spatial correlation which is integrated as spatial features. Second, a stacked kernel matrix is constructed based on spectral and spatial features vectors. Third, the one-against-one multi-class SVM classification is carried out using the stacked kernel. According to Mather and Pal (2005), the one-against-one approach is the most efficient among the SVM classification methods for remote sensing data. It is called pair-wise classification since it creates an SVM classifier for each possible pair of classes. As a consequence, for an m-class problem, m (m-1)/2 binary classifiers were computed. Two comparative studies were conducted. First, SVM classifications carried with and without spatial features were compared. Second, the effect of using the semivariogram values as spatial features, rather than the first order statistics was tested. The proposed approach was compared to the method of Camps-Valls et al. (2006) based only on incorporating the first order statistics, the mean and the standard deviation as spatial features. In addition, experimental results were also compared with those reported in previous works dealing with the SVM classification of the same benchmark data set (Gualtieri and Chettri 2000, Liu et al.2006, Camps-Valls et al. 2006). For this purpose, the one-against-one multiclass SVM was carried out based on polynomial and Radial Basis Function (RBF) kernels. The 10-fold cross-validation method was used for the selection of the penalty parameter C, the optimal degree d for the polynomial kernel, and the Gaussian parameter σ for the RBF kernel. The cross-validation procedure provides the following optimal parameters C = 100, d = 2 for the polynomial 4

kernel, and C = 5000, σ = 40 for the RBF kernel. Then, the optimal selected parameters were introduced to the SVM classifiers. SVM classifications with and without spatial features were compared. Then, the effect of using second order statistics rather than first order statistics was tested. The polynomial kernel was considered for all comparisons in the second experiment since it outperforms the RBF kernel in the first study (results section, table 2). To measure the spatial variation in the hyperspectral image a variogram geostatistic is used. In a regionalized variable, the variogram is used to quantify spatial correlation. A regionalized variable is a random variable, where location in space or time is known. In this formulation, variables are indexed by their location. In a remote sensing image, the variogram is experimentally computed as the mean sum of squares of the differences between pairs of values separated by a distance h as follows γ (h) = 1/2n n i=1 [Y (x i + h) - Y (x i ) ] 2 (1) Where γ is the estimated semivariance, n is the number of pairs of observations separated by h, x i is a location, Y(x) is the data value at the location x, Y (x i +h) is the data value at the location x i +h. The estimation of the semivariogram will be less precise at larger lag distance, since in that case, there are fewer pairs of observations. The common recommendation in geostatistical literature is to use small lag distance, in particular 1 pixel in remote sensing imagery. Consequently, a 5*5 moving window was used to calculate texture semi-variances at a lag of 1 pixel and angle of 0 in the vertical direction. The semivariance values are then combined with spectral features in the stacked composite kernel. Errors matrices are determined for all classifiers. The overall classification accuracy and the concordance coefficient (Kappa) of all classifiers are derived from error matrices. Moreover, statistical significance of the differences between the different classifiers is assessed based on the Z statistics at the 95 percent confidence level. For the experimental implementation, the OSU-SVM toolbox (http://www.eleceng.ohio-state.edu /~maj/osu_svm) is used with MATLAB 6.5 numerical workstation on a Pentium Ш 2.1 GHz personal computer with 3 GB of memory. 3. Results The performance of SVM classification with and without spatial features is investigated. As shown in table 2, spatial features improve SVM hyperspectral data accuracy for the 5

two tested kernels. However, in our study the polynomial kernel performs better than the RBF kernel. The overall classification accuracies and kappa coefficients for the different classifiers are reported in table 3. Numerically compared, the proposed approach gives superior classification accuracy than previous method using first order statistics as spatial features, reported by Camps-Valls et al. (2006). As a consequence, the semivariogram geostatistical measure is more informative than the first order statistics and leads to better classification results with 96.55 % overall accuracy. Statistically, the proposed approach gives the highest kappa value 0.96 rather than 0.93 for the spatialspectral SVM classifier with first statistics as spatial features and 0.87 for the spectral- SVM classifier. Table 2. Overall accuracy (%) and kappa statistics (in parentheses) of the SVM classification carried out with and without spatial features on the AVIRIS Indian Pines data set Classification SVM-Polynomial kernel (C = 100, d = 2) SVM-RBF kernel (C = 5000, σ = 40) SVM without spatial features 87.54 (0.86) 86.10 (0.85) SVM with spatial features 96.55 (0.96) 95.80 (0.95) 6

Table 3. Overall classification accuracy (%) and Kappa coefficient for different classifiers Classifications Overall accuracy Kappa statistic Spectral SVM Gualtieri and Chettri (2000) 87.30 0.86 Camps - Valls et al. (2006) 88.55 0.87 Liu et al. (2006) Spatial-Spectral SVM Camps - Valls et al. (2006): mean and standard deviation as spatial features Proposed Work: semivariance geostatistic as spatial features 93.77 0.93 94.21 0.93 96.55 0.96 7

4. Conclusion and discussion In this work the incorporation of spatial arrangement structure of hyperspectral data in the stacked kernel based SVM classification is tested. The positive effect of using the semivariance values as spatial features rather than first order statistics is illustrated. The experimental study carried out with the AVIRIS Indian Pines-1992 benchmark data set shows that the semivariogram is more informative than first order statistics as spatial features and allows better classification accuracy. Further improvements of hyperspectral data classification could be investigated by studying the effect of introducing more than one spatial feature. Moreover, applying multiple window sizes for the extraction of texture features could improve the classification accuracy. 8

9