Spectral and Spatial Methods for the Classification of Urban Remote Sensing Data

Similar documents
Hyperspectral Image Processing: Methods and Approaches

This is an author-deposited version published in : Eprints ID : 17710

Classification of Pavia University Hyperspectral Image using Gabor and SVM Classifier

Mahalanobis kernel for the classification of hyperspectral images

Outline. Basic concepts: SVM and kernels SVM primal/dual problems. Chih-Jen Lin (National Taiwan Univ.) 1 / 22

Introduction to Support Vector Machines

Support Vector Machines (SVM) in bioinformatics. Day 1: Introduction to SVM

Support Vector Machines. Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar

Jeff Howbert Introduction to Machine Learning Winter

Object Recognition Using Local Characterisation and Zernike Moments

Hyperspectral image classification using Support Vector Machine

Microarray Data Analysis: Discovery

Statistical learning theory, Support vector machines, and Bioinformatics

Statistical Learning Reading Assignments

Pattern Recognition and Machine Learning

Non-Bayesian Classifiers Part II: Linear Discriminants and Support Vector Machines

Data Mining. Linear & nonlinear classifiers. Hamid Beigy. Sharif University of Technology. Fall 1396

Support Vector Machines

Classification of High Spatial Resolution Remote Sensing Images Based on Decision Fusion

Chapter 9. Support Vector Machine. Yongdai Kim Seoul National University

A short introduction to supervised learning, with applications to cancer pathway analysis Dr. Christina Leslie

NONLINEAR CLASSIFICATION AND REGRESSION. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013

Pattern Recognition and Machine Learning. Perceptrons and Support Vector machines

PAC-learning, VC Dimension and Margin-based Bounds

Discriminative Direction for Kernel Classifiers

10/05/2016. Computational Methods for Data Analysis. Massimo Poesio SUPPORT VECTOR MACHINES. Support Vector Machines Linear classifiers

Contents Lecture 4. Lecture 4 Linear Discriminant Analysis. Summary of Lecture 3 (II/II) Summary of Lecture 3 (I/II)

Ch 4. Linear Models for Classification

Change Detection in Optical Aerial Images by a Multi-Layer Conditional Mixed Markov Model

Last updated: Oct 22, 2012 LINEAR CLASSIFIERS. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition

Linear & nonlinear classifiers

MIRA, SVM, k-nn. Lirong Xia

OBJECT DETECTION AND RECOGNITION IN DIGITAL IMAGES

Support Vector Machines

Classification and Support Vector Machine

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

CIS 520: Machine Learning Oct 09, Kernel Methods

Linear & nonlinear classifiers

Parameter learning in CRF s

LINEAR CLASSIFIERS. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition

CS4495/6495 Introduction to Computer Vision. 8C-L3 Support Vector Machines

Discriminative Learning and Big Data

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017

Neural networks and support vector machines

Kernel Methods and Support Vector Machines

LMS Algorithm Summary

Low Bias Bagged Support Vector Machines

Sparse Kernel Machines - SVM

Classifier Complexity and Support Vector Classifiers

Neural Networks. Prof. Dr. Rudolf Kruse. Computational Intelligence Group Faculty for Computer Science

Overview of Statistical Tools. Statistical Inference. Bayesian Framework. Modeling. Very simple case. Things are usually more complicated

Chap 2. Linear Classifiers (FTH, ) Yongdai Kim Seoul National University

Brief Introduction of Machine Learning Techniques for Content Analysis

Learning with kernels and SVM

Support Vector Machines

CS534 Machine Learning - Spring Final Exam

Lecture 4 Discriminant Analysis, k-nearest Neighbors

Linear Models for Classification

Indirect Rule Learning: Support Vector Machines. Donglin Zeng, Department of Biostatistics, University of North Carolina

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

Machine Learning Lecture 7

Machine Learning, Midterm Exam

LINEAR MODELS FOR CLASSIFICATION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

Learning Methods for Linear Detectors

Infinite Ensemble Learning with Support Vector Machinery

Support Vector Machine (continued)

COMS 4771 Introduction to Machine Learning. Nakul Verma

CSE 151 Machine Learning. Instructor: Kamalika Chaudhuri

CS 6375 Machine Learning

An Introduction to Statistical and Probabilistic Linear Models

Brief Introduction to Machine Learning

Support Vector Machine

Support Vector Machine. Industrial AI Lab.

Learning with Noisy Labels. Kate Niehaus Reading group 11-Feb-2014

Support Vector Machines for Classification: A Statistical Portrait

Midterm Review CS 6375: Machine Learning. Vibhav Gogate The University of Texas at Dallas

Machine Learning Lecture 5

Machine learning for pervasive systems Classification in high-dimensional spaces

Support Vector Machines. CAP 5610: Machine Learning Instructor: Guo-Jun QI

Cheng Soon Ong & Christian Walder. Canberra February June 2018

The Application of Extreme Learning Machine based on Gaussian Kernel in Image Classification

Support Vector Machine (SVM) & Kernel CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2012

Support Vector Machine & Its Applications

Computer Vision Group Prof. Daniel Cremers. 2. Regression (cont.)

High Dimensional Kullback-Leibler divergence for grassland classification using satellite image time series with high spatial resolution

Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers

Pattern Recognition 2018 Support Vector Machines

Lecture 10: A brief introduction to Support Vector Machine

Midterm Review CS 7301: Advanced Machine Learning. Vibhav Gogate The University of Texas at Dallas

Remote sensing of sealed surfaces and its potential for monitoring and modeling of urban dynamics

Introduction to Support Vector Machines

Chemometrics: Classification of spectra

Support Vector Machines II. CAP 5610: Machine Learning Instructor: Guo-Jun QI

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

Support Vector Machine (SVM) and Kernel Methods

URBAN LAND COVER AND LAND USE CLASSIFICATION USING HIGH SPATIAL RESOLUTION IMAGES AND SPATIAL METRICS

Polyhedral Computation. Linear Classifiers & the SVM

A Method to Improve the Accuracy of Remote Sensing Data Classification by Exploiting the Multi-Scale Properties in the Scene

Transcription:

Spectral and Spatial Methods for the Classification of Urban Remote Sensing Data Mathieu Fauvel gipsa-lab/dis, Grenoble Institute of Technology - INPG - FRANCE Department of Electrical and Computer Engineering, University of Iceland - ICELAND Membres du Jury Monsieur Henri MAÎTRE Président Monsieur Grégoire MERCIER Rapporteur Monsieur Sebastiano B. SERPICO Rapporteur Monsieur Albert BIJAOUI Examinateur Monsieur Jordi INGLADA Examinateur Monsieur Johannes R. SVEINSSON Examinateur Monsieur Jocelyn CHANUSSOT Directeur de thèse Monsieur Jon A. BENEDIKTSSON Co-Directeur

Context Data characteristics Objectives of the work Urban remote sensing data: PLEIADES (0.75m/pix) IRS-1C (5m/pix) Spatial resolution: 0.75 to 2.5 meter by pixel Spectral resolution: 1 to more than 200 spectral bands Mathieu Fauvel Classification of Urban Remote Sensing Data 2/37

Context Data characteristics Objectives of the work Very High resolution urban remote sensing data: Panchromatic Multispectral Classification: a pattern recognition approach 1 Feature extraction: one vector of attributes extracted for every pixel 2 Pattern recognition algorithms: Maximum Likelihood, Neural Network... Mathieu Fauvel Classification of Urban Remote Sensing Data 3/37

Context Data characteristics Objectives of the work Original Data Ground-truth Thematic Map Experimental Data Set: University Area, Pavia, Italy. [610 340 103], 1.5 m/pixel, 9 classes. Mathieu Fauvel Classification of Urban Remote Sensing Data 4/37

Context Data characteristics Objectives of the work High resolution spectral information: Fine physical description Directly accessible Curse of dimensionality No contextual information Reflectance 8000 7000 6000 5000 4000 3000 2000 1000 0 0 50 100 150 200 Wavelength High resolution spatial information: Fine description of structure Not Directly accessible No spectral information Mathieu Fauvel Classification of Urban Remote Sensing Data 5/37

Introduction Context Data characteristics Objectives of the work Combine the different types of information for the classification Why? / Random permutation Spectral classification / Mathieu Fauvel o / Spectral classification Classification of Urban Remote Sensing Data 6/37

Introduction Context Data characteristics Objectives of the work Combine the different types of information for the classification Why? Spectral classification / Random permutation / / Same classification!! o Spectral classification Need to incorporate information from the spatial domain Mathieu Fauvel Classification of Urban Remote Sensing Data 6/37

Context Data characteristics Objectives of the work Combine the different types of information for the classification Prior studies: 2 steps approach 1 Feature extraction: Morphological processing 2 Classification: Neural Network, Fuzzy Logic Contribution: 1 Feature extraction: Extraction of Contextual information (self-complementary filter) Spectral feature (KPCA) 2 Classification: Support Vector Machines Transferability of the hyperplane 3 : at data level at decision level Mathieu Fauvel Classification of Urban Remote Sensing Data 7/37

Context Data characteristics Objectives of the work Combine the different types of information for the classification Prior studies: 2 steps approach 1 Feature extraction: Morphological processing 2 Classification: Neural Network, Fuzzy Logic Contribution: 1 Feature extraction: Extraction of Contextual information (self-complementary filter) Spectral feature (KPCA) 2 Classification: Support Vector Machines Transferability of the hyperplane 3 : at data level at decision level Mathieu Fauvel Classification of Urban Remote Sensing Data 7/37

Context Data characteristics Objectives of the work Outline: 1 2 Motivation Decision Fusion 3 Conclusions Perspectives Mathieu Fauvel Classification of Urban Remote Sensing Data 8/37

1 2 Motivation Decision Fusion 3 Conclusions Perspectives Mathieu Fauvel Classification of Urban Remote Sensing Data 9/37

S = {(x 1, y1),..., (x l, yl)} R n { 1, 1} yi = 1 H (w, b) : {x w, x + b = 0} 2 w w yi = 1 Optimal separating hyperplane [Vapnik-98]: Minimize training errors over S Maximize the margin minimize w 2 Mathieu Fauvel Classification of Urban Remote Sensing Data 10/37

S = {(x 1, y1),..., (x l, yl)} R n { 1, 1} yi = 1 H (w, b) : {x w, x + b = 0} 2 w w f (x) = w, x + b yi = 1 Optimal separating hyperplane [Vapnik-98]: Minimize training errors over S Maximize the margin minimize w 2 ( l ) Decision: g(x) = sgn (f (x)) = sgn α iy i x i, x + b i=1 Mathieu Fauvel Classification of Urban Remote Sensing Data 10/37

S = {(x 1, y1),..., (x l, yl)} R n { 1, 1} yi = 1 H (w, b) : {x w, x + b = 0} 2 w w f (x) = w, x + b yi = 1 Optimal separating hyperplane [Vapnik-98]: Minimize training errors over S Maximize the margin minimize w 2 ( l ) Decision: g(x) = sgn (f (x)) = sgn α iy i x i, x + b i=1 Mathieu Fauvel Classification of Urban Remote Sensing Data 10/37

1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 Introduction Kernel methods: Use kernel function k (positive semi-definite) k(x i, x j ) = Φ(x i ), Φ(x j ) H Φ Φ(x i ), Φ(x j ) k(x i, x j ) Some kernels: Polynomial kernel: k(x i, x j ) = ( x i, x j + q ) p Gaussian kernel: k(x i, x j ) = exp ( xi x j 2 γ 2 ) Mathieu Fauvel Classification of Urban Remote Sensing Data 11/37

Multiclass problem: m classes 1 One versus All: m binary classifiers 2 One versus One: m(m-1)/2 classifiers Mathieu Fauvel Classification of Urban Remote Sensing Data 12/37

Multiclass problem: m classes 1 One versus All: m binary classifiers 2 One versus One: m(m-1)/2 classifiers Mathieu Fauvel Classification of Urban Remote Sensing Data 12/37

Classification in the spectral space: spatial & spectral Gaussian ML (78%) Neural Network (67%) SVM (80%) Classes: asphalt, meadow, gravel, tree, metal sheet, bare soil, bitumen, brick and shadow. Mathieu Fauvel Classification of Urban Remote Sensing Data 13/37

Spatial information: Statistical Information Inter-pixels dependency Texture Gray level distribution Geometrical Information Shape Area Orientation Histogram Mean 107 136 Variance 35 48 Shape Rectangular Rectangular Size 8049 9741 Mathieu Fauvel Classification of Urban Remote Sensing Data 14/37

Spatial information: Statistical Information Inter-pixels dependency Texture Gray level distribution Geometrical Information Shape Area Orientation Histogram Mean 107 136 Variance 35 48 Shape Rectangular Rectangular Size 8049 9741 Mathieu Fauvel Classification of Urban Remote Sensing Data 14/37

Spatial information: Statistical Information Inter-pixels dependency Texture Gray level distribution Geometrical Information Shape Area Orientation Histogram Mean 107 136 Variance 35 48 Shape Rectangular Rectangular Size 8049 9741 Mathieu Fauvel Classification of Urban Remote Sensing Data 14/37

Markov Random Field [Lafarge-05]: fixed neighborhood (cliques) Contextual features [Camps-Valls-06; Bruzzone-06]: fixed neighborhood (p p square) Texture features [Mercier-06] : two 1-D wavelets (on x and y) Morphological processing [Benediktsson-03]: structures Mathieu Fauvel Classification of Urban Remote Sensing Data 15/37

Not well suited to urban area data: discontinuity Morphological Neighborhood: adapt the neighborhood to the structures Previous works: Granulometry with geodesic filters (Morphological profile and its derivative). Acts only on extrema structures Proposed works: Self-complementary filters. Acts on structures whatever their gray-level Mathieu Fauvel Classification of Urban Remote Sensing Data 16/37

Self-complementary area filter [Soille-05]: Remove all the structures that are smaller than a given area threshold 1 Labelling all the flat zones that satisfy the area criterion λ, 2 Growing the labelled flat zones until an image partition is reached. Mathieu Fauvel Classification of Urban Remote Sensing Data 17/37

Self-complementary area filter [Soille-05]: Remove all the structures that are smaller than a given area threshold 1 Labelling all the flat zones that satisfy the area criterion λ, 2 Growing the labelled flat zones until an image partition is reached. Mathieu Fauvel Classification of Urban Remote Sensing Data 17/37

Self-complementary area filter [Soille-05]: Remove all the structures that are smaller than a given area threshold 1 Labelling all the flat zones that satisfy the area criterion λ, 2 Growing the labelled flat zones until an image partition is reached. Mathieu Fauvel Classification of Urban Remote Sensing Data 17/37

Self-complementary area filter [Soille-05]: Remove all the structures that are smaller than a given area threshold 1 Labelling all the flat zones that satisfy the area criterion λ, 2 Growing the labelled flat zones until an image partition is reached. Extracting the inter-pixel dependency [Fauvel-07a]: Υ x = med(ω x ) Mathieu Fauvel Classification of Urban Remote Sensing Data 17/37

How to use conjointly the spatial and the spectral information? Kernel approach: mixture of kernels SVM classifier Combination of kernels: Spatio-spectral kernel (SS kernel) µk spect + (1 µ)k spat k spect acts on the spectral information k spat acts on the spatial information µ control the amount of each type of information Mathieu Fauvel Classification of Urban Remote Sensing Data 18/37

Extension to hyperspectral data? Ordering relation for pixels is needed: does not exit. Marginal ordering by band filtering Total Pre-ordering h : R n R 1 Our approach: Dimension reduction with PCA. 1 Projection on the first PC 2 Area filtering and generalization of the neighborhood mask for each band 3 Spatial feature extraction 4 Classification with SS SVM h : R n R 1 x x = x, v 1 p R n Mathieu Fauvel Classification of Urban Remote Sensing Data 19/37

Experiments: EMP: Morphological approach (EMP): 3 PCs and 4 opening/closing. MRF: Markov Random Field (Potts model): SS SVM: Spatio-Spectral SVM Parameters setting: Cross-validation Multiclass strategy: One versus all. Mathieu Fauvel Classification of Urban Remote Sensing Data 20/37

Introduction Classification using spectral and spatial information: EMP MRF spectral only SS SVM Classes: asphalt, meadow, gravel, tree, metal sheet, bare soil, bitumen, brick and shadow. Mathieu Fauvel Classification of Urban Remote Sensing Data 21/37

Classification accuracies: % SVM EMP MRF SS SVM Overall Accuracy 80 80 83 86 Average Accuracy 88 85 89 92 Kappa coefficient 75 74 79 82 Asphalt 80 93 92 84 Meadow 68 73 70 78 Gravel 74 52 62 84 Bare soil 95 62 98 95 Mathieu Fauvel Classification of Urban Remote Sensing Data 22/37

Classification accuracies: % SVM EMP MRF SS SVM Overall Accuracy 80 80 83 86 Average Accuracy 88 85 89 92 Kappa coefficient 75 74 79 82 Asphalt 80 93 92 84 Meadow 68 73 70 78 Gravel 74 52 62 84 Bare soil 95 62 98 95 Mathieu Fauvel Classification of Urban Remote Sensing Data 22/37

Spatial information is helpful Benefits Adaptive neighborhood definition Kernel formulation Performance for peri-urban area Drawbacks Parameters λ (area threshold) and µ (weight in SS kernel) Statistical spatial information only Extension to multi- or hyperspectral data [Soille-07] Mathieu Fauvel Classification of Urban Remote Sensing Data 23/37

Motivation Decision Fusion 1 2 Motivation Decision Fusion 3 Conclusions Perspectives Mathieu Fauvel Classification of Urban Remote Sensing Data 24/37

Motivation Decision Fusion Different classifiers: Statistic Theory: Gaussian Maximum Likelihood, Fisher Discriminant Analysis... Machine learning Theory: Neural Network, Support Vector Machines... Fuzzy Set Theory: Fuzzy NN, Fuzzy model... Different sources: Panchromatic and multispectral data Multi-valued data Multi-temporal data Extracted data: texture or geometrical features Mathieu Fauvel Classification of Urban Remote Sensing Data 25/37

Motivation Decision Fusion Different classifiers: Statistic Theory: Gaussian Maximum Likelihood, Fisher Discriminant Analysis... Machine learning Theory: Neural Network, Support Vector Machines... Fuzzy Set Theory: Fuzzy NN, Fuzzy model... Different sources: Panchromatic and multispectral data Multi-valued data Multi-temporal data Extracted data: texture or geometrical features Complementary information Mathieu Fauvel Classification of Urban Remote Sensing Data 25/37

Motivation Decision Fusion Decision fusion scheme (for classification): S 1 S 2 FUSION PROCESS DECISION S m Problems: Sources of different natures Sources of different reliabilities Conflicting situations Mathieu Fauvel Classification of Urban Remote Sensing Data 26/37

Motivation Decision Fusion Requirement: At least one accurate classifier Variable reliability of classifier: for each pixel for each class General framework [Fauvel-06]: Modeling the classifier s output with fuzzy set Reliability is estimated by fuzziness Mathieu Fauvel Classification of Urban Remote Sensing Data 27/37

Motivation Decision Fusion Requirement: At least one accurate classifier Variable reliability of classifier: for each pixel for each class General framework [Fauvel-06]: Modeling the classifier s output with fuzzy set Reliability is estimated by fuzziness Application to SVM classifiers: needs specific derivations Mathieu Fauvel Classification of Urban Remote Sensing Data 27/37

Motivation Decision Fusion SVM fusion scheme: Original data 2 SVM m(m 1) binary classifiers 2 PCA EMP SVM fusion fusion df 12 (x) m(m 1) binary classifiers 2 d (m 1)m f (x) Majority voting decision Mathieu Fauvel Classification of Urban Remote Sensing Data 28/37

Motivation Decision Fusion SVM fusion scheme: Original data 2 SVM m(m 1) binary classifiers 2 PCA EMP SVM fusion fusion df 12 (x) m(m 1) binary classifiers 2 d (m 1)m f (x) Majority voting decision Fusion rule [Fauvel-07b]: The greater the distance to the hyperplane, the more reliable the source d ij f (x) = maxabs ( d ij 1 (x), d ij 2 (x) ) d 12 1 (x) 0 d2 12 (x) Mathieu Fauvel Classification of Urban Remote Sensing Data 28/37

Motivation Decision Fusion Same classifier with different inputs: A classifier based on the spectral information: SVM A classifier based on the spatial information: EMP + SVM Experiments: spectral features: 103 spatial features: 33 parameters fit with CV comparison with simple majority vote rule Mathieu Fauvel Classification of Urban Remote Sensing Data 29/37

Motivation Decision Fusion Classification results: SVM EMP Desision fusion Majority Voting Overall Accuracy 81 85 90 86 Average Accuracy 88 91 94 88 Kappa coefficient 76 81 87 82 Asphalt 84 95 93 94 Meadow 70 80 84 85 Gravel 70 88 82 65 Bare soil 92 64 91 62 Mathieu Fauvel Classification of Urban Remote Sensing Data 30/37

Introduction Motivation Decision Fusion Classification using decision fusion: SVM EMP Mathieu Fauvel Decision fusion Classification of Urban Remote Sensing Data 31/37

Motivation Decision Fusion Classification accuracies: SVM EMP SS SVM Decision Fusion Overall Accuracy 80 85 86 90 Average Accuracy 88 90 92 94 Kappa coefficient 75 80 83 87 Processing complexity: SVM EMP SS SVM Decision Fusion Pre-processing + Training + + Classification + + + Mathieu Fauvel Classification of Urban Remote Sensing Data 32/37

Introduction Motivation Decision Fusion Visual inspection: Original SVM EMP SS SVM Decision Fusion MRF Classes: asphalt, meadow, gravel, tree, metal sheet, bare soil, bitumen, brick and shadow. Mathieu Fauvel Classification of Urban Remote Sensing Data 33/37

Conclusions Perspectives The END! 1 2 Motivation Decision Fusion 3 Conclusions Perspectives Mathieu Fauvel Classification of Urban Remote Sensing Data 34/37

Conclusions Perspectives The END! Classification of remote sensing data of urban area 1 Include spatial information: Self-complementary filter 2 Support Vector Machines: kernel approach to include spatial information 3 Decision fusion: adaptive approach based on reliability estimation Mathieu Fauvel Classification of Urban Remote Sensing Data 35/37

Conclusions Perspectives The END! Spatial information: Extract other spatial information: geometric... Classification: Kernel for spectral data Data fusion Estimation of global accuracy: bound of performances Mathieu Fauvel Classification of Urban Remote Sensing Data 36/37

Conclusions Perspectives The END! Spectral and Spatial Methods for the Classification of Urban Remote Sensing Data Mathieu Fauvel gipsa-lab/dis, Grenoble Institute of Technology - INPG - FRANCE Department of Electrical and Computer Engineering, University of Iceland - ICELAND Mathieu Fauvel Classification of Urban Remote Sensing Data 37/37