Information theoretical approach for domain ontology exploration in large Earth observation image archives

Similar documents
KNOWLEDGE-BASED CLASSIFICATION OF LAND COVER FOR THE QUALITY ASSESSEMENT OF GIS DATABASE. Israel -

SPATIAL DATA MINING. Ms. S. Malathi, Lecturer in Computer Applications, KGiSL - IIM

Machine Learning and Bayesian Inference. Unsupervised learning. Can we find regularity in data without the aid of labels?

Scalable robust hypothesis tests using graphical models

Modeling of Growing Networks with Directional Attachment and Communities

Active learning in sequence labeling

6.867 Machine learning, lecture 23 (Jaakkola)

Techniques for Science Teachers: Using GIS in Science Classrooms.

Star-Structured High-Order Heterogeneous Data Co-clustering based on Consistent Information Theory

Development of a GIS Based Information and Management System for Cultural Heritage Site; Case Study of Safranbolu

A Method to Improve the Accuracy of Remote Sensing Data Classification by Exploiting the Multi-Scale Properties in the Scene

EFFECT OF ANCILLARY DATA ON THE PERFORMANCE OF LAND COVER CLASSIFICATION USING A NEURAL NETWORK MODEL. Duong Dang KHOI.

CS570 Introduction to Data Mining

Parametric Models. Dr. Shuang LIANG. School of Software Engineering TongJi University Fall, 2012

Adaptive Multi-Modal Sensing of General Concealed Targets

Exploring Spatial Relationships for Knowledge Discovery in Spatial Data

Object-Oriented Oriented Method to Classify the Land Use and Land Cover in San Antonio using ecognition Object-Oriented Oriented Image Analysis

GIS (GEOGRAPHIC INFORMATION SYSTEMS)

Feature selection and extraction Spectral domain quality estimation Alternatives

PRINCIPLES OF PHOTO INTERPRETATION

Urban Tree Canopy Assessment Purcellville, Virginia

Integrating Correlated Bayesian Networks Using Maximum Entropy

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University

ECE521 Lectures 9 Fully Connected Neural Networks

Exploring Large Digital Library Collections using a Map-based Visualisation

PATTERN RECOGNITION AND MACHINE LEARNING

Collaborative topic models: motivations cont

DEVELOPMENT OF DIGITAL CARTOGRAPHIC DATABASE FOR MANAGING THE ENVIRONMENT AND NATURAL RESOURCES IN THE REPUBLIC OF SERBIA

Discriminative Learning of Sum-Product Networks. Robert Gens Pedro Domingos

Urban land cover and land use extraction from Very High Resolution remote sensing imagery

Transactions on Information and Communications Technologies vol 18, 1998 WIT Press, ISSN

High resolution wetland mapping I.

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1

Nonnegative Matrix Factorization

Complex statistics of Synthetic Aperture Radar data using high resolution TerraSAR-X images Anca Popescu, Mihai Datcu, Madalin Frunzete

AUTOMATIC CATEGORIZATION OF CLUSTERS IN UNSUPERVISED CLASSIFICATION

Urban remote sensing: from local to global and back

Computer Vision Group Prof. Daniel Cremers. 14. Clustering

CS6220: DATA MINING TECHNIQUES

Modern Information Retrieval

arxiv: v2 [cs.cv] 15 Jul 2018

From statistics to data science. BAE 815 (Fall 2017) Dr. Zifei Liu

URBAN MAPPING AND CHANGE DETECTION

Mihai Boicu, Gheorghe Tecuci, Dorin Marcu Learning Agent Center, George Mason University

Unsupervised Learning. k-means Algorithm

Bayesian Networks Inference with Probabilistic Graphical Models

HCOC: hierarchical classifier with overlapping class groups

STRUCTURAL BIOINFORMATICS I. Fall 2015

A Unified Posterior Regularized Topic Model with Maximum Margin for Learning-to-Rank

13: Variational inference II

Principles of Pattern Recognition. C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata

Making a case for full-polarimetric radar remote sensing

Distributed ML for DOSNs: giving power back to users

Sum-Product Networks: A New Deep Architecture

Reconstruction. Reading for this lecture: Lecture Notes.

Assessment of spatial analysis techniques for estimating impervious cover

Object-based land use/cover extraction from QuickBird image using Decision tree

Identifying Audit, Evidence Methodology and Audit Design Matrix (ADM)

Spatial Data Science. Soumya K Ghosh

RADAR BACKSCATTER AND COHERENCE INFORMATION SUPPORTING HIGH QUALITY URBAN MAPPING

7.1 INTRODUCTION 7.2 OBJECTIVE

Chapter 2. Semantic Image Representation

Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees

Land Accounts - The Canadian Experience

Collaborative Topic Modeling for Recommending Scientific Articles

Clustering using Mixture Models

Real-Time Computerized Annotation of Pictures

ACCURACY ASSESSMENT OF ASTER GLOBAL DEM OVER TURKEY

Mapping Earth. How are Earth s surface features measured and modeled?

Recent Advances in Bayesian Inference Techniques

Quality and Coverage of Data Sources

Investigation of the Effect of Transportation Network on Urban Growth by Using Satellite Images and Geographic Information Systems

RETRIEVAL MODELS. Dr. Gjergji Kasneci Introduction to Information Retrieval WS

An Automated Object-Oriented Satellite Image Classification Method Integrating the FAO Land Cover Classification System (LCCS).

THE DEVELOPMENT OF THE METHOD FOR UPDATING LAND SURFACE DATA BY USING MULTI-TEMPORALLY ARCHIVED SATELLITE IMAGES

USING GIS CARTOGRAPHIC MODELING TO ANALYSIS SPATIAL DISTRIBUTION OF LANDSLIDE SENSITIVE AREAS IN YANGMINGSHAN NATIONAL PARK, TAIWAN

13 : Variational Inference: Loopy Belief Propagation and Mean Field

Spatial Information Retrieval

Introduction: MLE, MAP, Bayesian reasoning (28/8/13)

Impacts of sensor noise on land cover classifications: sensitivity analysis using simulated noise

Variational algorithms for marginal MAP

Cluster Analysis (Sect. 9.6/Chap. 14 of Wilks) Notes by Hong Li

Machine Learning to Automatically Detect Human Development from Satellite Imagery

Recognition Performance from SAR Imagery Subject to System Resource Constraints

Lecture 3: Pattern Classification. Pattern classification

Diversity of Settlement Categories in Very High Resolution SAR Images

LAND USE MAPPING AND MONITORING IN THE NETHERLANDS (LGN5)

Integrating Ontological Prior Knowledge into Relational Learning

Normalised mutual information-based ranking of spatio-temporal localisation maps

15 Introduction to Data Mining

Module 2.1 Monitoring activity data for forests using remote sensing

PROBABILITY AND INFORMATION THEORY. Dr. Gjergji Kasneci Introduction to Information Retrieval WS

A Machine Learning Approach for Knowledge Base Construction Incorporating GIS Data for Land Cover Classification of Landsat ETM+ Image

Progress and Land-Use Characteristics of Urban Sprawl in Busan Metropolitan City using Remote sensing and GIS

CS6220: DATA MINING TECHNIQUES

Supervisor: Prof. Stefano Spaccapietra Dr. Fabio Porto Student: Yuanjian Wang Zufferey. EPFL - Computer Science - LBD 1

Generative and Discriminative Approaches to Graphical Models CMSC Topics in AI

A COMPARISON BETWEEN DIFFERENT PIXEL-BASED CLASSIFICATION METHODS OVER URBAN AREA USING VERY HIGH RESOLUTION DATA INTRODUCTION

Clustering with k-means and Gaussian mixture distributions

Deep Poisson Factorization Machines: a factor analysis model for mapping behaviors in journalist ecosystem

Transcription:

Information theoretical approach for domain ontology exploration in large Earth observation image archives Mihai Datcu, Mariana Ciucu DLR Oberpfaffenhofen IGARSS 2004

KIM - Knowledge driven Information Mining in remote sensing image archives Semantic Definition Objectives: a prototype system of a next generation architecture to help the users to gather relevant information rapidly, manage and add value to the huge amounts of historical and newly acquired satellite data-sets Image Information Mode Learn Mode Label Information Name Description Label_1 Pan Mode Reset Similar labels View Order Gallery Store A posteriori Map Models contribution Model1 Model2 Search Search parameters Set label Set a label Probability Separability Coverage Set collections Set Satellites and/or Sensors Collection_1 Satellite_1 Collection_2 Satellite_2 Collection_3 Collection_4 Sensor_1 Collection_5 Sensor_2 Date Area Upper bound(lat/lon) From: yyyy/mm/dd Lower bound(lat/lon) To: yyyy/mm/dd Textual annotation Stored Queries Set a stored query Details Query_1 Query_2 Apply Query_3 Result Data: ERS, Landsat, Ikonos, MERIS, E-SAR, DEDALUS Search Store Query Parameters Confirm Selection Evaluators and users: ESRIN, EUSC, NERSC, DLR Output: prototype system

The paradox We have to design systems to enable people to analyse TeraBytes of data! People have normally trouble in caching more than 7 items at a time

HIERARCHY OF INFORMATION REPRESENTATION un- supervised clustering un- supervised clustering

Clustering and coding Morse alphabet E I A _ O _

Semantic coding texture spectral

Image semantics

Advanced communication Image s I k Cluster s? 1j Model p(? I ) p(l? ) Semantic labels CITY VILAGE ROAD? k Model 2 RIVER FOREST

Data-Information-Knowledge -applicable to a number of fields Geology Hydrology Domain Ontology -knowledge acquisition and sharing within user communities Water Drainage Semantics -multi - type data handling Mountain River Glacier Sea Grassland Agriculture Labels -knowledge communication Spectral Texture Spectral SAR Features Images

Semantic grouping - method Supervised Bayesian classification: p ( L ω i µ i ) > p( L ω ), µ Inference from data to aggregated labels: p( Λ τ D) = p( Λ ) τ p( L Λτ ) p( L p( L ) D) p( L Λτ ) Interactive learning the probabilistic link : p (L Λ, ψ ) = ψ ψ = ψ, K, ψ } τ p( ψ ) = Γ( c) = ( c 1)! p( ψ T) = Dir ( ψ α) p(l Λ τ ) = α α { 1 c

Controlling the Dynamic Indexing Problem In the system there may be labels with different names and with the same information content (the same meaning) Table LABEL from DataBase Labels L1 L2 pω L 11 ) ( 1 pω L 12 ) ( 1 pω ( L 11 2) pω ( L 12 2) p( ij 1 ω L) p( ij 2 ω L) Signal classes ω 11 ω 12 ω ij

Controlling the Dynamic Indexing Solution : similarity measure, Kullback-Leibler divergence KL( L 1 L 2 ) = i j p ( ω L ) ij 1 ( ω L ) p ( ) ij 1 *log q ωij L2 p q ( ωij L1 ) = p1 ( ωi L1 )* p2( ω j L1 ) ( ω L ) = q ( ω L )* q ( ω L ) ij 2 1 i 2 2 j 2 Example The initial label for two classfiles 1 4 7 2 5 8 3 6 9 The labels from database for the same classfiles After Kullback-Leibler procedure the sorted labels list is : 2, 1, 6, 4, 5, 9,7, 8 3

Domain Ontology Search Knowledge Graph Hierarchical structure Union of trees Graph more abstract concept domain Sub-domain Aggregated label aggregation algorithm For the same label we can have many realisations, with the same features ( like L2 ) L1 L2 L3 L4 L5 feature feature feature feature feature classes classes classes classes classes image image image image image classes lavel features lavel classes lavel image lavel more concrete concept When a user is training a new label on an image, the system may try to identify the other labels closest to it ( KL between 0 and 1) and give the user the possibility to choose an old label or to define a new one

Domain Ontology Search Problem Solution User needs to learn from another user We can predict what the user wants similarity measure ->Kullback-Leibler divergence between two ontologies r 1 P(k 1 r 1 ) P(k 2 r 1 ) P(k n r 1 ) k 1 k 2 k n r2 KL' ( r n 1, r2 ) = KL( r1, r2 ) + p( ki r1 )* KL' ( ki, pi ) i= 1 Where, r1 root node of sub-trees for one of users r2 root node of sub-trees for other user k i child nodes of r 1 p i child nodes of r 2 P(p 1 r 2 ) P(p 2 r 2 ) P(p n r 2 ) p 1 p 2 p n

Domain Ontology Search Examples 11178 hd_cloud6 md_cloud the entropy from the root node = hd_cloud6 is 7232032711267443 the entropy from the root node = md_cloud is 875741191905023 KL (hd_cloud6,md_cloud) -----------------------------07136991789787241 11190 ac_acqua_veloce hd_water2 the entropy from the root node = ac_acqua_veloce is 8134289966218503 the entropy from the root node = hd_water2 is 61539319369981 KL (ac_acqua_veloce,hd_water2) -----------------------------06401859742548874

Conclusions The system helps users by cumulative learning incremental acquisition of knowledge building on knowledge acquired in earlier steps take advantage of what was learned before share knowledge IGARSS 2004