Pramod K. Varshney. EECS Department, Syracuse University This research was sponsored by ARO grant W911NF

Similar documents
1 Biometric Authentication: A Copula Based Approach

Heterogeneous Sensor Signal Processing for Inference with Nonlinear Dependence

Fusing Heterogeneous Data for Detection Under Non-stationary Dependence

On the Detection of Footsteps Based on Acoustic

Semi-parametric Distributional Models for Biometric Fusion

Financial Econometrics and Volatility Models Copulas

QUANTIZATION FOR DISTRIBUTED ESTIMATION IN LARGE SCALE SENSOR NETWORKS

Biometric Fusion: Does Modeling Correlation Really Matter?

Robustness of a semiparametric estimator of a copula

Citation for published version (APA): Susyanto, N. (2016). Semiparametric copula models for biometric score level fusion

Distributed Detection of a Nuclear Radioactive Source using Fusion of Correlated Decisions

Optimal Mean-Square Noise Benefits in Quantizer-Array Linear Estimation Ashok Patel and Bart Kosko

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

Bivariate Degradation Modeling Based on Gamma Process

Optimal Sensor Rules and Unified Fusion Rules for Multisensor Multi-hypothesis Network Decision Systems with Fading Channels

Towards inference for skewed alpha stable Levy processes

Introduction to Statistical Inference

Multivariate Non-Normally Distributed Random Variables

Threshold Considerations in Distributed Detection in a Network of Sensors.

Detection Performance Limits for Distributed Sensor Networks in the Presence of Nonideal Channels

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016

A Multivariate Time-Frequency Based Phase Synchrony Measure for Quantifying Functional Connectivity in the Brain

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

Gaussian Process Vine Copulas for Multivariate Dependence

Copulas. MOU Lili. December, 2014

Compressive Sensing-Based Detection with Multimodal Dependent Data

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories

2234 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 52, NO. 8, AUGUST Nonparametric Hypothesis Tests for Statistical Dependency

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1

Inferring Brain Networks through Graphical Models with Hidden Variables

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2014

Distributed Binary Quantizers for Communication Constrained Large-scale Sensor Networks

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30

Independent Component Analysis. Contents

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

Human Pose Tracking I: Basics. David Fleet University of Toronto

Brief Introduction of Machine Learning Techniques for Content Analysis

The Instability of Correlations: Measurement and the Implications for Market Risk

Alfredo A. Romero * College of William and Mary

HANDBOOK OF APPLICABLE MATHEMATICS

Parametric estimation methods of multivariate data for multi-component image segmentation

When enough is enough: early stopping of biometrics error rate testing

Approximation of Average Run Length of Moving Sum Algorithms Using Multivariate Probabilities

Independent Component Analysis and Unsupervised Learning. Jen-Tzung Chien

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Statistical Models and Algorithms for Real-Time Anomaly Detection Using Multi-Modal Data

Independent Component Analysis and Unsupervised Learning

Distributed Structures, Sequential Optimization, and Quantization for Detection

Information Theory Based Estimator of the Number of Sources in a Sparse Linear Mixing Model

Curve Fitting Re-visited, Bishop1.2.5

STA 4273H: Statistical Machine Learning

Variational Inference with Copula Augmentation

BAYESIAN DESIGN OF DECENTRALIZED HYPOTHESIS TESTING UNDER COMMUNICATION CONSTRAINTS. Alla Tarighati, and Joakim Jaldén

BAYESIAN DECISION THEORY

Probability and Stochastic Processes

Recent Advances in Bayesian Inference Techniques

How to select a good vine

PATTERN RECOGNITION AND MACHINE LEARNING

A Goodness-of-fit Test for Copulas

Encoding or decoding

Parameter Estimation. Industrial AI Lab.

Testing Statistical Hypotheses

Introduction to Probability and Statistics (Continued)

بسم الله الرحمن الرحيم

IN HYPOTHESIS testing problems, a decision-maker aims

Time Varying Hierarchical Archimedean Copulae (HALOC)

Identification of marginal and joint CDFs using Bayesian method for RBDO

Robust Binary Quantizers for Distributed Detection

MACHINE LEARNING ADVANCED MACHINE LEARNING

Estimation, Detection, and Identification CMU 18752

Determining the number of components in mixture models for hierarchical data

Score Normalization in Multimodal Biometric Systems

Dynamic Causal Modelling for EEG and MEG. Stefan Kiebel

Fusion of Decisions Transmitted Over Fading Channels in Wireless Sensor Networks

Knowledge Extraction from DBNs for Images

Subject CS1 Actuarial Statistics 1 Core Principles

Electroencephalogram Based Causality Graph Analysis in Behavior Tasks of Parkinson s Disease Patients

A MULTIVARIATE MODEL FOR COMPARISON OF TWO DATASETS AND ITS APPLICATION TO FMRI ANALYSIS

Parametric Models. Dr. Shuang LIANG. School of Software Engineering TongJi University Fall, 2012

Gaussian Slug Simple Nonlinearity Enhancement to the 1-Factor and Gaussian Copula Models in Finance, with Parametric Estimation and Goodness-of-Fit

Bayesian inference J. Daunizeau

Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests

Multi-sensor classification with Consensus-based Multi-view Maximum Entropy Discrimination

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

Sparse Covariance Selection using Semidefinite Programming

Uncertainty Quantification for Machine Learning and Statistical Models

Target Localization in Wireless Sensor Networks with Quantized Data in the Presence of Byzantine Attacks

Hybrid Copula Bayesian Networks

STONY BROOK UNIVERSITY. CEAS Technical Report 829

ICA and ISA Using Schweizer-Wolff Measure of Dependence

Characterization of Jet Charge at the LHC

Bayesian inference J. Daunizeau

Quantifying Stochastic Model Errors via Robust Optimization

Human-Oriented Robotics. Probability Refresher. Kai Arras Social Robotics Lab, University of Freiburg Winter term 2014/2015

MAXIMUM ENTROPIES COPULAS

ON SCALABLE CODING OF HIDDEN MARKOV SOURCES. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

Information-driven learning, distributed fusion, and planning

Fundamental Probability and Statistics

Probabilistic Graphical Models

Transcription:

Pramod K. Varshney EECS Department, Syracuse University varshney@syr.edu This research was sponsored by ARO grant W911NF-09-1-0244

2

Overview of Distributed Inference U i s may be 1. Local decisions 2. Features 3. Raw signal 3

Introduction Heterogeneity and Dependence Copula theory Signal Detection Using Copulas Copula-based Parameter Estimation (Localization) Classification using copulas Applications in finance are not considered! Conclusion 4

Different characterizations of dependence exist, e.g., Correlation coefficient Linear measure of dependence Information theoretic, e.g., mutual information Computational difficulties Initial work on distributed inference assumed independence for tractability Distributed detection with dependent observations is an NPcomplete problem [Tsitsiklis & Athans, 1985] Decision fusion strategies to incorporate correlation among sensor decisions [Drakopolous & Lee, 1991] Assumes correlation coefficients are known [Kam et al. 1992] Bahadur-Lazarsfeld expansion of PDF s Both approaches assume prior knowledge of joint statistics 5

Inference with dependent observations: difficult problem Proposed solutions: largely problem specific 6

Non-parametric, learning-based HMMs & other graphical models M. J. Beal et al., A Graphical Model for Audio-Visual Object Tracking, Trans. PAMI, July 2003, Vol. 25, No. 7, pp. 828-836. M. R. Siracusa and J. Fisher III, Dynamic dependency tests: analysis and applications to multi-modal data association, in Proc. AI Stats, 2007. Manifold learning S. Lafon, Y. Keller, R. R. Coifman, Data fusion and multicue data matching by diffusion maps, IEEE Trans. PAMI, vol. 28, no. 11, pp. 1784--1797, Nov. 2006 General information theoretic framework for multimodal signal processing T. Butz and J. Thiran, From error probability to information theoretic (multi- modal) signal processing, Elsevier: Signal Processing, vol. 85, May 2005. 7

8

For example Z 1 and Z 2 may represent acoustic and video signals/features, respectively Definition is general Includes independent and identically distributed (iid) marginals 9

10

Dependence between X and Y evident from scatter plot Correlation coefficient is unable to capture this: r = 0 11

12

Relative entropy: distance from product distribution Multi-information is the multivariate extension of mutual information Normalized measure H. Joe, Relative entropy measures of multivariate dependence, Journal of the American Statistical Association, vol. 84, no. 405, pp. 157-164, 1989 M. Studeny and J. Vejnarova, The multiinformation function as a tool for measuring stochastic dependence, in Learning in Graphical Models (M. I. Jordan ed.) Kluwer, Dordrecht 13 1998, pp. 261-298Pramod K. Varshney Sensor Fusion Lab

A. D. Wyner, The common information of two dependent random variables, IEEE Trans. Inf. Theory, vol. 21, no. 2, pp. 163-179, March 1975 14

Motivation Concepts 15

Copula-based approach attempts to address these issues 16

Copulas are functions that couple marginals to form a joint distribution Sklar s Theorem is a key result existence theorem 17

Differentiate the joint CDF to get the joint PDF Product density N marginals (E.g., from N sensors) Independence Copula density Uniform random variables! 18

Several copulas have been proposed R. Nelsen, An Introduction to Copulas, Springer 1999 Archimedean copulas & Elliptical copulas Widely used in econometrics David Li pioneered the use of the Gaussian Copula Blamed for the meltdown on Wall Street Highlights dangers of applying theory without understanding the implications A pictorial example Copulas can characterize skewed dependencies Copulas can express dependency between marginals that do not share the same support (e.g. Normal and Gamma) 19

Bivariate Normal, r = 0.5 Bivariate density: Normal Marginals, Gumbel Copula = 2 0.2 0.15 0.1 0.05 0 2 0-2 -2-1 0 1 2 0.2 0.15 0.1 0.05 0 2 0-2 -2-1 0 1 2 2 2 1.5 1.5 1 1 0.5 0.5 0 0-0.5-0.5-1 -1-1.5-1.5-2 -2-1.5-1 -0.5 0 0.5 1 1.5 2-2 -2-1.5-1 -0.5 0 0.5 1 1.5 2 20

Bivariate Normal, r = 0.5 Bivariate density: Normal and Gamma Marginals Gumbel Copula = 2 0.2 0.06 0.15 0.04 0.1 0.02 0.05 0 2 0-2 -2-1 0 1 2 0 10 5 0-2 -1 0 1 2 2 1.5 1 0.5 0-0.5-1 -1.5-2 -2-1.5-1 -0.5 0 0.5 1 1.5 2 10 9 8 7 6 5 4 3 2 1-2 -1.5-1 -0.5 0 0.5 1 1.5 21

Copulas are typically defined as a CDF Elliptical copulas: derived from multivariate distributions Archimedean Copulas Gaussian copula t-copula 22

23

Limitations A. Subramanian, A. Sundaresan and P. K. Varshney, Fusion for the detection of dependent signals using multivariate copulas, in Proc. 14 th International Conf. on Information Fusion, to be 24 published

25

26

27

28

Criteria based on Minimum Description Length principles Akaike Information Criterion (AIC) Bayesian Information Criterion (BIC) Stochastic Information Criterion (SIC) Normalized Maximum Likelihood (NML) Online approach Model index 29 Joint likelihood Penalty term proportional to model complexity ML Estimate of Copula parameter

Area Under (receiver operating) Curve Application specific approach Best possible detector from the available library of models ROC is best for assessing detector performance AUC is easier to evaluate Offline approach training/testing paradigm 30

Signal Detection Localization Estimation Classification 31

Binary hypothesis testing problem General formulation All distribution parameters are unknown Estimated using MLE 32

GLR under independence Dependence term Copula based test-statistic decouples marginal and dependency information Information theoretic analysis of copula mismatch and AUC-based results * * S. Iyengar, P. K. Varshney, and T. Damarla, A parametric copula based framework for hypotheses testing using heterogeneous data, IEEE Trans. Signal Process., Vol 59, No. 5, May 2011, pp. 2308-2319 33

34

Signals are preprocessed using short-time Fourier Transform (STFT) Canonical Correlation Analysis (CCA) on STFT coefficients Inter-modal correlation is emphasized Dimensionality reduction: argmax a,b Corr(u = a T X, v = b T Y) Marginal distributions fitted using generalized Gaussian Marginal parameters under H 0 assumed known Dependence under H 0 modeled using Gaussian copula 35

36

Binary hypothesis testing problem Sensors make local decisions Local decisions are fused at a fusion center No prior knowledge of joint distribution of sensor observations Design problem Find individual sensor thresholds Design optimal fusion rule Neyman-Pearson (N-P) framework Temporal independence assumed 37

2 sensor case Over N time instants from sensors 1and 2 respectively, Sensor Observations Local Sensor Decisions Sensor Threshold 38

Observations may not be conditionally independent Copula term 39

Chair-Varshney Fusion Statistic Conditional independence term Cross-product Term Accounts for correlated observations 40

* A. Sundaresan, P. K. Varshney, and N. S. V. Rao, Copula-based fusion of correlated decisions, IEEE Trans. Aerosp. Electron. Syst., vol. 47, no. 1, pp. 454 471, 2011 41

Background radiation Signal from radioactiv e material 42

43

Given the intensity of a radiating source A 0 and its location (x 0,y 0 ), Find, Copula parameter is estimated as a nuisance parameter A. Sundaresan and P. K. Varshney, Location estimation of a random signal source based on correlated sensor observations, IEEE Trans. Signal Process., vol. 59, no. 2, pp. 787 799, 2011. 44

45

46

47

Information Fusion Genuin e or Imposte r Accept or Reject Data from NIST Two face-matchers with different performance and statistical properties Data partitioning: Randomized test/train partitions Fusion of algorithms Iyengar et al., IEEE Trans. Signal Process., Vol 59, No. 5, pp. 2308 2319, 2011. Also see S. G. Iyengar, P. K. Varshney and T. Damarla, Biometric Authentication: A Copula Based Approach, in Multibiometrics for Human Identification, B. Bhanu and V. Govindraju, Eds. Cambridge Univ. Press 48

Copula selected using AUC based methodology 49 Receiver Operating Characteristic (ROC)

Neural synchrony: co-movement of neural activity Why do we care? Suggestive of neurophysiological disorders such as Alzheimer s Disease and epileptic seizures Useful for studying brain connectivity and neural coding How do we quantify synchrony? Limitations of existing measures Existing measures such as Granger causality measure only the linear relationship Information theoretic measures such as mutual information are constrained to be bivariate Copula based multi-information developed to alleviate both limitations 50

S. G. Iyengar, J. Dauwels, P. K. Varshney and A. Cichocki, EEG Synchrony quantification using Copulas, Proc. IEEE ICASSP, 2010 51

Neural synchrony can be used as a feature for classification Drop in neural synchrony indicates possibility of Alzheimer s Disease (AD) Increased neural synchrony Epilepsy 25 patients with Mild Cognitive Impairment (MCI) vs. 38 age-matched control subjects All 25 patients developed mild AD later Inclusion of copula-based feature improves classification performance 52

Summary References 53

Copula based inference has diverse applicability Fusion of multimodal sensors and homogeneous sensors Multi-algorithm Fusion Approach discussed for multibiometrics falls under this category Multi-classifier Fusion Fusing different classifiers A theory for signal inference from dependent observations Inclusive theory: independence is a limiting case Signal Detection Signal Classification Parameter estimation Copula-based approach shows significant improvement over previously proposed techniques on real datasets 54

1. S. Iyengar, P. K. Varshney, and T. Damarla, A parametric copula based framework for hypotheses testing using heterogeneous data, IEEE Trans. Signal Process., Vol 59, No. 5, pp. 2308 2319, 2011. 2. A. Sundaresan, P. K. Varshney, and N. S. V. Rao, Copula-based fusion of correlated decisions, IEEE Trans. Aerosp. Electron. Syst., vol. 47, no. 1, pp. 454 471, 2011. 3. A. Sundaresan and P. K. Varshney, Location estimation of a random signal source based on correlated sensor observations, IEEE Trans. Signal Process., vol. 59, no. 2, pp. 787 799, 2011. 4. S. G. Iyengar, J. Dauwels, P. K. Varshney and A. Cichocki, EEG Synchrony quantification using Copulas, Proc. IEEE ICASSP, 2010. 5. S. G. Iyengar, P. K. Varshney and T. Damarla, On the detection of footsteps using heterogeneous data, in Proc. ACSSC 2007, 4-7 Nov. 2007, pp. 2248-2252. 6. A. Sundaresan, A. Subramanian, P. K. Varshney and T. Damarla, A copula-based semi-parametric approach for footstep detection using seismic sensor networks, in Proc. SPIE, vol. 7710, 2010. 7. A. Subramanian, A. Sundaresan and P. K. Varshney, Fusion for the detection of dependent signals using multivariate copulas, in Proc. 14 th International Conf. on 55 Information Fusion, to be published.