Kernel-Based Principal Component Analysis (KPCA) and Its Applications. Nonlinear PCA

Size: px
Start display at page:

Download "Kernel-Based Principal Component Analysis (KPCA) and Its Applications. Nonlinear PCA"

Transcription

1 Kernel-Based Principal Component Analysis (KPCA) and Its Applications 4//009 Based on slides originaly from Dr. John Tan 1 Nonlinear PCA Natural phenomena are usually nonlinear and standard PCA is intrinsically a linear technique. Nonlinear PCA Principal Curves Nonlinear PCA by Neural Network Kernel PCA 4//009 Based on slides originaly from Dr. John Tan 1

2 Principal Curves (PCS) Trevor Hastie; Werner Stuetzle, Principal Curves, Journal of the American Statistical Association, Vol. 84, No (Jun. 1989), pp //009 Based on slides originaly from Dr. John Tan 3 William W. Hsieh and Benyang Tang. 1998: Applying Neural Network Models to Prediction and Data Analysis in Meteorology and Oceanography. Bulletin of the American Meteorological Society: Vol. 79, No. 9, pp NLPCA-NN 4//009 Based on slides originaly from Dr. John Tan 4

3 Limitation on PCS and NNPCA Small input dimensions NNPCA used PCA to reduce the dimensions of spatio-temporal data 4//009 Based on slides originaly from Dr. John Tan 5 Kernel PCA B. Scholkopf, A. Smola, and K.-R. Muller. Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation, 10: , input space y X Φ F feature space v y Nonlinear map from input space to a richer feature space PCA in the feature space Preimage: map back to the input space 4//009 Based on slides originaly from Dr. John Tan 6 3

4 Major Steps Original spatio-temporal data ψ(,t): M points (dimensions); N observations; N M-dimensional vectors Nonlinear Mapping: An M-dimensional vector is transformed into a M F -dimensional vector 4//009 Based on slides originaly from Dr. John Tan 7 Feature Space Data Matri Matri for the eigenvalue problem: 4//009 Based on slides originaly from Dr. John Tan 8 4

5 Mapping Compleity Dot products for covariance matri element: K + = ( 1, ) (, y) = ( y) = ( 1 y1 y Dimension numbers may be high: For eample, the polynomial kernel with d = More General case: K (, y) + ) Φ( ) = ( 1,, 1 ) d = ( y c d = positive integer, c is constant Φ( ) = (1, 1,, 1,, 1 ) Compleity for the polynomial kernel is ~M d. Where N is the dimensionality of the input space. For eample, the polynomial kernel with d = 5 and a 1616 piel image (M = 56) would yield a dimensionality of 10 10! 4//009 Based on slides originaly from Dr. John Tan 9 ) Kernel Trick Elements of matri K is evaluated by a kernel function. Since Kernel is defined to a vector in the input space, we can compute elements of D Φ D Φ but not D Φ D Φ. Therefore, the size of K is always N, the total number of observations. 4//009 Based on slides originaly from Dr. John Tan 10 5

6 What is a valid kernel? Mercer s Theorem Any (semi) positive definite, symmetric function is a Kernel. K corresponds to the dot product of two mapped data points in the feature space Matri generated from K is symmetric and positive (semi) definite. K (, y) = ( Φ( ) Φ( y)) = f (, y) Linear Kernel K(, y) = y Gaussian RBF Kernel K y σ (, y) = e 4//009 Based on slides originaly from Dr. John Tan 11 Kernels New kernels can be constructed from eisting kernels. The set of Kernels is closed under some operations. Given the following properties, eisting Kernels can be used to generate new ones. Assuming that K1, K are valid Kernels: K1 + K is a Kernel C*K1 is a Kernel for C > 0 A*K1 + B*K is a Kernel for A, B > 0 There are more properties that can be eploited, see Schölkopf and Smola (00) Comple Kernels can be generated from known ones and these can be used to generate additional Kernels. 4//009 Based on slides originaly from Dr. John Tan 1 6

7 Physical Components Eigenvalue problem: and α i (i=1,, N) representing the time series (temporal components) Spatial pattern: (Spatial Component) 4//009 Based on slides originaly from Dr. John Tan 13 New Issues in Earth Science Applications Preimage: transformation back to the input space Mapping may be epensive, it can be highdimensional Mapping to this space is performed using a Mercer s Kernel Higher dimensionalities in feature space More comparable eigenvalues Variance is not conserved A new algorithm for pattern selection 4//009 Based on slides originaly from Dr. John Tan 14 7

8 Preimage Problem The mapping from X to F. The preimage problem is illustrated by the point Ψ which has no corollary in X. Required for earth science applications. Spatial patterns. 4//009 Based on slides originaly from Dr. John Tan 15 ( Ψ Φ( ) ρ1( = Φ( Φ( Preimages The mapping from input space to feature space is performed implicitly through the use of a kernel. This makes the calculation of the inverse transform, the preimage, interesting. The preimage in many cases may not eist however a good approimate preimage can be calculated [SMB99]. * ρ ( = Ψ Φ( ρ 1 ( Ψ ρ ( Φ(X ) ( Ψ Φ( ) ( Ψ Φ( ) * Φ( Ψ = Ψ Φ( Φ( Φ( Φ( F 4//009 Based on slides originaly from Dr. John Tan 16 8

9 K (, y) = ( y) PCA Eamples Data: Lorenz Attractor K y σ (, y) = e KPCA: Gaussian kernel P α e i i= 1 z n+ 1 = P i= 1 α i zn σ i zn σ ie i 4//009 Based on slides originaly from Dr. John Tan 17 KPCA for Spatial-Temporal Data New methodology for KPCA Determine (spatial) patterns correlated with a (temporal) signal Recall α represents the temporal principal components and is related to the implicitly defined spatial components v by: v I p = I Φ α ν ν p Mλ p 4//009 Based on slides originaly from Dr. John Tan 18 9

10 Finding the Temporal Signal The temporal principal components α can be combined to determine a direction set that will correlate better with the signal of interest. Instead of searching for the underlying source of a principal component direction, the source is specified and a preferred direction is determined based on the principal components. -This allows us to specify the corresponding spatial components v. 4//009 Based on slides originaly from Dr. John Tan 19 Procedures in PCA & KPCA Standard PCA Analysis Eamine principal components that contain the highest variance for inherent low dimensional structure Correlate temporal components (loadings) with signal of interest Determine component with highest correlation Eamine associated spatial principal component for new patterns KPCA Analysis for Spatial-Temporal Data Use α vectors as temporal components of mapped data Determine temporal components and correlate with signal of interest Determine set of α vectors that have highest correlation with signal of interest Calculate preimage of the associated spatial components and eamine for new patterns Linear combination of α vectors Sort vectors from highest to lowest wrt correlation score Combine vectors in descending order, keeping vectors that increase the score 4//009 Based on slides originaly from Dr. John Tan 0 10

11 Selection Algorithm 4//009 Based on slides originaly from Dr. John Tan 1 Signal Detection piel_value = 80*noise + 0*signal and at specific regions piel_value = 100*noise otherwise 4//009 Based on slides originaly from Dr. John Tan 11

12 Data Sets Normalized Difference Vegetation Inde (NDVI) NDVI = (near IR band - red band) / (near IR band + red band) NASA GES Distributed Active Archive Center 1 o 1 o Level 3 data set covering the globe January 198 to December 001 Sea Surface Temperature (SST) International Comprehensive Ocean-Atmosphere Data Set (ICOADS). Latitude 18 o S to 18 o N and Longitude 10 o E to 75 o W, at o o resolution. January 1951 to December 004 Southern Oscillation Inde (SOI) NOAA National Weather Service, Climate Prediction Center. January 1951-December 004 4//009 Based on slides originaly from Dr. John Tan 3 Masking NDVI NDVI Red - valid, Blue - invalid SST 1951 SST: all available Red - invalid, Blue-valid 4//009 Based on slides originaly from Dr. John Tan 4 1

13 Sea Surface Temperature Anomaly I Positive anomaly around , , , , and These correspond to El Nino years. The two regions of highest positive anomaly and are also strong El Nino years. raw deseasoned 4//009 Based on slides originaly from Dr. John Tan 5 piels month Global NDVI Anomaly I raw Like SSTA the deseasoned NDVI data has anomaly around the El Nino years. However the anomaly patterns are less distinct, SST has a direct link with El Nino. Vegetation has an indirect connection. deseasoned -Jan 88(73) -Jan 86(49) -Jan 84(5) -Jan 8(1) -Sep 01(33) -Jan 00(13) -Jan 98(189) -Jan 96(165) -Jan 94(145) -Jan 9(11) -Jan 90(97) 4//009 Based on slides originaly from * , Dr. John , Tan , , and

14 A Specific Eample Data Sets Normalized Difference Vegetation Inde (NDVI) NDVI = (near IR band - red band) / (near IR band + red band) NASA GES Distributed Active Archive Center 1 o 1 o Level 3 data set covering the globe January 198 to December 001 Southern Oscillation Inde (SOI) NOAA National Weather Service, Climate Prediction Center. January 1951-December 004 Gaussian Kernel: y σ = 6% of SD σ (Empirical optimal value) K(, y) = e 4//009 Based on slides originaly from Dr. John Tan 7 Results (PCA, One Component) Spatial pattern from PC 4 of standard PCA. (r = Percent of variance eplained by PC 4 = 3.8%) 4//009 Based on slides originaly from Dr. John Tan 8 14

15 Results (Combined KPCA) Combined spatial pattern from KPCA (r = 0.68 with SOI). 4//009 Based on slides originaly from Dr. John Tan 9 Results (Combined PCA) Combined spatial pattern from Standard PCA (r = 0.56 with SOI). 4//009 Based on slides originaly from Dr. John Tan 30 15

16 Comparisons PCA The drought patterns from the El Nino taken from the National Drought Mitigation Center ( show that KPCA pattern matches the drought patterns to a higher degree than standard PCA KPCA //009 Based on slides originaly from Dr. John Tan 31 Conclusions First known application of KPCA to Earth Science data KPCA yields correlations that are significantly higher than PCA results. The results show new spatial patterns and has a more refined regional structure The compleity is on the same order of operations and memory requirements as standard PCA The results depend on the choice of Kernel Eigenvalue problem of size N (# of observations) 4//009 Based on slides originaly from Dr. John Tan 3 16

17 Future Works Improve efficiency of preimage algorithms Test different kernels for earth science applications Study the atmospheric circulation regimes 4//009 Based on slides originaly from Dr. John Tan 33 PCA Applications Atmospheric Regimes Corti, S., F. Molteni, and T. Palmer, 1999: Signature of recent climate change in frequencies of natural atmospheric circulation regimes. Nature, 398, //009 Based on slides originaly from Dr. John Tan 34 17

18 KPCA vs. PCS/NNPCA/PCA KPCA NNPCA/PCS PCA Fleibility Good. factor analysis, dim reduction, nonlinear analysis Primarily dimension reduction, nonlinear analysis OK. linear characterization only Large training set: M Bad. But greedy methods available OK OK if input dim is small Large input dimension: d OK Bad OK Algorithm Symmetric Eigenvalue Problem Nonlinear variational optimization Problem Symmetric Eigenvalue Problem Variance Kernel dependent Well understood Well understood Inverse Trans. Preimage Given Matri Multiplication 4//009 Based on slides originaly from Dr. John Tan 35 SVM (Support Vector Machines) SVM is the tool of choice for the data mining classification problem. SVM is a statistical learning system for predictive data mining -- for estimating regression functions. Loads of information available here: 4//009 Based on slides originaly from Dr. John Tan 36 18

19 SVM Classification SVM attempts to find an optimal separating hyperplane between members of the two initial classifications. Class A Class B separating hyperplane 4//009 Based on slides originaly from Dr. John Tan 37 SVM Kernel Construction The data attributes can be transformed to a higher dimensional space (feature space) by applying a kernel function. This transformation can have the effect of allowing a separating hyperplane to be found. 4//009 Based on slides originaly from Dr. John Tan 38 19

Chap.11 Nonlinear principal component analysis [Book, Chap. 10]

Chap.11 Nonlinear principal component analysis [Book, Chap. 10] Chap.11 Nonlinear principal component analysis [Book, Chap. 1] We have seen machine learning methods nonlinearly generalizing the linear regression method. Now we will examine ways to nonlinearly generalize

More information

Support Vector Machines. Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar

Support Vector Machines. Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar Data Mining Support Vector Machines Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar 02/03/2018 Introduction to Data Mining 1 Support Vector Machines Find a linear hyperplane

More information

Linear, threshold units. Linear Discriminant Functions and Support Vector Machines. Biometrics CSE 190 Lecture 11. X i : inputs W i : weights

Linear, threshold units. Linear Discriminant Functions and Support Vector Machines. Biometrics CSE 190 Lecture 11. X i : inputs W i : weights Linear Discriminant Functions and Support Vector Machines Linear, threshold units CSE19, Winter 11 Biometrics CSE 19 Lecture 11 1 X i : inputs W i : weights θ : threshold 3 4 5 1 6 7 Courtesy of University

More information

An Introduction to Nonlinear Principal Component Analysis

An Introduction to Nonlinear Principal Component Analysis An Introduction tononlinearprincipal Component Analysis p. 1/33 An Introduction to Nonlinear Principal Component Analysis Adam Monahan monahana@uvic.ca School of Earth and Ocean Sciences University of

More information

Semiblind Source Separation of Climate Data Detects El Niño as the Component with the Highest Interannual Variability

Semiblind Source Separation of Climate Data Detects El Niño as the Component with the Highest Interannual Variability Semiblind Source Separation of Climate Data Detects El Niño as the Component with the Highest Interannual Variability Alexander Ilin Neural Networks Research Centre Helsinki University of Technology P.O.

More information

10/05/2016. Computational Methods for Data Analysis. Massimo Poesio SUPPORT VECTOR MACHINES. Support Vector Machines Linear classifiers

10/05/2016. Computational Methods for Data Analysis. Massimo Poesio SUPPORT VECTOR MACHINES. Support Vector Machines Linear classifiers Computational Methods for Data Analysis Massimo Poesio SUPPORT VECTOR MACHINES Support Vector Machines Linear classifiers 1 Linear Classifiers denotes +1 denotes -1 w x + b>0 f(x,w,b) = sign(w x + b) How

More information

From Last Meeting. Studied Fisher Linear Discrimination. - Mathematics. - Point Cloud view. - Likelihood view. - Toy examples

From Last Meeting. Studied Fisher Linear Discrimination. - Mathematics. - Point Cloud view. - Likelihood view. - Toy examples From Last Meeting Studied Fisher Linear Discrimination - Mathematics - Point Cloud view - Likelihood view - Toy eamples - Etensions (e.g. Principal Discriminant Analysis) Polynomial Embedding Aizerman,

More information

Kernel Methods. Foundations of Data Analysis. Torsten Möller. Möller/Mori 1

Kernel Methods. Foundations of Data Analysis. Torsten Möller. Möller/Mori 1 Kernel Methods Foundations of Data Analysis Torsten Möller Möller/Mori 1 Reading Chapter 6 of Pattern Recognition and Machine Learning by Bishop Chapter 12 of The Elements of Statistical Learning by Hastie,

More information

Issues and Techniques in Pattern Classification

Issues and Techniques in Pattern Classification Issues and Techniques in Pattern Classification Carlotta Domeniconi www.ise.gmu.edu/~carlotta Machine Learning Given a collection of data, a machine learner eplains the underlying process that generated

More information

Atmospheric circulation analysis for seasonal forecasting

Atmospheric circulation analysis for seasonal forecasting Training Seminar on Application of Seasonal Forecast GPV Data to Seasonal Forecast Products 18 21 January 2011 Tokyo, Japan Atmospheric circulation analysis for seasonal forecasting Shotaro Tanaka Climate

More information

Exploring Climate Patterns Embedded in Global Climate Change Datasets

Exploring Climate Patterns Embedded in Global Climate Change Datasets Exploring Climate Patterns Embedded in Global Climate Change Datasets James Bothwell, May Yuan Department of Geography University of Oklahoma Norman, OK 73019 jamesdbothwell@yahoo.com, myuan@ou.edu Exploring

More information

Connection of Local Linear Embedding, ISOMAP, and Kernel Principal Component Analysis

Connection of Local Linear Embedding, ISOMAP, and Kernel Principal Component Analysis Connection of Local Linear Embedding, ISOMAP, and Kernel Principal Component Analysis Alvina Goh Vision Reading Group 13 October 2005 Connection of Local Linear Embedding, ISOMAP, and Kernel Principal

More information

Assessment of the Impact of El Niño-Southern Oscillation (ENSO) Events on Rainfall Amount in South-Western Nigeria

Assessment of the Impact of El Niño-Southern Oscillation (ENSO) Events on Rainfall Amount in South-Western Nigeria 2016 Pearl Research Journals Journal of Physical Science and Environmental Studies Vol. 2 (2), pp. 23-29, August, 2016 ISSN 2467-8775 Full Length Research Paper http://pearlresearchjournals.org/journals/jpses/index.html

More information

EE613 Machine Learning for Engineers. Kernel methods Support Vector Machines. jean-marc odobez 2015

EE613 Machine Learning for Engineers. Kernel methods Support Vector Machines. jean-marc odobez 2015 EE613 Machine Learning for Engineers Kernel methods Support Vector Machines jean-marc odobez 2015 overview Kernel methods introductions and main elements defining kernels Kernelization of k-nn, K-Means,

More information

Introduction to Machine Learning

Introduction to Machine Learning 10-701 Introduction to Machine Learning PCA Slides based on 18-661 Fall 2018 PCA Raw data can be Complex, High-dimensional To understand a phenomenon we measure various related quantities If we knew what

More information

Linear vs Non-linear classifier. CS789: Machine Learning and Neural Network. Introduction

Linear vs Non-linear classifier. CS789: Machine Learning and Neural Network. Introduction Linear vs Non-linear classifier CS789: Machine Learning and Neural Network Support Vector Machine Jakramate Bootkrajang Department of Computer Science Chiang Mai University Linear classifier is in the

More information

Bearing fault diagnosis based on EMD-KPCA and ELM

Bearing fault diagnosis based on EMD-KPCA and ELM Bearing fault diagnosis based on EMD-KPCA and ELM Zihan Chen, Hang Yuan 2 School of Reliability and Systems Engineering, Beihang University, Beijing 9, China Science and Technology on Reliability & Environmental

More information

SVMs: nonlinearity through kernels

SVMs: nonlinearity through kernels Non-separable data e-8. Support Vector Machines 8.. The Optimal Hyperplane Consider the following two datasets: SVMs: nonlinearity through kernels ER Chapter 3.4, e-8 (a) Few noisy data. (b) Nonlinearly

More information

Seasonal Climate Watch September 2018 to January 2019

Seasonal Climate Watch September 2018 to January 2019 Seasonal Climate Watch September 2018 to January 2019 Date issued: Aug 31, 2018 1. Overview The El Niño-Southern Oscillation (ENSO) is still in a neutral phase and is still expected to rise towards an

More information

Machine Learning 4771

Machine Learning 4771 Machine Learning 477 Instructor: Tony Jebara Topic Regression Empirical Risk Minimization Least Squares Higher Order Polynomials Under-fitting / Over-fitting Cross-Validation Regression Classification

More information

Each new feature uses a pair of the original features. Problem: Mapping usually leads to the number of features blow up!

Each new feature uses a pair of the original features. Problem: Mapping usually leads to the number of features blow up! Feature Mapping Consider the following mapping φ for an example x = {x 1,...,x D } φ : x {x1,x 2 2,...,x 2 D,,x 2 1 x 2,x 1 x 2,...,x 1 x D,...,x D 1 x D } It s an example of a quadratic mapping Each new

More information

Atmospheric QBO and ENSO indices with high vertical resolution from GNSS RO

Atmospheric QBO and ENSO indices with high vertical resolution from GNSS RO Atmospheric QBO and ENSO indices with high vertical resolution from GNSS RO H. Wilhelmsen, F. Ladstädter, B. Scherllin-Pirscher, A.K.Steiner Wegener Center for Climate and Global Change University of Graz,

More information

Expectation-maximization analysis of spatial time series

Expectation-maximization analysis of spatial time series Nonlin. Processes Geophys., 1, 73 77, 7 www.nonlin-processes-geophys.net/1/73/7/ Author(s) 7. This work is licensed under a Creative Commons License. Nonlinear Processes in Geophysics Expectation-maximization

More information

Learning SVM Classifiers with Indefinite Kernels

Learning SVM Classifiers with Indefinite Kernels Learning SVM Classifiers with Indefinite Kernels Suicheng Gu and Yuhong Guo Dept. of Computer and Information Sciences Temple University Support Vector Machines (SVMs) (Kernel) SVMs are widely used in

More information

Nonlinear atmospheric teleconnections

Nonlinear atmospheric teleconnections GEOPHYSICAL RESEARCH LETTERS, VOL.???, XXXX, DOI:10.1029/, Nonlinear atmospheric teleconnections William W. Hsieh, 1 Aiming Wu, 1 and Amir Shabbar 2 Neural network models are used to reveal the nonlinear

More information

CS798: Selected topics in Machine Learning

CS798: Selected topics in Machine Learning CS798: Selected topics in Machine Learning Support Vector Machine Jakramate Bootkrajang Department of Computer Science Chiang Mai University Jakramate Bootkrajang CS798: Selected topics in Machine Learning

More information

Midterm Review CS 7301: Advanced Machine Learning. Vibhav Gogate The University of Texas at Dallas

Midterm Review CS 7301: Advanced Machine Learning. Vibhav Gogate The University of Texas at Dallas Midterm Review CS 7301: Advanced Machine Learning Vibhav Gogate The University of Texas at Dallas Supervised Learning Issues in supervised learning What makes learning hard Point Estimation: MLE vs Bayesian

More information

Support Vector Machines

Support Vector Machines Support Vector Machines Stephan Dreiseitl University of Applied Sciences Upper Austria at Hagenberg Harvard-MIT Division of Health Sciences and Technology HST.951J: Medical Decision Support Overview Motivation

More information

SPECTRAL CLUSTERING AND KERNEL PRINCIPAL COMPONENT ANALYSIS ARE PURSUING GOOD PROJECTIONS

SPECTRAL CLUSTERING AND KERNEL PRINCIPAL COMPONENT ANALYSIS ARE PURSUING GOOD PROJECTIONS SPECTRAL CLUSTERING AND KERNEL PRINCIPAL COMPONENT ANALYSIS ARE PURSUING GOOD PROJECTIONS VIKAS CHANDRAKANT RAYKAR DECEMBER 5, 24 Abstract. We interpret spectral clustering algorithms in the light of unsupervised

More information

Review: Support vector machines. Machine learning techniques and image analysis

Review: Support vector machines. Machine learning techniques and image analysis Review: Support vector machines Review: Support vector machines Margin optimization min (w,w 0 ) 1 2 w 2 subject to y i (w 0 + w T x i ) 1 0, i = 1,..., n. Review: Support vector machines Margin optimization

More information

Kernel Methods. Barnabás Póczos

Kernel Methods. Barnabás Póczos Kernel Methods Barnabás Póczos Outline Quick Introduction Feature space Perceptron in the feature space Kernels Mercer s theorem Finite domain Arbitrary domain Kernel families Constructing new kernels

More information

Midterm Review CS 6375: Machine Learning. Vibhav Gogate The University of Texas at Dallas

Midterm Review CS 6375: Machine Learning. Vibhav Gogate The University of Texas at Dallas Midterm Review CS 6375: Machine Learning Vibhav Gogate The University of Texas at Dallas Machine Learning Supervised Learning Unsupervised Learning Reinforcement Learning Parametric Y Continuous Non-parametric

More information

MACHINE LEARNING. Methods for feature extraction and reduction of dimensionality: Probabilistic PCA and kernel PCA

MACHINE LEARNING. Methods for feature extraction and reduction of dimensionality: Probabilistic PCA and kernel PCA 1 MACHINE LEARNING Methods for feature extraction and reduction of dimensionality: Probabilistic PCA and kernel PCA 2 Practicals Next Week Next Week, Practical Session on Computer Takes Place in Room GR

More information

Investigate the influence of the Amazon rainfall on westerly wind anomalies and the 2002 Atlantic Nino using QuikScat, Altimeter and TRMM data

Investigate the influence of the Amazon rainfall on westerly wind anomalies and the 2002 Atlantic Nino using QuikScat, Altimeter and TRMM data Investigate the influence of the Amazon rainfall on westerly wind anomalies and the 2002 Atlantic Nino using QuikScat, Altimeter and TRMM data Rong Fu 1, Mike Young 1, Hui Wang 2, Weiqing Han 3 1 School

More information

Seasonal Climate Watch June to October 2018

Seasonal Climate Watch June to October 2018 Seasonal Climate Watch June to October 2018 Date issued: May 28, 2018 1. Overview The El Niño-Southern Oscillation (ENSO) has now moved into the neutral phase and is expected to rise towards an El Niño

More information

Approximate Kernel PCA with Random Features

Approximate Kernel PCA with Random Features Approximate Kernel PCA with Random Features (Computational vs. Statistical Tradeoff) Bharath K. Sriperumbudur Department of Statistics, Pennsylvania State University Journées de Statistique Paris May 28,

More information

Seasonal Climate Watch April to August 2018

Seasonal Climate Watch April to August 2018 Seasonal Climate Watch April to August 2018 Date issued: Mar 23, 2018 1. Overview The El Niño-Southern Oscillation (ENSO) is expected to weaken from a moderate La Niña phase to a neutral phase through

More information

NSF Expeditions in Computing. Understanding Climate Change: A Data Driven Approach. Vipin Kumar University of Minnesota

NSF Expeditions in Computing. Understanding Climate Change: A Data Driven Approach. Vipin Kumar University of Minnesota NSF Expeditions in Computing Understanding Climate Change: A Data Driven Approach Vipin Kumar University of Minnesota kumar@cs.umn.edu www.cs.umn.edu/~kumar Vipin Kumar UCC Aug 15, 2011 Climate Change:

More information

Seasonal Climate Watch July to November 2018

Seasonal Climate Watch July to November 2018 Seasonal Climate Watch July to November 2018 Date issued: Jun 25, 2018 1. Overview The El Niño-Southern Oscillation (ENSO) is now in a neutral phase and is expected to rise towards an El Niño phase through

More information

Perceptron Revisited: Linear Separators. Support Vector Machines

Perceptron Revisited: Linear Separators. Support Vector Machines Support Vector Machines Perceptron Revisited: Linear Separators Binary classification can be viewed as the task of separating classes in feature space: w T x + b > 0 w T x + b = 0 w T x + b < 0 Department

More information

below, kernel PCA Eigenvectors, and linear combinations thereof. For the cases where the pre-image does exist, we can provide a means of constructing

below, kernel PCA Eigenvectors, and linear combinations thereof. For the cases where the pre-image does exist, we can provide a means of constructing Kernel PCA Pattern Reconstruction via Approximate Pre-Images Bernhard Scholkopf, Sebastian Mika, Alex Smola, Gunnar Ratsch, & Klaus-Robert Muller GMD FIRST, Rudower Chaussee 5, 12489 Berlin, Germany fbs,

More information

ESANN'2001 proceedings - European Symposium on Artificial Neural Networks Bruges (Belgium), April 2001, D-Facto public., ISBN ,

ESANN'2001 proceedings - European Symposium on Artificial Neural Networks Bruges (Belgium), April 2001, D-Facto public., ISBN , Sparse Kernel Canonical Correlation Analysis Lili Tan and Colin Fyfe 2, Λ. Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong. 2. School of Information and Communication

More information

Principal Component Analysis of Sea Surface Temperature via Singular Value Decomposition

Principal Component Analysis of Sea Surface Temperature via Singular Value Decomposition Principal Component Analysis of Sea Surface Temperature via Singular Value Decomposition SYDE 312 Final Project Ziyad Mir, 20333385 Jennifer Blight, 20347163 Faculty of Engineering Department of Systems

More information

(Kernels +) Support Vector Machines

(Kernels +) Support Vector Machines (Kernels +) Support Vector Machines Machine Learning Torsten Möller Reading Chapter 5 of Machine Learning An Algorithmic Perspective by Marsland Chapter 6+7 of Pattern Recognition and Machine Learning

More information

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 Prof. John Paisley Department of Electrical Engineering & Data Science Institute Columbia University FEATURE EXPANSIONS FEATURE EXPANSIONS

More information

Kernel Sliced Inverse Regression With Applications to Classification

Kernel Sliced Inverse Regression With Applications to Classification May 21-24, 2008 in Durham, NC Kernel Sliced Inverse Regression With Applications to Classification Han-Ming Wu (Hank) Department of Mathematics, Tamkang University Taipei, Taiwan 2008/05/22 http://www.hmwu.idv.tw

More information

Machine learning for pervasive systems Classification in high-dimensional spaces

Machine learning for pervasive systems Classification in high-dimensional spaces Machine learning for pervasive systems Classification in high-dimensional spaces Department of Communications and Networking Aalto University, School of Electrical Engineering stephan.sigg@aalto.fi Version

More information

Support Vector Regression (SVR) Descriptions of SVR in this discussion follow that in Refs. (2, 6, 7, 8, 9). The literature

Support Vector Regression (SVR) Descriptions of SVR in this discussion follow that in Refs. (2, 6, 7, 8, 9). The literature Support Vector Regression (SVR) Descriptions of SVR in this discussion follow that in Refs. (2, 6, 7, 8, 9). The literature suggests the design variables should be normalized to a range of [-1,1] or [0,1].

More information

Finding Climate Indices and Dipoles Using Data Mining

Finding Climate Indices and Dipoles Using Data Mining Finding Climate Indices and Dipoles Using Data Mining Michael Steinbach, Computer Science Contributors: Jaya Kawale, Stefan Liess, Arjun Kumar, Karsten Steinhauser, Dominic Ormsby, Vipin Kumar Climate

More information

Classifier Complexity and Support Vector Classifiers

Classifier Complexity and Support Vector Classifiers Classifier Complexity and Support Vector Classifiers Feature 2 6 4 2 0 2 4 6 8 RBF kernel 10 10 8 6 4 2 0 2 4 6 Feature 1 David M.J. Tax Pattern Recognition Laboratory Delft University of Technology D.M.J.Tax@tudelft.nl

More information

e 2 e 1 (a) (b) (d) (c)

e 2 e 1 (a) (b) (d) (c) 2.13 Rotated principal component analysis [Book, Sect. 2.2] Fig.: PCA applied to a dataset composed of (a) 1 cluster, (b) 2 clusters, (c) and (d) 4 clusters. In (c), an orthonormal rotation and (d) an

More information

Frequency-Based Separation of Climate Signals

Frequency-Based Separation of Climate Signals Frequency-Based Separation of Climate Signals Alexander Ilin 1 and Harri Valpola 2 1 Helsinki University of Technology, Neural Networks Research Centre, P.O. Box 5400, FI-02015 TKK, Espoo, Finland Alexander.Ilin@tkk.fi

More information

Announcements. CS 188: Artificial Intelligence Spring Classification. Today. Classification overview. Case-Based Reasoning

Announcements. CS 188: Artificial Intelligence Spring Classification. Today. Classification overview. Case-Based Reasoning CS 188: Artificial Intelligence Spring 21 Lecture 22: Nearest Neighbors, Kernels 4/18/211 Pieter Abbeel UC Berkeley Slides adapted from Dan Klein Announcements On-going: contest (optional and FUN!) Remaining

More information

Linking non-binned spike train kernels to several existing spike train metrics

Linking non-binned spike train kernels to several existing spike train metrics Linking non-binned spike train kernels to several existing spike train metrics Benjamin Schrauwen Jan Van Campenhout ELIS, Ghent University, Belgium Benjamin.Schrauwen@UGent.be Abstract. This work presents

More information

Discriminative Direction for Kernel Classifiers

Discriminative Direction for Kernel Classifiers Discriminative Direction for Kernel Classifiers Polina Golland Artificial Intelligence Lab Massachusetts Institute of Technology Cambridge, MA 02139 polina@ai.mit.edu Abstract In many scientific and engineering

More information

A high spectral resolution global land surface infrared emissivity database

A high spectral resolution global land surface infrared emissivity database A high spectral resolution global land surface infrared emissivity database Eva E. Borbas, Robert O. Knuteson, Suzanne W. Seemann, Elisabeth Weisz, Leslie Moy, and Hung-Lung Huang Space Science and Engineering

More information

Probabilistic Regression Using Basis Function Models

Probabilistic Regression Using Basis Function Models Probabilistic Regression Using Basis Function Models Gregory Z. Grudic Department of Computer Science University of Colorado, Boulder grudic@cs.colorado.edu Abstract Our goal is to accurately estimate

More information

Support Vector Machine (SVM) & Kernel CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2012

Support Vector Machine (SVM) & Kernel CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2012 Support Vector Machine (SVM) & Kernel CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2012 Linear classifier Which classifier? x 2 x 1 2 Linear classifier Margin concept x 2

More information

Support Vector Ordinal Regression using Privileged Information

Support Vector Ordinal Regression using Privileged Information Support Vector Ordinal Regression using Privileged Information Fengzhen Tang 1, Peter Tiňo 2, Pedro Antonio Gutiérrez 3 and Huanhuan Chen 4 1,2,4- The University of Birmingham, School of Computer Science,

More information

Statistical Machine Learning from Data

Statistical Machine Learning from Data Samy Bengio Statistical Machine Learning from Data 1 Statistical Machine Learning from Data Support Vector Machines Samy Bengio IDIAP Research Institute, Martigny, Switzerland, and Ecole Polytechnique

More information

Learning Eigenfunctions: Links with Spectral Clustering and Kernel PCA

Learning Eigenfunctions: Links with Spectral Clustering and Kernel PCA Learning Eigenfunctions: Links with Spectral Clustering and Kernel PCA Yoshua Bengio Pascal Vincent Jean-François Paiement University of Montreal April 2, Snowbird Learning 2003 Learning Modal Structures

More information

statistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI

statistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI statistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI tailored seasonal forecasts why do we make probabilistic forecasts? to reduce our uncertainty about the (unknown) future

More information

Machine Learning (BSMC-GA 4439) Wenke Liu

Machine Learning (BSMC-GA 4439) Wenke Liu Machine Learning (BSMC-GA 4439) Wenke Liu 02-01-2018 Biomedical data are usually high-dimensional Number of samples (n) is relatively small whereas number of features (p) can be large Sometimes p>>n Problems

More information

Immediate Reward Reinforcement Learning for Projective Kernel Methods

Immediate Reward Reinforcement Learning for Projective Kernel Methods ESANN'27 proceedings - European Symposium on Artificial Neural Networks Bruges (Belgium), 25-27 April 27, d-side publi., ISBN 2-9337-7-2. Immediate Reward Reinforcement Learning for Projective Kernel Methods

More information

Principal Dynamical Components

Principal Dynamical Components Principal Dynamical Components Manuel D. de la Iglesia* Departamento de Análisis Matemático, Universidad de Sevilla Instituto Nacional de Matemática Pura e Aplicada (IMPA) Rio de Janeiro, May 14, 2013

More information

Support Vector Machines. Maximizing the Margin

Support Vector Machines. Maximizing the Margin Support Vector Machines Support vector achines (SVMs) learn a hypothesis: h(x) = b + Σ i= y i α i k(x, x i ) (x, y ),..., (x, y ) are the training exs., y i {, } b is the bias weight. α,..., α are the

More information

Meteorology B Wright State Invite Team Name Team # Student Members: &

Meteorology B Wright State Invite Team Name Team # Student Members: & 1 Meteorology B Team Name Team # Student Members: & Raw Score: / 126 Rank: Part I. Multiple Choice. Answer the following questions by selecting the best answer. 2 points each. 1. All of the following are

More information

La Niña impacts on global seasonal weather anomalies: The OLR perspective. Andrew Chiodi and Ed Harrison

La Niña impacts on global seasonal weather anomalies: The OLR perspective. Andrew Chiodi and Ed Harrison La Niña impacts on global seasonal weather anomalies: The OLR perspective Andrew Chiodi and Ed Harrison Outline Motivation Impacts of the El Nino- Southern Oscillation (ENSO) on seasonal weather anomalies

More information

AnuMS 2018 Atlantic Hurricane Season Forecast

AnuMS 2018 Atlantic Hurricane Season Forecast AnuMS 2018 Atlantic Hurricane Season Forecast Issued: April 10, 2018 by Dale C. S. Destin (follow @anumetservice) Director (Ag), Antigua and Barbuda Meteorological Service (ABMS) The *AnuMS (Antigua Met

More information

Delayed Response of the Extratropical Northern Atmosphere to ENSO: A Revisit *

Delayed Response of the Extratropical Northern Atmosphere to ENSO: A Revisit * Delayed Response of the Extratropical Northern Atmosphere to ENSO: A Revisit * Ruping Mo Pacific Storm Prediction Centre, Environment Canada, Vancouver, BC, Canada Corresponding author s address: Ruping

More information

Linear and Non-Linear Dimensionality Reduction

Linear and Non-Linear Dimensionality Reduction Linear and Non-Linear Dimensionality Reduction Alexander Schulz aschulz(at)techfak.uni-bielefeld.de University of Pisa, Pisa 4.5.215 and 7.5.215 Overview Dimensionality Reduction Motivation Linear Projections

More information

Lecture 1c: Gaussian Processes for Regression

Lecture 1c: Gaussian Processes for Regression Lecture c: Gaussian Processes for Regression Cédric Archambeau Centre for Computational Statistics and Machine Learning Department of Computer Science University College London c.archambeau@cs.ucl.ac.uk

More information

Stefan Liess University of Minnesota Saurabh Agrawal, Snigdhansu Chatterjee, Vipin Kumar University of Minnesota

Stefan Liess University of Minnesota Saurabh Agrawal, Snigdhansu Chatterjee, Vipin Kumar University of Minnesota Introducing and Finding Tripoles: A Connection Between Central Asia and the Tropical Pacific Stefan Liess University of Minnesota liess@umn.edu Saurabh Agrawal, Snigdhansu Chatterjee, Vipin Kumar University

More information

Kernel-Based Retrieval of Atmospheric Profiles from IASI Data

Kernel-Based Retrieval of Atmospheric Profiles from IASI Data Kernel-Based Retrieval of Atmospheric Profiles from IASI Data Gustavo Camps-Valls, Valero Laparra, Jordi Muñoz-Marí, Luis Gómez-Chova, Xavier Calbet Image Processing Laboratory (IPL), Universitat de València.

More information

Kernel methods for comparing distributions, measuring dependence

Kernel methods for comparing distributions, measuring dependence Kernel methods for comparing distributions, measuring dependence Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Principal component analysis Given a set of M centered observations

More information

Machine Learning. Regression basics. Marc Toussaint University of Stuttgart Summer 2015

Machine Learning. Regression basics. Marc Toussaint University of Stuttgart Summer 2015 Machine Learning Regression basics Linear regression, non-linear features (polynomial, RBFs, piece-wise), regularization, cross validation, Ridge/Lasso, kernel trick Marc Toussaint University of Stuttgart

More information

Jeff Howbert Introduction to Machine Learning Winter

Jeff Howbert Introduction to Machine Learning Winter Classification / Regression Support Vector Machines Jeff Howbert Introduction to Machine Learning Winter 2012 1 Topics SVM classifiers for linearly separable classes SVM classifiers for non-linearly separable

More information

Construction and Analysis of Climate Networks

Construction and Analysis of Climate Networks Construction and Analysis of Climate Networks Karsten Steinhaeuser University of Minnesota Workshop on Understanding Climate Change from Data Minneapolis, MN August 15, 2011 Working Definitions Knowledge

More information

Kernel Methods. Machine Learning A W VO

Kernel Methods. Machine Learning A W VO Kernel Methods Machine Learning A 708.063 07W VO Outline 1. Dual representation 2. The kernel concept 3. Properties of kernels 4. Examples of kernel machines Kernel PCA Support vector regression (Relevance

More information

1. INTRODUCTION 2. HIGHLIGHTS

1. INTRODUCTION 2. HIGHLIGHTS Bulletin Issue January 2017 Issue Number: ICPAC/03/44 IGAD Climate Prediction and Applications Centre Seasonal Bulletin, Review for October to December (OND) Season 2016 For referencing within this bulletin,

More information

CIS 520: Machine Learning Oct 09, Kernel Methods

CIS 520: Machine Learning Oct 09, Kernel Methods CIS 520: Machine Learning Oct 09, 207 Kernel Methods Lecturer: Shivani Agarwal Disclaimer: These notes are designed to be a supplement to the lecture They may or may not cover all the material discussed

More information

Neural networks and support vector machines

Neural networks and support vector machines Neural netorks and support vector machines Perceptron Input x 1 Weights 1 x 2 x 3... x D 2 3 D Output: sgn( x + b) Can incorporate bias as component of the eight vector by alays including a feature ith

More information

Support Vector Machine & Its Applications

Support Vector Machine & Its Applications Support Vector Machine & Its Applications A portion (1/3) of the slides are taken from Prof. Andrew Moore s SVM tutorial at http://www.cs.cmu.edu/~awm/tutorials Mingyue Tan The University of British Columbia

More information

Advanced Methods for Fault Detection

Advanced Methods for Fault Detection Advanced Methods for Fault Detection Piero Baraldi Agip KCO Introduction Piping and long to eploration distance pipelines activities Piero Baraldi Maintenance Intervention Approaches & PHM Maintenance

More information

Ridge Regression 1. to which some random noise is added. So that the training labels can be represented as:

Ridge Regression 1. to which some random noise is added. So that the training labels can be represented as: CS 1: Machine Learning Spring 15 College of Computer and Information Science Northeastern University Lecture 3 February, 3 Instructor: Bilal Ahmed Scribe: Bilal Ahmed & Virgil Pavlu 1 Introduction Ridge

More information

Online Mode Shape Estimation using Complex Principal Component Analysis and Clustering, Hallvar Haugdal (NTMU Norway)

Online Mode Shape Estimation using Complex Principal Component Analysis and Clustering, Hallvar Haugdal (NTMU Norway) Online Mode Shape Estimation using Complex Principal Component Analysis and Clustering, Hallvar Haugdal (NTMU Norway) Mr. Hallvar Haugdal, finished his MSc. in Electrical Engineering at NTNU, Norway in

More information

Denoising and Dimension Reduction in Feature Space

Denoising and Dimension Reduction in Feature Space Denoising and Dimension Reduction in Feature Space Mikio L. Braun Fraunhofer Institute FIRST.IDA Kekuléstr. 7, 2489 Berlin mikio@first.fhg.de Joachim Buhmann Inst. of Computational Science ETH Zurich CH-8092

More information

Learning sets and subspaces: a spectral approach

Learning sets and subspaces: a spectral approach Learning sets and subspaces: a spectral approach Alessandro Rudi DIBRIS, Università di Genova Optimization and dynamical processes in Statistical learning and inverse problems Sept 8-12, 2014 A world of

More information

Statistical foundations

Statistical foundations Statistical foundations Michael K. Tippett International Research Institute for Climate and Societ The Earth Institute, Columbia Universit ERFS Climate Predictabilit Tool Training Workshop Ma 4-9, 29 Ideas

More information

I C P A C. IGAD Climate Prediction and Applications Centre Monthly Climate Bulletin, Climate Review for April 2018

I C P A C. IGAD Climate Prediction and Applications Centre Monthly Climate Bulletin, Climate Review for April 2018 No. ICPAC/02/312 Bulletin Issue May 2018 I C P A C IGAD Climate Prediction and Applications Centre Monthly Climate Bulletin, Climate Review for April 2018 1. INTRODUCTION This bulletin reviews the April

More information

I C P A C. IGAD Climate Prediction and Applications Centre Monthly Climate Bulletin, Climate Review for September 2017

I C P A C. IGAD Climate Prediction and Applications Centre Monthly Climate Bulletin, Climate Review for September 2017 Bulletin Issue October 2017 I C P A C IGAD Climate Prediction and Applications Centre Monthly Climate Bulletin, Climate Review for September 2017 1. INTRODUCTION This bulletin reviews the September 2017

More information

Kernel Partial Least Squares for Nonlinear Regression and Discrimination

Kernel Partial Least Squares for Nonlinear Regression and Discrimination Kernel Partial Least Squares for Nonlinear Regression and Discrimination Roman Rosipal Abstract This paper summarizes recent results on applying the method of partial least squares (PLS) in a reproducing

More information

Machine Learning : Support Vector Machines

Machine Learning : Support Vector Machines Machine Learning Support Vector Machines 05/01/2014 Machine Learning : Support Vector Machines Linear Classifiers (recap) A building block for almost all a mapping, a partitioning of the input space into

More information

Nonlinear principal component analysis of noisy data

Nonlinear principal component analysis of noisy data Nonlinear principal component 1 Nonlinear principal component analysis of noisy data William W. Hsieh Dept. of Earth and Ocean Sciences, University of British Columbia Vancouver, BC V6T 1Z4, Canada Abstract

More information

Nonlinear Projection Trick in kernel methods: An alternative to the Kernel Trick

Nonlinear Projection Trick in kernel methods: An alternative to the Kernel Trick Nonlinear Projection Trick in kernel methods: An alternative to the Kernel Trick Nojun Kak, Member, IEEE, Abstract In kernel methods such as kernel PCA and support vector machines, the so called kernel

More information

Reconstruction-based Contribution for Process Monitoring with Kernel Principal Component Analysis.

Reconstruction-based Contribution for Process Monitoring with Kernel Principal Component Analysis. American Control Conference Marriott Waterfront, Baltimore, MD, USA June 3-July, FrC.6 Reconstruction-based Contribution for Process Monitoring with Kernel Principal Component Analysis. Carlos F. Alcala

More information

AnuMS 2018 Atlantic Hurricane Season Forecast

AnuMS 2018 Atlantic Hurricane Season Forecast AnuMS 2018 Atlantic Hurricane Season Forecast Issued: May 10, 2018 by Dale C. S. Destin (follow @anumetservice) Director (Ag), Antigua and Barbuda Meteorological Service (ABMS) The *AnuMS (Antigua Met

More information

Chemometrics: Classification of spectra

Chemometrics: Classification of spectra Chemometrics: Classification of spectra Vladimir Bochko Jarmo Alander University of Vaasa November 1, 2010 Vladimir Bochko Chemometrics: Classification 1/36 Contents Terminology Introduction Big picture

More information

Kernel Methods in Machine Learning

Kernel Methods in Machine Learning Kernel Methods in Machine Learning Autumn 2015 Lecture 1: Introduction Juho Rousu ICS-E4030 Kernel Methods in Machine Learning 9. September, 2015 uho Rousu (ICS-E4030 Kernel Methods in Machine Learning)

More information

3. Carbon Dioxide (CO 2 )

3. Carbon Dioxide (CO 2 ) 3. Carbon Dioxide (CO 2 ) Basic information on CO 2 with regard to environmental issues Carbon dioxide (CO 2 ) is a significant greenhouse gas that has strong absorption bands in the infrared region and

More information