Jae-Bong Lee 1 and Bernard A. Megrey 2. International Symposium on Climate Change Effects on Fish and Fisheries

Similar documents
ECE 521. Lecture 11 (not on midterm material) 13 February K-means clustering, Dimensionality reduction

ARTIFICIAL NEURAL NETWORKS گروه مطالعاتي 17 بهار 92

Why Forecast Recruitment?

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu

Comparing walleye pollock dynamics across the Bering Sea and adjacent areas

Analysis of Interest Rate Curves Clustering Using Self-Organising Maps

Scuola di Calcolo Scientifico con MATLAB (SCSM) 2017 Palermo 31 Luglio - 4 Agosto 2017

Statistical Pattern Recognition

Neural Networks and Machine Learning research at the Laboratory of Computer and Information Science, Helsinki University of Technology

Multivariate Analysis of Ecological Data

Jacqueline M. Grebmeier Chesapeake Biological Laboratory University of Maryland Center for Environmental Science, Solomons, MD, USA

Statistical Machine Learning

STA 414/2104: Lecture 8

Application of SOM neural network in clustering

Face Recognition Using Laplacianfaces He et al. (IEEE Trans PAMI, 2005) presented by Hassan A. Kingravi

Introduction to multivariate analysis Outline

Spatial dynamics of small pelagic fish in the California Current system on the regime time-scale. Parallel processes in other species-ecosystems.

Neural Networks Lecture 4: Radial Bases Function Networks

Climate Change and Arctic Ecosystems

by co-chairs Lisa Hendrickson, Hassan Moustahfid and Alexander Arkhipkin

Machine learning for pervasive systems Classification in high-dimensional spaces

Introduction to Machine Learning. PCA and Spectral Clustering. Introduction to Machine Learning, Slides: Eran Halperin

Matching the dimensionality of maps with that of the data

Clustering. CSL465/603 - Fall 2016 Narayanan C Krishnan

Department of Computer Science and Engineering

Climate Outlook for October 2017 March 2018

A Synthesis of Results from the Norwegian ESSAS (N-ESSAS) Project

Unsupervised machine learning

Application of Clustering to Earth Science Data: Progress and Challenges

Part I. Linear regression & LASSO. Linear Regression. Linear Regression. Week 10 Based in part on slides from textbook, slides of Susan Holmes

University of Florida CISE department Gator Engineering. Clustering Part 1

Lecture 7: Con3nuous Latent Variable Models

Learning Vector Quantization

Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations.

Unsupervised Learning Techniques Class 07, 1 March 2006 Andrea Caponnetto

Artificial Neural Networks. Edward Gatt

Self Organizing Maps. We are drowning in information and starving for knowledge. A New Approach for Integrated Analysis of Geological Data.

Kevin Friedland National Marine Fisheries Service Narragansett, Rhode Island

How will climate change affect the Cold Water Prawn?

Entropy Manipulation of Arbitrary Non I inear Map pings

DIMENSION REDUCTION AND CLUSTER ANALYSIS

Trites and Larkin, 1996). The dashed line shows the division between the declining

CLIMAR-III Third JCOMM Workshop on Advances in Marine Climatology 6-9 May Gdynia, Poland

Climate Outlook for March August 2018

December 20, MAA704, Multivariate analysis. Christopher Engström. Multivariate. analysis. Principal component analysis

Reduction of complex models using data-mining and nonlinear projection techniques

Unsupervised learning: beyond simple clustering and PCA

An Introduction to Nonlinear Principal Component Analysis

Motivating the Covariance Matrix

Unsupervised Learning: K- Means & PCA

Data Exploration and Unsupervised Learning with Clustering

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1

Applying Visual Analytics Methods to Spatial Time Series Data: Forest Fires, Phone Calls,

Machine Learning. CUNY Graduate Center, Spring Lectures 11-12: Unsupervised Learning 1. Professor Liang Huang.

Self-Organization by Optimizing Free-Energy

Clustering VS Classification

Principal Component Analysis of Sea Surface Temperature via Singular Value Decomposition

Dimension Reduction (PCA, ICA, CCA, FLD,

Q & A on Trade-off between intensity and frequency of global tropical cyclones

Machine Learning of Environmental Spatial Data Mikhail Kanevski 1, Alexei Pozdnoukhov 2, Vasily Demyanov 3

ECE662: Pattern Recognition and Decision Making Processes: HW TWO

Data Mining Techniques

Global (ISOMAP) versus Local (LLE) Methods in Nonlinear Dimensionality Reduction

Using GIS to locate hotspots for bluefin tuna

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015

Learning sets and subspaces: a spectral approach

Covariance and Correlation Matrix

Applying Multi-Model Superensemble Methods to Global Ocean Operational Systems

Learning Vector Quantization (LVQ)

Applying cluster analysis to 2011 Census local authority data

CMIP3/CMIP5 differences: Scenario (SRESA1B vs RCP4.5) Ensemble mean Tas responses: CMIP3 = 2.8 K CMIP5 = 1.9 K CMIP5 higher average resolution

The Impact of Changing Sea Ice and Hydrographic Conditions on Biological Communities in the Northern Bering and Chukchi Seas

Cheng Soon Ong & Christian Walder. Canberra February June 2018

STA 414/2104: Lecture 8

Feature Extraction with Weighted Samples Based on Independent Component Analysis

Clustering by Mixture Models. General background on clustering Example method: k-means Mixture model based clustering Model estimation

AnuMS 2018 Atlantic Hurricane Season Forecast

53 contributors for 35 individual reports in 2009 show 5% of figures today

Dimensionality Reduction

CSC321 Lecture 20: Autoencoders

Efficient unsupervised clustering for spatial bird population analysis along the Loire river

Discriminant Uncorrelated Neighborhood Preserving Projections

Predictive analysis on Multivariate, Time Series datasets using Shapelets

Major human activities affecting Norwegian coastal marine ecosystems; present status and challenges

Expectation-maximization analysis of spatial time series

Chap.11 Nonlinear principal component analysis [Book, Chap. 10]

Critical Issues in Assessment of Offshore Wind Farm Development on Dispersion and Settling of Scallop Larvae in the Northeast U.S.

7. Variable extraction and dimensionality reduction

Algebraic and topological perspectives on semi-supervised learning

Principles of Pattern Recognition. C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata

Unsupervised Learning

Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees

The North Atlantic Oscillation: Climatic Significance and Environmental Impact

Last updated: Oct 22, 2012 LINEAR CLASSIFIERS. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition

PCA, Kernel PCA, ICA

Clusters. Unsupervised Learning. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Species specific geographical distribution patterns in a warm Barents Sea: haddock vs. cod

Preprocessing & dimensionality reduction

Nutrient fields reveal identity of ecosystems: A case study from the Bering Sea Kirill Kivva

Transcription:

International Symposium on Climate Change Effects on Fish and Fisheries On the utility of self-organizing maps (SOM) and k-means clustering to characterize and compare low frequency spatial and temporal climate impacts on marine ecosystem productivity Jae-Bong Lee 1 and Bernard A Megrey 2 1 National Fisheries Research and Development Institute, Busan, Korea 2 Alaska Fisheries Science Center, Seattle, USA

Outline Motivation and rational: temporal and spatial of R and S variability compared Describe Self organizing maps (SOM) Apply to time series trends in the ln(r/s) response variable Discuss the utility of SOM May 7, 21 2

Russia Alaska Greenland Eastern Bering Sea (EBS) (BNS) Barents Sea Gulf of Alaska (GOA) Norwegian Sea Norway 4 Major Canada Ecosystems USA GOM GB Gulf of Maine / Georges Bank (GOM/GB)

Trends in Recruitment Time Series Anomalies Temporal and spatial patterns of R and S variability from 17 stocks were compared among functionally analogous species and similar feeding guilds from four marine ecosystems Calculate the anomaly in ln(r/s), Pool by feeding guild (benthic vs pelagic) or ecosystem (EBS, GOA, GB/GOM, BNS) by calculating the average anomaly per year Look for within and cross ecosystem trends

1 ln(r/s) Response Variable EBS and GOA follow same trend (r=64, p<1) GB and BNS follow same trend (r=89, p<1) ln(r /S) 8 EBS GOA 6 GB 4 BNS 2-2 -4-6 195 196 197 198 199 2 21 EBS/GOA out of phase with GB/BNS (r=-67, p<1) Declining survival since mid-9 s for all but EBS Declining pelagic survival for GB and BNS ln(r/s) 12 1 8 6 4 2-2 -4 EBSpelagic GOApelagic GBpelagic BNSpelagic Improving benthic survival for GB and BNS since late 197 s Declining benthic survival for EBS and GOA over same period Recent upturn in benthic survival in EBS since late 199 s and in pelagic survival since mid 199 s log(r/s) -6 195 196 197 198 199 2 21 1 EBSbenthic 5 GOAbenthic GBbenthic BNSbenthic -5-1 -15 195 196 197 198 199 2 21 Year

Dynamic Factor Analysis ln(r/s) Pelagic Species Benthic Species

Survival Changes all Four Ecosystems: Benthic Regime Shift ~ 1989? EBSBenthic, 196-25 Probability = 1, cutoff length = 1, Huber parameter = 1 BNSBenthic 1964-22 Probability = 1, cutoff length = 1, Huber parameter = 1 3 2 1 15 1 5-5 -1-15 -2-25 -3 ln(r/s) -1-2 -3 196 1963 1966 1969 1972 1975 1978 1981 1984 1987 199 1993 1996 1999 22 25 196 1963 1966 1969 1972 1975 1978 1981 1984 1987 199 1993 1996 1999 22 25 Year Year GBBenthic, 1963-24 Probability = 1, cutoff length = 1, Huber parameter = 1 GOABenthic, 1961-24 Probability = 1, cutoff length = 1, Huber parameter = 1 25 2 15 1 5-5 -1-15 -2 15 1 5-5 -1-15 -2-25 196 1963 1966 1969 1972 1975 1978 1981 1984 1987 199 1993 1996 1999 22 25 196 1963 1966 1969 1972 1975 1978 1981 1984 1987 199 1993 1996 1999 22 25 Year Year ln(r/s) ln(r/s) ln(r/s)

Climate forcing of the Barents/Norwegian Sea? BNSBenthic 1964-22 Probability = 1, cutoff length = 1, Huber parameter = 1 ln(r/s) 15 1 5-5 -1-15 -2-25 -3 BNS Pelagic ln(r/s) 196 1963 1966 1969 1972 1975 1978 1981 1984 1987 199 1993 1996 1999 22 25 Year BNS SST, 1948-26 Probability = 1, cutoff length = 1, Huber parameter = 1 SST 29 27 25 23 21 19 17 15 1948 1952 1956 196 1964 1968 1972 1976 Year 198 1984 1988 1992 1996 2 24 BNS SST

Purpose Motivation and rational: temporal and spatial of R and S variability compared Describe Self organizing maps (SOM) Apply to time series trends in the ln(r/s) response variable Discuss the utility of SOM May 7, 21 9

Methods Artificial neural network (ANN) -- Self organizing mapping (SOM) May 7, 21 1

Wikipedia http://enwikipediaorg/wiki/self-organizing_map A self-organizing map (SOM) or self-organizing feature map (SOFM) is a type of artificial neural network that is trained using unsupervised learning to produce a low-dimensional (typically twodimensional) map Self-organizing maps are different from other artificial neural networks in the sense that they use a neighborhood function to preserve the topological properties of the input space This makes SOM useful for visualizing low-dimensional views of high-dimensional data, akin to multidimensional scaling The model was first described as an artificial neural network by the Finnish professor Teuvo Kohonen, and is sometimes called a Kohonen map Therefore, SOM forms a semantic map where similar samples are mapped close together and dissimilar apart This may be visualized by a U-Matrix (Euclidean distance between weight vectors of neighboring cells) of the SOM SOM may be considered a nonlinear generalization of Principal components analysis (PCA) SOM has many advantages over the conventional feature extraction methods such as Empirical Orthogonal Functions (EOF) or PCA May 7, 21 11

Sample units SU 1 SU 2 SU 3 SU p species sp 1 sp 2 sp n Complex data set Ordination Polar Ordination (PO) Principal Component Analysis (PCA) Correspondence Analysis (CoA) Nonlinear Multidimensional Scaling (NMDS) Classification Artificial Neural Network (ANN) SOM BP Interpretation Studying Large Ordination Limitations: Self Organizing sample to ecological strong display of Maps, plant distortions statistical a community good animal tool with sample abundance to non-linear simplify units drawn high species dimensional from abundance an ecological dataset community relations, Bacpropagation horseshoe A huge neural matrix effect, network, arch often effect, difficult for prediction outliers, to analyse and missing discrimination and data, interpret disjointed data matrix,

sp 1 SU 1 SU 2 SU 3 Sample units SU i SU p Species sp 2 sp n Initialization Learning of the reference vectors (iterative) - random choice of a sample unit - Locating the best matching unit (BMU) - learning of the BMU and its neighbors Put the SUs on the map SU SU SU SU SU SU SU SU SU SU SU SU SU Reference vectors (virtual units) The In each dataset SOM hexagon, method is projected a virtual in a unit non-linear (VU) will way be considered onto a rectangular The virtual grid units laid out are on virtual a hexagonal sites with lattice: species the abundance Kohonen map to be computed The modifications of the VUs are made through an ANN

Clustering with Self-Organizing Maps Teaching the SOM: the species abundance is computed for each virtual units Computing the U-matrix Mapping the Sample Units onto the U-matrix Making the clustering structure apparent for the human expert of the dataset by selecting the brightness of the display

Clustering with Self-Organizing Maps With a large dataset, when dendrograms become very difficult to read, the SOM and the U-matrix are able to provide a very convenient visualisation U-matrix is not a "ready made" clustering algorithm but rather a tool for the inspection of high dimensional data The clusters have to be «seen» on the map by the human dataset expert The expert can define all types of clusters including the non-convex ones

K-means Clustering k-means clustering is a method of cluster analysis which aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean It is similar to the expectation-maximization algorithm for mixtures of Gaussians in that they both attempt to find the centers of natural clusters in the data It has been shown that the relaxed solution of k-means clustering, specified by the cluster indicators, is given by the PCA (principal component analysis) principal components, and the PCA subspace spanned by the principal directions is identical to the cluster centroid subspace specified by the between-class scatter matrix May 7, 21 16

Apply model to four ecosystem data Ln(R/S) of pelagic Ln(R/S) of benthic Ln(R/S) of the pooled for ecosystems May 7, 21 17

Smoothed Anomalie 12 1 8 6 4 2-2 EBSpelagic GOApelagic GBpelagic BNSpelagic -4-6 1965 197 1975 198 1985 199 1995 2 25 21 Year 1 4 8 3 6 2 GOAPelagic 4 2 BNSPelagic 1-1 -2-2 -4-3 -1-5 5 1 15-4 -3-2 -1 1 2 3 4 May 7, 21 18 EBSPelagic GBPelagic

Smoothed Anomalie 12 1 8 6 4 2-2 -4 EBSpelagic GOApelagic GBpelagic BNSpelagic -6 1965 197 1975 198 1985 199 1995 2 25 21 Year May 7, 21 19

May 7, 21 2 U-matrix 111 127 244 d 267 482 938 EBSPelagic d -23 114 459 GOAPelagic d 121 267 413 GBPelagic d -112-854 945 BNSPelagic ic-ca Labels 79 8 81 82 83 78 77 98 67 68 5 69 7 71 84 97 85 86 87 66 96 72 89 88 99 73 9 74 91 76 75 92 93 94 95 1 2 3 4 64 65

logr/s Pelagic EBSpelagic GOApelagic GBpelagic BNSpelagic 172 174 176 178 18 182 184 186 188 19 Linkage Distance May 7, 21 21

1 5 Smoothed Anomalie -5 EBSbenthic -1 GOAbenthic GBbenthic -15 BNSbenthic 1965 197 1975 198 1985 199 1995 2 25 21 Year 1 8 6 4 8 6 4 2 GOABenthic 2-2 -4-6 -8-1 -12-1 -5 5 1 EBSBenthic May 7, 21 22 BNSBenthic -2-4 -6-8 -1-12 -14-8 -6-4 -2 2 4 6 8 GBBenthic

1 5 Smoothed Anomalie -5 EBSbenthic -1 GOAbenthic GBbenthic -15 BNSbenthic 1965 197 1975 198 1985 199 1995 2 25 21 Year May 7, 21 23

U-matrix 29 2 378 d -859-597 -335 EBSBenthic d -677-273 131 GOABenthic d -823-476 -129 GBBenthic d -136-671 175 BNSBenthic ic-ca Labels 87 88 89 5 9 92 94 95 96 91 93 97 98 86 99 84 85 83 1 82 8 81 2 3 4 64 65 66 78 79 67 77 68 76 75 73 74 72 71 7 69 May 7, 21 24

lnr/s Benthic EBSbenthic GBbenthic GOAbenthic BNSbenthic 22 24 26 28 3 32 34 Linkage Distance May 7, 21 25

Smoothed Anomalie 1 8 6 4 2-2 -4 EBS GOA GB BNS -6 1965 197 1975 198 1985 199 1995 2 25 21 Year 8 3 6 2 4 1 GOA 2 BNS -1-2 -2-4 -3-6 -6-4 -2 2 4 6 8 1 EBS -4-4 -3-2 -1 1 2 3 4 GB

Smoothed Anomalie 1 8 6 4 2-2 -4 EBS GOA GB BNS -6 1965 197 1975 198 1985 199 1995 2 25 21 Year

lnr/s Ecosystem EBS GOA GB BNS 5 1 15 2 25 Linkage Distance May 7, 21 29

Conclusions SOMS and k-means clustering provide a highly visual tool to easily identify patterns in the timing of high or low productivity years across both species and ecosystems Many of the peaks in the time series were synchronous within an ocean basin and opposing alternations in patterns of productivity were observed in ecosystems in between the Atlantic and Pacific Ocean basins Basin-scale results (similar within but different between) suggest that productivity in the two geographically broad areas are connected by unknown climatic mechanisms that, depending on the year, generate low frequency opposing alternations after 1976 and 1988 in the two basins May 7, 21 3

Thank you for paying attention Presented by Jae Bong Lee 1 and Bernard A Megrey 2 1 NFRDI, (Leejb@nfrdigokr) 2 Alaska Fishery Science Center NOAA Fisheries (BernMegrey@noaagov) at Sendai, Japan (28 April 21)