Intelligence and statistics for rapid and robust earthquake detection, association and location

Similar documents
Tsunami early warning using earthquake rupture duration

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

A New Automatic Pattern Recognition Approach for the Classification of Volcanic Tremor at Mount Etna, Italy

Issues and Techniques in Pattern Classification

9/26/17. Ridge regression. What our model needs to do. Ridge Regression: L2 penalty. Ridge coefficients. Ridge coefficients

ShakeMaps during the Emilia sequence

6.036 midterm review. Wednesday, March 18, 15

Course in Data Science

Tsunami early warning within 5 minutes

Logic and machine learning review. CS 540 Yingyu Liang

Introduction to Machine Learning Midterm Exam

Machine Learning Practice Page 2 of 2 10/28/13

Neural Networks. Single-layer neural network. CSE 446: Machine Learning Emily Fox University of Washington March 10, /9/17

Loss Functions, Decision Theory, and Linear Models

Algebraic and topological perspectives on semi-supervised learning

Linear Models for Classification

What is semi-supervised learning?

Machine Learning Basics

Microarray Data Analysis: Discovery

Discovery Through Situational Awareness

Final Overview. Introduction to ML. Marek Petrik 4/25/2017

Machine Learning and Deep Learning! Vincent Lepetit!

Holdout and Cross-Validation Methods Overfitting Avoidance

Introduction to Support Vector Machines

CS7267 MACHINE LEARNING

Twitter s Effectiveness on Blackout Detection during Hurricane Sandy

Robotics 2. AdaBoost for People and Place Detection. Kai Arras, Cyrill Stachniss, Maren Bennewitz, Wolfram Burgard

Machine Learning. Boris

W vs. QCD Jet Tagging at the Large Hadron Collider

Lecture 3: Statistical Decision Theory (Part II)

Finding Multiple Outliers from Multidimensional Data using Multiple Regression

Image classification via Scattering Transform

Introduction to Machine Learning

PAC Learning Introduction to Machine Learning. Matt Gormley Lecture 14 March 5, 2018

Machine Learning. Lecture 9: Learning Theory. Feng Li.

Brief Introduction of Machine Learning Techniques for Content Analysis

Manual for a computer class in ML

Integrating ARCGIS with Datamining Software to Predict Habitat for Red Sea Urchins on the Coast of British Columbia.

Improvements to Seismic Monitoring of the European Arctic Using Three-Component Array Processing at SPITS

Rapid estimation of rupture extent for large earthquakes: application to the 2004, M9 Sumatra- Andaman mega-thrust

Machine Learning And Applications: Supervised Learning-SVM

Hurricane Prediction with Python

Support Vector Machines. CAP 5610: Machine Learning Instructor: Guo-Jun QI

Graph-Based Anomaly Detection with Soft Harmonic Functions

Learning From Data Lecture 15 Reflecting on Our Path - Epilogue to Part I

INTRODUCTION TO DATA SCIENCE

Pattern Recognition 2018 Support Vector Machines

EEE 241: Linear Systems

An Overview of Outlier Detection Techniques and Applications

Pattern Recognition and Machine Learning. Learning and Evaluation of Pattern Recognition Processes

Data Mining. 3.6 Regression Analysis. Fall Instructor: Dr. Masoud Yaghini. Numeric Prediction

Modern Information Retrieval

Support Vector Machine. Industrial AI Lab.

The Changing Landscape of Land Administration

CPSC 540: Machine Learning

Jeff Howbert Introduction to Machine Learning Winter

Support Vector Machines (SVM) in bioinformatics. Day 1: Introduction to SVM

Machine Learning (CS 567) Lecture 2

Logistic Regression. COMP 527 Danushka Bollegala

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013

CSC Neural Networks. Perceptron Learning Rule

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1

SVM TRADE-OFF BETWEEN MAXIMIZE THE MARGIN AND MINIMIZE THE VARIABLES USED FOR REGRESSION

Learning with multiple models. Boosting.

OGS (Udine, Italy)

Bayesian Networks Inference with Probabilistic Graphical Models

Comparison of Shannon, Renyi and Tsallis Entropy used in Decision Trees

Support Vector Machines: Kernels

OVERLAPPING ANIMAL SOUND CLASSIFICATION USING SPARSE REPRESENTATION

Classifier Selection. Nicholas Ver Hoeve Craig Martek Ben Gardner

Seismic source modeling by clustering earthquakes and predicting earthquake magnitudes

Statistical Machine Learning Theory. From Multi-class Classification to Structured Output Prediction. Hisashi Kashima.

Semi-supervised Learning

Boosting: Algorithms and Applications

CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition

Learning Methods for Linear Detectors

Statistical Learning Reading Assignments

SVAN 2016 Mini Course: Stochastic Convex Optimization Methods in Machine Learning

DM534 - Introduction to Computer Science

Nonlinear Classification

PAC-learning, VC Dimension and Margin-based Bounds

CS570 Data Mining. Anomaly Detection. Li Xiong. Slide credits: Tan, Steinbach, Kumar Jiawei Han and Micheline Kamber.

WMO Aeronautical Meteorology Scientific Conference 2017

CC283 Intelligent Problem Solving 28/10/2013

How to do backpropagation in a brain

DANIEL WILSON AND BEN CONKLIN. Integrating AI with Foundation Intelligence for Actionable Intelligence

A Statistical Framework for Analysing Big Data Global Conference on Big Data for Official Statistics October, 2015 by S Tam, Chief

Machine Learning Methods for Radio Host Cross-Identification with Crowdsourced Labels

Notation. Pattern Recognition II. Michal Haindl. Outline - PR Basic Concepts. Pattern Recognition Notions

CS 6375 Machine Learning

Chap 1. Overview of Statistical Learning (HTF, , 2.9) Yongdai Kim Seoul National University

Support Vector Machine. Industrial AI Lab. Prof. Seungchul Lee

Introduction to Machine Learning Midterm Exam Solutions

Learning Linear Detectors

Optimizing Model Development and Validation Procedures of Partial Least Squares for Spectral Based Prediction of Soil Properties

Application of Support Vector Machine to the classification of volcanic tremor at Etna, Italy

Machine Learning Applications in Astronomy

Quantum machine learning for quantum anomaly detection CQT AND SUTD, SINGAPORE ARXIV:

How Do We Know Where an Earthquake Originated? Teacher's Guide

Does Unlabeled Data Help?

Transcription:

Intelligence and statistics for rapid and robust earthquake detection, association and location Anthony Lomax ALomax Scientific, Mouans-Sartoux, France anthony@alomax.net www.alomax.net @ALomaxNet Alberto Michelini, Fabrizio Bernardi, and Valentino Lauciani Istituto Nazionale di Geofisica e Vulcanologia, Roma, Italy

Early-est: rapid, fully automatic determination of the location, depth, magnitude, mechanism and tsunami potential of an earthquake For effective earthquake and tsunami early-warning it is crucial that key earthquake parameters are determined as rapidly and reliably as possible. EarlyEst: Rapid earthquake analysis module at INGV CAT tsunami alert center: Realtime display OT+8min ee

Rapid, early results use minimal data: prone to bias & errors, magnitudes, false events,... Example: False events Causes: M3.5 Greece FALSE: M6 Mali FALSE: M6 South Atlantic Ocean M7 Mid-Atlantic Ridge X X Poor station distribution 3D structure but 1D velocity model Mis-picked phases Poor pick/travel-time error model... Use statistics and machine learning to identify problems.

Identification of false events: apply statistics and machine learning to past true and (few) false events Data Frame (2D array) of training data: possible important attributes to discriminate true or false events labelled - identified as true or false event Problem: many true and few false events!

First step: basic statistical & expert analysis of past true and (few) false events Statistics: scatter matrix: examine pairs of attributes N phs N phs secondary gap outliers false events located by compact, distant clusters of stations gap2 gap2 outliers in origin-time error large pick residuals for false events σot σot depth outliers in longitude false events sometimes deep and in a-seismic regions longitude depth false events true events mb mb depth longitude latitude latitude N mb N mb possible important attributes to discriminate true or false events dmin dmin pandas.tools.plotting.scatter_matrix

Second step: Machine learning (classification, regression, ) for identifying outliers, making decisions, finding patterns... Examine semi-automatically in high-dimension many attributes What is machine learning? Applications: Given training data, construct an algorithm to make predications on new data. Decision / classification (e.g. False event? Tsunamigenic earthquake?) 1. Learn (select and tune algorithms) using training data. 2. Test algorithm on testing data. 3. Apply algorithm to new data. Supervised learning: predict attributes of data: - Classification: learn from labeled, xy training data how to predict the (discrete) class y of new, unlabeled data x. - Regression: learn from xy training data how to predict the (continuous) values of y variables in new data x. Outlier detection (e.g. False event? Unusual event?) Unsupervised learning: No target attributes, try to discover clustering or distribution of the data, or reduce the dimensionality of the data. and many more... http://scikit-learn.org

Machine learning: multitude of methods depending on goals and characteristics of data set Identify false events http://scikit-learn.org

Multiple machine learning algorithms: train and test with past true and (few) false events Classifier Algorithms: Support vector machines (SVMs) Data Frame (2D array) of training data: possible important attributes to discriminate true or false events Nearest Neighbors Classification labelled true or false event Problem: many true and few false events! and many more...

Multiple machine learning algorithms: train and test with past true and (few) false events False & true events promising good good promising Algorithms act in high-dimension using many attributes may discover complex relationships between attributes, may be difficult to understand in terms of expert knowledge & scientific theory. Many algorithms to select and parameters to tune great open software helps.

Intelligence and statistics for rapid and robust earthquake analysis, identification of false events: Conclusions Statistical analysis aids in acting on individual or few attributes, (e.g. stronger filtering on azimuth gaps) Direct use of expert knowledge & scientific theory Machine learning acts in high-dimension using many attributes: Powerful and shows much promise for improving early warning reliability, Many machine learning algorithms are very familiar in geophysics, and powerful new algorithms for big data, image recognition, BUT, automated, not theory based False events: Include past event history? Recursive Neural Networks? Easy to use with well documented, open tools in Python, R, Java, What advantages and trade-offs for science? Machine learning, Automation Expert knowledge, Scientific Theory Support: Centro Nazionale Terremoti, INGV Data: ingv.it, geofon.gfz-potsdam.de, geosbud.ipgp.fr, resif.fr, ird.nc, iris.washington.edu, usgs.gov Software: Python: scikit-learn.org, pandas.pydata.org, matplotlib.org; R statistics language Anthony Lomax ALomax Scientific, Mouans-Sartoux, France anthony@alomax.net www.alomax.net @ALomaxNet Alberto Michelini, Fabrizio Bernardi, and Valentino Lauciani Istituto Nazionale di Geofisica e Vulcanologia, Roma, Italy