Interaction Analysis of Spatial Point Patterns

Size: px
Start display at page:

Download "Interaction Analysis of Spatial Point Patterns"

Transcription

1 Interaction Analysis of Spatial Point Patterns Geog 2C Introduction to Spatial Data Analysis Phaedon C Kyriakidis wwwgeogucsbedu/ phaedon Department of Geography University of California Santa Barbara Santa Barbara, CA 936- phaedon@geogucsbedu Spring Quarter 9 Spatial Point Patterns Definition Set of point locations with recorded events within study region, eg, locations of trees, disease or crime incidents N= clustered events in a study region N= random events in a study region point locations could correspond to all possible events or to subsets of them (mapped versus sampled point pattern) attribute values could have also been measured at event locations, eg, tree diameter (marked point pattern) not considered in this handout Objective of this handout Introduce statistical tools for quantifying spatial interaction of events, eg, clustering versus randomness or regularity Ph Kyriakidis (UCSB) Geog 2C Spring 9 2 / 27

2 Outline Concepts & Notation Distance & Distance Matrices Distances Involved in Spatial Point Patterns Quantifying Spatial Interaction: G Function Quantifying Spatial Interaction: F Function Quantifying Spatial Interaction: K Function Points To Remember Ph Kyriakidis (UCSB) Geog 2C Spring 9 3 / 27 Some Notation Concepts & Notation Point events Set of N locations of events occurring in a study area: {u i, i =,, N}, u i D R K u i = coordinate vector of i-th event location, eg, in 2D u i = {x i y i }, = belongs to, D = study domain, a subset of a K-dimensional space R K Variable of interest y(s) = number of events (a count) within arbitrary domain or support s with measure (length, area, volume) s ; support s is centered at an arbitrary location u and can also be denoted as s(u); in statistics, y(s) is treated as a realization of a random variable (RV) Y (s) Objective Quantify interaction, eg, covariation, between outcomes of any two RVs Y (s) and Y (s ) To do so, all RVs must lie in the same environment ; in other words, the long-term average (expectation) of RV Y (s) should be similar to that of Y (s ) Ph Kyriakidis (UCSB) Geog 2C Spring 9 4 / 27

3 Concepts & Notation Intensity of Events Local intensity λ(u) Mean number of events per unit area at an arbitrary location or point u, formally defined as: { } E{Y (s)} λ(u) = lim, u D s s where E{Y (s)} denotes the expectation (mean) of RV Y (s) within region s(u) centered at u and s is the area of that region Overall intensity λ Estimated as: ˆλ = n D, where D = measure (area) of study region D First-order stationarity Any RV Y (s) should have the same long-term average, for a fixed areal unit s This implies a constant intensity: λ(u) = λ, u D, and the expected number of events with a region s is just a function of s : E{Y (s)} = λ s, s D Ph Kyriakidis (UCSB) Geog 2C Spring 9 5 / 27 Concepts & Notation Interaction Between Count RVs Second-order intensity Long-term average (expectation) of products of counts per unit areas at any two arbitrary points u and u, formally defined as: { E{Y (s)y (s σ(u, u )} ) = lim s, s s s Some terminology }, u, u D not the same as E{Y(s)}*E{Y(s')}, unless variables are independent second-order stationarity: expectation of all RVs is constant (first-order stationarity), and second-order intensity is a function of separation vector between any two locations u and u isotropy: only distance (not orientation) of separation vector matters Outlook Quantifying interaction in spatial point patterns within the above assumptions or working hypotheses amounts to studying distances between events Ph Kyriakidis (UCSB) Geog 2C Spring 9 6 / 27

4 Distance & Distance Matrices Distance A measure of proximity (typically along a crow s flight path) between any two locations or spatial entities Euclidean distance Consider two points in a 2D (geographical or other) space with coordinates u i = (x i, y i ) and u j = (x j, y j ) The Euclidean distance d ij between points u i and u j is computed via Pythagoras s theorem as: d ij = d(u i, u j ) = u i u j = (x i x j ) 2 + (y i y j ) 2 u i u j is called the 2-norm of vector h ij = u i u j locations u i and u j are called, respectively, the tail and head of vector h ij y i y u i d ij y i y j y j u j x i x j x j Ph Kyriakidis (UCSB) Geog 2C Spring 9 7 / 27 x i x Distance Metric Distance & Distance Matrices Formal characteristics of a distance metric A measure d ij of proximity between locations u i and u j is a valid distance metric if it satisfies the following requirements: distance between a point and itself is always zero: d ii = distance between a point and another one is always positive: d ij > distance between two points is the same no matter which point you consider first: d ij = d ji the triangular inequality holds: sum of length of two sides of a triangle cannot be smaller than length of third side: d ij d il + d lj A metric d ij need not always be Euclidean, hence should checked to ensure that it is a valid distance metric Ph Kyriakidis (UCSB) Geog 2C Spring 9 8 / 27

5 Distance & Distance Matrices Non-Euclidean Distances Alternative distance measures (i) over a road, or railway, (ii) along a river, (ii) over a network u 2 u u3 u 4 u 5 Euclidean distance between locations network distance between locations Even more exotic distance measures (i) travel time over a network, (ii) perceived travel time between urban landmarks, (iii) volume of exports/imports Euclidean distances between network nodes actual or perceived distances on the network the latter might not even be formal distance metrics, ie: d ij d ji Ph Kyriakidis (UCSB) Geog 2C Spring 9 9 / 27 Distance & Distance Matrices Minkowski s Generalized Distance Definition Consider two points in a K-dimensional (geographical or other) space R K with coordinate vectors u i = [u i,, u ik,, u ik ] and u j = [u j,, u jk,, u jk ] The Minkowski distance of order p (with p > ), denoted as d (p) ij, between points u i and u j is computed as: ( K ) /p d (p) ij = u ik u jk p Particular cases k= Manhattan or city-block distance: d () K ij = Euclidean distance: d (2) k= u ik u jk 2 infinity norm or Chebyshev distance, as p : max( u i u j,, u ik u jk,, u ik u jk ) ij = K k= u ik u jk Distances computed from points in multidimensional spaces are routinely used in statistical pattern recognition; points represent objects or cases, each described by K attribute values Ph Kyriakidis (UCSB) Geog 2C Spring 9 / 27

6 Distance & Distance Matrices Euclidean Distance Matrix: Single Set of Points Definition Consider a set of N points {u,, u i,, u N } in a K-dimensional (geographical or other) space The distance matrix D is square (N N) matrix containing the distances {d(u i, u j ), i =,, N, j =,, N} between all N N possible pairs of points in the set u i u u 2 u 3 u 4 u 5 x i x x 2 x 3 x 4 x 5 y i y y 2 y 3 y 4 y 5 by convention, u is the coordinate vector of the st point in the set (st entry in data file) D = d d 2 d 3 d 4 d 5 d 2 d 22 d 23 d 24 d 25 d 3 d 32 d 33 d 34 d 35 d 4 d 42 d 43 d 44 d 45 d 5 d 52 d 53 d 54 d 55 = d 2 d 3 d 4 d 5 d 2 d 23 d 24 d 25 d 3 d 23 d 34 d 35 d 4 d 24 d 34 d 45 d 5 d 25 d 35 d 45 = [d ij] i-th row (or column) contains distances between i-th point u i and all others (including itself) D is symmetric with zeros along its diagonal Ph Kyriakidis (UCSB) Geog 2C Spring 9 / 27 Distance & Distance Matrices Euclidean Distance Matrix: Two Sets of Points Definition Consider 2 sets of points {u,, u i,, u N } and {t,, t j,, t M } in a K-dimensional (geographical or other) space The distance matrix D is a (N M) matrix containing the Euclidean distances {d(u i, t j ), i =,, N, j =,, M} between all N M possible pairs formed by these two sets of points u i u u 2 u 3 u 4 u 5 x i x x 2 x 3 x 4 x 5 y i y y 2 y 3 y 4 y 5 t j t t 2 t 3 t 4 t 5 t 6 t 7 x j x x 2 x 3 x 4 x 5 x 6 x 7 y j y y 2 y 3 y 4 y 5 y 6 y 7 by convention, u is the coordinate vector of the st datum in the data set #, and similarly for t D = d d 2 d 3 d 4 d 5 d 6 d 7 d 2 d 22 d 23 d 24 d 25 d 26 d 27 d 3 d 32 d 33 d 34 d 35 d 36 d 37 d 4 d 42 d 43 d 44 d 45 d 46 d 47 d 5 d 52 d 53 d 54 d 55 d 56 d 57 = [d ij] i-th row contains distances between i-th point u i in set # and all points in set #2 j-th column contains distances between j-th point t j in set #2 and all points in set # D is not symmetric, ie, d 2 d 2 : pair {u, t 2 } is not the same as pair {u 2, t } Ph Kyriakidis (UCSB) Geog 2C Spring 9 2 / 27

7 Distances Involved in Spatial Point Patterns Distances Between Events in A Point Pattern Event-to-event distance Distance d ij between event at location u i and another event at location u j : d ij = (x i x j ) 2 + (y i y j ) 2 Point-to-event distance Distance d pj between a randomly chosen point at location t p and an event at location u j : d pj = ( x p x j ) 2 + (ỹ p y j ) 2 Event-to-nearest-event distance Distance d min (u i ) between an event at location u i and its nearest neighbor event: d min (u i ) = min{d ij, j =,, N} j i Point-to-nearest-event distance Distance d min (t p ) between a randomly chosen point at location t p and its nearest neighbor event: d min (t p ) = min{ d pj, j =,, N} Ph Kyriakidis (UCSB) Geog 2C Spring 9 3 / 27 Distances Involved in Spatial Point Patterns Event-to-Nearest-Event Distances u Pattern with N=5 events u 2 u u 5 3 u Distance matrix eg, 598 = d min (u ), 762 = d min (u 2 ) Some events might be nearest neighbors of each other: eg, u 4, u 5, or have same nearest neighbor: eg, u 2, u 3, u 4 are nearest neighbors of u 5 Mean nearest neighbor distance Average of all d min (u i ) values: d min = N d min (u i ) N i= Drawback: single number does not suffice to describe point pattern Ph Kyriakidis (UCSB) Geog 2C Spring 9 4 / 27

8 The G Function Quantifying Spatial Interaction: G Function Definition Proportion of event-to-nearest-event distances d min (u i ) no greater than given distance cutoff d, estimated as: Ĝ(d) = #{d min(u i ) d, i =,, N} N Cumulative distribution function (CDF) of all N event-to-nearest-event distances; instead of computing average d min of d min values, compute their CDF For point pattern in previous page Sample histogram of event nearest neighbor distances Ĝ(d) Sample G function event-to-nearest neighbor distance, d event-to-nearest neighbor distance, d for larger number of events N, Ĝ(d) becomes smoother Ph Kyriakidis (UCSB) Geog 2C Spring 9 5 / 27 Quantifying Spatial Interaction: G Function Event-to-Nearest-Event (E2NE) Distance Histograms N= random stratified events in a study region N= clustered events in a study region Histogram of E2NE distances (evenly spaced events) Histogram of E2NE distances (clustered events) event-to-nearest neighbor distance, d event-to-nearest neighbor distance, d for evenly-spaced events, more E2NE distances similar to spacing of events for clustered events, more small E2NE distances and fewer large such distances Ph Kyriakidis (UCSB) Geog 2C Spring 9 6 / 27

9 Quantifying Spatial Interaction: G Function Sample G Function Examples N= random stratified events in a study region N= clustered events in a study region Sample G function (evenly spaced events) Sample G function (clustered events) Ĝ(d) 5 Ĝ(d) event-to-nearest neighbor distance, d event-to-nearest neighbor distance, d for evenly-spaced events, Ĝ(d) rises gradually up to the distance at which most events are spaced, and then increases rapidly for clustered events, Ĝ(d) rises rapidly at short distances, and then levels off at larger d-values Ph Kyriakidis (UCSB) Geog 2C Spring 9 7 / 27 The F Function Quantifying Spatial Interaction: F Function Definition Proportion of point-to-nearest-event distances d min (t j ) no greater than given distance cutoff d, estimated as: ˆF (d) = #{ d min (t j ) d, j =,, M} M Cumulative distribution function (CDF) of all M point-to-nearest-event distances Pattern with N=5 events and M= random points Sample F function ˆF (d) point-to-nearest neighbor distance, d for larger number M of random points, ˆF (d) becomes even smoother Note: The F function provides information on event proximity to voids Ph Kyriakidis (UCSB) Geog 2C Spring 9 8 / 27

10 Quantifying Spatial Interaction: F Function Point-to-Nearest-Event (P2NE) Distance Histograms N= random stratified events in a study region N= clustered events in a study region Histogram of P2NE distances (evenly spaced events) 3 Histogram of P2NE distances (clustered events) point-to-nearest neighbor distance, d point-to-nearest neighbor distance, d for evenly-spaced events, there are more nearest events at small distances from randomly placed points for clustered events, P2NE distances are generally larger than the previous case, and there are a few large such distances Ph Kyriakidis (UCSB) Geog 2C Spring 9 9 / 27 Quantifying Spatial Interaction: F Function Sample F Function Examples N= random stratified events in a study region N= clustered events in a study region Sample F function (evenly spaced events) Sample F function (clustered events) ˆF (d) 5 ˆF (d) point-to-nearest neighbor distance, d point-to-nearest neighbor distance, d for evenly-spaced events, ˆF (d) rises rapidly up to the distance at which most events are spaced, and then levels off (more nearest neighbors at small distances from randomly placed points) for clustered events, ˆF (d) rises rapidly at short distances, and then levels off at larger d-values Ph Kyriakidis (UCSB) Geog 2C Spring 9 / 27

11 Quantifying Spatial Interaction: F Function Comparing Sample G and F Functions N= random stratified events in a study region N= clustered events in a study region proportion Sample G and F functions (evenly spaced events) proportion Sample G and F functions (clustered events) Ĝ(d) ˆF (d) distance, d Ĝ(d) ˆF (d) distance, d for evenly-spaced events, there is more open space (smaller point-to-event distances), hence ˆF (d) rises faster than Ĝ(d) for clustered events, the reverse is true Ph Kyriakidis (UCSB) Geog 2C Spring 9 2 / 27 Quantifying Spatial Interaction: K Function The Sample K Function Concept building construct set of concentric circles (of increasing radius d) around each event 2 count number of events in each distance band 3 cumulative number of events up to radius d around all events = sample K function ˆK(d) Formal definition K(d) = u 3 u Example of K function estimation 6 events within distance h=6 units from event at location 3 events within distance h=6 units from event at location u 2 4 events within distance h=6 units from event at location E{# of events within distance d of any arbitrary event } E{# of events within study domain } λ N #{d ij d, i =,, N, j( i) =,, N} = ˆK(d) Ph Kyriakidis (UCSB) Geog 2C Spring 9 22 / 27

12 Quantifying Spatial Interaction: K Function Interpreting The Sample K Function Re-expressing ˆK(d) = λ N #{d ij d, i =,, N, j( i) =,, N} = D N N #{d ij d, i =,, N, j( i) =,, N} = D (proportion of event-to-event distances d) In other words: Function ˆK(d) is the sample cumulative distribution function (CDF) of all N 2 N event-to-event distances, scaled by D u Pattern with N=5 events u 2 u u 5 3 u 4 Sample histogram of event to event distances event-to-event distance, d ˆK(d)/ A Sample K function (/) event-to-event distance, d Note: Ignore bin at d = (center plot) and point at d = (right plot) Ph Kyriakidis (UCSB) Geog 2C Spring 9 23 / 27 Quantifying Spatial Interaction: K Function Event-to-Event Distance Histograms N= random stratified events in a study region N= clustered events in a study region Histogram of event to event distances (evenly spaced) 45 Histogram of event to event distances (clustered) event-to-event distance event-to-event distance for evenly-spaced events, there are more medium-sized E2E distances than small or large such distances for clustered events, the distribution of E2E distances is multi-modal Ph Kyriakidis (UCSB) Geog 2C Spring 9 24 / 27

13 Quantifying Spatial Interaction: K Function Event-to-Event Distance CDFs N= random stratified events in a study region N= clustered events in a study region cumulative CDF of event to event distances (evenly spaced) event-to-event distance cumulative CDF of event to event distances (clustered) event-to-event distance for clustered events, there are multiple bumps in the CDF of E2E distances due to the grouping of events in space Ph Kyriakidis (UCSB) Geog 2C Spring 9 25 / 27 Quantifying Spatial Interaction: K Function Sample K Function Examples N= random stratified events in a study region N= clustered events in a study region Sample K function (evenly spaced events) Sample K function (clustered events) Area proportion, ˆK(d) Area proportion, ˆK(d) event-to-event distance, d event-to-event distance, d sample K function ˆK(d) is monotonically increasing and is a scaled (by domain measure D ) version of the CDF of E2E distances Ph Kyriakidis (UCSB) Geog 2C Spring 9 26 / 27

14 Points To Remember Recap Quantifying interaction in spatial point patterns event-to-nearest-event distances use the sample G function Ĝ(d) point-to-nearest-event distances use the sample F function ˆF (d) event-to-event distances use the sample K function ˆK(d) K function looks at information beyond nearest neighbors Caveats clustering is always a function of the overall intensity of a point pattern clustering might occur due to local intensity variations or due to interaction; it is very difficult to disentangle each contribution Watch out for boundaries and edge effects distance distortions due to map projections sampled versus mapped point patterns Ph Kyriakidis (UCSB) Geog 2C Spring 9 27 / 27

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 5 Topic Overview 1) Introduction/Unvariate Statistics 2) Bootstrapping/Monte Carlo Simulation/Kernel

More information

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 4 Spatial Point Patterns Definition Set of point locations with recorded events" within study

More information

Overview of Statistical Analysis of Spatial Data

Overview of Statistical Analysis of Spatial Data Overview of Statistical Analysis of Spatial Data Geog 2C Introduction to Spatial Data Analysis Phaedon C. Kyriakidis www.geog.ucsb.edu/ phaedon Department of Geography University of California Santa Barbara

More information

GIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis

GIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis GIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis Guofeng Cao www.spatial.ttu.edu Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Fall 2018 Spatial Point Patterns

More information

Overview of Spatial analysis in ecology

Overview of Spatial analysis in ecology Spatial Point Patterns & Complete Spatial Randomness - II Geog 0C Introduction to Spatial Data Analysis Chris Funk Lecture 8 Overview of Spatial analysis in ecology st step in understanding ecological

More information

Introduction. Spatial Processes & Spatial Patterns

Introduction. Spatial Processes & Spatial Patterns Introduction Spatial data: set of geo-referenced attribute measurements: each measurement is associated with a location (point) or an entity (area/region/object) in geographical (or other) space; the domain

More information

Data Mining: Data. Lecture Notes for Chapter 2. Introduction to Data Mining

Data Mining: Data. Lecture Notes for Chapter 2. Introduction to Data Mining Data Mining: Data Lecture Notes for Chapter 2 Introduction to Data Mining by Tan, Steinbach, Kumar Similarity and Dissimilarity Similarity Numerical measure of how alike two data objects are. Is higher

More information

Proximity data visualization with h-plots

Proximity data visualization with h-plots The fifth international conference user! 2009 Proximity data visualization with h-plots Irene Epifanio Dpt. Matemàtiques, Univ. Jaume I (SPAIN) epifanio@uji.es; http://www3.uji.es/~epifanio Outline Motivating

More information

Bivariate Distributions. Discrete Bivariate Distribution Example

Bivariate Distributions. Discrete Bivariate Distribution Example Spring 7 Geog C: Phaedon C. Kyriakidis Bivariate Distributions Definition: class of multivariate probability distributions describing joint variation of outcomes of two random variables (discrete or continuous),

More information

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Spatial Analysis I. Spatial data analysis Spatial analysis and inference Spatial Analysis I Spatial data analysis Spatial analysis and inference Roadmap Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for analyses: working with

More information

Lecture 20 : Markov Chains

Lecture 20 : Markov Chains CSCI 3560 Probability and Computing Instructor: Bogdan Chlebus Lecture 0 : Markov Chains We consider stochastic processes. A process represents a system that evolves through incremental changes called

More information

Clustering Lecture 1: Basics. Jing Gao SUNY Buffalo

Clustering Lecture 1: Basics. Jing Gao SUNY Buffalo Clustering Lecture 1: Basics Jing Gao SUNY Buffalo 1 Outline Basics Motivation, definition, evaluation Methods Partitional Hierarchical Density-based Mixture model Spectral methods Advanced topics Clustering

More information

Algorithms for Picture Analysis. Lecture 07: Metrics. Axioms of a Metric

Algorithms for Picture Analysis. Lecture 07: Metrics. Axioms of a Metric Axioms of a Metric Picture analysis always assumes that pictures are defined in coordinates, and we apply the Euclidean metric as the golden standard for distance (or derived, such as area) measurements.

More information

Lecture 2: Review of Basic Probability Theory

Lecture 2: Review of Basic Probability Theory ECE 830 Fall 2010 Statistical Signal Processing instructor: R. Nowak, scribe: R. Nowak Lecture 2: Review of Basic Probability Theory Probabilistic models will be used throughout the course to represent

More information

Nature of Spatial Data. Outline. Spatial Is Special

Nature of Spatial Data. Outline. Spatial Is Special Nature of Spatial Data Outline Spatial is special Bad news: the pitfalls of spatial data Good news: the potentials of spatial data Spatial Is Special Are spatial data special? Why spatial data require

More information

Basic Properties of Metric and Normed Spaces

Basic Properties of Metric and Normed Spaces Basic Properties of Metric and Normed Spaces Computational and Metric Geometry Instructor: Yury Makarychev The second part of this course is about metric geometry. We will study metric spaces, low distortion

More information

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra. DS-GA 1002 Lecture notes 0 Fall 2016 Linear Algebra These notes provide a review of basic concepts in linear algebra. 1 Vector spaces You are no doubt familiar with vectors in R 2 or R 3, i.e. [ ] 1.1

More information

Notion of Distance. Metric Distance Binary Vector Distances Tangent Distance

Notion of Distance. Metric Distance Binary Vector Distances Tangent Distance Notion of Distance Metric Distance Binary Vector Distances Tangent Distance Distance Measures Many pattern recognition/data mining techniques are based on similarity measures between objects e.g., nearest-neighbor

More information

Lecture 7. Econ August 18

Lecture 7. Econ August 18 Lecture 7 Econ 2001 2015 August 18 Lecture 7 Outline First, the theorem of the maximum, an amazing result about continuity in optimization problems. Then, we start linear algebra, mostly looking at familiar

More information

Distances and similarities Based in part on slides from textbook, slides of Susan Holmes. October 3, Statistics 202: Data Mining

Distances and similarities Based in part on slides from textbook, slides of Susan Holmes. October 3, Statistics 202: Data Mining Distances and similarities Based in part on slides from textbook, slides of Susan Holmes October 3, 2012 1 / 1 Similarities Start with X which we assume is centered and standardized. The PCA loadings were

More information

Let x be an approximate solution for Ax = b, e.g., obtained by Gaussian elimination. Let x denote the exact solution. Call. r := b A x.

Let x be an approximate solution for Ax = b, e.g., obtained by Gaussian elimination. Let x denote the exact solution. Call. r := b A x. ESTIMATION OF ERROR Let x be an approximate solution for Ax = b, e.g., obtained by Gaussian elimination. Let x denote the exact solution. Call the residual for x. Then r := b A x r = b A x = Ax A x = A

More information

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 http://intelligentoptimization.org/lionbook Roberto Battiti

More information

Points. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Points. Luc Anselin.   Copyright 2017 by Luc Anselin, All Rights Reserved Points Luc Anselin http://spatial.uchicago.edu 1 classic point pattern analysis spatial randomness intensity distance-based statistics points on networks 2 Classic Point Pattern Analysis 3 Classic Examples

More information

Permutations and Combinations

Permutations and Combinations Permutations and Combinations Permutations Definition: Let S be a set with n elements A permutation of S is an ordered list (arrangement) of its elements For r = 1,..., n an r-permutation of S is an ordered

More information

University of Florida CISE department Gator Engineering. Clustering Part 1

University of Florida CISE department Gator Engineering. Clustering Part 1 Clustering Part 1 Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville What is Cluster Analysis? Finding groups of objects such that the objects

More information

Jim Lambers MAT 610 Summer Session Lecture 2 Notes

Jim Lambers MAT 610 Summer Session Lecture 2 Notes Jim Lambers MAT 610 Summer Session 2009-10 Lecture 2 Notes These notes correspond to Sections 2.2-2.4 in the text. Vector Norms Given vectors x and y of length one, which are simply scalars x and y, the

More information

Uncertainty Quantification and Validation Using RAVEN. A. Alfonsi, C. Rabiti. Risk-Informed Safety Margin Characterization. https://lwrs.inl.

Uncertainty Quantification and Validation Using RAVEN. A. Alfonsi, C. Rabiti. Risk-Informed Safety Margin Characterization. https://lwrs.inl. Risk-Informed Safety Margin Characterization Uncertainty Quantification and Validation Using RAVEN https://lwrs.inl.gov A. Alfonsi, C. Rabiti North Carolina State University, Raleigh 06/28/2017 Assumptions

More information

GIST 4302/5302: Spatial Analysis and Modeling

GIST 4302/5302: Spatial Analysis and Modeling GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis Guofeng Cao http://thestarlab.github.io Department of Geosciences Texas Tech University

More information

Chapter 7 Network Flow Problems, I

Chapter 7 Network Flow Problems, I Chapter 7 Network Flow Problems, I Network flow problems are the most frequently solved linear programming problems. They include as special cases, the assignment, transportation, maximum flow, and shortest

More information

AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES

AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES JOEL A. TROPP Abstract. We present an elementary proof that the spectral radius of a matrix A may be obtained using the formula ρ(a) lim

More information

DEN: Linear algebra numerical view (GEM: Gauss elimination method for reducing a full rank matrix to upper-triangular

DEN: Linear algebra numerical view (GEM: Gauss elimination method for reducing a full rank matrix to upper-triangular form) Given: matrix C = (c i,j ) n,m i,j=1 ODE and num math: Linear algebra (N) [lectures] c phabala 2016 DEN: Linear algebra numerical view (GEM: Gauss elimination method for reducing a full rank matrix

More information

Motivating the Covariance Matrix

Motivating the Covariance Matrix Motivating the Covariance Matrix Raúl Rojas Computer Science Department Freie Universität Berlin January 2009 Abstract This note reviews some interesting properties of the covariance matrix and its role

More information

MTAEA Vectors in Euclidean Spaces

MTAEA Vectors in Euclidean Spaces School of Economics, Australian National University January 25, 2010 Vectors. Economists usually work in the vector space R n. A point in this space is called a vector, and is typically defined by its

More information

L2: Review of probability and statistics

L2: Review of probability and statistics Probability L2: Review of probability and statistics Definition of probability Axioms and properties Conditional probability Bayes theorem Random variables Definition of a random variable Cumulative distribution

More information

5. Discriminant analysis

5. Discriminant analysis 5. Discriminant analysis We continue from Bayes s rule presented in Section 3 on p. 85 (5.1) where c i is a class, x isap-dimensional vector (data case) and we use class conditional probability (density

More information

Linear Algebra Review

Linear Algebra Review Chapter 1 Linear Algebra Review It is assumed that you have had a course in linear algebra, and are familiar with matrix multiplication, eigenvectors, etc. I will review some of these terms here, but quite

More information

MA677 Assignment #3 Morgan Schreffler Due 09/19/12 Exercise 1 Using Hölder s inequality, prove Minkowski s inequality for f, g L p (R d ), p 1:

MA677 Assignment #3 Morgan Schreffler Due 09/19/12 Exercise 1 Using Hölder s inequality, prove Minkowski s inequality for f, g L p (R d ), p 1: Exercise 1 Using Hölder s inequality, prove Minkowski s inequality for f, g L p (R d ), p 1: f + g p f p + g p. Proof. If f, g L p (R d ), then since f(x) + g(x) max {f(x), g(x)}, we have f(x) + g(x) p

More information

Random variables. DS GA 1002 Probability and Statistics for Data Science.

Random variables. DS GA 1002 Probability and Statistics for Data Science. Random variables DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Motivation Random variables model numerical quantities

More information

GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis

GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis Guofeng Cao http://www.spatial.ttu.edu Department of Geosciences Texas Tech University guofeng.cao@ttu.edu

More information

Lecture 10: Dimension Reduction Techniques

Lecture 10: Dimension Reduction Techniques Lecture 10: Dimension Reduction Techniques Radu Balan Department of Mathematics, AMSC, CSCAMM and NWC University of Maryland, College Park, MD April 17, 2018 Input Data It is assumed that there is a set

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 MA 575 Linear Models: Cedric E Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 1 Revision: Probability Theory 11 Random Variables A real-valued random variable is

More information

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall Announcement New Teaching Assistant Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall 209 Email: michael.harrigan@ttu.edu Guofeng Cao, Texas Tech GIST4302/5302, Lecture 2: Review of Map Projection

More information

Lecture 2: Linear Algebra Review

Lecture 2: Linear Algebra Review CS 4980/6980: Introduction to Data Science c Spring 2018 Lecture 2: Linear Algebra Review Instructor: Daniel L. Pimentel-Alarcón Scribed by: Anh Nguyen and Kira Jordan This is preliminary work and has

More information

Lecture 1 and 2: Introduction and Graph theory basics. Spring EE 194, Networked estimation and control (Prof. Khan) January 23, 2012

Lecture 1 and 2: Introduction and Graph theory basics. Spring EE 194, Networked estimation and control (Prof. Khan) January 23, 2012 Lecture 1 and 2: Introduction and Graph theory basics Spring 2012 - EE 194, Networked estimation and control (Prof. Khan) January 23, 2012 Spring 2012: EE-194-02 Networked estimation and control Schedule

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition A Brief Mathematical Review Hamid R. Rabiee Jafar Muhammadi, Ali Jalali, Alireza Ghasemi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Probability theory

More information

Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1. x 2. x =

Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1. x 2. x = Linear Algebra Review Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1 x x = 2. x n Vectors of up to three dimensions are easy to diagram.

More information

Data Preprocessing. Cluster Similarity

Data Preprocessing. Cluster Similarity 1 Cluster Similarity Similarity is most often measured with the help of a distance function. The smaller the distance, the more similar the data objects (points). A function d: M M R is a distance on M

More information

PARAMETERIZATION OF NON-LINEAR MANIFOLDS

PARAMETERIZATION OF NON-LINEAR MANIFOLDS PARAMETERIZATION OF NON-LINEAR MANIFOLDS C. W. GEAR DEPARTMENT OF CHEMICAL AND BIOLOGICAL ENGINEERING PRINCETON UNIVERSITY, PRINCETON, NJ E-MAIL:WGEAR@PRINCETON.EDU Abstract. In this report we consider

More information

Vector spaces. DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis.

Vector spaces. DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis. Vector spaces DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_fall17/index.html Carlos Fernandez-Granda Vector space Consists of: A set V A scalar

More information

Matrix Factorization and Analysis

Matrix Factorization and Analysis Chapter 7 Matrix Factorization and Analysis Matrix factorizations are an important part of the practice and analysis of signal processing. They are at the heart of many signal-processing algorithms. Their

More information

The Fundamental Insight

The Fundamental Insight The Fundamental Insight We will begin with a review of matrix multiplication which is needed for the development of the fundamental insight A matrix is simply an array of numbers If a given array has m

More information

Multimedia Retrieval Distance. Egon L. van den Broek

Multimedia Retrieval Distance. Egon L. van den Broek Multimedia Retrieval 2018-1019 Distance Egon L. van den Broek 1 The project: Two perspectives Man Machine or? Objective Subjective 2 The default Default: distance = Euclidean distance This is how it is

More information

Definition A finite Markov chain is a memoryless homogeneous discrete stochastic process with a finite number of states.

Definition A finite Markov chain is a memoryless homogeneous discrete stochastic process with a finite number of states. Chapter 8 Finite Markov Chains A discrete system is characterized by a set V of states and transitions between the states. V is referred to as the state space. We think of the transitions as occurring

More information

5.6. PSEUDOINVERSES 101. A H w.

5.6. PSEUDOINVERSES 101. A H w. 5.6. PSEUDOINVERSES 0 Corollary 5.6.4. If A is a matrix such that A H A is invertible, then the least-squares solution to Av = w is v = A H A ) A H w. The matrix A H A ) A H is the left inverse of A and

More information

Statistics 202: Data Mining. c Jonathan Taylor. Week 2 Based in part on slides from textbook, slides of Susan Holmes. October 3, / 1

Statistics 202: Data Mining. c Jonathan Taylor. Week 2 Based in part on slides from textbook, slides of Susan Holmes. October 3, / 1 Week 2 Based in part on slides from textbook, slides of Susan Holmes October 3, 2012 1 / 1 Part I Other datatypes, preprocessing 2 / 1 Other datatypes Document data You might start with a collection of

More information

The University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013.

The University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013. The University of Texas at Austin Department of Electrical and Computer Engineering EE381V: Large Scale Learning Spring 2013 Assignment 1 Caramanis/Sanghavi Due: Thursday, Feb. 7, 2013. (Problems 1 and

More information

Review (Probability & Linear Algebra)

Review (Probability & Linear Algebra) Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint

More information

Part I. Other datatypes, preprocessing. Other datatypes. Other datatypes. Week 2 Based in part on slides from textbook, slides of Susan Holmes

Part I. Other datatypes, preprocessing. Other datatypes. Other datatypes. Week 2 Based in part on slides from textbook, slides of Susan Holmes Week 2 Based in part on slides from textbook, slides of Susan Holmes Part I Other datatypes, preprocessing October 3, 2012 1 / 1 2 / 1 Other datatypes Other datatypes Document data You might start with

More information

Lecture 2: Linear Algebra Review

Lecture 2: Linear Algebra Review EE 227A: Convex Optimization and Applications January 19 Lecture 2: Linear Algebra Review Lecturer: Mert Pilanci Reading assignment: Appendix C of BV. Sections 2-6 of the web textbook 1 2.1 Vectors 2.1.1

More information

ACO Comprehensive Exam October 14 and 15, 2013

ACO Comprehensive Exam October 14 and 15, 2013 1. Computability, Complexity and Algorithms (a) Let G be the complete graph on n vertices, and let c : V (G) V (G) [0, ) be a symmetric cost function. Consider the following closest point heuristic for

More information

Technische Universität Dresden Institute of Numerical Mathematics

Technische Universität Dresden Institute of Numerical Mathematics Technische Universität Dresden Institute of Numerical Mathematics An Improved Flow-based Formulation and Reduction Principles for the Minimum Connectivity Inference Problem Muhammad Abid Dar Andreas Fischer

More information

Basic Concepts in Linear Algebra

Basic Concepts in Linear Algebra Basic Concepts in Linear Algebra Grady B Wright Department of Mathematics Boise State University February 2, 2015 Grady B Wright Linear Algebra Basics February 2, 2015 1 / 39 Numerical Linear Algebra Linear

More information

Data dependent operators for the spatial-spectral fusion problem

Data dependent operators for the spatial-spectral fusion problem Data dependent operators for the spatial-spectral fusion problem Wien, December 3, 2012 Joint work with: University of Maryland: J. J. Benedetto, J. A. Dobrosotskaya, T. Doster, K. W. Duke, M. Ehler, A.

More information

Matrices and Vectors

Matrices and Vectors Matrices and Vectors James K. Peterson Department of Biological Sciences and Department of Mathematical Sciences Clemson University November 11, 2013 Outline 1 Matrices and Vectors 2 Vector Details 3 Matrix

More information

GIST 4302/5302: Spatial Analysis and Modeling

GIST 4302/5302: Spatial Analysis and Modeling GIST 4302/5302: Spatial Analysis and Modeling Basics of Statistics Guofeng Cao www.myweb.ttu.edu/gucao Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Spring 2015 Outline of This Week

More information

Metric-based classifiers. Nuno Vasconcelos UCSD

Metric-based classifiers. Nuno Vasconcelos UCSD Metric-based classifiers Nuno Vasconcelos UCSD Statistical learning goal: given a function f. y f and a collection of eample data-points, learn what the function f. is. this is called training. two major

More information

Geometric Constraints II

Geometric Constraints II Geometric Constraints II Realizability, Rigidity and Related theorems. Embeddability of Metric Spaces Section 1 Given the matrix D d i,j 1 i,j n corresponding to a metric space, give conditions under which

More information

CS 246 Review of Linear Algebra 01/17/19

CS 246 Review of Linear Algebra 01/17/19 1 Linear algebra In this section we will discuss vectors and matrices. We denote the (i, j)th entry of a matrix A as A ij, and the ith entry of a vector as v i. 1.1 Vectors and vector operations A vector

More information

Max-plus algebra. Max-plus algebra. Monika Molnárová. Technická univerzita Košice. Max-plus algebra.

Max-plus algebra. Max-plus algebra. Monika Molnárová. Technická univerzita Košice. Max-plus algebra. Technická univerzita Košice monika.molnarova@tuke.sk Outline 1 Digraphs Maximum cycle-mean and transitive closures of a matrix Reducible and irreducible matrices Definite matrices Digraphs Complete digraph

More information

Similarity and Dissimilarity

Similarity and Dissimilarity 1//015 Similarity and Dissimilarity COMP 465 Data Mining Similarity of Data Data Preprocessing Slides Adapted From : Jiawei Han, Micheline Kamber & Jian Pei Data Mining: Concepts and Techniques, 3 rd ed.

More information

Numerical Analysis: Solving Systems of Linear Equations

Numerical Analysis: Solving Systems of Linear Equations Numerical Analysis: Solving Systems of Linear Equations Mirko Navara http://cmpfelkcvutcz/ navara/ Center for Machine Perception, Department of Cybernetics, FEE, CTU Karlovo náměstí, building G, office

More information

GROUP THEORY PRIMER. New terms: so(2n), so(2n+1), symplectic algebra sp(2n)

GROUP THEORY PRIMER. New terms: so(2n), so(2n+1), symplectic algebra sp(2n) GROUP THEORY PRIMER New terms: so(2n), so(2n+1), symplectic algebra sp(2n) 1. Some examples of semi-simple Lie algebras In the previous chapter, we developed the idea of understanding semi-simple Lie algebras

More information

Review of Basic Concepts in Linear Algebra

Review of Basic Concepts in Linear Algebra Review of Basic Concepts in Linear Algebra Grady B Wright Department of Mathematics Boise State University September 7, 2017 Math 565 Linear Algebra Review September 7, 2017 1 / 40 Numerical Linear Algebra

More information

A fast randomized algorithm for approximating an SVD of a matrix

A fast randomized algorithm for approximating an SVD of a matrix A fast randomized algorithm for approximating an SVD of a matrix Joint work with Franco Woolfe, Edo Liberty, and Vladimir Rokhlin Mark Tygert Program in Applied Mathematics Yale University Place July 17,

More information

2 Notation and Preliminaries

2 Notation and Preliminaries On Asymmetric TSP: Transformation to Symmetric TSP and Performance Bound Ratnesh Kumar Haomin Li epartment of Electrical Engineering University of Kentucky Lexington, KY 40506-0046 Abstract We show that

More information

In the Name of God. Lectures 15&16: Radial Basis Function Networks

In the Name of God. Lectures 15&16: Radial Basis Function Networks 1 In the Name of God Lectures 15&16: Radial Basis Function Networks Some Historical Notes Learning is equivalent to finding a surface in a multidimensional space that provides a best fit to the training

More information

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Key message Spatial dependence First Law of Geography (Waldo Tobler): Everything is related to everything else, but near things

More information

CS 664 Segmentation (2) Daniel Huttenlocher

CS 664 Segmentation (2) Daniel Huttenlocher CS 664 Segmentation (2) Daniel Huttenlocher Recap Last time covered perceptual organization more broadly, focused in on pixel-wise segmentation Covered local graph-based methods such as MST and Felzenszwalb-Huttenlocher

More information

Point Pattern Analysis

Point Pattern Analysis Point Pattern Analysis Nearest Neighbor Statistics Luc Anselin http://spatial.uchicago.edu principle G function F function J function Principle Terminology events and points event: observed location of

More information

Multivariate Statistics: Hierarchical and k-means cluster analysis

Multivariate Statistics: Hierarchical and k-means cluster analysis Multivariate Statistics: Hierarchical and k-means cluster analysis Steffen Unkel Department of Medical Statistics University Medical Center Goettingen, Germany Summer term 217 1/43 What is a cluster? Proximity

More information

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 1 Review We saw some basic metrics that helped us characterize

More information

Spectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics

Spectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics Spectral Graph Theory and You: and Centrality Metrics Jonathan Gootenberg March 11, 2013 1 / 19 Outline of Topics 1 Motivation Basics of Spectral Graph Theory Understanding the characteristic polynomial

More information

Stochastic modelling of epidemic spread

Stochastic modelling of epidemic spread Stochastic modelling of epidemic spread Julien Arino Centre for Research on Inner City Health St Michael s Hospital Toronto On leave from Department of Mathematics University of Manitoba Julien Arino@umanitoba.ca

More information

Lecture Notes 1: Vector spaces

Lecture Notes 1: Vector spaces Optimization-based data analysis Fall 2017 Lecture Notes 1: Vector spaces In this chapter we review certain basic concepts of linear algebra, highlighting their application to signal processing. 1 Vector

More information

13 Spherical geometry

13 Spherical geometry 13 Spherical geometry Let ABC be a triangle in the Euclidean plane. From now on, we indicate the interior angles A = CAB, B = ABC, C = BCA at the vertices merely by A, B, C. The sides of length a = BC

More information

Preprocessing & dimensionality reduction

Preprocessing & dimensionality reduction Introduction to Data Mining Preprocessing & dimensionality reduction CPSC/AMTH 445a/545a Guy Wolf guy.wolf@yale.edu Yale University Fall 2016 CPSC 445 (Guy Wolf) Dimensionality reduction Yale - Fall 2016

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

MATH 117 LECTURE NOTES

MATH 117 LECTURE NOTES MATH 117 LECTURE NOTES XIN ZHOU Abstract. This is the set of lecture notes for Math 117 during Fall quarter of 2017 at UC Santa Barbara. The lectures follow closely the textbook [1]. Contents 1. The set

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Tail Inequalities. The Chernoff bound works for random variables that are a sum of indicator variables with the same distribution (Bernoulli trials).

Tail Inequalities. The Chernoff bound works for random variables that are a sum of indicator variables with the same distribution (Bernoulli trials). Tail Inequalities William Hunt Lane Department of Computer Science and Electrical Engineering, West Virginia University, Morgantown, WV William.Hunt@mail.wvu.edu Introduction In this chapter, we are interested

More information

Measurement and Data

Measurement and Data Measurement and Data Data describes the real world Data maps entities in the domain of interest to symbolic representation by means of a measurement procedure Numerical relationships between variables

More information

3. Review of Probability and Statistics

3. Review of Probability and Statistics 3. Review of Probability and Statistics ECE 830, Spring 2014 Probabilistic models will be used throughout the course to represent noise, errors, and uncertainty in signal processing problems. This lecture

More information

EE263 Review Session 1

EE263 Review Session 1 EE263 Review Session 1 October 5, 2018 0.1 Importing Variables from a MALAB.m file If you are importing variables given in file vars.m, use the following code at the beginning of your script. close a l

More information

Unconstrained Ordination

Unconstrained Ordination Unconstrained Ordination Sites Species A Species B Species C Species D Species E 1 0 (1) 5 (1) 1 (1) 10 (4) 10 (4) 2 2 (3) 8 (3) 4 (3) 12 (6) 20 (6) 3 8 (6) 20 (6) 10 (6) 1 (2) 3 (2) 4 4 (5) 11 (5) 8 (5)

More information

Norm and Distance. Stephen Boyd. EE103 Stanford University. September 27, 2017

Norm and Distance. Stephen Boyd. EE103 Stanford University. September 27, 2017 Norm and Distance Stephen Boyd EE103 Stanford University September 27, 2017 Outline Norm and distance Distance Standard deviation Angle Norm and distance 2 Norm the Euclidean norm (or just norm) of an

More information

proximity similarity dissimilarity distance Proximity Measures:

proximity similarity dissimilarity distance Proximity Measures: Similarity Measures Similarity and dissimilarity are important because they are used by a number of data mining techniques, such as clustering nearest neighbor classification and anomaly detection. The

More information

The Solution of Linear Systems AX = B

The Solution of Linear Systems AX = B Chapter 2 The Solution of Linear Systems AX = B 21 Upper-triangular Linear Systems We will now develop the back-substitution algorithm, which is useful for solving a linear system of equations that has

More information

CS168: The Modern Algorithmic Toolbox Lecture #7: Understanding Principal Component Analysis (PCA)

CS168: The Modern Algorithmic Toolbox Lecture #7: Understanding Principal Component Analysis (PCA) CS68: The Modern Algorithmic Toolbox Lecture #7: Understanding Principal Component Analysis (PCA) Tim Roughgarden & Gregory Valiant April 0, 05 Introduction. Lecture Goal Principal components analysis

More information

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 19

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 19 EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 19 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org (based on Dr. Raj Jain s lecture

More information

An Algorithmist s Toolkit Nov. 10, Lecture 17

An Algorithmist s Toolkit Nov. 10, Lecture 17 8.409 An Algorithmist s Toolkit Nov. 0, 009 Lecturer: Jonathan Kelner Lecture 7 Johnson-Lindenstrauss Theorem. Recap We first recap a theorem (isoperimetric inequality) and a lemma (concentration) from

More information