Interaction Analysis of Spatial Point Patterns
|
|
- Paul Milton Golden
- 5 years ago
- Views:
Transcription
1 Interaction Analysis of Spatial Point Patterns Geog 2C Introduction to Spatial Data Analysis Phaedon C Kyriakidis wwwgeogucsbedu/ phaedon Department of Geography University of California Santa Barbara Santa Barbara, CA 936- phaedon@geogucsbedu Spring Quarter 9 Spatial Point Patterns Definition Set of point locations with recorded events within study region, eg, locations of trees, disease or crime incidents N= clustered events in a study region N= random events in a study region point locations could correspond to all possible events or to subsets of them (mapped versus sampled point pattern) attribute values could have also been measured at event locations, eg, tree diameter (marked point pattern) not considered in this handout Objective of this handout Introduce statistical tools for quantifying spatial interaction of events, eg, clustering versus randomness or regularity Ph Kyriakidis (UCSB) Geog 2C Spring 9 2 / 27
2 Outline Concepts & Notation Distance & Distance Matrices Distances Involved in Spatial Point Patterns Quantifying Spatial Interaction: G Function Quantifying Spatial Interaction: F Function Quantifying Spatial Interaction: K Function Points To Remember Ph Kyriakidis (UCSB) Geog 2C Spring 9 3 / 27 Some Notation Concepts & Notation Point events Set of N locations of events occurring in a study area: {u i, i =,, N}, u i D R K u i = coordinate vector of i-th event location, eg, in 2D u i = {x i y i }, = belongs to, D = study domain, a subset of a K-dimensional space R K Variable of interest y(s) = number of events (a count) within arbitrary domain or support s with measure (length, area, volume) s ; support s is centered at an arbitrary location u and can also be denoted as s(u); in statistics, y(s) is treated as a realization of a random variable (RV) Y (s) Objective Quantify interaction, eg, covariation, between outcomes of any two RVs Y (s) and Y (s ) To do so, all RVs must lie in the same environment ; in other words, the long-term average (expectation) of RV Y (s) should be similar to that of Y (s ) Ph Kyriakidis (UCSB) Geog 2C Spring 9 4 / 27
3 Concepts & Notation Intensity of Events Local intensity λ(u) Mean number of events per unit area at an arbitrary location or point u, formally defined as: { } E{Y (s)} λ(u) = lim, u D s s where E{Y (s)} denotes the expectation (mean) of RV Y (s) within region s(u) centered at u and s is the area of that region Overall intensity λ Estimated as: ˆλ = n D, where D = measure (area) of study region D First-order stationarity Any RV Y (s) should have the same long-term average, for a fixed areal unit s This implies a constant intensity: λ(u) = λ, u D, and the expected number of events with a region s is just a function of s : E{Y (s)} = λ s, s D Ph Kyriakidis (UCSB) Geog 2C Spring 9 5 / 27 Concepts & Notation Interaction Between Count RVs Second-order intensity Long-term average (expectation) of products of counts per unit areas at any two arbitrary points u and u, formally defined as: { E{Y (s)y (s σ(u, u )} ) = lim s, s s s Some terminology }, u, u D not the same as E{Y(s)}*E{Y(s')}, unless variables are independent second-order stationarity: expectation of all RVs is constant (first-order stationarity), and second-order intensity is a function of separation vector between any two locations u and u isotropy: only distance (not orientation) of separation vector matters Outlook Quantifying interaction in spatial point patterns within the above assumptions or working hypotheses amounts to studying distances between events Ph Kyriakidis (UCSB) Geog 2C Spring 9 6 / 27
4 Distance & Distance Matrices Distance A measure of proximity (typically along a crow s flight path) between any two locations or spatial entities Euclidean distance Consider two points in a 2D (geographical or other) space with coordinates u i = (x i, y i ) and u j = (x j, y j ) The Euclidean distance d ij between points u i and u j is computed via Pythagoras s theorem as: d ij = d(u i, u j ) = u i u j = (x i x j ) 2 + (y i y j ) 2 u i u j is called the 2-norm of vector h ij = u i u j locations u i and u j are called, respectively, the tail and head of vector h ij y i y u i d ij y i y j y j u j x i x j x j Ph Kyriakidis (UCSB) Geog 2C Spring 9 7 / 27 x i x Distance Metric Distance & Distance Matrices Formal characteristics of a distance metric A measure d ij of proximity between locations u i and u j is a valid distance metric if it satisfies the following requirements: distance between a point and itself is always zero: d ii = distance between a point and another one is always positive: d ij > distance between two points is the same no matter which point you consider first: d ij = d ji the triangular inequality holds: sum of length of two sides of a triangle cannot be smaller than length of third side: d ij d il + d lj A metric d ij need not always be Euclidean, hence should checked to ensure that it is a valid distance metric Ph Kyriakidis (UCSB) Geog 2C Spring 9 8 / 27
5 Distance & Distance Matrices Non-Euclidean Distances Alternative distance measures (i) over a road, or railway, (ii) along a river, (ii) over a network u 2 u u3 u 4 u 5 Euclidean distance between locations network distance between locations Even more exotic distance measures (i) travel time over a network, (ii) perceived travel time between urban landmarks, (iii) volume of exports/imports Euclidean distances between network nodes actual or perceived distances on the network the latter might not even be formal distance metrics, ie: d ij d ji Ph Kyriakidis (UCSB) Geog 2C Spring 9 9 / 27 Distance & Distance Matrices Minkowski s Generalized Distance Definition Consider two points in a K-dimensional (geographical or other) space R K with coordinate vectors u i = [u i,, u ik,, u ik ] and u j = [u j,, u jk,, u jk ] The Minkowski distance of order p (with p > ), denoted as d (p) ij, between points u i and u j is computed as: ( K ) /p d (p) ij = u ik u jk p Particular cases k= Manhattan or city-block distance: d () K ij = Euclidean distance: d (2) k= u ik u jk 2 infinity norm or Chebyshev distance, as p : max( u i u j,, u ik u jk,, u ik u jk ) ij = K k= u ik u jk Distances computed from points in multidimensional spaces are routinely used in statistical pattern recognition; points represent objects or cases, each described by K attribute values Ph Kyriakidis (UCSB) Geog 2C Spring 9 / 27
6 Distance & Distance Matrices Euclidean Distance Matrix: Single Set of Points Definition Consider a set of N points {u,, u i,, u N } in a K-dimensional (geographical or other) space The distance matrix D is square (N N) matrix containing the distances {d(u i, u j ), i =,, N, j =,, N} between all N N possible pairs of points in the set u i u u 2 u 3 u 4 u 5 x i x x 2 x 3 x 4 x 5 y i y y 2 y 3 y 4 y 5 by convention, u is the coordinate vector of the st point in the set (st entry in data file) D = d d 2 d 3 d 4 d 5 d 2 d 22 d 23 d 24 d 25 d 3 d 32 d 33 d 34 d 35 d 4 d 42 d 43 d 44 d 45 d 5 d 52 d 53 d 54 d 55 = d 2 d 3 d 4 d 5 d 2 d 23 d 24 d 25 d 3 d 23 d 34 d 35 d 4 d 24 d 34 d 45 d 5 d 25 d 35 d 45 = [d ij] i-th row (or column) contains distances between i-th point u i and all others (including itself) D is symmetric with zeros along its diagonal Ph Kyriakidis (UCSB) Geog 2C Spring 9 / 27 Distance & Distance Matrices Euclidean Distance Matrix: Two Sets of Points Definition Consider 2 sets of points {u,, u i,, u N } and {t,, t j,, t M } in a K-dimensional (geographical or other) space The distance matrix D is a (N M) matrix containing the Euclidean distances {d(u i, t j ), i =,, N, j =,, M} between all N M possible pairs formed by these two sets of points u i u u 2 u 3 u 4 u 5 x i x x 2 x 3 x 4 x 5 y i y y 2 y 3 y 4 y 5 t j t t 2 t 3 t 4 t 5 t 6 t 7 x j x x 2 x 3 x 4 x 5 x 6 x 7 y j y y 2 y 3 y 4 y 5 y 6 y 7 by convention, u is the coordinate vector of the st datum in the data set #, and similarly for t D = d d 2 d 3 d 4 d 5 d 6 d 7 d 2 d 22 d 23 d 24 d 25 d 26 d 27 d 3 d 32 d 33 d 34 d 35 d 36 d 37 d 4 d 42 d 43 d 44 d 45 d 46 d 47 d 5 d 52 d 53 d 54 d 55 d 56 d 57 = [d ij] i-th row contains distances between i-th point u i in set # and all points in set #2 j-th column contains distances between j-th point t j in set #2 and all points in set # D is not symmetric, ie, d 2 d 2 : pair {u, t 2 } is not the same as pair {u 2, t } Ph Kyriakidis (UCSB) Geog 2C Spring 9 2 / 27
7 Distances Involved in Spatial Point Patterns Distances Between Events in A Point Pattern Event-to-event distance Distance d ij between event at location u i and another event at location u j : d ij = (x i x j ) 2 + (y i y j ) 2 Point-to-event distance Distance d pj between a randomly chosen point at location t p and an event at location u j : d pj = ( x p x j ) 2 + (ỹ p y j ) 2 Event-to-nearest-event distance Distance d min (u i ) between an event at location u i and its nearest neighbor event: d min (u i ) = min{d ij, j =,, N} j i Point-to-nearest-event distance Distance d min (t p ) between a randomly chosen point at location t p and its nearest neighbor event: d min (t p ) = min{ d pj, j =,, N} Ph Kyriakidis (UCSB) Geog 2C Spring 9 3 / 27 Distances Involved in Spatial Point Patterns Event-to-Nearest-Event Distances u Pattern with N=5 events u 2 u u 5 3 u Distance matrix eg, 598 = d min (u ), 762 = d min (u 2 ) Some events might be nearest neighbors of each other: eg, u 4, u 5, or have same nearest neighbor: eg, u 2, u 3, u 4 are nearest neighbors of u 5 Mean nearest neighbor distance Average of all d min (u i ) values: d min = N d min (u i ) N i= Drawback: single number does not suffice to describe point pattern Ph Kyriakidis (UCSB) Geog 2C Spring 9 4 / 27
8 The G Function Quantifying Spatial Interaction: G Function Definition Proportion of event-to-nearest-event distances d min (u i ) no greater than given distance cutoff d, estimated as: Ĝ(d) = #{d min(u i ) d, i =,, N} N Cumulative distribution function (CDF) of all N event-to-nearest-event distances; instead of computing average d min of d min values, compute their CDF For point pattern in previous page Sample histogram of event nearest neighbor distances Ĝ(d) Sample G function event-to-nearest neighbor distance, d event-to-nearest neighbor distance, d for larger number of events N, Ĝ(d) becomes smoother Ph Kyriakidis (UCSB) Geog 2C Spring 9 5 / 27 Quantifying Spatial Interaction: G Function Event-to-Nearest-Event (E2NE) Distance Histograms N= random stratified events in a study region N= clustered events in a study region Histogram of E2NE distances (evenly spaced events) Histogram of E2NE distances (clustered events) event-to-nearest neighbor distance, d event-to-nearest neighbor distance, d for evenly-spaced events, more E2NE distances similar to spacing of events for clustered events, more small E2NE distances and fewer large such distances Ph Kyriakidis (UCSB) Geog 2C Spring 9 6 / 27
9 Quantifying Spatial Interaction: G Function Sample G Function Examples N= random stratified events in a study region N= clustered events in a study region Sample G function (evenly spaced events) Sample G function (clustered events) Ĝ(d) 5 Ĝ(d) event-to-nearest neighbor distance, d event-to-nearest neighbor distance, d for evenly-spaced events, Ĝ(d) rises gradually up to the distance at which most events are spaced, and then increases rapidly for clustered events, Ĝ(d) rises rapidly at short distances, and then levels off at larger d-values Ph Kyriakidis (UCSB) Geog 2C Spring 9 7 / 27 The F Function Quantifying Spatial Interaction: F Function Definition Proportion of point-to-nearest-event distances d min (t j ) no greater than given distance cutoff d, estimated as: ˆF (d) = #{ d min (t j ) d, j =,, M} M Cumulative distribution function (CDF) of all M point-to-nearest-event distances Pattern with N=5 events and M= random points Sample F function ˆF (d) point-to-nearest neighbor distance, d for larger number M of random points, ˆF (d) becomes even smoother Note: The F function provides information on event proximity to voids Ph Kyriakidis (UCSB) Geog 2C Spring 9 8 / 27
10 Quantifying Spatial Interaction: F Function Point-to-Nearest-Event (P2NE) Distance Histograms N= random stratified events in a study region N= clustered events in a study region Histogram of P2NE distances (evenly spaced events) 3 Histogram of P2NE distances (clustered events) point-to-nearest neighbor distance, d point-to-nearest neighbor distance, d for evenly-spaced events, there are more nearest events at small distances from randomly placed points for clustered events, P2NE distances are generally larger than the previous case, and there are a few large such distances Ph Kyriakidis (UCSB) Geog 2C Spring 9 9 / 27 Quantifying Spatial Interaction: F Function Sample F Function Examples N= random stratified events in a study region N= clustered events in a study region Sample F function (evenly spaced events) Sample F function (clustered events) ˆF (d) 5 ˆF (d) point-to-nearest neighbor distance, d point-to-nearest neighbor distance, d for evenly-spaced events, ˆF (d) rises rapidly up to the distance at which most events are spaced, and then levels off (more nearest neighbors at small distances from randomly placed points) for clustered events, ˆF (d) rises rapidly at short distances, and then levels off at larger d-values Ph Kyriakidis (UCSB) Geog 2C Spring 9 / 27
11 Quantifying Spatial Interaction: F Function Comparing Sample G and F Functions N= random stratified events in a study region N= clustered events in a study region proportion Sample G and F functions (evenly spaced events) proportion Sample G and F functions (clustered events) Ĝ(d) ˆF (d) distance, d Ĝ(d) ˆF (d) distance, d for evenly-spaced events, there is more open space (smaller point-to-event distances), hence ˆF (d) rises faster than Ĝ(d) for clustered events, the reverse is true Ph Kyriakidis (UCSB) Geog 2C Spring 9 2 / 27 Quantifying Spatial Interaction: K Function The Sample K Function Concept building construct set of concentric circles (of increasing radius d) around each event 2 count number of events in each distance band 3 cumulative number of events up to radius d around all events = sample K function ˆK(d) Formal definition K(d) = u 3 u Example of K function estimation 6 events within distance h=6 units from event at location 3 events within distance h=6 units from event at location u 2 4 events within distance h=6 units from event at location E{# of events within distance d of any arbitrary event } E{# of events within study domain } λ N #{d ij d, i =,, N, j( i) =,, N} = ˆK(d) Ph Kyriakidis (UCSB) Geog 2C Spring 9 22 / 27
12 Quantifying Spatial Interaction: K Function Interpreting The Sample K Function Re-expressing ˆK(d) = λ N #{d ij d, i =,, N, j( i) =,, N} = D N N #{d ij d, i =,, N, j( i) =,, N} = D (proportion of event-to-event distances d) In other words: Function ˆK(d) is the sample cumulative distribution function (CDF) of all N 2 N event-to-event distances, scaled by D u Pattern with N=5 events u 2 u u 5 3 u 4 Sample histogram of event to event distances event-to-event distance, d ˆK(d)/ A Sample K function (/) event-to-event distance, d Note: Ignore bin at d = (center plot) and point at d = (right plot) Ph Kyriakidis (UCSB) Geog 2C Spring 9 23 / 27 Quantifying Spatial Interaction: K Function Event-to-Event Distance Histograms N= random stratified events in a study region N= clustered events in a study region Histogram of event to event distances (evenly spaced) 45 Histogram of event to event distances (clustered) event-to-event distance event-to-event distance for evenly-spaced events, there are more medium-sized E2E distances than small or large such distances for clustered events, the distribution of E2E distances is multi-modal Ph Kyriakidis (UCSB) Geog 2C Spring 9 24 / 27
13 Quantifying Spatial Interaction: K Function Event-to-Event Distance CDFs N= random stratified events in a study region N= clustered events in a study region cumulative CDF of event to event distances (evenly spaced) event-to-event distance cumulative CDF of event to event distances (clustered) event-to-event distance for clustered events, there are multiple bumps in the CDF of E2E distances due to the grouping of events in space Ph Kyriakidis (UCSB) Geog 2C Spring 9 25 / 27 Quantifying Spatial Interaction: K Function Sample K Function Examples N= random stratified events in a study region N= clustered events in a study region Sample K function (evenly spaced events) Sample K function (clustered events) Area proportion, ˆK(d) Area proportion, ˆK(d) event-to-event distance, d event-to-event distance, d sample K function ˆK(d) is monotonically increasing and is a scaled (by domain measure D ) version of the CDF of E2E distances Ph Kyriakidis (UCSB) Geog 2C Spring 9 26 / 27
14 Points To Remember Recap Quantifying interaction in spatial point patterns event-to-nearest-event distances use the sample G function Ĝ(d) point-to-nearest-event distances use the sample F function ˆF (d) event-to-event distances use the sample K function ˆK(d) K function looks at information beyond nearest neighbors Caveats clustering is always a function of the overall intensity of a point pattern clustering might occur due to local intensity variations or due to interaction; it is very difficult to disentangle each contribution Watch out for boundaries and edge effects distance distortions due to map projections sampled versus mapped point patterns Ph Kyriakidis (UCSB) Geog 2C Spring 9 27 / 27
Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis
Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 5 Topic Overview 1) Introduction/Unvariate Statistics 2) Bootstrapping/Monte Carlo Simulation/Kernel
More informationIntensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis
Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 4 Spatial Point Patterns Definition Set of point locations with recorded events" within study
More informationOverview of Statistical Analysis of Spatial Data
Overview of Statistical Analysis of Spatial Data Geog 2C Introduction to Spatial Data Analysis Phaedon C. Kyriakidis www.geog.ucsb.edu/ phaedon Department of Geography University of California Santa Barbara
More informationGIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis
GIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis Guofeng Cao www.spatial.ttu.edu Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Fall 2018 Spatial Point Patterns
More informationOverview of Spatial analysis in ecology
Spatial Point Patterns & Complete Spatial Randomness - II Geog 0C Introduction to Spatial Data Analysis Chris Funk Lecture 8 Overview of Spatial analysis in ecology st step in understanding ecological
More informationIntroduction. Spatial Processes & Spatial Patterns
Introduction Spatial data: set of geo-referenced attribute measurements: each measurement is associated with a location (point) or an entity (area/region/object) in geographical (or other) space; the domain
More informationData Mining: Data. Lecture Notes for Chapter 2. Introduction to Data Mining
Data Mining: Data Lecture Notes for Chapter 2 Introduction to Data Mining by Tan, Steinbach, Kumar Similarity and Dissimilarity Similarity Numerical measure of how alike two data objects are. Is higher
More informationProximity data visualization with h-plots
The fifth international conference user! 2009 Proximity data visualization with h-plots Irene Epifanio Dpt. Matemàtiques, Univ. Jaume I (SPAIN) epifanio@uji.es; http://www3.uji.es/~epifanio Outline Motivating
More informationBivariate Distributions. Discrete Bivariate Distribution Example
Spring 7 Geog C: Phaedon C. Kyriakidis Bivariate Distributions Definition: class of multivariate probability distributions describing joint variation of outcomes of two random variables (discrete or continuous),
More informationSpatial Analysis I. Spatial data analysis Spatial analysis and inference
Spatial Analysis I Spatial data analysis Spatial analysis and inference Roadmap Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for analyses: working with
More informationLecture 20 : Markov Chains
CSCI 3560 Probability and Computing Instructor: Bogdan Chlebus Lecture 0 : Markov Chains We consider stochastic processes. A process represents a system that evolves through incremental changes called
More informationClustering Lecture 1: Basics. Jing Gao SUNY Buffalo
Clustering Lecture 1: Basics Jing Gao SUNY Buffalo 1 Outline Basics Motivation, definition, evaluation Methods Partitional Hierarchical Density-based Mixture model Spectral methods Advanced topics Clustering
More informationAlgorithms for Picture Analysis. Lecture 07: Metrics. Axioms of a Metric
Axioms of a Metric Picture analysis always assumes that pictures are defined in coordinates, and we apply the Euclidean metric as the golden standard for distance (or derived, such as area) measurements.
More informationLecture 2: Review of Basic Probability Theory
ECE 830 Fall 2010 Statistical Signal Processing instructor: R. Nowak, scribe: R. Nowak Lecture 2: Review of Basic Probability Theory Probabilistic models will be used throughout the course to represent
More informationNature of Spatial Data. Outline. Spatial Is Special
Nature of Spatial Data Outline Spatial is special Bad news: the pitfalls of spatial data Good news: the potentials of spatial data Spatial Is Special Are spatial data special? Why spatial data require
More informationBasic Properties of Metric and Normed Spaces
Basic Properties of Metric and Normed Spaces Computational and Metric Geometry Instructor: Yury Makarychev The second part of this course is about metric geometry. We will study metric spaces, low distortion
More informationDS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.
DS-GA 1002 Lecture notes 0 Fall 2016 Linear Algebra These notes provide a review of basic concepts in linear algebra. 1 Vector spaces You are no doubt familiar with vectors in R 2 or R 3, i.e. [ ] 1.1
More informationNotion of Distance. Metric Distance Binary Vector Distances Tangent Distance
Notion of Distance Metric Distance Binary Vector Distances Tangent Distance Distance Measures Many pattern recognition/data mining techniques are based on similarity measures between objects e.g., nearest-neighbor
More informationLecture 7. Econ August 18
Lecture 7 Econ 2001 2015 August 18 Lecture 7 Outline First, the theorem of the maximum, an amazing result about continuity in optimization problems. Then, we start linear algebra, mostly looking at familiar
More informationDistances and similarities Based in part on slides from textbook, slides of Susan Holmes. October 3, Statistics 202: Data Mining
Distances and similarities Based in part on slides from textbook, slides of Susan Holmes October 3, 2012 1 / 1 Similarities Start with X which we assume is centered and standardized. The PCA loadings were
More informationLet x be an approximate solution for Ax = b, e.g., obtained by Gaussian elimination. Let x denote the exact solution. Call. r := b A x.
ESTIMATION OF ERROR Let x be an approximate solution for Ax = b, e.g., obtained by Gaussian elimination. Let x denote the exact solution. Call the residual for x. Then r := b A x r = b A x = Ax A x = A
More informationROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015
ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 http://intelligentoptimization.org/lionbook Roberto Battiti
More informationPoints. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved
Points Luc Anselin http://spatial.uchicago.edu 1 classic point pattern analysis spatial randomness intensity distance-based statistics points on networks 2 Classic Point Pattern Analysis 3 Classic Examples
More informationPermutations and Combinations
Permutations and Combinations Permutations Definition: Let S be a set with n elements A permutation of S is an ordered list (arrangement) of its elements For r = 1,..., n an r-permutation of S is an ordered
More informationUniversity of Florida CISE department Gator Engineering. Clustering Part 1
Clustering Part 1 Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville What is Cluster Analysis? Finding groups of objects such that the objects
More informationJim Lambers MAT 610 Summer Session Lecture 2 Notes
Jim Lambers MAT 610 Summer Session 2009-10 Lecture 2 Notes These notes correspond to Sections 2.2-2.4 in the text. Vector Norms Given vectors x and y of length one, which are simply scalars x and y, the
More informationUncertainty Quantification and Validation Using RAVEN. A. Alfonsi, C. Rabiti. Risk-Informed Safety Margin Characterization. https://lwrs.inl.
Risk-Informed Safety Margin Characterization Uncertainty Quantification and Validation Using RAVEN https://lwrs.inl.gov A. Alfonsi, C. Rabiti North Carolina State University, Raleigh 06/28/2017 Assumptions
More informationGIST 4302/5302: Spatial Analysis and Modeling
GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis Guofeng Cao http://thestarlab.github.io Department of Geosciences Texas Tech University
More informationChapter 7 Network Flow Problems, I
Chapter 7 Network Flow Problems, I Network flow problems are the most frequently solved linear programming problems. They include as special cases, the assignment, transportation, maximum flow, and shortest
More informationAN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES
AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES JOEL A. TROPP Abstract. We present an elementary proof that the spectral radius of a matrix A may be obtained using the formula ρ(a) lim
More informationDEN: Linear algebra numerical view (GEM: Gauss elimination method for reducing a full rank matrix to upper-triangular
form) Given: matrix C = (c i,j ) n,m i,j=1 ODE and num math: Linear algebra (N) [lectures] c phabala 2016 DEN: Linear algebra numerical view (GEM: Gauss elimination method for reducing a full rank matrix
More informationMotivating the Covariance Matrix
Motivating the Covariance Matrix Raúl Rojas Computer Science Department Freie Universität Berlin January 2009 Abstract This note reviews some interesting properties of the covariance matrix and its role
More informationMTAEA Vectors in Euclidean Spaces
School of Economics, Australian National University January 25, 2010 Vectors. Economists usually work in the vector space R n. A point in this space is called a vector, and is typically defined by its
More informationL2: Review of probability and statistics
Probability L2: Review of probability and statistics Definition of probability Axioms and properties Conditional probability Bayes theorem Random variables Definition of a random variable Cumulative distribution
More information5. Discriminant analysis
5. Discriminant analysis We continue from Bayes s rule presented in Section 3 on p. 85 (5.1) where c i is a class, x isap-dimensional vector (data case) and we use class conditional probability (density
More informationLinear Algebra Review
Chapter 1 Linear Algebra Review It is assumed that you have had a course in linear algebra, and are familiar with matrix multiplication, eigenvectors, etc. I will review some of these terms here, but quite
More informationMA677 Assignment #3 Morgan Schreffler Due 09/19/12 Exercise 1 Using Hölder s inequality, prove Minkowski s inequality for f, g L p (R d ), p 1:
Exercise 1 Using Hölder s inequality, prove Minkowski s inequality for f, g L p (R d ), p 1: f + g p f p + g p. Proof. If f, g L p (R d ), then since f(x) + g(x) max {f(x), g(x)}, we have f(x) + g(x) p
More informationRandom variables. DS GA 1002 Probability and Statistics for Data Science.
Random variables DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Motivation Random variables model numerical quantities
More informationGIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis
GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis Guofeng Cao http://www.spatial.ttu.edu Department of Geosciences Texas Tech University guofeng.cao@ttu.edu
More informationLecture 10: Dimension Reduction Techniques
Lecture 10: Dimension Reduction Techniques Radu Balan Department of Mathematics, AMSC, CSCAMM and NWC University of Maryland, College Park, MD April 17, 2018 Input Data It is assumed that there is a set
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2
MA 575 Linear Models: Cedric E Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 1 Revision: Probability Theory 11 Random Variables A real-valued random variable is
More informationMichael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall
Announcement New Teaching Assistant Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall 209 Email: michael.harrigan@ttu.edu Guofeng Cao, Texas Tech GIST4302/5302, Lecture 2: Review of Map Projection
More informationLecture 2: Linear Algebra Review
CS 4980/6980: Introduction to Data Science c Spring 2018 Lecture 2: Linear Algebra Review Instructor: Daniel L. Pimentel-Alarcón Scribed by: Anh Nguyen and Kira Jordan This is preliminary work and has
More informationLecture 1 and 2: Introduction and Graph theory basics. Spring EE 194, Networked estimation and control (Prof. Khan) January 23, 2012
Lecture 1 and 2: Introduction and Graph theory basics Spring 2012 - EE 194, Networked estimation and control (Prof. Khan) January 23, 2012 Spring 2012: EE-194-02 Networked estimation and control Schedule
More informationStatistical Pattern Recognition
Statistical Pattern Recognition A Brief Mathematical Review Hamid R. Rabiee Jafar Muhammadi, Ali Jalali, Alireza Ghasemi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Probability theory
More informationVectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1. x 2. x =
Linear Algebra Review Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1 x x = 2. x n Vectors of up to three dimensions are easy to diagram.
More informationData Preprocessing. Cluster Similarity
1 Cluster Similarity Similarity is most often measured with the help of a distance function. The smaller the distance, the more similar the data objects (points). A function d: M M R is a distance on M
More informationPARAMETERIZATION OF NON-LINEAR MANIFOLDS
PARAMETERIZATION OF NON-LINEAR MANIFOLDS C. W. GEAR DEPARTMENT OF CHEMICAL AND BIOLOGICAL ENGINEERING PRINCETON UNIVERSITY, PRINCETON, NJ E-MAIL:WGEAR@PRINCETON.EDU Abstract. In this report we consider
More informationVector spaces. DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis.
Vector spaces DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_fall17/index.html Carlos Fernandez-Granda Vector space Consists of: A set V A scalar
More informationMatrix Factorization and Analysis
Chapter 7 Matrix Factorization and Analysis Matrix factorizations are an important part of the practice and analysis of signal processing. They are at the heart of many signal-processing algorithms. Their
More informationThe Fundamental Insight
The Fundamental Insight We will begin with a review of matrix multiplication which is needed for the development of the fundamental insight A matrix is simply an array of numbers If a given array has m
More informationMultimedia Retrieval Distance. Egon L. van den Broek
Multimedia Retrieval 2018-1019 Distance Egon L. van den Broek 1 The project: Two perspectives Man Machine or? Objective Subjective 2 The default Default: distance = Euclidean distance This is how it is
More informationDefinition A finite Markov chain is a memoryless homogeneous discrete stochastic process with a finite number of states.
Chapter 8 Finite Markov Chains A discrete system is characterized by a set V of states and transitions between the states. V is referred to as the state space. We think of the transitions as occurring
More information5.6. PSEUDOINVERSES 101. A H w.
5.6. PSEUDOINVERSES 0 Corollary 5.6.4. If A is a matrix such that A H A is invertible, then the least-squares solution to Av = w is v = A H A ) A H w. The matrix A H A ) A H is the left inverse of A and
More informationStatistics 202: Data Mining. c Jonathan Taylor. Week 2 Based in part on slides from textbook, slides of Susan Holmes. October 3, / 1
Week 2 Based in part on slides from textbook, slides of Susan Holmes October 3, 2012 1 / 1 Part I Other datatypes, preprocessing 2 / 1 Other datatypes Document data You might start with a collection of
More informationThe University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013.
The University of Texas at Austin Department of Electrical and Computer Engineering EE381V: Large Scale Learning Spring 2013 Assignment 1 Caramanis/Sanghavi Due: Thursday, Feb. 7, 2013. (Problems 1 and
More informationReview (Probability & Linear Algebra)
Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint
More informationPart I. Other datatypes, preprocessing. Other datatypes. Other datatypes. Week 2 Based in part on slides from textbook, slides of Susan Holmes
Week 2 Based in part on slides from textbook, slides of Susan Holmes Part I Other datatypes, preprocessing October 3, 2012 1 / 1 2 / 1 Other datatypes Other datatypes Document data You might start with
More informationLecture 2: Linear Algebra Review
EE 227A: Convex Optimization and Applications January 19 Lecture 2: Linear Algebra Review Lecturer: Mert Pilanci Reading assignment: Appendix C of BV. Sections 2-6 of the web textbook 1 2.1 Vectors 2.1.1
More informationACO Comprehensive Exam October 14 and 15, 2013
1. Computability, Complexity and Algorithms (a) Let G be the complete graph on n vertices, and let c : V (G) V (G) [0, ) be a symmetric cost function. Consider the following closest point heuristic for
More informationTechnische Universität Dresden Institute of Numerical Mathematics
Technische Universität Dresden Institute of Numerical Mathematics An Improved Flow-based Formulation and Reduction Principles for the Minimum Connectivity Inference Problem Muhammad Abid Dar Andreas Fischer
More informationBasic Concepts in Linear Algebra
Basic Concepts in Linear Algebra Grady B Wright Department of Mathematics Boise State University February 2, 2015 Grady B Wright Linear Algebra Basics February 2, 2015 1 / 39 Numerical Linear Algebra Linear
More informationData dependent operators for the spatial-spectral fusion problem
Data dependent operators for the spatial-spectral fusion problem Wien, December 3, 2012 Joint work with: University of Maryland: J. J. Benedetto, J. A. Dobrosotskaya, T. Doster, K. W. Duke, M. Ehler, A.
More informationMatrices and Vectors
Matrices and Vectors James K. Peterson Department of Biological Sciences and Department of Mathematical Sciences Clemson University November 11, 2013 Outline 1 Matrices and Vectors 2 Vector Details 3 Matrix
More informationGIST 4302/5302: Spatial Analysis and Modeling
GIST 4302/5302: Spatial Analysis and Modeling Basics of Statistics Guofeng Cao www.myweb.ttu.edu/gucao Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Spring 2015 Outline of This Week
More informationMetric-based classifiers. Nuno Vasconcelos UCSD
Metric-based classifiers Nuno Vasconcelos UCSD Statistical learning goal: given a function f. y f and a collection of eample data-points, learn what the function f. is. this is called training. two major
More informationGeometric Constraints II
Geometric Constraints II Realizability, Rigidity and Related theorems. Embeddability of Metric Spaces Section 1 Given the matrix D d i,j 1 i,j n corresponding to a metric space, give conditions under which
More informationCS 246 Review of Linear Algebra 01/17/19
1 Linear algebra In this section we will discuss vectors and matrices. We denote the (i, j)th entry of a matrix A as A ij, and the ith entry of a vector as v i. 1.1 Vectors and vector operations A vector
More informationMax-plus algebra. Max-plus algebra. Monika Molnárová. Technická univerzita Košice. Max-plus algebra.
Technická univerzita Košice monika.molnarova@tuke.sk Outline 1 Digraphs Maximum cycle-mean and transitive closures of a matrix Reducible and irreducible matrices Definite matrices Digraphs Complete digraph
More informationSimilarity and Dissimilarity
1//015 Similarity and Dissimilarity COMP 465 Data Mining Similarity of Data Data Preprocessing Slides Adapted From : Jiawei Han, Micheline Kamber & Jian Pei Data Mining: Concepts and Techniques, 3 rd ed.
More informationNumerical Analysis: Solving Systems of Linear Equations
Numerical Analysis: Solving Systems of Linear Equations Mirko Navara http://cmpfelkcvutcz/ navara/ Center for Machine Perception, Department of Cybernetics, FEE, CTU Karlovo náměstí, building G, office
More informationGROUP THEORY PRIMER. New terms: so(2n), so(2n+1), symplectic algebra sp(2n)
GROUP THEORY PRIMER New terms: so(2n), so(2n+1), symplectic algebra sp(2n) 1. Some examples of semi-simple Lie algebras In the previous chapter, we developed the idea of understanding semi-simple Lie algebras
More informationReview of Basic Concepts in Linear Algebra
Review of Basic Concepts in Linear Algebra Grady B Wright Department of Mathematics Boise State University September 7, 2017 Math 565 Linear Algebra Review September 7, 2017 1 / 40 Numerical Linear Algebra
More informationA fast randomized algorithm for approximating an SVD of a matrix
A fast randomized algorithm for approximating an SVD of a matrix Joint work with Franco Woolfe, Edo Liberty, and Vladimir Rokhlin Mark Tygert Program in Applied Mathematics Yale University Place July 17,
More information2 Notation and Preliminaries
On Asymmetric TSP: Transformation to Symmetric TSP and Performance Bound Ratnesh Kumar Haomin Li epartment of Electrical Engineering University of Kentucky Lexington, KY 40506-0046 Abstract We show that
More informationIn the Name of God. Lectures 15&16: Radial Basis Function Networks
1 In the Name of God Lectures 15&16: Radial Basis Function Networks Some Historical Notes Learning is equivalent to finding a surface in a multidimensional space that provides a best fit to the training
More informationLecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad
Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Key message Spatial dependence First Law of Geography (Waldo Tobler): Everything is related to everything else, but near things
More informationCS 664 Segmentation (2) Daniel Huttenlocher
CS 664 Segmentation (2) Daniel Huttenlocher Recap Last time covered perceptual organization more broadly, focused in on pixel-wise segmentation Covered local graph-based methods such as MST and Felzenszwalb-Huttenlocher
More informationPoint Pattern Analysis
Point Pattern Analysis Nearest Neighbor Statistics Luc Anselin http://spatial.uchicago.edu principle G function F function J function Principle Terminology events and points event: observed location of
More informationMultivariate Statistics: Hierarchical and k-means cluster analysis
Multivariate Statistics: Hierarchical and k-means cluster analysis Steffen Unkel Department of Medical Statistics University Medical Center Goettingen, Germany Summer term 217 1/43 What is a cluster? Proximity
More informationProbability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008
Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 1 Review We saw some basic metrics that helped us characterize
More informationSpectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics
Spectral Graph Theory and You: and Centrality Metrics Jonathan Gootenberg March 11, 2013 1 / 19 Outline of Topics 1 Motivation Basics of Spectral Graph Theory Understanding the characteristic polynomial
More informationStochastic modelling of epidemic spread
Stochastic modelling of epidemic spread Julien Arino Centre for Research on Inner City Health St Michael s Hospital Toronto On leave from Department of Mathematics University of Manitoba Julien Arino@umanitoba.ca
More informationLecture Notes 1: Vector spaces
Optimization-based data analysis Fall 2017 Lecture Notes 1: Vector spaces In this chapter we review certain basic concepts of linear algebra, highlighting their application to signal processing. 1 Vector
More information13 Spherical geometry
13 Spherical geometry Let ABC be a triangle in the Euclidean plane. From now on, we indicate the interior angles A = CAB, B = ABC, C = BCA at the vertices merely by A, B, C. The sides of length a = BC
More informationPreprocessing & dimensionality reduction
Introduction to Data Mining Preprocessing & dimensionality reduction CPSC/AMTH 445a/545a Guy Wolf guy.wolf@yale.edu Yale University Fall 2016 CPSC 445 (Guy Wolf) Dimensionality reduction Yale - Fall 2016
More information401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.
401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis
More informationMATH 117 LECTURE NOTES
MATH 117 LECTURE NOTES XIN ZHOU Abstract. This is the set of lecture notes for Math 117 during Fall quarter of 2017 at UC Santa Barbara. The lectures follow closely the textbook [1]. Contents 1. The set
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationTail Inequalities. The Chernoff bound works for random variables that are a sum of indicator variables with the same distribution (Bernoulli trials).
Tail Inequalities William Hunt Lane Department of Computer Science and Electrical Engineering, West Virginia University, Morgantown, WV William.Hunt@mail.wvu.edu Introduction In this chapter, we are interested
More informationMeasurement and Data
Measurement and Data Data describes the real world Data maps entities in the domain of interest to symbolic representation by means of a measurement procedure Numerical relationships between variables
More information3. Review of Probability and Statistics
3. Review of Probability and Statistics ECE 830, Spring 2014 Probabilistic models will be used throughout the course to represent noise, errors, and uncertainty in signal processing problems. This lecture
More informationEE263 Review Session 1
EE263 Review Session 1 October 5, 2018 0.1 Importing Variables from a MALAB.m file If you are importing variables given in file vars.m, use the following code at the beginning of your script. close a l
More informationUnconstrained Ordination
Unconstrained Ordination Sites Species A Species B Species C Species D Species E 1 0 (1) 5 (1) 1 (1) 10 (4) 10 (4) 2 2 (3) 8 (3) 4 (3) 12 (6) 20 (6) 3 8 (6) 20 (6) 10 (6) 1 (2) 3 (2) 4 4 (5) 11 (5) 8 (5)
More informationNorm and Distance. Stephen Boyd. EE103 Stanford University. September 27, 2017
Norm and Distance Stephen Boyd EE103 Stanford University September 27, 2017 Outline Norm and distance Distance Standard deviation Angle Norm and distance 2 Norm the Euclidean norm (or just norm) of an
More informationproximity similarity dissimilarity distance Proximity Measures:
Similarity Measures Similarity and dissimilarity are important because they are used by a number of data mining techniques, such as clustering nearest neighbor classification and anomaly detection. The
More informationThe Solution of Linear Systems AX = B
Chapter 2 The Solution of Linear Systems AX = B 21 Upper-triangular Linear Systems We will now develop the back-substitution algorithm, which is useful for solving a linear system of equations that has
More informationCS168: The Modern Algorithmic Toolbox Lecture #7: Understanding Principal Component Analysis (PCA)
CS68: The Modern Algorithmic Toolbox Lecture #7: Understanding Principal Component Analysis (PCA) Tim Roughgarden & Gregory Valiant April 0, 05 Introduction. Lecture Goal Principal components analysis
More informationEEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 19
EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 19 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org (based on Dr. Raj Jain s lecture
More informationAn Algorithmist s Toolkit Nov. 10, Lecture 17
8.409 An Algorithmist s Toolkit Nov. 0, 009 Lecturer: Jonathan Kelner Lecture 7 Johnson-Lindenstrauss Theorem. Recap We first recap a theorem (isoperimetric inequality) and a lemma (concentration) from
More information