Lecture 18. Models for areal data. Colin Rundel 03/22/2017

Size: px
Start display at page:

Download "Lecture 18. Models for areal data. Colin Rundel 03/22/2017"

Transcription

1 Lecture 18 Models for areal data Colin Rundel 03/22/2017 1

2 areal / lattice data 2

3 Example - NC SIDS SID79 3

4 EDA - Moran s I If we have observations at n spatial locations s 1,... s n ) I = n i=1 n n j=1 w ij n i=1 n j=1 w ij ysi ) ȳ ) ys j ) ȳ ) n i=1 ysi ) ȳ ) where w is a spatial weights matrix. 4

5 EDA - Moran s I If we have observations at n spatial locations s 1,... s n ) I = n i=1 n n j=1 w ij n i=1 n j=1 w ij ysi ) ȳ ) ys j ) ȳ ) n i=1 ysi ) ȳ ) where w is a spatial weights matrix. Some properties of Moran s I when there is no spatial autocorrelation): EI) = 1/n 1) VarI) = EI 2 ) EI) 2 = Something ugly but closed form Asymptotically, I EI) N 0, 1) VarI) 4

6 NC SIDS & Moran s I Lets start by using an adjacency matrix for w shared county borders). morans_i = functiony, w) { n = lengthy) y_bar = meany) num = sumw * y-y_bar) %*% ty-y_bar)) denom = sum y-y_bar)^2 ) n/sumw)) * num/denom) } morans_iy = nc$sid74, w = 1*st_touchesnc, sparse=false)) ## [1] libraryape) Moran.Inc$SID74, weight = 1*st_touchesnc, sparse=false)) %>% str) ## List of 4 ## $ observed: num ## $ expected: num ## $ sd : num ## $ p.value : num

7 EDA - Geary s C Like Moran s I, if we have observations at n spatial locations s 1,... s n ) C = n 1 2 n i=1 n j=1 w ij n i=1 n j=1 w ij ysi ) ys j ) ) 2 n i=1 ysi ) ȳ ) where w is a spatial weights matrix. 6

8 EDA - Geary s C Like Moran s I, if we have observations at n spatial locations s 1,... s n ) C = n 1 2 n i=1 n j=1 w ij n i=1 n j=1 w ij ysi ) ys j ) ) 2 n i=1 ysi ) ȳ ) where w is a spatial weights matrix. Some properties of Geary s C: 0 < C < 2 If C 1 then no spatial autocorrelation If C > 1 then negative spatial autocorrelation If C < 1 then positive spatial autocorrelation Geary s C is inversely related to Moran s I 6

9 NC SIDS & Geary s C Again using an adjacency matrix for w shared county borders). gearys_c = functiony, w) { n = lengthy) y_bar = meany) y_i = y %*% trep1,n)) y_j = ty_i) num = sumw * y_i-y_j)^2) denom = sum y-y_bar)^2 ) n-1)/2*sumw))) * num/denom) } gearys_cy = nc$sid74, w = 1*st_touchesnc, sparse=false)) ## [1]

10 Spatial Correlogram d = nc %>% st_centroid) %>% st_distance) %>% strip_class) breaks = seq0, maxd), length.out = 21) d_cut = cutd, breaks) adj_mats = map levelsd_cut), functionl) { d_cut == l) %>% matrixncol=100) %>% diag<- 0) } ) d = data_frame dist = breaks[-1], morans = map_dbladj_mats, morans_i, y = nc$sid74), gearys = map_dbladj_mats, gearys_c, y = nc$sid74) ) 8

11 0.6 morans gearys value 0.2 var morans gearys e+05 4e+05 6e+05 2e+05 4e+05 6e+05 dist 9

12 1.0 gearys morans 10

13 Autoregressive Models 11

14 AR Models - Time Lets just focus on the simplest case, an AR1) process y t = δ + ϕ y t 1 + w t where w t N 0, σ 2 ) and ϕ < 1, then Ey t ) = Vary t ) = δ 1 ϕ σ2 1 ϕ 12

15 AR Models - Time - Joint Distribution Previously we saw that an AR1) model can be represented using a multivariate normal distribution y 1 1 y 2 N 1 δ 1 ϕ.,. y n 1 σ2 1 ϕ 1 ϕ ϕ n 1 ϕ 1 ϕ n ϕ n 1 ϕ n

16 AR Models - Time - Joint Distribution Previously we saw that an AR1) model can be represented using a multivariate normal distribution y 1 1 y 2 N 1 δ 1 ϕ.,. y n 1 σ2 1 ϕ 1 ϕ ϕ n 1 ϕ 1 ϕ n ϕ n 1 ϕ n 2 1 In writing down the likelihood we also saw that an AR1) is 1st order Markovian, fy 1,..., y n ) = fy 1 ) fy 2 y 1 ) fy 3 y 2, y 1 ) fy n y n 1, y n 2,..., y 1 ) = fy 1 ) fy 2 y 1 ) fy 3 y 2 ) fy n y n 1 ) 13

17 Competing Definitions for y t y t = δ + ϕ y t 1 + w t vs. y t y t 1 N δ + ϕ y t 1, σ 2 ) 14

18 Competing Definitions for y t y t = δ + ϕ y t 1 + w t vs. y t y t 1 N δ + ϕ y t 1, σ 2 ) In the case of time, both of these definitions result in the same multivariate distribution for y. 14

19 AR in Space s1 s2 s3 s4 s5 s6 s7 s8 s9 s10 15

20 AR in Space s1 s2 s3 s4 s5 s6 s7 s8 s9 s10 Even in the simplest spatial case there is no clear / unique ordering, f ys 1),..., ) ys 10) = ) ) f ys 1) f ys 2) ys 1) f ys 10 ys 9), ys 8),..., ) ys 1) = ) ) f ys 10) f ys 9) ys 10) f ys 1 ys 2), ys 3),..., ) ys 10) =? 15

21 AR in Space s1 s2 s3 s4 s5 s6 s7 s8 s9 s10 Even in the simplest spatial case there is no clear / unique ordering, f ys 1),..., ) ys 10) = ) ) f ys 1) f ys 2) ys 1) f ys 10 ys 9), ys 8),..., ) ys 1) = ) ) f ys 10) f ys 9) ys 10) f ys 1 ys 2), ys 3),..., ) ys 10) =? Instead we need to think about things in terms of their neighbors / neighborhoods. We will define Ns i ) to be the set of neighbors of location s i. If we define the neighborhood based on touching then Ns 3 ) = {s 2, s 4 } If we use distance within 2 units then Ns 3 ) = {s 1, s 2, s 3, s 4 } etc. 15

22 Defining the Spatial AR model Here we will consider a simple average of neighboring observations, just like with the temporal AR model we have two options in terms of defining the autoregressive process, Simultaneous Autogressve SAR) ys) = δ + ϕ 1 Ns) s Ns) ys ) + N 0, σ 2 ) Conditional Autoregressive CAR) ys) y s N δ + ϕ 1 Ns) s Ns) ys ), σ 2 16

23 Simultaneous Autogressve SAR) Using ys) = δ + ϕ 1 Ns) we want to find the distribution of y = s Ns) ys ) + N 0, σ 2 ) ys 1 ), ys 2 ),..., ys n )) t. 17

24 Simultaneous Autogressve SAR) Using ys) = δ + ϕ 1 Ns) we want to find the distribution of y = s Ns) ys ) + N 0, σ 2 ) ys 1 ), ys 2 ),..., ys n )) t. First we need to define a weight matrix W where {W} ij = { 1/ Ns i ) if j Ns i ) 0 otherwise 17

25 Simultaneous Autogressve SAR) Using ys) = δ + ϕ 1 Ns) we want to find the distribution of y = s Ns) ys ) + N 0, σ 2 ) ys 1 ), ys 2 ),..., ys n )) t. First we need to define a weight matrix W where {W} ij = { 1/ Ns i ) if j Ns i ) 0 otherwise then we can write y as follows, y = δ + ϕ W y + ϵ where ϵ N 0, σ 2 I) 17

26 A toy example 0 1/2 1/2 W = 1/2 0 1/2 1/2 1/2 0 18

27 Back to SAR y = δ + ϕ W y + ϵ 19

28 Conditional Autogressve CAR) This is a bit trickier, in the case of the temporal AR process we actually went from joint distribution conditional distributions which we were then able to simplify). Since we don t have a natural ordering we can t get away with this at least not easily). Going the other way, conditional distributions joint distribution is difficult because it is possible to specify conditional distributions that lead to an improper joint distribution. 20

29 Brook s Lemma For sets of observations x and y where px) > 0 x x and py) > 0 y y then py) n px) = py i y 1,..., y i 1, x i+1,..., x n ) px i=1 i x 1,..., x i 1, y i+1,..., y n ) n py i x 1,..., x i 1, y i+1,..., y n ) = px i y 1,..., y i 1, x i+1,..., x n ) i=1 21

30 A simplified example Let y = y 1, y 2 ) and x = x 1, x 2 ) then we can derive Brook s Lemma for this case, py 1, y 2 ) = py 1 y 2 )py 2 ) = py 1 y 2 ) py 2 x 1 ) px 1 ) px 1 y 2 ) = py 1 y 2 ) px 1 y 2 ) py 2 x 1 ) px 1 ) = py 1 y 2 ) py 2 x 1 ) px 1 y 2 ) px 2 x 1 ) px 1, x 2 ) = py 1 y 2 ) px 1 y 2 ) py 2 x 1 ) px 1 ) ) px2 x 1 ) px 2 x 1 ) py 1, y 2 ) px 1, x 2 ) = py 1 y 2 ) py 2 x 1 ) px 1 y 2 ) px 2 x 1 ) 22

31 Utility? Lets repeat that last example but consider the case where y = y 1, y 2 ) but now we let x = y 1 = 0, y 2 = 0) py 1, y 2 ) px 1, x 2 ) = py 1, y 2 ) py 1 = 0, y 2 = 0) py 1, y 2 ) = py 1 y 2 ) py 2 y 1 = 0) py 1 = 0 y 2 ) py 2 = 0 y 1 = 0) py 1 = 0, y 2 = 0) py 1, y 2 ) py 1 y 2 ) py 2 y 1 = 0) py 1 = 0 y 2 ) py 2 y 1 ) py 1 y 2 = 0) py 2 = 0 y 1 ) 23

32 As applied to a simple CAR s1 s2 ys 1) ys 2) N ϕw 12 ys 2), σ 2 ) ys 2) ys 1) N ϕw 21 ys 1), σ 2 ) 24

33 As applied to a simple CAR s1 s2 ys 1) ys 2) N ϕw 12 ys 2), σ 2 ) ys 2) ys 1) N ϕw 21 ys 1), σ 2 ) ) ) ) p ys 1), ys 2) p ys 1) ys 2) p ys 2) ys 1) = 0 ys ) 1) = 0 ys 2) p exp 1 2σ 2 ys 1) ϕ 2) W 12 ys 2)) exp exp 1 2σ 2 0 2) ϕw 12ys 2)) exp exp exp 1 1 2σ 2 ys 2) ϕ W 21 0) 2) ys1) ϕ W 12 ys 2)) 2 + ys 2) 2 2)) ϕw 12ys 2)) 2σ 2 1 ys 1) 2 2ϕ W 12 ys 1) ys 2) + 2)) ys 2) 2σ 2 ) ) 1 2σ y 0) 1 ϕw12 y 0) t 2 ϕw

34 Implicatiomns for y µ = 0 ) Σ 1 = 1 1 ϕw 12 σ 2 ϕw 12 1 = 1 I ϕ W) σ2 Σ = σ 2 I ϕ W) 1 25

35 Implicatiomns for y µ = 0 ) Σ 1 = 1 1 ϕw 12 σ 2 ϕw 12 1 = 1 I ϕ W) σ2 Σ = σ 2 I ϕ W) 1 we can then conclude that for y = ys 1 ), ys 2 )) t, y N 0, σ 2 I ϕ W) 1) which generalizes for all mean 0 CAR models. 25

36 General Proof Let y = ys 1 ),..., ys n )) and 0 = ys 1 ) = 0,..., ys n ) = 0) then by Brook s lemma, py) p0) = = n i=1 n py i y 1,..., y i 1, 0 i+1,..., 0 n) p0 i y 1,..., y i 1, 0 i+1,..., 0 n) exp i=1 exp = exp = exp 1 2σ 2 1 2σ 2 1 2σ 2 1 2σ 2 y i ϕ j<i Wij yj ϕ j>i 0j ) 2 ) ) ) 0 i ϕ j<i Wij yj ϕ 2 j>i 0j ) 2 ) 2 ) n n y i ϕ W ij y j + 1 ϕ W ij y j 2σ 2 i=1 j<i i=1 j<i ) n y 2 i 2ϕy i W ij y j i=1 j<i n n = exp 1 y 2 2σ 2 i ϕ i=1 i=1 j=1 ) = exp 1 2σ 2 yt I ϕw)y ) n ) y i W ij y j if W ij = W ji 26

37 CAR vs SAR Simultaneous Autogressve SAR) ys) = ϕ s W s s W s ys ) + ϵ y N 0, σ 2 I ϕw) 1 )I ϕw) 1 ) t ) Conditional Autoregressive CAR) ) W s s ys) y s N ys ), σ 2 W s s y N 0, σ 2 I ϕw) 1 ) 27

38 Generalization Adopting different weight matrices, W Between SAR and CAR model we move to a generic weight matrix definition beyond average of nearest neighbors) In time we varied p in the ARp) model, in space we adjust the weight matrix. In general having a symmetric W is helpful, but not required More complex Variance beyond σ 2 I) σ 2 can be a vector differences between areal locations) E.g. since areal data tends to be aggregated - adjust variance based on sample size 28

Lecture 7 Autoregressive Processes in Space

Lecture 7 Autoregressive Processes in Space Lecture 7 Autoregressive Processes in Space Dennis Sun Stanford University Stats 253 July 8, 2015 1 Last Time 2 Autoregressive Processes in Space 3 Estimating Parameters 4 Testing for Spatial Autocorrelation

More information

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Chapter 3: Basics of Areal Data Models p. 1/18 Areal Unit Data Regular or Irregular Grids

More information

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case Areal data models Spatial smoothers Brook s Lemma and Gibbs distribution CAR models Gaussian case Non-Gaussian case SAR models Gaussian case Non-Gaussian case CAR vs. SAR STAR models Inference for areal

More information

Lattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III)

Lattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III) Title: Spatial Statistics for Point Processes and Lattice Data (Part III) Lattice Data Tonglin Zhang Outline Description Research Problems Global Clustering and Local Clusters Permutation Test Spatial

More information

Asymptotic standard errors of MLE

Asymptotic standard errors of MLE Asymptotic standard errors of MLE Suppose, in the previous example of Carbon and Nitrogen in soil data, that we get the parameter estimates For maximum likelihood estimation, we can use Hessian matrix

More information

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield Areal data Reminder about types of data Geostatistical data: Z(s) exists everyhere, varies continuously Can accommodate sudden changes by a model for the mean E.g., soil ph, two soil types with different

More information

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Spatial Regression. 1. Introduction and Review. Luc Anselin.  Copyright 2017 by Luc Anselin, All Rights Reserved Spatial Regression 1. Introduction and Review Luc Anselin http://spatial.uchicago.edu matrix algebra basics spatial econometrics - definitions pitfalls of spatial analysis spatial autocorrelation spatial

More information

Classic Time Series Analysis

Classic Time Series Analysis Classic Time Series Analysis Concepts and Definitions Let Y be a random number with PDF f Y t ~f,t Define t =E[Y t ] m(t) is known as the trend Define the autocovariance t, s =COV [Y t,y s ] =E[ Y t t

More information

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University

More information

Covariance. Lecture 20: Covariance / Correlation & General Bivariate Normal. Covariance, cont. Properties of Covariance

Covariance. Lecture 20: Covariance / Correlation & General Bivariate Normal. Covariance, cont. Properties of Covariance Covariance Lecture 0: Covariance / Correlation & General Bivariate Normal Sta30 / Mth 30 We have previously discussed Covariance in relation to the variance of the sum of two random variables Review Lecture

More information

2. An Introduction to Moving Average Models and ARMA Models

2. An Introduction to Moving Average Models and ARMA Models . An Introduction to Moving Average Models and ARMA Models.1 White Noise. The MA(1) model.3 The MA(q) model..4 Estimation and forecasting of MA models..5 ARMA(p,q) models. The Moving Average (MA) models

More information

Global Spatial Autocorrelation Clustering

Global Spatial Autocorrelation Clustering Global Spatial Autocorrelation Clustering Luc Anselin http://spatial.uchicago.edu join count statistics Moran s I Moran scatter plot non-parametric spatial autocorrelation Join Count Statistics Recap -

More information

Lecture 23. Spatio-temporal Models. Colin Rundel 04/17/2017

Lecture 23. Spatio-temporal Models. Colin Rundel 04/17/2017 Lecture 23 Spatio-temporal Models Colin Rundel 04/17/2017 1 Spatial Models with AR time dependence 2 Example - Weather station data Based on Andrew Finley and Sudipto Banerjee s notes from National Ecological

More information

Markov Random Fields

Markov Random Fields Markov Random Fields 1. Markov property The Markov property of a stochastic sequence {X n } n 0 implies that for all n 1, X n is independent of (X k : k / {n 1, n, n + 1}), given (X n 1, X n+1 ). Another

More information

Lecture 15. Hypothesis testing in the linear model

Lecture 15. Hypothesis testing in the linear model 14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma

More information

Probabilistic Graphical Models Lecture Notes Fall 2009

Probabilistic Graphical Models Lecture Notes Fall 2009 Probabilistic Graphical Models Lecture Notes Fall 2009 October 28, 2009 Byoung-Tak Zhang School of omputer Science and Engineering & ognitive Science, Brain Science, and Bioinformatics Seoul National University

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Lab #3 Background Material Quantifying Point and Gradient Patterns

Lab #3 Background Material Quantifying Point and Gradient Patterns Lab #3 Background Material Quantifying Point and Gradient Patterns Dispersion metrics Dispersion indices that measure the degree of non-randomness Plot-based metrics Distance-based metrics First-order

More information

Temporal vs. Spatial Data

Temporal vs. Spatial Data Temporal vs. Spatial Data Temporal 1 dimensional Units: day, week, month Lag: t, t-1, t-2 Durbin-Watson Spatial 2-3 dimensional Units: county, mile, region Lag: near neighbor, networks (?) Moran s I Differencing

More information

Bayesian Analysis (Optional)

Bayesian Analysis (Optional) Bayesian Analysis (Optional) 1 2 Big Picture There are two ways to conduct statistical inference 1. Classical method (frequentist), which postulates (a) Probability refers to limiting relative frequencies

More information

Basics of Geographic Analysis in R

Basics of Geographic Analysis in R Basics of Geographic Analysis in R Spatial Autocorrelation and Spatial Weights Yuri M. Zhukov GOV 2525: Political Geography February 25, 2013 Outline 1. Introduction 2. Spatial Data and Basic Visualization

More information

Spatial Autocorrelation

Spatial Autocorrelation Spatial Autocorrelation Luc Anselin http://spatial.uchicago.edu spatial randomness positive and negative spatial autocorrelation spatial autocorrelation statistics spatial weights Spatial Randomness The

More information

Community Health Needs Assessment through Spatial Regression Modeling

Community Health Needs Assessment through Spatial Regression Modeling Community Health Needs Assessment through Spatial Regression Modeling Glen D. Johnson, PhD CUNY School of Public Health glen.johnson@lehman.cuny.edu Objectives: Assess community needs with respect to particular

More information

Univariate linear models

Univariate linear models Univariate linear models The specification process of an univariate ARIMA model is based on the theoretical properties of the different processes and it is also important the observation and interpretation

More information

A time series is called strictly stationary if the joint distribution of every collection (Y t

A time series is called strictly stationary if the joint distribution of every collection (Y t 5 Time series A time series is a set of observations recorded over time. You can think for example at the GDP of a country over the years (or quarters) or the hourly measurements of temperature over a

More information

L6: Regression II. JJ Chen. July 2, 2015

L6: Regression II. JJ Chen. July 2, 2015 L6: Regression II JJ Chen July 2, 2015 Today s Plan Review basic inference based on Sample average Difference in sample average Extrapolate the knowledge to sample regression coefficients Standard error,

More information

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3 GIS APPLICATIONS LECTURE 3 SPATIAL AUTOCORRELATION. First law of geography: everything is related to everything else, but near things are more related than distant things Waldo Tobler Check who is sitting

More information

Multivariate Statistical Analysis

Multivariate Statistical Analysis Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 4 for Applied Multivariate Analysis Outline 1 Eigen values and eigen vectors Characteristic equation Some properties of eigendecompositions

More information

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions Spatial inference I will start with a simple model, using species diversity data Strong spatial dependence, Î = 0.79 what is the mean diversity? How precise is our estimate? Sampling discussion: The 64

More information

Chapter 3 - Temporal processes

Chapter 3 - Temporal processes STK4150 - Intro 1 Chapter 3 - Temporal processes Odd Kolbjørnsen and Geir Storvik January 23 2017 STK4150 - Intro 2 Temporal processes Data collected over time Past, present, future, change Temporal aspect

More information

Probabilistic Graphical Models

Probabilistic Graphical Models Probabilistic Graphical Models Brown University CSCI 2950-P, Spring 2013 Prof. Erik Sudderth Lecture 12: Gaussian Belief Propagation, State Space Models and Kalman Filters Guest Kalman Filter Lecture by

More information

Regression with correlation for the Sales Data

Regression with correlation for the Sales Data Regression with correlation for the Sales Data Scatter with Loess Curve Time Series Plot Sales 30 35 40 45 Sales 30 35 40 45 0 10 20 30 40 50 Week 0 10 20 30 40 50 Week Sales Data What is our goal with

More information

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University An Introduction to Spatial Statistics Chunfeng Huang Department of Statistics, Indiana University Microwave Sounding Unit (MSU) Anomalies (Monthly): 1979-2006. Iron Ore (Cressie, 1986) Raw percent data

More information

3d scatterplots. You can also make 3d scatterplots, although these are less common than scatterplot matrices.

3d scatterplots. You can also make 3d scatterplots, although these are less common than scatterplot matrices. 3d scatterplots You can also make 3d scatterplots, although these are less common than scatterplot matrices. > library(scatterplot3d) > y par(mfrow=c(2,2)) > scatterplot3d(y,highlight.3d=t,angle=20)

More information

Probabilistic Graphical Models

Probabilistic Graphical Models School of Computer Science Probabilistic Graphical Models Gaussian graphical models and Ising models: modeling networks Eric Xing Lecture 0, February 7, 04 Reading: See class website Eric Xing @ CMU, 005-04

More information

Part 4: Multi-parameter and normal models

Part 4: Multi-parameter and normal models Part 4: Multi-parameter and normal models 1 The normal model Perhaps the most useful (or utilized) probability model for data analysis is the normal distribution There are several reasons for this, e.g.,

More information

Gaussian Processes 1. Schedule

Gaussian Processes 1. Schedule 1 Schedule 17 Jan: Gaussian processes (Jo Eidsvik) 24 Jan: Hands-on project on Gaussian processes (Team effort, work in groups) 31 Jan: Latent Gaussian models and INLA (Jo Eidsvik) 7 Feb: Hands-on project

More information

Non-Stationary Time Series and Unit Root Testing

Non-Stationary Time Series and Unit Root Testing Econometrics II Non-Stationary Time Series and Unit Root Testing Morten Nyboe Tabor Course Outline: Non-Stationary Time Series and Unit Root Testing 1 Stationarity and Deviation from Stationarity Trend-Stationarity

More information

2/7/2018. Module 4. Spatial Statistics. Point Patterns: Nearest Neighbor. Spatial Statistics. Point Patterns: Nearest Neighbor

2/7/2018. Module 4. Spatial Statistics. Point Patterns: Nearest Neighbor. Spatial Statistics. Point Patterns: Nearest Neighbor Spatial Statistics Module 4 Geographers are very interested in studying, understanding, and quantifying the patterns we can see on maps Q: What kinds of map patterns can you think of? There are so many

More information

Next tool is Partial ACF; mathematical tools first. The Multivariate Normal Distribution. e z2 /2. f Z (z) = 1 2π. e z2 i /2

Next tool is Partial ACF; mathematical tools first. The Multivariate Normal Distribution. e z2 /2. f Z (z) = 1 2π. e z2 i /2 Next tool is Partial ACF; mathematical tools first. The Multivariate Normal Distribution Defn: Z R 1 N(0,1) iff f Z (z) = 1 2π e z2 /2 Defn: Z R p MV N p (0, I) if and only if Z = (Z 1,..., Z p ) (a column

More information

Discrete time processes

Discrete time processes Discrete time processes Predictions are difficult. Especially about the future Mark Twain. Florian Herzog 2013 Modeling observed data When we model observed (realized) data, we encounter usually the following

More information

Modeling Data with Linear Combinations of Basis Functions. Read Chapter 3 in the text by Bishop

Modeling Data with Linear Combinations of Basis Functions. Read Chapter 3 in the text by Bishop Modeling Data with Linear Combinations of Basis Functions Read Chapter 3 in the text by Bishop A Type of Supervised Learning Problem We want to model data (x 1, t 1 ),..., (x N, t N ), where x i is a vector

More information

Undirected graphical models

Undirected graphical models Undirected graphical models Semantics of probabilistic models over undirected graphs Parameters of undirected models Example applications COMP-652 and ECSE-608, February 16, 2017 1 Undirected graphical

More information

1 Quantitative Techniques in Practice

1 Quantitative Techniques in Practice 1 Quantitative Techniques in Practice 1.1 Lecture 2: Stationarity, spurious regression, etc. 1.1.1 Overview In the rst part we shall look at some issues in time series economics. In the second part we

More information

Switching Regime Estimation

Switching Regime Estimation Switching Regime Estimation Series de Tiempo BIrkbeck March 2013 Martin Sola (FE) Markov Switching models 01/13 1 / 52 The economy (the time series) often behaves very different in periods such as booms

More information

Machine Learning (CS 567) Lecture 5

Machine Learning (CS 567) Lecture 5 Machine Learning (CS 567) Lecture 5 Time: T-Th 5:00pm - 6:20pm Location: GFS 118 Instructor: Sofus A. Macskassy (macskass@usc.edu) Office: SAL 216 Office hours: by appointment Teaching assistant: Cheol

More information

University of Oxford. Statistical Methods Autocorrelation. Identification and Estimation

University of Oxford. Statistical Methods Autocorrelation. Identification and Estimation University of Oxford Statistical Methods Autocorrelation Identification and Estimation Dr. Órlaith Burke Michaelmas Term, 2011 Department of Statistics, 1 South Parks Road, Oxford OX1 3TG Contents 1 Model

More information

Spatial Effects and Externalities

Spatial Effects and Externalities Spatial Effects and Externalities Philip A. Viton November 5, Philip A. Viton CRP 66 Spatial () Externalities November 5, / 5 Introduction If you do certain things to your property for example, paint your

More information

CSE446: Clustering and EM Spring 2017

CSE446: Clustering and EM Spring 2017 CSE446: Clustering and EM Spring 2017 Ali Farhadi Slides adapted from Carlos Guestrin, Dan Klein, and Luke Zettlemoyer Clustering systems: Unsupervised learning Clustering Detect patterns in unlabeled

More information

Spatio-Temporal Modelling of Credit Default Data

Spatio-Temporal Modelling of Credit Default Data 1/20 Spatio-Temporal Modelling of Credit Default Data Sathyanarayan Anand Advisor: Prof. Robert Stine The Wharton School, University of Pennsylvania April 29, 2011 2/20 Outline 1 Background 2 Conditional

More information

Lecture 19. Spatial GLM + Point Reference Spatial Data. Colin Rundel 04/03/2017

Lecture 19. Spatial GLM + Point Reference Spatial Data. Colin Rundel 04/03/2017 Lecture 19 Spatial GLM + Point Reference Spatial Data Colin Rundel 04/03/2017 1 Spatial GLM Models 2 Scottish Lip Cancer Data Observed Expected 60 N 59 N 58 N 57 N 56 N value 80 60 40 20 0 55 N 8 W 6 W

More information

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph Spatial analysis Huge topic! Key references Diggle (point patterns); Cressie (everything); Diggle and Ribeiro (geostatistics); Dormann et al (GLMMs for species presence/abundance); Haining; (Pinheiro and

More information

MAP Examples. Sargur Srihari

MAP Examples. Sargur Srihari MAP Examples Sargur srihari@cedar.buffalo.edu 1 Potts Model CRF for OCR Topics Image segmentation based on energy minimization 2 Examples of MAP Many interesting examples of MAP inference are instances

More information

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Approaches for Multiple Disease Mapping: MCAR and SANOVA Approaches for Multiple Disease Mapping: MCAR and SANOVA Dipankar Bandyopadhyay Division of Biostatistics, University of Minnesota SPH April 22, 2015 1 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

More information

Lecture 25: Review. Statistics 104. April 23, Colin Rundel

Lecture 25: Review. Statistics 104. April 23, Colin Rundel Lecture 25: Review Statistics 104 Colin Rundel April 23, 2012 Joint CDF F (x, y) = P [X x, Y y] = P [(X, Y ) lies south-west of the point (x, y)] Y (x,y) X Statistics 104 (Colin Rundel) Lecture 25 April

More information

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström.

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström. C Stochastic fields Covariance Spatial Statistics with Image Analysis Lecture 2 Johan Lindström November 4, 26 Lecture L2 Johan Lindström - johanl@maths.lth.se FMSN2/MASM2 L /2 C Stochastic fields Covariance

More information

i=1 h n (ˆθ n ) = 0. (2)

i=1 h n (ˆθ n ) = 0. (2) Stat 8112 Lecture Notes Unbiased Estimating Equations Charles J. Geyer April 29, 2012 1 Introduction In this handout we generalize the notion of maximum likelihood estimation to solution of unbiased estimating

More information

Summer School in Statistics for Astronomers V June 1 - June 6, Regression. Mosuk Chow Statistics Department Penn State University.

Summer School in Statistics for Astronomers V June 1 - June 6, Regression. Mosuk Chow Statistics Department Penn State University. Summer School in Statistics for Astronomers V June 1 - June 6, 2009 Regression Mosuk Chow Statistics Department Penn State University. Adapted from notes prepared by RL Karandikar Mean and variance Recall

More information

Moments. Raw moment: February 25, 2014 Normalized / Standardized moment:

Moments. Raw moment: February 25, 2014 Normalized / Standardized moment: Moments Lecture 10: Central Limit Theorem and CDFs Sta230 / Mth 230 Colin Rundel Raw moment: Central moment: µ n = EX n ) µ n = E[X µ) 2 ] February 25, 2014 Normalized / Standardized moment: µ n σ n Sta230

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

Supervised Learning. Regression Example: Boston Housing. Regression Example: Boston Housing

Supervised Learning. Regression Example: Boston Housing. Regression Example: Boston Housing Supervised Learning Unsupervised learning: To extract structure and postulate hypotheses about data generating process from observations x 1,...,x n. Visualize, summarize and compress data. We have seen

More information

POLI 8501 Introduction to Maximum Likelihood Estimation

POLI 8501 Introduction to Maximum Likelihood Estimation POLI 8501 Introduction to Maximum Likelihood Estimation Maximum Likelihood Intuition Consider a model that looks like this: Y i N(µ, σ 2 ) So: E(Y ) = µ V ar(y ) = σ 2 Suppose you have some data on Y,

More information

Statistics 910, #5 1. Regression Methods

Statistics 910, #5 1. Regression Methods Statistics 910, #5 1 Overview Regression Methods 1. Idea: effects of dependence 2. Examples of estimation (in R) 3. Review of regression 4. Comparisons and relative efficiencies Idea Decomposition Well-known

More information

Computer Vision Group Prof. Daniel Cremers. 9. Gaussian Processes - Regression

Computer Vision Group Prof. Daniel Cremers. 9. Gaussian Processes - Regression Group Prof. Daniel Cremers 9. Gaussian Processes - Regression Repetition: Regularized Regression Before, we solved for w using the pseudoinverse. But: we can kernelize this problem as well! First step:

More information

Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA

Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA SPATIAL STATISTICS Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA Keywords: Geostatistics, Isotropy, Kriging, Lattice Data, Spatial point patterns, Stationarity

More information

Time Series Analysis

Time Series Analysis Time Series Analysis hm@imm.dtu.dk Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby 1 Outline of the lecture Chapter 9 Multivariate time series 2 Transfer function

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu Dimension Reduction Techniques Presented by Jie (Jerry) Yu Outline Problem Modeling Review of PCA and MDS Isomap Local Linear Embedding (LLE) Charting Background Advances in data collection and storage

More information

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise

More information

Problem Set 2: Box-Jenkins methodology

Problem Set 2: Box-Jenkins methodology Problem Set : Box-Jenkins methodology 1) For an AR1) process we have: γ0) = σ ε 1 φ σ ε γ0) = 1 φ Hence, For a MA1) process, p lim R = φ γ0) = 1 + θ )σ ε σ ε 1 = γ0) 1 + θ Therefore, p lim R = 1 1 1 +

More information

Properties of Summation Operator

Properties of Summation Operator Econ 325 Section 003/004 Notes on Variance, Covariance, and Summation Operator By Hiro Kasahara Properties of Summation Operator For a sequence of the values {x 1, x 2,..., x n, we write the sum of x 1,

More information

Nature of Spatial Data. Outline. Spatial Is Special

Nature of Spatial Data. Outline. Spatial Is Special Nature of Spatial Data Outline Spatial is special Bad news: the pitfalls of spatial data Good news: the potentials of spatial data Spatial Is Special Are spatial data special? Why spatial data require

More information

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 Prof. John Paisley Department of Electrical Engineering & Data Science Institute Columbia University FEATURE EXPANSIONS FEATURE EXPANSIONS

More information

x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ).

x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ). .8.6 µ =, σ = 1 µ = 1, σ = 1 / µ =, σ =.. 3 1 1 3 x Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ ). The Gaussian distribution Probably the most-important distribution in all of statistics

More information

Metric-based classifiers. Nuno Vasconcelos UCSD

Metric-based classifiers. Nuno Vasconcelos UCSD Metric-based classifiers Nuno Vasconcelos UCSD Statistical learning goal: given a function f. y f and a collection of eample data-points, learn what the function f. is. this is called training. two major

More information

1 Class Organization. 2 Introduction

1 Class Organization. 2 Introduction Time Series Analysis, Lecture 1, 2018 1 1 Class Organization Course Description Prerequisite Homework and Grading Readings and Lecture Notes Course Website: http://www.nanlifinance.org/teaching.html wechat

More information

Univariate ARIMA Models

Univariate ARIMA Models Univariate ARIMA Models ARIMA Model Building Steps: Identification: Using graphs, statistics, ACFs and PACFs, transformations, etc. to achieve stationary and tentatively identify patterns and model components.

More information

Non-Stationary Time Series and Unit Root Testing

Non-Stationary Time Series and Unit Root Testing Econometrics II Non-Stationary Time Series and Unit Root Testing Morten Nyboe Tabor Course Outline: Non-Stationary Time Series and Unit Root Testing 1 Stationarity and Deviation from Stationarity Trend-Stationarity

More information

Bayesian Inference in a Normal Population

Bayesian Inference in a Normal Population Bayesian Inference in a Normal Population September 17, 2008 Gill Chapter 3. Sections 1-4, 7-8 Bayesian Inference in a Normal Population p.1/18 Normal Model IID observations Y = (Y 1,Y 2,...Y n ) Y i N(µ,σ

More information

Matrix Approach to Simple Linear Regression: An Overview

Matrix Approach to Simple Linear Regression: An Overview Matrix Approach to Simple Linear Regression: An Overview Aspects of matrices that you should know: Definition of a matrix Addition/subtraction/multiplication of matrices Symmetric/diagonal/identity matrix

More information

TIME SERIES DATA ANALYSIS USING EVIEWS

TIME SERIES DATA ANALYSIS USING EVIEWS TIME SERIES DATA ANALYSIS USING EVIEWS I Gusti Ngurah Agung Graduate School Of Management Faculty Of Economics University Of Indonesia Ph.D. in Biostatistics and MSc. in Mathematical Statistics from University

More information

Inference and estimation in probabilistic time series models

Inference and estimation in probabilistic time series models 1 Inference and estimation in probabilistic time series models David Barber, A Taylan Cemgil and Silvia Chiappa 11 Time series The term time series refers to data that can be represented as a sequence

More information

[y i α βx i ] 2 (2) Q = i=1

[y i α βx i ] 2 (2) Q = i=1 Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation

More information

Reminders. Thought questions should be submitted on eclass. Please list the section related to the thought question

Reminders. Thought questions should be submitted on eclass. Please list the section related to the thought question Linear regression Reminders Thought questions should be submitted on eclass Please list the section related to the thought question If it is a more general, open-ended question not exactly related to a

More information

Spatial Statistics with Image Analysis. Lecture L08. Computer exercise 3. Lecture 8. Johan Lindström. November 25, 2016

Spatial Statistics with Image Analysis. Lecture L08. Computer exercise 3. Lecture 8. Johan Lindström. November 25, 2016 C3 Repetition Creating Q Spectral Non-grid Spatial Statistics with Image Analysis Lecture 8 Johan Lindström November 25, 216 Johan Lindström - johanl@maths.lth.se FMSN2/MASM25L8 1/39 Lecture L8 C3 Repetition

More information

Testing for global spatial autocorrelation in Stata

Testing for global spatial autocorrelation in Stata Testing for global spatial autocorrelation in Stata Keisuke Kondo March 31, 2018 (moransi: version 1.00) Abstract This paper introduces the new Stata command moransi, which computes Moran s I statistic

More information

Lecture: Some Practical Considerations (3 of 4)

Lecture: Some Practical Considerations (3 of 4) Stat260/CS294: Spectral Graph Methods Lecture 14-03/10/2015 Lecture: Some Practical Considerations (3 of 4) Lecturer: Michael Mahoney Scribe: Michael Mahoney Warning: these notes are still very rough.

More information

41903: Introduction to Nonparametrics

41903: Introduction to Nonparametrics 41903: Notes 5 Introduction Nonparametrics fundamentally about fitting flexible models: want model that is flexible enough to accommodate important patterns but not so flexible it overspecializes to specific

More information

1. How can you tell if there is serial correlation? 2. AR to model serial correlation. 3. Ignoring serial correlation. 4. GLS. 5. Projects.

1. How can you tell if there is serial correlation? 2. AR to model serial correlation. 3. Ignoring serial correlation. 4. GLS. 5. Projects. 1. How can you tell if there is serial correlation? 2. AR to model serial correlation. 3. Ignoring serial correlation. 4. GLS. 5. Projects. 1) Identifying serial correlation. Plot Y t versus Y t 1. See

More information

MAS3301 Bayesian Statistics

MAS3301 Bayesian Statistics MAS3301 Bayesian Statistics M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2008-9 1 15 Inference for Normal Distributions II 15.1 Student s t-distribution When we look

More information

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2 Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, 2010 Jeffreys priors Lecturer: Michael I. Jordan Scribe: Timothy Hunter 1 Priors for the multivariate Gaussian Consider a multivariate

More information

Chance Constrained Data Envelopment Analysis The Productive Efficiency of Units with Stochastic Outputs

Chance Constrained Data Envelopment Analysis The Productive Efficiency of Units with Stochastic Outputs Chance Constrained Data Envelopment Analysis The Productive Efficiency of Units with Stochastic Outputs Michal Houda Department of Applied Mathematics and Informatics ROBUST 2016, September 11 16, 2016

More information

ECON/FIN 250: Forecasting in Finance and Economics: Section 6: Standard Univariate Models

ECON/FIN 250: Forecasting in Finance and Economics: Section 6: Standard Univariate Models ECON/FIN 250: Forecasting in Finance and Economics: Section 6: Standard Univariate Models Patrick Herb Brandeis University Spring 2016 Patrick Herb (Brandeis University) Standard Univariate Models ECON/FIN

More information

Lecture 13: Simple Linear Regression in Matrix Format. 1 Expectations and Variances with Vectors and Matrices

Lecture 13: Simple Linear Regression in Matrix Format. 1 Expectations and Variances with Vectors and Matrices Lecture 3: Simple Linear Regression in Matrix Format To move beyond simple regression we need to use matrix algebra We ll start by re-expressing simple linear regression in matrix form Linear algebra is

More information

The Multivariate Normal Distribution 1

The Multivariate Normal Distribution 1 The Multivariate Normal Distribution 1 STA 302 Fall 2014 1 See last slide for copyright information. 1 / 37 Overview 1 Moment-generating Functions 2 Definition 3 Properties 4 χ 2 and t distributions 2

More information

Checking the Poisson assumption in the Poisson generalized linear model

Checking the Poisson assumption in the Poisson generalized linear model Checking the Poisson assumption in the Poisson generalized linear model The Poisson regression model is a generalized linear model (glm) satisfying the following assumptions: The responses y i are independent

More information

Conditional distributions (discrete case)

Conditional distributions (discrete case) Conditional distributions (discrete case) The basic idea behind conditional distributions is simple: Suppose (XY) is a jointly-distributed random vector with a discrete joint distribution. Then we can

More information

Probabilistic Graphical Models

Probabilistic Graphical Models Probabilistic Graphical Models Lecture 12 Dynamical Models CS/CNS/EE 155 Andreas Krause Homework 3 out tonight Start early!! Announcements Project milestones due today Please email to TAs 2 Parameter learning

More information

Monte Carlo Studies. The response in a Monte Carlo study is a random variable.

Monte Carlo Studies. The response in a Monte Carlo study is a random variable. Monte Carlo Studies The response in a Monte Carlo study is a random variable. The response in a Monte Carlo study has a variance that comes from the variance of the stochastic elements in the data-generating

More information