Asymptotic standard errors of MLE

Similar documents
What s for today. Introduction to Space-time models. c Mikyoung Jun (Texas A&M) Stat647 Lecture 14 October 16, / 19

Simple example of analysis on spatial-temporal data set

What s for today. Continue to discuss about nonstationary models Moving windows Convolution model Weighted stationary model

Lecture 7 Autoregressive Processes in Space

What s for today. Random Fields Autocovariance Stationarity, Isotropy. c Mikyoung Jun (Texas A&M) stat647 Lecture 2 August 30, / 13

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions

What s for today. All about Variogram Nugget effect. Mikyoung Jun (Texas A&M) stat647 lecture 4 September 6, / 17

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

Lattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III)

Lecture 16 Solving GLMs via IRWLS

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

Nonparametric Bayesian Methods (Gaussian Processes)

CS281 Section 4: Factor Analysis and PCA

Peter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8

Saddlepoint-Based Bootstrap Inference in Dependent Data Settings

Stat 5101 Lecture Notes

Linear models. Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark. October 5, 2016

Matrices and Multivariate Statistics - II

STAT5044: Regression and Anova

Weighted Least Squares

De-mystifying random effects models

Outline. Overview of Issues. Spatial Regression. Luc Anselin

1 Mixed effect models and longitudinal data analysis

For more information about how to cite these materials visit

11 : Gaussian Graphic Models and Ising Models

Decomposable and Directed Graphical Gaussian Models

Non-stationary Cross-Covariance Models for Multivariate Processes on a Globe

Regression, Ridge Regression, Lasso

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science Algorithms For Inference Fall 2014

Nonstationary cross-covariance models for multivariate processes on a globe

Lecture 3 September 1

Markov random fields. The Markov property

[y i α βx i ] 2 (2) Q = i=1

10. Time series regression and forecasting

STA442/2101: Assignment 5

ECE521 week 3: 23/26 January 2017

UCSD ECE153 Handout #30 Prof. Young-Han Kim Thursday, May 15, Homework Set #6 Due: Thursday, May 22, 2011

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Regression Models - Introduction

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

STAT5044: Regression and Anova. Inyoung Kim

MS&E 226: Small Data

STAT Financial Time Series

MATH 829: Introduction to Data Mining and Analysis Graphical Models I

STAT 135 Lab 13 (Review) Linear Regression, Multivariate Random Variables, Prediction, Logistic Regression and the δ-method.

Gaussian Processes 1. Schedule

Linear Methods for Prediction

X t = a t + r t, (7.1)

Bayesian spatial hierarchical modeling for temperature extremes

Lecture 34: Properties of the LSE

STAT 730 Chapter 4: Estimation

Multivariate Bayesian Linear Regression MLAI Lecture 11

ivporbit:an R package to estimate the probit model with continuous endogenous regressors

STAT 100C: Linear models

Chapter 3 - Estimation by direct maximization of the likelihood

Lecture 6: Hypothesis Testing

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

LECTURE 2 LINEAR REGRESSION MODEL AND OLS

Weighted Least Squares

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

A time series is called strictly stationary if the joint distribution of every collection (Y t

MIT Spring 2015

Lecture Notes 5 Convergence and Limit Theorems. Convergence with Probability 1. Convergence in Mean Square. Convergence in Probability, WLLN

What s for today. More on Binomial distribution Poisson distribution. c Mikyoung Jun (Texas A&M) stat211 lecture 7 February 8, / 16

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

Generalized Linear Models. Kurt Hornik

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation

Multivariate Regression Analysis

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis

Regression diagnostics

Mean square continuity

Problem Selected Scores

Lecture 20: Linear model, the LSE, and UMVUE

Linear Regression. Junhui Qian. October 27, 2014

Regression Models - Introduction

Cross-covariance Functions for Tangent Vector Fields on the Sphere

Heteroskedasticity. Part VII. Heteroskedasticity

GLS and FGLS. Econ 671. Purdue University. Justin L. Tobias (Purdue) GLS and FGLS 1 / 22

Open Problems in Mixed Models

Spatial Process Estimates as Smoothers: A Review

Variations. ECE 6540, Lecture 10 Maximum Likelihood Estimation

Spatio-Temporal Modelling of Credit Default Data

Properties of the least squares estimates

Lecture 18. Models for areal data. Colin Rundel 03/22/2017

Bayesian Linear Regression

Parameter estimation: ACVF of AR processes

Association studies and regression

Specification Errors, Measurement Errors, Confounding

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Summer School in Statistics for Astronomers V June 1 - June 6, Regression. Mosuk Chow Statistics Department Penn State University.

Probabilistic Graphical Models

MCMC algorithms for fitting Bayesian models

STAT 518 Intro Student Presentation

Covariance and Correlation

Cox s proportional hazards model and Cox s partial likelihood

Transcription:

Asymptotic standard errors of MLE Suppose, in the previous example of Carbon and Nitrogen in soil data, that we get the parameter estimates For maximum likelihood estimation, we can use Hessian matrix of the loglikelihood function to get the asymptotic standard errors of the maximum likelihood estimates Hessian matrix is the matrix of the the second-order partial derivatives of a function The observed information matrix is the negative of the Hessian matrix of the loglikelihood function evaluated at the maximum likelihood estimators Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 1 / 9

Asymptotic standard errors of MLE It is known in statistics theory that maximum likelihood estimators are asymptotically normal with the mean being the true parameter values and the covariance matrix being the inverse of the observed information matrix In particular, the square root of the diagonal entries of the inverse of the observed information matrix are asymptotic standard errors of the parameter estimates Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 2 / 9

Asymptotic standard errors of MLE In practice, how do we get the asymptotic standard errors of the parameter estimates? Let us start by the Carbon-Nitrogen example Suppose we are doing maximum likelihood estimation to estimate the parameters In R, you can ask nlm or optim functions to return Hessian matrix Once you get the Hessian matrix, you need to be careful because of the parameter transformation For Carbon-Nitrogen data, we get ˆα = 0.040(0.021), ˆβ = 50.73(33.10), ˆν = 0.52(0.25), ˆδ = 0.029(0.0038), ˆβ 0 = 3.12(0.14), ˆβ 1 = 0.80(0.019) Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 3 / 9

Spatial autoregressive models In time series, autoregressive models express the data at time t as a linear combination of the values in the past For example, if Z(t) is a time series of interest, AR(p) model is in the form p Z(t) = c + φ i Z(t i) + ɛ(t) i=1 We can do similar things with spatial data We will see SAR models and CAR models today Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 4 / 9

Simultaneous Autoregressive (SAR) Model Consider a spatial regression problem with Gaussian data If B is a matrix of spatial dependence parameters with b i i = 0, Z(s) = m(s)β + e(s) e(s) = B e(s) + v The residuals v i, i = 1,, n, have mean zero and a diagonal covariance matrix Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 5 / 9

Simultaneous Autoregressive (SAR) Model We can also express this as (I B)(Z(s) m(s)β) = v Then the covariance matrix of Z(s), Σ SAR = (I B) 1 Σ v (I B T ) 1 if Σ v is the diagonal covariance matrix of v The covariance structure of Z(s) is completely determined by B and Σ v Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 6 / 9

Simultaneous Autoregressive (SAR) Model In practice, we may need to model B using a parametric model for b ij We may let B = ρw where W is a proximity matrix that consists of 0 s and 1 s Under this setting, we can easily see how SAR model reduces to a spatial model with uncorrelated errors Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 7 / 9

Markov property In time series, if the conditional distribution of Z(t + 1) given Z(s), s = 1,, t is the same as that of Z(t + 1) given Z(t), we say the process has Markov property We can extend this to spatial data We may say a spatial random field Z(s) is a Markov random field if Z(s i ) only depends on its neighbors N i Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 8 / 9

Conditional Autoregressive (CAR) Model This model uses the concept of Markov property We consider f (Z(s i ) Z(s) i ) where Z(s) i denotes the vector of all the data except Z(s i ) Specifically we assume each of the conditional distributions is Gaussian and we let E(Z(s i ) Z(s) i ) = m(s i )β + n c ij (Z(s j ) m(s i )β) j=1 Var(Z(s i ) Z(s) i ) = σ 2 i, i = 1,, n Here c ij is nonzero only if s i N i and c ii = 0 Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 9 / 9

Conditional Autoregressive (CAR) Model Given conditional distributions, it is not easy to construct joint distributions to do estimation and inference SAR model is only defined for multivariate Gaussian distribution while CAR model may not Mikyoung Jun (Texas A&M) stat647 lecture 11 October 8, 2009 10 / 9