A general mixed model approach for spatio-temporal regression data

Similar documents
Analysing geoadditive regression data: a mixed model approach

Modelling geoadditive survival data

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

mboost - Componentwise Boosting for Generalised Regression Models

Estimating Timber Volume using Airborne Laser Scanning Data based on Bayesian Methods J. Breidenbach 1 and E. Kublin 2

Variable Selection and Model Choice in Survival Models with Time-Varying Effects

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Aggregated cancer incidence data: spatial models

for function values or parameters, which are neighbours in the domain of a metrical covariate or a time scale, or in space. These concepts have been u

Beyond Mean Regression

Regularization in Cox Frailty Models

Stat 5101 Lecture Notes

Working Papers in Economics and Statistics

Modelling spatial patterns of distribution and abundance of mussel seed using Structured Additive Regression models

Multivariate Survival Analysis

Structured Additive Regression Models: An R Interface to BayesX

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Bayesian linear regression

Bayesian Linear Regression

Multilevel Statistical Models: 3 rd edition, 2003 Contents

LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape

Disease mapping with Gaussian processes

Modeling Real Estate Data using Quantile Regression

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

Generalized Linear Models for Non-Normal Data

Density Estimation. Seungjin Choi

Frailty Modeling for clustered survival data: a simulation study

Estimating prediction error in mixed models

Bayesian non-parametric model to longitudinally predict churn

STAT Section 2.1: Basic Inference. Basic Definitions

PENALIZED STRUCTURED ADDITIVE REGRESSION FOR SPACE-TIME DATA: A BAYESIAN PERSPECTIVE

Lecture 5 Models and methods for recurrent event data

Disease mapping with Gaussian processes

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

Luke B Smith and Brian J Reich North Carolina State University May 21, 2013

A short introduction to INLA and R-INLA

Multistate Modeling and Applications

Bayesian Linear Models

Integrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Geoadditive Latent Variable Modelling of Child Morbidity and Malnutrition in Nigeria

Beyond MCMC in fitting complex Bayesian models: The INLA method

Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting

STAT 518 Intro Student Presentation

Contents. Part I: Fundamentals of Bayesian Inference 1

Fahrmeir: Discrete failure time models

Introduction to Statistical Analysis

The linear model is the most fundamental of all serious statistical models encompassing:

Hierarchical Modeling for Multivariate Spatial Data

Non-Parametric Bayes

General Regression Model

Continuous Time Survival in Latent Variable Models

Semiparametric Generalized Linear Models

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Longitudinal breast density as a marker of breast cancer risk

Scaling intrinsic Gaussian Markov random field priors in spatial modelling

Bayesian Networks in Educational Assessment

Wrapped Gaussian processes: a short review and some new results

Hierarchical Modeling for Univariate Spatial Data

Package threg. August 10, 2015

Machine learning: Hypothesis testing. Anders Hildeman

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

CTDL-Positive Stable Frailty Model

Survival Analysis Math 434 Fall 2011

Default Priors and Effcient Posterior Computation in Bayesian

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Subject CS1 Actuarial Statistics 1 Core Principles

Practical Bayesian Quantile Regression. Keming Yu University of Plymouth, UK

Bayesian Multivariate Logistic Regression

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET

Bayesian Methods in Multilevel Regression

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

Hierarchical Modelling for Univariate Spatial Data

Bayesian Linear Models

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:

Multivariate Bayesian Linear Regression MLAI Lecture 11

STA 216: GENERALIZED LINEAR MODELS. Lecture 1. Review and Introduction. Much of statistics is based on the assumption that random

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Bayesian Regression Linear and Logistic Regression

Sample Size Calculations for ROC Studies: Parametric Robustness and Bayesian Nonparametrics

The sbgcop Package. March 9, 2007

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Bayesian spatial quantile regression

Package SimSCRPiecewise

Bayesian Semiparametric Methods for Joint Modeling Longitudinal and Survival Data

Power and Sample Size Calculations with the Additive Hazards Model

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

Regression models for multivariate ordered responses via the Plackett distribution

A Framework for Unbiased Model Selection Based on Boosting

On Bayesian Computation

Data-analysis and Retrieval Ordinal Classification

gamboostlss: boosting generalized additive models for location, scale and shape

A Bayesian Perspective on Residential Demand Response Using Smart Meter Data

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

Transcription:

A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression data 2. Examples I: Categorical forest health data 3. Structured additive regression models 4. Mixed model based inference 5. Software 6. Examples II: Leukemia survival times 16.12.2004

Spatio-temporal regression data Spatio-temporal regression data Regression in a general sense: Generalized linear models, multicategorical models, survival models. Usual regression data: Some response with categorical and continuous covariates. Spatio-temporal regression data: in addition spatial and temporal information. Special issues: Spatial and spatio-temporal correlations, Time- or space-varying effects, Non-linear effects, Complex interactions, Unobserved heterogeneity. A general mixed model approach for spatio-temporal regression data 1

Example I: Categorical forest health data Example I: Categorical forest health data Yearly forest health inventories carried out from 1983 to 2001. 83 beeches within a 15 km times 10 km area. Response: defoliation degree of beech i in year t, measured in three ordered categories: y it = 1 no defoliation, y it = 2 defoliation 25% or less, y it = 3 defoliation above 25%. Cumulative probit model: P (y it r) = Φ [θ r f 1 (t) f 2 (age it ) f 3 (t, age it ) f spat (s i ) u itγ] with standard normal cdf Φ and thresholds = θ 0 < θ 1 < θ 2 < θ 3 = A general mixed model approach for spatio-temporal regression data 2

Example I: Categorical forest health data 2.5 Time trend. 0-2.5 1983 1989 1995 2001 calendar time 3 0-3 Age effect. -6 7 39 71 103 135 167 199 231 age in years A general mixed model approach for spatio-temporal regression data 3

-1.5-1 -0.5 0 0.5 1 1.5 Thomas Kneib Example I: Categorical forest health data Spatial effect. -2.5 0 2.5 200 Interaction effect. 150 100 age of the tree 50 1985 1990 1995 2000 calendar time A general mixed model approach for spatio-temporal regression data 4

Structured additive regression Structured additive regression General Idea: Replace usual parametric predictor with a flexible semiparametric predictor containing Nonparametric effects of time scales and continuous covariates (P-splines, random walks), Spatial effects (Markov random fields, stationary Gaussian random fields), Interaction surfaces (2d P-splines), Varying coefficient terms (continuous and spatial effect modifiers), Random intercepts and random slopes. In a Bayesian context, all effects can be cast into one general framework. A general mixed model approach for spatio-temporal regression data 5

Mixed model based inference Mixed model based inference Each term in the predictor is associated with a vector of regression coefficients with improper multivariate Gaussian prior: p(β j τ 2 j ) exp ( 1 2τ 2 j β jk j β j ) Reparametrize the model to a proper mixed model. Obtain empirical Bayes estimates via iterating Penalized maximum likelihood for regression coefficients. Restricted Maximum / Marginal likelihood for variance parameters. A general mixed model approach for spatio-temporal regression data 6

Software Software Implemented in the public domain software package BayesX. Allows for spatio-temporal regression in the context of Univariate responses from exponential families, Multicategorical responses with ordered and unordered categories, Continuous time survival analysis. Available from http://www.stat.uni-muenchen.de/~lang/bayesx A general mixed model approach for spatio-temporal regression data 7

Examples II: Leukemia survival times Examples II: Leukemia survival times Survival time of adults after diagnosis of acute myeloid leukemia. 1,043 cases diagnosed between 1982 and 1998 in Northwest England. Spatial information in different resolutions. Model: λ(t; ) = λ 0 (t) exp[f 1 (age) + f 2 (wbc) + f 3 (tpi) + f spat (s) + γsex] A general mixed model approach for spatio-temporal regression data 8

Examples II: Leukemia survival times 4 log(baseline) Log-baseline hazard. 1-2 -5 0 7 14 time in years.5 effect of townsend deprivation index.25 0 -.25 Effect of deprivation. -.5-6 -2 2 6 10 townsend deprivation index A general mixed model approach for spatio-temporal regression data 9

Examples II: Leukemia survival times -0.44 0 0.3 District-level analysis 0.44 0.3 Individual-level analysis A general mixed model approach for spatio-temporal regression data 10

Discussion Discussion Bayesian treatment of complex regression models for spatio-temporal data without relying on MCMC simulation techniques. Closely related to penalized likelihood in a frequentist setting. Future work: Extended modelling for categorical responses, e.g. with correlated latent utilities. More general censoring schemes for survival times, e.g. interval censoring. Anisotropic spatial effects. 3d extensions of P-splines. A general mixed model approach for spatio-temporal regression data 11

References References Kneib, T. and Fahrmeir, L. (2004): A mixed model approach for structured hazard regression. SFB 386 Discussion Paper 400, University of Munich. Kneib, T. and Fahrmeir, L. (2004): Structured additive regression for categorical space-time data: A mixed model approach. Under revision for Biometrics. Fahrmeir, L., Kneib, T. and Lang, S. (2004): Penalized structured additive regression for space-time data: A Bayesian perspective. Statistica Sinica, 14, 715-745. Available from http://www.stat.uni-muenchen.de/~kneib A general mixed model approach for spatio-temporal regression data 12