Spatio-temporal prediction of site index based on forest inventories and climate change scenarios

Similar documents
Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Bruno Sansó. Department of Applied Mathematics and Statistics University of California Santa Cruz bruno

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Multivariate Spatial Data

Point-Referenced Data Models

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

Spatial Lasso with Application to GIS Model Selection. F. Jay Breidt Colorado State University

Using Estimating Equations for Spatially Correlated A

Gaussian predictive process models for large spatial data sets.

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

Hierarchical Modelling for Univariate Spatial Data

REGRESSION WITH SPATIALLY MISALIGNED DATA. Lisa Madsen Oregon State University David Ruppert Cornell University

Hierarchical Modelling for Multivariate Spatial Data

Multi-resolution models for large data sets

Introduction to Geostatistics

Chapter 4 - Fundamentals of spatial processes Lecture notes

Regression #3: Properties of OLS Estimator

Bayesian spatial quantile regression

Econometrics of Panel Data

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013

Modelling of tree height growth

Statistics 910, #5 1. Regression Methods

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP

Gaussian Processes 1. Schedule

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

1 Introduction to Generalized Least Squares

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Multi-resolution models for large data sets

A time series is called strictly stationary if the joint distribution of every collection (Y t

Hierarchical Modelling for Univariate Spatial Data

Analysing geoadditive regression data: a mixed model approach

Gridded monthly temperature fields for Croatia for the period

Hierarchical Modeling for Spatio-temporal Data

Regionalización dinámica: la experiencia española

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Nonparametric Small Area Estimation Using Penalized Spline Regression

A general mixed model approach for spatio-temporal regression data

Estimating Timber Volume using Airborne Laser Scanning Data based on Bayesian Methods J. Breidenbach 1 and E. Kublin 2

Chapter 4 - Fundamentals of spatial processes Lecture notes

Climate Change: the Uncertainty of Certainty

Regression, Ridge Regression, Lasso

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

Kriging Luc Anselin, All Rights Reserved

ECON 3150/4150, Spring term Lecture 6

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter

Multivariate spatial models and the multikrig class

Quasi-likelihood Scan Statistics for Detection of

Introduction. Semivariogram Cloud

Future Climate Projection in Mainland Southeast Asia: Climate change scenario for 21 st century. Suppakorn Chinvanno

Weighted Least Squares

Peter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8

Climate Change Impact Analysis

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Modeling Real Estate Data using Quantile Regression

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics,

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Gridded observation data for Climate Services

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Regression #4: Properties of OLS Estimator (Part 2)

The regression model with one fixed regressor cont d

Gridding of precipitation and air temperature observations in Belgium. Michel Journée Royal Meteorological Institute of Belgium (RMI)

Robust Inference for Regression with Spatially Correlated Errors

Nearest Neighbor Gaussian Processes for Large Spatial Data

Spatial analysis is the quantitative study of phenomena that are located in space.

Better Simulation Metamodeling: The Why, What and How of Stochastic Kriging

12 - Nonparametric Density Estimation

IEOR 165 Lecture 7 1 Bias-Variance Tradeoff

A SARIMAX coupled modelling applied to individual load curves intraday forecasting

CGE TRAINING MATERIALS ON VULNERABILITY AND ADAPTATION ASSESSMENT. Climate change scenarios

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

ECON 5350 Class Notes Functional Form and Structural Change

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Parameter Estimation

Introduction to Weather Data Cleaning

Statistics 910, #15 1. Kalman Filter

Modeling the Covariance

Training: Climate Change Scenarios for PEI. Training Session April Neil Comer Research Climatologist

Model Selection for Geostatistical Models

BAYESIAN KRIGING AND BAYESIAN NETWORK DESIGN

A Bayesian Perspective on Residential Demand Response Using Smart Meter Data

Subproject 01 Climate Change in the Okavango region

performance EARTH SCIENCE & CLIMATE CHANGE Mujtaba Hassan PhD Scholar Tsinghua University Beijing, P.R. C

Linear Models in Machine Learning

CBMS Lecture 1. Alan E. Gelfand Duke University

Probabilistic forecasting of solar radiation

10. Time series regression and forecasting

Weighted Least Squares

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Econometrics II - EXAM Answer each question in separate sheets in three hours

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Generative Models and Stochastic Algorithms for Population Average Estimation and Image Analysis

Covariance function estimation in Gaussian process regression

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Regression with correlation for the Sales Data

Machine Learning Approaches to Crop Yield Prediction and Climate Change Impact Assessment

Economics 582 Random Effects Estimation

Spatial Interpolation of daily Temperature and Precipitation for the Fennoscandia

Single Equation Linear GMM with Serially Correlated Moment Conditions

mboost - Componentwise Boosting for Generalised Regression Models

Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model?

Transcription:

Forest Research Institute Spatio-temporal prediction of site index based on forest inventories and climate change scenarios Arne Nothdurft 1, Thilo Wolf 1, Andre Ringeler 2, Jürgen Böhner 2, Joachim Saborowski 3 1 Forest Research Institute Baden-Württemberg, Department of Biometrics and Informatics 2 University of Hamburg, Institute of Geography 3 University of Göttingen, Department of Ecoinformatics, Biometrics and Forest Growth and Department of Ecosystem Modelling 7-9 November 2012, Freiburg Baden-Württemberg

Motivation: Spatio-temporal prediction of site index Definition (Site index) A relative measure of forest site quality based on the height of the dominant trees at a reference age. Goals: Reformulation of the existing site-productivity model of Schöpfer & Moosmayer (1972). Application of comprehensive data from regional and national forest inventories. To provide site-index predictions based on climate scenario data.

Dominant height measurements on forest inventory sample plots Species Sample plots Spruce 91,642 Fir 33,987 Douglas-fir 9,356 Pine 31,559 Oak 28,230 Beech 81,241

Climate data We use 1 retrospective climate data for regression modeling, 2 climate scenario projections for spatio-temporal predictions.

Dominant height growth: Sloboda s (1971) differential equation h (t o h(t), t) = ψ 1 ( h(t) ψ 1 [ ] ) ψ exp 2 ψ (ψ 3 1)t (ψ 3 o 1) 2 (ψ 3 1)t (ψ 3 1) 40 Height [m] 30 20 10 0 0 20 40 60 80 100 120 Age [Years]

The spatial process of site index The spatial process of site index is defined for any arbitrary location s as y(s) = x(s) β + b(s) + ɛ(s), and is composed of (i) a fixed linear trend dependent on site and climate variables in x(s), (ii) a spatially correlated error in terms of a Gaussian process b(s), with 0 mean, variance θ 2, and spatial autocovariance function c(h), h being the distance between two locations s i and s j, (iii) and an unstructured error ɛ(s) N ( 0, σ 2). universal kriging (UK) model

Reformulation of the UK model as mixed model The UK model can be reformulated as mixed model y = Xβ + Ub + ɛ having covariance matrix Cov(y) = Σ = θ 2 URU + σ 2 I n and correlation function Corr [b(s i ), b(s j )] = R = ρ(s i s j ).

Spatial autocorrelation The exponential correlation function ρ (h) = exp ( h/α) leads to the covariance function { θ 2 exp ( h c (h) = α), for h > 0 σ 2 + θ 2, for h = 0 and to the parametric semi-variogram model { σ 2 + θ 2 [ 1 exp ( h γ (h) = α)], for h > 0 0, for h = 0.

Estimation: iteratively re-weighted generalized least squares (IRWGLS) (I) Perform prior OLS ˆβ OLS = (X X) 1 X y to estimate the mean model (II) Iterate between (i) and (ii) until convergence is achieved (i) Fit the parametric semi-variogram model to the empirical counterpart min arg ˆγ (h) = K { 2γ (h(u)) 2γ (h(u); θ, σ, α) } 2 u=1 1 2 N(h) [e(s i ) e(s j )] 2 N(h) N(h) {(s i ; s j ) : s i s j = h; i, j = 1,..., n} { y X with e = ˆβ OLS, in the first iteration y X ˆβ GLS, otherwise (ii) Perform GLS ˆβ GLS = ( X ˆΣ 1 X) 1 X ˆΣ 1 y

Estimation Dilemma of large n Autocovariance matrix is high-dimensional and becomes intractable. Iteratively re-weighted generalized least squares (IRWGLS) is not feasible. Hypotheses (given large data exists) Kriging of OLS residuals in moving 100 point neighborhoods leads to an unbiased spatial predictor. The error of the mean model can be neglected for constructions of prediction intervals. B-spline regression techniques provide robust estimates of biologically plausible cause-and-effect curves.

Prediction Site-index predictions are spatially kriged at any location s 0 in moving 100 point neighborhoods by ) ŷ 0 = x(s 0 ) ˆβOLS + ĉ 1 ˆΣ (y X ˆβOLS, indicates the simplification of the universal kriging predictor in terms of a local predictor and ˆΣ has dimension 100 100. For interval prediction, the universal kriging variance E (y 0 ŷ 0 ) 2 =σ 2 + θ 2 c Σ 1 c is approximated by + [ x(s 0 ) c Σ 1 X ] [ X Σ 1 X ] 1 [ x(s0 ) c Σ 1 X ] Var (ŷ 0 ) =ˆσ 2 + ˆθ 2 ĉ 1 ˆΣ, in which the error of the estimator of β is neglected. Constructions of 95%-intervals are obatained by ŷ 0 ± 1.96 Var (ŷ 0 ).

Mean model based on B-Spline regression techniques A B-spline function is defined for the range of a vector of knots κ = (κ 1,..., κ l+1,...,..., κ d l,..., κ d ) T, and consists of d basis functions ( B1(x), l..., Bd l (x)) of degree l. A single basis functions is defined for the range between l + 2 consecutive knots and overlaps the 2l neighboring basis functions. With given κ a basis function of degree l = 0 is defined by { Bv 0 1 for κ v x < κ v+1 = 1 [κv,κ v+1)(x) = v = 1,..., d 1, 0 otherwise, and basis functions of higher order are recursively defined by B l v(x) = x κ v κ v+l κ v B l 1 v (x) + κ v+l+1 x κ v+l+1 κ v+1 B l 1 v+1 (x). To guarantee that d v=1 Bl v(x) = 1 holds for every x [κ 1, κ d ], boundary knots are replicated l times, so that κ 1 =... = κ l+1 and κ d l =... = κ d.

Mean model based on B-Spline regression techniques The entire regression matrix X = (X 1,, X p ) is composed of the p sub-matrices B1(x l z1 )... Bd l (x z1) X z =... B1(x l zn )... Bd l (x zn) Note: B-spline regression technique is nothing else than a multiple linear regression model!

Mean model based on B-Spline regression techniques Mean model based on B-Spline regression techniques and with natural boundary constraints. 0.0 0.2 0.4 0.6 0.8 1.0 1.2 2 1 0 1 X Variable Y Variable Degree 3 0.0 0.2 0.4 0.6 0.8 1.0 1.2 2 1 0 1 X Variable Y Variable Degree 3

Mean model based on B-Spline regression techniques 45 Dgl.Fir Site index [m] 40 35 30 25 300 500 700 900 Precipitation [mm] Spruce Beech Sil.Fir Pine Oak

Mean model based on B-Spline regression techniques 45 40 Site index [m] 35 30 25 20 1500 2500 3500 Temperature [ C] Dgl.Fir Sil.Fir Spruce Pine Oak Beech

Motivation Data Predictions of relative changes of site index Methods Results

Predictions of relative changes of site index

Motivation Data Predictions of relative changes of site index Methods Results

Validation based on 1000 simulations of Gaussian random fields Bias Coverage rate Density 0.0 0.4 0.8 1.2 1.0 0.5 0.0 0.5 1.0 Density 0 10 20 30 40 50 0.925 0.95 0.975

Validation based on 1000 simulations of Gaussian random fields 14 12 Semivariance 10 8 6 4 2 0 Truth Mean 95% Interv. 0 2000 4000 6000 8000 Distance [m]

Résumé Site index decreases in low elevation areas and increases in mountainous regions. Silver fir and oak stands show increased site index also on lower elevation sites. Site index of Scots pine is less affected by a changing climate. Site conditions in the Alpine foothills region remain highly productive for growth of Norway spruce.

Résumé If a universal kriging model is applied to a large forest inventory data set, then OLS is sufficient for the estimation of the mean and of the spatial covariance, spatial mean plus kriged OLS-residuals in moving 100 point neighborhoods is a practically unbiased predictor, and approximating the UK variance by neglecting the error of the mean yields a quasi-exact interval predictor.

Forest Research Institute Ende Nothdurft, A., Wolf, T., Ringeler, A., Böhner, J. & Saborowski, J. 2012. Spatio-temporal prediction of site index based on forest inventories and climate change scenarios. Forest Ecology and Management, 279, 97 111. http://dx.doi.org/10.1016/j.foreco.2012.05.018. Thank you for your attentation! Baden-Württemberg

Retrospective climate data 1 Lateral boundary conditions: NCAR/NCEP reanalysis series in resolution 2.5 2.5 (lat./long.). 2 Dynamical downscaling in two steps: non-hydrostatic regional climate model (RCM) Weather Research and Forecasting (WRF) 5 km 5 km resolution. 3 Statistical downscaling: observations from the met-station network of the German National Meteorological Service 50 m 50 m resolution. 4 Bias correction and smoothing. 30-year long-term means (between 1978 and 2007) as regressor covariates: total precipitation during the growing season (PGS) total of average daily temperatures during the growing season (TGS)

Climate projection data 1 Existing runs of the hydrostatic RCM REMO on initiative of the German Federal Environment Agency Lateral boundary conditions: general circulation model (GCM) ECHAM5-MPIOM. 10 km 10 km horizontal grid resolution. Scenarios A1B, A2 and B1 from Special Report on Emissions Scenarios (SRES). 2 Statistical downscaling 1 km 1 km: see retrospective climate data. 3 Spatio-temporal bias correction.