Fusing point and areal level space-time data. data with application to wet deposition

Size: px
Start display at page:

Download "Fusing point and areal level space-time data. data with application to wet deposition"

Transcription

1 Fusing point and areal level space-time data with application to wet deposition Alan Gelfand Duke University Joint work with Sujit Sahu and David Holland

2 Chemical Deposition Combustion of fossil fuel produces various chemicals including sulfate and nitrate gases. In the eastern U.S., most SO 2, and NO x release attributed to power plants. Emitted to the air; wet deposition and dry deposition; interest in total deposition. Deposition means return to the earth s surface by means of precipitation (rain or snow) for example. Wet Deposition = Precipitation Concentration. Wet deposition is responsible for damage to lakes, forests, and streams.

3 Chemical Deposition Combustion of fossil fuel produces various chemicals including sulfate and nitrate gases. In the eastern U.S., most SO 2, and NO x release attributed to power plants. Emitted to the air; wet deposition and dry deposition; interest in total deposition. Deposition means return to the earth s surface by means of precipitation (rain or snow) for example. Wet Deposition = Precipitation Concentration. Wet deposition is responsible for damage to lakes, forests, and streams.

4 Chemical Deposition Combustion of fossil fuel produces various chemicals including sulfate and nitrate gases. In the eastern U.S., most SO 2, and NO x release attributed to power plants. Emitted to the air; wet deposition and dry deposition; interest in total deposition. Deposition means return to the earth s surface by means of precipitation (rain or snow) for example. Wet Deposition = Precipitation Concentration. Wet deposition is responsible for damage to lakes, forests, and streams.

5 Chemical Deposition Combustion of fossil fuel produces various chemicals including sulfate and nitrate gases. In the eastern U.S., most SO 2, and NO x release attributed to power plants. Emitted to the air; wet deposition and dry deposition; interest in total deposition. Deposition means return to the earth s surface by means of precipitation (rain or snow) for example. Wet Deposition = Precipitation Concentration. Wet deposition is responsible for damage to lakes, forests, and streams.

6 Chemical Deposition Combustion of fossil fuel produces various chemicals including sulfate and nitrate gases. In the eastern U.S., most SO 2, and NO x release attributed to power plants. Emitted to the air; wet deposition and dry deposition; interest in total deposition. Deposition means return to the earth s surface by means of precipitation (rain or snow) for example. Wet Deposition = Precipitation Concentration. Wet deposition is responsible for damage to lakes, forests, and streams.

7 Chemical Deposition Combustion of fossil fuel produces various chemicals including sulfate and nitrate gases. In the eastern U.S., most SO 2, and NO x release attributed to power plants. Emitted to the air; wet deposition and dry deposition; interest in total deposition. Deposition means return to the earth s surface by means of precipitation (rain or snow) for example. Wet Deposition = Precipitation Concentration. Wet deposition is responsible for damage to lakes, forests, and streams.

8 The first stage likelihood f (P, Z, Q U, Y, X, V, Ṽ) = f (P U, V) f (Z Y, V) f (Q X, Ṽ) which takes the form [ { } T n t=1 i=1 1 exp u(si, t) 1 exp y(si, t) I (v(s i, t) > 0) { }] 1 exp x(aj, t) I (ṽ(a j, t) > 0) J j=1 where 1 x denotes a degenerate distribution with point mass at x and I( ) is the indicator function.

9 Deposition Model Y (s i, t) = β 0 + β 1 U(s i, t) + β 2 V (s i, t) + (b 0 + b(s i )) X(A ki, t) +η(s i, t) + ɛ(s i, t). Spatially varying coefficients, b = (b(s 1 ),..., b(s n )) is a Gaussian process (GP). Spatio-temporal intercept η t = (η(s 1, t),..., η(s n, t)) is a GP independent in time. Allow for spatially varying calibration of CMAQ. Could imagine common η(s i ). ɛ(s i, t) N(0, σ 2 ɛ ), provides the nugget effect.

10 The second stage models... Precipitation U(s i, t) = α 0 + α 1 V (s i, t) + δ(s i, t), where δ t = (δ(s 1, t),..., δ(s n, t)) is a GP independent in time. CMAQ output X(A j, t) = γ 0 + γ 1 Ṽ (A j, t) + ψ(a j, t), j = 1,..., J. Assume ψ(a j, t) N(0, σψ 2 ), independently.

11 Specification of latent processes Measurement Error Model (MEM) V (s i, t) N(Ṽ (A k i, t), σv 2 ), i = 1,..., n, t = 1,..., T. The process Ṽ (A j, t) is AR in time and CAR in space Ṽ (A j, t) = ρṽ (A j, t 1) + ζ(a j, t), ( J ) ζ(a j, t) N h ji ζ(a i, t), σ2 ζ, m j i=1 Let j define the m j neighboring grid cells of the cell A j. h ji = { 1 m j if i j 0 otherwise.

12 AR in time and CAR in Space Assume the initial condition for Ṽ0: Ṽ (A j, 0) = 1 T T X(A j, t), t=1 giving Ṽ0. Now we can write the CAR in closed form: f (Ṽt Ṽt 1, ρ, σ 2 ζ ) { exp 1 (Ṽt ρṽt 1) D 1 (I H) (Ṽt ρṽt 1) }, 2 D is a diagonal matrix with entries σ 2 ζ /m j. Note that this is an improper CAR.

13 National Atmospheric Deposition Program (NADP) NADP collects point-referenced data at several sites. They then use simple interpolation to produce maps

14 The CMAQ model Community Multi-scale Air Quality Model (CMAQ) A computer simulation model which produces averaged output on 36km, 12 km(used here), and now 4 km grid cells Uses variables such as power station emission volumes, meteorological data, land-use, etc. with atmospheric science (appropriate differential equations) to predict deposition levels. Not driven by monitoring station data. Predictions are biased but no missing data; monitoring data provide more accurate deposition but missingness

15 The CMAQ model Community Multi-scale Air Quality Model (CMAQ) A computer simulation model which produces averaged output on 36km, 12 km(used here), and now 4 km grid cells Uses variables such as power station emission volumes, meteorological data, land-use, etc. with atmospheric science (appropriate differential equations) to predict deposition levels. Not driven by monitoring station data. Predictions are biased but no missing data; monitoring data provide more accurate deposition but missingness

16 The CMAQ model Community Multi-scale Air Quality Model (CMAQ) A computer simulation model which produces averaged output on 36km, 12 km(used here), and now 4 km grid cells Uses variables such as power station emission volumes, meteorological data, land-use, etc. with atmospheric science (appropriate differential equations) to predict deposition levels. Not driven by monitoring station data. Predictions are biased but no missing data; monitoring data provide more accurate deposition but missingness

17 The CMAQ model Community Multi-scale Air Quality Model (CMAQ) A computer simulation model which produces averaged output on 36km, 12 km(used here), and now 4 km grid cells Uses variables such as power station emission volumes, meteorological data, land-use, etc. with atmospheric science (appropriate differential equations) to predict deposition levels. Not driven by monitoring station data. Predictions are biased but no missing data; monitoring data provide more accurate deposition but missingness

18 Our Contribution A fully model-based framework for fusing the NADP and CMAQ wet deposition data To accommodate the point masses at 0, i.e., no wet deposition if no precipitation To accommodate misalignment between NADP data at points and CMAQ data at grid cells in a computational feasible way across space and time To provide spatial interpolation and temporal aggregation Both sulfate and nitrate deposition

19 Our Contribution A fully model-based framework for fusing the NADP and CMAQ wet deposition data To accommodate the point masses at 0, i.e., no wet deposition if no precipitation To accommodate misalignment between NADP data at points and CMAQ data at grid cells in a computational feasible way across space and time To provide spatial interpolation and temporal aggregation Both sulfate and nitrate deposition

20 Our Contribution A fully model-based framework for fusing the NADP and CMAQ wet deposition data To accommodate the point masses at 0, i.e., no wet deposition if no precipitation To accommodate misalignment between NADP data at points and CMAQ data at grid cells in a computational feasible way across space and time To provide spatial interpolation and temporal aggregation Both sulfate and nitrate deposition

21 Our Contribution A fully model-based framework for fusing the NADP and CMAQ wet deposition data To accommodate the point masses at 0, i.e., no wet deposition if no precipitation To accommodate misalignment between NADP data at points and CMAQ data at grid cells in a computational feasible way across space and time To provide spatial interpolation and temporal aggregation Both sulfate and nitrate deposition

22 Our Contribution A fully model-based framework for fusing the NADP and CMAQ wet deposition data To accommodate the point masses at 0, i.e., no wet deposition if no precipitation To accommodate misalignment between NADP data at points and CMAQ data at grid cells in a computational feasible way across space and time To provide spatial interpolation and temporal aggregation Both sulfate and nitrate deposition

23 Inverse Distance weighting (IDW) Poor person s methodology Value at a new site = weighted mean of observations, Weights inversely proportional to the square of the distance. Problems Not model based! Unable to accommodate known covariate - precipitation! Unable to handle 0 s, unable to handle missing data Can t fuse with model output data With dynamic data, can only do independent weekly or aggregated annually No associated uncertainty maps!

24 Inverse Distance weighting (IDW) Poor person s methodology Value at a new site = weighted mean of observations, Weights inversely proportional to the square of the distance. Problems Not model based! Unable to accommodate known covariate - precipitation! Unable to handle 0 s, unable to handle missing data Can t fuse with model output data With dynamic data, can only do independent weekly or aggregated annually No associated uncertainty maps!

25 Inverse Distance weighting (IDW) Poor person s methodology Value at a new site = weighted mean of observations, Weights inversely proportional to the square of the distance. Problems Not model based! Unable to accommodate known covariate - precipitation! Unable to handle 0 s, unable to handle missing data Can t fuse with model output data With dynamic data, can only do independent weekly or aggregated annually No associated uncertainty maps!

26 Change of support problem Fuentes and Raftery, 2005 Need to upscale (block-average) point level Z (s, t) to obtain grid level Z (A j, t). Z (A j, t) = 1 Z (s, t) ds, (1) A j A j PSfrag replacements A ki si Many more A s than s s Use MEM (measurement error model) at point level centred around grid level values Make inference at the point level by downscaling Huge computational advantages, NO integration (1).

27 Change of support problem Fuentes and Raftery, 2005 Need to upscale (block-average) point level Z (s, t) to obtain grid level Z (A j, t). Z (A j, t) = 1 Z (s, t) ds, (1) A j A j PSfrag replacements A ki si Many more A s than s s Use MEM (measurement error model) at point level centred around grid level values Make inference at the point level by downscaling Huge computational advantages, NO integration (1).

28 Hierarchical Bayesian Model Model data weekly. Fuse with gridded weekly CMAQ output. Use weekly precipitation information, available from other monitoring networks. Interpolate in space, predict in time Obtain quarterly and annual maps. Reveal spatial pattern in deposition. Results are illustrative, not definitive.

29 Our data set Data Weekly deposition data from 128 sites in the eastern U.S. for the year Use 120 sites to estimate, remaining 8 to validate. Weekly CMAQ output from J = 33, 390 grid cells (about 1.8 million values!) Weekly precipitation data from 2827 predictive sites.

30 Location of the NADP A B C D E F G H Figure: A map of the study region; points denote the NADP sites for fitting and A-H denote the eight validation sites.

31 Annual Precipitation Figure: Map of annual total precipitation in 2001.

32 Exploratory Analyses wetso4 2 wetno (a) (b) Figure: Boxplot of weekly depositions: (a) sulfate and (b) nitrate.

33 Exploratory Analyses... log precipitation log sulfate (a) log precipitation log nitrate (b) Figure: Deposition against precipitation (both on the log scale): (a) sulfate and (b) nitrate.

34 Exploratory Analyses... log cmaq vals log sulfate (a) log cmaq vals log nitrate (b) Figure: Deposition at the NADP sites against the CMAQ values in the grid cell covering the corresponding NADP site on the log scale: (a) sulfate and (b) nitrate.

35 Modeling Requirements No deposition, Z (s i, t), without precipitation, P(s i, t); enforced by a latent atmospheric space-time process V (s i, t) below, i = 1,..., n = 120, and for each week t, t = 1,..., 52. Similarly, model CMAQ output, Q(A j, t) for each grid cell A j, j = 1,..., J = 33, 390 and for each week t, modeled using a latent atmospheric areal process Ṽ (A j, t). Model everything on the log-scale; latent processes take care of point masses at zero. Avoid log(0) problems.

36 First stage models Precipitation model { exp (U(si, t)) if V (s P(s i, t) = i, t) > 0 0 otherwise, Deposition model Z (s i, t) = { exp (Y (si, t)) if V (s i, t) > 0 0 otherwise. Model for CMAQ output { exp (X(Aj, t)) if Q(A j, t) = Ṽ (A j, t) > 0 0 otherwise.

37 Clarification P s, Z s, and Q s are the observed precipitation, NADP deposition, and CMAQ deposition, respectively. V (s, t) is a conceptual point level latent atmospheric process which drives P(s, t) and Z (s, t). P(s, t) and Z (s, t) = 0 if V (s, t) 0. U(s, t) and Y (s, t) are log precipitation and deposition, respectively. Models below will specify their values when V (s, t) 0 or if P(s, t) or Z (s, t) are missing. Ṽ (A j, t) is a conceptual areal level latent atmospheric process which drives Q(A j, t). X(A j, t) is log CMAQ output where modeling below will specify its values when Ṽ (A j, t) 0.

38 Clarification P s, Z s, and Q s are the observed precipitation, NADP deposition, and CMAQ deposition, respectively. V (s, t) is a conceptual point level latent atmospheric process which drives P(s, t) and Z (s, t). P(s, t) and Z (s, t) = 0 if V (s, t) 0. U(s, t) and Y (s, t) are log precipitation and deposition, respectively. Models below will specify their values when V (s, t) 0 or if P(s, t) or Z (s, t) are missing. Ṽ (A j, t) is a conceptual areal level latent atmospheric process which drives Q(A j, t). X(A j, t) is log CMAQ output where modeling below will specify its values when Ṽ (A j, t) 0.

39 Clarification P s, Z s, and Q s are the observed precipitation, NADP deposition, and CMAQ deposition, respectively. V (s, t) is a conceptual point level latent atmospheric process which drives P(s, t) and Z (s, t). P(s, t) and Z (s, t) = 0 if V (s, t) 0. U(s, t) and Y (s, t) are log precipitation and deposition, respectively. Models below will specify their values when V (s, t) 0 or if P(s, t) or Z (s, t) are missing. Ṽ (A j, t) is a conceptual areal level latent atmospheric process which drives Q(A j, t). X(A j, t) is log CMAQ output where modeling below will specify its values when Ṽ (A j, t) 0.

40 Clarification P s, Z s, and Q s are the observed precipitation, NADP deposition, and CMAQ deposition, respectively. V (s, t) is a conceptual point level latent atmospheric process which drives P(s, t) and Z (s, t). P(s, t) and Z (s, t) = 0 if V (s, t) 0. U(s, t) and Y (s, t) are log precipitation and deposition, respectively. Models below will specify their values when V (s, t) 0 or if P(s, t) or Z (s, t) are missing. Ṽ (A j, t) is a conceptual areal level latent atmospheric process which drives Q(A j, t). X(A j, t) is log CMAQ output where modeling below will specify its values when Ṽ (A j, t) 0.

41 Clarification P s, Z s, and Q s are the observed precipitation, NADP deposition, and CMAQ deposition, respectively. V (s, t) is a conceptual point level latent atmospheric process which drives P(s, t) and Z (s, t). P(s, t) and Z (s, t) = 0 if V (s, t) 0. U(s, t) and Y (s, t) are log precipitation and deposition, respectively. Models below will specify their values when V (s, t) 0 or if P(s, t) or Z (s, t) are missing. Ṽ (A j, t) is a conceptual areal level latent atmospheric process which drives Q(A j, t). X(A j, t) is log CMAQ output where modeling below will specify its values when Ṽ (A j, t) 0.

42 Clarification P s, Z s, and Q s are the observed precipitation, NADP deposition, and CMAQ deposition, respectively. V (s, t) is a conceptual point level latent atmospheric process which drives P(s, t) and Z (s, t). P(s, t) and Z (s, t) = 0 if V (s, t) 0. U(s, t) and Y (s, t) are log precipitation and deposition, respectively. Models below will specify their values when V (s, t) 0 or if P(s, t) or Z (s, t) are missing. Ṽ (A j, t) is a conceptual areal level latent atmospheric process which drives Q(A j, t). X(A j, t) is log CMAQ output where modeling below will specify its values when Ṽ (A j, t) 0.

43 Clarification P s, Z s, and Q s are the observed precipitation, NADP deposition, and CMAQ deposition, respectively. V (s, t) is a conceptual point level latent atmospheric process which drives P(s, t) and Z (s, t). P(s, t) and Z (s, t) = 0 if V (s, t) 0. U(s, t) and Y (s, t) are log precipitation and deposition, respectively. Models below will specify their values when V (s, t) 0 or if P(s, t) or Z (s, t) are missing. Ṽ (A j, t) is a conceptual areal level latent atmospheric process which drives Q(A j, t). X(A j, t) is log CMAQ output where modeling below will specify its values when Ṽ (A j, t) 0.

44 The first stage likelihood f (P, Z, Q U, Y, X, V, Ṽ) = f (P U, V) f (Z Y, V) f (Q X, Ṽ) which takes the form [ { } T n t=1 i=1 1 exp u(si, t) 1 exp y(si, t) I (v(s i, t) > 0) { }] 1 exp x(aj, t) I (ṽ(a j, t) > 0) J j=1 where 1 x denotes a degenerate distribution with point mass at x and I( ) is the indicator function.

45 Deposition Model Y (s i, t) = β 0 + β 1 U(s i, t) + β 2 V (s i, t) + (b 0 + b(s i )) X(A ki, t) +η(s i, t) + ɛ(s i, t). Spatially varying coefficients, b = (b(s 1 ),..., b(s n )) is a Gaussian process (GP). Spatio-temporal intercept η t = (η(s 1, t),..., η(s n, t)) is a GP independent in time. Allow for spatially varying calibration of CMAQ. Could imagine common η(s i ). ɛ(s i, t) N(0, σ 2 ɛ ), provides the nugget effect.

46 The second stage models... Precipitation U(s i, t) = α 0 + α 1 V (s i, t) + δ(s i, t), where δ t = (δ(s 1, t),..., δ(s n, t)) is a GP independent in time. CMAQ output X(A j, t) = γ 0 + γ 1 Ṽ (A j, t) + ψ(a j, t), j = 1,..., J. Assume ψ(a j, t) N(0, σψ 2 ), independently.

47 Specification of latent processes Measurement Error Model (MEM) V (s i, t) N(Ṽ (A k i, t), σv 2 ), i = 1,..., n, t = 1,..., T. The process Ṽ (A j, t) is AR in time and CAR in space Ṽ (A j, t) = ρṽ (A j, t 1) + ζ(a j, t), ( J ) ζ(a j, t) N h ji ζ(a i, t), σ2 ζ, m j i=1 Let j define the m j neighboring grid cells of the cell A j. h ji = { 1 m j if i j 0 otherwise.

48 AR in time and CAR in Space Assume the initial condition for Ṽ0: Ṽ (A j, 0) = 1 T T X(A j, t), t=1 giving Ṽ0. Now we can write the CAR in closed form: f (Ṽt Ṽt 1, ρ, σ 2 ζ ) { exp 1 (Ṽt ρṽt 1) D 1 (I H) (Ṽt ρṽt 1) }, 2 D is a diagonal matrix with entries σ 2 ζ /m j. Note that this is an improper CAR.

49 Clarification Note that we can have Z > 0, Q = 0 and vice versa. Therefore V and Ṽ can have opposite signs This arises because we are modeling at two different scales - need processes at two different scales. We can view V (s, t) Ṽ (A, t)as a deviation from the areal average. We assume these realized deviations are independent across space and time We have a conditional model for V and X given Ṽ. The resulting marginal model for U and Y given Ṽ is multiscale - additive random effects at two scales

50 Clarification Note that we can have Z > 0, Q = 0 and vice versa. Therefore V and Ṽ can have opposite signs This arises because we are modeling at two different scales - need processes at two different scales. We can view V (s, t) Ṽ (A, t)as a deviation from the areal average. We assume these realized deviations are independent across space and time We have a conditional model for V and X given Ṽ. The resulting marginal model for U and Y given Ṽ is multiscale - additive random effects at two scales

51 Clarification Note that we can have Z > 0, Q = 0 and vice versa. Therefore V and Ṽ can have opposite signs This arises because we are modeling at two different scales - need processes at two different scales. We can view V (s, t) Ṽ (A, t)as a deviation from the areal average. We assume these realized deviations are independent across space and time We have a conditional model for V and X given Ṽ. The resulting marginal model for U and Y given Ṽ is multiscale - additive random effects at two scales

52 Clarification Note that we can have Z > 0, Q = 0 and vice versa. Therefore V and Ṽ can have opposite signs This arises because we are modeling at two different scales - need processes at two different scales. We can view V (s, t) Ṽ (A, t)as a deviation from the areal average. We assume these realized deviations are independent across space and time We have a conditional model for V and X given Ṽ. The resulting marginal model for U and Y given Ṽ is multiscale - additive random effects at two scales

53 The second stage specification Deposition model: f (Y t U t, V t, X t, η t, b, θ). Space-time intercept: f (η t θ). Precipitation model: f (U t V t, θ). Measurement Error model: f (V t Ṽ(1) t, θ) CMAQ Model: f (X t Ṽt, θ) AR and CAR Model: f (Ṽt Ṽt 1, θ). T t=1 [ f (Yt U t, V t, X t, η t, b, θ) f (η t θ) f (U t V ] t, θ) f (V t Ṽ(1) t, θ) f (X t Ṽt, θ)f (Ṽt Ṽt 1, θ) f (b θ). θ = (α 0, α 1, β 0, β 1, β 2, b 0, γ 0, γ 1, ρ, σ 2 δ, σ2 b, σ2 η, σ2 ɛ, σ2 ψ, σ2 v, σ 2 ζ ).

54 The second stage specification Deposition model: f (Y t U t, V t, X t, η t, b, θ). Space-time intercept: f (η t θ). Precipitation model: f (U t V t, θ). Measurement Error model: f (V t Ṽ(1) t, θ) CMAQ Model: f (X t Ṽt, θ) AR and CAR Model: f (Ṽt Ṽt 1, θ). T t=1 [ f (Yt U t, V t, X t, η t, b, θ) f (η t θ) f (U t V ] t, θ) f (V t Ṽ(1) t, θ) f (X t Ṽt, θ)f (Ṽt Ṽt 1, θ) f (b θ). θ = (α 0, α 1, β 0, β 1, β 2, b 0, γ 0, γ 1, ρ, σ 2 δ, σ2 b, σ2 η, σ2 ɛ, σ2 ψ, σ2 v, σ 2 ζ ).

55 The second stage specification Deposition model: f (Y t U t, V t, X t, η t, b, θ). Space-time intercept: f (η t θ). Precipitation model: f (U t V t, θ). Measurement Error model: f (V t Ṽ(1) t, θ) CMAQ Model: f (X t Ṽt, θ) AR and CAR Model: f (Ṽt Ṽt 1, θ). T t=1 [ f (Yt U t, V t, X t, η t, b, θ) f (η t θ) f (U t V ] t, θ) f (V t Ṽ(1) t, θ) f (X t Ṽt, θ)f (Ṽt Ṽt 1, θ) f (b θ). θ = (α 0, α 1, β 0, β 1, β 2, b 0, γ 0, γ 1, ρ, σ 2 δ, σ2 b, σ2 η, σ2 ɛ, σ2 ψ, σ2 v, σ 2 ζ ).

56 The second stage specification Deposition model: f (Y t U t, V t, X t, η t, b, θ). Space-time intercept: f (η t θ). Precipitation model: f (U t V t, θ). Measurement Error model: f (V t Ṽ(1) t, θ) CMAQ Model: f (X t Ṽt, θ) AR and CAR Model: f (Ṽt Ṽt 1, θ). T t=1 [ f (Yt U t, V t, X t, η t, b, θ) f (η t θ) f (U t V ] t, θ) f (V t Ṽ(1) t, θ) f (X t Ṽt, θ)f (Ṽt Ṽt 1, θ) f (b θ). θ = (α 0, α 1, β 0, β 1, β 2, b 0, γ 0, γ 1, ρ, σ 2 δ, σ2 b, σ2 η, σ2 ɛ, σ2 ψ, σ2 v, σ 2 ζ ).

57 The second stage specification Deposition model: f (Y t U t, V t, X t, η t, b, θ). Space-time intercept: f (η t θ). Precipitation model: f (U t V t, θ). Measurement Error model: f (V t Ṽ(1) t, θ) CMAQ Model: f (X t Ṽt, θ) AR and CAR Model: f (Ṽt Ṽt 1, θ). T t=1 [ f (Yt U t, V t, X t, η t, b, θ) f (η t θ) f (U t V ] t, θ) f (V t Ṽ(1) t, θ) f (X t Ṽt, θ)f (Ṽt Ṽt 1, θ) f (b θ). θ = (α 0, α 1, β 0, β 1, β 2, b 0, γ 0, γ 1, ρ, σ 2 δ, σ2 b, σ2 η, σ2 ɛ, σ2 ψ, σ2 v, σ 2 ζ ).

58 The second stage specification Deposition model: f (Y t U t, V t, X t, η t, b, θ). Space-time intercept: f (η t θ). Precipitation model: f (U t V t, θ). Measurement Error model: f (V t Ṽ(1) t, θ) CMAQ Model: f (X t Ṽt, θ) AR and CAR Model: f (Ṽt Ṽt 1, θ). T t=1 [ f (Yt U t, V t, X t, η t, b, θ) f (η t θ) f (U t V ] t, θ) f (V t Ṽ(1) t, θ) f (X t Ṽt, θ)f (Ṽt Ṽt 1, θ) f (b θ). θ = (α 0, α 1, β 0, β 1, β 2, b 0, γ 0, γ 1, ρ, σ 2 δ, σ2 b, σ2 η, σ2 ɛ, σ2 ψ, σ2 v, σ 2 ζ ).

59 The second stage specification Deposition model: f (Y t U t, V t, X t, η t, b, θ). Space-time intercept: f (η t θ). Precipitation model: f (U t V t, θ). Measurement Error model: f (V t Ṽ(1) t, θ) CMAQ Model: f (X t Ṽt, θ) AR and CAR Model: f (Ṽt Ṽt 1, θ). T t=1 [ f (Yt U t, V t, X t, η t, b, θ) f (η t θ) f (U t V ] t, θ) f (V t Ṽ(1) t, θ) f (X t Ṽt, θ)f (Ṽt Ṽt 1, θ) f (b θ). θ = (α 0, α 1, β 0, β 1, β 2, b 0, γ 0, γ 1, ρ, σ 2 δ, σ2 b, σ2 η, σ2 ɛ, σ2 ψ, σ2 v, σ 2 ζ ).

60 Graphical representation of our model. Regional atmospheric driver (point) Regional atmospheric centering process (areal) replacements δ(s i, t) V (s i, t) MEM Ṽ (A ki, t) b(s i ) η(s i, t) U(s i, t) Y (s i, t) X(A ki, t) Ṽ (A j, t 1) P(s i, t) (observed precipitation) Z(s i, t) (observed deposition) Q(A ki, t) (CMAQ model output)

61 Graphical representation of a fusion model. replacements Regional atmospheric driver (point) δ(s i, t) η(s i, t) V (s i, t) block average V (A ki, t) a(s i ) U(s i, t) Y (true) (s i, t) block average X(A ki, t) Q(A ki, t) Z (true) (s i, t) true P(s i, t) (observed precipitation) Y (s i, t) Q(A ki, t) Z(s i, t) (observed deposition) (CMAQ model output)

62 Predictions at new locations At a new site s and time t we need Z (s, t ) which depends on Y (s, t ). If P(s, t ) = 0 then Z (s, t ) = 0. Suppose otherwise. Bayesian predictive distributions: π(z pred z obs ) = π(z pred par) π(par z obs )dpar. Need to simulate Y (s, t ). V (s, t ) N(Ṽ (A, t ), σ 2 v). U(s, t ), η(s, t ) and b(s ) are simulated from the conditional distributions at s given s 1,..., s n. Kriging. X(A, t ) =logq(a, t ) if Q(A, t ) > 0, otherwise updated in the MCMC. More details in the paper.

63 Predictions at new locations At a new site s and time t we need Z (s, t ) which depends on Y (s, t ). If P(s, t ) = 0 then Z (s, t ) = 0. Suppose otherwise. Bayesian predictive distributions: π(z pred z obs ) = π(z pred par) π(par z obs )dpar. Need to simulate Y (s, t ). V (s, t ) N(Ṽ (A, t ), σ 2 v). U(s, t ), η(s, t ) and b(s ) are simulated from the conditional distributions at s given s 1,..., s n. Kriging. X(A, t ) =logq(a, t ) if Q(A, t ) > 0, otherwise updated in the MCMC. More details in the paper.

64 Predictions at new locations At a new site s and time t we need Z (s, t ) which depends on Y (s, t ). If P(s, t ) = 0 then Z (s, t ) = 0. Suppose otherwise. Bayesian predictive distributions: π(z pred z obs ) = π(z pred par) π(par z obs )dpar. Need to simulate Y (s, t ). V (s, t ) N(Ṽ (A, t ), σ 2 v). U(s, t ), η(s, t ) and b(s ) are simulated from the conditional distributions at s given s 1,..., s n. Kriging. X(A, t ) =logq(a, t ) if Q(A, t ) > 0, otherwise updated in the MCMC. More details in the paper.

65 Predictions at new locations At a new site s and time t we need Z (s, t ) which depends on Y (s, t ). If P(s, t ) = 0 then Z (s, t ) = 0. Suppose otherwise. Bayesian predictive distributions: π(z pred z obs ) = π(z pred par) π(par z obs )dpar. Need to simulate Y (s, t ). V (s, t ) N(Ṽ (A, t ), σ 2 v). U(s, t ), η(s, t ) and b(s ) are simulated from the conditional distributions at s given s 1,..., s n. Kriging. X(A, t ) =logq(a, t ) if Q(A, t ) > 0, otherwise updated in the MCMC. More details in the paper.

66 Choosing the spatial decay parameters Estimation is challenging due to weak identifiability of variances and ranges. Inconsistent see Stein (1999) and Zhang (2004). So, we fix φ η, φ δ and φ b Use validation mean-square error VMSE = 1 n v 8 52 i=1 t=1 ( Z (s i, t) Ẑ (s i, t) ) 2 I(observed) n v = total number of validation observed (=407, here). Optimal ranges were 500, 1000 and 500 kilometers. VMSE is not sensitive near these values.

67 Model Validation Prediction 3 Prediction 2 O O O O O O O O O O O O OOO OO OO OO OO OO OO O O O OO O O O O O O O OO O O O O O O O O O O O 1 0 O O O O O O O O O O O O O O O O O O OO OO OO OO OO OO OOO O O O OO O OO O O O OO O O OO O O O O O O O O O OO O O O O O O O O O O O O Sulfate Observation (a) Nitrate Observation (b) Figure: Validation versus the observed values at the 8 reserved sites. Validation prediction intervals are plotted as vertical lines. (a) sulfate and (b) nitrate.

68 b(s) Surfaces (a) (b) (c) (d) Figure: (a) Sulfate. (b) Nitrate. (c) The s.d. for sulfate. (d) The s.d. for nitrate.

69 Spatially varying slopes? Do we need the spatially varying b(s) s? Only a few of the b(s i ) are significant; they are small relative to their standard errors. Importance of precipitation and the spatially varying intercept makes it difficult to find spatially varying contribution of CMAQ Fusion approaches also have not found spatially varying intercepts Still can see space-time bias in CMAQ by comparing model predictions with CMAQ output.

70 Spatially varying slopes? Do we need the spatially varying b(s) s? Only a few of the b(s i ) are significant; they are small relative to their standard errors. Importance of precipitation and the spatially varying intercept makes it difficult to find spatially varying contribution of CMAQ Fusion approaches also have not found spatially varying intercepts Still can see space-time bias in CMAQ by comparing model predictions with CMAQ output.

71 Spatially varying slopes? Do we need the spatially varying b(s) s? Only a few of the b(s i ) are significant; they are small relative to their standard errors. Importance of precipitation and the spatially varying intercept makes it difficult to find spatially varying contribution of CMAQ Fusion approaches also have not found spatially varying intercepts Still can see space-time bias in CMAQ by comparing model predictions with CMAQ output.

72 Spatially varying slopes? Do we need the spatially varying b(s) s? Only a few of the b(s i ) are significant; they are small relative to their standard errors. Importance of precipitation and the spatially varying intercept makes it difficult to find spatially varying contribution of CMAQ Fusion approaches also have not found spatially varying intercepts Still can see space-time bias in CMAQ by comparing model predictions with CMAQ output.

73 Spatially varying slopes? Do we need the spatially varying b(s) s? Only a few of the b(s i ) are significant; they are small relative to their standard errors. Importance of precipitation and the spatially varying intercept makes it difficult to find spatially varying contribution of CMAQ Fusion approaches also have not found spatially varying intercepts Still can see space-time bias in CMAQ by comparing model predictions with CMAQ output.

74 Parameter Estimates Sulfate Nitrate mean sd 95%CI mean sd 95%CI α ( , ) ( , ) α (0.1017, ) (0.0843, ) β ( , ) ( , ) β (0.8972, ) (0.8274, ) β ( , ) ( , ) b (0.0386, ) (0.0409, ) γ ( , ) ( , ) γ (0.8891, ) (0.7303, ) ρ (0.7664, ) (0.7468, ) σδ (2.5254, ) (1.7942, ) ση (0.2616, ) (0.3149, ) σɛ (0.0607, ) (0.0588, ) σψ (2.4997, ) (2.2092, ) σv (0.7601, ) (0.7366, ) σζ (0.4322, ) (0.4316, )

75 IDW vs. our model Validation PMSE for the annual totals using the 8 holdout sites For sulfates, IDW PMSE is 20.4, our model PMSE is 8.1 For nitrates, IDW PMSE is 3.5, our model PMSE is 1.3 Would expect improvement given the complexity of our model. However 60% is substantial and perhaps justifies the effort

76 IDW vs. our model Validation PMSE for the annual totals using the 8 holdout sites For sulfates, IDW PMSE is 20.4, our model PMSE is 8.1 For nitrates, IDW PMSE is 3.5, our model PMSE is 1.3 Would expect improvement given the complexity of our model. However 60% is substantial and perhaps justifies the effort

77 IDW vs. our model Validation PMSE for the annual totals using the 8 holdout sites For sulfates, IDW PMSE is 20.4, our model PMSE is 8.1 For nitrates, IDW PMSE is 3.5, our model PMSE is 1.3 Would expect improvement given the complexity of our model. However 60% is substantial and perhaps justifies the effort

78 IDW vs. our model Validation PMSE for the annual totals using the 8 holdout sites For sulfates, IDW PMSE is 20.4, our model PMSE is 8.1 For nitrates, IDW PMSE is 3.5, our model PMSE is 1.3 Would expect improvement given the complexity of our model. However 60% is substantial and perhaps justifies the effort

79 Annual Sulfate Deposition Figure: Model predicted map of annual sulfate deposition in The observed annual totals are labeled; a larger font size is used for the validation sites.

80 Annual Nitrate Deposition Figure: Model predicted map of annual nitrate deposition in The observed annual totals are labeled; a larger font size is used for the validation sites.

81 Map of the length of 95% prediction intervals Figure: Uncertainty map of annual sulfate deposition Figure: Uncertainty map of annual nitrate deposition.

82 Quarterly Sulfate Deposition (a) (b) (c) (d) Figure: (a) Jan Mar, (b) Apr-Jun, (c) Jul Sep, (d) Oct-Dec.

83 Quarterly Nitrate Deposition (a) (b) (c) (d) Figure: (a) Jan Mar, (b) Apr-Jun,(c) Jul Sep, (d) Oct-Dec.

84 Discussion Novel spatio-temporal model for fusing point and areal data which validates well. Inference can be provided for any spatial or temporal aggregation. With yearly data can study trends in deposition with regard to regulatory assessment There are models in between IDW and ours but may sacrifice the features we accommodate. Preferable to fusion using block averaging since number of modeled grid cells much greater than number of monitoring sites, even worse if across time. Develop model for dry deposition, hence total deposition.

Fusing point and areal level space-time data with application

Fusing point and areal level space-time data with application Fusing point and areal level space-time data with application to wet deposition Sujit K. Sahu University of Southampton, UK Alan E. Gelfand Duke University, USA David M. Holland Environmental Protection

More information

Combining Monitoring Data and Computer model Output in Assessing Environmental Exposure

Combining Monitoring Data and Computer model Output in Assessing Environmental Exposure Combining Monitoring Data and Computer model Output in Assessing Environmental Exposure Alan E. GELFAND and Sujit K. SAHU December 21, 28 Abstract In the environmental exposure community numerical models

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,

More information

Space-time downscaling under error in computer model output

Space-time downscaling under error in computer model output Space-time downscaling under error in computer model output University of Michigan Department of Biostatistics joint work with Alan E. Gelfand, David M. Holland, Peter Guttorp and Peter Craigmile Introduction

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley Department of Forestry & Department of Geography, Michigan State University, Lansing

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North

More information

Fusing space-time data under measurement error for computer model output

Fusing space-time data under measurement error for computer model output for computer model output (vjb2@stat.duke.edu) SAMSI joint work with Alan E. Gelfand and David M. Holland Introduction In many environmental disciplines data come from two sources: monitoring networks

More information

Bayesian hierarchical modelling for data assimilation of past observations and numerical model forecasts

Bayesian hierarchical modelling for data assimilation of past observations and numerical model forecasts Bayesian hierarchical modelling for data assimilation of past observations and numerical model forecasts Stan Yip Exeter Climate Systems, University of Exeter c.y.yip@ex.ac.uk Joint work with Sujit Sahu

More information

The Importance of Ammonia in Modeling Atmospheric Transport and Deposition of Air Pollution. Organization of Talk:

The Importance of Ammonia in Modeling Atmospheric Transport and Deposition of Air Pollution. Organization of Talk: The Importance of Ammonia in Modeling Atmospheric Transport and Deposition of Air Pollution Organization of Talk: What is modeled Importance of NH 3 emissions to deposition Status of NH 3 emissions (model-based)

More information

Hierarchical Modeling for Multivariate Spatial Data

Hierarchical Modeling for Multivariate Spatial Data Hierarchical Modeling for Multivariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

A Spatio-Temporal Downscaler for Output From Numerical Models

A Spatio-Temporal Downscaler for Output From Numerical Models Supplementary materials for this article are available at 10.1007/s13253-009-0004-z. A Spatio-Temporal Downscaler for Output From Numerical Models Veronica J. BERROCAL,AlanE.GELFAND, and David M. HOLLAND

More information

NRCSE. Misalignment and use of deterministic models

NRCSE. Misalignment and use of deterministic models NRCSE Misalignment and use of deterministic models Work with Veronica Berrocal Peter Craigmile Wendy Meiring Paul Sampson Gregory Nikulin The choice of spatial scale some questions 1. Which spatial scale

More information

Hierarchical Modelling for Multivariate Spatial Data

Hierarchical Modelling for Multivariate Spatial Data Hierarchical Modelling for Multivariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Point-referenced spatial data often come as

More information

Spatio-temporal modeling of fine particulate matter

Spatio-temporal modeling of fine particulate matter Spatio-temporal modeling of fine particulate matter Sujit Sahu a University of Southampton http://www.maths.soton.ac.uk/staff/sahu/ Lisbon: March 2004 a Joint work with Alan Gelfand and David Holland Fine

More information

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter Chris Paciorek Department of Biostatistics Harvard School of Public Health application joint

More information

Statistics for extreme & sparse data

Statistics for extreme & sparse data Statistics for extreme & sparse data University of Bath December 6, 2018 Plan 1 2 3 4 5 6 The Problem Climate Change = Bad! 4 key problems Volcanic eruptions/catastrophic event prediction. Windstorms

More information

Nearest Neighbor Gaussian Processes for Large Spatial Data

Nearest Neighbor Gaussian Processes for Large Spatial Data Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns

More information

Hierarchical Modeling for Spatio-temporal Data

Hierarchical Modeling for Spatio-temporal Data Hierarchical Modeling for Spatio-temporal Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of

More information

Bruno Sansó. Department of Applied Mathematics and Statistics University of California Santa Cruz bruno

Bruno Sansó. Department of Applied Mathematics and Statistics University of California Santa Cruz   bruno Bruno Sansó Department of Applied Mathematics and Statistics University of California Santa Cruz http://www.ams.ucsc.edu/ bruno Climate Models Climate Models use the equations of motion to simulate changes

More information

Standardized Anomaly Model Output Statistics Over Complex Terrain.

Standardized Anomaly Model Output Statistics Over Complex Terrain. Standardized Anomaly Model Output Statistics Over Complex Terrain Reto.Stauffer@uibk.ac.at Outline statistical ensemble postprocessing introduction to SAMOS new snow amount forecasts in Tyrol sub-seasonal

More information

Space-Time Data fusion Under Error in Computer Model Output: An Application to Modeling Air Quality

Space-Time Data fusion Under Error in Computer Model Output: An Application to Modeling Air Quality Biometrics 68, 837 848 September 2012 DOI: 10.1111/j.1541-0420.2011.01725.x Space-Time Data fusion Under Error in Computer Model Output: An Application to Modeling Air Quality Veronica J. Berrocal, 1,

More information

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise

More information

Climate also has a large influence on how local ecosystems have evolved and how we interact with them.

Climate also has a large influence on how local ecosystems have evolved and how we interact with them. The Mississippi River in a Changing Climate By Paul Lehman, P.Eng., General Manager Mississippi Valley Conservation (This article originally appeared in the Mississippi Lakes Association s 212 Mississippi

More information

Bayesian spatial quantile regression

Bayesian spatial quantile regression Brian J. Reich and Montserrat Fuentes North Carolina State University and David B. Dunson Duke University E-mail:reich@stat.ncsu.edu Tropospheric ozone Tropospheric ozone has been linked with several adverse

More information

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,

More information

Bayesian Approaches for Model Evaluation and Uncertainty Estimation

Bayesian Approaches for Model Evaluation and Uncertainty Estimation Bayesian Approaches for Model Evaluation and Uncertainty Estimation Tsuyoshi TADA National Defense Academy, Japan APHW 2006 Bangkok 1 Introduction In wellgauged basins Parameters can be obtained by calibrations.

More information

The Canadian ADAGIO Project for Mapping Total Atmospheric Deposition

The Canadian ADAGIO Project for Mapping Total Atmospheric Deposition The Canadian ADAGIO Project for Mapping Total Atmospheric Deposition Amanda S. Cole Environment & Climate Change Canada (ECCC) MMF-GTAD Workshop Geneva, Switzerland February 28, 2017 ADAGIO team Amanda

More information

Sub-kilometer-scale space-time stochastic rainfall simulation

Sub-kilometer-scale space-time stochastic rainfall simulation Picture: Huw Alexander Ogilvie Sub-kilometer-scale space-time stochastic rainfall simulation Lionel Benoit (University of Lausanne) Gregoire Mariethoz (University of Lausanne) Denis Allard (INRA Avignon)

More information

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Jon Wakefield Departments of Statistics and Biostatistics University of Washington 1 / 37 Lecture Content Motivation

More information

Hierarchical Bayesian models for space-time air pollution data

Hierarchical Bayesian models for space-time air pollution data Hierarchical Bayesian models for space-time air pollution data Sujit K. Sahu June 14, 2011 Abstract Recent successful developments in Bayesian modeling and computation for large and complex data sets have

More information

A spatial causal analysis of wildfire-contributed PM 2.5 using numerical model output. Brian Reich, NC State

A spatial causal analysis of wildfire-contributed PM 2.5 using numerical model output. Brian Reich, NC State A spatial causal analysis of wildfire-contributed PM 2.5 using numerical model output Brian Reich, NC State Workshop on Causal Adjustment in the Presence of Spatial Dependence June 11-13, 2018 Brian Reich,

More information

Non-gaussian spatiotemporal modeling

Non-gaussian spatiotemporal modeling Dec, 2008 1/ 37 Non-gaussian spatiotemporal modeling Thais C O da Fonseca Joint work with Prof Mark F J Steel Department of Statistics University of Warwick Dec, 2008 Dec, 2008 2/ 37 1 Introduction Motivation

More information

Analysis of Radar-Rainfall Uncertainties and effects on Hydrologic Applications. Emad Habib, Ph.D., P.E. University of Louisiana at Lafayette

Analysis of Radar-Rainfall Uncertainties and effects on Hydrologic Applications. Emad Habib, Ph.D., P.E. University of Louisiana at Lafayette Analysis of Radar-Rainfall Uncertainties and effects on Hydrologic Applications Emad Habib, Ph.D., P.E. University of Louisiana at Lafayette Motivation Rainfall is a process with significant variability

More information

Impacts of climate change on flooding in the river Meuse

Impacts of climate change on flooding in the river Meuse Impacts of climate change on flooding in the river Meuse Martijn Booij University of Twente,, The Netherlands m.j.booij booij@utwente.nlnl 2003 in the Meuse basin Model appropriateness Appropriate model

More information

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This

More information

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

On Gaussian Process Models for High-Dimensional Geostatistical Datasets On Gaussian Process Models for High-Dimensional Geostatistical Datasets Sudipto Banerjee Joint work with Abhirup Datta, Andrew O. Finley and Alan E. Gelfand University of California, Los Angeles, USA May

More information

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University

More information

A Geostatistical Approach to Linking Geographically-Aggregated Data From Different Sources

A Geostatistical Approach to Linking Geographically-Aggregated Data From Different Sources A Geostatistical Approach to Linking Geographically-Aggregated Data From Different Sources Carol A. Gotway Crawford National Center for Environmental Health Centers for Disease Control and Prevention,

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes Chapter 4 - Fundamentals of spatial processes Lecture notes Geir Storvik January 21, 2013 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites Mostly positive correlation Negative

More information

Hierarchical Bayesian auto-regressive models for large space time data with applications to ozone concentration modelling

Hierarchical Bayesian auto-regressive models for large space time data with applications to ozone concentration modelling Hierarchical Bayesian auto-regressive models for large space time data with applications to ozone concentration modelling Sujit Kumar Sahu School of Mathematics, University of Southampton, Southampton,

More information

Spatial Inference of Nitrate Concentrations in Groundwater

Spatial Inference of Nitrate Concentrations in Groundwater Spatial Inference of Nitrate Concentrations in Groundwater Dawn Woodard Operations Research & Information Engineering Cornell University joint work with Robert Wolpert, Duke Univ. Dept. of Statistical

More information

Climate Change Impact Analysis

Climate Change Impact Analysis Climate Change Impact Analysis Patrick Breach M.E.Sc Candidate pbreach@uwo.ca Outline July 2, 2014 Global Climate Models (GCMs) Selecting GCMs Downscaling GCM Data KNN-CAD Weather Generator KNN-CADV4 Example

More information

Data Integration Model for Air Quality: A Hierarchical Approach to the Global Estimation of Exposures to Ambient Air Pollution

Data Integration Model for Air Quality: A Hierarchical Approach to the Global Estimation of Exposures to Ambient Air Pollution Data Integration Model for Air Quality: A Hierarchical Approach to the Global Estimation of Exposures to Ambient Air Pollution Matthew Thomas 9 th January 07 / 0 OUTLINE Introduction Previous methods for

More information

X t = a t + r t, (7.1)

X t = a t + r t, (7.1) Chapter 7 State Space Models 71 Introduction State Space models, developed over the past 10 20 years, are alternative models for time series They include both the ARIMA models of Chapters 3 6 and the Classical

More information

Modeling daily precipitation in Space and Time

Modeling daily precipitation in Space and Time Space and Time SWGen - Hydro Berlin 20 September 2017 temporal - dependence Outline temporal - dependence temporal - dependence Stochastic Weather Generator Stochastic Weather Generator (SWG) is a stochastic

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Bayesian Dynamic Linear Modelling for. Complex Computer Models

Bayesian Dynamic Linear Modelling for. Complex Computer Models Bayesian Dynamic Linear Modelling for Complex Computer Models Fei Liu, Liang Zhang, Mike West Abstract Computer models may have functional outputs. With no loss of generality, we assume that a single computer

More information

Spatial Misalignment

Spatial Misalignment Spatial Misalignment Jamie Monogan University of Georgia Spring 2013 Jamie Monogan (UGA) Spatial Misalignment Spring 2013 1 / 28 Objectives By the end of today s meeting, participants should be able to:

More information

GAMINGRE 8/1/ of 7

GAMINGRE 8/1/ of 7 FYE 09/30/92 JULY 92 0.00 254,550.00 0.00 0 0 0 0 0 0 0 0 0 254,550.00 0.00 0.00 0.00 0.00 254,550.00 AUG 10,616,710.31 5,299.95 845,656.83 84,565.68 61,084.86 23,480.82 339,734.73 135,893.89 67,946.95

More information

Current and Future Impacts of Wildfires on PM 2.5, Health, and Policy in the Rocky Mountains

Current and Future Impacts of Wildfires on PM 2.5, Health, and Policy in the Rocky Mountains Current and Future Impacts of Wildfires on PM 2.5, Health, and Policy in the Rocky Mountains Yang Liu, Ph.D. STAR Grants Kick-off Meeting Research Triangle Park, NC April 5, 2017 Motivation The Rocky Mountains

More information

Convective-scale NWP for Singapore

Convective-scale NWP for Singapore Convective-scale NWP for Singapore Hans Huang and the weather modelling and prediction section MSS, Singapore Dale Barker and the SINGV team Met Office, Exeter, UK ECMWF Symposium on Dynamical Meteorology

More information

Temporal and spatial variations in radiation and energy fluxes across Lake Taihu

Temporal and spatial variations in radiation and energy fluxes across Lake Taihu Temporal and spatial variations in radiation and energy fluxes across Lake Taihu Wang Wei YNCenter Video Conference May 10, 2012 Outline 1. Motivation 2. Hypothesis 3. Methodology 4. Preliminary results

More information

Summary STK 4150/9150

Summary STK 4150/9150 STK4150 - Intro 1 Summary STK 4150/9150 Odd Kolbjørnsen May 22 2017 Scope You are expected to know and be able to use basic concepts introduced in the book. You knowledge is expected to be larger than

More information

Bayesian Forecasting Using Spatio-temporal Models with Applications to Ozone Concentration Levels in the Eastern United States

Bayesian Forecasting Using Spatio-temporal Models with Applications to Ozone Concentration Levels in the Eastern United States 1 Bayesian Forecasting Using Spatio-temporal Models with Applications to Ozone Concentration Levels in the Eastern United States Sujit Kumar Sahu Southampton Statistical Sciences Research Institute, University

More information

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case Areal data models Spatial smoothers Brook s Lemma and Gibbs distribution CAR models Gaussian case Non-Gaussian case SAR models Gaussian case Non-Gaussian case CAR vs. SAR STAR models Inference for areal

More information

Record date Payment date PID element Non-PID element. 08 Sep Oct p p. 01 Dec Jan p 9.85p

Record date Payment date PID element Non-PID element. 08 Sep Oct p p. 01 Dec Jan p 9.85p 2017/18 Record date Payment date PID element Non-PID element 08 Sep 17 06 Oct 17 9.85p - 9.85p 01 Dec 17 05 Jan 18-9.85p 9.85p 09 Mar 18 06 Apr 18 9.85p - 9.85p Final 22 Jun 18 27 Jul 18 14.65p - 14.65p

More information

Multivariate spatial modeling

Multivariate spatial modeling Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced

More information

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial

More information

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C.,

More information

2016 Meteorology Summary

2016 Meteorology Summary 2016 Meteorology Summary New Jersey Department of Environmental Protection AIR POLLUTION AND METEOROLOGY Meteorology plays an important role in the distribution of pollution throughout the troposphere,

More information

Contents 1 Introduction 2 Statistical Tools and Concepts

Contents 1 Introduction 2 Statistical Tools and Concepts 1 Introduction... 1 1.1 Objectives and Approach... 1 1.2 Scope of Resource Modeling... 2 1.3 Critical Aspects... 2 1.3.1 Data Assembly and Data Quality... 2 1.3.2 Geologic Model and Definition of Estimation

More information

Hierarchical Bayesian Modeling of Multisite Daily Rainfall Occurrence

Hierarchical Bayesian Modeling of Multisite Daily Rainfall Occurrence The First Henry Krumb Sustainable Engineering Symposium Hierarchical Bayesian Modeling of Multisite Daily Rainfall Occurrence Carlos Henrique Ribeiro Lima Prof. Upmanu Lall March 2009 Agenda 1) Motivation

More information

Statistical Models for Monitoring and Regulating Ground-level Ozone. Abstract

Statistical Models for Monitoring and Regulating Ground-level Ozone. Abstract Statistical Models for Monitoring and Regulating Ground-level Ozone Eric Gilleland 1 and Douglas Nychka 2 Abstract The application of statistical techniques to environmental problems often involves a tradeoff

More information

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets Matthias Katzfuß Advisor: Dr. Noel Cressie Department of Statistics The Ohio State University

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Climate Change: the Uncertainty of Certainty

Climate Change: the Uncertainty of Certainty Climate Change: the Uncertainty of Certainty Reinhard Furrer, UZH JSS, Geneva Oct. 30, 2009 Collaboration with: Stephan Sain - NCAR Reto Knutti - ETHZ Claudia Tebaldi - Climate Central Ryan Ford, Doug

More information

Recovery Analysis Methods and Data Requirements Study

Recovery Analysis Methods and Data Requirements Study Elevation Recovery Analysis Methods and Data Requirements Study Update to NTB Phase II Local Technical Peer Review Group December 3, 2003 Time 1 Schedule for Pumpage Reductions 180 160 140 120 158 121

More information

Communicating Climate Change Consequences for Land Use

Communicating Climate Change Consequences for Land Use Communicating Climate Change Consequences for Land Use Site: Prabost, Skye. Event: Kyle of Lochalsh, 28 th February 28 Further information: http://www.macaulay.ac.uk/ladss/comm_cc_consequences.html Who

More information

Statistical Models for Rainfall with Applications to Index Insura

Statistical Models for Rainfall with Applications to Index Insura Statistical Models for Rainfall with Applications to April 21, 2008 Overview The idea: Insure farmers against the risk of crop failure, like drought, instead of crop failure itself. It reduces moral hazard

More information

Forecasting Wind Ramps

Forecasting Wind Ramps Forecasting Wind Ramps Erin Summers and Anand Subramanian Jan 5, 20 Introduction The recent increase in the number of wind power producers has necessitated changes in the methods power system operators

More information

Climatic and Ecological Conditions in the Klamath Basin of Southern Oregon and Northern California: Projections for the Future

Climatic and Ecological Conditions in the Klamath Basin of Southern Oregon and Northern California: Projections for the Future Climatic and Ecological Conditions in the Klamath Basin of Southern Oregon and Northern California: Projections for the Future A Collaborative Effort by: CLIMATE LEADERSHIP INITIATIVE INSTITUTE FOR A SUSTAINABLE

More information

Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model?

Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model? Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model? Andrew W. Robertson International Research Institute for Climate

More information

Daily Rainfall Disaggregation Using HYETOS Model for Peninsular Malaysia

Daily Rainfall Disaggregation Using HYETOS Model for Peninsular Malaysia Daily Rainfall Disaggregation Using HYETOS Model for Peninsular Malaysia Ibrahim Suliman Hanaish, Kamarulzaman Ibrahim, Abdul Aziz Jemain Abstract In this paper, we have examined the applicability of single

More information

Geostatistical Modeling for Large Data Sets: Low-rank methods

Geostatistical Modeling for Large Data Sets: Low-rank methods Geostatistical Modeling for Large Data Sets: Low-rank methods Whitney Huang, Kelly-Ann Dixon Hamil, and Zizhuang Wu Department of Statistics Purdue University February 22, 2016 Outline Motivation Low-rank

More information

Overview on the modelling setup Seven new features in the modelling framework First results for the Alpine Rhine and Engadin Conclusions and Outlook

Overview on the modelling setup Seven new features in the modelling framework First results for the Alpine Rhine and Engadin Conclusions and Outlook SUB PROJECT: Natural water balance of Switzerland and its most important large river basins Overview on the modelling setup Seven new features in the modelling framework First results for the Alpine Rhine

More information

SIS Meeting Oct AgriCLASS Agriculture CLimate Advisory ServiceS. Phil Beavis - Telespazio VEGA UK Indicators, Models and System Design

SIS Meeting Oct AgriCLASS Agriculture CLimate Advisory ServiceS. Phil Beavis - Telespazio VEGA UK Indicators, Models and System Design 1 SIS Meeting 17-19 Oct 2016 AgriCLASS Agriculture CLimate Advisory ServiceS Phil Beavis - Telespazio VEGA UK Indicators, Models and System Design Michael Sanderson - UK Met Office Climate Data and Weather

More information

Overview of Spatial Statistics with Applications to fmri

Overview of Spatial Statistics with Applications to fmri with Applications to fmri School of Mathematics & Statistics Newcastle University April 8 th, 2016 Outline Why spatial statistics? Basic results Nonstationary models Inference for large data sets An example

More information

Gridded observation data for Climate Services

Gridded observation data for Climate Services Gridded observation data for Climate Services Ole Einar Tveito, Inger Hanssen Bauer, Eirik J. Førland and Cristian Lussana Norwegian Meteorological Institute Norwegian annual temperatures Norwegian annual

More information

A spatio-temporal model for extreme precipitation simulated by a climate model

A spatio-temporal model for extreme precipitation simulated by a climate model A spatio-temporal model for extreme precipitation simulated by a climate model Jonathan Jalbert Postdoctoral fellow at McGill University, Montréal Anne-Catherine Favre, Claude Bélisle and Jean-François

More information

Challenges in modelling air pollution and understanding its impact on human health

Challenges in modelling air pollution and understanding its impact on human health Challenges in modelling air pollution and understanding its impact on human health Alastair Rushworth Joint Statistical Meeting, Seattle Wednesday August 12 th, 2015 Acknowledgements Work in this talk

More information

Weather generators for studying climate change

Weather generators for studying climate change Weather generators for studying climate change Assessing climate impacts Generating Weather (WGEN) Conditional models for precip Douglas Nychka, Sarah Streett Geophysical Statistics Project, National Center

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Spatial omain Hierarchical Modelling for Univariate Spatial ata Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A.

More information

Gaussian predictive process models for large spatial data sets.

Gaussian predictive process models for large spatial data sets. Gaussian predictive process models for large spatial data sets. Sudipto Banerjee, Alan E. Gelfand, Andrew O. Finley, and Huiyan Sang Presenters: Halley Brantley and Chris Krut September 28, 2015 Overview

More information

A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES

A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES JOHN STEPHENSON 1, CHRIS HOLMES, KERRY GALLAGHER 1 and ALEXANDRE PINTORE 1 Dept. Earth Science and Engineering, Imperial College,

More information

Wrapped Gaussian processes: a short review and some new results

Wrapped Gaussian processes: a short review and some new results Wrapped Gaussian processes: a short review and some new results Giovanna Jona Lasinio 1, Gianluca Mastrantonio 2 and Alan Gelfand 3 1-Università Sapienza di Roma 2- Università RomaTRE 3- Duke University

More information

GIST 4302/5302: Spatial Analysis and Modeling

GIST 4302/5302: Spatial Analysis and Modeling GIST 4302/5302: Spatial Analysis and Modeling Basics of Statistics Guofeng Cao www.myweb.ttu.edu/gucao Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Spring 2015 Outline of This Week

More information

Chapter 6 Problems with the calibration of Gaussian HMMs to annual rainfall

Chapter 6 Problems with the calibration of Gaussian HMMs to annual rainfall 115 Chapter 6 Problems with the calibration of Gaussian HMMs to annual rainfall Hidden Markov models (HMMs) were introduced in Section 3.3 as a method to incorporate climatic persistence into stochastic

More information

The Generalized Likelihood Uncertainty Estimation methodology

The Generalized Likelihood Uncertainty Estimation methodology CHAPTER 4 The Generalized Likelihood Uncertainty Estimation methodology Calibration and uncertainty estimation based upon a statistical framework is aimed at finding an optimal set of models, parameters

More information

Simulations of Lake Processes within a Regional Climate Model

Simulations of Lake Processes within a Regional Climate Model Simulations of Lake Processes within a Regional Climate Model Jiming Jin Departments of Watershed Sciences and Plants, Soils, and Climate Utah State University, Logan, Utah Utah State University 850 Faculty

More information

Spatial Misalignment

Spatial Misalignment November 4, 2009 Objectives Land use in Madagascar Example with WinBUGS code Objectives Learn how to realign nonnested block level data Become familiar with ways to deal with misaligned data which are

More information

The National Atmospheric Deposition Program (NADP)

The National Atmospheric Deposition Program (NADP) The National Atmospheric Deposition Program (NADP) Christopher Lehmann Director, Central Analytical Laboratory National Atmospheric Deposition Program Illinois State Water Survey - Prairie Research Institute

More information

Climatography of the United States No

Climatography of the United States No Climate Division: AK 5 NWS Call Sign: ANC Month (1) Min (2) Month(1) Extremes Lowest (2) Temperature ( F) Lowest Month(1) Degree s (1) Base Temp 65 Heating Cooling 90 Number of s (3) Jan 22.2 9.3 15.8

More information

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets Sudipto Banerjee University of California, Los Angeles, USA Based upon projects involving: Abhirup Datta (Johns Hopkins University)

More information

STATISTICS OF CLIMATE EXTREMES// TRENDS IN CLIMATE DATASETS

STATISTICS OF CLIMATE EXTREMES// TRENDS IN CLIMATE DATASETS STATISTICS OF CLIMATE EXTREMES// TRENDS IN CLIMATE DATASETS Richard L Smith Departments of STOR and Biostatistics, University of North Carolina at Chapel Hill and Statistical and Applied Mathematical Sciences

More information

IBM Research Report. Gust Speed Forecasting Using Weather Model Outputs and Meteorological Observations

IBM Research Report. Gust Speed Forecasting Using Weather Model Outputs and Meteorological Observations RC25087 (W1012-060) December 14, 2010 Mathematics IBM Research Report Gust Speed Forecasting Using Weather Model Outputs and Meteorological Observations Honggei Li, Fei Liu, Jonathan R. Hosking, Yasuo

More information

Application of Real-Time Rainfall Information System to CSO control. 2 October 2011 Naruhito Funatsu METAWATER Co., Ltd.

Application of Real-Time Rainfall Information System to CSO control. 2 October 2011 Naruhito Funatsu METAWATER Co., Ltd. Application of Real-Time Rainfall Information System to CSO control 2 October 2011 Naruhito Funatsu METAWATER Co., Ltd. Presentation Points Objectives To verify the applicability of the real-time rainfall

More information

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models arxiv:1811.03735v1 [math.st] 9 Nov 2018 Lu Zhang UCLA Department of Biostatistics Lu.Zhang@ucla.edu Sudipto Banerjee UCLA

More information

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February

More information

Predicting Long-term Exposures for Health Effect Studies

Predicting Long-term Exposures for Health Effect Studies Predicting Long-term Exposures for Health Effect Studies Lianne Sheppard Adam A. Szpiro, Johan Lindström, Paul D. Sampson and the MESA Air team University of Washington CMAS Special Session, October 13,

More information