E(x i ) = µ i. 2 d. + sin 1 d θ 2. for d < θ 2 0 for d θ 2
|
|
- Oliver Dorsey
- 5 years ago
- Views:
Transcription
1 1 Gaussian Processes Definition 1.1 A Gaussian process { i } over sites i is defined by its mean function and its covariance function E( i ) = µ i c ij = Cov( i, j ) plus joint normality of the finite dimensional distributions. Hence restricted to the points labelled by 1,..., n is ( 1,..., n ) T and it has a n-variate Gaussian distribution N(µ, Σ), where µ = (µ 1,..., µ n ) T and Σ = (c ij ). Note that the covariance function c ij must be positive definite (ie. any covariance matri created from a finite dimensional set of i s must be positive definite: a T Σa > 0, for any non-ero vector a). Covariance functions The restriction that the function {c ij } be positive definite can make the search for valid covariance functions difficult. Most covariance functions model covariance between sites i and j as a function of between the two sites d ij = dist(i, j), where dist(i, j) is typically Euclidean, or a simple modification of it. Hence c ij = C(d ij ). It is standard to choose from a number of parameteried covariance functions, often called covariograms, listed below: Power family C(d θ, p) = θ 1 ep{ d/θ 2 p }, 0 < p 2 Two notable covariograms in this family are the eponential (p = 1) and the Gaussian (p = 2). Spherical C(d θ) = { [ ( )] θ1 1 2 d π θ2 1 d θ 2 + sin 1 d θ 2 for d < θ 2 0 for d θ 2 For the spherical covariogram, if i and j are separated by a greater than θ 2, i and j are independent. Matérn C(d θ) = θ θ 3 1 Γ(θ 3 ) ( ) θ3 ( ) 2 θ3 d 2 θ3 d K θ3 where θ 2 is a scale parameter and θ 3 is a shape parameter, and K() θ is a modified Bessel function of the third kind of order θ 3 (Abramowit and Stegun 1964, Chapter 9). θ 2 θ 2 1
2 Why positive definite? Consider the power covariogram for large p. This makes the covariogram look like a step function C(d) = I[0 d 1]. So if sites 1,2,3 lie on a line with spacing 1, then Cov( 2 1, 2 ) = Cov( 2, 3 ) = 1 1 = 2 = 3, but C(d = 1) requires that Cov( 1, 3 ) = 0, which is a contradiction. Such a difficulty can occur for any p > 2. Eample: Gaussian random walk Let = ( 0, 1, 2, 3,...) T be a Gaussian process defined on the integers {0, 1, 2, 3,...} such that = ( 0, 1, 2, 3, 4 ) T has density: 0 0, i i 1 N( i 1, 1), for i = 1, 2, 3,... } 4 π() = (2π) 4 2 ep { 1 2 ( i i 1 ) 2, 0 = 0 i=1 ( ) = (2π) 4 2 ep 1 2 = (2π) 4 2 ep { 1 2 T W }, 0 = , 0 = 0 Alternatively, we can compute the means, variances and covariances of using standard covariance formulas since i = i j=1 j where the i s are iid N(0, 1) random variables. Thus: N 0, N(0, Σ) Note that this distribution is degenerate since 0 = 0. So the rank of Σ is only 4. One can check that W is a generalied-inverse of Σ. The covariance function of this process is: Cov( i, j ) = min(i, j) = i j One can imagine defining a random walk on a grid with twice the density using increments with half the variance. The resulting covariogram on the integers would be eactly the same. The figure below shows a random walk on a successively finer grid. 2
3 The limiting process { t } is called Brownian motion. It is defined over t 0, it s continuous, and it is characteried by its mean function E t = 0 and its covariance function Cov( t, s ) = t s. Note that the covariance function cannot be written as a function of here. In fact, Var t = t, so the process { t } isn t even stationary. A process { t } is said to be stationary if the distribution of a finite dimensional restriction of { t } is unchanged if the process is translated over space (ie. ( t1,..., tn ) = d ( t1 +s,..., tn+s) for any s in the case of Brownian motion). The random walk is an eample of an intrinsicly stationary process because the increments are all iid. Such processes can be characteried by their variograms, which give the variance of all pairwise differences. When the variance of any pair ( i, j ) depends only on the d ij between sites i and j we can characterie the process {} with a variogram denoted by 2γ(d ij ). For historical reasons γ(d) is called the semi-variogram, and the variogram is defined to be 2γ(d). For the random walk Var( i j ) = 2γ(d ij ) = i j In the eample here, knowing the variogram doesn t completely specify the distribution of {}. For eample, the random walk process where 0 = 0 has eactly the same variogram as does the process with 0 = 1. However, as long as one of the i s (or more) are known, then the distribution for the remaining components of {} is completely specified. The Variogram Definition 1.2 A Gaussian process { i }, is said to have a variogram if Var( i j ) is a function of d ij between sites i and j. We use 2γ(d ij ) to denote the variogram. 2γ(d ij ) = Var( i j ) If a process { i } has a covariogram, then the two functions are related by γ(d) = C(0) C(d) C(d) = γ( ) γ(h) 3
4 If { i } has a variogram, but the covariogram doesn t eist, we can compute a covariance function by conditioning on the event 0 = 0. 2γ(d ij ) = Var( i j ) = Var( i j 0 = 0) = Var(( i 0 ) ( j 0 ) 0 = 0) = Var( i 0 ) + Var( j 0 )) 2Cov( i 0, j 0 0 = 0) = 2γ(d i0 ) + 2γ(d j0 ) 2Cov( i, j 0 = 0) Thus one gets Cov( i, j 0 = 0) = γ(d i0 ) + γ(d j0 ) γ(d ij ) This is one way of obtaining Σ in the random walk eample. In fact using Σ ij = c γ(d ij ) will give a valid covariance matri provided c is sufficiently large. One eample of a Gaussian process which has a variogram, but not a covariogram is a generalied Brownian motion process: Power variogram: γ(d) d p, 0 < p 2 For the case p = 1, this is general Brownian motion. 2-d Thin plate spline γ(d) d 2 log d Sample realiations: 4
5 C(d) ep{ d/σ}, here σ = 2 eponential covariogram C(d) ep{ (d/σ) 2 }, here σ = 6 random realiation - Gaussian covariogram
6 C(d) ep{ (d/σ) 1. }, here σ = 9 random realiation - C(d) = ep(-d^1.) C(d) 1 2 d π σ 1 d σ + d sin 1 here σ = 9 σ random realiation - spherical
7 γ(d) d/σ, here σ = random realiation - Brownian motion p= γ(d) (d/σ) 1.3, here σ = random realiation - Brownian motion p=
8 γ(d) (d/σ) 1.7, here σ = random realiation - Brownian motion p= γ(d) (d/σ) 2.0, here σ = random realiation - Brownian motion p= Interpolation Given a set of points {1, 2,..., n} on a line or in a plane and their values 1, 2,..., n one can fit an interpolating surface just by choosing a covariogram or variogram model. 1. Assume the surface is a realiation from a Gaussian field with a specified covariance function. 8
9 2. Compute the conditional mean of the process given the observed sites. This is the interpolating surface. Here s the recipe: Suppose the process { 1,..., n } is observed. Now suppose you wish to find the interpolating surface { n+1,..., n+m } at the sites {n + 1,..., n + m}. Assume = ( 1,..., n, n+1,..., n+m ) T N (( µ1 µ 2 ) ( )) Σ11 Σ, 12 Σ 21 Σ 22 where the elements of Σ are determined by the covariogram or variogram model. The conditional distribution of ( n+1,..., n+m ) given 1,..., n is: ( 2 1 ) = (( n+1,..., n+m ) ( 1,..., n ) T ) N ( µ 2 + Σ 21 Σ 1 11 ( 1 µ 1 ), Σ 21 Σ 1 11 Σ 12 ) Note: if we re-epress things in terms of precisions, then for we have = ( 1,..., n, n+1,..., n+m ) T N ( ( µ1 µ 2 ) ( ) W11 W 1 ), 12 W 21 W 22 ( 2 1 ) = (( n+1,..., n+m ) ( 1,..., n ) T ) N ( µ 2 + W22 1 W ) 21( 1 µ 1 ), W22 1. Define the interpolant to be the mean of 2 given the observations 1 : µ 22 1 = µ 2 + Σ 21 Σ 1 11 ( 1 µ 1 ) If { i } really did follow a Guassian process with the specified mean and covariance function, then the standard error of the enterpolant at j would be the jj component of Σ 21 Σ 1 11 Σ 12. We ll look at estimating parameters of the covariogram/variogram later. The conditional mean is the optimal interpolator under mean squared error loss. Some eamples: 9
10 Gaussian C(r), scale = 2 Eponential C(r), scale = 1 Brownian motion C(r), p = 1. scale = Gaussian C(r), scale = 3 0 Eponential C(r), scale = 0 Brownian motion C(r), p = 1. scale = Gaussian C(r), scale = 0 Eponential C(r), scale = 0 Brownian motion C(r), p = 1. scale =
11 Gaussian C(r), scale = 2 Eponential C(r), scale = 1 Brownian motion C(r), p = 1. scale = Gaussian C(r), scale = 3-0 Eponential C(r), scale = - 0 Brownian motion C(r), p = 1. scale = Gaussian C(r), scale = - 0 Eponential C(r), scale = - 0 Brownian motion C(r), p = 1. scale =
12 a realiation mean conditional on =1 points realiation conditional on =1 points realiation conditional on =1 points
13 a realiation mean conditional on =1 points realiation conditional on =1 points 2 realiation conditional on =1 points A closer look at the covariogram There are a number of features of a variogram/covariogram that are worth noting. It is also of use to understand how properties of the covariogram affect the resulting spatial process. range scale nugget 13
14 nugget sill covariance range Eamples: White noise process Effects of the nugget on the conditional distribution Effects of the range on the conditional distribution Effects of γ (0+) on the conditional distribution 4 Estimating the variogram Given an observed set of points 1,..., n at locations 1,..., n one may wish to estimate various properties of the variogram governing their covariance. This can be done graphically using the empirical variogram. Definition 4.1 The empirical variogram is determined by discretiing into a n d bins and then estimating ˆγ(d) = 1 ( i j ) 2 2N d (i,j) d where d is the set of all pairs (i, j) such that the between i and j is within of d and N d is the number of pairs in d. Note this estimate can be somewhat unreliable. What follows is a number of surfaces and their empirical variograms derrived from 1,..., n, which were sampled uniformly from the 400 points making up a lattice. 14
15 Realiations from a Gaussian process with eponential variogram
16 Realiations from a Gaussian process with Gaussian variogram
17 Realiations from a Gaussian process with power (p=1.) variogram What features of γ(d) are important to capture Fitting a variogram model to the empirical variogram 1. Least squares: Choose θ so that (γ(d k θ) ˆγ(d k )) 2 is minimied. 2. Weighted least squares: Choose θ so that w k (γ(d k θ) ˆγ(d k )) 2 k k is minimied. Weights w k may be chosen to be proportional to the number of pairs in each bin N d. This will put more weight on pairs that are closer together. 17
18 3. Maimum likelihood: Suppose y N(µ, Σ(θ)), then we can estimate µ and θ by the values ˆµ and ˆθ that maimie the likelihood: L(µ, θ y) Σ(θ) 1 2 ep { 1 } 2 (y µ)t Σ(θ)(y µ) A variant of maimum likelihood is restricted maimum likelihood (REML) which uses a slightly modified version of the likelihood. 4. Bayesian estimation. Specify prior distributions for µ and θ and use the posterior mean or posterior mode to estimate θ. Anisotropy d = ΛRd. Transform euclidean thru a rotation R(θ) and a stretching/shrinking of the principle aes via Λ. These additional parameters may be absorbed in θ for estimating the variogram parameters. Eample: Piaa Road Superfund site. log dioin concentrations from Pilot Road site ylim lim One can estimate the variogram parameters by eye, using ML or REML, or better yet, a Bayesian approach. Use the interpolation formulas of Section 2 to estimate the concentration at unobserved sites. 18
19 Modeling spatial data y = β + + e where β absorbs standard linear model terms, absorbs the spatial trend, and e is a white noise term. eamples Agricultural field trials. rainfall estimation environmental monitoring imaging Observed data: y = (y 1,..., y n ) T Unobserved spatial trend: = ( 1,..., n ) T Covariates: Some equivalent formulations: y N(β +, σ 2 y I) y = β + + e y N(β, σ2 y I + Σ(θ)) N(0, Σ(θ)) N(0, Σ(θ)) e N(0, σ 2 ei) An eample: Researchers want to know how carbon concentrations in a stream differ on two different sides of a culvert on I-40. Concentrations are measured at 9 upstream locations and 8 downstream locations and are given in the figure below. Is there evidence that the culvert is associated with differences in carbon concentration? I
20 Here we can model the 17 measurements by y N(µ + α i, σ 2 ei + Σ(θ)) where α i, i = 1, 2, denotes the upstream or downstream measurements. So the question can be put in statistical terms, does α 1 = α 2? It turns out it depends on what covariance function is specified. If an eponential covriance function is used, then the inference depends on the specified range: significance by range: dist corr of nearest p-value diff*^ One could rely on maimum likelihood to specify the parameters of the eponential distribution. In fact it fits a covariance that is nearly all nugget effect (ie. no spatial dependence). However the uncertainty about the covariance parameters are unaccounted for.
Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström.
C Stochastic fields Covariance Spatial Statistics with Image Analysis Lecture 2 Johan Lindström November 4, 26 Lecture L2 Johan Lindström - johanl@maths.lth.se FMSN2/MASM2 L /2 C Stochastic fields Covariance
More informationBasics of Point-Referenced Data Models
Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Chapter 2: Basics of Point-Referenced Data Models p. 1/45 Basics of Point-Referenced Data Models Basic
More informationGaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012
Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature
More informationChapter 4 - Fundamentals of spatial processes Lecture notes
TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites
More information2 Statistical Estimation: Basic Concepts
Technion Israel Institute of Technology, Department of Electrical Engineering Estimation and Identification in Dynamical Systems (048825) Lecture Notes, Fall 2009, Prof. N. Shimkin 2 Statistical Estimation:
More informationPoint-Referenced Data Models
Point-Referenced Data Models Jamie Monogan University of Georgia Spring 2013 Jamie Monogan (UGA) Point-Referenced Data Models Spring 2013 1 / 19 Objectives By the end of these meetings, participants should
More informationIntroduction to Spatial Data and Models
Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry
More informationIntroduction to Spatial Data and Models
Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,
More informationCBMS Lecture 1. Alan E. Gelfand Duke University
CBMS Lecture 1 Alan E. Gelfand Duke University Introduction to spatial data and models Researchers in diverse areas such as climatology, ecology, environmental exposure, public health, and real estate
More information1 Isotropic Covariance Functions
1 Isotropic Covariance Functions Let {Z(s)} be a Gaussian process on, ie, a collection of jointly normal random variables Z(s) associated with n-dimensional locations s The joint distribution of {Z(s)}
More informationSpatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields
Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February
More informationModels for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data
Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise
More informationSTA 414/2104, Spring 2014, Practice Problem Set #1
STA 44/4, Spring 4, Practice Problem Set # Note: these problems are not for credit, and not to be handed in Question : Consider a classification problem in which there are two real-valued inputs, and,
More informationIntroduction. Semivariogram Cloud
Introduction Data: set of n attribute measurements {z(s i ), i = 1,, n}, available at n sample locations {s i, i = 1,, n} Objectives: Slide 1 quantify spatial auto-correlation, or attribute dissimilarity
More informationSpace-time data. Simple space-time analyses. PM10 in space. PM10 in time
Space-time data Observations taken over space and over time Z(s, t): indexed by space, s, and time, t Here, consider geostatistical/time data Z(s, t) exists for all locations and all times May consider
More informationESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS
ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C.,
More informationRejection sampling - Acceptance probability. Review: How to sample from a multivariate normal in R. Review: Rejection sampling. Weighted resampling
Rejection sampling - Acceptance probability Review: How to sample from a multivariate normal in R Goal: Simulate from N d (µ,σ)? Note: For c to be small, g() must be similar to f(). The art of rejection
More informationLines and points. Lines and points
omogeneous coordinates in the plane Homogeneous coordinates in the plane A line in the plane a + by + c is represented as (a, b, c). A line is a subset of points in the plane. All vectors (ka, kb, kc)
More informationComputation fundamentals of discrete GMRF representations of continuous domain spatial models
Computation fundamentals of discrete GMRF representations of continuous domain spatial models Finn Lindgren September 23 2015 v0.2.2 Abstract The fundamental formulas and algorithms for Bayesian spatial
More informationIssues and Techniques in Pattern Classification
Issues and Techniques in Pattern Classification Carlotta Domeniconi www.ise.gmu.edu/~carlotta Machine Learning Given a collection of data, a machine learner eplains the underlying process that generated
More informationIntroduction to Geostatistics
Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,
More informationStatistics & Data Sciences: First Year Prelim Exam May 2018
Statistics & Data Sciences: First Year Prelim Exam May 2018 Instructions: 1. Do not turn this page until instructed to do so. 2. Start each new question on a new sheet of paper. 3. This is a closed book
More informationChapter 4 - Fundamentals of spatial processes Lecture notes
Chapter 4 - Fundamentals of spatial processes Lecture notes Geir Storvik January 21, 2013 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites Mostly positive correlation Negative
More informationCovariance function estimation in Gaussian process regression
Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian
More informationAn Introduction to GAMs based on penalized regression splines. Simon Wood Mathematical Sciences, University of Bath, U.K.
An Introduction to GAMs based on penalied regression splines Simon Wood Mathematical Sciences, University of Bath, U.K. Generalied Additive Models (GAM) A GAM has a form something like: g{e(y i )} = η
More informationEstimators as Random Variables
Estimation Theory Overview Properties Bias, Variance, and Mean Square Error Cramér-Rao lower bound Maimum likelihood Consistency Confidence intervals Properties of the mean estimator Introduction Up until
More informationSpatial analysis of a designed experiment. Uniformity trials. Blocking
Spatial analysis of a designed experiment RA Fisher s 3 principles of experimental design randomization unbiased estimate of treatment effect replication unbiased estimate of error variance blocking =
More informationebay/google short course: Problem set 2
18 Jan 013 ebay/google short course: Problem set 1. (the Echange Parado) You are playing the following game against an opponent, with a referee also taking part. The referee has two envelopes (numbered
More informationStatistics: Learning models from data
DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial
More informationFundamental Issues in Bayesian Functional Data Analysis. Dennis D. Cox Rice University
Fundamental Issues in Bayesian Functional Data Analysis Dennis D. Cox Rice University 1 Introduction Question: What are functional data? Answer: Data that are functions of a continuous variable.... say
More informationWhere now? Machine Learning and Bayesian Inference
Machine Learning and Bayesian Inference Dr Sean Holden Computer Laboratory, Room FC6 Telephone etension 67 Email: sbh@clcamacuk wwwclcamacuk/ sbh/ Where now? There are some simple take-home messages from
More informationF & B Approaches to a simple model
A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 215 http://www.astro.cornell.edu/~cordes/a6523 Lecture 11 Applications: Model comparison Challenges in large-scale surveys
More informationTheory of Maximum Likelihood Estimation. Konstantin Kashin
Gov 2001 Section 5: Theory of Maximum Likelihood Estimation Konstantin Kashin February 28, 2013 Outline Introduction Likelihood Examples of MLE Variance of MLE Asymptotic Properties What is Statistical
More informationEcon 424 Time Series Concepts
Econ 424 Time Series Concepts Eric Zivot January 20 2015 Time Series Processes Stochastic (Random) Process { 1 2 +1 } = { } = sequence of random variables indexed by time Observed time series of length
More informationRegression with correlation for the Sales Data
Regression with correlation for the Sales Data Scatter with Loess Curve Time Series Plot Sales 30 35 40 45 Sales 30 35 40 45 0 10 20 30 40 50 Week 0 10 20 30 40 50 Week Sales Data What is our goal with
More informationp(x ω i 0.4 ω 2 ω
p( ω i ). ω.3.. 9 3 FIGURE.. Hypothetical class-conditional probability density functions show the probability density of measuring a particular feature value given the pattern is in category ω i.if represents
More informationAdvanced Introduction to Machine Learning CMU-10715
Advanced Introduction to Machine Learning CMU-10715 Gaussian Processes Barnabás Póczos http://www.gaussianprocess.org/ 2 Some of these slides in the intro are taken from D. Lizotte, R. Parr, C. Guesterin
More informationFluvial Variography: Characterizing Spatial Dependence on Stream Networks. Dale Zimmerman University of Iowa (joint work with Jay Ver Hoef, NOAA)
Fluvial Variography: Characterizing Spatial Dependence on Stream Networks Dale Zimmerman University of Iowa (joint work with Jay Ver Hoef, NOAA) March 5, 2015 Stream network data Flow Legend o 4.40-5.80
More informationPracticum : Spatial Regression
: Alexandra M. Schmidt Instituto de Matemática UFRJ - www.dme.ufrj.br/ alex 2014 Búzios, RJ, www.dme.ufrj.br Exploratory (Spatial) Data Analysis 1. Non-spatial summaries Numerical summaries: Mean, median,
More informationLecture 3: Latent Variables Models and Learning with the EM Algorithm. Sam Roweis. Tuesday July25, 2006 Machine Learning Summer School, Taiwan
Lecture 3: Latent Variables Models and Learning with the EM Algorithm Sam Roweis Tuesday July25, 2006 Machine Learning Summer School, Taiwan Latent Variable Models What to do when a variable z is always
More information7 Gaussian Discriminant Analysis (including QDA and LDA)
36 Jonathan Richard Shewchuk 7 Gaussian Discriminant Analysis (including QDA and LDA) GAUSSIAN DISCRIMINANT ANALYSIS Fundamental assumption: each class comes from normal distribution (Gaussian). X N(µ,
More informationCOMS 4721: Machine Learning for Data Science Lecture 20, 4/11/2017
COMS 4721: Machine Learning for Data Science Lecture 20, 4/11/2017 Prof. John Paisley Department of Electrical Engineering & Data Science Institute Columbia University SEQUENTIAL DATA So far, when thinking
More informationNonparametric Bayesian Methods - Lecture I
Nonparametric Bayesian Methods - Lecture I Harry van Zanten Korteweg-de Vries Institute for Mathematics CRiSM Masterclass, April 4-6, 2016 Overview of the lectures I Intro to nonparametric Bayesian statistics
More informationTaylor Series and Series Convergence (Online)
7in 0in Felder c02_online.te V3 - February 9, 205 9:5 A.M. Page CHAPTER 2 Taylor Series and Series Convergence (Online) 2.8 Asymptotic Epansions In introductory calculus classes the statement this series
More informationGaussian Processes 1. Schedule
1 Schedule 17 Jan: Gaussian processes (Jo Eidsvik) 24 Jan: Hands-on project on Gaussian processes (Team effort, work in groups) 31 Jan: Latent Gaussian models and INLA (Jo Eidsvik) 7 Feb: Hands-on project
More informationMACHINE LEARNING ADVANCED MACHINE LEARNING
MACHINE LEARNING ADVANCED MACHINE LEARNING Recap of Important Notions on Estimation of Probability Density Functions 2 2 MACHINE LEARNING Overview Definition pdf Definition joint, condition, marginal,
More informationHandbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp
Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Marcela Alfaro Córdoba August 25, 2016 NCSU Department of Statistics Continuous Parameter
More informationREML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data
REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data Sue Welham Rothamsted Research Harpenden UK AL5 2JQ December 1, 2008 1 We will start by reviewing the principles
More informationFactor Analysis. Qian-Li Xue
Factor Analysis Qian-Li Xue Biostatistics Program Harvard Catalyst The Harvard Clinical & Translational Science Center Short course, October 7, 06 Well-used latent variable models Latent variable scale
More informationOptimum Ordering and Pole-Zero Pairing. Optimum Ordering and Pole-Zero Pairing Consider the scaled cascade structure shown below
Pole-Zero Pairing of the Cascade Form IIR Digital Filter There are many possible cascade realiations of a higher order IIR transfer function obtained by different pole-ero pairings and ordering Each one
More informationPRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH
PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH SURESH TRIPATHI Geostatistical Society of India Assumptions and Geostatistical Variogram
More informationBayesian Decision and Bayesian Learning
Bayesian Decision and Bayesian Learning Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1 / 30 Bayes Rule p(x ω i
More informationGauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA
JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter
More informationPhysics 403. Segev BenZvi. Parameter Estimation, Correlations, and Error Bars. Department of Physics and Astronomy University of Rochester
Physics 403 Parameter Estimation, Correlations, and Error Bars Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Review of Last Class Best Estimates and Reliability
More informationPMR Learning as Inference
Outline PMR Learning as Inference Probabilistic Modelling and Reasoning Amos Storkey Modelling 2 The Exponential Family 3 Bayesian Sets School of Informatics, University of Edinburgh Amos Storkey PMR Learning
More informationCSC321 Lecture 18: Learning Probabilistic Models
CSC321 Lecture 18: Learning Probabilistic Models Roger Grosse Roger Grosse CSC321 Lecture 18: Learning Probabilistic Models 1 / 25 Overview So far in this course: mainly supervised learning Language modeling
More informationStatistical Geometry Processing Winter Semester 2011/2012
Statistical Geometry Processing Winter Semester 2011/2012 Linear Algebra, Function Spaces & Inverse Problems Vector and Function Spaces 3 Vectors vectors are arrows in space classically: 2 or 3 dim. Euclidian
More informationObtaining Critical Values for Test of Markov Regime Switching
University of California, Santa Barbara From the SelectedWorks of Douglas G. Steigerwald November 1, 01 Obtaining Critical Values for Test of Markov Regime Switching Douglas G Steigerwald, University of
More informationNonparametric Regression With Gaussian Processes
Nonparametric Regression With Gaussian Processes From Chap. 45, Information Theory, Inference and Learning Algorithms, D. J. C. McKay Presented by Micha Elsner Nonparametric Regression With Gaussian Processes
More informationStatistics for analyzing and modeling precipitation isotope ratios in IsoMAP
Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable
More informationLikelihood-Based Methods
Likelihood-Based Methods Handbook of Spatial Statistics, Chapter 4 Susheela Singh September 22, 2016 OVERVIEW INTRODUCTION MAXIMUM LIKELIHOOD ESTIMATION (ML) RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION (REML)
More informationREGRESSION WITH SPATIALLY MISALIGNED DATA. Lisa Madsen Oregon State University David Ruppert Cornell University
REGRESSION ITH SPATIALL MISALIGNED DATA Lisa Madsen Oregon State University David Ruppert Cornell University SPATIALL MISALIGNED DATA 10 X X X X X X X X 5 X X X X X 0 X 0 5 10 OUTLINE 1. Introduction 2.
More informationComparing Non-informative Priors for Estimation and Prediction in Spatial Models
Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial
More informationMath 180A. Lecture 16 Friday May 7 th. Expectation. Recall the three main probability density functions so far (1) Uniform (2) Exponential.
Math 8A Lecture 6 Friday May 7 th Epectation Recall the three main probability density functions so far () Uniform () Eponential (3) Power Law e, ( ), Math 8A Lecture 6 Friday May 7 th Epectation Eample
More informationDealing with Rotating Coordinate Systems Physics 321. (Eq.1)
Dealing with Rotating Coordinate Systems Physics 321 The treatment of rotating coordinate frames can be very confusing because there are two different sets of aes, and one set of aes is not constant in
More informationSpatial Backfitting of Roller Measurement Values from a Florida Test Bed
Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2
More informationBayesian Quadrature: Model-based Approximate Integration. David Duvenaud University of Cambridge
Bayesian Quadrature: Model-based Approimate Integration David Duvenaud University of Cambridge The Quadrature Problem ˆ We want to estimate an integral Z = f ()p()d ˆ Most computational problems in inference
More informationI don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph
Spatial analysis Huge topic! Key references Diggle (point patterns); Cressie (everything); Diggle and Ribeiro (geostatistics); Dormann et al (GLMMs for species presence/abundance); Haining; (Pinheiro and
More informationParticle-Based Approximate Inference on Graphical Model
article-based Approimate Inference on Graphical Model Reference: robabilistic Graphical Model Ch. 2 Koller & Friedman CMU, 0-708, Fall 2009 robabilistic Graphical Models Lectures 8,9 Eric ing attern Recognition
More informationChapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations
Chapter 5 Statistical Models in Simulations 5.1 Contents Basic Probability Theory Concepts Discrete Distributions Continuous Distributions Poisson Process Empirical Distributions Useful Statistical Models
More informationBayesian Approach 2. CSC412 Probabilistic Learning & Reasoning
CSC412 Probabilistic Learning & Reasoning Lecture 12: Bayesian Parameter Estimation February 27, 2006 Sam Roweis Bayesian Approach 2 The Bayesian programme (after Rev. Thomas Bayes) treats all unnown quantities
More informationIndex. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:
Index Akaike information criterion (AIC) 105, 290 analysis of variance 35, 44, 127 132 angular transformation 22 anisotropy 59, 99 affine or geometric 59, 100 101 anisotropy ratio 101 exploring and displaying
More informationBrief Review on Estimation Theory
Brief Review on Estimation Theory K. Abed-Meraim ENST PARIS, Signal and Image Processing Dept. abed@tsi.enst.fr This presentation is essentially based on the course BASTA by E. Moulines Brief review on
More informationGaussian Process Regression
Gaussian Process Regression 4F1 Pattern Recognition, 21 Carl Edward Rasmussen Department of Engineering, University of Cambridge November 11th - 16th, 21 Rasmussen (Engineering, Cambridge) Gaussian Process
More informationDS-GA 1002 Lecture notes 11 Fall Bayesian statistics
DS-GA 100 Lecture notes 11 Fall 016 Bayesian statistics In the frequentist paradigm we model the data as realizations from a distribution that depends on deterministic parameters. In contrast, in Bayesian
More informationMACHINE LEARNING ADVANCED MACHINE LEARNING
MACHINE LEARNING ADVANCED MACHINE LEARNING Recap of Important Notions on Estimation of Probability Density Functions 22 MACHINE LEARNING Discrete Probabilities Consider two variables and y taking discrete
More informationLecture 3: Pattern Classification. Pattern classification
EE E68: Speech & Audio Processing & Recognition Lecture 3: Pattern Classification 3 4 5 The problem of classification Linear and nonlinear classifiers Probabilistic classification Gaussians, mitures and
More informationConstrained Gaussian processes: methodology, theory and applications
Constrained Gaussian processes: methodology, theory and applications Hassan Maatouk hassan.maatouk@univ-rennes2.fr Workshop on Gaussian Processes, November 6-7, 2017, St-Etienne (France) Hassan Maatouk
More informationStatistícal Methods for Spatial Data Analysis
Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis
More informationMobile Robot Localization
Mobile Robot Localization 1 The Problem of Robot Localization Given a map of the environment, how can a robot determine its pose (planar coordinates + orientation)? Two sources of uncertainty: - observations
More informationA Complete Spatial Downscaler
A Complete Spatial Downscaler Yen-Ning Huang, Brian J Reich, Montserrat Fuentes 1 Sankar Arumugam 2 1 Department of Statistics, NC State University 2 Department of Civil, Construction, and Environmental
More informationExpectation Maximization Deconvolution Algorithm
Epectation Maimization Deconvolution Algorithm Miaomiao ZHANG March 30, 2011 Abstract In this paper, we use a general mathematical and eperimental methodology to analyze image deconvolution. The main procedure
More informationFor a stochastic process {Y t : t = 0, ±1, ±2, ±3, }, the mean function is defined by (2.2.1) ± 2..., γ t,
CHAPTER 2 FUNDAMENTAL CONCEPTS This chapter describes the fundamental concepts in the theory of time series models. In particular, we introduce the concepts of stochastic processes, mean and covariance
More informationReview. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda
Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with
More informationSignals and Spectra - Review
Signals and Spectra - Review SIGNALS DETERMINISTIC No uncertainty w.r.t. the value of a signal at any time Modeled by mathematical epressions RANDOM some degree of uncertainty before the signal occurs
More informationStatistical Inference: Maximum Likelihood and Bayesian Approaches
Statistical Inference: Maximum Likelihood and Bayesian Approaches Surya Tokdar From model to inference So a statistical analysis begins by setting up a model {f (x θ) : θ Θ} for data X. Next we observe
More informationUseful Mathematics. 1. Multivariable Calculus. 1.1 Taylor s Theorem. Monday, 13 May 2013
Useful Mathematics Monday, 13 May 013 Physics 111 In recent years I have observed a reticence among a subpopulation of students to dive into mathematics when the occasion arises in theoretical mechanics
More informationKriging models with Gaussian processes - covariance function estimation and impact of spatial sampling
Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling François Bachoc former PhD advisor: Josselin Garnier former CEA advisor: Jean-Marc Martinez Department
More informationF denotes cumulative density. denotes probability density function; (.)
BAYESIAN ANALYSIS: FOREWORDS Notation. System means the real thing and a model is an assumed mathematical form for the system.. he probability model class M contains the set of the all admissible models
More informationGaussians. Hiroshi Shimodaira. January-March Continuous random variables
Cumulative Distribution Function Gaussians Hiroshi Shimodaira January-March 9 In this chapter we introduce the basics of how to build probabilistic models of continuous-valued data, including the most
More informationUnivariate ARIMA Forecasts (Theory)
Univariate ARIMA Forecasts (Theory) Al Nosedal University of Toronto April 10, 2016 Al Nosedal University of Toronto Univariate ARIMA Forecasts (Theory) April 10, 2016 1 / 23 All univariate forecasting
More informationProbabilistic Graphical Models
2016 Robert Nowak Probabilistic Graphical Models 1 Introduction We have focused mainly on linear models for signals, in particular the subspace model x = Uθ, where U is a n k matrix and θ R k is a vector
More informationPaper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA
Paper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA BY MIKYOUNG JUN AND MICHAEL L. STEIN Presented by Sungkyu Jung April, 2009 Outline 1 Introduction 2 Covariance Models 3 Application: Level
More informationRandom Fields in Bayesian Inference: Effects of the Random Field Discretization
Random Fields in Bayesian Inference: Effects of the Random Field Discretization Felipe Uribe a, Iason Papaioannou a, Wolfgang Betz a, Elisabeth Ullmann b, Daniel Straub a a Engineering Risk Analysis Group,
More informationPhysics 403. Segev BenZvi. Propagation of Uncertainties. Department of Physics and Astronomy University of Rochester
Physics 403 Propagation of Uncertainties Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Maximum Likelihood and Minimum Least Squares Uncertainty Intervals
More informationParametric Techniques Lecture 3
Parametric Techniques Lecture 3 Jason Corso SUNY at Buffalo 22 January 2009 J. Corso (SUNY at Buffalo) Parametric Techniques Lecture 3 22 January 2009 1 / 39 Introduction In Lecture 2, we learned how to
More informationAssociation studies and regression
Association studies and regression CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar Association studies and regression 1 / 104 Administration
More informationProbability and Statistics
Probability and Statistics 1 Contents some stochastic processes Stationary Stochastic Processes 2 4. Some Stochastic Processes 4.1 Bernoulli process 4.2 Binomial process 4.3 Sine wave process 4.4 Random-telegraph
More informationStatistical Data Analysis Stat 3: p-values, parameter estimation
Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,
More informationMultidimensional partitions of unity and Gaussian terrains
and Gaussian terrains Richard A. Bale, Jeff P. Grossman, Gary F. Margrave, and Michael P. Lamoureu ABSTRACT Partitions of unity play an important rôle as amplitude-preserving windows for nonstationary
More information