arxiv: v1 [gr-qc] 26 Nov 2009

Size: px
Start display at page:

Download "arxiv: v1 [gr-qc] 26 Nov 2009"

Transcription

1 AEI Random template placement and prior information arxiv:09.505v [gr-qc] 26 Nov 2009 Christian Röver Max-Plack-Institut für Gravitationsphysik (Albert-Einstein-Institut), Callinstraße 38, 3067 Hannover, Germany. Abstract. In signal detection problems, one is usually faced with the task of searching a parameter space for peaks in the likelihood function which indicate the presence of a signal. Random searches have proven to be very efficient as well as easy to implement, compared e.g. to searches along regular grids in parameter space. Knowledge of the parameterised shape of the signal searched for adds structure to the parameter space, i.e., there are usually regions requiring to be densely searched while in other regions a coarser search is sufficient. On the other hand, prior information identifies the regions in which a search will actually be promising or may likely be in vain. Defining specific figures of merit allows one to combine both template metric and prior distribution and devise optimal sampling schemes over the parameter space. We show an example related to the gravitational wave signal from a binary inspiral event. Here the template metric and prior information are particularly contradictory, since signals from low-mass systems tolerate the least mismatch in parameter space while high-mass systems are far more likely, as they imply a greater signal-to-noise ratio (SNR) and hence are detectable to greater distances. The derived sampling strategy is implemented in a Markov chain Monte Carlo (MCMC) algorithm where it improves convergence.. Introduction Signal detection, in gravitational wave detection in particular, frequently entails the problem of performing a computationally expensive numerical search over a large parameter space. The search here means a search for a peak in the likelihood function, or another detection statistic, based on the data at hand and varying the unknown signal parameters. A peak or a threshold excess then indicates the presence of a signal [, 2]. Such brute-force searches may be implemented as grid searches, evaluating the detection statistic at regularly placed points in parameter space. Computing the detection statistic usually means evaluating the match between a signal template and the data; the spacing between evaluated points in parameter space is then usually based on a template metric which ensures that all possible signals (corresponding to points in parameter space) have at least a certain minimal match with one of the evaluated templates (corresponding to the grid points). Instead of using regularly spaced template banks, the use of random template banks has recently gained popularity, as these are often very easily implemented, and have also be shown to be very efficient, especially in higher dimensions [3]. Here the idea is to populate the parameter space randomly, but uniformly with respect to the template metric. These template placement strategies have by now usually been based on minimax reasoning, by aiming at minimizing the maximal (worst-case) mismatch across the whole parameter space. Once one takes prior information on the unknown parameters into consideration, by accounting for a priori probabilities attached to different regions of parameter space, a decision-theoretic

2 approach allows us to devise other strategies, effectively concentrating efforts on the more promising regions of parameter space in pursuit of a certain optimality criterion [4, 5]. In fact, a minimax strategy may often only exist once one imposes hard bounds on the parameter space (and by that ensuring the existence of an absolute worst case). Markov chain Monte Carlo (MCMC) methods are meanwhile widely used for (Bayesian) parameter estimation in the signal processing stage for gravitational-wave signals [6, 7]. MCMC algorithms are, first of all, methods for stochastic integration [8, 9], although by the way they work they often behave similarly to stochastic search algorithms as well. This is in fact a most welcome property, as part of the parameter estimation problem is usually also a search/optimization problem, as, besides integration over the parameters posterior distribution, it requires finding the global mode or secondary modes. Parallel tempering [0, ] is a variety of the Metropolis-Hastings MCMC algorithm (and a special case of Metropolis-coupled MCMC algorithm [2, 9]) aimed at enhancing these stochastic search capabilities. This is done by basically running several MCMC chains in parallel, where tempering at increasing temperature values is applied to subsequent chains (as in simulated annealing methods [3]), and additional steps are introduced to allow for communication between chains [4]. Parallel tempering methods have been applied to gravitational-wave data analysis for binary inspiral signals in the context of ground-based [5] and space-based (LISA) measurements [4], where they have proven advantageous especially in cases of high SNR and of posterior distributions exhibiting multiple modes or degeneracies [6, 7]. They have meanwhile also been adopted for the analysis of burst signals [8]. Among the parallel Markov chains being run at different temperatures within the parallel tempering implementation, the cool ones with no tempering applied produce samples from the posterior distribution for the stochastic integration part, while the high-temperature chains are producing samples for the stochastic search. The question now is how to set up the algorithm so that the search is most efficient, given our knowledge of prior and template metric, i.e., our knowledge of where the true parameters are (un-) likely to be, and how hard one needs to look across the parameter space. The problem is of special interest in the context of binary inspiral signals, as prior and template metric are particularly contradictory: a priori one is most likely do detect an inspiral involving high masses, as these result in a high-snr signal that is detectable to a greater distance. On the other hand, considering the template metric only, one might want to mostly try low-mass templates, since at low masses the template s and true signal parameters need to be in very close agreement in order for them to match, while at high masses greater discrepancies still yield a good match. What needs to be defined is the distribution to sample from in order to find the mode(s) fastest, which is very similar to setting up a random template bank, the difference being that one does not settle on some fixed number of templates, as the MCMC sampler in principle is thought to sample indefinitely. In the following Sec. 2, we will introduce the problem for the case of binary inspiral signals, and Sec. 3 briefly introduces the parallel tempering context. In Sec. 4 the general problem is formulated in decision-theoretic terms and solved for a particular optimality criterion. Sec. 5 shows some illustrative examples, and Sec. 6 eventually closes with conclusions and perspectives. 2. Binary inspiral parameters In the simplest description, a binary inspiral signal as measured by ground-based interferometers is determined by 9 parameters: sky location (declination δ, right ascension α), polaristion (ψ), companion masses (m, m 2 ), luminosity distance (d L ), time of arrival (t c ), phase (φ) and inclination angle (ι). Assuming some prior distribution for the masses (in the following simply defined to be uniform, m,m 2 [M,0M ]), and an isotropic distribution of events across space while folding in the detectability as a function of signal-to-noise ratio (SNR), one can derive a joint prior distribution whose marginal distribution of masses is shown in Fig.

3 prior: π(θ) minimax/equalizer rule: ρ^(θ) g(θ) optimal rule: p*(θ) π(θ) g(θ) m2 m2 m2 m m m Figure. (Marginal) densities of the distributions π, ˆρ and p for the two mass parameters (m, m 2 ) of a binary inspiral signal. The prior (left plot) indicates that high masses are most likely, which is because they result in stronger signals that are detectable to greater distances. The template metric on the other hand implies that low masses require a dense template spacing (middle plot). [9, 4]. A template metric may be defined following [20, 2], assuming the metric to be constant in the space of the Newtonian and.5 PN chirp times λ and λ 2, which are functions of the mass parameters. For the remaining parameters, for now, we again assume the metric to be uniform (t c, log(d L )) and isotropic (δ, α, ψ, ι, φ). The implied distribution in terms of (m,m 2 ) following from a uniform spacing in (λ,λ 2 ) may be derived using the reparametrisation explicated in [22]. This distribution is shown in Fig.. 3. Parallel tempering In the context of Monte Carlo integration, tempering is utilised to prevent the integration algorithm from getting stuck in local modes of the distribution from which it is sampling. A temperature parameter T is introduced, and instead of sampling from the distribution of actual interest, with density function f(θ), the modified distribution f (T) (θ) f(θ) T () is used. The introduced exponent is supposed to make the distribution more tractable, as it has a flattening effect on the density; the same effect is also taken advantage of in simulated annealing methods [3]. In the limit of T, the density f (T) (θ) then approaches a uniform distribution [9]. In the context of posterior inference, when the target distribution f(θ) is the product of prior π(θ) and likelihood L(θ), it may be more sensible to use a scheme only tempering the likelihood part: f (T) (θ) π(θ) L(θ) T, (2) in which case f (T) (θ) goes towards the prior π(θ) for T [4]. Both uniform distribution and prior distribution may in general not be the most sensible choice, as was pointed out above, since the tempering is also supposed to enhance the algorithm s stochastic search properties. Assume that one had a distribution p (θ) available, which leads to an optimal sampling (w.r.t. to some pre-specified criterion), and which is then the desired density for T. This suggests a generalized tempering parametrisation: ( ) f(θ) f (T) (θ) p T (θ) = p p (θ) T f(θ) T (3) (θ)

4 which in the special cases of p (θ) and p (θ) = π(θ) again yields the tempering schemes from () and (2) above. The question now is how to choose such a limiting distribution p (θ) based on given prior information and template metric. 4. The decision theoretic approach Let g(θ) be the determinant of the template metric as a function of the signal parameters. A large value of g means that that templates need to be densely spaced around θ, while a smaller g indicates that a coarser spacing is sufficient. The volume covered by a template placed at parameter θ is proportional to g(θ) 2, and hence the probability density to sample from for setting up a random template bank is given by ˆρ(θ) g(θ) [3]. Now consider the case of the true parameter value being θ 0 Θ. The actual value θ 0 is unknown, what is known is the prior probability density π(θ). Whenever a template θ is placed in parameter space, it is considered a match if it was sufficiently close to the true value θ 0. What exactly is sufficiently close is determined via mismatch considerations and is expressed through the template metric. Then the probability of a match is P(match θ ) = c g(θ ) π(θ ), (4) where c R + is a constant depending on how close a match actually is required to be. If one was to pick a single template θ, the chances for success would obviously be maximal where the above product reaches its maximum. Analogously, consider the case of a given true value θ 0 and repeated, independent guesses drawn from p (θ). Then for each single guess the probability of success is P(match θ 0 ) = c g(θ0 ) p (θ 0 ). (5) What is desired is a distribution p from which to generate independent draws so that the chances of getting a match are optimal. Whether or when one will get a match is a matter of chance, depending on both the true value θ 0 Θ and the choice of p P, where P is the space of probability distributions over Θ. Suppose we are interested in minimizing the expected number of trials T (or waiting time) until the first match. Any choice of p implies a probability distribution for T; for a given true value θ 0 and a sampling distribution p, T follows a geometric distribution with density and expectation: P(T =t θ 0 ) = ( c ) t ( g(θ0 ) p (θ 0 ) c ) g(θ0 ) p (θ 0 ), E[T θ 0 ] = g(θ0 ) p (θ 0 ). (6) c In decision theoretic terms, we are given a state-of-nature space Θ, an action space P, and a loss function L : Θ P R with L(θ 0,p ) = E p [T θ 0 ] [4, 5]. An optimal choice of p may now be determined by minimizing the expected loss; integrating over the possible values that θ 0 could take, that (prior) expectation is E[T] = c Θ π(θ)dθ, (7) g(θ0 ) p (θ 0 ) which is minimized by choosing p (θ) π(θ) g(θ) = π(θ) ˆρ(θ), (8) i.e., the optimal p here is proportional to the geometric mean of π and ˆρ, and independent of c.

5 π(θ) ρ^(θ) g(θ) p*(θ) π(θ) g(θ) Figure 2. Densities of the distributions π, ˆρ and p for the toy example discussed in Sec The distribution defined through the density ˆρ that is usually utilized for random template banks [3] plays a particular role in this context. From equation (5) one can see that by setting p := ˆρ the probability of a match (and with that also the waiting time) becomes independent of the actual parameter value θ 0, so that ˆρ constitutes an equalizer rule. From (8) it follows that ˆρ will be optimal in the case that the prior happens to be π = ˆρ. This implies that π = ˆρ defines the least favourable prior distribution for this case, and that p = ˆρ also constitutes the minimax strategy (independent from the particular prior π), as it minimizes the maximum of E[T θ 0 ] across all possible true values θ 0 [4]. Since p = ˆρ leads to a uniform match probability in (5), it actually constitutes the equalizer rule for the wider family of optimality criteria that are functions of P(match θ 0 ). 5. Examples 5.. Toy example : Gaussian prior Consider a parameter space Θ =Rwhere the prior is Gaussian with mean µ and variance σ 2 : π = N(µ,σ 2 ), and the template metric is flat, i.e., g(θ) = γ is independent of θ. Then the equalizer rule ˆρ does not exist, and the optimal rule would be p = N(µ,2σ 2 ) Toy example 2: Numerical simulation Consider a parameter space Θ = [0,], where the prior and template metric behave as shown in Fig. 2. For this simple case the behaviour of different sampling strategies can be simulated numerically, by drawing true parameter values θ 0 from the prior distribution and then drawing guesses θ from either ˆρ or p in order to see how the strategies differ P(T t) p* ρ^ waiting time t P(T t) waiting time t Figure 3. Cumulative distributions of the resulting waiting times T when using sampling strategies p and ˆρ in the toy example of Sec The right panel shows a zoom-in on the differing tail behaviour. Fig. 3 illustrates the distribution of the resulting times T, for both the minimax and optimal strategies ˆρ and p. As expected, the average waiting time is lower for p, and one can see that the minimax strategy performs better in the unlikely worst cases.

6 (unnormalized) log likelihood chain # (T=.00) chain #2 (T=.50) chain #3 (T=2.25) chain #4 (T=3.38) chain #5 (T=5.06) chain #6 (T=7.59) chain #7 (T=.4) chain #8 (T=7.) mass ratio (η) mass ratio (η) MCMC iteration 2 chirp mass (m c ) chirp mass (m c ) Figure 4. This plot illustrates the behaviour of a Parallel Tempering algorithm utilizing the distribution p when running on simulated data. The left panel shows how the algorithm s cool chains manage to ascend to greater likelihood values while the tempered chains keep sampling at lower likelihood values. The 2nd panel is a scatter plot of mass parameter samples from all the different chains (after the algorithm s burn-in phase). The right panel eventually shows the resulting mass parameters marginal posterior density derived from the cool chain # alone; the cross indicates the true parameter value Binary inspiral example The prior π and minimax sampling rule ˆρ for the mass parameters of a binary inspiral event were shown in Fig.. The right panel of the same figure also shows the resulting optimized sampling distribution p. The obvious discrepancy between least favourable (ˆρ) and actual prior (π) suggests that there actually is a gain in doing the optimization. Fig. 4 shows how a parallel tempering algorithm for parameter estimation behaves when utilizing the distribution p for high-temperature chains as described in Sec. 3 (3). The MCMC chains quickly converge to the true parameter values, while the higher-temperature chains keep scanning the parameter space efficiently. 6. Conclusions and outlook We have applied a decision-theoretic approach in order to derive an optimized sampling distribution to be used within a parallel tempering MCMC implementation. The optimization step here provides a natural link between the parameter space metric and the prior information about the parameter values. The particular optimality criterion chosen here (the expected time until a matching template is found, E[T θ 0 ]) turns out to be computationally convenient, as the resulting sampling distribution p is independent of the particular mismatch threshold c, and is almost trivial to implement within an MCMC application. Other criteria are conceivable though, like the probability of a missed detection within N samples P(T > N θ 0 ) for example, which may then lead to more complicated results. The general approach used here should also be useful in other contexts; it turns out that the distribution usually used for setting up random template banks here constitutes the special case of a minimax strategy, which implies that the explicit specification of particular figures-of-merit and the consideration of prior information may yield great efficiency improvements, especially in cases where the implicitly assumed least favourable prior greatly deviates from the actual prior information as in the binary inspiral case. In the framework discussed above, the resulting optimized sampling distribution p even exists for cases where the minimax rule does not (as in the example of Sec. 5. above). This suggests that a similar approach may also make other ad-hoc fixes like the mass parameter bounds in the binary inspiral example dispensable, as it would naturally focus in on the promising parameter range while ruling out too unlikely and

7 too costly regions of parameter space. Acknowledgments The author would like to thank Chris Messenger, Reinhard Prix and Graham Woan for helpful discussions. This work was supported by the Max-Planck-Society. References [] McDonough R N and Whalen A D 995 Detection of signals in noise 2nd ed (New York: Academic Press) [2] Wainstein L A and Zubakov V D 962 Extraction of signals from noise (Englewood Cliffs, NJ: Prentice-Hall) [3] Messenger C, Prix R and Papa M A 2009 Physical Review D [4] Berger J O 985 Statistical decision theory and Bayesian analysis 2nd ed (Springer-Verlag) [5] Ferguson T S 967 Mathematical Statistics: A Decision Theoretic Approach (New York: Academic Press) [6] Christensen N and Meyer R 998 Physical Review D [7] Umstätter R, Meyer R, Dupuis R, Veitch J, Woan G and Christensen N 2004 Classical and Quantum Gravity 2 S655 S665 [8] Metropolis N and Ulam S 949 Journal of the American Statistical Association [9] Gilks W R, Richardson S and Spiegelhalter D J 996 Markov chain Monte Carlo in practice (Boca Raton: Chapman & Hall / CRC) [0] Hukushima K and Nemoto K 996 Journal of the Physical Society of Japan [] Hansmann U H E 997 Chemical Physics Letters [2] Geyer C J 99 Computing Science and Statistics: Proceedings of the 23rd Symposium on the Interface ed Keramidas E M (Fairfax Station: Interface Foundation) pp [3] Press W H, Teukolsky S A, Vetterling W T and Flannery B P 992 Numerical recipes in C: The art of scientific computing (Cambridge: Cambridge University Press) [4] Röver C 2007 Bayesian inference on astrophysical binary inspirals based on gravitational-wave measurements Ph.D. thesis The University of Auckland URL [5] Röver C, Meyer R and Christensen N 2007 Physical Review D [6] van der Sluys M V, Röver C, Stroeer A, Christensen N, Kalogera V, Meyer R and Vecchio A 2008 The Astrophysical Journal Letters 688 L6 L64 [7] Raymond V, van der Sluys M V, Mandel I, Kalogera V, Röver C and Christensen N 2009 Classical and Quantum Gravity [8] Key J S and Cornish N J 2009 Physical Review D [9] Röver C, Meyer R, Guidi G M, Viceré A and Christensen N 2007 Classical and Quantum Gravity 24 S607 S65 [20] Owen B J and Sathyaprakash B S 999 Physical Review D [2] Chronopoulos A E and Apostolatos T A 200 Physical Review D [22] Umstätter R and Tinto M 2008 Physical Review D

Inference on inspiral signals using LISA MLDC data

Inference on inspiral signals using LISA MLDC data Inference on inspiral signals using LISA MLDC data Christian Röver 1, Alexander Stroeer 2,3, Ed Bloomer 4, Nelson Christensen 5, James Clark 4, Martin Hendry 4, Chris Messenger 4, Renate Meyer 1, Matt

More information

Parameter estimation for signals from compact binary inspirals injected into LIGO data

Parameter estimation for signals from compact binary inspirals injected into LIGO data IOP PUBLISHING Class. Quantum Grav. 26 (2009) 204010 (10pp) CLASSICAL AND QUANTUM GRAVITY doi:10.1088/0264-9381/26/20/204010 Parameter estimation for signals from compact binary inspirals injected into

More information

arxiv: v2 [gr-qc] 3 Apr 2007

arxiv: v2 [gr-qc] 3 Apr 2007 Inference on white dwarf binary systems using the first round Mock LISA Data Challenges data sets compiled: 25 October 218 arxiv:74.48v2 [gr-qc] 3 Apr 27 Alexander Stroeer 1,2, John Veitch 1, Christian

More information

Multimodal Nested Sampling

Multimodal Nested Sampling Multimodal Nested Sampling Farhan Feroz Astrophysics Group, Cavendish Lab, Cambridge Inverse Problems & Cosmology Most obvious example: standard CMB data analysis pipeline But many others: object detection,

More information

Markov Chain Monte Carlo methods

Markov Chain Monte Carlo methods Markov Chain Monte Carlo methods By Oleg Makhnin 1 Introduction a b c M = d e f g h i 0 f(x)dx 1.1 Motivation 1.1.1 Just here Supresses numbering 1.1.2 After this 1.2 Literature 2 Method 2.1 New math As

More information

Reminder of some Markov Chain properties:

Reminder of some Markov Chain properties: Reminder of some Markov Chain properties: 1. a transition from one state to another occurs probabilistically 2. only state that matters is where you currently are (i.e. given present, future is independent

More information

Degrees-of-freedom estimation in the Student-t noise model.

Degrees-of-freedom estimation in the Student-t noise model. LIGO-T0497 Degrees-of-freedom estimation in the Student-t noise model. Christian Röver September, 0 Introduction The Student-t noise model was introduced in [] as a robust alternative to the commonly used

More information

MCMC Sampling for Bayesian Inference using L1-type Priors

MCMC Sampling for Bayesian Inference using L1-type Priors MÜNSTER MCMC Sampling for Bayesian Inference using L1-type Priors (what I do whenever the ill-posedness of EEG/MEG is just not frustrating enough!) AG Imaging Seminar Felix Lucka 26.06.2012 , MÜNSTER Sampling

More information

Work of the LSC Pulsar Upper Limits Group (PULG) Graham Woan, University of Glasgow on behalf of the LIGO Scientific Collaboration

Work of the LSC Pulsar Upper Limits Group (PULG) Graham Woan, University of Glasgow on behalf of the LIGO Scientific Collaboration Work of the LSC Pulsar Upper Limits Group (PULG) Graham Woan, University of Glasgow on behalf of the LIGO Scientific Collaboration GWDAW 2003 1 Pulsar Upper Limits Group (PULG) Community of LSC members

More information

Computational statistics

Computational statistics Computational statistics Markov Chain Monte Carlo methods Thierry Denœux March 2017 Thierry Denœux Computational statistics March 2017 1 / 71 Contents of this chapter When a target density f can be evaluated

More information

Markov Chain Monte Carlo methods

Markov Chain Monte Carlo methods Markov Chain Monte Carlo methods Tomas McKelvey and Lennart Svensson Signal Processing Group Department of Signals and Systems Chalmers University of Technology, Sweden November 26, 2012 Today s learning

More information

On Markov chain Monte Carlo methods for tall data

On Markov chain Monte Carlo methods for tall data On Markov chain Monte Carlo methods for tall data Remi Bardenet, Arnaud Doucet, Chris Holmes Paper review by: David Carlson October 29, 2016 Introduction Many data sets in machine learning and computational

More information

Monte Carlo in Bayesian Statistics

Monte Carlo in Bayesian Statistics Monte Carlo in Bayesian Statistics Matthew Thomas SAMBa - University of Bath m.l.thomas@bath.ac.uk December 4, 2014 Matthew Thomas (SAMBa) Monte Carlo in Bayesian Statistics December 4, 2014 1 / 16 Overview

More information

Bayesian Inference for Discretely Sampled Diffusion Processes: A New MCMC Based Approach to Inference

Bayesian Inference for Discretely Sampled Diffusion Processes: A New MCMC Based Approach to Inference Bayesian Inference for Discretely Sampled Diffusion Processes: A New MCMC Based Approach to Inference Osnat Stramer 1 and Matthew Bognar 1 Department of Statistics and Actuarial Science, University of

More information

The Bias-Variance dilemma of the Monte Carlo. method. Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel

The Bias-Variance dilemma of the Monte Carlo. method. Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel The Bias-Variance dilemma of the Monte Carlo method Zlochin Mark 1 and Yoram Baram 1 Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel fzmark,baramg@cs.technion.ac.il Abstract.

More information

17 : Markov Chain Monte Carlo

17 : Markov Chain Monte Carlo 10-708: Probabilistic Graphical Models, Spring 2015 17 : Markov Chain Monte Carlo Lecturer: Eric P. Xing Scribes: Heran Lin, Bin Deng, Yun Huang 1 Review of Monte Carlo Methods 1.1 Overview Monte Carlo

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate

More information

Monte Carlo Dynamically Weighted Importance Sampling for Spatial Models with Intractable Normalizing Constants

Monte Carlo Dynamically Weighted Importance Sampling for Spatial Models with Intractable Normalizing Constants Monte Carlo Dynamically Weighted Importance Sampling for Spatial Models with Intractable Normalizing Constants Faming Liang Texas A& University Sooyoung Cheon Korea University Spatial Model Introduction

More information

Bayesian methods in the search for gravitational waves

Bayesian methods in the search for gravitational waves Bayesian methods in the search for gravitational waves Reinhard Prix Albert-Einstein-Institut Hannover Bayes forum Garching, Oct 7 2016 Statistics as applied Probability Theory Probability Theory: extends

More information

Nested Sampling. Brendon J. Brewer. brewer/ Department of Statistics The University of Auckland

Nested Sampling. Brendon J. Brewer.   brewer/ Department of Statistics The University of Auckland Department of Statistics The University of Auckland https://www.stat.auckland.ac.nz/ brewer/ is a Monte Carlo method (not necessarily MCMC) that was introduced by John Skilling in 2004. It is very popular

More information

Bayesian search for other Earths

Bayesian search for other Earths Bayesian search for other Earths Low-mass planets orbiting nearby M dwarfs Mikko Tuomi University of Hertfordshire, Centre for Astrophysics Research Email: mikko.tuomi@utu.fi Presentation, 19.4.2013 1

More information

Detecting the next Galactic supernova

Detecting the next Galactic supernova Detecting the next Galactic supernova Nicolas Arnaud on behalf of the Virgo-LAL group Now fellow at LHCb-CERN Moriond Gravitation 2003 GW supernova amplitudes Burst online filter performances Comparison

More information

Bayesian Regression Linear and Logistic Regression

Bayesian Regression Linear and Logistic Regression When we want more than point estimates Bayesian Regression Linear and Logistic Regression Nicole Beckage Ordinary Least Squares Regression and Lasso Regression return only point estimates But what if we

More information

Learning the hyper-parameters. Luca Martino

Learning the hyper-parameters. Luca Martino Learning the hyper-parameters Luca Martino 2017 2017 1 / 28 Parameters and hyper-parameters 1. All the described methods depend on some choice of hyper-parameters... 2. For instance, do you recall λ (bandwidth

More information

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1 Parameter Estimation William H. Jefferys University of Texas at Austin bill@bayesrules.net Parameter Estimation 7/26/05 1 Elements of Inference Inference problems contain two indispensable elements: Data

More information

eqr094: Hierarchical MCMC for Bayesian System Reliability

eqr094: Hierarchical MCMC for Bayesian System Reliability eqr094: Hierarchical MCMC for Bayesian System Reliability Alyson G. Wilson Statistical Sciences Group, Los Alamos National Laboratory P.O. Box 1663, MS F600 Los Alamos, NM 87545 USA Phone: 505-667-9167

More information

Estimating the parameters of gravitational waves from neutron stars using an adaptive MCMC method

Estimating the parameters of gravitational waves from neutron stars using an adaptive MCMC method INSTITUTE OF PHYSICS PUBLISHING Class. Quantum Grav. 1 (004) S1655 S1665 CLASSICAL AND QUANTUM GRAVITY PII: S064-9381(04)78801- Estimating the parameters of gravitational waves from neutron stars using

More information

CSC 2541: Bayesian Methods for Machine Learning

CSC 2541: Bayesian Methods for Machine Learning CSC 2541: Bayesian Methods for Machine Learning Radford M. Neal, University of Toronto, 2011 Lecture 3 More Markov Chain Monte Carlo Methods The Metropolis algorithm isn t the only way to do MCMC. We ll

More information

IN DETAIL 20 SIGNIFICANCE April 2016

IN DETAIL 20 SIGNIFICANCE April 2016 IN DETAIL 20 SIGNIFICANCE April 2016 IN DETAIL Gravitational waves: A statistical autopsy of a black hole merger Renate Meyer and Nelson Christensen explain how statistics and statisticians helped unravel

More information

LECTURE 15 Markov chain Monte Carlo

LECTURE 15 Markov chain Monte Carlo LECTURE 15 Markov chain Monte Carlo There are many settings when posterior computation is a challenge in that one does not have a closed form expression for the posterior distribution. Markov chain Monte

More information

Basic math for biology

Basic math for biology Basic math for biology Lei Li Florida State University, Feb 6, 2002 The EM algorithm: setup Parametric models: {P θ }. Data: full data (Y, X); partial data Y. Missing data: X. Likelihood and maximum likelihood

More information

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model UNIVERSITY OF TEXAS AT SAN ANTONIO Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model Liang Jing April 2010 1 1 ABSTRACT In this paper, common MCMC algorithms are introduced

More information

Doing Bayesian Integrals

Doing Bayesian Integrals ASTR509-13 Doing Bayesian Integrals The Reverend Thomas Bayes (c.1702 1761) Philosopher, theologian, mathematician Presbyterian (non-conformist) minister Tunbridge Wells, UK Elected FRS, perhaps due to

More information

Bayesian Estimation with Sparse Grids

Bayesian Estimation with Sparse Grids Bayesian Estimation with Sparse Grids Kenneth L. Judd and Thomas M. Mertens Institute on Computational Economics August 7, 27 / 48 Outline Introduction 2 Sparse grids Construction Integration with sparse

More information

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn Parameter estimation and forecasting Cristiano Porciani AIfA, Uni-Bonn Questions? C. Porciani Estimation & forecasting 2 Temperature fluctuations Variance at multipole l (angle ~180o/l) C. Porciani Estimation

More information

A Convolution Method for Folding Systematic Uncertainties into Likelihood Functions

A Convolution Method for Folding Systematic Uncertainties into Likelihood Functions CDF/MEMO/STATISTICS/PUBLIC/5305 Version 1.00 June 24, 2005 A Convolution Method for Folding Systematic Uncertainties into Likelihood Functions Luc Demortier Laboratory of Experimental High-Energy Physics

More information

A Search and Jump Algorithm for Markov Chain Monte Carlo Sampling. Christopher Jennison. Adriana Ibrahim. Seminar at University of Kuwait

A Search and Jump Algorithm for Markov Chain Monte Carlo Sampling. Christopher Jennison. Adriana Ibrahim. Seminar at University of Kuwait A Search and Jump Algorithm for Markov Chain Monte Carlo Sampling Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Adriana Ibrahim Institute

More information

Bayesian Inference and MCMC

Bayesian Inference and MCMC Bayesian Inference and MCMC Aryan Arbabi Partly based on MCMC slides from CSC412 Fall 2018 1 / 18 Bayesian Inference - Motivation Consider we have a data set D = {x 1,..., x n }. E.g each x i can be the

More information

Lecture 7 and 8: Markov Chain Monte Carlo

Lecture 7 and 8: Markov Chain Monte Carlo Lecture 7 and 8: Markov Chain Monte Carlo 4F13: Machine Learning Zoubin Ghahramani and Carl Edward Rasmussen Department of Engineering University of Cambridge http://mlg.eng.cam.ac.uk/teaching/4f13/ Ghahramani

More information

Eco517 Fall 2013 C. Sims MCMC. October 8, 2013

Eco517 Fall 2013 C. Sims MCMC. October 8, 2013 Eco517 Fall 2013 C. Sims MCMC October 8, 2013 c 2013 by Christopher A. Sims. This document may be reproduced for educational and research purposes, so long as the copies contain this notice and are retained

More information

Recursive Deviance Information Criterion for the Hidden Markov Model

Recursive Deviance Information Criterion for the Hidden Markov Model International Journal of Statistics and Probability; Vol. 5, No. 1; 2016 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education Recursive Deviance Information Criterion for

More information

Bayesian data analysis in practice: Three simple examples

Bayesian data analysis in practice: Three simple examples Bayesian data analysis in practice: Three simple examples Martin P. Tingley Introduction These notes cover three examples I presented at Climatea on 5 October 0. Matlab code is available by request to

More information

A Bayesian perspective on GMM and IV

A Bayesian perspective on GMM and IV A Bayesian perspective on GMM and IV Christopher A. Sims Princeton University sims@princeton.edu November 26, 2013 What is a Bayesian perspective? A Bayesian perspective on scientific reporting views all

More information

Monte Carlo Methods for Computation and Optimization (048715)

Monte Carlo Methods for Computation and Optimization (048715) Technion Department of Electrical Engineering Monte Carlo Methods for Computation and Optimization (048715) Lecture Notes Prof. Nahum Shimkin Spring 2015 i PREFACE These lecture notes are intended for

More information

Introduction to Machine Learning CMU-10701

Introduction to Machine Learning CMU-10701 Introduction to Machine Learning CMU-10701 Markov Chain Monte Carlo Methods Barnabás Póczos & Aarti Singh Contents Markov Chain Monte Carlo Methods Goal & Motivation Sampling Rejection Importance Markov

More information

Bayesian Methods for Machine Learning

Bayesian Methods for Machine Learning Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),

More information

Bayesian model selection for computer model validation via mixture model estimation

Bayesian model selection for computer model validation via mixture model estimation Bayesian model selection for computer model validation via mixture model estimation Kaniav Kamary ATER, CNAM Joint work with É. Parent, P. Barbillon, M. Keller and N. Bousquet Outline Computer model validation

More information

A Semi-parametric Bayesian Framework for Performance Analysis of Call Centers

A Semi-parametric Bayesian Framework for Performance Analysis of Call Centers Proceedings 59th ISI World Statistics Congress, 25-30 August 2013, Hong Kong (Session STS065) p.2345 A Semi-parametric Bayesian Framework for Performance Analysis of Call Centers Bangxian Wu and Xiaowei

More information

STA 4273H: Sta-s-cal Machine Learning

STA 4273H: Sta-s-cal Machine Learning STA 4273H: Sta-s-cal Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 2 In our

More information

Approximate Bayesian Computation: a simulation based approach to inference

Approximate Bayesian Computation: a simulation based approach to inference Approximate Bayesian Computation: a simulation based approach to inference Richard Wilkinson Simon Tavaré 2 Department of Probability and Statistics University of Sheffield 2 Department of Applied Mathematics

More information

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An

More information

Monte Carlo Methods. Leon Gu CSD, CMU

Monte Carlo Methods. Leon Gu CSD, CMU Monte Carlo Methods Leon Gu CSD, CMU Approximate Inference EM: y-observed variables; x-hidden variables; θ-parameters; E-step: q(x) = p(x y, θ t 1 ) M-step: θ t = arg max E q(x) [log p(y, x θ)] θ Monte

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters

More information

Chapter 12 PAWL-Forced Simulated Tempering

Chapter 12 PAWL-Forced Simulated Tempering Chapter 12 PAWL-Forced Simulated Tempering Luke Bornn Abstract In this short note, we show how the parallel adaptive Wang Landau (PAWL) algorithm of Bornn et al. (J Comput Graph Stat, to appear) can be

More information

Prediction of Data with help of the Gaussian Process Method

Prediction of Data with help of the Gaussian Process Method of Data with help of the Gaussian Process Method R. Preuss, U. von Toussaint Max-Planck-Institute for Plasma Physics EURATOM Association 878 Garching, Germany March, Abstract The simulation of plasma-wall

More information

Introduction to Bayes

Introduction to Bayes Introduction to Bayes Alan Heavens September 3, 2018 ICIC Data Analysis Workshop Alan Heavens Introduction to Bayes September 3, 2018 1 / 35 Overview 1 Inverse Problems 2 The meaning of probability Probability

More information

A quick introduction to Markov chains and Markov chain Monte Carlo (revised version)

A quick introduction to Markov chains and Markov chain Monte Carlo (revised version) A quick introduction to Markov chains and Markov chain Monte Carlo (revised version) Rasmus Waagepetersen Institute of Mathematical Sciences Aalborg University 1 Introduction These notes are intended to

More information

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis Summarizing a posterior Given the data and prior the posterior is determined Summarizing the posterior gives parameter estimates, intervals, and hypothesis tests Most of these computations are integrals

More information

An introduction to Markov Chain Monte Carlo techniques

An introduction to Markov Chain Monte Carlo techniques An introduction to Markov Chain Monte Carlo techniques G. J. A. Harker University of Colorado ASTR5550, 19th March 2012 Outline Introduction Bayesian inference: recap MCMC: when to use it and why A simple

More information

Forward Problems and their Inverse Solutions

Forward Problems and their Inverse Solutions Forward Problems and their Inverse Solutions Sarah Zedler 1,2 1 King Abdullah University of Science and Technology 2 University of Texas at Austin February, 2013 Outline 1 Forward Problem Example Weather

More information

Making rating curves - the Bayesian approach

Making rating curves - the Bayesian approach Making rating curves - the Bayesian approach Rating curves what is wanted? A best estimate of the relationship between stage and discharge at a given place in a river. The relationship should be on the

More information

Detection ASTR ASTR509 Jasper Wall Fall term. William Sealey Gosset

Detection ASTR ASTR509 Jasper Wall Fall term. William Sealey Gosset ASTR509-14 Detection William Sealey Gosset 1876-1937 Best known for his Student s t-test, devised for handling small samples for quality control in brewing. To many in the statistical world "Student" was

More information

STAT 425: Introduction to Bayesian Analysis

STAT 425: Introduction to Bayesian Analysis STAT 425: Introduction to Bayesian Analysis Marina Vannucci Rice University, USA Fall 2017 Marina Vannucci (Rice University, USA) Bayesian Analysis (Part 2) Fall 2017 1 / 19 Part 2: Markov chain Monte

More information

Adaptive Monte Carlo methods

Adaptive Monte Carlo methods Adaptive Monte Carlo methods Jean-Michel Marin Projet Select, INRIA Futurs, Université Paris-Sud joint with Randal Douc (École Polytechnique), Arnaud Guillin (Université de Marseille) and Christian Robert

More information

Markov chain Monte Carlo

Markov chain Monte Carlo Markov chain Monte Carlo Karl Oskar Ekvall Galin L. Jones University of Minnesota March 12, 2019 Abstract Practically relevant statistical models often give rise to probability distributions that are analytically

More information

The Bayesian Approach to Multi-equation Econometric Model Estimation

The Bayesian Approach to Multi-equation Econometric Model Estimation Journal of Statistical and Econometric Methods, vol.3, no.1, 2014, 85-96 ISSN: 2241-0384 (print), 2241-0376 (online) Scienpress Ltd, 2014 The Bayesian Approach to Multi-equation Econometric Model Estimation

More information

MCMC for big data. Geir Storvik. BigInsight lunch - May Geir Storvik MCMC for big data BigInsight lunch - May / 17

MCMC for big data. Geir Storvik. BigInsight lunch - May Geir Storvik MCMC for big data BigInsight lunch - May / 17 MCMC for big data Geir Storvik BigInsight lunch - May 2 2018 Geir Storvik MCMC for big data BigInsight lunch - May 2 2018 1 / 17 Outline Why ordinary MCMC is not scalable Different approaches for making

More information

MONTE CARLO METHODS. Hedibert Freitas Lopes

MONTE CARLO METHODS. Hedibert Freitas Lopes MONTE CARLO METHODS Hedibert Freitas Lopes The University of Chicago Booth School of Business 5807 South Woodlawn Avenue, Chicago, IL 60637 http://faculty.chicagobooth.edu/hedibert.lopes hlopes@chicagobooth.edu

More information

Bayesian Estimation of DSGE Models 1 Chapter 3: A Crash Course in Bayesian Inference

Bayesian Estimation of DSGE Models 1 Chapter 3: A Crash Course in Bayesian Inference 1 The views expressed in this paper are those of the authors and do not necessarily reflect the views of the Federal Reserve Board of Governors or the Federal Reserve System. Bayesian Estimation of DSGE

More information

Prequential Analysis

Prequential Analysis Prequential Analysis Philip Dawid University of Cambridge NIPS 2008 Tutorial Forecasting 2 Context and purpose...................................................... 3 One-step Forecasts.......................................................

More information

Advanced Introduction to Machine Learning

Advanced Introduction to Machine Learning 10-715 Advanced Introduction to Machine Learning Homework 3 Due Nov 12, 10.30 am Rules 1. Homework is due on the due date at 10.30 am. Please hand over your homework at the beginning of class. Please see

More information

On the Optimal Scaling of the Modified Metropolis-Hastings algorithm

On the Optimal Scaling of the Modified Metropolis-Hastings algorithm On the Optimal Scaling of the Modified Metropolis-Hastings algorithm K. M. Zuev & J. L. Beck Division of Engineering and Applied Science California Institute of Technology, MC 4-44, Pasadena, CA 925, USA

More information

arxiv:gr-qc/ v1 28 Sep 2006

arxiv:gr-qc/ v1 28 Sep 2006 Coherent Bayesian inference on compact binary inspirals using a network of interferometric gravitational wave detectors Christian Röver and Renate Meyer Department of Statistics, The University of Auckland,

More information

The Ising model and Markov chain Monte Carlo

The Ising model and Markov chain Monte Carlo The Ising model and Markov chain Monte Carlo Ramesh Sridharan These notes give a short description of the Ising model for images and an introduction to Metropolis-Hastings and Gibbs Markov Chain Monte

More information

Monte Carlo Integration using Importance Sampling and Gibbs Sampling

Monte Carlo Integration using Importance Sampling and Gibbs Sampling Monte Carlo Integration using Importance Sampling and Gibbs Sampling Wolfgang Hörmann and Josef Leydold Department of Statistics University of Economics and Business Administration Vienna Austria hormannw@boun.edu.tr

More information

27 : Distributed Monte Carlo Markov Chain. 1 Recap of MCMC and Naive Parallel Gibbs Sampling

27 : Distributed Monte Carlo Markov Chain. 1 Recap of MCMC and Naive Parallel Gibbs Sampling 10-708: Probabilistic Graphical Models 10-708, Spring 2014 27 : Distributed Monte Carlo Markov Chain Lecturer: Eric P. Xing Scribes: Pengtao Xie, Khoa Luu In this scribe, we are going to review the Parallel

More information

MCMC: Markov Chain Monte Carlo

MCMC: Markov Chain Monte Carlo I529: Machine Learning in Bioinformatics (Spring 2013) MCMC: Markov Chain Monte Carlo Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2013 Contents Review of Markov

More information

Bayesian Inference in Astronomy & Astrophysics A Short Course

Bayesian Inference in Astronomy & Astrophysics A Short Course Bayesian Inference in Astronomy & Astrophysics A Short Course Tom Loredo Dept. of Astronomy, Cornell University p.1/37 Five Lectures Overview of Bayesian Inference From Gaussians to Periodograms Learning

More information

Markov Chain Monte Carlo

Markov Chain Monte Carlo Markov Chain Monte Carlo Recall: To compute the expectation E ( h(y ) ) we use the approximation E(h(Y )) 1 n n h(y ) t=1 with Y (1),..., Y (n) h(y). Thus our aim is to sample Y (1),..., Y (n) from f(y).

More information

Markov Chain Monte Carlo (MCMC) and Model Evaluation. August 15, 2017

Markov Chain Monte Carlo (MCMC) and Model Evaluation. August 15, 2017 Markov Chain Monte Carlo (MCMC) and Model Evaluation August 15, 2017 Frequentist Linking Frequentist and Bayesian Statistics How can we estimate model parameters and what does it imply? Want to find the

More information

Introduction to Gaussian Processes

Introduction to Gaussian Processes Introduction to Gaussian Processes Iain Murray murray@cs.toronto.edu CSC255, Introduction to Machine Learning, Fall 28 Dept. Computer Science, University of Toronto The problem Learn scalar function of

More information

Introduction. Chapter 1

Introduction. Chapter 1 Chapter 1 Introduction In this book we will be concerned with supervised learning, which is the problem of learning input-output mappings from empirical data (the training dataset). Depending on the characteristics

More information

Searching for Gravitational Waves from Coalescing Binary Systems

Searching for Gravitational Waves from Coalescing Binary Systems Searching for Gravitational Waves from Coalescing Binary Systems Stephen Fairhurst Cardiff University and LIGO Scientific Collaboration 1 Outline Motivation Searching for Coalescing Binaries Latest Results

More information

arxiv: v2 [gr-qc] 14 Feb 2015

arxiv: v2 [gr-qc] 14 Feb 2015 Postprocessing methods used in the search for continuous gravitational-wave signals from the Galactic Center arxiv:141.5997v2 [gr-qc] 14 Feb 215 Berit Behnke, 1, a Maria Alessandra Papa, 1, 2, b and Reinhard

More information

Bayesian Statistical Methods. Jeff Gill. Department of Political Science, University of Florida

Bayesian Statistical Methods. Jeff Gill. Department of Political Science, University of Florida Bayesian Statistical Methods Jeff Gill Department of Political Science, University of Florida 234 Anderson Hall, PO Box 117325, Gainesville, FL 32611-7325 Voice: 352-392-0262x272, Fax: 352-392-8127, Email:

More information

A Review of Pseudo-Marginal Markov Chain Monte Carlo

A Review of Pseudo-Marginal Markov Chain Monte Carlo A Review of Pseudo-Marginal Markov Chain Monte Carlo Discussed by: Yizhe Zhang October 21, 2016 Outline 1 Overview 2 Paper review 3 experiment 4 conclusion Motivation & overview Notation: θ denotes the

More information

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision The Particle Filter Non-parametric implementation of Bayes filter Represents the belief (posterior) random state samples. by a set of This representation is approximate. Can represent distributions that

More information

Tutorial on ABC Algorithms

Tutorial on ABC Algorithms Tutorial on ABC Algorithms Dr Chris Drovandi Queensland University of Technology, Australia c.drovandi@qut.edu.au July 3, 2014 Notation Model parameter θ with prior π(θ) Likelihood is f(ý θ) with observed

More information

A note on Reversible Jump Markov Chain Monte Carlo

A note on Reversible Jump Markov Chain Monte Carlo A note on Reversible Jump Markov Chain Monte Carlo Hedibert Freitas Lopes Graduate School of Business The University of Chicago 5807 South Woodlawn Avenue Chicago, Illinois 60637 February, 1st 2006 1 Introduction

More information

1 Using standard errors when comparing estimated values

1 Using standard errors when comparing estimated values MLPR Assignment Part : General comments Below are comments on some recurring issues I came across when marking the second part of the assignment, which I thought it would help to explain in more detail

More information

Verifying Regularity Conditions for Logit-Normal GLMM

Verifying Regularity Conditions for Logit-Normal GLMM Verifying Regularity Conditions for Logit-Normal GLMM Yun Ju Sung Charles J. Geyer January 10, 2006 In this note we verify the conditions of the theorems in Sung and Geyer (submitted) for the Logit-Normal

More information

1 Probabilities. 1.1 Basics 1 PROBABILITIES

1 Probabilities. 1.1 Basics 1 PROBABILITIES 1 PROBABILITIES 1 Probabilities Probability is a tricky word usually meaning the likelyhood of something occuring or how frequent something is. Obviously, if something happens frequently, then its probability

More information

Advanced Statistical Modelling

Advanced Statistical Modelling Markov chain Monte Carlo (MCMC) Methods and Their Applications in Bayesian Statistics School of Technology and Business Studies/Statistics Dalarna University Borlänge, Sweden. Feb. 05, 2014. Outlines 1

More information

Connections between score matching, contrastive divergence, and pseudolikelihood for continuous-valued variables. Revised submission to IEEE TNN

Connections between score matching, contrastive divergence, and pseudolikelihood for continuous-valued variables. Revised submission to IEEE TNN Connections between score matching, contrastive divergence, and pseudolikelihood for continuous-valued variables Revised submission to IEEE TNN Aapo Hyvärinen Dept of Computer Science and HIIT University

More information

ABC methods for phase-type distributions with applications in insurance risk problems

ABC methods for phase-type distributions with applications in insurance risk problems ABC methods for phase-type with applications problems Concepcion Ausin, Department of Statistics, Universidad Carlos III de Madrid Joint work with: Pedro Galeano, Universidad Carlos III de Madrid Simon

More information

Stat 535 C - Statistical Computing & Monte Carlo Methods. Lecture February Arnaud Doucet

Stat 535 C - Statistical Computing & Monte Carlo Methods. Lecture February Arnaud Doucet Stat 535 C - Statistical Computing & Monte Carlo Methods Lecture 13-28 February 2006 Arnaud Doucet Email: arnaud@cs.ubc.ca 1 1.1 Outline Limitations of Gibbs sampling. Metropolis-Hastings algorithm. Proof

More information

arxiv: v1 [gr-qc] 11 Aug 2014

arxiv: v1 [gr-qc] 11 Aug 2014 Testing general relativity with compact coalescing binaries: comparing exact and predictive methods to compute the Bayes factor arxiv:1408.2356v1 [gr-qc] 11 Aug 2014 Walter Del Pozzo, Katherine Grover,

More information

1 Probabilities. 1.1 Basics 1 PROBABILITIES

1 Probabilities. 1.1 Basics 1 PROBABILITIES 1 PROBABILITIES 1 Probabilities Probability is a tricky word usually meaning the likelyhood of something occuring or how frequent something is. Obviously, if something happens frequently, then its probability

More information

MCMC algorithms for fitting Bayesian models

MCMC algorithms for fitting Bayesian models MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models

More information

Who was Bayes? Bayesian Phylogenetics. What is Bayes Theorem?

Who was Bayes? Bayesian Phylogenetics. What is Bayes Theorem? Who was Bayes? Bayesian Phylogenetics Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison October 6, 2011 The Reverand Thomas Bayes was born in London in 1702. He was the

More information