A Bayesian method for the analysis of deterministic and stochastic time series

Similar documents
Inference of cluster distance and geometry from astrometry

Patterns in astronomical impacts on the Earth: Testing the claims

Bayesian Regression Linear and Logistic Regression

Optical variability of quasars: damped random walk Željko Ivezić, University of Washington with Chelsea MacLeod, U.S.

Turbulent Flows. g u

Time Delay Estimation: Microlensing

Large Scale Bayesian Inference

arxiv:astro-ph/ v1 14 Sep 2005

Introduction to Systems Analysis and Decision Making Prepared by: Jakub Tomczak

Signal Modeling, Statistical Inference and Data Mining in Astrophysics

Combining probabilities

Ornstein-Uhlenbeck processes for geophysical data analysis

ENSC327 Communications Systems 19: Random Processes. Jie Liang School of Engineering Science Simon Fraser University

GENERATION OF COLORED NOISE

Stat 535 C - Statistical Computing & Monte Carlo Methods. Arnaud Doucet.

Nonparametric Bayesian Methods - Lecture I

Stochastic Processes. A stochastic process is a function of two variables:

Introduction to Bayesian Data Analysis

Physics 403. Segev BenZvi. Numerical Methods, Maximum Likelihood, and Least Squares. Department of Physics and Astronomy University of Rochester

The Gaia Mission. Coryn Bailer-Jones Max Planck Institute for Astronomy Heidelberg, Germany. ISYA 2016, Tehran

Covariance Matrix Simplification For Efficient Uncertainty Management

Sub-kilometer-scale space-time stochastic rainfall simulation

STA414/2104. Lecture 11: Gaussian Processes. Department of Statistics

5 Applying the Fokker-Planck equation

Kurume University Faculty of Economics Monograph Collection 18. Theoretical Advances and Applications in. Operations Research. Kyushu University Press

Lecture 1: Pragmatic Introduction to Stochastic Differential Equations

State Space Representation of Gaussian Processes

ELEG 3143 Probability & Stochastic Process Ch. 6 Stochastic Process

Infer relationships among three species: Outgroup:

X t = a t + r t, (7.1)

Lecture 6: Bayesian Inference in SDE Models

Asymptotics for posterior hazards

ECE531 Lecture 2b: Bayesian Hypothesis Testing

An Introduction to Bayesian Linear Regression

Miscellany : Long Run Behavior of Bayesian Methods; Bayesian Experimental Design (Lecture 4)

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

New Statistical Methods That Improve on MLE and GLM Including for Reserve Modeling GARY G VENTER

Gaussian processes for inference in stochastic differential equations

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Monte Carlo Markov Chains: A Brief Introduction and Implementation. Jennifer Helsby Astro 321

Effects of Forcing Scheme on the Flow and the Relative Motion of Inertial Particles in DNS of Isotropic Turbulence

Dynamic Multipath Estimation by Sequential Monte Carlo Methods

STA 4273H: Sta-s-cal Machine Learning

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Lecture 3. Linear Regression II Bastian Leibe RWTH Aachen

A6523 Modeling, Inference, and Mining Jim Cordes, Cornell University

Stochastic Spectral Approaches to Bayesian Inference

Bayesian Modeling of Conditional Distributions

Gaussian Process Approximations of Stochastic Differential Equations

SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA. Sistemi di Elaborazione dell Informazione. Regressione. Ruggero Donida Labati

Stochastic Modeling of Astronomical Time Series

Contrasting measures of irreversibility in stochastic and deterministic dynamics

Handbook of Stochastic Methods

Introduction to machine learning and pattern recognition Lecture 2 Coryn Bailer-Jones

Bayesian methods in the search for gravitational waves

Model Selection for Gaussian Processes

Numerical Solutions of ODEs by Gaussian (Kalman) Filtering

Correlation integral likelihood for stochastic differential equations

Partial factor modeling: predictor-dependent shrinkage for linear regression

Stochastic Processes

Introduction to Stochastic SIR Model

Variations. ECE 6540, Lecture 10 Maximum Likelihood Estimation

Signals and Spectra - Review

Bayesian statistics. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

08. Brownian Motion. University of Rhode Island. Gerhard Müller University of Rhode Island,

Robust and accurate inference via a mixture of Gaussian and terrors

Latent state estimation using control theory

Approximate Inference Part 1 of 2

Simulation of DART Buoy Data

Additional Keplerian Signals in the HARPS data for Gliese 667C from a Bayesian re-analysis

AE2160 Introduction to Experimental Methods in Aerospace

Approximate Inference Part 1 of 2

Bayesian model selection in graphs by using BDgraph package

Gaussian Process Approximations of Stochastic Differential Equations

Common noise vs Coupling in Oscillator Populations

Markov chain optimisation for energy systems (MC-ES)

Effects of Forcing Scheme on the Flow and the Relative Motion of Inertial Particles in DNS of Isotropic Turbulence

SDE Coefficients. March 4, 2008

1 Elementary probability

Information geometry for bivariate distribution control

STA414/2104 Statistical Methods for Machine Learning II

HMM part 1. Dr Philip Jackson

Local vs. Nonlocal Diffusions A Tale of Two Laplacians

Empirical Bayes interpretations of random point events

An example of Bayesian reasoning Consider the one-dimensional deconvolution problem with various degrees of prior information.

Consistent Downscaling of Seismic Inversions to Cornerpoint Flow Models SPE

System Identification, Lecture 4

Practical Bayesian Optimization of Machine Learning. Learning Algorithms

Some mathematical models from population genetics

Lagrangian Data Assimilation and Its Application to Geophysical Fluid Flows

Week 9 Generators, duality, change of measure

System Identification, Lecture 4

Probabilistic Models in Theoretical Neuroscience

Determining the GRB (Redshift, Luminosity)-Distribution Using Burst Variability

Models for models. Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research

Bayesian Mass Estimates of the Milky Way: incorporating incomplete data

Stochastic Processes. M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno

Computer Vision Group Prof. Daniel Cremers. 2. Regression (cont.)

Bayesian rules of probability as principles of logic [Cox] Notation: pr(x I) is the probability (or pdf) of x being true given information I

Probing the covariance matrix

Transcription:

A Bayesian method for the analysis of deterministic and stochastic time series Coryn Bailer-Jones Max Planck Institute for Astronomy, Heidelberg DPG, Berlin, March 2015

Time series modelling heteroscedastic, asymmetric noise on time and signal non-uniform time sampling measured signal, y 20 0 20 40 60 0 20 40 60 80 100 measured time, s Measured data D j =(s j,y j ) and uncertainties j =( sj, y j ) Model M with parameters Likelihood of single data point: integrate over unknown true time (t) and signal (z) Z P (D j j,,m)= P (D j t j,z j, j) t j,z j {z } Measurement model P (t j,z j,m) {z } Time series model dt j dz j

Model comparison Likelihood of all data points is P (D,,M)= Y j P (D j j,,m) Evidence is the likelihood marginalized over the parameter prior P (D,M)= Z P (D,,M) {z } likelihood P ( M) {z } prior More robust alternative is the leave-one-out cross validation likelihood P (D j D j,,m)= Z P (D j j,,m) {z } likelihood d P ( D j, j,m) d {z } posterior L CV = j=j Y j=1 P (D j D j,,m) Calculate integrals by MCMC sampling of posterior

Time series model Deterministic mean plus stochastic variation of constant variance P (z j t j,,m)= 1 p 2! e (z j (t j )) 2 /2! 2 Gaussian (t j )= a 2 cos[2 ( t + )] + b sinusoidal true signal z 0.04 0.00 0.02 0.04 0 20 40 60 80 100 true time t red solid: deterministic component red dashed: standard deviation of stochastic component black: true data

Time series model Ornstein-Uhlenbeck process A Stationary, Markov, Gaussian process dz(t) = 1 z(t)dt + c1/2 N (t;0,dt) c relaxation time di usion constant P (z j t j,,m)= 1 p 2 Vz e (z j µ z ) 2 /2V z with µ z = z 0 V z = c 2 (1 2 ) 9 = ; where = e (t t 0)/ for t>t 0

Examples of OU process realizations relaxation time, 1 10 100 1000 signal 15 5 0 5 10 15 5 0 5 10 15 5 0 5 10 0 20 60 100 0 20 60 100 0 20 60 100 0 20 60 100 time different randomisations

Luminosity variations in ultra cool dwarf stars 2m0345 2m0913 0.06 0.00 0.2 0.0 2m1145a 2m1145b 0.06 0.02 0 20 40 60 0.05 0.05 2m1146 2m1334 signal / mag 0.04 0.02 0.10 0.05 calar3 0 5 10 15 20 25 0.06 0.00 0.02 0.02 sdss0539 0 20 40 60 sori31 sori33 0.03 0.01 0.02 0.01 sori45 0.2 0.0 time / hrs

Luminosity variations in ultra cool dwarf stars Models compared: constant (variability just due to measurement noise) constant with Gaussian stochastic component sinusoid with Gaussian stochastic component OU process

Luminosity variations in ultra cool dwarf stars 2m0345 2m0913 0.06 0.00 0.2 0.0 2m1145a 2m1145b OU process 0.06 0.02 0 20 40 60 0.05 0.05 2m1146 2m1334 Sinusoid (8.3h, 13.3h) signal / mag 0.10 0.05 0.04 0.02 calar3 0 5 10 15 20 25 0.02 0.02 0.06 0.00 sdss0539 0 20 40 60 Sinusoid + stochastic sori31 sori33 0.03 0.01 0.02 0.01 sori45 0.2 0.0 time / hrs

Periodicity in biodiversity over past 550 Myr? Rohde & Muller 2005 no. genera (- cubic fit) 800 400 0 400 550 450 350 250 150 50 time BP / Myr periodic model with additional fitted Gaussian noise black = data red = model fit no. genera (- cubic fit) 800 400 0 400 550 450 350 250 150 50 stochastic process (OU process) CV likelihood is much higher for this model time BP / Myr

Summary a Bayesian method for modelling times series arbitrary time sampling and error models deterministic and stochastic times series use of cross-validation likelihood, a robust alternative to the evidence applications light curves of some very cool stars (and quasars) evolve stochastically no evidence for periodic variation of biodiversity over past 550 Myr more information and software: tinyurl.com/ctsmod

Ultra cool dwarf model comparison results Table 4. Log (base 10) LOO-CV likelihood of each model relative to that for the no-model for each light curve (log L LOO CV log L NM ). Light curve OUprocess Off+Stoch Sin Sin+Stoch Off+Sin+Stoch No-model p-value 2m0345 3.26 2.07 0.15 2.06 2.66 13.60 4e-4 2m0913 0.44 0.72 0.23 0.97 0.10 53.39 7e-4 2m1145a 15.23 8.59 3.01 12.26 11.70 63.83 <1e-9 2m1145b 0.73 1.96 2.00 2.69 2.95 39.71 1e-3 2m1146 0.67 0.56 0.08 0.21 1.17 26.83 3e-3 2m1334 14.95 12.82 4.06 16.86 16.12 65.88 1e-9 sdss0539 5.50 1.99 4.93 4.48 4.67 19.62 3e-5 calar3 3.60 1.43 5.65 5.11 4.28 28.06 6e-4 sori31 2.04 2.12 1.02 2.59 1.90 11.16 4e-5 sori33 1.49 0.66 2.14 1.85 2.12 8.39 2e-3 sori45 6.70 4.32 5.08 6.23 6.32 29.93 5e-9 Notes. The penultimate column gives the value of the log likelihood for the no-model, log L NM.Thelastcolumnisthep-valueforthehypothesis test from BJM.

Parameter posterior PDFs: 0 20 40 60 0 5 10 20 30 0 2 4 6 8 0.05 0.10 0.15 frequency, ν / hr 1 0.02 0.06 0.10 amplitude, a / mag 0.5 0.6 0.7 0.8 0.9 phase φ black = posterior red = prior

Parameter posterior PDFs: 2m1145a 0.00 0.02 0.04 0 10 20 30 40 0 5000 10000 0 50 100 150 0.06 0.02 0.02 0.06 0.00000 0.00015 0.00030 τ / hr b / mag c / mag 2 hr 1 0 50 100 150 black = posterior red = prior 0.04 0.00 0.04 0.00 0.04 0.08 µ[z 1 ] / mag V 1 2 [z 1 ] / mag

Parameter posterior PDFs: 2m1334 0 40 80 120 0 100 300 500 0.06 0.02 offset, b / mag 0.000 0.004 0.008 frequency, ν / hr 1 0.00 0.05 0.10 0.15 amplitude, a / mag 0 20 40 60 0 50 100 150 black = posterior red = prior 0.85 0.90 0.95 1.00 phase φ 0.005 0.015 0.025 standard deviation, ω / mag