Additional Keplerian Signals in the HARPS data for Gliese 667C from a Bayesian re-analysis

Similar documents
Detecting Extra-solar Planets with a Bayesian hybrid MCMC Kepler periodogram

Extra-solar Planets via Bayesian Fusion MCMC

arxiv:astro-ph/ v1 14 Sep 2005

SAMSI Astrostatistics Tutorial. More Markov chain Monte Carlo & Demo of Mathematica software

Bayesian re-analysis of the Gliese 581 exoplanet system

arxiv: v1 [astro-ph.sr] 2 Dec 2015

Bayesian Exoplanet tests of a new method for MCMC sampling in highly correlated model parameter spaces

SAMSI Astrostatistics Tutorial. Models with Gaussian Uncertainties (lecture 2)

arxiv: v1 [astro-ph.ep] 18 Feb 2009

Analysis of Radial Velocity Measurements

Date of delivery: 29 June 2011 Journal and vol/article ref: IAU Number of pages (not including this page): 5

CSC 2541: Bayesian Methods for Machine Learning

Bayesian search for other Earths

A Bayesian Analysis of Extrasolar Planet Data for HD 73526

A COMPARISON OF LEAST-SQUARES AND BAYESIAN FITTING TECHNIQUES TO RADIAL VELOCITY DATA SETS

Miscellany : Long Run Behavior of Bayesian Methods; Bayesian Experimental Design (Lecture 4)

Bayesian Model Selection & Extrasolar Planet Detection

EXONEST The Exoplanetary Explorer. Kevin H. Knuth and Ben Placek Department of Physics University at Albany (SUNY) Albany NY

Introduction to Bayesian Data Analysis

Statistical validation of PLATO 2.0 planet candidates

Convergence Diagnostics For Markov chain Monte Carlo. Eric B. Ford (Penn State) Bayesian Computing for Astronomical Data Analysis June 9, 2017

arxiv: v1 [astro-ph.ep] 5 Apr 2012

Hierarchical Bayesian Modeling

arxiv: v1 [astro-ph.ep] 21 Mar 2018

A BAYESIAN ANALYSIS OF EXTRASOLAR PLANET DATA FOR HD 73526

Indirect Methods: gravitational perturbation of the stellar motion. Exoplanets Doppler method

Markov chain Monte Carlo methods in atmospheric remote sensing

Frequency of Exoplanets Beyond the Snow Line from 6 Years of MOA Data Studying Exoplanets in Their Birthplace

Bayesian room-acoustic modal analysis

Evidence of a Neptune-sized Planet in the ρ 1 Cancri System

MASS DETERMINATIONS OF POPULATION II BINARY STARS

arxiv: v2 [astro-ph.ep] 8 Apr 2016

Astronomy 421. Lecture 8: Binary stars

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

17 : Markov Chain Monte Carlo

Hierarchical Bayesian Modeling of Planet Populations

A Bayesian Approach to Phylogenetics

Markov Chain Monte Carlo methods

Outline. 1 Introduction. 2 Problem and solution. 3 Bayesian tracking model of group debris. 4 Simulation results. 5 Conclusions

A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring

Advanced Statistical Methods. Lecture 6

Monitoring the Behavior of Star Spots Using Photometric Data

Computational statistics

Delicious Diameters of Dwarfs (in particular, the juicy red ones)

Science Olympiad Astronomy C Division Event National Exam

QUANTIFYING THE UNCERTAINTY IN THE ORBITS OF EXTRASOLAR PLANETS

STA 4273H: Statistical Machine Learning

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

27 : Distributed Monte Carlo Markov Chain. 1 Recap of MCMC and Naive Parallel Gibbs Sampling

Design reference mission construction for planet finders

Robotics. Lecture 4: Probabilistic Robotics. See course website for up to date information.

Infer relationships among three species: Outgroup:

Bayesian rules of probability as principles of logic [Cox] Notation: pr(x I) is the probability (or pdf) of x being true given information I

arxiv:astro-ph/ v1 10 Nov 2005

Introduction. 1. Processing chain

Bayesian Evidence and Model Selection: A Tutorial in Two Acts

A Beginner s Guide to MCMC

arxiv:astro-ph/ v2 4 May 2006

Monte Carlo in Bayesian Statistics

Copyright and Reuse: 2017 The Authors. Published by Oxford University Press on behalf of the Royal Astronomical Society.

A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring

Thomas Bayes versus the wedge model: An example inference using a geostatistical prior function

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis

Probabilistic Graphical Models Lecture 17: Markov chain Monte Carlo

Published by University of Arizona Press. For additional information about this book. Sara Seager

Principles of Bayesian Inference

Observational Cosmology Journal Club

Learning in Bayesian Networks

Fast Likelihood-Free Inference via Bayesian Optimization

Large Scale Bayesian Inference

Hierarchical Bayesian Modeling

RV- method: disturbing oscilla8ons Example: F- star Procyon

Supplementary Note on Bayesian analysis

Reminder of some Markov Chain properties:

Signal Modeling, Statistical Inference and Data Mining in Astrophysics

Work of the LSC Pulsar Upper Limits Group (PULG) Graham Woan, University of Glasgow on behalf of the LIGO Scientific Collaboration

An Introduction to Reversible Jump MCMC for Bayesian Networks, with Application

The University of Auckland Applied Mathematics Bayesian Methods for Inverse Problems : why and how Colin Fox Tiangang Cui, Mike O Sullivan (Auckland),

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 13: SEQUENTIAL DATA

Markov Chain Monte Carlo

Detection theory 101 ELEC-E5410 Signal Processing for Communications

Radial Velocity Detection of Planets: I. Techniques and Tools

Solving for the Orbital Elements of Binary Systems Using Markov Chain Monte Carlo Simulations

Computer Vision Group Prof. Daniel Cremers. 10a. Markov Chain Monte Carlo

Ways to make neural networks generalize better

Exoplanet Search Techniques: Overview. PHY 688, Lecture 28 April 3, 2009

MCMC Sampling for Bayesian Inference using L1-type Priors

CPSC 540: Machine Learning

Bayesian Methods and Uncertainty Quantification for Nonlinear Inverse Problems

Selection functions in doppler planet searches

arxiv: v3 [astro-ph] 16 Mar 2009

Dynamic Multipath Estimation by Sequential Monte Carlo Methods

imin...

Multimodal Nested Sampling

A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring

Rare Event Sampling using Multicanonical Monte Carlo

Approximate inference in Energy-Based Models

Bayesian Regression Linear and Logistic Regression

Approximate Bayesian Computation

Markov Chain Monte Carlo (MCMC) and Model Evaluation. August 15, 2017

Transcription:

Additional Keplerian Signals in the HARPS data for Gliese 667C from a Bayesian re-analysis Phil Gregory, Samantha Lawler, Brett Gladman Physics and Astronomy Univ. of British Columbia

Abstract A re-analysis of Gliese 667C HARPS radial velocity data was carried out with a Bayesian multi-planet Kepler periodogram (from 0 to 7 planets) based on a controlled fusion MCMC algorithm. The most probable number of signals detected is 6 with a Bayesian false alarm probability of 0.012. The 6 signals detected include two previously reported with periods of 7.2 and 28.1 days, plus additional periods of 30.8, 38.8, 53.2, and 91.3 days suggesting the possibility of further planets. Stellar diagnostic information suggest that the 53 day signal is the second harmonic of the stellar rotation period and is likely the result of surface activity. N-body simulations are underway to determine which of the remaining signals are consistent with a stable planetary system. At present, we have not found a long term (10 7 yr) stable 5 planet system consistent with the data. If we assume the 30.8 d period is a spectral artifact, we are able to identify a long term stable system with periods of 7.2, 28.1, 38.8, and 91.3 d. The corresponding M sin i values are 5.4, 4.8, 2.4, and 5.4 M E and the semi-major axes inferred for the 28 and 38.8 d signals place them in the central region of the habitable zone. Further analysis is underway to define the probability bubble of stable orbits corresponding to this candidate 4 planet system.

Prior information: the signals satisfy Kepler s laws The radial velocity equation, a nonlinear model f i = model prediction = V + K Hcos Hq @t i +cpd +wl + ecoswl outline V = systematic velocity K = velocity amplitude = 2 π asin i P 1 e 2 ; a = semi - major axis, i= inclination e = orbital eccentricity w = longitude of periastron c = fraction of orbit prior to data reference time that periastron occurred at q Ht i +cpl = true anomaly = angle of star in orbit at time t i relative to periastron q i and t i are related by the conservation of angular momentum equation 2 p t q @td - H1 + ecos@q P I1 - e 2 M 3ê2 DL2 = 0

Controlled Fusion MCMC The combination of nonlinear model, sparse sampling and huge prior period range of 0.5 d to 1000 yr, yields a highly multi-modal target distribution which is a problem for a straight Metropolis MCMC. To deal with this, Gregory has developed a new Markov chain Monte Carlo algorithm called controlled fusion MCMC. It incorporates: a) parallel tempering*, b) simulated annealing and, c) genetic crossover operations. Each of these features facilitate the detection of a global minimum in chi-squared in a multi-modal environment. By combining all three, the algorithm greatly increases the probability of realizing this goal. Fusion MCMC is a very general nonlinear model fitting method applicable to many other problems. * Also known as Exchange Monte Carlo (Hukushima & Nemoto 1996)

Controlled Fusion MCMC This fusion MCMC approach has been achieved through the development of a unique multi-stage adaptive control system, hence the term controlled fusion MCMC. Among other things the control system automates the tuning of MCMC proposal distributions for efficient exploration of the model parameter space even when the parameters are highly correlated. At each iteration, a single joint proposal to jump to a new location in the parameter space is generated. The algorithm is currently implemented in Mathematica using parallelized code and run on an 8 core PC.

Controlled Fusion MCMC 8 parallel tempering Metropolis chains Output at each iteration = 1.0 = 0.72 = 0.52 = 0.39 = 0.29 = 0.20 = 0.13 = 0.09 values Anneal independent Gaussian proposal s s Parallel tempering swap operations Refine & update Gaussian proposal s s parameters, logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, logprior + loglike Peak parameter set: If (logprior + loglike) > previous best by a threshold then update and reset burn-in Monitor for parameters with peak probability 2 stage proposal s control system error signal = (actual joint acceptance rate 0.25) Effectively defines burn-in interval MCMC adaptive control system Genetic algorithm Every 40 th iteration perform gene crossover operation to breed a more probable parameter set.

Controlled Fusion MCMC 8 parallel tempering Metropolis chains = 1.0 = 0.72 = 0.52 = 0.39 = 0.29 = 0.20 = 0.13 = 0.09 Parallel tempering swap operations Automatic proposal scheme that learns Output about parameter at each iteration correlations during burn-in (for each chain) parameters, logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, fromlogprior + loglike parameters, new logprior + b â loglike, proposal logprior C+ loglike parameters, parameter logprior + b â loglike, logprior + loglike parameters, valuelogprior + b â loglike, fromlogprior + loglike parameters, logprior + b â loglike, proposal logprior I + loglike parameters, repeat logprior + b â loglike, logprior + loglike parameters, logprior + b â loglike, logprior + loglike values Monitor for Add every 2 Peak parameter set: nd latest parameters to a buffer 300 values If (logprior + loglike) > with peak Refine & update probability Anneal I proposals Gaussian C proposals previous best by a Gaussian threshold then update Independent proposal s s Proposal distribution difference proposal s s and reset burn-in multiply by Gaussian proposal with built in parameter of random constant <1 scheme 2 stage employed proposal s control correlations system used pairs 50% of the time error signal 50% = of the time Genetic algorithm (actual joint acceptance rate 0.25) During Ever 10 th burn-in iteration control perform system gene Effectively defines burn-in interval adjusts crossover constant operation so to acceptance breed larger rate (logprior from + C loglike) proposals parameter = 25 set. % MCMC adaptive control system

Data Model Prior Bayes Theorem p X D, M, I Target posterior If we input a Kepler model the fusion MCMC becomes A Kepler periodogram Optimum for finding Kepler orbits and evaluating their probabilities. Capable of simultaneously fitting multiple planet models. A multi-planet Kepler periodogram Multiple Planets model = sum of multiple independent Keplerian orbits.

Gliese 667C 3 rd isolated member of triple star system distance = 22.1 ly HARPS data 1) 2011, History Planet b with M sin i = 5.9 M Earth + two other interesting periods at 90 & 28d (habitable zone orbit) Bonfils et al., arxiv:1111.5019v2 Velocity (ms -1 ) 2) 2012, Anglada et al. & Delfosse et al., Confirm planet b & report planet c 28d, 4.3 M Earth (HZ)

Gliese 667C 2 Keplerian FMCMC model fit 2 nd period exhibits multiple peaks Arrow indicates starting periods

Gliese 667C 2 Keplerian FMCMC model fit Arrow indicates starting periods

Gliese 667C 3 Keplerian signal FMCMC model fit Additional periods detected in other 3 signal FMCMC runs 38.8, 87, 106, & 128 d. In all cases signals were detected at 7.2 and 28.1 d.

Questions and complications How to decide when to stop adding more Keplerian signals? For a Bayesian -> use model selection and compute the Bayes factor. => Favors the 6 signal model over nearest rival by a factor of 137 Star spots can also give rise to a Keplerian like signal. Complication: If there are additional signals beyond what is assumed in the model, they will lead to correlated or colored noise. Can investigate this by computing the autocorrelation function of the model residuals. => 6 signal model residuals consistent with white noise while models with fewer signals exhibit correlated noise.

Autocorrelation function of residuals where x i is the i th residual, j is the lag and x is the mean of the samples in the overlap region. Because the data are not uniformly sampled, for each lag all sample pairs that differed in time by this lag ±0.1 d were utilized. 1 signal residuals P = 7.2 d 6 signal residuals P = 7.2, 28.1,30.8, 38.8, 53 91 d 3 signal residuals P = 7.2, 28.1, 184 d The solid red curve in the 1signal residuals is the average autocorrelation generated from 400 simulated data sets of a 5 signal model (28.1, 30.8, 38.8, 53.2, & 91 d periods) together with the quoted measurement errors.

Gliese 667C 6 Keplerian signal FMCMC model fit

Periodogram of the 6 signal residuals A generalized Lomb-Scargle (GLS) periodogram (Zechmeister & Kurster 2009) for the maximum a posteriori (MAP) parameter values of the 6 signal fit residuals. The GLS allows for a floating offset and weights. The dashed horizontal lines correspond to peak periodogram levels for which the frequentist false alarm probability (FAP) would = 0.1 & 0.01. No evidence of any significant peaks or red noise in these residuals.

Gl 667C Subset of HARPS data (1 st 143 velocities) Includes burn-in phase to show aliases Upper and lower one year alias of 38.8d.

Are any of our 6 signals spectral artifacts? Periodogram of a stellar activity diagnostic, the FWHM of the cross correlation function (Delfosse et al. 2012). Delfosse et al. (2012) conclude the star s rotation P ~105 days, from a periodogram of a stellar activity diagnostic. At various stages in our RV analysis a P = 106 d was detected. In our preferred 6 signal model, a P = 53 d (2 nd harmonic) signal was detected. Consider this signal an artifact of surface activity.

Further analysis in progress 1. Modified likelihood in Fusion MCMC to incorporate n-body simulation for an n planet model (for n = 3,4,5) plus m additional Keplerian-like artifact signals (m = 3,2,1) Uses a 4 th order Hermite n-body integrator with automatic step size. 2. Check the long term stability (10 7 yr) of n planet model (for n = 3,4,5) with FMCMC parameters that provides a good fit to the data. => 4 of the signals (7.2, 28.1, 38.8, 91 d) are consistent with a stable candidate 4 planet system.

Conclusions 1. Our results suggest there are 6 interesting signals in the HARPS data for Gliese 667C. One is likely an artifact of surface activity. 2. The 6 signal residuals are consistent with white noise but residuals with fewer than 6 signals are correlated (colored noise). 3. Further analysis is underway to determine which of the remaining signals are consistent with a stable planetary system. 4. Our Bayesian approach is helping to raise the bar on how much useful information can be extracted from the radial velocity data. 5. Experience gained while working with the exoplanet data sets has led to the development by Gregory of a Controlled Fusion MCMC nonlinear model fitting algorithm, which can be applied to many other problems.

Gliese 667C 4 planet FMCMC model fit outline Additional periods detected in other FMCMC runs 38.8 & 91 d.

Gliese 667C 5 planet FMCMC model fit outline Additional periods detected in other FMCMC runs were 30.8, 38.8, 87, 106, 128, & 184 d. In all 5 signal runs P = 7.2, 28.1, 53, & 91 d were detected.