Sequential Monte Carlo Algorithms

Size: px

Start display at page:

Download "Sequential Monte Carlo Algorithms"

August Paul
5 years ago
Views:

1 ayesian Phylogenetic Inference using Sequential Monte arlo lgorithms lexandre ouchard-ôté *, Sriram Sankararaman *, and Michael I. Jordan *, * omputer Science ivision, University of alifornia erkeley epartment of Statistics, University of alifornia erkeley

2 Phylogenetic tree inference Topic of this talk: integration over the space of trees using Sequential Monte arlo (SM) Motivation: ayesian approach to phylogenetic inference Put a prior on trees, use the posterior for reconstruction Heavy use of integrals over the space of trees: e.g. for handling nuisance parameters, computing minimum risk estimators, ayes factors, etc. Prelude: a parallel with the simpler problem of maximization over the space of trees

3 Maximization over phylogenies Two strategies: Local and sequential search Key difference: representation Local Sequential State t Trees over the observed species

4 Maximization: local strategy Meta-algorithm: 1.Start at arbitrary state

5 Maximization: local strategy Meta-algorithm: 1.Start at arbitrary state 2.Iterate: i. Evaluate neighbors ii. Move to a nearby tree

6 Maximization: local strategy Meta-algorithm: 1.Start at arbitrary state 2.Iterate: i. Evaluate neighbors ii. Move to a nearby tree 3.Return best state visited Example: stochastic annealing

7 Maximization over phylogenies Two strategies: Local and sequential search Key difference: representation Local Sequential State t Trees over the observed species Partial state p Forests over the observed species

8 Maximization over phylogenies Two strategies: Local and sequential search Key difference: representation Local Sequential State t Partial state p Trees over the observed species Forests over the observed species

9 Maximization: sequential strategy Meta-algorithm: 1.Start at the initial, unconstrained partial state =

10 Maximization: sequential strategy Meta-algorithm: 1.Start at the initial, unconstrained partial state 2.Iterate: i. Extend partial state ii. Estimate best successor =

11 Maximization: sequential strategy Meta-algorithm: 1.Start at the initial, unconstrained partial state 2.Iterate: i. Extend partial state ii. Estimate best successor 3.Return best final state = Example: neighbor joining

12 Parallel lassification of phylogenetic algorithms Local strategy Sequential strategy Maximization Stochastic annealing, Neighbor-joining, Integration

13 Parallel lassification of phylogenetic algorithms Local strategy Sequential strategy Maximization Stochastic annealing, Neighbor-joining, Integration MM algorithms???

14 Parallel lassification of phylogenetic algorithms Local strategy Sequential strategy Maximization Stochastic annealing, Neighbor-joining, Integration MM algorithms??? Sequential Monte arlo (SM)

15 Outline ackground: Importance sampling and Sequential Monte arlo SM for phylogenetic inference Framework for designing proposals Experiments: comparisons with MM

16 Preview: omparative advantages SM MM + Trivial to parallelize + Easier to get data likelihood estimate + No burn-in + Easier to resample hyper-parameters + Easier to design proposal distribution

17 Preview: omparative advantages SM MM + Trivial to parallelize + Easier to get data likelihood estimate + No burn-in + Easier to resample hyper-parameters + Easier to design proposal distribution Not exclusive: the two approaches can be combined

18 Phylogenetic setup: ultrametric trees root G.. G.. } Tree T Hidden sequences X E G..} G.. G.. GGT.. TTT.. GT.. GT.. Observations Y = y

19 Phylogenetic setup: ultrametric trees root G.. G.. height } Tree T Hidden sequences X E G..} G.. G.. GGT.. TTT.. GT.. GT.. Observations Y = y

20 Phylogenetic setup: ultrametric trees root G.. G.. height } Tree T Hidden sequences X E G..} G.. G.. GGT.. TTT.. GT.. GT.. Observations Y = y Target distribution: with density: T Y d = π γ(t) Z

21 Phylogenetic setup: ultrametric trees root G.. G.. height } Tree T Hidden sequences X E G..} Joint density evaluated G.. at G.. (t, y) GGT.. TTT.. GT.. GT.., summing over Observations Y = y hidden x Target distribution: with density: T Y d = π γ(t) Z

22 Phylogenetic setup: ultrametric trees root G.. G.. height } Tree T Hidden sequences X E G..} Joint density evaluated G.. at G.. (t, y) GGT.. TTT.. GT.. GT.., summing over Observations Y = y x P(Y = y) T Y = d π hidden Target distribution: ata likelihood (intractable) with density: γ(t) Z

23 Sequential Monte arlo (SM) ackground: Importance Sampling (IS) t1 t2 t3 IS : pproximation for π q 1.Sample trees from a proposal q: ti ~ q

24 Sequential Monte arlo (SM) ackground: Importance Sampling (IS) IS : pproximation for π w1 w2 w3 1.Sample trees from a proposal q: ti ~ q 2. ompute weights w i = γ(t i )/q(t i ) 3. Normalize weights

25 Sequential Monte arlo (SM) ackground: Importance Sampling (IS) IS : pproximation for π w1 w2 w3 1.Sample trees from a proposal q: ti ~ q 2. ompute weights w i = γ(t i )/q(t i ) "Particle" 3. Normalize weights

26 Sequential Monte arlo (SM) ackground: Problem with importance sampling: π is high-dimensional Most particles will have tiny weights

27 Sequential Monte arlo (SM) ackground: Importance Sampling q π γ Z

28 Sequential Monte arlo (SM) ackground: Importance Sampling q π γ Z SM: a sequence of proposals q q q q π 1 π 2 π R = π γ Z

29 Sequential Monte arlo (SM) ackground: Importance Sampling q π γ Z SM: a sequence of proposals q q q q π 1 π 2 π R = π γ Z SM for phylogenies: states (forest) π r are distributions over partial

30 Sequential Monte arlo (SM) 1. Initialize [] π 1

31 Sequential Monte arlo (SM) SM : pproximation for π 1. Initialize [] π 2 2. Iterate : i. Sample partial states p i π 1 q π 1

32 Sequential Monte arlo (SM) SM : pproximation for π 1. Initialize [] π 2 2. Iterate : i. Sample partial states p i π 1 q π 1 p1 p3 p2

33 Sequential Monte arlo (SM) SM : pproximation for π 1. Initialize [] π 2 p 1 p 2 p 3 2. Iterate : i. Sample partial states p i π 1 q π 1 p1 p3 p2

34 Sequential Monte arlo (SM) SM : pproximation for π 1. Initialize [] π 2 2. Iterate : i. Sample partial states w1 w2 w3 p i π 1 q ii. ompute weights w i = γ(p i ) γ(p i ) 1 q(p i p i ) iii. Normalize weights

35 Sequential Monte arlo (SM) π SM : pproximation for π 1. Initialize [] π 2 2. Iterate : i. Sample partial states p i π 1 q π 1 ii. ompute weights w i = γ(p i ) γ(p i ) 1 q(p i p i ) iii. Normalize weights

36 Intuition: why it works asic result: SM is asymptotically consistent p w = γ(p ) γ(p ) 1 q(p p ) p p w = γ(p ) γ(p) 1 q(p p ) w = γ(p) 1 1 q( p)

37 Intuition: why it works asic result: SM is asymptotically consistent p w = γ(p ) γ(p ) 1 q(p p ) p p w = γ(p ) γ(p) 1 q(p p ) w = γ(p) 1 1 q( p) w w w = γ(p ) q( p )

38 Intuition: why it works asic result: SM is asymptotically consistent p w = γ(p ) γ(p ) 1 q(p p ) p p w = γ(p ) γ(p) 1 q(p p ) w = γ(p) 1 1 q( p) w w w = γ(p ) q( p )

39 Intuition: why it works ompare: Weights along a SM path Importance sampling w w w = γ(p ) q( p ) w = γ(t) q(t)

40 esigning a proposal q Issue: Over-counting Two ways of t1 t2 One way of building t2 building t1

41 esigning a proposal q Useful abstraction: q induce a partial order (poset) P

42 esigning a proposal q Useful abstraction: q induce a partial order (poset) P p1 p2 if q can propose a path from p1 to p2

43 esigning a proposal q Useful abstraction: q induce a partial order (poset) P Poset s Hesse diagram:

44 esigning a proposal q Useful abstraction: q induce a partial order (poset) P Poset s Hesse diagram: Use proposal that have tree-shaped Hesse diagrams

45 esigning a proposal q Example: a proposal that has a tree-shaped Hesse diagram. 1. Pick a pair of trees to merge uniformly at random 2. Pick a height for the new tree such that = height( ) < height( )

46 esigning a proposal q t1 t2 = < height( ) height( )

47 Experiments: setup Synthetic-small Synthetic-med Real data Source Generated from the model Subset of HGP Likelihood model rownian motion on frequencies Number of sites ,511 Number of nodes Number of leaves

48 Synthetic experiments Goal: comparison against MM ompetitor: standard MM sampler, 4 tempering chains, shared sum-product implementation Metric: symmetric clade difference of the Minimum ayes Risk reconstructed tree to the generating tree atapoints computed by increasing the number of particles (for SM) and the number of sampling steps (for MM)

49 omparison with MM Synthetic-small Symmetric clade difference SM MM Wall clock time (ms) in logscale

50 omparison with MM Synthetic-medium Symmetric clade difference MM SM x10 6 1x10 7 Wall clock time (ms) in logscale

51 Experiments on real data Goal: show that the method scales to large number of sites Number of particle (10,000) determined using synthetic experiments, timing experiments with different numbers of cores: Wall clock time (ms) Number of cores

52 onclusion SM can be applied to a wide range of phylogenetic models; previous work limited to oalescent priors [Teh et al. 07] Order theoretic framework for designing proposals Experiments: There are regimes where SM outperforms MM Promising applications of SM in phylogenetic inference: 1. Quickly analyze large datasets 2. Initialization and large step proposal for MM chains

15-780: Graduate Artificial Intelligence. Bayesian networks: Construction and inference

15-780: Graduate Artificial Intelligence ayesian networks: Construction and inference ayesian networks: Notations ayesian networks are directed acyclic graphs. Conditional probability tables (CPTs) P(Lo)