Statistical Debugging with Latent Topic Models

Size: px

Start display at page:

Download "Statistical Debugging with Latent Topic Models"

Isabel Cook
5 years ago
Views:

1 Statistical Debugging with Latent Topic Models David Andrzejewski, Anne Mulhern, Ben Liblit, Xiaojin Zhu Department of Computer Sciences University of Wisconsin Madison European Conference on Machine Learning, 2007 (UW Madison) Topic-based statistical debugging ECML / 21

2 Outline 1 Statistical Debugging 2 Our Approach 3 Our Results (UW Madison) Topic-based statistical debugging ECML / 21

5 Debugging Debugging as machine learning (UW Madison) Topic-based statistical debugging ECML / 21

6 Program Runs as Documents Predicates Vocabulary Predicate counts Word counts Program run Bag-of-words document Debugging Latent topic analysis (UW Madison) Topic-based statistical debugging ECML / 21

7 Program Runs as Documents Predicates Vocabulary Predicate counts Word counts Program run Bag-of-words document Debugging Latent topic analysis (UW Madison) Topic-based statistical debugging ECML / 21

8 Program Runs as Documents Predicates Vocabulary Predicate counts Word counts Program run Bag-of-words document Debugging Latent topic analysis (UW Madison) Topic-based statistical debugging ECML / 21

9 Program Runs as Documents Predicates Vocabulary Predicate counts Word counts Program run Bag-of-words document Debugging Latent topic analysis (UW Madison) Topic-based statistical debugging ECML / 21

10 Program Runs as Documents Predicates Vocabulary Predicate counts Word counts Program run Bag-of-words document Debugging Latent topic analysis (UW Madison) Topic-based statistical debugging ECML / 21

11 Hidden topics Word counts (observed) Weighted latent topics (hidden) doc 1 doc 2 doc 3 θ 1 φ 1 + θ 2 φ 2 + θ 3 φ 3 + θ 4 φ 4 θ 1 φ 1 + θ 2 φ 2 + θ 3 φ 3 + θ 4 φ 4 θ 1 φ 1 + θ 2 φ 2 + θ 3 φ 3 + θ 4 φ 4 (UW Madison) Topic-based statistical debugging ECML / 21

12 Hidden topics Word counts (observed) doc 1 doc 2 doc 3 Weighted latent topics (hidden) φ 3, φ 4 = bug topics θ 1 φ 1 + θ 2 φ 2 + θ 3 φ 3 + θ 4 φ 4 θ 1 φ 1 + θ 2 φ 2 + θ 3 φ 3 + θ 4 φ 4 θ 1 φ 1 + θ 2 φ 2 + θ 3 φ 3 + θ 4 φ 4 (UW Madison) Topic-based statistical debugging ECML / 21

13 Uncover Hidden Bugs Goals Uncover hidden bugs NOT predicting whether a single run will fail or not It will do that all on its own... Assumptions Few hidden bugs many failing runs 1 bug per failing run (UW Madison) Topic-based statistical debugging ECML / 21

14 Uncover Hidden Bugs Goals Uncover hidden bugs NOT predicting whether a single run will fail or not It will do that all on its own... Assumptions Few hidden bugs many failing runs 1 bug per failing run (UW Madison) Topic-based statistical debugging ECML / 21

15 Latent Topic Modeling Debugging Predicates Program runs Bug patterns Active bug patterns Topic modeling Words Documents Topics Topic weights (UW Madison) Topic-based statistical debugging ECML / 21

16 Latent Topic Modeling Debugging Predicates Program runs Bug patterns Active bug patterns Topic modeling Words Documents Topics Topic weights (UW Madison) Topic-based statistical debugging ECML / 21

17 Latent Topic Modeling Debugging Predicates Program runs Bug patterns Active bug patterns Topic modeling Words Documents Topics Topic weights (UW Madison) Topic-based statistical debugging ECML / 21

18 Latent Dirichlet Allocation (LDA) β u φ u Nu w z Nd θ D α s For each topic t φ t Dir(β) For each doc d θ d Dir(α) For each word w Topic z θ d w φ z (UW Madison) Topic-based statistical debugging ECML / 21

19 Latent Dirichlet Allocation (LDA) β u φ u Nu w z Nd θ D α s For each topic t φ t Dir(β) For each doc d θ d Dir(α) For each word w Topic z θ d w φ z (UW Madison) Topic-based statistical debugging ECML / 21

20 Latent Dirichlet Allocation (LDA) β u φ u Nu w z Nd θ D α s For each topic t φ t Dir(β) For each doc d θ d Dir(α) For each word w Topic z θ d w φ z (UW Madison) Topic-based statistical debugging ECML / 21

21 Latent Dirichlet Allocation (LDA) β u φ u Nu w z Nd θ D α s For each topic t φ t Dir(β) For each doc d θ d Dir(α) For each word w Topic z θ d w φ z (UW Madison) Topic-based statistical debugging ECML / 21

22 Latent Dirichlet Allocation (LDA) β u φ u Nu w z Nd θ D α s For each topic t φ t Dir(β) For each doc d θ d Dir(α) For each word w Topic z θ d w φ z (UW Madison) Topic-based statistical debugging ECML / 21

23 LDA motivation LDA cannot recover bug patterns Strong non-bug patterns also present Most runs successful Usage patterns explain behavior Some runs fail Usage patterns mostly explain behavior Usage patterns overwhelm bug patterns (UW Madison) Topic-based statistical debugging ECML / 21

24 LDA motivation LDA cannot recover bug patterns Strong non-bug patterns also present Most runs successful Usage patterns explain behavior Some runs fail Usage patterns mostly explain behavior Usage patterns overwhelm bug patterns (UW Madison) Topic-based statistical debugging ECML / 21

25 LDA motivation LDA cannot recover bug patterns Strong non-bug patterns also present Most runs successful Usage patterns explain behavior Some runs fail Usage patterns mostly explain behavior Usage patterns overwhelm bug patterns (UW Madison) Topic-based statistical debugging ECML / 21

26 LDA motivation LDA cannot recover bug patterns Strong non-bug patterns also present Most runs successful Usage patterns explain behavior Some runs fail Usage patterns mostly explain behavior Usage patterns overwhelm bug patterns (UW Madison) Topic-based statistical debugging ECML / 21

27 Toy Dataset (UW Madison) Topic-based statistical debugging ECML / 21

28 Toy Dataset (UW Madison) Topic-based statistical debugging ECML / 21

29 Toy Dataset (UW Madison) Topic-based statistical debugging ECML / 21

30 Toy Dataset (UW Madison) Topic-based statistical debugging ECML / 21

31 LDA β u b β φ u Nu b φ N b w z Nd o θ D α s f α w z θ φ u β u, α s o φ b β b, α f words topics topic weights usage topic multinomials hyperparameters outcome flag bug topic multinomials hyperparameters (UW Madison) Topic-based statistical debugging ECML / 21

32 LDA β u b β φ u Nu b φ N b w z Nd o θ D α s f α w z θ φ u β u, α s o φ b β b, α f words topics topic weights usage topic multinomials hyperparameters outcome flag bug topic multinomials hyperparameters (UW Madison) Topic-based statistical debugging ECML / 21

33 LDA β u β b φ u Nu φ b N b w z Nd o θ D α s α f w z θ φ u β u, α s o φ b β b, α f words topics topic weights usage topic multinomials hyperparameters outcome flag bug topic multinomials hyperparameters (UW Madison) Topic-based statistical debugging ECML / 21

34 LDA β u β b φ u Nu φ b Nb w z Nd o θ D α s α f w z θ φ u β u, α s o φ b β b, α f words topics topic weights usage topic multinomials hyperparameters outcome flag bug topic multinomials hyperparameters (UW Madison) Topic-based statistical debugging ECML / 21

35 LDA Successful run α s = [ ] Failing run α f = [ ] (UW Madison) Topic-based statistical debugging ECML / 21

LDA Successful run α s = [1 1 1 0 0] Failing run α f = [1 1 1 1 1] Successful

36 LDA Successful run α s = [ ] Failing run α f = [ ] Successful run p(θ α s ) (UW Madison) Topic-based statistical debugging ECML / 21

37 LDA Successful run α s = [ ] Failing run α f = [ ] Failing run p(θ α f ) (UW Madison) Topic-based statistical debugging ECML / 21

38 Inference Need to calculate posterior p(z w, o) Use z to estimate Topic multinomials φ for each topic Topic weights θ for each document Collapsed Gibbs Sampling Uses easily obtainable counts Efficient p(z k = i z k, w, o) ( n i k,jk + β i j k n i k, + W j βi j ) ( n d k k,i + α o k i n d k k, + N u+n b i α o k i ) (UW Madison) Topic-based statistical debugging ECML / 21

39 Inference Need to calculate posterior p(z w, o) Use z to estimate Topic multinomials φ for each topic Topic weights θ for each document Collapsed Gibbs Sampling Uses easily obtainable counts Efficient p(z k = i z k, w, o) ( n i k,jk + β i j k n i k, + W j βi j ) ( n d k k,i + α o k i n d k k, + N u+n b i α o k i ) (UW Madison) Topic-based statistical debugging ECML / 21

40 Group failing runs by bug Run 1 θ.. Run n θ (UW Madison) Topic-based statistical debugging ECML / 21

41 Results grep moss bug1 runs 144 bug2 runs topic topic 6 PCA PCA PCA bug1 runs 106 bug3 runs 147 bug4 runs 329 bug5 runs 206 bug8 runs 186 other runs (UW Madison) Topic-based statistical debugging ECML / 21

42 Results Rand index wrt ground truth LDA [1] [2] exif grep gzip moss Baselines 1 Statistical Debugging: Simultaneous Isolation of Multiple Bugs, Zheng et al ICML 06 2 Scalable Statistical Bug Isolation, Liblit et al PLDI 05 (UW Madison) Topic-based statistical debugging ECML / 21

43 Top predicates for each bug 1 Use φ to calculate p(z w) 2 For each bug topic z, sort predicates w by p(z w) Top predicates for exif bug topic 9 Rank p(z w) Predicate w Expert opinion jpeg-data.c:434 Direct result jpeg_data_set_exif_data() jpeg-data.c:436 Direct result jpeg_data_set_exif_data() jpeg-data.c:207 Smoking gun jpeg_data_load_data() (UW Madison) Topic-based statistical debugging ECML / 21

44 Top predicates for each bug 1 Use φ to calculate p(z w) 2 For each bug topic z, sort predicates w by p(z w) Top predicates for exif bug topic 9 Rank p(z w) Predicate w Expert opinion jpeg-data.c:434 Direct result jpeg_data_set_exif_data() jpeg-data.c:436 Direct result jpeg_data_set_exif_data() jpeg-data.c:207 Smoking gun jpeg_data_load_data() (UW Madison) Topic-based statistical debugging ECML / 21

45 Conclusion Debugging can be a machine learning problem LDA overcomes problems with standard LDA LDA successful on real-world data Other possible uses of LDA (eg, sentiment analysis) Acknowledgements NLM Training Grant 5T15LM07359, NSF Grant CCF , AFOSR Grant FA (UW Madison) Topic-based statistical debugging ECML / 21

46 LDA Debugging LDA Words Documents Usage topics Bug topics Bug topic documents Topic weights Debugging Predicates Program runs Usage patterns Bug patterns Failing runs Active usage/bug patterns (UW Madison) Topic-based statistical debugging ECML / 21

47 Debugging Dataset Runs Program Lines of Code Bugs Successful Failing exif 10, grep 15, gzip 8, moss 35, Topics Program Word Types Usage Bug exif grep 2, gzip 3, moss 1, (UW Madison) Topic-based statistical debugging ECML / 21

48 Joint distribution p(w, z) = p(w z)p(z) p(w z) = p(z) = (UW Madison) Topic-based statistical debugging ECML / 21

49 Joint distribution p(w, z) = p(w z)p(z) N p(w z) = p(φ i β) i dφ i p(z) = (UW Madison) Topic-based statistical debugging ECML / 21

50 Joint distribution p(w, z) = p(w z)p(z) N p(w z) = p(φ i β) p(z) = i D d p(θ d α) dφ i dθ d (UW Madison) Topic-based statistical debugging ECML / 21

51 Joint distribution p(w, z) = p(w z)p(z) N p(w z) = p(φ i β) p(z) = i D d p(θ d α) N i θ di n d i dφ i dθ d (UW Madison) Topic-based statistical debugging ECML / 21

52 Joint distribution p(w, z) = p(w z)p(z) N W n p(w z) = p(φ i β) φ j i ij dφ i p(z) = i D p(θ d α) d i j N θ di n d i dθ d (UW Madison) Topic-based statistical debugging ECML / 21

Topic Models. Charles Elkan November 20, 2008

Topic Models. Charles Elkan November 20, 2008 Topic Models Charles Elan elan@cs.ucsd.edu November 20, 2008 Suppose that we have a collection of documents, and we want to find an organization for these, i.e. we want to do unsupervised learning. One