Bayesian Nonparametric Models for Ranking Data
|
|
- Cody Mosley
- 5 years ago
- Views:
Transcription
1 Bayesian Nonparametric Models for Ranking Data François Caron 1, Yee Whye Teh 1 and Brendan Murphy 2 1 Dept of Statistics, University of Oxford, UK 2 School of Mathematical Sciences, University College Dublin, Ireland BNPSki, January 2014 Caron, Teh & Murphy 1 / 23
2 Ranked Data Rank these 5 movies by preference Rank your 5 favourite movies among these 30 Rank your 5 favourite movies Aim Bayesian nonparametric model to model top-m partial rankings of a potentially infinite number of items. Each item is modelled using a positive rating parameter that is inferred from partial rankings. Develop efficient computational procedure for posterior simulation. Mixture generalizations. Application to preferences in college programmes Caron, Teh & Murphy 2 / 23
3 Preferences for college degree programmes Application for college degree programmes in Ireland in applicants provided top-10 rankings 533 degree programmes available for selection Heterogenity in the data Table: Two samples from the preference data. Rank CAO code College Degree Programme 1 DN002 University College Dublin Medicine 2 GY501 NUI - Galway Medicine 3 CK701 University College Cork Medicine 4 DN006 University College Dublin Physiotherapy 5 TR053 Trinity College Dublin Physiotherapy 6 DN004 University College Dublin Radiotherapy 7 TR007 Trinity College Dublin Clinical Speech 8 FT223 Dublin IT Human Nutrition 9 TR084 Trinity College Dublin Social Work 10 DN007 University College Dublin Social Science Rank CAO code College Degree Programme 1 MI005 Mary Immaculate Limerick Education - Primary Teaching 2 CK301 University College Cork Law 3 CK105 University College Cork European Studies 4 CK107 University College Cork Language - French 5 CK101 University College Cork Arts Caron, Teh & Murphy 3 / 23
4 Parametric Plackett-Luce Model Population of M items X 1,..., X M. To each item X k assign a rating parameter w k > 0. Stage-wise interpretation: Pick first item with probabilities proportional to w k s. Remove first item. Pick second item with probabilities proportional to w k s. Remove second item.... The probability of a ranking (X ρ1,..., X ρm ), with ρ = (ρ 1,..., ρ M ) a permutation, is P (ρ w) = M i=1 w ρi M k=1 w k i 1 j=1 w ρ j [Luce, 1959, Plackett, 1975], [Gormley and Murphy, 2008, Gormley and Murphy, 2009] Caron, Teh & Murphy 4 / 23
5 Nonparametric Plackett-Luce Model Population of items X 1, X 2,... of infinite size. Each item X k assigned a rating parameter w k > 0. Stage-wise interpretation: Pick first item with probabilities proportional to w k s. Remove first item. Pick second item with probabilities proportional to w k s. Remove second item.... The probability of a (finite, partial) ranking X ρ1,..., X ρm P (ρ w) = m i=1 w ρi k=1 w k i 1 j=1 w ρ j is Caron, Teh & Murphy 5 / 23
6 Thurstonian Interpretation For each item X k define an exponential random variate: z k Exp(w k ) Induces a partial permutation ρ such that z ρ1 < < z ρm < A race among items: z k is the time that item X k finished the race. ρ is the order of champion, runner-up, second runner-up... The probability of a partial permutation is: P (ρ w) =P (z ρ1 < z ρ2 < < z ρm < everything else) m w ρi = k=1 w k i 1 j=1 w ρ j i=1 Caron, Teh & Murphy 6 / 23
7 Thurstonian Interpretation w k z k ρ 4 ρ 3 ρ items 15 items 15 ρ ρ ratings time P (ρ w) =P (z ρ1 < z ρ2 < < z ρm < everything else) m w ρi = k=1 w k i 1 j=1 w ρ j i=1 Caron, Teh & Murphy 7 / 23
8 Data Augmentation Reparametrize in terms of inter-arrival durations: Z 1 = z ρ1, Z 2 = z ρ2 z ρ1,... Z m = z ρm z ρm 1 w k z k ρ 4 ρ 3 ρ items 15 items 15 ρ ρ ratings time Z 1 Z 2 Z 3 Z 4 Z 5 Caron, Teh & Murphy 8 / 23
9 Data Augmentation Augmented system with auxiliary variables Z 1,..., Z m : i 1 Z i ρ, w, X Exp w k Joint probability is: P (ρ, Z w, X) = k=1 j=1 m w ρi exp w k i=1 What prior to use for w, X? k=1 Independent Gamma s for w k s do not work. Gamma process. Better: completely random measures. w ρj i 1 j=1 w ρj Z i Caron, Teh & Murphy 9 / 23
10 Completely Random Measures Lévy intensity λ(w). Base distribution H with density h(x). Random atomic measure G = w k δ Xk k=1 Construction: two-dimensional Poisson process N = {w k, X k } with intensity λ(w)h(x): [Kingman, 1967, Lijoi and Prünster, 2010] Caron, Teh & Murphy 10 / 23
11 Completely Random Measures Conditions on Lévy intensity: 0 0 λ(w)dw = infinitely many items. (1 e w )λ(w)dw < finite total w k. k=1 Plackett-Luce sampling without replacement: Pick first item with probabilities proportional to w k s; remove. Pick second item with probabilities proportional to w k s; remove;... Size-biased sampling of the atoms in G. Caron, Teh & Murphy 11 / 23
12 Prior Draws Generalized Gamma process with λ(w) = α Γ(1 σ) w σ 1 e τ w, τ = (a) α = 0.1, σ = (b) α = 1, σ = (c) α = 3, σ = (d) α = 1, σ = 0.1 (e) α = 1, σ = 0.5 (f) α = 1, σ = 0.9 Caron, Teh & Murphy 12 / 23
13 Posterior Characterization Observe L partial rankings Y l = (Y l1,..., Y lml ) for l = 1,..., L. X1,..., X K are the K unique items among observations Associated auxiliary variables Z l1,..., Z lml. Theorem The posterior distribution given partial rankings Y and auxiliary variables Z is a CRM with fixed atoms: G Y, Z = G + K wk δ Xk k=1 where G and w 1,..., w K are mutually independent. The law of G is still CRM with Lévy intensity λ (w) = λ(w)e w( li Z li) while the masses have distributions, P (w k Y, Z) (w k )n ke w k ( li δ likz li ) λ(w k ). Characterization similar to that for normalized random measures. [Prünster, 2002, James, 2002, James et al., 2009] Caron, Teh & Murphy 13 / 23
14 Bayesian Inference via Gibbs Sampling Rating parameters wk of observed items. Latent variables Z li. CRM G containing unobserved items. Marginalize out G, keeping only its total mass w. Easy Gibbs sampler for generalized gamma process class of CRM Z li rest Exponential wk rest Gamma w rest Exponentially tilted stable Caron, Teh & Murphy 14 / 23
15 Nonparametric Plackett-Luce Mixture Model π Partial rankings reflecting the preferences of a heterogenous population. π GEM(θ) c l π Discrete(π) Y l c l, G cl PL(G cl ) c ρ G k C k Gamma(αH + C k, τ + φ) C k G 0 Poisson(φG 0 ) G 0 Gamma(αH, τ ) G k C k G k Gamma(αH, τ ) does not work. Caron, Teh & Murphy 15 / 23 G 0
16 Irish University Programme Applications Irish university applicants. Each applicant ranks their top-10 desired university programmes. Point estimate of the partition ĉ = arg min c (i) {c (1),...,c (N) } k l (δ (i) c k c(i) l where the coclustering matrix ζ is obtained with N ζ kl ) 2 ζ kl = 1 N i=1 δ (i) c k c(i) l c (i), i = 1,..., N are the Monte Carlo samples and δ kl = 1 if k = l, 0 otherwise. Caron, Teh & Murphy 16 / 23
17 Irish University Programme Applications Table: Description of the different clusters. The size of the clusters, the entropy and a cluster description are provided. Cluster Size Entropy Description Cluster Size Entropy Description Social Science/Tourism Engineering Science Teaching/Arts Business/Commerce Art/Music - Dublin Arts Engineering - Dublin Business/Marketing - Dublin Medicine Construction Arts/Religion/Theology CS - outside Dublin Arts/History - Dublin CS - Dublin Galway Arts/Social - outside Dublin Limerick Business/Finance - Dublin Law Arts/Psychology - Dublin Business - Dublin Cork Arts/Bus. - Dublin Comm./Journalism - Dublin Mixed Caron, Teh & Murphy 17 / 23
18 Irish University Programme Applications Table: Cluster 7: Computer Science - outside Dublin Rank Aver. Norm. Weight College Degree Programme Cork IT Computer Applications Limerick IT Software Development University of Limerick Computer Systems Waterford IT Applied Computing Cork IT Software Dev & Comp Net IT Carlow Computer Networking Athlone IT Computer and Software Engineering University College Cork Computer Science Dublin City University Computer Applications University of Limerick Information Technology Table: Cluster 8: Computer Science - Dublin Rank Aver. Norm. Weight College Degree Programme Dublin City University Computer Applications University College Dublin Computer Science NUI - Maynooth Computer Science Dublin IT Computer Science National College of Ireland Software Systems Dublin IT Business Info. Systems Dev Trinity College Dublin Computer Science Dublin IT Applied Sciences/Computing Trinity College Dublin Information & Comm. Tech University College Dublin B.A. (Computer Science) Caron, Teh & Murphy 18 / 23
19 Irish University Programme Applications Table: Cluster 12: Cork Rank Aver. Norm. Weight College Degree Programme University College Cork Arts University College Cork Computer Science University College Cork Commerce University College Cork Business Information Systems Cork IT Computer Applications Cork IT Software Dev & Comp Net University College Cork Finance University College Cork Law University College Cork Accounting University College Cork Biological and Chemical Sciences Caron, Teh & Murphy 19 / 23
20 Irish University Programme Applications Caron, Teh & Murphy 20 / 23
21 Summary A Bayesian nonparametric Plackett-Luce model for partial rankings. Completely random measures and posterior characterization. Easy Gibbs sampling for posterior simulation. Mixture generalization. Future: Dependent general CRM models Caron, Teh & Murphy 21 / 23
22 Bibliography I Caron, F., Teh, Y. W., and Murphy, B. (2013). Bayesian nonparametric Plackett-Luce models for the analysis of preferences for college degree programmes. To appear in Annals of Applied Statistics. Gormley, I. C. and Murphy, T. B. (2008). Exploring voting blocs with the Irish electorate: a mixture modeling approach. Journal of the American Statistical Association, 103(483): Gormley, I. C. and Murphy, T. B. (2009). A grade of membership model for rank data. Bayesian Analysis, 4(2): James, L., Lijoi, A., and Prünster, I. (2009). Posterior analysis for normalized random measures with independent increments. Scandinavian Journal of Statistics, 36(1): James, L. F. (2002). Poisson process partition calculus with applications to exchangeable models and bayesian nonparametrics. arxiv preprint math/ Caron, Teh & Murphy 22 / 23
23 Bibliography II Kingman, J. F. C. (1967). Completely random measures. Pacific Journal of Mathematics, 21(1): Lijoi, A. and Prünster, I. (2010). Models beyond the Dirichlet process. In Hjort, N. L., C. Holmes, P. M., and Walker, S. G., editors, Bayesian Nonparametrics. Cambridge University Press. Luce, R. D. (1959). Individual choice behavior: A theoretical analysis. Wiley. Plackett, R. (1975). The analysis of permutations. Journal of the Royal Statistical Society: Series C (Applied Statistics), 24(2): Prünster, I. (2002). Random probability measures derived from increasing additive processes and their application to Bayesian statistics. PhD thesis, University of Pavia. Caron, Teh & Murphy 23 / 23
Bayesian nonparametric models for bipartite graphs
Bayesian nonparametric models for bipartite graphs François Caron Department of Statistics, Oxford Statistics Colloquium, Harvard University November 11, 2013 F. Caron 1 / 27 Bipartite networks Readers/Customers
More informationBayesian nonparametric models for ranked data
Bayesian nonparametric models for ranked data François Caron INRIA IMB - University of Bordeaux Talence, France Francois.Caron@inria.fr Yee Whye Teh Department of Statistics University of Oxford Oxford,
More informationBayesian nonparametric latent feature models
Bayesian nonparametric latent feature models Indian Buffet process, beta process, and related models François Caron Department of Statistics, Oxford Applied Bayesian Statistics Summer School Como, Italy
More informationBayesian Nonparametrics: some contributions to construction and properties of prior distributions
Bayesian Nonparametrics: some contributions to construction and properties of prior distributions Annalisa Cerquetti Collegio Nuovo, University of Pavia, Italy Interview Day, CETL Lectureship in Statistics,
More informationBayesian nonparametric models for ranked data
arxiv:1211.4321v1 [stat.ml] 19 Nov 2012 Bayesian nonparametric models for ranked data François Caron, Yee Whye Teh RESEARCH REPORT N 8140 November 19, 2012 Project-Team ALEA ISSN 0249-6399 ISRN INRIA/RR--8140--FR+ENG
More informationA marginal sampler for σ-stable Poisson-Kingman mixture models
A marginal sampler for σ-stable Poisson-Kingman mixture models joint work with Yee Whye Teh and Stefano Favaro María Lomelí Gatsby Unit, University College London Talk at the BNP 10 Raleigh, North Carolina
More informationBayesian Nonparametrics: Dirichlet Process
Bayesian Nonparametrics: Dirichlet Process Yee Whye Teh Gatsby Computational Neuroscience Unit, UCL http://www.gatsby.ucl.ac.uk/~ywteh/teaching/npbayes2012 Dirichlet Process Cornerstone of modern Bayesian
More informationBayesian nonparametric models of sparse and exchangeable random graphs
Bayesian nonparametric models of sparse and exchangeable random graphs F. Caron & E. Fox Technical Report Discussion led by Esther Salazar Duke University May 16, 2014 (Reading group) May 16, 2014 1 /
More informationPreferences in college applications
Preferences in college applications A non-parametric Bayesian analysis of top-10 rankings Alnur Ali 1 Thomas Brendan Murphy 2 Marina Meilă 3 Harr Chen 4 1 Microsoft 2 University College Dublin 3 University
More informationBayesian nonparametric models for bipartite graphs
Bayesian nonparametric models for bipartite graphs François Caron INRIA IMB - University of Bordeaux Talence, France Francois.Caron@inria.fr Abstract We develop a novel Bayesian nonparametric model for
More informationTruncation error of a superposed gamma process in a decreasing order representation
Truncation error of a superposed gamma process in a decreasing order representation B julyan.arbel@inria.fr Í www.julyanarbel.com Inria, Mistis, Grenoble, France Joint work with Igor Pru nster (Bocconi
More informationBayesian nonparametric model for sparse dynamic networks
Bayesian nonparametric model for sparse dynamic networks Konstantina Palla, François Caron and Yee Whye Teh 23rd of February, 2018 University of Glasgow Konstantina Palla 1 / 38 BIG PICTURE Interested
More informationTruncation error of a superposed gamma process in a decreasing order representation
Truncation error of a superposed gamma process in a decreasing order representation Julyan Arbel Inria Grenoble, Université Grenoble Alpes julyan.arbel@inria.fr Igor Prünster Bocconi University, Milan
More informationCompound Random Measures
Compound Random Measures Jim Griffin (joint work with Fabrizio Leisen) University of Kent Introduction: Two clinical studies 3 CALGB8881 3 CALGB916 2 2 β 1 1 β 1 1 1 5 5 β 1 5 5 β Infinite mixture models
More informationOn the posterior structure of NRMI
On the posterior structure of NRMI Igor Prünster University of Turin, Collegio Carlo Alberto and ICER Joint work with L.F. James and A. Lijoi Isaac Newton Institute, BNR Programme, 8th August 2007 Outline
More informationSlice sampling σ stable Poisson Kingman mixture models
ISSN 2279-9362 Slice sampling σ stable Poisson Kingman mixture models Stefano Favaro S.G. Walker No. 324 December 2013 www.carloalberto.org/research/working-papers 2013 by Stefano Favaro and S.G. Walker.
More informationA Brief Overview of Nonparametric Bayesian Models
A Brief Overview of Nonparametric Bayesian Models Eurandom Zoubin Ghahramani Department of Engineering University of Cambridge, UK zoubin@eng.cam.ac.uk http://learning.eng.cam.ac.uk/zoubin Also at Machine
More informationPriors for Random Count Matrices with Random or Fixed Row Sums
Priors for Random Count Matrices with Random or Fixed Row Sums Mingyuan Zhou Joint work with Oscar Madrid and James Scott IROM Department, McCombs School of Business Department of Statistics and Data Sciences
More informationSpatial Normalized Gamma Process
Spatial Normalized Gamma Process Vinayak Rao Yee Whye Teh Presented at NIPS 2009 Discussion and Slides by Eric Wang June 23, 2010 Outline Introduction Motivation The Gamma Process Spatial Normalized Gamma
More informationDependent hierarchical processes for multi armed bandits
Dependent hierarchical processes for multi armed bandits Federico Camerlenghi University of Bologna, BIDSA & Collegio Carlo Alberto First Italian meeting on Probability and Mathematical Statistics, Torino
More informationSlice Sampling Mixture Models
Slice Sampling Mixture Models Maria Kalli, Jim E. Griffin & Stephen G. Walker Centre for Health Services Studies, University of Kent Institute of Mathematics, Statistics & Actuarial Science, University
More informationLecture 3a: Dirichlet processes
Lecture 3a: Dirichlet processes Cédric Archambeau Centre for Computational Statistics and Machine Learning Department of Computer Science University College London c.archambeau@cs.ucl.ac.uk Advanced Topics
More informationAsymptotics for posterior hazards
Asymptotics for posterior hazards Pierpaolo De Blasi University of Turin 10th August 2007, BNR Workshop, Isaac Newton Intitute, Cambridge, UK Joint work with Giovanni Peccati (Université Paris VI) and
More informationBayesian Nonparametrics for Speech and Signal Processing
Bayesian Nonparametrics for Speech and Signal Processing Michael I. Jordan University of California, Berkeley June 28, 2011 Acknowledgments: Emily Fox, Erik Sudderth, Yee Whye Teh, and Romain Thibaux Computer
More informationAdvanced Machine Learning
Advanced Machine Learning Nonparametric Bayesian Models --Learning/Reasoning in Open Possible Worlds Eric Xing Lecture 7, August 4, 2009 Reading: Eric Xing Eric Xing @ CMU, 2006-2009 Clustering Eric Xing
More informationDependent mixture models: clustering and borrowing information
ISSN 2279-9362 Dependent mixture models: clustering and borrowing information Antonio Lijoi Bernardo Nipoti Igor Pruenster No. 32 June 213 www.carloalberto.org/research/working-papers 213 by Antonio Lijoi,
More informationOn the Truncation Error of a Superposed Gamma Process
On the Truncation Error of a Superposed Gamma Process Julyan Arbel and Igor Prünster Abstract Completely random measures (CRMs) form a key ingredient of a wealth of stochastic models, in particular in
More informationBayesian Nonparametric Regression for Diabetes Deaths
Bayesian Nonparametric Regression for Diabetes Deaths Brian M. Hartman PhD Student, 2010 Texas A&M University College Station, TX, USA David B. Dahl Assistant Professor Texas A&M University College Station,
More informationInference in Explicit Duration Hidden Markov Models
Inference in Explicit Duration Hidden Markov Models Frank Wood Joint work with Chris Wiggins, Mike Dewar Columbia University November, 2011 Wood (Columbia University) EDHMM Inference November, 2011 1 /
More informationNonparametric Bayesian Methods: Models, Algorithms, and Applications (Day 5)
Nonparametric Bayesian Methods: Models, Algorithms, and Applications (Day 5) Tamara Broderick ITT Career Development Assistant Professor Electrical Engineering & Computer Science MIT Bayes Foundations
More informationDistance-Based Probability Distribution for Set Partitions with Applications to Bayesian Nonparametrics
Distance-Based Probability Distribution for Set Partitions with Applications to Bayesian Nonparametrics David B. Dahl August 5, 2008 Abstract Integration of several types of data is a burgeoning field.
More informationPrerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3
University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.
More informationDirichlet Process. Yee Whye Teh, University College London
Dirichlet Process Yee Whye Teh, University College London Related keywords: Bayesian nonparametrics, stochastic processes, clustering, infinite mixture model, Blackwell-MacQueen urn scheme, Chinese restaurant
More informationBayesian nonparametric latent feature models
Bayesian nonparametric latent feature models François Caron UBC October 2, 2007 / MLRG François Caron (UBC) Bayes. nonparametric latent feature models October 2, 2007 / MLRG 1 / 29 Overview 1 Introduction
More informationBayesian Hidden Markov Models and Extensions
Bayesian Hidden Markov Models and Extensions Zoubin Ghahramani Department of Engineering University of Cambridge joint work with Matt Beal, Jurgen van Gael, Yunus Saatci, Tom Stepleton, Yee Whye Teh Modeling
More informationParsimonious Adaptive Rejection Sampling
Parsimonious Adaptive Rejection Sampling Luca Martino Image Processing Laboratory, Universitat de València (Spain). Abstract Monte Carlo (MC) methods have become very popular in signal processing during
More informationHaupthseminar: Machine Learning. Chinese Restaurant Process, Indian Buffet Process
Haupthseminar: Machine Learning Chinese Restaurant Process, Indian Buffet Process Agenda Motivation Chinese Restaurant Process- CRP Dirichlet Process Interlude on CRP Infinite and CRP mixture model Estimation
More informationBayesian Inference. Chapter 2: Conjugate models
Bayesian Inference Chapter 2: Conjugate models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Master in Business Administration and Quantitative Methods Master in
More informationOn Simulations form the Two-Parameter. Poisson-Dirichlet Process and the Normalized. Inverse-Gaussian Process
On Simulations form the Two-Parameter arxiv:1209.5359v1 [stat.co] 24 Sep 2012 Poisson-Dirichlet Process and the Normalized Inverse-Gaussian Process Luai Al Labadi and Mahmoud Zarepour May 8, 2018 ABSTRACT
More informationUnit-rate Poisson representations of completely random measures
Unit-rate Poisson representations of completely random measures Peter Orbanz and Sinead Williamson Abstract: Constructive definitions of discrete random measures, which specify a sampling procedure for
More informationMAD-Bayes: MAP-based Asymptotic Derivations from Bayes
MAD-Bayes: MAP-based Asymptotic Derivations from Bayes Tamara Broderick Brian Kulis Michael I. Jordan Cat Clusters Mouse clusters Dog 1 Cat Clusters Dog Mouse Lizard Sheep Picture 1 Picture 2 Picture 3
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationA Bayesian Nonparametric Model for Predicting Disease Status Using Longitudinal Profiles
A Bayesian Nonparametric Model for Predicting Disease Status Using Longitudinal Profiles Jeremy Gaskins Department of Bioinformatics & Biostatistics University of Louisville Joint work with Claudio Fuentes
More informationImage segmentation combining Markov Random Fields and Dirichlet Processes
Image segmentation combining Markov Random Fields and Dirichlet Processes Jessica SODJO IMS, Groupe Signal Image, Talence Encadrants : A. Giremus, J.-F. Giovannelli, F. Caron, N. Dobigeon Jessica SODJO
More informationA Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data
A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data David S. Matteson Department of Statistical Science Cornell University matteson@cornell.edu www.stat.cornell.edu/~matteson
More information19 : Bayesian Nonparametrics: The Indian Buffet Process. 1 Latent Variable Models and the Indian Buffet Process
10-708: Probabilistic Graphical Models, Spring 2015 19 : Bayesian Nonparametrics: The Indian Buffet Process Lecturer: Avinava Dubey Scribes: Rishav Das, Adam Brodie, and Hemank Lamba 1 Latent Variable
More informationProbabilistic and Bayesian Machine Learning
Probabilistic and Bayesian Machine Learning Lecture 1: Introduction to Probabilistic Modelling Yee Whye Teh ywteh@gatsby.ucl.ac.uk Gatsby Computational Neuroscience Unit University College London Why a
More informationMaster of Science in Statistics A Proposal
1 Master of Science in Statistics A Proposal Rationale of the Program In order to cope up with the emerging complexity on the solutions of realistic problems involving several phenomena of nature it is
More informationCSC 2541: Bayesian Methods for Machine Learning
CSC 2541: Bayesian Methods for Machine Learning Radford M. Neal, University of Toronto, 2011 Lecture 4 Problem: Density Estimation We have observed data, y 1,..., y n, drawn independently from some unknown
More informationModelling Genetic Variations with Fragmentation-Coagulation Processes
Modelling Genetic Variations with Fragmentation-Coagulation Processes Yee Whye Teh, Charles Blundell, Lloyd Elliott Gatsby Computational Neuroscience Unit, UCL Genetic Variations in Populations Inferring
More informationVariable selection for model-based clustering of categorical data
Variable selection for model-based clustering of categorical data Brendan Murphy Wirtschaftsuniversität Wien Seminar, 2016 1 / 44 Alzheimer Dataset Data were collected on early onset Alzheimer patient
More informationGraphical Models for Query-driven Analysis of Multimodal Data
Graphical Models for Query-driven Analysis of Multimodal Data John Fisher Sensing, Learning, & Inference Group Computer Science & Artificial Intelligence Laboratory Massachusetts Institute of Technology
More informationBayesian Nonparametrics
Bayesian Nonparametrics Peter Orbanz Columbia University PARAMETERS AND PATTERNS Parameters P(X θ) = Probability[data pattern] 3 2 1 0 1 2 3 5 0 5 Inference idea data = underlying pattern + independent
More informationHyperparameter estimation in Dirichlet process mixture models
Hyperparameter estimation in Dirichlet process mixture models By MIKE WEST Institute of Statistics and Decision Sciences Duke University, Durham NC 27706, USA. SUMMARY In Bayesian density estimation and
More informationPriors for Random Count Matrices Derived from a Family of Negative Binomial Processes: Supplementary Material
Priors for Random Count Matrices Derived from a Family of Negative Binomial Processes: Supplementary Material A The Negative Binomial Process: Details A. Negative binomial process random count matrix To
More informationBayesian Nonparametric Autoregressive Models via Latent Variable Representation
Bayesian Nonparametric Autoregressive Models via Latent Variable Representation Maria De Iorio Yale-NUS College Dept of Statistical Science, University College London Collaborators: Lifeng Ye (UCL, London,
More informationHierarchical Bayesian Languge Model Based on Pitman-Yor Processes. Yee Whye Teh
Hierarchical Bayesian Languge Model Based on Pitman-Yor Processes Yee Whye Teh Probabilistic model of language n-gram model Utility i-1 P(word i word i-n+1 ) Typically, trigram model (n=3) e.g., speech,
More informationTwo Tales About Bayesian Nonparametric Modeling
Two Tales About Bayesian Nonparametric Modeling Pierpaolo De Blasi Stefano Favaro Antonio Lijoi Ramsés H. Mena Igor Prünster Abstract Gibbs type priors represent a natural generalization of the Dirichlet
More informationAdaptive Rejection Sampling with fixed number of nodes
Adaptive Rejection Sampling with fixed number of nodes L. Martino, F. Louzada Institute of Mathematical Sciences and Computing, Universidade de São Paulo, Brazil. Abstract The adaptive rejection sampling
More informationABC methods for phase-type distributions with applications in insurance risk problems
ABC methods for phase-type with applications problems Concepcion Ausin, Department of Statistics, Universidad Carlos III de Madrid Joint work with: Pedro Galeano, Universidad Carlos III de Madrid Simon
More informationNormalized Random Measure Mixture Models in Variational Autoencoders
JMLR: Workshop and Conference Proceedings 1 6 Normalized Random Measure Mixture Models in Variational Autoencoders Rahul Mehta mehta5@uic.edu and Hui Lu huilu@uic.edu Department of Bioengineering University
More informationRonald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California
Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University
More informationCTDL-Positive Stable Frailty Model
CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland
More informationExchangeability. Peter Orbanz. Columbia University
Exchangeability Peter Orbanz Columbia University PARAMETERS AND PATTERNS Parameters P(X θ) = Probability[data pattern] 3 2 1 0 1 2 3 5 0 5 Inference idea data = underlying pattern + independent noise Peter
More informationFitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation
Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation Dimitris Rizopoulos Department of Biostatistics, Erasmus University Medical Center, the Netherlands d.rizopoulos@erasmusmc.nl
More informationProbabilistic Low-Rank Matrix Completion with Adaptive Spectral Regularization Algorithms
Probabilistic Low-Rank Matrix Completion with Adaptive Spectral Regularization Algorithms Adrien Todeschini Inria Bordeaux JdS 2014, Rennes Aug. 2014 Joint work with François Caron (Univ. Oxford), Marie
More informationDiscussion of On simulation and properties of the stable law by L. Devroye and L. James
Stat Methods Appl (2014) 23:371 377 DOI 10.1007/s10260-014-0269-4 Discussion of On simulation and properties of the stable law by L. Devroye and L. James Antonio Lijoi Igor Prünster Accepted: 16 May 2014
More informationSTAT 518 Intro Student Presentation
STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible
More informationBayesian Nonparametric Models
Bayesian Nonparametric Models David M. Blei Columbia University December 15, 2015 Introduction We have been looking at models that posit latent structure in high dimensional data. We use the posterior
More informationBayesian nonparametrics
Bayesian nonparametrics 1 Some preliminaries 1.1 de Finetti s theorem We will start our discussion with this foundational theorem. We will assume throughout all variables are defined on the probability
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate
More informationBayesian Nonparametrics: Models Based on the Dirichlet Process
Bayesian Nonparametrics: Models Based on the Dirichlet Process Alessandro Panella Department of Computer Science University of Illinois at Chicago Machine Learning Seminar Series February 18, 2013 Alessandro
More informationBayesian estimation of the discrepancy with misspecified parametric models
Bayesian estimation of the discrepancy with misspecified parametric models Pierpaolo De Blasi University of Torino & Collegio Carlo Alberto Bayesian Nonparametrics workshop ICERM, 17-21 September 2012
More informationNew Dirichlet Mean Identities
Hong Kong University of Science and Technology Isaac Newton Institute, August 10, 2007 Origins CIFARELLI, D. M. and REGAZZINI, E. (1979). Considerazioni generali sull impostazione bayesiana di problemi
More informationHidden Markov models: from the beginning to the state of the art
Hidden Markov models: from the beginning to the state of the art Frank Wood Columbia University November, 2011 Wood (Columbia University) HMMs November, 2011 1 / 44 Outline Overview of hidden Markov models
More informationOutline. Binomial, Multinomial, Normal, Beta, Dirichlet. Posterior mean, MAP, credible interval, posterior distribution
Outline A short review on Bayesian analysis. Binomial, Multinomial, Normal, Beta, Dirichlet Posterior mean, MAP, credible interval, posterior distribution Gibbs sampling Revisit the Gaussian mixture model
More informationGaussian Mixture Model
Case Study : Document Retrieval MAP EM, Latent Dirichlet Allocation, Gibbs Sampling Machine Learning/Statistics for Big Data CSE599C/STAT59, University of Washington Emily Fox 0 Emily Fox February 5 th,
More informationThe Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations
The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations John R. Michael, Significance, Inc. and William R. Schucany, Southern Methodist University The mixture
More informationDefining Predictive Probability Functions for Species Sampling Models
Defining Predictive Probability Functions for Species Sampling Models Jaeyong Lee Department of Statistics, Seoul National University leejyc@gmail.com Fernando A. Quintana Departamento de Estadísica, Pontificia
More informationOn Consistency of Nonparametric Normal Mixtures for Bayesian Density Estimation
On Consistency of Nonparametric Normal Mixtures for Bayesian Density Estimation Antonio LIJOI, Igor PRÜNSTER, andstepheng.walker The past decade has seen a remarkable development in the area of Bayesian
More informationBayesian Nonparametric Models on Decomposable Graphs
Bayesian Nonparametric Models on Decomposable Graphs François Caron INRIA Bordeaux Sud Ouest Institut de Mathématiques de Bordeaux University of Bordeaux, France francois.caron@inria.fr Arnaud Doucet Departments
More informationBayesian Nonparametrics
Bayesian Nonparametrics Lorenzo Rosasco 9.520 Class 18 April 11, 2011 About this class Goal To give an overview of some of the basic concepts in Bayesian Nonparametrics. In particular, to discuss Dirichelet
More informationAsymptotics for posterior hazards
Asymptotics for posterior hazards Igor Prünster University of Turin, Collegio Carlo Alberto and ICER Joint work with P. Di Biasi and G. Peccati Workshop on Limit Theorems and Applications Paris, 16th January
More information27 : Distributed Monte Carlo Markov Chain. 1 Recap of MCMC and Naive Parallel Gibbs Sampling
10-708: Probabilistic Graphical Models 10-708, Spring 2014 27 : Distributed Monte Carlo Markov Chain Lecturer: Eric P. Xing Scribes: Pengtao Xie, Khoa Luu In this scribe, we are going to review the Parallel
More informationPart 1: Expectation Propagation
Chalmers Machine Learning Summer School Approximate message passing and biomedicine Part 1: Expectation Propagation Tom Heskes Machine Learning Group, Institute for Computing and Information Sciences Radboud
More informationCSCI 5822 Probabilistic Model of Human and Machine Learning. Mike Mozer University of Colorado
CSCI 5822 Probabilistic Model of Human and Machine Learning Mike Mozer University of Colorado Topics Language modeling Hierarchical processes Pitman-Yor processes Based on work of Teh (2006), A hierarchical
More informationAdaptive Rejection Sampling with fixed number of nodes
Adaptive Rejection Sampling with fixed number of nodes L. Martino, F. Louzada Institute of Mathematical Sciences and Computing, Universidade de São Paulo, São Carlos (São Paulo). Abstract The adaptive
More informationClustering bi-partite networks using collapsed latent block models
Clustering bi-partite networks using collapsed latent block models Jason Wyse, Nial Friel & Pierre Latouche Insight at UCD Laboratoire SAMM, Université Paris 1 Mail: jason.wyse@ucd.ie Insight Latent Space
More informationContents. Part I: Fundamentals of Bayesian Inference 1
Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian
More informationarxiv: v1 [stat.ml] 6 Jul 2016
Bayesian nonparametrics for Sparse Dynamic Networks Konstantina Palla, François Caron, Yee Whye Teh Department of Statistics, Oxford University 24-29 St Giles, Oxford, OX1 3LB United Kingdom {palla,caron,
More informationTemporal point processes: the conditional intensity function
Temporal point processes: the conditional intensity function Jakob Gulddahl Rasmussen December 21, 2009 Contents 1 Introduction 2 2 Evolutionary point processes 2 2.1 Evolutionarity..............................
More informationAn Efficient Sequential Monte Carlo Algorithm for Coalescent Clustering
An Efficient Sequential Monte Carlo Algorithm for Coalescent Clustering Dilan Görür Gatsby Unit University College London dilan@gatsby.ucl.ac.uk Yee Whye Teh Gatsby Unit University College London ywteh@gatsby.ucl.ac.uk
More informationAn Overview of Choice Models
An Overview of Choice Models Dilan Görür Gatsby Computational Neuroscience Unit University College London May 08, 2009 Machine Learning II 1 / 31 Outline 1 Overview Terminology and Notation Economic vs
More informationInfinite-State Markov-switching for Dynamic. Volatility Models : Web Appendix
Infinite-State Markov-switching for Dynamic Volatility Models : Web Appendix Arnaud Dufays 1 Centre de Recherche en Economie et Statistique March 19, 2014 1 Comparison of the two MS-GARCH approximations
More informationBayesian Modeling of Conditional Distributions
Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings
More informationLecture 1a: Basic Concepts and Recaps
Lecture 1a: Basic Concepts and Recaps Cédric Archambeau Centre for Computational Statistics and Machine Learning Department of Computer Science University College London c.archambeau@cs.ucl.ac.uk Advanced
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 7 Approximate
More informationMarkov chain Monte Carlo methods for visual tracking
Markov chain Monte Carlo methods for visual tracking Ray Luo rluo@cory.eecs.berkeley.edu Department of Electrical Engineering and Computer Sciences University of California, Berkeley Berkeley, CA 94720
More informationModelling Dropouts by Conditional Distribution, a Copula-Based Approach
The 8th Tartu Conference on MULTIVARIATE STATISTICS, The 6th Conference on MULTIVARIATE DISTRIBUTIONS with Fixed Marginals Modelling Dropouts by Conditional Distribution, a Copula-Based Approach Ene Käärik
More informationDeep Variational Inference. FLARE Reading Group Presentation Wesley Tansey 9/28/2016
Deep Variational Inference FLARE Reading Group Presentation Wesley Tansey 9/28/2016 What is Variational Inference? What is Variational Inference? Want to estimate some distribution, p*(x) p*(x) What is
More information