The family of hierarchical classes models: A state-of-the-art overview. Iven Van Mechelen, Eva Ceulemans, Iwin Leenen, and Jan Schepers
|
|
- Adela Hopkins
- 6 years ago
- Views:
Transcription
1 The family of hierarchical classes models: state-of-the-art overview Iven Van Mechelen, Eva Ceulemans, Iwin Leenen, and Jan Schepers University of Leuven
2 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
3 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
4 . Data I I I N-way N-mode data array... N X with entries x i i... i N binary, rating-valued, real-valued array-conditionality Note: We leave aside: () structurally incomplete data () multiblock data
5 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
6 . Models. Context N-way Tucker model: x a g e N N N n i i... i =... Π i p p p... p + i i... i p = p = p = n= N n n N N n In n with denoting the n th component matrix G and denoting the core array
7 x a g e N N N n i i... i =... Π i p p p... p + i i... i p = p = p = n= N n n N N options:. identity matrix for,,, N- component matrices Tucker N- N-way Tucker N- model Tucker G n. superdiagonal or superidentity N-way CNDECOM/RFC
8 models studied in research group: () primary constraint: n binary for,,, N component matrices overlapping clustering for n th mode () operator: Σ or Max (Min) x a g e N N N n i i... i =... Π i p p p... p + i i... i p = p = p = n= N n n N N x a g e... Max Max... Max N N n i i i = i p p p... p i i... i p = p = p = Π + n= N n n N N N unifying model: Van Mechelen & Schepers (submitted)
9 focus of present talk: n () binary for all component matrices overlapping clustering for all modes () operator: Max G (3) takes values in +
10 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
11 . Models. Hierarchical classes models.. Basic model two-way two-mode ( ) x = Max Max a a g + e i i i p i p p p i i p = p = reconstructed data xˆi i
12 ( ) xˆ = Max Max a a g i i i p i p p p p = p = three cases: G. binary (/) is identity matrix G xˆ = Max a a ( ) i i i p i p p= ( ) i i = i p i p p= xˆ a a (Boolean sum) G. rating-valued (i.e., values in {,,, V} ) G 3. real-valued (i.e., values in ) +
13 . Models. Hierarchical classes models.. Justification of the Max operator Three (interrelated reasons):. representation by component matrices of quasi-orders that can be naturally defined on each of the modes. substantive interpretation of decomposition rule 3. decision rule for overlapping biclusters (triclusters etc.)
14 . Models. Hierarchical classes models.. Justification of the Max operator Three (interrelated reasons):. representation by component matrices of quasi-orders that can be naturally defined on each of the modes. substantive interpretation of decomposition rule 3. decision rule for overlapping biclusters (triclusters etc.)
15 binary case (G is /) illustrative example: 7 6 feature by object reconstructed data O O O 3 O 4 O 5 O 6 F F ˆ = X F 3 F 4 F 5 F 6 F 7 implication relation among features e.g., F F 5, F 3 F 4, F 5 F 7
16 similar implication relation among objects quasi-order: reflexive, transitive implies partial order on resulting equivalence classes graphical representation: (quasi-) Hasse diagram F 7 F 5, F 6 F F F 3, F 4 HIERRCHICL CLSSES!!!
17 representation of quasi-orders by component matrices: quasi-order on {F,, F 7 } quasi-order on {F,, F 7 } as implied by ˆX = as implied by O O O 3 O 4 O 5 O 6 3 F F F F ˆ = X F 3 F 4 F 3 F 4 = F 5 F 5 F 6 F 6 F 7 F 7
18 representation of quasi-orders by component matrices: quasi-order on {F,, F 7 } quasi order-on {F,, F 7 } as implied by ˆX = as implied by 3 F F 7 F F 5, F 6 F F F 3, F 4 F 3 F 4 F 5 F 6 F 7 =
19 representation of quasi-orders by component matrices: quasi-order on {O,, O 6 } quasi-order on {O,, O 6 } as implied by ˆX = as implied by O O O 3 O 4 O 5 O 6 33 F O F O ˆ = X F 3 F 4 O 3 O 4 = F 5 5 F 6 6 F 7
20 representation of quasi-orders by component matrices: quasi-order on {O,, O 6 } quasi-order on {O,, O 6 } as implied by ˆX = as implied by 33 O O O 5 O 6 O O, O 3 O 4 O 3 O 4 O 5 O 6 =
21 Note: (quasi) Hasse diagram as implied by F 7 F 5, F 6 F F F 3, F 4
22 Note: (quasi) Hasse diagram as implied by O 5 O 6 O O, O 3 O 4
23 Note: (quasi) Hasse diagrams as implied by and F 7 F 5, F 6 F F F 3, F 4 O 5 O 6 O O, O 3 O 4
24 Note: F 7 F 5, F 6 F F F 3, F 4 O O, O 3 O 4 O 5 O 6
25 Note: comprehensive graphical representation of HICLS model!!! F 7 F 5, F 6 F F F 3, F 4 O O, O 3 O 4 O 5 O 6
26 rating- or positively real-valued case illustrative example: 7 6 feature by object reconstructed data O O O 3 O 4 O 5 O 6 F 4 4 F ˆ = X F 3 F 4 F F F generalized implication relation among features e.g., F F 5, F 3 F 4, F 5 F 7
27 generalized implication relation is again a quasi-order similar quasi-order on objects quasi-orders can be graphically represented by (quasi-) Hasse diagrams e.g., F 7 F 5, F 6 F F F 3, F 4 quasi-orders are to be represented by component matrices
28 representation of quasi-orders by component matrices: quasi-order on {F,, F 7 } quasi-order on {F,, F 7 } as implied by Xˆ = as implied by O O O 3 O 4 O 5 O 6 3 F 4 4 F F F ˆ = X F 3 F 4 F 3 F 4 = F F 5 F F 6 F F 7
29 Note: G F O 4 F O 5 F 3 O 3 3 F 4 O 4 F 5 O 5 F 6 O 6 F 7
30 Note: comprehensive graphical representation of HICLS model F 7 F 5, F 6 F F F 3, F O O, O 3 O 4 O 5 O 6
31 Note: Max-operator can be shown to be only operator that allows representations of quasi-orders!!!
32 . Models. Hierarchical classes models.. Justification of the Max operator Three (interrelated reasons):. representation by component matrices of quasi-orders that can be naturally defined on each of the modes. substantive interpretation of decomposition rule 3. decision rule for overlapping biclusters (triclusters etc.)
33 consider binary (/) case assume (reconstructed) data pertain to person by problem failure/success: x ˆ = : i i x ˆ = : i i person i fails for problem i person i succeeds for problem i underlying mechanism: -(latent) solution strategies - person i may master strategy p or not: a = or - strategy p may be suitable for solving problem i or not: i p a = or i p
34 xˆ = Max a a ( ) i i i p i p p= ( ) xˆ = a a i i i p i p p= ( ) xˆ = iff p: a = and a = i i i p i p Note: person i succeeds iff there is at least one strategy p that for problem i person i masters and that is suitable for solving problem i existential quantifier disjunctive model
35 . Models. Hierarchical classes models.. Justification of the Max operator Three (interrelated reasons):. representation by component matrices of quasi-orders that can be naturally defined on each of the modes. substantive interpretation of decomposition rule 3. decision rule for overlapping biclusters (triclusters etc.)
36 consider positively real-valued (reconstructed) object by variable data consider the following two models: ( ) xˆ a a g = i i i p i p p p p = p = ( ) xˆ = Max Max a a g i i i p i p p p p = p =
37 ( ) xˆ a a g = i i i p i p p p p = p = ( ) xˆ = Max Max a a g i i i p i p p p p = p = O V O V 3 3 O 3 V 3 G O 4 V 4 O 5 V 5 O 6 V 6 V 7
38 O O ( ) xˆ a a g = ( xˆ ) i i = Max Max a i p a i p g p p p = p= i i i p i p p p p = p = O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
39 ( ) xˆ a a g = i i i p i p p p p = p = ( ) xˆ = Max Max a a g i i i p i p p p p = p = V V V 3 V 4 V 5 V 6 V 7 O O O 3 O 4 O 5 O 6 O O O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
40 ( ) xˆ a a g = i i i p i p p p p = p = ( ) xˆ = Max Max a a g i i i p i p p p p = p = V V V 3 V 4 V 5 V 6 V 7 O O O 3 O 4 O 5 O 6 O O O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
41 ( ) xˆ a a g = i i i p i p p p p = p = ( ) xˆ = Max Max a a g i i i p i p p p p = p = V V V 3 V 4 V 5 V 6 V 7 O O O 3 O 4 O 5 O 6 O O O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
42 ( ) xˆ a a g = i i i p i p p p p = p = ( ) xˆ = Max Max a a g i i i p i p p p p = p = V V V 3 V 4 V 5 V 6 V 7 O O O 3 O 4 O 5 O 6 O O O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
43 ( ) xˆ a a g = i i i p i p p p p = p = ( ) xˆ = Max Max a a g i i i p i p p p p = p = V V V 3 V 4 V 5 V 6 V 7 O O O 3 O 4 O 5 O 6 O O O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
44 ( ) xˆ a a g = ( xˆ ) i i = Max Max a i p a i p g p p p = p= i i i p i p p p p = p = V V V 3 V 4 V 5 V 6 V 7 O O O 3 O 4 O 5 O 6 O? O O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
45 ( ) xˆ a a g = ( xˆ ) i i = Max Max a i p a i p g p p p = p= i i i p i p p p p = p = V V V 3 V 4 V 5 V 6 V 7 O O O 3 O 4 O 5 O 6 O O O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
46 ( ) xˆ a a g = ( xˆ ) i i = Max Max a i p a i p g p p p = p= i i i p i p p p p = p = V V V 3 V 4 V 5 V 6 V 7 O O O 3 O 4 O 5 O 6 O O O 3 O 4 O 5 O 6 G 3 V V V 3 V 4 V 5 V 6 V 7
47 Note: taxonomic overview of biclustering: see Van Mechelen, Bock, & De Boeck (4)
48 . Models. Hierarchical classes models..3 The HICLS family key features:. operator = Max x a g e... Max Max... Max N N n i i i = i p p p... p i i... i p = p = p = Π + n= N n n N N N. n all / 3. entries of G 4. representation of quasi-orders by + n
49 unconstrained models: G / N = G =I (disjunctive) hierarchical classes model (De Boeck and Rosenberg, 988) Notes: * dual conjunctive model with Min-operator: Van Mechelen, De Boeck & Rosenberg, 995) * stochastic extension: Bayesian HICLS (Leenen, Van Mechelen, Gelman & De Knop, submitted)
50 N=3 * G superidentity (CNDECOM/RFC case): INDCLS (Leenen, Van Mechelen, De Boeck & Rosenberg, 999) * G general: - Tucker3 HICLS (Ceulemans, Van Mechelen & Leenen, 3) - Tucker HICLS (Ceulemans & Van Mechelen, 4) G rating-valued (N =) HICLS-R (Van Mechelen, Lombardi & Ceulemans, conditionally accepted) Note: option to put limit on number of distinct values in G ( optimal coarsening )
51 entries G + real-valued HICLS (Schepers & Van Mechelen, submitted) constrained models taxonomy of constraints (Ceulemans, Van Mechelen & Kuppens, 4), based on:. locus: component matrices core. nature: value (e.g., values from previous study) vs. structure (e.g., Guttman scale, consecutive ones, decomposability, lower bound on number of ones, etc.) 3. extent: full vs. partial 4. use of external information: no vs. yes Note: varying amount of required algorithmic adaptations
52 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
53 3. Research topics 3. Models mathematical properties (e.g., rank) relevant structures: Boolean algebra and tropical semiring ( +,Max, ) Notes:. differences between these structures and common algebras over e.g., row, column, and decomposition (Schein) rank may differ e.g., only invertible Boolean matrices are permutation matrices. even with Σ-operator rank issues considerably change if one or n more of the are constrained to be / ( -/)
54 identifiability / uniqueness: - only invertible Boolean matrices are permutation matrices less identifiability problems than for common N-way Tucker models in general only permutational freedom rather than full rotational freedom - remaining amount of nonuniqueness pertains to particular decompositions - part of remaining nonuniqueness removed by requirement of representation of quasi-order - at this moment only sufficient condition for uniqueness in / case (Van Mechelen, De Boeck & Rosenberg, 995; Ceulemans & Van Mechelen, 3) HICLS/INDCLS: presence of all component patterns that contain single Tucker3 HICLS: idem + no core slice is subset of union of other core slices
55 model interrelations (Ceulemans & Van Mechelen, 5): Tucker () Tucker (B) Tucker (C) Tucker (B) Tucker (C) Tucker (BC) Tucker3 (BC) RFC (BC)
56 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
57 3. Research topics 3. Estimation L = xˆ x i i... in i i... i i, i,..., i N ( ˆ ) i i i i i i L = x x... N... i, i,..., i N N N Note: in pure / case L = L evaluation criteria for algorithms: - primary: goodness of fit;?global optimum - secondary: goodness of recovery
58 types of algorithms under study: - alternating least L / L - simulated annealing (Ceulemans, Van Mechelen & Leenen, submitted) - MCMC (Metropolis) (Leenen, Van Mechelen, Gelman, & De Knop, submitted)
59 algorithmic issues - parametrization of solution space (and partitioning of parameter vector, if applicable) (Schepers, Van Mechelen & Ceulemans, in press) - starts * number * nature: rational from data / rational from data + noise / random from data / purely random (Ceulemans, Van Mechelen & Leenen, submitted) - iterative process - choice metaparameters in simulated annealing and MCMC - greedy vs. branch and bound in conditional updating of alternating least L p algorithms (Leenen & Van Mechelen, )
60 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
61 3. Research topics 3. Model selection and model checking issues - model type - rank - correct representation quasi-orders - assumptions about error process types of methods under study - scree-test based approaches (including extended convex hull methods (Leenen & Van Mechelen, ; Ceulemans & Van Mechelen, 5; see also Ceulemans & Kiers, in press) - pseudo IC approach (Ceulemans & Van Mechelen, 5) - Bayesian approaches with posterior predictive checks (Leenen, Van Mechelen, Gelman & De Knop, submitted)
62 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
63 3. Research topics 3.4 Quantification of uncertainty issues: - telling apart strong and weak parts in obtained representations; confidence intervals - not only single parameters; also aspects derived from multiple parameters (e.g., classes, hierarchical relations) - confidence intervals in / case are tricky! types of methods under study - nonparametric Bootstrap approach - Bayesian approach based on simulated posterior (Leenen, Van Mechelen, Gelman & De Knop, submitted)
64 Overview of the talk. data. models. context. hierarchical classes models.. basic model.. justification of the Max operator..3 the HICLS family 3. research topics 3. models 3. estimation 3.3 model selection and model checking 3.4 quantification of uncertainty 4. references
65 4. References Ceulemans, E., & Kiers, H..L. (in press). Selecting among three-mode principal component models of different types and complexities: numerical convex hull based method. British Journal of Mathematical and Statistical sychology. Ceulemans, E., & Van Mechelen, I. (3). Uniqueness of N-way N- mode hierarchical classes models. Journal of Mathematical sychology, 47, Ceulemans, E., Van Mechelen, I., & Leenen, I. (3). Tucker3 hierarchical classes analysis. sychometrika, 68, Ceulemans, E., & Van Mechelen, I. (4). Tucker hierarchical classes analysis. sychometrika, 69, Ceulemans, E., & Van Mechelen, I. (5). Hierarchical classes models for three-way three-mode binary data: Interrelations and model selection. sychometrika, 7,
66 Ceulemans, E., Van Mechelen, I., & Leenen, I. (submitted). The local minima problem in hierarchical classes analysis: n evaluation of a simulated annealing algorithm and various multistart procedures. De Boeck,., & Rosenberg, S. (988). Hierarchical classes: Model and data analysis. sychometrika, 53, Leenen, I., Van Mechelen, I., De Boeck,., & Rosenberg, S. (999). INDCLS: Individual differences hierarchical classes analysis. sychometrika, 64, 9 4. Leenen, I., Van Mechelen, I., & De Boeck,. (999). generic disjunctive/conjunctive decomposition model for n-ary relations. Journal of Mathematical sychology, 43, -. Leenen, I., & Van Mechelen, I. (). n evaluation of two algorithms for hierarchical classes analysis. Journal of Classification, 8, Leenen, I., Van Mechelen, I., Gelman,., & De Knop, S. (submitted). Bayesian hierarchical classes analysis. Schepers, J., & Van Mechelen, I. (submitted). The real-valued model of hierarchical classes.
67 Schepers, J., Van Mechelen, I., & Ceulemans, E. (in press). Three-mode partitioning. Computational Statistics and Data nalysis. Van Mechelen, I., De Boeck,., & Rosenberg, S. (995). The conjunctive model of hierarchical classes. sychometrika, 6, Van Mechelen, I., Bock, H.-H., & De Boeck,. (4). Two-mode clustering methods: structured overview. Statistical Methods in Medical Research, 3, Van Mechelen, I., Lombardi, L., & Ceulemans, E. (conditionally accepted). Hierarchical classes modeling of rating data. sychometrika. Van Mechelen, I., & Schepers, J. (submitted). unifying model involving a categorical and/or dimensional reduction for multimode data.
T E C H N I C A L R E P O R T K-CENTROIDS HIERARCHICAL CLASSES ANALYSIS. LOMBARDI, L., CEULEMANS, E. and I. VAN MECHELEN
T E C H N I C A L R E P O R T 0352 K-CENTROIDS HIERARCHICAL CLASSES ANALYSIS LOMBARDI, L., CEULEMANS, E. and I. VAN MECHELEN * I A P S T A T I S T I C S N E T W O R K INTERUNIVERSITY ATTRACTION POLE http://www.stat.ucl.ac.be/iap
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationTo cite this article:
To cite this article: Meulders, M., & De Bruecker, P. (in press). Latent class probabilistic latent feature analysis of three-way three-mode binary data. Journal of Statistical Software. JSS Journal of
More informationMULTILEVEL IMPUTATION 1
MULTILEVEL IMPUTATION 1 Supplement B: MCMC Sampling Steps and Distributions for Two-Level Imputation This document gives technical details of the full conditional distributions used to draw regression
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationPreface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of
Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Probability Sampling Procedures Collection of Data Measures
More informationChemometrics and Intelligent Laboratory Systems
Chemometrics and Intelligent Laboratory Systems () Contents lists available at ScienceDirect Chemometrics and Intelligent Laboratory Systems journal homepage: www.elsevier.com/locate/chemolab On the added
More informationPattern Recognition and Machine Learning
Christopher M. Bishop Pattern Recognition and Machine Learning ÖSpri inger Contents Preface Mathematical notation Contents vii xi xiii 1 Introduction 1 1.1 Example: Polynomial Curve Fitting 4 1.2 Probability
More informationBAYESIAN DECISION THEORY
Last updated: September 17, 2012 BAYESIAN DECISION THEORY Problems 2 The following problems from the textbook are relevant: 2.1 2.9, 2.11, 2.17 For this week, please at least solve Problem 2.3. We will
More informationFinal Exam, Machine Learning, Spring 2009
Name: Andrew ID: Final Exam, 10701 Machine Learning, Spring 2009 - The exam is open-book, open-notes, no electronics other than calculators. - The maximum possible score on this exam is 100. You have 3
More informationData Mining and Matrices
Data Mining and Matrices 08 Boolean Matrix Factorization Rainer Gemulla, Pauli Miettinen June 13, 2013 Outline 1 Warm-Up 2 What is BMF 3 BMF vs. other three-letter abbreviations 4 Binary matrices, tiles,
More informationInquiry Calculus and the Issue of Negative Higher Order Informations
Article Inquiry Calculus and the Issue of Negative Higher Order Informations H. R. Noel van Erp, *, Ronald O. Linger and Pieter H. A. J. M. van Gelder,2 ID Safety and Security Science Group, TU Delft,
More informationData Mining and Matrices
Data Mining and Matrices 05 Semi-Discrete Decomposition Rainer Gemulla, Pauli Miettinen May 16, 2013 Outline 1 Hunting the Bump 2 Semi-Discrete Decomposition 3 The Algorithm 4 Applications SDD alone SVD
More informationIntroduction to Kleene Algebras
Introduction to Kleene Algebras Riccardo Pucella Basic Notions Seminar December 1, 2005 Introduction to Kleene Algebras p.1 Idempotent Semirings An idempotent semiring is a structure S = (S, +,, 1, 0)
More informationUNIVERSITY OF NORTH ALABAMA MA 110 FINITE MATHEMATICS
MA 110 FINITE MATHEMATICS Course Description. This course is intended to give an overview of topics in finite mathematics together with their applications and is taken primarily by students who are not
More informationApplications. Stephen J. Stoyan, Maged M. Dessouky*, and Xiaoqing Wang
Introduction to Large-Scale Linear Programming and Applications Stephen J. Stoyan, Maged M. Dessouky*, and Xiaoqing Wang Daniel J. Epstein Department of Industrial and Systems Engineering, University of
More informationInterpretable Latent Variable Models
Interpretable Latent Variable Models Fernando Perez-Cruz Bell Labs (Nokia) Department of Signal Theory and Communications, University Carlos III in Madrid 1 / 24 Outline 1 Introduction to Machine Learning
More informationMulticlass Classification-1
CS 446 Machine Learning Fall 2016 Oct 27, 2016 Multiclass Classification Professor: Dan Roth Scribe: C. Cheng Overview Binary to multiclass Multiclass SVM Constraint classification 1 Introduction Multiclass
More informationMultimodal Nested Sampling
Multimodal Nested Sampling Farhan Feroz Astrophysics Group, Cavendish Lab, Cambridge Inverse Problems & Cosmology Most obvious example: standard CMB data analysis pipeline But many others: object detection,
More informationBayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation Volker Tresp Summer 2016 1 Introduction So far we were mostly concerned with supervised learning: we predicted one or several
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 7 Approximate
More informationDRAFT CONCEPTUAL SOLUTION REPORT DRAFT
BASIC STRUCTURAL MODELING PROJECT Joseph J. Simpson Mary J. Simpson 08-12-2013 DRAFT CONCEPTUAL SOLUTION REPORT DRAFT Version 0.11 Page 1 of 18 Table of Contents Introduction Conceptual Solution Context
More informationPerformance Evaluation and Comparison
Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation
More information9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering
Types of learning Modeling data Supervised: we know input and targets Goal is to learn a model that, given input data, accurately predicts target data Unsupervised: we know the input only and want to make
More informationProbabilistic Latent Semantic Analysis
Probabilistic Latent Semantic Analysis Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjin@postech.ac.kr
More informationBOOLEAN TENSOR FACTORIZATIONS. Pauli Miettinen 14 December 2011
BOOLEAN TENSOR FACTORIZATIONS Pauli Miettinen 14 December 2011 BACKGROUND: TENSORS AND TENSOR FACTORIZATIONS X BACKGROUND: TENSORS AND TENSOR FACTORIZATIONS C X A B BACKGROUND: BOOLEAN MATRIX FACTORIZATIONS
More informationMachine Learning Ensemble Learning I Hamid R. Rabiee Jafar Muhammadi, Alireza Ghasemi Spring /
Machine Learning Ensemble Learning I Hamid R. Rabiee Jafar Muhammadi, Alireza Ghasemi Spring 2015 http://ce.sharif.edu/courses/93-94/2/ce717-1 / Agenda Combining Classifiers Empirical view Theoretical
More informationMatrix Factorizations over Non-Conventional Algebras for Data Mining. Pauli Miettinen 28 April 2015
Matrix Factorizations over Non-Conventional Algebras for Data Mining Pauli Miettinen 28 April 2015 Chapter 1. A Bit of Background Data long-haired well-known male Data long-haired well-known male ( ) 1
More informationRobust Principal Component Analysis
ELE 538B: Mathematics of High-Dimensional Data Robust Principal Component Analysis Yuxin Chen Princeton University, Fall 2018 Disentangling sparse and low-rank matrices Suppose we are given a matrix M
More informationMATHEMATICS. Units Topics Marks I Relations and Functions 10
MATHEMATICS Course Structure Units Topics Marks I Relations and Functions 10 II Algebra 13 III Calculus 44 IV Vectors and 3-D Geometry 17 V Linear Programming 6 VI Probability 10 Total 100 Course Syllabus
More informationPerceptual Mapping Based on Three-Way Binary Data
Perceptual Mapping Based on Three-Way Binary Data 00000 0000 000 B. Kemal Büyükkurt, Concordia University, Montreal, Canada & Pieter M. Kroonenberg, Leiden University, The Netherlands Perceptual Mapping
More informationOn (Weighted) k-order Fuzzy Connectives
Author manuscript, published in "IEEE Int. Conf. on Fuzzy Systems, Spain 2010" On Weighted -Order Fuzzy Connectives Hoel Le Capitaine and Carl Frélicot Mathematics, Image and Applications MIA Université
More informationPrerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3
University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.
More informationFinal Overview. Introduction to ML. Marek Petrik 4/25/2017
Final Overview Introduction to ML Marek Petrik 4/25/2017 This Course: Introduction to Machine Learning Build a foundation for practice and research in ML Basic machine learning concepts: max likelihood,
More information1 Column Generation and the Cutting Stock Problem
1 Column Generation and the Cutting Stock Problem In the linear programming approach to the traveling salesman problem we used the cutting plane approach. The cutting plane approach is appropriate when
More informationFoundations of Nonparametric Bayesian Methods
1 / 27 Foundations of Nonparametric Bayesian Methods Part II: Models on the Simplex Peter Orbanz http://mlg.eng.cam.ac.uk/porbanz/npb-tutorial.html 2 / 27 Tutorial Overview Part I: Basics Part II: Models
More informationSPARSE signal representations have gained popularity in recent
6958 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 57, NO. 10, OCTOBER 2011 Blind Compressed Sensing Sivan Gleichman and Yonina C. Eldar, Senior Member, IEEE Abstract The fundamental principle underlying
More informationSparsity Models. Tong Zhang. Rutgers University. T. Zhang (Rutgers) Sparsity Models 1 / 28
Sparsity Models Tong Zhang Rutgers University T. Zhang (Rutgers) Sparsity Models 1 / 28 Topics Standard sparse regression model algorithms: convex relaxation and greedy algorithm sparse recovery analysis:
More informationECE521 week 3: 23/26 January 2017
ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear
More informationSelected Solutions to Even Problems, Part 3
Selected Solutions to Even Problems, Part 3 March 14, 005 Page 77 6. If one selects 101 integers from among {1,,..., 00}, then at least two of these numbers must be consecutive, and therefore coprime (which
More informationIntroduction to Machine Learning
Introduction to Machine Learning Brown University CSCI 1950-F, Spring 2012 Prof. Erik Sudderth Lecture 25: Markov Chain Monte Carlo (MCMC) Course Review and Advanced Topics Many figures courtesy Kevin
More informationCheng Soon Ong & Christian Walder. Canberra February June 2018
Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 218 Outlines Overview Introduction Linear Algebra Probability Linear Regression 1
More informationThe Missing-Index Problem in Process Control
The Missing-Index Problem in Process Control Joseph G. Voelkel CQAS, KGCOE, RIT October 2009 Voelkel (RIT) Missing-Index Problem 10/09 1 / 35 Topics 1 Examples Introduction to the Problem 2 Models, Inference,
More informationA randomized block sampling approach to the canonical polyadic decomposition of large-scale tensors
A randomized block sampling approach to the canonical polyadic decomposition of large-scale tensors Nico Vervliet Joint work with Lieven De Lathauwer SIAM AN17, July 13, 2017 2 Classification of hazardous
More informationSpreading of Activation
Spreading of Activation Feed forward networks: no loops. Example u 2 u 4.5 1.5 1-1 3-1.5.5.5 1 u 5 1 u 1 u 3 Algebra, Logic and Formal Methods in Computer Science 1 Reflexive Reasoning Humans are capable
More informationLearning Decision Trees
Learning Decision Trees Machine Learning Spring 2018 1 This lecture: Learning Decision Trees 1. Representation: What are decision trees? 2. Algorithm: Learning decision trees The ID3 algorithm: A greedy
More informationBindel, Fall 2016 Matrix Computations (CS 6210) Notes for
1 A cautionary tale Notes for 2016-10-05 You have been dropped on a desert island with a laptop with a magic battery of infinite life, a MATLAB license, and a complete lack of knowledge of basic geometry.
More informationCS6375: Machine Learning Gautam Kunapuli. Decision Trees
Gautam Kunapuli Example: Restaurant Recommendation Example: Develop a model to recommend restaurants to users depending on their past dining experiences. Here, the features are cost (x ) and the user s
More informationLow-rank Promoting Transformations and Tensor Interpolation - Applications to Seismic Data Denoising
Low-rank Promoting Transformations and Tensor Interpolation - Applications to Seismic Data Denoising Curt Da Silva and Felix J. Herrmann 2 Dept. of Mathematics 2 Dept. of Earth and Ocean Sciences, University
More informationBagging During Markov Chain Monte Carlo for Smoother Predictions
Bagging During Markov Chain Monte Carlo for Smoother Predictions Herbert K. H. Lee University of California, Santa Cruz Abstract: Making good predictions from noisy data is a challenging problem. Methods
More informationCHAPTER 1. Relations. 1. Relations and Their Properties. Discussion
CHAPTER 1 Relations 1. Relations and Their Properties 1.1. Definition of a Relation. Definition 1.1.1. A binary relation from a set A to a set B is a subset R A B. If (a, b) R we say a is Related to b
More informationMotivation. CS389L: Automated Logical Reasoning. Lecture 10: Overview of First-Order Theories. Signature and Axioms of First-Order Theory
Motivation CS389L: Automated Logical Reasoning Lecture 10: Overview of First-Order Theories Işıl Dillig Last few lectures: Full first-order logic In FOL, functions/predicates are uninterpreted (i.e., structure
More informationPAC Generalization Bounds for Co-training
PAC Generalization Bounds for Co-training Sanjoy Dasgupta AT&T Labs Research dasgupta@research.att.com Michael L. Littman AT&T Labs Research mlittman@research.att.com David McAllester AT&T Labs Research
More informationStructured matrix factorizations. Example: Eigenfaces
Structured matrix factorizations Example: Eigenfaces An extremely large variety of interesting and important problems in machine learning can be formulated as: Given a matrix, find a matrix and a matrix
More informationClustering VS Classification
MCQ Clustering VS Classification 1. What is the relation between the distance between clusters and the corresponding class discriminability? a. proportional b. inversely-proportional c. no-relation Ans:
More informationMaster of Science in Statistics A Proposal
1 Master of Science in Statistics A Proposal Rationale of the Program In order to cope up with the emerging complexity on the solutions of realistic problems involving several phenomena of nature it is
More informationCPSC 340: Machine Learning and Data Mining. More PCA Fall 2017
CPSC 340: Machine Learning and Data Mining More PCA Fall 2017 Admin Assignment 4: Due Friday of next week. No class Monday due to holiday. There will be tutorials next week on MAP/PCA (except Monday).
More informationIDL Advanced Math & Stats Module
IDL Advanced Math & Stats Module Regression List of Routines and Functions Multiple Linear Regression IMSL_REGRESSORS IMSL_MULTIREGRESS IMSL_MULTIPREDICT Generates regressors for a general linear model.
More information1-bit Matrix Completion. PAC-Bayes and Variational Approximation
: PAC-Bayes and Variational Approximation (with P. Alquier) PhD Supervisor: N. Chopin Bayes In Paris, 5 January 2017 (Happy New Year!) Various Topics covered Matrix Completion PAC-Bayesian Estimation Variational
More informationDiscrete Mathematics. Benny George K. September 22, 2011
Discrete Mathematics Benny George K Department of Computer Science and Engineering Indian Institute of Technology Guwahati ben@iitg.ernet.in September 22, 2011 Set Theory Elementary Concepts Let A and
More informationMath 42, Discrete Mathematics
c Fall 2018 last updated 12/05/2018 at 15:47:21 For use by students in this class only; all rights reserved. Note: some prose & some tables are taken directly from Kenneth R. Rosen, and Its Applications,
More informationAn introduction to Bayesian reasoning in particle physics
An introduction to Bayesian reasoning in particle physics Graduiertenkolleg seminar, May 15th 2013 Overview Overview A concrete example Scientific reasoning Probability and the Bayesian interpretation
More informationLarge Scale Data Analysis Using Deep Learning
Large Scale Data Analysis Using Deep Learning Linear Algebra U Kang Seoul National University U Kang 1 In This Lecture Overview of linear algebra (but, not a comprehensive survey) Focused on the subset
More informationLeast Squares Optimization
Least Squares Optimization The following is a brief review of least squares optimization and constrained optimization techniques. Broadly, these techniques can be used in data analysis and visualization
More informationStructure in Data. A major objective in data analysis is to identify interesting features or structure in the data.
Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two
More information17 : Markov Chain Monte Carlo
10-708: Probabilistic Graphical Models, Spring 2015 17 : Markov Chain Monte Carlo Lecturer: Eric P. Xing Scribes: Heran Lin, Bin Deng, Yun Huang 1 Review of Monte Carlo Methods 1.1 Overview Monte Carlo
More informationMultilevel Statistical Models: 3 rd edition, 2003 Contents
Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction
More informationFirst-Order Logic First-Order Theories. Roopsha Samanta. Partly based on slides by Aaron Bradley and Isil Dillig
First-Order Logic First-Order Theories Roopsha Samanta Partly based on slides by Aaron Bradley and Isil Dillig Roadmap Review: propositional logic Syntax and semantics of first-order logic (FOL) Semantic
More informationAMS526: Numerical Analysis I (Numerical Linear Algebra)
AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 3: Positive-Definite Systems; Cholesky Factorization Xiangmin Jiao Stony Brook University Xiangmin Jiao Numerical Analysis I 1 / 11 Symmetric
More informationRonald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California
Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University
More informationPAC Learning. prof. dr Arno Siebes. Algorithmic Data Analysis Group Department of Information and Computing Sciences Universiteit Utrecht
PAC Learning prof. dr Arno Siebes Algorithmic Data Analysis Group Department of Information and Computing Sciences Universiteit Utrecht Recall: PAC Learning (Version 1) A hypothesis class H is PAC learnable
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval http://informationretrieval.org IIR 18: Latent Semantic Indexing Hinrich Schütze Center for Information and Language Processing, University of Munich 2013-07-10 1/43
More informationLattices and dimensional representations: matrix decompositions and ordering structures
Social Networks 24 (2002) 423 444 Lattices and dimensional representations: matrix decompositions and ordering structures Philippa E. Pattison a,, Ronald L. Breiger b a Department of Psychology, University
More information* 8 Groups, with Appendix containing Rings and Fields.
* 8 Groups, with Appendix containing Rings and Fields Binary Operations Definition We say that is a binary operation on a set S if, and only if, a, b, a b S Implicit in this definition is the idea that
More informationSection 5.6. LU and LDU Factorizations
5.6. LU and LDU Factorizations Section 5.6. LU and LDU Factorizations Note. We largely follow Fraleigh and Beauregard s approach to this topic from Linear Algebra, 3rd Edition, Addison-Wesley (995). See
More informationLMI MODELLING 4. CONVEX LMI MODELLING. Didier HENRION. LAAS-CNRS Toulouse, FR Czech Tech Univ Prague, CZ. Universidad de Valladolid, SP March 2009
LMI MODELLING 4. CONVEX LMI MODELLING Didier HENRION LAAS-CNRS Toulouse, FR Czech Tech Univ Prague, CZ Universidad de Valladolid, SP March 2009 Minors A minor of a matrix F is the determinant of a submatrix
More informationThree-group ROC predictive analysis for ordinal outcomes
Three-group ROC predictive analysis for ordinal outcomes Tahani Coolen-Maturi Durham University Business School Durham University, UK tahani.maturi@durham.ac.uk June 26, 2016 Abstract Measuring the accuracy
More informationUncertainty quantification for inverse problems with a weak wave-equation constraint
Uncertainty quantification for inverse problems with a weak wave-equation constraint Zhilong Fang*, Curt Da Silva*, Rachel Kuske** and Felix J. Herrmann* *Seismic Laboratory for Imaging and Modeling (SLIM),
More informationDecision Trees Part 1. Rao Vemuri University of California, Davis
Decision Trees Part 1 Rao Vemuri University of California, Davis Overview What is a Decision Tree Sample Decision Trees How to Construct a Decision Tree Problems with Decision Trees Classification Vs Regression
More informationMeasurement error as missing data: the case of epidemiologic assays. Roderick J. Little
Measurement error as missing data: the case of epidemiologic assays Roderick J. Little Outline Discuss two related calibration topics where classical methods are deficient (A) Limit of quantification methods
More informationLearning Decision Trees
Learning Decision Trees Machine Learning Fall 2018 Some slides from Tom Mitchell, Dan Roth and others 1 Key issues in machine learning Modeling How to formulate your problem as a machine learning problem?
More informationDimension Reduction and Iterative Consensus Clustering
Dimension Reduction and Iterative Consensus Clustering Southeastern Clustering and Ranking Workshop August 24, 2009 Dimension Reduction and Iterative 1 Document Clustering Geometry of the SVD Centered
More informationBayesian Nonparametric Regression for Diabetes Deaths
Bayesian Nonparametric Regression for Diabetes Deaths Brian M. Hartman PhD Student, 2010 Texas A&M University College Station, TX, USA David B. Dahl Assistant Professor Texas A&M University College Station,
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationThe Comparison of Stochastic and Deterministic DEA Models
The International Scientific Conference INPROFORUM 2015, November 5-6, 2015, České Budějovice, 140-145, ISBN 978-80-7394-536-7. The Comparison of Stochastic and Deterministic DEA Models Michal Houda, Jana
More informationMathematical Formulation of Our Example
Mathematical Formulation of Our Example We define two binary random variables: open and, where is light on or light off. Our question is: What is? Computer Vision 1 Combining Evidence Suppose our robot
More informationBayesian Nonparametrics for Speech and Signal Processing
Bayesian Nonparametrics for Speech and Signal Processing Michael I. Jordan University of California, Berkeley June 28, 2011 Acknowledgments: Emily Fox, Erik Sudderth, Yee Whye Teh, and Romain Thibaux Computer
More informationMATH 433 Applied Algebra Lecture 22: Review for Exam 2.
MATH 433 Applied Algebra Lecture 22: Review for Exam 2. Topics for Exam 2 Permutations Cycles, transpositions Cycle decomposition of a permutation Order of a permutation Sign of a permutation Symmetric
More informationDigitalteknik EIT020. Lecture 10: Karnaugh Maps
Digitalteknik EIT020 Lecture 10: Karnaugh Maps September 30, 2014 Idea of minimisation Theorem There exists a prime cover that is a minimal cover. Minimal prime cover Find all prime implicants. Find the
More informationFactor Analysis (10/2/13)
STA561: Probabilistic machine learning Factor Analysis (10/2/13) Lecturer: Barbara Engelhardt Scribes: Li Zhu, Fan Li, Ni Guan Factor Analysis Factor analysis is related to the mixture models we have studied.
More informationCh 4. Linear Models for Classification
Ch 4. Linear Models for Classification Pattern Recognition and Machine Learning, C. M. Bishop, 2006. Department of Computer Science and Engineering Pohang University of Science and echnology 77 Cheongam-ro,
More informationIntrinsic products and factorizations of matrices
Available online at www.sciencedirect.com Linear Algebra and its Applications 428 (2008) 5 3 www.elsevier.com/locate/laa Intrinsic products and factorizations of matrices Miroslav Fiedler Academy of Sciences
More informationClick Prediction and Preference Ranking of RSS Feeds
Click Prediction and Preference Ranking of RSS Feeds 1 Introduction December 11, 2009 Steven Wu RSS (Really Simple Syndication) is a family of data formats used to publish frequently updated works. RSS
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationAssignment 1: From the Definition of Convexity to Helley Theorem
Assignment 1: From the Definition of Convexity to Helley Theorem Exercise 1 Mark in the following list the sets which are convex: 1. {x R 2 : x 1 + i 2 x 2 1, i = 1,..., 10} 2. {x R 2 : x 2 1 + 2ix 1x
More informationLecturer: David Blei Lecture #3 Scribes: Jordan Boyd-Graber and Francisco Pereira October 1, 2007
COS 597C: Bayesian Nonparametrics Lecturer: David Blei Lecture # Scribes: Jordan Boyd-Graber and Francisco Pereira October, 7 Gibbs Sampling with a DP First, let s recapitulate the model that we re using.
More informationCS 486: Applied Logic Lecture 7, February 11, Compactness. 7.1 Compactness why?
CS 486: Applied Logic Lecture 7, February 11, 2003 7 Compactness 7.1 Compactness why? So far, we have applied the tableau method to propositional formulas and proved that this method is sufficient and
More informationMath 671: Tensor Train decomposition methods
Math 671: Eduardo Corona 1 1 University of Michigan at Ann Arbor December 8, 2016 Table of Contents 1 Preliminaries and goal 2 Unfolding matrices for tensorized arrays The Tensor Train decomposition 3
More informationMatrix Decomposition in Privacy-Preserving Data Mining JUN ZHANG DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF KENTUCKY
Matrix Decomposition in Privacy-Preserving Data Mining JUN ZHANG DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF KENTUCKY OUTLINE Why We Need Matrix Decomposition SVD (Singular Value Decomposition) NMF (Nonnegative
More informationGraphical Models for Collaborative Filtering
Graphical Models for Collaborative Filtering Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Sequence modeling HMM, Kalman Filter, etc.: Similarity: the same graphical model topology,
More information