Maximization of Multi - Information
|
|
- Nathaniel Stevenson
- 6 years ago
- Views:
Transcription
1 Maximization of Multi - Information Week of Doctoral Students 2007 Jozef Juríček Academy of Sciences of the Czech Republic Created by Image To PDF trial version, Institute to remove of Information this Theory mark, and please Automation register this soft Supervisor: Ing. František Matúš, CSc. Created by Image To PDF trial ver Prague, 05 th June 2007 Charles University in Prague Faculty of Mathematics and Physics Department of Probability and Mathematical Statistics
2 01 17 Contents Introduction Informally Formally Recent Results Dimensionality Set of Maximizers Exponential Families and Maximizers Examples and Special Cases The Case of N Equal Units The Case of 2 Units The Case of N Units
3 01 05 Introduction Contents Introduction Informally Formally Recent Results Dimensionality Set of Maximizers Exponential Families and Maximizers Examples and Special Cases The Case of N Equal Units The Case of 2 Units The Case of N Units
4 01 05 Introduction Informally 01/17 Interests of Study interdependence of stochastic units (random variables) - based on information theory the probability measures (pm) that represents maximal interdependence geometric interpretation of the problem and its solution description of the structure of maximizers
5 01 05 Introduction Formally 02/17 Notations = Ω finite set P(Ω) := {p = (p(ω)) ω Ω R Ω : ω Ω; p(ω) 0 & ω Ω p(ω) = 1} supp(p) := {ω Ω : p(ω) > 0} Kullback - Leibler divergence D : P(Ω) P(Ω) [0, ] defined as { p(ω) (p, q) D(p q) := ω supp(p) p(ω) ln, if supp(p) supp(q); q(ω), otherwise for E P(Ω) let us define D E : P(Ω) R + as p D E(p) := inf q E D(p q)
6 01 05 Introduction Formally 03/17 Notations Denote V := [N] := {1,..., N} the set of N 2 stochastic units (random variables X 1,..., X N ) for i [N]: Ω i the set of configurations of a unit i and Ω i =: n i for A [N]: Ω A := i A Ω i for p = p V P(Ω V ): p A denotes marginal pm and p i := p {i} P(Ω V ) := {p P(Ω V ) : supp(p) = Ω V } the set of factorizable pm F := F(Ω V ) := {p P(Ω V ) : p(ω 1,..., ω n) = p 1 (ω 1 )... p N (ω n); (ω 1,..., ω N ) Ω V } the set of strictly positive factorizable pm F := F P(Ω V )
7 01 05 Introduction Formally 04/17 Exponential Families define the function Exp : R Ω P(Ω) as ( R Ω (X(ω)) ω Ω = X Exp(X) = (Exp(X)(ω)) ω Ω := e X(ω) ω Ω ex(ω ) exponential family in P(Ω) is the image Exp(T ), where T R Ω is some linear (or affine) subspace of R Ω F is an exponential family in P(Ω) hierarchy of exponential families based on interactions for k [N]: I (k) := {f R Ω V : f is constant at least in N k variables} Ĩ (k) := I (k) (I (k 1) ) w.r.t. scalar product f, g := ω Ω V f (ω)g(ω); orthogonal complement is in R Ω V Exp(I (0) )... Exp(I (N) ) Exp(I (0) ) is center of ( i [N] n i 1) - dimensional simplex (uniform distribution on Ω V ) Exp(I (1) ) = F with dimension i [N] (n i 1) ) ω Ω
8 01 05 Introduction Formally 05/17 Multi - Information... Measure of Interdependence define I(p) := I p(x 1,..., X N ) := D F(p) the question is behavior of the I(p) as a function of pm p on P(Ω V ) interest in global and local maximizers of I(p) how many of them? any special structure of the set of maximizers? I(p) = I p(x 1,..., X n) = N i=1 Hp i (X i) H p(x 1,..., X N ), where marginal entropy H pi (X i ) = ω i Ω i p i (ω i ) ln p i (ω i ) global entropy H p(x 1,... X N ) = ω Ω V p(ω) ln p(ω) I(p) N 1 i=1 ln(n i) M(Ω V ) := M(n 1,..., n N ) := {p P(Ω V ) : I(p) = N 1 i=1 ln(n i)} for which cases of Ω V is M(Ω V )? (maximal global maximizers)
9 06 09 Recent Results Contents Introduction Informally Formally Recent Results Dimensionality Set of Maximizers Exponential Families and Maximizers Examples and Special Cases The Case of N Equal Units The Case of 2 Units The Case of N Units
10 06 09 Recent Results Dimensionality 06/17 Dimensions of Maximizers Theorem (Dimension of Maximizers of the Distance From an Arbitrary Exponential Family [AyKn05], [MaAy04]) Let E be an exponential family in P(Ω) with dim E = d. Then there exists an exponential family E ; E E with dim E = 3d + 2 such that cl(e ) contains all local maximizers of D E. By now let us assume (wlog) 2 n 1... n N. Corollary (Multi - Information Maximizers Dimension [AyKn05]) There exists an exponential family F in P(Ω V ) with dim F 3 (n i 1) + 2 3N(n N 1) + 2 i [N] such that cl(f ) contains all local maximizers of I(p). In the binary case, i [N] : n i = 2; dim F 3N + 2.
11 06 09 Recent Results Set of Maximizers 07/17 General Theorems About M(Ω V ) Theorems (The Essential Theorems About M(Ω V ) [AyKn05]) (1) Let p be a pm on Ω V. Then p M(Ω V ) if and only if there exist a pm p (N) P(Ω N ) and the functions π i : Ω N onto Ω i ; i [N 1] with p (N) [π i (ω N ) = ω i ] = 1 n i ( ω i Ω i ) (uniform distributions on 1-dimensional margins Ω 1,..., Ω N 1 ) and (ω 1,..., ω N ) Ω V : { p (N) (ω p(ω 1,..., ω N ) = N ), if ω i = π i (ω N ), i [N 1]; 0, otherwise (2) M(Ω V ) if and only if n N n min := A [N 1] ( 1) A 1 GCD((n i ) i A ) (GCD is greatest common divisor, LCM will be least common multiple)
12 06 09 Recent Results Set of Maximizers 08/17 Remarks on M(Ω V ) Remarks (Remarks on the Essential Theorems About M(Ω V ) [AyKn05]) (1) M(Ω V ) if (1a) N = 2 (1b) n 1 =... n N =: n (2) n N 1 n min 1 + i [N 1] (n i 1) and (2a) n min = n N 1 n N 1 = LCM((n i ) i [N 1] ) (2b) n min = 1 + i [N 1] (n i 1) LCM((n i ) i [N 1] ) = 1 (2c) n min LCM((n i ) i [N] ) (3) maximizers p M(Ω V ) simultaneously maximize the mutual information of the pairs (i, N) of units (i [N]) (3a) in the case of LCM((n i ) i [N 1] ) = n N maximizers p M(Ω V ) simultaneously maximize the mutual information of all pairs (i, j) of units (i, j [N])
13 06 09 Recent Results Exponential Families and Maximizers 09/17 Sufficiency of Low - Order Interactions for M(Ω V ) Theorem (The Lowest Order of Sufficient Interaction for M(Ω V ) is 2 [AyKn05]) There exists an exponential family F Exp(Ĩ(2) ) with dimf = (n N 1) i [N 1] (n i 1) such that M(Ω V ) cl(f ).
14 10 17 Examples and Special Cases Contents Introduction Informally Formally Recent Results Dimensionality Set of Maximizers Exponential Families and Maximizers Examples and Special Cases The Case of N Equal Units The Case of 2 Units The Case of N Units
15 10 17 Examples and Special Cases The Case of N Equal Units 10/17 (n,..., n) for Ω i = n; i [N]: M(Ω V ) = { 1 n ω N Ω δ (π1 (ω N N ),...,π N 1 (ω N ),ω N ) : π i : Ω N 1 1 Ω i ; i [N 1]} M(Ω V ) = (n!) N 1 p M(Ω V ); I(p) = (N 1) ln(n) there exists exponential family with dimension less than or equal to 3N(n 1) + 2 that contains M(Ω V ) in its closure there exists exponential family with dimension less than or equal to n2 +3n 2 that contains M(Ω V ) in its closure there exists exponential family F Exp(Ĩ(2) ) with dimension less than or equal to (N 1)(n 1) 2 such that M(Ω V ) cl(f )
16 10 17 Examples and Special Cases The Case of N Equal Units 11/17 (2, 2) In the case of two binary units, we have: M(Ω V ) = { 1 2 (δ (0,0) + δ (1,1) ), 1 2 (δ (1,0) + δ (0,1) )} P({0, 1} 2 ) Exp(Ĩ(2) ), dim(exp(ĩ(2) )) = 1 F = Exp(Ĩ(2) ) = {λ 1 2 (δ (0,0) + δ (1,1) ) + (1 λ) 1 2 (δ (1,0) + δ (0,1) ) : 0 < λ < 1}. Let us see the situation on the pictures 1 : 1 the most of the right picture is grabbed from [AyKn05]
17 10 17 Examples and Special Cases The Case of 2 Units 12/17 (n1, n2) M(n 1, n 2 ) = {p P(Ω 1 Ω 2 ) : I(p) = ln(n 1 )} Let Ω 1 := Ω 1 {0}, S := {π : Ω 2 Ω 1 : π(ω 2 ) Ω 1 }, define relation σ π σ 1 (ω 1 ) π 1 (ω 1 ); ω 1 Ω 1 is a partial order(reflexivity, anti-symmetry, transitivity) and makes S a poset and induces a cover graph of S, see the picture of cover graph (so-called Hasse diagram) and the structure of M(2, 3): cover graph of S and the set M(n 1, n 2 ) are connected if and only if n 1 < n 2
18 10 17 Examples and Special Cases The Case of 2 Units 13/17 (n 1, n 2 ) for a given π S we consider M π(ω 1, Ω 2 ) := {p P(Ω 1 Ω 2 ) : ω 1 Ω 1 : ω 2 π 1 (ω 1 ) p(ω 1, ω 2 ) = 1 n 1 and p(ω 1, ω 2 ) > 0 iff π(ω 2 ) = ω 1 } notice the picture: there must be at least one unit in each row (onto) and mostly one unit in each column (function) the set of global maximizers of the mutual information is a disjoint union of relatively open faces M π(ω 1, Ω 2 ) with dim M π(ω 1, Ω 2 ) = π 1 (Ω 1 ) Ω 1 : M(Ω 1, Ω 2 ) = π S Mπ(Ω 1, Ω 2 ) there are Fm n 1 := n 1! ( ) n 2 m Sm,n1 faces M π(ω 1, Ω 2 ) with dimension n1 i=0 ( 1)i( ) n 1 (n1 i i) m denotes Stirling m n 1, where S m,n1 := 1 n 1! number of the second kind and m = n 1,..., n 2 there are F n1,n2 := n 2 m=n 1 F m n 1 vertexes of cover graph (naturally different affine spaces/exponential families which in their closures contains maximizers) there exist an exponential family F with dim F = 2 such that cl(f ) M(2, 3)
19 10 17 Examples and Special Cases The Case of 2 Units 14/17 (2, n 2 ) let us consider number of vertexes F 2,n2 of the cover graph of S define the (code) function C : S {0, 1, 2} n 2 as this picture denotes: C(π) will be called code of π F 2,n2 = S = [all codes] [codes that do not contain 1 or 2] = (by De Morgan) = [all] [1 but not 2] [2 but not 1] [not 1 nor 2] formally: F 2,n = 3 n 2(2 n 1) 1 = 3 n 2 2 n + 1 n hence, we can represent the problem as a homogenous difference equation with characteristic polynomial P 2,n (λ) = (λ 3)(λ 2)(λ 1) = λ 3 6λ λ 6 Corresponding difference equation is a n+3 = 6a n a n+1 6a n with given values a 2 = F 2,2 = 2, a 3 = F 2,3 = 12, a 4 = F 2,4 = 50 there exists exponential family F with dim F = n 2 1 such that cl(f ) M(2, n 2 )
20 10 17 Examples and Special Cases The Case of N Units 15/17 (2, 2, n 3 ) Analogously, we define F 2,2,n3 as a number of naturally different exponential families which closures contain M(2, 2, n 3 ) we notice that the code of onto mappings contain at least on of pairs (1, 4), (2, 3) and we will see Created by Image To PDF trial version, to remove this mark, please register this F 2,2,n = 5 n 4 3 n n 1 P 2,2,n (λ) = (λ 5)P 2,n = (λ 5)(λ 3)(λ 2)(λ 1) with a 2 = 4, a 3 = F 2,2,3, a 4 = F 2,2,4, a 5 = F 2,2,5 dim F = 2(n 3 1)
21 10 17 Examples and Special Cases The Case of N Units 16/17 (2, 2, 2, n 4 ) F 2,2,2,n = 9 n 2 5 n 10 4 n n 6 2 n 1 P 2,2,2n (λ) = (λ 9)(λ 4)P 2,2,n = (λ 9)(λ 5)(λ 4)P 2,n (λ) dim F = 3(n 4 1)
22 10 17 Examples and Special Cases The Case of N Units 17/17 Questions for the Following Studies is ad-hoc searching of the difference equations representation useful/necessary? is there any (reasonable) mapping between the number of surjective mappings and the space of natural polynomials? is there any connection with the maximizers on the spaces of continuous distributions through the characteristic polynomial representation (difference to differential equations)?
23 Goodbye Bibliography Thank you for attention! This file is available on Ay, N., Knauf, A. (2005): Maximizing Multi-Information. Kybernetika. Matúš, F., Ay, N. (2004): On maximization of the information divergence from an exponential family. WUPES 03. University of Economics Prague. pp
Maximization of the information divergence from the multinomial distributions 1
aximization of the information divergence from the multinomial distributions Jozef Juríček Charles University in Prague Faculty of athematics and Physics Department of Probability and athematical Statistics
More informationThe Theory behind PageRank
The Theory behind PageRank Mauro Sozio Telecom ParisTech May 21, 2014 Mauro Sozio (LTCI TPT) The Theory behind PageRank May 21, 2014 1 / 19 A Crash Course on Discrete Probability Events and Probability
More informationMath 3C Lecture 25. John Douglas Moore
Math 3C Lecture 25 John Douglas Moore June 1, 2009 Let V be a vector space. A basis for V is a collection of vectors {v 1,..., v k } such that 1. V = Span{v 1,..., v k }, and 2. {v 1,..., v k } are linearly
More informationSets and Motivation for Boolean algebra
SET THEORY Basic concepts Notations Subset Algebra of sets The power set Ordered pairs and Cartesian product Relations on sets Types of relations and their properties Relational matrix and the graph of
More informationChapter 1: Introduction to Probability Theory
ECE5: Stochastic Signals and Systems Fall 8 Lecture - September 6, 8 Prof. Salim El Rouayheb Scribe: Peiwen Tian, Lu Liu, Ghadir Ayache Chapter : Introduction to Probability Theory Axioms of Probability
More informationSeries 7, May 22, 2018 (EM Convergence)
Exercises Introduction to Machine Learning SS 2018 Series 7, May 22, 2018 (EM Convergence) Institute for Machine Learning Dept. of Computer Science, ETH Zürich Prof. Dr. Andreas Krause Web: https://las.inf.ethz.ch/teaching/introml-s18
More informationLecture 5 - Information theory
Lecture 5 - Information theory Jan Bouda FI MU May 18, 2012 Jan Bouda (FI MU) Lecture 5 - Information theory May 18, 2012 1 / 42 Part I Uncertainty and entropy Jan Bouda (FI MU) Lecture 5 - Information
More informationLecture Notes on DISCRETE MATHEMATICS. Eusebius Doedel
Lecture Notes on DISCRETE MATHEMATICS Eusebius Doedel c Eusebius J. Doedel, 009 Contents Logic. Introduction............................................................................... Basic logical
More informationRandom Variables. Definition: A random variable (r.v.) X on the probability space (Ω, F, P) is a mapping
Random Variables Example: We roll a fair die 6 times. Suppose we are interested in the number of 5 s in the 6 rolls. Let X = number of 5 s. Then X could be 0, 1, 2, 3, 4, 5, 6. X = 0 corresponds to the
More informationTutorial Obtain the principal disjunctive normal form and principal conjunction form of the statement
Tutorial - 1 1. Obtain the principal disjunctive normal form and principal conjunction form of the statement Let S P P Q Q R P P Q Q R A: P Q Q R P Q R P Q Q R Q Q R A S Minterm Maxterm T T T F F T T T
More informationNotation Index. gcd(a, b) (greatest common divisor) NT-16
Notation Index (for all) B A (all functions) B A = B A (all functions) SF-18 (n) k (falling factorial) SF-9 a R b (binary relation) C(n,k) = n! k! (n k)! (binomial coefficient) SF-9 n! (n factorial) SF-9
More informationfür Mathematik in den Naturwissenschaften Leipzig
ŠܹÈÐ Ò ¹ÁÒ Ø ØÙØ für Mathematik in den Naturwissenschaften Leipzig Finding the Maximizers of the Information Divergence from an Exponential Family by Johannes Rauh Preprint no.: 82 2009 Finding the
More information2MA105 Algebraic Structures I
2MA105 Algebraic Structures I Per-Anders Svensson http://homepage.lnu.se/staff/psvmsi/2ma105.html Lecture 12 Partially Ordered Sets Lattices Bounded Lattices Distributive Lattices Complemented Lattices
More informationIntroduction to Information Entropy Adapted from Papoulis (1991)
Introduction to Information Entropy Adapted from Papoulis (1991) Federico Lombardo Papoulis, A., Probability, Random Variables and Stochastic Processes, 3rd edition, McGraw ill, 1991. 1 1. INTRODUCTION
More informationUnless otherwise specified, V denotes an arbitrary finite-dimensional vector space.
MAT 90 // 0 points Exam Solutions Unless otherwise specified, V denotes an arbitrary finite-dimensional vector space..(0) Prove: a central arrangement A in V is essential if and only if the dual projective
More informationAlgebraic matroids are almost entropic
accepted to Proceedings of the AMS June 28, 2017 Algebraic matroids are almost entropic František Matúš Abstract. Algebraic matroids capture properties of the algebraic dependence among elements of extension
More informationBayes spaces: use of improper priors and distances between densities
Bayes spaces: use of improper priors and distances between densities J. J. Egozcue 1, V. Pawlowsky-Glahn 2, R. Tolosana-Delgado 1, M. I. Ortego 1 and G. van den Boogaart 3 1 Universidad Politécnica de
More informationA PARAMETRIC MODEL FOR DISCRETE-VALUED TIME SERIES. 1. Introduction
tm Tatra Mt. Math. Publ. 00 (XXXX), 1 10 A PARAMETRIC MODEL FOR DISCRETE-VALUED TIME SERIES Martin Janžura and Lucie Fialová ABSTRACT. A parametric model for statistical analysis of Markov chains type
More informationMath 1553, Introduction to Linear Algebra
Learning goals articulate what students are expected to be able to do in a course that can be measured. This course has course-level learning goals that pertain to the entire course, and section-level
More informationCONVENIENT PRETOPOLOGIES ON Z 2
TWMS J. Pure Appl. Math., V.9, N.1, 2018, pp.40-51 CONVENIENT PRETOPOLOGIES ON Z 2 J. ŠLAPAL1 Abstract. We deal with pretopologies on the digital plane Z 2 convenient for studying and processing digital
More information0 Sets and Induction. Sets
0 Sets and Induction Sets A set is an unordered collection of objects, called elements or members of the set. A set is said to contain its elements. We write a A to denote that a is an element of the set
More informationLocal properties of plane algebraic curves
Chapter 7 Local properties of plane algebraic curves Throughout this chapter let K be an algebraically closed field of characteristic zero, and as usual let A (K) be embedded into P (K) by identifying
More informationSets. We discuss an informal (naive) set theory as needed in Computer Science. It was introduced by G. Cantor in the second half of the nineteenth
Sets We discuss an informal (naive) set theory as needed in Computer Science. It was introduced by G. Cantor in the second half of the nineteenth century. Most students have seen sets before. This is intended
More informationLecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable
Lecture Notes 1 Probability and Random Variables Probability Spaces Conditional Probability and Independence Random Variables Functions of a Random Variable Generation of a Random Variable Jointly Distributed
More informationLECTURE 10: THE ATIYAH-GUILLEMIN-STERNBERG CONVEXITY THEOREM
LECTURE 10: THE ATIYAH-GUILLEMIN-STERNBERG CONVEXITY THEOREM Contents 1. The Atiyah-Guillemin-Sternberg Convexity Theorem 1 2. Proof of the Atiyah-Guillemin-Sternberg Convexity theorem 3 3. Morse theory
More informationLecture Notes 1 Basic Concepts of Mathematics MATH 352
Lecture Notes 1 Basic Concepts of Mathematics MATH 352 Ivan Avramidi New Mexico Institute of Mining and Technology Socorro, NM 87801 June 3, 2004 Author: Ivan Avramidi; File: absmath.tex; Date: June 11,
More informationSet, functions and Euclidean space. Seungjin Han
Set, functions and Euclidean space Seungjin Han September, 2018 1 Some Basics LOGIC A is necessary for B : If B holds, then A holds. B A A B is the contraposition of B A. A is sufficient for B: If A holds,
More informationNotation. Pattern Recognition II. Michal Haindl. Outline - PR Basic Concepts. Pattern Recognition Notions
Notation S pattern space X feature vector X = [x 1,...,x l ] l = dim{x} number of features X feature space K number of classes ω i class indicator Ω = {ω 1,...,ω K } g(x) discriminant function H decision
More informationSet Theory. Pattern Recognition III. Michal Haindl. Set Operations. Outline
Set Theory A, B sets e.g. A = {ζ 1,...,ζ n } A = { c x y d} S space (universe) A,B S Outline Pattern Recognition III Michal Haindl Faculty of Information Technology, KTI Czech Technical University in Prague
More informationCOUNTING INTEGER POINTS IN POLYTOPES ASSOCIATED WITH DIRECTED GRAPHS. Ilse Fischer
COUNTING INTEGER POINTS IN POLYTOPES ASSOCIATED WITH DIRECTED GRAPHS Ilse Fischer Fakultät für Mathematik, Universität Wien Oskar-Morgenstern-Platz 1, 1090 Wien, Austria ilse.fischer@univie.ac.at Tel:
More information2018 Fall 2210Q Section 013 Midterm Exam II Solution
08 Fall 0Q Section 0 Midterm Exam II Solution True or False questions points 0 0 points) ) Let A be an n n matrix. If the equation Ax b has at least one solution for each b R n, then the solution is unique
More information5 Mutual Information and Channel Capacity
5 Mutual Information and Channel Capacity In Section 2, we have seen the use of a quantity called entropy to measure the amount of randomness in a random variable. In this section, we introduce several
More informationLecture 6: Gaussian Channels. Copyright G. Caire (Sample Lectures) 157
Lecture 6: Gaussian Channels Copyright G. Caire (Sample Lectures) 157 Differential entropy (1) Definition 18. The (joint) differential entropy of a continuous random vector X n p X n(x) over R is: Z h(x
More informationSystems of Linear Equations
Systems of Linear Equations Math 108A: August 21, 2008 John Douglas Moore Our goal in these notes is to explain a few facts regarding linear systems of equations not included in the first few chapters
More informationLECTURE 25-26: CARTAN S THEOREM OF MAXIMAL TORI. 1. Maximal Tori
LECTURE 25-26: CARTAN S THEOREM OF MAXIMAL TORI 1. Maximal Tori By a torus we mean a compact connected abelian Lie group, so a torus is a Lie group that is isomorphic to T n = R n /Z n. Definition 1.1.
More informationProof: The coding of T (x) is the left shift of the coding of x. φ(t x) n = L if T n+1 (x) L
Lecture 24: Defn: Topological conjugacy: Given Z + d (resp, Zd ), actions T, S a topological conjugacy from T to S is a homeomorphism φ : M N s.t. φ T = S φ i.e., φ T n = S n φ for all n Z + d (resp, Zd
More informationApplication of Information Theory, Lecture 7. Relative Entropy. Handout Mode. Iftach Haitner. Tel Aviv University.
Application of Information Theory, Lecture 7 Relative Entropy Handout Mode Iftach Haitner Tel Aviv University. December 1, 2015 Iftach Haitner (TAU) Application of Information Theory, Lecture 7 December
More informationChapter 1. Sets and probability. 1.3 Probability space
Random processes - Chapter 1. Sets and probability 1 Random processes Chapter 1. Sets and probability 1.3 Probability space 1.3 Probability space Random processes - Chapter 1. Sets and probability 2 Probability
More informationIs g one-to-one? Is g onto? Why? Solution: g is not one-to-one, since for c A, g(b) = g(c) = c. g is not onto, since a / g(a).
Discrete Structures: Exam 2 Solutions to Sample Questions, 1. Let A = B = {a, b, c}. Consider the relation g = {(a, b), (b, c), (c, c)}. Is g one-to-one? Is g onto? Why? Solution: g is not one-to-one,
More informationLecture 2: August 31
0-704: Information Processing and Learning Fall 206 Lecturer: Aarti Singh Lecture 2: August 3 Note: These notes are based on scribed notes from Spring5 offering of this course. LaTeX template courtesy
More informationDo not open this exam until you are told to begin. You will have 75 minutes for the exam.
Math 2603 Midterm 1 Spring 2018 Your Name Student ID # Section Do not open this exam until you are told to begin. You will have 75 minutes for the exam. Check that you have a complete exam. There are 5
More informationPart IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015
Part IA Probability Definitions Based on lectures by R. Weber Notes taken by Dexter Chua Lent 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly) after lectures.
More informationLecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable
Lecture Notes 1 Probability and Random Variables Probability Spaces Conditional Probability and Independence Random Variables Functions of a Random Variable Generation of a Random Variable Jointly Distributed
More informationNORTHERN INDIA ENGINEERING COLLEGE, LKO D E P A R T M E N T O F M A T H E M A T I C S. B.TECH IIIrd SEMESTER QUESTION BANK ACADEMIC SESSION
NORTHERN INDIA ENGINEERING COLLEGE, LKO D E P A R T M E N T O F M A T H E M A T I C S B.TECH IIIrd SEMESTER QUESTION BANK ACADEMIC SESSION 011-1 DISCRETE MATHEMATICS (EOE 038) 1. UNIT I (SET, RELATION,
More informationOn Markov Properties in Evidence Theory
On Markov Properties in Evidence Theory 131 On Markov Properties in Evidence Theory Jiřina Vejnarová Institute of Information Theory and Automation of the ASCR & University of Economics, Prague vejnar@utia.cas.cz
More informationMATH 20F: LINEAR ALGEBRA LECTURE B00 (T. KEMP)
MATH 20F: LINEAR ALGEBRA LECTURE B00 (T KEMP) Definition 01 If T (x) = Ax is a linear transformation from R n to R m then Nul (T ) = {x R n : T (x) = 0} = Nul (A) Ran (T ) = {Ax R m : x R n } = {b R m
More informationMARKOV CHAINS: STATIONARY DISTRIBUTIONS AND FUNCTIONS ON STATE SPACES. Contents
MARKOV CHAINS: STATIONARY DISTRIBUTIONS AND FUNCTIONS ON STATE SPACES JAMES READY Abstract. In this paper, we rst introduce the concepts of Markov Chains and their stationary distributions. We then discuss
More informationMATH 8253 ALGEBRAIC GEOMETRY WEEK 12
MATH 8253 ALGEBRAIC GEOMETRY WEEK 2 CİHAN BAHRAN 3.2.. Let Y be a Noetherian scheme. Show that any Y -scheme X of finite type is Noetherian. Moreover, if Y is of finite dimension, then so is X. Write f
More informationLinear maps. Matthew Macauley. Department of Mathematical Sciences Clemson University Math 8530, Spring 2017
Linear maps Matthew Macauley Department of Mathematical Sciences Clemson University http://www.math.clemson.edu/~macaule/ Math 8530, Spring 2017 M. Macauley (Clemson) Linear maps Math 8530, Spring 2017
More information2.1 Sets. Definition 1 A set is an unordered collection of objects. Important sets: N, Z, Z +, Q, R.
2. Basic Structures 2.1 Sets Definition 1 A set is an unordered collection of objects. Important sets: N, Z, Z +, Q, R. Definition 2 Objects in a set are called elements or members of the set. A set is
More information3. If a choice is broken down into two successive choices, the original H should be the weighted sum of the individual values of H.
Appendix A Information Theory A.1 Entropy Shannon (Shanon, 1948) developed the concept of entropy to measure the uncertainty of a discrete random variable. Suppose X is a discrete random variable that
More informationMATH 3300 Test 1. Name: Student Id:
Name: Student Id: There are nine problems (check that you have 9 pages). Solutions are expected to be short. In the case of proofs, one or two short paragraphs should be the average length. Write your
More informationMarkov Chains and Stochastic Sampling
Part I Markov Chains and Stochastic Sampling 1 Markov Chains and Random Walks on Graphs 1.1 Structure of Finite Markov Chains We shall only consider Markov chains with a finite, but usually very large,
More informationConvergence of generalized entropy minimizers in sequences of convex problems
Proceedings IEEE ISIT 206, Barcelona, Spain, 2609 263 Convergence of generalized entropy minimizers in sequences of convex problems Imre Csiszár A Rényi Institute of Mathematics Hungarian Academy of Sciences
More informationdiv(f ) = D and deg(d) = deg(f ) = d i deg(f i ) (compare this with the definitions for smooth curves). Let:
Algebraic Curves/Fall 015 Aaron Bertram 4. Projective Plane Curves are hypersurfaces in the plane CP. When nonsingular, they are Riemann surfaces, but we will also consider plane curves with singularities.
More informationVector Space Concepts
Vector Space Concepts ECE 174 Introduction to Linear & Nonlinear Optimization Ken Kreutz-Delgado ECE Department, UC San Diego Ken Kreutz-Delgado (UC San Diego) ECE 174 Fall 2016 1 / 25 Vector Space Theory
More informationCOMP9020 Lecture 3 Session 2, 2014 Sets, Functions, and Sequences. Revision: 1.3
1 COMP9020 Lecture 3 Session 2, 2014 Sets, Functions, and Sequences Revision: 1.3 2 Notation for Numbers Definition Integers Z = {... 2, 1, 0, 1, 2,...} Reals R. : R Z floor of x, the greatest integer
More information(a) For an accumulation point a of S, the number l is the limit of f(x) as x approaches a, or lim x a f(x) = l, iff
Chapter 4: Functional Limits and Continuity Definition. Let S R and f : S R. (a) For an accumulation point a of S, the number l is the limit of f(x) as x approaches a, or lim x a f(x) = l, iff ε > 0, δ
More informationIN POSITIVE CHARACTERISTICS: 3. Modular varieties with Hecke symmetries. 7. Foliation and a conjecture of Oort
FINE STRUCTURES OF MODULI SPACES IN POSITIVE CHARACTERISTICS: HECKE SYMMETRIES AND OORT FOLIATION 1. Elliptic curves and their moduli 2. Moduli of abelian varieties 3. Modular varieties with Hecke symmetries
More informationContents. O-minimal geometry. Tobias Kaiser. Universität Passau. 19. Juli O-minimal geometry
19. Juli 2016 1. Axiom of o-minimality 1.1 Semialgebraic sets 2.1 Structures 2.2 O-minimal structures 2. Tameness 2.1 Cell decomposition 2.2 Smoothness 2.3 Stratification 2.4 Triangulation 2.5 Trivialization
More information1 9/5 Matrices, vectors, and their applications
1 9/5 Matrices, vectors, and their applications Algebra: study of objects and operations on them. Linear algebra: object: matrices and vectors. operations: addition, multiplication etc. Algorithms/Geometric
More informationMATH 556: PROBABILITY PRIMER
MATH 6: PROBABILITY PRIMER 1 DEFINITIONS, TERMINOLOGY, NOTATION 1.1 EVENTS AND THE SAMPLE SPACE Definition 1.1 An experiment is a one-off or repeatable process or procedure for which (a there is a well-defined
More informationCorrections to Introduction to Topological Manifolds (First edition) by John M. Lee December 7, 2015
Corrections to Introduction to Topological Manifolds (First edition) by John M. Lee December 7, 2015 Changes or additions made in the past twelve months are dated. Page 29, statement of Lemma 2.11: The
More information3 Boolean Algebra 3.1 BOOLEAN ALGEBRA
3 Boolean Algebra 3.1 BOOLEAN ALGEBRA In 1854, George Boole introduced the following formalism which eventually became Boolean Algebra. Definition. An algebraic system consisting of a set B of elements
More informationVertex Routing Models and Polyhomeostatic Optimization. Claudius Gros. Institute for Theoretical Physics Goethe University Frankfurt, Germany
Vertex Routing Models and Polyhomeostatic Optimization Claudius Gros Institute for Theoretical Physics Goethe University Frankfurt, Germany 1 topics in complex system theory Vertex Routing Models modelling
More information1. To be a grandfather. Objects of our consideration are people; a person a is associated with a person b if a is a grandfather of b.
20 [161016-1020 ] 3.3 Binary relations In mathematics, as in everyday situations, we often speak about a relationship between objects, which means an idea of two objects being related or associated one
More informationMATH 326: RINGS AND MODULES STEFAN GILLE
MATH 326: RINGS AND MODULES STEFAN GILLE 1 2 STEFAN GILLE 1. Rings We recall first the definition of a group. 1.1. Definition. Let G be a non empty set. The set G is called a group if there is a map called
More informationINTRODUCTION TO REAL ANALYSIS II MATH 4332 BLECHER NOTES
INTRODUCTION TO REAL ANALYSIS II MATH 4332 BLECHER NOTES You will be expected to reread and digest these typed notes after class, line by line, trying to follow why the line is true, for example how it
More informationLinear Independence x
Linear Independence A consistent system of linear equations with matrix equation Ax = b, where A is an m n matrix, has a solution set whose graph in R n is a linear object, that is, has one of only n +
More informationProblem Set 1 Sept, 14
EE6: Random Processes in Systems Lecturer: Jean C. Walrand Problem Set Sept, 4 Fall 06 GSI: Assane Gueye This problem set essentially reviews notions of conditional expectation, conditional distribution,
More informationExpressive Power and Approximation Errors of Restricted Boltzmann Machines
Expressive Power and Approximation Errors of Restricted Boltzmann Machines Guido F. Montúfar, Johannes Rauh, and Nihat Ay, Max Planck Institute for Mathematics in the Sciences, Inselstraße 0403 Leipzig,
More informationTest One Mathematics Fall 2009
Test One Mathematics 35.2 Fall 29 TO GET FULL CREDIT YOU MUST SHOW ALL WORK! I have neither given nor received aid in the completion of this test. Signature: pts. 2 pts. 3 5 pts. 2 pts. 5 pts. 6(i) pts.
More informationFUNCTORS AND ADJUNCTIONS. 1. Functors
FUNCTORS AND ADJUNCTIONS Abstract. Graphs, quivers, natural transformations, adjunctions, Galois connections, Galois theory. 1.1. Graph maps. 1. Functors 1.1.1. Quivers. Quivers generalize directed graphs,
More informationLinear Algebra problems
Linear Algebra problems 1. Show that the set F = ({1, 0}, +,.) is a field where + and. are defined as 1+1=0, 0+0=0, 0+1=1+0=1, 0.0=0.1=1.0=0, 1.1=1.. Let X be a non-empty set and F be any field. Let X
More informationLaw of Cosines and Shannon-Pythagorean Theorem for Quantum Information
In Geometric Science of Information, 2013, Paris. Law of Cosines and Shannon-Pythagorean Theorem for Quantum Information Roman V. Belavkin 1 School of Engineering and Information Sciences Middlesex University,
More informationLecture 10: October 30, 2017
Information an Coing Theory Autumn 2017 Lecturer: Mahur Tulsiani Lecture 10: October 30, 2017 1 I-Projections an applications In this lecture, we will talk more about fining the istribution in a set Π
More informationMachine Learning. Lecture 02.2: Basics of Information Theory. Nevin L. Zhang
Machine Learning Lecture 02.2: Basics of Information Theory Nevin L. Zhang lzhang@cse.ust.hk Department of Computer Science and Engineering The Hong Kong University of Science and Technology Nevin L. Zhang
More informationMathematics Review for Business PhD Students
Mathematics Review for Business PhD Students Anthony M. Marino Department of Finance and Business Economics Marshall School of Business Lecture 1: Introductory Material Sets The Real Number System Functions,
More informationInformation Theory and Statistics Lecture 2: Source coding
Information Theory and Statistics Lecture 2: Source coding Łukasz Dębowski ldebowsk@ipipan.waw.pl Ph. D. Programme 2013/2014 Injections and codes Definition (injection) Function f is called an injection
More informationIntroduction to Arithmetic Geometry Fall 2013 Lecture #17 11/05/2013
18.782 Introduction to Arithmetic Geometry Fall 2013 Lecture #17 11/05/2013 Throughout this lecture k denotes an algebraically closed field. 17.1 Tangent spaces and hypersurfaces For any polynomial f k[x
More informationTopology. Xiaolong Han. Department of Mathematics, California State University, Northridge, CA 91330, USA address:
Topology Xiaolong Han Department of Mathematics, California State University, Northridge, CA 91330, USA E-mail address: Xiaolong.Han@csun.edu Remark. You are entitled to a reward of 1 point toward a homework
More informationChapter 2 Subspaces of R n and Their Dimensions
Chapter 2 Subspaces of R n and Their Dimensions Vector Space R n. R n Definition.. The vector space R n is a set of all n-tuples (called vectors) x x 2 x =., where x, x 2,, x n are real numbers, together
More informationChapter 1. Sets and Mappings
Chapter 1. Sets and Mappings 1. Sets A set is considered to be a collection of objects (elements). If A is a set and x is an element of the set A, we say x is a member of A or x belongs to A, and we write
More informationECE521 Lectures 9 Fully Connected Neural Networks
ECE521 Lectures 9 Fully Connected Neural Networks Outline Multi-class classification Learning multi-layer neural networks 2 Measuring distance in probability space We learnt that the squared L2 distance
More informationParameter Estimation of the Stable GARCH(1,1)-Model
WDS'09 Proceedings of Contributed Papers, Part I, 137 142, 2009. ISBN 978-80-7378-101-9 MATFYZPRESS Parameter Estimation of the Stable GARCH(1,1)-Model V. Omelchenko Charles University, Faculty of Mathematics
More informationSymmetric and self-adjoint matrices
Symmetric and self-adjoint matrices A matrix A in M n (F) is called symmetric if A T = A, ie A ij = A ji for each i, j; and self-adjoint if A = A, ie A ij = A ji or each i, j Note for A in M n (R) that
More informationAnna University, Chennai, November/December 2012
B.E./B.Tech. DEGREE EXAMINATION, NOVEMBER/DECEMBER 2012 Fifth Semester Computer Science and Engineering MA2265 DISCRETE MATHEMATICS (Regulation 2008) Part - A 1. Define Tautology with an example. A Statement
More informationMath 54. Selected Solutions for Week 5
Math 54. Selected Solutions for Week 5 Section 4. (Page 94) 8. Consider the following two systems of equations: 5x + x 3x 3 = 5x + x 3x 3 = 9x + x + 5x 3 = 4x + x 6x 3 = 9 9x + x + 5x 3 = 5 4x + x 6x 3
More informationCases Where Finding the Minimum Entropy Coloring of a Characteristic Graph is a Polynomial Time Problem
Cases Where Finding the Minimum Entropy Coloring of a Characteristic Graph is a Polynomial Time Problem Soheil Feizi, Muriel Médard RLE at MIT Emails: {sfeizi,medard}@mit.edu Abstract In this paper, we
More informationCutting planes from extended LP formulations
Cutting planes from extended LP formulations Merve Bodur University of Wisconsin-Madison mbodur@wisc.edu Sanjeeb Dash IBM Research sanjeebd@us.ibm.com March 7, 2016 Oktay Günlük IBM Research gunluk@us.ibm.com
More information4 Vector Spaces. 4.1 Basic Definition and Examples. Lecture 10
Lecture 10 4 Vector Spaces 4.1 Basic Definition and Examples Throughout mathematics we come across many types objects which can be added and multiplied by scalars to arrive at similar types of objects.
More informationKybernetika VOLUME 43 (2007), NUMBER 5. The Journal of the Czech Society for Cybernetics and Information Sciences
Kybernetika VOLUME 43 (2007), NUMBER 5 The Journal of the Czech Society for Cybernetics and Information Sciences Published by: Institute of Information Theory and Automation of the AS CR, v.v.i. Editor-in-Chief:
More informationQuizzes for Math 401
Quizzes for Math 401 QUIZ 1. a) Let a,b be integers such that λa+µb = 1 for some inetegrs λ,µ. Prove that gcd(a,b) = 1. b) Use Euclid s algorithm to compute gcd(803, 154) and find integers λ,µ such that
More informationRings With Topologies Induced by Spaces of Functions
Rings With Topologies Induced by Spaces of Functions Răzvan Gelca April 7, 2006 Abstract: By considering topologies on Noetherian rings that carry the properties of those induced by spaces of functions,
More informationMath 341: Convex Geometry. Xi Chen
Math 341: Convex Geometry Xi Chen 479 Central Academic Building, University of Alberta, Edmonton, Alberta T6G 2G1, CANADA E-mail address: xichen@math.ualberta.ca CHAPTER 1 Basics 1. Euclidean Geometry
More informationStatistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation
Statistics 62: L p spaces, metrics on spaces of probabilites, and connections to estimation Moulinath Banerjee December 6, 2006 L p spaces and Hilbert spaces We first formally define L p spaces. Consider
More informationMathematics Review for Business PhD Students Lecture Notes
Mathematics Review for Business PhD Students Lecture Notes Anthony M. Marino Department of Finance and Business Economics Marshall School of Business University of Southern California Los Angeles, CA 90089-0804
More informationChapter 1 Preliminaries
Chapter 1 Preliminaries 1.1 Conventions and Notations Throughout the book we use the following notations for standard sets of numbers: N the set {1, 2,...} of natural numbers Z the set of integers Q the
More informationIntroduction to Machine Learning CMU-10701
Introduction to Machine Learning CMU-10701 Markov Chain Monte Carlo Methods Barnabás Póczos & Aarti Singh Contents Markov Chain Monte Carlo Methods Goal & Motivation Sampling Rejection Importance Markov
More informationName (print): Question 4. exercise 1.24 (compute the union, then the intersection of two sets)
MTH299 - Homework 1 Question 1. exercise 1.10 (compute the cardinality of a handful of finite sets) Solution. Write your answer here. Question 2. exercise 1.20 (compute the union of two sets) Question
More information