AN ALGEBRAIC APPROACH TO GENERALIZED MEASURES OF INFORMATION

Similar documents
Boolean Algebras, Boolean Rings and Stone s Representation Theorem

Formal power series rings, inverse limits, and I-adic completions of rings

1 Notations and Statement of the Main Results

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Eilenberg-Steenrod properties. (Hatcher, 2.1, 2.3, 3.1; Conlon, 2.6, 8.1, )

Universal Algebra for Logics

Skew Boolean algebras

Measures and Measure Spaces

1.1. MEASURES AND INTEGRALS

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

RS Chapter 1 Random Variables 6/5/2017. Chapter 1. Probability Theory: Introduction

Category Theory. Categories. Definition.

Algebraic Varieties. Notes by Mateusz Micha lek for the lecture on April 17, 2018, in the IMPRS Ringvorlesung Introduction to Nonlinear Algebra

On minimal models of the Region Connection Calculus

INVERSE LIMITS AND PROFINITE GROUPS

ABSTRACT NONSINGULAR CURVES

MATH 205B NOTES 2010 COMMUTATIVE ALGEBRA 53

ALGEBRAIC GEOMETRY COURSE NOTES, LECTURE 9: SCHEMES AND THEIR MODULES.

Notes about Filters. Samuel Mimram. December 6, 2012

CHAPTER 1. AFFINE ALGEBRAIC VARIETIES

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define

CHAPTER I THE RIESZ REPRESENTATION THEOREM

Direct Limits. Mathematics 683, Fall 2013

5 Set Operations, Functions, and Counting

Algebraic Geometry

Lattices, closure operators, and Galois connections.

CW-complexes. Stephen A. Mitchell. November 1997

Congruence Boolean Lifting Property

2.2 Annihilators, complemented subspaces

LECTURE 3: RELATIVE SINGULAR HOMOLOGY

Boolean Algebra and Propositional Logic

Math 145. Codimension

V (v i + W i ) (v i + W i ) is path-connected and hence is connected.

B 1 = {B(x, r) x = (x 1, x 2 ) H, 0 < r < x 2 }. (a) Show that B = B 1 B 2 is a basis for a topology on X.

Math 396. Quotient spaces

3. The Sheaf of Regular Functions

MATH 8253 ALGEBRAIC GEOMETRY WEEK 12

AHAHA: Preliminary results on p-adic groups and their representations.

MV-algebras and fuzzy topologies: Stone duality extended

CATEGORICAL GROTHENDIECK RINGS AND PICARD GROUPS. Contents. 1. The ring K(R) and the group Pic(R)

June 2014 Written Certification Exam. Algebra

Representations of quivers

MATH 101: ALGEBRA I WORKSHEET, DAY #1. We review the prerequisites for the course in set theory and beginning a first pass on group. 1.

FOUNDATIONS OF ALGEBRAIC GEOMETRY CLASS 25

Topological vectorspaces

Boolean Algebra and Propositional Logic

Math 210B. Artin Rees and completions

Section 2: Classes of Sets

INTRODUCTION TO FURSTENBERG S 2 3 CONJECTURE

The Measure Problem. Louis de Branges Department of Mathematics Purdue University West Lafayette, IN , USA

Lecture 6: Etale Fundamental Group

Category O and its basic properties

MATH & MATH FUNCTIONS OF A REAL VARIABLE EXERCISES FALL 2015 & SPRING Scientia Imperii Decus et Tutamen 1

38 Irreducibility criteria in rings of polynomials

Lie Algebra Cohomology

Notes on p-divisible Groups

Filters in Analysis and Topology

TROPICAL SCHEME THEORY

Product measures, Tonelli s and Fubini s theorems For use in MAT4410, autumn 2017 Nadia S. Larsen. 17 November 2017.

Math 396. Bijectivity vs. isomorphism

IndCoh Seminar: Ind-coherent sheaves I

Math 250A, Fall 2004 Problems due October 5, 2004 The problems this week were from Lang s Algebra, Chapter I.

A Note on the Inverse Limits of Linear Algebraic Groups

Measure and Integration: Concepts, Examples and Exercises. INDER K. RANA Indian Institute of Technology Bombay India

58 CHAPTER 2. COMPUTATIONAL METHODS

which element of ξ contains the outcome conveys all the available information.

THEOREMS, ETC., FOR MATH 515

3. Abstract Boolean Algebras

Fréchet algebras of finite type

II - REAL ANALYSIS. This property gives us a way to extend the notion of content to finite unions of rectangles: we define

Spectral Theory, with an Introduction to Operator Means. William L. Green

Tensor algebras and subproduct systems arising from stochastic matrices

5 Quiver Representations

CHAPTER 4. βs as a semigroup

CHAPTER VIII HILBERT SPACES

ALGEBRAIC GEOMETRY (NMAG401) Contents. 2. Polynomial and rational maps 9 3. Hilbert s Nullstellensatz and consequences 23 References 30

ABSTRACT DIFFERENTIAL GEOMETRY VIA SHEAF THEORY

THEOREMS, ETC., FOR MATH 516

W if p = 0; ; W ) if p 1. p times

Universität des Saarlandes. Fachrichtung 6.1 Mathematik

Part III. 10 Topological Space Basics. Topological Spaces

Löwenheim-Skolem Theorems, Countable Approximations, and L ω. David W. Kueker (Lecture Notes, Fall 2007)

Math 429/581 (Advanced) Group Theory. Summary of Definitions, Examples, and Theorems by Stefan Gille

The prime spectrum of MV-algebras based on a joint work with A. Di Nola and P. Belluce

MATH MEASURE THEORY AND FOURIER ANALYSIS. Contents

int cl int cl A = int cl A.

SUBLATTICES OF LATTICES OF ORDER-CONVEX SETS, III. THE CASE OF TOTALLY ORDERED SETS

(dim Z j dim Z j 1 ) 1 j i

Metric Spaces and Topology

and this makes M into an R-module by (1.2). 2

Real Analysis Notes. Thomas Goller

The Logic of Partitions

Integration on Measure Spaces

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

Topological properties

MATH 221 NOTES BRENT HO. Date: January 3, 2009.

Lebesgue measure on R is just one of many important measures in mathematics. In these notes we introduce the general framework for measures.

Math 249B. Nilpotence of connected solvable groups

Indecomposable Quiver Representations

Duality, Residues, Fundamental class

Transcription:

AN ALGEBRAIC APPROACH TO GENERALIZED MEASURES OF INFORMATION Daniel Halpern-Leistner 6/20/08 Abstract. I propose an algebraic framework in which to study measures of information. One immediate consequence is a new proof of the classification of branching entropies on a Boolean algebra. Common constructions in the theory of entropy such as the conditional entropy and the mutual information arise naturally in this framework. A generalized theory of information can be interpreted in a category-theoretic context as a natural transformation between two functors. In 1948, Claude E. Shannon established the statistical theory of entropy (Shannon 1948), which offered deep insights into coding theory and compression. Since then researchers have been generalizing the axiomatic structure of the Shannon entropy so that they might bring it to bear on other mathematical questions. One such generalization is the notion of a branching inset entropy (Kannappan & Sander 2004). In 1983, J. Aczel named the problem of classifying inset information measures on a list of unsolved problems in the theory of functional equations at the Twenty-First Symposium on Functional Equations. The problem was solved (Ebanks 1986, B.R. Ebanks & Ng 1990), but the proof is 20 pages long and involves checking over a dozen special cases. Below we introduce an algebraic object, the partition algebra, and we identify the space of branching entropies as the dual of this object. In addition to producing a new and simple algebraic proof of the classification of branching entropies, this construction endows the space of branching entropies with a rich algebraic structure with information theoretic interpretations and leads to a notion of a generalized information theory. 1. Definition of the partition algebra We fix an arbitrary base ring k, but for most applications we can consider k = R. For any boolean algebra F, we consider the space of formal linear combinations of elements of F with coefficients in k, where we take the element F to be zero. We can extend the intersection operation bilinearly to define a multiplication of formal sums. This k-module, along with the multiplication, is a k-algebra denoted k 0 F. Define the k-algebra A k F to be the quotient of k 0 F by the ideal generated by expressions of the form a b + a b a b for all a, b F. In other words A k F is the space of formal linear combinations of elements of F, but where we consider a b = a + b whenever a, b F are disjoint. We refer to A k F as the algebra of simple functions on F, because if F is a boolean algebra of subsets of some set X, then A k F is just the algebra of k-valued functions on X whose range is a finite set in k and whose level sets are elements of F. We see this by identifying the formal element E F A k F with the indicator function χ E for all subsets E F. Denote the surjection onto the quotient algebra π F : k 0 F A k F. This surjection is natural in the sense that if φ : F G is a map of boolean algebras, then there is a 1

corresponding map of short exact sequences 0 ker π F k 0 F π F A k F 0 φ 0 ker π G k 0 G π G A k G 0 Finally we define the partition algebra as the smallest subalgebra of k 0 F containing ker π F. So we have P k F := ker π F k 1 k 0 F The projection π F induces an augmentation map π F : P k F k, and a map of boolean algebras F G naturally induces a map of augmented k-algebras P k F P k G. The space ker π F is a k 0 F-module and hence a P k F-module. 1.1. Calculation in P k F and A k F The main tool for calculation in the algebra of simple functions A k F is the existence of a canonical form Proposition 1..1. For any element f A k F there is a unique minimal (i.e. fewest terms) expression f = λ 1 a 1 + + λ n a n such that λ i k are non-zero and the elements a i F are pairwise disjoint. This proposition makes rigorous the analogy between elements of A k F and functions, and many arguments can be made using this analogy. Remark 1..2. Another way to make this analogy rigorous when k is a field is to consider the spectrum Spec A k F, then the boolean algebra F is canonically isomorphic to the lattice of open-and-closed subsets of this space. When k = Z/2 this is just Stone s theorem as presented in (M.F. Atiyah 1969) φ φ Next we introduce a notation for calculation in the algebra P k F. If ω = {ω 1,..., ω n } is an unordered n-tuple of pairwise disjoint elements of F (we ignore the zero element F), i.e. ω i ω j = for i j, then we define ω := ω 1 + + ω n k 0 F ω := (ω 1 ω n ) ω P k F We call such an n-tuple a partition of F and say that ω is full if ω 1 ω n = 1. If ω is full then ω P k F k 0 F and ω = 1 ω by definition. Furthermore in this case π F (ω ) = 1. Next if ω and ξ are two partitions, we can define their join to be the tuple of pairwise intersections ω ξ := {ω 1 ξ 1, ω 1 ξ 2,...,..., ω m ξ n } For any two partitions we have ω ξ = (ω ξ), and ω and ω are orthogonal idempotents, i.e. (ω ) 2 = ω, (ω ) 2 = ω, ω ω = 0 2

Finally if both ω and ξ are full partitions then ω ξ = ω + ξ (ω ξ). There is a correspondence between full partitions of F and finite boolean subalgebras of F. If G F is a finite boolean subalgebra, then it has a set of irreducible generators (sometimes called atoms ). The set of atoms is the unique collection a 1,..., a n G which are distinct, nonzero generators and such that a i x = a i or for all x G In particular a 1 a n = 1 and the a i are pairwise disjoint. We can associate to G the full partition {a 1,..., a n }, and this establishes a bijection between finite subalgebras and full partitions. Sometimes it is convenient to confuse ω with the subalgebra it generates and vice versa. In the context of subalgebras the join ω ξ is the smallest boolean subalgebra of F containing both ω and ξ, i.e. the subalgebra generated by the two subalgebras ω and ξ. One important fact about the partition algebra is the following Proposition 1..3. Let k be a ring and F a boolean algebra. The space ker π F is generated as a k-module by the set of ω, where ω = {ω 1, ω 2 } ranges over all partitions of F with just two elements. This is an immediate consequence of the Proposition 2..2 below, so we will postpone the proof until then. Corollary 1..4. The space ker π F is generated as a k-module by the set of ω, where ω = {ω 1, ω 2, ω 3 } ranges over all full partitions of F with three elements. The space P k F is generated as a k-modules by the set of ω, where ω ranges over all full partitions of F with three elements. 2. The relation to entropy The algebraic objects defined in the previous section capture many important structures in information theory and measure theory. Dualizing the relationship between the partition algebra and the algebra of simple functions will yield a relationship between finitely additive measures and so-called branching entropies on a boolean algebra. 2.1. Additive measures Given a boolean algebra F, the space M k (F) of additive measures on F is defined to be the k-module of functions µ : F k such that µ( ) = 0 and µ(a b) = µ(a) + µ(b) whenever a b = Note that these are finitely additive measures and need not be countably additive even when F is a σ-algebra. One can see that the space M k (F) is naturally identified with the k dual of the space A k F, i.e. the space (A k F) of k-linear maps A k F k. In the case k = R we can relate this definition to the usual notion of a measure on a measure space. First of all, a measure is usually considered to take nonnegative values, i.e. µ(a) 0 for all a F. We can express this condition in algebraic terms by saying that the symmetric bilinear form on A R F given by (f, g) µ := µ(f g) 3

is positive semi-definite. One can use Proposition 1..1 to show that µ nonnegative is equivalent to (, ) µ positive semi-definite. We denote by M + R (F) the space of nonnegative measures on F it is closed under addition and multiplication by positive scalars (i.e. it is a semigroup under addition with a compatible R + group action). Classically (, ) µ is just the L 2 inner product on the algebra of simple functions. From this algebraic perspective, the classical Jordan decomposition theorem which states that a signed measure µ can be written uniquely as the difference of two orthogonal nonnegative measures µ = µ 1 µ 2 can be seen as a decomposition of a symmetric bilinear form as the difference of two orthogonal nonnegative forms (i.e. such that the null space of µ is precisely the intersection of the null spaces of µ 1 and µ 2 ). The inner product perspective gives us yet another way to think about measures on the space A k F. It can be shown that µ (, ) µ gives a one-to-one onto correspondence between measures on F and inner products on A k F under which the action of k 0 F on A k F is self-adjoint. Remark 2..1. In order to discuss countably additive measures on a σ-algebra, we must introduce a suitable topology on the vector space A R F. Then the space of countably additive measures can be constructed as the space of continuous linear functionals on A R F. 2.2. Branching entropies The notion of a (symmetric) branching entropy is a common generalization of the Shannon entropy and several other information measures. For a particular boolean algebra F and base ring k, a branching entropy h is a k-valued function on the set of all partitions of F which satisfies the following axioms h(ω 1,..., ω n ) = h(, ω 1,..., ω n ) h(ω 1,..., ω n+1 ) = h(ω 1, ω 2 ) + h(ω 1 ω 2, ω 3,..., ω n+1 ) In particular these axioms imply that for a one-element partition we have h(ω 1 ) = 0. The space of all such branching entropies is a k-module which we will denote as Br k (F). A map of boolean algebras φ : F G induces a map of k-modules φ : Br k (G) Br k (F) and (φ ϕ) = ϕ φ. As with additive measures we can consider the k-module Rel k (F) := { formal sums of partitions λ 1 ω (1) + λ m ω (m)} / where {, ω 1,..., ω n } {ω 1,..., ω n } {ω 1,..., ω n+1 } {ω 1, ω 2 } + {ω 1 ω 2, ω 3,..., ω n+1 } and one can see that Br k (F) is naturally identified with the k dual of Rel k (F), i.e. Br k (F) = (Rel k (F)) naturally. We have the following theorem Proposition 2..2. The assignment ω ω induces a natural isomorphism of k-modules Rel k (F) = ker π F. Corollary 2..3. There is a natural isomorphism of k-modules Br k (F) = (ker π F ). 4

Proof. k 0 F is the union of the subalgebras k 0 ξ k 0 F where ξ ranges over all Boolean subalgebras of F. The map ω ω is natural with respect to these inclusions and the corresponding inclusions Rel k (ξ) Rel k (F), so it suffices to show that the map is an isomorphism for F = ξ finite. We equip Rel k (F) with a natural k 0 F-module structure (hence a P k F-module structure) via the action a {ω 1,..., ω n } := {a ω 1,..., a ω n } And for finite F = ξ the space ker π ξ is generated as a P k ξ-module by ξ because A k ξ can be naturally identified with ξ k 0 ξ and the projection k 0 ξ A k ξ is just multiplication by ξ. It follows that for finite ξ we can define an inverse explicitly by u ξ u ξ for u P k ξ. It can be shown that the spaces A k F and P k F are both projective as k-modules A, and thus taking the dual spaces gives a short exact sequence 0 M k (F) (k 0 F) Br k (F) 0 This short exact sequence classifies the space of branching entropies. Explicitly it states that any branching entropy is of the form h(ω 1,..., ω n ) = f(ω ) = f(ω 1 ω n ) f(ω 1 ) f(ω n ) for some function f : F k with f( ) = 0. Furthermore, the short exact sequence implies that two functions f, f induce the same branching entropy if and only if their difference f f is an additive measure on F. This gives a simple algebraic proof of the classification of branching entropies, and this proof works over a general base ring k and not just R. However, the isomorphism 2..3 does more than just classify the k-module of branching entropies. As we show in the next section, this isomorphism endows Br k (F) with a rich algebraic structure. 2.3. The algebraic structure of branching entropies From the proposition 2..2, the space Rel k F is naturally a P k (F)-module. This induces a P k F-module structure on Br k (F), given explicitly by (ω h)(ξ) = h(ω ξ ) = h((ω ξ) ω ) = h(ω ξ) h(ω) Here we have assumed that ω and ξ are both full partitions. In classical information theory the quantity H(ω ξ) H(ω) is defined to be the conditional entropy H(ξ ω), although here it arises as the natural action of the partition algebra on the space of branching entropies. In the classical notation we write this action as (ω h)( ) = h( ω). Standard formulas involving conditional entropy, such as the fact that h(ζ ω ξ) = h(ζ ξ ω) h(ξ ω) follow immediately from this interpretation of conditional entropy. Note: The space ker π F is a k 0 F-module in addition to being a P k F-module. Thus studying the action of all of k 0 F on Br k (F) provides a new more general notion of conditional entropy which does not seem to appear in the classical theory. 5

Given a branching entropy h (ker π F ) we define a symmetric bilinear form (u, v) h := h(u v) for u, v ker π F. As usual we can think of u 2 = (u, u) as a kind of pseudonorm on ker π F, although one should exercise caution as to when this norm is nonnegative (see below). This symmetric bilinear form gives rise to many of the classical quantities studied in information theory (ω, ξ ) = h(ω) + h(ξ) h(ω ξ) ω ξ 2 = h(ξ ω) (1) ω ξ 2 = 2h(ω ξ) h(ω) h(ξ) = h(ξ ω) + h(ω ξ) The quantity (ξ, ω ) is usually called the mutual information and denoted I(ω; ξ). The quantity ξ ω 2 is called the Rohklin metric and is sometimes denoted ρ(ω, ξ). The short exact sequence 2.2. allows us to define the notion of the complement of a branching entropy. We will need the following lemma Lemma 1. Fix a base ring k and let F be an arbitrary boolean algebra. There exists a finitely additive measure µ M k (F) with µ(1) = 1. If k = R, the measure can be chosen to be nonnegative. This is just a rephrasing of the existence of ultrafilters (). The ultrafilter lemma implies that there are boolean algebra homomorphisms ψ : F {, 1}. One can verify that if we consider 1 = 1 k and = 0 k then the boolean algebra homomorphism is precisely a finitely additive measure. We summarize the definition and properties of the complement map in the following proposition (we use [f(x)] Br k (F) to denote the branching entropy represented by the function f : F k) Proposition 2..4. Let µ M k (F) with µ(1) = 1, then the k-linear map ( ) c : Br k (F) Br k (F) : [f(x)] [f(x c ) f(1)µ(x c )] is well-defined and independent of the choice of µ. Also (h c ) c = h for any h Br k (F). If we think of Br k as a functor Br k : Bool op Mod k from the (opposite) category of boolean algebras to the category of k-modules, then the complement is a natural transformation of functors ( ) c : Br k Br k. All of the above structure on Br k (F) leads one to interpret a branching entropy h as a sort of state of knowledge in a whole configuration space Br k (F). Imagine h represents my knowledge of the outcome of a chemistry experiment. The relation (ω h)( ) = h( ω) allows us to interpret the action of the partition algebra on h as appending facts (i.e. additional data and measurements) to my current state of knowledge h. In the same vein, ω = h(ω ) represents the knowledge I stand to gain once I am given the data encoded by ω. Finally the complement h c represents complementary knowledge in the sense that if h were to encode which of several possible events occured in my experiment, the h c encodes a weighted average of the knowledge of whether each event failed to occur. 6

Example 2..5. For any nonnegative measure µ M + R (F) we can assign the Shannon entropy H µ Br R (F) induced by the function a η(µ(a)) := µ(a) log µ(a). Explicitly the Shannon entropy assigns H µ (ω) = η(µ(ω 1 ω n )) η(µ(ω 1 )) η(µ(ω n )) This assignment rule µ H µ Br R (F) is the entire content of the classical statistical definition of information. If we consider Br R and M R as contravariant functors M + R, Br R : Bool op Set, then the Shannon entropy is just a natural transformation of functors S : M + R Br R. 3. Generalized measures of information Motivated by the Shannon entropy we define Definition 3..1. A generalized information theory for the category C is a triple(f, Q, η), where F : C Set is a structure functor, Q : C Bool op associates a Boolean algebra to each object of C, and η : F Br R Q is a natural transformation of functors. C F Q η Bool op Br R Set The Shannon entropy is purely statistical, C = Bool op and F = M R so the entropy only depends on a measure on a Boolean algebra. However, we can imagine theories of information that depend on additional structures such as that of a metric space or a group action, etc.. This definition of a generalized information theory is a first step towards extending classical information theory uniformly to other branches of mathematics. 3.1. Nonnegativity of branching entropies We know from the classical theory that one of the most important properties of the Shannon entropy is the quantities H µ (ω), I µ (ω; ξ), ρ µ (ω, ξ), and H µ (ω ξ) are nonnegative and even obey some subadditivity laws. To generalize this call a branching entropy nonnegative if the form (, ) h is positive semi-definite, and denote by Br + R (F) the space of nonnegative entropies on F. As before, Br + R (F) is a semigroup under addition and has an R+ group action under scalar multiplication. Similarly we call an information theory nonnegative if the map η : F Br R Q lands in the subfunctor Br + R Q. Unfortunately the Shannon entropy is not nonnegative in this sense, but the complement of the Shannon entropy is. Define for any finite Boolean algebra ξ the special element e ξ := a F ( 1) a a c, where a denotes the number of atoms in a. We have the lemma Lemma 2. Let η : F Br R be an information theory on C = Bool op. Then η is nonnegative iff η F [x](e F ) 0 for all finite Boolean algebras F and every x F (F). Using this lemma 7

Proposition 3..2. The complement of the Shannon entropy H c : µ H c µ is nonnegative. These ideas are still in their infancy, but the great range of information-theoretic constructions captured by these algebraic structures suggests that they are a step in the right direction towards a general theory of mathematical complexity. In addition to developing the idea of nonnegativity further, we are studying specific examples of information theories on metric spaces and dynamical systems, and we are studying the relationship between entropy and coding in this general context. A Proof that A k F and P k F are projective k modules Proposition A.1. For any boolean algebra F and any ring R, the R-module A R F is projective. Proof. Any element of A R F lies in the image of some finite subalgebra A R ξ A R F, we have the isomorphism A R F = lim A R ξ ξ F ξ < To show A R F is projective, we must show that any diagram of the form M ψ lim A R ξ ϕ ξ N has a lift ψ. Any such ϕ is simply a family of homomorphisms ϕ ξ : A R ξ N that are compatible with the inclusion homomorphisms. A R ξ is free on the set of atoms {ξ 1,..., ξ n }, so we can think of ϕ as a compatible family ϕ ξ : {ξ 1,..., ξ n } N of maps of sets. The compatibility condition means that for all ξ ω we have ϕ ξ (ξ i ) = ϕ ω (ω j ) N (2) ω j ξ i for all atoms ξ 1,..., ξ n of ξ. We seek a family of lifts ψ ξ that satisfy the relation 2. Define the set S(ξ) for each finite ξ by S(ξ) = {f : {ξ 1,..., ξ n } M so that π f(ξ i ) = ϕ ξ (ξ i )} and for any ξ ω F we have the maps of sets S(ω) S(ξ) f f (ξ i ) = where f S(ξ) because π is linear, and thus π f (ξ i ) = ω j ξ i π f(ω j ) = 8 π ω j ξ i f(ω j ) ω j ξ i ϕ ω (ω j ) = ϕ ξ (ξ i )

Furthermore the maps S(ω) S(ξ) are surjective. To see this, take a particular f S(ξ), then we construct a lift f S(ω) as follows: For each ξ i, take the set of ω j ξ i. We call them ω 1,..., ω k for simplicity. Now assign the values of f(ω 2 ),..., f(ω k ) M arbitrarily so that π(f(ω j )) = φ ω (ω j ) N for j = 2,..., k. Then we finally assign the value of f(ω 1 ) = f (ξ i ) f(ω 2 ) f(ω k ) And it follows that π f(ω 1 ) = π f (ξ i ) k π f(ω j ) = ϕ ξ (ξ i ) j=2 k ϕ ω (ω j ) = ϕ ω (ω 1 ) And repeating this procedure for each ξ i gives a lift f S(ω), so in fact we have S(ω) S(ξ) Now a lift ψ of ϕ corresponds to a compatible choice of elements of S(ξ) for all ξ, hence an element of lim S(ξ). But the limit over a direct system of non-empty sets and ξ surjections is always non-empty, and hence there is a lift ψ : A R F M so that ϕ = π ψ. j=2 Corollary A.2. The k-module ker π F is projective Proof. This follows because k 0 F is a free k-module. Because A k F is projective, it follows that ker π F is a direct summand of k 0 F, hence projective. B Proof that the Shannon entropy is nonnegative For n 0 and a continuous map η : [0, ) R, let n F n [η](x 1,..., x n ) = ( 1) k η(x i1 + + x ik ) i 1 < <i k k=0 where the k = 0 contribution is just η(0). It will be convenient to introduce the notation t η(z) := η(z + t) for the action by translation of positive numbers on the space of functions [0, ) R. Observe that if η is differentiable then F n [η] is as well, and we can calculate F n [η] x i (x 1,..., x n ) = F n 1 [x i η ](x 1,..., ˆx i,..., x n ) t F n[t η](x 1,..., x n ) = F n [t η ](x 1,..., x n ) We also have that F n [η](0, x 2,..., x n ) = 0, because each term containing x i will cancel the corresponding term not containing x 1. Proposition B.1. Let η : [0, ) R be a smooth function such that d 2k η(z) dz 2k 0 for all z > 0 and k n. Then F n [η](x 1,..., x n ) is nonnegative for x 1,..., x n 0. 9

Proof. We prove this by induction on n. For the case n = 0, we have F 0 [η]() = η(0) 0 by continuity and the hypothesis that d0 η = η 0 for z > 0. dz 0 Now fix a choice of p 1,..., p n 0. Then we will show that F n [η](p 1 + t, p 2 t, p 3,..., p n ) (3) is concave as a function of t [ p 1, p 2 ]. Computing the second derivative gives 2 t 2 F n[η](p 1 + t, p 2 t, p 3,..., p n ) = t F n 1[(t + p 1 ) η ](p 2 t, p 3,..., p n ) + t F n 1[(p 2 t) η ](p 1 + t, p 3,..., p n ) = F n 1 [(p 1 + t) η ](p 2 t, p 3,..., p n ) F n 1 [(p 2 t) η ](p 1 + t, p 3,..., p 4 ) 2F n 2 [(p 1 + p 2 ) η ](p 3,..., p n ) And the new functions η = (p 1 + t) η, (p 2 t) η, and (p 1 + p 2 ) η all have the d property that 2k η 0 for k n 1. The inductive hypothesis implies that each term dz 2k above is non-positive and so the whole expression is 0. Thus we have verified that the function (3) is concave for t [ p 1, p 2 ]. In addition, the function is 0 when t = p 1 or t = p 2 because F n [η] vanishes when any one of its arguments vanishes. Concavity implies the function is nonnegative when t = 0 [ p 1, p 2 ]. The proposition now follows by induction on n. Remark B.2. Under the hypotheses of proposition B.1, the second derivative of F n [η](tp 1, (1 t)p 2, p 3,..., p n ) is nonnegative. It follows that F n [η] is a concave function. References P. K. B.R. Ebanks & C. Ng (1990). Recursive Inset Entropies of Multiplicative Type on Open Domains. Aequationes Math. 3p.(pp. 100). B. Ebanks (1986). Branching Inset Entropies on Open Domains. Aequationes Math. 30.(pp. 187). P. Kannappan & W. Sander (2004). Inset Information Measures on Open Domains. Aequationes Math. 68.(pp. 289). I. M. M.F. Atiyah (1969). Introduction to Commutative Algebra. Westview Press, Boulder,CO. C. Shannon (1948). A Mathematical Theory of Communication. Bell System Technical Journal 27.(pp. 379, 623). 10