DUAL CONNECTIONS IN NONPARAMETRIC CLASSICAL INFORMATION GEOMETRY

Size: px
Start display at page:

Download "DUAL CONNECTIONS IN NONPARAMETRIC CLASSICAL INFORMATION GEOMETRY"

Transcription

1 DUAL CONNECTIONS IN NONPARAMETRIC CLASSICAL INFORMATION GEOMETRY M. R. GRASSELLI Department of Mathematics and Statistics McMaster University 1280 Main Street West Hamilton ON L8S 4K1, Canada NONPARAMETRIC INFORMATION GEOMETRY Abstract. We show how to obtain the mixture connection in an infinite dimensional information manifold and prove that it is dual to the exponential connection with respect to the Fisher information. We also define the α-connections and prove that they are convex mixtures of the extremal (±1)-connections. Key words and phrases: Information geometry, statistical manifiold, Fisher metric, Orlicz spaces, Amari-Nagaoka duality 1. Introduction Information geometry is the branch of probability theory dedicated to provide families of probability distributions with differential geometrical structures. One then uses the tools of differential geometry in order to have a clear and intuitive picture, as well as rigor, in a variety of practical applications ranging from neural networks to statistical es- 1

2 timation, from mathematical finance to nonequilibrium statistical mechanics (see Sollich et al. (2001)). It was just over half a century ago that the Fisher information log p(x, θ) log p(x, θ) (1.1) g ij = p(x, θ)dx θ i θ j was independently suggested by Rao (1945) and Jeffreys (1946) as a Riemannian metric for a parametric statistical model {p(x, θ), θ = (θ 1,..., θ n )}. The Riemannian geometry of statistical models was then studied as a mathematical curiosity for some years, with an emphasis in the geodesic distances associated with the Levi-Civita connection for this metric. A greater amount of attention was devoted to the subject after Efron (1975) introduced the concept of statistical curvature, pointing out its importance to statistical inference, as well as implicitly using a new affine connection, which would be known as the exponential connection. This exponential connection, together with another connection, later to be called the mixture connection, were further investigated by Dawid (1975). The work of several years on the geometric aspects of parametric statistical models culminated with the Amari (1985) masterful account, where the whole finite dimensional differentialgeometric machinery is employed, including a one-parameter family of α-connections, the essential concept of duality and the notions of statistical divergence, projections and minimization procedures. Among the successes of the research at these early stages one could single out the rigidity of the geometric structures, like in the Čencov (1982) result concerning the uniqueness of the Fisher metric with respect to monotonicity and Amari s result concerning the uniqueness of the α-connections as introduced by invariant statistical divergences. The ideas were then extensively used in statistics, in particular higher order asymptotic inference and curved exponential models (see Kass and Vos (1997)). 2

3 A different line of investigation in Information Geometry took off in the nineties: the search for a fully-fledge infinite dimensional manifold of probability measures. As for motivations for this quest, one had, on the practical side, the need to deal with nonparametric models in statistics, where the shape of the underlying distribution is not assumed to be known. On a more fundamental level, there was the desire of having parametric statistical manifolds defined simply as finite dimensional submanifolds of a well defined manifold of all probability measures on a sample space. The motivating idea was already in Dawid (1975) and was also addressed by Amari (1985). The first sound mathematical construction, however, is due to Pistone and Sempi (1995). Given a probability space (, F, µ), they showed how to construct a Banach manifold M of all probability measures equivalent to µ. The Banach space used as generalised coordinates was the Orlicz space L Φ 1, where Φ 1 is an exponential Young function. In a subsequent work, Pistone and Rogantin (1999) analysed further properties of this manifold, in particular the concepts of orthogonality and submanifolds. In section 3, we review their construction and present an alternative proof of the main result in Pistone and Sempi (1995), namely that the collection of covering neighborhoods U p and charts e 1 p form an affine C atlas for M. The crux is proposition 3.1, where we show that the image of overlapping neighborhoods under any chart e 1 p is open in the topology of the target space L Φ 1. The next step in this development was the Gibilisco and Pistone (1998) definition of the exponential connection as the natural connection induced by the use of L Φ 1. These authors then propose a mixture connection acting on the pretangent bundle T M and prove that it is dual to the exponential connection, in the sense of duality for Banach spaces. They further define the α-connections through generalised α-embeddings and show that the formal relation between the exponential, mixture and α-connections are 3

4 the same as in the parametric case, that is (1.2) (α) = 1 + α 2 (e) + 1 α (m). 2 We argue, however, that neither of these two results (duality for the exponential and mixture connection and α-connections as convex mixture of them) is a proper generalisation of the corresponding parametric ones, the reason being twofold. First, Banach space duality is not Amari-Nagaoka duality. The latter refers to a metric being preseverd by the joint action of two parallel transports, which are then said to be dual (see (4.2)). Secondly, all the α-connections in the parametric case act on the tangent bundle, whereas in Gibilisco and Pistone (1998) each of them acts on its own bundle-connection pair, making a formula like (1.2) at least difficult to interpret. In order to solve these problems we adopted in Grasselli (2001) the smaller space M Φ as generalized coordinates (following Pistone (2001)) and showed how to meaningfully introduce the mixture connection on the tangent bundle T M. In the present paper we strengthen this result by showing that the same can be done using the original L Φ spaces as coordinates. This considerably enlarges its domain of applicability, since the space L Φ correspond to random variables with finite moment generating function on an interval around the origin, whereas M Φ correspond to random variables with finite moment generating function for all t R. We then prove that the mixture and exponential connections are dual in the sense of Amari and Nagaoka. In section 5 we show that these definitions reduce to the familiar ones for finite dimensional submanifolds and that exponential and mixture families are geodesic for the exponential and mixture connections respectively. We then move to the subject of α-connections in section 6, where we again rearrange the definitions of Gibilisco and Pistone (1998) in order to have them all acting on the same bundle and with the desired relation between them, the exponential and the mixture 4

5 connections still holding. We finalize the paper by showing that, like the ±1 geodesics, the α geodesics between any two points in the same connected component E(p) of M are also contained in E(p). 2. Orlicz Spaces We present here the aspects of the theory of Orlicz spaces that will be relevant for the construction of the information manifold. For more comprehensive accounts, as well as for the proofs of all statements in this section, the reader is referred to the monographs of Rao and Ren (1991) and Krasnosel skiĭ and Rutickiĭ (1961). The general theory of Orlicz spaces is developed around the concept of a Young function, that is, a convex function Φ : R R + satisfying (i) Φ(x) = Φ( x), x R, (ii) Φ(0) = 0, (iii) lim x Φ(x) = +. For applications in information geometry, it is enough to consider Young functions of the form (2.1) Φ(x) = x 0 φ(t)dt, x 0, where φ : [0, ) [0, ) is nondecreasing, continuous and such that φ(0) = 0 and lim φ(x) = +. Young functions of this type include the monomials x x r /r, for 1 < r <, and the following examples arising in information geometry: (2.2) (2.3) (2.4) Φ 1 (x) = cosh x 1, Φ 2 (x) = e x x 1, Φ 3 (x) = (1 + x ) log(1 + x ) x 5

6 (in the sequel, Φ 1,Φ 2 and Φ 3 will always refer to these three particular functions, with other symbols being used to denote generic Young functions). When a Young function Φ is given in the form (2.1) we can define its complementary (conjugate) function as the Young function Ψ given by (2.5) Ψ(y) = y 0 ψ(t)dt, y 0, where ψ is the inverse of φ. One can verify that (Φ 2, Φ 3 ) and ( x r /r, x s /s), with r 1 + s 1 = 1, are examples of complementary pairs. For a general Young function Φ, the complementary function Ψ is given less constructively by (2.6) Ψ(y) = sup{x 0 : x y Φ(x)}. There are many different ways of introducing a partial order on the class of Young functions. A particularly straightforward one is to say that a Young function Ψ 2 is stronger than another Young function Ψ 1, denoted by Ψ 1 Ψ 2, if there exist a constant a > 0 such that (2.7) Ψ 1 (x) Ψ 2 (ax), x x 0, for some x 0 0 (depending on a). For example, one can verify that (2.8) x Φ 3 x r r x s s Φ 2 whenever 1 < r s <. Two Young functions Ψ 1 and Ψ 2 are said to be equivalent if Ψ 1 Ψ 2 and Ψ 2 Ψ 1, that is, if there exist real numbers 0 < c 1 c 2 < and x 0 0 such that (2.9) Ψ 1 (c 1 x) Ψ 2 (x) Ψ 1 (c 2 x), x x 0. For example, the functions Φ 1 and Φ 2 are equivalent, both being of exponential type. 6

7 There are also several classifications of Young functions according to their growth properties. The only one we are going to need for the construction of the information manifold is the so called 2 -class. A Young function Φ : R R + satisfies the 2 - condition if (2.10) Φ(2x) KΦ(x), x x 0 0, for some constant K > 0. Examples of functions in this class are the monomials x r /r, r 1 and the function Φ 3. Now let (, Σ, P ) be a probability space. The Orlicz class associated with a Young function Φ is defined as (2.11) LΦ (P ) = { f : R, measurable : } Φ(f)dP <. Since P is a finite measure, the Banach space L (, Σ, P ) of essentially bounded random variables is easily seen to be a subset of L Φ (P ) for any Young function Φ. One can also immediately check that L Φ (P ) is a convex set and (with a bit more work) that it is a vector space when Φ satisfies the 2 -condition. The trouble is that, whenever the probability measure P is diffuse on at least one set D Σ of positive measure, the 2 - condition is also necessary for L Φ (P ) to be a vector space (see Rao and Ren (1991, page 46)). In contrast, the Orlicz space associated with a Young function Φ defined as (2.12) L Φ (P ) = { f : R, measurable : } Φ(αf)dP <, for some α > 0 is always vector space (see Rao and Ren (1991, page 49)). If Φ satisfies the 2 -condition then it coincides with its subset L Φ. Moreover, if we identify functions which differ only on sets of measure zero, then L Φ is a Banach space when furnished with the Luxembourg 7

8 norm (see Rao and Ren (1991, page 67)) (2.13) N Φ (f) = inf { k > 0 : Φ( f } k )dp 1, or with the equivalent Orlicz norm (see Rao and Ren (1991, page 61)) { } (2.14) f Φ = sup fg dp : g L Ψ (P ), Ψ(g)dP 1, where Ψ is the complementary Young function to Φ. Φ(f)dP 1 iff N Φ(f) 1 (see Rao and Ren (1991, page 54)). We observe for later use that A key ingredient in the analysis of Orlicz spaces is the generalised Hölder inequality (see Rao and Ren (1991, page 58)). If Φ and Ψ are complementary Young functions, f L Φ (P ), g L Ψ (P ), then (2.15) fg dp 2N Φ (f)n Ψ (g). It follows that each element f L Φ (P ) defines a continuous linear functional on L Ψ (P ), so that if we denote its topological dual by ( L Ψ) we obtain the continuous injection L Φ ( L Ψ) for any pair of complementary Young functions. If Ψ 2 Ψ 1 then there exist a constant k such that Ψ2 k Ψ1 and therefore L Ψ 1 (P ) L Ψ 2 (P ) (see Rao and Ren (1991, page 155)). For instance, due to (2.8) we obtain that for 1 < r s < (2.16) L Φ 2 L s L r L Φ 3 L 1, where L r, r 1 denote the usual Lebesgue spaces on (, Σ, P ), which coincide with the Orlicz space defined by the Young functions x r /r, r 1. If two Young functions are equivalent, the Banach spaces associated with them coincide as sets and have equivalent norms. For example, we denote this by L Φ 1 (P ) L Φ 2 (P ). 8

9 Now define (2.17) M Φ (P ) = { f L Φ : } Φ(kf)dP <, for all k > 0. In the next lemma, we collect important results concerning M Φ for the case of a continuous Young function vanishing only at the origin. Part (i) and the last statement of the lemma are adaptations of Rao and Ren (1991, proposition and corollary 3.4.4), while part (ii) is taken from Rao and Ren (1991, theorem 4.1.6). Lemma 2.1. Let (Φ, Ψ) be a complementary pair of Young functions, Φ continuous, Φ(x) = 0 iff x = 0. Then: (i) M Φ (P ) is the closure of L (, Σ, P ) in the L Φ (P ) norm. (ii) ( M Φ (P ) ) is isometrically isomorphic to L Ψ (P ). If, moreover, Φ satisfies the 2 -condition, then M Φ (P ) = L Φ (P ). Since Φ 3 satisfies the 2 condition, we have that M Φ 3 (P ) = L Φ 3 (P ) by the last statement in the lemma. But by part (ii) of the lemma (M Φ 3 (P )) is isometrically isomorphic to L Φ 2 (P ), since (Φ 2, Φ 3 ) is a complementary pair. Therefore, since Φ 1 and Φ 2 are equivalent, we have (2.18) (L Φ 3 (P )) = (M Φ 3 (P )) = L Φ 2 (P ) L Φ 1 (P ), where the first two equalities correspond to isometric isomorphisms and the third to Banach space equivalence. Similarly M Φ 1 (P ) M Φ 2 (P ) = L Φ 3 (P ). 3. The Pistone-Sempi Information Manifold We start by reviewing the construction of an infinite dimensional information manifold along the lines of Pistone and Sempi (1995); Pistone and Rogantin (1999); Gibilisco 9

10 and Pistone (1998). Consider the set M of all densities of probability measures equivalent to a reference measure µ, that is, M M(, Σ, µ) = {f : R, measurable : f > 0 a.e. and fdµ = 1}. For each point p M, let L Φ 1 (p) be the exponential Orlicz space over the probability space (Σ,, pdµ) and consider its closed subspace of p-centred random variables (3.1) B p = {u L Φ 1 (p) : updµ = 0} as the coordinate Banach space. For definiteness, we choose to work with the Orlicz norm Φ1, although everything could be done with the equivalent Luxemburg norm N Φ1, and use the notation Φ1,p when it is necessary to specify the base point p. In probabilistic terms, the set L Φ 1 (p) correspond to random variables whose moment generating function with respect to the probability pdµ is finite on a neighborhood of the origin (see Pistone and Sempi (1995, proposition 2.3)). In statistics this are exactly the random variables used to define the one dimensional exponential model p(t) associated with a point p M and a random variable u: (3.2) p(t) = etu Z p (tu) p, t ( ε, ε). In particular, if we denote by V p the unit ball in B p, then it follows that the moment generating functional Z p (u) = eu pdµ is finite on V p (see Pistone and Sempi (1995, proposition 2.4)). The underlying idea for the Pistone Sempi manifold is to parametrize the neighborhoods around points p M by all possible one dimensional exponential models passing through p. As a preliminary result, we mention that if two densities p and q are connected by a one dimensional exponential model, then L Φ 1 (p) L Φ 1 (q) (see Pistone and Rogantin (1999, proposition 5)). 10

11 Pistone and Sempi define the inverse of a local chart around p M as e p : V p M (3.3) u eu Z p (u) p. Denote by U p the image of V p under e p. We verify that e p is a bijection from V p to U p, since e u Z p (u) p = ev Z p (v) p implies that (u v) is a constant random variable, which must vanish, since both u, v have zero p expectation. Then let e 1 p be the inverse of e p on U p. One can check that (3.4) e 1 p : U p B p q log ( ) q log p ( ) q pdµ. p and also that, for any p 1, M, the transition functions are given by (3.5) e 1 e p1 : e 1 p 1 (U p1 U p2 ) e 1 (U p1 U p2 ) ( ) p1 u u + log ( u + log p ) 1 dµ. The main result of Pistone and Sempi (1995) is to show that the charts defined above lead to a well define infinite dimensional manifold. The crucial part of the proof is to show that, for any two points p 1, M, the image of the overlapping neighborhoods U p1 U p2 under e 1 p 1 is open in the topology of the model space B p1. To do so they introduce a topology induce by the notion of exponential convergence, with respect to which the sets U p1 U p2 are open, and then show that e 1 p 1 is sequentially continuous from exponential convergence to L Φ 1 convergence. In what follows, we bypass the use of exponential convergence and present a direct proof that Pistone and Sempi s construction yields a Banach manifold. We first need to establish the following proposition. 11

12 Proposition 3.1. For any p 1, M, the set e 1 p 1 (U p1 U p2 ) is open in the topology of B p1. Proof: Suppose that q U p1 U p2 for some p 1, M. Then we can write it as for some u V p1. Using (3.5), we find e 1 (q) = u + log q = ( p1 eu Z p1 (u) p 1, ) Since e 1 (q) V p2, we have that ( ) e 1 (q) Φ1, = u + log p1 ( u + log p ) 1 dµ. ( u + log p ) 1 dµ < 1. Φ1, Consider an open ball of radius r around u = e 1 p 1 (q) e 1 p 1 (U p1 U p2 ) in the topology of B p1, that is, consider the set A r = {v B p1 : v u Φ1,p 1 < r} and let r be small enough so that A r V p1. Then the image in M of each point v A r under e p1 is q = e p1 (v) = ev Z p1 (v) p 1. We claim that q U p1 U p2 if r is sufficiently small. Indeed, applying e 1 ) e 1 ( q) = v + log ( p1 ( v + log p ) 1 dµ, to it we find so ( ) ( ( q) Φ1, v u Φ1, + u + log p1 u + log p ) 1 dµ + (v u) dµ Φ1, v u Φ1, + e 1 (q) Φ1, + 1 Φ1, v u dµ e 1 = v u Φ1, + e 1 (q) Φ1, + v u 1,p2 K, Φ1, 12

13 where K = 1 Φ1, and we use the notation 1,p2 for the L 1 ( )-norm. As we have seen in the previous section, it follows from the growth properties of Φ 1 that there exists c 1 > 0 such that f 1,p2 c 1 f Φ1,. Moreover, since L Φ 1 (p 1 ) and L Φ 1 ( ) coincide as sets and have equivalent norms (since both p 1 and are connected to q by one dimensional exponential models) it follows that there exists a constant c 2 > 0 such that f Φ1, c 2 f Φ1,p 1. Therefore, the previous inequality becomes e 1 ( q) Φ1, c 2 v u Φ1,p 1 + e 1 (q) Φ1, + c 1 c 2 K v u Φ1,p 1 = c 2 (1 + c 1 K) v u Φ1,p 1 + e 1 (q) Φ1,. Thus, is we choose we will have that r < 1 e 1 (q) Φ1,, c 2 (1 + c 1 K) e 1 ( q) Φ1, < 1 which proves the claim. What we have just proved is that e 1 p 1 (U p1 U p2 ) consists entirely of interior points in the topology of B p1, so e 1 p 1 (U p1 U p2 ) is open in B p1. We then have that the collection {(U p, e 1 p ), p M} satisfies the three axioms for being a C atlas for M (see Lang (1995, 0)). Moreover, since all the spaces B p are isomorphic as topological vector spaces, we can say that M is a C manifold modeled on B p. As usual, the tangent space at each point p M can be abstractly identified with B p. A concrete realisation has been given in Pistone and Rogantin (1999, proposition 21), namely each curve through p M is tangent to a one-dimensional exponential model e tu Z p(tu) p, so we take u as the tangent vector representing the equivalence class of such a curve. 13

14 Finally, given a point p M, the connected component of M containing p coincides with the maximal exponential model obtained from p (see Pistone and Sempi (1995, theorem 4.1)): { } e u (3.6) E(p) = Z p (u) p, u B p Z p, where Z p = {f : Z p (f) < } The Fisher Information and Dual Connections In the parametric version of information geometry, Amari and Nagaoka have introduced the concept of dual connections with respect to a Riemannian metric (see Amari and Nagaoka (2000) and the references given therein to their earlier work). For finite dimensional manifolds, any continuous assignment of a positive definite symmetric bilinear form to each tangent space determines a Riemannian metric. In infinite dimensions, we need to impose that the tangent space be self-dual and that the bilinear form be bounded. Since our tangent spaces B p are not even reflexive, let alone self-dual, we abandon the idea of having a Riemannian structure on M and propose a weaker version of duality, the duality with respect to a continuous scalar product. When restricted to finite dimensional submanifolds, the scalar product becomes a Riemannian metric and the original definition of duality is recovered. Let, p be a continuous positive definite symmetric bilinear form assigned continuously to each B p T p M. A pair of connection (, ) are said to be dual with respect to, p if (4.1) τu, τ v q = u, v p for all u, v T p M and all smooth curves γ : [0, 1] M such that γ(0) = p,γ(1) = q, 14

15 where τ and τ denote the parallel transports associated with and, respectively. Equivalently, (, ) are dual with respect to, p if (4.2) v ( s 1, s 2 p ) = v s 1, s 2 p + s 1, vs 2 p for all v T p M and all smooth vector fields s 1 and s 2. We stress that this is not the kind of duality obtained when a conection on a bundle F is used to construct another connection on the dual bundle F as defined, for instance, in Gibilisco and Pistone (1998, definiton 6). The latter is a construction that does not involve any metric or scalar product and the two connections act on different bundles, while Amar-Nagaoka duality is a duality with respect to a specific scalar product (or metric, in the finite dimensional case) and the dual connections act on the same bundle, the tangent bundle. The infinite dimensional generalisation of the Fisher information is given by (4.3) u, v p = (uv)pdµ, u, v B p. This is clearly bilinear, symmetric and positive definite. Moreover, continuity follows from that fact that, since L Φ 1 (p) L Φ 2 (p) L Φ 3 (p), the generalised Hölder inequality gives (4.4) u, v p K u Φ1,p v Φ1,p, u, v B p. The use of exponential Orlicz space to model the manifold induces naturally a globally flat affine connection on the tangent bundle T M, called the exponential connection and denoted by (1). It is defined on each connected component of the manifold M, which is equivalent to saying that its parallel transport is defined between points connected by an exponential model. If p and q are two such points, then the exponential parallel transport 15

16 is given by (4.5) τ (1) pq : T p M T q M u u uqdµ. It is well defined, since T p M = B p and T q M = B q are subsets of the same set L Φ 1 (p) L Φ 1 (q), so the exponential parallel transport just subtracts a constant from u to make it centred around the right point. We now want to define the dual connection to (1) with respect to the Fisher information. In Grasselli (2001, lemma 2.3.1) we proved a result that allowed the definiton of the mixture parallel transport for tangent vectors with absolutely continuous Orlicz norm (see Rao and Ren (1991, page 84)). A modification of that argument is presented bellow and allows for the definition of the mixture parallel transport for all tangent vectors. Proposition 4.1. Let p and q be two points in the same connected component of M. Then p q u B q, for all u B p. Proof: From the hypothesis, p and q can be connected by a one dimensional exponential model, which implies that M Φ 1 (p) M Φ 1 (q), since they are the completion of the same set L under equivalent norms. Using lemma 2.1, we find that L Φ 3 (p) (M Φ 1 (p)) (M Φ 1 (q)) L Φ 3 (q), in the sense that they are the same set furnished with equivalent norms. Therefore, there 16

17 exist a constant k such that N Φ3,p( ) kn Φ3,q( ). We then obtain that p { q u Φ 1,q = sup { = sup { = k sup { k sup p q uv qdµ : v LΦ 3 (q), p q uv p ( v q u k = k u Φ1,p <, } Φ 3 (v)qdµ 1 } qdµ : v LΦ 3 (p), N Φ3,q(v) 1 ) qdµ : v L Φ 3 (q), N Φ3,q(v) 1} uf pdµ : f L Φ 3 (p), N Φ3,p(f) 1 } so that p q u LΦ 1 (q). But since p q u is centred around q we have that p q u B q. We can then define the mixture connection on T M, as τ ( 1) pq : T p M T q M (4.6) u p q u, for p and q in the same connected component of M. We notice that it is also globally flat and prove the following result. Theorem 4.1. The connections (1) and ( 1) are dual with respect to the Fisher information. Proof: We have that τ (1) u, τ ( 1) v q = = = u uqdµ, p q v u p q vqdµ ( uvpdµ = u, v p, u, v B p, q ) uqdµ p q vqdµ where, to go from the second to the third line above, we used that v is centred around p. 17

18 5. Covariant Derivatives and Geodesics We begin this section recalling that the covariant derivative for the exponential connection has been computed in Gibilisco and Pistone (1998, proposition 25) and found to be (5.1) ( (1) v s ) (p) = (d v s)(p) E p ((d v s)(p)), where s S(T M) is a differentiable vector field, v T p M is a tangent vector at p, E p ( ) denotes the expected value with respect to the measure pdµ and d v s denotes the directional derivative in L Φ 1 (p) of s composed with the patch e p as a map between Banach spaces. We first notice that this gives the usual covariant derivative for the exponential connection in parametric information geometry (see Murray and Rice (1993, pages )). For if {θ 1,..., θ n } is a coordinate system in a finite dimensional submanifold of M we can put v = θ i and s = log p θ j as the 1-representation (see Amari and Nagaoka (2000, page 41)) of the vector field θ j, then (5.1) reduces to ( ) (1) (p) = 2 log p θ i θ j θ i θ E j p ( 2 log p θ i θ j which is the classical finite dimensional result. Accordingly, parametric exponential models are flat submanifolds of M. We can also verify that one-dimensional exponential models of the form ), q(t) = are geodesics for (1), since if s(t) = d dt ( etu Z p (tu) p log q(t) p ) is the vector field tangent to q(t) at 18

19 each point t (according to Pistone and Rogantin (1999, proposition 21)), then (5.1) gives ( ) ( ) (1) q(t) s(t) (q(t)) = d2 d 2 dt (tu log Z p(tu)) E 2 q(t) dt (tu log Z p(tu)) ( 2 ) = d2 d 2 dt log Z p(tu) + E 2 q(t) dt log Z p(tu) 2 = 0. As we emphasised in the previous section, the definition given in Gibilisco and Pistone (1998, definition 22) for the mixture connection differs from ours (due to the different concepts of duality employed), so we have to compute its covariant derivative according to the definition given here, if only to have the notation right. Proposition 5.1. Let γ : ( ε, ε) M be a smooth curve such that p = γ(0) and v = (e 1 p γ) (0) and let s S(T M) be a differentiable vector field. Then (5.2) ( ( 1) v s ) (p) = (d v s)(p) + s(p)l (0), where l(t) = log(γ(t)). Proof: ( ( 1) v s ) ( (p) = ( 1) (e 1 1 = lim h 0 h 1 = lim h 0 h ) s (γ(0)) p γ) (0) [ ] τ ( 1) γ(h)γ(0) s(γ(h)) s(γ(0)) [ ] γ(h) s(γ(h)) s(γ(0)) γ(0) 1 = lim [s(γ(h)) s(γ(0))] + lim h 0 h h 0 = (d v s)(p) + s(p)l (0). [ ] 1 γ(h) γ(0) s(γ(h)) h γ(0) Again this reduces to the parametric result for the case of submanifolds of M. Put v = θ i and s = log p θ j in (5.2) to obtain ( θ j ( 1) θ i ) (p) = 2 log p θ i θ j + log p log p, θ i θ j 19

20 which is the classical finite dimensional result. The mixture connection owes its name to the fact that in the parametric version of information geometry a convex mixture of two densities describes a geodesic with respect to ( 1). To verify the same statement in the non-parametric case, we first need to check that a convex mixture of two points in a connected component of M remains in the same connected component. Proposition 5.2. If q 1 and q 2 are two points in E(p) for some p M, then q(t) = tq 1 + (1 t)q 2 belongs to E(p) for all t (0, 1). Proof: We begin by writing q 1 = eu 1 Z p (u 1 ) p and q 2 = eu2 Z p (u 2 ) p, for some u 1, u 2 B p L Φ 1 (p). Therefore, there exist constants α 1 and α 2 such that Φ 1(α 1 u 1 )pdµ < and Φ 1(α 2 u 2 )pdµ <, or in other words, α 1 u 1, α 2 u 2 L Φ 1 (p). But since L Φ 1 (p) is a circled solid set (see Rao and Ren (1991, page 49)), if we take α = min(α 1, α 2 ) we also have that αu 1, αu 2 L Φ 1 (p). To simplify the notation, let us define ũ 1 = u 1 log Z p (u 1 ) and ũ 2 = u 2 log Z p (u 2 ). We want to show that, if we write eũp = q(t) = teũ1 p + (1 t)eũ2 p, then ũ is an element of L Φ 1 (p), so that u = ũ ũpdµ B p 20

21 and q(t) = eu Z p (u) p E(p). We have that eũ = teũ1 + (1 t)eũ2 which implies e αũ = ( teũ1 + (1 t)eũ2 ) α 2 α ( t α e αũ 1 + (1 t) α e αũ 2 ). Thus (5.3) e αũ pdµ 2 α t α e αũ 1 pdµ + 2 α (1 t) α e αũ 2 pdµ < since both αũ 1 and αũ 2 are in L Φ 1 (p). Similarly, we observe that e αũ = ( teũ1 + (1 t)eũ2 ) α 1 = (teũ1 + (1 t)e ũ 2) α 1 t α e = αũ t α e αũ 1. 1 Therefore (5.4) e αũ pdµ t α e αũ 1 pdµ <, since αũ 1 L Φ 1 (p). But this completes the proof, since (5.3) and (5.4) together imply that ũ L Φ 1 (p). We can now verify that a family of the form q(t) = tq 1 + (1 t)q 2, t (0, 1) 21

22 is a geodesic for ( 1). Let s(t) = d dt ( log q(t) p ) be the vector field tangent to q(t) at each point t (see Pistone and Rogantin (1999, proposition 21)), then (5.2) gives ( ) [ ( 1) s(t) (q(t)) = d2 (e 1 p q) (t) dt 2 log tq ] 1 + (1 t)q 2 p [ log tq 1 + (1 t)q 2 p ] ] d dt [log tq 1 + (1 t)q 2 ] + d dt = d [ p (q 1 q 2 ) dt tq 1 + (1 t)q 2 p ( ) p q 1 q 2 (q1 q 2 ) + tq 1 + (1 t)q 2 p tq 1 + (1 t)q 2 ( ) 2 ( ) 2 (q1 q 2 ) (q1 q 2 ) = + tq 1 + (1 t)q 2 tq 1 + (1 t)q 2 = α-connections In this section, we define the infinite dimensional analogue of the α-connections introduced in the parametric case independently in Čencov (1982) and Amari (1985). We use the same technique proposed in Gibilisco and Pistone (1998), namely exploring the geometry of spheres in the Lebesgue spaces L r, but modified in such a way that the resulting connections all act on the tangent bundle T M. We begin with the Amari-Nagaoka α-embeddings l α : M L r (µ) (6.1) p 2 1 α p 1 α 2, α ( 1, 1), where r = 2 1 α. Observe that [ 1/r [ l α (p) r = l α (p) dµ] r = ( ) r ] 1/r 2 1 α p 1 α 2 dµ = r, so l α (p) S r (µ), the sphere of radius r in L r (µ) (we warn the reader that, throughout 22

23 this paper, the r in S r refers to the fact that this is a sphere of radius r, while the fact that it is a subset of L r is judiciously omitted from the notation). According to Gibilisco and Pistone (1998), the tangent space to S r (µ) at a point f is (6.2) T f S r (µ) = where f = sgn(f) f r 1. In our case, { } g L r (µ) : gf dµ = 0, (6.3) f = l α (p) = rp 1/r so that (6.4) f = ( rp 1/r) r 1 = r r 1 p 1 1/r. Therefore, the tangent space to S r (µ) at rp 1/r is (6.5) T rp 1/rS r (µ) = { } g L r (µ) : gp 1 1/r dµ = 0. We now look for a concrete realization of the push-forward of the map l α when the tangent space T p M is identified with B p as in the previous sections. Since ( ) d 2 dt 1 α p 1 α 2 = p 1 α 2 the push-forward can be formally implemented as d log p, dt (l α ) (p) : T p M = B p T rp 1/rS r (µ) (6.6) u p 1 α 2 u. For this to be well defined, we need to check that p 1 α 2 u is an element of T rp 1/rS r (µ). Indeed, since L Φ 1 (p) L s (p) for all s > 1, we have that p 1/r u r dµ = u r pdµ <, 23

24 so p 1 α 2 u L r (µ). Moreover which verifies that p 1/r u T rp 1/rS r (µ). p 1/r up 1 1/r dµ = updµ = 0, The sphere S r (µ) inherits a natural connection obtained by projecting the trivial connection on L r (µ) (the one where parallel transport is just the identity map) onto its tangent space at each point. For each f S r (µ), a canonical projection from the tangent space T f L r (µ) onto the tangent space T f S r (µ) can be uniquely defined, since the spaces L r (µ) are uniformly convex (see Gibilisco and Isola (1999)), and is given by (6.7) Π f : T f L r (µ) T f S r (µ) ) g g (r r gf dµ f. When f = rp 1/r and f = r r 1 p 1 1/r, the formula above gives (6.8) Π rp 1/r : T rp 1/rL r (µ) T rp 1/rS r (µ) ( ) g g gp 1 1/r dµ p 1/r. We are now ready to define the α-connections. In what follows, is used to denote the trivial connection on L r (µ). Definition 6.1. For α ( 1, 1), let γ : ( ε, ε) M be a smooth curve such that p = γ(0) and v = γ(0) and let s S(T M) be a differentiable vector field. The α-connection on T M is given by [ (6.9) ( α v s) (p) = (l α ) 1 (p) Π ] rp 1/r (lα) (p) v(l α ) (γ(t)) s. 24

25 A formula like (6.9) deserves a more wordy explanation. We take the vector field s and push it forward along the curve γ to obtain (l α ) (γ(t)) s. Then we take its covariant derivative with respect to the trival connection in the direction of (l α ) (p) v, the pushforward of the tangent vector v. The result is a vector in T rp 1/rL r (µ), so we use the canonical projection Π to obtain a vector in T rp 1/rS r (µ). Finally, since this belongs to the image of (l α ) (p) v (see the proof of the next theorem for a justification of this statement) we pull it back to T p M using (l α ) 1 (p). The next theorem shows that the relation between the exponential, the mixture and the α-connections just defined is the same as in the parametric case. Its proof resembles the calculation in the last pages of Gibilisco and Pistone (1998), except that all our connections act on the same bundle, whereas in Gibilisco and Pistone (1998) each one is defined on its own bundle-connection pair. Theorem 6.1. The exponential, mixture and α-covariant derivatives on T M satisfy (6.10) α = 1 + α 2 (1) + 1 α ( 1). 2 Proof: Let l(t) = log(γ(t)) with γ, s, p and v as in definition 6.1. Before explicitly computing the derivatives in (6.9), observe that since s(γ(t)) B γ(t) for each t ( ε, ε), we have d s(γ(t))γ(t)dµ = 0 dt ds(γ(t)) γ(t)dµ = dt ds(γ(t)) γ(t)dµ = dt s(γ(t)) dγ(t) dµ dt s(γ(t)) d log(γ(t)) γ(t)dµ. dt 25

26 In particular, for t = 0, we get (6.11) (d v s) (p)pdµ = s(p) l(0)pdµ We can now look more closely at (6.9). It reads ( α v s) (p) = p [Π 1/r ] rp 1/r p vγ(t) 1/r s(γ(t)) 1/r = p [Π 1/r d ( rp 1/r γ(t) 1/r s(γ(t)) ) ] dt t=0 ( )] 1 = p [Π 1/r d log(γ(t)) rp 1/r p1/r 1/r ds(γ(t)) r dt s(p) + p t=0 dt ( )] t=0 1 = p [Π 1/r rp 1/r r p1/r l (0)s(p) + p 1/r (d v s) (p) = 1 ( ) 1 r l (0)s(p) + (d v s)(p) r l (0)s(p) + (d v s)(p) pdµ. At this point we make use of (6.11) in the integrand above to obtain ( α v s) (p) = 1 r l (0)s(p) + (d v s)(p) + ( 1 + α = 2 = 1 + α ( (1) v s ) (p) + 1 α 2 2 ( ) 1 r 1 ) [(d v s)(p) E p ((d v s)(p))] + ( ( 1) v s ) (p). (d v s)(p)pdµ ( 1 α 2 ) [(d v s)(p) + s(p)l (0)] A direct application of (4.2) gives the following corollary. Corollary 6.1. The connections α and α are dual with respect to the Fisher information, p. The previous theorem shows that the α-connections are well defined objects at the level of covariant derivatives. We conclude this paper with an analogue of proposition 5.2, establishing that their parallel transport is well defined on M. Proposition 6.1. The connected component E(p) contains the α-geodesic between any two of its points. 26

27 Proof: Using the same notation as in proposition 5.2, we have that the α geodesic connecting q 1, q 2 E(p) is the pull back of the arc of great circle connecting their images f 1 = l α (q 1 ) and f 2 = l α (q 2 ) on the sphere S r (µ). Now if tf 1 + (1 t)f 2 is the straight line connecting f 1 and f 2 in L r (µ), then for each fixed t [0, 1] the corresponding point on the sphere of radius r is (6.12) f(t) = r k(t) [tf 1 + (1 t)f 2 ], where k(t) = tf 1 + (1 t)f 2 r. Let us write its inverse image with respect to the α embedding as eũp = (l α ) 1 (f(t)) = 1 k(t) r [tf 1 + (1 t)f 2 ] r, for some random variable ũ. Following the argument in proposition 5.2, we see that e αũ 1 = p α k(t) [tf αr 1 + (1 t)f 2 ] αr [ ] αr 2r [t αr e αu 1 + (1 t) αr e αu 2 ], k(t) so that (6.13) e αũ pdµ <, since both αu 1 and αu 2 are in L Φ 1 (p). Furthermore, e αũ = p α k(t) αr [tf 1 + (1 t)f 2 ] αr [ ] αr k(t) e αu 1, tr so that (6.14) e αũ pdµ <, which together with (6.13), imply that ũ L Φ 1. To complete the proof we can define (6.15) u = ũ ũpdµ, 27

28 to obtain that u B p and (6.16) (l α ) 1 (f(t)) = eu Z p (u) p E(p). 7. Further developments We have seen that using L Φ 1 as the coordinate space for the infinite dimensional information manifold leads to a well defined mixture connection which is dual to the exponential connection with respect to the generalised Fisher metric. Both connections are flat, and the next natural step in our programme is to prove that the Kullback- Leibler relative entropy is the statistical divergence associated with the dualistic triple ( (1), ( 1), g) (see Amari and Nagaoka (2000)). In the same vein, we have also shown how to obtain the interpolation family of α-covariant derivatives satisfying the same convex mixture structure as in finite dimensions, which prompts us to the study of the infinite dimensional analogues of the α-divergences. The completion of this circle of ideas would be an infinite dimensional generalisation of the projection theorems obtained by Amari in the finite dimensional case. Namely, one seeks to prove that, given a point p M and an α-flat submanifold S, then the point q S with minimal α-divergence from p is obtained by projecting p orthogonally (with respect to the Fisher metric) onto S following a α-geodesic. An equally ambitious result to be pursued is the infinite dimensional analogue of Centsov s theorem, which would characterise the generalised Fisher metric as the unique continuous scalar product on M which is reduced by Markov morphisms on the tangent space. 28

29 Acknowledgements I would like to acknowledge the financial suport from CAPES, Brazil, and of an ORS scholarship from the British government received while I was a PhD student at King s College London, where this research was initiated. It is a pleasure to thank R.F. Streater for suggesting the theme and thoroughly discussing the results obtained. I am also grateful to P. Gibilisco and G. Pistone for estimulating discussions during the Information Geometry and Applications conference held in Pescara, July References Amari, S.-i. (1985). Differential-geometrical Methods in Statistics. Springer-Verlag, New York. Amari, S.-i. and Nagaoka, H. (2000). Methods of Information Geometry. American Mathematical Society, Providence, RI. Translated from the 1993 Japanese original by Daishi Harada. Čencov, N. N. (1982). Statistical Decision Rules and Optimal Inference. American Mathematical Society, Providence, R.I. Translation from the Russian edited by Lev J. Leifman. Dawid, A. P. (1975). On the concepts of sufficiency and ancillarity in the presence of nuisance parameters. Journal of the Royal Statistical Society B Efron, B. (1975). Defining the curvature of a statistical problem (with applications to second order efficiency). Annals of Statistics With a discussion by C. R. Rao, Don A. Pierce, D. R. Cox, D. V. Lindley, Lucien LeCam, J. K. Ghosh, J. Pfanzagl, Neils Keiding, A. P. Dawid, Jim Reeds and with a reply by the author. 29

30 Gibilisco, P. and Isola, T. (1999). Connections on statistical manifolds of density operators by geometry of noncommutative L p -spaces. Infinite Dimensional Analysis Quantum Probability and Related Topics Gibilisco, P. and Pistone, G. (1998). Connections on non-parametric statistical manifolds by Orlicz space geometry. Infinite Dimensional Analysis Quantum Probability and Related Topics Grasselli, M. R. (2001). Classical and Quantum Information Geometry. Ph.D. thesis, King s College London. Jeffreys, H. (1946). An invariant form for the prior probability in estimation problems. Proceedings of Royal Society A Kass, R. E. and Vos, P. W. (1997). Geometrical Foundations of Asymptotic Inference. John Wiley & Sons Inc., New York. A Wiley-Interscience Publication. Krasnosel skiĭ, M. A. and Rutickiĭ, J. B. (1961). Convex Functions and Orlicz spaces. P. Noordhoff Ltd., Groningen. Lang, S. (1995). Differential and Riemannian Manifolds. 3rd ed. Springer-Verlag, New York. Murray, M. K. and Rice, J. W. (1993). Differential Geometry and Statistics. Chapman & Hall, London. Pistone, G. (2001). New ideas in nonparametric estimation. In Disordered and Complex Systems (P. Sollich et al., eds.). American Institute of Physics. AIP Conference Proceedings

31 Pistone, G. and Rogantin, M. P. (1999). The exponential statistical manifold: mean parameters, orthogonality and space transformations. Bernoulli Pistone, G. and Sempi, C. (1995). An infinite-dimensional geometric structure on the space of all the probability measures equivalent to a given one. Annals of Statistics Rao, C. R. (1945). Information and accuracy attainable in the estimation of statistical parameters. Bulletin of the Calcuta Mathematical Society Rao, M. M. and Ren, Z. D. (1991). Theory of Orlicz spaces. Marcel Dekker Inc., New York. Sollich, P. et al. (eds.) (2001). Disordered and Complex Systems. American Institute of Physics. AIP Conference Proceedings

Dual connections in nonparametric classical information geometry

Dual connections in nonparametric classical information geometry Ann Inst Stat Math 00 6:873 896 DOI 0.007/s0463-008-09-3 Dual connections in nonparametric classical information geometry M. R. Grasselli Received: 8 September 006 / Revised: 9 May 008 / Published online:

More information

Open Questions in Quantum Information Geometry

Open Questions in Quantum Information Geometry Open Questions in Quantum Information Geometry M. R. Grasselli Department of Mathematics and Statistics McMaster University Imperial College, June 15, 2005 1. Classical Parametric Information Geometry

More information

Classical and Quantum Information Geometry

Classical and Quantum Information Geometry Classical and Quantum Information Geometry Matheus da Rocha Grasselli A dissertation submitted to the University of London in partial fulfillment of the requirements for the degree of Doctor of Philosophy

More information

Applications of Algebraic Geometry IMA Annual Program Year Workshop Applications in Biology, Dynamics, and Statistics March 5-9, 2007

Applications of Algebraic Geometry IMA Annual Program Year Workshop Applications in Biology, Dynamics, and Statistics March 5-9, 2007 Applications of Algebraic Geometry IMA Annual Program Year Workshop Applications in Biology, Dynamics, and Statistics March 5-9, 2007 Information Geometry (IG) and Algebraic Statistics (AS) Giovanni Pistone

More information

CERTAIN PROPERTIES OF GENERALIZED ORLICZ SPACES

CERTAIN PROPERTIES OF GENERALIZED ORLICZ SPACES Volume 10 (2009), Issue 2, Article 37, 10 pp. CERTAIN PROPERTIES OF GENERALIZED ORLICZ SPACES PANKAJ JAIN AND PRITI UPRETI DEPARTMENT OF MATHEMATICS DESHBANDHU COLLEGE (UNIVERSITY OF DELHI) KALKAJI, NEW

More information

Chapter 3. Riemannian Manifolds - I. The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves

Chapter 3. Riemannian Manifolds - I. The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves Chapter 3 Riemannian Manifolds - I The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves embedded in Riemannian manifolds. A Riemannian manifold is an abstraction

More information

Information Geometry and Algebraic Statistics

Information Geometry and Algebraic Statistics Matematikos ir informatikos institutas Vinius Information Geometry and Algebraic Statistics Giovanni Pistone and Eva Riccomagno POLITO, UNIGE Wednesday 30th July, 2008 G.Pistone, E.Riccomagno (POLITO,

More information

ON SUBSPACES OF NON-REFLEXIVE ORLICZ SPACES

ON SUBSPACES OF NON-REFLEXIVE ORLICZ SPACES ON SUBSPACES OF NON-REFLEXIVE ORLICZ SPACES J. ALEXOPOULOS May 28, 997 ABSTRACT. Kadec and Pelczýnski have shown that every non-reflexive subspace of L (µ) contains a copy of l complemented in L (µ). On

More information

Chapter 2 Exponential Families and Mixture Families of Probability Distributions

Chapter 2 Exponential Families and Mixture Families of Probability Distributions Chapter 2 Exponential Families and Mixture Families of Probability Distributions The present chapter studies the geometry of the exponential family of probability distributions. It is not only a typical

More information

CALCULUS ON MANIFOLDS. 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M =

CALCULUS ON MANIFOLDS. 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M = CALCULUS ON MANIFOLDS 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M = a M T am, called the tangent bundle, is itself a smooth manifold, dim T M = 2n. Example 1.

More information

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation Statistics 62: L p spaces, metrics on spaces of probabilites, and connections to estimation Moulinath Banerjee December 6, 2006 L p spaces and Hilbert spaces We first formally define L p spaces. Consider

More information

1.4 The Jacobian of a map

1.4 The Jacobian of a map 1.4 The Jacobian of a map Derivative of a differentiable map Let F : M n N m be a differentiable map between two C 1 manifolds. Given a point p M we define the derivative of F at p by df p df (p) : T p

More information

CHAPTER 3. Gauss map. In this chapter we will study the Gauss map of surfaces in R 3.

CHAPTER 3. Gauss map. In this chapter we will study the Gauss map of surfaces in R 3. CHAPTER 3 Gauss map In this chapter we will study the Gauss map of surfaces in R 3. 3.1. Surfaces in R 3 Let S R 3 be a submanifold of dimension 2. Let {U i, ϕ i } be a DS on S. For any p U i we have a

More information

F -Geometry and Amari s α Geometry on a Statistical Manifold

F -Geometry and Amari s α Geometry on a Statistical Manifold Entropy 014, 16, 47-487; doi:10.3390/e160547 OPEN ACCESS entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Article F -Geometry and Amari s α Geometry on a Statistical Manifold Harsha K. V. * and Subrahamanian

More information

Lecture 4 Lebesgue spaces and inequalities

Lecture 4 Lebesgue spaces and inequalities Lecture 4: Lebesgue spaces and inequalities 1 of 10 Course: Theory of Probability I Term: Fall 2013 Instructor: Gordan Zitkovic Lecture 4 Lebesgue spaces and inequalities Lebesgue spaces We have seen how

More information

1 Differentiable manifolds and smooth maps

1 Differentiable manifolds and smooth maps 1 Differentiable manifolds and smooth maps Last updated: April 14, 2011. 1.1 Examples and definitions Roughly, manifolds are sets where one can introduce coordinates. An n-dimensional manifold is a set

More information

Math 6455 Nov 1, Differential Geometry I Fall 2006, Georgia Tech

Math 6455 Nov 1, Differential Geometry I Fall 2006, Georgia Tech Math 6455 Nov 1, 26 1 Differential Geometry I Fall 26, Georgia Tech Lecture Notes 14 Connections Suppose that we have a vector field X on a Riemannian manifold M. How can we measure how much X is changing

More information

DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17

DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17 DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17 6. Geodesics A parametrized line γ : [a, b] R n in R n is straight (and the parametrization is uniform) if the vector γ (t) does not depend on t. Thus,

More information

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure? MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due 9/5). Prove that every countable set A is measurable and µ(a) = 0. 2 (Bonus). Let A consist of points (x, y) such that either x or y is

More information

Information Geometry: Background and Applications in Machine Learning

Information Geometry: Background and Applications in Machine Learning Geometry and Computer Science Information Geometry: Background and Applications in Machine Learning Giovanni Pistone www.giannidiorestino.it Pescara IT), February 8 10, 2017 Abstract Information Geometry

More information

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability... Functional Analysis Franck Sueur 2018-2019 Contents 1 Metric spaces 1 1.1 Definitions........................................ 1 1.2 Completeness...................................... 3 1.3 Compactness......................................

More information

A brief introduction to N functions and Orlicz function spaces. John Alexopoulos Kent State University, Stark Campus

A brief introduction to N functions and Orlicz function spaces. John Alexopoulos Kent State University, Stark Campus A brief introduction to N functions and Orlicz function spaces John Alexopoulos Kent State University, Stark Campus March 12, 24 2 Contents 1 Introduction: Uniform integrability 5 1.1 Definitions.....................................

More information

Tangent bundles, vector fields

Tangent bundles, vector fields Location Department of Mathematical Sciences,, G5-109. Main Reference: [Lee]: J.M. Lee, Introduction to Smooth Manifolds, Graduate Texts in Mathematics 218, Springer-Verlag, 2002. Homepage for the book,

More information

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory Part V 7 Introduction: What are measures and why measurable sets Lebesgue Integration Theory Definition 7. (Preliminary). A measure on a set is a function :2 [ ] such that. () = 2. If { } = is a finite

More information

fy (X(g)) Y (f)x(g) gy (X(f)) Y (g)x(f)) = fx(y (g)) + gx(y (f)) fy (X(g)) gy (X(f))

fy (X(g)) Y (f)x(g) gy (X(f)) Y (g)x(f)) = fx(y (g)) + gx(y (f)) fy (X(g)) gy (X(f)) 1. Basic algebra of vector fields Let V be a finite dimensional vector space over R. Recall that V = {L : V R} is defined to be the set of all linear maps to R. V is isomorphic to V, but there is no canonical

More information

June 21, Peking University. Dual Connections. Zhengchao Wan. Overview. Duality of connections. Divergence: general contrast functions

June 21, Peking University. Dual Connections. Zhengchao Wan. Overview. Duality of connections. Divergence: general contrast functions Dual Peking University June 21, 2016 Divergences: Riemannian connection Let M be a manifold on which there is given a Riemannian metric g =,. A connection satisfying Z X, Y = Z X, Y + X, Z Y (1) for all

More information

PICARD S THEOREM STEFAN FRIEDL

PICARD S THEOREM STEFAN FRIEDL PICARD S THEOREM STEFAN FRIEDL Abstract. We give a summary for the proof of Picard s Theorem. The proof is for the most part an excerpt of [F]. 1. Introduction Definition. Let U C be an open subset. A

More information

Affine Connections: Part 2

Affine Connections: Part 2 Affine Connections: Part 2 Manuscript for Machine Learning Reading Group Talk R. Simon Fong Abstract Note for online manuscript: This is the manuscript of a one hour introductory talk on (affine) connections.

More information

NONSINGULAR CURVES BRIAN OSSERMAN

NONSINGULAR CURVES BRIAN OSSERMAN NONSINGULAR CURVES BRIAN OSSERMAN The primary goal of this note is to prove that every abstract nonsingular curve can be realized as an open subset of a (unique) nonsingular projective curve. Note that

More information

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure? MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due ). Show that the open disk x 2 + y 2 < 1 is a countable union of planar elementary sets. Show that the closed disk x 2 + y 2 1 is a countable

More information

Introduction to Real Analysis Alternative Chapter 1

Introduction to Real Analysis Alternative Chapter 1 Christopher Heil Introduction to Real Analysis Alternative Chapter 1 A Primer on Norms and Banach Spaces Last Updated: March 10, 2018 c 2018 by Christopher Heil Chapter 1 A Primer on Norms and Banach Spaces

More information

AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD

AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD C.T.J. DODSON AND HIROSHI MATSUZOE Abstract. For the space of gamma distributions with Fisher metric and exponential connections, natural coordinate systems, potential

More information

1 Exponential manifold by reproducing kernel Hilbert spaces

1 Exponential manifold by reproducing kernel Hilbert spaces 1 Exponential manifold by reproducing kernel Hilbert spaces 1.1 Introduction The purpose of this paper is to propose a method of constructing exponential families of Hilbert manifold, on which estimation

More information

Introduction to Information Geometry

Introduction to Information Geometry Introduction to Information Geometry based on the book Methods of Information Geometry written by Shun-Ichi Amari and Hiroshi Nagaoka Yunshu Liu 2012-02-17 Outline 1 Introduction to differential geometry

More information

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES CHRISTOPHER HEIL 1. Compact Sets Definition 1.1 (Compact and Totally Bounded Sets). Let X be a metric space, and let E X be

More information

Mean-field equations for higher-order quantum statistical models : an information geometric approach

Mean-field equations for higher-order quantum statistical models : an information geometric approach Mean-field equations for higher-order quantum statistical models : an information geometric approach N Yapage Department of Mathematics University of Ruhuna, Matara Sri Lanka. arxiv:1202.5726v1 [quant-ph]

More information

The small ball property in Banach spaces (quantitative results)

The small ball property in Banach spaces (quantitative results) The small ball property in Banach spaces (quantitative results) Ehrhard Behrends Abstract A metric space (M, d) is said to have the small ball property (sbp) if for every ε 0 > 0 there exists a sequence

More information

A crash course the geometry of hyperbolic surfaces

A crash course the geometry of hyperbolic surfaces Lecture 7 A crash course the geometry of hyperbolic surfaces 7.1 The hyperbolic plane Hyperbolic geometry originally developed in the early 19 th century to prove that the parallel postulate in Euclidean

More information

LECTURE 15: COMPLETENESS AND CONVEXITY

LECTURE 15: COMPLETENESS AND CONVEXITY LECTURE 15: COMPLETENESS AND CONVEXITY 1. The Hopf-Rinow Theorem Recall that a Riemannian manifold (M, g) is called geodesically complete if the maximal defining interval of any geodesic is R. On the other

More information

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9 MAT 570 REAL ANALYSIS LECTURE NOTES PROFESSOR: JOHN QUIGG SEMESTER: FALL 204 Contents. Sets 2 2. Functions 5 3. Countability 7 4. Axiom of choice 8 5. Equivalence relations 9 6. Real numbers 9 7. Extended

More information

Implicit Functions, Curves and Surfaces

Implicit Functions, Curves and Surfaces Chapter 11 Implicit Functions, Curves and Surfaces 11.1 Implicit Function Theorem Motivation. In many problems, objects or quantities of interest can only be described indirectly or implicitly. It is then

More information

Spaces with Ricci curvature bounded from below

Spaces with Ricci curvature bounded from below Spaces with Ricci curvature bounded from below Nicola Gigli February 23, 2015 Topics 1) On the definition of spaces with Ricci curvature bounded from below 2) Analytic properties of RCD(K, N) spaces 3)

More information

ON A MAXIMAL OPERATOR IN REARRANGEMENT INVARIANT BANACH FUNCTION SPACES ON METRIC SPACES

ON A MAXIMAL OPERATOR IN REARRANGEMENT INVARIANT BANACH FUNCTION SPACES ON METRIC SPACES Vasile Alecsandri University of Bacău Faculty of Sciences Scientific Studies and Research Series Mathematics and Informatics Vol. 27207), No., 49-60 ON A MAXIMAL OPRATOR IN RARRANGMNT INVARIANT BANACH

More information

Chapter 16. Manifolds and Geodesics Manifold Theory. Reading: Osserman [7] Pg , 55, 63-65, Do Carmo [2] Pg ,

Chapter 16. Manifolds and Geodesics Manifold Theory. Reading: Osserman [7] Pg , 55, 63-65, Do Carmo [2] Pg , Chapter 16 Manifolds and Geodesics Reading: Osserman [7] Pg. 43-52, 55, 63-65, Do Carmo [2] Pg. 238-247, 325-335. 16.1 Manifold Theory Let us recall the definition of differentiable manifolds Definition

More information

CHAPTER 1 PRELIMINARIES

CHAPTER 1 PRELIMINARIES CHAPTER 1 PRELIMINARIES 1.1 Introduction The aim of this chapter is to give basic concepts, preliminary notions and some results which we shall use in the subsequent chapters of the thesis. 1.2 Differentiable

More information

PROBLEMS. (b) (Polarization Identity) Show that in any inner product space

PROBLEMS. (b) (Polarization Identity) Show that in any inner product space 1 Professor Carl Cowen Math 54600 Fall 09 PROBLEMS 1. (Geometry in Inner Product Spaces) (a) (Parallelogram Law) Show that in any inner product space x + y 2 + x y 2 = 2( x 2 + y 2 ). (b) (Polarization

More information

Welcome to Copenhagen!

Welcome to Copenhagen! Welcome to Copenhagen! Schedule: Monday Tuesday Wednesday Thursday Friday 8 Registration and welcome 9 Crash course on Crash course on Introduction to Differential and Differential and Information Geometry

More information

Loos Symmetric Cones. Jimmie Lawson Louisiana State University. July, 2018

Loos Symmetric Cones. Jimmie Lawson Louisiana State University. July, 2018 Louisiana State University July, 2018 Dedication I would like to dedicate this talk to Joachim Hilgert, whose 60th birthday we celebrate at this conference and with whom I researched and wrote a big blue

More information

Hyperplane Margin Classifiers on the Multinomial Manifold

Hyperplane Margin Classifiers on the Multinomial Manifold Hyperplane Margin Classifiers on the Multinomial Manifold Guy Lebanon LEBANON@CS.CMU.EDU John Lafferty LAFFERTY@CS.CMU.EDU School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh

More information

Overview of normed linear spaces

Overview of normed linear spaces 20 Chapter 2 Overview of normed linear spaces Starting from this chapter, we begin examining linear spaces with at least one extra structure (topology or geometry). We assume linearity; this is a natural

More information

Information Geometry on Hierarchy of Probability Distributions

Information Geometry on Hierarchy of Probability Distributions IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 47, NO. 5, JULY 2001 1701 Information Geometry on Hierarchy of Probability Distributions Shun-ichi Amari, Fellow, IEEE Abstract An exponential family or mixture

More information

Differential Geometry MTG 6257 Spring 2018 Problem Set 4 Due-date: Wednesday, 4/25/18

Differential Geometry MTG 6257 Spring 2018 Problem Set 4 Due-date: Wednesday, 4/25/18 Differential Geometry MTG 6257 Spring 2018 Problem Set 4 Due-date: Wednesday, 4/25/18 Required problems (to be handed in): 2bc, 3, 5c, 5d(i). In doing any of these problems, you may assume the results

More information

(x, y) = d(x, y) = x y.

(x, y) = d(x, y) = x y. 1 Euclidean geometry 1.1 Euclidean space Our story begins with a geometry which will be familiar to all readers, namely the geometry of Euclidean space. In this first chapter we study the Euclidean distance

More information

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing. 5 Measure theory II 1. Charges (signed measures). Let (Ω, A) be a σ -algebra. A map φ: A R is called a charge, (or signed measure or σ -additive set function) if φ = φ(a j ) (5.1) A j for any disjoint

More information

Solutions to the Hamilton-Jacobi equation as Lagrangian submanifolds

Solutions to the Hamilton-Jacobi equation as Lagrangian submanifolds Solutions to the Hamilton-Jacobi equation as Lagrangian submanifolds Matias Dahl January 2004 1 Introduction In this essay we shall study the following problem: Suppose is a smooth -manifold, is a function,

More information

II. DIFFERENTIABLE MANIFOLDS. Washington Mio CENTER FOR APPLIED VISION AND IMAGING SCIENCES

II. DIFFERENTIABLE MANIFOLDS. Washington Mio CENTER FOR APPLIED VISION AND IMAGING SCIENCES II. DIFFERENTIABLE MANIFOLDS Washington Mio Anuj Srivastava and Xiuwen Liu (Illustrations by D. Badlyans) CENTER FOR APPLIED VISION AND IMAGING SCIENCES Florida State University WHY MANIFOLDS? Non-linearity

More information

Optimization and Optimal Control in Banach Spaces

Optimization and Optimal Control in Banach Spaces Optimization and Optimal Control in Banach Spaces Bernhard Schmitzer October 19, 2017 1 Convex non-smooth optimization with proximal operators Remark 1.1 (Motivation). Convex optimization: easier to solve,

More information

LECTURE: KOBORDISMENTHEORIE, WINTER TERM 2011/12; SUMMARY AND LITERATURE

LECTURE: KOBORDISMENTHEORIE, WINTER TERM 2011/12; SUMMARY AND LITERATURE LECTURE: KOBORDISMENTHEORIE, WINTER TERM 2011/12; SUMMARY AND LITERATURE JOHANNES EBERT 1.1. October 11th. 1. Recapitulation from differential topology Definition 1.1. Let M m, N n, be two smooth manifolds

More information

THE FUNDAMENTAL GROUP OF MANIFOLDS OF POSITIVE ISOTROPIC CURVATURE AND SURFACE GROUPS

THE FUNDAMENTAL GROUP OF MANIFOLDS OF POSITIVE ISOTROPIC CURVATURE AND SURFACE GROUPS THE FUNDAMENTAL GROUP OF MANIFOLDS OF POSITIVE ISOTROPIC CURVATURE AND SURFACE GROUPS AILANA FRASER AND JON WOLFSON Abstract. In this paper we study the topology of compact manifolds of positive isotropic

More information

HADAMARD FOLIATIONS OF H n. I

HADAMARD FOLIATIONS OF H n. I HADAMARD FOLIATIONS OF H n. I MACIEJ CZARNECKI Abstract. We introduce the notion of an Hadamard foliation as a foliation of Hadamard manifold which all leaves are Hadamard. We prove that a foliation of

More information

Vector fields Lecture 2

Vector fields Lecture 2 Vector fields Lecture 2 Let U be an open subset of R n and v a vector field on U. We ll say that v is complete if, for every p U, there exists an integral curve, γ : R U with γ(0) = p, i.e., for every

More information

CHAPTER 7. Connectedness

CHAPTER 7. Connectedness CHAPTER 7 Connectedness 7.1. Connected topological spaces Definition 7.1. A topological space (X, T X ) is said to be connected if there is no continuous surjection f : X {0, 1} where the two point set

More information

Terse Notes on Riemannian Geometry

Terse Notes on Riemannian Geometry Terse Notes on Riemannian Geometry Tom Fletcher January 26, 2010 These notes cover the basics of Riemannian geometry, Lie groups, and symmetric spaces. This is just a listing of the basic definitions and

More information

Risk Aversion over Incomes and Risk Aversion over Commodities. By Juan E. Martinez-Legaz and John K.-H. Quah 1

Risk Aversion over Incomes and Risk Aversion over Commodities. By Juan E. Martinez-Legaz and John K.-H. Quah 1 Risk Aversion over Incomes and Risk Aversion over Commodities By Juan E. Martinez-Legaz and John K.-H. Quah 1 Abstract: This note determines the precise connection between an agent s attitude towards income

More information

THE RADON NIKODYM PROPERTY AND LIPSCHITZ EMBEDDINGS

THE RADON NIKODYM PROPERTY AND LIPSCHITZ EMBEDDINGS THE RADON NIKODYM PROPERTY AND LIPSCHITZ EMBEDDINGS Motivation The idea here is simple. Suppose we have a Lipschitz homeomorphism f : X Y where X and Y are Banach spaces, namely c 1 x y f (x) f (y) c 2

More information

USING FUNCTIONAL ANALYSIS AND SOBOLEV SPACES TO SOLVE POISSON S EQUATION

USING FUNCTIONAL ANALYSIS AND SOBOLEV SPACES TO SOLVE POISSON S EQUATION USING FUNCTIONAL ANALYSIS AND SOBOLEV SPACES TO SOLVE POISSON S EQUATION YI WANG Abstract. We study Banach and Hilbert spaces with an eye towards defining weak solutions to elliptic PDE. Using Lax-Milgram

More information

The Skorokhod reflection problem for functions with discontinuities (contractive case)

The Skorokhod reflection problem for functions with discontinuities (contractive case) The Skorokhod reflection problem for functions with discontinuities (contractive case) TAKIS KONSTANTOPOULOS Univ. of Texas at Austin Revised March 1999 Abstract Basic properties of the Skorokhod reflection

More information

Structural and Multidisciplinary Optimization. P. Duysinx and P. Tossings

Structural and Multidisciplinary Optimization. P. Duysinx and P. Tossings Structural and Multidisciplinary Optimization P. Duysinx and P. Tossings 2018-2019 CONTACTS Pierre Duysinx Institut de Mécanique et du Génie Civil (B52/3) Phone number: 04/366.91.94 Email: P.Duysinx@uliege.be

More information

Convexity in R n. The following lemma will be needed in a while. Lemma 1 Let x E, u R n. If τ I(x, u), τ 0, define. f(x + τu) f(x). τ.

Convexity in R n. The following lemma will be needed in a while. Lemma 1 Let x E, u R n. If τ I(x, u), τ 0, define. f(x + τu) f(x). τ. Convexity in R n Let E be a convex subset of R n. A function f : E (, ] is convex iff f(tx + (1 t)y) (1 t)f(x) + tf(y) x, y E, t [0, 1]. A similar definition holds in any vector space. A topology is needed

More information

An introduction to Mathematical Theory of Control

An introduction to Mathematical Theory of Control An introduction to Mathematical Theory of Control Vasile Staicu University of Aveiro UNICA, May 2018 Vasile Staicu (University of Aveiro) An introduction to Mathematical Theory of Control UNICA, May 2018

More information

I teach myself... Hilbert spaces

I teach myself... Hilbert spaces I teach myself... Hilbert spaces by F.J.Sayas, for MATH 806 November 4, 2015 This document will be growing with the semester. Every in red is for you to justify. Even if we start with the basic definition

More information

Stabilization of Control-Affine Systems by Local Approximations of Trajectories

Stabilization of Control-Affine Systems by Local Approximations of Trajectories Stabilization of Control-Affine Systems by Local Approximations of Trajectories Raik Suttner arxiv:1805.05991v2 [math.ds] 9 Jun 2018 We study convergence and stability properties of control-affine systems.

More information

Generalized vector equilibrium problems on Hadamard manifolds

Generalized vector equilibrium problems on Hadamard manifolds Available online at www.tjnsa.com J. Nonlinear Sci. Appl. 9 (2016), 1402 1409 Research Article Generalized vector equilibrium problems on Hadamard manifolds Shreyasi Jana a, Chandal Nahak a, Cristiana

More information

GEOMETRIC QUANTIZATION

GEOMETRIC QUANTIZATION GEOMETRIC QUANTIZATION 1. The basic idea The setting of the Hamiltonian version of classical (Newtonian) mechanics is the phase space (position and momentum), which is a symplectic manifold. The typical

More information

Banach Spaces II: Elementary Banach Space Theory

Banach Spaces II: Elementary Banach Space Theory BS II c Gabriel Nagy Banach Spaces II: Elementary Banach Space Theory Notes from the Functional Analysis Course (Fall 07 - Spring 08) In this section we introduce Banach spaces and examine some of their

More information

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES JEREMY J. BECNEL Abstract. We examine the main topologies wea, strong, and inductive placed on the dual of a countably-normed space

More information

THE JORDAN-BROUWER SEPARATION THEOREM

THE JORDAN-BROUWER SEPARATION THEOREM THE JORDAN-BROUWER SEPARATION THEOREM WOLFGANG SCHMALTZ Abstract. The Classical Jordan Curve Theorem says that every simple closed curve in R 2 divides the plane into two pieces, an inside and an outside

More information

Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping.

Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping. Minimization Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping. 1 Minimization A Topological Result. Let S be a topological

More information

of conditional information geometry,

of conditional information geometry, An Extended Čencov-Campbell Characterization of Conditional Information Geometry Guy Lebanon School of Computer Science Carnegie ellon niversity lebanon@cs.cmu.edu Abstract We formulate and prove an axiomatic

More information

DIFFERENTIAL GEOMETRY. LECTURE 12-13,

DIFFERENTIAL GEOMETRY. LECTURE 12-13, DIFFERENTIAL GEOMETRY. LECTURE 12-13, 3.07.08 5. Riemannian metrics. Examples. Connections 5.1. Length of a curve. Let γ : [a, b] R n be a parametried curve. Its length can be calculated as the limit of

More information

On the distributional divergence of vector fields vanishing at infinity

On the distributional divergence of vector fields vanishing at infinity Proceedings of the Royal Society of Edinburgh, 141A, 65 76, 2011 On the distributional divergence of vector fields vanishing at infinity Thierry De Pauw Institut de Recherches en Mathématiques et Physique,

More information

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm Chapter 13 Radon Measures Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm (13.1) f = sup x X f(x). We want to identify

More information

The local equicontinuity of a maximal monotone operator

The local equicontinuity of a maximal monotone operator arxiv:1410.3328v2 [math.fa] 3 Nov 2014 The local equicontinuity of a maximal monotone operator M.D. Voisei Abstract The local equicontinuity of an operator T : X X with proper Fitzpatrick function ϕ T

More information

Junior Seminar: Hyperbolic Geometry Lecture Notes

Junior Seminar: Hyperbolic Geometry Lecture Notes Junior Seminar: Hyperbolic Geometry Lecture Notes Tim Campion January 20, 2010 1 Motivation Our first construction is very similar in spirit to an analogous one in Euclidean space. The group of isometries

More information

An introduction to some aspects of functional analysis

An introduction to some aspects of functional analysis An introduction to some aspects of functional analysis Stephen Semmes Rice University Abstract These informal notes deal with some very basic objects in functional analysis, including norms and seminorms

More information

Functional Analysis HW #1

Functional Analysis HW #1 Functional Analysis HW #1 Sangchul Lee October 9, 2015 1 Solutions Solution of #1.1. Suppose that X

More information

Banach-Alaoglu theorems

Banach-Alaoglu theorems Banach-Alaoglu theorems László Erdős Jan 23, 2007 1 Compactness revisited In a topological space a fundamental property is the compactness (Kompaktheit). We recall the definition: Definition 1.1 A subset

More information

arxiv:math/ v1 [math.dg] 23 Dec 2001

arxiv:math/ v1 [math.dg] 23 Dec 2001 arxiv:math/0112260v1 [math.dg] 23 Dec 2001 VOLUME PRESERVING EMBEDDINGS OF OPEN SUBSETS OF Ê n INTO MANIFOLDS FELIX SCHLENK Abstract. We consider a connected smooth n-dimensional manifold M endowed with

More information

Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions

Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions Jun Zhang 1, and Fubo Li 1 University of Michigan, Ann Arbor, Michigan, USA Sichuan University, Chengdu, China

More information

Normed Vector Spaces and Double Duals

Normed Vector Spaces and Double Duals Normed Vector Spaces and Double Duals Mathematics 481/525 In this note we look at a number of infinite-dimensional R-vector spaces that arise in analysis, and we consider their dual and double dual spaces

More information

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by L p Functions Given a measure space (, µ) and a real number p [, ), recall that the L p -norm of a measurable function f : R is defined by f p = ( ) /p f p dµ Note that the L p -norm of a function f may

More information

Monotone Point-to-Set Vector Fields

Monotone Point-to-Set Vector Fields Monotone Point-to-Set Vector Fields J.X. da Cruz Neto, O.P.Ferreira and L.R. Lucambio Pérez Dedicated to Prof.Dr. Constantin UDRIŞTE on the occasion of his sixtieth birthday Abstract We introduce the concept

More information

1. Geometry of the unit tangent bundle

1. Geometry of the unit tangent bundle 1 1. Geometry of the unit tangent bundle The main reference for this section is [8]. In the following, we consider (M, g) an n-dimensional smooth manifold endowed with a Riemannian metric g. 1.1. Notations

More information

10. Smooth Varieties. 82 Andreas Gathmann

10. Smooth Varieties. 82 Andreas Gathmann 82 Andreas Gathmann 10. Smooth Varieties Let a be a point on a variety X. In the last chapter we have introduced the tangent cone C a X as a way to study X locally around a (see Construction 9.20). It

More information

Combinatorics in Banach space theory Lecture 12

Combinatorics in Banach space theory Lecture 12 Combinatorics in Banach space theory Lecture The next lemma considerably strengthens the assertion of Lemma.6(b). Lemma.9. For every Banach space X and any n N, either all the numbers n b n (X), c n (X)

More information

The nonsmooth Newton method on Riemannian manifolds

The nonsmooth Newton method on Riemannian manifolds The nonsmooth Newton method on Riemannian manifolds C. Lageman, U. Helmke, J.H. Manton 1 Introduction Solving nonlinear equations in Euclidean space is a frequently occurring problem in optimization and

More information

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy Banach Spaces These notes provide an introduction to Banach spaces, which are complete normed vector spaces. For the purposes of these notes, all vector spaces are assumed to be over the real numbers.

More information

Information Geometry and Sufficient Statistics

Information Geometry and Sufficient Statistics Information Geometry and Sufficient Statistics Nihat Ay Jürgen Jost Hong Van Le Lorenz Schwachhoefer SFI WORKING PAPER: 2012-11-020 SFI Working Papers contain accounts of scientific work of the author(s)

More information

Banach Spaces V: A Closer Look at the w- and the w -Topologies

Banach Spaces V: A Closer Look at the w- and the w -Topologies BS V c Gabriel Nagy Banach Spaces V: A Closer Look at the w- and the w -Topologies Notes from the Functional Analysis Course (Fall 07 - Spring 08) In this section we discuss two important, but highly non-trivial,

More information

SHARP BOUNDARY TRACE INEQUALITIES. 1. Introduction

SHARP BOUNDARY TRACE INEQUALITIES. 1. Introduction SHARP BOUNDARY TRACE INEQUALITIES GILES AUCHMUTY Abstract. This paper describes sharp inequalities for the trace of Sobolev functions on the boundary of a bounded region R N. The inequalities bound (semi-)norms

More information