Dual connections in nonparametric classical information geometry

Size: px
Start display at page:

Download "Dual connections in nonparametric classical information geometry"

Transcription

1 Ann Inst Stat Math 00 6: DOI 0.007/s Dual connections in nonparametric classical information geometry M. R. Grasselli Received: 8 September 006 / Revised: 9 May 008 / Published online: 5 August 008 The Institute of Statistical Mathematics, Tokyo 008 Abstract We construct an infinite-dimensional information manifold based on exponential Orlicz spaces without using the notion of exponential convergence. We then show that convex mixtures of probability densities lie on the same connected component of this manifold, and characterize the class of densities for which this mixture can be extended to an open segment containing the extreme points. For this class, we define an infinite-dimensional analogue of the mixture parallel transport and prove that it is dual to the exponential parallel transport with respect to the Fisher information. We also define α-derivatives and prove that they are convex mixtures of the extremal ±-derivatives. Keywords Information geometry Statistical manifiold Fisher metric Orlicz spaces Amari Nagaoka duality Introduction Information geometry is the branch of probability theory dedicated to provide families of probability distributions with differential geometrical structures. One then uses the tools of differential geometry in order to have a clear and intuitive picture, as well as rigor, in a variety of practical applications ranging from neural networks to statistical estimation, from mathematical finance to nonequilibrium statistical mechanics see Sollich et al. 00. M. R. Grasselli B Department of Mathematics and Statistics, McMaster University, 80 Main Street West, Hamilton, ON L8S 4K, Canada grasselli@math.mcmaster.ca

2 874 M. R. Grasselli It was just over half a century ago that the Fisher information log px,θ g ij = θ i log px,θ θ j px,θdx was independently suggested by Rao 945 and Jeffreys 946 as a Riemannian metric for a parametric statistical model { px,θ,θ = θ,...,θ n }. The Riemannian geometry of statistical models was then studied as a mathematical curiosity for some years, with an emphasis in the geodesic distances associated with the Levi Civita connection for this metric. A greater amount of attention was devoted to the subject after Efron 975 introduced the concept of statistical curvature, pointing out its importance to statistical inference, as well as implicitly using a new affine connection, which would be known as the exponential connection. This exponential connection, together with another connection, later to be called the mixture connection, were further investigated by Dawid 975. The work of several years on the geometric aspects of parametric statistical models culminated with the masterful account in Amari 985, where the whole finite dimensional differential-geometric machinery is employed, including a one-parameter family of α-connections, the essential concept of duality and the notions of statistical divergence, projections and minimization procedures. Among the successes of the research at these early stages one could single out the rigidity of the geometric structures, such as the result concerning the uniqueness of the Fisher metric with respect to monotonicity in Čencov 98 and Amari s result concerning the uniqueness of the α-connections introduced by invariant statistical divergences. The ideas were then extensively used in statistics, in particular higher order asymptotic inference and curved exponential models see Kass and Vos 997. A different line of investigation in Information Geometry took off in the nineties: the search for a fully fledge infinite dimensional manifold of probability measures. As for motivations for this quest, one had, on the practical side, the need to deal with nonparametric models in statistics, where the shape of the underlying distribution is not assumed to be known. On a more fundamental level, there was the desire of having parametric statistical manifolds defined simply as finite dimensional submanifolds of a well defined manifold of all probability measures on a sample space. The motivating idea was already in Dawid 975 and was also addressed by Amari 985. The first sound mathematical construction, however, is due to Pistone and Sempi 995.Given a probability space, F,μ, they showed how to construct a Banach manifold M of all probability measures equivalent to μ. The Banach space used as generalized coordinates was the Orlicz space L, where is an exponential Young function. In a subsequent work, Pistone and Rogantin 999 analyzed further properties of this manifold, in particular the concepts of orthogonality and submanifolds. In Sect. 3, we review their construction and present an alternative proof of the main result in Pistone and Sempi 995, namely that the collection of covering neighborhoods U p and charts e p form an affine C -atlas for M. The crux is Proposition, where we show that the image of overlapping neighborhoods under any chart e p is open in the topology of the target space L.

3 Dual connections in nonparametric classical information geometry 875 The next step in this development was the Gibilisco and Pistone 998 definition of the exponential connection as the natural connection induced by the use of L. These authors then propose a mixture connection acting on the pretangent bundle T M and prove that it is dual to the exponential connection, in the sense of duality for Banach spaces. They further define the α-connections through generalized α-embeddings and show that the formal relation between the exponential, mixture and α-connections are the same as in the parametric case, that is α = + α e + α m. We argue, however, that neither of these two results duality for the exponential and mixture connection and α-connections as convex mixture of them is a proper generalization of the corresponding parametric ones, the reason being twofold. First, Banach space duality is not Amari Nagaoka duality. The latter refers to a metric being preserved by the joint action of two parallel transports, which are then said to be dual see 5. Secondly, all the α-connections in the parametric case act on the tangent bundle, whereas in Gibilisco and Pistone 998 each of them acts on its own bundle-connection pair, making a formula like at least difficult to interpret. In order to address these problems, we define in Sect. 4 an isomorphism τ of tangent spaces, which satisfy the Amari Nagaoka duality relation with respect to the Fisher metric when paired with the exponential parallel transport τ. However, it turns out that our map τ can only be rigorously defined between points q and in M whose ratio is a bounded random variable. Proposition 3 then characterizes the extended convex mixtures between such points. In Sect. 5, we rearrange the definitions of Gibilisco and Pistone 998 in order to have α-derivatives all acting on the same tangent bundle, but defined only for a restricted class of tangent vectors. We then show that the desired relation holds for our definitions. We then finalize the paper by showing that the α-auto-parallel curves between two points whose ratio is a bounded function belong to the connected component Ep. Orlicz spaces We present here the aspects of the theory of Orlicz spaces that will be relevant for the construction of the information manifold. For more comprehensive accounts, as well as for the proofs of all statements in this section, the reader is referred to the monographs of Rao and Ren 99 and Krasnosel skiĭ and Rutickiĭ 96. The general theory of Orlicz spaces is developed around the concept of a Young function, that is, a convex function : R R + satisfying i x = x, x R, ii 0 = 0, iii lim x

4 876 M. R. Grasselli For applications in information geometry, it is enough to consider Young functions of the form x = x 0 φtdt, x 0, 3 where φ :[0, [0, is nondecreasing, continuous and such that φ0 = 0 and lim φx =+. Young functions of this type include the monomials x x r /r, for < r <, and the following examples arising in information geometry: x = cosh x, 4 x = e x x, 5 3 x = + x log + x x 6 in the sequel,, and 3 will always refer to these three particular functions, with other symbols being used to denote generic Young functions. When a Young function is given in the form 3 we can define its complementary conjugate function as the Young function given by y = y 0 ψtdt, y 0, 7 where ψ is the inverse of φ. One can verify that, 3 and x r /r, x s /s, with r + s =, are examples of complementary pairs. For a general Young function, the complementary function is given less constructively by y = sup{x 0 : x y x}. 8 There are many different ways of introducing a partial order on the class of Young functions. A particularly straightforward one is to say that a Young function is stronger than another Young function, denoted by, if there exist a constant a > 0 such that x ax, x x 0, 9 for some x 0 0 depending on a. For example, one can verify that x 3 x r r x s s 0 whenever < r s <. Two Young functions and are said to be equivalent if and, that is, if there exist real numbers 0 < c c < and x 0 0 such that c x x c x, x x 0.

5 Dual connections in nonparametric classical information geometry 877 For example, the functions and are equivalent, both being of exponential type. Now let,, P be a probability space. The Orlicz class associated with a Young function is defined as L P = { f : R, measurable : } f dp <. Since P is a finite measure, the Banach space L,, P of essentially bounded random variables is easily seen to be a subset of L P for any Young function.it is easy to see that L P is a convex set and that h L P and f h imply that f L P. However, in general, L P is not a vector space, which leads to the definition of the Orlicz space associated with a Young function as { } L P = f : R, measurable : αf dp <, for some α>0, 3 furnished with the Luxembourg norm see Rao and Ren 99, p. 67 { } f N f = inf k > 0 : dp. 4 k or with the equivalent Orlicz norm see Rao and Ren 99, p. 6 { } f = sup fg dp : g L P, gdp, 5 where is the complementary Young function to. We observe for later use that f dp iffn f seerao and Ren 99, p. 54. A key ingredient in the analysis of Orlicz spaces is the generalized Hölder inequality see Rao and Ren 99, p. 58. If and are complementary Young functions, f L P, g L P, then fg dp N f N g. 6 It follows that each element f L P defines a continuous linear functional on L P, so that if we denote its topological dual by L we obtain the continuous injection L L for any pair of complementary Young functions. If then there exist a constant k such that N kn and therefore L P L P see Rao and Ren 99, p. 55. For instance, due to 0 we obtain that for < r s < L L s L r L 3 L, 7 where L r, r denote the usual Lebesgue spaces on,, P, which coincide with the Orlicz space defined by the Young functions x r /r, r. If two Young functions are equivalent, then the Orlicz spaces associated with them are isomorphic,

6 878 M. R. Grasselli that is, they coincide as sets and have equivalent norms. For example, we have that L P = L P. 3 The Pistone Sempi information manifold We start by reviewing the construction of an infinite dimensional information manifold along the lines of Pistone and Sempi 995; Pistone and Rogantin 999; Gibilisco and Pistone 998. Consider the set M of all densities of probability measures equivalent to a reference measure μ, that is, M M,,μ={f : R, measurable : f > 0 a.e. and f dμ = }. For each point p M, letl p be the exponential Orlicz space with norm N p over the probability space,, pdμ and consider its closed subspace of p-centred random variables B p ={u L p : updμ = 0} 8 as the coordinate Banach space. In probabilistic terms, the set L p corresponds to random variables whose moment generating function with respect to the probability pdμ is finite on a neighborhood of the origin see Pistone and Sempi 995, proposition.3. In statistics this are exactly the random variables used to define the one dimensional exponential model pt associated with a point p M and a random variable u: pt = etu p, t ε, ε. 9 Z p tu In particular, if we denote by V p the unit ball in B p, then it follows that the moment generating functional Z p u = eu pdμ is finite on V p see Pistone and Sempi 995, proposition.4. The underlying idea for the Pistone Sempi manifold is to parametrize the neighborhoods around points p M by all possible one dimensional exponential models passing through p. As a preliminary result, we mention that if two densities p and q are connected by a one dimensional exponential model, then L p = L q see Pistone and Rogantin 999, proposition 5. Pistone and Sempi define the inverse of a local chart around p M as e p : V p M u eu p. 0 Z p u

7 Dual connections in nonparametric classical information geometry 879 Denote by U p the image of V p under e p. We verify that e p is a bijection from V p to U p, since e u Z p u p = ev Z p v p implies that u v is a constant random variable, which must vanish, since both u,v have zero p-expectation. Then let e p be the inverse of e p on U p. One can check that e p : U p B p q log q q log pdμ. p p and also that, for any p, M, the transition functions are given by e e p : e p U p U p e u u + log U p U p p u + log p dμ. The main result of Pistone and Sempi 995 is to show that the charts defined above lead to a well-defined infinite dimensional manifold. The crucial part of the proof is to show that, for any two points p, M, the image of the overlapping neighborhoods U p U p under e p is open in the topology of the model space B p. To do so they introduce a topology induced by the notion of exponential convergence, with respect to which the sets U p U p are open, and then show that e p is sequentially continuous from exponential convergence to L -convergence. In what follows, we bypass the use of exponential convergence and present a direct proof that the Pistone and Sempi construction yields a Banach manifold. We first need to establish the following proposition. Proposition For any p, M, thesete p U p U p is open in the topology of B p. Proof Suppose that q U p U p for some p, M. Then we can write it as q = for some u V p.using, we find e q = u + log Since e q V p, we have that p N e q = N u + log eu Z p u p, u + log p dμ. p u + log p dμ <.

8 880 M. R. Grasselli Consider an open ball of radius r around u = e p q e p U p U p in the topology of B p, that is, consider the set A r ={v B p : N p v u <r} and let r be small enough so that A r v A r under e p is V p. Then the image in M of each point q = e p v = ev Z p v p. We claim that q U p U p if r is sufficiently small. Indeed, applying e to it we find e q = v + log p so N e q N v u + N u + log +N v u dμ v + log p dμ, p N v u + N e q = N v u + N e q + v u,p K, u + log p dμ + N v u dμ where K = N and we use the notation,p for the L -norm. As we have seen in the previous section, it follows from the growth properties of that there exists c > 0 such that f,p c N f. Moreover, since L p = L since both p and are connected to q by one dimensional exponential models it follows that there exists a constant c > 0 such that N f c N p f. Therefore, the previous inequality becomes N e q c N p v u + N e q + c c KN p v u = c + c K N p v u + N e q. Thus, if we choose r < N e q, c + c K

9 Dual connections in nonparametric classical information geometry 88 we will have that N e q < which proves the claim. What we have just proved is that e p U p U p consists entirely of interior points in the topology of B p, and is therefore open in B p. We then have that the collection {U p, e p, p M} satisfies the three axioms for being a C -atlas for M see Lang 995, p. 0. Moreover, since for each connect component all the spaces B p are isomorphic as topological vector spaces, we can say that M is a C -manifold modeled on B p. As usual, the tangent space at each point p M can be abstractly identified with B p. A concrete realization has been given in Pistone and Rogantin 999, proposition, namely each curve through p M is tangent to a one-dimensional exponential e model tu Z p tu p,sowetakeu as the tangent vector representing the equivalence class of such a curve. Finally, given a point p M, the connected component of M containing p coincides with the maximal exponential model obtained from p see Pistone and Sempi 995, Theorem 4.: { e u } Ep = Z p u p, u B p Z p, 3 where Z p ={f : Z p f < } 0. 4 The Fisher information and dual connections In the parametric version of information geometry, Amari and Nagaoka have introduced the concept of dual connections with respect to a Riemannian metric see Amari and Nagaoka 000 and the references given therein to their earlier work. For finite dimensional manifolds, any continuous assignment of a positive definite symmetric bilinear form to each tangent space determines a Riemannian metric. In infinite dimensions, we need to impose that the tangent space be self-dual and that the bilinear form be bounded. Since our tangent spaces B p are not even reflexive, let alone self-dual, we abandon the idea of having a Riemannian structure on M and propose a weaker version of duality, the duality with respect to a continuous scalar product. When restricted to finite dimensional submanifolds, the scalar product becomes a Riemannian metric and the original definition of duality is recovered. Let, p be a continuous positive definite symmetric bilinear form assigned continuously to each B p T p M. A pair of connections, are said to be dual with respect to, p if τu,τ v q = u,v p 4

10 88 M. R. Grasselli for all u,v T p M and all smooth curves γ :[0, ] M such that γ0 = p, γ = q, where τ and τ denote the parallel transports associated with and, respectively. Equivalently,, are dual with respect to, p if v s, s p = v s, s p + s, v s p 5 for all v T p M and all smooth vector fields s and s. We stress that this is not the kind of duality obtained when a connection on a bundle F is used to construct another connection on the dual bundle F as defined, for instance, in Gibilisco and Pistone 998, Definiton 6. The latter is a construction that does not involve any metric or scalar product and the two connections act on different bundles, while Amari Nagaoka duality is a duality with respect to a specific scalar product or metric, in the finite dimensional case and the dual connections act on the same bundle, the tangent bundle. The infinite dimensional generalization of the Fisher information is given by u,v p = uvpdμ, u,v B p. 6 This is clearly bilinear, symmetric and positive definite. Moreover, continuity follows from that fact that, since L p = L p L 3p, the generalized Hölder inequality gives u,v p KN p un p v, u,v B p. 7 The use of exponential Orlicz space to model the manifold naturally induces a globally flat affine connection on the tangent bundle T M, called the exponential connection and denoted by. It is defined on each connected component of the manifold M, which is equivalent to saying that its parallel transport is defined between points connected by an exponential model. If q and are two such points, then the exponential parallel transport is given by τ q : T q M T q M u u u dμ. 8 It is a well-defined isomorphism, since T q M = B q and T q M = B q are subsets of the same Orlicz space L q = L, so the exponential parallel transport just subtracts a constant from u to make it centred around the right point. We now want to obtain the dual parallel transport to τ with respect to the Fisher information, which in the parametric version of information geometry is called the mixture parallel transport since it is derived from the convex mixture of two densities. We therefore start with a result regarding such mixtures.

11 Dual connections in nonparametric classical information geometry 883 Proposition If q and are two points in U p for some p M, then belongs to Ep for all t [0, ]. Proof We begin by writing qt = tq + t q = eu Z p u p and = eu Z p u p, for some u, u V p L p. Therefore, there exist constants β > and β > such that β u pdμ < and β u pdμ <. To simplify the notation, let us define ũ = u log Z p u and ũ = u log Z p u. We want to show that, if we write eũ p = qt = teũ p + teũ p, then ũ is an element of Z p, so that u =ũ ũpdμ B p Z p and qt = eu Z p u p Ep. For this, let β = minβ,β > and observe that, on account of the inequality a + b β β a β + b β, we have that e βũ = teũ + teũ β β t β e βũ + t β e βũ. Thus e βũ pdμ β t β e βũ pdμ + β t β e βũ pdμ < 9 since both βũ and βũ are in L p. On the other hand, we observe that e βũ = te ũ + teũ β t β e βũ.

12 884 M. R. Grasselli Therefore e βũ pdμ t β e βũ pdμ <, 30 since βũ L p. But this completes the proof, since 9 and 30 together imply that ũ Z p. We now explore the possibility of extending the convex mixture between q and beyond these extreme points while maintaining positivity of qt. This depends on the relative sizes of q and, as shown in the next proposition: Proposition 3 Let q = eu Z p u p and = eu Z p u p be two points in U p. Then there exist constants ε > 0 and ε > 0 such that qt =[tq + t ] E p for all t ε, + ε if and only if u u L. Moreover, if u u L, then L 3q = L 3. Proof Suppose that qt E p for all t ε, + ε. Then since q ε 0 we have that ε q + + ε 0 q + ε ε. Similarly, since q + ε 0 we have that Therefore, the random variable u u = log + ε q ε 0 q ε + ε. q + log q p pdμ log q pdμ p is uniformly bounded from above and below. Conversely, if we have that u, u V p with u u L and K = u u, then Z p u Z p u e K q Z pu Z p u ek. We can then conclude that there exist constants 0 <ξ < and ξ > such that Then observe that for t 0wehave qt = ξ q ξ. t q + t tξ + t.

13 Dual connections in nonparametric classical information geometry 885 Therefore, provided ξ < t 0, the inequality above ensures that qt is strictly positive. Using the same notation as in the proof of Proposition, the same inequality gives e βũ pdμ = Similarly, for t 0wehave qt = t q + t Z p u β tξ + t β q p β pdμ t q + t tξ + t. e βu pdμ <. 3 Therefore, provided 0 t < ξ, this inequality shows that qt is strictly positive and that e βũ pdμ = t q + t Z p u β tξ + t β q p β pdμ e βu pdμ <. 3 Moreover, since the first part of the proof of Proposition holds provided qt is positive, we have that qt = tq + t E p for all t ξ, ξ, which completes the proof for the first statement by setting ε = ξ and ε = ξ ξ. For the second statement in the Proposition, observe that q = q + ε + ε + ε + ε 33 = q ε + ε q, + ε + ε 34 for positive densities q + ε and q ε. Therefore 3 βvq dμ< 3 βv dμ<, which implies that L 3q = L 3. Moreover, equations 33 and 34 can be used to show that the norms N 3,q and N 3, are equivalent. To see this, let u L u 3q and consider v = N 3,q u, so that N 3,q v = and consequently 3 vq dμ.

14 886 M. R. Grasselli Using 33 we see that 3 vq + ε dμ + ε 3 v dμ, + ε + ε which implies that ε + ε 3 v dμ. 35 On the other hand, it follows from convexity of 3 that ε 3 v + ε Inserting this into 35 and denoting K = +ε ε 3 v K ε + ε 3 v. gives dμ, which means that N 3, v K. Consequently we have that N 3, u = N 3, vn 3,q u KN 3,q u. Similarly, let f L f 3 and consider g = N 3, f, so that N 3, g = and consequently 3 g dμ. Using 34 we see that + ε 3 gq ε dμ + ε 3 gq dμ, + ε which implies that ε + ε 3 gq dμ. 36 Again, it follows from convexity of 3 that ε 3 g + ε ε + ε 3 g.

15 Dual connections in nonparametric classical information geometry 887 Inserting this into 36 and denoting k = +ε ε gives 3 g k q dμ, which means that N 3,q g k. Consequently we have that N 3,q f = N 3,q gn 3, f kn 3, f. 37 Proposition 4 Let q = eu Z p u p and = eu Z p u p be two points in U p such that u u L. Then the map τ q : T q M T q M u q u, 38 is an isomorphism of Banach spaces. Proof In view of Proposition 3, we have that L 3q = L 3 and that the norms N,q and N,q are equivalent, from which it follows that, for u B q, { } q u q = sup q uv, q dμ : v L 3, 3 v dμ { } = sup q uv q dμ : v L 3, N 3, v { = k sup q v u dμ : v L 3, N 3,q k v } { } k sup uf q dμ : f L 3 q, N 3,q f = k u,q <, where k is the constant appearing in 37. Thus, q u L, and since q u is centred around we have that q u B q. Therefore u q u is a well-defined continuous bijection from B q to B q whose inverse map v q v is well-defined by the same arguments.

16 888 M. R. Grasselli We denoted the map in the previous proposition by τ since it satisfies the duality relation τ u,τ v q = u u dμ, q v = = u q v dμ uvq dμ = u,v q, u,v B q, u dμ q v dμ where the third equality follows from the fact that v is centred around q. Let us now reflect on the collective results of Propositions to 4. Proposition tells us that the convex mixture of two probability densities in the same U p remains in the connected component E p of M, but not necessarily in the same neighbourhood. Proposition 3 then characterizes exactly those pairs q and for which the convex mixture can be extended beyond the extreme points while remaining in the same connected component E p. For such pairs, Proposition 4 gives an isomorphism of tangent spaces τ which satisfies the duality relation 4 with respect to the Fisher information. However, we refrain from calling τ a parallel transport, since it might fail to be well-defined when the points q and do not satisfy the conditions of Proposition 4. Nevertheless, we can still compute the derivative of τ along curves that satisfy these conditions, as is done in the next proposition. Proposition 5 Let v T p M be a tangent vector at p M and s ST M be a differentiable vector field. If there exist a differentiable curve γ : ε, ε M such that p = γ0, v = γ 0, and whose image consists entirely of points satisfying the hypotheses of Proposition 4, then D v sp := d v sp + spl 0 B p, 39 where d v sp denotes the directional derivative of s in the direction of v in the Banach space L p, and lt = logγ t. Proof For h sufficiently small, it follows from Proposition 4 that τ B γ0. The result then follows from the following calculation: γhγ 0 sγ h [ ] [ ] lim τ h 0 h γhγ 0 sγ h sγ 0 γh = lim sγ h sγ 0 h 0 h γ0 = lim h 0 h + lim h 0 h [ sγ h sγ 0 ] [ γh γ0 γ0 = d v sp + spl 0. ] sγ h

17 Dual connections in nonparametric classical information geometry 889 Despite satisfying all the usual properties of a covariant derivative, such as linearity and Leibniz rule, the differential operator D v might fail to be well-defined when no curve satisfying the conditions of Proposition 5 exists. For this reason, we simply call it the -derivative in the direction of those v for which it is well-defined. In the next section, we will see that D v is part of a one-parameter family of derivatives defined for exactly this class of tangent vectors. 5 α-derivatives In this section, we define an infinite-dimensional analogue of the α-connections introduced in the parametric case independently in Čencov 98 and Amari 985. We use the same technique proposed in Gibilisco and Pistone 998, namely exploring the geometry of spheres in the Lebesgue spaces L r, but modified in such a way that the resulting derivatives all act on sections of the tangent bundle T M. The price we pay is that our derivatives are not defined for all tangent vectors, but only those satisfying the conditions of Proposition 5. We begin with the Amari Nagaoka α-embeddings where r = α. Observe that l α : M L r μ p α p α, α [,, 40 [ /r [ l α p r = l α p dμ] r = r ] /r α p α dμ = r, so l α p S r μ, the sphere of radius r in L r μ we warn the reader that, throughout this paper, the r in S r refers to the fact that this is a sphere of radius r, while the fact that it is a subset of L r is judiciously omitted from the notation. According to Gibilisco and Pistone 998, the tangent space to S r μ at a point f is T f S r μ = where f = sgn f f r. In our case, { } g L r μ : gf dμ = 0, 4 f = l α p = rp /r 4 so that f = rp /r r = r r p /r. 43

18 890 M. R. Grasselli Therefore, the tangent space to S r μ at rp /r is T rp /r S r μ = { } g L r μ : gp /r dμ = We now look for a concrete realization of the push-forward of the map l α when the tangent space T p M is identified with B p as in the previous sections. Since d dt α p α = p α the α-push-forward can be formally implemented as dlogp, dt l α p : T p M = B p T rp /r S r μ u p α u. 45 For this to be well defined, we need to check that p α u is an element of T rp /r S r μ. Indeed, since L p L s p for all s, we have that p /r u r dμ = so p α u L r μ. Moreover p /r up /r dμ = u rf r pdμ <, updμ = 0, which verifies that p /r u T rp /r S r μ. The sphere S r μ inherits a natural connection obtained by projecting the trivial connection on L r μ the one where parallel transport is just the identity map onto its tangent space at each point. For each f S r μ, a canonical projection from the tangent space T f L r μ onto the tangent space T f S r μ can be uniquely defined, since the spaces L r μ are uniformly convex see Gibilisco and Isola 999, and is given by f : T f L r μ T f S r μ g g r r gf dμ When f = rp /r and f = r r p /r, the formula above gives f. 46 rp /r : T rp /r L r μ T rp /r S r μ g g gp /r dμ p /r. 47

19 Dual connections in nonparametric classical information geometry 89 We are now ready to introduce the α-derivative. Suppose that γ : ε, ε M is a smooth curve whose image consists entirely of points satisfying the conditions of Proposition 4. Then the α-push-forward of an arbitrary vector field s ST M along γ is l α γ t s = γt /r sγ t, while the α-push-forward of the tangent vector v = γ0 T M p is l α p v = p /r v. Therefore, the covariant derivative of l α γ t sγ t in the direction of l α p v with respect to the trivial connection on L r μ is given by lα p vl α γ t s = d γt /r sγ t dt t=0 = dlogγ t p/r /r dsγ t r dt sp + p t=0 dt = r p/r l 0sp + p /r d v s p. Using the projection rp /r to obtain a tangent vector in T rp /r S r μ we get t=0 [ rp /r lα p vl α γ t s = p /r r l 0sp + d v sp r l 0sp + d v sp ] pdμ. It then follows from Proposition 5 that r l 0sp + d v sp L p, which implies that the expression above belongs to the image of l α p. Therefore, we can pull it back to T p M using l α p, from which we obtain l α [ p rp /r lα p vl α γ t s ] = r l 0sp + d v sp r l 0sp + d v sp pdμ. 48 As this construction shows, the α-derivatives can be rigorously defined on the tangent bundle T M as follows: Definition For α [,, letγ : ε, ε M be a smooth curve such that p = γ0 and whose image consists entirely of points satisfying the conditions of Proposition 4. The α-derivative of a differentiable vector field s ST M in the direction of v = γ 0 is given by D α v s p = l α [ p rp /r lα p vl α γ t s ]. 49

20 89 M. R. Grasselli Before we proceed, observe that since sγ t B γt for each t ε, ε, we have d sγ tγ tdμ = 0 dt dsγ t γtdμ = sγ t dγt dμ dt dt dsγ t dlogγ t γtdμ = sγ t γtdμ. dt dt In particular, for t = 0, we get d v s ppdμ = s p l0 pdμ 50 Inserting this relation into 48 with α =, corresponding to r =, leads to D v s p = d v sp + spl 0, 5 which coincides with 39. Recall that the covariant derivative associated with the exponential parallel transport 8 was computed in Gibilisco and Pistone 998, proposition 5 as v s p = d v sp d v sppdμ. 5 The next proposition shows that the relation between the exponential connection and the α-derivatives just defined is the same as in the parametric case. Its proof resembles the calculation in the last pages of Gibilisco and Pistone 998, except that all our derivatives act on the same bundle, whereas in Gibilisco and Pistone 998 each one is defined on its own bundle-connection pair. Proposition 6 The exponential connection and the α-derivatives on T M satisfy D α = + α + α D. 53 Proof Let lt = logγ t with γ, s, p and v as in definition. Inserting 50 into 49 gives D α v s p = r l 0sp + d v sp + r d v sppdμ [ ] + α = d v sp d v sp

21 Dual connections in nonparametric classical information geometry Auto-parallel curves α + = + α [d v sp + spl 0] v s p + α D v s p. We now investigate some of the auto-parallel curves associated with the derivatives introduced in the previous sections. First observe that a one-dimensional exponential model of the form qt = etu Z p tu p, u B p, t ε, ε, which obviously belong to the connected component E p, is an auto-parallel curve for, since its tangent vector field st = dt d log qt p according to Pistone and Rogantin 999, proposition satisfies qt st qt = d tu log Z dt p tu E qt = d dt log Z ptu + E qt d tu log Z dt p tu d dt log Z ptu = 0. Next observe that for q and satisfying the conditions of Proposition 4, a mixture model of the form qt = tq + t, q, U p, t [0, ], which belongs to the connected component E p according to Proposition, is an autoparallel curve for D, since the tangent vector field st = dt d log qt p satisfies [ D st qt = d e p q t dt log tq ] + t p + d [ log tq ] + t d dt p dt [log tq + t ] = d [ ] p q dt tq + t p p q q + tq + t p tq + t q q = + = 0. tq + t tq + t The next theorem establishes the corresponding result for the α-derivatives.

22 894 M. R. Grasselli Proposition 7 For α,, theα-auto-parallel curves between two of points q and in U p satisfying the conditions of Proposition 4, forsomep M, belongs to the connected component E p. Proof Using the same notation as in Proposition, we have that the α-auto-parallel curve connecting q, Ep is the pull back of the arc of great circle connecting their images f = l α q and f = l α on the sphere S r μ.nowiftf + t f is the straight line connecting f and f in L r μ, then for each fixed t [0, ] the corresponding point on the sphere of radius r is f t = r kt [tf + t f ], 54 where kt = tf + t f r. Let us write its inverse image with respect to the α-embedding as eũ p = l α f t = kt r [tf + t f ] r, for some random variable ũ. Following the argument in proposition, we see that e βũ = p β kt βr [tf + t f ] βr [ ] r βr [t βr e βũ + t βr e βũ ], kt so that e βũ pdμ <, 55 since both βu and βu are in L p. Furthermore, e βũ = p β kt βr [tf + t f ] βr [ ] kt βr e βũ, tr so that e βũ pdμ <, 56 which together with 55, imply that ũ Z p. To complete the proof we can define u =ũ ũpdμ, 57

23 Dual connections in nonparametric classical information geometry 895 to obtain that u B p and l α f t = eu Z p u p Ep Further developments We have seen that using L as the coordinate space for the infinite-dimensional information manifold leads to a well-defined isomorphism τ between the tangent spaces B q and B q whenever the difference of their log-likelihoods u and u is bounded. Moreover, this isomorphism is dual to the exponential parallel transport with respect to the generalized Fisher metric. The next step in our program is to show that the Kullback Leibler relative entropy is the statistical divergence associated with the dualistic triple τ,τ, g see Amari and Nagaoka 000. In the same vein, since our interpolation family of α-derivatives satisfy the same convex mixture structure as in finite dimensions, we are led to the study of the infinite-dimensional analogues of the α-divergences. The completion of this circle of ideas would be an infinite-dimensional generalization of the projection theorems obtained by Amari in the finite dimensional case. Namely, one seeks to prove that, given a point p M and an α-flat submanifold S, then the point q S with minimal α-divergence from p is obtained by projecting p orthogonally with respect to the Fisher metric onto S following a αgeodesic. An equally ambitious result to be pursued is the infinite-dimensional analogue of Centsov s theorem, which would characterize the generalized Fisher metric as the unique continuous scalar product on M which is reduced by Markov morphisms on the tangent space. Acknowledgments I would like to acknowledge the financial support from CAPES, Brazil, and of an ORS scholarship from the British government received while I was a PhD student at King s College London, where this research was initiated. It is a pleasure to thank R.F. Streater for suggesting the theme and thoroughly discussing the results obtained. I am also grateful to P. Gibilisco and G. Pistone for estimulating discussions during the Information Geometry and Applications conference held in Pescara, July 00. Finally, I am indebted to H. Nagaoka for pointing out a crucial mistake in the previous version of this paper, and to two anonymous referees for marked improvements leading to the current version. References Amari, S.-I Differential-geometrical methods in statistics. New York: Springer. Amari, S.-I., Nagaoka, H Methods of information Geometry. Providence, RI: American Mathematical Society. Translated from the 993 Japanese original by Daishi Harada. Čencov, N.N. 98. Statistical decision rules and optimal inference. Providence, RI: American Mathematical Society. Translation from the Russian edited by Lev J. Leifman. Dawid, A.P On the concepts of sufficiency and ancillarity in the presence of nuisance parameters. Journal of the Royal Statistical Society B, 37, Efron, B Defining the curvature of a statistical problem with applications to second order efficiency. Annals of Statistics, 3, With a discussion by C. R. Rao, Don A. Pierce, D. R. Cox,

24 896 M. R. Grasselli D. V. Lindley, Lucien LeCam, J. K. Ghosh, J. Pfanzagl, Neils Keiding, A. P. Dawid, Jim Reeds and with a reply by the author. Gibilisco, P., Isola, T Connections on statistical manifolds of density operators by geometry of noncommutative L p -spaces. Infinite Dimensional Analysis Quantum Probability and Related Topics,, Gibilisco, P., Pistone, G Connections on non-parametric statistical manifolds by Orlicz space geometry. Infinite Dimensional Analysis Quantum Probability and Related Topics,, Grasselli, M.R. 00. Classical and quantum information geometry. Ph.D. thesis, King s College, London. Jeffreys, H An invariant form for the prior probability in estimation problems. Proceedings of Royal Society A, 86, Kass, R.E., Vos, P.W Geometrical foundations of asymptotic inference. New York: Wiley- Interscience. Krasnosel skiĭ, M.A., Rutickiĭ, J.B. 96. Convex functions and Orlicz spaces. Groningen: P. Noordhoff. Lang, S Differential and Riemannian manifolds 3rd ed.. New York: Springer. Murray, M.K., Rice, J.W Differential geometry and statistics. London: Chapman & Hall. Pistone, G. 00. New ideas in nonparametric estimation. In P. Sollich, et al. Eds., Disordered and complex systems. American Institute of Physics. AIP Conference Proceedings 553. Pistone, G., Rogantin, M.P The exponential statistical manifold: Mean parameters, orthogonality and space transformations. Bernoulli, 5, Pistone, G., Sempi, C An infinite-dimensional geometric structure on the space of all the probability measures equivalent to a given one. Annals of Statistics, 3, Rao, C.R Information and accuracy attainable in the estimation of statistical parameters. Bulletin of the Calcutta Mathematical Society, 37, 8 9. Rao, M.M., Ren, Z.D. 99. Theory of Orlicz spaces. New York: Marcel Dekker. Sollich, P., et al. Eds. 00. Disordered and complex systems. American Institute of Physics. AIP Conference Proceedings 553.

DUAL CONNECTIONS IN NONPARAMETRIC CLASSICAL INFORMATION GEOMETRY

DUAL CONNECTIONS IN NONPARAMETRIC CLASSICAL INFORMATION GEOMETRY DUAL CONNECTIONS IN NONPARAMETRIC CLASSICAL INFORMATION GEOMETRY M. R. GRASSELLI Department of Mathematics and Statistics McMaster University 1280 Main Street West Hamilton ON L8S 4K1, Canada e-mail: grasselli@math.mcmaster.ca

More information

Open Questions in Quantum Information Geometry

Open Questions in Quantum Information Geometry Open Questions in Quantum Information Geometry M. R. Grasselli Department of Mathematics and Statistics McMaster University Imperial College, June 15, 2005 1. Classical Parametric Information Geometry

More information

Classical and Quantum Information Geometry

Classical and Quantum Information Geometry Classical and Quantum Information Geometry Matheus da Rocha Grasselli A dissertation submitted to the University of London in partial fulfillment of the requirements for the degree of Doctor of Philosophy

More information

Applications of Algebraic Geometry IMA Annual Program Year Workshop Applications in Biology, Dynamics, and Statistics March 5-9, 2007

Applications of Algebraic Geometry IMA Annual Program Year Workshop Applications in Biology, Dynamics, and Statistics March 5-9, 2007 Applications of Algebraic Geometry IMA Annual Program Year Workshop Applications in Biology, Dynamics, and Statistics March 5-9, 2007 Information Geometry (IG) and Algebraic Statistics (AS) Giovanni Pistone

More information

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation Statistics 62: L p spaces, metrics on spaces of probabilites, and connections to estimation Moulinath Banerjee December 6, 2006 L p spaces and Hilbert spaces We first formally define L p spaces. Consider

More information

Information Geometry and Algebraic Statistics

Information Geometry and Algebraic Statistics Matematikos ir informatikos institutas Vinius Information Geometry and Algebraic Statistics Giovanni Pistone and Eva Riccomagno POLITO, UNIGE Wednesday 30th July, 2008 G.Pistone, E.Riccomagno (POLITO,

More information

CERTAIN PROPERTIES OF GENERALIZED ORLICZ SPACES

CERTAIN PROPERTIES OF GENERALIZED ORLICZ SPACES Volume 10 (2009), Issue 2, Article 37, 10 pp. CERTAIN PROPERTIES OF GENERALIZED ORLICZ SPACES PANKAJ JAIN AND PRITI UPRETI DEPARTMENT OF MATHEMATICS DESHBANDHU COLLEGE (UNIVERSITY OF DELHI) KALKAJI, NEW

More information

Chapter 3. Riemannian Manifolds - I. The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves

Chapter 3. Riemannian Manifolds - I. The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves Chapter 3 Riemannian Manifolds - I The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves embedded in Riemannian manifolds. A Riemannian manifold is an abstraction

More information

CALCULUS ON MANIFOLDS. 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M =

CALCULUS ON MANIFOLDS. 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M = CALCULUS ON MANIFOLDS 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M = a M T am, called the tangent bundle, is itself a smooth manifold, dim T M = 2n. Example 1.

More information

1 Differentiable manifolds and smooth maps

1 Differentiable manifolds and smooth maps 1 Differentiable manifolds and smooth maps Last updated: April 14, 2011. 1.1 Examples and definitions Roughly, manifolds are sets where one can introduce coordinates. An n-dimensional manifold is a set

More information

Introduction to Real Analysis Alternative Chapter 1

Introduction to Real Analysis Alternative Chapter 1 Christopher Heil Introduction to Real Analysis Alternative Chapter 1 A Primer on Norms and Banach Spaces Last Updated: March 10, 2018 c 2018 by Christopher Heil Chapter 1 A Primer on Norms and Banach Spaces

More information

Chapter 2 Exponential Families and Mixture Families of Probability Distributions

Chapter 2 Exponential Families and Mixture Families of Probability Distributions Chapter 2 Exponential Families and Mixture Families of Probability Distributions The present chapter studies the geometry of the exponential family of probability distributions. It is not only a typical

More information

F -Geometry and Amari s α Geometry on a Statistical Manifold

F -Geometry and Amari s α Geometry on a Statistical Manifold Entropy 014, 16, 47-487; doi:10.3390/e160547 OPEN ACCESS entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Article F -Geometry and Amari s α Geometry on a Statistical Manifold Harsha K. V. * and Subrahamanian

More information

fy (X(g)) Y (f)x(g) gy (X(f)) Y (g)x(f)) = fx(y (g)) + gx(y (f)) fy (X(g)) gy (X(f))

fy (X(g)) Y (f)x(g) gy (X(f)) Y (g)x(f)) = fx(y (g)) + gx(y (f)) fy (X(g)) gy (X(f)) 1. Basic algebra of vector fields Let V be a finite dimensional vector space over R. Recall that V = {L : V R} is defined to be the set of all linear maps to R. V is isomorphic to V, but there is no canonical

More information

LECTURE 15: COMPLETENESS AND CONVEXITY

LECTURE 15: COMPLETENESS AND CONVEXITY LECTURE 15: COMPLETENESS AND CONVEXITY 1. The Hopf-Rinow Theorem Recall that a Riemannian manifold (M, g) is called geodesically complete if the maximal defining interval of any geodesic is R. On the other

More information

4.7 The Levi-Civita connection and parallel transport

4.7 The Levi-Civita connection and parallel transport Classnotes: Geometry & Control of Dynamical Systems, M. Kawski. April 21, 2009 138 4.7 The Levi-Civita connection and parallel transport In the earlier investigation, characterizing the shortest curves

More information

Affine Connections: Part 2

Affine Connections: Part 2 Affine Connections: Part 2 Manuscript for Machine Learning Reading Group Talk R. Simon Fong Abstract Note for online manuscript: This is the manuscript of a one hour introductory talk on (affine) connections.

More information

Information Geometry: Background and Applications in Machine Learning

Information Geometry: Background and Applications in Machine Learning Geometry and Computer Science Information Geometry: Background and Applications in Machine Learning Giovanni Pistone www.giannidiorestino.it Pescara IT), February 8 10, 2017 Abstract Information Geometry

More information

Information Geometry on Hierarchy of Probability Distributions

Information Geometry on Hierarchy of Probability Distributions IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 47, NO. 5, JULY 2001 1701 Information Geometry on Hierarchy of Probability Distributions Shun-ichi Amari, Fellow, IEEE Abstract An exponential family or mixture

More information

Differential Geometry MTG 6257 Spring 2018 Problem Set 4 Due-date: Wednesday, 4/25/18

Differential Geometry MTG 6257 Spring 2018 Problem Set 4 Due-date: Wednesday, 4/25/18 Differential Geometry MTG 6257 Spring 2018 Problem Set 4 Due-date: Wednesday, 4/25/18 Required problems (to be handed in): 2bc, 3, 5c, 5d(i). In doing any of these problems, you may assume the results

More information

1 Exponential manifold by reproducing kernel Hilbert spaces

1 Exponential manifold by reproducing kernel Hilbert spaces 1 Exponential manifold by reproducing kernel Hilbert spaces 1.1 Introduction The purpose of this paper is to propose a method of constructing exponential families of Hilbert manifold, on which estimation

More information

Math 6455 Nov 1, Differential Geometry I Fall 2006, Georgia Tech

Math 6455 Nov 1, Differential Geometry I Fall 2006, Georgia Tech Math 6455 Nov 1, 26 1 Differential Geometry I Fall 26, Georgia Tech Lecture Notes 14 Connections Suppose that we have a vector field X on a Riemannian manifold M. How can we measure how much X is changing

More information

A crash course the geometry of hyperbolic surfaces

A crash course the geometry of hyperbolic surfaces Lecture 7 A crash course the geometry of hyperbolic surfaces 7.1 The hyperbolic plane Hyperbolic geometry originally developed in the early 19 th century to prove that the parallel postulate in Euclidean

More information

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory Part V 7 Introduction: What are measures and why measurable sets Lebesgue Integration Theory Definition 7. (Preliminary). A measure on a set is a function :2 [ ] such that. () = 2. If { } = is a finite

More information

Welcome to Copenhagen!

Welcome to Copenhagen! Welcome to Copenhagen! Schedule: Monday Tuesday Wednesday Thursday Friday 8 Registration and welcome 9 Crash course on Crash course on Introduction to Differential and Differential and Information Geometry

More information

THE FUNDAMENTAL GROUP OF MANIFOLDS OF POSITIVE ISOTROPIC CURVATURE AND SURFACE GROUPS

THE FUNDAMENTAL GROUP OF MANIFOLDS OF POSITIVE ISOTROPIC CURVATURE AND SURFACE GROUPS THE FUNDAMENTAL GROUP OF MANIFOLDS OF POSITIVE ISOTROPIC CURVATURE AND SURFACE GROUPS AILANA FRASER AND JON WOLFSON Abstract. In this paper we study the topology of compact manifolds of positive isotropic

More information

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Ann Inst Stat Math (2009) 61:773 787 DOI 10.1007/s10463-008-0172-6 Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Taisuke Otsu Received: 1 June 2007 / Revised:

More information

Loos Symmetric Cones. Jimmie Lawson Louisiana State University. July, 2018

Loos Symmetric Cones. Jimmie Lawson Louisiana State University. July, 2018 Louisiana State University July, 2018 Dedication I would like to dedicate this talk to Joachim Hilgert, whose 60th birthday we celebrate at this conference and with whom I researched and wrote a big blue

More information

On a Class of Multidimensional Optimal Transportation Problems

On a Class of Multidimensional Optimal Transportation Problems Journal of Convex Analysis Volume 10 (2003), No. 2, 517 529 On a Class of Multidimensional Optimal Transportation Problems G. Carlier Université Bordeaux 1, MAB, UMR CNRS 5466, France and Université Bordeaux

More information

class # MATH 7711, AUTUMN 2017 M-W-F 3:00 p.m., BE 128 A DAY-BY-DAY LIST OF TOPICS

class # MATH 7711, AUTUMN 2017 M-W-F 3:00 p.m., BE 128 A DAY-BY-DAY LIST OF TOPICS class # 34477 MATH 7711, AUTUMN 2017 M-W-F 3:00 p.m., BE 128 A DAY-BY-DAY LIST OF TOPICS [DG] stands for Differential Geometry at https://people.math.osu.edu/derdzinski.1/courses/851-852-notes.pdf [DFT]

More information

AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD

AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD AN AFFINE EMBEDDING OF THE GAMMA MANIFOLD C.T.J. DODSON AND HIROSHI MATSUZOE Abstract. For the space of gamma distributions with Fisher metric and exponential connections, natural coordinate systems, potential

More information

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES CHRISTOPHER HEIL 1. Compact Sets Definition 1.1 (Compact and Totally Bounded Sets). Let X be a metric space, and let E X be

More information

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure? MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due 9/5). Prove that every countable set A is measurable and µ(a) = 0. 2 (Bonus). Let A consist of points (x, y) such that either x or y is

More information

DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17

DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17 DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17 6. Geodesics A parametrized line γ : [a, b] R n in R n is straight (and the parametrization is uniform) if the vector γ (t) does not depend on t. Thus,

More information

Information geometry of Bayesian statistics

Information geometry of Bayesian statistics Information geometry of Bayesian statistics Hiroshi Matsuzoe Department of Computer Science and Engineering, Graduate School of Engineering, Nagoya Institute of Technology, Nagoya 466-8555, Japan Abstract.

More information

Tangent bundles, vector fields

Tangent bundles, vector fields Location Department of Mathematical Sciences,, G5-109. Main Reference: [Lee]: J.M. Lee, Introduction to Smooth Manifolds, Graduate Texts in Mathematics 218, Springer-Verlag, 2002. Homepage for the book,

More information

ON SUBSPACES OF NON-REFLEXIVE ORLICZ SPACES

ON SUBSPACES OF NON-REFLEXIVE ORLICZ SPACES ON SUBSPACES OF NON-REFLEXIVE ORLICZ SPACES J. ALEXOPOULOS May 28, 997 ABSTRACT. Kadec and Pelczýnski have shown that every non-reflexive subspace of L (µ) contains a copy of l complemented in L (µ). On

More information

Solutions to the Hamilton-Jacobi equation as Lagrangian submanifolds

Solutions to the Hamilton-Jacobi equation as Lagrangian submanifolds Solutions to the Hamilton-Jacobi equation as Lagrangian submanifolds Matias Dahl January 2004 1 Introduction In this essay we shall study the following problem: Suppose is a smooth -manifold, is a function,

More information

On the distributional divergence of vector fields vanishing at infinity

On the distributional divergence of vector fields vanishing at infinity Proceedings of the Royal Society of Edinburgh, 141A, 65 76, 2011 On the distributional divergence of vector fields vanishing at infinity Thierry De Pauw Institut de Recherches en Mathématiques et Physique,

More information

Lecture 4 Lebesgue spaces and inequalities

Lecture 4 Lebesgue spaces and inequalities Lecture 4: Lebesgue spaces and inequalities 1 of 10 Course: Theory of Probability I Term: Fall 2013 Instructor: Gordan Zitkovic Lecture 4 Lebesgue spaces and inequalities Lebesgue spaces We have seen how

More information

The Skorokhod reflection problem for functions with discontinuities (contractive case)

The Skorokhod reflection problem for functions with discontinuities (contractive case) The Skorokhod reflection problem for functions with discontinuities (contractive case) TAKIS KONSTANTOPOULOS Univ. of Texas at Austin Revised March 1999 Abstract Basic properties of the Skorokhod reflection

More information

Convexity in R n. The following lemma will be needed in a while. Lemma 1 Let x E, u R n. If τ I(x, u), τ 0, define. f(x + τu) f(x). τ.

Convexity in R n. The following lemma will be needed in a while. Lemma 1 Let x E, u R n. If τ I(x, u), τ 0, define. f(x + τu) f(x). τ. Convexity in R n Let E be a convex subset of R n. A function f : E (, ] is convex iff f(tx + (1 t)y) (1 t)f(x) + tf(y) x, y E, t [0, 1]. A similar definition holds in any vector space. A topology is needed

More information

The small ball property in Banach spaces (quantitative results)

The small ball property in Banach spaces (quantitative results) The small ball property in Banach spaces (quantitative results) Ehrhard Behrends Abstract A metric space (M, d) is said to have the small ball property (sbp) if for every ε 0 > 0 there exists a sequence

More information

6 Lecture 6: More constructions with Huber rings

6 Lecture 6: More constructions with Huber rings 6 Lecture 6: More constructions with Huber rings 6.1 Introduction Recall from Definition 5.2.4 that a Huber ring is a commutative topological ring A equipped with an open subring A 0, such that the subspace

More information

June 21, Peking University. Dual Connections. Zhengchao Wan. Overview. Duality of connections. Divergence: general contrast functions

June 21, Peking University. Dual Connections. Zhengchao Wan. Overview. Duality of connections. Divergence: general contrast functions Dual Peking University June 21, 2016 Divergences: Riemannian connection Let M be a manifold on which there is given a Riemannian metric g =,. A connection satisfying Z X, Y = Z X, Y + X, Z Y (1) for all

More information

CHAPTER 1 PRELIMINARIES

CHAPTER 1 PRELIMINARIES CHAPTER 1 PRELIMINARIES 1.1 Introduction The aim of this chapter is to give basic concepts, preliminary notions and some results which we shall use in the subsequent chapters of the thesis. 1.2 Differentiable

More information

Course 212: Academic Year Section 1: Metric Spaces

Course 212: Academic Year Section 1: Metric Spaces Course 212: Academic Year 1991-2 Section 1: Metric Spaces D. R. Wilkins Contents 1 Metric Spaces 3 1.1 Distance Functions and Metric Spaces............. 3 1.2 Convergence and Continuity in Metric Spaces.........

More information

NOTES ON THE REGULARITY OF QUASICONFORMAL HOMEOMORPHISMS

NOTES ON THE REGULARITY OF QUASICONFORMAL HOMEOMORPHISMS NOTES ON THE REGULARITY OF QUASICONFORMAL HOMEOMORPHISMS CLARK BUTLER. Introduction The purpose of these notes is to give a self-contained proof of the following theorem, Theorem.. Let f : S n S n be a

More information

Optimization and Optimal Control in Banach Spaces

Optimization and Optimal Control in Banach Spaces Optimization and Optimal Control in Banach Spaces Bernhard Schmitzer October 19, 2017 1 Convex non-smooth optimization with proximal operators Remark 1.1 (Motivation). Convex optimization: easier to solve,

More information

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability... Functional Analysis Franck Sueur 2018-2019 Contents 1 Metric spaces 1 1.1 Definitions........................................ 1 1.2 Completeness...................................... 3 1.3 Compactness......................................

More information

A brief introduction to N functions and Orlicz function spaces. John Alexopoulos Kent State University, Stark Campus

A brief introduction to N functions and Orlicz function spaces. John Alexopoulos Kent State University, Stark Campus A brief introduction to N functions and Orlicz function spaces John Alexopoulos Kent State University, Stark Campus March 12, 24 2 Contents 1 Introduction: Uniform integrability 5 1.1 Definitions.....................................

More information

1. Geometry of the unit tangent bundle

1. Geometry of the unit tangent bundle 1 1. Geometry of the unit tangent bundle The main reference for this section is [8]. In the following, we consider (M, g) an n-dimensional smooth manifold endowed with a Riemannian metric g. 1.1. Notations

More information

Pseudo-Poincaré Inequalities and Applications to Sobolev Inequalities

Pseudo-Poincaré Inequalities and Applications to Sobolev Inequalities Pseudo-Poincaré Inequalities and Applications to Sobolev Inequalities Laurent Saloff-Coste Abstract Most smoothing procedures are via averaging. Pseudo-Poincaré inequalities give a basic L p -norm control

More information

Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions

Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions Symplectic and Kähler Structures on Statistical Manifolds Induced from Divergence Functions Jun Zhang 1, and Fubo Li 1 University of Michigan, Ann Arbor, Michigan, USA Sichuan University, Chengdu, China

More information

Lecture 8. Connections

Lecture 8. Connections Lecture 8. Connections This lecture introduces connections, which are the machinery required to allow differentiation of vector fields. 8.1 Differentiating vector fields. The idea of differentiating vector

More information

Introduction to Information Geometry

Introduction to Information Geometry Introduction to Information Geometry based on the book Methods of Information Geometry written by Shun-Ichi Amari and Hiroshi Nagaoka Yunshu Liu 2012-02-17 Outline 1 Introduction to differential geometry

More information

Implicit Functions, Curves and Surfaces

Implicit Functions, Curves and Surfaces Chapter 11 Implicit Functions, Curves and Surfaces 11.1 Implicit Function Theorem Motivation. In many problems, objects or quantities of interest can only be described indirectly or implicitly. It is then

More information

Hyperplane Margin Classifiers on the Multinomial Manifold

Hyperplane Margin Classifiers on the Multinomial Manifold Hyperplane Margin Classifiers on the Multinomial Manifold Guy Lebanon LEBANON@CS.CMU.EDU John Lafferty LAFFERTY@CS.CMU.EDU School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh

More information

CHAPTER 3. Gauss map. In this chapter we will study the Gauss map of surfaces in R 3.

CHAPTER 3. Gauss map. In this chapter we will study the Gauss map of surfaces in R 3. CHAPTER 3 Gauss map In this chapter we will study the Gauss map of surfaces in R 3. 3.1. Surfaces in R 3 Let S R 3 be a submanifold of dimension 2. Let {U i, ϕ i } be a DS on S. For any p U i we have a

More information

CORRIGENDUM: THE SYMPLECTIC SUM FORMULA FOR GROMOV-WITTEN INVARIANTS

CORRIGENDUM: THE SYMPLECTIC SUM FORMULA FOR GROMOV-WITTEN INVARIANTS CORRIGENDUM: THE SYMPLECTIC SUM FORMULA FOR GROMOV-WITTEN INVARIANTS ELENY-NICOLETA IONEL AND THOMAS H. PARKER Abstract. We correct an error and an oversight in [IP]. The sign of the curvature in (8.7)

More information

Part IB GEOMETRY (Lent 2016): Example Sheet 1

Part IB GEOMETRY (Lent 2016): Example Sheet 1 Part IB GEOMETRY (Lent 2016): Example Sheet 1 (a.g.kovalev@dpmms.cam.ac.uk) 1. Suppose that H is a hyperplane in Euclidean n-space R n defined by u x = c for some unit vector u and constant c. The reflection

More information

Set, functions and Euclidean space. Seungjin Han

Set, functions and Euclidean space. Seungjin Han Set, functions and Euclidean space Seungjin Han September, 2018 1 Some Basics LOGIC A is necessary for B : If B holds, then A holds. B A A B is the contraposition of B A. A is sufficient for B: If A holds,

More information

Problem Set 6: Solutions Math 201A: Fall a n x n,

Problem Set 6: Solutions Math 201A: Fall a n x n, Problem Set 6: Solutions Math 201A: Fall 2016 Problem 1. Is (x n ) n=0 a Schauder basis of C([0, 1])? No. If f(x) = a n x n, n=0 where the series converges uniformly on [0, 1], then f has a power series

More information

EXISTENCE OF SET-INTERPOLATING AND ENERGY- MINIMIZING CURVES

EXISTENCE OF SET-INTERPOLATING AND ENERGY- MINIMIZING CURVES EXISTENCE OF SET-INTERPOLATING AND ENERGY- MINIMIZING CURVES JOHANNES WALLNER Abstract. We consider existence of curves c : [0, 1] R n which minimize an energy of the form c (k) p (k = 1, 2,..., 1 < p

More information

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure? MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due ). Show that the open disk x 2 + y 2 < 1 is a countable union of planar elementary sets. Show that the closed disk x 2 + y 2 1 is a countable

More information

Theorem 2. Let n 0 3 be a given integer. is rigid in the sense of Guillemin, so are all the spaces ḠR n,n, with n n 0.

Theorem 2. Let n 0 3 be a given integer. is rigid in the sense of Guillemin, so are all the spaces ḠR n,n, with n n 0. This monograph is motivated by a fundamental rigidity problem in Riemannian geometry: determine whether the metric of a given Riemannian symmetric space of compact type can be characterized by means of

More information

(x, y) = d(x, y) = x y.

(x, y) = d(x, y) = x y. 1 Euclidean geometry 1.1 Euclidean space Our story begins with a geometry which will be familiar to all readers, namely the geometry of Euclidean space. In this first chapter we study the Euclidean distance

More information

of conditional information geometry,

of conditional information geometry, An Extended Čencov-Campbell Characterization of Conditional Information Geometry Guy Lebanon School of Computer Science Carnegie ellon niversity lebanon@cs.cmu.edu Abstract We formulate and prove an axiomatic

More information

Combinatorics in Banach space theory Lecture 12

Combinatorics in Banach space theory Lecture 12 Combinatorics in Banach space theory Lecture The next lemma considerably strengthens the assertion of Lemma.6(b). Lemma.9. For every Banach space X and any n N, either all the numbers n b n (X), c n (X)

More information

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization MATHEMATICS OF OPERATIONS RESEARCH Vol. 29, No. 3, August 2004, pp. 479 491 issn 0364-765X eissn 1526-5471 04 2903 0479 informs doi 10.1287/moor.1040.0103 2004 INFORMS Some Properties of the Augmented

More information

Chapter 16. Manifolds and Geodesics Manifold Theory. Reading: Osserman [7] Pg , 55, 63-65, Do Carmo [2] Pg ,

Chapter 16. Manifolds and Geodesics Manifold Theory. Reading: Osserman [7] Pg , 55, 63-65, Do Carmo [2] Pg , Chapter 16 Manifolds and Geodesics Reading: Osserman [7] Pg. 43-52, 55, 63-65, Do Carmo [2] Pg. 238-247, 325-335. 16.1 Manifold Theory Let us recall the definition of differentiable manifolds Definition

More information

An introduction to Mathematical Theory of Control

An introduction to Mathematical Theory of Control An introduction to Mathematical Theory of Control Vasile Staicu University of Aveiro UNICA, May 2018 Vasile Staicu (University of Aveiro) An introduction to Mathematical Theory of Control UNICA, May 2018

More information

HADAMARD FOLIATIONS OF H n. I

HADAMARD FOLIATIONS OF H n. I HADAMARD FOLIATIONS OF H n. I MACIEJ CZARNECKI Abstract. We introduce the notion of an Hadamard foliation as a foliation of Hadamard manifold which all leaves are Hadamard. We prove that a foliation of

More information

ON A MAXIMAL OPERATOR IN REARRANGEMENT INVARIANT BANACH FUNCTION SPACES ON METRIC SPACES

ON A MAXIMAL OPERATOR IN REARRANGEMENT INVARIANT BANACH FUNCTION SPACES ON METRIC SPACES Vasile Alecsandri University of Bacău Faculty of Sciences Scientific Studies and Research Series Mathematics and Informatics Vol. 27207), No., 49-60 ON A MAXIMAL OPRATOR IN RARRANGMNT INVARIANT BANACH

More information

Boundedly complete weak-cauchy basic sequences in Banach spaces with the PCP

Boundedly complete weak-cauchy basic sequences in Banach spaces with the PCP Journal of Functional Analysis 253 (2007) 772 781 www.elsevier.com/locate/jfa Note Boundedly complete weak-cauchy basic sequences in Banach spaces with the PCP Haskell Rosenthal Department of Mathematics,

More information

Parametrized Measure Models

Parametrized Measure Models Parametrized Measure Models Nihat Ay Jürgen Jost Hông Vân Lê Lorenz Schwachhöfer SFI WORKING PAPER: 2015-10-040 SFI Working Papers contain accounts of scienti5ic work of the author(s) and do not necessarily

More information

L p Spaces and Convexity

L p Spaces and Convexity L p Spaces and Convexity These notes largely follow the treatments in Royden, Real Analysis, and Rudin, Real & Complex Analysis. 1. Convex functions Let I R be an interval. For I open, we say a function

More information

II. DIFFERENTIABLE MANIFOLDS. Washington Mio CENTER FOR APPLIED VISION AND IMAGING SCIENCES

II. DIFFERENTIABLE MANIFOLDS. Washington Mio CENTER FOR APPLIED VISION AND IMAGING SCIENCES II. DIFFERENTIABLE MANIFOLDS Washington Mio Anuj Srivastava and Xiuwen Liu (Illustrations by D. Badlyans) CENTER FOR APPLIED VISION AND IMAGING SCIENCES Florida State University WHY MANIFOLDS? Non-linearity

More information

1 Introduction: connections and fiber bundles

1 Introduction: connections and fiber bundles [under construction] 1 Introduction: connections and fiber bundles Two main concepts of differential geometry are those of a covariant derivative and of a fiber bundle (in particular, a vector bundle).

More information

Geometry in a Fréchet Context: A Projective Limit Approach

Geometry in a Fréchet Context: A Projective Limit Approach Geometry in a Fréchet Context: A Projective Limit Approach Geometry in a Fréchet Context: A Projective Limit Approach by C.T.J. Dodson University of Manchester, Manchester, UK George Galanis Hellenic

More information

LARGE DEVIATIONS OF TYPICAL LINEAR FUNCTIONALS ON A CONVEX BODY WITH UNCONDITIONAL BASIS. S. G. Bobkov and F. L. Nazarov. September 25, 2011

LARGE DEVIATIONS OF TYPICAL LINEAR FUNCTIONALS ON A CONVEX BODY WITH UNCONDITIONAL BASIS. S. G. Bobkov and F. L. Nazarov. September 25, 2011 LARGE DEVIATIONS OF TYPICAL LINEAR FUNCTIONALS ON A CONVEX BODY WITH UNCONDITIONAL BASIS S. G. Bobkov and F. L. Nazarov September 25, 20 Abstract We study large deviations of linear functionals on an isotropic

More information

JUHA KINNUNEN. Harmonic Analysis

JUHA KINNUNEN. Harmonic Analysis JUHA KINNUNEN Harmonic Analysis Department of Mathematics and Systems Analysis, Aalto University 27 Contents Calderón-Zygmund decomposition. Dyadic subcubes of a cube.........................2 Dyadic cubes

More information

CALCULUS ON MANIFOLDS

CALCULUS ON MANIFOLDS CALCULUS ON MANIFOLDS 1. Manifolds Morally, manifolds are topological spaces which locally look like open balls of the Euclidean space R n. One can construct them by piecing together such balls ( cells

More information

KUIPER S THEOREM ON CONFORMALLY FLAT MANIFOLDS

KUIPER S THEOREM ON CONFORMALLY FLAT MANIFOLDS KUIPER S THEOREM ON CONFORMALLY FLAT MANIFOLDS RALPH HOWARD DEPARTMENT OF MATHEMATICS UNIVERSITY OF SOUTH CAROLINA COLUMBIA, S.C. 29208, USA HOWARD@MATH.SC.EDU 1. Introduction These are notes to that show

More information

THE JORDAN-BROUWER SEPARATION THEOREM

THE JORDAN-BROUWER SEPARATION THEOREM THE JORDAN-BROUWER SEPARATION THEOREM WOLFGANG SCHMALTZ Abstract. The Classical Jordan Curve Theorem says that every simple closed curve in R 2 divides the plane into two pieces, an inside and an outside

More information

THE RADON NIKODYM PROPERTY AND LIPSCHITZ EMBEDDINGS

THE RADON NIKODYM PROPERTY AND LIPSCHITZ EMBEDDINGS THE RADON NIKODYM PROPERTY AND LIPSCHITZ EMBEDDINGS Motivation The idea here is simple. Suppose we have a Lipschitz homeomorphism f : X Y where X and Y are Banach spaces, namely c 1 x y f (x) f (y) c 2

More information

SMSTC (2017/18) Geometry and Topology 2.

SMSTC (2017/18) Geometry and Topology 2. SMSTC (2017/18) Geometry and Topology 2 Lecture 1: Differentiable Functions and Manifolds in R n Lecturer: Diletta Martinelli (Notes by Bernd Schroers) a wwwsmstcacuk 11 General remarks In this lecture

More information

Whitney topology and spaces of preference relations. Abstract

Whitney topology and spaces of preference relations. Abstract Whitney topology and spaces of preference relations Oleksandra Hubal Lviv National University Michael Zarichnyi University of Rzeszow, Lviv National University Abstract The strong Whitney topology on the

More information

1.4 The Jacobian of a map

1.4 The Jacobian of a map 1.4 The Jacobian of a map Derivative of a differentiable map Let F : M n N m be a differentiable map between two C 1 manifolds. Given a point p M we define the derivative of F at p by df p df (p) : T p

More information

MATH DIFFERENTIAL GEOMETRY. Contents

MATH DIFFERENTIAL GEOMETRY. Contents MATH 3968 - DIFFERENTIAL GEOMETRY ANDREW TULLOCH Contents 1. Curves in R N 2 2. General Analysis 2 3. Surfaces in R 3 3 3.1. The Gauss Bonnet Theorem 8 4. Abstract Manifolds 9 1 MATH 3968 - DIFFERENTIAL

More information

arxiv:math/ v1 [math.dg] 23 Dec 2001

arxiv:math/ v1 [math.dg] 23 Dec 2001 arxiv:math/0112260v1 [math.dg] 23 Dec 2001 VOLUME PRESERVING EMBEDDINGS OF OPEN SUBSETS OF Ê n INTO MANIFOLDS FELIX SCHLENK Abstract. We consider a connected smooth n-dimensional manifold M endowed with

More information

NORMAL FAMILIES OF HOLOMORPHIC FUNCTIONS ON INFINITE DIMENSIONAL SPACES

NORMAL FAMILIES OF HOLOMORPHIC FUNCTIONS ON INFINITE DIMENSIONAL SPACES PORTUGALIAE MATHEMATICA Vol. 63 Fasc.3 2006 Nova Série NORMAL FAMILIES OF HOLOMORPHIC FUNCTIONS ON INFINITE DIMENSIONAL SPACES Paula Takatsuka * Abstract: The purpose of the present work is to extend some

More information

Strictly convex functions on complete Finsler manifolds

Strictly convex functions on complete Finsler manifolds Proc. Indian Acad. Sci. (Math. Sci.) Vol. 126, No. 4, November 2016, pp. 623 627. DOI 10.1007/s12044-016-0307-2 Strictly convex functions on complete Finsler manifolds YOE ITOKAWA 1, KATSUHIRO SHIOHAMA

More information

Lecture 11. Geodesics and completeness

Lecture 11. Geodesics and completeness Lecture 11. Geodesics and completeness In this lecture we will investigate further the metric properties of geodesics of the Levi-Civita connection, and use this to characterise completeness of a Riemannian

More information

ON THE CONSTRUCTION OF DUALLY FLAT FINSLER METRICS

ON THE CONSTRUCTION OF DUALLY FLAT FINSLER METRICS Huang, L., Liu, H. and Mo, X. Osaka J. Math. 52 (2015), 377 391 ON THE CONSTRUCTION OF DUALLY FLAT FINSLER METRICS LIBING HUANG, HUAIFU LIU and XIAOHUAN MO (Received April 15, 2013, revised November 14,

More information

Normed Vector Spaces and Double Duals

Normed Vector Spaces and Double Duals Normed Vector Spaces and Double Duals Mathematics 481/525 In this note we look at a number of infinite-dimensional R-vector spaces that arise in analysis, and we consider their dual and double dual spaces

More information

The nonsmooth Newton method on Riemannian manifolds

The nonsmooth Newton method on Riemannian manifolds The nonsmooth Newton method on Riemannian manifolds C. Lageman, U. Helmke, J.H. Manton 1 Introduction Solving nonlinear equations in Euclidean space is a frequently occurring problem in optimization and

More information

Choice of Riemannian Metrics for Rigid Body Kinematics

Choice of Riemannian Metrics for Rigid Body Kinematics Choice of Riemannian Metrics for Rigid Body Kinematics Miloš Žefran1, Vijay Kumar 1 and Christopher Croke 2 1 General Robotics and Active Sensory Perception (GRASP) Laboratory 2 Department of Mathematics

More information

Monotone Point-to-Set Vector Fields

Monotone Point-to-Set Vector Fields Monotone Point-to-Set Vector Fields J.X. da Cruz Neto, O.P.Ferreira and L.R. Lucambio Pérez Dedicated to Prof.Dr. Constantin UDRIŞTE on the occasion of his sixtieth birthday Abstract We introduce the concept

More information

ON SUBMAXIMAL DIMENSION OF THE GROUP OF ALMOST ISOMETRIES OF FINSLER METRICS.

ON SUBMAXIMAL DIMENSION OF THE GROUP OF ALMOST ISOMETRIES OF FINSLER METRICS. ON SUBMAXIMAL DIMENSION OF THE GROUP OF ALMOST ISOMETRIES OF FINSLER METRICS. VLADIMIR S. MATVEEV Abstract. We show that the second greatest possible dimension of the group of (local) almost isometries

More information